WorldWideScience

Sample records for haploid genome copies

  1. Dramatic improvement in genome assembly achieved using doubled-haploid genomes.

    Science.gov (United States)

    Zhang, Hong; Tan, Engkong; Suzuki, Yutaka; Hirose, Yusuke; Kinoshita, Shigeharu; Okano, Hideyuki; Kudoh, Jun; Shimizu, Atsushi; Saito, Kazuyoshi; Watabe, Shugo; Asakawa, Shuichi

    2014-10-27

    Improvement in de novo assembly of large genomes is still to be desired. Here, we improved draft genome sequence quality by employing doubled-haploid individuals. We sequenced wildtype and doubled-haploid Takifugu rubripes genomes, under the same conditions, using the Illumina platform and assembled contigs with SOAPdenovo2. We observed 5.4-fold and 2.6-fold improvement in the sizes of the N50 contig and scaffold of doubled-haploid individuals, respectively, compared to the wildtype, indicating that the use of a doubled-haploid genome aids in accurate genome analysis.

  2. A Trichosporonales genome tree based on 27 haploid and three evolutionarily conserved 'natural' hybrid genomes.

    Science.gov (United States)

    Takashima, Masako; Sriswasdi, Sira; Manabe, Ri-Ichiroh; Ohkuma, Moriya; Sugita, Takashi; Iwasaki, Wataru

    2018-01-01

    To construct a backbone tree consisting of basidiomycetous yeasts, draft genome sequences from 25 species of Trichosporonales (Tremellomycetes, Basidiomycota) were generated. In addition to the hybrid genomes of Trichosporon coremiiforme and Trichosporon ovoides that we described previously, we identified an interspecies hybrid genome in Cutaneotrichosporon mucoides (formerly Trichosporon mucoides). This hybrid genome had a gene retention rate of ~55%, and its closest haploid relative was Cutaneotrichosporon dermatis. After constructing the C. mucoides subgenomes, we generated a phylogenetic tree using genome data from the 27 haploid species and the subgenome data from the three hybrid genome species. It was a high-quality tree with 100% bootstrap support for all of the branches. The genome-based tree provided superior resolution compared with previous multi-gene analyses. Although our backbone tree does not include all Trichosporonales genera (e.g. Cryptotrichosporon), it will be valuable for future analyses of genome data. Interest in interspecies hybrid fungal genomes has recently increased because they may provide a basis for new technologies. The three Trichosporonales hybrid genomes described in this study are different from well-characterized hybrid genomes (e.g. those of Saccharomyces pastorianus and Saccharomyces bayanus) because these hybridization events probably occurred in the distant evolutionary past. Hence, they will be useful for studying genome stability following hybridization and speciation events. Copyright © 2017 John Wiley & Sons, Ltd. Copyright © 2017 John Wiley & Sons, Ltd.

  3. Selection on Optimal Haploid Value Increases Genetic Gain and Preserves More Genetic Diversity Relative to Genomic Selection.

    Science.gov (United States)

    Daetwyler, Hans D; Hayden, Matthew J; Spangenberg, German C; Hayes, Ben J

    2015-08-01

    Doubled haploids are routinely created and phenotypically selected in plant breeding programs to accelerate the breeding cycle. Genomic selection, which makes use of both phenotypes and genotypes, has been shown to further improve genetic gain through prediction of performance before or without phenotypic characterization of novel germplasm. Additional opportunities exist to combine genomic prediction methods with the creation of doubled haploids. Here we propose an extension to genomic selection, optimal haploid value (OHV) selection, which predicts the best doubled haploid that can be produced from a segregating plant. This method focuses selection on the haplotype and optimizes the breeding program toward its end goal of generating an elite fixed line. We rigorously tested OHV selection breeding programs, using computer simulation, and show that it results in up to 0.6 standard deviations more genetic gain than genomic selection. At the same time, OHV selection preserved a substantially greater amount of genetic diversity in the population than genomic selection, which is important to achieve long-term genetic gain in breeding populations. Copyright © 2015 by the Genetics Society of America.

  4. Generation of Mouse Haploid Somatic Cells by Small Molecules for Genome-wide Genetic Screening

    Directory of Open Access Journals (Sweden)

    Zheng-Quan He

    2017-08-01

    Full Text Available The recent success of derivation of mammalian haploid embryonic stem cells (haESCs has provided a powerful tool for large-scale functional analysis of the mammalian genome. However, haESCs rapidly become diploidized after differentiation, posing challenges for genetic analysis. Here, we show that the spontaneous diploidization of haESCs happens in metaphase due to mitotic slippage. Diploidization can be suppressed by small-molecule-mediated inhibition of CDK1 and ROCK. Through ROCK inhibition, we can generate haploid somatic cells of all three germ layers from haESCs, including terminally differentiated neurons. Using piggyBac transposon-based insertional mutagenesis, we generated a haploid neural cell library harboring genome-wide mutations for genetic screening. As a proof of concept, we screened for Mn2+-mediated toxicity and identified the Park2 gene. Our findings expand the applications of mouse haploid cell technology to somatic cell types and may also shed light on the mechanisms of ploidy maintenance.

  5. Selection on Optimal Haploid Value Increases Genetic Gain and Preserves More Genetic Diversity Relative to Genomic Selection

    OpenAIRE

    Daetwyler, Hans D.; Hayden, Matthew J.; Spangenberg, German C.; Hayes, Ben J.

    2015-01-01

    Doubled haploids are routinely created and phenotypically selected in plant breeding programs to accelerate the breeding cycle. Genomic selection, which makes use of both phenotypes and genotypes, has been shown to further improve genetic gain through prediction of performance before or without phenotypic characterization of novel germplasm. Additional opportunities exist to combine genomic prediction methods with the creation of doubled haploids. Here we propose an extension to genomic selec...

  6. A Perfect Match Genomic Landscape Provides a Unified Framework for the Precise Detection of Variation in Natural and Synthetic Haploid Genomes.

    Science.gov (United States)

    Palacios-Flores, Kim; García-Sotelo, Jair; Castillo, Alejandra; Uribe, Carina; Aguilar, Luis; Morales, Lucía; Gómez-Romero, Laura; Reyes, José; Garciarubio, Alejandro; Boege, Margareta; Dávila, Guillermo

    2018-04-01

    We present a conceptually simple, sensitive, precise, and essentially nonstatistical solution for the analysis of genome variation in haploid organisms. The generation of a Perfect Match Genomic Landscape (PMGL), which computes intergenome identity with single nucleotide resolution, reveals signatures of variation wherever a query genome differs from a reference genome. Such signatures encode the precise location of different types of variants, including single nucleotide variants, deletions, insertions, and amplifications, effectively introducing the concept of a general signature of variation. The precise nature of variants is then resolved through the generation of targeted alignments between specific sets of sequence reads and known regions of the reference genome. Thus, the perfect match logic decouples the identification of the location of variants from the characterization of their nature, providing a unified framework for the detection of genome variation. We assessed the performance of the PMGL strategy via simulation experiments. We determined the variation profiles of natural genomes and of a synthetic chromosome, both in the context of haploid yeast strains. Our approach uncovered variants that have previously escaped detection. Moreover, our strategy is ideally suited for further refining high-quality reference genomes. The source codes for the automated PMGL pipeline have been deposited in a public repository. Copyright © 2018 by the Genetics Society of America.

  7. Characterization of in vitro haploid and doubled haploid Chrysanthemum morifolium plants via unfertilized ovule culture for phenotypical traits and DNA methylation pattern.

    Directory of Open Access Journals (Sweden)

    Haibin eWang

    2014-12-01

    Full Text Available Chrysanthemum is one of important ornamental species in the world. Its highly heterozygous state complicates molecular analysis, so it is of interest to derive haploid forms. A total of 2,579 non-fertilized chrysanthemum ovules pollinated by Argyranthemum frutescens were cultured in vitro to isolate haploid progeny. One single regenerant emerged from each of three of the 105 calli produced. Chromosome counts and microsatellite fingerprinting showed that only one of the regenerants was a true haploid. Nine doubled haploid derivatives were subsequently generated by colchicine treatment of 80 in vitro cultured haploid nodal segments. Morphological screening showed that the haploid plant was shorter than the doubled haploids, and developed smaller leaves, flowers and stomata. An in vitro pollen germination test showed that few of the haploid's pollen were able to germinate and those which did so were abnormal. Both the haploid and the doubled haploids produced yellow flowers, whereas those of the maternal parental cultivar were mauve. Methylation-sensitive amplification polymorphism (MSAP profiling was further used to detect alterations in cytosine methylation caused by the haploidization and/or the chromosome doubling processes. While 52.2% of the resulting amplified fragments were cytosine methylated in the maternal parent's genome, the corresponding proportions for the haploid's and doubled haploids' genomes were, respectively, 47.0% and 51.7%, demonstrating a reduction in global cytosine methylation caused by haploidization and a partial recovery following chromosome doubling.

  8. Evaluating droplet digital PCR for the quantification of human genomic DNA: converting copies per nanoliter to nanograms nuclear DNA per microliter.

    Science.gov (United States)

    Duewer, David L; Kline, Margaret C; Romsos, Erica L; Toman, Blaza

    2018-05-01

    The highly multiplexed polymerase chain reaction (PCR) assays used for forensic human identification perform best when used with an accurately determined quantity of input DNA. To help ensure the reliable performance of these assays, we are developing a certified reference material (CRM) for calibrating human genomic DNA working standards. To enable sharing information over time and place, CRMs must provide accurate and stable values that are metrologically traceable to a common reference. We have shown that droplet digital PCR (ddPCR) limiting dilution end-point measurements of the concentration of DNA copies per volume of sample can be traceably linked to the International System of Units (SI). Unlike values assigned using conventional relationships between ultraviolet absorbance and DNA mass concentration, entity-based ddPCR measurements are expected to be stable over time. However, the forensic community expects DNA quantity to be stated in terms of mass concentration rather than entity concentration. The transformation can be accomplished given SI-traceable values and uncertainties for the number of nucleotide bases per human haploid genome equivalent (HHGE) and the average molar mass of a nucleotide monomer in the DNA polymer. This report presents the considerations required to establish the metrological traceability of ddPCR-based mass concentration estimates of human nuclear DNA. Graphical abstract The roots of metrological traceability for human nuclear DNA mass concentration results. Values for the factors in blue must be established experimentally. Values for the factors in red have been established from authoritative source materials. HHGE stands for "haploid human genome equivalent"; there are two HHGE per diploid human genome.

  9. Strategies for implementing genomic selection in family-based aquaculture breeding schemes: double haploid sib test populations

    Directory of Open Access Journals (Sweden)

    Nirea Kahsay G

    2012-10-01

    Full Text Available Abstract Background Simulation studies have shown that accuracy and genetic gain are increased in genomic selection schemes compared to traditional aquaculture sib-based schemes. In genomic selection, accuracy of selection can be maximized by increasing the precision of the estimation of SNP effects and by maximizing the relationships between test sibs and candidate sibs. Another means of increasing the accuracy of the estimation of SNP effects is to create individuals in the test population with extreme genotypes. The latter approach was studied here with creation of double haploids and use of non-random mating designs. Methods Six alternative breeding schemes were simulated in which the design of the test population was varied: test sibs inherited maternal (Mat, paternal (Pat or a mixture of maternal and paternal (MatPat double haploid genomes or test sibs were obtained by maximum coancestry mating (MaxC, minimum coancestry mating (MinC, or random (RAND mating. Three thousand test sibs and 3000 candidate sibs were genotyped. The test sibs were recorded for a trait that could not be measured on the candidates and were used to estimate SNP effects. Selection was done by truncation on genome-wide estimated breeding values and 100 individuals were selected as parents each generation, equally divided between both sexes. Results Results showed a 7 to 19% increase in selection accuracy and a 6 to 22% increase in genetic gain in the MatPat scheme compared to the RAND scheme. These increases were greater with lower heritabilities. Among all other scenarios, i.e. Mat, Pat, MaxC, and MinC, no substantial differences in selection accuracy and genetic gain were observed. Conclusions In conclusion, a test population designed with a mixture of paternal and maternal double haploids, i.e. the MatPat scheme, increases substantially the accuracy of selection and genetic gain. This will be particularly interesting for traits that cannot be recorded on the

  10. Genome Sequences of Industrially Relevant Saccharomyces cerevisiae Strain M3707, Isolated from a Sample of Distillers Yeast and Four Haploid Derivatives

    Energy Technology Data Exchange (ETDEWEB)

    Brown, Steven D.; Klingeman, Dawn M.; Johnson, Courtney M.; Clum, Alicia; Aerts, Andrea; Salamov, Asaf; Sharma, Aditi; Zane, Matthew; Barry, Kerrie; Grigoriev, Igor V.; Davison, Brian H.; Lynd, Lee R.; Gilna, Paul; Hau, Heidi; Hogsett, David A.; Froehlich, Allan C.

    2013-04-19

    Saccharomyces cerevisiae strain M3707 was isolated from a sample of commercial distillers yeast, and its genome sequence together with the genome sequences for the four derived haploid strains M3836, M3837, M3838, and M3839 has been determined. Yeasts have potential for consolidated bioprocessing (CBP) for biofuel production, and access to these genome sequences will facilitate their development.

  11. Genome Architecture and Its Roles in Human Copy Number Variation

    Directory of Open Access Journals (Sweden)

    Lu Chen

    2014-12-01

    Full Text Available Besides single-nucleotide variants in the human genome, large-scale genomic variants, such as copy number variations (CNVs, are being increasingly discovered as a genetic source of human diversity and the pathogenic factors of diseases. Recent experimental findings have shed light on the links between different genome architectures and CNV mutagenesis. In this review, we summarize various genomic features and discuss their contributions to CNV formation. Genomic repeats, including both low-copy and high-copy repeats, play important roles in CNV instability, which was initially known as DNA recombination events. Furthermore, it has been found that human genomic repeats can also induce DNA replication errors and consequently result in CNV mutations. Some recent studies showed that DNA replication timing, which reflects the high-order information of genomic organization, is involved in human CNV mutations. Our review highlights that genome architecture, from DNA sequence to high-order genomic organization, is an important molecular factor in CNV mutagenesis and human genomic instability.

  12. Induction of gynogenetic and androgenetic haploid and doubled haploid development in the brown trout (Salmo trutta Linnaeus 1758).

    Science.gov (United States)

    Michalik, O; Dobosz, S; Zalewski, T; Sapota, M; Ocalewicz, K

    2015-04-01

    Gynogenetic and androgenetic brown trout (Salmo trutta Linnaeus 1758) haploids (Hs) and doubled haploids (DHs) were produced in the present research. Haploid development was induced by radiation-induced genetic inactivation of spermatozoa (gynogenesis) or eggs (androgenesis) before insemination. To provide DHs, gynogenetic and androgenetic haploid zygotes were subjected to the high pressure shock to suppress the first mitotic cleavage. Among haploids, gynogenetic embryos were showing lower mortality when compared to the androgenetic embryos; however, most of them die before the first feeding stage. Gynogenetic doubled haploids provided in the course of the brown trout eggs activation performed by homologous and heterologous sperm (rainbow trout) were developing equally showing hatching rates of 14.76 ± 2.4% and 16.14 ± 2.90% and the survival rates at the first feeding stage of 10.48 ± 3.48% and 12.78 ± 2.18%, respectively. Significantly, lower survival rate was observed among androgenetic progenies from the diploid groups with only few specimens that survived to the first feeding stage. Cytogenetic survey showed that among embryos from the diploid variants of the research, only gynogenetic individuals possessed doubled sets of chromosomes. Thus, it is reasonable to assume that radiation employed for the genetic inactivation of the brown trout eggs misaligned mechanism responsible for the cell divisions and might have delayed or even arrested the first mitotic cleavage in the androgenetic brown trout zygotes. Moreover, protocol for the radiation-induced inactivation of the paternal and maternal genome should be adjusted as some of the cytogenetically surveyed gynogenetic and androgenetic embryos exhibited fragments of the irradiated chromosomes. © 2015 Blackwell Verlag GmbH.

  13. Breast tumor copy number aberration phenotypes and genomic instability

    International Nuclear Information System (INIS)

    Fridlyand, Jane; Jain, Ajay N; McLennan, Jane; Ziegler, John; Chin, Koei; Devries, Sandy; Feiler, Heidi; Gray, Joe W; Waldman, Frederic; Pinkel, Daniel; Albertson, Donna G; Snijders, Antoine M; Ylstra, Bauke; Li, Hua; Olshen, Adam; Segraves, Richard; Dairkee, Shanaz; Tokuyasu, Taku; Ljung, Britt Marie

    2006-01-01

    Genomic DNA copy number aberrations are frequent in solid tumors, although the underlying causes of chromosomal instability in tumors remain obscure. Genes likely to have genomic instability phenotypes when mutated (e.g. those involved in mitosis, replication, repair, and telomeres) are rarely mutated in chromosomally unstable sporadic tumors, even though such mutations are associated with some heritable cancer prone syndromes. We applied array comparative genomic hybridization (CGH) to the analysis of breast tumors. The variation in the levels of genomic instability amongst tumors prompted us to investigate whether alterations in processes/genes involved in maintenance and/or manipulation of the genome were associated with particular types of genomic instability. We discriminated three breast tumor subtypes based on genomic DNA copy number alterations. The subtypes varied with respect to level of genomic instability. We find that shorter telomeres and altered telomere related gene expression are associated with amplification, implicating telomere attrition as a promoter of this type of aberration in breast cancer. On the other hand, the numbers of chromosomal alterations, particularly low level changes, are associated with altered expression of genes in other functional classes (mitosis, cell cycle, DNA replication and repair). Further, although loss of function instability phenotypes have been demonstrated for many of the genes in model systems, we observed enhanced expression of most genes in tumors, indicating that over expression, rather than deficiency underlies instability. Many of the genes associated with higher frequency of copy number aberrations are direct targets of E2F, supporting the hypothesis that deregulation of the Rb pathway is a major contributor to chromosomal instability in breast tumors. These observations are consistent with failure to find mutations in sporadic tumors in genes that have roles in maintenance or manipulation of the genome

  14. Incidental copy-number variants identified by routine genome testing in a clinical population

    Science.gov (United States)

    Boone, Philip M.; Soens, Zachry T.; Campbell, Ian M.; Stankiewicz, Pawel; Cheung, Sau Wai; Patel, Ankita; Beaudet, Arthur L.; Plon, Sharon E.; Shaw, Chad A.; McGuire, Amy L.; Lupski, James R.

    2013-01-01

    Purpose Mutational load of susceptibility variants has not been studied on a genomic scale in a clinical population, nor has the potential to identify these mutations as incidental findings during clinical testing been systematically ascertained. Methods Array comparative genomic hybridization, a method for genome-wide detection of DNA copy-number variants, was performed clinically on DNA from 9,005 individuals. Copy-number variants encompassing or disrupting single genes were identified and analyzed for their potential to confer predisposition to dominant, adult-onset disease. Multigene copy-number variants affecting dominant, adult-onset cancer syndrome genes were also assessed. Results In our cohort, 83 single-gene copy-number variants affected 40 unique genes associated with dominant, adult-onset disorders and unrelated to the patients’ referring diagnoses (i.e., incidental) were found. Fourteen of these copy-number variants are likely disease-predisposing, 25 are likely benign, and 44 are of unknown clinical consequence. When incidental copy-number variants spanning up to 20 genes were considered, 27 copy-number variants affected 17 unique genes associated with dominant, adult-onset cancer predisposition. Conclusion Copy-number variants potentially conferring susceptibility to adult-onset disease can be identified as incidental findings during routine genome-wide testing. Some of these mutations may be medically actionable, enabling disease surveillance or prevention; however, most incidentally observed single-gene copy-number variants are currently of unclear significance to the patient. PMID:22878507

  15. Sparse representation and Bayesian detection of genome copy number alterations from microarray data.

    Science.gov (United States)

    Pique-Regi, Roger; Monso-Varona, Jordi; Ortega, Antonio; Seeger, Robert C; Triche, Timothy J; Asgharzadeh, Shahab

    2008-02-01

    Genomic instability in cancer leads to abnormal genome copy number alterations (CNA) that are associated with the development and behavior of tumors. Advances in microarray technology have allowed for greater resolution in detection of DNA copy number changes (amplifications or deletions) across the genome. However, the increase in number of measured signals and accompanying noise from the array probes present a challenge in accurate and fast identification of breakpoints that define CNA. This article proposes a novel detection technique that exploits the use of piece wise constant (PWC) vectors to represent genome copy number and sparse Bayesian learning (SBL) to detect CNA breakpoints. First, a compact linear algebra representation for the genome copy number is developed from normalized probe intensities. Second, SBL is applied and optimized to infer locations where copy number changes occur. Third, a backward elimination (BE) procedure is used to rank the inferred breakpoints; and a cut-off point can be efficiently adjusted in this procedure to control for the false discovery rate (FDR). The performance of our algorithm is evaluated using simulated and real genome datasets and compared to other existing techniques. Our approach achieves the highest accuracy and lowest FDR while improving computational speed by several orders of magnitude. The proposed algorithm has been developed into a free standing software application (GADA, Genome Alteration Detection Algorithm). http://biron.usc.edu/~piquereg/GADA

  16. Genetic dissection of mammalian ERAD through comparative haploid and CRISPR forward genetic screens

    DEFF Research Database (Denmark)

    Timms, Richard T.; Menzies, Sam A.; Tchasovnikarova, Iva A.

    2016-01-01

    The application of forward genetic screens to cultured human cells represents a powerful method to study gene function. The repurposing of the bacterial CRISPR/Cas9 system provides an effective method to disrupt gene function in mammalian cells, and has been applied to genome-wide screens. Here, we...... compare the efficacy of genome-wide CRISPR/Cas9-mediated forward genetic screens versus gene-trap mutagenesis screens in haploid human cells, which represent the existing ‘gold standard’ method. This head-to-head comparison aimed to identify genes required for the endoplasmic reticulum....../3-associated disulphide reductase. Genome-wide CRISPR/Cas9-mediated screens together with haploid genetic screens provide a powerful addition to the forward genetic toolbox....

  17. Effective Normalization for Copy Number Variation Detection from Whole Genome Sequencing

    NARCIS (Netherlands)

    Janevski, A.; Varadan, V.; Kamalakaran, S.; Banerjee, N.; Dimitrova, D.

    2012-01-01

    Background Whole genome sequencing enables a high resolution view ofthe human genome and provides unique insights into genome structureat an unprecedented scale. There have been a number of tools to infer copy number variation in the genome. These tools while validatedalso include a number of

  18. Transposition of a Ds element from a plasmid into the plant genome in Nicotiana plumbaginifolia protoplast-derived cells.

    Science.gov (United States)

    Houba-Hérin, N; Domin, M; Pédron, J

    1994-07-01

    Nicotiana plumbaginifolia haploid protoplasts were co-transformed with two plasmids, one with a NPT-II/Ds element and one with a gene encoding an amino-terminal truncated Ac transposase. It is shown that Ds can efficiently transpose from extrachromosomal DNA to N. plumbaginifolia chromosomes when the Ac transposase gene is present in trans. Ds has been shown to have transposed into the plant genome in a limited number of copies (1.9 copies per genome), for 21/32 transgenic lines tested. The flanking sequences present in the original plasmid are missing in these 21 plants. In only two of 21 plants was part of the transposase construct integrated. By segregation analysis of transgenic progeny, Ds was shown to be present in the heterozygous state in 10 lines even though haploid protoplasts had been originally transformed. This observation could indicate that integration occurred after or during DNA replication that leads to protoplast diploidization.

  19. Copy Number Variations in Tilapia Genomes.

    Science.gov (United States)

    Li, Bi Jun; Li, Hong Lian; Meng, Zining; Zhang, Yong; Lin, Haoran; Yue, Gen Hua; Xia, Jun Hong

    2017-02-01

    Discovering the nature and pattern of genome variation is fundamental in understanding phenotypic diversity among populations. Although several millions of single nucleotide polymorphisms (SNPs) have been discovered in tilapia, the genome-wide characterization of larger structural variants, such as copy number variation (CNV) regions has not been carried out yet. We conducted a genome-wide scan for CNVs in 47 individuals from three tilapia populations. Based on 254 Gb of high-quality paired-end sequencing reads, we identified 4642 distinct high-confidence CNVs. These CNVs account for 1.9% (12.411 Mb) of the used Nile tilapia reference genome. A total of 1100 predicted CNVs were found overlapping with exon regions of protein genes. Further association analysis based on linear model regression found 85 CNVs ranging between 300 and 27,000 base pairs significantly associated to population types (R 2  > 0.9 and P > 0.001). Our study sheds first insights on genome-wide CNVs in tilapia. These CNVs among and within tilapia populations may have functional effects on phenotypes and specific adaptation to particular environments.

  20. Allelic recombination between distinct genomic locations generates copy number diversity in human β-defensins

    Science.gov (United States)

    Bakar, Suhaili Abu; Hollox, Edward J.; Armour, John A. L.

    2009-01-01

    β-Defensins are small secreted antimicrobial and signaling peptides involved in the innate immune response of vertebrates. In humans, a cluster of at least 7 of these genes shows extensive copy number variation, with a diploid copy number commonly ranging between 2 and 7. Using a genetic mapping approach, we show that this cluster is at not 1 but 2 distinct genomic loci ≈5 Mb apart on chromosome band 8p23.1, contradicting the most recent genome assembly. We also demonstrate that the predominant mechanism of change in β-defensin copy number is simple allelic recombination occurring in the interval between the 2 distinct genomic loci for these genes. In 416 meiotic transmissions, we observe 3 events creating a haplotype copy number not found in the parent, equivalent to a germ-line rate of copy number change of ≈0.7% per gamete. This places it among the fastest-changing copy number variants currently known. PMID:19131514

  1. Genomic Copy Number Dictates a Gene-Independent Cell Response to CRISPR/Cas9 Targeting | Office of Cancer Genomics

    Science.gov (United States)

    The CRISPR/Cas9 system enables genome editing and somatic cell genetic screens in mammalian cells. We performed genome-scale loss-of-function screens in 33 cancer cell lines to identify genes essential for proliferation/survival and found a strong correlation between increased gene copy number and decreased cell viability after genome editing. Within regions of copy-number gain, CRISPR/Cas9 targeting of both expressed and unexpressed genes, as well as intergenic loci, led to significantly decreased cell proliferation through induction of a G2 cell-cycle arrest.

  2. Copy-number and gene dependency analysis reveals partial copy loss of wild-type SF3B1 as a novel cancer vulnerability. | Office of Cancer Genomics

    Science.gov (United States)

    Genomic instability is a hallmark of human cancer, and results in widespread somatic copy number alterations. We used a genome-scale shRNA viability screen in human cancer cell lines to systematically identify genes that are essential in the context of particular copy-number alterations (copy-number associated gene dependencies). The most enriched class of copy-number associated gene dependencies was CYCLOPS (Copy-number alterations Yielding Cancer Liabilities Owing to Partial losS) genes, and spliceosome components were the most prevalent.

  3. Mapping copy number variation by population-scale genome sequencing

    DEFF Research Database (Denmark)

    Mills, Ryan E.; Walter, Klaudia; Stewart, Chip

    2011-01-01

    Genomic structural variants (SVs) are abundant in humans, differing from other forms of variation in extent, origin and functional impact. Despite progress in SV characterization, the nucleotide resolution architecture of most SVs remains unknown. We constructed a map of unbalanced SVs (that is......, copy number variants) based on whole genome DNA sequencing data from 185 human genomes, integrating evidence from complementary SV discovery approaches with extensive experimental validations. Our map encompassed 22,025 deletions and 6,000 additional SVs, including insertions and tandem duplications...

  4. Haploids in Conifer Species: Characterization and Chromosomal Integrity of a Maritime Pine Cell Line

    Directory of Open Access Journals (Sweden)

    José Antonio Cabezas

    2016-11-01

    Full Text Available Haploids are a valuable tool for genomic studies in higher plants, especially those with huge genome size and long juvenile periods, such as conifers. In these species, megagametophyte cultures have been widely used to obtain haploid callus and somatic embryogenic lines. One of the main problems associated with tissue culture is the potential genetic instability of the regenerants. Because of this, chromosomal stability of the callus and/or somatic embryos should also be assessed. To this end, chromosome counting, flow cytometry and genotyping using microsatellites have been reported. Here, we present an overview of the work done in conifers, with special emphasis on the production of a haploid cell line in maritime pine (Pinus pinaster L. and the use of a set of molecular markers, which includes Single Nucleotide Polymorphisms (SNPs and microsatellites or Single Sequence Repeats (SSRs, to validate chromosomal integrity confirming the presence of all chromosomic arms.

  5. Genome-wide patterns of copy number variation in the diversified chicken genomes using next-generation sequencing.

    Science.gov (United States)

    Yi, Guoqiang; Qu, Lujiang; Liu, Jianfeng; Yan, Yiyuan; Xu, Guiyun; Yang, Ning

    2014-11-07

    Copy number variation (CNV) is important and widespread in the genome, and is a major cause of disease and phenotypic diversity. Herein, we performed a genome-wide CNV analysis in 12 diversified chicken genomes based on whole genome sequencing. A total of 8,840 CNV regions (CNVRs) covering 98.2 Mb and representing 9.4% of the chicken genome were identified, ranging in size from 1.1 to 268.8 kb with an average of 11.1 kb. Sequencing-based predictions were confirmed at a high validation rate by two independent approaches, including array comparative genomic hybridization (aCGH) and quantitative PCR (qPCR). The Pearson's correlation coefficients between sequencing and aCGH results ranged from 0.435 to 0.755, and qPCR experiments revealed a positive validation rate of 91.71% and a false negative rate of 22.43%. In total, 2,214 (25.0%) predicted CNVRs span 2,216 (36.4%) RefSeq genes associated with specific biological functions. Besides two previously reported copy number variable genes EDN3 and PRLR, we also found some promising genes with potential in phenotypic variation. Two genes, FZD6 and LIMS1, related to disease susceptibility/resistance are covered by CNVRs. The highly duplicated SOCS2 may lead to higher bone mineral density. Entire or partial duplication of some genes like POPDC3 may have great economic importance in poultry breeding. Our results based on extensive genetic diversity provide a more refined chicken CNV map and genome-wide gene copy number estimates, and warrant future CNV association studies for important traits in chickens.

  6. Global repeat discovery and estimation of genomic copy number in a large, complex genome using a high-throughput 454 sequence survey

    Directory of Open Access Journals (Sweden)

    Varala Kranthi

    2007-05-01

    Full Text Available Abstract Background Extensive computational and database tools are available to mine genomic and genetic databases for model organisms, but little genomic data is available for many species of ecological or agricultural significance, especially those with large genomes. Genome surveys using conventional sequencing techniques are powerful, particularly for detecting sequences present in many copies per genome. However these methods are time-consuming and have potential drawbacks. High throughput 454 sequencing provides an alternative method by which much information can be gained quickly and cheaply from high-coverage surveys of genomic DNA. Results We sequenced 78 million base-pairs of randomly sheared soybean DNA which passed our quality criteria. Computational analysis of the survey sequences provided global information on the abundant repetitive sequences in soybean. The sequence was used to determine the copy number across regions of large genomic clones or contigs and discover higher-order structures within satellite repeats. We have created an annotated, online database of sequences present in multiple copies in the soybean genome. The low bias of pyrosequencing against repeat sequences is demonstrated by the overall composition of the survey data, which matches well with past estimates of repetitive DNA content obtained by DNA re-association kinetics (Cot analysis. Conclusion This approach provides a potential aid to conventional or shotgun genome assembly, by allowing rapid assessment of copy number in any clone or clone-end sequence. In addition, we show that partial sequencing can provide access to partial protein-coding sequences.

  7. Copy number variation in the bovine genome

    DEFF Research Database (Denmark)

    Fadista, João; Thomsen, Bo; Holm, Lars-Erik

    2010-01-01

    to genetic variation in cattle. Results We designed and used a set of NimbleGen CGH arrays that tile across the assayable portion of the cattle genome with approximately 6.3 million probes, at a median probe spacing of 301 bp. This study reports the highest resolution map of copy number variation...... in the cattle genome, with 304 CNV regions (CNVRs) being identified among the genomes of 20 bovine samples from 4 dairy and beef breeds. The CNVRs identified covered 0.68% (22 Mb) of the genome, and ranged in size from 1.7 to 2,031 kb (median size 16.7 kb). About 20% of the CNVs co-localized with segmental...... duplications, while 30% encompass genes, of which the majority is involved in environmental response. About 10% of the human orthologous of these genes are associated with human disease susceptibility and, hence, may have important phenotypic consequences. Conclusions Together, this analysis provides a useful...

  8. Insights into the genome structure and copy-number variation of Eimeria tenella

    Directory of Open Access Journals (Sweden)

    Lim Lik-Sin

    2012-08-01

    Full Text Available Abstract Background Eimeria is a genus of parasites in the same phylum (Apicomplexa as human parasites such as Toxoplasma, Cryptosporidium and the malaria parasite Plasmodium. As an apicomplexan whose life-cycle involves a single host, Eimeria is a convenient model for understanding this group of organisms. Although the genomes of the Apicomplexa are diverse, that of Eimeria is unique in being composed of large alternating blocks of sequence with very different characteristics - an arrangement seen in no other organism. This arrangement has impeded efforts to fully sequence the genome of Eimeria, which remains the last of the major apicomplexans to be fully analyzed. In order to increase the value of the genome sequence data and aid in the effort to gain a better understanding of the Eimeria tenella genome, we constructed a whole genome map for the parasite. Results A total of 1245 contigs representing 70.0% of the whole genome assembly sequences (Wellcome Trust Sanger Institute were selected and subjected to marker selection. Subsequently, 2482 HAPPY markers were developed and typed. Of these, 795 were considered as usable markers, and utilized in the construction of a HAPPY map. Markers developed from chromosomally-assigned genes were then integrated into the HAPPY map and this aided the assignment of a number of linkage groups to their respective chromosomes. BAC-end sequences and contigs from whole genome sequencing were also integrated to improve and validate the HAPPY map. This resulted in an integrated HAPPY map consisting of 60 linkage groups that covers approximately half of the estimated 60 Mb genome. Further analysis suggests that the segmental organization first seen in Chromosome 1 is present throughout the genome, with repeat-poor (P regions alternating with repeat-rich (R regions. Evidence of copy-number variation between strains was also uncovered. Conclusions This paper describes the application of a whole genome mapping

  9. TEGS-CN: A Statistical Method for Pathway Analysis of Genome-wide Copy Number Profile.

    Science.gov (United States)

    Huang, Yen-Tsung; Hsu, Thomas; Christiani, David C

    2014-01-01

    The effects of copy number alterations make up a significant part of the tumor genome profile, but pathway analyses of these alterations are still not well established. We proposed a novel method to analyze multiple copy numbers of genes within a pathway, termed Test for the Effect of a Gene Set with Copy Number data (TEGS-CN). TEGS-CN was adapted from TEGS, a method that we previously developed for gene expression data using a variance component score test. With additional development, we extend the method to analyze DNA copy number data, accounting for different sizes and thus various numbers of copy number probes in genes. The test statistic follows a mixture of X (2) distributions that can be obtained using permutation with scaled X (2) approximation. We conducted simulation studies to evaluate the size and the power of TEGS-CN and to compare its performance with TEGS. We analyzed a genome-wide copy number data from 264 patients of non-small-cell lung cancer. With the Molecular Signatures Database (MSigDB) pathway database, the genome-wide copy number data can be classified into 1814 biological pathways or gene sets. We investigated associations of the copy number profile of the 1814 gene sets with pack-years of cigarette smoking. Our analysis revealed five pathways with significant P values after Bonferroni adjustment (number data, and causal mechanisms of the five pathways require further study.

  10. Identification of copy number variants defining genomic differences among major human groups.

    Directory of Open Access Journals (Sweden)

    Lluís Armengol

    Full Text Available BACKGROUND: Understanding the genetic contribution to phenotype variation of human groups is necessary to elucidate differences in disease predisposition and response to pharmaceutical treatments in different human populations. METHODOLOGY/PRINCIPAL FINDINGS: We have investigated the genome-wide profile of structural variation on pooled samples from the three populations studied in the HapMap project by comparative genome hybridization (CGH in different array platforms. We have identified and experimentally validated 33 genomic loci that show significant copy number differences from one population to the other. Interestingly, we found an enrichment of genes related to environment adaptation (immune response, lipid metabolism and extracellular space within these regions and the study of expression data revealed that more than half of the copy number variants (CNVs translate into gene-expression differences among populations, suggesting that they could have functional consequences. In addition, the identification of single nucleotide polymorphisms (SNPs that are in linkage disequilibrium with the copy number alleles allowed us to detect evidences of population differentiation and recent selection at the nucleotide variation level. CONCLUSIONS: Overall, our results provide a comprehensive view of relevant copy number changes that might play a role in phenotypic differences among major human populations, and generate a list of interesting candidates for future studies.

  11. Whole genome DNA copy number changes identified by high density oligonucleotide arrays

    Directory of Open Access Journals (Sweden)

    Huang Jing

    2004-05-01

    Full Text Available Abstract Changes in DNA copy number are one of the hallmarks of the genetic instability common to most human cancers. Previous micro-array-based methods have been used to identify chromosomal gains and losses; however, they are unable to genotype alleles at the level of single nucleotide polymorphisms (SNPs. Here we describe a novel algorithm that uses a recently developed high-density oligonucleotide array-based SNP genotyping method, whole genome sampling analysis (WGSA, to identify genome-wide chromosomal gains and losses at high resolution. WGSA simultaneously genotypes over 10,000 SNPs by allele-specific hybridisation to perfect match (PM and mismatch (MM probes synthesised on a single array. The copy number algorithm jointly uses PM intensity and discrimination ratios between paired PM and MM intensity values to identify and estimate genetic copy number changes. Values from an experimental sample are compared with SNP-specific distributions derived from a reference set containing over 100 normal individuals to gain statistical power. Genomic regions with statistically significant copy number changes can be identified using both single point analysis and contiguous point analysis of SNP intensities. We identified multiple regions of amplification and deletion using a panel of human breast cancer cell lines. We verified these results using an independent method based on quantitative polymerase chain reaction and found that our approach is both sensitive and specific and can tolerate samples which contain a mixture of both tumour and normal DNA. In addition, by using known allele frequencies from the reference set, statistically significant genomic intervals can be identified containing contiguous stretches of homozygous markers, potentially allowing the detection of regions undergoing loss of heterozygosity (LOH without the need for a matched normal control sample. The coupling of LOH analysis, via SNP genotyping, with copy number

  12. In vitro wheat haploid embryo production by wheat x maize cross system under different environmental conditions

    International Nuclear Information System (INIS)

    Khan, M.A.; Ahmad, J.

    2011-01-01

    Haploids are helpful in studies for inter genomic relationship, identifying molecular markers, reducing time period of varietal development and increasing efficiency of breeding program. In case of bread wheat (Triticum aestivum L.), wheat x maize cross system is the most successful system due to its higher efficiency, more haploid embryo production and low genetic specificity. The haploid embryo production is affected by many factors i.e. light, temperature, relative humidity and tiller culture media. A study was carried out comprising 25 genotypes of bread wheat for haploid embryo production using 100 mgL/sup -1/ 2,4-D, 40Gl/sup -1/ Sucrose and 8mlL/sup -1/ Sulphurous acid. Haploid embryo production was observed at various levels of environmental factors i.e. maize pollen collection temperature, time of pollination after tiller emasculation, light intensity and relative humidity during haploid seed formation. Maximum haploid embryo formation recorded was 9.52%. Best temperature observed for pollination was 21-26 degree C, optimum time duration for pollination was 24 hours after emasculation, light intensity was 10,000 Lux and relative humidity was 60-65% at 20-22 degree C. (author)

  13. Evolutionary restoration of fertility in an interspecies hybrid yeast, by whole-genome duplication after a failed mating-type switch.

    Directory of Open Access Journals (Sweden)

    Raúl A Ortiz-Merino

    2017-05-01

    Full Text Available Many interspecies hybrids have been discovered in yeasts, but most of these hybrids are asexual and can replicate only mitotically. Whole-genome duplication has been proposed as a mechanism by which interspecies hybrids can regain fertility, restoring their ability to perform meiosis and sporulate. Here, we show that this process occurred naturally during the evolution of Zygosaccharomyces parabailii, an interspecies hybrid that was formed by mating between 2 parents that differed by 7% in genome sequence and by many interchromosomal rearrangements. Surprisingly, Z. parabailii has a full sexual cycle and is genetically haploid. It goes through mating-type switching and autodiploidization, followed by immediate sporulation. We identified the key evolutionary event that enabled Z. parabailii to regain fertility, which was breakage of 1 of the 2 homeologous copies of the mating-type (MAT locus in the hybrid, resulting in a chromosomal rearrangement and irreparable damage to 1 MAT locus. This rearrangement was caused by HO endonuclease, which normally functions in mating-type switching. With 1 copy of MAT inactivated, the interspecies hybrid now behaves as a haploid. Our results provide the first demonstration that MAT locus damage is a naturally occurring evolutionary mechanism for whole-genome duplication and restoration of fertility to interspecies hybrids. The events that occurred in Z. parabailii strongly resemble those postulated to have caused ancient whole-genome duplication in an ancestor of Saccharomyces cerevisiae.

  14. Haploid rice plants in mutation studies

    Energy Technology Data Exchange (ETDEWEB)

    Tanaka, S [Institute of Radiation Breeding, Ministry of Agriculture and Forestry, Ohmiya, Ibaraki-ken (Japan)

    1970-03-01

    Studies were made on chlorophyll-deficient sectors and diploid-like sectors in haploid rice plants exposed to chronic gamma irradiation, and on germinal mutations in diploid strains derived from the haploid plants. The induction and elimination of somatic mutations in haploid plants and the occurrence of drastic germinal mutations in diploid strains from haploid plants are discussed. (author)

  15. Multiplicity of genome equivalents in the radiation-resistant bacterium Micrococcus radiodurans.

    Science.gov (United States)

    Hansen, M T

    1978-01-01

    The complexity of the genome of Micrococcus radiodurans was determined to be (2.0 +/- 0.3) X 10(9) daltons by DNA renaturation kinetics. The number of genome equivalents of DNA per cell was calculated from the complexity and the content of DNA. A lower limit of four genome equivalents per cell was approached with decreasing growth rate. Thus, no haploid stage appeared to be realized in this organism. The replication time was estimated from the kinetics and amount of residual DNA synthesis after inhibiting initiation of new rounds of replication. From this, the redundancy of terminal genetic markers was calculated to vary with growth rate from four to approximately eight copies per cell. All genetic material, including the least abundant, is thus multiply represented in each cell. The potential significance of the maintenance in each cell of multiple gene copies is discussed in relation to the extreme radiation resistance of M. radiodurans. PMID:649572

  16. The role of epistatic interactions underpinning resistance to parasitic Varroa mites in haploid honey bee (Apis mellifera) drones.

    Science.gov (United States)

    Conlon, Benjamin H; Frey, Eva; Rosenkranz, Peter; Locke, Barbara; Moritz, Robin F A; Routtu, Jarkko

    2018-06-01

    The Red Queen hypothesis predicts that host-parasite coevolutionary dynamics can select for host resistance through increased genetic diversity, recombination and evolutionary rates. However, in haplodiploid organisms such as the honeybee (Apis mellifera), models suggest the selective pressure is weaker than in diploids. Haplodiploid sex determination, found in A. mellifera, can allow deleterious recessive alleles to persist in the population through the diploid sex with negative effects predominantly expressed in the haploid sex. To overcome these negative effects in haploid genomes, epistatic interactions have been hypothesized to play an important role. Here, we use the interaction between A. mellifera and the parasitic mite Varroa destructor to test epistasis in the expression of resistance, through the inhibition of parasite reproduction, in haploid drones. We find novel loci on three chromosomes which explain over 45% of the resistance phenotype. Two of these loci interact only additively, suggesting their expression is independent of each other, but both loci interact epistatically with the third locus. With drone offspring inheriting only one copy of the queen's chromosomes, the drones will only possess one of two queen alleles throughout the years-long lifetime of the honeybee colony. Varroa, in comparison, completes its highly inbred reproductive cycle in a matter of weeks, allowing it to rapidly evolve resistance. Faced with the rapidly evolving Varroa, a diversity of pathways and epistatic interactions for the inhibition of Varroa reproduction could therefore provide a selective advantage to the high levels of recombination seen in A. mellifera. This allows for the remixing of phenotypes despite a fixed queen genotype. © 2018 European Society For Evolutionary Biology. Journal of Evolutionary Biology © 2018 European Society For Evolutionary Biology.

  17. aCNViewer: Comprehensive genome-wide visualization of absolute copy number and copy neutral variations.

    Directory of Open Access Journals (Sweden)

    Victor Renault

    Full Text Available Copy number variations (CNV include net gains or losses of part or whole chromosomal regions. They differ from copy neutral loss of heterozygosity (cn-LOH events which do not induce any net change in the copy number and are often associated with uniparental disomy. These phenomena have long been reported to be associated with diseases and particularly in cancer. Losses/gains of genomic regions are often correlated with lower/higher gene expression. On the other hand, loss of heterozygosity (LOH and cn-LOH are common events in cancer and may be associated with the loss of a functional tumor suppressor gene. Therefore, identifying recurrent CNV and cn-LOH events can be important as they may highlight common biological components and give insights into the development or mechanisms of a disease. However, no currently available tools allow a comprehensive whole-genome visualization of recurrent CNVs and cn-LOH in groups of samples providing absolute quantification of the aberrations leading to the loss of potentially important information.To overcome these limitations, we developed aCNViewer (Absolute CNV Viewer, a visualization tool for absolute CNVs and cn-LOH across a group of samples. aCNViewer proposes three graphical representations: dendrograms, bi-dimensional heatmaps showing chromosomal regions sharing similar abnormality patterns, and quantitative stacked histograms facilitating the identification of recurrent absolute CNVs and cn-LOH. We illustrated aCNViewer using publically available hepatocellular carcinomas (HCCs Affymetrix SNP Array data (Fig 1A. Regions 1q and 8q present a similar percentage of total gains but significantly different copy number gain categories (p-value of 0.0103 with a Fisher exact test, validated by another cohort of HCCs (p-value of 5.6e-7 (Fig 2B.aCNViewer is implemented in python and R and is available with a GNU GPLv3 license on GitHub https://github.com/FJD-CEPH/aCNViewer and Docker https://hub.docker.com/r/fjdceph/acnviewer/.aCNViewer@cephb.fr.

  18. aCNViewer: Comprehensive genome-wide visualization of absolute copy number and copy neutral variations.

    Science.gov (United States)

    Renault, Victor; Tost, Jörg; Pichon, Fabien; Wang-Renault, Shu-Fang; Letouzé, Eric; Imbeaud, Sandrine; Zucman-Rossi, Jessica; Deleuze, Jean-François; How-Kit, Alexandre

    2017-01-01

    Copy number variations (CNV) include net gains or losses of part or whole chromosomal regions. They differ from copy neutral loss of heterozygosity (cn-LOH) events which do not induce any net change in the copy number and are often associated with uniparental disomy. These phenomena have long been reported to be associated with diseases and particularly in cancer. Losses/gains of genomic regions are often correlated with lower/higher gene expression. On the other hand, loss of heterozygosity (LOH) and cn-LOH are common events in cancer and may be associated with the loss of a functional tumor suppressor gene. Therefore, identifying recurrent CNV and cn-LOH events can be important as they may highlight common biological components and give insights into the development or mechanisms of a disease. However, no currently available tools allow a comprehensive whole-genome visualization of recurrent CNVs and cn-LOH in groups of samples providing absolute quantification of the aberrations leading to the loss of potentially important information. To overcome these limitations, we developed aCNViewer (Absolute CNV Viewer), a visualization tool for absolute CNVs and cn-LOH across a group of samples. aCNViewer proposes three graphical representations: dendrograms, bi-dimensional heatmaps showing chromosomal regions sharing similar abnormality patterns, and quantitative stacked histograms facilitating the identification of recurrent absolute CNVs and cn-LOH. We illustrated aCNViewer using publically available hepatocellular carcinomas (HCCs) Affymetrix SNP Array data (Fig 1A). Regions 1q and 8q present a similar percentage of total gains but significantly different copy number gain categories (p-value of 0.0103 with a Fisher exact test), validated by another cohort of HCCs (p-value of 5.6e-7) (Fig 2B). aCNViewer is implemented in python and R and is available with a GNU GPLv3 license on GitHub https://github.com/FJD-CEPH/aCNViewer and Docker https

  19. The Small Nuclear Genomes of Selaginella Are Associated with a Low Rate of Genome Size Evolution.

    Science.gov (United States)

    Baniaga, Anthony E; Arrigo, Nils; Barker, Michael S

    2016-06-03

    The haploid nuclear genome size (1C DNA) of vascular land plants varies over several orders of magnitude. Much of this observed diversity in genome size is due to the proliferation and deletion of transposable elements. To date, all vascular land plant lineages with extremely small nuclear genomes represent recently derived states, having ancestors with much larger genome sizes. The Selaginellaceae represent an ancient lineage with extremely small genomes. It is unclear how small nuclear genomes evolved in Selaginella We compared the rates of nuclear genome size evolution in Selaginella and major vascular plant clades in a comparative phylogenetic framework. For the analyses, we collected 29 new flow cytometry estimates of haploid genome size in Selaginella to augment publicly available data. Selaginella possess some of the smallest known haploid nuclear genome sizes, as well as the lowest rate of genome size evolution observed across all vascular land plants included in our analyses. Additionally, our analyses provide strong support for a history of haploid nuclear genome size stasis in Selaginella Our results indicate that Selaginella, similar to other early diverging lineages of vascular land plants, has relatively low rates of genome size evolution. Further, our analyses highlight that a rapid transition to a small genome size is only one route to an extremely small genome. © The Author 2016. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.

  20. Genome-Wide Association of Copy Number Polymorphisms and Kidney Function.

    Directory of Open Access Journals (Sweden)

    Man Li

    Full Text Available Genome-wide association studies (GWAS using single nucleotide polymorphisms (SNPs have identified more than 50 loci associated with estimated glomerular filtration rate (eGFR, a measure of kidney function. However, significant SNPs account for a small proportion of eGFR variability. Other forms of genetic variation have not been comprehensively evaluated for association with eGFR. In this study, we assess whether changes in germline DNA copy number are associated with GFR estimated from serum creatinine, eGFRcrea. We used hidden Markov models (HMMs to identify copy number polymorphic regions (CNPs from high-throughput SNP arrays for 2,514 African (AA and 8,645 European ancestry (EA participants in the Atherosclerosis Risk in Communities (ARIC study. Separately for the EA and AA cohorts, we used Bayesian Gaussian mixture models to estimate copy number at regions identified by the HMM or previously reported in the HapMap Project. We identified 312 and 464 autosomal CNPs among individuals of EA and AA, respectively. Multivariate models adjusted for SNP-derived covariates of population structure identified one CNP in the EA cohort near genome-wide statistical significance (Bonferroni-adjusted p = 0.067 located on chromosome 5 (876-880kb. Overall, our findings suggest a limited role of CNPs in explaining eGFR variability.

  1. Integrative Genomics Reveals Mechanisms of Copy Number Alterations Responsible for Transcriptional Deregulation in Colorectal Cancer

    Science.gov (United States)

    Camps, Jordi; Nguyen, Quang Tri; Padilla-Nash, Hesed M.; Knutsen, Turid; McNeil, Nicole E.; Wangsa, Danny; Hummon, Amanda B.; Grade, Marian; Ried, Thomas; Difilippantonio, Michael J.

    2016-01-01

    To evaluate the mechanisms and consequences of chromosomal aberrations in colorectal cancer (CRC), we used a combination of spectral karyotyping, array comparative genomic hybridization (aCGH), and array-based global gene expression profiling on 31 primary carcinomas and 15 established cell lines. Importantly, aCGH showed that the genomic profiles of primary tumors are recapitulated in the cell lines. We revealed a preponderance of chromosome breakpoints at sites of copy number variants (CNVs) in the CRC cell lines, a novel mechanism of DNA breakage in cancer. The integration of gene expression and aCGH led to the identification of 157 genes localized within high-level copy number changes whose transcriptional deregulation was significantly affected across all of the samples, thereby suggesting that these genes play a functional role in CRC. Genomic amplification at 8q24 was the most recurrent event and led to the overexpression of MYC and FAM84B. Copy number dependent gene expression resulted in deregulation of known cancer genes such as APC, FGFR2, and ERBB2. The identification of only 36 genes whose localization near a breakpoint could account for their observed deregulated expression demonstrates that the major mechanism for transcriptional deregulation in CRC is genomic copy number changes resulting from chromosomal aberrations. PMID:19691111

  2. Semiconservative quasispecies equations for polysomic genomes: The general case

    Science.gov (United States)

    Itan, Eran; Tannenbaum, Emmanuel

    2010-06-01

    This paper develops a formulation of the quasispecies equations appropriate for polysomic, semiconservatively replicating genomes. This paper is an extension of previous work on the subject, which considered the case of haploid genomes. Here, we develop a more general formulation of the quasispecies equations that is applicable to diploid and even polyploid genomes. Interestingly, with an appropriate classification of population fractions, we obtain a system of equations that is formally identical to the haploid case. As with the work for haploid genomes, we consider both random and immortal DNA strand chromosome segregation mechanisms. However, in contrast to the haploid case, we have found that an analytical solution for the mean fitness is considerably more difficult to obtain for the polyploid case. Accordingly, whereas for the haploid case we obtained expressions for the mean fitness for the case of an analog of the single-fitness-peak landscape for arbitrary lesion repair probabilities (thereby allowing for noncomplementary genomes), here we solve for the mean fitness for the restricted case of perfect lesion repair.

  3. Functional role of a highly repetitive DNA sequence in anchorage of the mouse genome.

    Science.gov (United States)

    Neuer-Nitsche, B; Lu, X N; Werner, D

    1988-09-12

    The major portion of the eukaryotic genome consists of various categories of repetitive DNA sequences which have been studied with respect to their base compositions, organizations, copy numbers, transcription and species specificities; their biological roles, however, are still unclear. A novel quality of a highly repetitive mouse DNA sequence is described which points to a functional role: All copies (approximately 50,000 per haploid genome) of this DNA sequence reside on genomic Alu I DNA fragments each associated with nuclear polypeptides that are not released from DNA by proteinase K, SDS and phenol extraction. By this quality the repetitive DNA sequence is classified as a member of the sub-set of DNA sequences involved in tight DNA-polypeptide complexes which have been previously shown to be components of the subnuclear structure termed 'nuclear matrix'. From these results it has to be concluded that the repetitive DNA sequence characterized in this report represents or comprises a signal for a large number of site specific attachment points of the mouse genome in the nuclear matrix.

  4. Construction and characterization of a yeast artificial chromosome library containing seven haploid human genome equivalents

    International Nuclear Information System (INIS)

    Albertsen, H.M.; Abderrahim, H.; Cann, H.M.; Dausset, J.; Le Paslier, D.; Cohen, D.

    1990-01-01

    Prior to constructing a library of yeast artificial chromosomes (YACs) containing very large human DNA fragments, the authors performed a series of preliminary experiments aimed at developing a suitable protocol. They found an inverse relationship between YAC insert size and transformation efficiency. Evidence of occasional rearrangement within YAC inserts was found resulting in clonally stable internal deletions or clonally unstable size variations. A protocol was developed for preparative electrophoretic enrichment of high molecular mass human DNA fragments from partial restriction digests and ligation with the YAC vector in agarose. A YAC library has been constructed from large fragments of DNA from an Epstein-Barr virus-transformed human lymphoblastoid cell line. The library presently contains 50,000 clones, 95% of which are greater than 250 kilobase pairs in size. The mean YAC size of the library, calculated from 132 randomly isolated clones, is 430 kilobase pairs. The library thus contains the equivalent of approximately seven haploid human genomes

  5. Copy number variation is a fundamental aspect of the placental genome.

    Directory of Open Access Journals (Sweden)

    Roberta L Hannibal

    2014-05-01

    Full Text Available Discovery of lineage-specific somatic copy number variation (CNV in mammals has led to debate over whether CNVs are mutations that propagate disease or whether they are a normal, and even essential, aspect of cell biology. We show that 1,000 N polyploid trophoblast giant cells (TGCs of the mouse placenta contain 47 regions, totaling 138 Megabases, where genomic copies are underrepresented (UR. UR domains originate from a subset of late-replicating heterochromatic regions containing gene deserts and genes involved in cell adhesion and neurogenesis. While lineage-specific CNVs have been identified in mammalian cells, classically in the immune system where V(DJ recombination occurs, we demonstrate that CNVs form during gestation in the placenta by an underreplication mechanism, not by recombination nor deletion. Our results reveal that large scale CNVs are a normal feature of the mammalian placental genome, which are regulated systematically during embryogenesis and are propagated by a mechanism of underreplication.

  6. Beta-defensin genomic copy number is not a modifier locus for cystic fibrosis

    Directory of Open Access Journals (Sweden)

    Burgess Juliana

    2005-12-01

    Full Text Available Abstract Human beta-defensin 2 (DEFB4, also known as DEFB2 or hBD-2 is a salt-sensitive antimicrobial protein that is expressed in lung epithelia. Previous work has shown that it is encoded in a cluster of beta-defensin genes at 8p23.1, which varies in copy number between 2 and 12 in different individuals. We determined the copy number of this locus in 355 patients with cystic fibrosis (CF, and tested for correlation between beta-defensin cluster genomic copy number and lung disease associated with CF. No significant association was found.

  7. Qtl mapping of wheat doubled haploids for chlorophyll content and chlorophyll fluorescence kinetics under drought stress imposed at anthesis stage

    International Nuclear Information System (INIS)

    Ilyas, M.; Ilyas, N.; Arshad, M.; Kazi, A.G.

    2014-01-01

    Drought stress is one of the major environmental constraints to crop plants including wheat worldwide. Synthetic hexaploid can act as a vehicle for improving crop tolerance against biotic and abiotic stresses. Doubled haploid population consisting of one hundred and forty individuals derived from cross of Opata and SH223 was used in the present study to identify genomic regions associated with various quantitative attributes of physiological nature. Doubled haploid mapping population was phenotyped for chlorophyll content and chlorophyll fluorescence kinetics under control and drought stress imposed at anthesis stage. Genotyping of population was accomplished by utilizing two hundred and sixty one polymorphic Gaterslaben wheat microsatellites and Beltsville agriculture research center simple sequence repeats. Linkage map of doubled haploid population comprising of 19 linkage groups and covering map length of two thousands six hundred and twenty six (2626) cM was constructed using map maker software. Major and minor QTLs associated with quantitative traits were identified using QGene software. Major QTL for chlorophyll content (QTc.wwc-1B-S11) of doubled haploid mapping population under anthesis drought stress was mapped on chromosome 1B and explained 10.09 percent of phenotypic variation at LOD score of 5.5. Seven major and minor QTLs for PCFK of doubled haploids were identified on chromosome 1B, 7A and 7D under control and drought stress at anthesis stage. The identified QTLs are of prime importance for high resolution mapping in synthetic hexaploid wheat. Genomic synteny of doubled haploids was observed with rice chromosome 2, 4, 7 and maize chromosome 7 owing to occurrence of orthologous QTLs for chlorophyll content and chlorophyll fluorescence respectively. (author)

  8. High copy number of highly similar mariner-like transposons in planarian (Platyhelminthe): evidence for a trans-phyla horizontal transfer.

    Science.gov (United States)

    Garcia-Fernàndez, J; Bayascas-Ramírez, J R; Marfany, G; Muñoz-Mármol, A M; Casali, A; Baguñà, J; Saló, E

    1995-05-01

    Several DNA sequences similar to the mariner element were isolated and characterized in the platyhelminthe Dugesia (Girardia) tigrina. They were 1,288 bp long, flanked by two 32 bp-inverted repeats, and contained a single 339 amino acid open-reading frame (ORF) encoding the transposase. The number of copies of this element is approximately 8,000 per haploid genome, constituting a member of the middle-repetitive DNA of Dugesia tigrina. Sequence analysis of several elements showed a high percentage of conservation between the different copies. Most of them presented an intact ORF and the standard signals of actively expressed genes, which suggests that some of them are or have recently been functional transposons. The high degree of similarity shared with other mariner elements from some arthropods, together with the fact that this element is undetectable in other planarian species, strongly suggests a case of horizontal transfer between these two distant phyla.

  9. Detecting single DNA copy number variations in complex genomes using one nanogram of starting DNA and BAC-array CGH.

    Science.gov (United States)

    Guillaud-Bataille, Marine; Valent, Alexander; Soularue, Pascal; Perot, Christine; Inda, Maria Mar; Receveur, Aline; Smaïli, Sadek; Roest Crollius, Hugues; Bénard, Jean; Bernheim, Alain; Gidrol, Xavier; Danglot, Gisèle

    2004-07-29

    Comparative genomic hybridization to bacterial artificial chromosome (BAC)-arrays (array-CGH) is a highly efficient technique, allowing the simultaneous measurement of genomic DNA copy number at hundreds or thousands of loci, and the reliable detection of local one-copy-level variations. We report a genome-wide amplification method allowing the same measurement sensitivity, using 1 ng of starting genomic DNA, instead of the classical 1 microg usually necessary. Using a discrete series of DNA fragments, we defined the parameters adapted to the most faithful ligation-mediated PCR amplification and the limits of the technique. The optimized protocol allows a 3000-fold DNA amplification, retaining the quantitative characteristics of the initial genome. Validation of the amplification procedure, using DNA from 10 tumour cell lines hybridized to BAC-arrays of 1500 spots, showed almost perfectly superimposed ratios for the non-amplified and amplified DNAs. Correlation coefficients of 0.96 and 0.99 were observed for regions of low-copy-level variations and all regions, respectively (including in vivo amplified oncogenes). Finally, labelling DNA using two nucleotides bearing the same fluorophore led to a significant increase in reproducibility and to the correct detection of one-copy gain or loss in >90% of the analysed data, even for pseudotriploid tumour genomes.

  10. Genome-wide copy number variation (CNV) in patients with autoimmune Addison's disease

    Science.gov (United States)

    2011-01-01

    Background Addison's disease (AD) is caused by an autoimmune destruction of the adrenal cortex. The pathogenesis is multi-factorial, involving genetic components and hitherto unknown environmental factors. The aim of the present study was to investigate if gene dosage in the form of copy number variation (CNV) could add to the repertoire of genetic susceptibility to autoimmune AD. Methods A genome-wide study using the Affymetrix GeneChip® Genome-Wide Human SNP Array 6.0 was conducted in 26 patients with AD. CNVs in selected genes were further investigated in a larger material of patients with autoimmune AD (n = 352) and healthy controls (n = 353) by duplex Taqman real-time polymerase chain reaction assays. Results We found that low copy number of UGT2B28 was significantly more frequent in AD patients compared to controls; conversely high copy number of ADAM3A was associated with AD. Conclusions We have identified two novel CNV associations to ADAM3A and UGT2B28 in AD. The mechanism by which this susceptibility is conferred is at present unclear, but may involve steroid inactivation (UGT2B28) and T cell maturation (ADAM3A). Characterization of these proteins may unravel novel information on the pathogenesis of autoimmunity. PMID:21851588

  11. Genome-wide copy number variation (CNV in patients with autoimmune Addison's disease

    Directory of Open Access Journals (Sweden)

    Brønstad Ingeborg

    2011-08-01

    Full Text Available Abstract Background Addison's disease (AD is caused by an autoimmune destruction of the adrenal cortex. The pathogenesis is multi-factorial, involving genetic components and hitherto unknown environmental factors. The aim of the present study was to investigate if gene dosage in the form of copy number variation (CNV could add to the repertoire of genetic susceptibility to autoimmune AD. Methods A genome-wide study using the Affymetrix GeneChip® Genome-Wide Human SNP Array 6.0 was conducted in 26 patients with AD. CNVs in selected genes were further investigated in a larger material of patients with autoimmune AD (n = 352 and healthy controls (n = 353 by duplex Taqman real-time polymerase chain reaction assays. Results We found that low copy number of UGT2B28 was significantly more frequent in AD patients compared to controls; conversely high copy number of ADAM3A was associated with AD. Conclusions We have identified two novel CNV associations to ADAM3A and UGT2B28 in AD. The mechanism by which this susceptibility is conferred is at present unclear, but may involve steroid inactivation (UGT2B28 and T cell maturation (ADAM3A. Characterization of these proteins may unravel novel information on the pathogenesis of autoimmunity.

  12. Fixation Probability in a Haploid-Diploid Population.

    Science.gov (United States)

    Bessho, Kazuhiro; Otto, Sarah P

    2017-01-01

    Classical population genetic theory generally assumes either a fully haploid or fully diploid life cycle. However, many organisms exhibit more complex life cycles, with both free-living haploid and diploid stages. Here we ask what the probability of fixation is for selected alleles in organisms with haploid-diploid life cycles. We develop a genetic model that considers the population dynamics using both the Moran model and Wright-Fisher model. Applying a branching process approximation, we obtain an accurate fixation probability assuming that the population is large and the net effect of the mutation is beneficial. We also find the diffusion approximation for the fixation probability, which is accurate even in small populations and for deleterious alleles, as long as selection is weak. These fixation probabilities from branching process and diffusion approximations are similar when selection is weak for beneficial mutations that are not fully recessive. In many cases, particularly when one phase predominates, the fixation probability differs substantially for haploid-diploid organisms compared to either fully haploid or diploid species. Copyright © 2017 by the Genetics Society of America.

  13. Gene copy number variation throughout the Plasmodium falciparum genome

    Directory of Open Access Journals (Sweden)

    Stewart Lindsay B

    2009-08-01

    Full Text Available Abstract Background Gene copy number variation (CNV is responsible for several important phenotypes of the malaria parasite Plasmodium falciparum, including drug resistance, loss of infected erythrocyte cytoadherence and alteration of receptor usage for erythrocyte invasion. Despite the known effects of CNV, little is known about its extent throughout the genome. Results We performed a whole-genome survey of CNV genes in P. falciparum using comparative genome hybridisation of a diverse set of 16 laboratory culture-adapted isolates to a custom designed high density Affymetrix GeneChip array. Overall, 186 genes showed hybridisation signals consistent with deletion or amplification in one or more isolate. There is a strong association of CNV with gene length, genomic location, and low orthology to genes in other Plasmodium species. Sub-telomeric regions of all chromosomes are strongly associated with CNV genes independent from members of previously described multigene families. However, ~40% of CNV genes were located in more central regions of the chromosomes. Among the previously undescribed CNV genes, several that are of potential phenotypic relevance are identified. Conclusion CNV represents a major form of genetic variation within the P. falciparum genome; the distribution of gene features indicates the involvement of highly non-random mutational and selective processes. Additional studies should be directed at examining CNV in natural parasite populations to extend conclusions to clinical settings.

  14. Exploiting a Reference Genome in Terms of Duplications: The Network of Paralogs and Single Copy Genes in Arabidopsis thaliana

    Directory of Open Access Journals (Sweden)

    Mara Sangiovanni

    2013-12-01

    Full Text Available Arabidopsis thaliana became the model organism for plant studies because of its small diploid genome, rapid lifecycle and short adult size. Its genome was the first among plants to be sequenced, becoming the reference in plant genomics. However, the Arabidopsis genome is characterized by an inherently complex organization, since it has undergone ancient whole genome duplications, followed by gene reduction, diploidization events and extended rearrangements, which relocated and split up the retained portions. These events, together with probable chromosome reductions, dramatically increased the genome complexity, limiting its role as a reference. The identification of paralogs and single copy genes within a highly duplicated genome is a prerequisite to understand its organization and evolution and to improve its exploitation in comparative genomics. This is still controversial, even in the widely studied Arabidopsis genome. This is also due to the lack of a reference bioinformatics pipeline that could exhaustively identify paralogs and singleton genes. We describe here a complete computational strategy to detect both duplicated and single copy genes in a genome, discussing all the methodological issues that may strongly affect the results, their quality and their reliability. This approach was used to analyze the organization of Arabidopsis nuclear protein coding genes, and besides classifying computationally defined paralogs into networks and single copy genes into different classes, it unraveled further intriguing aspects concerning the genome annotation and the gene relationships in this reference plant species. Since our results may be useful for comparative genomics and genome functional analyses, we organized a dedicated web interface to make them accessible to the scientific community.

  15. Direct Differentiation of Human Pluripotent Stem Cells into Haploid Spermatogenic Cells

    Directory of Open Access Journals (Sweden)

    Charles A. Easley, IV

    2012-09-01

    Full Text Available Human embryonic stem cells (hESCs and induced pluripotent stem cells (hiPSCs have been shown to differentiate into primordial germ cells (PGCs but not into spermatogonia, haploid spermatocytes, or spermatids. Here, we show that hESCs and hiPSCs differentiate directly into advanced male germ cell lineages, including postmeiotic, spermatid-like cells, in vitro without genetic manipulation. Furthermore, our procedure mirrors spermatogenesis in vivo by differentiating PSCs into UTF1-, PLZF-, and CDH1-positive spermatogonia-like cells; HIWI- and HILI-positive spermatocyte-like cells; and haploid cells expressing acrosin, transition protein 1, and protamine 1 (proteins that are uniquely found in spermatids and/or sperm. These spermatids show uniparental genomic imprints similar to those of human sperm on two loci: H19 and IGF2. These results demonstrate that male PSCs have the ability to differentiate directly into advanced germ cell lineages and may represent a novel strategy for studying spermatogenesis in vitro.

  16. Copy-number analysis and inference of subclonal populations in cancer genomes using Sclust.

    Science.gov (United States)

    Cun, Yupeng; Yang, Tsun-Po; Achter, Viktor; Lang, Ulrich; Peifer, Martin

    2018-06-01

    The genomes of cancer cells constantly change during pathogenesis. This evolutionary process can lead to the emergence of drug-resistant mutations in subclonal populations, which can hinder therapeutic intervention in patients. Data derived from massively parallel sequencing can be used to infer these subclonal populations using tumor-specific point mutations. The accurate determination of copy-number changes and tumor impurity is necessary to reliably infer subclonal populations by mutational clustering. This protocol describes how to use Sclust, a copy-number analysis method with a recently developed mutational clustering approach. In a series of simulations and comparisons with alternative methods, we have previously shown that Sclust accurately determines copy-number states and subclonal populations. Performance tests show that the method is computationally efficient, with copy-number analysis and mutational clustering taking Linux/Unix command-line syntax should be able to carry out analyses of subclonal populations.

  17. Doubled haploid production in Flax (Linum usitatissimum L.).

    Science.gov (United States)

    Obert, Bohus; Zácková, Zuzana; Samaj, Jozef; Pretová, Anna

    2009-01-01

    There is a requirement of haploid and double haploid material and homozygous lines for cell culture studies and breeding in flax. Anther culture is currently the most successful method producing doubled haploid lines in flax. Recently, ovary culture was also described as a good source of doubled haploids. In this review we focus on tissue and plants regeneration using anther culture, and cultivation of ovaries containing unfertilized ovules. The effect of genotype, physiological status of donor plants, donor material pre-treatment and cultivation conditions for flax anthers and ovaries is discussed here. The process of plant regeneration from anther and ovary derived calli is also in the focus of this review. Attention is paid to the ploidy level of regenerated tissue and to the use of molecular markers for determining of gametic origin of flax plants derived from anther and ovary cultures. Finally, some future prospects on the use of doubled haploids in flax biotechnology are outlined here.

  18. Genomic copy concentrations of selected waterborne viruses in a slum environment in Kampala, Uganda.

    Science.gov (United States)

    Katukiza, A Y; Temanu, H; Chung, J W; Foppen, J W A; Lens, P N L

    2013-06-01

    The presence of viruses in a slum environment where sanitation is poor is a major concern. However, little is known of their occurrence and genomic copy concentration in the slum environment. The main objective of this study was to determine the genomic copy concentrations of human adenoviruses F and G, Rotavirus (RV), Hepatitis A virus (HAV), Hepatitis E virus (HEV) and human adenovirus species A,C,D,E, and F (HAdV-ACDEF) in Bwaise III, a typical slum in Kampala, Uganda. Forty-one samples from surface water, grey water and ground water were collected from 30 sampling locations. The virus particles were recovered by glass wool filtration with elution using beef extract. DNA and RNA viruses were detected by the real time quantitative polymerase chain reaction (qPCR) and the reverse transcription-qPCR (RT-qPCR), respectively. HAdV-F and G were detected in 70.7% of the samples with concentrations up to 2.65 × 10(1) genomic copies per mL (gc mL(-1)). RV and HAV were detected in 60.9% and 17.1% of the samples, respectively. The maximum concentration of RV was 1.87 × 10(2)gc mL(-1). In addition, 78% of the samples tested positive for the HAdV-ACDEF, but all samples tested negative for HEV. These new data are essential for quantitative microbial risk assessment, and for understanding the effects of environmental pollution in slums.

  19. Evaluation of GRCh38 and de novo haploid genome assemblies demonstrates the enduring quality of the reference assembly.

    Science.gov (United States)

    Schneider, Valerie A; Graves-Lindsay, Tina; Howe, Kerstin; Bouk, Nathan; Chen, Hsiu-Chuan; Kitts, Paul A; Murphy, Terence D; Pruitt, Kim D; Thibaud-Nissen, Françoise; Albracht, Derek; Fulton, Robert S; Kremitzki, Milinn; Magrini, Vincent; Markovic, Chris; McGrath, Sean; Steinberg, Karyn Meltz; Auger, Kate; Chow, William; Collins, Joanna; Harden, Glenn; Hubbard, Timothy; Pelan, Sarah; Simpson, Jared T; Threadgold, Glen; Torrance, James; Wood, Jonathan M; Clarke, Laura; Koren, Sergey; Boitano, Matthew; Peluso, Paul; Li, Heng; Chin, Chen-Shan; Phillippy, Adam M; Durbin, Richard; Wilson, Richard K; Flicek, Paul; Eichler, Evan E; Church, Deanna M

    2017-05-01

    The human reference genome assembly plays a central role in nearly all aspects of today's basic and clinical research. GRCh38 is the first coordinate-changing assembly update since 2009; it reflects the resolution of roughly 1000 issues and encompasses modifications ranging from thousands of single base changes to megabase-scale path reorganizations, gap closures, and localization of previously orphaned sequences. We developed a new approach to sequence generation for targeted base updates and used data from new genome mapping technologies and single haplotype resources to identify and resolve larger assembly issues. For the first time, the reference assembly contains sequence-based representations for the centromeres. We also expanded the number of alternate loci to create a reference that provides a more robust representation of human population variation. We demonstrate that the updates render the reference an improved annotation substrate, alter read alignments in unchanged regions, and impact variant interpretation at clinically relevant loci. We additionally evaluated a collection of new de novo long-read haploid assemblies and conclude that although the new assemblies compare favorably to the reference with respect to continuity, error rate, and gene completeness, the reference still provides the best representation for complex genomic regions and coding sequences. We assert that the collected updates in GRCh38 make the newer assembly a more robust substrate for comprehensive analyses that will promote our understanding of human biology and advance our efforts to improve health. © 2017 Schneider et al.; Published by Cold Spring Harbor Laboratory Press.

  20. Production of haploids and doubled haploids in oil palm

    Directory of Open Access Journals (Sweden)

    Croxford Adam E

    2010-10-01

    Full Text Available Abstract Background Oil palm is the world's most productive oil-food crop despite yielding well below its theoretical maximum. This maximum could be approached with the introduction of elite F1 varieties. The development of such elite lines has thus far been prevented by difficulties in generating homozygous parental types for F1 generation. Results Here we present the first high-throughput screen to identify spontaneously-formed haploid (H and doubled haploid (DH palms. We secured over 1,000 Hs and one DH from genetically diverse material and derived further DH/mixoploid palms from Hs using colchicine. We demonstrated viability of pollen from H plants and expect to generate 100% homogeneous F1 seed from intercrosses between DH/mixoploids once they develop female inflorescences. Conclusions This study has generated genetically diverse H/DH palms from which parental clones can be selected in sufficient numbers to enable the commercial-scale breeding of F1 varieties. The anticipated step increase in productivity may help to relieve pressure to extend palm cultivation, and limit further expansion into biodiverse rainforest.

  1. The mobile genetic element Alu in the human genome

    Energy Technology Data Exchange (ETDEWEB)

    Novick, G.E. [Florida International Univ., Miami, FL (United States); Batzer, M.A.; Deininger, P.L. [Louisiana State Univ. Medical Center, New Orleans, LA (United States)] [and others

    1996-01-01

    Genetic material has been traditionally envisioned as relatively static with the exception of occasional, often deleterious mutations. The sequence DNA-to-RNA-to-protein represented for many years the central dogma relating gene structure and function. Recently, the field of molecular genetics has provided revolutionary information on the dynamic role of repetitive elements in the function of the genetic material and the evolution of humans and other organisms. Alu sequences represent the largest family of short interspersed repetitive elements (SINEs) in humans, being present in an excess of 500,000 copies per haploid genome. Alu elements, as well as the other repetitive elements, were once considered to be useless. Today, the biology of Alu transposable elements is being widely examined in order to determine the molecular basis of a growing number of identified diseases and to provide new directions in genome mapping and biomedical research. 66 refs., 5 figs.

  2. Genome size variation in the pine fusiform rust pathogen Cronartium quercuum f.sp. fusiforme as determined by flow cytometry

    Science.gov (United States)

    Claire L Anderson; Thomas L Kubisiak; C Dana Nelson; Jason A Smith; John M Davis

    2010-01-01

    The genome size of the pine fusiform rust pathogen Cronartium quercuum f.sp. fusiforme (Cqf) was determined by flow cytometric analysis of propidium iodide-stained, intact haploid pycniospores with haploid spores of two genetically well characterized fungal species, Sclerotinia sclerotiorum and Puccinia graminis f.sp. tritici, as size standards. The Cqf haploid genome...

  3. Chromosomal Copy Number Variation in Saccharomyces pastorianus Is Evidence for Extensive Genome Dynamics in Industrial Lager Brewing Strains.

    Science.gov (United States)

    van den Broek, M; Bolat, I; Nijkamp, J F; Ramos, E; Luttik, M A H; Koopman, F; Geertman, J M; de Ridder, D; Pronk, J T; Daran, J-M

    2015-09-01

    Lager brewing strains of Saccharomyces pastorianus are natural interspecific hybrids originating from the spontaneous hybridization of Saccharomyces cerevisiae and Saccharomyces eubayanus. Over the past 500 years, S. pastorianus has been domesticated to become one of the most important industrial microorganisms. Production of lager-type beers requires a set of essential phenotypes, including the ability to ferment maltose and maltotriose at low temperature, the production of flavors and aromas, and the ability to flocculate. Understanding of the molecular basis of complex brewing-related phenotypic traits is a prerequisite for rational strain improvement. While genome sequences have been reported, the variability and dynamics of S. pastorianus genomes have not been investigated in detail. Here, using deep sequencing and chromosome copy number analysis, we showed that S. pastorianus strain CBS1483 exhibited extensive aneuploidy. This was confirmed by quantitative PCR and by flow cytometry. As a direct consequence of this aneuploidy, a massive number of sequence variants was identified, leading to at least 1,800 additional protein variants in S. pastorianus CBS1483. Analysis of eight additional S. pastorianus strains revealed that the previously defined group I strains showed comparable karyotypes, while group II strains showed large interstrain karyotypic variability. Comparison of three strains with nearly identical genome sequences revealed substantial chromosome copy number variation, which may contribute to strain-specific phenotypic traits. The observed variability of lager yeast genomes demonstrates that systematic linking of genotype to phenotype requires a three-dimensional genome analysis encompassing physical chromosomal structures, the copy number of individual chromosomes or chromosomal regions, and the allelic variation of copies of individual genes. Copyright © 2015, van den Broek et al.

  4. Genome-wide detection of copy number variations among diverse horse breeds by array CGH.

    Directory of Open Access Journals (Sweden)

    Wei Wang

    Full Text Available Recent studies have found that copy number variations (CNVs are widespread in human and animal genomes. CNVs are a significant source of genetic variation, and have been shown to be associated with phenotypic diversity. However, the effect of CNVs on genetic variation in horses is not well understood. In the present study, CNVs in 6 different breeds of mare horses, Mongolia horse, Abaga horse, Hequ horse and Kazakh horse (all plateau breeds and Debao pony and Thoroughbred, were determined using aCGH. In total, seven hundred CNVs were identified ranging in size from 6.1 Kb to 0.57 Mb across all autosomes, with an average size of 43.08 Kb and a median size of 15.11 Kb. By merging overlapping CNVs, we found a total of three hundred and fifty-three CNV regions (CNVRs. The length of the CNVRs ranged from 6.1 Kb to 1.45 Mb with average and median sizes of 38.49 Kb and 13.1 Kb. Collectively, 13.59 Mb of copy number variation was identified among the horses investigated and accounted for approximately 0.61% of the horse genome sequence. Five hundred and eighteen annotated genes were affected by CNVs, which corresponded to about 2.26% of all horse genes. Through the gene ontology (GO, genetic pathway analysis and comparison of CNV genes among different breeds, we found evidence that CNVs involving 7 genes may be related to the adaptation to severe environment of these plateau horses. This study is the first report of copy number variations in Chinese horses, which indicates that CNVs are ubiquitous in the horse genome and influence many biological processes of the horse. These results will be helpful not only in mapping the horse whole-genome CNVs, but also to further research for the adaption to the high altitude severe environment for plateau horses.

  5. Verification and characterization of chromosome duplication in haploid maize.

    Science.gov (United States)

    de Oliveira Couto, E G; Resende Von Pinho, E V; Von Pinho, R G; Veiga, A D; de Carvalho, M R; de Oliveira Bustamante, F; Nascimento, M S

    2015-06-26

    Doubled haploid technology has been used by various private companies. However, information regarding chromosome duplication methodologies, particularly those concerning techniques used to identify duplication in cells, is limited. Thus, we analyzed and characterized artificially doubled haploids using microsatellites molecular markers, pollen viability, and flow cytometry techniques. Evaluated material was obtained using two different chromosome duplication protocols in maize seeds considered haploids, resulting from the cross between the haploid inducer line KEMS and 4 hybrids (GNS 3225, GNS 3032, GNS 3264, and DKB 393). Fourteen days after duplication, plant samples were collected and assessed by flow cytometry. Further, the plants were transplanted to a field, and samples were collected for DNA analyses using microsatellite markers. The tassels were collected during anthesis for pollen viability analyses. Haploid, diploid, and mixoploid individuals were detected using flow cytometry, demonstrating that this technique was efficient for identifying doubled haploids. The microsatellites markers were also efficient for confirming the ploidies preselected by flow cytometry and for identifying homozygous individuals. Pollen viability showed a significant difference between the evaluated ploidies when the Alexander and propionic-carmin stains were used. The viability rates between the plodies analyzed show potential for fertilization.

  6. Identification and quantification of genetically modified Moonshade carnation lines using conventional and TaqMan real-time polymerase chain reaction methods.

    Science.gov (United States)

    Li, Peng; Jia, Junwei; Bai, Lan; Pan, Aihu; Tang, Xueming

    2013-07-01

    Genetically modified carnation (Dianthus caryophyllus L.) Moonshade was approved for planting and commercialization in several countries from 2004. Developing methods for analyzing Moonshade is necessary for implementing genetically modified organism labeling regulations. In this study, the 5'-transgene integration sequence was isolated using thermal asymmetric interlaced (TAIL)-PCR. Based upon the 5'-transgene integration sequence, conventional and TaqMan real-time PCR assays were established. The relative limit of detection for the conventional PCR assay was 0.05 % for Moonshade using 100 ng total carnation genomic DNA, corresponding to approximately 79 copies of the carnation haploid genome, and the limits of detection and quantification of the TaqMan real-time PCR assay were estimated to be 51 and 254 copies of haploid carnation genomic DNA, respectively. These results are useful for identifying and quantifying Moonshade and its derivatives.

  7. SVA retrotransposon insertion-associated deletion represents a novel mutational mechanism underlying large genomic copy number changes with non-recurrent breakpoints

    NARCIS (Netherlands)

    J. Vogt (Julia); K. Bengesser (Kathrin); K.B.M. Claes (Kathleen B.M.); K. Wimmer (Katharina); V.-F. Mautner (Victor-Felix); R. van Minkelen (Rick); E. Legius (Eric); H. Brems (Hilde); M. Upadhyaya (Meena); J. Högel (Josef); C. Lazaro (Conxi); T. Rosenbaum (Thorsten); S. Bammert (Simone); L. Messiaen (Ludwine); D.N. Cooper (David); H. Kehrer-Sawatzki (Hildegard)

    2014-01-01

    textabstractBackground: Genomic disorders are caused by copy number changes that may exhibit recurrent breakpoints processed by nonallelic homologous recombination. However, region-specific disease-associated copy number changes have also been observed which exhibit non-recurrent breakpoints. The

  8. Simple Meets Single: The Application of CRISPR/Cas9 in Haploid Embryonic Stem Cells

    Directory of Open Access Journals (Sweden)

    Zixi Yin

    2017-01-01

    Full Text Available The CRISPR/Cas9 system provides a powerful method for the genetic manipulation of the mammalian genome, allowing knockout of individual genes as well as the generation of genome-wide knockout cell libraries for genetic screening. However, the diploid status of most mammalian cells restricts the application of CRISPR/Cas9 in genetic screening. Mammalian haploid embryonic stem cells (haESCs have only one set of chromosomes per cell, avoiding the issue of heterozygous recessive mutations in diploid cells. Thus, the combination of haESCs and CRISPR/Cas9 facilitates the generation of genome-wide knockout cell libraries for genetic screening. Here, we review recent progress in CRISPR/Cas9 and haPSCs and discuss their applications in genetic screening.

  9. Potential Value of Genomic Copy Number Variations in Schizophrenia

    Directory of Open Access Journals (Sweden)

    Chuanjun Zhuo

    2017-06-01

    Full Text Available Schizophrenia is a devastating neuropsychiatric disorder affecting approximately 1% of the global population, and the disease has imposed a considerable burden on families and society. Although, the exact cause of schizophrenia remains unknown, several lines of scientific evidence have revealed that genetic variants are strongly correlated with the development and early onset of the disease. In fact, the heritability among patients suffering from schizophrenia is as high as 80%. Genomic copy number variations (CNVs are one of the main forms of genomic variations, ubiquitously occurring in the human genome. An increasing number of studies have shown that CNVs account for population diversity and genetically related diseases, including schizophrenia. The last decade has witnessed rapid advances in the development of novel genomic technologies, which have led to the identification of schizophrenia-associated CNVs, insight into the roles of the affected genes in their intervals in schizophrenia, and successful manipulation of the target CNVs. In this review, we focus on the recent discoveries of important CNVs that are associated with schizophrenia and outline the potential values that the study of CNVs will bring to the areas of schizophrenia research, diagnosis, and therapy. Furthermore, with the help of the novel genetic tool known as the Clustered Regularly Interspaced Short Palindromic Repeats-associated nuclease 9 (CRISPR/Cas9 system, the pathogenic CNVs as genomic defects could be corrected. In conclusion, the recent novel findings of schizophrenia-associated CNVs offer an exciting opportunity for schizophrenia research to decipher the pathological mechanisms underlying the onset and development of schizophrenia as well as to provide potential clinical applications in genetic counseling, diagnosis, and therapy for this complex mental disease.

  10. Wheat-specific gene, ribosomal protein l21, used as the endogenous reference gene for qualitative and real-time quantitative polymerase chain reaction detection of transgenes.

    Science.gov (United States)

    Liu, Yi-Ke; Li, He-Ping; Huang, Tao; Cheng, Wei; Gao, Chun-Sheng; Zuo, Dong-Yun; Zhao, Zheng-Xi; Liao, Yu-Cai

    2014-10-29

    Wheat-specific ribosomal protein L21 (RPL21) is an endogenous reference gene suitable for genetically modified (GM) wheat identification. This taxon-specific RPL21 sequence displayed high homogeneity in different wheat varieties. Southern blots revealed 1 or 3 copies, and sequence analyses showed one amplicon in common wheat. Combined analyses with sequences from common wheat (AABBDD) and three diploid ancestral species, Triticum urartu (AA), Aegilops speltoides (BB), and Aegilops tauschii (DD), demonstrated the presence of this amplicon in the AA genome. Using conventional qualitative polymerase chain reaction (PCR), the limit of detection was 2 copies of wheat haploid genome per reaction. In the quantitative real-time PCR assay, limits of detection and quantification were about 2 and 8 haploid genome copies, respectively, the latter of which is 2.5-4-fold lower than other reported wheat endogenous reference genes. Construct-specific PCR assays were developed using RPL21 as an endogenous reference gene, and as little as 0.5% of GM wheat contents containing Arabidopsis NPR1 were properly quantified.

  11. A comprehensive profile of DNA copy number variations in a Korean population: identification of copy number invariant regions among Koreans.

    Science.gov (United States)

    Jeon, Jae Pil; Shim, Sung Mi; Jung, Jong Sun; Nam, Hye Young; Lee, Hye Jin; Oh, Berm Seok; Kim, Kuchan; Kim, Hyung Lae; Han, Bok Ghee

    2009-09-30

    To examine copy number variations among the Korean population, we compared individual genomes with the Korean reference genome assembly using the publicly available Korean HapMap SNP 50 k chip data from 90 individuals. Korean individuals exhibited 123 copy number variation regions (CNVRs) covering 27.2 mb, equivalent to 1.0% of the genome in the copy number variation (CNV) analysis using the combined criteria of P value (Por= 0.25) among study subjects. In contrast, when compared to the Affymetrix reference genome assembly from multiple ethnic groups, considerably more CNVRs (n=643) were detected in larger proportions (5.0%) of the genome covering 135.1 mb even by more stringent criteria (Por=0.25), reflecting ethnic diversity of structural variations between Korean and other populations. Some CNVRs were validated by the quantitative multiplex PCR of short fluorescent fragment (QMPSF) method, and then copy number invariant regions were detected among the study subjects. These copy number invariant regions would be used as good internal controls for further CNV studies. Lastly, we demonstrated that the CNV information could stratify even a single ethnic population with a proper reference genome assembly from multiple heterogeneous populations.

  12. Diploidization of cucumber (Cucumis sativus L. haploids by colchicine treatment

    Directory of Open Access Journals (Sweden)

    Vesselina Nikolova

    2014-01-01

    Full Text Available Haploid cucumber plants are totally infertile and do not undergo spontaneous diploidization. The use of haploids depends on the possibility of doubling the chromosome number and the obtaining of stable doubled haploids (DH. Four haploids of different genotypes propagated vegetatively were treated with colchicine in order to obtain DH. The following procedures were used: 1 apical shoot meristem treatment, 2 soaking of shoot explants, 3 placing of shoot explants on medium with colchicine. Plants of the C1 generation were evaluated in respect to morphological and cytological characters and fertility. The best result of 20.9% DH was obtained after repeated treatment of the meristem with colchicine. A large group of chimeras (28.5% was also distinguished as were haploids and tetraploids. DH plants were fertile and gave uniform progeny. Chimeras had a decreased fertility and showed disturbances in meiotic divisions.

  13. Specific genomic regions are differentially affected by copy number alterations across distinct cancer types, in aggregated cytogenetic data.

    Science.gov (United States)

    Kumar, Nitin; Cai, Haoyang; von Mering, Christian; Baudis, Michael

    2012-01-01

    Regional genomic copy number alterations (CNA) are observed in the vast majority of cancers. Besides specifically targeting well-known, canonical oncogenes, CNAs may also play more subtle roles in terms of modulating genetic potential and broad gene expression patterns of developing tumors. Any significant differences in the overall CNA patterns between different cancer types may thus point towards specific biological mechanisms acting in those cancers. In addition, differences among CNA profiles may prove valuable for cancer classifications beyond existing annotation systems. We have analyzed molecular-cytogenetic data from 25579 tumors samples, which were classified into 160 cancer types according to the International Classification of Disease (ICD) coding system. When correcting for differences in the overall CNA frequencies between cancer types, related cancers were often found to cluster together according to similarities in their CNA profiles. Based on a randomization approach, distance measures from the cluster dendrograms were used to identify those specific genomic regions that contributed significantly to this signal. This approach identified 43 non-neutral genomic regions whose propensity for the occurrence of copy number alterations varied with the type of cancer at hand. Only a subset of these identified loci overlapped with previously implied, highly recurrent (hot-spot) cytogenetic imbalance regions. Thus, for many genomic regions, a simple null-hypothesis of independence between cancer type and relative copy number alteration frequency can be rejected. Since a subset of these regions display relatively low overall CNA frequencies, they may point towards second-tier genomic targets that are adaptively relevant but not necessarily essential for cancer development.

  14. Population-genetic nature of copy number variations in the human genome.

    Science.gov (United States)

    Kato, Mamoru; Kawaguchi, Takahisa; Ishikawa, Shumpei; Umeda, Takayoshi; Nakamichi, Reiichiro; Shapero, Michael H; Jones, Keith W; Nakamura, Yusuke; Aburatani, Hiroyuki; Tsunoda, Tatsuhiko

    2010-03-01

    Copy number variations (CNVs) are universal genetic variations, and their association with disease has been increasingly recognized. We designed high-density microarrays for CNVs, and detected 3000-4000 CNVs (4-6% of the genomic sequence) per population that included CNVs previously missed because of smaller sizes and residing in segmental duplications. The patterns of CNVs across individuals were surprisingly simple at the kilo-base scale, suggesting the applicability of a simple genetic analysis for these genetic loci. We utilized the probabilistic theory to determine integer copy numbers of CNVs and employed a recently developed phasing tool to estimate the population frequencies of integer copy number alleles and CNV-SNP haplotypes. The results showed a tendency toward a lower frequency of CNV alleles and that most of our CNVs were explained only by zero-, one- and two-copy alleles. Using the estimated population frequencies, we found several CNV regions with exceptionally high population differentiation. Investigation of CNV-SNP linkage disequilibrium (LD) for 500-900 bi- and multi-allelic CNVs per population revealed that previous conflicting reports on bi-allelic LD were unexpectedly consistent and explained by an LD increase correlated with deletion-allele frequencies. Typically, the bi-allelic LD was lower than SNP-SNP LD, whereas the multi-allelic LD was somewhat stronger than the bi-allelic LD. After further investigation of tag SNPs for CNVs, we conclude that the customary tagging strategy for disease association studies can be applicable for common deletion CNVs, but direct interrogation is needed for other types of CNVs.

  15. Gene expansion shapes genome architecture in the human pathogen Lichtheimia corymbifera: an evolutionary genomics analysis in the ancient terrestrial mucorales (Mucoromycotina).

    Science.gov (United States)

    Schwartze, Volker U; Winter, Sascha; Shelest, Ekaterina; Marcet-Houben, Marina; Horn, Fabian; Wehner, Stefanie; Linde, Jörg; Valiante, Vito; Sammeth, Michael; Riege, Konstantin; Nowrousian, Minou; Kaerger, Kerstin; Jacobsen, Ilse D; Marz, Manja; Brakhage, Axel A; Gabaldón, Toni; Böcker, Sebastian; Voigt, Kerstin

    2014-08-01

    Lichtheimia species are the second most important cause of mucormycosis in Europe. To provide broader insights into the molecular basis of the pathogenicity-associated traits of the basal Mucorales, we report the full genome sequence of L. corymbifera and compared it to the genome of Rhizopus oryzae, the most common cause of mucormycosis worldwide. The genome assembly encompasses 33.6 MB and 12,379 protein-coding genes. This study reveals four major differences of the L. corymbifera genome to R. oryzae: (i) the presence of an highly elevated number of gene duplications which are unlike R. oryzae not due to whole genome duplication (WGD), (ii) despite the relatively high incidence of introns, alternative splicing (AS) is not frequently observed for the generation of paralogs and in response to stress, (iii) the content of repetitive elements is strikingly low (<5%), (iv) L. corymbifera is typically haploid. Novel virulence factors were identified which may be involved in the regulation of the adaptation to iron-limitation, e.g. LCor01340.1 encoding a putative siderophore transporter and LCor00410.1 involved in the siderophore metabolism. Genes encoding the transcription factors LCor08192.1 and LCor01236.1, which are similar to GATA type regulators and to calcineurin regulated CRZ1, respectively, indicating an involvement of the calcineurin pathway in the adaption to iron limitation. Genes encoding MADS-box transcription factors are elevated up to 11 copies compared to the 1-4 copies usually found in other fungi. More findings are: (i) lower content of tRNAs, but unique codons in L. corymbifera, (ii) Over 25% of the proteins are apparently specific for L. corymbifera. (iii) L. corymbifera contains only 2/3 of the proteases (known to be essential virulence factors) in comparison to R. oryzae. On the other hand, the number of secreted proteases, however, is roughly twice as high as in R. oryzae.

  16. An analysis of radiation-induced damage in the spider mite. Relationship between mortality of haploid and diploid eggs in two successive generations

    International Nuclear Information System (INIS)

    Leenhouts, H.P.; Chadwick, K.H.

    1976-01-01

    Unfertilized females of the spider mite (Tetranychus urticae) produce only haploid eggs which develop into a haploid male. Fertilized females produce both haploid eggs (unfertilized), which develop into males, and diploid eggs (fertilized), which develop into females. Radiobiological experiments performed by A.M. Feldmann (Association Euratom-ITAL) made data available on the radiation-induced mortality of haploid and diploid eggs in the F 1 and F 2 generation following irradiation of either males or females with X rays or fast neutrons. The data have been analysed using the molecular theory of cell survival where it is assumed that DNA double strand breaks, induced randomly in the cell, are the critical radiation-induced lesions, which lead to cell death. Theoretical relationships are derived for the dose dependence of hatchability in haploid and diploid eggs in the first and second generations expressed as a function of the radiation damage in the parental genome. These theoretical relationships can be used to derive the inter-relationship between the different hatchabilities, and the results from the spider mite have been analysed using these considerations. It is concluded that the radiation-induced genetic damage arises from one type of initial lesion. The eventual radiobiological implications of this analysis are discussed, expecially with respect to the transmittance of radiation-induced genetic damage after low-level radiation. (author)

  17. Rare copy number alterations and copy-neutral loss of heterozygosity revealed in ameloblastomas by high-density whole-genome microarray analysis.

    Science.gov (United States)

    Diniz, Marina Gonçalves; Duarte, Alessandra Pires; Villacis, Rolando A; Guimarães, Bruna V A; Duarte, Luiz Cláudio Pires; Rogatto, Sílvia R; Gomez, Ricardo Santiago; Gomes, Carolina Cavaliéri

    2017-05-01

    Ameloblastoma (unicystic, UA, or multicystic, MA) is a rare tumor associated with bone destruction and facial deformity. Its malignant counterpart is the ameloblastic carcinoma (AC). The BRAFV600E mutation is highly prevalent in all these tumors subtypes and cannot account for their different clinical behaviors. We assessed copy number alterations (CNAs) and copy-neutral loss of heterozygosity (cnLOH) in UA (n = 2), MA (n = 3), and AC (n = 1) using the CytoScan HD Array (Affymetrix) and the BRAFV600E status. RT-qPCR was applied in four selected genes (B4GALT1, BAG1, PKD1L2, and PPP2R5A) covered by rare alterations, also including three MA and four normal oral tissues. Fifty-seven CNAs and cnLOH were observed in the ameloblastomas and six CNAs in the AC. Seven of the CNAs were rare (six in UA and one in MA), four of them encompassing genes (gains of 7q11.21, 1q32.3, and 9p21.1 and loss of 16q23.2). We found positive correlation between rare CNA gene dosage and the expression of B4GALT1, BAG1, PKD1L2, and PPP2R5A. The AC and 1 UA were BRAF wild-type; however, this UA showed rare genomic alterations encompassing genes associated with RAF/MAPK activation. Ameloblastomas show rare CNAs and cnLOH, presenting a specific genomic profile with no overlapping of the rare alterations among UA, MA, and AC. These genomic changes might play a role in tumor evolution and in BRAFV600E-negative tumors. © 2016 John Wiley & Sons A/S. Published by John Wiley & Sons Ltd.

  18. Construction of a nurse shark (Ginglymostoma cirratum bacterial artificial chromosome (BAC library and a preliminary genome survey

    Directory of Open Access Journals (Sweden)

    Inoko Hidetoshi

    2006-05-01

    Full Text Available Abstract Background Sharks are members of the taxonomic class Chondrichthyes, the oldest living jawed vertebrates. Genomic studies of this group, in comparison to representative species in other vertebrate taxa, will allow us to theorize about the fundamental genetic, developmental, and functional characteristics in the common ancestor of all jawed vertebrates. Aims In order to obtain mapping and sequencing data for comparative genomics, we constructed a bacterial artificial chromosome (BAC library for the nurse shark, Ginglymostoma cirratum. Results The BAC library consists of 313,344 clones with an average insert size of 144 kb, covering ~4.5 × 1010 bp and thus providing an 11-fold coverage of the haploid genome. BAC end sequence analyses revealed, in addition to LINEs and SINEs commonly found in other animal and plant genomes, two new groups of nurse shark-specific repetitive elements, NSRE1 and NSRE2 that seem to be major components of the nurse shark genome. Screening the library with single-copy or multi-copy gene probes showed 6–28 primary positive clones per probe of which 50–90% were true positives, demonstrating that the BAC library is representative of the different regions of the nurse shark genome. Furthermore, some BAC clones contained multiple genes, making physical mapping feasible. Conclusion We have constructed a deep-coverage, high-quality, large insert, and publicly available BAC library for a cartilaginous fish. It will be very useful to the scientific community interested in shark genomic structure, comparative genomics, and functional studies. We found two new groups of repetitive elements specific to the nurse shark genome, which may contribute to the architecture and evolution of the nurse shark genome.

  19. Construction of a nurse shark (Ginglymostoma cirratum) bacterial artificial chromosome (BAC) library and a preliminary genome survey.

    Science.gov (United States)

    Luo, Meizhong; Kim, Hyeran; Kudrna, Dave; Sisneros, Nicholas B; Lee, So-Jeong; Mueller, Christopher; Collura, Kristi; Zuccolo, Andrea; Buckingham, E Bryan; Grim, Suzanne M; Yanagiya, Kazuyo; Inoko, Hidetoshi; Shiina, Takashi; Flajnik, Martin F; Wing, Rod A; Ohta, Yuko

    2006-05-03

    Sharks are members of the taxonomic class Chondrichthyes, the oldest living jawed vertebrates. Genomic studies of this group, in comparison to representative species in other vertebrate taxa, will allow us to theorize about the fundamental genetic, developmental, and functional characteristics in the common ancestor of all jawed vertebrates. In order to obtain mapping and sequencing data for comparative genomics, we constructed a bacterial artificial chromosome (BAC) library for the nurse shark, Ginglymostoma cirratum. The BAC library consists of 313,344 clones with an average insert size of 144 kb, covering ~4.5 x 1010 bp and thus providing an 11-fold coverage of the haploid genome. BAC end sequence analyses revealed, in addition to LINEs and SINEs commonly found in other animal and plant genomes, two new groups of nurse shark-specific repetitive elements, NSRE1 and NSRE2 that seem to be major components of the nurse shark genome. Screening the library with single-copy or multi-copy gene probes showed 6-28 primary positive clones per probe of which 50-90% were true positives, demonstrating that the BAC library is representative of the different regions of the nurse shark genome. Furthermore, some BAC clones contained multiple genes, making physical mapping feasible. We have constructed a deep-coverage, high-quality, large insert, and publicly available BAC library for a cartilaginous fish. It will be very useful to the scientific community interested in shark genomic structure, comparative genomics, and functional studies. We found two new groups of repetitive elements specific to the nurse shark genome, which may contribute to the architecture and evolution of the nurse shark genome.

  20. Contribution of copy number variants to schizophrenia from a genome-wide study of 41,321 subjects

    DEFF Research Database (Denmark)

    Marshall, Christian R.; Howrigan, Daniel P.; Merico, Daniele

    2017-01-01

    Copy number variants (CNVs) have been strongly implicated in the genetic etiology of schizophrenia (SCZ). However, genome-wide investigation of the contribution of CNV to risk has been hampered by limited sample sizes. We sought to address this obstacle by applying a centralized analysis pipeline...

  1. Genome Wide Distributions and Functional Characterization of Copy Number Variations between Chinese and Western Pigs.

    Directory of Open Access Journals (Sweden)

    Hongyang Wang

    Full Text Available Copy number variations (CNVs refer to large insertions, deletions and duplications in the genomic structure ranging from one thousand to several million bases in size. Since the development of next generation sequencing technology, several methods have been well built for detection of copy number variations with high credibility and accuracy. Evidence has shown that CNV occurring in gene region could lead to phenotypic changes due to the alteration in gene structure and dosage. However, it still remains unexplored whether CNVs underlie the phenotypic differences between Chinese and Western domestic pigs. Based on the read-depth methods, we investigated copy number variations using 49 individuals derived from both Chinese and Western pig breeds. A total of 3,131 copy number variation regions (CNVRs were identified with an average size of 13.4 Kb in all individuals during domestication, harboring 1,363 genes. Among them, 129 and 147 CNVRs were Chinese and Western pig specific, respectively. Gene functional enrichments revealed that these CNVRs contribute to strong disease resistance and high prolificacy in Chinese domestic pigs, but strong muscle tissue development in Western domestic pigs. This finding is strongly consistent with the morphologic characteristics of Chinese and Western pigs, indicating that these group-specific CNVRs might have been preserved by artificial selection for the favored phenotypes during independent domestication of Chinese and Western pigs. In this study, we built high-resolution CNV maps in several domestic pig breeds and discovered the group specific CNVs by comparing Chinese and Western pigs, which could provide new insight into genomic variations during pigs' independent domestication, and facilitate further functional studies of CNV-associated genes.

  2. A review of methods for the production of haploids in seed plants

    International Nuclear Information System (INIS)

    Brunkener, L.

    1974-01-01

    This paper is a review of the methods which have been tried to produce haploids in seed plants. Because of their unique genetic constitution the haploids provide useful material for the study of various fundamental cytological and genetic problems. Thus the diploidization of haploids in conifers and subsequent intercrossing might allow the use of the same breeding technique as those which have been used with great success in maize breeding. Induced parthenogenesis in angiosperms has led to the formation of haploids in at least 25 genera but the number of plants obtained is generally too low for a practical utilization. The most promising method in angiosperms is in vitro cultivation of anthers which has hitherto yielded haploids in relatively large numbers in at least l5 genera but certainly can be applied with success in many more. Isolation of protoplasts or whole cells of diploid and haploid tissues and regeneration of entire plants from these protoplasts or cells have been accomplished in a few angiosperm genera. In gymnosperms all attempts to produce complete haploid plants have up to now failed. In this group there is possibly a relationship between polyembryonic seeds and the presence of small, haploid embryos. Therefore, the in vitro cultivation of small embryos of such seeds may result in the formation of independent haploid plants. In vitro cultivation of whole microsporangia or pollen has led to the production of calli in a few gymnosperm genera and in one case roots have even been induced. In some gymnosperms female gametophytes cultured on nutrient medium have yielded calli and in a few genera even root-like organs or small seedlings have arisen. (author)

  3. Integrative analysis of genome-wide gene copy number changes and gene expression in non-small cell lung cancer.

    Directory of Open Access Journals (Sweden)

    Verena Jabs

    Full Text Available Non-small cell lung cancer (NSCLC represents a genomically unstable cancer type with extensive copy number aberrations. The relationship of gene copy number alterations and subsequent mRNA levels has only fragmentarily been described. The aim of this study was to conduct a genome-wide analysis of gene copy number gains and corresponding gene expression levels in a clinically well annotated NSCLC patient cohort (n = 190 and their association with survival. While more than half of all analyzed gene copy number-gene expression pairs showed statistically significant correlations (10,296 of 18,756 genes, high correlations, with a correlation coefficient >0.7, were obtained only in a subset of 301 genes (1.6%, including KRAS, EGFR and MDM2. Higher correlation coefficients were associated with higher copy number and expression levels. Strong correlations were frequently based on few tumors with high copy number gains and correspondingly increased mRNA expression. Among the highly correlating genes, GO groups associated with posttranslational protein modifications were particularly frequent, including ubiquitination and neddylation. In a meta-analysis including 1,779 patients we found that survival associated genes were overrepresented among highly correlating genes (61 of the 301 highly correlating genes, FDR adjusted p<0.05. Among them are the chaperone CCT2, the core complex protein NUP107 and the ubiquitination and neddylation associated protein CAND1. In conclusion, in a comprehensive analysis we described a distinct set of highly correlating genes. These genes were found to be overrepresented among survival-associated genes based on gene expression in a large collection of publicly available datasets.

  4. SVA retrotransposon insertion-associated deletion represents a novel mutational mechanism underlying large genomic copy number changes with non-recurrent breakpoints

    Science.gov (United States)

    2014-01-01

    Background Genomic disorders are caused by copy number changes that may exhibit recurrent breakpoints processed by nonallelic homologous recombination. However, region-specific disease-associated copy number changes have also been observed which exhibit non-recurrent breakpoints. The mechanisms underlying these non-recurrent copy number changes have not yet been fully elucidated. Results We analyze large NF1 deletions with non-recurrent breakpoints as a model to investigate the full spectrum of causative mechanisms, and observe that they are mediated by various DNA double strand break repair mechanisms, as well as aberrant replication. Further, two of the 17 NF1 deletions with non-recurrent breakpoints, identified in unrelated patients, occur in association with the concomitant insertion of SINE/variable number of tandem repeats/Alu (SVA) retrotransposons at the deletion breakpoints. The respective breakpoints are refractory to analysis by standard breakpoint-spanning PCRs and are only identified by means of optimized PCR protocols designed to amplify across GC-rich sequences. The SVA elements are integrated within SUZ12P intron 8 in both patients, and were mediated by target-primed reverse transcription of SVA mRNA intermediates derived from retrotranspositionally active source elements. Both SVA insertions occurred during early postzygotic development and are uniquely associated with large deletions of 1 Mb and 867 kb, respectively, at the insertion sites. Conclusions Since active SVA elements are abundant in the human genome and the retrotranspositional activity of many SVA source elements is high, SVA insertion-associated large genomic deletions encompassing many hundreds of kilobases could constitute a novel and as yet under-appreciated mechanism underlying large-scale copy number changes in the human genome. PMID:24958239

  5. Characterization of a new high copy Stowaway family MITE, BRAMI-1 in Brassica genome

    Science.gov (United States)

    2013-01-01

    Background Miniature inverted-repeat transposable elements (MITEs) are expected to play important roles in evolution of genes and genome in plants, especially in the highly duplicated plant genomes. Various MITE families and their roles in plants have been characterized. However, there have been fewer studies of MITE families and their potential roles in evolution of the recently triplicated Brassica genome. Results We identified a new MITE family, BRAMI-1, belonging to the Stowaway super-family in the Brassica genome. In silico mapping revealed that 697 members are dispersed throughout the euchromatic regions of the B. rapa pseudo-chromosomes. Among them, 548 members (78.6%) are located in gene-rich regions, less than 3 kb from genes. In addition, we identified 516 and 15 members in the 470 Mb and 15 Mb genomic shotgun sequences currently available for B. oleracea and B. napus, respectively. The resulting estimated copy numbers for the entire genomes were 1440, 1464 and 2490 in B. rapa, B. oleracea and B. napus, respectively. Concurrently, only 70 members of the related Arabidopsis ATTIRTA-1 MITE family were identified in the Arabidopsis genome. Phylogenetic analysis revealed that BRAMI-1 elements proliferated in the Brassica genus after divergence from the Arabidopsis lineage. MITE insertion polymorphism (MIP) was inspected for 50 BRAMI-1 members, revealing high levels of insertion polymorphism between and within species of Brassica that clarify BRAMI-1 activation periods up to the present. Comparative analysis of the 71 genes harbouring the BRAMI-1 elements with their non-insertion paralogs (NIPs) showed that the BRAMI-1 insertions mainly reside in non-coding sequences and that the expression levels of genes with the elements differ from those of their NIPs. Conclusion A Stowaway family MITE, named as BRAMI-1, was gradually amplified and remained present in over than 1400 copies in each of three Brassica species. Overall, 78% of the members were identified in

  6. Novel type of linear mitochondrial genomes with dual flip-flop inversion system in apicomplexan parasites, Babesia microti and Babesia rodhaini

    Directory of Open Access Journals (Sweden)

    Hikosaka Kenji

    2012-11-01

    Full Text Available Abstract Background Mitochondrial (mt genomes vary considerably in size, structure and gene content. The mt genomes of the phylum Apicomplexa, which includes important human pathogens such as the malaria parasite Plasmodium, also show marked diversity of structure. Plasmodium has a concatenated linear mt genome of the smallest size (6-kb; Babesia and Theileria have a linear monomeric mt genome (6.5-kb to 8.2-kb with terminal inverted repeats; Eimeria, which is distantly related to Plasmodium and Babesia/Theileria, possesses a mt genome (6.2-kb with a concatemeric form similar to that of Plasmodium; Cryptosporidium, the earliest branching lineage within the phylum Apicomplexa, has no mt genome. We are interested in the evolutionary origin of linear mt genomes of Babesia/Theileria, and have investigated mt genome structures in members of archaeopiroplasmid, a lineage branched off earlier from Babesia/Theileria. Results The complete mt genomes of archaeopiroplasmid parasites, Babesia microti and Babesia rodhaini, were sequenced. The mt genomes of B. microti (11.1-kb and B. rodhaini (6.9-kb possess two pairs of unique inverted repeats, IR-A and IR-B. Flip-flop inversions between two IR-As and between two IR-Bs appear to generate four distinct genome structures that are present at an equi-molar ratio. An individual parasite contained multiple mt genome structures, with 20 copies and 2 – 3 copies per haploid nuclear genome in B. microti and B. rodhaini, respectively. Conclusion We found a novel linear monomeric mt genome structure of B. microti and B. rhodhaini equipped with dual flip-flop inversion system, by which four distinct genome structures are readily generated. To our knowledge, this study is the first to report the presence of two pairs of distinct IR sequences within a monomeric linear mt genome. The present finding provides insight into further understanding of evolution of mt genome structure.

  7. Single-Cell-Based Platform for Copy Number Variation Profiling through Digital Counting of Amplified Genomic DNA Fragments.

    Science.gov (United States)

    Li, Chunmei; Yu, Zhilong; Fu, Yusi; Pang, Yuhong; Huang, Yanyi

    2017-04-26

    We develop a novel single-cell-based platform through digital counting of amplified genomic DNA fragments, named multifraction amplification (mfA), to detect the copy number variations (CNVs) in a single cell. Amplification is required to acquire genomic information from a single cell, while introducing unavoidable bias. Unlike prevalent methods that directly infer CNV profiles from the pattern of sequencing depth, our mfA platform denatures and separates the DNA molecules from a single cell into multiple fractions of a reaction mix before amplification. By examining the sequencing result of each fraction for a specific fragment and applying a segment-merge maximum likelihood algorithm to the calculation of copy number, we digitize the sequencing-depth-based CNV identification and thus provide a method that is less sensitive to the amplification bias. In this paper, we demonstrate a mfA platform through multiple displacement amplification (MDA) chemistry. When performing the mfA platform, the noise of MDA is reduced; therefore, the resolution of single-cell CNV identification can be improved to 100 kb. We can also determine the genomic region free of allelic drop-out with mfA platform, which is impossible for conventional single-cell amplification methods.

  8. Assessing genome-wide copy number variation in the Han Chinese population.

    Science.gov (United States)

    Lu, Jianqi; Lou, Haiyi; Fu, Ruiqing; Lu, Dongsheng; Zhang, Feng; Wu, Zhendong; Zhang, Xi; Li, Changhua; Fang, Baijun; Pu, Fangfang; Wei, Jingning; Wei, Qian; Zhang, Chao; Wang, Xiaoji; Lu, Yan; Yan, Shi; Yang, Yajun; Jin, Li; Xu, Shuhua

    2017-10-01

    Copy number variation (CNV) is a valuable source of genetic diversity in the human genome and a well-recognised cause of various genetic diseases. However, CNVs have been considerably under-represented in population-based studies, particularly the Han Chinese which is the largest ethnic group in the world. To build a representative CNV map for the Han Chinese population. We conducted a genome-wide CNV study involving 451 male Han Chinese samples from 11 geographical regions encompassing 28 dialect groups, representing a less-biased panel compared with the currently available data. We detected CNVs by using 4.2M NimbleGen comparative genomic hybridisation array and whole-genome deep sequencing of 51 samples to optimise the filtering conditions in CNV discovery. A comprehensive Han Chinese CNV map was built based on a set of high-quality variants (positive predictive value >0.8, with sizes ranging from 369 bp to 4.16 Mb and a median of 5907 bp). The map consists of 4012 CNV regions (CNVRs), and more than half are novel to the 30 East Asian CNV Project and the 1000 Genomes Project Phase 3. We further identified 81 CNVRs specific to regional groups, which was indicative of the subpopulation structure within the Han Chinese population. Our data are complementary to public data sources, and the CNV map may facilitate in the identification of pathogenic CNVs and further biomedical research studies involving the Han Chinese population. © Article author(s) (or their employer(s) unless otherwise stated in the text of the article) 2017. All rights reserved. No commercial use is permitted unless otherwise expressly granted.

  9. Molecular cloning and expression of full-length DNA copies of the genomic RNAs of cowpea mosaic virus

    NARCIS (Netherlands)

    Vos, P.

    1987-01-01

    The experiments described in this thesis were designed to unravel various aspects of the mechanism of gene expression of cowpea mosaic virus (CPMV). For this purpose full-length DNA copies of both genomic RNAs of CPMV were constructed. Using powerful invitro

  10. A quantitative PCR approach for determining the ribosomal DNA copy number in the genome of Agave tequila Weber

    Directory of Open Access Journals (Sweden)

    Jorge Rubio-Piña

    2016-07-01

    Conclusions: Results show that the proposed method a can correctly detect the rDNA copy number, b could be used as species-specific markers and c might help in understanding the genetic diversity, genome organization and evolution of this species.

  11. Evaluasi keseragaaman, keragaman, dan kestabilan karakter agronomi galur-galur padi haploid ganda hasil kultur antera

    Directory of Open Access Journals (Sweden)

    PRIATNA SASMITA

    2011-05-01

    Full Text Available Sasmita P. 2011. Evaluasi keseragaman, keragaman, dan kestabilan karakter agronomi galur-galur padi haploid ganda hasil kultur antera. Bioteknologi 8: 10-17. Pembentukan galur haploid ganda dalam kultur antera bertujuan untuk mempercepat perolehan galur murni. Seleksi karakter yang diinginkan dapat dilakukan langsung terhadap progeni hasil kultur antera pada generasi awal. Percobaan ini bertujuan untuk mengetahui karakteristik agronomi, keseragaman, dan kestabilan galur haploid ganda, serta mendapatkan putatif galur-gallur haploid ganda sebagai bahan evaluasi lebih lanjut untuk mendapatkan galur harapan. Percobaan pertama menggunakan rancangan acak lengkap diulang lima kali. Perlakuannya aadalah 111 galur haploid ganda hasil kultur antera generasi pertama (DH1. Percobaan kedua menggunakan rancangan petak terpisah dengan perlakuan petak utama adalah galur haploid ganda hasil kultur antera dan perlakuan anak petaknya generasi galur haploid ganda kedua (DH2 hingga kelima (DDH5. Hasil percobaan menunjukkan bahwa setiap tanaman dalam galur yang sama memiliki karakter agronnomi seragam, sedangkan tanaman antar galur berbeda memiliki karakter agronomi beragam. Hasil evaluasi lebih lanjut terhadap tiga dari 111 galur haploid ganda yang berasal dari generasi kedua hingga kelima menunjukkan tidak terdapat perbedaan karakter antar generasi untuk setiap p galur yang sama. Hasil penelitian tersebut menunjukkan pula bahwa karakteristik agronomi galur haploid ganda stabil dari generasi ke generasi.

  12. Maximization of Markers Linked in Coupling for Tetraploid Potatoes via Monoparental Haploids

    Directory of Open Access Journals (Sweden)

    Annette M. Bartkiewicz

    2018-05-01

    Full Text Available Haploid potato populations derived from a single tetraploid donor constitute an efficient strategy to analyze markers segregating from a single donor genotype. Analysis of marker segregation in populations derived from crosses between polysomic tetraploids is complicated by a maximum of eight segregating alleles, multiple dosages of the markers and problems related to linkage analysis of marker segregation in repulsion. Here, we present data on two monoparental haploid populations generated by prickle pollination of two tetraploid cultivars with Solanum phureja and genotyped with the 12.8 k SolCAP single nucleotide polymorphism (SNP array. We show that in a population of monoparental haploids, the number of biallelic SNP markers segregating in linkage to loci from the tetraploid donor genotype is much larger than in putative crosses of this genotype to a diverse selection of 125 tetraploid cultivars. Although this strategy is more laborious than conventional breeding, the generation of haploid progeny for efficient marker analysis is straightforward if morphological markers and flow cytometry are utilized to select true haploid progeny. The level of introgressed fragments from S. phureja, the haploid inducer, is very low, supporting its suitability for genetic analysis. Mapping with single-dose markers allowed the analysis of quantitative trait loci (QTL for four phenotypic traits.

  13. NU-IN: Nucleotide evolution and input module for the EvolSimulator genome simulation platform

    Directory of Open Access Journals (Sweden)

    Barker Michael S

    2010-08-01

    Full Text Available Abstract Background There is increasing demand to test hypotheses that contrast the evolution of genes and gene families among genomes, using simulations that work across these levels of organization. The EvolSimulator program was developed recently to provide a highly flexible platform for forward simulations of amino acid evolution in multiple related lineages of haploid genomes, permitting copy number variation and lateral gene transfer. Synonymous nucleotide evolution is not currently supported, however, and would be highly advantageous for comparisons to full genome, transcriptome, and single nucleotide polymorphism (SNP datasets. In addition, EvolSimulator creates new genomes for each simulation, and does not allow the input of user-specified sequences and gene family information, limiting the incorporation of further biological realism and/or user manipulations of the data. Findings We present modified C++ source code for the EvolSimulator platform, which we provide as the extension module NU-IN. With NU-IN, synonymous and non-synonymous nucleotide evolution is fully implemented, and the user has the ability to use real or previously-simulated sequence data to initiate a simulation of one or more lineages. Gene family membership can be optionally specified, as well as gene retention probabilities that model biased gene retention. We provide PERL scripts to assist the user in deriving this information from previous simulations. We demonstrate the features of NU-IN by simulating genome duplication (polyploidy in the presence of ongoing copy number variation in an evolving lineage. This example is initiated with real genomic data, and produces output that we analyse directly with existing bioinformatic pipelines. Conclusions The NU-IN extension module is a publicly available open source software (GNU GPLv3 license extension to EvolSimulator. With the NU-IN module, users are now able to simulate both drift and selection at the nucleotide

  14. High Quality Genomic Copy Number Data from Archival Formalin-Fixed Paraffin-Embedded Leiomyosarcoma: Optimisation of Universal Linkage System Labelling

    Science.gov (United States)

    Salawu, Abdulazeez; Ul-Hassan, Aliya; Hammond, David; Fernando, Malee; Reed, Malcolm; Sisley, Karen

    2012-01-01

    Most soft tissue sarcomas are characterized by genetic instability and frequent genomic copy number aberrations that are not subtype-specific. Oligonucleotide microarray-based Comparative Genomic Hybridisation (array CGH) is an important technique used to map genome-wide copy number aberrations, but the traditional requirement for high-quality DNA typically obtained from fresh tissue has limited its use in sarcomas. Although large archives of Formalin-fixed Paraffin-embedded (FFPE) tumour samples are available for research, the degradative effects of formalin on DNA from these tissues has made labelling and analysis by array CGH technically challenging. The Universal Linkage System (ULS) may be used for a one-step chemical labelling of such degraded DNA. We have optimised the ULS labelling protocol to perform aCGH on archived FFPE leiomyosarcoma tissues using the 180k Agilent platform. Preservation age of samples ranged from a few months to seventeen years and the DNA showed a wide range of degradation (when visualised on agarose gels). Consistently high DNA labelling efficiency and low microarray probe-to-probe variation (as measured by the derivative log ratio spread) was seen. Comparison of paired fresh and FFPE samples from identical tumours showed good correlation of CNAs detected. Furthermore, the ability to macro-dissect FFPE samples permitted the detection of CNAs that were masked in fresh tissue. Aberrations were visually confirmed using Fluorescence in situ Hybridisation. These results suggest that archival FFPE tissue, with its relative abundance and attendant clinical data may be used for effective mapping for genomic copy number aberrations in such rare tumours as leiomyosarcoma and potentially unravel clues to tumour origins, progression and ultimately, targeted treatment. PMID:23209738

  15. Discrimination of haploid and diploid maize kernels via multispectral imaging

    DEFF Research Database (Denmark)

    De La Fuente, Gerald N.; Carstensen, Jens Michael; Adsetts Edberg Hansen, Michael

    2017-01-01

    sorting of haploids would increase the efficiency of DH line development. In this study, six inbred lines were crossed with the maternal haploid inducer ‘RWS/RWK-76’ and a sample of seed was sorted manually for each line. Using the VideometerLab 3 system, spectral imaging techniques were applied...

  16. Genome-wide analysis of macrosatellite repeat copy number variation in worldwide populations: Evidence for differences and commonalities in size distributions and size restrictions

    NARCIS (Netherlands)

    M. Schaap (Michiel); R.J.L.F. Lemmers (Richard); R. Maassen (Roel); P.J. van der Vliet (Patrick); L.F. Hoogerheide (Lennart); H.K. van Dijk (Herman); N. Basturk (Nalan); P. de Knijff (Peter); S.M. van der Maarel (Silvère)

    2013-01-01

    textabstractBackground: Macrosatellite repeats (MSRs), usually spanning hundreds of kilobases of genomic DNA, comprise a significant proportion of the human genome. Because of their highly polymorphic nature, MSRs represent an extreme example of copy number variation, but their structure and

  17. Microarray MAPH: accurate array-based detection of relative copy number in genomic DNA

    Directory of Open Access Journals (Sweden)

    Chan Alan

    2006-06-01

    Full Text Available Abstract Background Current methods for measurement of copy number do not combine all the desirable qualities of convenience, throughput, economy, accuracy and resolution. In this study, to improve the throughput associated with Multiplex Amplifiable Probe Hybridisation (MAPH we aimed to develop a modification based on the 3-Dimensional, Flow-Through Microarray Platform from PamGene International. In this new method, electrophoretic analysis of amplified products is replaced with photometric analysis of a probed oligonucleotide array. Copy number analysis of hybridised probes is based on a dual-label approach by comparing the intensity of Cy3-labelled MAPH probes amplified from test samples co-hybridised with similarly amplified Cy5-labelled reference MAPH probes. The key feature of using a hybridisation-based end point with MAPH is that discrimination of amplified probes is based on sequence and not fragment length. Results In this study we showed that microarray MAPH measurement of PMP22 gene dosage correlates well with PMP22 gene dosage determined by capillary MAPH and that copy number was accurately reported in analyses of DNA from 38 individuals, 12 of which were known to have Charcot-Marie-Tooth disease type 1A (CMT1A. Conclusion Measurement of microarray-based endpoints for MAPH appears to be of comparable accuracy to electrophoretic methods, and holds the prospect of fully exploiting the potential multiplicity of MAPH. The technology has the potential to simplify copy number assays for genes with a large number of exons, or of expanded sets of probes from dispersed genomic locations.

  18. Microarray MAPH: accurate array-based detection of relative copy number in genomic DNA.

    Science.gov (United States)

    Gibbons, Brian; Datta, Parikkhit; Wu, Ying; Chan, Alan; Al Armour, John

    2006-06-30

    Current methods for measurement of copy number do not combine all the desirable qualities of convenience, throughput, economy, accuracy and resolution. In this study, to improve the throughput associated with Multiplex Amplifiable Probe Hybridisation (MAPH) we aimed to develop a modification based on the 3-Dimensional, Flow-Through Microarray Platform from PamGene International. In this new method, electrophoretic analysis of amplified products is replaced with photometric analysis of a probed oligonucleotide array. Copy number analysis of hybridised probes is based on a dual-label approach by comparing the intensity of Cy3-labelled MAPH probes amplified from test samples co-hybridised with similarly amplified Cy5-labelled reference MAPH probes. The key feature of using a hybridisation-based end point with MAPH is that discrimination of amplified probes is based on sequence and not fragment length. In this study we showed that microarray MAPH measurement of PMP22 gene dosage correlates well with PMP22 gene dosage determined by capillary MAPH and that copy number was accurately reported in analyses of DNA from 38 individuals, 12 of which were known to have Charcot-Marie-Tooth disease type 1A (CMT1A). Measurement of microarray-based endpoints for MAPH appears to be of comparable accuracy to electrophoretic methods, and holds the prospect of fully exploiting the potential multiplicity of MAPH. The technology has the potential to simplify copy number assays for genes with a large number of exons, or of expanded sets of probes from dispersed genomic locations.

  19. The Number of Genomic Copies at the 16p11.2 Locus Modulates Language, Verbal Memory, and Inhibition.

    Science.gov (United States)

    Hippolyte, Loyse; Maillard, Anne M; Rodriguez-Herreros, Borja; Pain, Aurélie; Martin-Brevet, Sandra; Ferrari, Carina; Conus, Philippe; Macé, Aurélien; Hadjikhani, Nouchine; Metspalu, Andres; Reigo, Anu; Kolk, Anneli; Männik, Katrin; Barker, Mandy; Isidor, Bertrand; Le Caignec, Cédric; Mignot, Cyril; Schneider, Laurence; Mottron, Laurent; Keren, Boris; David, Albert; Doco-Fenzy, Martine; Gérard, Marion; Bernier, Raphael; Goin-Kochel, Robin P; Hanson, Ellen; Green Snyder, LeeAnne; Ramus, Franck; Beckmann, Jacques S; Draganski, Bogdan; Reymond, Alexandre; Jacquemont, Sébastien

    2016-07-15

    Deletions and duplications of the 16p11.2 BP4-BP5 locus are prevalent copy number variations (CNVs), highly associated with autism spectrum disorder and schizophrenia. Beyond language and global cognition, neuropsychological assessments of these two CNVs have not yet been reported. This study investigates the relationship between the number of genomic copies at the 16p11.2 locus and cognitive domains assessed in 62 deletion carriers, 44 duplication carriers, and 71 intrafamilial control subjects. IQ is decreased in deletion and duplication carriers, but we demonstrate contrasting cognitive profiles in these reciprocal CNVs. Deletion carriers present with severe impairments of phonology and of inhibition skills beyond what is expected for their IQ level. In contrast, for verbal memory and phonology, the data may suggest that duplication carriers outperform intrafamilial control subjects with the same IQ level. This finding is reminiscent of special isolated skills as well as contrasting language performance observed in autism spectrum disorder. Some domains, such as visuospatial and working memory, are unaffected by the 16p11.2 locus beyond the effect of decreased IQ. Neuroimaging analyses reveal that measures of inhibition covary with neuroanatomic structures previously identified as sensitive to 16p11.2 CNVs. The simultaneous study of reciprocal CNVs suggests that the 16p11.2 genomic locus modulates specific cognitive skills according to the number of genomic copies. Further research is warranted to replicate these findings and elucidate the molecular mechanisms modulating these cognitive performances. Copyright © 2016 Society of Biological Psychiatry. Published by Elsevier Inc. All rights reserved.

  20. Isolation of BAC Clones Containing Conserved Genes from Libraries of Three Distantly Related Moths: A Useful Resource for Comparative Genomics of Lepidoptera

    Directory of Open Access Journals (Sweden)

    Yuji Yasukochi

    2011-01-01

    Full Text Available Lepidoptera, butterflies and moths, is the second largest animal order and includes numerous agricultural pests. To facilitate comparative genomics in Lepidoptera, we isolated BAC clones containing conserved and putative single-copy genes from libraries of three pests, Heliothis virescens, Ostrinia nubilalis, and Plutella xylostella, harboring the haploid chromosome number, =31, which are not closely related with each other or with the silkworm, Bombyx mori, (=28, the sequenced model lepidopteran. A total of 108–184 clones representing 101–182 conserved genes were isolated for each species. For 79 genes, clones were isolated from more than two species, which will be useful as common markers for analysis using fluorescence in situ hybridization (FISH, as well as for comparison of genome sequence among multiple species. The PCR-based clone isolation method presented here is applicable to species which lack a sequenced genome but have a significant collection of cDNA or EST sequences.

  1. Probabilistic quantitative microbial risk assessment model of norovirus from wastewater irrigated vegetables in Ghana using genome copies and fecal indicator ratio conversion for estimating exposure dose.

    Science.gov (United States)

    Owusu-Ansah, Emmanuel de-Graft Johnson; Sampson, Angelina; Amponsah, Samuel K; Abaidoo, Robert C; Dalsgaard, Anders; Hald, Tine

    2017-12-01

    The need to replace the commonly applied fecal indicator conversions ratio (an assumption of 1:10 -5 virus to fecal indicator organism) in Quantitative Microbial Risk Assessment (QMRA) with models based on quantitative data on the virus of interest has gained prominence due to the different physical and environmental factors that might influence the reliability of using indicator organisms in microbial risk assessment. The challenges facing analytical studies on virus enumeration (genome copies or particles) have contributed to the already existing lack of data in QMRA modelling. This study attempts to fit a QMRA model to genome copies of norovirus data. The model estimates the risk of norovirus infection from the intake of vegetables irrigated with wastewater from different sources. The results were compared to the results of a corresponding model using the fecal indicator conversion ratio to estimate the norovirus count. In all scenarios of using different water sources, the application of the fecal indicator conversion ratio underestimated the norovirus disease burden, measured by the Disability Adjusted Life Years (DALYs), when compared to results using the genome copies norovirus data. In some cases the difference was >2 orders of magnitude. All scenarios using genome copies met the 10 -4 DALY per person per year for consumption of vegetables irrigated with wastewater, although these results are considered to be highly conservative risk estimates. The fecal indicator conversion ratio model of stream-water and drain-water sources of wastewater achieved the 10 -6 DALY per person per year threshold, which tends to indicate an underestimation of health risk when compared to using genome copies for estimating the dose. Copyright © 2017 Elsevier B.V. All rights reserved.

  2. Genome-wide analysis of macrosatellite repeat copy number variation in worldwide populations: evidence for differences and commonalities in size distributions and size restrictions

    NARCIS (Netherlands)

    Schaap, M.; Lemmers, R.J.L.F.; Maassen, R.; van der Vliet, P.J.; Hoogerheide, L.F.; van Dijk, H.K.; Basturk, N.; de Knijff, P.; van der Maarel, S.M.

    2013-01-01

    Background: Macrosatellite repeats (MSRs), usually spanning hundreds of kilobases of genomic DNA, comprise a significant proportion of the human genome. Because of their highly polymorphic nature, MSRs represent an extreme example of copy number variation, but their structure and function is largely

  3. Gene expansion shapes genome architecture in the human pathogen Lichtheimia corymbifera: an evolutionary genomics analysis in the ancient terrestrial mucorales (Mucoromycotina.

    Directory of Open Access Journals (Sweden)

    Volker U Schwartze

    2014-08-01

    Full Text Available Lichtheimia species are the second most important cause of mucormycosis in Europe. To provide broader insights into the molecular basis of the pathogenicity-associated traits of the basal Mucorales, we report the full genome sequence of L. corymbifera and compared it to the genome of Rhizopus oryzae, the most common cause of mucormycosis worldwide. The genome assembly encompasses 33.6 MB and 12,379 protein-coding genes. This study reveals four major differences of the L. corymbifera genome to R. oryzae: (i the presence of an highly elevated number of gene duplications which are unlike R. oryzae not due to whole genome duplication (WGD, (ii despite the relatively high incidence of introns, alternative splicing (AS is not frequently observed for the generation of paralogs and in response to stress, (iii the content of repetitive elements is strikingly low (<5%, (iv L. corymbifera is typically haploid. Novel virulence factors were identified which may be involved in the regulation of the adaptation to iron-limitation, e.g. LCor01340.1 encoding a putative siderophore transporter and LCor00410.1 involved in the siderophore metabolism. Genes encoding the transcription factors LCor08192.1 and LCor01236.1, which are similar to GATA type regulators and to calcineurin regulated CRZ1, respectively, indicating an involvement of the calcineurin pathway in the adaption to iron limitation. Genes encoding MADS-box transcription factors are elevated up to 11 copies compared to the 1-4 copies usually found in other fungi. More findings are: (i lower content of tRNAs, but unique codons in L. corymbifera, (ii Over 25% of the proteins are apparently specific for L. corymbifera. (iii L. corymbifera contains only 2/3 of the proteases (known to be essential virulence factors in comparison to R. oryzae. On the other hand, the number of secreted proteases, however, is roughly twice as high as in R. oryzae.

  4. Event-specific qualitative and quantitative PCR detection of the GMO carnation (Dianthus caryophyllus) variety Moonlite based upon the 5'-transgene integration sequence.

    Science.gov (United States)

    Li, P; Jia, J W; Jiang, L X; Zhu, H; Bai, L; Wang, J B; Tang, X M; Pan, A H

    2012-04-27

    To ensure the implementation of genetically modified organism (GMO)-labeling regulations, an event-specific detection method was developed based on the junction sequence of an exogenous integrant in the transgenic carnation variety Moonlite. The 5'-transgene integration sequence was isolated by thermal asymmetric interlaced PCR. Based upon the 5'-transgene integration sequence, the event-specific primers and TaqMan probe were designed to amplify the fragments, which spanned the exogenous DNA and carnation genomic DNA. Qualitative and quantitative PCR assays were developed employing the designed primers and probe. The detection limit of the qualitative PCR assay was 0.05% for Moonlite in 100 ng total carnation genomic DNA, corresponding to about 79 copies of the carnation haploid genome; the limit of detection and quantification of the quantitative PCR assay were estimated to be 38 and 190 copies of haploid carnation genomic DNA, respectively. Carnation samples with different contents of genetically modified components were quantified and the bias between the observed and true values of three samples were lower than the acceptance criterion (GMO detection method. These results indicated that these event-specific methods would be useful for the identification and quantification of the GMO carnation Moonlite.

  5. Evaluation of the Cow Rumen Metagenome: Assembly by Single Copy Gene Analysis and Single Cell Genome Assemblies (Metagenomics Informatics Challenges Workshop: 10K Genomes at a Time)

    Energy Technology Data Exchange (ETDEWEB)

    Sczyrba, Alex

    2011-10-13

    DOE JGI's Alex Sczyrba on "Evaluation of the Cow Rumen Metagenome" and "Assembly by Single Copy Gene Analysis and Single Cell Genome Assemblies" at the Metagenomics Informatics Challenges Workshop held at the DOE JGI on October 12-13, 2011.

  6. [Analysis of genomic copy number variations in two sisters with primary amenorrhea and hyperandrogenism].

    Science.gov (United States)

    Zhang, Yanliang; Xu, Qiuyue; Cai, Xuemei; Li, Yixun; Song, Guibo; Wang, Juan; Zhang, Rongchen; Dai, Yong; Duan, Yong

    2015-12-01

    To analyze genomic copy number variations (CNVs) in two sisters with primary amenorrhea and hyperandrogenism. G-banding was performed for karyotype analysis. The whole genome of the two sisters were scanned and analyzed by array-based comparative genomic hybridization (array-CGH). The results were confirmed with real-time quantitative PCR (RT-qPCR). No abnormality was found by conventional G-banded chromosome analysis. Array-CGH has identified 11 identical CNVs from the sisters which, however, overlapped with CNVs reported by the Database of Genomic Variants (http://projects.tcag.ca/variation/). Therefore, they are likely to be benign. In addition, a -8.44 Mb 9p11.1-p13.1 duplication (38,561,587-47,002,387 bp, hg18) and a -80.9 kb 4q13.2 deletion (70,183,990-70,264,889 bp, hg18) were also detected in the elder and younger sister, respectively. The relationship between such CNVs and primary amenorrhea and hyperandrogenism was however uncertain. RT-qPCR results were in accordance with array-CGH. Two CNVs were detected in two sisters by array-CGH, for which further studies are needed to clarify their correlation with primary amenorrhea and hyperandrogenism.

  7. Expression of the hepatitis B virus genome in chronic hepatitis B carriers and patients with hepatocellular carcinoma

    International Nuclear Information System (INIS)

    Bowyer, S.M.; Dusheiko, G.M.; Schoub, B.D.; Kew, M.C.

    1987-01-01

    The authors examined the methylation status of CCGG sites in hepatitis B virus (HBV) DNA to determine whether methylation could be responsible for the selective expression of the HBV surface gene in chronic hepatitis B infection and hepatocellular carcinoma. Infected liver tissue from patients with low levels of viral replication was analyzed for HBV DNA copy number per haploid cell genome. Total cellular DNA, with sufficient HBV DNA, was digested with the restriction endonucleases Msp I and Hpa II, to determine whether the HBV DNA was methylated, or HindIII, to determine whether the HBV DNA was integrated or episomal. The cleavage fragments were analyzed by Southern blotting and hybridization to 32 P-labeled HBV DNA. In replicative chronic hepatitis B, hypomethylation of the HBV genome correlated with HBV expression in both virions and infected tissue. In carriers with nonreplicative infection, it was difficult to ascertain the role of methylation as copy number was low. HBV DNA copy number was also low in 17 out of 29 of the rumor tissues tested and as many as 14 out of 16 of the adjacent non-neoplastic tissues tested. Integrated sequences were hypermethylated in the PLC/PRF/5 cell line and in six of the tumor tissues suggesting that methylation plays a role in HBV gene repression. However, since DNA from five other tumors was hypomethylated, the belief that methylation per se is an absolute determinant of HBV core gene repression does not hold for human hepatocellular carcinoma tissue

  8. Variable Copy Number, Intra-Genomic Heterogeneities and Lateral Transfers of the 16S rRNA Gene in Pseudomonas

    Science.gov (United States)

    Bodilis, Josselin; Nsigue-Meilo, Sandrine; Besaury, Ludovic; Quillet, Laurent

    2012-01-01

    Even though the 16S rRNA gene is the most commonly used taxonomic marker in microbial ecology, its poor resolution is still not fully understood at the intra-genus level. In this work, the number of rRNA gene operons, intra-genomic heterogeneities and lateral transfers were investigated at a fine-scale resolution, throughout the Pseudomonas genus. In addition to nineteen sequenced Pseudomonas strains, we determined the 16S rRNA copy number in four other Pseudomonas strains by Southern hybridization and Pulsed-Field Gel Electrophoresis, and studied the intra-genomic heterogeneities by Denaturing Gradient Gel Electrophoresis and sequencing. Although the variable copy number (from four to seven) seems to be correlated with the evolutionary distance, some close strains in the P. fluorescens lineage showed a different number of 16S rRNA genes, whereas all the strains in the P. aeruginosa lineage displayed the same number of genes (four copies). Further study of the intra-genomic heterogeneities revealed that most of the Pseudomonas strains (15 out of 19 strains) had at least two different 16S rRNA alleles. A great difference (5 or 19 nucleotides, essentially grouped near the V1 hypervariable region) was observed only in two sequenced strains. In one of our strains studied (MFY30 strain), we found a difference of 12 nucleotides (grouped in the V3 hypervariable region) between copies of the 16S rRNA gene. Finally, occurrence of partial lateral transfers of the 16S rRNA gene was further investigated in 1803 full-length sequences of Pseudomonas available in the databases. Remarkably, we found that the two most variable regions (the V1 and V3 hypervariable regions) had probably been laterally transferred from another evolutionary distant Pseudomonas strain for at least 48.3 and 41.6% of the 16S rRNA sequences, respectively. In conclusion, we strongly recommend removing these regions of the 16S rRNA gene during the intra-genus diversity studies. PMID:22545126

  9. Plasticity of the Leishmania genome leading to gene copy number variations and drug resistance [version 1; referees: 5 approved

    Directory of Open Access Journals (Sweden)

    Marie-Claude N. Laffitte

    2016-09-01

    Full Text Available Leishmania has a plastic genome, and drug pressure can select for gene copy number variation (CNV. CNVs can apply either to whole chromosomes, leading to aneuploidy, or to specific genomic regions. For the latter, the amplification of chromosomal regions occurs at the level of homologous direct or inverted repeated sequences leading to extrachromosomal circular or linear amplified DNAs. This ability of Leishmania to respond to drug pressure by CNVs has led to the development of genomic screens such as Cos-Seq, which has the potential of expediting the discovery of drug targets for novel promising drug candidates.

  10. The evolution of sex chromosomes in organisms with separate haploid sexes.

    Science.gov (United States)

    Immler, Simone; Otto, Sarah Perin

    2015-03-01

    The evolution of dimorphic sex chromosomes is driven largely by the evolution of reduced recombination and the subsequent accumulation of deleterious mutations. Although these processes are increasingly well understood in diploid organisms, the evolution of dimorphic sex chromosomes in haploid organisms (U/V) has been virtually unstudied theoretically. We analyze a model to investigate the evolution of linkage between fitness loci and the sex-determining region in U/V species. In a second step, we test how prone nonrecombining regions are to degeneration due to accumulation of deleterious mutations. Our modeling predicts that the decay of recombination on the sex chromosomes and the addition of strata via fusions will be just as much a part of the evolution of haploid sex chromosomes as in diploid sex chromosome systems. Reduced recombination is broadly favored, as long as there is some fitness difference between haploid males and females. The degeneration of the sex-determining region due to the accumulation of deleterious mutations is expected to be slower in haploid organisms because of the absence of masking. Nevertheless, balancing selection often drives greater differentiation between the U/V sex chromosomes than in X/Y and Z/W systems. We summarize empirical evidence for haploid sex chromosome evolution and discuss our predictions in light of these findings. © 2015 The Author(s).

  11. Asexual populations of the human malaria parasite, Plasmodium falciparum, use a two-step genomic strategy to acquire accurate, beneficial DNA amplifications.

    Directory of Open Access Journals (Sweden)

    Jennifer L Guler

    Full Text Available Malaria drug resistance contributes to up to a million annual deaths. Judicious deployment of new antimalarials and vaccines could benefit from an understanding of early molecular events that promote the evolution of parasites. Continuous in vitro challenge of Plasmodium falciparum parasites with a novel dihydroorotate dehydrogenase (DHODH inhibitor reproducibly selected for resistant parasites. Genome-wide analysis of independently-derived resistant clones revealed a two-step strategy to evolutionary success. Some haploid blood-stage parasites first survive antimalarial pressure through fortuitous DNA duplications that always included the DHODH gene. Independently-selected parasites had different sized amplification units but they were always flanked by distant A/T tracks. Higher level amplification and resistance was attained using a second, more efficient and more accurate, mechanism for head-to-tail expansion of the founder unit. This second homology-based process could faithfully tune DNA copy numbers in either direction, always retaining the unique DNA amplification sequence from the original A/T-mediated duplication for that parasite line. Pseudo-polyploidy at relevant genomic loci sets the stage for gaining additional mutations at the locus of interest. Overall, we reveal a population-based genomic strategy for mutagenesis that operates in human stages of P. falciparum to efficiently yield resistance-causing genetic changes at the correct locus in a successful parasite. Importantly, these founding events arise with precision; no other new amplifications are seen in the resistant haploid blood stage parasite. This minimizes the need for meiotic genetic cleansing that can only occur in sexual stage development of the parasite in mosquitoes.

  12. Copy Number Variation in the Horse Genome

    Science.gov (United States)

    Ghosh, Sharmila; Qu, Zhipeng; Das, Pranab J.; Fang, Erica; Juras, Rytis; Cothran, E. Gus; McDonell, Sue; Kenney, Daniel G.; Lear, Teri L.; Adelson, David L.; Chowdhary, Bhanu P.; Raudsepp, Terje

    2014-01-01

    We constructed a 400K WG tiling oligoarray for the horse and applied it for the discovery of copy number variations (CNVs) in 38 normal horses of 16 diverse breeds, and the Przewalski horse. Probes on the array represented 18,763 autosomal and X-linked genes, and intergenic, sub-telomeric and chrY sequences. We identified 258 CNV regions (CNVRs) across all autosomes, chrX and chrUn, but not in chrY. CNVs comprised 1.3% of the horse genome with chr12 being most enriched. American Miniature horses had the highest and American Quarter Horses the lowest number of CNVs in relation to Thoroughbred reference. The Przewalski horse was similar to native ponies and draft breeds. The majority of CNVRs involved genes, while 20% were located in intergenic regions. Similar to previous studies in horses and other mammals, molecular functions of CNV-associated genes were predominantly in sensory perception, immunity and reproduction. The findings were integrated with previous studies to generate a composite genome-wide dataset of 1476 CNVRs. Of these, 301 CNVRs were shared between studies, while 1174 were novel and require further validation. Integrated data revealed that to date, 41 out of over 400 breeds of the domestic horse have been analyzed for CNVs, of which 11 new breeds were added in this study. Finally, the composite CNV dataset was applied in a pilot study for the discovery of CNVs in 6 horses with XY disorders of sexual development. A homozygous deletion involving AKR1C gene cluster in chr29 in two affected horses was considered possibly causative because of the known role of AKR1C genes in testicular androgen synthesis and sexual development. While the findings improve and integrate the knowledge of CNVs in horses, they also show that for effective discovery of variants of biomedical importance, more breeds and individuals need to be analyzed using comparable methodological approaches. PMID:25340504

  13. Copy number variation in the horse genome.

    Directory of Open Access Journals (Sweden)

    Sharmila Ghosh

    2014-10-01

    Full Text Available We constructed a 400K WG tiling oligoarray for the horse and applied it for the discovery of copy number variations (CNVs in 38 normal horses of 16 diverse breeds, and the Przewalski horse. Probes on the array represented 18,763 autosomal and X-linked genes, and intergenic, sub-telomeric and chrY sequences. We identified 258 CNV regions (CNVRs across all autosomes, chrX and chrUn, but not in chrY. CNVs comprised 1.3% of the horse genome with chr12 being most enriched. American Miniature horses had the highest and American Quarter Horses the lowest number of CNVs in relation to Thoroughbred reference. The Przewalski horse was similar to native ponies and draft breeds. The majority of CNVRs involved genes, while 20% were located in intergenic regions. Similar to previous studies in horses and other mammals, molecular functions of CNV-associated genes were predominantly in sensory perception, immunity and reproduction. The findings were integrated with previous studies to generate a composite genome-wide dataset of 1476 CNVRs. Of these, 301 CNVRs were shared between studies, while 1174 were novel and require further validation. Integrated data revealed that to date, 41 out of over 400 breeds of the domestic horse have been analyzed for CNVs, of which 11 new breeds were added in this study. Finally, the composite CNV dataset was applied in a pilot study for the discovery of CNVs in 6 horses with XY disorders of sexual development. A homozygous deletion involving AKR1C gene cluster in chr29 in two affected horses was considered possibly causative because of the known role of AKR1C genes in testicular androgen synthesis and sexual development. While the findings improve and integrate the knowledge of CNVs in horses, they also show that for effective discovery of variants of biomedical importance, more breeds and individuals need to be analyzed using comparable methodological approaches.

  14. Generation of genetically modified mice using CRISPR/Cas9 and haploid embryonic stem cell systems

    Directory of Open Access Journals (Sweden)

    Li-Fang JIN

    2016-07-01

    Full Text Available With the development of high-throughput sequencing technology in the post-genomic era, researchers have concentrated their efforts on elucidating the relationships between genes and their corresponding functions. Recently, important progress has been achieved in the generation of genetically modified mice based on CRISPR/Cas9 and haploid embryonic stem cell (haESC approaches, which provide new platforms for gene function analysis, human disease modeling, and gene therapy. Here, we review the CRISPR/Cas9 and haESC technology for the generation of genetically modified mice and discuss the key challenges in the application of these approaches.

  15. A mutation in the centriole-associated protein centrin causes genomic instability via increased chromosome loss in Chlamydomonas reinhardtii

    Directory of Open Access Journals (Sweden)

    Marshall Wallace F

    2005-05-01

    Full Text Available Abstract Background The role of centrioles in mitotic spindle function remains unclear. One approach to investigate mitotic centriole function is to ask whether mutation of centriole-associated proteins can cause genomic instability. Results We addressed the role of the centriole-associated EF-hand protein centrin in genomic stability using a Chlamydomonas reinhardtii centrin mutant that forms acentriolar bipolar spindles and lacks the centrin-based rhizoplast structures that join centrioles to the nucleus. Using a genetic assay for loss of heterozygosity, we found that this centrin mutant showed increased genomic instability compared to wild-type cells, and we determined that the increase in genomic instability was due to a 100-fold increase in chromosome loss rates compared to wild type. Live cell imaging reveals an increased rate in cell death during G1 in haploid cells that is consistent with an elevated rate of chromosome loss, and analysis of cell death versus centriole copy number argues against a role for multipolar spindles in this process. Conclusion The increased chromosome loss rates observed in a centrin mutant that forms acentriolar spindles suggests a role for centrin protein, and possibly centrioles, in mitotic fidelity.

  16. Use of intergeneric cross for production of doubled haploid wheat (triticum aestivum l.)

    International Nuclear Information System (INIS)

    Khan, M.A.; Shaukat, S.; Kashif, M.; Khan, A.S.

    2012-01-01

    The main purpose of conventional breeding or hybridisation is to bring about homozygosity, for which 6 to 7 years may be required. Wheat and maize crosses have proved to be more efficient in DH lines production than anther culture methods, because of its lower genetic specificity. Doubled haploid technique facilitates the development of homozygous plants within one generation. The system is developed through haploid production, followed by chromosome doubling, to produce homozygous plants in a single generation. For doubled haploid production method wheat and maize crossing system is better than anther culture and ovule culture because maize pollens are highly responsive and produce stable progeny population. Wheat is being used as female parent and maize as a male parent for the production of doubled haploid. Moreover, Silver Nitrate (AgNO/sub 3/) in tiller culture media can improve the frequency of haploid embryo production in this crossing system. Our result showed that DH production through wheat and maize crossing system was proved to be time saving (2 years) as compared to other conventional breeding methods (6 years). (author)

  17. Genomic copy number analysis of Chernobyl papillary thyroid carcinoma in the Ukrainian–American Cohort

    Science.gov (United States)

    Selmansberger, Martin; Braselmann, Herbert; Hess, Julia; Bogdanova, Tetiana; Abend, Michael; Tronko, Mykola; Brenner, Alina; Zitzelsberger, Horst; Unger, Kristian

    2015-01-01

    One of the major consequences of the 1986 Chernobyl reactor accident was a dramatic increase in papillary thyroid carcinoma (PTC) incidence, predominantly in patients exposed to the radioiodine fallout at young age. The present study is the first on genomic copy number alterations (CNAs) of PTCs of the Ukrainian–American cohort (UkrAm) generated by array comparative genomic hybridization (aCGH). Unsupervised hierarchical clustering of CNA profiles revealed a significant enrichment of a subgroup of patients with female gender, long latency (>17 years) and negative lymph node status. Further, we identified single CNAs that were significantly associated with latency, gender, radiation dose and BRAF V600E mutation status. Multivariate analysis revealed no interactions but additive effects of parameters gender, latency and dose on CNAs. The previously identified radiation-associated gain of the chromosomal bands 7q11.22-11.23 was present in 29% of cases. Moreover, comparison of our radiation-associated PTC data set with the TCGA data set on sporadic PTCs revealed altered copy numbers of the tumor driver genes NF2 and CHEK2. Further, we integrated the CNA data with transcriptomic data that were available on a subset of the herein analyzed cohort and did not find statistically significant associations between the two molecular layers. However, applying hierarchical clustering on a ‘BRAF-like/RAS-like’ transcriptome signature split the cases into four groups, one of which containing all BRAF-positive cases validating the signature in an independent data set. PMID:26320103

  18. Measurement of locus copy number by hybridisation with amplifiable probes

    Science.gov (United States)

    Armour, John A. L.; Sismani, Carolina; Patsalis, Philippos C.; Cross, Gareth

    2000-01-01

    Despite its fundamental importance in genome analysis, it is only recently that systematic approaches have been developed to assess copy number at specific genetic loci, or to examine genomic DNA for submicroscopic deletions of unknown location. In this report we show that short probes can be recovered and amplified quantitatively following hybridisation to genomic DNA. This simple observation forms the basis of a new approach to determining locus copy number in complex genomes. The power and specificity of multiplex amplifiable probe hybridisation is demonstrated by the simultaneous assessment of copy number at a set of 40 human loci, including detection of deletions causing Duchenne muscular dystrophy and Prader–Willi/Angelman syndromes. Assembly of other probe sets will allow novel, technically simple approaches to a wide variety of genetic analyses, including the potential for extension to high resolution genome-wide screens for deletions and amplifications. PMID:10606661

  19. Measurement of locus copy number by hybridisation with amplifiable probes.

    Science.gov (United States)

    Armour, J A; Sismani, C; Patsalis, P C; Cross, G

    2000-01-15

    Despite its fundamental importance in genome analysis, it is only recently that systematic approaches have been developed to assess copy number at specific genetic loci, or to examine genomic DNA for submicro-scopic deletions of unknown location. In this report we show that short probes can be recovered and amplified quantitatively following hybridisation to genomic DNA. This simple observation forms the basis of a new approach to determining locus copy number in complex genomes. The power and specificity of multiplex amplifiable probe hybridisation is demonstrated by the simultaneous assessment of copy number at a set of 40 human loci, including detection of deletions causing Duchenne muscular dystrophy and Prader-Willi/Angelman syndromes. Assembly of other probe sets will allow novel, technically simple approaches to a wide variety of genetic analyses, including the potential for extension to high resolution genome-wide screens for deletions and amplifications.

  20. Genome-wide association study identifies a maternal copy-number deletion in PSG11 enriched among preeclampsia patients

    Directory of Open Access Journals (Sweden)

    Zhao Linlu

    2012-06-01

    Full Text Available Abstract Background Specific genetic contributions for preeclampsia (PE are currently unknown. This genome-wide association study (GWAS aims to identify maternal single nucleotide polymorphisms (SNPs and copy-number variants (CNVs involved in the etiology of PE. Methods A genome-wide scan was performed on 177 PE cases (diagnosed according to National Heart, Lung and Blood Institute guidelines and 116 normotensive controls. White female study subjects from Iowa were genotyped on Affymetrix SNP 6.0 microarrays. CNV calls made using a combination of four detection algorithms (Birdseye, Canary, PennCNV, and QuantiSNP were merged using CNVision and screened with stringent prioritization criteria. Due to limited DNA quantities and the deleterious nature of copy-number deletions, it was decided a priori that only deletions would be selected for assay on the entire case-control dataset using quantitative real-time PCR. Results The top four SNP candidates had an allelic or genotypic p-value between 10-5 and 10-6, however, none surpassed the Bonferroni-corrected significance threshold. Three recurrent rare deletions meeting prioritization criteria detected in multiple cases were selected for targeted genotyping. A locus of particular interest was found showing an enrichment of case deletions in 19q13.31 (5/169 cases and 1/114 controls, which encompasses the PSG11 gene contiguous to a highly plastic genomic region. All algorithm calls for these regions were assay confirmed. Conclusions CNVs may confer risk for PE and represent interesting regions that warrant further investigation. Top SNP candidates identified from the GWAS, although not genome-wide significant, may be useful to inform future studies in PE genetics.

  1. Two circular chromosomes of unequal copy number make up the mitochondrial genome of the rotifer Brachionus plicatilis.

    Science.gov (United States)

    Suga, Koushirou; Mark Welch, David B; Tanaka, Yukari; Sakakura, Yoshitaka; Hagiwara, Atsushi

    2008-06-01

    The monogonont rotifer Brachionus plicatilis is an emerging model system for a diverse array of questions in limnological ecosystem dynamics, the evolution of sexual recombination, cryptic speciation, and the phylogeny of basal metazoans. We sequenced the complete mitochondrial genome of B. plicatilis sensu strictu NH1L and found that it is composed of 2 circular chromosomes, designated mtDNA-I (11,153 bp) and mtDNA-II (12,672 bp). Hybridization to DNA isolated from mitochondria demonstrated that mtDNA-I is present at 4 times the copy number of mtDNA-II. The only nucleotide similarity between the 2 chromosomes is a 4.9-kbp region of 99.5% identity including a transfer RNA (tRNA) gene and an extensive noncoding region that contains putative D-loop and control sequence. The mtDNA-I chromosome encodes 4 proteins (ATP6, COB, NAD1, and NAD2), 13 tRNAs, and the large and small subunit ribosomal RNAs; mtDNA-II encodes 8 proteins (COX1-3, NAD3-6, and NAD4L) and 9 tRNAs. Gene order is not conserved between B. plicatilis and its closest relative with a sequenced mitochondrial genome, the acanthocephalan Leptorhynchoides thecatus, or other sequenced mitochondrial genomes. Polymerase chain reaction assays and Southern hybridization to DNA from 18 strains of Brachionus suggest that the 2-chromosome structure has been stable for millions of years. The novel organization of the B. plicatilis mitochondrial genome into 2 nearly equal chromosomes of 4-fold different copy number may provide insight into the evolution of metazoan mitochondria and the phylogenetics of rotifers and other basal animal phyla.

  2. The distribution and impact of common copy-number variation in the genome of the domesticated apple, Malus x domestica Borkh.

    Science.gov (United States)

    Boocock, James; Chagné, David; Merriman, Tony R; Black, Michael A

    2015-10-23

    Copy number variation (CNV) is a common feature of eukaryotic genomes, and a growing body of evidence suggests that genes affected by CNV are enriched in processes that are associated with environmental responses. Here we use next generation sequence (NGS) data to detect copy-number variable regions (CNVRs) within the Malus x domestica genome, as well as to examine their distribution and impact. CNVRs were detected using NGS data derived from 30 accessions of M. x domestica analyzed using the read-depth method, as implemented in the CNVrd2 software. To improve the reliability of our results, we developed a quality control and analysis procedure that involved checking for organelle DNA, not repeat masking, and the determination of CNVR identity using a permutation testing procedure. Overall, we identified 876 CNVRs, which spanned 3.5 % of the apple genome. To verify that detected CNVRs were not artifacts, we analyzed the B- allele-frequencies (BAF) within a single nucleotide polymorphism (SNP) array dataset derived from a screening of 185 individual apple accessions and found the CNVRs were enriched for SNPs having aberrant BAFs (P apple scab. We present the first analysis and catalogue of CNVRs in the M. x domestica genome. The enrichment of the CNVRs with R gene models and their overlap with gene loci of agricultural significance draw attention to a form of unexplored genetic variation in apple. This research will underpin further investigation of the role that CNV plays within the apple genome.

  3. Copy number and loss of heterozygosity detected by SNP array of formalin-fixed tissues using whole-genome amplification.

    Directory of Open Access Journals (Sweden)

    Angela Stokes

    Full Text Available The requirement for large amounts of good quality DNA for whole-genome applications prohibits their use for small, laser capture micro-dissected (LCM, and/or rare clinical samples, which are also often formalin-fixed and paraffin-embedded (FFPE. Whole-genome amplification of DNA from these samples could, potentially, overcome these limitations. However, little is known about the artefacts introduced by amplification of FFPE-derived DNA with regard to genotyping, and subsequent copy number and loss of heterozygosity (LOH analyses. Using a ligation adaptor amplification method, we present data from a total of 22 Affymetrix SNP 6.0 experiments, using matched paired amplified and non-amplified DNA from 10 LCM FFPE normal and dysplastic oral epithelial tissues, and an internal method control. An average of 76.5% of SNPs were called in both matched amplified and non-amplified DNA samples, and concordance was a promising 82.4%. Paired analysis for copy number, LOH, and both combined, showed that copy number changes were reduced in amplified DNA, but were 99.5% concordant when detected, amplifications were the changes most likely to be 'missed', only 30% of non-amplified LOH changes were identified in amplified pairs, and when copy number and LOH are combined ∼50% of gene changes detected in the unamplified DNA were also detected in the amplified DNA and within these changes, 86.5% were concordant for both copy number and LOH status. However, there are also changes introduced as ∼20% of changes in the amplified DNA are not detected in the non-amplified DNA. An integrative network biology approach revealed that changes in amplified DNA of dysplastic oral epithelium localize to topologically critical regions of the human protein-protein interaction network, suggesting their functional implication in the pathobiology of this disease. Taken together, our results support the use of amplification of FFPE-derived DNA, provided sufficient samples are used

  4. Whole-genome copy number variation analysis in anophthalmia and microphthalmia.

    Science.gov (United States)

    Schilter, K F; Reis, L M; Schneider, A; Bardakjian, T M; Abdul-Rahman, O; Kozel, B A; Zimmerman, H H; Broeckel, U; Semina, E V

    2013-11-01

    Anophthalmia/microphthalmia (A/M) represent severe developmental ocular malformations. Currently, mutations in known genes explain less than 40% of A/M cases. We performed whole-genome copy number variation analysis in 60 patients affected with isolated or syndromic A/M. Pathogenic deletions of 3q26 (SOX2) were identified in four independent patients with syndromic microphthalmia. Other variants of interest included regions with a known role in human disease (likely pathogenic) as well as novel rearrangements (uncertain significance). A 2.2-Mb duplication of 3q29 in a patient with non-syndromic anophthalmia and an 877-kb duplication of 11p13 (PAX6) and a 1.4-Mb deletion of 17q11.2 (NF1) in two independent probands with syndromic microphthalmia and other ocular defects were identified; while ocular anomalies have been previously associated with 3q29 duplications, PAX6 duplications, and NF1 mutations in some cases, the ocular phenotypes observed here are more severe than previously reported. Three novel regions of possible interest included a 2q14.2 duplication which cosegregated with microphthalmia/microcornea and congenital cataracts in one family, and 2q21 and 15q26 duplications in two additional cases; each of these regions contains genes that are active during vertebrate ocular development. Overall, this study identified causative copy number mutations and regions with a possible role in ocular disease in 17% of A/M cases. © 2013 John Wiley & Sons A/S. Published by John Wiley & Sons Ltd.

  5. Complete mitochondrial genome of endangered Yellow-shouldered Amazon (Amazona barbadensis): two control region copies in parrot species of the Amazona genus.

    Science.gov (United States)

    Urantowka, Adam Dawid; Hajduk, Kacper; Kosowska, Barbara

    2013-08-01

    Amazona barbadensis is an endangered species of parrot living in northern coastal Venezuela and in several Caribbean islands. In this study, we sequenced full mitochondrial genome of the considered species. The total length of the mitogenome was 18,983 bp and contained 13 protein-coding genes, 22 transfer RNA genes, two ribosomal RNA genes, duplicated control region, and degenerate copies of ND6 and tRNA (Glu) genes. High degree of identity between two copies of control region suggests their coincident evolution and functionality. Comparative analysis of both the control region sequences from four Amazona species revealed their 89.1% identity over a region of 1300 bp and indicates the presence of distinctive parts of two control region copies.

  6. Genome-wide assessment of the association of rare and common copy number variations to testicular germ cell cancer

    DEFF Research Database (Denmark)

    Edsgard, Stefan Daniel; Dalgaard, Marlene Danner; Weinhold, Nils

    2013-01-01

    Testicular germ cell cancer (TGCC) is one of the most heritable forms of cancer. Previous genome-wide association studies have focused on single nucleotide polymorphisms, largely ignoring the influence of copy number variants (CNVs). Here we present a genome-wide study of CNV on a cohort of 212...... of rare CNVs related to cell migration (false-discovery rate = 0.021, 1.8% of cases and 1.1% of controls). Dysregulation during migration of primordial germ cells has previously been suspected to be a part of TGCC development and this set of multiple rare variants may thereby have a minor contribution...

  7. Inferring Variation in Copy Number Using High Throughput Sequencing Data in R.

    Science.gov (United States)

    Knaus, Brian J; Grünwald, Niklaus J

    2018-01-01

    Inference of copy number variation presents a technical challenge because variant callers typically require the copy number of a genome or genomic region to be known a priori . Here we present a method to infer copy number that uses variant call format (VCF) data as input and is implemented in the R package vcfR . This method is based on the relative frequency of each allele (in both genic and non-genic regions) sequenced at heterozygous positions throughout a genome. These heterozygous positions are summarized by using arbitrarily sized windows of heterozygous positions, binning the allele frequencies, and selecting the bin with the greatest abundance of positions. This provides a non-parametric summary of the frequency that alleles were sequenced at. The method is applicable to organisms that have reference genomes that consist of full chromosomes or sub-chromosomal contigs. In contrast to other software designed to detect copy number variation, our method does not rely on an assumption of base ploidy, but instead infers it. We validated these approaches with the model system of Saccharomyces cerevisiae and applied it to the oomycete Phytophthora infestans , both known to vary in copy number. This functionality has been incorporated into the current release of the R package vcfR to provide modular and flexible methods to investigate copy number variation in genomic projects.

  8. Real-time polymerase chain reaction assay for endogenous reference gene for specific detection and quantification of common wheat-derived DNA (Triticum aestivum L.).

    Science.gov (United States)

    Vautrin, Sonia; Zhang, David

    2007-01-01

    A species-specific endogenous reference gene system was developed for polymerase chain reaction (PCR)-based analysis in common wheat (Triticum aestivum L.) by targeting the ALMT1 gene, an aluminium-activated malate transporter. The primers and probe were elaborated for real-time PCR-based qualitative and quantitative assay. The size of amplified product is 95 base pairs. The specificity was assessed on 17 monocot and dicot plant species. The established real-time PCR assay amplified only T. aestivum-derived DNA; no amplification occurred on other phylogenetically related species, including durum wheat (T. durum). The robustness of the system was tested on the DNA of 15 common wheat cultivars using 20 000 genomic copies per PCR the mean cycle threshold (Ct) values of 24.02 +/- 0.251 were obtained. The absolute limits of detection and quantification of the real-time PCR assay were estimated to 2 and 20 haploid genome copies of common wheat, respectively. The linearity was experimentally validated on 2-fold serial dilutions of DNA from 650 to 20 000 haploid genome copies. All these results show that the real-time PCR assay developed on the ALMT1 gene is suitable to be used as an endogenous reference gene for PCR-based specific detection and quantification of T. aestivum-derived DNA in various applications, in particular for the detection and quantification of genetically modified materials in common wheat.

  9. International collaborative study of the endogenous reference gene, sucrose phosphate synthase (SPS), used for qualitative and quantitative analysis of genetically modified rice.

    Science.gov (United States)

    Jiang, Lingxi; Yang, Litao; Zhang, Haibo; Guo, Jinchao; Mazzara, Marco; Van den Eede, Guy; Zhang, Dabing

    2009-05-13

    One rice ( Oryza sativa ) gene, sucrose phosphate synthase (SPS), has been proven to be a suitable endogenous reference gene for genetically modified (GM) rice detection in a previous study. Herein are the reported results of an international collaborative ring trial for validation of the SPS gene as an endogenous reference gene and its optimized qualitative and quantitative polymerase chain reaction (PCR) systems. A total of 12 genetically modified organism (GMO) detection laboratories from seven countries participated in the ring trial and returned their results. The validated results confirmed the species specificity of the method through testing 10 plant genomic DNAs, low heterogeneity, and a stable single-copy number of the rice SPS gene among 7 indica varieties and 5 japonica varieties. The SPS qualitative PCR assay was validated with a limit of detection (LOD) of 0.1%, which corresponded to about 230 copies of haploid rice genomic DNA, while the limit of quantification (LOQ) for the quantitative PCR system was about 23 copies of haploid rice genomic DNA, with acceptable PCR efficiency and linearity. Furthermore, the bias between the test and true values of eight blind samples ranged from 5.22 to 26.53%. Thus, we believe that the SPS gene is suitable for use as an endogenous reference gene for the identification and quantification of GM rice and its derivates.

  10. Competition between the sperm of a single male can increase the evolutionary rate of haploid expressed genes.

    Science.gov (United States)

    Ezawa, Kiyoshi; Innan, Hideki

    2013-07-01

    The population genetic behavior of mutations in sperm genes is theoretically investigated. We modeled the processes at two levels. One is the standard population genetic process, in which the population allele frequencies change generation by generation, depending on the difference in selective advantages. The other is the sperm competition during each genetic transmission from one generation to the next generation. For the sperm competition process, we formulate the situation where a huge number of sperm with alleles A and B, produced by a single heterozygous male, compete to fertilize a single egg. This "minimal model" demonstrates that a very slight difference in sperm performance amounts to quite a large difference between the alleles' winning probabilities. By incorporating this effect of paternity-sharing sperm competition into the standard population genetic process, we show that fierce sperm competition can enhance the fixation probability of a mutation with a very small phenotypic effect at the single-sperm level, suggesting a contribution of sperm competition to rapid amino acid substitutions in haploid-expressed sperm genes. Considering recent genome-wide demonstrations that a substantial fraction of the mammalian sperm genes are haploid expressed, our model could provide a potential explanation of rapid evolution of sperm genes with a wide variety of functions (as long as they are expressed in the haploid phase). Another advantage of our model is that it is applicable to a wide range of species, irrespective of whether the species is externally fertilizing, polygamous, or monogamous. The theoretical result was applied to mammalian data to estimate the selection intensity on nonsynonymous mutations in sperm genes.

  11. An initial comparative map of copy number variations in the goat (Capra hircus genome

    Directory of Open Access Journals (Sweden)

    Casadio Rita

    2010-11-01

    Full Text Available Abstract Background The goat (Capra hircus represents one of the most important farm animal species. It is reared in all continents with an estimated world population of about 800 million of animals. Despite its importance, studies on the goat genome are still in their infancy compared to those in other farm animal species. Comparative mapping between cattle and goat showed only a few rearrangements in agreement with the similarity of chromosome banding. We carried out a cross species cattle-goat array comparative genome hybridization (aCGH experiment in order to identify copy number variations (CNVs in the goat genome analysing animals of different breeds (Saanen, Camosciata delle Alpi, Girgentana, and Murciano-Granadina using a tiling oligonucleotide array with ~385,000 probes designed on the bovine genome. Results We identified a total of 161 CNVs (an average of 17.9 CNVs per goat, with the largest number in the Saanen breed and the lowest in the Camosciata delle Alpi goat. By aggregating overlapping CNVs identified in different animals we determined CNV regions (CNVRs: on the whole, we identified 127 CNVRs covering about 11.47 Mb of the virtual goat genome referred to the bovine genome (0.435% of the latter genome. These 127 CNVRs included 86 loss and 41 gain and ranged from about 24 kb to about 1.07 Mb with a mean and median equal to 90,292 bp and 49,530 bp, respectively. To evaluate whether the identified goat CNVRs overlap with those reported in the cattle genome, we compared our results with those obtained in four independent cattle experiments. Overlapping between goat and cattle CNVRs was highly significant (P Conclusions We describe a first map of goat CNVRs. This provides information on a comparative basis with the cattle genome by identifying putative recurrent interspecies CNVs between these two ruminant species. Several goat CNVs affect genes with important biological functions. Further studies are needed to evaluate the

  12. Determination of beta-defensin genomic copy number in different populations

    DEFF Research Database (Denmark)

    Fode, Peder; Jespersgaard, Cathrine; Hardwick, Robert J

    2011-01-01

    There have been conflicting reports in the literature on association of gene copy number with disease, including CCL3L1 and HIV susceptibility, and ß-defensins and Crohn's disease. Quantification of precise gene copy numbers is important in order to define any association of gene copy number with...

  13. Characterization of the exogenous insert and development of event-specific PCR detection methods for genetically modified Huanong No. 1 papaya.

    Science.gov (United States)

    Guo, Jinchao; Yang, Litao; Liu, Xin; Guan, Xiaoyan; Jiang, Lingxi; Zhang, Dabing

    2009-08-26

    Genetically modified (GM) papaya (Carica papaya L.), Huanong No. 1, was approved for commercialization in Guangdong province, China in 2006, and the development of the Huanong No. 1 papaya detection method is necessary for implementing genetically modified organism (GMO) labeling regulations. In this study, we reported the characterization of the exogenous integration of GM Huanong No. 1 papaya by means of conventional polymerase chain reaction (PCR) and thermal asymmetric interlaced (TAIL)-PCR strategies. The results suggested that one intact copy of the initial construction was integrated in the papaya genome and which probably resulted in one deletion (38 bp in size) of the host genomic DNA. Also, one unintended insertion of a 92 bp truncated NptII fragment was observed at the 5' end of the exogenous insert. Furthermore, we revealed its 5' and 3' flanking sequences between the insert DNA and the papaya genomic DNA, and developed the event-specific qualitative and quantitative PCR assays for GM Huanong No. 1 papaya based on the 5' integration flanking sequence. The relative limit of detection (LOD) of the qualitative PCR assay was about 0.01% in 100 ng of total papaya genomic DNA, corresponding to about 25 copies of papaya haploid genome. In the quantitative PCR, the limits of detection and quantification (LOD and LOQ) were as low as 12.5 and 25 copies of papaya haploid genome, respectively. In practical sample quantification, the quantified biases between the test and true values of three samples ranged from 0.44% to 4.41%. Collectively, we proposed that all of these results are useful for the identification and quantification of Huanong No. 1 papaya and its derivates.

  14. Creation of BAC genomic resources for cocoa ( Theobroma cacao L.) for physical mapping of RGA containing BAC clones.

    Science.gov (United States)

    Clément, D; Lanaud, C; Sabau, X; Fouet, O; Le Cunff, L; Ruiz, E; Risterucci, A M; Glaszmann, J C; Piffanelli, P

    2004-05-01

    We have constructed and validated the first cocoa ( Theobroma cacao L.) BAC library, with the aim of developing molecular resources to study the structure and evolution of the genome of this perennial crop. This library contains 36,864 clones with an average insert size of 120 kb, representing approximately ten haploid genome equivalents. It was constructed from the genotype Scavina-6 (Sca-6), a Forastero clone highly resistant to cocoa pathogens and a parent of existing mapping populations. Validation of the BAC library was carried out with a set of 13 genetically-anchored single copy and one duplicated markers. An average of nine BAC clones per probe was identified, giving an initial experimental estimation of the genome coverage represented in the library. Screening of the library with a set of resistance gene analogues (RGAs), previously mapped in cocoa and co-localizing with QTL for resistance to Phytophthora traits, confirmed at the physical level the tight clustering of RGAs in the cocoa genome and provided the first insights into the relationships between genetic and physical distances in the cocoa genome. This library represents an available BAC resource for structural genomic studies or map-based cloning of genes corresponding to important QTLs for agronomic traits such as resistance genes to major cocoa pathogens like Phytophthora spp ( palmivora and megakarya), Crinipellis perniciosa and Moniliophthora roreri.

  15. Genome-wide screening identifies a KCNIP1 copy number variant as a genetic predictor for atrial fibrillation

    Science.gov (United States)

    Tsai, Chia-Ti; Hsieh, Chia-Shan; Chang, Sheng-Nan; Chuang, Eric Y.; Ueng, Kwo-Chang; Tsai, Chin-Feng; Lin, Tsung-Hsien; Wu, Cho-Kai; Lee, Jen-Kuang; Lin, Lian-Yu; Wang, Yi-Chih; Yu, Chih-Chieh; Lai, Ling-Ping; Tseng, Chuen-Den; Hwang, Juey-Jen; Chiang, Fu-Tien; Lin, Jiunn-Lee

    2016-01-01

    Atrial fibrillation (AF) is the most common sustained cardiac arrhythmia. Previous genome-wide association studies had identified single-nucleotide polymorphisms in several genomic regions to be associated with AF. In human genome, copy number variations (CNVs) are known to contribute to disease susceptibility. Using a genome-wide multistage approach to identify AF susceptibility CNVs, we here show a common 4,470-bp diallelic CNV in the first intron of potassium interacting channel 1 gene (KCNIP1) is strongly associated with AF in Taiwanese populations (odds ratio=2.27 for insertion allele; P=6.23 × 10−24). KCNIP1 insertion is associated with higher KCNIP1 mRNA expression. KCNIP1-encoded protein potassium interacting channel 1 (KCHIP1) is physically associated with potassium Kv channels and modulates atrial transient outward current in cardiac myocytes. Overexpression of KCNIP1 results in inducible AF in zebrafish. In conclusions, a common CNV in KCNIP1 gene is a genetic predictor of AF risk possibly pointing to a functional pathway. PMID:26831368

  16. Performance Evaluation of NIPT in Detection of Chromosomal Copy Number Variants Using Low-Coverage Whole-Genome Sequencing of Plasma DNA

    DEFF Research Database (Denmark)

    Liu, Hongtai; Gao, Ya; Hu, Zhiyang

    2016-01-01

    , including 33 CNVs samples and 886 normal samples from September 1, 2011 to May 31, 2013, were enrolled in this study. The samples were randomly rearranged and blindly sequenced by low-coverage (about 7M reads) whole-genome sequencing of plasma DNA. Fetal CNVs were detected by Fetal Copy-number Analysis...

  17. Near-haploid and low-hypodiploid acute lymphoblastic leukemia

    DEFF Research Database (Denmark)

    Safavi, Setareh; Paulsson, Kajsa

    2017-01-01

    Hypodiploidy leukemia (ALL) in both children and adults. It has long been clear by cytogenetic analyses, and recently confirmed by mutational profiling, that these cases may be further subdivided into 2 subtypes: near-haploid ALL...

  18. The role of copy number variation in susceptibility to amyotrophic lateral sclerosis: genome-wide association study and comparison with published loci.

    Directory of Open Access Journals (Sweden)

    Louise V Wain

    2009-12-01

    Full Text Available The genetic contribution to sporadic amyotrophic lateral sclerosis (ALS has not been fully elucidated. There are increasing efforts to characterise the role of copy number variants (CNVs in human diseases; two previous studies concluded that CNVs may influence risk of sporadic ALS, with multiple rare CNVs more important than common CNVs. A little-explored issue surrounding genome-wide CNV association studies is that of post-calling filtering and merging of raw CNV calls. We undertook simulations to define filter thresholds and considered optimal ways of merging overlapping CNV calls for association testing, taking into consideration possibly overlapping or nested, but distinct, CNVs and boundary estimation uncertainty.In this study we screened Illumina 300K SNP genotyping data from 730 ALS cases and 789 controls for copy number variation. Following quality control filters using thresholds defined by simulation, a total of 11321 CNV calls were made across 575 cases and 621 controls. Using region-based and gene-based association analyses, we identified several loci showing nominally significant association. However, the choice of criteria for combining calls for association testing has an impact on the ranking of the results by their significance. Several loci which were previously reported as being associated with ALS were identified here. However, of another 15 genes previously reported as exhibiting ALS-specific copy number variation, only four exhibited copy number variation in this study. Potentially interesting novel loci, including EEF1D, a translation elongation factor involved in the delivery of aminoacyl tRNAs to the ribosome (a process which has previously been implicated in genetic studies of spinal muscular atrophy were identified but must be treated with caution due to concerns surrounding genomic location and platform suitability.Interpretation of CNV association findings must take into account the effects of filtering and combining

  19. Genome-wide analysis of esophageal adenocarcinoma yields specific copy number aberrations that correlate with prognosis.

    Science.gov (United States)

    Frankel, Adam; Armour, Nicola; Nancarrow, Derek; Krause, Lutz; Hayward, Nicholas; Lampe, Guy; Smithers, B Mark; Barbour, Andrew

    2014-04-01

    The incidence of esophageal adenocarcinoma (EAC) has been increasing rapidly for the past 3 decades in Western (Caucasian) populations. Curative treatment is based around esophagectomy, which has a major impact on quality of life. For those suitable for treatment with curative intent, 5-year survival is ∼30%. More accurate prognostic tools are therefore needed, and copy number aberrations (CNAs) may offer the ability to act as prospective biomarkers in this regard. We performed a genome-wide examination of CNAs in 54 samples of EAC using single-nucleotide polymorphism (SNP) arrays. Our aims were to describe frequent regions of CNA, to define driver CNAs, and to identify CNAs that correlated with survival. Regions of frequent amplification included oncogenes such as EGFR, MYC, KLF12, and ERBB2, while frequently deleted regions included tumor suppressor genes such as CDKN2A/B, PTPRD, FHIT, and SMAD4. The genomic identification of significant targets in cancer (GISTIC) algorithm identified 24 regions of gain and 28 regions of loss that were likely to contain driver changes. We discovered 61 genes in five regions that, when stratified by CNA type (gain or loss), correlated with a statistically significant difference in survival. Pathway analysis of the genes residing in both the GISTIC and prognostic regions showed they were significantly enriched for cancer-related networks. Finally, we discovered that copy-neutral loss of heterozygosity is a frequent mechanism of CNA in genes currently targetable by chemotherapy, potentially leading to under-reporting of cases suitable for such treatment. Copyright © 2014 Wiley Periodicals, Inc.

  20. Dana-Farber Cancer Institute (DFCI): Computational Correction of Copy-number Effect in CRISPR-Cas9 Essentiality Screens of Cancer Cells | Office of Cancer Genomics

    Science.gov (United States)

    Genome-wide CRISPR-Cas9 screens were performed in 341 cell lines. The results were processed with the CERES algorithm to produce copy-number and guide-efficacy corrected gene knockout effect estimates.

  1. Efficiency of anther culture technique in the production of wheat double haploids

    Directory of Open Access Journals (Sweden)

    Kondić-Špika Ankica Đ.

    2008-01-01

    Full Text Available The objective of the study was to investigate efficiency of anther culture in the production of spontaneous double haploids from randomly selected heterozygous genotypes of wheat (Triticum aestivum L.. Anthers of 20 F1 wheat combinations were grown in vitro on a modified Potato-2 medium. All of the examined genotypes have shown the ability to produce pollen calluses as well as to regenerate green plants. On average for the whole experiment material, 47.2 calluses were produced per 100 cultured anthers. The green plant regeneration ranged from 0.8 to 13.4 green plants per spike, with an overall mean of 5.8. From the total of 582 regenerated green plants, 47.9% (279 were spontaneous double haploids. The final average yield from the study was 2.8 double haploids per spike.

  2. Partitioning of copy-number genotypes in pedigrees

    Directory of Open Access Journals (Sweden)

    Andelfinger Gregor U

    2010-05-01

    Full Text Available Abstract Background Copy number variations (CNVs and polymorphisms (CNPs have only recently gained the genetic community's attention. Conservative estimates have shown that CNVs and CNPs might affect more than 10% of the genome and that they may be at least as important as single nucleotide polymorphisms in assessing human variability. Widely used tools for CNP analysis have been implemented in Birdsuite and PLINK for the purpose of conducting genetic association studies based on the unpartitioned total number of CNP copies provided by the intensities from Affymetrix's Genome-Wide Human SNP Array. Here, we are interested in partitioning copy number variations and polymorphisms in extended pedigrees for the purpose of linkage analysis on familial data. Results We have developed CNGen, a new software for the partitioning of copy number polymorphism using the integrated genotypes from Birdsuite with the Affymetrix platform. The algorithm applied to familial trios or extended pedigrees can produce partitioned copy number genotypes with distinct parental alleles. We have validated the algorithm using simulations on a complex pedigree structure using frequencies calculated from a real dataset of 300 genotyped samples from 42 pedigrees segregating a congenital heart defect phenotype. Conclusions CNGen is the first published software for the partitioning of copy number genotypes in pedigrees, making possible the use CNPs and CNVs for linkage analysis. It was implemented with the Python interpreter version 2.5.2. It was successfully tested on current Linux, Windows and Mac OS workstations.

  3. Autism genome-wide copy number variation reveals ubiquitin and neuronal genes.

    Science.gov (United States)

    Glessner, Joseph T; Wang, Kai; Cai, Guiqing; Korvatska, Olena; Kim, Cecilia E; Wood, Shawn; Zhang, Haitao; Estes, Annette; Brune, Camille W; Bradfield, Jonathan P; Imielinski, Marcin; Frackelton, Edward C; Reichert, Jennifer; Crawford, Emily L; Munson, Jeffrey; Sleiman, Patrick M A; Chiavacci, Rosetta; Annaiah, Kiran; Thomas, Kelly; Hou, Cuiping; Glaberson, Wendy; Flory, James; Otieno, Frederick; Garris, Maria; Soorya, Latha; Klei, Lambertus; Piven, Joseph; Meyer, Kacie J; Anagnostou, Evdokia; Sakurai, Takeshi; Game, Rachel M; Rudd, Danielle S; Zurawiecki, Danielle; McDougle, Christopher J; Davis, Lea K; Miller, Judith; Posey, David J; Michaels, Shana; Kolevzon, Alexander; Silverman, Jeremy M; Bernier, Raphael; Levy, Susan E; Schultz, Robert T; Dawson, Geraldine; Owley, Thomas; McMahon, William M; Wassink, Thomas H; Sweeney, John A; Nurnberger, John I; Coon, Hilary; Sutcliffe, James S; Minshew, Nancy J; Grant, Struan F A; Bucan, Maja; Cook, Edwin H; Buxbaum, Joseph D; Devlin, Bernie; Schellenberg, Gerard D; Hakonarson, Hakon

    2009-05-28

    Autism spectrum disorders (ASDs) are childhood neurodevelopmental disorders with complex genetic origins. Previous studies focusing on candidate genes or genomic regions have identified several copy number variations (CNVs) that are associated with an increased risk of ASDs. Here we present the results from a whole-genome CNV study on a cohort of 859 ASD cases and 1,409 healthy children of European ancestry who were genotyped with approximately 550,000 single nucleotide polymorphism markers, in an attempt to comprehensively identify CNVs conferring susceptibility to ASDs. Positive findings were evaluated in an independent cohort of 1,336 ASD cases and 1,110 controls of European ancestry. Besides previously reported ASD candidate genes, such as NRXN1 (ref. 10) and CNTN4 (refs 11, 12), several new susceptibility genes encoding neuronal cell-adhesion molecules, including NLGN1 and ASTN2, were enriched with CNVs in ASD cases compared to controls (P = 9.5 x 10(-3)). Furthermore, CNVs within or surrounding genes involved in the ubiquitin pathways, including UBE3A, PARK2, RFWD2 and FBXO40, were affected by CNVs not observed in controls (P = 3.3 x 10(-3)). We also identified duplications 55 kilobases upstream of complementary DNA AK123120 (P = 3.6 x 10(-6)). Although these variants may be individually rare, they target genes involved in neuronal cell-adhesion or ubiquitin degradation, indicating that these two important gene networks expressed within the central nervous system may contribute to the genetic susceptibility of ASD.

  4. Genomic copy number variations in three Southeast Asian populations.

    Science.gov (United States)

    Ku, Chee-Seng; Pawitan, Yudi; Sim, Xueling; Ong, Rick T H; Seielstad, Mark; Lee, Edmund J D; Teo, Yik-Ying; Chia, Kee-Seng; Salim, Agus

    2010-07-01

    Research on the role of copy number variations (CNVs) in the genetic risk of diseases in Asian populations has been hampered by a relative lack of reference CNV maps for Asian populations outside the East Asians. In this article, we report the population characteristics of CNVs in Chinese, Malay, and Asian Indian populations in Singapore. Using the Illumina Human 1M Beadchip array, we identify 1,174 CNV loci in these populations that corroborated with findings when the same samples were typed on the Affymetrix 6.0 platform. We identify 441 novel loci not previously reported in the Database of Genomic Variations (DGV). We observe a considerable number of loci that span all three populations and were previously unreported, as well as population-specific loci that are quite common in the respective populations. From this we observe the distribution of CNVs in the Asian Indian population to be considerably different from the Chinese and Malay populations. About half of the deletion loci and three-quarters of duplication loci overlap UCSC genes. Tens of loci show population differentiation and overlap with genes previously known to be associated with genetic risk of diseases. One of these loci is the CYP2A6 deletion, previously linked to reduced susceptibility to lung cancer. (c) 2010 Wiley-Liss, Inc.

  5. A set of haploid strains available for genetic studies of Saccharomyces cerevisiae flor yeasts.

    Science.gov (United States)

    Coi, Anna Lisa; Legras, Jean-Luc; Zara, Giacomo; Dequin, Sylvie; Budroni, Marilena

    2016-09-01

    Flor yeasts of Saccharomyces cerevisiae have been extensively studied for biofilm formation, however the lack of specific haploid model strains has limited the application of genetic approaches such as gene knockout, allelic replacement and Quantitative Trait Locus mapping for the deciphering of the molecular basis of velum formation under biological ageing. The aim of this work was to construct a set of flor isogenic haploid strains easy to manipulate genetically. The analysis of the allelic variations at 12 minisatellite loci of 174 Saccharomyces cerevisiae strains allowed identifying three flor parental strains with different phylogenic positions. These strains were characterized for sporulation efficiency, growth on galactose, adherence to polystyrene, agar invasion, growth on wine and ability to develop a biofilm. Interestingly, the inability to grow on galactose was found associated with a frameshift in GAL4 gene that seems peculiar of flor strains. From these wild flor strains, isogenic haploid strains were constructed by deleting HO gene with a loxP-KanMX-loxP cassette followed by the removal of the kanamycin cassette. Haploid strains obtained were characterized for their phenotypic and genetic properties and compared with the parental strains. Preliminary results showed that the haploid strains represent new tools for genetic studies and breeding programs on biofilm formation. © FEMS 2016. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.

  6. Inter Simple Sequence Repeat DNA (ISSR) Polymorphism Utility in Haploid Nicotiana Alata Irradiated Plants for Finding Markers Associated with Gamma Irradiation and Salinity

    International Nuclear Information System (INIS)

    El-Fiki, A.; Adly, M.; El-Metabteb, G.

    2017-01-01

    Nicotiana alata is an ornamental plant. It is a member of family Solanasea. Tobacco (Nicotiana spp.) is one of the most important commercial crops in the world. Wild Nicotiana species, as a store house of genes for several diseases and pests, in addition to genes for several important phytochemicals and quality traits which are not present in cultivated varieties. Inter simple sequence repeat DNA (ISSR) analysis was used to determine the degree of genetic variation in treated haploid Nicotiana alata plants. Total genomic DNAs from different treated haploid plant lets were amplified using five specific primers. All primers were polymorphic. A total of 209 bands were amplified of which 135 (59.47%) polymorphic across the radiation treatments. Whilst, the level of polymorphism among the salinity treatments were 181 (85.6 %). Whereas, the polymorphism among the combined effects between gamma radiation doses and salinity concentrations were 283 ( 73.95% ). Treatments relationships were estimated through cluster analysis (UPGMA) based on ISSR data

  7. Schrödinger's Cheshire Cat: Are Haploid Emiliania huxleyi Cells Resistant to Viral Infection or Not?

    Science.gov (United States)

    Mordecai, Gideon J; Verret, Frederic; Highfield, Andrea; Schroeder, Declan C

    2017-03-18

    Emiliania huxleyi is the main calcite producer on Earth and is routinely infected by a virus (EhV); a double stranded DNA (dsDNA) virus belonging to the family Phycodnaviridae . E. huxleyi exhibits a haplodiploid life cycle; the calcified diploid stage is non-motile and forms extensive blooms. The haploid phase is a non-calcified biflagellated cell bearing organic scales. Haploid cells are thought to resist infection, through a process deemed the "Cheshire Cat" escape strategy; however, a recent study detected the presence of viral lipids in the same haploid strain. Here we report on the application of an E. huxleyi CCMP1516 EhV-86 combined tiling array (TA) that further confirms an EhV infection in the RCC1217 haploid strain, which grew without any signs of cell lysis. Reverse transcription polymerase chain reaction (RT-PCR) and PCR verified the presence of viral RNA in the haploid cells, yet indicated an absence of viral DNA, respectively. These infected cells are an alternative stage of the virus life cycle deemed the haplococcolithovirocell. In this instance, the host is both resistant to and infected by EhV, i.e., the viral transcriptome is present in haploid cells whilst there is no evidence of viral lysis. This superimposed state is reminiscent of Schrödinger's cat; of being simultaneously both dead and alive.

  8. Exploiting rRNA operon copy number to investigate bacterial reproductive strategies.

    Science.gov (United States)

    Roller, Benjamin R K; Stoddard, Steven F; Schmidt, Thomas M

    2016-09-12

    The potential for rapid reproduction is a hallmark of microbial life, but microbes in nature must also survive and compete when growth is constrained by resource availability. Successful reproduction requires different strategies when resources are scarce and when they are abundant 1,2 , but a systematic framework for predicting these reproductive strategies in bacteria has not been available. Here, we show that the number of ribosomal RNA operons (rrn) in bacterial genomes predicts two important components of reproduction-growth rate and growth efficiency-which are favoured under contrasting regimes of resource availability 3,4 . We find that the maximum reproductive rate of bacteria doubles with a doubling of rrn copy number, and the efficiency of carbon use is inversely related to maximal growth rate and rrn copy number. We also identify a feasible explanation for these patterns: the rate and yield of protein synthesis mirror the overall pattern in maximum growth rate and growth efficiency. Furthermore, comparative analysis of genomes from 1,167 bacterial species reveals that rrn copy number predicts traits associated with resource availability, including chemotaxis and genome streamlining. Genome-wide patterns of orthologous gene content covary with rrn copy number, suggesting convergent evolution in response to resource availability. Our findings imply that basic cellular processes adapt in contrasting ways to long-term differences in resource availability. They also establish a basis for predicting changes in bacterial community composition in response to resource perturbations using rrn copy number measurements 5 or inferences 6,7 .

  9. Proteomic strategy for the identification of critical actors in reorganization of the post-meiotic male genome.

    Science.gov (United States)

    Govin, Jerome; Gaucher, Jonathan; Ferro, Myriam; Debernardi, Alexandra; Garin, Jerome; Khochbin, Saadi; Rousseaux, Sophie

    2012-01-01

    After meiosis, during the final stages of spermatogenesis, the haploid male genome undergoes major structural changes, resulting in a shift from a nucleosome-based genome organization to the sperm-specific, highly compacted nucleoprotamine structure. Recent data support the idea that region-specific programming of the haploid male genome is of high importance for the post-fertilization events and for successful embryo development. Although these events constitute a unique and essential step in reproduction, the mechanisms by which they occur have remained completely obscure and the factors involved have mostly remained uncharacterized. Here, we sought a strategy to significantly increase our understanding of proteins controlling the haploid male genome reprogramming, based on the identification of proteins in two specific pools: those with the potential to bind nucleic acids (basic proteins) and proteins capable of binding basic proteins (acidic proteins). For the identification of acidic proteins, we developed an approach involving a transition-protein (TP)-based chromatography, which has the advantage of retaining not only acidic proteins due to the charge interactions, but also potential TP-interacting factors. A second strategy, based on an in-depth bioinformatic analysis of the identified proteins, was then applied to pinpoint within the lists obtained, male germ cells expressed factors relevant to the post-meiotic genome organization. This approach reveals a functional network of DNA-packaging proteins and their putative chaperones and sheds a new light on the way the critical transitions in genome organizations could take place. This work also points to a new area of research in male infertility and sperm quality assessments.

  10. Hapsembler: An Assembler for Highly Polymorphic Genomes

    Science.gov (United States)

    Donmez, Nilgun; Brudno, Michael

    As whole genome sequencing has become a routine biological experiment, algorithms for assembly of whole genome shotgun data has become a topic of extensive research, with a plethora of off-the-shelf methods that can reconstruct the genomes of many organisms. Simultaneously, several recently sequenced genomes exhibit very high polymorphism rates. For these organisms genome assembly remains a challenge as most assemblers are unable to handle highly divergent haplotypes in a single individual. In this paper we describe Hapsembler, an assembler for highly polymorphic genomes, which makes use of paired reads. Our experiments show that Hapsembler produces accurate and contiguous assemblies of highly polymorphic genomes, while performing on par with the leading tools on haploid genomes. Hapsembler is available for download at http://compbio.cs.toronto.edu/hapsembler.

  11. Standing at the Gateway to Europe - The Genetic Structure of Western Balkan Populations Based on Autosomal and Haploid Markers

    Science.gov (United States)

    Kovacevic, Lejla; Tambets, Kristiina; Ilumäe, Anne-Mai; Kushniarevich, Alena; Yunusbayev, Bayazit; Solnik, Anu; Bego, Tamer; Primorac, Dragan; Skaro, Vedrana; Leskovac, Andreja; Jakovski, Zlatko; Drobnic, Katja; Tolk, Helle-Viivi; Kovacevic, Sandra; Rudan, Pavao; Metspalu, Ene; Marjanovic, Damir

    2014-01-01

    Contemporary inhabitants of the Balkan Peninsula belong to several ethnic groups of diverse cultural background. In this study, three ethnic groups from Bosnia and Herzegovina - Bosniacs, Bosnian Croats and Bosnian Serbs - as well as the populations of Serbians, Croatians, Macedonians from the former Yugoslav Republic of Macedonia, Montenegrins and Kosovars have been characterized for the genetic variation of 660 000 genome-wide autosomal single nucleotide polymorphisms and for haploid markers. New autosomal data of the 70 individuals together with previously published data of 20 individuals from the populations of the Western Balkan region in a context of 695 samples of global range have been analysed. Comparison of the variation data of autosomal and haploid lineages of the studied Western Balkan populations reveals a concordance of the data in both sets and the genetic uniformity of the studied populations, especially of Western South-Slavic speakers. The genetic variation of Western Balkan populations reveals the continuity between the Middle East and Europe via the Balkan region and supports the scenario that one of the major routes of ancient gene flows and admixture went through the Balkan Peninsula. PMID:25148043

  12. Standing at the gateway to Europe--the genetic structure of Western balkan populations based on autosomal and haploid markers.

    Science.gov (United States)

    Kovacevic, Lejla; Tambets, Kristiina; Ilumäe, Anne-Mai; Kushniarevich, Alena; Yunusbayev, Bayazit; Solnik, Anu; Bego, Tamer; Primorac, Dragan; Skaro, Vedrana; Leskovac, Andreja; Jakovski, Zlatko; Drobnic, Katja; Tolk, Helle-Viivi; Kovacevic, Sandra; Rudan, Pavao; Metspalu, Ene; Marjanovic, Damir

    2014-01-01

    Contemporary inhabitants of the Balkan Peninsula belong to several ethnic groups of diverse cultural background. In this study, three ethnic groups from Bosnia and Herzegovina - Bosniacs, Bosnian Croats and Bosnian Serbs - as well as the populations of Serbians, Croatians, Macedonians from the former Yugoslav Republic of Macedonia, Montenegrins and Kosovars have been characterized for the genetic variation of 660 000 genome-wide autosomal single nucleotide polymorphisms and for haploid markers. New autosomal data of the 70 individuals together with previously published data of 20 individuals from the populations of the Western Balkan region in a context of 695 samples of global range have been analysed. Comparison of the variation data of autosomal and haploid lineages of the studied Western Balkan populations reveals a concordance of the data in both sets and the genetic uniformity of the studied populations, especially of Western South-Slavic speakers. The genetic variation of Western Balkan populations reveals the continuity between the Middle East and Europe via the Balkan region and supports the scenario that one of the major routes of ancient gene flows and admixture went through the Balkan Peninsula.

  13. The draft genome and transcriptome of Cannabis sativa.

    Science.gov (United States)

    van Bakel, Harm; Stout, Jake M; Cote, Atina G; Tallon, Carling M; Sharpe, Andrew G; Hughes, Timothy R; Page, Jonathan E

    2011-10-20

    Cannabis sativa has been cultivated throughout human history as a source of fiber, oil and food, and for its medicinal and intoxicating properties. Selective breeding has produced cannabis plants for specific uses, including high-potency marijuana strains and hemp cultivars for fiber and seed production. The molecular biology underlying cannabinoid biosynthesis and other traits of interest is largely unexplored. We sequenced genomic DNA and RNA from the marijuana strain Purple Kush using shortread approaches. We report a draft haploid genome sequence of 534 Mb and a transcriptome of 30,000 genes. Comparison of the transcriptome of Purple Kush with that of the hemp cultivar 'Finola' revealed that many genes encoding proteins involved in cannabinoid and precursor pathways are more highly expressed in Purple Kush than in 'Finola'. The exclusive occurrence of Δ9-tetrahydrocannabinolic acid synthase in the Purple Kush transcriptome, and its replacement by cannabidiolic acid synthase in 'Finola', may explain why the psychoactive cannabinoid Δ9-tetrahydrocannabinol (THC) is produced in marijuana but not in hemp. Resequencing the hemp cultivars 'Finola' and 'USO-31' showed little difference in gene copy numbers of cannabinoid pathway enzymes. However, single nucleotide variant analysis uncovered a relatively high level of variation among four cannabis types, and supported a separation of marijuana and hemp. The availability of the Cannabis sativa genome enables the study of a multifunctional plant that occupies a unique role in human culture. Its availability will aid the development of therapeutic marijuana strains with tailored cannabinoid profiles and provide a basis for the breeding of hemp with improved agronomic characteristics.

  14. Transcriptome analysis of functional differentiation between haploid and diploid cells of Emiliania huxleyi, a globally significant photosynthetic calcifying cell

    Science.gov (United States)

    2009-01-01

    Background Eukaryotes are classified as either haplontic, diplontic, or haplo-diplontic, depending on which ploidy levels undergo mitotic cell division in the life cycle. Emiliania huxleyi is one of the most abundant phytoplankton species in the ocean, playing an important role in global carbon fluxes, and represents haptophytes, an enigmatic group of unicellular organisms that diverged early in eukaryotic evolution. This species is haplo-diplontic. Little is known about the haploid cells, but they have been hypothesized to allow persistence of the species between the yearly blooms of diploid cells. We sequenced over 38,000 expressed sequence tags from haploid and diploid E. huxleyi normalized cDNA libraries to identify genes involved in important processes specific to each life phase (2N calcification or 1N motility), and to better understand the haploid phase of this prominent haplo-diplontic organism. Results The haploid and diploid transcriptomes showed a dramatic differentiation, with approximately 20% greater transcriptome richness in diploid cells than in haploid cells and only ≤ 50% of transcripts estimated to be common between the two phases. The major functional category of transcripts differentiating haploids included signal transduction and motility genes. Diploid-specific transcripts included Ca2+, H+, and HCO3- pumps. Potential factors differentiating the transcriptomes included haploid-specific Myb transcription factor homologs and an unusual diploid-specific histone H4 homolog. Conclusions This study permitted the identification of genes likely involved in diploid-specific biomineralization, haploid-specific motility, and transcriptional control. Greater transcriptome richness in diploid cells suggests they may be more versatile for exploiting a diversity of rich environments whereas haploid cells are intrinsically more streamlined. PMID:19832986

  15. Identification of networks of co-occurring, tumor-related DNA copy number changes using a genome-wide scoring approach.

    Directory of Open Access Journals (Sweden)

    Christiaan Klijn

    2010-01-01

    Full Text Available Tumorigenesis is a multi-step process in which normal cells transform into malignant tumors following the accumulation of genetic mutations that enable them to evade the growth control checkpoints that would normally suppress their growth or result in apoptosis. It is therefore important to identify those combinations of mutations that collaborate in cancer development and progression. DNA copy number alterations (CNAs are one of the ways in which cancer genes are deregulated in tumor cells. We hypothesized that synergistic interactions between cancer genes might be identified by looking for regions of co-occurring gain and/or loss. To this end we developed a scoring framework to separate truly co-occurring aberrations from passenger mutations and dominant single signals present in the data. The resulting regions of high co-occurrence can be investigated for between-region functional interactions. Analysis of high-resolution DNA copy number data from a panel of 95 hematological tumor cell lines correctly identified co-occurring recombinations at the T-cell receptor and immunoglobulin loci in T- and B-cell malignancies, respectively, showing that we can recover truly co-occurring genomic alterations. In addition, our analysis revealed networks of co-occurring genomic losses and gains that are enriched for cancer genes. These networks are also highly enriched for functional relationships between genes. We further examine sub-networks of these networks, core networks, which contain many known cancer genes. The core network for co-occurring DNA losses we find seems to be independent of the canonical cancer genes within the network. Our findings suggest that large-scale, low-intensity copy number alterations may be an important feature of cancer development or maintenance by affecting gene dosage of a large interconnected network of functionally related genes.

  16. Quadruplex MAPH: improvement of throughput in high-resolution copy number screening.

    Science.gov (United States)

    Tyson, Jess; Majerus, Tamsin Mo; Walker, Susan; Armour, John Al

    2009-09-28

    Copy number variation (CNV) in the human genome is recognised as a widespread and important source of human genetic variation. Now the challenge is to screen for these CNVs at high resolution in a reliable, accurate and cost-effective way. Multiplex Amplifiable Probe Hybridisation (MAPH) is a sensitive, high-resolution technology appropriate for screening for CNVs in a defined region, for a targeted population. We have developed MAPH to a highly multiplexed format ("QuadMAPH") that allows the user a four-fold increase in the number of loci tested simultaneously. We have used this method to analyse a genomic region of 210 kb, including the MSH2 gene and 120 kb of flanking DNA. We show that the QuadMAPH probes report copy number with equivalent accuracy to simplex MAPH, reliably demonstrating diploid copy number in control samples and accurately detecting deletions in Hereditary Non-Polyposis Colorectal Cancer (HNPCC) samples. QuadMAPH is an accurate, high-resolution method that allows targeted screening of large numbers of subjects without the expense of genome-wide approaches. Whilst we have applied this technique to a region of the human genome, it is equally applicable to the genomes of other organisms.

  17. Genetic control of the development of the haploid generation in Oenothera

    Directory of Open Access Journals (Sweden)

    Cornelia Harte

    2014-01-01

    Full Text Available The haploid generation of higher plants has to be considered in its own individuality. Special experimental designs are needed to investigate the developmental processes of the male and female gametophytes between meiosis and fertilization. Experiments on Oenothera demonstrate the existence of genes, which action can be described as influencing the competition between meiospores or between gametophytes, or as interaction between different individuals, the gametophytic-gametophytic and gametophytic-sporophytic incompatibility. The development of the haploid generation is regulated by genes. Some of these genes are active only in this phase of the life cycle.

  18. Survival and DNA repair in ultraviolet-irradiated haploid and diploid cultured frog cells

    International Nuclear Information System (INIS)

    Freed, J.J.; Hoess, R.H.; Angelosanto, F.A.; Massey, H.C. Jr.

    1979-01-01

    Survival and repair of DNA following ultraviolet (254-nm) radiation have been investigated in ICR 2A, a cultured cell line from haploid embryos of the grassfrog, Rana pipiens. Survival curves from cells recovering in the dark gave mean lethal dose value (D 0 ) in the range 1.5-1.7 Jm -2 for both haploid and diploid cell stocks. The only significant difference observed between haploids and diploids was in the extent of the shoulder at low fluence (Dsub(q)), the value for exponentially multiplying diploid cells (3.0 Jm -2 ) being higher than that found for haploids (1.2 Jm -2 ). Irradiation of cultures reversibly blocked in the G1 phase of the cell cycle gave survival-curve coefficients indistinguishable between haploids and diploids. Post-irradiation exposure to visible light restored colony-forming capacity and removed chromatographically estimated pyrimidine dimers from DNA at the same rates. After fluences killing 90% of the cells, complete restoration of survival was obtained after 60-min exposure to 500 foot-candles, indicating that in this range lethality is entirely photoreversible and therefore attributable to pyrimidine dimers in DNA. Dimer removal required illumination following ultraviolet exposure, intact cells and physiological temperature, implying that the photoreversal involved DNA photolyase activity. Excision-repair capacity was slight, since no loss of dimers could be detected chromoatographically during up to 48 h incubation in the dark and since autoradiographically detected 'unscheduled DNA synthesis' was limited to a 2-fold increase saturated at 10 Jm -2 . These properties make ICR 2A frog cells useful to explore how DNA-repair pathways influence mutant yield. (Auth.)

  19. An endogenous reference gene of common and durum wheat for detection of genetically modified wheat.

    Science.gov (United States)

    Imai, Shinjiro; Tanaka, Keiko; Nishitsuji, Yasuyuki; Kikuchi, Yosuke; Matsuoka, Yasuyuki; Arami, Shin-Ichiro; Sato, Megumi; Haraguchi, Hiroyuki; Kurimoto, Youichi; Mano, Junichi; Furui, Satoshi; Kitta, Kazumi

    2012-01-01

    To develop a method for detecting GM wheat that may be marketed in the near future, we evaluated the proline-rich protein (PRP) gene as an endogenous reference gene of common wheat (Triticum aestivum L.) and durum wheat (Triticum durum L.). Real-time PCR analysis showed that only DNA of wheat was amplified and no amplification product was observed for phylogenetically related cereals, indicating that the PRP detection system is specific to wheat. The intensities of the amplification products and Ct values among all wheat samples used in this study were very similar, with no nonspecific or additional amplification, indicating that the PRP detection system has high sequence stability. The limit of detection was estimated at 5 haploid genome copies. The PRP region was demonstrated to be present as a single or double copy in the common wheat haploid genome. Furthermore, the PRP detection system showed a highly linear relationship between Ct values and the amount of plasmid DNA, indicating that an appropriate calibration curve could be constructed for quantitative detection of GM wheat. All these results indicate that the PRP gene is a suitable endogenous reference gene for PCR-based detection of GM wheat.

  20. FISH Oracle: a web server for flexible visualization of DNA copy number data in a genomic context.

    Science.gov (United States)

    Mader, Malte; Simon, Ronald; Steinbiss, Sascha; Kurtz, Stefan

    2011-07-28

    The rapidly growing amount of array CGH data requires improved visualization software supporting the process of identifying candidate cancer genes. Optimally, such software should work across multiple microarray platforms, should be able to cope with data from different sources and should be easy to operate. We have developed a web-based software FISH Oracle to visualize data from multiple array CGH experiments in a genomic context. Its fast visualization engine and advanced web and database technology supports highly interactive use. FISH Oracle comes with a convenient data import mechanism, powerful search options for genomic elements (e.g. gene names or karyobands), quick navigation and zooming into interesting regions, and mechanisms to export the visualization into different high quality formats. These features make the software especially suitable for the needs of life scientists. FISH Oracle offers a fast and easy to use visualization tool for array CGH and SNP array data. It allows for the identification of genomic regions representing minimal common changes based on data from one or more experiments. FISH Oracle will be instrumental to identify candidate onco and tumor suppressor genes based on the frequency and genomic position of DNA copy number changes. The FISH Oracle application and an installed demo web server are available at http://www.zbh.uni-hamburg.de/fishoracle.

  1. Stratification of clear cell renal cell carcinoma (ccRCC) genomes by gene-directed copy number alteration (CNA) analysis.

    Science.gov (United States)

    Thiesen, H-J; Steinbeck, F; Maruschke, M; Koczan, D; Ziems, B; Hakenberg, O W

    2017-01-01

    Tumorigenic processes are understood to be driven by epi-/genetic and genomic alterations from single point mutations to chromosomal alterations such as insertions and deletions of nucleotides up to gains and losses of large chromosomal fragments including products of chromosomal rearrangements e.g. fusion genes and proteins. Overall comparisons of copy number alterations (CNAs) presented in 48 clear cell renal cell carcinoma (ccRCC) genomes resulted in ratios of gene losses versus gene gains between 26 ccRCC Fuhrman malignancy grades G1 (ratio 1.25) and 20 G3 (ratio 0.58). Gene losses and gains of 15762 CNA genes were mapped to 795 chromosomal cytoband loci including 280 KEGG pathways. CNAs were classified according to their contribution to Fuhrman tumour gradings G1 and G3. Gene gains and losses turned out to be highly structured processes in ccRCC genomes enabling the subclassification and stratification of ccRCC tumours in a genome-wide manner. CNAs of ccRCC seem to start with common tumour related gene losses flanked by CNAs specifying Fuhrman grade G1 losses and CNA gains favouring grade G3 tumours. The appearance of recurrent CNA signatures implies the presence of causal mechanisms most likely implicated in the pathogenesis and disease-outcome of ccRCC tumours distinguishing lower from higher malignant tumours. The diagnostic quality of initial 201 genes (108 genes supporting G1 and 93 genes G3 phenotypes) has been successfully validated on published Swiss data (GSE19949) leading to a restricted CNA gene set of 171 CNA genes of which 85 genes favour Fuhrman grade G1 and 86 genes Fuhrman grade G3. Regarding these gene sets overall survival decreased with the number of G3 related gene losses plus G3 related gene gains. CNA gene sets presented define an entry to a gene-directed and pathway-related functional understanding of ongoing copy number alterations within and between individual ccRCC tumours leading to CNA genes of prognostic and predictive value.

  2. Fully-Automated High-Throughput NMR System for Screening of Haploid Kernels of Maize (Corn by Measurement of Oil Content.

    Directory of Open Access Journals (Sweden)

    Hongzhi Wang

    Full Text Available One of the modern crop breeding techniques uses doubled haploid plants that contain an identical pair of chromosomes in order to accelerate the breeding process. Rapid haploid identification method is critical for large-scale selections of double haploids. The conventional methods based on the color of the endosperm and embryo seeds are slow, manual and prone to error. On the other hand, there exists a significant difference between diploid and haploid seeds generated by high oil inducer, which makes it possible to use oil content to identify the haploid. This paper describes a fully-automated high-throughput NMR screening system for maize haploid kernel identification. The system is comprised of a sampler unit to select a single kernel to feed for measurement of NMR and weight, and a kernel sorter to distribute the kernel according to the measurement result. Tests of the system show a consistent accuracy of 94% with an average screening time of 4 seconds per kernel. Field test result is described and the directions for future improvement are discussed.

  3. Fully-Automated High-Throughput NMR System for Screening of Haploid Kernels of Maize (Corn) by Measurement of Oil Content

    Science.gov (United States)

    Xu, Xiaoping; Huang, Qingming; Chen, Shanshan; Yang, Peiqiang; Chen, Shaojiang; Song, Yiqiao

    2016-01-01

    One of the modern crop breeding techniques uses doubled haploid plants that contain an identical pair of chromosomes in order to accelerate the breeding process. Rapid haploid identification method is critical for large-scale selections of double haploids. The conventional methods based on the color of the endosperm and embryo seeds are slow, manual and prone to error. On the other hand, there exists a significant difference between diploid and haploid seeds generated by high oil inducer, which makes it possible to use oil content to identify the haploid. This paper describes a fully-automated high-throughput NMR screening system for maize haploid kernel identification. The system is comprised of a sampler unit to select a single kernel to feed for measurement of NMR and weight, and a kernel sorter to distribute the kernel according to the measurement result. Tests of the system show a consistent accuracy of 94% with an average screening time of 4 seconds per kernel. Field test result is described and the directions for future improvement are discussed. PMID:27454427

  4. Re-annotation of the physical map of Glycine max for polyploid-like regions by BAC end sequence driven whole genome shotgun read assembly

    Directory of Open Access Journals (Sweden)

    Shultz Jeffry

    2008-07-01

    Full Text Available Abstract Background Many of the world's most important food crops have either polyploid genomes or homeologous regions derived from segmental shuffling following polyploid formation. The soybean (Glycine max genome has been shown to be composed of approximately four thousand short interspersed homeologous regions with 1, 2 or 4 copies per haploid genome by RFLP analysis, microsatellite anchors to BACs and by contigs formed from BAC fingerprints. Despite these similar regions,, the genome has been sequenced by whole genome shotgun sequence (WGS. Here the aim was to use BAC end sequences (BES derived from three minimum tile paths (MTP to examine the extent and homogeneity of polyploid-like regions within contigs and the extent of correlation between the polyploid-like regions inferred from fingerprinting and the polyploid-like sequences inferred from WGS matches. Results Results show that when sequence divergence was 1–10%, the copy number of homeologous regions could be identified from sequence variation in WGS reads overlapping BES. Homeolog sequence variants (HSVs were single nucleotide polymorphisms (SNPs; 89% and single nucleotide indels (SNIs 10%. Larger indels were rare but present (1%. Simulations that had predicted fingerprints of homeologous regions could be separated when divergence exceeded 2% were shown to be false. We show that a 5–10% sequence divergence is necessary to separate homeologs by fingerprinting. BES compared to WGS traces showed polyploid-like regions with less than 1% sequence divergence exist at 2.3% of the locations assayed. Conclusion The use of HSVs like SNPs and SNIs to characterize BACs wil improve contig building methods. The implications for bioinformatic and functional annotation of polyploid and paleopolyploid genomes show that a combined approach of BAC fingerprint based physical maps, WGS sequence and HSV-based partitioning of BAC clones from homeologous regions to separate contigs will allow reliable de

  5. Schrödinger’s Cheshire Cat: Are Haploid Emiliania huxleyi Cells Resistant to Viral Infection or Not?

    Directory of Open Access Journals (Sweden)

    Gideon J. Mordecai

    2017-03-01

    Full Text Available Emiliania huxleyi is the main calcite producer on Earth and is routinely infected by a virus (EhV; a double stranded DNA (dsDNA virus belonging to the family Phycodnaviridae. E. huxleyi exhibits a haplodiploid life cycle; the calcified diploid stage is non-motile and forms extensive blooms. The haploid phase is a non-calcified biflagellated cell bearing organic scales. Haploid cells are thought to resist infection, through a process deemed the “Cheshire Cat” escape strategy; however, a recent study detected the presence of viral lipids in the same haploid strain. Here we report on the application of an E. huxleyi CCMP1516 EhV-86 combined tiling array (TA that further confirms an EhV infection in the RCC1217 haploid strain, which grew without any signs of cell lysis. Reverse transcription polymerase chain reaction (RT-PCR and PCR verified the presence of viral RNA in the haploid cells, yet indicated an absence of viral DNA, respectively. These infected cells are an alternative stage of the virus life cycle deemed the haplococcolithovirocell. In this instance, the host is both resistant to and infected by EhV, i.e., the viral transcriptome is present in haploid cells whilst there is no evidence of viral lysis. This superimposed state is reminiscent of Schrödinger’s cat; of being simultaneously both dead and alive.

  6. Schrödinger’s Cheshire Cat: Are Haploid Emiliania huxleyi Cells Resistant to Viral Infection or Not?

    Science.gov (United States)

    Mordecai, Gideon J.; Verret, Frederic; Highfield, Andrea; Schroeder, Declan C.

    2017-01-01

    Emiliania huxleyi is the main calcite producer on Earth and is routinely infected by a virus (EhV); a double stranded DNA (dsDNA) virus belonging to the family Phycodnaviridae. E. huxleyi exhibits a haplodiploid life cycle; the calcified diploid stage is non-motile and forms extensive blooms. The haploid phase is a non-calcified biflagellated cell bearing organic scales. Haploid cells are thought to resist infection, through a process deemed the “Cheshire Cat” escape strategy; however, a recent study detected the presence of viral lipids in the same haploid strain. Here we report on the application of an E. huxleyi CCMP1516 EhV-86 combined tiling array (TA) that further confirms an EhV infection in the RCC1217 haploid strain, which grew without any signs of cell lysis. Reverse transcription polymerase chain reaction (RT-PCR) and PCR verified the presence of viral RNA in the haploid cells, yet indicated an absence of viral DNA, respectively. These infected cells are an alternative stage of the virus life cycle deemed the haplococcolithovirocell. In this instance, the host is both resistant to and infected by EhV, i.e., the viral transcriptome is present in haploid cells whilst there is no evidence of viral lysis. This superimposed state is reminiscent of Schrödinger’s cat; of being simultaneously both dead and alive. PMID:28335465

  7. Systematic biases in DNA copy number originate from isolation procedures

    NARCIS (Netherlands)

    van Heesch, S.; Mokry, M.; Boskova, V.; Junker, W.; Mehon, R.; Toonen, P.; de Bruijn, E.; Shull, J.D.; Aitman, T.J.; Cuppen, E.; Guryev, V.

    2013-01-01

    BACKGROUND: The ability to accurately detect DNA copy number variation in both a sensitive and quantitative manner is important in many research areas. However, genome-wide DNA copy number analyses are complicated by variations in detection signal. RESULTS: While GC content has been used to correct

  8. Genetical Studies On Haploid Production In Some Ornamental Plants

    International Nuclear Information System (INIS)

    MOSTAFA, M.A.M.

    2013-01-01

    Haploid are plants with a gametophytic chromosome number and doubled haploid are dihaploids that have undergone chromosome duplication. The production of haploid and doubled haploid (DHs) through gametic embryogenesis allows a single-step development of complete homozygous lines from heterozygous parents, shortening the time required to produce homozygous plants in comparison with the conventional breeding methods that employ several generations of selfing. The production of haploid and DHs provides a particularly attractive biotechnological tool, and the development of haploidy technology and protocols to produce homozygous plants has had a significant impact on agricultural systems. Nowadays, these bio technologies represent an integral part of the breeding programmes of many agronomically important crops. There are several available methods to obtain haploid and DHs, of which in vitro anther or isolated microspore culture are the most effective and widely used (Germana Maria 2011). Tissue culture techniques, particularly short-term culture procedures such as shoot-tip culture and regeneration from primary explants, have been proposed as methods for obtaining large numbers of plants identical to the plant used as an explant source( Evans et al., 1984). Nicotiana spp. are one of the most important commercial crops in the world ( Liu and Zhang, 2008). Nicotiana alata is member from family solanacea, it is ornamental plant and the diploid cells contains 18 chromosomes. Nitsch (1969) reported the first production of haploid plants through anther culture and regeneration of plants of Nicotiana alata, For these reasons they have been considered to suitable candidates for model species in somatic cell genetics research( Bourgin et al., 1979). Radiobiological studies on plant tissues in culture may provide information on the cell growth behavior, radiosensitivity and the induction of mutations. The radiosensitivity of plants and calli can be manifested mostly in three

  9. A genomic copy number signature predicts radiation exposure in post-Chernobyl breast cancer.

    Science.gov (United States)

    Wilke, Christina M; Braselmann, Herbert; Hess, Julia; Klymenko, Sergiy V; Chumak, Vadim V; Zakhartseva, Liubov M; Bakhanova, Elena V; Walch, Axel K; Selmansberger, Martin; Samaga, Daniel; Weber, Peter; Schneider, Ludmila; Fend, Falko; Bösmüller, Hans C; Zitzelsberger, Horst; Unger, Kristian

    2018-04-16

    Breast cancer is the second leading cause of cancer death among women worldwide and besides life style, age and genetic risk factors, exposure to ionizing radiation is known to increase the risk for breast cancer. Further, DNA copy number alterations (CNAs), which can result from radiation-induced double-strand breaks, are frequently occurring in breast cancer cells. We set out to identify a signature of CNAs discriminating breast cancers from radiation-exposed and non-exposed female patients. We analyzed resected breast cancer tissues from 68 exposed female Chernobyl clean-up workers and evacuees and 68 matched non-exposed control patients for CNAs by array comparative genomic hybridization analysis (aCGH). Using a stepwise forward-backward selection approach a non-complex CNA signature, that is, less than ten features, was identified in the training data set, which could be subsequently validated in the validation data set (p value < 0.05). The signature consisted of nine copy number regions located on chromosomal bands 7q11.22-11.23, 7q21.3, 16q24.3, 17q21.31, 20p11.23-11.21, 1p21.1, 2q35, 2q35, 6p22.2. The signature was independent of any clinical characteristics of the patients. In all, we identified a CNA signature that has the potential to allow identification of radiation-associated breast cancer at the individual level. © 2018 UICC.

  10. Accurate measurement of gene copy number for human alpha-defensin DEFA1A3.

    Science.gov (United States)

    Khan, Fayeza F; Carpenter, Danielle; Mitchell, Laura; Mansouri, Omniah; Black, Holly A; Tyson, Jess; Armour, John A L

    2013-10-20

    Multi-allelic copy number variants include examples of extensive variation between individuals in the copy number of important genes, most notably genes involved in immune function. The definition of this variation, and analysis of its impact on function, has been hampered by the technical difficulty of large-scale but accurate typing of genomic copy number. The copy-variable alpha-defensin locus DEFA1A3 on human chromosome 8 commonly varies between 4 and 10 copies per diploid genome, and presents considerable challenges for accurate high-throughput typing. In this study, we developed two paralogue ratio tests and three allelic ratio measurements that, in combination, provide an accurate and scalable method for measurement of DEFA1A3 gene number. We combined information from different measurements in a maximum-likelihood framework which suggests that most samples can be assigned to an integer copy number with high confidence, and applied it to typing 589 unrelated European DNA samples. Typing the members of three-generation pedigrees provided further reassurance that correct integer copy numbers had been assigned. Our results have allowed us to discover that the SNP rs4300027 is strongly associated with DEFA1A3 gene copy number in European samples. We have developed an accurate and robust method for measurement of DEFA1A3 copy number. Interrogation of rs4300027 and associated SNPs in Genome-Wide Association Study SNP data provides no evidence that alpha-defensin copy number is a strong risk factor for phenotypes such as Crohn's disease, type I diabetes, HIV progression and multiple sclerosis.

  11. GenomeCAT: a versatile tool for the analysis and integrative visualization of DNA copy number variants.

    Science.gov (United States)

    Tebel, Katrin; Boldt, Vivien; Steininger, Anne; Port, Matthias; Ebert, Grit; Ullmann, Reinhard

    2017-01-06

    The analysis of DNA copy number variants (CNV) has increasing impact in the field of genetic diagnostics and research. However, the interpretation of CNV data derived from high resolution array CGH or NGS platforms is complicated by the considerable variability of the human genome. Therefore, tools for multidimensional data analysis and comparison of patient cohorts are needed to assist in the discrimination of clinically relevant CNVs from others. We developed GenomeCAT, a standalone Java application for the analysis and integrative visualization of CNVs. GenomeCAT is composed of three modules dedicated to the inspection of single cases, comparative analysis of multidimensional data and group comparisons aiming at the identification of recurrent aberrations in patients sharing the same phenotype, respectively. Its flexible import options ease the comparative analysis of own results derived from microarray or NGS platforms with data from literature or public depositories. Multidimensional data obtained from different experiment types can be merged into a common data matrix to enable common visualization and analysis. All results are stored in the integrated MySQL database, but can also be exported as tab delimited files for further statistical calculations in external programs. GenomeCAT offers a broad spectrum of visualization and analysis tools that assist in the evaluation of CNVs in the context of other experiment data and annotations. The use of GenomeCAT does not require any specialized computer skills. The various R packages implemented for data analysis are fully integrated into GenomeCATs graphical user interface and the installation process is supported by a wizard. The flexibility in terms of data import and export in combination with the ability to create a common data matrix makes the program also well suited as an interface between genomic data from heterogeneous sources and external software tools. Due to the modular architecture the functionality of

  12. Quadruplex MAPH: improvement of throughput in high-resolution copy number screening

    Directory of Open Access Journals (Sweden)

    Walker Susan

    2009-09-01

    Full Text Available Abstract Background Copy number variation (CNV in the human genome is recognised as a widespread and important source of human genetic variation. Now the challenge is to screen for these CNVs at high resolution in a reliable, accurate and cost-effective way. Results Multiplex Amplifiable Probe Hybridisation (MAPH is a sensitive, high-resolution technology appropriate for screening for CNVs in a defined region, for a targeted population. We have developed MAPH to a highly multiplexed format ("QuadMAPH" that allows the user a four-fold increase in the number of loci tested simultaneously. We have used this method to analyse a genomic region of 210 kb, including the MSH2 gene and 120 kb of flanking DNA. We show that the QuadMAPH probes report copy number with equivalent accuracy to simplex MAPH, reliably demonstrating diploid copy number in control samples and accurately detecting deletions in Hereditary Non-Polyposis Colorectal Cancer (HNPCC samples. Conclusion QuadMAPH is an accurate, high-resolution method that allows targeted screening of large numbers of subjects without the expense of genome-wide approaches. Whilst we have applied this technique to a region of the human genome, it is equally applicable to the genomes of other organisms.

  13. Sequencing the CHO DXB11 genome reveals regional variations in genomic stability and haploidy

    DEFF Research Database (Denmark)

    Kaas, Christian Schrøder; Kristensen, Claus; Betenbaugh, Michael J.

    2015-01-01

    Background: The DHFR negative CHO DXB11 cell line (also known as DUX-B11 and DUKX) was historically the first CHO cell line to be used for large scale production of heterologous proteins and is still used for production of a number of complex proteins.  Results: Here we present the genomic sequence...... of the CHO DXB11 genome sequenced to a depth of 33x. Overall a significant genomic drift was seen favoring GC -> AT point mutations in line with the chemical mutagenesis strategy used for generation of the cell line. The sequencing depth for each gene in the genome revealed distinct peaks at sequencing...... in eight additional analyzed CHO genomes (15-20% haploidy) but not in the genome of the Chinese hamster. The dhfr gene is confirmed to be haploid in CHO DXB11; transcriptionally active and the remaining allele contains a G410C point mutation causing a Thr137Arg missense mutation. We find similar to 2...

  14. Genomic and karyotypic variation in Drosophila parasitoids (Hymenoptera, Cynipoidea, Figitidae

    Directory of Open Access Journals (Sweden)

    Vladimir Gokhman

    2011-08-01

    Full Text Available Drosophila melanogaster Meigen, 1830 has served as a model insect for over a century. Sequencing of the 11 additional Drosophila Fallen, 1823 species marks substantial progress in comparative genomics of this genus. By comparison, practically nothing is known about the genome size or genome sequences of parasitic wasps of Drosophila. Here, we present the first comparative analysis of genome size and karyotype structures of Drosophila parasitoids of the Leptopilina Förster, 1869 and Ganaspis Förster, 1869 species. The gametic genome size of Ganaspis xanthopoda (Ashmead, 1896 is larger than those of the three Leptopilina species studied. The genome sizes of all parasitic wasps studied here are also larger than those known for all Drosophila species. Surprisingly, genome sizes of these Drosophila parasitoids exceed the average value known for all previously studied Hymenoptera. The haploid chromosome number of both Leptopilina heterotoma (Thomson, 1862 and L. victoriae Nordlander, 1980 is ten. A chromosomal fusion appears to have produced a distinct karyotype for L. boulardi (Barbotin, Carton et Keiner-Pillault, 1979 (n = 9, whose genome size is smaller than that of wasps of the L. heterotoma clade. Like L. boulardi, the haploid chromosome number for G. xanthopoda is also nine. Our studies reveal a positive, but non linear, correlation between the genome size and total chromosome length in Drosophila parasitoids. These Drosophila parasitoids differ widely in their host range, and utilize different infection strategies to overcome host defense. Their comparative genomics, in relation to their exceptionally well-characterized hosts, will prove to be valuable for understanding the molecular basis of the host-parasite arms race and how such mechanisms shape the genetic structures of insect communities.

  15. Diploid, but not haploid, human embryonic stem cells can be derived from microsurgically repaired tripronuclear human zygotes

    Science.gov (United States)

    Fan, Yong; Li, Rong; Huang, Jin; Yu, Yang; Qiao, Jie

    2013-01-01

    Human embryonic stem cells have shown tremendous potential in regenerative medicine, and the recent progress in haploid embryonic stem cells provides new insights for future applications of embryonic stem cells. Disruption of normal fertilized embryos remains controversial; thus, the development of a new source for human embryonic stem cells is important for their usefulness. Here, we investigated the feasibility of haploid and diploid embryo reconstruction and embryonic stem cell derivation using microsurgically repaired tripronuclear human zygotes. Diploid and haploid zygotes were successfully reconstructed, but a large proportion of them still had a tripolar spindle assembly. The reconstructed embryos developed to the blastocyst stage, although the loss of chromosomes was observed in these zygotes. Finally, triploid and diploid human embryonic stem cells were derived from tripronuclear and reconstructed zygotes (from which only one pronucleus was removed), but haploid human embryonic stem cells were not successfully derived from the reconstructed zygotes when two pronuclei were removed. Both triploid and diploid human embryonic stem cells showed the general characteristics of human embryonic stem cells. These results indicate that the lower embryo quality resulting from abnormal spindle assembly contributed to the failure of the haploid embryonic stem cell derivation. However, the successful derivation of diploid embryonic stem cells demonstrated that microsurgical tripronuclear zygotes are an alternative source of human embryonic stem cells. In the future, improving spindle assembly will facilitate the application of triploid zygotes to the field of haploid embryonic stem cells. PMID:23255130

  16. Comparisons of Copy Number, Genomic Structure, and Conserved Motifs for α-Amylase Genes from Barley, Rice, and Wheat

    Directory of Open Access Journals (Sweden)

    Qisen Zhang

    2017-10-01

    Full Text Available Barley is an important crop for the production of malt and beer. However, crops such as rice and wheat are rarely used for malting. α-amylase is the key enzyme that degrades starch during malting. In this study, we compared the genomic properties, gene copies, and conserved promoter motifs of α-amylase genes in barley, rice, and wheat. In all three crops, α-amylase consists of four subfamilies designated amy1, amy2, amy3, and amy4. In wheat and barley, members of amy1 and amy2 genes are localized on chromosomes 6 and 7, respectively. In rice, members of amy1 genes are found on chromosomes 1 and 2, and amy2 genes on chromosome 6. The barley genome has six amy1 members and three amy2 members. The wheat B genome contains four amy1 members and three amy2 members, while the rice genome has three amy1 members and one amy2 member. The B genome has mostly amy1 and amy2 members among the three wheat genomes. Amy1 promoters from all three crop genomes contain a GA-responsive complex consisting of a GA-responsive element (CAATAAA, pyrimidine box (CCTTTT and TATCCAT/C box. This study has shown that amy1 and amy2 from both wheat and barley have similar genomic properties, including exon/intron structures and GA-responsive elements on promoters, but these differ in rice. Like barley, wheat should have sufficient amy activity to degrade starch completely during malting. Other factors, such as high protein with haze issues and the lack of husk causing Lauting difficulty, may limit the use of wheat for brewing.

  17. Genome-wide association study identified copy number variants important for appendicular lean mass.

    Science.gov (United States)

    Ran, Shu; Liu, Yong-Jun; Zhang, Lei; Pei, Yufang; Yang, Tie-Lin; Hai, Rong; Han, Ying-Ying; Lin, Yong; Tian, Qing; Deng, Hong-Wen

    2014-01-01

    Skeletal muscle is a major component of the human body. Age-related loss of muscle mass and function contributes to some public health problems such as sarcopenia and osteoporosis. Skeletal muscle, mainly composed of appendicular lean mass (ALM), is a heritable trait. Copy number variation (CNV) is a common type of human genome variant which may play an important role in the etiology of many human diseases. In this study, we performed genome-wide association analyses of CNV for ALM in 2,286 Caucasian subjects. We then replicated the major findings in 1,627 Chinese subjects. Two CNVs, CNV1191 and CNV2580, were detected to be associated with ALM (p = 2.26×10(-2) and 3.34×10(-3), respectively). In the Chinese replication sample, the two CNVs achieved p-values of 3.26×10(-2) and 0.107, respectively. CNV1191 covers a gene, GTPase of the immunity-associated protein family (GIMAP1), which is important for skeletal muscle cell survival/death in humans. CNV2580 is located in the Serine hydrolase-like protein (SERHL) gene, which plays an important role in normal peroxisome function and skeletal muscle growth in response to mechanical stimuli. In summary, our study suggested two novel CNVs and the related genes that may contribute to variation in ALM.

  18. Genome-wide association study identified copy number variants important for appendicular lean mass.

    Directory of Open Access Journals (Sweden)

    Shu Ran

    Full Text Available Skeletal muscle is a major component of the human body. Age-related loss of muscle mass and function contributes to some public health problems such as sarcopenia and osteoporosis. Skeletal muscle, mainly composed of appendicular lean mass (ALM, is a heritable trait. Copy number variation (CNV is a common type of human genome variant which may play an important role in the etiology of many human diseases. In this study, we performed genome-wide association analyses of CNV for ALM in 2,286 Caucasian subjects. We then replicated the major findings in 1,627 Chinese subjects. Two CNVs, CNV1191 and CNV2580, were detected to be associated with ALM (p = 2.26×10(-2 and 3.34×10(-3, respectively. In the Chinese replication sample, the two CNVs achieved p-values of 3.26×10(-2 and 0.107, respectively. CNV1191 covers a gene, GTPase of the immunity-associated protein family (GIMAP1, which is important for skeletal muscle cell survival/death in humans. CNV2580 is located in the Serine hydrolase-like protein (SERHL gene, which plays an important role in normal peroxisome function and skeletal muscle growth in response to mechanical stimuli. In summary, our study suggested two novel CNVs and the related genes that may contribute to variation in ALM.

  19. Engineered promoters enable constant gene expression at any copy number in bacteria.

    Science.gov (United States)

    Segall-Shapiro, Thomas H; Sontag, Eduardo D; Voigt, Christopher A

    2018-04-01

    The internal environment of growing cells is variable and dynamic, making it difficult to introduce reliable parts, such as promoters, for genetic engineering. Here, we applied control-theoretic ideas to design promoters that maintained constant levels of expression at any copy number. Theory predicts that independence to copy number can be achieved by using an incoherent feedforward loop (iFFL) if the negative regulation is perfectly non-cooperative. We engineered iFFLs into Escherichia coli promoters using transcription-activator-like effectors (TALEs). These promoters had near-identical expression in different genome locations and plasmids, even when their copy number was perturbed by genomic mutations or changes in growth medium composition. We applied the stabilized promoters to show that a three-gene metabolic pathway to produce deoxychromoviridans could retain function without re-tuning when the stabilized-promoter-driven genes were moved from a plasmid into the genome.

  20. Probabilistic quantitative microbial risk assessment model of norovirus from wastewater irrigated vegetables in Ghana using genome copies and fecal indicator ratio conversion for estimating exposure dose

    DEFF Research Database (Denmark)

    Owusu-Ansah, Emmanuel de-Graft Johnson; Sampson, Angelina; Amponsah, Samuel K.

    2017-01-01

    physical and environmental factors that might influence the reliability of using indicator organisms in microbial risk assessment. The challenges facing analytical studies on virus enumeration (genome copies or particles) have contributed to the already existing lack of data in QMRA modelling. This study......The need to replace the commonly applied fecal indicator conversions ratio (an assumption of 1:10− 5 virus to fecal indicator organism) in Quantitative Microbial Risk Assessment (QMRA) with models based on quantitative data on the virus of interest has gained prominence due to the different...... attempts to fit a QMRA model to genome copies of norovirus data. The model estimates the risk of norovirus infection from the intake of vegetables irrigated with wastewater from different sources. The results were compared to the results of a corresponding model using the fecal indicator conversion ratio...

  1. A genome-wide association study of copy number variations with umbilical hernia in swine.

    Science.gov (United States)

    Long, Yi; Su, Ying; Ai, Huashui; Zhang, Zhiyan; Yang, Bin; Ruan, Guorong; Xiao, Shijun; Liao, Xinjun; Ren, Jun; Huang, Lusheng; Ding, Nengshui

    2016-06-01

    Umbilical hernia (UH) is one of the most common congenital defects in pigs, leading to considerable economic loss and serious animal welfare problems. To test whether copy number variations (CNVs) contribute to pig UH, we performed a case-control genome-wide CNV association study on 905 pigs from the Duroc, Landrace and Yorkshire breeds using the Porcine SNP60 BeadChip and penncnv algorithm. We first constructed a genomic map comprising 6193 CNVs that pertain to 737 CNV regions. Then, we identified eight CNVs significantly associated with the risk for UH in the three pig breeds. Six of seven significantly associated CNVs were validated using quantitative real-time PCR. Notably, a rare CNV (CNV14:13030843-13059455) encompassing the NUGGC gene was strongly associated with UH (permutation-corrected P = 0.0015) in Duroc pigs. This CNV occurred exclusively in seven Duroc UH-affected individuals. SNPs surrounding the CNV did not show association signals, indicating that rare CNVs may play an important role in complex pig diseases such as UH. The NUGGC gene has been implicated in human omphalocele and inguinal hernia. Our finding supports that CNVs, including the NUGGC CNV, contribute to the pathogenesis of pig UH. © 2016 Stichting International Foundation for Animal Genetics.

  2. Consequences of normalizing transcriptomic and genomic libraries of plant genomes using a duplex-specific nuclease and tetramethylammonium chloride.

    Science.gov (United States)

    Matvienko, Marta; Kozik, Alexander; Froenicke, Lutz; Lavelle, Dean; Martineau, Belinda; Perroud, Bertrand; Michelmore, Richard

    2013-01-01

    Several applications of high throughput genome and transcriptome sequencing would benefit from a reduction of the high-copy-number sequences in the libraries being sequenced and analyzed, particularly when applied to species with large genomes. We adapted and analyzed the consequences of a method that utilizes a thermostable duplex-specific nuclease for reducing the high-copy components in transcriptomic and genomic libraries prior to sequencing. This reduces the time, cost, and computational effort of obtaining informative transcriptomic and genomic sequence data for both fully sequenced and non-sequenced genomes. It also reduces contamination from organellar DNA in preparations of nuclear DNA. Hybridization in the presence of 3 M tetramethylammonium chloride (TMAC), which equalizes the rates of hybridization of GC and AT nucleotide pairs, reduced the bias against sequences with high GC content. Consequences of this method on the reduction of high-copy and enrichment of low-copy sequences are reported for Arabidopsis and lettuce.

  3. Consequences of normalizing transcriptomic and genomic libraries of plant genomes using a duplex-specific nuclease and tetramethylammonium chloride.

    Directory of Open Access Journals (Sweden)

    Marta Matvienko

    Full Text Available Several applications of high throughput genome and transcriptome sequencing would benefit from a reduction of the high-copy-number sequences in the libraries being sequenced and analyzed, particularly when applied to species with large genomes. We adapted and analyzed the consequences of a method that utilizes a thermostable duplex-specific nuclease for reducing the high-copy components in transcriptomic and genomic libraries prior to sequencing. This reduces the time, cost, and computational effort of obtaining informative transcriptomic and genomic sequence data for both fully sequenced and non-sequenced genomes. It also reduces contamination from organellar DNA in preparations of nuclear DNA. Hybridization in the presence of 3 M tetramethylammonium chloride (TMAC, which equalizes the rates of hybridization of GC and AT nucleotide pairs, reduced the bias against sequences with high GC content. Consequences of this method on the reduction of high-copy and enrichment of low-copy sequences are reported for Arabidopsis and lettuce.

  4. Genome engineering in human cells.

    Science.gov (United States)

    Song, Minjung; Kim, Young-Hoon; Kim, Jin-Soo; Kim, Hyongbum

    2014-01-01

    Genome editing in human cells is of great value in research, medicine, and biotechnology. Programmable nucleases including zinc-finger nucleases, transcription activator-like effector nucleases, and RNA-guided engineered nucleases recognize a specific target sequence and make a double-strand break at that site, which can result in gene disruption, gene insertion, gene correction, or chromosomal rearrangements. The target sequence complexities of these programmable nucleases are higher than 3.2 mega base pairs, the size of the haploid human genome. Here, we briefly introduce the structure of the human genome and the characteristics of each programmable nuclease, and review their applications in human cells including pluripotent stem cells. In addition, we discuss various delivery methods for nucleases, programmable nickases, and enrichment of gene-edited human cells, all of which facilitate efficient and precise genome editing in human cells.

  5. Computational correction of copy number effect improves specificity of CRISPR-Cas9 essentiality screens in cancer cells. | Office of Cancer Genomics

    Science.gov (United States)

    The CRISPR-Cas9 system has revolutionized gene editing both at single genes and in multiplexed loss-of-function screens, thus enabling precise genome-scale identification of genes essential for proliferation and survival of cancer cells. However, previous studies have reported that a gene-independent antiproliferative effect of Cas9-mediated DNA cleavage confounds such measurement of genetic dependency, thereby leading to false-positive results in copy number-amplified regions.

  6. Cancer genomics

    DEFF Research Database (Denmark)

    Norrild, Bodil; Guldberg, Per; Ralfkiær, Elisabeth Methner

    2007-01-01

    Almost all cells in the human body contain a complete copy of the genome with an estimated number of 25,000 genes. The sequences of these genes make up about three percent of the genome and comprise the inherited set of genetic information. The genome also contains information that determines whe...

  7. Influence of time of auxin application on wheat haploid embrio formation

    Directory of Open Access Journals (Sweden)

    Prodanović Slaven

    2005-01-01

    Full Text Available A hybrid interspecies zygote appears after crosses between wheat and maize Zygote derived after usual self-fertilization in wheat is dividing by mitotic divisions into embryo. However, interspecies zygote aborts soon. Auxin treatment is widely used to promote its development. Growth hormones auxins have stimulative ortoxic effects on plant tissue sin relation to its concentration and the time of application. In this paper the effect of time of auxin dicamba application on embryo in wheat x maize crosses was investigated. Chromosomes of pollen donor parent are eliminated quickly in cells of such embryos and they become haploid. It was concluded that for the production of haploid embryos the best time for auxin application is one day after pollination with maize.

  8. The genome of the fire ant Solenopsis invicta

    DEFF Research Database (Denmark)

    Wurm, Yannick; Wang, John; Riba-Grognuz, Oksana

    2011-01-01

    Ants have evolved very complex societies and are key ecosystem members. Some ants, such as the fire ant Solenopsis invicta, are also major pests. Here, we present a draft genome of S. invicta, assembled from Roche 454 and Illumina sequencing reads obtained from a focal haploid male and his brothe...

  9. Multi-platform whole-genome microarray analyses refine the epigenetic signature of breast cancer metastasis with gene expression and copy number.

    Directory of Open Access Journals (Sweden)

    Joseph Andrews

    2010-01-01

    Full Text Available We have previously identified genome-wide DNA methylation changes in a cell line model of breast cancer metastasis. These complex epigenetic changes that we observed, along with concurrent karyotype analyses, have led us to hypothesize that complex genomic alterations in cancer cells (deletions, translocations and ploidy are superimposed over promoter-specific methylation events that are responsible for gene-specific expression changes observed in breast cancer metastasis.We undertook simultaneous high-resolution, whole-genome analyses of MDA-MB-468GFP and MDA-MB-468GFP-LN human breast cancer cell lines (an isogenic, paired lymphatic metastasis cell line model using Affymetrix gene expression (U133, promoter (1.0R, and SNP/CNV (SNP 6.0 microarray platforms to correlate data from gene expression, epigenetic (DNA methylation, and combination copy number variant/single nucleotide polymorphism microarrays. Using Partek Software and Ingenuity Pathway Analysis we integrated datasets from these three platforms and detected multiple hypomethylation and hypermethylation events. Many of these epigenetic alterations correlated with gene expression changes. In addition, gene dosage events correlated with the karyotypic differences observed between the cell lines and were reflected in specific promoter methylation patterns. Gene subsets were identified that correlated hyper (and hypo methylation with the loss (or gain of gene expression and in parallel, with gene dosage losses and gains, respectively. Individual gene targets from these subsets were also validated for their methylation, expression and copy number status, and susceptible gene pathways were identified that may indicate how selective advantage drives the processes of tumourigenesis and metastasis.Our approach allows more precisely profiling of functionally relevant epigenetic signatures that are associated with cancer progression and metastasis.

  10. Two tandemly repeated telomere-associated sequences in Nicotiana plumbaginifolia.

    Science.gov (United States)

    Chen, C M; Wang, C T; Wang, C J; Ho, C H; Kao, Y Y; Chen, C C

    1997-12-01

    Two tandemly repeated telomere-associated sequences, NP3R and NP4R, have been isolated from Nicotiana plumbaginifolia. The length of a repeating unit for NP3R and NP4R is 165 and 180 nucleotides respectively. The abundance of NP3R, NP4R and telomeric repeats is, respectively, 8.4 x 10(4), 6 x 10(3) and 1.5 x 10(6) copies per haploid genome of N. plumbaginifolia. Fluorescence in situ hybridization revealed that NP3R is located at the ends and/or in interstitial regions of all 10 chromosomes and NP4R on the terminal regions of three chromosomes in the haploid genome of N. plumbaginifolia. Sequence homology search revealed that not only are NP3R and NP4R homologous to HRS60 and GRS, respectively, two tandem repeats isolated from N. tabacum, but that NP3R and NP4R are also related to each other, suggesting that they originated from a common ancestral sequence. The role of these repeated sequences in chromosome healing is discussed based on the observation that two to three copies of a telomere-similar sequence were present in each repeating unit of NP3R and NP4R.

  11. CoNVaQ: a web tool for copy number variation-based association studies

    DEFF Research Database (Denmark)

    Larsen, Simon Jonas; do Canto, Luisa Matos; Rogatto, Silvia Regina

    2018-01-01

    Copy number variations (CNVs) are large segments of the genome that are duplicated or deleted. Structural variations in the genome have been linked to many complex diseases. Similar to how genome-wide association studies (GWAS) have helped discover single-nucleotide polymorphisms linked to diseas...

  12. Study on the in vitro culture of cut plants in wheat haploid embryo induction by a wheat × maize cross

    Institute of Scientific and Technical Information of China (English)

    Jian GU; Kun LIU; Shaoxiang LI; Yuxian TIAN; Hexian YANG; Mujun YANG

    2008-01-01

    The wheat × maize system is one of the most effective ways to produce haploids in wheat. Whether and how it could be successfully applied in practical breeding mostly depends upon the efficiency of haploid embryo pro-duction. To perfect the protocols of haploid embryo induc-tion, the efficiency of haploid embryo production between in vitro culture of cut plant and intact plant growth for hybrid spikes with two F1 wheat hybrids and two maize varieties was compared. Effects of different cutting plant times and formulas of nutrient solutions for cut plant cul-ture on haploid embryo formation were also studied. Results indicated that the embryo rate of in vitro culture was 3.29 times that of intact plant growth, with the figures of 31.6% vs 9.6%, respectively. The optimal time for cut plant culture was 24 h after pollination. Formulas of nutri-ent solutions significantly affected the efficiency of haploid embryo induction. With an embryo rate of 0-35.5%, add-could raise the caryopsis and embryo rates. According to this study, the best medium for cut plant culture was: phate, with which a caryopsis rate of 95% and an embryo rate of about 30% could be obtained.

  13. Genome-wide identification of significant aberrations in cancer genome.

    Science.gov (United States)

    Yuan, Xiguo; Yu, Guoqiang; Hou, Xuchu; Shih, Ie-Ming; Clarke, Robert; Zhang, Junying; Hoffman, Eric P; Wang, Roger R; Zhang, Zhen; Wang, Yue

    2012-07-27

    Somatic Copy Number Alterations (CNAs) in human genomes are present in almost all human cancers. Systematic efforts to characterize such structural variants must effectively distinguish significant consensus events from random background aberrations. Here we introduce Significant Aberration in Cancer (SAIC), a new method for characterizing and assessing the statistical significance of recurrent CNA units. Three main features of SAIC include: (1) exploiting the intrinsic correlation among consecutive probes to assign a score to each CNA unit instead of single probes; (2) performing permutations on CNA units that preserve correlations inherent in the copy number data; and (3) iteratively detecting Significant Copy Number Aberrations (SCAs) and estimating an unbiased null distribution by applying an SCA-exclusive permutation scheme. We test and compare the performance of SAIC against four peer methods (GISTIC, STAC, KC-SMART, CMDS) on a large number of simulation datasets. Experimental results show that SAIC outperforms peer methods in terms of larger area under the Receiver Operating Characteristics curve and increased detection power. We then apply SAIC to analyze structural genomic aberrations acquired in four real cancer genome-wide copy number data sets (ovarian cancer, metastatic prostate cancer, lung adenocarcinoma, glioblastoma). When compared with previously reported results, SAIC successfully identifies most SCAs known to be of biological significance and associated with oncogenes (e.g., KRAS, CCNE1, and MYC) or tumor suppressor genes (e.g., CDKN2A/B). Furthermore, SAIC identifies a number of novel SCAs in these copy number data that encompass tumor related genes and may warrant further studies. Supported by a well-grounded theoretical framework, SAIC has been developed and used to identify SCAs in various cancer copy number data sets, providing useful information to study the landscape of cancer genomes. Open-source and platform-independent SAIC software is

  14. Transformation of haploid, microspore-derived cell suspension protoplasts of rice (Oryza sativa L.).

    Science.gov (United States)

    Chaïr, H; Legavre, T; Guiderdoni, E

    1996-06-01

    We compared the transient activity of three cereal gene-derived promoter-gus fusions and the efficiency of selection mediated by three different selectable genes in a polyethylene glycol transformation system with haploid cell suspension protoplasts of rice. The maize ubiquitin promoter was found to be the most active in transformed protoplasts, and selection on ammonium glufosinate mediated by the bar gene was the most efficient for producing resistant calluses. Cotransformation of protoplasts with two separate plasmids carrying the gus and the bar genes, at either a 2∶1 or 1∶1 ratio, led to 0.8 × 10(-5) and 1.6 × 10(-5) resistant callus recovery frequencies and 59.7 and 37.9 cotransformation efficiencies respectively. No escapes were detected in dot blot analyses of 100 resistant calluses with a probe consisting of the bar coding region. Cotransformation efficiency, based on resistance to basta and β-glucuronidase staining of the leaf tissue of 115 regenerated plants, was 47%. Resistance tests and Southern analysis of seed progenies of three diploid transgenic plants demonstrated homozygous integration of multiple copies of the transgene at one locus at least in the first plant, heterozygous integration at one locus in the second plant and heterozygous integration at two loci in the third plant.

  15. Genetic Diversity in Haploid Nicotiana alata Induced by Gamma Irradiation, Salt Tolerance and Detection of These Differences by RAPD

    Directory of Open Access Journals (Sweden)

    Ayman EL-FIKI

    2016-03-01

    Full Text Available Haploid plants of Nicotiana alata were cultured in vitro on MS medium with IAA + KIN. The resulting plantlets were irradiated using gamma radiation doses of 10, 15, 20 and 25 Gy. Single node pieces were cut and transferred onto fresh MS medium. Gamma radiation doses caused the death of 9% and up to 28% of explants. NaCl concentrations caused the death of 8% up to 36% of explants, while the combined effect between gamma radiation doses and salinity had an impact suffused on the percentage of survival. The combined effect of gamma radiation doses 20 Gy and 25 Gy on NaCl concentrations of 100, 150 and 200 mM were deadly. Even more, the combined effect of gamma radiation doses and salinity had a severe negative impact on both the proline content and total soluble protein. Random amplified polymorphic DNA (RAPD analysis was used to determine the degree of genetic variation in treated haploid Nicotiana alata plants. Total genomic DNAs from different haploid plantlets treated were amplified using five arbitrary primers. Two hundred and seventy bands were detected from plantlets irradiated with doses of 15, 20 and 25 Gy, with polymorphic band number 226 (83.7%. The total number of bands resulted from plant grew on 150 mM and 200 mM NaCl were 260 bands with polymorphic bands 185 (85.6%. However, the total number of bands produced from combined effects between gamma rays and salinity (20 Gy X 50 mM NaCl, 20 Gy X 100 mM NaCl and 25 Gy X 50 mM NaCl were 270, with polymorphic band number 231 (85.5%. High similarity between treatments was revealed. Treatments relationships were estimated through cluster analysis (UPGMA based on RAPD data.

  16. The Complete Chloroplast Genome of Catha edulis: A Comparative Analysis of Genome Features with Related Species

    Directory of Open Access Journals (Sweden)

    Cuihua Gu

    2018-02-01

    Full Text Available Qat (Catha edulis, Celastraceae is a woody evergreen species with great economic and cultural importance. It is cultivated for its stimulant alkaloids cathine and cathinone in East Africa and southwest Arabia. However, genome information, especially DNA sequence resources, for C. edulis are limited, hindering studies regarding interspecific and intraspecific relationships. Herein, the complete chloroplast (cp genome of Catha edulis is reported. This genome is 157,960 bp in length with 37% GC content and is structurally arranged into two 26,577 bp inverted repeats and two single-copy areas. The size of the small single-copy and the large single-copy regions were 18,491 bp and 86,315 bp, respectively. The C. edulis cp genome consists of 129 coding genes including 37 transfer RNA (tRNA genes, 8 ribosomal RNA (rRNA genes, and 84 protein coding genes. For those genes, 112 are single copy genes and 17 genes are duplicated in two inverted regions with seven tRNAs, four rRNAs, and six protein coding genes. The phylogenetic relationships resolved from the cp genome of qat and 32 other species confirms the monophyly of Celastraceae. The cp genomes of C. edulis, Euonymus japonicus and seven Celastraceae species lack the rps16 intron, which indicates an intron loss took place among an ancestor of this family. The cp genome of C. edulis provides a highly valuable genetic resource for further phylogenomic research, barcoding and cp transformation in Celastraceae.

  17. The Complete Chloroplast Genome of Catha edulis: A Comparative Analysis of Genome Features with Related Species

    Science.gov (United States)

    Tembrock, Luke R.; Zheng, Shaoyu; Wu, Zhiqiang

    2018-01-01

    Qat (Catha edulis, Celastraceae) is a woody evergreen species with great economic and cultural importance. It is cultivated for its stimulant alkaloids cathine and cathinone in East Africa and southwest Arabia. However, genome information, especially DNA sequence resources, for C. edulis are limited, hindering studies regarding interspecific and intraspecific relationships. Herein, the complete chloroplast (cp) genome of Catha edulis is reported. This genome is 157,960 bp in length with 37% GC content and is structurally arranged into two 26,577 bp inverted repeats and two single-copy areas. The size of the small single-copy and the large single-copy regions were 18,491 bp and 86,315 bp, respectively. The C. edulis cp genome consists of 129 coding genes including 37 transfer RNA (tRNA) genes, 8 ribosomal RNA (rRNA) genes, and 84 protein coding genes. For those genes, 112 are single copy genes and 17 genes are duplicated in two inverted regions with seven tRNAs, four rRNAs, and six protein coding genes. The phylogenetic relationships resolved from the cp genome of qat and 32 other species confirms the monophyly of Celastraceae. The cp genomes of C. edulis, Euonymus japonicus and seven Celastraceae species lack the rps16 intron, which indicates an intron loss took place among an ancestor of this family. The cp genome of C. edulis provides a highly valuable genetic resource for further phylogenomic research, barcoding and cp transformation in Celastraceae. PMID:29425128

  18. Genomic structure and evolution of the mating type locus in the green seaweed Ulva partita.

    Science.gov (United States)

    Yamazaki, Tomokazu; Ichihara, Kensuke; Suzuki, Ryogo; Oshima, Kenshiro; Miyamura, Shinichi; Kuwano, Kazuyoshi; Toyoda, Atsushi; Suzuki, Yutaka; Sugano, Sumio; Hattori, Masahira; Kawano, Shigeyuki

    2017-09-15

    The evolution of sex chromosomes and mating loci in organisms with UV systems of sex/mating type determination in haploid phases via genes on UV chromosomes is not well understood. We report the structure of the mating type (MT) locus and its evolutionary history in the green seaweed Ulva partita, which is a multicellular organism with an isomorphic haploid-diploid life cycle and mating type determination in the haploid phase. Comprehensive comparison of a total of 12.0 and 16.6 Gb of genomic next-generation sequencing data for mt - and mt + strains identified highly rearranged MT loci of 1.0 and 1.5 Mb in size and containing 46 and 67 genes, respectively, including 23 gametologs. Molecular evolutionary analyses suggested that the MT loci diverged over a prolonged period in the individual mating types after their establishment in an ancestor. A gene encoding an RWP-RK domain-containing protein was found in the mt - MT locus but was not an ortholog of the chlorophycean mating type determination gene MID. Taken together, our results suggest that the genomic structure and its evolutionary history in the U. partita MT locus are similar to those on other UV chromosomes and that the MT locus genes are quite different from those of Chlorophyceae.

  19. Analysis of Genome-Wide Copy Number Variations in Chinese Indigenous and Western Pig Breeds by 60 K SNP Genotyping Arrays

    Science.gov (United States)

    Sun, Yaqi; Wang, Hongyang; Wang, Chao; Yu, Shaobo; Liu, Jing; Zhang, Yu; Fan, Bin; Li, Kui; Liu, Bang

    2014-01-01

    Copy number variations (CNVs) represent a substantial source of structural variants in mammals and contribute to both normal phenotypic variability and disease susceptibility. Although low-resolution CNV maps are produced in many domestic animals, and several reports have been published about the CNVs of porcine genome, the differences between Chinese and western pigs still remain to be elucidated. In this study, we used Porcine SNP60 BeadChip and PennCNV algorithm to perform a genome-wide CNV detection in 302 individuals from six Chinese indigenous breeds (Tongcheng, Laiwu, Luchuan, Bama, Wuzhishan and Ningxiang pigs), three western breeds (Yorkshire, Landrace and Duroc) and one hybrid (Tongcheng×Duroc). A total of 348 CNV Regions (CNVRs) across genome were identified, covering 150.49 Mb of the pig genome or 6.14% of the autosomal genome sequence. In these CNVRs, 213 CNVRs were found to exist only in the six Chinese indigenous breeds, and 60 CNVRs only in the three western breeds. The characters of CNVs in four Chinese normal size breeds (Luchuan, Tongcheng and Laiwu pigs) and two minipig breeds (Bama and Wuzhishan pigs) were also analyzed in this study. Functional annotation suggested that these CNVRs possess a great variety of molecular function and may play important roles in phenotypic and production traits between Chinese and western breeds. Our results are important complementary to the CNV map in pig genome, which provide new information about the diversity of Chinese and western pig breeds, and facilitate further research on porcine genome CNVs. PMID:25198154

  20. Response of haploid and diploid protoplasts from Datura innoxia Mill. and Petunia hybrida L. to treatment with X-rays and a chemical mutagen

    International Nuclear Information System (INIS)

    Krumbiegel, G.

    1979-01-01

    Haploid and diploid protoplasts of the two Solanaceous species Datura innoxia Mill. and Petunia Hybridia L., were exposed to two different mutagens, increased doses of X-rays and N-methyl-N'-nitro-N-nitrosoguanidine (MNNG). With both species the survival rates of haploid protoplasts decreased exponentially with increased doses of X-rays and increased concentrations of MNNG. Diploid protoplasts showed a higher resistance than haploids only at higher mutagen doses or concentrations. After the MNNG-treatment of haploid protoplasts from Datura innoxia, four mutants with altered pigment patterns were isolated. (author)

  1. Sequence diversity and copy number variation of Mutator-like transposases in wheat

    Directory of Open Access Journals (Sweden)

    Nobuaki Asakura

    2008-01-01

    Full Text Available Partial transposase-coding sequences of Mutator-like elements (MULEs were isolated from a wild einkorn wheat, Triticum urartu, by degenerate PCR. The isolated sequences were classified into a MuDR or Class I clade and divided into two distinct subclasses (subclass I and subclass II. The average pair-wise identity between members of both subclasses was 58.8% at the nucleotide sequence level. Sequence diversity of subclass I was larger than that of subclass II. DNA gel blot analysis showed that subclass I was present as low copy number elements in the genomes of all Triticum and Aegilops accessions surveyed, while subclass II was present as high copy number elements. These two subclasses seemed uncapable of recognizing each other for transposition. The number of copies of subclass II elements was much higher in Aegilops with the S, Sl and D genomes and polyploid Triticum species than in diploid Triticum with the A genome, indicating that active transposition occurred in S, Sl and D genomes before polyploidization. DNA gel blot analysis of six species selected from three subfamilies of Poaceae demonstrated that only the tribe Triticeae possessed both subclasses. These results suggest that the differentiation of these two subclasses occurred before or immediately after the establishment of the tribe Triticeae.

  2. Prediction of a deletion copy number variant by a dense SNP panel

    NARCIS (Netherlands)

    Kadri, N.K.; Koks, P.D.; Meuwissen, T.H.E.

    2012-01-01

    Background: A newly recognized type of genetic variation, Copy Number Variation (CNV), is detected in mammalian genomes, e.g. the cattle genome. This form of variation can potentially cause phenotypic variation. Our objective was to determine whether dense SNP (single nucleotide polymorphisms)

  3. A genome-wide copy number variant study of suicidal behavior.

    Directory of Open Access Journals (Sweden)

    Jeffrey A Gross

    Full Text Available Suicide and suicide attempts are complex behaviors that result from the interaction of different factors, including genetic variants that increase the predisposition to suicidal behaviors. Copy number variations (CNVs are deletions or duplications of a segment of DNA usually larger than one kilobase. These structural genetic changes, although quite rare, have been associated with genetic liability to mental disorders, such as autism, schizophrenia, and bipolar disorder. No genome-wide level studies have been published investigating the potential role of CNVs in suicidal behaviors. Based on single-nucleotide polymorphism array data, we followed the Penn-CNV standards to detect CNVs in 1,608 subjects, comprising 475 suicide and suicide attempt cases and 1,133 controls. Although the initial algorithms determined the presence of CNVs on chromosomes 6 and 12 in seven and eight cases, respectively, compared with none of the controls, visual inspection of the raw data did not support this finding. Furthermore we were unable to validate these findings by CNV-specific real-time polymerase chain reaction. Additionally, rare CNV burden analysis did not find an association between the frequency or length of rare CNVs and suicidal behavior in our sample population. Although our findings suggest CNVs do not play an important role in the etiology of suicidal behaviors, they are not inconsistent with the strong evidence from the literature suggesting that other genetic variants account for a portion of the total phenotypic variability in suicidal behavior.

  4. Evaluating the Production of Doubled Haploid Wheat Lines Using Various Methods of Wheat and Maize Crossing to Develop Heat-Tolerant Wheat Varieties

    Directory of Open Access Journals (Sweden)

    Tayebeh BAKHSHI

    2017-02-01

    Full Text Available Abstract. In this study, chromosome elimination method was used to develop doubled haploid wheat lines via crosses with maize. The plant materials used included 11, F1 wheat genotypes and maize genotype BC572. In these crosses, the maize plant was used as the male parent.Three methods of haploid production in wheat comprising conventional (A, detached-tiller culture (B and intermediate (C techniques were used and compared. The traits such as the number of seeds set, the number of embryos obtained and the number of haploid seedlings produced were studied. Comparisons showed that among various methods of storing wheat spikes, method (C was better than other techniques in terms of the percentage of seed production, embryo formation and haploid seedling production. Also, in all three methods, the percentage of seed production, the percentage of embryo formation and the percentage of haploid seedling production were respectively equal to 76.84, 25.22 and 51.89. Among the wheat genotypes in all three methods, genotype DH-133 with 87.28 percent seed set and genotype DH-132 with 32.71 percent embryo formation and 65.08 percent haploid seedling production were the best genotypes. A total of 92 doubled haploid lines were produced. In the field evaluations of 86 doubled haploid lines, traits such as growing season, plant height, lodging, kernel yield and 1000 kernel weight were examined. Finally, 3 lines were selected for adaptation and stability testing under heat stress conditions.Keywords: Wheat, Doubled haploid, Chromosome elimination, Detached-tiller culture Özet. Bu çalışmada, mısır ile çaprazlarla çift katlı haploid buğday hatlarının geliştirilmesi için kromozom eliminasyon yöntemi kullanılmıştır. Kullanılan bitki materyalleri 11, F1 buğday genotipleri ve BC572 mısır genotipini içermektedir. Bu çaprazlarda, mısır bitkisi erkek ebeveyn olarak kullanılmıştır. Geleneksel (A, ayrık-yeke kültürü (B ve ara (C

  5. A universal genomic coordinate translator for comparative genomics.

    Science.gov (United States)

    Zamani, Neda; Sundström, Görel; Meadows, Jennifer R S; Höppner, Marc P; Dainat, Jacques; Lantz, Henrik; Haas, Brian J; Grabherr, Manfred G

    2014-06-30

    Genomic duplications constitute major events in the evolution of species, allowing paralogous copies of genes to take on fine-tuned biological roles. Unambiguously identifying the orthology relationship between copies across multiple genomes can be resolved by synteny, i.e. the conserved order of genomic sequences. However, a comprehensive analysis of duplication events and their contributions to evolution would require all-to-all genome alignments, which increases at N2 with the number of available genomes, N. Here, we introduce Kraken, software that omits the all-to-all requirement by recursively traversing a graph of pairwise alignments and dynamically re-computing orthology. Kraken scales linearly with the number of targeted genomes, N, which allows for including large numbers of genomes in analyses. We first evaluated the method on the set of 12 Drosophila genomes, finding that orthologous correspondence computed indirectly through a graph of multiple synteny maps comes at minimal cost in terms of sensitivity, but reduces overall computational runtime by an order of magnitude. We then used the method on three well-annotated mammalian genomes, human, mouse, and rat, and show that up to 93% of protein coding transcripts have unambiguous pairwise orthologous relationships across the genomes. On a nucleotide level, 70 to 83% of exons match exactly at both splice junctions, and up to 97% on at least one junction. We last applied Kraken to an RNA-sequencing dataset from multiple vertebrates and diverse tissues, where we confirmed that brain-specific gene family members, i.e. one-to-many or many-to-many homologs, are more highly correlated across species than single-copy (i.e. one-to-one homologous) genes. Not limited to protein coding genes, Kraken also identifies thousands of newly identified transcribed loci, likely non-coding RNAs that are consistently transcribed in human, chimpanzee and gorilla, and maintain significant correlation of expression levels across

  6. Genome-Wide DNA Copy Number Analysis of Acute Lymphoblastic Leukemia Identifies New Genetic Markers Associated with Clinical Outcome.

    Directory of Open Access Journals (Sweden)

    Maribel Forero-Castro

    Full Text Available Identifying additional genetic alterations associated with poor prognosis in acute lymphoblastic leukemia (ALL is still a challenge.To characterize the presence of additional DNA copy number alterations (CNAs in children and adults with ALL by whole-genome oligonucleotide array (aCGH analysis, and to identify their associations with clinical features and outcome. Array-CGH was carried out in 265 newly diagnosed ALLs (142 children and 123 adults. The NimbleGen CGH 12x135K array (Roche was used to analyze genetic gains and losses. CNAs were analyzed with GISTIC and aCGHweb software. Clinical and biological variables were analyzed. Three of the patients showed chromothripsis (cth6, cth14q and cth15q. CNAs were associated with age, phenotype, genetic subtype and overall survival (OS. In the whole cohort of children, the losses on 14q32.33 (p = 0.019 and 15q13.2 (p = 0.04 were related to shorter OS. In the group of children without good- or poor-risk cytogenetics, the gain on 1p36.11 was a prognostic marker independently associated with shorter OS. In adults, the gains on 19q13.2 (p = 0.001 and Xp21.1 (p = 0.029, and the loss of 17p (p = 0.014 were independent markers of poor prognosis with respect to OS. In summary, CNAs are frequent in ALL and are associated with clinical parameters and survival. Genome-wide DNA copy number analysis allows the identification of genetic markers that predict clinical outcome, suggesting that detection of these genetic lesions will be useful in the management of patients newly diagnosed with ALL.

  7. Whole-genome sequencing of spermatocytic tumors provides insights into the mutational processes operating in the male germline

    DEFF Research Database (Denmark)

    Giannoulatou, Eleni; Maher, Geoffrey J; Ding, Zhihao

    2017-01-01

    Adult male germline stem cells (spermatogonia) proliferate by mitosis and, after puberty, generate spermatocytes that undertake meiosis to produce haploid spermatozoa. Germ cells are under evolutionary constraint to curtail mutations and maintain genome integrity. Despite constant turnover...

  8. Copy number variation identification and analysis of the chicken genome using a 60K SNP BeadChip.

    Science.gov (United States)

    Rao, Y S; Li, J; Zhang, R; Lin, X R; Xu, J G; Xie, L; Xu, Z Q; Wang, L; Gan, J K; Xie, X J; He, J; Zhang, X Q

    2016-08-01

    Copy number variation (CNV) is an important source of genetic variation in organisms and a main factor that affects phenotypic variation. A comprehensive study of chicken CNV can provide valuable information on genetic diversity and facilitate future analyses of associations between CNV and economically important traits in chickens. In the present study, an F2 full-sib chicken population (554 individuals), established from a cross between Xinghua and White Recessive Rock chickens, was used to explore CNV in the chicken genome. Genotyping was performed using a chicken 60K SNP BeadChip. A total of 1,875 CNV were detected with the PennCNV algorithm, and the average number of CNV was 3.42 per individual. The CNV were distributed across 383 independent CNV regions (CNVR) and covered 41 megabases (3.97%) of the chicken genome. Seven CNVR in 108 individuals were validated by quantitative real-time PCR, and 81 of these individuals (75%) also were detected with the PennCNV algorithm. In total, 274 CNVR (71.54%) identified in the current study were previously reported. Of these, 147 (38.38%) were reported in at least 2 studies. Additionally, 109 of the CNVR (28.46%) discovered here are novel. A total of 709 genes within or overlapping with the CNVR was retrieved. Out of the 2,742 quantitative trait loci (QTL) collected in the chicken QTL database, 43 QTL had confidence intervals overlapping with the CNVR, and 32 CNVR encompassed one or more functional genes. The functional genes located in the CNVR are likely to be the QTG that are associated with underlying economic traits. This study considerably expands our insight into the structural variation in the genome of chickens and provides an important resource for genomic variation, especially for genomic structural variation related to economic traits in chickens. © 2016 Poultry Science Association Inc.

  9. GRAbB : Selective Assembly of Genomic Regions, a New Niche for Genomic Research

    NARCIS (Netherlands)

    Brankovics, Balázs; Zhang, Hao; van Diepeningen, Anne D; van der Lee, Theo A J; Waalwijk, Cees; de Hoog, G Sybren

    GRAbB (Genomic Region Assembly by Baiting) is a new program that is dedicated to assemble specific genomic regions from NGS data. This approach is especially useful when dealing with multi copy regions, such as mitochondrial genome and the rDNA repeat region, parts of the genome that are often

  10. Haploid Barley from the Intergeneric Cross Hordeum vulgare x Psathyrostachys fragilis

    DEFF Research Database (Denmark)

    Bothmer, Roland; Jacobsen, Niels; Bagger Jørgensen, Rikke

    1984-01-01

    The intergeneric hybrid Hordeum vulgare x Psathyrostachys fragilis was fairly easily obtained. During each growing season the intermediate, perennial hybrid yielded haploid tillers of H. vulgare. Late in one season few, hybrid tillers headed. The morphology, cytology and enzymatic patterns...

  11. Advancing Eucalyptus genomics: identification and sequencing of lignin biosynthesis genes from deep-coverage BAC libraries

    Directory of Open Access Journals (Sweden)

    Kudrna David

    2011-03-01

    Full Text Available Abstract Background Eucalyptus species are among the most planted hardwoods in the world because of their rapid growth, adaptability and valuable wood properties. The development and integration of genomic resources into breeding practice will be increasingly important in the decades to come. Bacterial artificial chromosome (BAC libraries are key genomic tools that enable positional cloning of important traits, synteny evaluation, and the development of genome framework physical maps for genetic linkage and genome sequencing. Results We describe the construction and characterization of two deep-coverage BAC libraries EG_Ba and EG_Bb obtained from nuclear DNA fragments of E. grandis (clone BRASUZ1 digested with HindIII and BstYI, respectively. Genome coverages of 17 and 15 haploid genome equivalents were estimated for EG_Ba and EG_Bb, respectively. Both libraries contained large inserts, with average sizes ranging from 135 Kb (Eg_Bb to 157 Kb (Eg_Ba, very low extra-nuclear genome contamination providing a probability of finding a single copy gene ≥ 99.99%. Libraries were screened for the presence of several genes of interest via hybridizations to high-density BAC filters followed by PCR validation. Five selected BAC clones were sequenced and assembled using the Roche GS FLX technology providing the whole sequence of the E. grandis chloroplast genome, and complete genomic sequences of important lignin biosynthesis genes. Conclusions The two E. grandis BAC libraries described in this study represent an important milestone for the advancement of Eucalyptus genomics and forest tree research. These BAC resources have a highly redundant genome coverage (> 15×, contain large average inserts and have a very low percentage of clones with organellar DNA or empty vectors. These publicly available BAC libraries are thus suitable for a broad range of applications in genetic and genomic research in Eucalyptus and possibly in related species of Myrtaceae

  12. Tumor transcriptome sequencing reveals allelic expression imbalances associated with copy number alterations.

    Directory of Open Access Journals (Sweden)

    Brian B Tuch

    Full Text Available Due to growing throughput and shrinking cost, massively parallel sequencing is rapidly becoming an attractive alternative to microarrays for the genome-wide study of gene expression and copy number alterations in primary tumors. The sequencing of transcripts (RNA-Seq should offer several advantages over microarray-based methods, including the ability to detect somatic mutations and accurately measure allele-specific expression. To investigate these advantages we have applied a novel, strand-specific RNA-Seq method to tumors and matched normal tissue from three patients with oral squamous cell carcinomas. Additionally, to better understand the genomic determinants of the gene expression changes observed, we have sequenced the tumor and normal genomes of one of these patients. We demonstrate here that our RNA-Seq method accurately measures allelic imbalance and that measurement on the genome-wide scale yields novel insights into cancer etiology. As expected, the set of genes differentially expressed in the tumors is enriched for cell adhesion and differentiation functions, but, unexpectedly, the set of allelically imbalanced genes is also enriched for these same cancer-related functions. By comparing the transcriptomic perturbations observed in one patient to his underlying normal and tumor genomes, we find that allelic imbalance in the tumor is associated with copy number mutations and that copy number mutations are, in turn, strongly associated with changes in transcript abundance. These results support a model in which allele-specific deletions and duplications drive allele-specific changes in gene expression in the developing tumor.

  13. Hyb-Seq: Combining Target Enrichment and Genome Skimming for Plant Phylogenomics

    Directory of Open Access Journals (Sweden)

    Kevin Weitemier

    2014-08-01

    Full Text Available Premise of the study: Hyb-Seq, the combination of target enrichment and genome skimming, allows simultaneous data collection for low-copy nuclear genes and high-copy genomic targets for plant systematics and evolution studies. Methods and Results: Genome and transcriptome assemblies for milkweed (Asclepias syriaca were used to design enrichment probes for 3385 exons from 768 genes (>1.6 Mbp followed by Illumina sequencing of enriched libraries. Hyb-Seq of 12 individuals (10 Asclepias species and two related genera resulted in at least partial assembly of 92.6% of exons and 99.7% of genes and an average assembly length >2 Mbp. Importantly, complete plastomes and nuclear ribosomal DNA cistrons were assembled using off-target reads. Phylogenomic analyses demonstrated signal conflict between genomes. Conclusions: The Hyb-Seq approach enables targeted sequencing of thousands of low-copy nuclear exons and flanking regions, as well as genome skimming of high-copy repeats and organellar genomes, to efficiently produce genome-scale data sets for phylogenomics.

  14. Identification of genomic copy number variations associated with specific clinical features of head and neck cancer.

    Science.gov (United States)

    Zagradišnik, Boris; Krgović, Danijela; Herodež, Špela Stangler; Zagorac, Andreja; Ćižmarević, Bogdan; Vokač, Nadja Kokalj

    2018-01-01

    Copy number variations (CNSs) of large genomic regions are an important mechanism implicated in the development of head and neck cancer, however, for most changes their exact role is not well understood. The aim of this study was to find possible associations between gains/losses of genomic regions and clinically distinct subgroups of head and neck cancer patients. Array comparative genomic hybridization (aCGH) analysis was performed on DNA samples in 64 patients with cancer in oral cavity, oropharynx or hypopharynx. Overlapping genomic regions created from gains and losses were used for statistical analysis. Following regions were overrepresented: in tumors with stage I or II a gain of 2.98 Mb on 6p21.2-p11 and a gain of 7.4 Mb on 8q11.1-q11.23; in tumors with grade I histology a gain of 1.1 Mb on 8q24.13, a loss of a large part of p arm of chromosome 3, a loss of a 1.24 Mb on 6q14.3, and a loss of terminal 32 Mb region of 8p23.3; in cases with affected lymph nodes a gain of 0.75 Mb on 3q24, and a gain of 0.9 Mb on 3q26.32-q26.33; in cases with unaffected lymph nodes a gain of 1.1 Mb on 8q23.3, in patients not treated with surgery a gain of 12.2 Mb on 7q21.3-q22.3 and a gain of 0.33 Mb on 20q11.22. Our study identified several genomic regions of interest which appear to be associated with various clinically distinct subgroups of head and neck cancer. They represent a potentially important source of biomarkers useful for the clinical management of head and neck cancer. In particular, the PIK3CA and AGTR1 genes could be singled out to predict the lymph node involvement.

  15. Differences between the genomes of lymphoblastoid cell lines and blood-derived samples

    Directory of Open Access Journals (Sweden)

    Joesch-Cohen LM

    2017-02-01

    Full Text Available Lena M Joesch-Cohen, Gustavo Glusman Institute for Systems Biology, Seattle, WA, USA Abstract: Lymphoblastoid cell lines (LCLs represent a convenient research tool for expanding the amount of biologic material available from an individual. LCLs are commonly used as reference materials, most notably from the Genome in a Bottle Consortium. However, the question remains how faithfully LCL-derived genome assemblies represent the germline genome of the donor individual as compared to the genome assemblies derived from peripheral blood mononuclear cells. We present an in-depth comparison of a large collection of LCL- and peripheral blood mononuclear cell-derived genomes in terms of distributions of coverage and copy number alterations. We found significant differences in the depth of coverage and copy number calls, which may be driven by differential replication timing. Importantly, these copy number changes preferentially affect regions closer to genes and with higher GC content. This suggests that genomic studies based on LCLs may display locus-specific biases, and that conclusions based on analysis of depth of coverage and copy number variation may require further scrutiny. Keywords: genomics, whole-genome sequencing, viral transformation, copy number changes, bioinformatics

  16. Global copy number profiling of cancer genomes | Office of Cancer Genomics

    Science.gov (United States)

    In this article, we introduce a robust and efficient strategy for deriving global and allele-specific copy number alternations (CNA) from cancer whole exome sequencing data based on Log R ratios and B-allele frequencies. Applying the approach to the analysis of over 200 skin cancer samples, we demonstrate its utility for discovering distinct CNA events and for deriving ancillary information such as tumor purity. Availability and implementation: https://github.com/xfwang/CLOSE CONTACT: xuefeng.wang@stonybrook.edu or michael.krauthammer@yale.edu. (Publication Abstract)

  17. Genome-wide identification of significant aberrations in cancer genome

    Directory of Open Access Journals (Sweden)

    Yuan Xiguo

    2012-07-01

    Full Text Available Abstract Background Somatic Copy Number Alterations (CNAs in human genomes are present in almost all human cancers. Systematic efforts to characterize such structural variants must effectively distinguish significant consensus events from random background aberrations. Here we introduce Significant Aberration in Cancer (SAIC, a new method for characterizing and assessing the statistical significance of recurrent CNA units. Three main features of SAIC include: (1 exploiting the intrinsic correlation among consecutive probes to assign a score to each CNA unit instead of single probes; (2 performing permutations on CNA units that preserve correlations inherent in the copy number data; and (3 iteratively detecting Significant Copy Number Aberrations (SCAs and estimating an unbiased null distribution by applying an SCA-exclusive permutation scheme. Results We test and compare the performance of SAIC against four peer methods (GISTIC, STAC, KC-SMART, CMDS on a large number of simulation datasets. Experimental results show that SAIC outperforms peer methods in terms of larger area under the Receiver Operating Characteristics curve and increased detection power. We then apply SAIC to analyze structural genomic aberrations acquired in four real cancer genome-wide copy number data sets (ovarian cancer, metastatic prostate cancer, lung adenocarcinoma, glioblastoma. When compared with previously reported results, SAIC successfully identifies most SCAs known to be of biological significance and associated with oncogenes (e.g., KRAS, CCNE1, and MYC or tumor suppressor genes (e.g., CDKN2A/B. Furthermore, SAIC identifies a number of novel SCAs in these copy number data that encompass tumor related genes and may warrant further studies. Conclusions Supported by a well-grounded theoretical framework, SAIC has been developed and used to identify SCAs in various cancer copy number data sets, providing useful information to study the landscape of cancer genomes

  18. Parallel or convergent evolution in human population genomic data revealed by genotype networks

    OpenAIRE

    Vahdati, Ali R; Wagner, Andreas

    2016-01-01

    Background Genotype networks are representations of genetic variation data that are complementary to phylogenetic trees. A genotype network is a graph whose nodes are genotypes (DNA sequences) with the same broadly defined phenotype. Two nodes are connected if they differ in some minimal way, e.g., in a single nucleotide. Results We analyze human genome variation data from the 1,000 genomes project, and construct haploid genotype (haplotype) networks for 12,235 protein coding genes. The struc...

  19. Integrated genomic classification of melanocytic tumors of the central nervous system using mutation analysis, copy number alterations and DNA methylation profiling.

    Science.gov (United States)

    Griewank, Klaus; Koelsche, Christian; van de Nes, Johannes A P; Schrimpf, Daniel; Gessi, Marco; Möller, Inga; Sucker, Antje; Scolyer, Richard A; Buckland, Michael E; Murali, Rajmohan; Pietsch, Torsten; von Deimling, Andreas; Schadendorf, Dirk

    2018-06-11

    In the central nervous system, distinguishing primary leptomeningeal melanocytic tumors from melanoma metastases and predicting their biological behavior solely using histopathologic criteria can be challenging. We aimed to assess the diagnostic and prognostic value of integrated molecular analysis. Targeted next-generation-sequencing, array-based genome-wide methylation analysis and BAP1 immunohistochemistry was performed on the largest cohort of central nervous system melanocytic tumors analyzed to date, incl. 47 primary tumors of the central nervous system, 16 uveal melanomas. 13 cutaneous melanoma metastasis and 2 blue nevus-like melanomas. Gene mutation, DNA-methylation and copy-number profiles were correlated with clinicopathological features. Combining mutation, copy-number and DNA-methylation profiles clearly distinguished cutaneous melanoma metastases from other melanocytic tumors. Primary leptomeningeal melanocytic tumors, uveal melanomas and blue nevus-like melanoma showed common DNA-methylation, copy-number alteration and gene mutation signatures. Notably, tumors demonstrating chromosome 3 monosomy and BAP1 alterations formed a homogeneous subset within this group. Integrated molecular profiling aids in distinguishing primary from metastatic melanocytic tumors of the central nervous system. Primary leptomeningeal melanocytic tumors, uveal melanoma and blue nevus-like melanoma share molecular similarity with chromosome 3 and BAP1 alterations markers of poor prognosis. Copyright ©2018, American Association for Cancer Research.

  20. Genome-wide loss of heterozygosity and copy number alteration in esophageal squamous cell carcinoma using the Affymetrix GeneChip Mapping 10 K array

    Directory of Open Access Journals (Sweden)

    Goldstein Alisa M

    2006-11-01

    Full Text Available Abstract Background Esophageal squamous cell carcinoma (ESCC is a common malignancy worldwide. Comprehensive genomic characterization of ESCC will further our understanding of the carcinogenesis process in this disease. Results Genome-wide detection of chromosomal changes was performed using the Affymetrix GeneChip 10 K single nucleotide polymorphism (SNP array, including loss of heterozygosity (LOH and copy number alterations (CNA, for 26 pairs of matched germ-line and micro-dissected tumor DNA samples. LOH regions were identified by two methods – using Affymetrix's genotype call software and using Affymetrix's copy number alteration tool (CNAT software – and both approaches yielded similar results. Non-random LOH regions were found on 10 chromosomal arms (in decreasing order of frequency: 17p, 9p, 9q, 13q, 17q, 4q, 4p, 3p, 15q, and 5q, including 20 novel LOH regions (10 kb to 4.26 Mb. Fifteen CNA-loss regions (200 kb to 4.3 Mb and 36 CNA-gain regions (200 kb to 9.3 Mb were also identified. Conclusion These studies demonstrate that the Affymetrix 10 K SNP chip is a valid platform to integrate analyses of LOH and CNA. The comprehensive knowledge gained from this analysis will enable improved strategies to prevent, diagnose, and treat ESCC.

  1. Bayesian Nonparametric Hidden Markov Models with application to the analysis of copy-number-variation in mammalian genomes.

    Science.gov (United States)

    Yau, C; Papaspiliopoulos, O; Roberts, G O; Holmes, C

    2011-01-01

    We consider the development of Bayesian Nonparametric methods for product partition models such as Hidden Markov Models and change point models. Our approach uses a Mixture of Dirichlet Process (MDP) model for the unknown sampling distribution (likelihood) for the observations arising in each state and a computationally efficient data augmentation scheme to aid inference. The method uses novel MCMC methodology which combines recent retrospective sampling methods with the use of slice sampler variables. The methodology is computationally efficient, both in terms of MCMC mixing properties, and robustness to the length of the time series being investigated. Moreover, the method is easy to implement requiring little or no user-interaction. We apply our methodology to the analysis of genomic copy number variation.

  2. Quantification of ploidy in proteobacteria revealed the existence of monoploid, (mero-oligoploid and polyploid species.

    Directory of Open Access Journals (Sweden)

    Vito Pecoraro

    Full Text Available Bacteria are generally assumed to be monoploid (haploid. This assumption is mainly based on generalization of the results obtained with the most intensely studied model bacterium, Escherichia coli (a gamma-proteobacterium, which is monoploid during very slow growth. However, several species of proteobacteria are oligo- or polyploid, respectively. To get a better overview of the distribution of ploidy levels, genome copy numbers were quantified in four species of three different groups of proteobacteria. A recently developed Real Time PCR approach, which had been used to determine the ploidy levels of halophilic archaea, was optimized for the quantification of genome copy numbers of bacteria. Slow-growing (doubling time 103 minutes and fast-growing (doubling time 25 minutes E. coli cultures were used as a positive control. The copy numbers of the origin and terminus region of the chromosome were determined and the results were in excellent agreement with published data. The approach was also used to determine the ploidy levels of Caulobacter crescentus (an alpha-proteobacterium and Wolinella succinogenes (an epsilon-proteobacterium, both of which are monoploid. In contrast, Pseudomonas putida (a gamma-proteobacterium contains 20 genome copies and is thus polyploid. A survey of the proteobacteria with experimentally-determined genome copy numbers revealed that only three to four of 11 species are monoploid and thus monoploidy is not typical for proteobacteria. The ploidy level is not conserved within the groups of proteobacteria, and there are no obvious correlations between the ploidy levels with other parameters like genome size, optimal growth temperature or mode of life.

  3. Human Genome Sequencing in Health and Disease

    Science.gov (United States)

    Gonzaga-Jauregui, Claudia; Lupski, James R.; Gibbs, Richard A.

    2013-01-01

    Following the “finished,” euchromatic, haploid human reference genome sequence, the rapid development of novel, faster, and cheaper sequencing technologies is making possible the era of personalized human genomics. Personal diploid human genome sequences have been generated, and each has contributed to our better understanding of variation in the human genome. We have consequently begun to appreciate the vastness of individual genetic variation from single nucleotide to structural variants. Translation of genome-scale variation into medically useful information is, however, in its infancy. This review summarizes the initial steps undertaken in clinical implementation of personal genome information, and describes the application of whole-genome and exome sequencing to identify the cause of genetic diseases and to suggest adjuvant therapies. Better analysis tools and a deeper understanding of the biology of our genome are necessary in order to decipher, interpret, and optimize clinical utility of what the variation in the human genome can teach us. Personal genome sequencing may eventually become an instrument of common medical practice, providing information that assists in the formulation of a differential diagnosis. We outline herein some of the remaining challenges. PMID:22248320

  4. Hyb-Seq: Combining target enrichment and genome skimming for plant phylogenomics1

    Science.gov (United States)

    Weitemier, Kevin; Straub, Shannon C. K.; Cronn, Richard C.; Fishbein, Mark; Schmickl, Roswitha; McDonnell, Angela; Liston, Aaron

    2014-01-01

    • Premise of the study: Hyb-Seq, the combination of target enrichment and genome skimming, allows simultaneous data collection for low-copy nuclear genes and high-copy genomic targets for plant systematics and evolution studies. • Methods and Results: Genome and transcriptome assemblies for milkweed (Asclepias syriaca) were used to design enrichment probes for 3385 exons from 768 genes (>1.6 Mbp) followed by Illumina sequencing of enriched libraries. Hyb-Seq of 12 individuals (10 Asclepias species and two related genera) resulted in at least partial assembly of 92.6% of exons and 99.7% of genes and an average assembly length >2 Mbp. Importantly, complete plastomes and nuclear ribosomal DNA cistrons were assembled using off-target reads. Phylogenomic analyses demonstrated signal conflict between genomes. • Conclusions: The Hyb-Seq approach enables targeted sequencing of thousands of low-copy nuclear exons and flanking regions, as well as genome skimming of high-copy repeats and organellar genomes, to efficiently produce genome-scale data sets for phylogenomics. PMID:25225629

  5. Detection of copy number variations and their effects in Chinese bulls

    KAUST Repository

    Zhang, Liangzhi

    2014-06-17

    Background: Copy number variations (CNVs) are a main source of genomic structural variations underlying animal evolution and production traits. Here, with one pure-blooded Angus bull as reference, we describe a genome-wide analysis of CNVs based on comparative genomic hybridization arrays in 29 Chinese domesticated bulls and examined their effects on gene expression and cattle growth traits.Results: We identified 486 copy number variable regions (CNVRs), covering 2.45% of the bovine genome, in 24 taurine (Bos taurus), together with 161 ones in 2 yaks (Bos grunniens) and 163 ones in 3 buffaloes (Bubalus bubalis). Totally, we discovered 605 integrated CNVRs, with more " loss" events than both " gain" and " both" ones, and clearly clustered them into three cattle groups. Interestingly, we confirmed their uneven distributions across chromosomes, and the differences of mitochondrion DNA copy number (gain: taurine, loss: yak & buffalo). Furthermore, we confirmed approximately 41.8% (253/605) and 70.6% (427/605) CNVRs span cattle genes and quantitative trait loci (QTLs), respectively. Finally, we confirmed 6 CNVRs in 9 chosen ones by using quantitative PCR, and further demonstrated that CNVR22 had significantly negative effects on expression of PLA2G2D gene, and both CNVR22 and CNVR310 were associated with body measurements in Chinese cattle, suggesting their key effects on gene expression and cattle traits.Conclusions: The results advanced our understanding of CNV as an important genomic structural variation in taurine, yak and buffalo. This study provides a highly valuable resource for Chinese cattle\\'s evolution and breeding researches. 2014 Zhang et al.; licensee BioMed Central Ltd.

  6. A New Single Nucleotide Polymorphism Database for Rainbow Trout Generated Through Whole Genome Resequencing

    Directory of Open Access Journals (Sweden)

    Guangtu Gao

    2018-04-01

    Full Text Available Single-nucleotide polymorphisms (SNPs are highly abundant markers, which are broadly distributed in animal genomes. For rainbow trout (Oncorhynchus mykiss, SNP discovery has been previously done through sequencing of restriction-site associated DNA (RAD libraries, reduced representation libraries (RRL and RNA sequencing. Recently we have performed high coverage whole genome resequencing with 61 unrelated samples, representing a wide range of rainbow trout and steelhead populations, with 49 new samples added to 12 aquaculture samples from AquaGen (Norway that we previously used for SNP discovery. Of the 49 new samples, 11 were double-haploid lines from Washington State University (WSU and 38 represented wild and hatchery populations from a wide range of geographic distribution and with divergent migratory phenotypes. We then mapped the sequences to the new rainbow trout reference genome assembly (GCA_002163495.1 which is based on the Swanson YY doubled haploid line. Variant calling was conducted with FreeBayes and SAMtools mpileup, followed by filtering of SNPs based on quality score, sequence complexity, read depth on the locus, and number of genotyped samples. Results from the two variant calling programs were compared and genotypes of the double haploid samples were used for detecting and filtering putative paralogous sequence variants (PSVs and multi-sequence variants (MSVs. Overall, 30,302,087 SNPs were identified on the rainbow trout genome 29 chromosomes and 1,139,018 on unplaced scaffolds, with 4,042,723 SNPs having high minor allele frequency (MAF > 0.25. The average SNP density on the chromosomes was one SNP per 64 bp, or 15.6 SNPs per 1 kb. Results from the phylogenetic analysis that we conducted indicate that the SNP markers contain enough population-specific polymorphisms for recovering population relationships despite the small sample size used. Intra-Population polymorphism assessment revealed high level of polymorphism and

  7. Characters that differ between diploid and haploid honey bee (Apis mellifera) drones.

    Science.gov (United States)

    Herrmann, Matthias; Trenzcek, Tina; Fahrenhorst, Hartmut; Engels, Wolf

    2005-12-30

    Diploid males have long been considered a curiosity contradictory to the haplo-diploid mode of sex determination in the Hymenoptera. In Apis mellifera, 'false' diploid male larvae are eliminated by worker cannibalism immediately after hatching. A 'cannibalism substance' produced by diploid drone larvae to induce worker-assisted suicide has been hypothesized, but it has never been detected. Diploid drones are only removed some hours after hatching. Older larvae are evidently not regarded as 'false males' and instead are regularly nursed by the brood-attending worker bees. As the pheromonal cues presumably are located on the surface of newly hatched bee larvae, we extracted the cuticular secretions and analyzed their chemical composition by gas chromatograph-mass spectrometry (GC-MS) analyses. Larvae were sexed and then reared in vitro for up to three days. The GC-MS pattern that was obtained, with alkanes as the major compounds, was compared between diploid and haploid drone larvae. We also examined some physical parameters of adult drones. There was no difference between diploid and haploid males in their weight at the day of emergence. The diploid adult drones had fewer wing hooks and smaller testes. The sperm DNA content was 0.30 and 0.15 pg per nucleus, giving an exact 2:1 ratio for the gametocytes of diploid and haploid drones, respectively. Vitellogenin was found in the hemolymph of both types of imaginal drones at 5 to 6 days, with a significantly lower titer in the diploids.

  8. Building a model: developing genomic resources for common milkweed (Asclepias syriaca) with low coverage genome sequencing.

    Science.gov (United States)

    Straub, Shannon C K; Fishbein, Mark; Livshultz, Tatyana; Foster, Zachary; Parks, Matthew; Weitemier, Kevin; Cronn, Richard C; Liston, Aaron

    2011-05-04

    Milkweeds (Asclepias L.) have been extensively investigated in diverse areas of evolutionary biology and ecology; however, there are few genetic resources available to facilitate and compliment these studies. This study explored how low coverage genome sequencing of the common milkweed (Asclepias syriaca L.) could be useful in characterizing the genome of a plant without prior genomic information and for development of genomic resources as a step toward further developing A. syriaca as a model in ecology and evolution. A 0.5× genome of A. syriaca was produced using Illumina sequencing. A virtually complete chloroplast genome of 158,598 bp was assembled, revealing few repeats and loss of three genes: accD, clpP, and ycf1. A nearly complete rDNA cistron (18S-5.8S-26S; 7,541 bp) and 5S rDNA (120 bp) sequence were obtained. Assessment of polymorphism revealed that the rDNA cistron and 5S rDNA had 0.3% and 26.7% polymorphic sites, respectively. A partial mitochondrial genome sequence (130,764 bp), with identical gene content to tobacco, was also assembled. An initial characterization of repeat content indicated that Ty1/copia-like retroelements are the most common repeat type in the milkweed genome. At least one A. syriaca microread hit 88% of Catharanthus roseus (Apocynaceae) unigenes (median coverage of 0.29×) and 66% of single copy orthologs (COSII) in asterids (median coverage of 0.14×). From this partial characterization of the A. syriaca genome, markers for population genetics (microsatellites) and phylogenetics (low-copy nuclear genes) studies were developed. The results highlight the promise of next generation sequencing for development of genomic resources for any organism. Low coverage genome sequencing allows characterization of the high copy fraction of the genome and exploration of the low copy fraction of the genome, which facilitate the development of molecular tools for further study of a target species and its relatives. This study represents a first

  9. Performance of Molecular Inversion Probes (MIP) in Allele CopyNumber Determination

    Energy Technology Data Exchange (ETDEWEB)

    Wang, Yuker; Moorhead, Martin; Karlin-Neumann, George; Wang,Nicolas J.; Ireland, James; Lin, Steven; Chen, Chunnuan; Heiser, LauraM.; Chin, Koei; Esserman, Laura; Gray, Joe W.; Spellman, Paul T.; Faham,Malek

    2007-05-14

    We have developed a new protocol for using MolecularInversion Probes (MIP) to accurately and specifically measure allele copynumber (ACN). The new protocol provides for significant improvementsincluding the reduction of input DNA (from 2?g) by more than 25 fold (to75ng total genomic DNA), higher overall precision resulting in one orderof magnitude lower false positive rate, and greater dynamic range withaccurate absolute copy number up to 60 copies.

  10. Accurate, high-throughput typing of copy number variation using paralogue ratios from dispersed repeats.

    Science.gov (United States)

    Armour, John A L; Palla, Raquel; Zeeuwen, Patrick L J M; den Heijer, Martin; Schalkwijk, Joost; Hollox, Edward J

    2007-01-01

    Recent work has demonstrated an unexpected prevalence of copy number variation in the human genome, and has highlighted the part this variation may play in predisposition to common phenotypes. Some important genes vary in number over a high range (e.g. DEFB4, which commonly varies between two and seven copies), and have posed formidable technical challenges for accurate copy number typing, so that there are no simple, cheap, high-throughput approaches suitable for large-scale screening. We have developed a simple comparative PCR method based on dispersed repeat sequences, using a single pair of precisely designed primers to amplify products simultaneously from both test and reference loci, which are subsequently distinguished and quantified via internal sequence differences. We have validated the method for the measurement of copy number at DEFB4 by comparison of results from >800 DNA samples with copy number measurements by MAPH/REDVR, MLPA and array-CGH. The new Paralogue Ratio Test (PRT) method can require as little as 10 ng genomic DNA, appears to be comparable in accuracy to the other methods, and for the first time provides a rapid, simple and inexpensive method for copy number analysis, suitable for application to typing thousands of samples in large case-control association studies.

  11. Dicer is required for haploid male germ cell differentiation in mice.

    Directory of Open Access Journals (Sweden)

    Hanna M Korhonen

    Full Text Available BACKGROUND: The RNase III endonuclease Dicer is an important regulator of gene expression that processes microRNAs (miRNAs and small interfering RNAs (siRNAs. The best-characterized function of miRNAs is gene repression at the post-transcriptional level through the pairing with mRNAs of protein-encoding genes. Small RNAs can also act at the transcriptional level by controlling the epigenetic status of chromatin. Dicer and other mediators of small RNA pathways are present in mouse male germ cells, and several miRNAs and endogenous siRNAs are expressed in the testis, suggesting that Dicer-dependent small RNAs are involved in the control of the precisely timed and highly organised process of spermatogenesis. PRINCIPAL FINDINGS: Being interested in the Dicer-mediated functions during spermatogenesis, we have analysed here a male germ cell-specific Dicer1 knockout mouse model, in which the deletion of Dicer1 takes place during early postnatal development in spermatogonia. We found that Dicer1 knockout testes were reduced in size and spermatogenesis within the seminiferous tubules was disrupted. Dicer1 knockout epididymides contained very low number of mature sperm with pronounced morphological abnormalities. Spermatogonial differentiation appeared unaffected. However, the number of haploid cells was decreased in knockout testes, and an increased number of apoptotic spermatocytes was observed. The most prominent defects were found during late haploid differentiation, and Dicer was demonstrated to be critical for the normal organization of chromatin and nuclear shaping of elongating spermatids. CONCLUSIONS/SIGNIFICANCE: We demonstrate that Dicer and Dicer-dependent small RNAs are imperative regulators of haploid spermatid differentiation and essential for male fertility.

  12. Meiosis and haploid gametes in the pathogen Trypanosoma brucei.

    Science.gov (United States)

    Peacock, Lori; Bailey, Mick; Carrington, Mark; Gibson, Wendy

    2014-01-20

    In eukaryote pathogens, sex is an important driving force in spreading genes for drug resistance, pathogenicity, and virulence. For the parasitic trypanosomes that cause African sleeping sickness, mating occurs during transmission by the tsetse vector and involves meiosis, but haploid gametes have not yet been identified. Here, we show that meiosis is a normal part of development in the insect salivary glands for all subspecies of Trypanosoma brucei, including the human pathogens. By observing insect-derived trypanosomes during the window of peak expression of meiosis-specific genes, we identified promastigote-like (PL) cells that interacted with each other via their flagella and underwent fusion, as visualized by the mixing of cytoplasmic red and green fluorescent proteins. PL cells had a short, wide body, a very long anterior flagellum, and either one or two kinetoplasts, but only the anterior kinetoplast was associated with the flagellum. Measurement of nuclear DNA contents showed that PL cells were haploid relative to diploid metacyclics. Trypanosomes are among the earliest diverging eukaryotes, and our results support the hypothesis that meiosis and sexual reproduction are ubiquitous in eukaryotes and likely to have been early innovations. Copyright © 2014 The Authors. Published by Elsevier Inc. All rights reserved.

  13. A High-Throughput Computational Framework for Identifying Significant Copy Number Aberrations from Array Comparative Genomic Hybridisation Data

    Directory of Open Access Journals (Sweden)

    Ian Roberts

    2012-01-01

    Full Text Available Reliable identification of copy number aberrations (CNA from comparative genomic hybridization data would be improved by the availability of a generalised method for processing large datasets. To this end, we developed swatCGH, a data analysis framework and region detection heuristic for computational grids. swatCGH analyses sequentially displaced (sliding windows of neighbouring probes and applies adaptive thresholds of varying stringency to identify the 10% of each chromosome that contains the most frequently occurring CNAs. We used the method to analyse a published dataset, comparing data preprocessed using four different DNA segmentation algorithms, and two methods for prioritising the detected CNAs. The consolidated list of the most commonly detected aberrations confirmed the value of swatCGH as a simplified high-throughput method for identifying biologically significant CNA regions of interest.

  14. Copy number variation in obsessive-compulsive disorder and tourette syndrome: a cross-disorder study

    NARCIS (Netherlands)

    McGrath, Lauren M.; Yu, Dongmei; Marshall, Christian; Davis, Lea K.; Thiruvahindrapuram, Bhooma; Li, Bingbin; Cappi, Carolina; Gerber, Gloria; Wolf, Aaron; Schroeder, Frederick A.; Osiecki, Lisa; O'Dushlaine, Colm; Kirby, Andrew; Illmann, Cornelia; Haddad, Stephen; Gallagher, Patience; Fagerness, Jesen A.; Barr, Cathy L.; Bellodi, Laura; Benarroch, Fortu; Bienvenu, O. Joseph; Black, Donald W.; Bloch, Michael H.; Bruun, Ruth D.; Budman, Cathy L.; Camarena, Beatriz; Cath, Danielle C.; Cavallini, Maria C.; Chouinard, Sylvain; Coric, Vladimir; Cullen, Bernadette; Delorme, Richard; Denys, Damiaan; Derks, Eske M.; Dion, Yves; Rosário, Maria C.; Eapen, Valsama; Evans, Patrick; Falkai, Peter; Fernandez, Thomas V.; Garrido, Helena; Geller, Daniel; Grabe, Hans J.; Grados, Marco A.; Greenberg, Benjamin D.; Gross-Tsur, Varda; Grünblatt, Edna; Heiman, Gary A.; Hemmings, Sian M. J.; Herrera, Luis D.; Hounie, Ana G.; Jankovic, Joseph; Kennedy, James L.; King, Robert A.; Kurlan, Roger; Lanzagorta, Nuria; Leboyer, Marion; Leckman, James F.; Lennertz, Leonhard; Lochner, Christine; Lowe, Thomas L.; Lyon, Gholson J.; Macciardi, Fabio; Maier, Wolfgang; McCracken, James T.; McMahon, William; Murphy, Dennis L.; Naarden, Allan L.; Neale, Benjamin M.; Nurmi, Erika; Pakstis, Andrew J.; Pato, Michele T.; Pato, Carlos N.; Piacentini, John; Pittenger, Christopher; Pollak, Yehuda; Reus, Victor I.; Richter, Margaret A.; Riddle, Mark; Robertson, Mary M.; Rosenberg, David; Rouleau, Guy A.; Ruhrmann, Stephan; Sampaio, Aline S.; Samuels, Jack; Sandor, Paul; Sheppard, Brooke; Singer, Harvey S.; Smit, Jan H.; Stein, Dan J.; Tischfield, Jay A.; Vallada, Homero; Veenstra-Vanderweele, Jeremy; Walitza, Susanne; Wang, Ying; Wendland, Jens R.; Shugart, Yin Yao; Miguel, Euripedes C.; Nicolini, Humberto; Oostra, Ben A.; Moessner, Rainald; Wagner, Michael; Ruiz-Linares, Andres; Heutink, Peter; Nestadt, Gerald; Freimer, Nelson; Petryshen, Tracey; Posthuma, Danielle; Jenike, Michael A.; Cox, Nancy J.; Hanna, Gregory L.; Brentani, Helena; Scherer, Stephen W.; Arnold, Paul D.; Stewart, S. Evelyn; Mathews, Carol A.; Knowles, James A.; Cook, Edwin H.; Pauls, David L.; Wang, Kai; Scharf, Jeremiah M.

    2014-01-01

    Obsessive-compulsive disorder (OCD) and Tourette syndrome (TS) are heritable neurodevelopmental disorders with a partially shared genetic etiology. This study represents the first genome-wide investigation of large (>500 kb), rare ( <1%) copy number variants (CNVs) in OCD and the largest genome-wide

  15. Polyamine patterns in haploid and diploid tobacco tissues and in vitro cultures

    Directory of Open Access Journals (Sweden)

    Sílvia Bicudo Carone

    2010-04-01

    Full Text Available The aim of this work was to determine PAs levels in pith tissues and callus cultures from haploid and diploid tobacco plants, explanted from the apical and basal regions of the stem. These explants were cultured in an RM-64 medium supplied with IAA and kinetin, under light or in the dark, during successive subcultures. PAs levels followed a basipetal decrease in diploid and an increase in haploid, pith tissues. A similar pattern of total PAs (free + conjugated was observed for the callus of diploid and haploid plants maintained in the light, and for the haploid callus in the dark, whereas the diploid callus in the dark showed a constant increase in total PAs levels until the end of culture. The PA increase in the diploid callus in the dark was related to free Put levels increase. The ploidy status of the plants could express different PA gradients together with the plant pith and in vitro callus cultures.O objetivo deste trabalho foi determinar os níveis de PAs em tecidos de medula e cultura de calos de plantas haplóides e diplóides de tabaco, obtidas da região apical e basal do caule. Estes explantes foram cultivados em meio RM-64 suplementado com AIA e cinetina, na luz e no escuro, durante vários subcultivos. Nos tecidos medulares, os níveis de PAs apresentam um decréscimo basípeto em diplóides e um aumento em haplóides.Um padrão similar nos níveis de PAs totais (livres+ conjugadas foi observado em calos haplóides e diplóides mantidos na luz, e haplóides no escuro, enquanto os diplóides cultivados no escuro mostraram um aumento constante até o final do cultivo. O aumento no conteúdo de PAs nos calos diplóides no escuro, foi devido ao aumento do conteúdo de Put livre. Foi observado que a ploidia da planta pode expressar diferentes gradientes de PA ao longo do tecido medular e nas culturas de calos in vitro.

  16. Genome size of 14 species of fireflies (Insecta, Coleoptera, Lampyridae

    Directory of Open Access Journals (Sweden)

    Gui-Chun Liu

    2017-11-01

    Full Text Available Eukaryotic genome size data are important both as the basis for comparative research into genome evolution and as estimators of the cost and difficulty of genome sequencing programs for non-model organisms. In this study, the genome size of 14 species of fireflies (Lampyridae (two genera in Lampyrinae, three genera in Luciolinae, and one genus in subfamily incertae sedis were estimated by propidium iodide (PI-based flow cytometry. The haploid genome sizes of Lampyridae ranged from 0.42 to 1.31 pg, a 3.1-fold span. Genome sizes of the fireflies varied within the tested subfamilies and genera. Lamprigera and Pyrocoelia species had large and small genome sizes, respectively. No correlation was found between genome size and morphological traits such as body length, body width, eye width, and antennal length. Our data provide additional information on genome size estimation of the firefly family Lampyridae. Furthermore, this study will help clarify the cost and difficulty of genome sequencing programs for non-model organisms and will help promote studies on firefly genome evolution.

  17. The first genetic map of a synthesized allohexaploid Brassica with A, B and C genomes based on simple sequence repeat markers.

    Science.gov (United States)

    Yang, S; Chen, S; Geng, X X; Yan, G; Li, Z Y; Meng, J L; Cowling, W A; Zhou, W J

    2016-04-01

    We present the first genetic map of an allohexaploid Brassica species, based on segregating microsatellite markers in a doubled haploid mapping population generated from a hybrid between two hexaploid parents. This study reports the first genetic map of trigenomic Brassica. A doubled haploid mapping population consisting of 189 lines was obtained via microspore culture from a hybrid H16-1 derived from a cross between two allohexaploid Brassica lines (7H170-1 and Y54-2). Simple sequence repeat primer pairs specific to the A genome (107), B genome (44) and C genome (109) were used to construct a genetic linkage map of the population. Twenty-seven linkage groups were resolved from 274 polymorphic loci on the A genome (109), B genome (49) and C genome (116) covering a total genetic distance of 3178.8 cM with an average distance between markers of 11.60 cM. This is the first genetic framework map for the artificially synthesized Brassica allohexaploids. The linkage groups represent the expected complement of chromosomes in the A, B and C genomes from the original diploid and tetraploid parents. This framework linkage map will be valuable for QTL analysis and future genetic improvement of a new allohexaploid Brassica species, and in improving our understanding of the genetic control of meiosis in new polyploids.

  18. Algorithms and Complexity Results for Genome Mapping Problems.

    Science.gov (United States)

    Rajaraman, Ashok; Zanetti, Joao Paulo Pereira; Manuch, Jan; Chauve, Cedric

    2017-01-01

    Genome mapping algorithms aim at computing an ordering of a set of genomic markers based on local ordering information such as adjacencies and intervals of markers. In most genome mapping models, markers are assumed to occur uniquely in the resulting map. We introduce algorithmic questions that consider repeats, i.e., markers that can have several occurrences in the resulting map. We show that, provided with an upper bound on the copy number of repeated markers and with intervals that span full repeat copies, called repeat spanning intervals, the problem of deciding if a set of adjacencies and repeat spanning intervals admits a genome representation is tractable if the target genome can contain linear and/or circular chromosomal fragments. We also show that extracting a maximum cardinality or weight subset of repeat spanning intervals given a set of adjacencies that admits a genome realization is NP-hard but fixed-parameter tractable in the maximum copy number and the number of adjacent repeats, and tractable if intervals contain a single repeated marker.

  19. Application of Doubled Haploid (DH) Technique in Mutation and Conventional Wheat Breeding in Kenya

    International Nuclear Information System (INIS)

    Njau, P.N.

    2002-01-01

    Wheat is the second most important staple cereal in Kenya after maize.over the last six years wheat improvement for various stresses and agronomic characteristics have been undertaken through various biotechnological approaches which have been used as complements to the traditional breeding methods. The prime objective in any breeding program is the prevention of the debilitating effects of breeding. In self-pollinated crops such as wheat selection is more efficient homozygous lines than in segregating population. During repeated selfing, to develop homozygousity the vigour of the F1 of M1 plats is lost. Application of biotechnology in crop movement has been suggested as the useful tool in a faster variety development. The double haploid (DH) technique does not only shorten the time of developing homozygous lines but also maintains the heterosis of the F 1 , increase the selection of the efficiency of selection in mutants and increase the effectiveness of selection. in this study DHs were developed from F1 and M4 generation developed from drought tolerance.This was accomplished through the following step: (i)F 1 crosses were produced by crossing three drought tolerant varieties namely Kenya Mbweha, Duma and Ngamia with two highly yielding commercial varieties namely Kenya Chiriku and Kwale in 1998 while mutants were developed through gamma ray irradiation in 1995. (ii) The haploids were produced through chromosome elimination by crossing the F 1 s and the M 4 with maize pollen and (iii) the Double Haploid (DH) were produced by treating the haploid with colchicine. Twenty DH lines were produced from F 1 haploid and 5 from M 4 ones. The DH technique tend to increase uniformity, stability and distinctiveness of the mutants and the segregating populations. Most of the DHs showed wide variation indicating high potential of selection for various agronomic characteristics. Heterosis was realized on a number of characteristics in the DH lines. Through this technique the

  20. Draft genome sequence of the rubber tree Hevea brasiliensis

    Directory of Open Access Journals (Sweden)

    Rahman Ahmad Yamin Abdul

    2013-02-01

    Full Text Available Abstract Background Hevea brasiliensis, a member of the Euphorbiaceae family, is the major commercial source of natural rubber (NR. NR is a latex polymer with high elasticity, flexibility, and resilience that has played a critical role in the world economy since 1876. Results Here, we report the draft genome sequence of H. brasiliensis. The assembly spans ~1.1 Gb of the estimated 2.15 Gb haploid genome. Overall, ~78% of the genome was identified as repetitive DNA. Gene prediction shows 68,955 gene models, of which 12.7% are unique to Hevea. Most of the key genes associated with rubber biosynthesis, rubberwood formation, disease resistance, and allergenicity have been identified. Conclusions The knowledge gained from this genome sequence will aid in the future development of high-yielding clones to keep up with the ever increasing need for natural rubber.

  1. Hyb-Seq: combining target enrichment and genome skimming for plant phylogenomics

    Science.gov (United States)

    Kevin Weitemier; Shannon C.K. Straub; Richard C. Cronn; Mark Fishbein; Roswitha Schmickl; Angela McDonnell; Aaron. Liston

    2014-01-01

    • Premise of the study: Hyb-Seq, the combination of target enrichment and genome skimming, allows simultaneous data collection for low-copy nuclear genes and high-copy genomic targets for plant systematics and evolution studies. • Methods and Results: Genome and transcriptome assemblies for milkweed ( Asclepias syriaca ) were used to design enrichment probes for 3385...

  2. Novel applications of array comparative genomic hybridization in molecular diagnostics.

    Science.gov (United States)

    Cheung, Sau W; Bi, Weimin

    2018-05-31

    In 2004, the implementation of array comparative genomic hybridization (array comparative genome hybridization [CGH]) into clinical practice marked a new milestone for genetic diagnosis. Array CGH and single-nucleotide polymorphism (SNP) arrays enable genome-wide detection of copy number changes in a high resolution, and therefore microarray has been recognized as the first-tier test for patients with intellectual disability or multiple congenital anomalies, and has also been applied prenatally for detection of clinically relevant copy number variations in the fetus. Area covered: In this review, the authors summarize the evolution of array CGH technology from their diagnostic laboratory, highlighting exonic SNP arrays developed in the past decade which detect small intragenic copy number changes as well as large DNA segments for the region of heterozygosity. The applications of array CGH to human diseases with different modes of inheritance with the emphasis on autosomal recessive disorders are discussed. Expert commentary: An exonic array is a powerful and most efficient clinical tool in detecting genome wide small copy number variants in both dominant and recessive disorders. However, whole-genome sequencing may become the single integrated platform for detection of copy number changes, single-nucleotide changes as well as balanced chromosomal rearrangements in the near future.

  3. Clinical findings and genetic screening for copy number variation ...

    African Journals Online (AJOL)

    to the Unified Parkinson's Disease Rating Scale (UPDRS), and patients were classified according to motor features. Genomic DNA was extracted and multiplex ligation-dependent probe amplification was used for detection of copy number variation (CNV) mutations in the known PD-causing genes. Results. Sixteen patients ...

  4. Meiosis and Haploid Gametes in the Pathogen Trypanosoma brucei

    OpenAIRE

    Peacock, Lori; Bailey, Mick; Carrington, Mark; Gibson, Wendy

    2014-01-01

    Summary In eukaryote pathogens, sex is an important driving force in spreading genes for drug resistance, pathogenicity, and virulence [1]. For the parasitic trypanosomes that cause African sleeping sickness, mating occurs during transmission by the tsetse vector [2, 3] and involves meiosis [4], but haploid gametes have not yet been identified. Here, we show that meiosis is a normal part of development in the insect salivary glands for all subspecies of Trypanosoma brucei, including the human...

  5. Building a model: developing genomic resources for common milkweed (Asclepias syriaca with low coverage genome sequencing

    Directory of Open Access Journals (Sweden)

    Weitemier Kevin

    2011-05-01

    Full Text Available Abstract Background Milkweeds (Asclepias L. have been extensively investigated in diverse areas of evolutionary biology and ecology; however, there are few genetic resources available to facilitate and compliment these studies. This study explored how low coverage genome sequencing of the common milkweed (Asclepias syriaca L. could be useful in characterizing the genome of a plant without prior genomic information and for development of genomic resources as a step toward further developing A. syriaca as a model in ecology and evolution. Results A 0.5× genome of A. syriaca was produced using Illumina sequencing. A virtually complete chloroplast genome of 158,598 bp was assembled, revealing few repeats and loss of three genes: accD, clpP, and ycf1. A nearly complete rDNA cistron (18S-5.8S-26S; 7,541 bp and 5S rDNA (120 bp sequence were obtained. Assessment of polymorphism revealed that the rDNA cistron and 5S rDNA had 0.3% and 26.7% polymorphic sites, respectively. A partial mitochondrial genome sequence (130,764 bp, with identical gene content to tobacco, was also assembled. An initial characterization of repeat content indicated that Ty1/copia-like retroelements are the most common repeat type in the milkweed genome. At least one A. syriaca microread hit 88% of Catharanthus roseus (Apocynaceae unigenes (median coverage of 0.29× and 66% of single copy orthologs (COSII in asterids (median coverage of 0.14×. From this partial characterization of the A. syriaca genome, markers for population genetics (microsatellites and phylogenetics (low-copy nuclear genes studies were developed. Conclusions The results highlight the promise of next generation sequencing for development of genomic resources for any organism. Low coverage genome sequencing allows characterization of the high copy fraction of the genome and exploration of the low copy fraction of the genome, which facilitate the development of molecular tools for further study of a target species

  6. Genomics technologies to study structural variations in the grapevine genome

    Directory of Open Access Journals (Sweden)

    Cardone Maria Francesca

    2016-01-01

    Full Text Available Grapevine is one of the most important crop plants in the world. Recently there was great expansion of genomics resources about grapevine genome, thus providing increasing efforts for molecular breeding. Current cultivars display a great level of inter-specific differentiation that needs to be investigated to reach a comprehensive understanding of the genetic basis of phenotypic differences, and to find responsible genes selected by cross breeding programs. While there have been significant advances in resolving the pattern and nature of single nucleotide polymorphisms (SNPs on plant genomes, few data are available on copy number variation (CNV. Furthermore association between structural variations and phenotypes has been described in only a few cases. We combined high throughput biotechnologies and bioinformatics tools, to reveal the first inter-varietal atlas of structural variation (SV for the grapevine genome. We sequenced and compared four table grape cultivars with the Pinot noir inbred line PN40024 genome as the reference. We detected roughly 8% of the grapevine genome affected by genomic variations. Taken into account phenotypic differences existing among the studied varieties we performed comparison of SVs among them and the reference and next we performed an in-depth analysis of gene content of polymorphic regions. This allowed us to identify genes showing differences in copy number as putative functional candidates for important traits in grapevine cultivation.

  7. Genome-wide association study of multiplex schizophrenia pedigrees

    DEFF Research Database (Denmark)

    Levinson, Douglas F; Shi, Jianxin; Wang, Kai

    2012-01-01

    The authors used a genome-wide association study (GWAS) of multiply affected families to investigate the association of schizophrenia to common single-nucleotide polymorphisms (SNPs) and rare copy number variants (CNVs).......The authors used a genome-wide association study (GWAS) of multiply affected families to investigate the association of schizophrenia to common single-nucleotide polymorphisms (SNPs) and rare copy number variants (CNVs)....

  8. ReadDepth: a parallel R package for detecting copy number alterations from short sequencing reads.

    Directory of Open Access Journals (Sweden)

    Christopher A Miller

    2011-01-01

    Full Text Available Copy number alterations are important contributors to many genetic diseases, including cancer. We present the readDepth package for R, which can detect these aberrations by measuring the depth of coverage obtained by massively parallel sequencing of the genome. In addition to achieving higher accuracy than existing packages, our tool runs much faster by utilizing multi-core architectures to parallelize the processing of these large data sets. In contrast to other published methods, readDepth does not require the sequencing of a reference sample, and uses a robust statistical model that accounts for overdispersed data. It includes a method for effectively increasing the resolution obtained from low-coverage experiments by utilizing breakpoint information from paired end sequencing to do positional refinement. We also demonstrate a method for inferring copy number using reads generated by whole-genome bisulfite sequencing, thus enabling integrative study of epigenomic and copy number alterations. Finally, we apply this tool to two genomes, showing that it performs well on genomes sequenced to both low and high coverage. The readDepth package runs on Linux and MacOSX, is released under the Apache 2.0 license, and is available at http://code.google.com/p/readdepth/.

  9. Reduced Self-Diploidization and Improved Survival of Semi-cloned Mice Produced from Androgenetic Haploid Embryonic Stem Cells through Overexpression of Dnmt3b

    Directory of Open Access Journals (Sweden)

    Wenteng He

    2018-02-01

    Full Text Available Summary: Androgenetic haploid embryonic stem cells (AG-haESCs hold great promise for exploring gene functions and generating gene-edited semi-cloned (SC mice. However, the high incidence of self-diploidization and low efficiency of SC mouse production are major obstacles preventing widespread use of these cells. Moreover, although SC mice generation could be greatly improved by knocking out the differentially methylated regions of two imprinted genes, 50% of the SC mice did not survive into adulthood. Here, we found that the genome-wide DNA methylation level in AG-haESCs is extremely low. Subsequently, downregulation of both de novo methyltransferase Dnmt3b and other methylation-related genes was determined to be responsible for DNA hypomethylation. We further demonstrated that ectopic expression of Dnmt3b in AG-haESCs could effectively improve DNA methylation level, and the high incidence of self-diploidization could be markedly rescued. More importantly, the developmental potential of SC embryos was improved, and most SC mice could survive into adulthood. : Ectopic expression of Dnmt3b could rescue DNA methylation level in repetitive sequences of hypomethylated AG-haESCs, suppress high incidence of self-diploidization, and promote developmental potential of SC embryos, and most SC mice could survive into adulthood. Keywords: androgenetic haploid embryonic stem cells, self-diploidization, semi-cloned mice, DNA methylation, Dnmt3b

  10. On the effect of certain mutations on the radiosensitivity of haploid and diploid yeast cells

    International Nuclear Information System (INIS)

    Sokurova, E.N.; Korogodin, V.I.

    1978-01-01

    Mutation ade 1-6 in haploid cell Saccharomyces cerevisiae increases half as much against radioresistance of cells. Diploid cells lacking in adenine, homozygous by ade 1-6 mutation, are nearly twice as radiosensitive as prototrophic cells. Hence ade 1-6 mutation increases radioresistance of haploid cells and decreases that of diplois. These changes in radioresistance are not connected with variations in the extrapolation number of survival curve, the ability of cells to recover from radiation damages upon cultivation in an innutrient medium, and with the inactivation form ratio. Lack of adenine influences the radioresistance of diploid yeast irrespective of whether it is or it is not affected by homo- or heterozygosity by the locus of mating type

  11. Doubled haploid production from Spanish onion (Allium cepa L.) germplasm: embryogenesis induction, plant regeneration and chromosome doubling.

    Science.gov (United States)

    Fayos, Oreto; Vallés, María P; Garcés-Claver, Ana; Mallor, Cristina; Castillo, Ana M

    2015-01-01

    The use of doubled haploids in onion breeding is limited due to the low gynogenesis efficiency of this species. Gynogenesis capacity from Spanish germplasm, including the sweet cultivar Fuentes de Ebro, the highly pungent landrace BGHZ1354 and the two Valenciana type commercial varieties Recas and Rita, was evaluated and optimized in this study. The OH-1 population, characterized by a high gynogenesis induction, was used as control. Growing conditions of the donor plants were tested with a one-step protocol and field plants produced a slightly higher percentage of embryogenesis induction than growth chamber plants. A one-step protocol was compared with a two-step protocol for embryogenesis induction. Spanish germplasm produced a 2-3 times higher percentage of embryogenesis with the two-step protocol, Recas showing the highest percentage (2.09%) and Fuentes de Ebro the lowest (0.53%). These percentages were significantly lower than those from the OH-1 population, with an average of 15% independently of the protocol used. The effect of different containers on plant regeneration was tested using both protocols. The highest percentage of acclimated plants was obtained with the two-step protocol in combination with Eco2box (70%), whereas the lowest percentage was observed with glass tubes in the two protocols (20-23%). Different amiprofos-methyl (APM) treatments were applied to embryos for chromosome doubling. A similar number of doubled haploid plants were recovered with 25 or 50 μM APM in liquid medium. However, the application of 25 μM in solid medium for 24 h produced the highest number of doubled haploid plants. Somatic regeneration from flower buds of haploid and mixoploid plants proved to be a successful approach for chromosome doubling, since diploid plants were obtained from the four regenerated lines. In this study, doubled haploid plants were produced from the four Spanish cultivars, however further improvements are needed to increase their gynogenesis

  12. Doubled haploid production from Spanish onion (Allium cepa L. germplasm: embryogenesis induction, plant regeneration and chromosome doubling

    Directory of Open Access Journals (Sweden)

    Oreto eFayos

    2015-05-01

    Full Text Available The use of doubled haploids in onion breeding is limited due to the low gynogenesis efficiency of this species. Gynogenesis capacity from Spanish germplasm, including the sweet cultivar Fuentes de Ebro, the highly pungent landrace BGHZ1354 and the two Valenciana type commercial varieties Recas and Rita, was evaluated and optimized in this study. The OH-1 population, characterized by a high gynogenesis induction, was used as control. Growing conditions of the donor plants were tested with a one-step protocol and field plants produced a slightly higher percentage of embryogenesis induction than growth chamber plants. A one-step protocol was compared with a two-step protocol for embryogenesis induction. Spanish germplasm produced a 2 to 3 times higher percentage of embryogenesis with the two-step protocol, Recas showing the highest percentage (2.09% and Fuentes de Ebro the lowest (0.53%. These percentages were significantly lower than those from the OH-1 population, with an average of 15% independently of the protocol used. The effect of different containers on plant regeneration was tested using both protocols. The highest percentage of acclimated plants was obtained with the two-step protocol in combination with Eco2box (70%, whereas the lowest percentage was observed with glass tubes in the two protocols (20-23%. Different amiprofos-methyl (APM treatments were applied to embryos for chromosome doubling. A similar number of doubled haploid plants were recovered with 25 or 50 µM APM in liquid medium. However, the application of 25 µM in solid medium for 24 h produced the highest number of doubled haploid plants. Somatic regeneration from flower buds of haploid and mixoploid plants proved to be a successful approach for chromosome doubling, since diploid plants were obtained from the 4 regenerated lines. In this study, doubled haploid plants were produced from the four Spanish cultivars, however further improvements are needed to increase their

  13. Characterization of Chinese Hamster Ovary Cells Producing Coagulation Factor VIII Using Multi-omics Tools

    DEFF Research Database (Denmark)

    Kaas, Christian Schrøder

    The first public draft of a genome from Chinese hamster ovary (CHO) cells was published in 2011, an entire decade after the first draft of the human genome. This publication of a relevant CHO reference genome, in combination with the fact that the cost for DNA sequencing has dropped more than 10...... using omics tools. A wide range of methods were applied including whole-genome sequencing, targeted genome sequencing, mRNA sequencing, miRNA sequencing and mass spectrometry based shotgun proteomics on a number of clones in order to get a more holistic picture of the inner workings of these CHO...... transfectants. From the whole-genome sequencing of two CHO genomes (CHO DXB11 and the FVIII producing transfectant: F435) it was observed that roughly 20% of the genes in the genome were haploid and roughly 10% had a copy number of three or higher indicating extensive rearrangements compared to the Chinese...

  14. Effects of gamma irradiation of pollen on parthenogenetic haploid production in muskmelon (Cucumis melo L.)

    International Nuclear Information System (INIS)

    Cuny, F.; Grotte, M.; Dumas de Vaulx, R.; Rieu, A.

    1993-01-01

    The effects of increasing gamma ray exposures on muskmelon pollen of the Védrantais genotype were evaluated after autofertilization and hybridization with the F1.G1 genotype. Regardless of doses of between 0.15 and 1.6 kGy, fruit set and number of seeds per fruit were comparable to those of the control. The pollen tube from pollen irradiated with up to 2.5 kGy grew in styles and reached the ovules. When pollen was cultivated in vitro, relatively high doses of irradiation (1.6 kGy) were needed to reduce the level of germination. Radiation-induced changes in the generative nucleus led to the formation of two chromosomally unbalanced sperm cells (as indicated by the appearance of morphological dimorphism) which induced parthenogenetic development of the egg to form a haploid embryo. Haploid embryo production by gamma-irradiated pollen was genotype dependent. For exposures of between 0.15 and 2.5 kGy, the production of embryos was the same, about 3.4%; a maximum of 70% of these embryos placed in a specific culture medium produced haploid plants. The ploidy of the plantlets in vitro was determined by flow cytometry. No aneuploidy was detected. All resulting plants exhibited normal phenotypes. (author) [fr

  15. Stimulating effect of gamma radiation on haploid wheat production through microscope over co-culture system

    International Nuclear Information System (INIS)

    Naseri, M.; Rahimi, M.; Faramarz, M.

    2004-01-01

    Haploid production focuses on low plant regeneration in some wheat genotypes. Haploid application gamma ray as an electromagnetic ray has ionizing properties which can produce ions when passing through biological matter. It can produce genetic variation therefore, is applied in crop and ornamental improvement to enhance agronomic traits. The most important changes caused by gamma radiation is in DNA structure existing in the nucleus of cell. The desirable agronomic changes then will be passed on through generations. Another property of gamma ray can be it's stimulating effect which is the aim in this investigation. Microspore-overy co-culture of wheat along with application of low doses of gamma radiation 2,3 and 4 Gy as absorbing doses were implemented with the aim to evaluate wheat haploid production. Modified Morashig and Skoog medium was used as induction medium and 190-2 medium for regeneration. Two winter and two spring wheat cultivars were used as genetic material. Low doses of gamma radiation simulated microspore cell division and produced more calli relative to non-irradiated microspores only in winter type wheats. In microspore overy co-culture, filtered microspores were centrifuged and then plated in Petri dishes containing MMS+500 mg/I glutamine with 25 overies

  16. Structural genomic variation in ischemic stroke

    Science.gov (United States)

    Matarin, Mar; Simon-Sanchez, Javier; Fung, Hon-Chung; Scholz, Sonja; Gibbs, J. Raphael; Hernandez, Dena G.; Crews, Cynthia; Britton, Angela; Wavrant De Vrieze, Fabienne; Brott, Thomas G.; Brown, Robert D.; Worrall, Bradford B.; Silliman, Scott; Case, L. Douglas; Hardy, John A.; Rich, Stephen S.; Meschia, James F.; Singleton, Andrew B.

    2008-01-01

    Technological advances in molecular genetics allow rapid and sensitive identification of genomic copy number variants (CNVs). This, in turn, has sparked interest in the function such variation may play in disease. While a role for copy number mutations as a cause of Mendelian disorders is well established, it is unclear whether CNVs may affect risk for common complex disorders. We sought to investigate whether CNVs may modulate risk for ischemic stroke (IS) and to provide a catalog of CNVs in patients with this disorder by analyzing copy number metrics produced as a part of our previous genome-wide single-nucleotide polymorphism (SNP)-based association study of ischemic stroke in a North American white population. We examined CNVs in 263 patients with ischemic stroke (IS). Each identified CNV was compared with changes identified in 275 neurologically normal controls. Our analysis identified 247 CNVs, corresponding to 187 insertions (76%; 135 heterozygous; 25 homozygous duplications or triplications; 2 heterosomic) and 60 deletions (24%; 40 heterozygous deletions;3 homozygous deletions; 14 heterosomic deletions). Most alterations (81%) were the same as, or overlapped with, previously reported CNVs. We report here the first genome-wide analysis of CNVs in IS patients. In summary, our study did not detect any common genomic structural variation unequivocally linked to IS, although we cannot exclude that smaller CNVs or CNVs in genomic regions poorly covered by this methodology may confer risk for IS. The application of genome-wide SNP arrays now facilitates the evaluation of structural changes through the entire genome as part of a genome-wide genetic association study. PMID:18288507

  17. DNA replication stress restricts ribosomal DNA copy number.

    Science.gov (United States)

    Salim, Devika; Bradford, William D; Freeland, Amy; Cady, Gillian; Wang, Jianmin; Pruitt, Steven C; Gerton, Jennifer L

    2017-09-01

    Ribosomal RNAs (rRNAs) in budding yeast are encoded by ~100-200 repeats of a 9.1kb sequence arranged in tandem on chromosome XII, the ribosomal DNA (rDNA) locus. Copy number of rDNA repeat units in eukaryotic cells is maintained far in excess of the requirement for ribosome biogenesis. Despite the importance of the repeats for both ribosomal and non-ribosomal functions, it is currently not known how "normal" copy number is determined or maintained. To identify essential genes involved in the maintenance of rDNA copy number, we developed a droplet digital PCR based assay to measure rDNA copy number in yeast and used it to screen a yeast conditional temperature-sensitive mutant collection of essential genes. Our screen revealed that low rDNA copy number is associated with compromised DNA replication. Further, subculturing yeast under two separate conditions of DNA replication stress selected for a contraction of the rDNA array independent of the replication fork blocking protein, Fob1. Interestingly, cells with a contracted array grew better than their counterparts with normal copy number under conditions of DNA replication stress. Our data indicate that DNA replication stresses select for a smaller rDNA array. We speculate that this liberates scarce replication factors for use by the rest of the genome, which in turn helps cells complete DNA replication and continue to propagate. Interestingly, tumors from mini chromosome maintenance 2 (MCM2)-deficient mice also show a loss of rDNA repeats. Our data suggest that a reduction in rDNA copy number may indicate a history of DNA replication stress, and that rDNA array size could serve as a diagnostic marker for replication stress. Taken together, these data begin to suggest the selective pressures that combine to yield a "normal" rDNA copy number.

  18. DNA replication stress restricts ribosomal DNA copy number

    Science.gov (United States)

    Salim, Devika; Bradford, William D.; Freeland, Amy; Cady, Gillian; Wang, Jianmin

    2017-01-01

    Ribosomal RNAs (rRNAs) in budding yeast are encoded by ~100–200 repeats of a 9.1kb sequence arranged in tandem on chromosome XII, the ribosomal DNA (rDNA) locus. Copy number of rDNA repeat units in eukaryotic cells is maintained far in excess of the requirement for ribosome biogenesis. Despite the importance of the repeats for both ribosomal and non-ribosomal functions, it is currently not known how “normal” copy number is determined or maintained. To identify essential genes involved in the maintenance of rDNA copy number, we developed a droplet digital PCR based assay to measure rDNA copy number in yeast and used it to screen a yeast conditional temperature-sensitive mutant collection of essential genes. Our screen revealed that low rDNA copy number is associated with compromised DNA replication. Further, subculturing yeast under two separate conditions of DNA replication stress selected for a contraction of the rDNA array independent of the replication fork blocking protein, Fob1. Interestingly, cells with a contracted array grew better than their counterparts with normal copy number under conditions of DNA replication stress. Our data indicate that DNA replication stresses select for a smaller rDNA array. We speculate that this liberates scarce replication factors for use by the rest of the genome, which in turn helps cells complete DNA replication and continue to propagate. Interestingly, tumors from mini chromosome maintenance 2 (MCM2)-deficient mice also show a loss of rDNA repeats. Our data suggest that a reduction in rDNA copy number may indicate a history of DNA replication stress, and that rDNA array size could serve as a diagnostic marker for replication stress. Taken together, these data begin to suggest the selective pressures that combine to yield a “normal” rDNA copy number. PMID:28915237

  19. DNA replication stress restricts ribosomal DNA copy number.

    Directory of Open Access Journals (Sweden)

    Devika Salim

    2017-09-01

    Full Text Available Ribosomal RNAs (rRNAs in budding yeast are encoded by ~100-200 repeats of a 9.1kb sequence arranged in tandem on chromosome XII, the ribosomal DNA (rDNA locus. Copy number of rDNA repeat units in eukaryotic cells is maintained far in excess of the requirement for ribosome biogenesis. Despite the importance of the repeats for both ribosomal and non-ribosomal functions, it is currently not known how "normal" copy number is determined or maintained. To identify essential genes involved in the maintenance of rDNA copy number, we developed a droplet digital PCR based assay to measure rDNA copy number in yeast and used it to screen a yeast conditional temperature-sensitive mutant collection of essential genes. Our screen revealed that low rDNA copy number is associated with compromised DNA replication. Further, subculturing yeast under two separate conditions of DNA replication stress selected for a contraction of the rDNA array independent of the replication fork blocking protein, Fob1. Interestingly, cells with a contracted array grew better than their counterparts with normal copy number under conditions of DNA replication stress. Our data indicate that DNA replication stresses select for a smaller rDNA array. We speculate that this liberates scarce replication factors for use by the rest of the genome, which in turn helps cells complete DNA replication and continue to propagate. Interestingly, tumors from mini chromosome maintenance 2 (MCM2-deficient mice also show a loss of rDNA repeats. Our data suggest that a reduction in rDNA copy number may indicate a history of DNA replication stress, and that rDNA array size could serve as a diagnostic marker for replication stress. Taken together, these data begin to suggest the selective pressures that combine to yield a "normal" rDNA copy number.

  20. Cellular heredity in haploid cultures of somatic cells. Annual progress report, March 1, 1975--March 31, 1976

    International Nuclear Information System (INIS)

    Freed, J.J.

    1976-01-01

    In experiments with haploid and diploid derivatives from the haploid frog embryo cell line ICR 2A, we have investigated aspects of cell survival, DNA repair and mutant induction after exposure to 254 nm radiation. Survival curves for haploid and diploid cells in random growth or blocked in the Gl phase of the cell cycle were determined; the survival data do not differ sufficiently to permit the use of such comparisons as an index of recessive lethal induction. Studies of the induction of thymine dimers in DNA indicated that the incidence of dimers in DNA from haploid and diploid cells is similar after exposure of the cells to equal doses of ultraviolet. The cells are capable of photoreversing dimers but appear to be deficient in excision repair. In an attempt to examine the effect of the permitted mode of DNA repair on the yield of mutations, we compared the incidence of ouabain-resistant variants among survivors of ultraviolet exposure and of ultraviolet exposure followed by photoreversal. Although the yield of resistant colonies was small, the data suggest that photoreversal lowers the yield of resistant colonies and thus that the induction of this phenotype is related to dimer persistence in DNA. We have also observed by fluorescence microscopy that an acridine mustard mutagen, ICR 191, is preferentially accumulated in cytoplasmic granules having the intracellular distribution pattern of lysosomes. This form of incorporation may be significant in the apparently non-genetic early toxicity of this compound observed in experiments with cultured cells

  1. Industrial Relevance of Chromosomal Copy Number Variation in Saccharomyces Yeasts

    Science.gov (United States)

    Gorter de Vries, Arthur R.; Pronk, Jack T.

    2017-01-01

    ABSTRACT Chromosomal copy number variation (CCNV) plays a key role in evolution and health of eukaryotes. The unicellular yeast Saccharomyces cerevisiae is an important model for studying the generation, physiological impact, and evolutionary significance of CCNV. Fundamental studies of this yeast have contributed to an extensive set of methods for analyzing and introducing CCNV. Moreover, these studies provided insight into the balance between negative and positive impacts of CCNV in evolutionary contexts. A growing body of evidence indicates that CCNV not only frequently occurs in industrial strains of Saccharomyces yeasts but also is a key contributor to the diversity of industrially relevant traits. This notion is further supported by the frequent involvement of CCNV in industrially relevant traits acquired during evolutionary engineering. This review describes recent developments in genome sequencing and genome editing techniques and discusses how these offer opportunities to unravel contributions of CCNV in industrial Saccharomyces strains as well as to rationally engineer yeast chromosomal copy numbers and karyotypes. PMID:28341679

  2. Copy-number variants in neurodevelopmental disorders: promises and challenges.

    LENUS (Irish Health Repository)

    Merikangas, Alison K

    2012-02-01

    Copy-number variation (CNV) is the most prevalent type of structural variation in the human genome. There is emerging evidence that copy-number variants (CNVs) provide a new vista on understanding susceptibility to neuropsychiatric disorders. Some challenges in the interpretation of current CNV studies include the use of overlapping samples, differing phenotypic definitions, an absence of population norms for CNVs and a lack of consensus in methods for CNV detection and analysis. Here, we review current CNV association study methods and results in autism spectrum disorders (ASD) and schizophrenia, and provide suggestions for design approaches to future studies that might maximize the translation of this work to etiological understanding.

  3. Exploratory analysis of the copy number alterations in glioblastoma multiforme.

    Directory of Open Access Journals (Sweden)

    Pablo Freire

    Full Text Available The Cancer Genome Atlas project (TCGA has initiated the analysis of multiple samples of a variety of tumor types, starting with glioblastoma multiforme. The analytical methods encompass genomic and transcriptomic information, as well as demographic and clinical data about the sample donors. The data create the opportunity for a systematic screening of the components of the molecular machinery for features that may be associated with tumor formation. The wealth of existing mechanistic information about cancer cell biology provides a natural reference for the exploratory exercise.Glioblastoma multiforme DNA copy number data was generated by The Cancer Genome Atlas project for 167 patients using 227 aCGH experiments, and was analyzed to build a catalog of aberrant regions. Genome screening was performed using an information theory approach in order to quantify aberration as a deviation from a centrality without the bias of untested assumptions about its parametric nature. A novel Cancer Genome Browser software application was developed and is made public to provide a user-friendly graphical interface in which the reported results can be reproduced. The application source code and stand alone executable are available at (http://code.google.com/p/cancergenome and (http://bioinformaticstation.org, respectively.The most important known copy number alterations for glioblastoma were correctly recovered using entropy as a measure of aberration. Additional alterations were identified in different pathways, such as cell proliferation, cell junctions and neural development. Moreover, novel candidates for oncogenes and tumor suppressors were also detected. A detailed map of aberrant regions is provided.

  4. Exploratory analysis of the copy number alterations in glioblastoma multiforme.

    Science.gov (United States)

    Freire, Pablo; Vilela, Marco; Deus, Helena; Kim, Yong-Wan; Koul, Dimpy; Colman, Howard; Aldape, Kenneth D; Bogler, Oliver; Yung, W K Alfred; Coombes, Kevin; Mills, Gordon B; Vasconcelos, Ana T; Almeida, Jonas S

    2008-01-01

    The Cancer Genome Atlas project (TCGA) has initiated the analysis of multiple samples of a variety of tumor types, starting with glioblastoma multiforme. The analytical methods encompass genomic and transcriptomic information, as well as demographic and clinical data about the sample donors. The data create the opportunity for a systematic screening of the components of the molecular machinery for features that may be associated with tumor formation. The wealth of existing mechanistic information about cancer cell biology provides a natural reference for the exploratory exercise. Glioblastoma multiforme DNA copy number data was generated by The Cancer Genome Atlas project for 167 patients using 227 aCGH experiments, and was analyzed to build a catalog of aberrant regions. Genome screening was performed using an information theory approach in order to quantify aberration as a deviation from a centrality without the bias of untested assumptions about its parametric nature. A novel Cancer Genome Browser software application was developed and is made public to provide a user-friendly graphical interface in which the reported results can be reproduced. The application source code and stand alone executable are available at (http://code.google.com/p/cancergenome) and (http://bioinformaticstation.org), respectively. The most important known copy number alterations for glioblastoma were correctly recovered using entropy as a measure of aberration. Additional alterations were identified in different pathways, such as cell proliferation, cell junctions and neural development. Moreover, novel candidates for oncogenes and tumor suppressors were also detected. A detailed map of aberrant regions is provided.

  5. Identification of copy number variants in horses

    KAUST Repository

    Doan, R.

    2012-03-01

    Copy number variants (CNVs) represent a substantial source of genetic variation in mammals. However, the occurrence of CNVs in horses and their subsequent impact on phenotypic variation is unknown. We performed a study to identify CNVs in 16 horses representing 15 distinct breeds (Equus caballus) and an individual gray donkey (Equus asinus) using a whole-exome tiling array and the array comparative genomic hybridization methodology. We identified 2368 CNVs ranging in size from 197 bp to 3.5 Mb. Merging identical CNVs from each animal yielded 775 CNV regions (CNVRs), involving 1707 protein- and RNA-coding genes. The number of CNVs per animal ranged from 55 to 347, with median and mean sizes of CNVs of 5.3 kb and 99.4 kb, respectively. Approximately 6% of the genes investigated were affected by a CNV. Biological process enrichment analysis indicated CNVs primarily affected genes involved in sensory perception, signal transduction, and metabolism. CNVs also were identified in genes regulating blood group antigens, coat color, fecundity, lactation, keratin formation, neuronal homeostasis, and height in other species. Collectively, these data are the first report of copy number variation in horses and suggest that CNVs are common in the horse genome and may modulate biological processes underlying different traits observed among horses and horse breeds.

  6. The RNA 5 of Prunus necrotic ringspot virus is a biologically inactive copy of the 3'-UTR of the genomic RNA 3.

    Science.gov (United States)

    Di Terlizzi, B; Skrzeczkowski, L J; Mink, G I; Scott, S W; Zimmerman, M T

    2001-01-01

    In addition to the four RNAs known to be encapsidated by Prunus necrotic ringspot virus (PNRSV) and Apple mosaic virus (ApMV), an additional small RNA (RNA 5) was present in purified preparations of several isolates of both viruses. RNA 5 was always produced following infection of a susceptible host by an artificial mixture of RNAs 1, 2, 3, and 4 indicating that it was a product of viral replication. RNA 5 does not activate the infectivity of mixtures that contain the three genomic RNAs (RNA 1 + RNA 2 + RNA 3) nor does it appear to modify symptom expression. Results from hybridization studies suggested that RNA 5 had partial sequence homology with RNAs 1, 2, 3, and 4. Cloning and sequencing the RNA 5 of isolate CH 57/1-M of PNRSV, and the 3' termini of the RNA 1, RNA 2 and RNA 3 of this isolate indicated that it was a copy of the 3' untranslated terminal region (3'-UTR) of the genomic RNA 3.

  7. Brewing characteristics of haploid strains isolated from sake yeast Kyokai No. 7.

    Science.gov (United States)

    Katou, Taku; Kitagaki, Hiroshi; Akao, Takeshi; Shimoi, Hitoshi

    2008-11-01

    Sake yeast exhibit various characteristics that make them more suitable for sake brewing compared to other yeast strains. Since sake yeast strains are Saccharomyces cerevisiae heterothallic diploid strains, it is likely that they have heterozygous alleles on homologous chromosomes (heterozygosity) due to spontaneous mutations. If this is the case, segregation of phenotypic traits in haploid strains after sporulation and concomitant meiosis of sake yeast strains would be expected to occur. To examine this hypothesis, we isolated 100 haploid strains from Kyokai No. 7 (K7), a typical sake yeast strain in Japan, and compared their brewing characteristics in small-scale sake-brewing tests. Analyses of the resultant sake samples showed a smooth and continuous distribution of analytical values for brewing characteristics, suggesting that K7 has multiple heterozygosities that affect brewing characteristics and that these heterozygous alleles do segregate after sporulation. Correlation and principal component analyses suggested that the analytical parameters could be classified into two groups, indicating fermentation ability and sake flavour. (c) 2008 John Wiley & Sons, Ltd.

  8. Copy number variation of KIR genes influences HIV-1 control

    DEFF Research Database (Denmark)

    Pelak, Kimberly; Need, Anna C; Fellay, Jacques

    2011-01-01

    A genome-wide screen for large structural variants showed that a copy number variant (CNV) in the region encoding killer cell immunoglobulin-like receptors (KIR) associates with HIV-1 control as measured by plasma viral load at set point in individuals of European ancestry. This CNV encompasses t...

  9. DNA copy number, including telomeres and mitochondria, assayed using next-generation sequencing

    Directory of Open Access Journals (Sweden)

    Jackson Stuart

    2010-04-01

    Full Text Available Abstract Background DNA copy number variations occur within populations and aberrations can cause disease. We sought to develop an improved lab-automatable, cost-efficient, accurate platform to profile DNA copy number. Results We developed a sequencing-based assay of nuclear, mitochondrial, and telomeric DNA copy number that draws on the unbiased nature of next-generation sequencing and incorporates techniques developed for RNA expression profiling. To demonstrate this platform, we assayed UMC-11 cells using 5 million 33 nt reads and found tremendous copy number variation, including regions of single and homogeneous deletions and amplifications to 29 copies; 5 times more mitochondria and 4 times less telomeric sequence than a pool of non-diseased, blood-derived DNA; and that UMC-11 was derived from a male individual. Conclusion The described assay outputs absolute copy number, outputs an error estimate (p-value, and is more accurate than array-based platforms at high copy number. The platform enables profiling of mitochondrial levels and telomeric length. The assay is lab-automatable and has a genomic resolution and cost that are tunable based on the number of sequence reads.

  10. Cellular heredity in haploid cultures of somatic cells, March 1968-April 1981. Final report

    International Nuclear Information System (INIS)

    Freed, J.J.

    1982-03-01

    An account is given of the development and application to cell-culture genetics of unique haploid cell lines from frog embryo developed in this laboratory. Since 1968, the main aim of this project has been to develop the haploid cell system for studies of mutagenesis in culture, particularly by ultraviolet radiation. In the course of this work we isolated chromosomally stable cell lines, derived and characterized a number of variants, and adapted cell hybridization and other methods to this material. Particular emphasis was placed on ultraviolet photobiology, including studies of cell survival, mutagenesis, and pathways of repair of uv-damaged DNA. Although at present less widely used for genetic experiments than mammalian cell lines, the frog cells offer the advantages of authentic haploidy and a favorable repertory of DNA repair pathways for study of uv mutagenesis

  11. The positioning logic and copy number control of genes in bacteria under stress

    Science.gov (United States)

    Zhang, Qiucen; Austin, Robert; Vyawahare, Saurabh; Lau, Alexandra

    2013-03-01

    Escherichia coli (E. coli) cells when challenged with sublethal concentrations of the genotoxic antibiotic ciprofloxacin cease to divide and form long filaments which contain multiple bacterial chromosomes. These filaments are individual mesoscopic environmental niches which provide protection for a community of chromosomes (as opposed to cells) under mutagenic stress and can provide an evolutionary fitness advantage within the niche. We use comparative genomic hybridization to show that the mesoscopic niche evolves within 20 minutes of ciprofloxacin exposure via replication of multiple copies of genes expressing ATP dependent transporters. We show that this rapid genomic amplification is done in a time efficient manner via placement of the genes encoding the pumps near the origin of replication on the bacterial chromosome. The de-amplification of multiple copies back to the wild type number is a function of the duration is a function of the ciprofloxacin exposure duration: the longer the exposure, the slower the removal of the multiple copies. The project described was supported by the National Science Foundation and the National Cancer Institute

  12. Segregation distortion causes large-scale differences between male and female genomes in hybrid ants.

    Science.gov (United States)

    Kulmuni, Jonna; Seifert, Bernhard; Pamilo, Pekka

    2010-04-20

    Hybridization in isolated populations can lead either to hybrid breakdown and extinction or in some cases to speciation. The basis of hybrid breakdown lies in genetic incompatibilities between diverged genomes. In social Hymenoptera, the consequences of hybridization can differ from those in other animals because of haplodiploidy and sociality. Selection pressures differ between sexes because males are haploid and females are diploid. Furthermore, sociality and group living may allow survival of hybrid genotypes. We show that hybridization in Formica ants has resulted in a stable situation in which the males form two highly divergent gene pools whereas all the females are hybrids. This causes an exceptional situation with large-scale differences between male and female genomes. The genotype differences indicate strong transmission ratio distortion depending on offspring sex, whereby the mother transmits some alleles exclusively to her daughters and other alleles exclusively to her sons. The genetic differences between the sexes and the apparent lack of multilocus hybrid genotypes in males can be explained by recessive incompatibilities which cause the elimination of hybrid males because of their haploid genome. Alternatively, differentiation between sexes could be created by prezygotic segregation into male-forming and female-forming gametes in diploid females. Differentiation between sexes is stable and maintained throughout generations. The present study shows a unique outcome of hybridization and demonstrates that hybridization has the potential of generating evolutionary novelties in animals.

  13. Untangling nucleotide diversity and evolution of the H genome in polyploid Hordeum and Elymus species based on the single copy of nuclear gene DMC1.

    Directory of Open Access Journals (Sweden)

    Dongfa Sun

    Full Text Available Numerous hybrid and polypoid species are found within the Triticeae. It has been suggested that the H subgenome of allopolyploid Elymus (wheatgrass species originated from diploid Hordeum (barley species, but the role of hybridization between polyploid Elymus and Hordeum has not been studied. It is not clear whether gene flow across polyploid Hordeum and Elymus species has occurred following polyploid speciation. Answering these questions will provide new insights into the formation of these polyploid species, and the potential role of gene flow among polyploid species during polyploid evolution. In order to address these questions, disrupted meiotic cDNA1 (DMC1 data from the allopolyploid StH Elymus are analyzed together with diploid and polyploid Hordeum species. Phylogenetic analysis revealed that the H copies of DMC1 sequence in some Elymus are very close to the H copies of DMC1 sequence in some polyploid Hordeum species, indicating either that the H genome in theses Elymus and polyploid Hordeum species originated from same diploid donor or that gene flow has occurred among them. Our analysis also suggested that the H genomes in Elymus species originated from limited gene pool, while H genomes in Hordeum polyploids have originated from broad gene pools. Nucleotide diversity (π of the DMC1 sequences on H genome from polyploid species (π = 0.02083 in Elymus, π = 0.01680 in polyploid Hordeum is higher than that in diploid Hordeum (π = 0.01488. The estimates of Tajima's D were significantly departure from the equilibrium neutral model at this locus in diploid Hordeum species (P<0.05, suggesting an excess of rare variants in diploid species which may not contribute to the origination of polyploids. Nucleotide diversity (π of the DMC1 sequences in Elymus polyploid species (π = 0.02083 is higher than that in polyploid Hordeum (π = 0.01680, suggesting that the degree of relationships between two parents of a polyploid might be a factor

  14. A large-scale survey of genetic copy number variations among Han Chinese residing in Taiwan

    Directory of Open Access Journals (Sweden)

    Wu Jer-Yuarn

    2008-12-01

    Full Text Available Abstract Background Copy number variations (CNVs have recently been recognized as important structural variations in the human genome. CNVs can affect gene expression and thus may contribute to phenotypic differences. The copy number inferring tool (CNIT is an effective hidden Markov model-based algorithm for estimating allele-specific copy number and predicting chromosomal alterations from single nucleotide polymorphism microarrays. The CNIT algorithm, which was constructed using data from 270 HapMap multi-ethnic individuals, was applied to identify CNVs from 300 unrelated Han Chinese individuals in Taiwan. Results Using stringent selection criteria, 230 regions with variable copy numbers were identified in the Han Chinese population; 133 (57.83% had been reported previously, 64 displayed greater than 1% CNV allele frequency. The average size of the CNV regions was 322 kb (ranging from 1.48 kb to 5.68 Mb and covered a total of 2.47% of the human genome. A total of 196 of the CNV regions were simple deletions and 27 were simple amplifications. There were 449 genes and 5 microRNAs within these CNV regions; some of these genes are known to be associated with diseases. Conclusion The identified CNVs are characteristic of the Han Chinese population and should be considered when genetic studies are conducted. The CNV distribution in the human genome is still poorly characterized, and there is much diversity among different ethnic populations.

  15. The Yeast Deletion Collection: A Decade of Functional Genomics

    Science.gov (United States)

    Giaever, Guri; Nislow, Corey

    2014-01-01

    The yeast deletion collections comprise >21,000 mutant strains that carry precise start-to-stop deletions of ∼6000 open reading frames. This collection includes heterozygous and homozygous diploids, and haploids of both MATa and MATα mating types. The yeast deletion collection, or yeast knockout (YKO) set, represents the first and only complete, systematically constructed deletion collection available for any organism. Conceived during the Saccharomyces cerevisiae sequencing project, work on the project began in 1998 and was completed in 2002. The YKO strains have been used in numerous laboratories in >1000 genome-wide screens. This landmark genome project has inspired development of numerous genome-wide technologies in organisms from yeast to man. Notable spinoff technologies include synthetic genetic array and HIPHOP chemogenomics. In this retrospective, we briefly describe the yeast deletion project and some of its most noteworthy biological contributions and the impact that these collections have had on the yeast research community and on genomics in general. PMID:24939991

  16. Genome aliquoting with double cut and join

    Directory of Open Access Journals (Sweden)

    Sankoff David

    2008-01-01

    Full Text Available Abstract Background The genome aliquoting probem is, given an observed genome A with n copies of each gene, presumed to descend from an n-way polyploidization event from an ordinary diploid genome B, followed by a history of chromosomal rearrangements, to reconstruct the identity of the original genome B'. The idea is to construct B', containing exactly one copy of each gene, so as to minimize the number of rearrangements d(A, B' ⊕ B' ⊕ ... ⊕ B' necessary to convert the observed genome B' ⊕ B' ⊕ ... ⊕ B' into A. Results In this paper we make the first attempt to define and solve the genome aliquoting problem. We present a heuristic algorithm for the problem as well the data from our experiments demonstrating its validity. Conclusion The heuristic performs well, consistently giving a non-trivial result. The question as to the existence or non-existence of an exact solution to this problem remains open.

  17. Accurate measurement of transgene copy number in crop plants using droplet digital PCR.

    Science.gov (United States)

    Collier, Ray; Dasgupta, Kasturi; Xing, Yan-Ping; Hernandez, Bryan Tarape; Shao, Min; Rohozinski, Dominica; Kovak, Emma; Lin, Jeanie; de Oliveira, Maria Luiza P; Stover, Ed; McCue, Kent F; Harmon, Frank G; Blechl, Ann; Thomson, James G; Thilmony, Roger

    2017-06-01

    Genetic transformation is a powerful means for the improvement of crop plants, but requires labor- and resource-intensive methods. An efficient method for identifying single-copy transgene insertion events from a population of independent transgenic lines is desirable. Currently, transgene copy number is estimated by either Southern blot hybridization analyses or quantitative polymerase chain reaction (qPCR) experiments. Southern hybridization is a convincing and reliable method, but it also is expensive, time-consuming and often requires a large amount of genomic DNA and radioactively labeled probes. Alternatively, qPCR requires less DNA and is potentially simpler to perform, but its results can lack the accuracy and precision needed to confidently distinguish between one- and two-copy events in transgenic plants with large genomes. To address this need, we developed a droplet digital PCR-based method for transgene copy number measurement in an array of crops: rice, citrus, potato, maize, tomato and wheat. The method utilizes specific primers to amplify target transgenes, and endogenous reference genes in a single duplexed reaction containing thousands of droplets. Endpoint amplicon production in the droplets is detected and quantified using sequence-specific fluorescently labeled probes. The results demonstrate that this approach can generate confident copy number measurements in independent transgenic lines in these crop species. This method and the compendium of probes and primers will be a useful resource for the plant research community, enabling the simple and accurate determination of transgene copy number in these six important crop species. Published 2017. This article is a U.S. Government work and is in the public domain in the USA.

  18. Rare Genome-Wide Copy Number Variation and Expression of Schizophrenia in 22q11.2 Deletion Syndrome.

    Science.gov (United States)

    Bassett, Anne S; Lowther, Chelsea; Merico, Daniele; Costain, Gregory; Chow, Eva W C; van Amelsvoort, Therese; McDonald-McGinn, Donna; Gur, Raquel E; Swillen, Ann; Van den Bree, Marianne; Murphy, Kieran; Gothelf, Doron; Bearden, Carrie E; Eliez, Stephan; Kates, Wendy; Philip, Nicole; Sashi, Vandana; Campbell, Linda; Vorstman, Jacob; Cubells, Joseph; Repetto, Gabriela M; Simon, Tony; Boot, Erik; Heung, Tracy; Evers, Rens; Vingerhoets, Claudia; van Duin, Esther; Zackai, Elaine; Vergaelen, Elfi; Devriendt, Koen; Vermeesch, Joris R; Owen, Michael; Murphy, Clodagh; Michaelovosky, Elena; Kushan, Leila; Schneider, Maude; Fremont, Wanda; Busa, Tiffany; Hooper, Stephen; McCabe, Kathryn; Duijff, Sasja; Isaev, Karin; Pellecchia, Giovanna; Wei, John; Gazzellone, Matthew J; Scherer, Stephen W; Emanuel, Beverly S; Guo, Tingwei; Morrow, Bernice E; Marshall, Christian R

    2017-11-01

    Chromosome 22q11.2 deletion syndrome (22q11.2DS) is associated with a more than 20-fold increased risk for developing schizophrenia. The aim of this study was to identify additional genetic factors (i.e., "second hits") that may contribute to schizophrenia expression. Through an international consortium, the authors obtained DNA samples from 329 psychiatrically phenotyped subjects with 22q11.2DS. Using a high-resolution microarray platform and established methods to assess copy number variation (CNV), the authors compared the genome-wide burden of rare autosomal CNV, outside of the 22q11.2 deletion region, between two groups: a schizophrenia group and those with no psychotic disorder at age ≥25 years. The authors assessed whether genes overlapped by rare CNVs were overrepresented in functional pathways relevant to schizophrenia. Rare CNVs overlapping one or more protein-coding genes revealed significant between-group differences. For rare exonic duplications, six of 19 gene sets tested were enriched in the schizophrenia group; genes associated with abnormal nervous system phenotypes remained significant in a stepwise logistic regression model and showed significant interactions with 22q11.2 deletion region genes in a connectivity analysis. For rare exonic deletions, the schizophrenia group had, on average, more genes overlapped. The additional rare CNVs implicated known (e.g., GRM7, 15q13.3, 16p12.2) and novel schizophrenia risk genes and loci. The results suggest that additional rare CNVs overlapping genes outside of the 22q11.2 deletion region contribute to schizophrenia risk in 22q11.2DS, supporting a multigenic hypothesis for schizophrenia. The findings have implications for understanding expression of psychotic illness and herald the importance of whole-genome sequencing to appreciate the overall genomic architecture of schizophrenia.

  19. Generation of meiomaps of genome-wide recombination and chromosome segregation in human oocytes

    DEFF Research Database (Denmark)

    Ottolini, Christian S; Capalbo, Antonio; Newnham, Louise

    2016-01-01

    We have developed a protocol for the generation of genome-wide maps (meiomaps) of recombination and chromosome segregation for the three products of human female meiosis: the first and second polar bodies (PB1 and PB2) and the corresponding oocyte. PB1 is biopsied and the oocyte is artificially......-nucleotide polymorphisms (SNPs) genome-wide by microarray. Informative maternal heterozygous SNPs are phased using a haploid PB2 or oocyte as a reference. A simple algorithm is then used to identify the maternal haplotypes for each chromosome, in all of the products of meiosis for each oocyte. This allows mapping...

  20. Production of haploid plant of 'Banpeiyu' pummelo [Citrus maxima (Burm.) Merr.] by pollination with soft X-ray-irradiated pollen

    International Nuclear Information System (INIS)

    Yahata, Masaki; Yasuda, Kiichi; Kunitake, Hisato; Nagasawa, Kohji; Harusaki, Seiichi; Komatsu, Haruki

    2010-01-01

    To induce haploid plants in Citrus maxima (Burm.) Merr. 'Banpeiyu', we evaluated the effect of pollination with soft X-ray-irradiated pollen on fruit set and seed development, and carried out ovule culture. When 'Banpeiyu' pummelo pistils were pollinated with X-ray-irradiated pollen of 'Fukuhara' sweet orange [C. sinensis (L.) Osbeck], the exposure doses affected the fruit set. The number of seeds per fruit was also affected by the exposure dose, and tended to decrease as the dose increased; however, all developed seeds obtained from these crosses were diploid. In the ovule culture of 'Banpeiyu' pummelo, six embryoids shown haploidy were obtained in all treatments. One haploid plantlet with 9 chromosomes was regenerated from an embryoid in a culture of ovules established 40 days after pollination with 400 Gray (Gy)-irradiated pollen of 'Tosa-buntan' pummelo (C. maxima). This haploid was suggested to be derived from 'Banpeiyu' pummelo by random amplified polymorphic DNA (RAPD) and cleaved amplified polymorphic sequence (CAPS) analysis. (author)

  1. New cytogenetically visible copy number variant in region 8q21.2

    Directory of Open Access Journals (Sweden)

    Ewers Elisabeth

    2011-01-01

    Full Text Available Abstract Background Cytogenetically visible unbalanced chromosomal abnormalities (UBCA, reported for >50 euchromatic regions of almost all human autosomes, are comprised of a few megabases of DNA, and carriers are in many cases clinically healthy. It may be speculated, that some of the UBCA may be similar or identical to copy number variants (CNV of the human genome. Results Here we report on a yet unreported cytogenetically visible copy number variant (CNV in the long arm of chromosome 8, region 8q21.2, detected in three unrelated clinically healthy carriers. Conclusion The first description of a cytogenetically visible CNV/UBCA in 8q21.2 shows that banding cytogenetics is far from being outdated. It is a cost efficient, up-to-date method for a single cell specific overview on the whole genome, still prepared to deliver unexpected findings.

  2. Production of haploid plants from ten hybrids of bread wheat (Triticum aestivum L. through wide hybridization with maize (Zea mays L. Producción de plantas haploides a partir de 10 híbridos de trigo para pan (Triticum aestivum L. mediante hibridación interespecífica con maíz (Zea mays L.

    Directory of Open Access Journals (Sweden)

    L.E. Torres

    2010-12-01

    Full Text Available The aim of this work was to obtain haploid plants of bread wheat through wide hybridization with maize. The experimental material included ten bread wheat hybrids (female parent and one population of maize (pollen donor. Two assays were carried out in two different seasons (summer and winter. Wheat spikes were manually emasculated, each spike was pollinated twice with fresh pollen of maize and a solution of 2,4-D (100 mg l-1 was sprayed on pollinated florets and injected in the upper internode. Fifteen and 21 days after pollination caryopses were removed and surface sterilized. Embryos were cultured in tubes containing B5 medium. The ten hybrid combinations produced caryopses, but only eight of these hybrids produced embryos and, in six of them, the recovered embryos developed into haploid plantlets. The results showed that there is genotypic influence of the wheat parents on the percentage of haploid embryo formation, in accordance with the results obtained by other authors. Regardless of the genotype, the sowing season and the harvest date, 69.4% of the pollinated flowers gave place to the formation of caryopses, 5.5% of these caryopses developed into presumably haploid embryos (for their morphological phenotypes and 26.1 % of the recovered embryos developed into haploid plantlets.El objetivo del presente trabajo fue obtener plantas haploides de trigo para pan mediante hibridación interespecífica con maíz. Se utilizaron 10 híbridos de trigo para pan (madre y una población de maíz (donante de polen; se llevaron a cabo dos ensayos en distintas estaciones de cultivo. Cada espiga de trigo fue emasculada manualmente y polinizada dos veces con polen fresco de maíz; las flores polinizadas se pulverizaron con una solución de 2,4-D (100 mg l-1, la que también se inyectó en la base de la espiga. Las semillas se cosecharon a los 15 y 21 días posteriores a la polinización. Los embriones recuperados se colocaron en tubos conteniendo medio de

  3. Breeding of a xylose-fermenting hybrid strain by mating genetically engineered haploid strains derived from industrial Saccharomyces cerevisiae.

    Science.gov (United States)

    Inoue, Hiroyuki; Hashimoto, Seitaro; Matsushika, Akinori; Watanabe, Seiya; Sawayama, Shigeki

    2014-12-01

    The industrial Saccharomyces cerevisiae IR-2 is a promising host strain to genetically engineer xylose-utilizing yeasts for ethanol fermentation from lignocellulosic hydrolysates. Two IR-2-based haploid strains were selected based upon the rate of xylulose fermentation, and hybrids were obtained by mating recombinant haploid strains harboring heterogeneous xylose dehydrogenase (XDH) (wild-type NAD(+)-dependent XDH or engineered NADP(+)-dependent XDH, ARSdR), xylose reductase (XR) and xylulose kinase (XK) genes. ARSdR in the hybrids selected for growth rates on yeast extract-peptone-dextrose (YPD) agar and YP-xylose agar plates typically had a higher activity than NAD(+)-dependent XDH. Furthermore, the xylose-fermenting performance of the hybrid strain SE12 with the same level of heterogeneous XDH activity was similar to that of a recombinant strain of IR-2 harboring a single set of genes, XR/ARSdR/XK. These results suggest not only that the recombinant haploid strains retain the appropriate genetic background of IR-2 for ethanol production from xylose but also that ARSdR is preferable for xylose fermentation.

  4. Genome-wide copy number variation analysis identified deletions in SFMBT1 associated with fasting plasma glucose in a Han Chinese population.

    Science.gov (United States)

    Chung, Ren-Hua; Chiu, Yen-Feng; Hung, Yi-Jen; Lee, Wen-Jane; Wu, Kwan-Dun; Chen, Hui-Ling; Lin, Ming-Wei; Chen, Yii-Der I; Quertermous, Thomas; Hsiung, Chao A

    2017-08-08

    Fasting glucose and fasting insulin are glycemic traits closely related to diabetes, and understanding the role of genetic factors in these traits can help reveal the etiology of type 2 diabetes. Although single nucleotide polymorphisms (SNPs) in several candidate genes have been found to be associated with fasting glucose and fasting insulin, copy number variations (CNVs), which have been reported to be associated with several complex traits, have not been reported for association with these two traits. We aimed to identify CNVs associated with fasting glucose and fasting insulin. We conducted a genome-wide CNV association analysis for fasting plasma glucose (FPG) and fasting plasma insulin (FPI) using a family-based genome-wide association study sample from a Han Chinese population in Taiwan. A family-based CNV association test was developed in this study to identify common CNVs (i.e., CNVs with frequencies ≥ 5%), and a generalized estimating equation approach was used to test the associations between the traits and counts of global rare CNVs (i.e., CNVs with frequencies <5%). We found a significant genome-wide association for common deletions with a frequency of 5.2% in the Scm-like with four mbt domains 1 (SFMBT1) gene with FPG (association p-value = 2×10 -4 and an adjusted p-value = 0.0478 for multiple testing). No significant association was observed between global rare CNVs and FPG or FPI. The deletions in 20 individuals with DNA samples available were successfully validated using PCR-based amplification. The association of the deletions in SFMBT1 with FPG was further evaluated using an independent population-based replication sample obtained from the Taiwan Biobank. An association p-value of 0.065, which was close to the significance level of 0.05, for FPG was obtained by testing 9 individuals with CNVs in the SFMBT1 gene region and 11,692 individuals with normal copies in the replication cohort. Previous studies have found that SNPs in SFMBT1 are

  5. Specific functions of the Rep and Rep' proteins of porcine circovirus during copy-release and rolling-circle DNA replication

    Science.gov (United States)

    The roles of two porcine circovirus replication initiator proteins, Rep and Rep', in generating copy-release and rolling-circle DNA replication intermediates were determined. Rep uses the supercoiled closed-circular genome (ccc) to initiate leading-strand synthesis (identical to copy-release replica...

  6. Transposon domestication versus mutualism in ciliate genome rearrangements.

    Directory of Open Access Journals (Sweden)

    Alexander Vogt

    Full Text Available Ciliated protists rearrange their genomes dramatically during nuclear development via chromosome fragmentation and DNA deletion to produce a trimmer and highly reorganized somatic genome. The deleted portion of the genome includes potentially active transposons or transposon-like sequences that reside in the germline. Three independent studies recently showed that transposase proteins of the DDE/DDD superfamily are indispensible for DNA processing in three distantly related ciliates. In the spirotrich Oxytricha trifallax, high copy-number germline-limited transposons mediate their own excision from the somatic genome but also contribute to programmed genome rearrangement through a remarkable transposon mutualism with the host. By contrast, the genomes of two oligohymenophorean ciliates, Tetrahymena thermophila and Paramecium tetraurelia, encode homologous PiggyBac-like transposases as single-copy genes in both their germline and somatic genomes. These domesticated transposases are essential for deletion of thousands of different internal sequences in these species. This review contrasts the events underlying somatic genome reduction in three different ciliates and considers their evolutionary origins and the relationships among their distinct mechanisms for genome remodeling.

  7. Development of a doubled haploid system for wheat through wheat ...

    African Journals Online (AJOL)

    La variété de maïs Kelvedon glory, avait la meilleure réponse parmi les variétés des maïs. Des considérables différences variétales dans le rendement de production d'haploides étaient évidentes à la fois dans les variétés de blé et de maïs. Mots Clés: Plantes homozygènes, protocole de rendement, Tritium aestivum, Zea ...

  8. A spatial haplotype copying model with applications to genotype imputation.

    Science.gov (United States)

    Yang, Wen-Yun; Hormozdiari, Farhad; Eskin, Eleazar; Pasaniuc, Bogdan

    2015-05-01

    Ever since its introduction, the haplotype copy model has proven to be one of the most successful approaches for modeling genetic variation in human populations, with applications ranging from ancestry inference to genotype phasing and imputation. Motivated by coalescent theory, this approach assumes that any chromosome (haplotype) can be modeled as a mosaic of segments copied from a set of chromosomes sampled from the same population. At the core of the model is the assumption that any chromosome from the sample is equally likely to contribute a priori to the copying process. Motivated by recent works that model genetic variation in a geographic continuum, we propose a new spatial-aware haplotype copy model that jointly models geography and the haplotype copying process. We extend hidden Markov models of haplotype diversity such that at any given location, haplotypes that are closest in the genetic-geographic continuum map are a priori more likely to contribute to the copying process than distant ones. Through simulations starting from the 1000 Genomes data, we show that our model achieves superior accuracy in genotype imputation over the standard spatial-unaware haplotype copy model. In addition, we show the utility of our model in selecting a small personalized reference panel for imputation that leads to both improved accuracy as well as to a lower computational runtime than the standard approach. Finally, we show our proposed model can be used to localize individuals on the genetic-geographical map on the basis of their genotype data.

  9. Integrative analysis of copy number and gene expression data suggests novel pathogenetic mechanisms in primary myelofibrosis.

    Science.gov (United States)

    Salati, Simona; Zini, Roberta; Nuzzo, Simona; Guglielmelli, Paola; Pennucci, Valentina; Prudente, Zelia; Ruberti, Samantha; Rontauroli, Sebastiano; Norfo, Ruggiero; Bianchi, Elisa; Bogani, Costanza; Rotunno, Giada; Fanelli, Tiziana; Mannarelli, Carmela; Rosti, Vittorio; Salmoiraghi, Silvia; Pietra, Daniela; Ferrari, Sergio; Barosi, Giovanni; Rambaldi, Alessandro; Cazzola, Mario; Bicciato, Silvio; Tagliafico, Enrico; Vannucchi, Alessandro M; Manfredini, Rossella

    2016-04-01

    Primary myelofibrosis (PMF) is a Myeloproliferative Neoplasm (MPN) characterized by megakaryocyte hyperplasia, progressive bone marrow fibrosis, extramedullary hematopoiesis and transformation to Acute Myeloid Leukemia (AML). A number of phenotypic driver (JAK2, CALR, MPL) and additional subclonal mutations have been described in PMF, pointing to a complex genomic landscape. To discover novel genomic lesions that can contribute to disease phenotype and/or development, gene expression and copy number signals were integrated and several genomic abnormalities leading to a concordant alteration in gene expression levels were identified. In particular, copy number gain in the polyamine oxidase (PAOX) gene locus was accompanied by a coordinated transcriptional up-regulation in PMF patients. PAOX inhibition resulted in rapid cell death of PMF progenitor cells, while sparing normal cells, suggesting that PAOX inhibition could represent a therapeutic strategy to selectively target PMF cells without affecting normal hematopoietic cells' survival. Moreover, copy number loss in the chromatin modifier HMGXB4 gene correlates with a concomitant transcriptional down-regulation in PMF patients. Interestingly, silencing of HMGXB4 induces megakaryocyte differentiation, while inhibiting erythroid development, in human hematopoietic stem/progenitor cells. These results highlight a previously un-reported, yet potentially interesting role of HMGXB4 in the hematopoietic system and suggest that genomic and transcriptional imbalances of HMGXB4 could contribute to the aberrant expansion of the megakaryocytic lineage that characterizes PMF patients. © 2015 UICC.

  10. Inhibition of colorectal cancer genomic copy number alterations and chromosomal fragile site tumor suppressor FHIT and WWOX deletions by DNA mismatch repair

    Science.gov (United States)

    Gelincik, Ozkan; Blecua, Pedro; Edelmann, Winfried; Kucherlapati, Raju; Zhou, Kathy; Jasin, Maria; Gümüş, Zeynep H.; Lipkin, Steven M.

    2017-01-01

    Homologous recombination (HR) enables precise DNA repair after DNA double strand breaks (DSBs) using identical sequence templates, whereas homeologous recombination (HeR) uses only partially homologous sequences. Homeologous recombination introduces mutations through gene conversion and genomic deletions through single-strand annealing (SSA). DNA mismatch repair (MMR) inhibits HeR, but the roles of mammalian MMR MutL homologues (MLH1, PMS2 and MLH3) proteins in HeR suppression are poorly characterized. Here, we demonstrate that mouse embryonic fibroblasts (MEFs) carrying Mlh1, Pms2, and Mlh3 mutations have higher HeR rates, by using 7,863 uniquely mapping paired direct repeat sequences (DRs) in the mouse genome as endogenous gene conversion and SSA reporters. Additionally, when DSBs are induced by gamma-radiation, Mlh1, Pms2 and Mlh3 mutant MEFs have higher DR copy number alterations (CNAs), including DR CNA hotspots previously identified in mouse MMR-deficient colorectal cancer (dMMR CRC). Analysis of The Cancer Genome Atlas CRC data revealed that dMMR CRCs have higher genome-wide DR HeR rates than MMR proficient CRCs, and that dMMR CRCs have deletion hotspots in tumor suppressors FHIT/WWOX at chromosomal fragile sites FRA3B and FRA16D (which have elevated DSB rates) flanked by paired homologous DRs and inverted repeats (IR). Overall, these data provide novel insights into the MMR-dependent HeR inhibition mechanism and its role in tumor suppression. PMID:29069730

  11. A quasispecies approach to the evolution of sexual replication in unicellular organisms.

    Science.gov (United States)

    Tannenbaum, Emmanuel; Fontanari, José F

    2008-03-01

    This study develops a simplified model describing the evolutionary dynamics of a population composed of obligate sexually and asexually reproducing, unicellular organisms. The model assumes that the organisms have diploid genomes consisting of two chromosomes, and that the sexual organisms replicate by first dividing into haploid intermediates, which then combine with other haploids, followed by the normal mitotic division of the resulting diploid into two new daughter cells. We assume that the fitness landscape of the diploids is analogous to the single-fitness-peak approach often used in single-chromosome studies. That is, we assume a master chromosome that becomes defective with just one point mutation. The diploid fitness then depends on whether the genome has zero, one, or two copies of the master chromosome. We also assume that only pairs of haploids with a master chromosome are capable of combining so as to produce sexual diploid cells, and that this process is described by second-order kinetics. We find that, in a range of intermediate values of the replication fidelity, sexually reproducing cells can outcompete asexual ones, provided the initial abundance of sexual cells is above some threshold value. The range of values where sexual reproduction outcompetes asexual reproduction increases with decreasing replication rate and increasing population density. We critically evaluate a common approach, based on a group selection perspective, used to study the competition between populations and show its flaws in addressing the evolution of sex problem.

  12. Establishment and application of event-specific polymerase chain reaction methods for two genetically modified soybean events, A2704-12 and A5547-127.

    Science.gov (United States)

    Li, Xiang; Pan, Liangwen; Li, Junyi; Zhang, Qigang; Zhang, Shuya; Lv, Rong; Yang, Litao

    2011-12-28

    For implementation of the issued regulations and labeling policies for genetically modified organism (GMO) supervision, the polymerase chain reaction (PCR) method has been widely used due to its high specificity and sensitivity. In particular, use of the event-specific PCR method based on the flanking sequence of transgenes has become the primary trend. In this study, both qualitative and quantitative PCR methods were established on the basis of the 5' flanking sequence of transgenic soybean A2704-12 and the 3' flanking sequence of transgenic soybean A5547-127, respectively. In qualitative PCR assays, the limits of detection (LODs) were 10 copies of haploid soybean genomic DNA for both A2704-12 and A5547-127. In quantitative real-time PCR assays, the LODs were 5 copies of haploid soybean genomic DNA for both A2704-12 and A5547-127, and the limits of quantification (LOQs) were 10 copies for both. Low bias and acceptable SD and RSD values were also achieved in quantification of four blind samples using the developed real-time PCR assays. In addition, the developed PCR assays for the two transgenic soybean events were used for routine analysis of soybean samples imported to Shanghai in a 6 month period from October 2010 to March 2011. A total of 27 lots of soybean from the United States and Argentina were analyzed: 8 lots from the Unites States were found to have the GM soybean A2704-12 event, and the GM contents were <1.5% in all eight analyzed lots. On the contrary, no GM soybean A5547-127 content was found in any of the eight lots. These results demonstrated that the established event-specific qualitative and quantitative PCR methods could be used effectively in routine identification and quantification of GM soybeans A2704-12 and A5547-127 and their derived products.

  13. Structural and sequence diversity of the transposon Galileo in the Drosophila willistoni genome.

    Science.gov (United States)

    Gonçalves, Juliana W; Valiati, Victor Hugo; Delprat, Alejandra; Valente, Vera L S; Ruiz, Alfredo

    2014-09-13

    Galileo is one of three members of the P superfamily of DNA transposons. It was originally discovered in Drosophila buzzatii, in which three segregating chromosomal inversions were shown to have been generated by ectopic recombination between Galileo copies. Subsequently, Galileo was identified in six of 12 sequenced Drosophila genomes, indicating its widespread distribution within this genus. Galileo is strikingly abundant in Drosophila willistoni, a neotropical species that is highly polymorphic for chromosomal inversions, suggesting a role for this transposon in the evolution of its genome. We carried out a detailed characterization of all Galileo copies present in the D. willistoni genome. A total of 191 copies, including 133 with two terminal inverted repeats (TIRs), were classified according to structure in six groups. The TIRs exhibited remarkable variation in their length and structure compared to the most complete copy. Three copies showed extended TIRs due to internal tandem repeats, the insertion of other transposable elements (TEs), or the incorporation of non-TIR sequences into the TIRs. Phylogenetic analyses of the transposase (TPase)-encoding and TIR segments yielded two divergent clades, which we termed Galileo subfamilies V and W. Target-site duplications (TSDs) in D. willistoni Galileo copies were 7- or 8-bp in length, with the consensus sequence GTATTAC. Analysis of the region around the TSDs revealed a target site motif (TSM) with a 15-bp palindrome that may give rise to a stem-loop secondary structure. There is a remarkable abundance and diversity of Galileo copies in the D. willistoni genome, although no functional copies were found. The TIRs in particular have a dynamic structure and extend in different ways, but their ends (required for transposition) are more conserved than the rest of the element. The D. willistoni genome harbors two Galileo subfamilies (V and W) that diverged ~9 million years ago and may have descended from an ancestral

  14. The effects of quantitative fecundity in the haploid stage on reproductive success and diploid fitness in the aquatic peat moss Sphagnum macrophyllum.

    Science.gov (United States)

    Johnson, M G; Shaw, A J

    2016-06-01

    A major question in evolutionary biology is how mating patterns affect the fitness of offspring. However, in animals and seed plants it is virtually impossible to investigate the effects of specific gamete genotypes. In bryophytes, haploid gametophytes grow via clonal propagation and produce millions of genetically identical gametes throughout a population. The main goal of this research was to test whether gamete identity has an effect on the fitness of their diploid offspring in a population of the aquatic peat moss Sphagnum macrophyllum. We observed a heavily male-biased sex ratio in gametophyte plants (ramets) and in multilocus microsatellite genotypes (genets). There was a steeper relationship between mating success (number of different haploid mates) and fecundity (number of diploid offspring) for male genets compared with female genets. At the sporophyte level, we observed a weak effect of inbreeding on offspring fitness, but no effect of brood size (number of sporophytes per maternal ramet). Instead, the identities of the haploid male and haploid female parents were significant contributors to variance in fitness of sporophyte offspring in the population. Our results suggest that intrasexual gametophyte/gamete competition may play a role in determining mating success in this population.

  15. The complete chloroplast genome sequence of Dendrobium nobile.

    Science.gov (United States)

    Yan, Wenjin; Niu, Zhitao; Zhu, Shuying; Ye, Meirong; Ding, Xiaoyu

    2016-11-01

    The complete chloroplast (cp) genome sequence of Dendrobium nobile, an endangered and traditional Chinese medicine with important economic value, is presented in this article. The total genome size is 150,793 bp, containing a large single copy (LSC) region (84,939 bp) and a small single copy region (SSC) (13,310 bp) which were separated by two inverted repeat (IRs) regions (26,272 bp). The overall GC contents of the plastid genome were 38.8%. In total, 130 unique genes were annotated and they were consisted of 76 protein-coding genes, 30 tRNA genes and 4 rRNA genes. Fourteen genes contained one or two introns.

  16. Improvement of ethanol-tolerance of haploid Saccharomyces diastaticus

    International Nuclear Information System (INIS)

    Song, S.H.; Kim, K.; Lee, M.W.

    1994-01-01

    Several mutation procedures have been compared to obtain an ethanol-tolerant Saccharomyces diastaticus strain secreting glucoamylase. These procedures include spontaneous mutation, EMS treatment, UV irradiation, and combination of EMS treatment and UV irradiation. All these methods were followed by adaptation of the yeast cells to gradually higher ethanol concentration. Among these procedures, the combined method of EMS treatment and UV irradiation gave the promising result, i.e. the ethanol tolerance of the yeast increased from 11.5%(v/v) to 14.0%(v/v). Respiratory deficient petite mutants of industrial and ethanol-tolerant yeast strains have been isolated and hybridized with haploid S. diastaticus strains. The resulting hybrids showed increased ethanol tolerance and starch-fermentability

  17. Screening for common copy-number variants in cancer genes.

    Science.gov (United States)

    Tyson, Jess; Majerus, Tamsin M O; Walker, Susan; Armour, John A L

    2010-12-01

    For most cases of colorectal cancer that arise without a family history of the disease, it is proposed that an appreciable heritable component of predisposition is the result of contributions from many loci. Although progress has been made in identifying single nucleotide variants associated with colorectal cancer risk, the involvement of low-penetrance copy number variants is relatively unexplored. We have used multiplex amplifiable probe hybridization (MAPH) in a fourfold multiplex (QuadMAPH), positioned at an average resolution of one probe per 2 kb, to screen a total of 1.56 Mb of genomic DNA for copy number variants around the genes APC, AXIN1, BRCA1, BRCA2, CTNNB1, HRAS, MLH1, MSH2, and TP53. Two deletion events were detected, one upstream of MLH1 in a control individual and the other in APC in a colorectal cancer patient, but these do not seem to correspond to copy number polymorphisms with measurably high population frequencies. In summary, by means of our QuadMAPH assay, copy number measurement data were of sufficient resolution and accuracy to detect any copy number variants with high probability. However, this study has demonstrated a very low incidence of deletion and duplication variants within intronic and flanking regions of these nine genes, in both control individuals and colorectal cancer patients. Copyright © 2010 Elsevier Inc. All rights reserved.

  18. The complete chloroplast genome sequence of Curcuma flaviflora (Curcuma).

    Science.gov (United States)

    Zhang, Yan; Deng, Jiabin; Li, Yangyi; Gao, Gang; Ding, Chunbang; Zhang, Li; Zhou, Yonghong; Yang, Ruiwu

    2016-09-01

    The complete chloroplast (cp) genome of Curcuma flaviflora, a medicinal plant in Southeast Asia, was sequenced. The genome size was 160 478 bp in length, with 36.3% GC content. A pair of inverted repeats (IRs) of 26 946 bp were separated by a large single copy (LSC) of 88 008 bp and a small single copy (SSC) of 18 578 bp, respectively. The cp genome contained 132 annotated genes, including 79 protein coding genes, 30 tRNA genes, and four rRNA genes. And 19 of these genes were duplicated in inverted repeat regions.

  19. Genomic hypomethylation in the human germline associates with selective structural mutability in the human genome.

    Directory of Open Access Journals (Sweden)

    Jian Li

    Full Text Available The hotspots of structural polymorphisms and structural mutability in the human genome remain to be explained mechanistically. We examine associations of structural mutability with germline DNA methylation and with non-allelic homologous recombination (NAHR mediated by low-copy repeats (LCRs. Combined evidence from four human sperm methylome maps, human genome evolution, structural polymorphisms in the human population, and previous genomic and disease studies consistently points to a strong association of germline hypomethylation and genomic instability. Specifically, methylation deserts, the ~1% fraction of the human genome with the lowest methylation in the germline, show a tenfold enrichment for structural rearrangements that occurred in the human genome since the branching of chimpanzee and are highly enriched for fast-evolving loci that regulate tissue-specific gene expression. Analysis of copy number variants (CNVs from 400 human samples identified using a custom-designed array comparative genomic hybridization (aCGH chip, combined with publicly available structural variation data, indicates that association of structural mutability with germline hypomethylation is comparable in magnitude to the association of structural mutability with LCR-mediated NAHR. Moreover, rare CNVs occurring in the genomes of individuals diagnosed with schizophrenia, bipolar disorder, and developmental delay and de novo CNVs occurring in those diagnosed with autism are significantly more concentrated within hypomethylated regions. These findings suggest a new connection between the epigenome, selective mutability, evolution, and human disease.

  20. Industrial Relevance of Chromosomal Copy Number Variation in Saccharomyces Yeasts.

    Science.gov (United States)

    Gorter de Vries, Arthur R; Pronk, Jack T; Daran, Jean-Marc G

    2017-06-01

    Chromosomal copy number variation (CCNV) plays a key role in evolution and health of eukaryotes. The unicellular yeast Saccharomyces cerevisiae is an important model for studying the generation, physiological impact, and evolutionary significance of CCNV. Fundamental studies of this yeast have contributed to an extensive set of methods for analyzing and introducing CCNV. Moreover, these studies provided insight into the balance between negative and positive impacts of CCNV in evolutionary contexts. A growing body of evidence indicates that CCNV not only frequently occurs in industrial strains of Saccharomyces yeasts but also is a key contributor to the diversity of industrially relevant traits. This notion is further supported by the frequent involvement of CCNV in industrially relevant traits acquired during evolutionary engineering. This review describes recent developments in genome sequencing and genome editing techniques and discusses how these offer opportunities to unravel contributions of CCNV in industrial Saccharomyce s strains as well as to rationally engineer yeast chromosomal copy numbers and karyotypes. Copyright © 2017 Gorter de Vries et al.

  1. Array-based genomic screening at diagnosis and during follow-up in chronic lymphocytic leukemia

    DEFF Research Database (Denmark)

    Gunnarsson, Rebeqa; Mansouri, Larry; Isaksson, Anders

    2011-01-01

    High-resolution genomic microarrays enable simultaneous detection of copy-number aberrations such as the known recurrent aberrations in chronic lymphocytic leukemia [del(11q), del(13q), del(17p) and trisomy 12], and copy-number neutral loss of heterozygosity. Moreover, comparison of genomic...

  2. Comparative genomics of Lactobacillus and other LAB

    DEFF Research Database (Denmark)

    Wassenaar, Trudy M.; Lukjancenko, Oksana

    2014-01-01

    that of the others, with the two Streptococcus species having the shortest genomes. The widest distribution in genome content was observed for Lactobacillus. The number of tRNA and rRNA gene copies varied considerably, with exceptional high numbers observed for Lb. delbrueckii, while these numbers were relatively......The genomes of 66 LABs, belonging to five different genera, were compared for genome size and gene content. The analyzed genomes included 37 Lactobacillus genomes of 17 species, six Lactococcus lactis genomes, four Leuconostoc genomes of three species, six Streptococcus genomes of two species...

  3. Distribution and functional impact of DNA copy number variation in the rat.

    NARCIS (Netherlands)

    Guryev, V.; Saar, K.; Adamovic, T.; Verheul, M.; van Heesch, S.; Cook, S.; Pravenec, M.; Aitman, T.; Jacob, H.; Shull, J.D.; Hubner, N.; Cuppen, E.

    2008-01-01

    The abundance and dynamics of copy number variants (CNVs) in mammalian genomes poses new challenges in the identification of their impact on natural and disease phenotypes. We used computational and experimental methods to catalog CNVs in rat and found that they share important functional

  4. The complete chloroplast genome of Sinopodophyllum hexandrum (Berberidaceae).

    Science.gov (United States)

    Li, Huie; Guo, Qiqiang

    2016-07-01

    The complete chloroplast (cp) genome of the Sinopodophyllum hexandrum (Berberidaceae) was determined in this study. The circular genome is 157,940 bp in size, and comprises a pair of inverted repeat (IR) regions of 26,077 bp each, a large single-copy (LSC) region of 86,460 bp and a small single-copy (SSC) region of 19,326 bp. The GC content of the whole cp genome was 38.5%. A total of 133 genes were identified, including 88 protein-coding genes, 37 tRNA genes and eight rRNA genes. The whole cp genome consists of 114 unique genes, and 19 genes are duplicated in the IR regions. The phylogenetic analysis revealed that S. hexandrum is closely related to Nandina domestica within the family Berberidaceae.

  5. Assessment of mutagenic damage by monofunctional alkylating agents and gamma radiation in haploid and diploid frogs, Xenopus laevis

    International Nuclear Information System (INIS)

    Hart, D.R.; Armstrong, J.B.

    1984-01-01

    Adult male South African clawed frogs, Xenopus laevis, were mutagenized by 3-day immersion in aqueous solutions of ethyl methanesulfonate (EMS), diethyl nitrosamine (DEN), or ethyl nitrosourea (ENU), or by acute exposure to gamma radiation. They were then spawned repeatedly at 2-week intervals with untreated females, and embryonic survival of the progeny was used to assess genetic damage. Recessive lethal effects were assessed from reduced survival of androgenetic haploid progeny. Neither recessive nor dominant lethal effects were obtained after exposure to 100 mg/liter EMS or 2 g/liter DEN. At 250 mg/liter EMS, peak dominant lethality occurred 3-5 weeks after treatment. Most embryos hatched, but many were abnormal and died shortly after hatching. Haploid survival was significantly reduced over a broader period, from 1 to 13 weeks after mutagenesis. Treatment with 75 mg/liter ENU produced effects similar to the 250-mg/liter EMS mutagenesis. At 400 mg/liter EMS, the frequency and severity of the effects on both diploid and haploid embryos were increased over the lower dose. Gamma irradiation at 1500 R produced effects similar to the 400-mg/liter mutagenesis, except that peak dominant lethality extended from 1 to 7 weeks

  6. Copy-Number Disorders Are a Common Cause of Congenital Kidney Malformations

    Science.gov (United States)

    Sanna-Cherchi, Simone; Kiryluk, Krzysztof; Burgess, Katelyn E.; Bodria, Monica; Sampson, Matthew G.; Hadley, Dexter; Nees, Shannon N.; Verbitsky, Miguel; Perry, Brittany J.; Sterken, Roel; Lozanovski, Vladimir J.; Materna-Kiryluk, Anna; Barlassina, Cristina; Kini, Akshata; Corbani, Valentina; Carrea, Alba; Somenzi, Danio; Murtas, Corrado; Ristoska-Bojkovska, Nadica; Izzi, Claudia; Bianco, Beatrice; Zaniew, Marcin; Flogelova, Hana; Weng, Patricia L.; Kacak, Nilgun; Giberti, Stefania; Gigante, Maddalena; Arapovic, Adela; Drnasin, Kristina; Caridi, Gianluca; Curioni, Simona; Allegri, Franca; Ammenti, Anita; Ferretti, Stefania; Goj, Vinicio; Bernardo, Luca; Jobanputra, Vaidehi; Chung, Wendy K.; Lifton, Richard P.; Sanders, Stephan; State, Matthew; Clark, Lorraine N.; Saraga, Marijan; Padmanabhan, Sandosh; Dominiczak, Anna F.; Foroud, Tatiana; Gesualdo, Loreto; Gucev, Zoran; Allegri, Landino; Latos-Bielenska, Anna; Cusi, Daniele; Scolari, Francesco; Tasic, Velibor; Hakonarson, Hakon; Ghiggeri, Gian Marco; Gharavi, Ali G.

    2012-01-01

    We examined the burden of large, rare, copy-number variants (CNVs) in 192 individuals with renal hypodysplasia (RHD) and replicated findings in 330 RHD cases from two independent cohorts. CNV distribution was significantly skewed toward larger gene-disrupting events in RHD cases compared to 4,733 ethnicity-matched controls (p = 4.8 × 10−11). This excess was attributable to known and novel (i.e., not present in any database or in the literature) genomic disorders. All together, 55/522 (10.5%) RHD cases harbored 34 distinct known genomic disorders, which were detected in only 0.2% of 13,839 population controls (p = 1.2 × 10−58). Another 32 (6.1%) RHD cases harbored large gene-disrupting CNVs that were absent from or extremely rare in the 13,839 population controls, identifying 38 potential novel or rare genomic disorders for this trait. Deletions at the HNF1B locus and the DiGeorge/velocardiofacial locus were most frequent. However, the majority of disorders were detected in a single individual. Genomic disorders were detected in 22.5% of individuals with multiple malformations and 14.5% of individuals with isolated urinary-tract defects; 14 individuals harbored two or more diagnostic or rare CNVs. Strikingly, the majority of the known CNV disorders detected in the RHD cohort have previous associations with developmental delay or neuropsychiatric diseases. Up to 16.6% of individuals with kidney malformations had a molecular diagnosis attributable to a copy-number disorder, suggesting kidney malformations as a sentinel manifestation of pathogenic genomic imbalances. A search for pathogenic CNVs should be considered in this population for the diagnosis of their specific genomic disorders and for the evaluation of the potential for developmental delay. PMID:23159250

  7. Cellular heredity in haploid cultures of somatic cells. Annual progress report, March 1, 1975--March 31, 1976. [UV radiation

    Energy Technology Data Exchange (ETDEWEB)

    Freed, J.J.

    1976-01-01

    In experiments with haploid and diploid derivatives from the haploid frog embryo cell line ICR 2A, we have investigated aspects of cell survival, DNA repair and mutant induction after exposure to 254 nm radiation. Survival curves for haploid and diploid cells in random growth or blocked in the Gl phase of the cell cycle were determined; the survival data do not differ sufficiently to permit the use of such comparisons as an index of recessive lethal induction. Studies of the induction of thymine dimers in DNA indicated that the incidence of dimers in DNA from haploid and diploid cells is similar after exposure of the cells to equal doses of ultraviolet. The cells are capable of photoreversing dimers but appear to be deficient in excision repair. In an attempt to examine the effect of the permitted mode of DNA repair on the yield of mutations, we compared the incidence of ouabain-resistant variants among survivors of ultraviolet exposure and of ultraviolet exposure followed by photoreversal. Although the yield of resistant colonies was small, the data suggest that photoreversal lowers the yield of resistant colonies and thus that the induction of this phenotype is related to dimer persistence in DNA. We have also observed by fluorescence microscopy that an acridine mustard mutagen, ICR 191, is preferentially accumulated in cytoplasmic granules having the intracellular distribution pattern of lysosomes. This form of incorporation may be significant in the apparently non-genetic early toxicity of this compound observed in experiments with cultured cells.

  8. SNP array analysis reveals novel genomic abnormalities including copy neutral loss of heterozygosity in anaplastic oligodendrogliomas.

    Directory of Open Access Journals (Sweden)

    Ahmed Idbaih

    Full Text Available Anaplastic oligodendrogliomas (AOD are rare glial tumors in adults with relative homogeneous clinical, radiological and histological features at the time of diagnosis but dramatically various clinical courses. Studies have identified several molecular abnormalities with clinical or biological relevance to AOD (e.g. t(1;19(q10;p10, IDH1, IDH2, CIC and FUBP1 mutations.To better characterize the clinical and biological behavior of this tumor type, the creation of a national multicentric network, named "Prise en charge des OLigodendrogliomes Anaplasiques (POLA," has been supported by the Institut National du Cancer (InCA. Newly diagnosed and centrally validated AOD patients and their related biological material (tumor and blood samples were prospectively included in the POLA clinical database and tissue bank, respectively.At the molecular level, we have conducted a high-resolution single nucleotide polymorphism array analysis, which included 83 patients. Despite a careful central pathological review, AOD have been found to exhibit heterogeneous genomic features. A total of 82% of the tumors exhibited a 1p/19q-co-deletion, while 18% harbor a distinct chromosome pattern. Novel focal abnormalities, including homozygously deleted, amplified and disrupted regions, have been identified. Recurring copy neutral losses of heterozygosity (CNLOH inducing the modulation of gene expression have also been discovered. CNLOH in the CDKN2A locus was associated with protein silencing in 1/3 of the cases. In addition, FUBP1 homozygous deletion was detected in one case suggesting a putative tumor suppressor role of FUBP1 in AOD.Our study showed that the genomic and pathological analyses of AOD are synergistic in detecting relevant clinical and biological subgroups of AOD.

  9. Analysis of high-identity segmental duplications in the grapevine genome

    Directory of Open Access Journals (Sweden)

    Carelli Francesco N

    2011-08-01

    Full Text Available Abstract Background Segmental duplications (SDs are blocks of genomic sequence of 1-200 kb that map to different loci in a genome and share a sequence identity > 90%. SDs show at the sequence level the same characteristics as other regions of the human genome: they contain both high-copy repeats and gene sequences. SDs play an important role in genome plasticity by creating new genes and modeling genome structure. Although data is plentiful for mammals, not much was known about the representation of SDs in plant genomes. In this regard, we performed a genome-wide analysis of high-identity SDs on the sequenced grapevine (Vitis vinifera genome (PN40024. Results We demonstrate that recent SDs (> 94% identity and >= 10 kb in size are a relevant component of the grapevine genome (85 Mb, 17% of the genome sequence. We detected mitochondrial and plastid DNA and genes (10% of gene annotation in segmentally duplicated regions of the nuclear genome. In particular, the nine highest copy number genes have a copy in either or both organelle genomes. Further we showed that several duplicated genes take part in the biosynthesis of compounds involved in plant response to environmental stress. Conclusions These data show the great influence of SDs and organelle DNA transfers in modeling the Vitis vinifera nuclear DNA structure as well as the impact of SDs in contributing to the adaptive capacity of grapevine and the nutritional content of grape products through genome variation. This study represents a step forward in the full characterization of duplicated genes important for grapevine cultural needs and human health.

  10. Interspersion of highly repetitive DNA with single copy DNA in the genome of the red crab, Geryon quinquedens

    Energy Technology Data Exchange (ETDEWEB)

    Christie, N.T. (Univ. of Tennessee, Oak Ridge); Skinner, D.M.

    1979-02-01

    Kinetic analysis of the reassociation of 420 nucleotide (NT) long fragments has shown that essentially all of the repetitive sequences of the DNA of the red crab Geryon quinquedens are highly repetitive. There are negligible amounts of low and intermediate repetitive DNAs. Though atypical of most eukaryotes, this pattern has been observed in al other brachyurans (true crabs) studied. The major repetitive component is subdivided into short runs of 300 NT and longer runs of greater than 1200 NT while the minor component has an average sequence length of 400 NT. Both components reassociate at rates commonly observed for satellite DNAs. Unique among eukaryotes the organization of the genome includes single copy DNA contiguous to short runs (300 NT) of both repetitive components. Although patent satellites are not present, subsets of the repetitive DNA have been isolated by either restriction endonuclease digestion or by centrifugation in Ag/sup +/ or Hg/sup 2 +//Cs/sub 2/SO/sub 4/ density gradients.

  11. Phenotypic manifestations of copy number variation in chromosome 16p13.11

    NARCIS (Netherlands)

    Nagamani, Sandesh C. Sreenath; Erez, Ayelet; Bader, Patricia; Lalani, Seema R.; Scott, Daryl A.; Scaglia, Fernando; Plon, Sharon E.; Tsai, Chun-Hui; Reimschisel, Tyler; Roeder, Elizabeth; Malphrus, Amy D.; Eng, Patricia A.; Hixson, Patricia M.; Kang, Sung-Hae L.; Stankiewicz, Pawel; Patel, Ankita; Cheung, Sau Wai

    The widespread clinical utilization of array comparative genome hybridization, has led to the unraveling of many new copy number variations (CNVs). Although some of these CNVs are clearly pathogenic, the phenotypic consequences of others, such as those in 16p13.11 remain unclear. Whereas deletions

  12. Techniques of radiation induced haploid breeding of wheat

    International Nuclear Information System (INIS)

    Xuan Pu; Xu Liyuan; Qu Shihong; Yu Guirong; Yin Chunrong; Yue Chunfang

    2000-01-01

    With the treatment of different doses of 60 Co γ-ray irradiation to F 1 hybrid seeds and donor plants from M 1 F 1 or M 2 F 2 , wheat anther culture was made based on the media of MW 14 and modified MS. A series of studies on the applied doses of radiation induction, low temperature treatment on donor spikes and calli, variable temperature induced incubation and yield of pollen callus and calli giving green plant lets, pollen plant lets control over summertime and pollen plant let transplantation were carried out in order to increase the efficiency of obtaining double haploid-pure diploid plant lets of wheat with stable heredity and propagation. Theses plant lets could be used directly in rapid breeding

  13. Polyploid genome of Camelina sativa revealed by isolation of fatty acid synthesis genes

    Directory of Open Access Journals (Sweden)

    Shewmaker Christine K

    2010-10-01

    Full Text Available Abstract Background Camelina sativa, an oilseed crop in the Brassicaceae family, has inspired renewed interest due to its potential for biofuels applications. Little is understood of the nature of the C. sativa genome, however. A study was undertaken to characterize two genes in the fatty acid biosynthesis pathway, fatty acid desaturase (FAD 2 and fatty acid elongase (FAE 1, which revealed unexpected complexity in the C. sativa genome. Results In C. sativa, Southern analysis indicates the presence of three copies of both FAD2 and FAE1 as well as LFY, a known single copy gene in other species. All three copies of both CsFAD2 and CsFAE1 are expressed in developing seeds, and sequence alignments show that previously described conserved sites are present, suggesting that all three copies of both genes could be functional. The regions downstream of CsFAD2 and upstream of CsFAE1 demonstrate co-linearity with the Arabidopsis genome. In addition, three expressed haplotypes were observed for six predicted single-copy genes in 454 sequencing analysis and results from flow cytometry indicate that the DNA content of C. sativa is approximately three-fold that of diploid Camelina relatives. Phylogenetic analyses further support a history of duplication and indicate that C. sativa and C. microcarpa might share a parental genome. Conclusions There is compelling evidence for triplication of the C. sativa genome, including a larger chromosome number and three-fold larger measured genome size than other Camelina relatives, three isolated copies of FAD2, FAE1, and the KCS17-FAE1 intergenic region, and three expressed haplotypes observed for six predicted single-copy genes. Based on these results, we propose that C. sativa be considered an allohexaploid. The characterization of fatty acid synthesis pathway genes will allow for the future manipulation of oil composition of this emerging biofuel crop; however, targeted manipulations of oil composition and general

  14. Mutation induction in haploid yeast after split-dose radiation-exposure. Pt. 1

    International Nuclear Information System (INIS)

    Schenk, K.; Zoelzer, F.; Kiefer, J.

    1989-01-01

    Mutation induction was investigated in wild-type haploid yeast Saccharomyces cerevisiae after split-dose UV-irradiation. Cells were exposed to fractionated 254 nm-UV-doses separated by intervals from 0 to 6 h with incubation either on non-nutrient or nutrient agar between. The test parameter was resistance to canavanine. If modifications of sensitivity due to incubation are appropriately taken into account there is no change of mutation frequency. (orig.)

  15. Leishmania naiffi and Leishmania guyanensis reference genomes highlight genome structure and gene evolution in the Viannia subgenus.

    Science.gov (United States)

    Coughlan, Simone; Taylor, Ali Shirley; Feane, Eoghan; Sanders, Mandy; Schonian, Gabriele; Cotton, James A; Downing, Tim

    2018-04-01

    The unicellular protozoan parasite Leishmania causes the neglected tropical disease leishmaniasis, affecting 12 million people in 98 countries. In South America, where the Viannia subgenus predominates, so far only L. ( Viannia ) braziliensis and L. ( V. ) panamensis have been sequenced, assembled and annotated as reference genomes. Addressing this deficit in molecular information can inform species typing, epidemiological monitoring and clinical treatment. Here, L. ( V. ) naiffi and L. ( V. ) guyanensis genomic DNA was sequenced to assemble these two genomes as draft references from short sequence reads. The methods used were tested using short sequence reads for L. braziliensis M2904 against its published reference as a comparison. This assembly and annotation pipeline identified 70 additional genes not annotated on the original M2904 reference. Phylogenetic and evolutionary comparisons of L. guyanensis and L. naiffi with 10 other Viannia genomes revealed four traits common to all Viannia : aneuploidy, 22 orthologous groups of genes absent in other Leishmania subgenera, elevated TATE transposon copies and a high NADH-dependent fumarate reductase gene copy number. Within the Viannia , there were limited structural changes in genome architecture specific to individual species: a 45 Kb amplification on chromosome 34 was present in all bar L. lainsoni , L. naiffi had a higher copy number of the virulence factor leishmanolysin, and laboratory isolate L. shawi M8408 had a possible minichromosome derived from the 3' end of chromosome 34 . This combination of genome assembly, phylogenetics and comparative analysis across an extended panel of diverse Viannia has uncovered new insights into the origin and evolution of this subgenus and can help improve diagnostics for leishmaniasis surveillance.

  16. The complete chloroplast genome of the Dendrobium strongylanthum (Orchidaceae: Epidendroideae).

    Science.gov (United States)

    Li, Jing; Chen, Chen; Wang, Zhe-Zhi

    2016-07-01

    Complete chloroplast genome sequence is very useful for studying the phylogenetic and evolution of species. In this study, the complete chloroplast genome of Dendrobium strongylanthum was constructed from whole-genome Illumina sequencing data. The chloroplast genome is 153 058 bp in length with 37.6% GC content and consists of two inverted repeats (IRs) of 26 316 bp. The IR regions are separated by large single-copy region (LSC, 85 836 bp) and small single-copy (SSC, 14 590 bp) region. A total of 130 chloroplast genes were successfully annotated, including 84 protein coding genes, 38 tRNA genes, and eight rRNA genes. Phylogenetic analyses showed that the chloroplast genome of Dendrobium strongylanthum is related to that of the Dendrobium officinal.

  17. Simulation and estimation of gene number in a biological pathway using almost complete saturation mutagenesis screening of haploid mouse cells.

    Science.gov (United States)

    Tokunaga, Masahiro; Kokubu, Chikara; Maeda, Yusuke; Sese, Jun; Horie, Kyoji; Sugimoto, Nakaba; Kinoshita, Taroh; Yusa, Kosuke; Takeda, Junji

    2014-11-24

    Genome-wide saturation mutagenesis and subsequent phenotype-driven screening has been central to a comprehensive understanding of complex biological processes in classical model organisms such as flies, nematodes, and plants. The degree of "saturation" (i.e., the fraction of possible target genes identified) has been shown to be a critical parameter in determining all relevant genes involved in a biological function, without prior knowledge of their products. In mammalian model systems, however, the relatively large scale and labor intensity of experiments have hampered the achievement of actual saturation mutagenesis, especially for recessive traits that require biallelic mutations to manifest detectable phenotypes. By exploiting the recently established haploid mouse embryonic stem cells (ESCs), we present an implementation of almost complete saturation mutagenesis in a mammalian system. The haploid ESCs were mutagenized with the chemical mutagen N-ethyl-N-nitrosourea (ENU) and processed for the screening of mutants defective in various steps of the glycosylphosphatidylinositol-anchor biosynthetic pathway. The resulting 114 independent mutant clones were characterized by a functional complementation assay, and were shown to be defective in any of 20 genes among all 22 known genes essential for this well-characterized pathway. Ten mutants were further validated by whole-exome sequencing. The predominant generation of single-nucleotide substitutions by ENU resulted in a gene mutation rate proportional to the length of the coding sequence, which facilitated the experimental design of saturation mutagenesis screening with the aid of computational simulation. Our study enables mammalian saturation mutagenesis to become a realistic proposition. Computational simulation, combined with a pilot mutagenesis experiment, could serve as a tool for the estimation of the number of genes essential for biological processes such as drug target pathways when a positive selection of

  18. Duplication and relocation of the functional DPY19L2 gene within low copy repeats

    Directory of Open Access Journals (Sweden)

    Cheung Joseph

    2006-03-01

    Full Text Available Abstract Background Low copy repeats (LCRs are thought to play an important role in recent gene evolution, especially when they facilitate gene duplications. Duplicate genes are fundamental to adaptive evolution, providing substrates for the development of new or shared gene functions. Moreover, silencing of duplicate genes can have an indirect effect on adaptive evolution by causing genomic relocation of functional genes. These changes are theorized to have been a major factor in speciation. Results Here we present a novel example showing functional gene relocation within a LCR. We characterize the genomic structure and gene content of eight related LCRs on human Chromosomes 7 and 12. Two members of a novel transmembrane gene family, DPY19L, were identified in these regions, along with six transcribed pseudogenes. One of these genes, DPY19L2, is found on Chromosome 12 and is not syntenic with its mouse orthologue. Instead, the human locus syntenic to mouse Dpy19l2 contains a pseudogene, DPY19L2P1. This indicates that the ancestral copy of this gene has been silenced, while the descendant copy has remained active. Thus, the functional copy of this gene has been relocated to a new genomic locus. We then describe the expansion and evolution of the DPY19L gene family from a single gene found in invertebrate animals. Ancient duplications have led to multiple homologues in different lineages, with three in fish, frogs and birds and four in mammals. Conclusion Our results show that the DPY19L family has expanded throughout the vertebrate lineage and has undergone recent primate-specific evolution within LCRs.

  19. 3D modeling of genome macroorganization on the basis of its structural changes after action of radiation

    International Nuclear Information System (INIS)

    Aleksandrov, I.D.; Aleksandrova, M.V.; Zaikin, N.S.; Koren'kov, V.V.; Pervushova, O.V.; Stepanenko, V.A.

    2006-01-01

    At present, after 120 years of the theoretical and experimental works, the issue of the genome macroarchitecture as the highest level of interphase chromosome organization in somatic cell nuclei remains still unresolved. The problem of the spatial arrangement of interphase chromosomes in haploid germ cells has never even been studied. A 3D simulation of packaging of the entire second chromosome in Drosophila mature sperms has been performed by using mathematical approaches and visualization methods to present macromolecular structure data. As genetic markers for simulation, frequency and location of the second inversion breakpoints for 72 structural υg mutants induced by ionizing radiation were used supposing that both ends of each inversion are topologically brought together forming loop of appropriate size. For the account of a degree of spatial affinity and visualization of chromosomal loops modern 3D-modeling methods with application of splines, libraries OpenGL, language Delphi, program Gmax were used. According to the model proposed, the entire second chromosome within mature sperm nuclei seems to be packaged in the form of a megarosette-loop structure which may be a basic principle of organization of the genome macro-architecture in animal haploid germ cells

  20. Estimating the Probability of Traditional Copying, Conditional on Answer-Copying Statistics.

    Science.gov (United States)

    Allen, Jeff; Ghattas, Andrew

    2016-06-01

    Statistics for detecting copying on multiple-choice tests produce p values measuring the probability of a value at least as large as that observed, under the null hypothesis of no copying. The posterior probability of copying is arguably more relevant than the p value, but cannot be derived from Bayes' theorem unless the population probability of copying and probability distribution of the answer-copying statistic under copying are known. In this article, the authors develop an estimator for the posterior probability of copying that is based on estimable quantities and can be used with any answer-copying statistic. The performance of the estimator is evaluated via simulation, and the authors demonstrate how to apply the formula using actual data. Potential uses, generalizability to other types of cheating, and limitations of the approach are discussed.

  1. Broad genomic and transcriptional analysis reveals a highly derived genome in dinoflagellate mitochondria

    Directory of Open Access Journals (Sweden)

    Keeling Patrick J

    2007-09-01

    Full Text Available Abstract Background Dinoflagellates comprise an ecologically significant and diverse eukaryotic phylum that is sister to the phylum containing apicomplexan endoparasites. The mitochondrial genome of apicomplexans is uniquely reduced in gene content and size, encoding only three proteins and two ribosomal RNAs (rRNAs within a highly compacted 6 kb DNA. Dinoflagellate mitochondrial genomes have been comparatively poorly studied: limited available data suggest some similarities with apicomplexan mitochondrial genomes but an even more radical type of genomic organization. Here, we investigate structure, content and expression of dinoflagellate mitochondrial genomes. Results From two dinoflagellates, Crypthecodinium cohnii and Karlodinium micrum, we generated over 42 kb of mitochondrial genomic data that indicate a reduced gene content paralleling that of mitochondrial genomes in apicomplexans, i.e., only three protein-encoding genes and at least eight conserved components of the highly fragmented large and small subunit rRNAs. Unlike in apicomplexans, dinoflagellate mitochondrial genes occur in multiple copies, often as gene fragments, and in numerous genomic contexts. Analysis of cDNAs suggests several novel aspects of dinoflagellate mitochondrial gene expression. Polycistronic transcripts were found, standard start codons are absent, and oligoadenylation occurs upstream of stop codons, resulting in the absence of termination codons. Transcripts of at least one gene, cox3, are apparently trans-spliced to generate full-length mRNAs. RNA substitutional editing, a process previously identified for mRNAs in dinoflagellate mitochondria, is also implicated in rRNA expression. Conclusion The dinoflagellate mitochondrial genome shares the same gene complement and fragmentation of rRNA genes with its apicomplexan counterpart. However, it also exhibits several unique characteristics. Most notable are the expansion of gene copy numbers and their arrangements

  2. Generation of doubled haploid transgenic wheat lines by microspore transformation.

    Directory of Open Access Journals (Sweden)

    Rhoda A T Brew-Appiah

    Full Text Available Microspores can be induced to develop homozygous doubled haploid plants in a single generation. In the present experiments androgenic microspores of wheat have been genetically transformed and developed into mature homozygous transgenic plants. Two different transformation techniques were investigated, one employing electroporation and the other co-cultivation with Agrobacterium tumefaciens. Different tissue culture and transfection conditions were tested on nine different wheat cultivars using four different constructs. A total of 19 fertile transformants in five genotypes from four market classes of common wheat were recovered by the two procedures. PCR followed by DNA sequencing of the products, Southern blot analyses and bio/histo-chemical and histological assays of the recombinant enzymes confirmed the presence of the transgenes in the T0 transformants and their stable inheritance in homozygous T1∶2 doubled haploid progenies. Several decisive factors determining the transformation and regeneration efficiency with the two procedures were determined: (i pretreatment of immature spikes with CuSO4 solution (500 mg/L at 4°C for 10 days; (ii electroporation of plasmid DNA in enlarged microspores by a single pulse of ∼375 V; (iii induction of microspores after transfection at 28°C in NPB-99 medium and regeneration at 26°C in MMS5 medium; (iv co-cultivation with Agrobacterium AGL-1 cells for transfer of plasmid T-DNA into microspores at day 0 for <24 hours; and (v elimination of AGL-1 cells after co-cultivation with timentin (200-400 mg/L.

  3. Array-based genomic screening at diagnosis and during follow-up in chronic lymphocytic leukemia

    DEFF Research Database (Denmark)

    Gunnarsson, Rebeqa; Mansouri, Larry; Isaksson, Anders

    2011-01-01

    High-resolution genomic microarrays enable simultaneous detection of copy-number aberrations such as the known recurrent aberrations in chronic lymphocytic leukemia [del(11q), del(13q), del(17p) and trisomy 12], and copy-number neutral loss of heterozygosity. Moreover, comparison of genomic...... profiles from sequential patients' samples allows detection of clonal evolution....

  4. A Poisson hierarchical modelling approach to detecting copy number variation in sequence coverage data

    OpenAIRE

    Sep?lveda, Nuno; Campino, Susana G; Assefa, Samuel A; Sutherland, Colin J; Pain5, Arnab; Clark, Taane G

    2013-01-01

    BACKGROUND: The advent of next generation sequencing technology has accelerated efforts to map and catalogue copy number variation (CNV) in genomes of important micro-organisms for public health. A typical analysis of the sequence data involves mapping reads onto a reference genome, calculating the respective coverage, and detecting regions with too-low or too-high coverage (deletions and amplifications, respectively). Current CNV detection methods rely on statistical assumptions (e.g., a Poi...

  5. Copy number variation in obsessive-compulsive disorder and tourette syndrome: A cross-disorder study

    NARCIS (Netherlands)

    L.M. McGrath; D. Yu (D.); C.R. Marshall (Christian); L.K. Davis (Lea); B. Thiruvahindrapuram (Bhooma); B. Li (Bingbin); C. Cappi (Carolina); G. Gerber (Gloria); A. de Wolf (Anneke); F.A. Schroeder (Frederick); L. Osiecki (Lisa); C. O'Dushlaine (Colm); A. Kirby (Andrew); C. Illmann (Cornelia); S. Haddad (Stephen); P. Gallagher (Patience); J. Fagerness (Jesen); C.L. Barr (Cathy); L. Bellodi (Laura); F. Benarroch (Fortu); O.J. Bienvenu (Oscar); D.W. Black (Donald); J. Bloch (Jocelyne); R.D. Bruun (Ruth); C.L. Budman (Cathy); B. Camarena (Beatriz); D. Cath (Daniëlle); M.C. Cavallini (Maria); S. Chouinard; V. Coric (Vladimir); C. Cullen; R. Delorme (Richard); D.A.J.P. Denys (Damiaan); E.M. Derks (Eske); Y. Dion (Yves); M.C. Rosário (Maria); C.E. Eapen (Chundamannil Eapen); P. Evans; P. Falkai (Peter); T.V. Fernandez (Thomas); H. Garrido (Helena); D. Geller (Daniel); H.J. Grabe (Hans Jörgen); M. Grados (Marco); B.D. Greenberg (Benjamin); V. Gross-Tsur (Varda); E. Grünblatt (Edna); M.L. Heiman (Mark); S.M.J. Hemmings (Sian); L.D. Herrera (Luis); A.G. Hounie (Ana); J. Jankovic (Joseph); J.L. Kennedy; R.A. King; R. Kurlan; N. Lanzagorta (Nuria); M. Leboyer (Marion); J.F. Leckman; L. Lennertz (Leonhard); C. Lochner (Christine); T.L. Lowe (Thomas); H.N. Lyon (Helen); F. MacCiardi (Fabio); W. Maier (Wolfgang); J.T. McCracken (James); W.M. McMahon (William); D.L. Murphy (Dennis); A.L. Naarden (Allan); E. Nurmi (Erika); A.J. Pakstis; C. Pato (Carlos); C. Pato (Carlos); J. Piacentini (John); C. Pittenger (Christopher); M.N. Pollak (Michael); V.I. Reus (Victor); M.A. Richter (Margaret); M. Riddle (Mark); M.M. Robertson; D. Rosenberg (David); G.A. Rouleau; S. Ruhrmann (Stephan); A.S. Sampaio (Aline); J. Samuels (Jonathan); P. Sandor (Paul); B. Sheppard (Brooke); H.S. Singer (Harvey); J.H. Smit (Jan); D.J. Stein (Dan); J.A. Tischfield (Jay); H. Vallada (Homero); J. Veenstra-Vanderweele (Jeremy); S. Walitza (Susanne); Y. Wang (Ying); A. Wendland (Annika); Y.Y. Shugart; E.C. Miguel (Euripedes); H. Nicolini (Humberto); B.A. Oostra (Ben); R. Moessner (Rainald); M. Wagner (Michael); A. Ruiz-Linares (Andres); P. Heutink (Peter); G. Nestadt (Gerald); N.B. Freimer (Nelson); T.L. Petryshen (Tracey); D. Posthuma (Danielle); M.A. Jenike (Michael); N.J. Cox (Nancy); G.L. Hanna (Gregory); H. Brentani (Helena); S.W. Scherer (Stephen); P.D. Arnold (Paul); S.E. Stewart; C. Mathews; J.A. Knowles (James A); E.H. Cook (Edwin); D.L. Pauls (David); K. Wang (Kai); J.M. Scharf; B.M. Neale (Benjamin)

    2014-01-01

    textabstractObjective Obsessive-compulsive disorder (OCD) and Tourette syndrome (TS) are heritable neurodevelopmental disorders with a partially shared genetic etiology. This study represents the first genome-wide investigation of large (>500 kb), rare (<1%) copy number variants (CNVs) in OCD and

  6. Inference of haplotypic phase and missing genotypes in polyploid organisms and variable copy number genomic regions

    Directory of Open Access Journals (Sweden)

    Balding David J

    2008-12-01

    Full Text Available Abstract Background The power of haplotype-based methods for association studies, identification of regions under selection, and ancestral inference, is well-established for diploid organisms. For polyploids, however, the difficulty of determining phase has limited such approaches. Polyploidy is common in plants and is also observed in animals. Partial polyploidy is sometimes observed in humans (e.g. trisomy 21; Down's syndrome, and it arises more frequently in some human tissues. Local changes in ploidy, known as copy number variations (CNV, arise throughout the genome. Here we present a method, implemented in the software polyHap, for the inference of haplotype phase and missing observations from polyploid genotypes. PolyHap allows each individual to have a different ploidy, but ploidy cannot vary over the genomic region analysed. It employs a hidden Markov model (HMM and a sampling algorithm to infer haplotypes jointly in multiple individuals and to obtain a measure of uncertainty in its inferences. Results In the simulation study, we combine real haplotype data to create artificial diploid, triploid, and tetraploid genotypes, and use these to demonstrate that polyHap performs well, in terms of both switch error rate in recovering phase and imputation error rate for missing genotypes. To our knowledge, there is no comparable software for phasing a large, densely genotyped region of chromosome from triploids and tetraploids, while for diploids we found polyHap to be more accurate than fastPhase. We also compare the results of polyHap to SATlotyper on an experimentally haplotyped tetraploid dataset of 12 SNPs, and show that polyHap is more accurate. Conclusion With the availability of large SNP data in polyploids and CNV regions, we believe that polyHap, our proposed method for inferring haplotypic phase from genotype data, will be useful in enabling researchers analysing such data to exploit the power of haplotype-based analyses.

  7. Parallel Evolution of Copy-Number Variation across Continents in Drosophila melanogaster

    Science.gov (United States)

    Schrider, Daniel R.; Hahn, Matthew W.; Begun, David J.

    2016-01-01

    Genetic differentiation across populations that is maintained in the presence of gene flow is a hallmark of spatially varying selection. In Drosophila melanogaster, the latitudinal clines across the eastern coasts of Australia and North America appear to be examples of this type of selection, with recent studies showing that a substantial portion of the D. melanogaster genome exhibits allele frequency differentiation with respect to latitude on both continents. As of yet there has been no genome-wide examination of differentiated copy-number variants (CNVs) in these geographic regions, despite their potential importance for phenotypic variation in Drosophila and other taxa. Here, we present an analysis of geographic variation in CNVs in D. melanogaster. We also present the first genomic analysis of geographic variation for copy-number variation in the sister species, D. simulans, in order to investigate patterns of parallel evolution in these close relatives. In D. melanogaster we find hundreds of CNVs, many of which show parallel patterns of geographic variation on both continents, lending support to the idea that they are influenced by spatially varying selection. These findings support the idea that polymorphic CNVs contribute to local adaptation in D. melanogaster. In contrast, we find very few CNVs in D. simulans that are geographically differentiated in parallel on both continents, consistent with earlier work suggesting that clinal patterns are weaker in this species. PMID:26809315

  8. Crossing the LINE toward genomic instability: LINE-1 retrotransposition in cancer

    Science.gov (United States)

    Kemp, Jacqueline; Longworth, Michelle

    2015-12-01

    Retrotransposons are repetitive DNA sequences that are positioned throughout the human genome. Retrotransposons are capable of copying themselves and mobilizing new copies to novel genomic locations in a process called retrotransposition. While most retrotransposon sequences in the human genome are incomplete and incapable of mobilization, the LINE-1 retrotransposon, which comprises approximately 17% of the human genome, remains active. The disruption of cellular mechanisms that suppress retrotransposon activity is linked to the generation of aneuploidy, a potential driver of tumor development. When retrotransposons insert into a novel genomic region, they have the potential to disrupt the coding sequence of endogenous genes and alter gene expression, which can lead to deleterious consequences for the organism. Additionally, increased LINE-1 copy numbers provide more chances for recombination events to occur between retrotransposons, which can lead to chromosomal breaks and rearrangements. LINE-1 activity is increased in various cancer cell lines and in patient tissues resected from primary tumors. LINE-1 activity also correlates with increased cancer metastasis. This review aims to give a brief overview of the connections between LINE-1 retrotransposition and the loss of genome stability. We will also discuss the mechanisms that repress retrotransposition in human cells and their links to cancer.

  9. Genome fluctuations in cyanobacteria reflect evolutionary, developmental and adaptive traits

    Science.gov (United States)

    2011-01-01

    Background Cyanobacteria belong to an ancient group of photosynthetic prokaryotes with pronounced variations in their cellular differentiation strategies, physiological capacities and choice of habitat. Sequencing efforts have shown that genomes within this phylum are equally diverse in terms of size and protein-coding capacity. To increase our understanding of genomic changes in the lineage, the genomes of 58 contemporary cyanobacteria were analysed for shared and unique orthologs. Results A total of 404 protein families, present in all cyanobacterial genomes, were identified. Two of these are unique to the phylum, corresponding to an AbrB family transcriptional regulator and a gene that escapes functional annotation although its genomic neighbourhood is conserved among the organisms examined. The evolution of cyanobacterial genome sizes involves a mix of gains and losses in the clade encompassing complex cyanobacteria, while a single event of reduction is evident in a clade dominated by unicellular cyanobacteria. Genome sizes and gene family copy numbers evolve at a higher rate in the former clade, and multi-copy genes were predominant in large genomes. Orthologs unique to cyanobacteria exhibiting specific characteristics, such as filament formation, heterocyst differentiation, diazotrophy and symbiotic competence, were also identified. An ancestral character reconstruction suggests that the most recent common ancestor of cyanobacteria had a genome size of approx. 4.5 Mbp and 1678 to 3291 protein-coding genes, 4%-6% of which are unique to cyanobacteria today. Conclusions The different rates of genome-size evolution and multi-copy gene abundance suggest two routes of genome development in the history of cyanobacteria. The expansion strategy is driven by gene-family enlargment and generates a broad adaptive potential; while the genome streamlining strategy imposes adaptations to highly specific niches, also reflected in their different functional capacities. A few

  10. Genome fluctuations in cyanobacteria reflect evolutionary, developmental and adaptive traits

    Directory of Open Access Journals (Sweden)

    Nylander Johan AA

    2011-06-01

    Full Text Available Abstract Background Cyanobacteria belong to an ancient group of photosynthetic prokaryotes with pronounced variations in their cellular differentiation strategies, physiological capacities and choice of habitat. Sequencing efforts have shown that genomes within this phylum are equally diverse in terms of size and protein-coding capacity. To increase our understanding of genomic changes in the lineage, the genomes of 58 contemporary cyanobacteria were analysed for shared and unique orthologs. Results A total of 404 protein families, present in all cyanobacterial genomes, were identified. Two of these are unique to the phylum, corresponding to an AbrB family transcriptional regulator and a gene that escapes functional annotation although its genomic neighbourhood is conserved among the organisms examined. The evolution of cyanobacterial genome sizes involves a mix of gains and losses in the clade encompassing complex cyanobacteria, while a single event of reduction is evident in a clade dominated by unicellular cyanobacteria. Genome sizes and gene family copy numbers evolve at a higher rate in the former clade, and multi-copy genes were predominant in large genomes. Orthologs unique to cyanobacteria exhibiting specific characteristics, such as filament formation, heterocyst differentiation, diazotrophy and symbiotic competence, were also identified. An ancestral character reconstruction suggests that the most recent common ancestor of cyanobacteria had a genome size of approx. 4.5 Mbp and 1678 to 3291 protein-coding genes, 4%-6% of which are unique to cyanobacteria today. Conclusions The different rates of genome-size evolution and multi-copy gene abundance suggest two routes of genome development in the history of cyanobacteria. The expansion strategy is driven by gene-family enlargment and generates a broad adaptive potential; while the genome streamlining strategy imposes adaptations to highly specific niches, also reflected in their different

  11. Genome-wide massively parallel sequencing of formaldehyde fixed-paraffin embedded (FFPE tumor tissues for copy-number- and mutation-analysis.

    Directory of Open Access Journals (Sweden)

    Michal R Schweiger

    Full Text Available BACKGROUND: Cancer re-sequencing programs rely on DNA isolated from fresh snap frozen tissues, the preparation of which is combined with additional preservation efforts. Tissue samples at pathology departments are routinely stored as formalin-fixed and paraffin-embedded (FFPE samples and their use would open up access to a variety of clinical trials. However, FFPE preparation is incompatible with many down-stream molecular biology techniques such as PCR based amplification methods and gene expression studies. METHODOLOGY/PRINCIPAL FINDINGS: Here we investigated the sample quality requirements of FFPE tissues for massively parallel short-read sequencing approaches. We evaluated key variables of pre-fixation, fixation related and post-fixation processes that occur in routine medical service (e.g. degree of autolysis, duration of fixation and of storage. We also investigated the influence of tissue storage time on sequencing quality by using material that was up to 18 years old. Finally, we analyzed normal and tumor breast tissues using the Sequencing by Synthesis technique (Illumina Genome Analyzer, Solexa to simultaneously localize genome-wide copy number alterations and to detect genomic variations such as substitutions and point-deletions and/or insertions in FFPE tissue samples. CONCLUSIONS/SIGNIFICANCE: The application of second generation sequencing techniques on small amounts of FFPE material opens up the possibility to analyze tissue samples which have been collected during routine clinical work as well as in the context of clinical trials. This is in particular important since FFPE samples are amply available from surgical tumor resections and histopathological diagnosis, and comprise tissue from precursor lesions, primary tumors, lymphogenic and/or hematogenic metastases. Large-scale studies using this tissue material will result in a better prediction of the prognosis of cancer patients and the early identification of patients which

  12. Clinical significance of rare copy number variations in epilepsy: a case-control survey using microarray-based comparative genomic hybridization.

    Science.gov (United States)

    Striano, Pasquale; Coppola, Antonietta; Paravidino, Roberta; Malacarne, Michela; Gimelli, Stefania; Robbiano, Angela; Traverso, Monica; Pezzella, Marianna; Belcastro, Vincenzo; Bianchi, Amedeo; Elia, Maurizio; Falace, Antonio; Gazzerro, Elisabetta; Ferlazzo, Edoardo; Freri, Elena; Galasso, Roberta; Gobbi, Giuseppe; Molinatto, Cristina; Cavani, Simona; Zuffardi, Orsetta; Striano, Salvatore; Ferrero, Giovanni Battista; Silengo, Margherita; Cavaliere, Maria Luigia; Benelli, Matteo; Magi, Alberto; Piccione, Maria; Dagna Bricarelli, Franca; Coviello, Domenico A; Fichera, Marco; Minetti, Carlo; Zara, Federico

    2012-03-01

    To perform an extensive search for genomic rearrangements by microarray-based comparative genomic hybridization in patients with epilepsy. Prospective cohort study. Epilepsy centers in Italy. Two hundred seventy-nine patients with unexplained epilepsy, 265 individuals with nonsyndromic mental retardation but no epilepsy, and 246 healthy control subjects were screened by microarray-based comparative genomic hybridization. Identification of copy number variations (CNVs) and gene enrichment. Rare CNVs occurred in 26 patients (9.3%) and 16 healthy control subjects (6.5%) (P = .26). The CNVs identified in patients were larger (P = .03) and showed higher gene content (P = .02) than those in control subjects. The CNVs larger than 1 megabase (P = .002) and including more than 10 genes (P = .005) occurred more frequently in patients than in control subjects. Nine patients (34.6%) among those harboring rare CNVs showed rearrangements associated with emerging microdeletion or microduplication syndromes. Mental retardation and neuropsychiatric features were associated with rare CNVs (P = .004), whereas epilepsy type was not. The CNV rate in patients with epilepsy and mental retardation or neuropsychiatric features is not different from that observed in patients with mental retardation only. Moreover, significant enrichment of genes involved in ion transport was observed within CNVs identified in patients with epilepsy. Patients with epilepsy show a significantly increased burden of large, rare, gene-rich CNVs, particularly when associated with mental retardation and neuropsychiatric features. The limited overlap between CNVs observed in the epilepsy group and those observed in the group with mental retardation only as well as the involvement of specific (ion channel) genes indicate a specific association between the identified CNVs and epilepsy. Screening for CNVs should be performed for diagnostic purposes preferentially in patients with epilepsy and mental retardation or

  13. Assessment of copy number variations in 120 patients with Poland syndrome.

    Science.gov (United States)

    Vaccari, Carlotta Maria; Tassano, Elisa; Torre, Michele; Gimelli, Stefania; Divizia, Maria Teresa; Romanini, Maria Victoria; Bossi, Simone; Musante, Ilaria; Valle, Maura; Senes, Filippo; Catena, Nunzio; Bedeschi, Maria Francesca; Baban, Anwar; Calevo, Maria Grazia; Acquaviva, Massimo; Lerone, Margherita; Ravazzolo, Roberto; Puliti, Aldamaria

    2016-11-25

    Poland Syndrome (PS) is a rare congenital disorder presenting with agenesis/hypoplasia of the pectoralis major muscle variably associated with thoracic and/or upper limb anomalies. Most cases are sporadic, but familial recurrence, with different inheritance patterns, has been observed. The genetic etiology of PS remains unknown. Karyotyping and array-comparative genomic hybridization (CGH) analyses can identify genomic imbalances that can clarify the genetic etiology of congenital and neurodevelopmental disorders. We previously reported a chromosome 11 deletion in twin girls with pectoralis muscle hypoplasia and skeletal anomalies, and a chromosome six deletion in a patient presenting a complex phenotype that included pectoralis muscle hypoplasia. However, the contribution of genomic imbalances to PS remains largely unknown. To investigate the prevalence of chromosomal imbalances in PS, standard cytogenetic and array-CGH analyses were performed in 120 PS patients. Following the application of stringent filter criteria, 14 rare copy number variations (CNVs) were identified in 14 PS patients in different regions outside known common copy number variations: seven genomic duplications and seven genomic deletions, enclosing the two previously reported PS associated chromosomal deletions. These CNVs ranged from 0.04 to 4.71 Mb in size. Bioinformatic analysis of array-CGH data indicated gene enrichment in pathways involved in cell-cell adhesion, DNA binding and apoptosis processes. The analysis also provided a number of candidate genes possibly causing the developmental defects observed in PS patients, among others REV3L, a gene coding for an error-prone DNA polymerase previously associated with Möbius Syndrome with variable phenotypes including pectoralis muscle agenesis. A number of rare CNVs were identified in PS patients, and these involve genes that represent candidates for further evaluation. Rare inherited CNVs may contribute to, or represent risk factors of PS

  14. SINEs as driving forces in genome evolution.

    Science.gov (United States)

    Schmitz, J

    2012-01-01

    SINEs are short interspersed elements derived from cellular RNAs that repetitively retropose via RNA intermediates and integrate more or less randomly back into the genome. SINEs propagate almost entirely vertically within their host cells and, once established in the germline, are passed on from generation to generation. As non-autonomous elements, their reverse transcription (from RNA to cDNA) and genomic integration depends on the activity of the enzymatic machinery of autonomous retrotransposons, such as long interspersed elements (LINEs). SINEs are widely distributed in eukaryotes, but are especially effectively propagated in mammalian species. For example, more than a million Alu-SINE copies populate the human genome (approximately 13% of genomic space), and few master copies of them are still active. In the organisms where they occur, SINEs are a challenge to genomic integrity, but in the long term also can serve as beneficial building blocks for evolution, contributing to phenotypic heterogeneity and modifying gene regulatory networks. They substantially expand the genomic space and introduce structural variation to the genome. SINEs have the potential to mutate genes, to alter gene expression, and to generate new parts of genes. A balanced distribution and controlled activity of such properties is crucial to maintaining the organism's dynamic and thriving evolution. Copyright © 2012 S. Karger AG, Basel.

  15. The complete chloroplast genome of a medicinal plant Epimedium koreanum Nakai (Berberidaceae).

    Science.gov (United States)

    Lee, Jung-Hoon; Kim, Kyunghee; Kim, Na-Rae; Lee, Sang-Choon; Yang, Tae-Jin; Kim, Young-Dong

    2016-11-01

    Epimedium koreanum is a perennial medicinal plant distributed in Eastern Asia. The complete chloroplast genome sequences of E. koreanum was obtained by de novo assembly using whole genome next-generation sequences. The chloroplast genome of E. koreanum was 157 218 bp in length and separated into four distinct regions such as large single copy region (89 600 bp), small single copy region (17 222 bp) and a pair of inverted repeat regions (25 198 bp). The genome contained a total of 112 genes including 78 protein-coding genes, 30 tRNA genes, and 4 rRNA genes. Phylogenetic analysis with the reported chloroplast genomes revealed that E. koreanum is most closely related to Berberis bealei, a traditional medicinal plant in the Berberidaceae family.

  16. Meraculous: De Novo Genome Assembly with Short Paired-End Reads

    Energy Technology Data Exchange (ETDEWEB)

    Chapman, Jarrod A.; Ho, Isaac; Sunkara, Sirisha; Luo, Shujun; Schroth, Gary P.; Rokhsar, Daniel S.; Salzberg, Steven L.

    2011-08-18

    We describe a new algorithm, meraculous, for whole genome assembly of deep paired-end short reads, and apply it to the assembly of a dataset of paired 75-bp Illumina reads derived from the 15.4 megabase genome of the haploid yeast Pichia stipitis. More than 95% of the genome is recovered, with no errors; half the assembled sequence is in contigs longer than 101 kilobases and in scaffolds longer than 269 kilobases. Incorporating fosmid ends recovers entire chromosomes. Meraculous relies on an efficient and conservative traversal of the subgraph of the k-mer (deBruijn) graph of oligonucleotides with unique high quality extensions in the dataset, avoiding an explicit error correction step as used in other short-read assemblers. A novel memory-efficient hashing scheme is introduced. The resulting contigs are ordered and oriented using paired reads separated by ~280 bp or ~3.2 kbp, and many gaps between contigs can be closed using paired-end placements. Practical issues with the dataset are described, and prospects for assembling larger genomes are discussed.

  17. Metabolic engineering of a haploid strain derived from a triploid industrial yeast for producing cellulosic ethanol.

    Science.gov (United States)

    Kim, Soo Rin; Skerker, Jeffrey M; Kong, In Iok; Kim, Heejin; Maurer, Matthew J; Zhang, Guo-Chang; Peng, Dairong; Wei, Na; Arkin, Adam P; Jin, Yong-Su

    2017-03-01

    Many desired phenotypes for producing cellulosic biofuels are often observed in industrial Saccharomyces cerevisiae strains. However, many industrial yeast strains are polyploid and have low spore viability, making it difficult to use these strains for metabolic engineering applications. We selected the polyploid industrial strain S. cerevisiae ATCC 4124 exhibiting rapid glucose fermentation capability, high ethanol productivity, strong heat and inhibitor tolerance in order to construct an optimal yeast strain for producing cellulosic ethanol. Here, we focused on developing a general approach and high-throughput screening method to isolate stable haploid segregants derived from a polyploid parent, such as triploid ATCC 4124 with a poor spore viability. Specifically, we deleted the HO genes, performed random sporulation, and screened the resulting segregants based on growth rate, mating type, and ploidy. Only one stable haploid derivative (4124-S60) was isolated, while 14 other segregants with a stable mating type were aneuploid. The 4124-S60 strain inherited only a subset of desirable traits present in the parent strain, same as other aneuploids, suggesting that glucose fermentation and specific ethanol productivity are likely to be genetically complex traits and/or they might depend on ploidy. Nonetheless, the 4124-60 strain did inherit the ability to tolerate fermentation inhibitors. When additional genetic perturbations known to improve xylose fermentation were introduced into the 4124-60 strain, the resulting engineered strain (IIK1) was able to ferment a Miscanthus hydrolysate better than a previously engineered laboratory strain (SR8), built by making the same genetic changes. However, the IIK1 strain showed higher glycerol and xylitol yields than the SR8 strain. In order to decrease glycerol and xylitol production, an NADH-dependent acetate reduction pathway was introduced into the IIK1 strain. By consuming 2.4g/L of acetate, the resulting strain (IIK1A

  18. Diversity and population-genetic properties of copy number variations and multicopy genes in cattle

    Science.gov (United States)

    Bickhart, Derek M.; Xu, Lingyang; Hutchison, Jana L.; Cole, John B.; Null, Daniel J.; Schroeder, Steven G.; Song, Jiuzhou; Garcia, Jose Fernando; Sonstegard, Tad S.; Van Tassell, Curtis P.; Schnabel, Robert D.; Taylor, Jeremy F.; Lewin, Harris A.; Liu, George E.

    2016-01-01

    The diversity and population genetics of copy number variation (CNV) in domesticated animals are not well understood. In this study, we analysed 75 genomes of major taurine and indicine cattle breeds (including Angus, Brahman, Gir, Holstein, Jersey, Limousin, Nelore, and Romagnola), sequenced to 11-fold coverage to identify 1,853 non-redundant CNV regions. Supported by high validation rates in array comparative genomic hybridization (CGH) and qPCR experiments, these CNV regions accounted for 3.1% (87.5 Mb) of the cattle reference genome, representing a significant increase over previous estimates of the area of the genome that is copy number variable (∼2%). Further population genetics and evolutionary genomics analyses based on these CNVs revealed the population structures of the cattle taurine and indicine breeds and uncovered potential diversely selected CNVs near important functional genes, including AOX1, ASZ1, GAT, GLYAT, and KRTAP9-1. Additionally, 121 CNV gene regions were found to be either breed specific or differentially variable across breeds, such as RICTOR in dairy breeds and PNPLA3 in beef breeds. In contrast, clusters of the PRP and PAG genes were found to be duplicated in all sequenced animals, suggesting that subfunctionalization, neofunctionalization, or overdominance play roles in diversifying those fertility-related genes. These CNV results provide a new glimpse into the diverse selection histories of cattle breeds and a basis for correlating structural variation with complex traits in the future. PMID:27085184

  19. The complete chloroplast genome sequence of Hibiscus syriacus.

    Science.gov (United States)

    Kwon, Hae-Yun; Kim, Joon-Hyeok; Kim, Sea-Hyun; Park, Ji-Min; Lee, Hyoshin

    2016-09-01

    The complete chloroplast genome sequence of Hibiscus syriacus L. is presented in this study. The genome is composed of 161 019 bp in length, with a typical circular structure containing a pair of inverted repeats of 25 745 bp of length separated by a large single-copy region and a small single-copy region of 89 698 bp and 19 831 bp of length, respectively. The overall GC content is 36.8%. One hundred and fourteen genes were annotated, including 81 protein-coding genes, 4 ribosomal RNA genes and 29 transfer RNA genes.

  20. Copy number variation in obsessive-compulsive disorder and tourette syndrome: a cross-disorder study.

    NARCIS (Netherlands)

    McGrath, L.M.; Yu, D.; Marshall, C.; Davis, L.K.; Thiruvahindrapuram, B.; Li, B.; Cappi, C.; Gerber, G.; Wolf, A.; Schroeder, F.A.; Osiecki, L.; O'Dushlaine, C.; Kirby, A.; Illmann, C.; Haddad, S.; Gallagher, P.; Fagerness, J.A.; Barr, C.L.; Bellodi, L.; Benarroch, F.; Bienvenu, O.J.; Black, D. W.; Bloch, M.H.; Bruun, R.D.; Budman, C.L.; Camarena, B.; Cath, D.C.; Cavallini, M.C.; Chouinard, S.; Coric, V.; Cullen, B.; Delorme, R.; Denys, D.; Derks, E.M.; Dion, Y.; Rosário, M.C.; Eapen, V.; Evans, P.; Falkai, P.; Fernandez, T.V.; Garrido, H.; Geller, D.; Grabe, H.J.; Grados, M.A.; Greenberg, B.D.; Gross-Tsur, V.; Grünblatt, E.; Heiman, G.A.; Hemmings, S.M.; Herrera, L.D.; Hounie, A.G.; Jankovic, J.; Kennedy, J.L.; King, R.A.; Kurlan, R.; Lanzagorta, N.; Leboyer, M.; Leckman, J.F.; Lennertz, L.; Lochner, C.; Lowe, T.L.; Lyon, G.J.; Macciardi, F.; Maier, W.; McCracken, J.T.; McMahon, W.; Murphy, D.L.; Naarden, A.L.; Neale, B. M.; Nurmi, E.; Pakstis, A.J.; Pato, M. T.; Piacentini, J.; Pittenger, C.; Pollak, Y.; Reus, V.I.; Richter, M.A.; Riddle, M.; Robertson, M.M.; Rosenberg, D.; Rouleau, G.A.; Ruhrmann, S.; Sampaio, A.S.; Samuels, J.; Sandor, P.; Sheppard, B.; Singer, H.S.; Smit, J.H.; Stein, D.J.; Tischfield, J.A.; Vallada, H.; Veenstra-Vanderweele, J.; Walitza, S.; Wang, Y.; Wendland, J.R.; Shugart, Y.Y.; Miguel, E.C.; Nicolini, H.; Oostra, B.A.; Moessner, R.; Wagner, M.; Ruiz-Linares, A.; Heutink, P.; Nestadt, G.; Freimer, N.; Petryshen, T.; Posthuma, D.; Jenike, M.A.; Cox, N.J.; Hanna, G.L.; Brentani, H.; Scherer, S.W.; Arnold, P.D.; Stewart, S.E.; Mathews, C.A.; Knowles, J.A.; Cook, E.H.; Pauls, D.L.; Wang, K.; Scharf, J.M.

    2014-01-01

    Objective Obsessive-compulsive disorder (OCD) and Tourette syndrome (TS) are heritable neurodevelopmental disorders with a partially shared genetic etiology. This study represents the first genome-wide investigation of large (>500 kb), rare (<1%) copy number variants (CNVs) in OCD and the largest

  1. Copy number variation in obsessive-compulsive disorder and tourette syndrome : a cross-disorder study

    NARCIS (Netherlands)

    McGrath, Lauren M; Yu, Dongmei; Marshall, Christian; Davis, Lea K; Thiruvahindrapuram, Bhooma; Li, Bingbin; Cappi, Carolina; Gerber, Gloria; Wolf, Aaron; Schroeder, Frederick A; Osiecki, Lisa; O'Dushlaine, Colm; Kirby, Andrew; Illmann, Cornelia; Haddad, Stephen; Gallagher, Patience; Fagerness, Jesen A; Barr, Cathy L; Bellodi, Laura; Benarroch, Fortu; Bienvenu, O Joseph; Black, Donald W; Bloch, Michael H; Bruun, Ruth D; Budman, Cathy L; Camarena, Beatriz; Cath, Danielle C; Cavallini, Maria C; Chouinard, Sylvain; Coric, Vladimir; Cullen, Bernadette; Delorme, Richard; Denys, D.; Derks, Eske M; Dion, Yves; Rosário, Maria C; Eapen, Valsama; Evans, Patrick; Falkai, Peter; Fernandez, Thomas V; Garrido, Helena; Geller, Daniel; Grabe, Hans J; Grados, Marco A; Greenberg, Benjamin D; Gross-Tsur, Varda; Grünblatt, Edna; Heiman, Gary A; Hemmings, Sian M J; Herrera, Luis D; Hounie, Ana G; Jankovic, Joseph; Kennedy, James L; King, Robert A; Kurlan, Roger; Lanzagorta, Nuria; Leboyer, Marion; Leckman, James F; Lennertz, Leonhard; Lochner, Christine; Lowe, Thomas L; Lyon, Gholson J; Macciardi, Fabio; Maier, Wolfgang; McCracken, James T; McMahon, William; Murphy, Dennis L; Naarden, Allan L; Neale, Benjamin M; Nurmi, Erika; Pakstis, Andrew J; Pato, Michele T; Pato, Carlos N; Piacentini, John; Pittenger, Christopher; Pollak, Yehuda; Reus, Victor I; Richter, Margaret A; Riddle, Mark; Robertson, Mary M; Rosenberg, David; Rouleau, Guy A; Ruhrmann, Stephan; Sampaio, Aline S; Samuels, Jack; Sandor, Paul; Sheppard, Brooke; Singer, Harvey S; Smit, Jan H; Stein, Dan J; Tischfield, Jay A; Vallada, Homero; Veenstra-VanderWeele, Jeremy; Walitza, Susanne; Wang, Ying; Wendland, Jens R; Shugart, Yin Yao; Miguel, Euripedes C; Nicolini, Humberto; Oostra, Ben A; Moessner, Rainald; Wagner, Michael; Ruiz-Linares, Andres; Heutink, Peter; Nestadt, Gerald; Freimer, Nelson; Petryshen, Tracey; Posthuma, Danielle; Jenike, Michael A; Cox, Nancy J; Hanna, Gregory L; Brentani, Helena; Scherer, Stephen W; Arnold, Paul D; Stewart, S Evelyn; Mathews, Carol A; Knowles, James A; Cook, Edwin H; Pauls, David L; Wang, Kai; Scharf, Jeremiah M

    OBJECTIVE: Obsessive-compulsive disorder (OCD) and Tourette syndrome (TS) are heritable neurodevelopmental disorders with a partially shared genetic etiology. This study represents the first genome-wide investigation of large (>500 kb), rare (<1%) copy number variants (CNVs) in OCD and the largest

  2. Inferring mechanisms of copy number change from haplotype structures at the human DEFA1A3 locus.

    Science.gov (United States)

    Black, Holly A; Khan, Fayeza F; Tyson, Jess; Al Armour, John

    2014-07-21

    The determination of structural haplotypes at copy number variable regions can indicate the mechanisms responsible for changes in copy number, as well as explain the relationship between gene copy number and expression. However, obtaining spatial information at regions displaying extensive copy number variation, such as the DEFA1A3 locus, is complex, because of the difficulty in the phasing and assembly of these regions. The DEFA1A3 locus is intriguing in that it falls within a region of high linkage disequilibrium, despite its high variability in copy number (n = 3-16); hence, the mechanisms responsible for changes in copy number at this locus are unclear. In this study, a region flanking the DEFA1A3 locus was sequenced across 120 independent haplotypes with European ancestry, identifying five common classes of DEFA1A3 haplotype. Assigning DEFA1A3 class to haplotypes within the 1000 Genomes project highlights a significant difference in DEFA1A3 class frequencies between populations with different ancestry. The features of each DEFA1A3 class, for example, the associated DEFA1A3 copy numbers, were initially assessed in a European cohort (n = 599) and replicated in the 1000 Genomes samples, showing within-class similarity, but between-class and between-population differences in the features of the DEFA1A3 locus. Emulsion haplotype fusion-PCR was used to generate 61 structural haplotypes at the DEFA1A3 locus, showing a high within-class similarity in structure. Structural haplotypes across the DEFA1A3 locus indicate that intra-allelic rearrangement is the predominant mechanism responsible for changes in DEFA1A3 copy number, explaining the conservation of linkage disequilibrium across the locus. The identification of common structural haplotypes at the DEFA1A3 locus could aid studies into how DEFA1A3 copy number influences expression, which is currently unclear.

  3. Assessing Mitochondrial DNA Variation and Copy Number in Lymphocytes of ~2,000 Sardinians Using Tailored Sequencing Analysis Tools.

    Directory of Open Access Journals (Sweden)

    Jun Ding

    2015-07-01

    Full Text Available DNA sequencing identifies common and rare genetic variants for association studies, but studies typically focus on variants in nuclear DNA and ignore the mitochondrial genome. In fact, analyzing variants in mitochondrial DNA (mtDNA sequences presents special problems, which we resolve here with a general solution for the analysis of mtDNA in next-generation sequencing studies. The new program package comprises 1 an algorithm designed to identify mtDNA variants (i.e., homoplasmies and heteroplasmies, incorporating sequencing error rates at each base in a likelihood calculation and allowing allele fractions at a variant site to differ across individuals; and 2 an estimation of mtDNA copy number in a cell directly from whole-genome sequencing data. We also apply the methods to DNA sequence from lymphocytes of ~2,000 SardiNIA Project participants. As expected, mothers and offspring share all homoplasmies but a lesser proportion of heteroplasmies. Both homoplasmies and heteroplasmies show 5-fold higher transition/transversion ratios than variants in nuclear DNA. Also, heteroplasmy increases with age, though on average only ~1 heteroplasmy reaches the 4% level between ages 20 and 90. In addition, we find that mtDNA copy number averages ~110 copies/lymphocyte and is ~54% heritable, implying substantial genetic regulation of the level of mtDNA. Copy numbers also decrease modestly but significantly with age, and females on average have significantly more copies than males. The mtDNA copy numbers are significantly associated with waist circumference (p-value = 0.0031 and waist-hip ratio (p-value = 2.4×10-5, but not with body mass index, indicating an association with central fat distribution. To our knowledge, this is the largest population analysis to date of mtDNA dynamics, revealing the age-imposed increase in heteroplasmy, the relatively high heritability of copy number, and the association of copy number with metabolic traits.

  4. Specific functions of the Rep and Rep׳ proteins of porcine circovirus during copy-release and rolling-circle DNA replication.

    Science.gov (United States)

    Cheung, Andrew K

    2015-07-01

    The roles of two porcine circovirus replication initiator proteins, Rep and Rep׳, in generating copy-release and rolling-circle DNA replication intermediates were determined. Rep uses the supercoiled closed-circular genome (ccc) to initiate leading-strand synthesis (identical to copy-release replication) and generates the single-stranded circular (ssc) genome from the displaced DNA strand. In the process, a minus-genome primer (MGP) necessary for complementary-strand synthesis, from ssc to ccc, is synthesized. Rep׳ cleaves the growing nascent-strand to regenerate the parent ccc molecule. In the process, a Rep׳-DNA hybrid containing the right palindromic sequence (at the origin of DNA replication) is generated. Analysis of the virus particle showed that it is composed of four components: ssc, MGP, capsid protein and a novel Rep-related protein (designated Protein-3). Copyright © 2015. Published by Elsevier Inc.

  5. The complete chloroplast genome sequences of Lychnis wilfordii and Silene capitata and comparative analyses with other Caryophyllaceae genomes.

    Science.gov (United States)

    Kang, Jong-Soo; Lee, Byoung Yoon; Kwak, Myounghai

    2017-01-01

    The complete chloroplast genomes of Lychnis wilfordii and Silene capitata were determined and compared with ten previously reported Caryophyllaceae chloroplast genomes. The chloroplast genome sequences of L. wilfordii and S. capitata contain 152,320 bp and 150,224 bp, respectively. The gene contents and orders among 12 Caryophyllaceae species are consistent, but several microstructural changes have occurred. Expansion of the inverted repeat (IR) regions at the large single copy (LSC)/IRb and small single copy (SSC)/IR boundaries led to partial or entire gene duplications. Additionally, rearrangements of the LSC region were caused by gene inversions and/or transpositions. The 18 kb inversions, which occurred three times in different lineages of tribe Sileneae, were thought to be facilitated by the intermolecular duplicated sequences. Sequence analyses of the L. wilfordii and S. capitata genomes revealed 39 and 43 repeats, respectively, including forward, palindromic, and reverse repeats. In addition, a total of 67 and 56 simple sequence repeats were discovered in the L. wilfordii and S. capitata chloroplast genomes, respectively. Finally, we constructed phylogenetic trees of the 12 Caryophyllaceae species and two Amaranthaceae species based on 73 protein-coding genes using both maximum parsimony and likelihood methods.

  6. Transcript levels of ten caste-related genes in adult diploid males of Melipona quadrifasciata (Hymenoptera, Apidae: a comparison with haploid males, queens and workers

    Directory of Open Access Journals (Sweden)

    Andreia A. Borges

    2011-01-01

    Full Text Available In Hymenoptera, homozygosity at the sex locus results in the production of diploid males. In social species, these pose a double burden by having low fitness and drawing resources normally spent for increasing the work force of a colony. Yet, diploid males are of academic interest as they can elucidate effects of ploidy (normal males are haploid, whereas the female castes, the queens and workers, are diploid on morphology and life history. Herein we investigated expression levels of ten caste-related genes in the stingless bee Melipona quadrifasciata, comparing newly emerged and 5-day-old diploid males with haploid males, queens and workers. In diploid males, transcript levels for dunce and paramyosin were increased during the first five days of adult life, while those for diacylglycerol kinase and the transcriptional co-repressor groucho diminished. Two general trends were apparent, (i gene expression patterns in diploid males were overall more similar to haploid ones and workers than to queens, and (ii in queens and workers, more genes were up-regulated after emergence until day five, whereas in diploid and especially so in haploid males more genes were down-regulated. This difference between the sexes may be related to longevity, which is much longer in females than in males.

  7. Transcript levels of ten caste-related genes in adult diploid males of Melipona quadrifasciata (Hymenoptera, Apidae) - A comparison with haploid males, queens and workers.

    Science.gov (United States)

    Borges, Andreia A; Humann, Fernanda C; Oliveira Campos, Lucio A; Tavares, Mara G; Hartfelder, Klaus

    2011-10-01

    In Hymenoptera, homozygosity at the sex locus results in the production of diploid males. In social species, these pose a double burden by having low fitness and drawing resources normally spent for increasing the work force of a colony. Yet, diploid males are of academic interest as they can elucidate effects of ploidy (normal males are haploid, whereas the female castes, the queens and workers, are diploid) on morphology and life history. Herein we investigated expression levels of ten caste-related genes in the stingless bee Melipona quadrifasciata, comparing newly emerged and 5-day-old diploid males with haploid males, queens and workers. In diploid males, transcript levels for dunce and paramyosin were increased during the first five days of adult life, while those for diacylglycerol kinase and the transcriptional co-repressor groucho diminished. Two general trends were apparent, (i) gene expression patterns in diploid males were overall more similar to haploid ones and workers than to queens, and (ii) in queens and workers, more genes were up-regulated after emergence until day five, whereas in diploid and especially so in haploid males more genes were down-regulated. This difference between the sexes may be related to longevity, which is much longer in females than in males.

  8. The First Complete Chloroplast Genome Sequences in Actinidiaceae: Genome Structure and Comparative Analysis.

    Science.gov (United States)

    Yao, Xiaohong; Tang, Ping; Li, Zuozhou; Li, Dawei; Liu, Yifei; Huang, Hongwen

    2015-01-01

    Actinidia chinensis is an important economic plant belonging to the basal lineage of the asterids. Availability of a complete Actinidia chloroplast genome sequence is crucial to understanding phylogenetic relationships among major lineages of angiosperms and facilitates kiwifruit genetic improvement. We report here the complete nucleotide sequences of the chloroplast genomes for Actinidia chinensis and A. chinensis var deliciosa obtained through de novo assembly of Illumina paired-end reads produced by total DNA sequencing. The total genome size ranges from 155,446 to 157,557 bp, with an inverted repeat (IR) of 24,013 to 24,391 bp, a large single copy region (LSC) of 87,984 to 88,337 bp and a small single copy region (SSC) of 20,332 to 20,336 bp. The genome encodes 113 different genes, including 79 unique protein-coding genes, 30 tRNA genes and 4 ribosomal RNA genes, with 16 duplicated in the inverted repeats, and a tRNA gene (trnfM-CAU) duplicated once in the LSC region. Comparisons of IR boundaries among four asterid species showed that IR/LSC borders were extended into the 5' portion of the psbA gene and IR contraction occurred in Actinidia. The clap gene has been lost from the chloroplast genome in Actinidia, and may have been transferred to the nucleus during chloroplast evolution. Twenty-seven polymorphic simple sequence repeat (SSR) loci were identified in the Actinidia chloroplast genome. Maximum parsimony analyses of a 72-gene, 16 taxa angiosperm dataset strongly support the placement of Actinidiaceae in Ericales within the basal asterids.

  9. Diversity and population-genetic properties of copy number variations and multicopy genes in cattle

    Science.gov (United States)

    The diversity and population-genetics of copy number variation (CNV) in domesticated animals are not well understood. In this study, we analyzed 75 genomes of major taurine and indicine cattle breeds (including Angus, Brahman, Gir, Holstein, Jersey, Limousin, Nelore, Romagnola), sequenced to 11-fold...

  10. In vitro selection of rape variants resistant to oxalic acid using haploid stem apexes

    International Nuclear Information System (INIS)

    Wang Yifei; Huang Jianhua; Lu Ruiju; Sun Yuefang; Zhou Runmei; Zhou Zhijiang; Xie Zhujie; Liu Chenghong

    2002-01-01

    Mutagenic treatment was made of the haploid stem apexes rape strain '9841' and '9885' with Pingyangmycin. As a result of positive selection with oxalic acid providing selection pressure, variants with significantly higher tolerance to oxalic acid than the original ones were obtained. 3 germplasm with significantly higher resistance to Sclerotinia sclerotiorum than cultivar Hu You 12 were selected from field test

  11. Efficient Algorithms for Analyzing Segmental Duplications, Deletions, and Inversions in Genomes

    Science.gov (United States)

    Kahn, Crystal L.; Mozes, Shay; Raphael, Benjamin J.

    Segmental duplications, or low-copy repeats, are common in mammalian genomes. In the human genome, most segmental duplications are mosaics consisting of pieces of multiple other segmental duplications. This complex genomic organization complicates analysis of the evolutionary history of these sequences. Earlier, we introduced a genomic distance, called duplication distance, that computes the most parsimonious way to build a target string by repeatedly copying substrings of a source string. We also showed how to use this distance to describe the formation of segmental duplications according to a two-step model that has been proposed to explain human segmental duplications. Here we describe polynomial-time exact algorithms for several extensions of duplication distance including models that allow certain types of substring deletions and inversions. These extensions will permit more biologically realistic analyses of segmental duplications in genomes.

  12. The whole chloroplast genome of wild rice (Oryza australiensis).

    Science.gov (United States)

    Wu, Zhiqiang; Ge, Song

    2016-01-01

    The whole chloroplast genome of wild rice (Oryza australiensis) is characterized in this study. The genome size is 135,224  bp, exhibiting a typical circular structure including a pair of 25,776  bp inverted repeats (IRa,b) separated by a large single-copy region (LSC) of 82,212  bp and a small single-copy region (SSC) of 12,470  bp. The overall GC content of the genome is 38.95%. 110 unique genes were annotated, including 76 protein-coding genes, 4 ribosomal RNA genes, and 30t RNA genes. Among these, 18 are duplicated in the inverted repeat regions, 13 genes contain one intron, and 2 genes (rps12 and ycf3) have two introns.

  13. SNP detection for massively parallel whole-genome resequencing

    DEFF Research Database (Denmark)

    Li, Ruiqiang; Li, Yingrui; Fang, Xiaodong

    2009-01-01

    -genome or target region resequencing. Here, we have developed a consensus-calling and SNP-detection method for sequencing-by-synthesis Illumina Genome Analyzer technology. We designed this method by carefully considering the data quality, alignment, and experimental errors common to this technology. All...... of this information was integrated into a single quality score for each base under Bayesian theory to measure the accuracy of consensus calling. We tested this methodology using a large-scale human resequencing data set of 36x coverage and assembled a high-quality nonrepetitive consensus sequence for 92.......25% of the diploid autosomes and 88.07% of the haploid X chromosome. Comparison of the consensus sequence with Illumina human 1M BeadChip genotyped alleles from the same DNA sample showed that 98.6% of the 37,933 genotyped alleles on the X chromosome and 98% of 999,981 genotyped alleles on autosomes were covered...

  14. The complete chloroplast genome sequence of the medicinal plant Andrographis paniculata.

    Science.gov (United States)

    Ding, Ping; Shao, Yanhua; Li, Qian; Gao, Junli; Zhang, Runjing; Lai, Xiaoping; Wang, Deqin; Zhang, Huiye

    2016-07-01

    The complete chloroplast genome of Andrographis paniculata, an important medicinal plant with great economic value, has been studied in this article. The genome size is 150,249 bp in length, with 38.3% GC content. A pair of inverted repeats (IRs, 25,300 bp) are separated by a large single copy region (LSC, 82,459 bp) and a small single-copy region (SSC, 17,190 bp). The chloroplast genome contains 114 unique genes, 80 protein-coding genes, 30 tRNA genes and 4 rRNA genes. In these genes, 15 genes contained 1 intron and 3 genes comprised of 2 introns.

  15. Gauge field copies

    International Nuclear Information System (INIS)

    Bollini, C.G.; Giambiagi, J.J.; Tiomno, J.

    1979-01-01

    The construction of field strength copies without any gauge constraint is discussed. Several examples are given, one of which is not only a field strength copy but also (at the same time) a 'current copy'. (author) [pt

  16. Genetic mapping of centromeres in the nine Citrus clementina chromosomes using half-tetrad analysis and recombination patterns in unreduced and haploid gametes.

    Science.gov (United States)

    Aleza, Pablo; Cuenca, José; Hernández, María; Juárez, José; Navarro, Luis; Ollitrault, Patrick

    2015-03-08

    Mapping centromere locations in plant species provides essential information for the analysis of genetic structures and population dynamics. The centromere's position affects the distribution of crossovers along a chromosome and the parental heterozygosity restitution by 2n gametes is a direct function of the genetic distance to the centromere. Sexual polyploidisation is relatively frequent in Citrus species and is widely used to develop new seedless triploid cultivars. The study's objectives were to (i) map the positions of the centromeres of the nine Citrus clementina chromosomes; (ii) analyse the crossover interference in unreduced gametes; and (iii) establish the pattern of genetic recombination in haploid clementine gametes along each chromosome and its relationship with the centromere location and distribution of genic sequences. Triploid progenies were derived from unreduced megagametophytes produced by second-division restitution. Centromere positions were mapped genetically for all linkage groups using half-tetrad analysis. Inference of the physical locations of centromeres revealed one acrocentric, four metacentric and four submetacentric chromosomes. Crossover interference was observed in unreduced gametes, with variation seen between chromosome arms. For haploid gametes, a strong decrease in the recombination rate occurred in centromeric and pericentromeric regions, which contained a low density of genic sequences. In chromosomes VIII and IX, these low recombination rates extended beyond the pericentromeric regions. The genomic region corresponding to a genetic distance recombination pattern along each chromosome. However, regions with low recombination rates extended beyond the pericentromeric regions of some chromosomes into areas richer in genic sequences. The persistence of strong linkage disequilibrium between large numbers of genes promotes the stability of epistatic interactions and multilocus-controlled traits over successive generations but

  17. Integrated analysis of gene expression, CpG island methylation, and gene copy number in breast cancer cells by deep sequencing.

    Directory of Open Access Journals (Sweden)

    Zhifu Sun

    Full Text Available We used deep sequencing technology to profile the transcriptome, gene copy number, and CpG island methylation status simultaneously in eight commonly used breast cell lines to develop a model for how these genomic features are integrated in estrogen receptor positive (ER+ and negative breast cancer. Total mRNA sequence, gene copy number, and genomic CpG island methylation were carried out using the Illumina Genome Analyzer. Sequences were mapped to the human genome to obtain digitized gene expression data, DNA copy number in reference to the non-tumor cell line (MCF10A, and methylation status of 21,570 CpG islands to identify differentially expressed genes that were correlated with methylation or copy number changes. These were evaluated in a dataset from 129 primary breast tumors. Gene expression in cell lines was dominated by ER-associated genes. ER+ and ER- cell lines formed two distinct, stable clusters, and 1,873 genes were differentially expressed in the two groups. Part of chromosome 8 was deleted in all ER- cells and part of chromosome 17 amplified in all ER+ cells. These loci encoded 30 genes that were overexpressed in ER+ cells; 9 of these genes were overexpressed in ER+ tumors. We identified 149 differentially expressed genes that exhibited differential methylation of one or more CpG islands within 5 kb of the 5' end of the gene and for which mRNA abundance was inversely correlated with CpG island methylation status. In primary tumors we identified 84 genes that appear to be robust components of the methylation signature that we identified in ER+ cell lines. Our analyses reveal a global pattern of differential CpG island methylation that contributes to the transcriptome landscape of ER+ and ER- breast cancer cells and tumors. The role of gene amplification/deletion appears to more modest, although several potentially significant genes appear to be regulated by copy number aberrations.

  18. The Role of Constitutional Copy Number Variants in Breast Cancer

    Science.gov (United States)

    Walker, Logan C.; Wiggins, George A.R.; Pearson, John F.

    2015-01-01

    Constitutional copy number variants (CNVs) include inherited and de novo deviations from a diploid state at a defined genomic region. These variants contribute significantly to genetic variation and disease in humans, including breast cancer susceptibility. Identification of genetic risk factors for breast cancer in recent years has been dominated by the use of genome-wide technologies, such as single nucleotide polymorphism (SNP)-arrays, with a significant focus on single nucleotide variants. To date, these large datasets have been underutilised for generating genome-wide CNV profiles despite offering a massive resource for assessing the contribution of these structural variants to breast cancer risk. Technical challenges remain in determining the location and distribution of CNVs across the human genome due to the accuracy of computational prediction algorithms and resolution of the array data. Moreover, better methods are required for interpreting the functional effect of newly discovered CNVs. In this review, we explore current and future application of SNP array technology to assess rare and common CNVs in association with breast cancer risk in humans. PMID:27600231

  19. Supervised classification of combined copy number and gene expression data

    Directory of Open Access Journals (Sweden)

    Riccadonna S.

    2007-12-01

    Full Text Available In this paper we apply a predictive profiling method to genome copy number aberrations (CNA in combination with gene expression and clinical data to identify molecular patterns of cancer pathophysiology. Predictive models and optimal feature lists for the platforms are developed by a complete validation SVM-based machine learning system. Ranked list of genome CNA sites (assessed by comparative genomic hybridization arrays – aCGH and of differentially expressed genes (assessed by microarray profiling with Affy HG-U133A chips are computed and combined on a breast cancer dataset for the discrimination of Luminal/ ER+ (Lum/ER+ and Basal-like/ER- classes. Different encodings are developed and applied to the CNA data, and predictive variable selection is discussed. We analyze the combination of profiling information between the platforms, also considering the pathophysiological data. A specific subset of patients is identified that has a different response to classification by chromosomal gains and losses and by differentially expressed genes, corroborating the idea that genomic CNA can represent an independent source for tumor classification.

  20. Dynamic Copy Number Evolution of X- and Y-Linked Ampliconic Genes in Human Populations

    DEFF Research Database (Denmark)

    Lucotte, Elise A; Skov, Laurits; Jensen, Jacob Malte

    2018-01-01

    we explore the evolution of human X- and Y-linked ampliconic genes by investigating copy number variation (CNV) and coding variation between populations using the Simons Genome Diversity Project. We develop a method to assess CNVs using the read-depth on modified X and Y chromosome targets containing...... related Y haplogroups, that diversified less than 50,000 years ago. Moreover, X and Y-linked ampliconic genes seem to have a faster amplification dynamic than autosomal multicopy genes. Looking at expression data from another study, we also find that XY-linked ampliconic genes with extensive copy number...

  1. Accurate, high-throughput typing of copy number variation using paralogue ratios from dispersed repeats.

    NARCIS (Netherlands)

    Armour, J.A.; Palla, R.; Zeeuwen, P.L.J.M.; Heijer, M. den; Schalkwijk, J.; Hollox, E.J.

    2007-01-01

    Recent work has demonstrated an unexpected prevalence of copy number variation in the human genome, and has highlighted the part this variation may play in predisposition to common phenotypes. Some important genes vary in number over a high range (e.g. DEFB4, which commonly varies between two and

  2. Copy number variation analysis of matched ovarian primary tumors and peritoneal metastasis.

    Directory of Open Access Journals (Sweden)

    Joel A Malek

    Full Text Available Ovarian cancer is the most deadly gynecological cancer. The high rate of mortality is due to the large tumor burden with extensive metastatic lesion of the abdominal cavity. Despite initial chemosensitivity and improved surgical procedures, abdominal recurrence remains an issue and results in patients' poor prognosis. Transcriptomic and genetic studies have revealed significant genome pathologies in the primary tumors and yielded important information regarding carcinogenesis. There are, however, few studies on genetic alterations and their consequences in peritoneal metastatic tumors when compared to their matched ovarian primary tumors. We used high-density SNP arrays to investigate copy number variations in matched primary and metastatic ovarian cancer from 9 patients. Here we show that copy number variations acquired by ovarian tumors are significantly different between matched primary and metastatic tumors and these are likely due to different functional requirements. We show that these copy number variations clearly differentially affect specific pathways including the JAK/STAT and cytokine signaling pathways. While many have shown complex involvement of cytokines in the ovarian cancer environment we provide evidence that ovarian tumors have specific copy number variation differences in many of these genes.

  3. Insights into structural variations and genome rearrangements in prokaryotic genomes.

    Science.gov (United States)

    Periwal, Vinita; Scaria, Vinod

    2015-01-01

    Structural variations (SVs) are genomic rearrangements that affect fairly large fragments of DNA. Most of the SVs such as inversions, deletions and translocations have been largely studied in context of genetic diseases in eukaryotes. However, recent studies demonstrate that genome rearrangements can also have profound impact on prokaryotic genomes, leading to altered cell phenotype. In contrast to single-nucleotide variations, SVs provide a much deeper insight into organization of bacterial genomes at a much better resolution. SVs can confer change in gene copy number, creation of new genes, altered gene expression and many other functional consequences. High-throughput technologies have now made it possible to explore SVs at a much refined resolution in bacterial genomes. Through this review, we aim to highlight the importance of the less explored field of SVs in prokaryotic genomes and their impact. We also discuss its potential applicability in the emerging fields of synthetic biology and genome engineering where targeted SVs could serve to create sophisticated and accurate genome editing. © The Author 2014. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.

  4. Mutagenesis and haploid culture for disease resistance in Brassica napus

    Energy Technology Data Exchange (ETDEWEB)

    MacDonald, M V; Ahmad, I; Ingram, D S [Botany School, University of Cambridge, Cambridge (United Kingdom)

    1990-01-01

    Full text: Most winter oilseed rape cultivars share parentage and therefore show little genetic diversity. There is no known resistance to Alternaria spp. in oilseed rape or in any related Brassica species. Experiments with tissue culture yielded only transient, non-genetic resistance. Therefore, mutagenesis may be used to generate heritable resistance to Alternaria spp. Gamma irradiation was applied to seeds of 'Bienvenue', secondary embryoids of cvs 'Primor' and 'Rapora', and buds of cvs 'Primor' and 'Ariana'. Isolated microspores from cv 'Ariana' and rapid cycling B. napus were also treated. The doses used ranged from 0-100 Gy for isolated microspores and buds, up to 600 Gy for seeds and 960 Gy for secondary embryoids. EMS was used to treat seeds of line WRG-42 (supplied by Nickersons RPB) and microspores of cv 'Bienvenue' and rapid cycling B. napus. Seeds were treated with up to 2.0% EMS for 0.2 h. before plating them on the culture medium. Seed irradiation up to 600 Gy did not reduce germination. M{sub 1} and M{sub 2} progenies were tested both in the laboratory and in field trials, and none of these were found to be resistant to Alternaria. However, considerable variation for other characters was observed. Haploid cultures from these plants were extremely difficult to regenerate, and for this reason no regenerant plants have been tested for resistance. For irradiated secondary embryoids the regeneration capacity decreased with increasing dose. Regenerated plants have been tested for resistance to Alternaria, but stable resistance was not observed. Haploid cultures were obtained from irradiated buds, using both anther and microspore culture. Low irradiation treatment was beneficial to developing embryoids. Some regenerants have been obtained from EMS treated microspores and seeds. Four plants have repeatedly given increased levels of resistance to A. brassicicola, and progenies are being tested to determine the genetic nature of the resistance. (author)

  5. Mutagenesis and haploid culture for disease resistance in Brassica napus

    International Nuclear Information System (INIS)

    MacDonald, M.V.; Ahmad, I.; Ingram, D.S.

    1990-01-01

    Full text: Most winter oilseed rape cultivars share parentage and therefore show little genetic diversity. There is no known resistance to Alternaria spp. in oilseed rape or in any related Brassica species. Experiments with tissue culture yielded only transient, non-genetic resistance. Therefore, mutagenesis may be used to generate heritable resistance to Alternaria spp. Gamma irradiation was applied to seeds of 'Bienvenue', secondary embryoids of cvs 'Primor' and 'Rapora', and buds of cvs 'Primor' and 'Ariana'. Isolated microspores from cv 'Ariana' and rapid cycling B. napus were also treated. The doses used ranged from 0-100 Gy for isolated microspores and buds, up to 600 Gy for seeds and 960 Gy for secondary embryoids. EMS was used to treat seeds of line WRG-42 (supplied by Nickersons RPB) and microspores of cv 'Bienvenue' and rapid cycling B. napus. Seeds were treated with up to 2.0% EMS for 0.2 h. before plating them on the culture medium. Seed irradiation up to 600 Gy did not reduce germination. M 1 and M 2 progenies were tested both in the laboratory and in field trials, and none of these were found to be resistant to Alternaria. However, considerable variation for other characters was observed. Haploid cultures from these plants were extremely difficult to regenerate, and for this reason no regenerant plants have been tested for resistance. For irradiated secondary embryoids the regeneration capacity decreased with increasing dose. Regenerated plants have been tested for resistance to Alternaria, but stable resistance was not observed. Haploid cultures were obtained from irradiated buds, using both anther and microspore culture. Low irradiation treatment was beneficial to developing embryoids. Some regenerants have been obtained from EMS treated microspores and seeds. Four plants have repeatedly given increased levels of resistance to A. brassicicola, and progenies are being tested to determine the genetic nature of the resistance. (author)

  6. Genome duplication and mutations in ACE2 cause multicellular, fast-sedimenting phenotypes in evolved Saccharomyces cerevisiae.

    Science.gov (United States)

    Oud, Bart; Guadalupe-Medina, Victor; Nijkamp, Jurgen F; de Ridder, Dick; Pronk, Jack T; van Maris, Antonius J A; Daran, Jean-Marc

    2013-11-05

    Laboratory evolution of the yeast Saccharomyces cerevisiae in bioreactor batch cultures yielded variants that grow as multicellular, fast-sedimenting clusters. Knowledge of the molecular basis of this phenomenon may contribute to the understanding of natural evolution of multicellularity and to manipulating cell sedimentation in laboratory and industrial applications of S. cerevisiae. Multicellular, fast-sedimenting lineages obtained from a haploid S. cerevisiae strain in two independent evolution experiments were analyzed by whole genome resequencing. The two evolved cell lines showed different frameshift mutations in a stretch of eight adenosines in ACE2, which encodes a transcriptional regulator involved in cell cycle control and mother-daughter cell separation. Introduction of the two ace2 mutant alleles into the haploid parental strain led to slow-sedimenting cell clusters that consisted of just a few cells, thus representing only a partial reconstruction of the evolved phenotype. In addition to single-nucleotide mutations, a whole-genome duplication event had occurred in both evolved multicellular strains. Construction of a diploid reference strain with two mutant ace2 alleles led to complete reconstruction of the multicellular-fast sedimenting phenotype. This study shows that whole-genome duplication and a frameshift mutation in ACE2 are sufficient to generate a fast-sedimenting, multicellular phenotype in S. cerevisiae. The nature of the ace2 mutations and their occurrence in two independent evolution experiments encompassing fewer than 500 generations of selective growth suggest that switching between unicellular and multicellular phenotypes may be relevant for competitiveness of S. cerevisiae in natural environments.

  7. Ploidy Variation in Kluyveromyces marxianus Separates Dairy and Non-dairy Isolates

    Directory of Open Access Journals (Sweden)

    Raúl A. Ortiz-Merino

    2018-03-01

    Full Text Available Kluyveromyces marxianus is traditionally associated with fermented dairy products, but can also be isolated from diverse non-dairy environments. Because of thermotolerance, rapid growth and other traits, many different strains are being developed for food and industrial applications but there is, as yet, little understanding of the genetic diversity or population genetics of this species. K. marxianus shows a high level of phenotypic variation but the only phenotype that has been clearly linked to a genetic polymorphism is lactose utilisation, which is controlled by variation in the LAC12 gene. The genomes of several strains have been sequenced in recent years and, in this study, we sequenced a further nine strains from different origins. Analysis of the Single Nucleotide Polymorphisms (SNPs in 14 strains was carried out to examine genome structure and genetic diversity. SNP diversity in K. marxianus is relatively high, with up to 3% DNA sequence divergence between alleles. It was found that the isolates include haploid, diploid, and triploid strains, as shown by both SNP analysis and flow cytometry. Diploids and triploids contain long genomic tracts showing loss of heterozygosity (LOH. All six isolates from dairy environments were diploid or triploid, whereas 6 out 7 isolates from non-dairy environment were haploid. This also correlated with the presence of functional LAC12 alleles only in dairy haplotypes. The diploids were hybrids between a non-dairy and a dairy haplotype, whereas triploids included three copies of a dairy haplotype.

  8. The first complete chloroplast genome sequence of a lycophyte,Huperzia lucidula (Lycopodiaceae)

    Energy Technology Data Exchange (ETDEWEB)

    Wolf, Paul G.; Karol, Kenneth G.; Mandoli, Dina F.; Kuehl,Jennifer V.; Arumuganathan, K.; Ellis, Mark W.; Mishler, Brent D.; Kelch,Dean G.; Olmstead, Richard G.; Boore, Jeffrey L.

    2005-02-01

    We used a unique combination of techniques to sequence the first complete chloroplast genome of a lycophyte, Huperzia lucidula. This plant belongs to a significant clade hypothesized to represent the sister group to all other vascular plants. We used fluorescence-activated cell sorting (FACS) to isolate the organelles, rolling circle amplification (RCA) to amplify the genome, and shotgun sequencing to 8x depth coverage to obtain the complete chloroplast genome sequence. The genome is 154,373bp, containing inverted repeats of 15,314 bp each, a large single-copy region of 104,088 bp, and a small single-copy region of 19,671 bp. Gene order is more similar to those of mosses, liverworts, and hornworts than to gene order for other vascular plants. For example, the Huperziachloroplast genome possesses the bryophyte gene order for a previously characterized 30 kb inversion, thus supporting the hypothesis that lycophytes are sister to all other extant vascular plants. The lycophytechloroplast genome data also enable a better reconstruction of the basaltracheophyte genome, which is useful for inferring relationships among bryophyte lineages. Several unique characters are observed in Huperzia, such as movement of the gene ndhF from the small single copy region into the inverted repeat. We present several analyses of evolutionary relationships among land plants by using nucleotide data, amino acid sequences, and by comparing gene arrangements from chloroplast genomes. The results, while still tentative pending the large number of chloroplast genomes from other key lineages that are soon to be sequenced, are intriguing in themselves, and contribute to a growing comparative database of genomic and morphological data across the green plants.

  9. Exceptional diversity, non-random distribution, and rapid evolution of retroelements in the B73 maize genome.

    Directory of Open Access Journals (Sweden)

    Regina S Baucom

    2009-11-01

    Full Text Available Recent comprehensive sequence analysis of the maize genome now permits detailed discovery and description of all transposable elements (TEs in this complex nuclear environment. Reiteratively optimized structural and homology criteria were used in the computer-assisted search for retroelements, TEs that transpose by reverse transcription of an RNA intermediate, with the final results verified by manual inspection. Retroelements were found to occupy the majority (>75% of the nuclear genome in maize inbred B73. Unprecedented genetic diversity was discovered in the long terminal repeat (LTR retrotransposon class of retroelements, with >400 families (>350 newly discovered contributing >31,000 intact elements. The two other classes of retroelements, SINEs (four families and LINEs (at least 30 families, were observed to contribute 1,991 and approximately 35,000 copies, respectively, or a combined approximately 1% of the B73 nuclear genome. With regard to fully intact elements, median copy numbers for all retroelement families in maize was 2 because >250 LTR retrotransposon families contained only one or two intact members that could be detected in the B73 draft sequence. The majority, perhaps all, of the investigated retroelement families exhibited non-random dispersal across the maize genome, with LINEs, SINEs, and many low-copy-number LTR retrotransposons exhibiting a bias for accumulation in gene-rich regions. In contrast, most (but not all medium- and high-copy-number LTR retrotransposons were found to preferentially accumulate in gene-poor regions like pericentromeric heterochromatin, while a few high-copy-number families exhibited the opposite bias. Regions of the genome with the highest LTR retrotransposon density contained the lowest LTR retrotransposon diversity. These results indicate that the maize genome provides a great number of different niches for the survival and procreation of a great variety of retroelements that have evolved to

  10. Nuclear fusion and genome encounter during yeast zygote formation.

    Science.gov (United States)

    Tartakoff, Alan Michael; Jaiswal, Purnima

    2009-06-01

    When haploid cells of Saccharomyces cerevisiae are crossed, parental nuclei congress and fuse with each other. To investigate underlying mechanisms, we have developed assays that evaluate the impact of drugs and mutations. Nuclear congression is inhibited by drugs that perturb the actin and tubulin cytoskeletons. Nuclear envelope (NE) fusion consists of at least five steps in which preliminary modifications are followed by controlled flux of first outer and then inner membrane proteins, all before visible dilation of the waist of the nucleus or coalescence of the parental spindle pole bodies. Flux of nuclear pore complexes occurs after dilation. Karyogamy requires both the Sec18p/NSF ATPase and ER/NE luminal homeostasis. After fusion, chromosome tethering keeps tagged parental genomes separate from each other. The process of NE fusion and evidence of genome independence in yeast provide a prototype for understanding related events in higher eukaryotes.

  11. Exact Markov chains versus diffusion theory for haploid random mating.

    Science.gov (United States)

    Tyvand, Peder A; Thorvaldsen, Steinar

    2010-05-01

    Exact discrete Markov chains are applied to the Wright-Fisher model and the Moran model of haploid random mating. Selection and mutations are neglected. At each discrete value of time t there is a given number n of diploid monoecious organisms. The evolution of the population distribution is given in diffusion variables, to compare the two models of random mating with their common diffusion limit. Only the Moran model converges uniformly to the diffusion limit near the boundary. The Wright-Fisher model allows the population size to change with the generations. Diffusion theory tends to under-predict the loss of genetic information when a population enters a bottleneck. 2010 Elsevier Inc. All rights reserved.

  12. Figure 4 from Integrative Genomics Viewer: Visualizing Big Data | Office of Cancer Genomics

    Science.gov (United States)

    Gene-list view of genomic data. The gene-list view allows users to compare data across a set of loci. The data in this figure includes copy number, mutation, and clinical data from 202 glioblastoma samples from TCGA. Adapted from Figure 7; Thorvaldsdottir H et al. 2012

  13. Reduced representation approaches to interrogate genome diversity in large repetitive plant genomes.

    Science.gov (United States)

    Hirsch, Cory D; Evans, Joseph; Buell, C Robin; Hirsch, Candice N

    2014-07-01

    Technology and software improvements in the last decade now provide methodologies to access the genome sequence of not only a single accession, but also multiple accessions of plant species. This provides a means to interrogate species diversity at the genome level. Ample diversity among accessions in a collection of species can be found, including single-nucleotide polymorphisms, insertions and deletions, copy number variation and presence/absence variation. For species with small, non-repetitive rich genomes, re-sequencing of query accessions is robust, highly informative, and economically feasible. However, for species with moderate to large sized repetitive-rich genomes, technical and economic barriers prevent en masse genome re-sequencing of accessions. Multiple approaches to access a focused subset of loci in species with larger genomes have been developed, including reduced representation sequencing, exome capture and transcriptome sequencing. Collectively, these approaches have enabled interrogation of diversity on a genome scale for large plant genomes, including crop species important to worldwide food security. © The Author 2014. Published by Oxford University Press. All rights reserved. For permissions, please email: journals.permissions@oup.com.

  14. The bipartite mitochondrial genome of Ruizia karukerae (Rhigonematomorpha, Nematoda).

    Science.gov (United States)

    Kim, Taeho; Kern, Elizabeth; Park, Chungoo; Nadler, Steven A; Bae, Yeon Jae; Park, Joong-Ki

    2018-05-10

    Mitochondrial genes and whole mitochondrial genome sequences are widely used as molecular markers in studying population genetics and resolving both deep and shallow nodes in phylogenetics. In animals the mitochondrial genome is generally composed of a single chromosome, but mystifying exceptions sometimes occur. We determined the complete mitochondrial genome of the millipede-parasitic nematode Ruizia karukerae and found its mitochondrial genome consists of two circular chromosomes, which is highly unusual in bilateral animals. Chromosome I is 7,659 bp and includes six protein-coding genes, two rRNA genes and nine tRNA genes. Chromosome II comprises 7,647 bp, with seven protein-coding genes and 16 tRNA genes. Interestingly, both chromosomes share a 1,010 bp sequence containing duplicate copies of cox2 and three tRNA genes (trnD, trnG and trnH), and the nucleotide sequences between the duplicated homologous gene copies are nearly identical, suggesting a possible recent genesis for this bipartite mitochondrial genome. Given that little is known about the formation, maintenance or evolution of abnormal mitochondrial genome structures, R. karukerae mtDNA may provide an important early glimpse into this process.

  15. The complete chloroplast genome sequence of Dodonaea viscosa: comparative and phylogenetic analyses.

    Science.gov (United States)

    Saina, Josphat K; Gichira, Andrew W; Li, Zhi-Zhong; Hu, Guang-Wan; Wang, Qing-Feng; Liao, Kuo

    2018-02-01

    The plant chloroplast (cp) genome is a highly conserved structure which is beneficial for evolution and systematic research. Currently, numerous complete cp genome sequences have been reported due to high throughput sequencing technology. However, there is no complete chloroplast genome of genus Dodonaea that has been reported before. To better understand the molecular basis of Dodonaea viscosa chloroplast, we used Illumina sequencing technology to sequence its complete genome. The whole length of the cp genome is 159,375 base pairs (bp), with a pair of inverted repeats (IRs) of 27,099 bp separated by a large single copy (LSC) 87,204 bp, and small single copy (SSC) 17,972 bp. The annotation analysis revealed a total of 115 unique genes of which 81 were protein coding, 30 tRNA, and four ribosomal RNA genes. Comparative genome analysis with other closely related Sapindaceae members showed conserved gene order in the inverted and single copy regions. Phylogenetic analysis clustered D. viscosa with other species of Sapindaceae with strong bootstrap support. Finally, a total of 249 SSRs were detected. Moreover, a comparison of the synonymous (Ks) and nonsynonymous (Ka) substitution rates in D. viscosa showed very low values. The availability of cp genome reported here provides a valuable genetic resource for comprehensive further studies in genetic variation, taxonomy and phylogenetic evolution of Sapindaceae family. In addition, SSR markers detected will be used in further phylogeographic and population structure studies of the species in this genus.

  16. Mutation induction in haploid yeast after split-dose radiation-exposure. I. Fractionated UV-irradiation.

    Science.gov (United States)

    Schenk, K; Zölzer, F; Kiefer, J

    1989-01-01

    Mutation induction was investigated in wild-type haploid yeast Saccharomyces cerevisiae after split-dose UV-irradiation. Cells were exposed to fractionated 254 nm-UV-doses separated by intervals from 0 to 6 h with incubation either on non-nutrient or nutrient agar between. The test parameter was resistance to canavanine. If modifications of sensitivity due to incubation are appropriately taken into account there is no change of mutation frequency.

  17. Citrus sinensis annotation project (CAP): a comprehensive database for sweet orange genome.

    Science.gov (United States)

    Wang, Jia; Chen, Dijun; Lei, Yang; Chang, Ji-Wei; Hao, Bao-Hai; Xing, Feng; Li, Sen; Xu, Qiang; Deng, Xiu-Xin; Chen, Ling-Ling

    2014-01-01

    Citrus is one of the most important and widely grown fruit crop with global production ranking firstly among all the fruit crops in the world. Sweet orange accounts for more than half of the Citrus production both in fresh fruit and processed juice. We have sequenced the draft genome of a double-haploid sweet orange (C. sinensis cv. Valencia), and constructed the Citrus sinensis annotation project (CAP) to store and visualize the sequenced genomic and transcriptome data. CAP provides GBrowse-based organization of sweet orange genomic data, which integrates ab initio gene prediction, EST, RNA-seq and RNA-paired end tag (RNA-PET) evidence-based gene annotation. Furthermore, we provide a user-friendly web interface to show the predicted protein-protein interactions (PPIs) and metabolic pathways in sweet orange. CAP provides comprehensive information beneficial to the researchers of sweet orange and other woody plants, which is freely available at http://citrus.hzau.edu.cn/.

  18. Genome-wide gene copy number and expression analysis of primary gastric tumors and gastric cancer cell lines

    International Nuclear Information System (INIS)

    Junnila, Siina; Kokkola, Arto; Karjalainen-Lindsberg, Marja-Liisa; Puolakkainen, Pauli; Monni, Outi

    2010-01-01

    Gastric cancer is one of the most common malignancies worldwide and the second most common cause of cancer related death. Gene copy number alterations play an important role in the development of gastric cancer and a change in gene copy number is one of the main mechanisms for a cancer cell to control the expression of potential oncogenes and tumor suppressor genes. To highlight genes of potential biological and clinical relevance in gastric cancer, we carried out a systematic array-based survey of gene expression and copy number levels in primary gastric tumors and gastric cancer cell lines and validated the results using an affinity capture based transcript analysis (TRAC assay) and real-time qRT-PCR. Integrated microarray analysis revealed altogether 256 genes that were located in recurrent regions of gains or losses and had at least a 2-fold copy number- associated change in their gene expression. The expression levels of 13 of these genes, ALPK2, ASAP1, CEACAM5, CYP3A4, ENAH, ERBB2, HHIPL2, LTB4R, MMP9, PERLD1, PNMT, PTPRA, and OSMR, were validated in a total of 118 gastric samples using either the qRT-PCR or TRAC assay. All of these 13 genes were differentially expressed between cancerous samples and nonmalignant tissues (p < 0.05) and the association between copy number and gene expression changes was validated for nine (69.2%) of these genes (p < 0.05). In conclusion, integrated gene expression and copy number microarray analysis highlighted genes that may be critically important for gastric carcinogenesis. TRAC and qRT-PCR analyses validated the microarray results and therefore the role of these genes as potential biomarkers for gastric cancer

  19. Scaling up Copy Detection

    OpenAIRE

    Li, Xian; Dong, Xin Luna; Lyons, Kenneth B.; Meng, Weiyi; Srivastava, Divesh

    2015-01-01

    Recent research shows that copying is prevalent for Deep-Web data and considering copying can significantly improve truth finding from conflicting values. However, existing copy detection techniques do not scale for large sizes and numbers of data sources, so truth finding can be slowed down by one to two orders of magnitude compared with the corresponding techniques that do not consider copying. In this paper, we study {\\em how to improve scalability of copy detection on structured data}. Ou...

  20. Molecular characterization, genomic distribution and evolutionary dynamics of Short INterspersed Elements in the termite genome.

    Science.gov (United States)

    Luchetti, Andrea; Mantovani, Barbara

    2011-02-01

    Short INterspersed Elements (SINEs) in invertebrates, and especially in animal inbred genomes such that of termites, are poorly known; in this paper we characterize three new SINE families (Talub, Taluc and Talud) through the analyses of 341 sequences, either isolated from the Reticulitermes lucifugus genome or drawn from EST Genbank collection. We further add new data to the only isopteran element known so far, Talua. These SINEs are tRNA-derived elements, with an average length ranging from 258 to 372 bp. The tails are made up by poly(A) or microsatellite motifs. Their copy number varies from 7.9 × 10(3) to 10(5) copies, well within the range observed for other metazoan genomes. Species distribution, age and target site duplication analysis indicate Talud as the oldest, possibly inactive SINE originated before the onset of Isoptera (~150 Myr ago). Taluc underwent to substantial sequence changes throughout the evolution of termites and data suggest it was silenced and then re-activated in the R. lucifugus lineage. Moreover, Taluc shares a conserved sequence block with other unrelated SINEs, as observed for some vertebrate and cephalopod elements. The study of genomic environment showed that insertions are mainly surrounded by microsatellites and other SINEs, indicating a biased accumulation within non-coding regions. The evolutionary dynamics of Talu~ elements is explained through selective mechanisms acting in an inbred genome; in this respect, the study of termites' SINEs activity may provide an interesting framework to address the (co)evolution of mobile elements and the host genome.

  1. Retrotransposon Domestication and Control in Dictyostelium discoideum

    Directory of Open Access Journals (Sweden)

    Marek Malicki

    2017-10-01

    Full Text Available Transposable elements, identified in all eukaryotes, are mobile genetic units that can change their genomic position. Transposons usually employ an excision and reintegration mechanism, by which they change position, but not copy number. In contrast, retrotransposons amplify via RNA intermediates, increasing their genomic copy number. Hence, they represent a particular threat to the structural and informational integrity of the invaded genome. The social amoeba Dictyostelium discoideum, model organism of the evolutionary Amoebozoa supergroup, features a haploid, gene-dense genome that offers limited space for damage-free transposition. Several of its contemporary retrotransposons display intrinsic integration preferences, for example by inserting next to transfer RNA genes or other retroelements. Likely, any retrotransposons that invaded the genome of the amoeba in a non-directed manner were lost during evolution, as this would result in decreased fitness of the organism. Thus, the positional preference of the Dictyostelium retroelements might represent a domestication of the selfish elements. Likewise, the reduced danger of such domesticated transposable elements led to their accumulation, and they represent about 10% of the current genome of D. discoideum. To prevent the uncontrolled spreading of retrotransposons, the amoeba employs control mechanisms including RNA interference and heterochromatization. Here, we review TRE5-A, DIRS-1 and Skipper-1, as representatives of the three retrotransposon classes in D. discoideum, which make up 5.7% of the Dictyostelium genome. We compile open questions with respect to their mobility and cellular regulation, and suggest strategies, how these questions might be addressed experimentally.

  2. An estimation of the prevalence of genomic disorders using chromosomal microarray data.

    Science.gov (United States)

    Gillentine, Madelyn A; Lupo, Philip J; Stankiewicz, Pawel; Schaaf, Christian P

    2018-04-24

    Multiple genomic disorders result from recurrent deletions or duplications between low copy repeat (LCR) clusters, mediated by nonallelic homologous recombination. These copy number variants (CNVs) often exhibit variable expressivity and/or incomplete penetrance. However, the population prevalence of many genomic disorders has not been estimated accurately. A subset of genomic disorders similarly characterized by CNVs between LCRs have been studied epidemiologically, including Williams-Beuren syndrome (7q11.23), Smith-Magenis syndrome (17p11.2), velocardiofacial syndrome (22q11.21), Prader-Willi/Angelman syndromes (15q11.2q12), 17q12 deletion syndrome, and Charcot-Marie-Tooth neuropathy type 1/hereditary neuropathy with liability to pressure palsy (PMP22, 17q11.2). We have generated a method to estimate prevalence of highly penetrant genomic disorders by (1) leveraging epidemiological data for genomic disorders with previously reported prevalence estimates, (2) obtaining chromosomal microarray data on genomic disorders from a large medical genetics clinic; and (3) utilizing these in a linear regression model to determine the prevalence of this syndromic copy number change among the general population. Using our algorithm, the prevalence for five clinically relevant recurrent genomic disorders: 1q21.1 microdeletion (1/6882 live births) and microduplication syndromes (1/6309), 15q13.3 microdeletion syndrome (1/5525), and 16p11.2 microdeletion (1/3021) and microduplication syndromes (1/4216), were determined. These findings will inform epidemiological strategies for evaluating those conditions, and our method may be useful to evaluate the prevalence of other highly penetrant genomic disorders.

  3. Inheritance patterns of the response to in vitro doubled haploid induction in perennial ryegrass (Lolium perenne L.)

    Czech Academy of Sciences Publication Activity Database

    Begheyn, R. F.; Roulund, N.; Vangsgaard, K.; Kopecký, David; Studer, B.

    2017-01-01

    Roč. 130, č. 3 (2017), s. 667-679 ISSN 0167-6857 Institutional support: RVO:61389030 Keywords : Androgenesis * Androgenic capacity * Anther culture * Doubled haploid (DH) * Perennial ryegrass (Lolium perenne L.) Subject RIV: EB - Genetics ; Molecular Biology OBOR OECD: Plant sciences, botany Impact factor: 2.002, year: 2016

  4. Generation of single-copy transgenic mouse embryos directly from ES cells by tetraploid embryo complementation

    Directory of Open Access Journals (Sweden)

    Zhao Roong

    2001-12-01

    Full Text Available Abstract Background Transgenic mice have been used extensively to analyze gene function. Unfortunately, traditional transgenic procedures have only limited use in analyzing alleles that cause lethality because lines of founder mice cannot be established. This is frustrating given that such alleles often reveal crucial aspects of gene function. For this reason techniques that facilitate the generation of embryos expressing such alleles would be of enormous benefit. Although the transient generation of transgenic embryos has allowed limited analysis of lethal alleles, it is expensive, time consuming and technically challenging. Moreover a fundamental limitation with this approach is that each embryo generated is unique and transgene expression is highly variable due to the integration of different transgene copy numbers at random genomic sites. Results Here we describe an alternative method that allows the generation of clonal mouse embryos harboring a single-copy transgene at a defined genomic location. This was facilitated through the production of Hprt negative embryonic stem cells that allow the derivation of embryos by tetraploid embryo complementation. We show that targeting transgenes to the hprt locus in these ES cells by homologous recombination can be efficiently selected by growth in HAT medium. Moreover, embryos derived solely from targeted ES cells containing a single copy LacZ transgene under the control of the α-myosin heavy chain promoter exhibited the expected cardiac specific expression pattern. Conclusion Our results demonstrate that tetraploid embryo complementation by F3 hprt negative ES cells facilitates the generation of transgenic mouse embryos containing a single copy gene at a defined genomic locus. This approach is simple, extremely efficient and bypasses any requirement to generate chimeric mice. Moreover embryos generated by this procedure are clonal in that they are all derived from a single ES cell lines. This

  5. The human homolog of S. cerevisiae CDC27, CDC27 Hs, is encoded by a highly conserved intronless gene present in multiple copies in the human genome

    Energy Technology Data Exchange (ETDEWEB)

    Devor, E.J.; Dill-Devor, R.M. [Univ. of Iowa College of Medicine, Iowa City (United States)

    1994-09-01

    We have obtained a number of unique sequences via PCR amplification of human genomic DNA using degenerate primers under low stringency (42{degrees}C). One of these, an 853 bp product, has been identified as a partial genomic sequence of the human homolog of the S. cerevisiae CDC27 gene, CDC27Hs (GenBank No. U00001). This gene, reported by Turgendreich et al. is also designated EST00556 from Adams et al. We have undertaken a more detailed examination of our sequence, MCP34N, and have found that: 1. the genomic sequence is nearly identical to CDC27Hs over its entire 853 bp length; 2. an MCP34N-specific PCR assay of several non-human primate species reveals amplification products in chimpanzee and gorilla genomes having greater than 90% sequence identity with CDC27Hs; and 3. an MCP34N-specific PCR assay of the BIOS hybrid cell line panel gives a discordancy pattern suggesting multiple loci. Based upon these data, we present the following initial characterization: 1. the complete MCP34N sequence identity with CDC27Hs indicates that the latter is encoded by an intronless gene; 2. CDC27Hs is highly conserved among higher primates; and 3. CDC27Hs is present in multiple copies in the human genome. These characteristics, taken together with those initially reported for CDC27Hs, suggest that this is an old gene that carries out an important but, as yet, unknown function in the human brain.

  6. Parallel Evolution of Copy-Number Variation across Continents in Drosophila melanogaster.

    Science.gov (United States)

    Schrider, Daniel R; Hahn, Matthew W; Begun, David J

    2016-05-01

    Genetic differentiation across populations that is maintained in the presence of gene flow is a hallmark of spatially varying selection. In Drosophila melanogaster, the latitudinal clines across the eastern coasts of Australia and North America appear to be examples of this type of selection, with recent studies showing that a substantial portion of the D. melanogaster genome exhibits allele frequency differentiation with respect to latitude on both continents. As of yet there has been no genome-wide examination of differentiated copy-number variants (CNVs) in these geographic regions, despite their potential importance for phenotypic variation in Drosophila and other taxa. Here, we present an analysis of geographic variation in CNVs in D. melanogaster. We also present the first genomic analysis of geographic variation for copy-number variation in the sister species, D. simulans, in order to investigate patterns of parallel evolution in these close relatives. In D. melanogaster we find hundreds of CNVs, many of which show parallel patterns of geographic variation on both continents, lending support to the idea that they are influenced by spatially varying selection. These findings support the idea that polymorphic CNVs contribute to local adaptation in D. melanogaster In contrast, we find very few CNVs in D. simulans that are geographically differentiated in parallel on both continents, consistent with earlier work suggesting that clinal patterns are weaker in this species. © The Author 2016. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.

  7. Genome-wide copy number variation study associates metabotropic glutamate receptor gene networks with attention deficit hyperactivity disorder

    Science.gov (United States)

    Elia, Josephine; Glessner, Joseph T; Wang, Kai; Takahashi, Nagahide; Shtir, Corina J; Hadley, Dexter; Sleiman, Patrick M A; Zhang, Haitao; Kim, Cecilia E; Robison, Reid; Lyon, Gholson J; Flory, James H; Bradfield, Jonathan P; Imielinski, Marcin; Hou, Cuiping; Frackelton, Edward C; Chiavacci, Rosetta M; Sakurai, Takeshi; Rabin, Cara; Middleton, Frank A; Thomas, Kelly A; Garris, Maria; Mentch, Frank; Freitag, Christine M; Steinhausen, Hans-Christoph; Todorov, Alexandre A; Reif, Andreas; Rothenberger, Aribert; Franke, Barbara; Mick, Eric O; Roeyers, Herbert; Buitelaar, Jan; Lesch, Klaus-Peter; Banaschewski, Tobias; Ebstein, Richard P; Mulas, Fernando; Oades, Robert D; Sergeant, Joseph; Sonuga-Barke, Edmund; Renner, Tobias J; Romanos, Marcel; Romanos, Jasmin; Warnke, Andreas; Walitza, Susanne; Meyer, Jobst; Pálmason, Haukur; Seitz, Christiane; Loo, Sandra K; Smalley, Susan L; Biederman, Joseph; Kent, Lindsey; Asherson, Philip; Anney, Richard J L; Gaynor, J William; Shaw, Philip; Devoto, Marcella; White, Peter S; Grant, Struan F A; Buxbaum, Joseph D; Rapoport, Judith L; Williams, Nigel M; Nelson, Stanley F; Faraone, Stephen V; Hakonarson, Hakon

    2014-01-01

    Attention deficit hyperactivity disorder (ADHD) is a common, heritable neuropsychiatric disorder of unknown etiology. We performed a whole-genome copy number variation (CNV) study on 1,013 cases with ADHD and 4,105 healthy children of European ancestry using 550,000 SNPs. We evaluated statistically significant findings in multiple independent cohorts, with a total of 2,493 cases with ADHD and 9,222 controls of European ancestry, using matched platforms. CNVs affecting metabotropic glutamate receptor genes were enriched across all cohorts (P = 2.1 × 10−9). We saw GRM5 (encoding glutamate receptor, metabotropic 5) deletions in ten cases and one control (P = 1.36 × 10−6). We saw GRM7 deletions in six cases, and we saw GRM8 deletions in eight cases and no controls. GRM1 was duplicated in eight cases. We experimentally validated the observed variants using quantitative RT-PCR. A gene network analysis showed that genes interacting with the genes in the GRM family are enriched for CNVs in ~10% of the cases (P = 4.38 × 10−10) after correction for occurrence in the controls. We identified rare recurrent CNVs affecting glutamatergic neurotransmission genes that were overrepresented in multiple ADHD cohorts. PMID:22138692

  8. GeneBreak: detection of recurrent DNA copy number aberration-associated chromosomal breakpoints within genes [version 2; referees: 2 approved

    Directory of Open Access Journals (Sweden)

    Evert van den Broek

    2017-07-01

    Full Text Available Development of cancer is driven by somatic alterations, including numerical and structural chromosomal aberrations. Currently, several computational methods are available and are widely applied to detect numerical copy number aberrations (CNAs of chromosomal segments in tumor genomes. However, there is lack of computational methods that systematically detect structural chromosomal aberrations by virtue of the genomic location of CNA-associated chromosomal breaks and identify genes that appear non-randomly affected by chromosomal breakpoints across (large series of tumor samples. ‘GeneBreak’ is developed to systematically identify genes recurrently affected by the genomic location of chromosomal CNA-associated breaks by a genome-wide approach, which can be applied to DNA copy number data obtained by array-Comparative Genomic Hybridization (CGH or by (low-pass whole genome sequencing (WGS. First, ‘GeneBreak’ collects the genomic locations of chromosomal CNA-associated breaks that were previously pinpointed by the segmentation algorithm that was applied to obtain CNA profiles. Next, a tailored annotation approach for breakpoint-to-gene mapping is implemented. Finally, dedicated cohort-based statistics is incorporated with correction for covariates that influence the probability to be a breakpoint gene. In addition, multiple testing correction is integrated to reveal recurrent breakpoint events. This easy-to-use algorithm, ‘GeneBreak’, is implemented in R (www.cran.r-project.org and is available from Bioconductor (www.bioconductor.org/packages/release/bioc/html/GeneBreak.html.

  9. Functional noncoding sequences derived from SINEs in the mammalian genome.

    Science.gov (United States)

    Nishihara, Hidenori; Smit, Arian F A; Okada, Norihiro

    2006-07-01

    Recent comparative analyses of mammalian sequences have revealed that a large number of nonprotein-coding genomic regions are under strong selective constraint. Here, we report that some of these loci have been derived from a newly defined family of ancient SINEs (short interspersed repetitive elements). This is a surprising result, as SINEs and other transposable elements are commonly thought to be genomic parasites. We named the ancient SINE family AmnSINE1, for Amniota SINE1, because we found it to be present in mammals as well as in birds, and some copies predate the mammalian-bird split 310 million years ago (Mya). AmnSINE1 has a chimeric structure of a 5S rRNA and a tRNA-derived SINE, and is related to five tRNA-derived SINE families that we characterized here in the coelacanth, dogfish shark, hagfish, and amphioxus genomes. All of the newly described SINE families have a common central domain that is also shared by zebrafish SINE3, and we collectively name them the DeuSINE (Deuterostomia SINE) superfamily. Notably, of the approximately 1000 still identifiable copies of AmnSINE1 in the human genome, 105 correspond to loci phylogenetically highly conserved among mammalian orthologs. The conservation is strongest over the central domain. Thus, AmnSINE1 appears to be the best example of a transposable element of which a significant fraction of the copies have acquired genomic functionality.

  10. Genomic characterization of recurrent high-grade astroblastoma.

    Science.gov (United States)

    Bale, Tejus A; Abedalthagafi, Malak; Bi, Wenya Linda; Kang, Yun Jee; Merrill, Parker; Dunn, Ian F; Dubuc, Adrian; Charbonneau, Sarah K; Brown, Loreal; Ligon, Azra H; Ramkissoon, Shakti H; Ligon, Keith L

    2016-01-01

    Astroblastomas are rare primary brain tumors, diagnosed based on histologic features. Not currently assigned a WHO grade, they typically display indolent behavior, with occasional variants taking a more aggressive course. We characterized the immunohistochemical characteristics, copy number (high-resolution array comparative genomic hybridization, OncoCopy) and mutational profile (targeted next-generation exome sequencing, OncoPanel) of a cohort of seven biopsies from four patients to identify recurrent genomic events that may help distinguish astroblastomas from other more common high-grade gliomas. We found that tumor histology was variable across patients and between primary and recurrent tumor samples. No common molecular features were identified among the four tumors. Mutations commonly observed in astrocytic tumors (IDH1/2, TP53, ATRX, and PTEN) or ependymoma were not identified. However one case with rapid clinical progression displayed mutations more commonly associated with GBM (NF1(N1054H/K63)*, PIK3CA(R38H) and ERG(A403T)). Conversely, another case, originally classified as glioblastoma with nine-year survival before recurrence, lacked a GBM mutational profile. Other mutations frequently seen in lower grade gliomas (BCOR, BCORL1, ERBB3, MYB, ATM) were also present in several tumors. Copy number changes were variable across tumors. Our findings indicate that astroblastomas have variable growth patterns and morphologic features, posing significant challenges to accurate classification in the absence of diagnostically specific copy number alterations and molecular features. Their histopathologic overlap with glioblastoma will likely confound the observation of long-term GBM "survivors". Further genomic profiling is needed to determine whether these tumors represent a distinct entity and to guide management strategies. Copyright © 2016 Elsevier Inc. All rights reserved.

  11. Global gene transcription patterns in in vitro-cultured fertilized embryos and diploid and haploid murine parthenotes

    International Nuclear Information System (INIS)

    Cui Xiangshun; Li Xingyu; Kim, Nam-Hyung

    2007-01-01

    To gain insights into the roles the paternal genome and chromosome number play in pre-implantation development, we cultured fertilized embryos and diploid and haploid parthenotes (DPs and HPs, respectively), and compared their development and gene expression patterns. The DPs and fertilized embryos did not differ in developmental ability but HPs development was slower and characterized by impaired compaction and blastocoel formation. Microarray analysis revealed that fertilized blastocysts expressed several genes at higher levels than DP blastocysts; these included the Y-chromosome-specific gene eukaryotic translation initiation factor 2, subunit 3, structural gene Y-linked (Eif2s3y) and the imprinting gene U2 small nuclear ribonucleoprotein auxiliary factor 1, related sequence 1 (U2af1-rs1). We also found that when DPs and HPs were both harvested at 44 and 58 h of culture, they differed in the expression of 38 and 665 genes, respectively. However, when DPs and HPs were harvested at the midpoints of 4-cell stage (44 and 49 h, respectively), no differences in expression was observed. Similarly, when the DPs and HPs were harvested when they became blastocysts (102 and 138 h, respectively), only 15 genes showed disparate expression. These results suggest that while transcripts needed for early development are delayed in HPs, it does progress sufficiently for the generation of the various developmental stages despite the lack of genetic components

  12. Sexagesimal scale for mapping human genome Escala sexagesimal para mapear el genoma humano

    Directory of Open Access Journals (Sweden)

    RICARDO CRUZ-COKE

    2001-03-01

    Full Text Available In a previous work I designed a diagram of the human genome based on a circular ideogram of the haploid set of chromosomes, using a low resolution scale of Megabase units. The purpose of this work is to draft a new scale to measure the physical map of the human genome at the highest resolution level. The entire length of the haploid genome of males is deployed in a circumference, marked with a sexagesimal scale with 360 degrees and 1296000 arc seconds. The radio of this circunference displays a semilogaritmic metric scale from 1 m up to the nanometer level. The base pair level of DNA sequences, 10-9 of this circunsference, is measured in milliarsec unit (mas, equivalent to a thousand of arcsecond. The "mas" unit, correspond to 1.27 nanometers (nm or 0.427 base pair (bp and it is the framework for measure DNA sequences. Thus the three billion base pairs of the human genome may be identified by 1296000000 "mas" units in continous correlation from number 1 to number 1296000000. This sexagesimal scale covers all the levels of the nuclear genetic material, from nucleotides to chromosomes. The locations of every codon and every gene may be numbered in the physical map of chomosome regions according to this new scale, instead of the partial kilobase and Megabase scales used today. The advantage of the new scale is the unification of the set of chromosomes under a continous scale of measurement at the DNA level, facilitating the correlation with the phenotypes of man and other speciesEn un trabajo anterior yo diseñé un diagrama del genoma humano basado en un ideograma circular del conjunto haploide de cromosomas, usando una escala de baja resolución en megabases. El propósito de este trabajo es el de diseñar una nueva escala para medir el mapa físico del genoma humano al más alto nivel de resolución. La longitud completa del genoma haploide del varon es extendido en una circunsferencia, marcada con una escala sexagesimal de 360 grados y 1296000

  13. SAGE: String-overlap Assembly of GEnomes.

    Science.gov (United States)

    Ilie, Lucian; Haider, Bahlul; Molnar, Michael; Solis-Oba, Roberto

    2014-09-15

    De novo genome assembly of next-generation sequencing data is one of the most important current problems in bioinformatics, essential in many biological applications. In spite of significant amount of work in this area, better solutions are still very much needed. We present a new program, SAGE, for de novo genome assembly. As opposed to most assemblers, which are de Bruijn graph based, SAGE uses the string-overlap graph. SAGE builds upon great existing work on string-overlap graph and maximum likelihood assembly, bringing an important number of new ideas, such as the efficient computation of the transitive reduction of the string overlap graph, the use of (generalized) edge multiplicity statistics for more accurate estimation of read copy counts, and the improved use of mate pairs and min-cost flow for supporting edge merging. The assemblies produced by SAGE for several short and medium-size genomes compared favourably with those of existing leading assemblers. SAGE benefits from innovations in almost every aspect of the assembly process: error correction of input reads, string-overlap graph construction, read copy counts estimation, overlap graph analysis and reduction, contig extraction, and scaffolding. We hope that these new ideas will help advance the current state-of-the-art in an essential area of research in genomics.

  14. Figure 2 from Integrative Genomics Viewer: Visualizing Big Data | Office of Cancer Genomics

    Science.gov (United States)

    Grouping and sorting genomic data in IGV. The IGV user interface displaying 202 glioblastoma samples from TCGA. Samples are grouped by tumor subtype (second annotation column) and data type (first annotation column) and sorted by copy number of the EGFR locus (middle column). Adapted from Figure 1; Robinson et al. 2011

  15. Comparative analysis of genome maintenance genes in naked mole rat, mouse, and human

    NARCIS (Netherlands)

    S.L. Macrae (Sheila L.); Q. Zhang (Quanwei); C. Lemetre (Christophe); I. Seim (Inge); R.B. Calder (Robert B.); J.H.J. Hoeijmakers (Jan); Y. Suh (Yousin); V.N. Gladyshev (Vadim N.); A. Seluanov (Andrei); V. Gorbunova (Vera); J. Vijg (Jan); Z.D. Zhang (Zhengdong D.)

    2015-01-01

    textabstractGenome maintenance (GM) is an essential defense system against aging and cancer, as both are characterized by increased genome instability. Here, we compared the copy number variation and mutation rate of 518 GM-associated genes in the naked mole rat (NMR), mouse, and human genomes. GM

  16. Genome sequencing and annotation of Stenotrophomonas sp. SAM8

    Directory of Open Access Journals (Sweden)

    Samy Selim

    2015-12-01

    Full Text Available We report draft genome sequence of Stenotrophomonas sp. strain SAM8, isolated from environmental water. The draft genome size is 3,665,538 bp with a G + C content of 67.2% and contains 6 rRNA sequence (single copies of 5S, 16S & 23S rRNA. The genome sequence can be accessed at DDBJ/EMBL/GenBank under the accession no. LDAV00000000.

  17. The peculiar landscape of repetitive sequences in the olive (Olea europaea L.) genome.

    Science.gov (United States)

    Barghini, Elena; Natali, Lucia; Cossu, Rosa Maria; Giordani, Tommaso; Pindo, Massimo; Cattonaro, Federica; Scalabrin, Simone; Velasco, Riccardo; Morgante, Michele; Cavallini, Andrea

    2014-04-01

    Analyzing genome structure in different species allows to gain an insight into the evolution of plant genome size. Olive (Olea europaea L.) has a medium-sized haploid genome of 1.4 Gb, whose structure is largely uncharacterized, despite the growing importance of this tree as oil crop. Next-generation sequencing technologies and different computational procedures have been used to study the composition of the olive genome and its repetitive fraction. A total of 2.03 and 2.3 genome equivalents of Illumina and 454 reads from genomic DNA, respectively, were assembled following different procedures, which produced more than 200,000 differently redundant contigs, with mean length higher than 1,000 nt. Mapping Illumina reads onto the assembled sequences was used to estimate their redundancy. The genome data set was subdivided into highly and medium redundant and nonredundant contigs. By combining identification and mapping of repeated sequences, it was established that tandem repeats represent a very large portion of the olive genome (∼31% of the whole genome), consisting of six main families of different length, two of which were first discovered in these experiments. The other large redundant class in the olive genome is represented by transposable elements (especially long terminal repeat-retrotransposons). On the whole, the results of our analyses show the peculiar landscape of the olive genome, related to the massive amplification of tandem repeats, more than that reported for any other sequenced plant genome.

  18. Copy number variation of KIR genes influences HIV-1 control

    DEFF Research Database (Denmark)

    Pelak, Kimberly; Need, Anna C; Fellay, Jacques

    2011-01-01

    A genome-wide screen for large structural variants showed that a copy number variant (CNV) in the region encoding killer cell immunoglobulin-like receptors (KIR) associates with HIV-1 control as measured by plasma viral load at set point in individuals of European ancestry. This CNV encompasses...... the KIR3DL1-KIR3DS1 locus, encoding receptors that interact with specific HLA-Bw4 molecules to regulate the activation of lymphocyte subsets including natural killer (NK) cells. We quantified the number of copies of KIR3DS1 and KIR3DL1 in a large HIV-1 positive cohort, and showed that an increase in KIR3...... amounts of these activating and inhibitory KIR play a role in regulating the peripheral expansion of highly antiviral KIR3DS1+ NK cells, which may determine differences in HIV-1 control following infection....

  19. Comparative analysis of genome maintenance genes in naked mole rat, mouse, and human.

    Science.gov (United States)

    MacRae, Sheila L; Zhang, Quanwei; Lemetre, Christophe; Seim, Inge; Calder, Robert B; Hoeijmakers, Jan; Suh, Yousin; Gladyshev, Vadim N; Seluanov, Andrei; Gorbunova, Vera; Vijg, Jan; Zhang, Zhengdong D

    2015-04-01

    Genome maintenance (GM) is an essential defense system against aging and cancer, as both are characterized by increased genome instability. Here, we compared the copy number variation and mutation rate of 518 GM-associated genes in the naked mole rat (NMR), mouse, and human genomes. GM genes appeared to be strongly conserved, with copy number variation in only four genes. Interestingly, we found NMR to have a higher copy number of CEBPG, a regulator of DNA repair, and TINF2, a protector of telomere integrity. NMR, as well as human, was also found to have a lower rate of germline nucleotide substitution than the mouse. Together, the data suggest that the long-lived NMR, as well as human, has more robust GM than mouse and identifies new targets for the analysis of the exceptional longevity of the NMR. © 2015 The Authors. Aging Cell published by the Anatomical Society and John Wiley & Sons Ltd.

  20. Quantitative trait loci mapping of heat tolerance in a doubled haploid population of broccoli using genotyping-by-sequencing

    Science.gov (United States)

    Broccoli is a cool weather vegetable crop with a vernalization requirement to initiate and maintain floral development. Breeding for heat tolerance in broccoli has the potential to both expand viable production areas and extend the growing season. A doubled haploid (DH) population of broccoli (Bras...

  1. Clinical array-based karyotyping of breast cancer with equivocal HER2 status resolves gene copy number and reveals chromosome 17 complexity

    International Nuclear Information System (INIS)

    Gunn, Shelly; Gorre, Mercedes; Mohammed, Mansoor; Yeh, I-Tien; Lytvak, Irina; Tirtorahardjo, Budi; Dzidic, Natasha; Zadeh, Soheila; Kim, Jaeweon; McCaskill, Chris; Lim, Lony

    2010-01-01

    HER2 gene copy status, and concomitant administration of trastuzumab (Herceptin), remains one of the best examples of targeted cancer therapy based on understanding the genomic etiology of disease. However, newly diagnosed breast cancer cases with equivocal HER2 results present a challenge for the oncologist who must make treatment decisions despite the patient's unresolved HER2 status. In some cases both immunohistochemistry (IHC) and fluorescence in situ hybridization (FISH) are reported as equivocal, whereas in other cases IHC results and FISH are discordant for positive versus negative results. The recent validation of array-based, molecular karyotyping for clinical oncology testing provides an alternative method for determination of HER2 gene copy number status in cases remaining unresolved by traditional methods. In the current study, DNA extracted from 20 formalin fixed paraffin embedded (FFPE) tissue samples from newly diagnosed cases of invasive ductal carcinoma referred to our laboratory with unresolved HER2 status, were analyzed using a clinically validated genomic array containing 127 probes covering the HER2 amplicon, the pericentromeric regions, and both chromosome 17 arms. Array-based comparative genomic hybridization (array CGH) analysis of chromosome 17 resolved HER2 gene status in [20/20] (100%) of cases and revealed additional chromosome 17 copy number changes in [18/20] (90%) of cases. Array CGH analysis also revealed two false positives and one false negative by FISH due to 'ratio skewing' caused by chromosomal gains and losses in the centromeric region. All cases with complex rearrangements of chromosome 17 showed genome-wide chromosomal instability. These results illustrate the analytical power of array-based genomic analysis as a clinical laboratory technique for resolution of HER2 status in breast cancer cases with equivocal results. The frequency of complex chromosome 17 abnormalities in these cases suggests that the two

  2. Genome-wide association study identified CNP12587 region underlying height variation in Chinese females.

    Directory of Open Access Journals (Sweden)

    Yin-Ping Zhang

    Full Text Available Human height is a highly heritable trait considered as an important factor for health. There has been limited success in identifying the genetic factors underlying height variation. We aim to identify sequence variants associated with adult height by a genome-wide association study of copy number variants (CNVs in Chinese.Genome-wide CNV association analyses were conducted in 1,625 unrelated Chinese adults and sex specific subgroup for height variation, respectively. Height was measured with a stadiometer. Affymetrix SNP6.0 genotyping platform was used to identify copy number polymorphisms (CNPs. We constructed a genomic map containing 1,009 CNPs in Chinese individuals and performed a genome-wide association study of CNPs with height.We detected 10 significant association signals for height (p<0.05 in the whole population, 9 and 11 association signals for Chinese female and male population, respectively. A copy number polymorphism (CNP12587, chr18:54081842-54086942, p = 2.41 × 10(-4 was found to be significantly associated with height variation in Chinese females even after strict Bonferroni correction (p = 0.048. Confirmatory real time PCR experiments lent further support for CNV validation. Compared to female subjects with two copies of the CNP, carriers of three copies had an average of 8.1% decrease in height. An important candidate gene, ubiquitin-protein ligase NEDD4-like (NEDD4L, was detected at this region, which plays important roles in bone metabolism by binding to bone formation regulators.Our findings suggest the important genetic variants underlying height variation in Chinese.

  3. Long identical multispecies elements in plant and animal genomes.

    Science.gov (United States)

    Reneker, Jeff; Lyons, Eric; Conant, Gavin C; Pires, J Chris; Freeling, Michael; Shyu, Chi-Ren; Korkin, Dmitry

    2012-05-08

    Ultraconserved elements (UCEs) are DNA sequences that are 100% identical (no base substitutions, insertions, or deletions) and located in syntenic positions in at least two genomes. Although hundreds of UCEs have been found in animal genomes, little is known about the incidence of ultraconservation in plant genomes. Using an alignment-free information-retrieval approach, we have comprehensively identified all long identical multispecies elements (LIMEs), which include both syntenic and nonsyntenic regions, of at least 100 identical base pairs shared by at least two genomes. Among six animal genomes, we found the previously known syntenic UCEs as well as previously undescribed nonsyntenic elements. In contrast, among six plant genomes, we only found nonsyntenic LIMEs. LIMEs can also be classified as either simple (repetitive) or complex (nonrepetitive), they may occur in multiple copies in a genome, and they are often spread across multiple chromosomes. Although complex LIMEs were found in both animal and plant genomes, they differed significantly in their composition and copy number. Further analyses of plant LIMEs revealed their functional diversity, encompassing elements found near rRNA and enzyme-coding genes, as well as those found in transposons and noncoding DNA. We conclude that despite the common presence of LIMEs in both animal and plant lineages, the evolutionary processes involved in the creation and maintenance of these elements differ in the two groups and are likely attributable to several mechanisms, including transfer of genetic material from organellar to nuclear genomes, de novo sequence manufacturing, and purifying selection.

  4. Intragenomic polymorphisms among high-copy loci: a genus-wide study of nuclear ribosomal DNA in Asclepias (Apocynaceae).

    Science.gov (United States)

    Weitemier, Kevin; Straub, Shannon C K; Fishbein, Mark; Liston, Aaron

    2015-01-01

    Despite knowledge that concerted evolution of high-copy loci is often imperfect, studies that investigate the extent of intragenomic polymorphisms and comparisons across a large number of species are rarely made. We present a bioinformatic pipeline for characterizing polymorphisms within an individual among copies of a high-copy locus. Results are presented for nuclear ribosomal DNA (nrDNA) across the milkweed genus, Asclepias. The 18S-26S portion of the nrDNA cistron of Asclepias syriaca served as a reference for assembly of the region from 124 samples representing 90 species of Asclepias. Reads were mapped back to each individual's consensus and at each position reads differing from the consensus were tallied using a custom perl script. Low frequency polymorphisms existed in all individuals (mean = 5.8%). Most nrDNA positions (91%) were polymorphic in at least one individual, with polymorphic sites being less frequent in subunit regions and loops. Highly polymorphic sites existed in each individual, with highest abundance in the "noncoding" ITS regions. Phylogenetic signal was present in the distribution of intragenomic polymorphisms across the genus. Intragenomic polymorphisms in nrDNA are common in Asclepias, being found at higher frequency than any other study to date. The high and variable frequency of polymorphisms across species highlights concerns that phylogenetic applications of nrDNA may be error-prone. The new analytical approach provided here is applicable to other taxa and other high-copy regions characterized by low coverage genome sequencing (genome skimming).

  5. Engineered ribosomal RNA operon copy-number variants of E. coli reveal the evolutionary trade-offs shaping rRNA operon number

    Science.gov (United States)

    Gyorfy, Zsuzsanna; Draskovits, Gabor; Vernyik, Viktor; Blattner, Frederick F.; Gaal, Tamas; Posfai, Gyorgy

    2015-01-01

    Ribosomal RNA (rrn) operons, characteristically present in several copies in bacterial genomes (7 in E. coli), play a central role in cellular physiology. We investigated the factors determining the optimal number of rrn operons in E. coli by constructing isogenic variants with 5–10 operons. We found that the total RNA and protein content, as well as the size of the cells reflected the number of rrn operons. While growth parameters showed only minor differences, competition experiments revealed a clear pattern: 7–8 copies were optimal under conditions of fluctuating, occasionally rich nutrient influx and lower numbers were favored in stable, nutrient-limited environments. We found that the advantages of quick adjustment to nutrient availability, rapid growth and economic regulation of ribosome number all contribute to the selection of the optimal rrn operon number. Our results suggest that the wt rrn operon number of E. coli reflects the natural, ‘feast and famine’ life-style of the bacterium, however, different copy numbers might be beneficial under different environmental conditions. Understanding the impact of the copy number of rrn operons on the fitness of the cell is an important step towards the creation of functional and robust genomes, the ultimate goal of synthetic biology. PMID:25618851

  6. Original Copies

    DEFF Research Database (Denmark)

    Sørensen, Tim Flohr

    2013-01-01

    of similarity by looking at artefactual similarity as the results of prototyping and as a production of simulacra. In this light, the concept of copying turns out to be more than simply a matter of trying to imitate an exotic or prestigious original, and it fundamentally raises the question how different a copy...

  7. Quantum copying: A review

    Directory of Open Access Journals (Sweden)

    Mark Hillery

    2000-07-01

    Full Text Available Quantum information is stored in two-level quantum systems known as qubits. The no-cloning theorem states that the state of an unknown qubit cannot be copied. This is in contrast to classical information which can be copied. If one drops the requirement that the copies be perfect it is possible to design quantum copiers. This paper presents a short review of the theory of quantum copying.

  8. Haplotype phasing and inheritance of copy number variants in nuclear families.

    Science.gov (United States)

    Palta, Priit; Kaplinski, Lauris; Nagirnaja, Liina; Veidenberg, Andres; Möls, Märt; Nelis, Mari; Esko, Tõnu; Metspalu, Andres; Laan, Maris; Remm, Maido

    2015-01-01

    DNA copy number variants (CNVs) that alter the copy number of a particular DNA segment in the genome play an important role in human phenotypic variability and disease susceptibility. A number of CNVs overlapping with genes have been shown to confer risk to a variety of human diseases thus highlighting the relevance of addressing the variability of CNVs at a higher resolution. So far, it has not been possible to deterministically infer the allelic composition of different haplotypes present within the CNV regions. We have developed a novel computational method, called PiCNV, which enables to resolve the haplotype sequence composition within CNV regions in nuclear families based on SNP genotyping microarray data. The algorithm allows to i) phase normal and CNV-carrying haplotypes in the copy number variable regions, ii) resolve the allelic copies of rearranged DNA sequence within the haplotypes and iii) infer the heritability of identified haplotypes in trios or larger nuclear families. To our knowledge this is the first program available that can deterministically phase null, mono-, di-, tri- and tetraploid genotypes in CNV loci. We applied our method to study the composition and inheritance of haplotypes in CNV regions of 30 HapMap Yoruban trios and 34 Estonian families. For 93.6% of the CNV loci, PiCNV enabled to unambiguously phase normal and CNV-carrying haplotypes and follow their transmission in the corresponding families. Furthermore, allelic composition analysis identified the co-occurrence of alternative allelic copies within 66.7% of haplotypes carrying copy number gains. We also observed less frequent transmission of CNV-carrying haplotypes from parents to children compared to normal haplotypes and identified an emergence of several de novo deletions and duplications in the offspring.

  9. Haplotype phasing and inheritance of copy number variants in nuclear families.

    Directory of Open Access Journals (Sweden)

    Priit Palta

    Full Text Available DNA copy number variants (CNVs that alter the copy number of a particular DNA segment in the genome play an important role in human phenotypic variability and disease susceptibility. A number of CNVs overlapping with genes have been shown to confer risk to a variety of human diseases thus highlighting the relevance of addressing the variability of CNVs at a higher resolution. So far, it has not been possible to deterministically infer the allelic composition of different haplotypes present within the CNV regions. We have developed a novel computational method, called PiCNV, which enables to resolve the haplotype sequence composition within CNV regions in nuclear families based on SNP genotyping microarray data. The algorithm allows to i phase normal and CNV-carrying haplotypes in the copy number variable regions, ii resolve the allelic copies of rearranged DNA sequence within the haplotypes and iii infer the heritability of identified haplotypes in trios or larger nuclear families. To our knowledge this is the first program available that can deterministically phase null, mono-, di-, tri- and tetraploid genotypes in CNV loci. We applied our method to study the composition and inheritance of haplotypes in CNV regions of 30 HapMap Yoruban trios and 34 Estonian families. For 93.6% of the CNV loci, PiCNV enabled to unambiguously phase normal and CNV-carrying haplotypes and follow their transmission in the corresponding families. Furthermore, allelic composition analysis identified the co-occurrence of alternative allelic copies within 66.7% of haplotypes carrying copy number gains. We also observed less frequent transmission of CNV-carrying haplotypes from parents to children compared to normal haplotypes and identified an emergence of several de novo deletions and duplications in the offspring.

  10. biomvRhsmm: Genomic Segmentation with Hidden Semi-Markov Model

    Directory of Open Access Journals (Sweden)

    Yang Du

    2014-01-01

    Full Text Available High-throughput technologies like tiling array and next-generation sequencing (NGS generate continuous homogeneous segments or signal peaks in the genome that represent transcripts and transcript variants (transcript mapping and quantification, regions of deletion and amplification (copy number variation, or regions characterized by particular common features like chromatin state or DNA methylation ratio (epigenetic modifications. However, the volume and output of data produced by these technologies present challenges in analysis. Here, a hidden semi-Markov model (HSMM is implemented and tailored to handle multiple genomic profile, to better facilitate genome annotation by assisting in the detection of transcripts, regulatory regions, and copy number variation by holistic microarray or NGS. With support for various data distributions, instead of limiting itself to one specific application, the proposed hidden semi-Markov model is designed to allow modeling options to accommodate different types of genomic data and to serve as a general segmentation engine. By incorporating genomic positions into the sojourn distribution of HSMM, with optional prior learning using annotation or previous studies, the modeling output is more biologically sensible. The proposed model has been compared with several other state-of-the-art segmentation models through simulation benchmarking, which shows that our efficient implementation achieves comparable or better sensitivity and specificity in genomic segmentation.

  11. Resource allocation for maximizing prediction accuracy and genetic gain of genomic selection in plant breeding: a simulation experiment.

    Science.gov (United States)

    Lorenz, Aaron J

    2013-03-01

    Allocating resources between population size and replication affects both genetic gain through phenotypic selection and quantitative trait loci detection power and effect estimation accuracy for marker-assisted selection (MAS). It is well known that because alleles are replicated across individuals in quantitative trait loci mapping and MAS, more resources should be allocated to increasing population size compared with phenotypic selection. Genomic selection is a form of MAS using all marker information simultaneously to predict individual genetic values for complex traits and has widely been found superior to MAS. No studies have explicitly investigated how resource allocation decisions affect success of genomic selection. My objective was to study the effect of resource allocation on response to MAS and genomic selection in a single biparental population of doubled haploid lines by using computer simulation. Simulation results were compared with previously derived formulas for the calculation of prediction accuracy under different levels of heritability and population size. Response of prediction accuracy to resource allocation strategies differed between genomic selection models (ridge regression best linear unbiased prediction [RR-BLUP], BayesCπ) and multiple linear regression using ordinary least-squares estimation (OLS), leading to different optimal resource allocation choices between OLS and RR-BLUP. For OLS, it was always advantageous to maximize population size at the expense of replication, but a high degree of flexibility was observed for RR-BLUP. Prediction accuracy of doubled haploid lines included in the training set was much greater than of those excluded from the training set, so there was little benefit to phenotyping only a subset of the lines genotyped. Finally, observed prediction accuracies in the simulation compared well to calculated prediction accuracies, indicating these theoretical formulas are useful for making resource allocation

  12. Chromosome length scaling in haploid, asexual reproduction

    International Nuclear Information System (INIS)

    Oliveira, P M C de

    2007-01-01

    We study the genetic behaviour of a population formed by haploid individuals which reproduce asexually. The genetic information for each individual is stored along a bit-string (or chromosome) with L bits, where 0-bits represent the wild allele and 1-bits correspond to harmful mutations. Each newborn inherits this chromosome from its parent with a few random mutations: on average a fixed number m of bits are flipped. Selection is implemented according to the number N of 1-bits counted along the individual's chromosome: the smaller N the higher the probability an individual has to survive a new time step. Such a population evolves, with births and deaths, and its genetic distribution becomes stabilized after sufficiently many generations have passed. The question we pose concerns the procedure of increasing L. The aim is to get the same distribution of genetic loads N/L among the equilibrated population, in spite of a larger L. Should we keep the same mutation rate m/L for different values of L? The answer is yes, which intuitively seems to be plausible. However, this conclusion is not trivial, according to our simulation results: the question also involves the population size

  13. Complete Chloroplast Genome Sequence of Aquilaria sinensis (Lour.) Gilg and Evolution Analysis within the Malvales Order.

    Science.gov (United States)

    Wang, Ying; Zhan, Di-Feng; Jia, Xian; Mei, Wen-Li; Dai, Hao-Fu; Chen, Xiong-Ting; Peng, Shi-Qing

    2016-01-01

    Aquilaria sinensis (Lour.) Gilg is an important medicinal woody plant producing agarwood, which is widely used in traditional Chinese medicine. High-throughput sequencing of chloroplast (cp) genomes enhanced the understanding about evolutionary relationships within plant families. In this study, we determined the complete cp genome sequences for A. sinensis. The size of the A. sinensis cp genome was 159,565 bp. This genome included a large single-copy region of 87,482 bp, a small single-copy region of 19,857 bp, and a pair of inverted repeats (IRa and IRb) of 26,113 bp each. The GC content of the genome was 37.11%. The A. sinensis cp genome encoded 113 functional genes, including 82 protein-coding genes, 27 tRNA genes, and 4 rRNA genes. Seven genes were duplicated in the protein-coding genes, whereas 11 genes were duplicated in the RNA genes. A total of 45 polymorphic simple-sequence repeat loci and 60 pairs of large repeats were identified. Most simple-sequence repeats were located in the noncoding sections of the large single-copy/small single-copy region and exhibited high A/T content. Moreover, 33 pairs of large repeat sequences were located in the protein-coding genes, whereas 27 pairs were located in the intergenic regions. Aquilaria sinensis cp genome bias ended with A/T on the basis of codon usage. The distribution of codon usage in A. sinensis cp genome was most similar to that in the Gonystylus bancanus cp genome. Comparative results of 82 protein-coding genes from 29 species of cp genomes demonstrated that A. sinensis was a sister species to G. bancanus within the Malvales order. Aquilaria sinensis cp genome presented the highest sequence similarity of >90% with the G. bancanus cp genome by using CGView Comparison Tool. This finding strongly supports the placement of A. sinensis as a sister to G. bancanus within the Malvales order. The complete A. sinensis cp genome information will be highly beneficial for further studies on this traditional medicinal

  14. A Three-Dimensional Model of the Yeast Genome

    Science.gov (United States)

    Noble, William; Duan, Zhi-Jun; Andronescu, Mirela; Schutz, Kevin; McIlwain, Sean; Kim, Yoo Jung; Lee, Choli; Shendure, Jay; Fields, Stanley; Blau, C. Anthony

    Layered on top of information conveyed by DNA sequence and chromatin are higher order structures that encompass portions of chromosomes, entire chromosomes, and even whole genomes. Interphase chromosomes are not positioned randomly within the nucleus, but instead adopt preferred conformations. Disparate DNA elements co-localize into functionally defined aggregates or factories for transcription and DNA replication. In budding yeast, Drosophila and many other eukaryotes, chromosomes adopt a Rabl configuration, with arms extending from centromeres adjacent to the spindle pole body to telomeres that abut the nuclear envelope. Nonetheless, the topologies and spatial relationships of chromosomes remain poorly understood. Here we developed a method to globally capture intra- and inter-chromosomal interactions, and applied it to generate a map at kilobase resolution of the haploid genome of Saccharomyces cerevisiae. The map recapitulates known features of genome organization, thereby validating the method, and identifies new features. Extensive regional and higher order folding of individual chromosomes is observed. Chromosome XII exhibits a striking conformation that implicates the nucleolus as a formidable barrier to interaction between DNA sequences at either end. Inter-chromosomal contacts are anchored by centromeres and include interactions among transfer RNA genes, among origins of early DNA replication and among sites where chromosomal breakpoints occur. Finally, we constructed a three-dimensional model of the yeast genome. Our findings provide a glimpse of the interface between the form and function of a eukaryotic genome.

  15. Phylogenomic approaches to common problems encountered in the analysis of low copy repeats: The sulfotransferase 1A gene family example

    Directory of Open Access Journals (Sweden)

    Benner Steven A

    2005-03-01

    Full Text Available Abstract Background Blocks of duplicated genomic DNA sequence longer than 1000 base pairs are known as low copy repeats (LCRs. Identified by their sequence similarity, LCRs are abundant in the human genome, and are interesting because they may represent recent adaptive events, or potential future adaptive opportunities within the human lineage. Sequence analysis tools are needed, however, to decide whether these interpretations are likely, whether a particular set of LCRs represents nearly neutral drift creating junk DNA, or whether the appearance of LCRs reflects assembly error. Here we investigate an LCR family containing the sulfotransferase (SULT 1A genes involved in drug metabolism, cancer, hormone regulation, and neurotransmitter biology as a first step for defining the problems that those tools must manage. Results Sequence analysis here identified a fourth sulfotransferase gene, which may be transcriptionally active, located on human chromosome 16. Four regions of genomic sequence containing the four human SULT1A paralogs defined a new LCR family. The stem hominoid SULT1A progenitor locus was identified by comparative genomics involving complete human and rodent genomes, and a draft chimpanzee genome. SULT1A expansion in hominoid genomes was followed by positive selection acting on specific protein sites. This episode of adaptive evolution appears to be responsible for the dopamine sulfonation function of some SULT enzymes. Each of the conclusions that this bioinformatic analysis generated using data that has uncertain reliability (such as that from the chimpanzee genome sequencing project has been confirmed experimentally or by a "finished" chromosome 16 assembly, both of which were published after the submission of this manuscript. Conclusion SULT1A genes expanded from one to four copies in hominoids during intra-chromosomal LCR duplications, including (apparently one after the divergence of chimpanzees and humans. Thus, LCRs may

  16. The complete chloroplast genome sequence of Maddenia hypoleuca koehne (Prunoideae, Rosaceae).

    Science.gov (United States)

    Chen, Tao; Zhang, Jing; Liu, Yin; Wang, Hao; Wang, Juan; Chen, Qing; Tang, Hao-Ru; Wang, Xiao-Rong

    2016-11-01

    Maddenia hypoleuca Koehne belonging to family Rosaceae is a native species in China. The complete chloroplast (cp) genome was generated by de novo assembly using low coverage whole genome sequencing data and manual correction. The cp genome was 158 084 bp in length, with GC content of 36.63%. It exhibited a typical quadripartite structure: a pair of large inverted repeat regions (IRs, 26 246 bp each), a large single-copy region (LSC, 86 713 bp), and a small single-copy region (SSC, 18 879 bp). A total of 114 genes were predicted, which included 80 protein-coding genes, 30 tRNA genes, and four rRNA genes. Phylogenetic analysis indicated that M. hypoleuca is most closely related to Prunus padus within the Prunoideae subfamily, which conforms to the traditional classification.

  17. Expanding probe repertoire and improving reproducibility in human genomic hybridization

    Science.gov (United States)

    Dorman, Stephanie N.; Shirley, Ben C.; Knoll, Joan H. M.; Rogan, Peter K.

    2013-01-01

    Diagnostic DNA hybridization relies on probes composed of single copy (sc) genomic sequences. Sc sequences in probe design ensure high specificity and avoid cross-hybridization to other regions of the genome, which could lead to ambiguous results that are difficult to interpret. We examine how the distribution and composition of repetitive sequences in the genome affects sc probe performance. A divide and conquer algorithm was implemented to design sc probes. With this approach, sc probes can include divergent repetitive elements, which hybridize to unique genomic targets under higher stringency experimental conditions. Genome-wide custom probe sets were created for fluorescent in situ hybridization (FISH) and microarray genomic hybridization. The scFISH probes were developed for detection of copy number changes within small tumour suppressor genes and oncogenes. The microarrays demonstrated increased reproducibility by eliminating cross-hybridization to repetitive sequences adjacent to probe targets. The genome-wide microarrays exhibited lower median coefficients of variation (17.8%) for two HapMap family trios. The coefficients of variations of commercial probes within 300 nt of a repetitive element were 48.3% higher than the nearest custom probe. Furthermore, the custom microarray called a chromosome 15q11.2q13 deletion more consistently. This method for sc probe design increases probe coverage for FISH and lowers variability in genomic microarrays. PMID:23376933

  18. The complete chloroplast genome sequence of Abies nephrolepis (Pinaceae: Abietoideae

    Directory of Open Access Journals (Sweden)

    Dong-Keun Yi

    2016-06-01

    Full Text Available The plant chloroplast (cp genome has maintained a relatively conserved structure and gene content throughout evolution. Cp genome sequences have been used widely for resolving evolutionary and phylogenetic issues at various taxonomic levels of plants. Here, we report the complete cp genome of Abies nephrolepis. The A. nephrolepis cp genome is 121,336 base pairs (bp in length including a pair of short inverted repeat regions (IRa and IRb of 139 bp each separated by a small single copy (SSC region of 54,323 bp (SSC and a large single copy region of 66,735 bp (LSC. It contains 114 genes, 68 of which are protein coding genes, 35 tRNA and four rRNA genes, six open reading frames, and one pseudogene. Seventeen repeat units and 64 simple sequence repeats (SSR have been detected in A. nephrolepis cp genome. Large IR sequences locate in 42-kb inversion points (1186 bp. The A. nephrolepis cp genome is identical to Abies koreana’s which is closely related to taxa. Pairwise comparison between two cp genomes revealed 140 polymorphic sites in each. Complete cp genome sequence of A. nephrolepis has a significant potential to provide information on the evolutionary pattern of Abietoideae and valuable data for development of DNA markers for easy identification and classification.

  19. Effect of gamma-radiations on haploid cultured cells of Datura innoxia

    International Nuclear Information System (INIS)

    Jain, R.K.; Maherchandani, N.; Sharma, D.R.; Chowdhury, V.K.

    1981-01-01

    The effects of gamma-radiations were studied in haploid cultured cells of Datura innoxia. Growth of callus cultures and shoot differentiation were stimulated at low doses (0.2 and 1.0 kR), while the higher dose (5.0 kR) was inhibitory. Root differentiation was observed only in cultures exposed to 1.0 kR dose. Enzyme activities of alpha-amylase, peroxidase, malate dehydrogenase and phosphatases, and the amounts of buffer extractable proteins were stimulated at 0.2 and 1.0 kR and inhibited at 5.0 kR. Mitotic index too decreased at 5.0 kR. Mean nuclear volume increased with increase in radiation dose, probably due to increased ploidy. (author)

  20. Genomic Diversity Using Copy Number Variations in Worldwide Chicken Populations

    Directory of Open Access Journals (Sweden)

    Erica Gorla

    2018-06-01

    Full Text Available Recently, many studies in livestock have focused on the identification of Copy Number Variants (CNVs using high-density Single Nucleotide Polymorphism (SNP arrays, but few have focused on studying chicken ecotypes coming from many locations. CNVs are polymorphisms, which may influence phenotype and are an important source of genetic variation in populations. The aim of this study was to explore the genetic difference and structure, using a high density SNP chip in 936 individuals from seven different countries (Brazil, Italy, Egypt, Mexico, Rwanda, Sri Lanka and Uganda. The DNA was genotyped with the Affymetrix Axiom®600k Chicken Genotyping Array and processed with stringent quality controls to obtain 559,201 SNPs in 915 individuals. The Log R Ratio (LRR and the B Allele Frequency of SNPs were used to perform the CNV calling with PennCNV software based on a Hidden Markov Model analysis and the LRR was used to perform CNV detection with SVS Golden Helix software.After filtering, a total of 19,027 CNVs were detected with the SVS software, while 9,065 CNVs were identified with the Penn CNV software. The CNVs were summarized in 7,001 Copy Number Variant Regions (CNVRs and 4,414 CNVRs, using the software BedTool.The consensus analysis across the CNVRs allowed the identification of 2,820 consensus CNVR, of which 1,721 were gain, 637 loss and 462 complex, for a total length of 53 Mb corresponding to the 5 % of the GalGal5 chicken autosomes. Only the consensus CNV regions obtained from both detections were considered for further analysis.The intersection analysis performed between the chicken gene database (Gallus_gallus-5.0 and the 1,927 consensus CNVRs allowed the identification (within or partial overlap of a total of 2,354 unique genes with an official gene ID.  The CNVRs identified here represent the first comprehensive mapping in several worldwide populations, using a high-density SNP chip.

  1. Classification of human cancers based on DNA copy number amplification modeling

    Directory of Open Access Journals (Sweden)

    Knuutila Sakari

    2008-05-01

    Full Text Available Abstract Background DNA amplifications alter gene dosage in cancer genomes by multiplying the gene copy number. Amplifications are quintessential in a considerable number of advanced cancers of various anatomical locations. The aims of this study were to classify human cancers based on their amplification patterns, explore the biological and clinical fundamentals behind their amplification-pattern based classification, and understand the characteristics in human genomic architecture that associate with amplification mechanisms. Methods We applied a machine learning approach to model DNA copy number amplifications using a data set of binary amplification records at chromosome sub-band resolution from 4400 cases that represent 82 cancer types. Amplification data was fused with background data: clinical, histological and biological classifications, and cytogenetic annotations. Statistical hypothesis testing was used to mine associations between the data sets. Results Probabilistic clustering of each chromosome identified 111 amplification models and divided the cancer cases into clusters. The distribution of classification terms in the amplification-model based clustering of cancer cases revealed cancer classes that were associated with specific DNA copy number amplification models. Amplification patterns – finite or bounded descriptions of the ranges of the amplifications in the chromosome – were extracted from the clustered data and expressed according to the original cytogenetic nomenclature. This was achieved by maximal frequent itemset mining using the cluster-specific data sets. The boundaries of amplification patterns were shown to be enriched with fragile sites, telomeres, centromeres, and light chromosome bands. Conclusions Our results demonstrate that amplifications are non-random chromosomal changes and specifically selected in tumor tissue microenvironment. Furthermore, statistical evidence showed that specific chromosomal features

  2. Getting DNA copy numbers without control samples.

    Science.gov (United States)

    Ortiz-Estevez, Maria; Aramburu, Ander; Rubio, Angel

    2012-08-16

    The selection of the reference to scale the data in a copy number analysis has paramount importance to achieve accurate estimates. Usually this reference is generated using control samples included in the study. However, these control samples are not always available and in these cases, an artificial reference must be created. A proper generation of this signal is crucial in terms of both noise and bias.We propose NSA (Normality Search Algorithm), a scaling method that works with and without control samples. It is based on the assumption that genomic regions enriched in SNPs with identical copy numbers in both alleles are likely to be normal. These normal regions are predicted for each sample individually and used to calculate the final reference signal. NSA can be applied to any CN data regardless the microarray technology and preprocessing method. It also finds an optimal weighting of the samples minimizing possible batch effects. Five human datasets (a subset of HapMap samples, Glioblastoma Multiforme (GBM), Ovarian, Prostate and Lung Cancer experiments) have been analyzed. It is shown that using only tumoral samples, NSA is able to remove the bias in the copy number estimation, to reduce the noise and therefore, to increase the ability to detect copy number aberrations (CNAs). These improvements allow NSA to also detect recurrent aberrations more accurately than other state of the art methods. NSA provides a robust and accurate reference for scaling probe signals data to CN values without the need of control samples. It minimizes the problems of bias, noise and batch effects in the estimation of CNs. Therefore, NSA scaling approach helps to better detect recurrent CNAs than current methods. The automatic selection of references makes it useful to perform bulk analysis of many GEO or ArrayExpress experiments without the need of developing a parser to find the normal samples or possible batches within the data. The method is available in the open-source R package

  3. Genetic transformation and gene silencing mediated by multiple copies of a transgene in eastern white pine.

    Science.gov (United States)

    Tang, Wei; Newton, Ronald J; Weidner, Douglas A

    2007-01-01

    An efficient transgenic eastern white pine (Pinus strobus L.) plant regeneration system has been established using Agrobacterium tumefaciens strain GV3850-mediated transformation and the green fluorescent protein (gfp) gene as a reporter in this investigation. Stable integration of transgenes in the plant genome of pine was confirmed by polymerase chain reaction (PCR), Southern blot, and northern blot analyses. Transgene expression was analysed in pine T-DNA transformants carrying different numbers of copies of T-DNA insertions. Post-transcriptional gene silencing (PTGS) was mostly obtained in transgenic lines with more than three copies of T-DNA, but not in transgenic lines with one copy of T-DNA. In situ hybridization chromosome analysis of transgenic lines demonstrated that silenced transgenic lines had two or more T-DNA insertions in the same chromosome. These results suggest that two or more T-DNA insertions in the same chromosome facilitate efficient gene silencing in transgenic pine cells expressing green fluorescent protein. There were no differences in shoot differentiation and development between transgenic lines with multiple T-DNA copies and transgenic lines with one or two T-DNA copies.

  4. Copy number variations and genetic admixtures in three Xinjiang ethnic minority groups.

    Science.gov (United States)

    Lou, Haiyi; Li, Shilin; Jin, Wenfei; Fu, Ruiqing; Lu, Dongsheng; Pan, Xinwei; Zhou, Huaigu; Ping, Yuan; Jin, Li; Xu, Shuhua

    2015-04-01

    Xinjiang is geographically located in central Asia, and it has played an important historical role in connecting eastern Eurasian (EEA) and western Eurasian (WEA) people. However, human population genomic studies in this region have been largely underrepresented, especially with respect to studies of copy number variations (CNVs). Here we constructed the first CNV map of the three major ethnic minority groups, the Uyghur, Kazakh and Kirgiz, using Affymetrix Genome-Wide Human SNP Array 6.0. We systematically compared the properties of CNVs we identified in the three groups with the data from representatives of EEA and WEA. The analyses indicated a typical genetic admixture pattern in all three groups with ancestries from both EEA and WEA. We also identified several CNV regions showing significant deviation of allele frequency from the expected genome-wide distribution, which might be associated with population-specific phenotypes. Our study provides the first genome-wide perspective on the CNVs of three major Xinjiang ethnic minority groups and has implications for both evolutionary and medical studies.

  5. Inferring species trees from incongruent multi-copy gene trees using the Robinson-Foulds distance

    Science.gov (United States)

    2013-01-01

    Background Constructing species trees from multi-copy gene trees remains a challenging problem in phylogenetics. One difficulty is that the underlying genes can be incongruent due to evolutionary processes such as gene duplication and loss, deep coalescence, or lateral gene transfer. Gene tree estimation errors may further exacerbate the difficulties of species tree estimation. Results We present a new approach for inferring species trees from incongruent multi-copy gene trees that is based on a generalization of the Robinson-Foulds (RF) distance measure to multi-labeled trees (mul-trees). We prove that it is NP-hard to compute the RF distance between two mul-trees; however, it is easy to calculate this distance between a mul-tree and a singly-labeled species tree. Motivated by this, we formulate the RF problem for mul-trees (MulRF) as follows: Given a collection of multi-copy gene trees, find a singly-labeled species tree that minimizes the total RF distance from the input mul-trees. We develop and implement a fast SPR-based heuristic algorithm for the NP-hard MulRF problem. We compare the performance of the MulRF method (available at http://genome.cs.iastate.edu/CBL/MulRF/) with several gene tree parsimony approaches using gene tree simulations that incorporate gene tree error, gene duplications and losses, and/or lateral transfer. The MulRF method produces more accurate species trees than gene tree parsimony approaches. We also demonstrate that the MulRF method infers in minutes a credible plant species tree from a collection of nearly 2,000 gene trees. Conclusions Our new phylogenetic inference method, based on a generalized RF distance, makes it possible to quickly estimate species trees from large genomic data sets. Since the MulRF method, unlike gene tree parsimony, is based on a generic tree distance measure, it is appealing for analyses of genomic data sets, in which many processes such as deep coalescence, recombination, gene duplication and losses as

  6. Chloroplast microsatellite markers for Pseudotaxus chienii developed from the whole chloroplast genome of Taxus chinensis var. mairei (Taxaceae).

    Science.gov (United States)

    Deng, Qi; Zhang, Hanrui; He, Yipeng; Wang, Ting; Su, Yingjuan

    2017-03-01

    Pseudotaxus chienii (Taxaceae) is an old rare species endemic to China that has adapted well to ecological heterogeneity with high genetic diversity in its nuclear genome. However, the genetic variation in its chloroplast genome is unknown. Eighteen chloroplast microsatellite markers (cpSSRs) were developed from the whole chloroplast genome of Taxus chinensis var. mairei and successfully amplified in four P. chienii populations and one T. chinensis var. mairei population. Of these loci, 10 were polymorphic in P. chienii , whereas six were polymorphic in T. chinensis var. mairei . The unbiased haploid diversity per locus ranged from 0.000 to 0.641 and 0.000 to 0.545 for P. chienii and T. chinensis var. mairei , respectively. The 18 cpSSRs will be used to further investigate the chloroplast genetic structure and adaptive evolution in P. chienii populations.

  7. Genome sequencing and annotation of Proteus sp. SAS71

    Directory of Open Access Journals (Sweden)

    Samy Selim

    2015-12-01

    Full Text Available We report draft genome sequence of Proteus sp. strain SAS71, isolated from water spring in Aljouf region, Saudi Arabia. The draft genome size is 3,037,704 bp with a G + C content of 39.3% and contains 6 rRNA sequence (single copies of 5S, 16S & 23S rRNA. The genome sequence can be accessed at DDBJ/EMBL/GenBank under the accession no. LDIU00000000.

  8. Analysis of copy number loss of the ErbB4 receptor tyrosine kinase in glioblastoma.

    Directory of Open Access Journals (Sweden)

    DeAnalisa C Jones

    Full Text Available Current treatments for glioblastoma multiforme (GBM-an aggressive form of brain cancer-are minimally effective and yield a median survival of 14.6 months and a two-year survival rate of 30%. Given the severity of GBM and the limitations of its treatment, there is a need for the discovery of novel drug targets for GBM and more personalized treatment approaches based on the characteristics of an individual's tumor. Most receptor tyrosine kinases-such as EGFR-act as oncogenes, but publicly available data from the Cancer Cell Line Encyclopedia (CCLE indicates copy number loss in the ERBB4 RTK gene across dozens of GBM cell lines, suggesting a potential tumor suppressor role. This loss is mutually exclusive with loss of its cognate ligand NRG1 in CCLE as well, more strongly suggesting a functional role. The availability of higher resolution copy number data from clinical GBM patients in The Cancer Genome Atlas (TCGA revealed that a region in Intron 1 of the ERBB4 gene was deleted in 69.1% of tumor samples harboring ERBB4 copy number loss; however, it was also found to be deleted in the matched normal tissue samples from these GBM patients (n = 81. Using the DECIPHER Genome Browser, we also discovered that this mutation occurs at approximately the same frequency in the general population as it does in the disease population. We conclude from these results that this loss in Intron 1 of the ERBB4 gene is neither a de novo driver mutation nor a predisposing factor to GBM, despite the indications from CCLE. A biological role of this significantly occurring genetic alteration is still unknown. While this is a negative result, the broader conclusion is that while copy number data from large cell line-based data repositories may yield compelling hypotheses, careful follow up with higher resolution copy number assays, patient data, and general population analyses are essential to codify initial hypotheses prior to investing experimental resources.

  9. In-vitro Androgenesis in Rice: Advantages, Constraints and Future Prospects

    Directory of Open Access Journals (Sweden)

    Rukmini Mishra

    2016-03-01

    Full Text Available In vitro androgenesis is an important component of plant biotechnology when the pollen grains are forced to switch from their normal pollen developmental pathway towards an embryogenic route. Haploid and doubled haploid produced through androgenesis have long been recognized as a valuable tool in plant breeding as it can shorten the breeding cycle, fix agronomic characters in homozygous state and enhance the selection efficiency of useful recessive agronomic traits. Recently, doubled haploids have been largely recognized as an important component of crop improvement through genome mapping, quantitative trait locus analysis, and genetic mutation, and as targets for genetic transformation programs. Thus, this review is focused mainly on various facets of doubled haploid in the chief staple food crop rice and sights its recent applications in plant breeding, genetics and genomics.

  10. Cytonuclear Coordination Is Not Immediate upon Allopolyploid Formation in Tragopogon miscellus (Asteraceae) Allopolyploids

    Science.gov (United States)

    Sehrish, Tina; Symonds, V. Vaughan; Soltis, Douglas E.; Soltis, Pamela S.; Tate, Jennifer A.

    2015-01-01

    Allopolyploids, formed by hybridization and chromosome doubling, face the immediate challenge of having duplicated nuclear genomes that interact with the haploid and maternally inherited cytoplasmic (plastid and mitochondrial) genomes. Most of our knowledge of the genomic consequences of allopolyploidy has focused on the fate of the duplicated nuclear genes without regard to their potential interactions with cytoplasmic genomes. As a step toward understanding the fates of nuclear-encoded subunits that are plastid-targeted, here we examine the retention and expression of the gene encoding the small subunit of Ribulose-1, 5-bisphosphate carboxylase/oxygenase (Rubisco; rbcS) in multiple populations of allotetraploid Tragopogon miscellus (Asteraceae). These polyploids formed recently (~80 years ago) and repeatedly from T. dubius and T. pratensis in the northwestern United States. Examination of 79 T. miscellus individuals from 10 natural populations, as well as 25 synthetic allotetraploids, including reciprocally formed plants, revealed a low percentage of naturally occurring individuals that show a bias in either gene (homeolog) loss (12%) or expression (16%), usually toward maintaining the maternal nuclear copy of rbcS. For individuals showing loss, seven retained the maternally derived rbcS homeolog only, while three had the paternally derived copy. All of the synthetic polyploid individuals examined (S0 and S1 generations) retained and expressed both parental homeologs. These results demonstrate that cytonuclear coordination does not happen immediately upon polyploid formation in Tragopogon miscellus. PMID:26646761

  11. Comparative genome analysis of Pseudogymnoascus spp. reveals primarily clonal evolution with small genome fragments exchanged between lineages.

    Science.gov (United States)

    Leushkin, Evgeny V; Logacheva, Maria D; Penin, Aleksey A; Sutormin, Roman A; Gerasimov, Evgeny S; Kochkina, Galina A; Ivanushkina, Natalia E; Vasilenko, Oleg V; Kondrashov, Alexey S; Ozerskaya, Svetlana M

    2015-05-21

    Pseudogymnoascus spp. is a wide group of fungi lineages in the family Pseudorotiaceae including an aggressive pathogen of bats P. destructans. Although several lineages of P. spp. were shown to produce ascospores in culture, the vast majority of P. spp. demonstrates no evidence of sexual reproduction. P. spp. can tolerate a wide range of different temperatures and salinities and can survive even in permafrost layer. Adaptability of P. spp. to different environments is accompanied by extremely variable morphology and physiology. We sequenced genotypes of 14 strains of P. spp., 5 of which were extracted from permafrost, 1 from a cryopeg, a layer of unfrozen ground in permafrost, and 8 from temperate surface environments. All sequenced genotypes are haploid. Nucleotide diversity among these genomes is very high, with a typical evolutionary distance at synonymous sites dS ≈ 0.5, suggesting that the last common ancestor of these strains lived >50 Mya. The strains extracted from permafrost do not form a separate clade. Instead, each permafrost strain has close relatives from temperate environments. We observed a strictly clonal population structure with no conflicting topologies for ~99% of genome sequences. However, there is a number of short (~100-10,000 nt) genomic segments with the total length of 67.6 Kb which possess phylogenetic patterns strikingly different from the rest of the genome. The most remarkable case is a MAT-locus, which has 2 distinct alleles interspersed along the whole-genome phylogenetic tree. Predominantly clonal structure of genome sequences is consistent with the observations that sexual reproduction is rare in P. spp. Small number of regions with noncanonical phylogenies seem to arise due to some recombination events between derived lineages of P. spp., with MAT-locus being transferred on multiple occasions. All sequenced strains have heterothallic configuration of MAT-locus.

  12. Integration of transcript expression, copy number and LOH analysis of infiltrating ductal carcinoma of the breast

    Directory of Open Access Journals (Sweden)

    Hawthorn Lesleyann

    2010-08-01

    Full Text Available Abstract Background A major challenge in the interpretation of genomic profiling data generated from breast cancer samples is the identification of driver genes as distinct from bystander genes which do not impact tumorigenesis. One way to assess the relative importance of alterations in the transcriptome profile is to combine parallel analyses that assess changes in the copy number alterations (CNAs. This integrated analysis permits the identification of genes with altered expression that map within specific chromosomal regions which demonstrate copy number alterations, providing a mechanistic approach to identify the 'driver genes'. Methods We have performed whole genome analysis of CNAs using the Affymetrix 250K Mapping array on 22 infiltrating ductal carcinoma samples (IDCs. Analysis of transcript expression alterations was performed using the Affymetrix U133 Plus2.0 array on 16 IDC samples. Fourteen IDC samples were analyzed using both platforms and the data integrated. We also incorporated data from loss of heterozygosity (LOH analysis to identify genes showing altered expression in LOH regions. Results Common chromosome gains and amplifications were identified at 1q21.3, 6p21.3, 7p11.2-p12.1, 8q21.11 and 8q24.3. A novel amplicon was identified at 5p15.33. Frequent losses were found at 1p36.22, 8q23.3, 11p13, 11q23, and 22q13. Over 130 genes were identified with concurrent increases or decreases in expression that mapped to these regions of copy number alterations. LOH analysis revealed three tumors with whole chromosome or p arm allelic loss of chromosome 17. Genes were identified that mapped to copy neutral LOH regions. LOH with accompanying copy loss was detected on Xp24 and Xp25 and genes mapping to these regions with decreased expression were identified. Gene expression data highlighted the PPARα/RXRα Activation Pathway as down-regulated in the tumor samples. Conclusion We have demonstrated the utility of the application of

  13. Intragenomic polymorphisms among high-copy loci: a genus-wide study of nuclear ribosomal DNA in Asclepias (Apocynaceae

    Directory of Open Access Journals (Sweden)

    Kevin Weitemier

    2015-01-01

    Full Text Available Despite knowledge that concerted evolution of high-copy loci is often imperfect, studies that investigate the extent of intragenomic polymorphisms and comparisons across a large number of species are rarely made. We present a bioinformatic pipeline for characterizing polymorphisms within an individual among copies of a high-copy locus. Results are presented for nuclear ribosomal DNA (nrDNA across the milkweed genus, Asclepias. The 18S-26S portion of the nrDNA cistron of Asclepias syriaca served as a reference for assembly of the region from 124 samples representing 90 species of Asclepias. Reads were mapped back to each individual’s consensus and at each position reads differing from the consensus were tallied using a custom perl script. Low frequency polymorphisms existed in all individuals (mean = 5.8%. Most nrDNA positions (91% were polymorphic in at least one individual, with polymorphic sites being less frequent in subunit regions and loops. Highly polymorphic sites existed in each individual, with highest abundance in the “noncoding” ITS regions. Phylogenetic signal was present in the distribution of intragenomic polymorphisms across the genus. Intragenomic polymorphisms in nrDNA are common in Asclepias, being found at higher frequency than any other study to date. The high and variable frequency of polymorphisms across species highlights concerns that phylogenetic applications of nrDNA may be error-prone. The new analytical approach provided here is applicable to other taxa and other high-copy regions characterized by low coverage genome sequencing (genome skimming.

  14. High-resolution copy number variation analysis of schizophrenia in Japan.

    Science.gov (United States)

    Kushima, I; Aleksic, B; Nakatochi, M; Shimamura, T; Shiino, T; Yoshimi, A; Kimura, H; Takasaki, Y; Wang, C; Xing, J; Ishizuka, K; Oya-Ito, T; Nakamura, Y; Arioka, Y; Maeda, T; Yamamoto, M; Yoshida, M; Noma, H; Hamada, S; Morikawa, M; Uno, Y; Okada, T; Iidaka, T; Iritani, S; Yamamoto, T; Miyashita, M; Kobori, A; Arai, M; Itokawa, M; Cheng, M-C; Chuang, Y-A; Chen, C-H; Suzuki, M; Takahashi, T; Hashimoto, R; Yamamori, H; Yasuda, Y; Watanabe, Y; Nunokawa, A; Someya, T; Ikeda, M; Toyota, T; Yoshikawa, T; Numata, S; Ohmori, T; Kunimoto, S; Mori, D; Iwata, N; Ozaki, N

    2017-03-01

    Recent schizophrenia (SCZ) studies have reported an increased burden of de novo copy number variants (CNVs) and identified specific high-risk CNVs, although with variable phenotype expressivity. However, the pathogenesis of SCZ has not been fully elucidated. Using array comparative genomic hybridization, we performed a high-resolution genome-wide CNV analysis on a mainly (92%) Japanese population (1699 SCZ cases and 824 controls) and identified 7066 rare CNVs, 70.0% of which were small (history of congenital/developmental phenotypes, and the rate of treatment resistance was significantly higher (odds ratio=2.79, P=0.0036). We found more severe clinical manifestations in patients with two clinically significant CNVs. Gene set analysis replicated previous findings (e.g., synapse, calcium signaling) and identified novel biological pathways including oxidative stress response, genomic integrity, kinase and small GTPase signaling. Furthermore, involvement of multiple SCZ candidate genes and biological pathways in the pathogenesis of SCZ was suggested in established SCZ-associated CNV loci. Our study shows the high genetic heterogeneity of SCZ and its clinical features and raises the possibility that genomic instability is involved in its pathogenesis, which may be related to the increased burden of de novo CNVs and variable expressivity of CNVs.

  15. The importance of copy number variation in congenital heart disease

    Science.gov (United States)

    Costain, Gregory; Silversides, Candice K; Bassett, Anne S

    2016-01-01

    Congenital heart disease (CHD) is the most common class of major malformations in humans. The historical association with large chromosomal abnormalities foreshadowed the role of submicroscopic rare copy number variations (CNVs) as important genetic causes of CHD. Recent studies have provided robust evidence for these structural variants as genome-wide contributors to all forms of CHD, including CHD that appears isolated without extra-cardiac features. Overall, a CNV-related molecular diagnosis can be made in up to one in eight patients with CHD. These include de novo and inherited variants at established (chromosome 22q11.2), emerging (chromosome 1q21.1), and novel loci across the genome. Variable expression of rare CNVs provides support for the notion of a genetic spectrum of CHD that crosses traditional anatomic classification boundaries. Clinical genetic testing using genome-wide technologies (e.g., chromosomal microarray analysis) is increasingly employed in prenatal, paediatric and adult settings. CNV discoveries in CHD have translated to changes to clinical management, prognostication and genetic counselling. The convergence of findings at individual gene and at pathway levels is shedding light on the mechanisms that govern human cardiac morphogenesis. These clinical and research advances are helping to inform whole-genome sequencing, the next logical step in delineating the genetic architecture of CHD. PMID:28706735

  16. Development, optimization, and single laboratory validation of an event-specific real-time PCR method for the detection and quantification of Golden Rice 2 using a novel taxon-specific assay.

    Science.gov (United States)

    Jacchia, Sara; Nardini, Elena; Savini, Christian; Petrillo, Mauro; Angers-Loustau, Alexandre; Shim, Jung-Hyun; Trijatmiko, Kurniawan; Kreysa, Joachim; Mazzara, Marco

    2015-02-18

    In this study, we developed, optimized, and in-house validated a real-time PCR method for the event-specific detection and quantification of Golden Rice 2, a genetically modified rice with provitamin A in the grain. We optimized and evaluated the performance of the taxon (targeting rice Phospholipase D α2 gene)- and event (targeting the 3' insert-to-plant DNA junction)-specific assays that compose the method as independent modules, using haploid genome equivalents as unit of measurement. We verified the specificity of the two real-time PCR assays and determined their dynamic range, limit of quantification, limit of detection, and robustness. We also confirmed that the taxon-specific DNA sequence is present in single copy in the rice genome and verified its stability of amplification across 132 rice varieties. A relative quantification experiment evidenced the correct performance of the two assays when used in combination.

  17. Low AMY1 Gene Copy Number Is Associated with Increased Body Mass Index in Prepubertal Boys.

    Directory of Open Access Journals (Sweden)

    M Loredana Marcovecchio

    Full Text Available Genome-wide association studies have identified more than 60 single nucleotide polymorphisms associated with Body Mass Index (BMI. Additional genetic variants, such as copy number variations (CNV, have also been investigated in relation to BMI. Recently, the highly polymorphic CNV in the salivary amylase (AMY1 gene, encoding an enzyme implicated in the first step of starch digestion, has been associated with obesity in adults and children. We assessed the potential association between AMY1 copy number and a wide range of BMI in a population of Italian school-children.744 children (354 boys, 390 girls, mean age (±SD: 8.4±1.4years underwent anthropometric assessments (height, weight and collection of saliva samples for DNA extraction. AMY1 copies were evaluated by quantitative PCR.A significant increase of BMI z-score by decreasing AMY1 copy number was observed in boys (β: -0.117, p = 0.033, but not in girls. Similarly, waist circumference (β: -0.155, p = 0.003, adjusted for age was negatively influenced by AMY1 copy number in boys. Boys with 8 or more AMY1 copy numbers presented a significant lower BMI z-score (p = 0.04 and waist circumference (p = 0.01 when compared to boys with less than 8 copy numbers.In this pediatric-only, population-based study, a lower AMY1 copy number emerged to be associated with increased BMI in boys. These data confirm previous findings from adult studies and support a potential role of a higher copy number of the salivary AMY1 gene in protecting from excess weight gain.

  18. Nature and distribution of feline sarcoma virus nucleotide sequences.

    Science.gov (United States)

    Frankel, A E; Gilbert, J H; Porzig, K J; Scolnick, E M; Aaronson, S A

    1979-01-01

    The genomes of three independent isolates of feline sarcoma virus (FeSV) were compared by molecular hybridization techniques. Using complementary DNAs prepared from two strains, SM- and ST-FeSV, common complementary DNA'S were selected by sequential hybridization to FeSV and feline leukemia virus RNAs. These DNAs were shown to be highly related among the three independent sarcoma virus isolates. FeSV-specific complementary DNAs were prepared by selection for hybridization by the homologous FeSV RNA and against hybridization by fline leukemia virus RNA. Sarcoma virus-specific sequences of SM-FeSV were shown to differ from those of either ST- or GA-FeSV strains, whereas ST-FeSV-specific DNA shared extensive sequence homology with GA-FeSV. By molecular hybridization, each set of FeSV-specific sequences was demonstrated to be present in normal cat cellular DNA in approximately one copy per haploid genome and was conserved throughout Felidae. In contrast, FeSV-common sequences were present in multiple DNA copies and were found only in Mediterranean cats. The present results are consistent with the concept that each FeSV strain has arisen by a mechanism involving recombination between feline leukemia virus and cat cellular DNA sequences, the latter represented within the cat genome in a manner analogous to that of a cellular gene. PMID:225544

  19. Novel candidate genes and regions for childhood apraxia of speech identified by array comparative genomic hybridization.

    Science.gov (United States)

    Laffin, Jennifer J S; Raca, Gordana; Jackson, Craig A; Strand, Edythe A; Jakielski, Kathy J; Shriberg, Lawrence D

    2012-11-01

    The goal of this study was to identify new candidate genes and genomic copy-number variations associated with a rare, severe, and persistent speech disorder termed childhood apraxia of speech. Childhood apraxia of speech is the speech disorder segregating with a mutation in FOXP2 in a multigenerational London pedigree widely studied for its role in the development of speech-language in humans. A total of 24 participants who were suspected to have childhood apraxia of speech were assessed using a comprehensive protocol that samples speech in challenging contexts. All participants met clinical-research criteria for childhood apraxia of speech. Array comparative genomic hybridization analyses were completed using a customized 385K Nimblegen array (Roche Nimblegen, Madison, WI) with increased coverage of genes and regions previously associated with childhood apraxia of speech. A total of 16 copy-number variations with potential consequences for speech-language development were detected in 12 or half of the 24 participants. The copy-number variations occurred on 10 chromosomes, 3 of which had two to four candidate regions. Several participants were identified with copy-number variations in two to three regions. In addition, one participant had a heterozygous FOXP2 mutation and a copy-number variation on chromosome 2, and one participant had a 16p11.2 microdeletion and copy-number variations on chromosomes 13 and 14. Findings support the likelihood of heterogeneous genomic pathways associated with childhood apraxia of speech.

  20. Whole-genome analysis of a patient with early-stage small-cell lung cancer.

    Science.gov (United States)

    Han, J-Y; Lee, Y-S; Kim, B C; Lee, G K; Lee, S; Kim, E-H; Kim, H-M; Bhak, J

    2014-12-01

    We performed whole-genome sequencing (WGS) of a case of early-stage small-cell lung cancer (SCLC) to analyze the genomic features. WGS revealed a lot of single-nucleotide variations (SNVs), small insertion/deletions and chromosomal abnormality. Chromosomes 4p, 5q, 13q, 15q, 17p and 22q contained many block deletions. Especially, copy loss was observed in tumor suppressor genes RB1 and TP53, and copy gain in oncogene hTERT. Somatic mutations were found in TP53 and CREBBP. Novel nonsynonymous (ns) SNVs in C6ORF103 and SLC5A4 genes were also found. Sanger sequencing of the SLC5A4 gene in 23 independent SCLC samples showed another nsSNV in the SLC5A4 gene, indicating that nsSNVs in the SLC5A4 gene are recurrent in SCLC. WGS of an early-stage SCLC identified novel recurrent mutations and validated known variations, including copy number variations. These findings provide insight into the genomic landscape contributing to SCLC development.

  1. Diversity in copy number and structure of a silkworm morphogenetic gene as a result of domestication.

    Science.gov (United States)

    Sakudoh, Takashi; Nakashima, Takeharu; Kuroki, Yoko; Fujiyama, Asao; Kohara, Yuji; Honda, Naoko; Fujimoto, Hirofumi; Shimada, Toru; Nakagaki, Masao; Banno, Yutaka; Tsuchida, Kozo

    2011-03-01

    The carotenoid-binding protein (CBP) of the domesticated silkworm, Bombyx mori, a major determinant of cocoon color, is likely to have been substantially influenced by domestication of this species. We analyzed the structure of the CBP gene in multiple strains of B. mori, in multiple individuals of the wild silkworm, B. mandarina (the putative wild ancestor of B. mori), and in a number of other lepidopterans. We found the CBP gene copy number in genomic DNA to vary widely among B. mori strains, ranging from 1 to 20. The copies of CBP are of several types, based on the presence of a retrotransposon or partial deletion of the coding sequence. In contrast to B. mori, B. mandarina was found to possess a single copy of CBP without the retrotransposon insertion, regardless of habitat. Several other lepidopterans were found to contain sequences homologous to CBP, revealing that this gene is evolutionarily conserved in the lepidopteran lineage. Thus, domestication can generate significant diversity of gene copy number and structure over a relatively short evolutionary time. © 2011 by the Genetics Society of America

  2. Genome size estimation: a new methodology

    Science.gov (United States)

    Álvarez-Borrego, Josué; Gallardo-Escárate, Crisitian; Kober, Vitaly; López-Bonilla, Oscar

    2007-03-01

    Recently, within the cytogenetic analysis, the evolutionary relations implied in the content of nuclear DNA in plants and animals have received a great attention. The first detailed measurements of the nuclear DNA content were made in the early 40's, several years before Watson and Crick proposed the molecular structure of the DNA. In the following years Hewson Swift developed the concept of "C-value" in reference to the haploid phase of DNA in plants. Later Mirsky and Ris carried out the first systematic study of genomic size in animals, including representatives of the five super classes of vertebrates as well as of some invertebrates. From these preliminary results it became evident that the DNA content varies enormously between the species and that this variation does not bear relation to the intuitive notion from the complexity of the organism. Later, this observation was reaffirmed in the following years as the studies increased on genomic size, thus denominating to this characteristic of the organisms like the "Paradox of the C-value". Few years later along with the no-codification discovery of DNA the paradox was solved, nevertheless, numerous questions remain until nowadays unfinished, taking to denominate this type of studies like the "C-value enigma". In this study, we reported a new method for genome size estimation by quantification of fluorescence fading. We measured the fluorescence intensity each 1600 milliseconds in DAPI-stained nuclei. The estimation of the area under the graph (integral fading) during fading period was related with the genome size.

  3. A genome-wide investigation of copy number variation in patients with sporadic brain arteriovenous malformation.

    Directory of Open Access Journals (Sweden)

    Nasrine Bendjilali

    Full Text Available Brain arteriovenous malformations (BAVM are clusters of abnormal blood vessels, with shunting of blood from the arterial to venous circulation and a high risk of rupture and intracranial hemorrhage. Most BAVMs are sporadic, but also occur in patients with Hereditary Hemorrhagic Telangiectasia, a Mendelian disorder caused by mutations in genes in the transforming growth factor beta (TGFβ signaling pathway.To investigate whether copy number variations (CNVs contribute to risk of sporadic BAVM, we performed a genome-wide association study in 371 sporadic BAVM cases and 563 healthy controls, all Caucasian. Cases and controls were genotyped using the Affymetrix 6.0 array. CNVs were called using the PennCNV and Birdsuite algorithms and analyzed via segment-based and gene-based approaches. Common and rare CNVs were evaluated for association with BAVM.A CNV region on 1p36.13, containing the neuroblastoma breakpoint family, member 1 gene (NBPF1, was significantly enriched with duplications in BAVM cases compared to controls (P = 2.2×10(-9; NBPF1 was also significantly associated with BAVM in gene-based analysis using both PennCNV and Birdsuite. We experimentally validated the 1p36.13 duplication; however, the association did not replicate in an independent cohort of 184 sporadic BAVM cases and 182 controls (OR = 0.81, P = 0.8. Rare CNV analysis did not identify genes significantly associated with BAVM.We did not identify common CNVs associated with sporadic BAVM that replicated in an independent cohort. Replication in larger cohorts is required to elucidate the possible role of common or rare CNVs in BAVM pathogenesis.

  4. The Art of Copying

    DEFF Research Database (Denmark)

    Christensen, Hans Dam

    2017-01-01

    This article discusses copies within the field of art museums by way of mapping strategies for copy practices. This mapping leans heavily towards parts of the writings of Jacques Derrida (1930–2004). Against the backdrop of this theoretical premise, the article distinguishes five main strategies....... An informational copy is just as unique as an original object of art, and at the same time, it defines the original and is itself defined by this opposition. Lastly, the strategy for the imagined relation between original and copy follows. This strategy is dependent upon several of the previous approaches, and...

  5. Discovery of previously unidentified genomic disorders from the duplication architecture of the human genome.

    Science.gov (United States)

    Sharp, Andrew J; Hansen, Sierra; Selzer, Rebecca R; Cheng, Ze; Regan, Regina; Hurst, Jane A; Stewart, Helen; Price, Sue M; Blair, Edward; Hennekam, Raoul C; Fitzpatrick, Carrie A; Segraves, Rick; Richmond, Todd A; Guiver, Cheryl; Albertson, Donna G; Pinkel, Daniel; Eis, Peggy S; Schwartz, Stuart; Knight, Samantha J L; Eichler, Evan E

    2006-09-01

    Genomic disorders are characterized by the presence of flanking segmental duplications that predispose these regions to recurrent rearrangement. Based on the duplication architecture of the genome, we investigated 130 regions that we hypothesized as candidates for previously undescribed genomic disorders. We tested 290 individuals with mental retardation by BAC array comparative genomic hybridization and identified 16 pathogenic rearrangements, including de novo microdeletions of 17q21.31 found in four individuals. Using oligonucleotide arrays, we refined the breakpoints of this microdeletion, defining a 478-kb critical region containing six genes that were deleted in all four individuals. We mapped the breakpoints of this deletion and of four other pathogenic rearrangements in 1q21.1, 15q13, 15q24 and 17q12 to flanking segmental duplications, suggesting that these are also sites of recurrent rearrangement. In common with the 17q21.31 deletion, these breakpoint regions are sites of copy number polymorphism in controls, indicating that these may be inherently unstable genomic regions.

  6. Robust Adaptable Video Copy Detection

    DEFF Research Database (Denmark)

    Assent, Ira; Kremer, Hardy

    2009-01-01

    in contrast). Our query processing combines filtering and indexing structures for efficient multistep computation of video copies under this model. We show that our model successfully identifies altered video copies and does so more reliably than existing models.......Video copy detection should be capable of identifying video copies subject to alterations e.g. in video contrast or frame rates. We propose a video copy detection scheme that allows for adaptable detection of videos that are altered temporally (e.g. frame rate change) and/or visually (e.g. change...

  7. A sparse regulatory network of copy-number driven gene expression reveals putative breast cancer oncogenes.

    Science.gov (United States)

    Yuan, Yinyin; Curtis, Christina; Caldas, Carlos; Markowetz, Florian

    2012-01-01

    Copy number aberrations are recognized to be important in cancer as they may localize to regions harboring oncogenes or tumor suppressors. Such genomic alterations mediate phenotypic changes through their impact on expression. Both cis- and transacting alterations are important since they may help to elucidate putative cancer genes. However, amidst numerous passenger genes, trans-effects are less well studied due to the computational difficulty in detecting weak and sparse signals in the data, and yet may influence multiple genes on a global scale. We propose an integrative approach to learn a sparse interaction network of DNA copy-number regions with their downstream transcriptional targets in breast cancer. With respect to goodness of fit on both simulated and real data, the performance of sparse network inference is no worse than other state-of-the-art models but with the advantage of simultaneous feature selection and efficiency. The DNA-RNA interaction network helps to distinguish copy-number driven expression alterations from those that are copy-number independent. Further, our approach yields a quantitative copy-number dependency score, which distinguishes cis- versus trans-effects. When applied to a breast cancer data set, numerous expression profiles were impacted by cis-acting copy-number alterations, including several known oncogenes such as GRB7, ERBB2, and LSM1. Several trans-acting alterations were also identified, impacting genes such as ADAM2 and BAGE, which warrant further investigation. An R package named lol is available from www.markowetzlab.org/software/lol.html.

  8. Bead-probe complex capture a couple of SINE and LINE family from genomes of two closely related species of East Asian cyprinid directly using magnetic separation

    Science.gov (United States)

    Tong, Chaobo; Guo, Baocheng; He, Shunping

    2009-01-01

    Background Short and long interspersed elements (SINEs and LINEs, respectively), two types of retroposons, are active in shaping the architecture of genomes and powerful tools for studies of phylogeny and population biology. Here we developed special protocol to apply biotin-streptavidin bead system into isolation of interspersed repeated sequences rapidly and efficiently, in which SINEs and LINEs were captured directly from digested genomic DNA by hybridization to bead-probe complex in solution instead of traditional strategy including genomic library construction and screening. Results A new couple of SINEs and LINEs that shared an almost identical 3'tail was isolated and characterized in silver carp and bighead carp of two closely related species. These SINEs (34 members), designated HAmo SINE family, were little divergent in sequence and flanked by obvious TSD indicated that HAmo SINE was very young family. The copy numbers of this family was estimated to 2 × 105 and 1.7 × 105 per haploid genome by Real-Time qPCR, respectively. The LINEs, identified as the homologs of LINE2 in other fishes, had a conserved primary sequence and secondary structures of the 3'tail region that was almost identical to that of HAmo SINE. These evidences suggest that HAmo SINEs are active and amplified recently utilizing the enzymatic machinery for retroposition of HAmoL2 through the recognition of higher-order structures of the conserved 42-tail region. We analyzed the possible structures of HAmo SINE that lead to successful amplification in genome and then deduced that HAmo SINE, SmaI SINE and FokI SINE that were similar in sequence each other, were probably generated independently and created by LINE family within the same lineage of a LINE phylogeny in the genomes of different hosts. Conclusion The presented results show the advantage of the novel method for retroposons isolation and a pair of young SINE family and its partner LINE family in two carp fishes, which strengthened

  9. Bead-probe complex capture a couple of SINE and LINE family from genomes of two closely related species of East Asian cyprinid directly using magnetic separation

    Directory of Open Access Journals (Sweden)

    Guo Baocheng

    2009-02-01

    Full Text Available Abstract Background Short and long interspersed elements (SINEs and LINEs, respectively, two types of retroposons, are active in shaping the architecture of genomes and powerful tools for studies of phylogeny and population biology. Here we developed special protocol to apply biotin-streptavidin bead system into isolation of interspersed repeated sequences rapidly and efficiently, in which SINEs and LINEs were captured directly from digested genomic DNA by hybridization to bead-probe complex in solution instead of traditional strategy including genomic library construction and screening. Results A new couple of SINEs and LINEs that shared an almost identical 3'tail was isolated and characterized in silver carp and bighead carp of two closely related species. These SINEs (34 members, designated HAmo SINE family, were little divergent in sequence and flanked by obvious TSD indicated that HAmo SINE was very young family. The copy numbers of this family was estimated to 2 × 105 and 1.7 × 105 per haploid genome by Real-Time qPCR, respectively. The LINEs, identified as the homologs of LINE2 in other fishes, had a conserved primary sequence and secondary structures of the 3'tail region that was almost identical to that of HAmo SINE. These evidences suggest that HAmo SINEs are active and amplified recently utilizing the enzymatic machinery for retroposition of HAmoL2 through the recognition of higher-order structures of the conserved 42-tail region. We analyzed the possible structures of HAmo SINE that lead to successful amplification in genome and then deduced that HAmo SINE, SmaI SINE and FokI SINE that were similar in sequence each other, were probably generated independently and created by LINE family within the same lineage of a LINE phylogeny in the genomes of different hosts. Conclusion The presented results show the advantage of the novel method for retroposons isolation and a pair of young SINE family and its partner LINE family in two carp

  10. The complete chloroplast genome of Capsicum annuum var. glabriusculum using Illumina sequencing.

    Science.gov (United States)

    Raveendar, Sebastin; Na, Young-Wang; Lee, Jung-Ro; Shim, Donghwan; Ma, Kyung-Ho; Lee, Sok-Young; Chung, Jong-Wook

    2015-07-20

    Chloroplast (cp) genome sequences provide a valuable source for DNA barcoding. Molecular phylogenetic studies have concentrated on DNA sequencing of conserved gene loci. However, this approach is time consuming and more difficult to implement when gene organization differs among species. Here we report the complete re-sequencing of the cp genome of Capsicum pepper (Capsicum annuum var. glabriusculum) using the Illumina platform. The total length of the cp genome is 156,817 bp with a 37.7% overall GC content. A pair of inverted repeats (IRs) of 50,284 bp were separated by a small single copy (SSC; 18,948 bp) and a large single copy (LSC; 87,446 bp). The number of cp genes in C. annuum var. glabriusculum is the same as that in other Capsicum species. Variations in the lengths of LSC; SSC and IR regions were the main contributors to the size variation in the cp genome of this species. A total of 125 simple sequence repeat (SSR) and 48 insertions or deletions variants were found by sequence alignment of Capsicum cp genome. These findings provide a foundation for further investigation of cp genome evolution in Capsicum and other higher plants.

  11. Comparative Genome Analysis of Lolium-Festuca Complex Species

    DEFF Research Database (Denmark)

    Czaban, Adrian; Byrne, Stephen; Sharma, Sapna

    2015-01-01

    , winter hardiness, drought tolerance and resistance to grazing. In this study we have sequenced and assembled the low copy fraction of the genomes of Lolium westerwoldicum, Lolium multiflorum, Festuca pratensis and Lolium temulentum. We have also generated de-novo transcriptome assemblies for each species......, and these have aided in the annotation of the genomic sequence. Using this data we were able to generate annotated assemblies of the gene rich regions of the four species to complement the already sequenced Lolium perenne genome. Using these gene models we have identified orthologous genes between the species...

  12. A Near-Complete Haplotype-Phased Genome of the Dikaryotic Wheat Stripe Rust Fungus Puccinia striiformis f. sp. tritici Reveals High Interhaplotype Diversity.

    Science.gov (United States)

    Schwessinger, Benjamin; Sperschneider, Jana; Cuddy, William S; Garnica, Diana P; Miller, Marisa E; Taylor, Jennifer M; Dodds, Peter N; Figueroa, Melania; Park, Robert F; Rathjen, John P

    2018-02-20

    A long-standing biological question is how evolution has shaped the genomic architecture of dikaryotic fungi. To answer this, high-quality genomic resources that enable haplotype comparisons are essential. Short-read genome assemblies for dikaryotic fungi are highly fragmented and lack haplotype-specific information due to the high heterozygosity and repeat content of these genomes. Here, we present a diploid-aware assembly of the wheat stripe rust fungus Puccinia striiformis f. sp. tritici based on long reads using the FALCON-Unzip assembler. Transcriptome sequencing data sets were used to infer high-quality gene models and identify virulence genes involved in plant infection referred to as effectors. This represents the most complete Puccinia striiformis f. sp. tritici genome assembly to date (83 Mb, 156 contigs, N 50 of 1.5 Mb) and provides phased haplotype information for over 92% of the genome. Comparisons of the phase blocks revealed high interhaplotype diversity of over 6%. More than 25% of all genes lack a clear allelic counterpart. When we investigated genome features that potentially promote the rapid evolution of virulence, we found that candidate effector genes are spatially associated with conserved genes commonly found in basidiomycetes. Yet, candidate effectors that lack an allelic counterpart are more distant from conserved genes than allelic candidate effectors and are less likely to be evolutionarily conserved within the P. striiformis species complex and Pucciniales In summary, this haplotype-phased assembly enabled us to discover novel genome features of a dikaryotic plant-pathogenic fungus previously hidden in collapsed and fragmented genome assemblies. IMPORTANCE Current representations of eukaryotic microbial genomes are haploid, hiding the genomic diversity intrinsic to diploid and polyploid life forms. This hidden diversity contributes to the organism's evolutionary potential and ability to adapt to stress conditions. Yet, it is

  13. DR-Integrator: a new analytic tool for integrating DNA copy number and gene expression data.

    Science.gov (United States)

    Salari, Keyan; Tibshirani, Robert; Pollack, Jonathan R

    2010-02-01

    DNA copy number alterations (CNA) frequently underlie gene expression changes by increasing or decreasing gene dosage. However, only a subset of genes with altered dosage exhibit concordant changes in gene expression. This subset is likely to be enriched for oncogenes and tumor suppressor genes, and can be identified by integrating these two layers of genome-scale data. We introduce DNA/RNA-Integrator (DR-Integrator), a statistical software tool to perform integrative analyses on paired DNA copy number and gene expression data. DR-Integrator identifies genes with significant correlations between DNA copy number and gene expression, and implements a supervised analysis that captures genes with significant alterations in both DNA copy number and gene expression between two sample classes. DR-Integrator is freely available for non-commercial use from the Pollack Lab at http://pollacklab.stanford.edu/ and can be downloaded as a plug-in application to Microsoft Excel and as a package for the R statistical computing environment. The R package is available under the name 'DRI' at http://cran.r-project.org/. An example analysis using DR-Integrator is included as supplemental material. Supplementary data are available at Bioinformatics online.

  14. Prognostic Impact of Array-based Genomic Profiles in Esophageal Squamous Cell Cancer

    International Nuclear Information System (INIS)

    Carneiro, Ana; Isinger, Anna; Karlsson, Anna; Johansson, Jan; Jönsson, Göran; Bendahl, Pär-Ola; Falkenback, Dan; Halvarsson, Britta; Nilbert, Mef

    2008-01-01

    Esophageal squamous cell carcinoma (ESCC) is a genetically complex tumor type and a major cause of cancer related mortality. Although distinct genetic alterations have been linked to ESCC development and prognosis, the genetic alterations have not gained clinical applicability. We applied array-based comparative genomic hybridization (aCGH) to obtain a whole genome copy number profile relevant for identifying deranged pathways and clinically applicable markers. A 32 k aCGH platform was used for high resolution mapping of copy number changes in 30 stage I-IV ESCC. Potential interdependent alterations and deranged pathways were identified and copy number changes were correlated to stage, differentiation and survival. Copy number alterations affected median 19% of the genome and included recurrent gains of chromosome regions 5p, 7p, 7q, 8q, 10q, 11q, 12p, 14q, 16p, 17p, 19p, 19q, and 20q and losses of 3p, 5q, 8p, 9p and 11q. High-level amplifications were observed in 30 regions and recurrently involved 7p11 (EGFR), 11q13 (MYEOV, CCND1, FGF4, FGF3, PPFIA, FAD, TMEM16A, CTTS and SHANK2) and 11q22 (PDFG). Gain of 7p22.3 predicted nodal metastases and gains of 1p36.32 and 19p13.3 independently predicted poor survival in multivariate analysis. aCGH profiling verified genetic complexity in ESCC and herein identified imbalances of multiple central tumorigenic pathways. Distinct gains correlate with clinicopathological variables and independently predict survival, suggesting clinical applicability of genomic profiling in ESCC

  15. New Genome Similarity Measures based on Conserved Gene Adjacencies.

    Science.gov (United States)

    Doerr, Daniel; Kowada, Luis Antonio B; Araujo, Eloi; Deshpande, Shachi; Dantas, Simone; Moret, Bernard M E; Stoye, Jens

    2017-06-01

    Many important questions in molecular biology, evolution, and biomedicine can be addressed by comparative genomic approaches. One of the basic tasks when comparing genomes is the definition of measures of similarity (or dissimilarity) between two genomes, for example, to elucidate the phylogenetic relationships between species. The power of different genome comparison methods varies with the underlying formal model of a genome. The simplest models impose the strong restriction that each genome under study must contain the same genes, each in exactly one copy. More realistic models allow several copies of a gene in a genome. One speaks of gene families, and comparative genomic methods that allow this kind of input are called gene family-based. The most powerful-but also most complex-models avoid this preprocessing of the input data and instead integrate the family assignment within the comparative analysis. Such methods are called gene family-free. In this article, we study an intermediate approach between family-based and family-free genomic similarity measures. Introducing this simpler model, called gene connections, we focus on the combinatorial aspects of gene family-free genome comparison. While in most cases, the computational costs to the general family-free case are the same, we also find an instance where the gene connections model has lower complexity. Within the gene connections model, we define three variants of genomic similarity measures that have different expression powers. We give polynomial-time algorithms for two of them, while we show NP-hardness for the third, most powerful one. We also generalize the measures and algorithms to make them more robust against recent local disruptions in gene order. Our theoretical findings are supported by experimental results, proving the applicability and performance of our newly defined similarity measures.

  16. Sex bias in copy number variation of olfactory receptor gene family depends on ethnicity

    Directory of Open Access Journals (Sweden)

    Farideh eShadravan

    2013-03-01

    Full Text Available Gender plays a pivotal role in the human genetic identity and is also manifested in many genetic disorders particularly mental retardation. In this study its effect on copy number variation (CNV, known to cause genetic disorders was explored. As the olfactory receptor (OR repertoire comprises the largest human gene family, it was selected for this study, which was carried out within and between three populations, derived from 150 individuals from the 1000 Genome Project. Analysis of 3872 CNVs detected among 791 OR loci, in which 307 loci showed CNV, revealed the following novel findings: Sex bias in CNV was significantly more prevalent in uncommon than common CNV variants of OR pseudogenes, in which the male genome showed more CNVs; and in one-copy number loss compared to complete deletion of OR pseudogenes; both findings implying a more recent evolutionary role for gender. Sex bias in copy number gain was also detected. Another novel finding was that the observed six bias was largely dependent on ethnicity and was in general absent in East Asians. Using a CNV public database for sick children (ISCA the application of these findings for improving clinical molecular diagnostics is discussed by showing an example of sex bias in CNV among kids with autism. Additional clinical relevance is discussed, as the most polymorphic CNV-enriched OR cluster in the human genome, located on chr 15q11.2, is found near the PWS/AS bi-directionally imprinted region associated with two well-known mental retardation syndromes. As olfaction represents the primitive cognition in most mammals, arguably in competition with the development of a larger brain, the extensive retention of OR pseudogenes in females of this study, might point to a parent-of-origin indirect regulatory role for OR pseudogenes in the embryonic development of human brain. Thus any perturbation in the temporal regulation of olfactory system could lead to developmental delay disorders including

  17. DNA Replication in Engineered Escherichia coli Genomes with Extra Replication Origins.

    Science.gov (United States)

    Milbredt, Sarah; Farmani, Neda; Sobetzko, Patrick; Waldminghaus, Torsten

    2016-10-21

    The standard outline of bacterial genomes is a single circular chromosome with a single replication origin. From the bioengineering perspective, it appears attractive to extend this basic setup. Bacteria with split chromosomes or multiple replication origins have been successfully constructed in the last few years. The characteristics of these engineered strains will largely depend on the respective DNA replication patterns. However, the DNA replication has not been investigated systematically in engineered bacteria with multiple origins or split replicons. Here we fill this gap by studying a set of strains consisting of (i) E. coli strains with an extra copy of the native replication origin (oriC), (ii) E. coli strains with an extra copy of the replication origin from the secondary chromosome of Vibrio cholerae (oriII), and (iii) a strain in which the E. coli chromosome is split into two linear replicons. A combination of flow cytometry, microarray-based comparative genomic hybridization (CGH), and modeling revealed silencing of extra oriC copies and differential timing of ectopic oriII copies compared to the native oriC. The results were used to derive construction rules for future multiorigin and multireplicon projects.

  18. Multiply to conquer: Copy number variations at Ppd-B1 and Vrn-A1 facilitate global adaptation in wheat.

    Science.gov (United States)

    Würschum, Tobias; Boeven, Philipp H G; Langer, Simon M; Longin, C Friedrich H; Leiser, Willmar L

    2015-07-29

    Copy number variation was found to be a frequent type of DNA polymorphism in the human genome often associated with diseases but its importance in crops and the effects on agronomic traits are still largely unknown. Here, we employed a large worldwide panel of 1110 winter wheat varieties to assess the frequency and the geographic distribution of copy number variants at the Photoperiod-B1 (Ppd-B1) and the Vernalization-A1 (Vrn-A1) loci as well as their effects on flowering time under field conditions. We identified a novel four copy variant of Vrn-A1 and based on the phylogenetic relationships among the lines show that the higher copy variants at both loci are likely to have arisen independently multiple times. In addition, we found that the frequency of the different copy number variants at both loci reflects the environmental conditions in the varieties' region of origin and based on multi-location field trials show that Ppd-B1 copy number has a substantial effect on the fine-tuning of flowering time. In conclusion, our results show the importance of copy number variation at Ppd-B1 and Vrn-A1 for the global adaptation of wheat making it a key factor for wheat success in a broad range of environments and in a wider context substantiate the significant role of copy number variation in crops.

  19. Copy-Number Disorders Are a Common Cause of Congenital Kidney Malformations

    OpenAIRE

    Sanna-Cherchi, Simone; Kiryluk, Krzysztof; Burgess, Katelyn E.; Bodria, Monica; Sampson, Matthew G.; Hadley, Dexter; Nees, Shannon N.; Verbitsky, Miguel; Perry, Brittany J.; Sterken, Roel; Lozanovski, Vladimir J.; Materna-Kiryluk, Anna; Barlassina, Cristina; Kini, Akshata; Corbani, Valentina

    2012-01-01

    We examined the burden of large, rare, copy-number variants (CNVs) in 192 individuals with renal hypodysplasia (RHD) and replicated findings in 330 RHD cases from two independent cohorts. CNV distribution was significantly skewed toward larger gene-disrupting events in RHD cases compared to 4,733 ethnicity-matched controls (p = 4.8 × 10−11). This excess was attributable to known and novel (i.e., not present in any database or in the literature) genomic disorders. All together, 55/522 (10.5%) ...

  20. Genome sequence of the endosymbiont Rickettsia peacockii and comparison with virulent Rickettsia rickettsii: identification of virulence factors.

    Directory of Open Access Journals (Sweden)

    Roderick F Felsheim

    2009-12-01

    Full Text Available Rickettsia peacockii, also known as the East Side Agent, is a non-pathogenic obligate intracellular bacterium found as an endosymbiont in Dermacentor andersoni ticks in the western USA and Canada. Its presence in ticks is correlated with reduced prevalence of Rickettsia rickettsii, the agent of Rocky Mountain Spotted Fever. It has been proposed that a virulent SFG rickettsia underwent changes to become the East Side Agent. We determined the genome sequence of R. peacockii and provide a comparison to a closely related virulent R. rickettsii. The presence of 42 chromosomal copies of the ISRpe1 transposon in the genome of R. peacockii is associated with a lack of synteny with the genome of R. rickettsii and numerous deletions via recombination between transposon copies. The plasmid contains a number of genes from distantly related organisms, such as part of the glycosylation island of Pseudomonas aeruginosa. Genes deleted or mutated in R. peacockii which may relate to loss of virulence include those coding for an ankyrin repeat containing protein, DsbA, RickA, protease II, OmpA, ScaI, and a putative phosphoethanolamine transferase. The gene coding for the ankyrin repeat containing protein is especially implicated as it is mutated in R. rickettsii strain Iowa, which has attenuated virulence. Presence of numerous copies of the ISRpe1 transposon, likely acquired by lateral transfer from a Cardinium species, are associated with extensive genomic reorganization and deletions. The deletion and mutation of genes possibly involved in loss of virulence have been identified by this genomic comparison. It also illustrates that the introduction of a transposon into the genome can have varied effects; either correlating with an increase in pathogenicity as in Francisella tularensis or a loss of pathogenicity as in R. peacockii and the recombination enabled by multiple transposon copies can cause significant deletions in some genomes while not in others.

  1. Whole Genome and Tandem Duplicate Retention facilitated Glucosinolate Pathway Diversification in the Mustard Family.

    NARCIS (Netherlands)

    Hofberger, J.A.; Lyons, E.; Edger, P.P.; Pires, J.C.; Schranz, M.E.

    2013-01-01

    Plants share a common history of successive whole genome duplication (WGD) events retaining genomic patterns of duplicate gene copies (ohnologs) organized in conserved syntenic blocks. Duplication was often proposed to affect the origin of novel traits during evolution. However, genetic evidence

  2. Comprehensive analysis of copy number aberrations in microsatellite stable colon cancer in view of stromal component.

    Science.gov (United States)

    Alonso, M Henar; Aussó, Susanna; Lopez-Doriga, Adriana; Cordero, David; Guinó, Elisabet; Solé, Xavier; Barenys, Mercè; de Oca, Javier; Capella, Gabriel; Salazar, Ramón; Sanz-Pamplona, Rebeca; Moreno, Victor

    2017-07-25

    Somatic copy number aberrations (CNAs) are common acquired changes in cancer cells having an important role in the progression of colon cancer (colorectal cancer, CRC). This study aimed to perform a characterisation of CNA and their impact in gene expression. Copy number aberrations were inferred from SNP array data in a series of 99 CRC. Copy number aberration events were calculated and used to assess the association between copy number dosage, clinical and molecular characteristics of the tumours, and gene expression changes. All analyses were adjusted for the quantity of stroma in each sample, which was inferred from gene expression data. High heterogeneity among samples was observed; the proportion of altered genome ranged between 0.04 and 26.6%. Recurrent CNA regions with gains were frequent in chromosomes 7p, 8q, 13q, and 20, whereas 8p, 17p, and 18 cumulated losses. A significant positive correlation was observed between the number of somatic mutations and total CNA (Spearman's r=0.42, P=0.006). Approximately 37% of genes located in CNA regions changed their level of expression and the average partial correlation (adjusted for stromal content) with copy number was 0.54 (interquartile range 0.20 to 0.81). Altered genes showed enrichment in pathways relevant for CRC. Tumours classified as CMS2 and CMS4 by the consensus molecular subtyping showed higher frequency of CNA. Losses of one small region in 1p36.33, with gene CDK11B, were associated with poor prognosis. More than 66% of the recurrent CNA were validated in the The Cancer Genome Atlas (TCGA) data when analysed with the same procedure. Furthermore, 79% of the genes with altered expression in our data were validated in the TCGA. Although CNA are frequent events in microsatellite stable CRC, few focal recurrent regions were found. These aberrations have strong effects on gene expression and contribute to deregulate relevant cancer pathways. Owing to the diploid nature of stromal cells, it is important to

  3. Apomixis and the problem of obtaining haploids and homozygote diploids in pear (Pyrus communis L.

    Directory of Open Access Journals (Sweden)

    Є. О. Долматов

    2013-02-01

    Full Text Available The article highlights results of research over simulative apomixes in pear and its utilization for obtaining haploids and homozygote diploids. It has been established that over 50% pear varieties with failed remote hybridization are capable of generating seeds of apomictic origin producing diploid plants. Genotypes displaying maximal inclination to apomixes have been singled out. Apomictic pear seedlings obtained from foreign pollination within the limits of the same combination are inherent in profound morphological diversity. Fruit-bearing apomicts originated from one and the same maternal plant differ to the same extent as hybrid seedlings of the same family. Genetic markers have enabled to establish that these are embryo sacs in which meiosis has completed that give rise to apomictic seeds. In vitro method as used for the purpose of increasing apomictic plants output has been illustrated. The greatest induction of apomictic shoots in vitro has been reached by alternation of BAP cytokinin at concentration of 1mg/l and 2 mg/l on the background of GA3 amounting to 1,5 mg/l. Grafting with shoots in vitro on non-sterile rootstocks of pear (Pyrus communis has increased the output of plants up to 80%. A cytological assessment of 9 apomictic samples is provided. The cytological analysis of samples of apomictic forms has certified the presence of simulative haploid parthenogenesis in pear.

  4. Phylogenomic relationship of feijoa (Acca sellowiana (O.Berg) Burret) with other Myrtaceae based on complete chloroplast genome sequences.

    Science.gov (United States)

    Machado, Lilian de Oliveira; Vieira, Leila do Nascimento; Stefenon, Valdir Marcos; Oliveira Pedrosa, Fábio de; Souza, Emanuel Maltempi de; Guerra, Miguel Pedro; Nodari, Rubens Onofre

    2017-04-01

    Given their distribution, importance, and richness, Myrtaceae species comprise a model system for studying the evolution of tropical plant diversity. In addition, chloroplast (cp) genome sequencing is an efficient tool for phylogenetic relationship studies. Feijoa [Acca sellowiana (O. Berg) Burret; CN: pineapple-guava] is a Myrtaceae species that occurs naturally in southern Brazil and northern Uruguay. Feijoa is known for its exquisite perfume and flavorful fruits, pharmacological properties, ornamental value and increasing economic relevance. In the present work, we reported the complete cp genome of feijoa. The feijoa cp genome is a circular molecule of 159,370 bp with a quadripartite structure containing two single copy regions, a Large Single Copy region (LSC 88,028 bp) and a Small Single Copy region (SSC 18,598 bp) separated by Inverted Repeat regions (IRs 26,372 bp). The genome structure, gene order, GC content and codon usage are similar to those of typical angiosperm cp genomes. When compared to other cp genome sequences of Myrtaceae, feijoa showed closest relationship with pitanga (Eugenia uniflora L.). Furthermore, a comparison of pitanga synonymous (Ks) and nonsynonymous (Ka) substitution rates revealed extremely low values. Maximum Likelihood and Bayesian Inference analyses produced phylogenomic trees identical in topology. These trees supported monophyly of three Myrtoideae clades.

  5. Draft genome sequence of ramie, Boehmeria nivea (L.) Gaudich.

    Science.gov (United States)

    Luan, Ming-Bao; Jian, Jian-Bo; Chen, Ping; Chen, Jun-Hui; Chen, Jian-Hua; Gao, Qiang; Gao, Gang; Zhou, Ju-Hong; Chen, Kun-Mei; Guang, Xuan-Min; Chen, Ji-Kang; Zhang, Qian-Qian; Wang, Xiao-Fei; Fang, Long; Sun, Zhi-Min; Bai, Ming-Zhou; Fang, Xiao-Dong; Zhao, Shan-Cen; Xiong, He-Ping; Yu, Chun-Ming; Zhu, Ai-Guo

    2018-05-01

    Ramie, Boehmeria nivea (L.) Gaudich, family Urticaceae, is a plant native to eastern Asia, and one of the world's oldest fibre crops. It is also used as animal feed and for the phytoremediation of heavy metal-contaminated farmlands. Thus, the genome sequence of ramie was determined to explore the molecular basis of its fibre quality, protein content and phytoremediation. For further understanding ramie genome, different paired-end and mate-pair libraries were combined to generate 134.31 Gb of raw DNA sequences using the Illumina whole-genome shotgun sequencing approach. The highly heterozygous B. nivea genome was assembled using the Platanus Genome Assembler, which is an effective tool for the assembly of highly heterozygous genome sequences. The final length of the draft genome of this species was approximately 341.9 Mb (contig N50 = 22.62 kb, scaffold N50 = 1,126.36 kb). Based on ramie genome annotations, 30,237 protein-coding genes were predicted, and the repetitive element content was 46.3%. The completeness of the final assembly was evaluated by benchmarking universal single-copy orthologous genes (BUSCO); 90.5% of the 1,440 expected embryophytic genes were identified as complete, and 4.9% were identified as fragmented. Phylogenetic analysis based on single-copy gene families and one-to-one orthologous genes placed ramie with mulberry and cannabis, within the clade of urticalean rosids. Genome information of ramie will be a valuable resource for the conservation of endangered Boehmeria species and for future studies on the biogeography and characteristic evolution of members of Urticaceae. © 2018 John Wiley & Sons Ltd.

  6. rDNA Copy Number Variants Are Frequent Passenger Mutations in Saccharomyces cerevisiae Deletion Collections and de Novo Transformants

    Directory of Open Access Journals (Sweden)

    Elizabeth X. Kwan

    2016-09-01

    Full Text Available The Saccharomyces cerevisiae ribosomal DNA (rDNA locus is known to exhibit greater instability relative to the rest of the genome. However, wild-type cells preferentially maintain a stable number of rDNA copies, suggesting underlying genetic control of the size of this locus. We performed a screen of a subset of the Yeast Knock-Out (YKO single gene deletion collection to identify genetic regulators of this locus and to determine if rDNA copy number correlates with yeast replicative lifespan. While we found no correlation between replicative lifespan and rDNA size, we identified 64 candidate strains with significant rDNA copy number differences. However, in the process of validating candidate rDNA variants, we observed that independent isolates of our de novo gene deletion strains had unsolicited but significant changes in rDNA copy number. Moreover, we were not able to recapitulate rDNA phenotypes from the YKO yeast deletion collection. Instead, we found that the standard lithium acetate transformation protocol is a significant source of rDNA copy number variation, with lithium acetate exposure being the treatment causing variable rDNA copy number events after transformation. As the effects of variable rDNA copy number are being increasingly reported, our finding that rDNA is affected by lithium acetate exposure suggested that rDNA copy number variants may be influential passenger mutations in standard strain construction in S. cerevisiae.

  7. rDNA Copy Number Variants Are Frequent Passenger Mutations in Saccharomyces cerevisiae Deletion Collections and de Novo Transformants.

    Science.gov (United States)

    Kwan, Elizabeth X; Wang, Xiaobin S; Amemiya, Haley M; Brewer, Bonita J; Raghuraman, M K

    2016-09-08

    The Saccharomyces cerevisiae ribosomal DNA (rDNA) locus is known to exhibit greater instability relative to the rest of the genome. However, wild-type cells preferentially maintain a stable number of rDNA copies, suggesting underlying genetic control of the size of this locus. We performed a screen of a subset of the Yeast Knock-Out (YKO) single gene deletion collection to identify genetic regulators of this locus and to determine if rDNA copy number correlates with yeast replicative lifespan. While we found no correlation between replicative lifespan and rDNA size, we identified 64 candidate strains with significant rDNA copy number differences. However, in the process of validating candidate rDNA variants, we observed that independent isolates of our de novo gene deletion strains had unsolicited but significant changes in rDNA copy number. Moreover, we were not able to recapitulate rDNA phenotypes from the YKO yeast deletion collection. Instead, we found that the standard lithium acetate transformation protocol is a significant source of rDNA copy number variation, with lithium acetate exposure being the treatment causing variable rDNA copy number events after transformation. As the effects of variable rDNA copy number are being increasingly reported, our finding that rDNA is affected by lithium acetate exposure suggested that rDNA copy number variants may be influential passenger mutations in standard strain construction in S. cerevisiae. Copyright © 2016 Kwan et al.

  8. A haploid genetic screen identifies the G1/S regulatory machinery as a determinant of Wee1 inhibitor sensitivity.

    Science.gov (United States)

    Heijink, Anne Margriet; Blomen, Vincent A; Bisteau, Xavier; Degener, Fabian; Matsushita, Felipe Yu; Kaldis, Philipp; Foijer, Floris; van Vugt, Marcel A T M

    2015-12-08

    The Wee1 cell cycle checkpoint kinase prevents premature mitotic entry by inhibiting cyclin-dependent kinases. Chemical inhibitors of Wee1 are currently being tested clinically as targeted anticancer drugs. Wee1 inhibition is thought to be preferentially cytotoxic in p53-defective cancer cells. However, TP53 mutant cancers do not respond consistently to Wee1 inhibitor treatment, indicating the existence of genetic determinants of Wee1 inhibitor sensitivity other than TP53 status. To optimally facilitate patient selection for Wee1 inhibition and uncover potential resistance mechanisms, identification of these currently unknown genes is necessary. The aim of this study was therefore to identify gene mutations that determine Wee1 inhibitor sensitivity. We performed a genome-wide unbiased functional genetic screen in TP53 mutant near-haploid KBM-7 cells using gene-trap insertional mutagenesis. Insertion site mapping of cells that survived long-term Wee1 inhibition revealed enrichment of G1/S regulatory genes, including SKP2, CUL1, and CDK2. Stable depletion of SKP2, CUL1, or CDK2 or chemical Cdk2 inhibition rescued the γ-H2AX induction and abrogation of G2 phase as induced by Wee1 inhibition in breast and ovarian cancer cell lines. Remarkably, live cell imaging showed that depletion of SKP2, CUL1, or CDK2 did not rescue the Wee1 inhibition-induced karyokinesis and cytokinesis defects. These data indicate that the activity of the DNA replication machinery, beyond TP53 mutation status, determines Wee1 inhibitor sensitivity, and could serve as a selection criterion for Wee1-inhibitor eligible patients. Conversely, loss of the identified S-phase genes could serve as a mechanism of acquired resistance, which goes along with development of severe genomic instability.

  9. The complete plastid genomes of the two 'dinotoms' Durinskia baltica and Kryptoperidinium foliaceum.

    Directory of Open Access Journals (Sweden)

    Behzad Imanian

    2010-05-01

    Full Text Available In one small group of dinoflagellates, photosynthesis is carried out by a tertiary endosymbiont derived from a diatom, giving rise to a complex cell that we collectively refer to as a 'dinotom'. The endosymbiont is separated from its host by a single membrane and retains plastids, mitochondria, a large nucleus, and many other eukaryotic organelles and structures, a level of complexity suggesting an early stage of integration. Although the evolution of these endosymbionts has attracted considerable interest, the plastid genome has not been examined in detail, and indeed no tertiary plastid genome has yet been sequenced.Here we describe the complete plastid genomes of two closely related dinotoms, Durinskia baltica and Kryptoperidinium foliaceum. The D. baltica (116470 bp and K. foliaceum (140426 bp plastid genomes map as circular molecules featuring two large inverted repeats that separate distinct single copy regions. The organization and gene content of the D. baltica plastid closely resemble those of the pennate diatom Phaeodactylum tricornutum. The K. foliaceum plastid genome is much larger, has undergone more reorganization, and encodes a putative tyrosine recombinase (tyrC also found in the plastid genome of the heterokont Heterosigma akashiwo, and two putative serine recombinases (serC1 and serC2 homologous to recombinases encoded by plasmids pCf1 and pCf2 in another pennate diatom, Cylindrotheca fusiformis. The K. foliaceum plastid genome also contains an additional copy of serC1, two degenerate copies of another plasmid-encoded ORF, and two non-coding regions whose sequences closely resemble portions of the pCf1 and pCf2 plasmids.These results suggest that while the plastid genomes of two dinotoms share very similar gene content and genome organization with that of the free-living pennate diatom P. tricornutum, the K. folicaeum plastid genome has absorbed two exogenous plasmids. Whether this took place before or after the tertiary

  10. The complete chloroplast genome of Sinopodophyllum hexandrum Ying (Berberidaceae).

    Science.gov (United States)

    Meng, Lihua; Liu, Ruijuan; Chen, Jianbing; Ding, Chenxu

    2017-05-01

    The complete nucleotide sequence of the Sinopodophyllum hexandrum Ying chloroplast genome (cpDNA) was determined based on next-generation sequencing technologies in this study. The genome was 157 203 bp in length, containing a pair of inverted repeat (IRa and IRb) regions of 25 960 bp, which were separated by a large single-copy (LSC) region of 87 065 bp and a small single-copy (SSC) region of 18 218 bp, respectively. The cpDNA contained 148 genes, including 96 protein-coding genes, 8 ribosomal RNA genes, and 44 tRNA genes. In these genes, eight harbored a single intron, and two (ycf3 and clpP) contained a couple of introns. The cpDNA AT content of S. hexandrum cpDNA is 61.5%.

  11. The complete chloroplast genome sequence of Euonymus japonicus (Celastraceae).

    Science.gov (United States)

    Choi, Kyoung Su; Park, SeonJoo

    2016-09-01

    The complete chloroplast (cp) genome sequence of the Euonymus japonicus, the first sequenced of the genus Euonymus, was reported in this study. The total length was 157 637 bp, containing a pair of 26 678 bp inverted repeat region (IR), which were separated by small single copy (SSC) region and large single copy (LSC) region of 18 340 bp and 85 941 bp, respectively. This genome contains 107 unique genes, including 74 coding genes, four rRNA genes, and 29 tRNA genes. Seventeen genes contain intron of E. japonicus, of which three genes (clpP, ycf3, and rps12) include two introns. The maximum likelihood (ML) phylogenetic analysis revealed that E. japonicus was closely related to Manihot and Populus.

  12. Population Genomics of Infectious and Integrated Wolbachia pipientis Genomes in Drosophila ananassae

    Science.gov (United States)

    Choi, Jae Young; Bubnell, Jaclyn E.; Aquadro, Charles F.

    2015-01-01

    Coevolution between Drosophila and its endosymbiont Wolbachia pipientis has many intriguing aspects. For example, Drosophila ananassae hosts two forms of W. pipientis genomes: One being the infectious bacterial genome and the other integrated into the host nuclear genome. Here, we characterize the infectious and integrated genomes of W. pipientis infecting D. ananassae (wAna), by genome sequencing 15 strains of D. ananassae that have either the infectious or integrated wAna genomes. Results indicate evolutionarily stable maternal transmission for the infectious wAna genome suggesting a relatively long-term coevolution with its host. In contrast, the integrated wAna genome showed pseudogene-like characteristics accumulating many variants that are predicted to have deleterious effects if present in an infectious bacterial genome. Phylogenomic analysis of sequence variation together with genotyping by polymerase chain reaction of large structural variations indicated several wAna variants among the eight infectious wAna genomes. In contrast, only a single wAna variant was found among the seven integrated wAna genomes examined in lines from Africa, south Asia, and south Pacific islands suggesting that the integration occurred once from a single infectious wAna genome and then spread geographically. Further analysis revealed that for all D. ananassae we examined with the integrated wAna genomes, the majority of the integrated wAna genomic regions is represented in at least two copies suggesting a double integration or single integration followed by an integrated genome duplication. The possible evolutionary mechanism underlying the widespread geographical presence of the duplicate integration of the wAna genome is an intriguing question remaining to be answered. PMID:26254486

  13. Genomic context drives transcription of insertion sequences in the bacterial endosymbiont Wolbachia wVulC.

    Science.gov (United States)

    Cerveau, Nicolas; Gilbert, Clément; Liu, Chao; Garrett, Roger A; Grève, Pierre; Bouchon, Didier; Cordaux, Richard

    2015-06-10

    Transposable elements (TEs) are DNA pieces that are present in almost all the living world at variable genomic density. Due to their mobility and density, TEs are involved in a large array of genomic modifications. In eukaryotes, TE expression has been studied in detail in several species. In prokaryotes, studies of IS expression are generally linked to particular copies that induce a modification of neighboring gene expression. Here we investigated global patterns of IS transcription in the Alphaproteobacterial endosymbiont Wolbachia wVulC, using both RT-PCR and bioinformatic analyses. We detected several transcriptional promoters in all IS groups. Nevertheless, only one of the potentially functional IS groups possesses a promoter located upstream of the transposase gene, that could lead up to the production of a functional protein. We found that the majority of IS groups are expressed whatever their functional status. RT-PCR analyses indicate that the transcription of two IS groups lacking internal promoters upstream of the transposase start codon may be driven by the genomic environment. We confirmed this observation with the transcription analysis of individual copies of one IS group. These results suggest that the genomic environment is important for IS expression and it could explain, at least partly, copy number variability of the various IS groups present in the wVulC genome and, more generally, in bacterial genomes. Copyright © 2015 Elsevier B.V. All rights reserved.

  14. Hard-copy versus soft-copy with and without simple image manipulation for detection of pulmonary nodules and masses

    International Nuclear Information System (INIS)

    Kosuda, S.; Kaji, T.; Iwasaki, Y.; Kusano, S.; Kobayashi, H.; Watanabe, M.

    2000-01-01

    To compare interpretation performance on soft-copy presentations, with and without simple image manipulation, and on unmodified hard-copy presentations with regard to detection of pulmonary nodules and masses. Material and Methods: Fifty chest digital radiograph combinations of patients with a total of 60 nodules, 32 of which were 2.0 cm in diameter, were selected for the study. Three readers evaluated three separate image formats: unmodified hard- and soft-copies, and soft-copies with simple image manipulation of lung and mediastinum window settings, and zooming. The screen display was 1600x1200 pixels with 8 bits/pixel. Results: The sensitivity, accuracy, detectability, and Az value of the soft-copy systems were clearly inferior to hard-copy evaluation. The mean Az values were 0.921 for unmodified hard-copy, 0.820 for image-manipulated soft-copy, and 0.781 for unmodified soft-copy. Conclusion: Soft-copy interpretations were not as sensitive in detecting pulmonary nodules and masses as hard-copy evaluation

  15. Whole-genome sequencing and comprehensive molecular profiling identify new driver mutations in gastric cancer

    NARCIS (Netherlands)

    Wang, Kai; Yuen, Siu Tsan; Xu, Jiangchun; Lee, Siu Po; Yan, Helen H N; Shi, Stephanie T; Siu, Hoi Cheong; Deng, Shibing; Chu, Kent Man; Law, Simon; Chan, Kok Hoe; Chan, Annie S Y; Tsui, Wai Yin; Ho, Siu Lun; Chan, Anthony K W; Man, Jonathan L K; Foglizzo, Valentina; Ng, Man Kin; Chan, April S; Ching, Yick Pang; Cheng, Grace H W; Xie, Tao; Fernandez, Julio; Li, Vivian S W; Clevers, Hans; Rejto, Paul A; Mao, Mao; Leung, Suet Yi

    Gastric cancer is a heterogeneous disease with diverse molecular and histological subtypes. We performed whole-genome sequencing in 100 tumor-normal pairs, along with DNA copy number, gene expression and methylation profiling, for integrative genomic analysis. We found subtype-specific genetic and

  16. Exploration of large, rare copy number variants associated with psychiatric and neurodevelopmental disorders in individuals with anorexia nervosa

    NARCIS (Netherlands)

    Yilmaz, Zeynep; Szatkiewicz, Jin P; Crowley, James J; Ancalade, NaEshia; Brandys, Marek K; van Elburg, Annemarie; de Kovel, Carolien G F; Adan, Roger A H; Hinney, Anke; Hebebrand, Johannes; Gratacos, Monica; Fernandez-Aranda, Fernando; Escaramis, Georgia; Gonzalez, Juan R; Estivill, Xavier; Zeggini, Eleftheria; Sullivan, Patrick F; Bulik, Cynthia M; Genetic Consortium for Anorexia Nervosa, Wellcome Trust Case Control Consortium 3

    Anorexia nervosa (AN) is a serious and heritable psychiatric disorder. To date, studies of copy number variants (CNVs) have been limited and inconclusive because of small sample sizes. We conducted a case-only genome-wide CNV survey in 1983 female AN cases included in the Genetic Consortium for

  17. Complete sequences of organelle genomes from the medicinal plant Rhazya stricta (Apocynaceae) and contrasting patterns of mitochondrial genome evolution across asterids.

    Science.gov (United States)

    Park, Seongjun; Ruhlman, Tracey A; Sabir, Jamal S M; Mutwakil, Mohammed H Z; Baeshen, Mohammed N; Sabir, Meshaal J; Baeshen, Nabih A; Jansen, Robert K

    2014-05-28

    Rhazya stricta is native to arid regions in South Asia and the Middle East and is used extensively in folk medicine to treat a wide range of diseases. In addition to generating genomic resources for this medicinally important plant, analyses of the complete plastid and mitochondrial genomes and a nuclear transcriptome from Rhazya provide insights into inter-compartmental transfers between genomes and the patterns of evolution among eight asterid mitochondrial genomes. The 154,841 bp plastid genome is highly conserved with gene content and order identical to the ancestral organization of angiosperms. The 548,608 bp mitochondrial genome exhibits a number of phenomena including the presence of recombinogenic repeats that generate a multipartite organization, transferred DNA from the plastid and nuclear genomes, and bidirectional DNA transfers between the mitochondrion and the nucleus. The mitochondrial genes sdh3 and rps14 have been transferred to the nucleus and have acquired targeting presequences. In the case of rps14, two copies are present in the nucleus; only one has a mitochondrial targeting presequence and may be functional. Phylogenetic analyses of both nuclear and mitochondrial copies of rps14 across angiosperms suggests Rhazya has experienced a single transfer of this gene to the nucleus, followed by a duplication event. Furthermore, the phylogenetic distribution of gene losses and the high level of sequence divergence in targeting presequences suggest multiple, independent transfers of both sdh3 and rps14 across asterids. Comparative analyses of mitochondrial genomes of eight sequenced asterids indicates a complicated evolutionary history in this large angiosperm clade with considerable diversity in genome organization and size, repeat, gene and intron content, and amount of foreign DNA from the plastid and nuclear genomes. Organelle genomes of Rhazya stricta provide valuable information for improving the understanding of mitochondrial genome evolution

  18. Getting DNA copy numbers without control samples

    Directory of Open Access Journals (Sweden)

    Ortiz-Estevez Maria

    2012-08-01

    Full Text Available Abstract Background The selection of the reference to scale the data in a copy number analysis has paramount importance to achieve accurate estimates. Usually this reference is generated using control samples included in the study. However, these control samples are not always available and in these cases, an artificial reference must be created. A proper generation of this signal is crucial in terms of both noise and bias. We propose NSA (Normality Search Algorithm, a scaling method that works with and without control samples. It is based on the assumption that genomic regions enriched in SNPs with identical copy numbers in both alleles are likely to be normal. These normal regions are predicted for each sample individually and used to calculate the final reference signal. NSA can be applied to any CN data regardless the microarray technology and preprocessing method. It also finds an optimal weighting of the samples minimizing possible batch effects. Results Five human datasets (a subset of HapMap samples, Glioblastoma Multiforme (GBM, Ovarian, Prostate and Lung Cancer experiments have been analyzed. It is shown that using only tumoral samples, NSA is able to remove the bias in the copy number estimation, to reduce the noise and therefore, to increase the ability to detect copy number aberrations (CNAs. These improvements allow NSA to also detect recurrent aberrations more accurately than other state of the art methods. Conclusions NSA provides a robust and accurate reference for scaling probe signals data to CN values without the need of control samples. It minimizes the problems of bias, noise and batch effects in the estimation of CNs. Therefore, NSA scaling approach helps to better detect recurrent CNAs than current methods. The automatic selection of references makes it useful to perform bulk analysis of many GEO or ArrayExpress experiments without the need of developing a parser to find the normal samples or possible batches within the

  19. Beneficial effect of a high number of copies of salivary amylase AMY1 gene on obesity risk in Mexican children.

    Science.gov (United States)

    Mejía-Benítez, María A; Bonnefond, Amélie; Yengo, Loïc; Huyvaert, Marlène; Dechaume, Aurélie; Peralta-Romero, Jesús; Klünder-Klünder, Miguel; García Mena, Jaime; El-Sayed Moustafa, Julia S; Falchi, Mario; Cruz, Miguel; Froguel, Philippe

    2015-02-01

    Childhood obesity is a major public health problem in Mexico, affecting one in every three children. Genome-wide association studies identified genetic variants associated with childhood obesity, but a large missing heritability remains to be elucidated. We have recently shown a strong association between a highly polymorphic copy number variant encompassing the salivary amylase gene (AMY1 also known as AMY1A) and obesity in European and Asian adults. In the present study, we aimed to evaluate the association between AMY1 copy number and obesity in Mexican children. We evaluated the number of AMY1 copies in 597 Mexican children (293 obese children and 304 normal weight controls) through highly sensitive digital PCR. The effect of AMY1 copy number on obesity status was assessed using a logistic regression model adjusted for age and sex. We identified a marked effect of AMY1 copy number on reduced risk of obesity (OR per estimated copy 0.84, with the number of copies ranging from one to 16 in this population; p = 4.25 × 10(-6)). The global association between AMY1 copy number and reduced risk of obesity seemed to be mostly driven by the contribution of the highest AMY1 copy number. Strikingly, all children with >10 AMY1 copies were normal weight controls. Salivary amylase initiates the digestion of dietary starch, which is highly consumed in Mexico. Our current study suggests putative benefits of high number of AMY1 copies (and related production of salivary amylase) on energy metabolism in Mexican children.

  20. Hybrid incompatibilities are affected by dominance and dosage in the haplodiploid wasp Nasonia

    Science.gov (United States)

    Beukeboom, Leo W.; Koevoets, Tosca; Morales, Hernán E.; Ferber, Steven; van de Zande, Louis

    2015-01-01

    Study of genome incompatibilities in species hybrids is important for understanding the genetic basis of reproductive isolation and speciation. According to Haldane's rule hybridization affects the heterogametic sex more than the homogametic sex. Several theories have been proposed that attribute asymmetry in hybridization effects to either phenotype (sex) or genotype (heterogamety). Here we investigate the genetic basis of hybrid genome incompatibility in the haplodiploid wasp Nasonia using the powerful features of haploid males and sex reversal. We separately investigate the effects of heterozygosity (ploidy level) and sex by generating sex reversed diploid hybrid males and comparing them to genotypically similar haploid hybrid males and diploid hybrid females. Hybrid effects of sterility were more pronounced than of inviability, and were particularly strong in haploid males, but weak to absent in diploid males and females, indicating a strong ploidy level but no sex specific effect. Molecular markers identified a number of genomic regions associated with hybrid inviability in haploid males that disappeared under diploidy in both hybrid males and females. Hybrid inviability was rescued by dominance effects at some genomic regions, but aggravated or alleviated by dosage effects at other regions, consistent with cytonuclear incompatibilities. Dosage effects underlying Bateson–Dobzhansky–Muller (BDM) incompatibilities need more consideration in explaining Haldane's rule in diploid systems. PMID:25926847

  1. Detection and correction of false segmental duplications caused by genome mis-assembly

    Science.gov (United States)

    2010-01-01

    Diploid genomes with divergent chromosomes present special problems for assembly software as two copies of especially polymorphic regions may be mistakenly constructed, creating the appearance of a recent segmental duplication. We developed a method for identifying such false duplications and applied it to four vertebrate genomes. For each genome, we corrected mis-assemblies, improved estimates of the amount of duplicated sequence, and recovered polymorphisms between the sequenced chromosomes. PMID:20219098

  2. Selective regain of egfr gene copies in CD44+/CD24-/low breast cancer cellular model MDA-MB-468

    International Nuclear Information System (INIS)

    Agelopoulos, Konstantin; Buerger, Horst; Brandt, Burkhard; Greve, Burkhard; Schmidt, Hartmut; Pospisil, Heike; Kurtz, Stefan; Bartkowiak, Kai; Andreas, Antje; Wieczorek, Marek; Korsching, Eberhard

    2010-01-01

    Increased transcription of oncogenes like the epidermal growth factor receptor (EGFR) is frequently caused by amplification of the whole gene or at least of regulatory sequences. Aim of this study was to pinpoint mechanistic parameters occurring during egfr copy number gains leading to a stable EGFR overexpression and high sensitivity to extracellular signalling. A deeper understanding of those marker events might improve early diagnosis of cancer in suspect lesions, early detection of cancer progression and the prediction of egfr targeted therapies. The basal-like/stemness type breast cancer cell line subpopulation MDA-MB-468 CD44 high /CD24 -/low , carrying high egfr amplifications, was chosen as a model system in this study. Subclones of the heterogeneous cell line expressing low and high EGF receptor densities were isolated by cell sorting. Genomic profiling was carried out for these by means of SNP array profiling, qPCR and FISH. Cell cycle analysis was performed using the BrdU quenching technique. Low and high EGFR expressing MDA-MB-468 CD44 + /CD24 -/low subpopulations separated by cell sorting showed intermediate and high copy numbers of egfr, respectively. However, during cell culture an increase solely for egfr gene copy numbers in the intermediate subpopulation occurred. This shift was based on the formation of new cells which regained egfr gene copies. By two parametric cell cycle analysis clonal effects mediated through growth advantage of cells bearing higher egfr gene copy numbers could most likely be excluded for being the driving force. Subsequently, the detection of a fragile site distal to the egfr gene, sustaining uncapped telomere-less chromosomal ends, the ladder-like structure of the intrachromosomal egfr amplification and a broader range of egfr copy numbers support the assumption that dynamic chromosomal rearrangements, like breakage-fusion-bridge-cycles other than proliferation drive the gain of egfr copies. Progressive genome modulation

  3. The complete chloroplast genome of banana (Musa acuminata, Zingiberales): insight into plastid monocotyledon evolution.

    Science.gov (United States)

    Martin, Guillaume; Baurens, Franc-Christophe; Cardi, Céline; Aury, Jean-Marc; D'Hont, Angélique

    2013-01-01

    Banana (genus Musa) is a crop of major economic importance worldwide. It is a monocotyledonous member of the Zingiberales, a sister group of the widely studied Poales. Most cultivated bananas are natural Musa inter-(sub-)specific triploid hybrids. A Musa acuminata reference nuclear genome sequence was recently produced based on sequencing of genomic DNA enriched in nucleus. The Musa acuminata chloroplast genome was assembled with chloroplast reads extracted from whole-genome-shotgun sequence data. The Musa chloroplast genome is a circular molecule of 169,972 bp with a quadripartite structure containing two single copy regions, a Large Single Copy region (LSC, 88,338 bp) and a Small Single Copy region (SSC, 10,768 bp) separated by Inverted Repeat regions (IRs, 35,433 bp). Two forms of the chloroplast genome relative to the orientation of SSC versus LSC were found. The Musa chloroplast genome shows an extreme IR expansion at the IR/SSC boundary relative to the most common structures found in angiosperms. This expansion consists of the integration of three additional complete genes (rps15, ndhH and ycf1) and part of the ndhA gene. No such expansion has been observed in monocots so far. Simple Sequence Repeats were identified in the Musa chloroplast genome and a new set of Musa chloroplastic markers was designed. The complete sequence of M. acuminata ssp malaccensis chloroplast we reported here is the first one for the Zingiberales order. As such it provides new insight in the evolution of the chloroplast of monocotyledons. In particular, it reinforces that IR/SSC expansion has occurred independently several times within monocotyledons. The discovery of new polymorphic markers within Musa chloroplast opens new perspectives to better understand the origin of cultivated triploid bananas.

  4. The complete chloroplast genome of banana (Musa acuminata, Zingiberales: insight into plastid monocotyledon evolution.

    Directory of Open Access Journals (Sweden)

    Guillaume Martin

    Full Text Available Banana (genus Musa is a crop of major economic importance worldwide. It is a monocotyledonous member of the Zingiberales, a sister group of the widely studied Poales. Most cultivated bananas are natural Musa inter-(sub-specific triploid hybrids. A Musa acuminata reference nuclear genome sequence was recently produced based on sequencing of genomic DNA enriched in nucleus.The Musa acuminata chloroplast genome was assembled with chloroplast reads extracted from whole-genome-shotgun sequence data. The Musa chloroplast genome is a circular molecule of 169,972 bp with a quadripartite structure containing two single copy regions, a Large Single Copy region (LSC, 88,338 bp and a Small Single Copy region (SSC, 10,768 bp separated by Inverted Repeat regions (IRs, 35,433 bp. Two forms of the chloroplast genome relative to the orientation of SSC versus LSC were found. The Musa chloroplast genome shows an extreme IR expansion at the IR/SSC boundary relative to the most common structures found in angiosperms. This expansion consists of the integration of three additional complete genes (rps15, ndhH and ycf1 and part of the ndhA gene. No such expansion has been observed in monocots so far. Simple Sequence Repeats were identified in the Musa chloroplast genome and a new set of Musa chloroplastic markers was designed.The complete sequence of M. acuminata ssp malaccensis chloroplast we reported here is the first one for the Zingiberales order. As such it provides new insight in the evolution of the chloroplast of monocotyledons. In particular, it reinforces that IR/SSC expansion has occurred independently several times within monocotyledons. The discovery of new polymorphic markers within Musa chloroplast opens new perspectives to better understand the origin of cultivated triploid bananas.

  5. Cell-free mitochondrial DNA copy number variation in head and neck squamous cell carcinoma: A study of non-invasive biomarker from Northeast India.

    Science.gov (United States)

    Kumar, Manish; Srivastava, Shilpee; Singh, Seram Anil; Das, Anup Kumar; Das, Ganesh Chandra; Dhar, Bishal; Ghosh, Sankar Kumar; Mondal, Rosy

    2017-10-01

    Head and neck squamous cell carcinoma is the most commonly diagnosed cancer worldwide. The lifestyle, food habits, and customary practices manifest the Northeast Indian population toward higher susceptibility to develop head and neck squamous cell carcinoma. Here, we have investigated the association of smoke and smokeless tobacco, and alcohol with copy number variation of cell-free mitochondrial DNA and cell-free nuclear DNA in cases and controls. Cell-free DNA from plasma was isolated from 50 head and neck squamous cell carcinoma cases and 50 controls with informed written consent using QIAamp Circulating Nucleic Acid Kit. Real-time polymerase chain reaction was done for copy number variation in cell-free mitochondrial DNA and cell-free nuclear DNA. Receiver operating characteristic curve analysis was performed to evaluate the diagnostic application between the two study groups using clinicopathological parameters. The levels of cell-free nuclear DNA and cell-free mitochondrial DNA of cases in association with smoke and smokeless tobacco, alcohol with smoking (p squamous cell carcinoma cases and controls, we distinguished cell-free mitochondrial DNA (cutoff: 19.84 raw Ct; sensitivity: 84%; specificity: 100%; p < 0.001) and cell-free nuclear DNA (cutoff: 463,282 genomic equivalent/mL; sensitivity: 53%; specificity: 87%; p < 0.001). The copy number variation in cases (cell-free nuclear DNA: 5451.66 genomic equivalent/mL and cell-free mitochondrial DNA: 29,103,476.15 genomic equivalent/mL) and controls (cell-free nuclear DNA: 1650.9 genomic equivalent/mL and cell-free mitochondrial DNA: 9,189,312.54 genomic equivalent/mL), respectively. Our result indicates that the cell-free mitochondrial DNA content is highly associated with smoke and smokeless tobacco, betel quid chewing, and alcohol which shows greater promises, holding the key characteristics of diagnostic biomarkers, that is, minimal invasiveness, high specificity, and sensitivity.

  6. Roadmap for annotating transposable elements in eukaryote genomes.

    Science.gov (United States)

    Permal, Emmanuelle; Flutre, Timothée; Quesneville, Hadi

    2012-01-01

    Current high-throughput techniques have made it feasible to sequence even the genomes of non-model organisms. However, the annotation process now represents a bottleneck to genome analysis, especially when dealing with transposable elements (TE). Combined approaches, using both de novo and knowledge-based methods to detect TEs, are likely to produce reasonably comprehensive and sensitive results. This chapter provides a roadmap for researchers involved in genome projects to address this issue. At each step of the TE annotation process, from the identification of TE families to the annotation of TE copies, we outline the tools and good practices to be used.

  7. Analysis of Copy Number Variation in the Abp Gene Regions of Two House Mouse Subspecies Suggests Divergence during the Gene Family Expansions.

    Science.gov (United States)

    Pezer, Željka; Chung, Amanda G; Karn, Robert C; Laukaitis, Christina M

    2017-06-01

    The Androgen-binding protein ( Abp ) gene region of the mouse genome contains 64 genes, some encoding pheromones that influence assortative mating between mice from different subspecies. Using CNVnator and quantitative PCR, we explored copy number variation in this gene family in natural populations of Mus musculus domesticus ( Mmd ) and Mus musculus musculus ( Mmm ), two subspecies of house mice that form a narrow hybrid zone in Central Europe. We found that copy number variation in the center of the Abp gene region is very common in wild Mmd , primarily representing the presence/absence of the final duplications described for the mouse genome. Clustering of Mmd individuals based on this variation did not reflect their geographical origin, suggesting no population divergence in the Abp gene cluster. However, copy number variation patterns differ substantially between Mmd and other mouse taxa. Large blocks of Abp genes are absent in Mmm , Mus musculus castaneus and an outgroup, Mus spretus , although with differences in variation and breakpoint locations. Our analysis calls into question the reliance on a reference genome for interpreting the detailed organization of genes in taxa more distant from the Mmd reference genome. The polymorphic nature of the gene family expansion in all four taxa suggests that the number of Abp genes, especially in the central gene region, is not critical to the survival and reproduction of the mouse. However, Abp haplotypes of variable length may serve as a source of raw genetic material for new signals influencing reproductive communication and thus speciation of mice. © The Author 2017. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.

  8. Genome size evolution at the speciation level: the cryptic species complex Brachionus plicatilis (Rotifera).

    Science.gov (United States)

    Stelzer, Claus-Peter; Riss, Simone; Stadler, Peter

    2011-04-07

    Studies on genome size variation in animals are rarely done at lower taxonomic levels, e.g., slightly above/below the species level. Yet, such variation might provide important clues on the tempo and mode of genome size evolution. In this study we used the flow-cytometry method to study the evolution of genome size in the rotifer Brachionus plicatilis, a cryptic species complex consisting of at least 14 closely related species. We found an unexpectedly high variation in this species complex, with genome sizes ranging approximately seven-fold (haploid '1C' genome sizes: 0.056-0.416 pg). Most of this variation (67%) could be ascribed to the major clades of the species complex, i.e. clades that are well separated according to most species definitions. However, we also found substantial variation (32%) at lower taxonomic levels--within and among genealogical species--and, interestingly, among species pairs that are not completely reproductively isolated. In one genealogical species, called B. 'Austria', we found greatly enlarged genome sizes that could roughly be approximated as multiples of the genomes of its closest relatives, which suggests that whole-genome duplications have occurred early during separation of this lineage. Overall, genome size was significantly correlated to egg size and body size, even though the latter became non-significant after controlling for phylogenetic non-independence. Our study suggests that substantial genome size variation can build up early during speciation, potentially even among isolated populations. An alternative, but not mutually exclusive interpretation might be that reproductive isolation tends to build up unusually slow in this species complex.

  9. The complete chloroplast genome sequence of Dendrobium officinale.

    Science.gov (United States)

    Yang, Pei; Zhou, Hong; Qian, Jun; Xu, Haibin; Shao, Qingsong; Li, Yonghua; Yao, Hui

    2016-01-01

    The complete chloroplast sequence of Dendrobium officinale, an endangered and economically important traditional Chinese medicine, was reported and characterized. The genome size is 152,018 bp, with 37.5% GC content. A pair of inverted repeats (IRs) of 26,284 bp are separated by a large single-copy region (LSC, 84,944 bp) and a small single-copy region (SSC, 14,506 bp). The complete cp DNA contains 83 protein-coding genes, 39 tRNA genes and 8 rRNA genes. Fourteen genes contained one or two introns.

  10. Striking structural dynamism and nucleotide sequence variation of the transposon Galileo in the genome of Drosophila mojavensis.

    Science.gov (United States)

    Marzo, Mar; Bello, Xabier; Puig, Marta; Maside, Xulio; Ruiz, Alfredo

    2013-02-04

    Galileo is a transposable element responsible for the generation of three chromosomal inversions in natural populations of Drosophila buzzatii. Although the most characteristic feature of Galileo is the long internally-repetitive terminal inverted repeats (TIRs), which resemble the Drosophila Foldback element, its transposase-coding sequence has led to its classification as a member of the P-element superfamily (Class II, subclass 1, TIR order). Furthermore, Galileo has a wide distribution in the genus Drosophila, since it has been found in 6 of the 12 Drosophila sequenced genomes. Among these species, D. mojavensis, the one closest to D. buzzatii, presented the highest diversity in sequence and structure of Galileo elements. In the present work, we carried out a thorough search and annotation of all the Galileo copies present in the D. mojavensis sequenced genome. In our set of 170 Galileo copies we have detected 5 Galileo subfamilies (C, D, E, F, and X) with different structures ranging from nearly complete, to only 2 TIR or solo TIR copies. Finally, we have explored the structural and length variation of the Galileo copies that point out the relatively frequent rearrangements within and between Galileo elements. Different mechanisms responsible for these rearrangements are discussed. Although Galileo is a transposable element with an ancient history in the D. mojavensis genome, our data indicate a recent transpositional activity. Furthermore, the dynamism in sequence and structure, mainly affecting the TIRs, suggests an active exchange of sequences among the copies. This exchange could lead to new subfamilies of the transposon, which could be crucial for the long-term survival of the element in the genome.

  11. Evaluation of plasmid and genomic DNA calibrants used for the quantification of genetically modified organisms.

    Science.gov (United States)

    Caprioara-Buda, M; Meyer, W; Jeynov, B; Corbisier, P; Trapmann, S; Emons, H

    2012-07-01

    The reliable quantification of genetically modified organisms (GMOs) by real-time PCR requires, besides thoroughly validated quantitative detection methods, sustainable calibration systems. The latter establishes the anchor points for the measured value and the measurement unit, respectively. In this paper, the suitability of two types of DNA calibrants, i.e. plasmid DNA and genomic DNA extracted from plant leaves, for the certification of the GMO content in reference materials as copy number ratio between two targeted DNA sequences was investigated. The PCR efficiencies and coefficients of determination of the calibration curves as well as the measured copy number ratios for three powder certified reference materials (CRMs), namely ERM-BF415e (NK603 maize), ERM-BF425c (356043 soya), and ERM-BF427c (98140 maize), originally certified for their mass fraction of GMO, were compared for both types of calibrants. In all three systems investigated, the PCR efficiencies of plasmid DNA were slightly closer to the PCR efficiencies observed for the genomic DNA extracted from seed powders rather than those of the genomic DNA extracted from leaves. Although the mean DNA copy number ratios for each CRM overlapped within their uncertainties, the DNA copy number ratios were significantly different using the two types of calibrants. Based on these observations, both plasmid and leaf genomic DNA calibrants would be technically suitable as anchor points for the calibration of the real-time PCR methods applied in this study. However, the most suitable approach to establish a sustainable traceability chain is to fix a reference system based on plasmid DNA.

  12. Introns: The Functional Benefits of Introns in Genomes

    Directory of Open Access Journals (Sweden)

    Bong-Seok Jo

    2015-12-01

    Full Text Available The intron has been a big biological mystery since it was first discovered in several aspects. First, all of the completely sequenced eukaryotes harbor introns in the genomic structure, whereas no prokaryotes identified so far carry introns. Second, the amount of total introns varies in different species. Third, the length and number of introns vary in different genes, even within the same species genome. Fourth, all introns are copied into RNAs by transcription and DNAs by replication processes, but intron sequences do not participate in protein-coding sequences. The existence of introns in the genome should be a burden to some cells, because cells have to consume a great deal of energy to copy and excise them exactly at the correct positions with the help of complicated spliceosomal machineries. The existence throughout the long evolutionary history is explained, only if selective advantages of carrying introns are assumed to be given to cells to overcome the negative effect of introns. In that regard, we summarize previous research about the functional roles or benefits of introns. Additionally, several other studies strongly suggesting that introns should not be junk will be introduced.

  13. Introns: The Functional Benefits of Introns in Genomes.

    Science.gov (United States)

    Jo, Bong-Seok; Choi, Sun Shim

    2015-12-01

    The intron has been a big biological mystery since it was first discovered in several aspects. First, all of the completely sequenced eukaryotes harbor introns in the genomic structure, whereas no prokaryotes identified so far carry introns. Second, the amount of total introns varies in different species. Third, the length and number of introns vary in different genes, even within the same species genome. Fourth, all introns are copied into RNAs by transcription and DNAs by replication processes, but intron sequences do not participate in protein-coding sequences. The existence of introns in the genome should be a burden to some cells, because cells have to consume a great deal of energy to copy and excise them exactly at the correct positions with the help of complicated spliceosomal machineries. The existence throughout the long evolutionary history is explained, only if selective advantages of carrying introns are assumed to be given to cells to overcome the negative effect of introns. In that regard, we summarize previous research about the functional roles or benefits of introns. Additionally, several other studies strongly suggesting that introns should not be junk will be introduced.

  14. The complete chloroplast genome of an irreplaceable dietary and model crop, foxtail millet (Setaria italica).

    Science.gov (United States)

    Wang, Shuo; Gao, Li-Zhi

    2016-11-01

    The complete chloroplast genome sequence of foxtail millet (Setaria italica), an important food and fodder crop in the family Poaceae, is first reported in this study. The genome consists of 1 35 516 bp containing a pair of inverted repeats (IRs) of 21 804 bp separated by a large single-copy (LSC) region and a small single-copy (SSC) region of 79 896 bp and 12 012 bp, respectively. Coding sequences constitute 58.8% of the genome harboring 111 unique genes, 71 of which are protein-coding genes, 4 are rRNA genes, and 36 are tRNA genes. Phylogenetic analysis indicated foxtail millet clustered with Panicum virgatum and Echinochloa crus-galli belonging to the tribe Paniceae of the subfamily Panicoideae. This newly determined chloroplast genome will provide valuable information for the future breeding programs of valuable cereal crops in the family Poaceae.

  15. Complete chloroplast genome sequence of a tree fern Alsophila spinulosa: insights into evolutionary changes in fern chloroplast genomes.

    Science.gov (United States)

    Gao, Lei; Yi, Xuan; Yang, Yong-Xia; Su, Ying-Juan; Wang, Ting

    2009-06-11

    Ferns have generally been neglected in studies of chloroplast genomics. Before this study, only one polypod and two basal ferns had their complete chloroplast (cp) genome reported. Tree ferns represent an ancient fern lineage that first occurred in the Late Triassic. In recent phylogenetic analyses, tree ferns were shown to be the sister group of polypods, the most diverse group of living ferns. Availability of cp genome sequence from a tree fern will facilitate interpretation of the evolutionary changes of fern cp genomes. Here we have sequenced the complete cp genome of a scaly tree fern Alsophila spinulosa (Cyatheaceae). The Alsophila cp genome is 156,661 base pairs (bp) in size, and has a typical quadripartite structure with the large (LSC, 86,308 bp) and small single copy (SSC, 21,623 bp) regions separated by two copies of an inverted repeat (IRs, 24,365 bp each). This genome contains 117 different genes encoding 85 proteins, 4 rRNAs and 28 tRNAs. Pseudogenes of ycf66 and trnT-UGU are also detected in this genome. A unique trnR-UCG gene (derived from trnR-CCG) is found between rbcL and accD. The Alsophila cp genome shares some unusual characteristics with the previously sequenced cp genome of the polypod fern Adiantum capillus-veneris, including the absence of 5 tRNA genes that exist in most other cp genomes. The genome shows a high degree of synteny with that of Adiantum, but differs considerably from two basal ferns (Angiopteris evecta and Psilotum nudum). At one endpoint of an ancient inversion we detected a highly repeated 565-bp-region that is absent from the Adiantum cp genome. An additional minor inversion of the trnD-GUC, which is possibly shared by all ferns, was identified by comparison between the fern and other land plant cp genomes. By comparing four fern cp genome sequences it was confirmed that two major rearrangements distinguish higher leptosporangiate ferns from basal fern lineages. The Alsophila cp genome is very similar to that of the

  16. Complete chloroplast genome sequence of a tree fern Alsophila spinulosa: insights into evolutionary changes in fern chloroplast genomes

    Directory of Open Access Journals (Sweden)

    Yang Yong-Xia

    2009-06-01

    Full Text Available Abstract Background Ferns have generally been neglected in studies of chloroplast genomics. Before this study, only one polypod and two basal ferns had their complete chloroplast (cp genome reported. Tree ferns represent an ancient fern lineage that first occurred in the Late Triassic. In recent phylogenetic analyses, tree ferns were shown to be the sister group of polypods, the most diverse group of living ferns. Availability of cp genome sequence from a tree fern will facilitate interpretation of the evolutionary changes of fern cp genomes. Here we have sequenced the complete cp genome of a scaly tree fern Alsophila spinulosa (Cyatheaceae. Results The Alsophila cp genome is 156,661 base pairs (bp in size, and has a typical quadripartite structure with the large (LSC, 86,308 bp and small single copy (SSC, 21,623 bp regions separated by two copies of an inverted repeat (IRs, 24,365 bp each. This genome contains 117 different genes encoding 85 proteins, 4 rRNAs and 28 tRNAs. Pseudogenes of ycf66 and trnT-UGU are also detected in this genome. A unique trnR-UCG gene (derived from trnR-CCG is found between rbcL and accD. The Alsophila cp genome shares some unusual characteristics with the previously sequenced cp genome of the polypod fern Adiantum capillus-veneris, including the absence of 5 tRNA genes that exist in most other cp genomes. The genome shows a high degree of synteny with that of Adiantum, but differs considerably from two basal ferns (Angiopteris evecta and Psilotum nudum. At one endpoint of an ancient inversion we detected a highly repeated 565-bp-region that is absent from the Adiantum cp genome. An additional minor inversion of the trnD-GUC, which is possibly shared by all ferns, was identified by comparison between the fern and other land plant cp genomes. Conclusion By comparing four fern cp genome sequences it was confirmed that two major rearrangements distinguish higher leptosporangiate ferns from basal fern lineages. The

  17. Characterization of canine osteosarcoma by array comparative genomic hybridization and RT-qPCR: signatures of genomic imbalance in canine osteosarcoma parallel the human counterpart.

    Science.gov (United States)

    Angstadt, Andrea Y; Motsinger-Reif, Alison; Thomas, Rachael; Kisseberth, William C; Guillermo Couto, C; Duval, Dawn L; Nielsen, Dahlia M; Modiano, Jaime F; Breen, Matthew

    2011-11-01

    Osteosarcoma (OS) is the most commonly diagnosed malignant bone tumor in humans and dogs, characterized in both species by extremely complex karyotypes exhibiting high frequencies of genomic imbalance. Evaluation of genomic signatures in human OS using array comparative genomic hybridization (aCGH) has assisted in uncovering genetic mechanisms that result in disease phenotype. Previous low-resolution (10-20 Mb) aCGH analysis of canine OS identified a wide range of recurrent DNA copy number aberrations, indicating extensive genomic instability. In this study, we profiled 123 canine OS tumors by 1 Mb-resolution aCGH to generate a dataset for direct comparison with current data for human OS, concluding that several high frequency aberrations in canine and human OS are orthologous. To ensure complete coverage of gene annotation, we identified the human refseq genes that map to these orthologous aberrant dog regions and found several candidate genes warranting evaluation for OS involvement. Specifically, subsequenct FISH and qRT-PCR analysis of RUNX2, TUSC3, and PTEN indicated that expression levels correlated with genomic copy number status, showcasing RUNX2 as an OS associated gene and TUSC3 as a possible tumor suppressor candidate. Together these data demonstrate the ability of genomic comparative oncology to identify genetic abberations which may be important for OS progression. Large scale screening of genomic imbalance in canine OS further validates the use of the dog as a suitable model for human cancers, supporting the idea that dysregulation discovered in canine cancers will provide an avenue for complementary study in human counterparts. Copyright © 2011 Wiley-Liss, Inc.

  18. First Insights into the Large Genome of Epimedium sagittatum (Sieb. et Zucc Maxim, a Chinese Traditional Medicinal Plant

    Directory of Open Access Journals (Sweden)

    Gong Xiao

    2013-06-01

    Full Text Available Epimedium sagittatum (Sieb. et Zucc Maxim is a member of the Berberidaceae family of basal eudicot plants, widely distributed and used as a traditional medicinal plant in China for therapeutic effects on many diseases with a long history. Recent data shows that E. sagittatum has a relatively large genome, with a haploid genome size of ~4496 Mbp, divided into a small number of only 12 diploid chromosomes (2n = 2x = 12. However, little is known about Epimedium genome structure and composition. Here we present the analysis of 691 kb of high-quality genomic sequence derived from 672 randomly selected plasmid clones of E. sagittatum genomic DNA, representing ~0.0154% of the genome. The sampled sequences comprised at least 78.41% repetitive DNA elements and 2.51% confirmed annotated gene sequences, with a total GC% content of 39%. Retrotransposons represented the major class of transposable element (TE repeats identified (65.37% of all TE repeats, particularly LTR (Long Terminal Repeat retrotransposons (52.27% of all TE repeats. Chromosome analysis and Fluorescence in situ Hybridization of Gypsy-Ty3 retrotransposons were performed to survey the E. sagittatum genome at the cytological level. Our data provide the first insights into the composition and structure of the E. sagittatum genome, and will facilitate the functional genomic analysis of this valuable medicinal plant.

  19. First Insights into the Large Genome of Epimedium sagittatum (Sieb. et Zucc) Maxim, a Chinese Traditional Medicinal Plant

    Science.gov (United States)

    Liu, Di; Zeng, Shao-Hua; Chen, Jian-Jun; Zhang, Yan-Jun; Xiao, Gong; Zhu, Lin-Yao; Wang, Ying

    2013-01-01

    Epimedium sagittatum (Sieb. et Zucc) Maxim is a member of the Berberidaceae family of basal eudicot plants, widely distributed and used as a traditional medicinal plant in China for therapeutic effects on many diseases with a long history. Recent data shows that E. sagittatum has a relatively large genome, with a haploid genome size of ~4496 Mbp, divided into a small number of only 12 diploid chromosomes (2n = 2x = 12). However, little is known about Epimedium genome structure and composition. Here we present the analysis of 691 kb of high-quality genomic sequence derived from 672 randomly selected plasmid clones of E. sagittatum genomic DNA, representing ~0.0154% of the genome. The sampled sequences comprised at least 78.41% repetitive DNA elements and 2.51% confirmed annotated gene sequences, with a total GC% content of 39%. Retrotransposons represented the major class of transposable element (TE) repeats identified (65.37% of all TE repeats), particularly LTR (Long Terminal Repeat) retrotransposons (52.27% of all TE repeats). Chromosome analysis and Fluorescence in situ Hybridization of Gypsy-Ty3 retrotransposons were performed to survey the E. sagittatum genome at the cytological level. Our data provide the first insights into the composition and structure of the E. sagittatum genome, and will facilitate the functional genomic analysis of this valuable medicinal plant. PMID:23807511

  20. GRAbB: Selective Assembly of Genomic Regions, a New Niche for Genomic Research.

    Directory of Open Access Journals (Sweden)

    Balázs Brankovics

    2016-06-01

    Full Text Available GRAbB (Genomic Region Assembly by Baiting is a new program that is dedicated to assemble specific genomic regions from NGS data. This approach is especially useful when dealing with multi copy regions, such as mitochondrial genome and the rDNA repeat region, parts of the genome that are often neglected or poorly assembled, although they contain interesting information from phylogenetic or epidemiologic perspectives, but also single copy regions can be assembled. The program is capable of targeting multiple regions within a single run. Furthermore, GRAbB can be used to extract specific loci from NGS data, based on homology, like sequences that are used for barcoding. To make the assembly specific, a known part of the region, such as the sequence of a PCR amplicon or a homologous sequence from a related species must be specified. By assembling only the region of interest, the assembly process is computationally much less demanding and may lead to assemblies of better quality. In this study the different applications and functionalities of the program are demonstrated such as: exhaustive assembly (rDNA region and mitochondrial genome, extracting homologous regions or genes (IGS, RPB1, RPB2 and TEF1a, as well as extracting multiple regions within a single run. The program is also compared with MITObim, which is meant for the exhaustive assembly of a single target based on a similar query sequence. GRAbB is shown to be more efficient than MITObim in terms of speed, memory and disk usage. The other functionalities (handling multiple targets simultaneously and extracting homologous regions of the new program are not matched by other programs. The program is available with explanatory documentation at https://github.com/b-brankovics/grabb. GRAbB has been tested on Ubuntu (12.04 and 14.04, Fedora (23, CentOS (7.1.1503 and Mac OS X (10.7. Furthermore, GRAbB is available as a docker repository: brankovics/grabb (https://hub.docker.com/r/brankovics/grabb/.