WorldWideScience

Sample records for apospory-specific genomic region

  1. Evolution of the apomixis transmitting chromosome in Pennisetum

    Directory of Open Access Journals (Sweden)

    Yamada-Akiyama Hitomi

    2011-10-01

    Full Text Available Abstract Background Apomixis is an intriguing trait in plants that results in maternal clones through seed reproduction. Apomixis is an elusive, but potentially revolutionary, trait for plant breeding and hybrid seed production. Recent studies arguing that apomicts are not evolutionary dead ends have generated further interest in the evolution of asexual flowering plants. Results In the present study, we investigate karyotypic variation in a single chromosome responsible for transmitting apomixis, the Apospory-Specific Genomic Region carrier chromosome, in relation to species phylogeny in the genera Pennisetum and Cenchrus. A 1 kb region from the 3' end of the ndhF gene and a 900 bp region from trnL-F were sequenced from 12 apomictic and eight sexual species in the genus Pennisetum and allied genus Cenchrus. An 800 bp region from the Apospory-Specific Genomic Region also was sequenced from the 12 apomicts. Molecular cytological analysis was conducted in sixteen Pennisetum and two Cenchrus species. Our results indicate that the Apospory-Specific Genomic Region is shared by all apomictic species while it is absent from all sexual species or cytotypes. Contrary to our previous observations in Pennisetum squamulatum and Cenchrus ciliaris, retrotransposon sequences of the Opie-2-like family were not closely associated with the Apospory-Specific Genomic Region in all apomictic species, suggesting that they may have been accumulated after the Apospory-Specific Genomic Region originated. Conclusions Given that phylogenetic analysis merged Cenchrus and newly investigated Pennisetum species into a single clade containing a terminal cluster of Cenchrus apomicts, the presumed monophyletic origin of Cenchrus is supported. The Apospory-Specific Genomic Region likely preceded speciation in Cenchrus and its lateral transfer through hybridization and subsequent chromosome repatterning may have contributed to further speciation in the two genera.

  2. GRAbB : Selective Assembly of Genomic Regions, a New Niche for Genomic Research

    NARCIS (Netherlands)

    Brankovics, Balázs; Zhang, Hao; van Diepeningen, Anne D; van der Lee, Theo A J; Waalwijk, Cees; de Hoog, G Sybren

    2016-01-01

    GRAbB (Genomic Region Assembly by Baiting) is a new program that is dedicated to assemble specific genomic regions from NGS data. This approach is especially useful when dealing with multi copy regions, such as mitochondrial genome and the rDNA repeat region, parts of the genome that are often negle

  3. Human-mouse comparative genomics: successes and failures to reveal functional regions of the human genome

    Energy Technology Data Exchange (ETDEWEB)

    Pennacchio, Len A.; Baroukh, Nadine; Rubin, Edward M.

    2003-05-15

    Deciphering the genetic code embedded within the human genome remains a significant challenge despite the human genome consortium's recent success at defining its linear sequence (Lander et al. 2001; Venter et al. 2001). While useful strategies exist to identify a large percentage of protein encoding regions, efforts to accurately define functional sequences in the remaining {approx}97 percent of the genome lag. Our primary interest has been to utilize the evolutionary relationship and the universal nature of genomic sequence information in vertebrates to reveal functional elements in the human genome. This has been achieved through the combined use of vertebrate comparative genomics to pinpoint highly conserved sequences as candidates for biological activity and transgenic mouse studies to address the functionality of defined human DNA fragments. Accordingly, we describe strategies and insights into functional sequences in the human genome through the use of comparative genomics coupled wit h functional studies in the mouse.

  4. The transcriptionally active regions in the genome of Bacillus subtilis

    DEFF Research Database (Denmark)

    Rasmussen, Simon; Nielsen, Henrik Bjørn; Jarmer, Hanne Østergaard

    2009-01-01

    The majority of all genes have so far been identified and annotated systematically through in silico gene finding. Here we report the finding of 3662 strand-specific transcriptionally active regions (TARs) in the genome of Bacillus subtilis by the use of tiling arrays. We have measured the genome...

  5. Origin of the duplicated regions in the yeast genomes

    DEFF Research Database (Denmark)

    Piskur, Jure

    2001-01-01

    The genome of Saccharomyces cerevisiae contains several duplicated regions. The recent sequencing results of several yeast species suggest that the duplicated regions found in the modern Saccharomyces species are probably the result of a single gross duplication, as well as a series of sporadic...

  6. Targeted genome-wide enrichment of functional regions.

    Directory of Open Access Journals (Sweden)

    Periannan Senapathy

    Full Text Available Only a small fraction of large genomes such as that of the human contains the functional regions such as the exons, promoters, and polyA sites. A platform technique for selective enrichment of functional genomic regions will enable several next-generation sequencing applications that include the discovery of causal mutations for disease and drug response. Here, we describe a powerful platform technique, termed "functional genomic fingerprinting" (FGF, for the multiplexed genomewide isolation and analysis of targeted regions such as the exome, promoterome, or exon splice enhancers. The technique employs a fixed part of a uniquely designed Fixed-Randomized primer, while the randomized part contains all the possible sequence permutations. The Fixed-Randomized primers bind with full sequence complementarity at multiple sites where the fixed sequence (such as the splice signals occurs within the genome, and multiplex amplify many regions bounded by the fixed sequences (e.g., exons. Notably, validation of this technique using cardiac myosin binding protein-C (MYBPC3 gene as an example strongly supports the application and efficacy of this method. Further, assisted by genomewide computational analyses of such sequences, the FGF technique may provide a unique platform for high-throughput sample production and analysis of targeted genomic regions by the next-generation sequencing techniques, with powerful applications in discovering disease and drug response genes.

  7. Analysis of Human Accelerated DNA Regions Using Archaic Hominin Genomes

    Science.gov (United States)

    Burbano, Hernán A.; Green, Richard E.; Maricic, Tomislav; Lalueza-Fox, Carles; de la Rasilla, Marco; Rosas, Antonio; Kelso, Janet; Pollard, Katherine S.; Lachmann, Michael; Pääbo, Svante

    2012-01-01

    Several previous comparisons of the human genome with other primate and vertebrate genomes identified genomic regions that are highly conserved in vertebrate evolution but fast-evolving on the human lineage. These human accelerated regions (HARs) may be regions of past adaptive evolution in humans. Alternatively, they may be the result of non-adaptive processes, such as biased gene conversion. We captured and sequenced DNA from a collection of previously published HARs using DNA from an Iberian Neandertal. Combining these new data with shotgun sequence from the Neandertal and Denisova draft genomes, we determine at least one archaic hominin allele for 84% of all positions within HARs. We find that 8% of HAR substitutions are not observed in the archaic hominins and are thus recent in the sense that the derived allele had not come to fixation in the common ancestor of modern humans and archaic hominins. Further, we find that recent substitutions in HARs tend to have come to fixation faster than substitutions elsewhere in the genome and that substitutions in HARs tend to cluster in time, consistent with an episodic rather than a clock-like process underlying HAR evolution. Our catalog of sequence changes in HARs will help prioritize them for functional studies of genomic elements potentially responsible for modern human adaptations. PMID:22412940

  8. Harnessing genomics to improve health in the Eastern Mediterranean Region - an executive course in genomics policy.

    Science.gov (United States)

    Acharya, Tara; Rab, Mohammed Abdur; Singer, Peter A; Daar, Abdallah S

    2005-01-21

    BACKGROUND: While innovations in medicine, science and technology have resulted in improved health and quality of life for many people, the benefits of modern medicine continue to elude millions of people in many parts of the world. To assess the potential of genomics to address health needs in EMR, the World Health Organization's Eastern Mediterranean Regional Office and the University of Toronto Joint Centre for Bioethics jointly organized a Genomics and Public Health Policy Executive Course, held September 20th-23rd, 2003, in Muscat, Oman. The 4-day course was sponsored by WHO-EMRO with additional support from the Canadian Program in Genomics and Global Health. The overall objective of the course was to collectively explore how to best harness genomics to improve health in the region. This article presents the course findings and recommendations for genomics policy in EMR. METHODS: The course brought together senior representatives from academia, biotechnology companies, regulatory bodies, media, voluntary, and legal organizations to engage in discussion. Topics covered included scientific advances in genomics, followed by innovations in business models, public sector perspectives, ethics, legal issues and national innovation systems. RESULTS: A set of recommendations, summarized below, was formulated for the Regional Office, the Member States and for individuals.* Advocacy for genomics and biotechnology for political leadership;* Networking between member states to share information, expertise, training, and regional cooperation in biotechnology; coordination of national surveys for assessment of health biotechnology innovation systems, science capacity, government policies, legislation and regulations, intellectual property policies, private sector activity;* Creation in each member country of an effective National Body on genomics, biotechnology and health to:- formulate national biotechnology strategies- raise biotechnology awareness- encourage teaching and

  9. Linkage disequilibrium of evolutionarily conserved regions in the human genome

    Directory of Open Access Journals (Sweden)

    Johnson Todd A

    2006-12-01

    Full Text Available Abstract Background The strong linkage disequilibrium (LD recently found in genic or exonic regions of the human genome demonstrated that LD can be increased by evolutionary mechanisms that select for functionally important loci. This suggests that LD might be stronger in regions conserved among species than in non-conserved regions, since regions exposed to natural selection tend to be conserved. To assess this hypothesis, we used genome-wide polymorphism data from the HapMap project and investigated LD within DNA sequences conserved between the human and mouse genomes. Results Unexpectedly, we observed that LD was significantly weaker in conserved regions than in non-conserved regions. To investigate why, we examined sequence features that may distort the relationship between LD and conserved regions. We found that interspersed repeats, and not other sequence features, were associated with the weak LD tendency in conserved regions. To appropriately understand the relationship between LD and conserved regions, we removed the effect of repetitive elements and found that the high degree of sequence conservation was strongly associated with strong LD in coding regions but not with that in non-coding regions. Conclusion Our work demonstrates that the degree of sequence conservation does not simply increase LD as predicted by the hypothesis. Rather, it implies that purifying selection changes the polymorphic patterns of coding sequences but has little influence on the patterns of functional units such as regulatory elements present in non-coding regions, since the former are generally restricted by the constraint of maintaining a functional protein product across multiple exons while the latter may exist more as individually isolated units.

  10. Genomic Regions Affecting Cheese Making Properties Identified in Danish Holsteins

    DEFF Research Database (Denmark)

    Gregersen, Vivi Raundahl; Bertelsen, Henriette Pasgaard; Poulsen, Nina Aagaard

    The cheese renneting process is affected by a number of factors associated to milk composition and a number of Danish Holsteins has previously been identified to have poor milk coagulation ability. Therefore, the aim of this study was to identify genomic regions affecting the technological...

  11. Differentiation of regions with atypical oligonucleotide composition in bacterial genomes

    Directory of Open Access Journals (Sweden)

    Reva Oleg N

    2005-10-01

    Full Text Available Abstract Background Complete sequencing of bacterial genomes has become a common technique of present day microbiology. Thereafter, data mining in the complete sequence is an essential step. New in silico methods are needed that rapidly identify the major features of genome organization and facilitate the prediction of the functional class of ORFs. We tested the usefulness of local oligonucleotide usage (OU patterns to recognize and differentiate types of atypical oligonucleotide composition in DNA sequences of bacterial genomes. Results A total of 163 bacterial genomes of eubacteria and archaea published in the NCBI database were analyzed. Local OU patterns exhibit substantial intrachromosomal variation in bacteria. Loci with alternative OU patterns were parts of horizontally acquired gene islands or ancient regions such as genes for ribosomal proteins and RNAs. OU statistical parameters, such as local pattern deviation (D, pattern skew (PS and OU variance (OUV enabled the detection and visualization of gene islands of different functional classes. Conclusion A set of approaches has been designed for the statistical analysis of nucleotide sequences of bacterial genomes. These methods are useful for the visualization and differentiation of regions with atypical oligonucleotide composition prior to or accompanying gene annotation.

  12. Characterization of copy number variation in genomic regions containing STR loci using array comparative genomic hybridization.

    Science.gov (United States)

    Repnikova, Elena A; Rosenfeld, Jill A; Bailes, Andrea; Weber, Cecilia; Erdman, Linda; McKinney, Aimee; Ramsey, Sarah; Hashimoto, Sayaka; Lamb Thrush, Devon; Astbury, Caroline; Reshmi, Shalini C; Shaffer, Lisa G; Gastier-Foster, Julie M; Pyatt, Robert E

    2013-09-01

    Short tandem repeat (STR) loci are commonly used in forensic casework, familial analysis for human identification, and for monitoring hematopoietic cell engraftment after bone marrow transplant. Unexpected genetic variation leading to sequence and length differences in STR loci can complicate STR typing, and presents challenges in casework interpretation. Copy number variation (CNV) is a relatively recently identified form of genetic variation consisting of genomic regions present at variable copy numbers within an individual compared to a reference genome. Large scale population studies have demonstrated that likely all individuals carry multiple regions with CNV of 1kb in size or greater in their genome. To date, no study correlating genomic regions containing STR loci with CNV has been conducted. In this study, we analyzed results from 32,850 samples sent for clinical array comparative genomic hybridization (CGH) analysis for the presence of CNV at regions containing the 13 CODIS (Combined DNA Index System) STR, and the Amelogenin X (AMELX) and Amelogenin Y (AMELY) loci. Thirty-two individuals with CNV involving STR loci on chromosomes 2, 4, 7, 11, 12, 13, 16, and 21, and twelve with CNV involving the AMELX/AMELY loci were identified. These results were correlated with data from publicly available databases housing information on CNV identified in normal populations and additional clinical cases. These collective results demonstrate the presence of CNV in regions containing 9 of the 13 CODIS STR and AMELX/Y loci. Further characterization of STR profiles within regions of CNV, additional cataloging of these variants in multiple populations, and contributing such examples to the public domain will provide valuable information for reliable use of these loci.

  13. Identification of candidate genome regions controlling disease resistance in Arachis

    Directory of Open Access Journals (Sweden)

    Pike Jodie

    2009-08-01

    Full Text Available Abstract Background Worldwide, diseases are important reducers of peanut (Arachis hypogaea yield. Sources of resistance against many diseases are available in cultivated peanut genotypes, although often not in farmer preferred varieties. Wild species generally harbor greater levels of resistance and even apparent immunity, although the linkage of agronomically un-adapted wild alleles with wild disease resistance genes is inevitable. Marker-assisted selection has the potential to facilitate the combination of both cultivated and wild resistance loci with agronomically adapted alleles. However, in peanut there is an almost complete lack of knowledge of the regions of the Arachis genome that control disease resistance. Results In this work we identified candidate genome regions that control disease resistance. For this we placed candidate disease resistance genes and QTLs against late leaf spot disease on the genetic map of the A-genome of Arachis, which is based on microsatellite markers and legume anchor markers. These marker types are transferable within the genus Arachis and to other legumes respectively, enabling this map to be aligned to other Arachis maps and to maps of other legume crops including those with sequenced genomes. In total, 34 sequence-confirmed candidate disease resistance genes and five QTLs were mapped. Conclusion Candidate genes and QTLs were distributed on all linkage groups except for the smallest, but the distribution was not even. Groupings of candidate genes and QTLs for late leaf spot resistance were apparent on the upper region of linkage group 4 and the lower region of linkage group 2, indicating that these regions are likely to control disease resistance.

  14. GRAbB: Selective Assembly of Genomic Regions, a New Niche for Genomic Research.

    Directory of Open Access Journals (Sweden)

    Balázs Brankovics

    2016-06-01

    Full Text Available GRAbB (Genomic Region Assembly by Baiting is a new program that is dedicated to assemble specific genomic regions from NGS data. This approach is especially useful when dealing with multi copy regions, such as mitochondrial genome and the rDNA repeat region, parts of the genome that are often neglected or poorly assembled, although they contain interesting information from phylogenetic or epidemiologic perspectives, but also single copy regions can be assembled. The program is capable of targeting multiple regions within a single run. Furthermore, GRAbB can be used to extract specific loci from NGS data, based on homology, like sequences that are used for barcoding. To make the assembly specific, a known part of the region, such as the sequence of a PCR amplicon or a homologous sequence from a related species must be specified. By assembling only the region of interest, the assembly process is computationally much less demanding and may lead to assemblies of better quality. In this study the different applications and functionalities of the program are demonstrated such as: exhaustive assembly (rDNA region and mitochondrial genome, extracting homologous regions or genes (IGS, RPB1, RPB2 and TEF1a, as well as extracting multiple regions within a single run. The program is also compared with MITObim, which is meant for the exhaustive assembly of a single target based on a similar query sequence. GRAbB is shown to be more efficient than MITObim in terms of speed, memory and disk usage. The other functionalities (handling multiple targets simultaneously and extracting homologous regions of the new program are not matched by other programs. The program is available with explanatory documentation at https://github.com/b-brankovics/grabb. GRAbB has been tested on Ubuntu (12.04 and 14.04, Fedora (23, CentOS (7.1.1503 and Mac OS X (10.7. Furthermore, GRAbB is available as a docker repository: brankovics/grabb (https://hub.docker.com/r/brankovics/grabb/.

  15. GRAbB: Selective Assembly of Genomic Regions, a New Niche for Genomic Research.

    Science.gov (United States)

    Brankovics, Balázs; Zhang, Hao; van Diepeningen, Anne D; van der Lee, Theo A J; Waalwijk, Cees; de Hoog, G Sybren

    2016-06-01

    GRAbB (Genomic Region Assembly by Baiting) is a new program that is dedicated to assemble specific genomic regions from NGS data. This approach is especially useful when dealing with multi copy regions, such as mitochondrial genome and the rDNA repeat region, parts of the genome that are often neglected or poorly assembled, although they contain interesting information from phylogenetic or epidemiologic perspectives, but also single copy regions can be assembled. The program is capable of targeting multiple regions within a single run. Furthermore, GRAbB can be used to extract specific loci from NGS data, based on homology, like sequences that are used for barcoding. To make the assembly specific, a known part of the region, such as the sequence of a PCR amplicon or a homologous sequence from a related species must be specified. By assembling only the region of interest, the assembly process is computationally much less demanding and may lead to assemblies of better quality. In this study the different applications and functionalities of the program are demonstrated such as: exhaustive assembly (rDNA region and mitochondrial genome), extracting homologous regions or genes (IGS, RPB1, RPB2 and TEF1a), as well as extracting multiple regions within a single run. The program is also compared with MITObim, which is meant for the exhaustive assembly of a single target based on a similar query sequence. GRAbB is shown to be more efficient than MITObim in terms of speed, memory and disk usage. The other functionalities (handling multiple targets simultaneously and extracting homologous regions) of the new program are not matched by other programs. The program is available with explanatory documentation at https://github.com/b-brankovics/grabb. GRAbB has been tested on Ubuntu (12.04 and 14.04), Fedora (23), CentOS (7.1.1503) and Mac OS X (10.7). Furthermore, GRAbB is available as a docker repository: brankovics/grabb (https://hub.docker.com/r/brankovics/grabb/).

  16. Nucleolar organizer regions: genomic 'dark matter' requiring illumination.

    Science.gov (United States)

    McStay, Brian

    2016-07-15

    Nucleoli form around tandem arrays of a ribosomal gene repeat, termed nucleolar organizer regions (NORs). During metaphase, active NORs adopt a characteristic undercondensed morphology. Recent evidence indicates that the HMG-box-containing DNA-binding protein UBF (upstream binding factor) is directly responsible for this morphology and provides a mitotic bookmark to ensure rapid nucleolar formation beginning in telophase in human cells. This is likely to be a widely employed strategy, as UBF is present throughout metazoans. In higher eukaryotes, NORs are typically located within regions of chromosomes that form perinucleolar heterochromatin during interphase. Typically, the genomic architecture of NORs and the chromosomal regions within which they lie is very poorly described, yet recent evidence points to a role for context in their function. In Arabidopsis, NOR silencing appears to be controlled by sequences outside the rDNA (ribosomal DNA) array. Translocations reveal a role for context in the expression of the NOR on the X chromosome in Drosophila Recent work has begun on characterizing the genomic architecture of human NORs. A role for distal sequences located in perinucleolar heterochromatin has been inferred, as they exhibit a complex transcriptionally active chromatin structure. Links between rDNA genomic stability and aging in Saccharomyces cerevisiae are now well established, and indications are emerging that this is important in aging and replicative senescence in higher eukaryotes. This, combined with the fact that rDNA arrays are recombinational hot spots in cancer cells, has focused attention on DNA damage responses in NORs. The introduction of DNA double-strand breaks into rDNA arrays leads to a dramatic reorganization of nucleolar structure. Damaged rDNA repeats move from the nucleolar interior to form caps at the nucleolar periphery, presumably to facilitate repair, suggesting that the chromosomal context of human NORs contributes to their genomic

  17. Forces shaping the fastest evolving regions in the human genome.

    Directory of Open Access Journals (Sweden)

    Katherine S Pollard

    2006-10-01

    Full Text Available Comparative genomics allow us to search the human genome for segments that were extensively changed in the last approximately 5 million years since divergence from our common ancestor with chimpanzee, but are highly conserved in other species and thus are likely to be functional. We found 202 genomic elements that are highly conserved in vertebrates but show evidence of significantly accelerated substitution rates in human. These are mostly in non-coding DNA, often near genes associated with transcription and DNA binding. Resequencing confirmed that the five most accelerated elements are dramatically changed in human but not in other primates, with seven times more substitutions in human than in chimp. The accelerated elements, and in particular the top five, show a strong bias for adenine and thymine to guanine and cytosine nucleotide changes and are disproportionately located in high recombination and high guanine and cytosine content environments near telomeres, suggesting either biased gene conversion or isochore selection. In addition, there is some evidence of directional selection in the regions containing the two most accelerated regions. A combination of evolutionary forces has contributed to accelerated evolution of the fastest evolving elements in the human genome.

  18. Genome-wide comparisons of phylogenetic similarities between partial genomic regions and the full-length genome in Hepatitis E virus genotyping.

    Directory of Open Access Journals (Sweden)

    Shuai Wang

    Full Text Available Besides the complete genome, different partial genomic sequences of Hepatitis E virus (HEV have been used in genotyping studies, making it difficult to compare the results based on them. No commonly agreed partial region for HEV genotyping has been determined. In this study, we used a statistical method to evaluate the phylogenetic performance of each partial genomic sequence from a genome wide, by comparisons of evolutionary distances between genomic regions and the full-length genomes of 101 HEV isolates to identify short genomic regions that can reproduce HEV genotype assignments based on full-length genomes. Several genomic regions, especially one genomic region at the 3'-terminal of the papain-like cysteine protease domain, were detected to have relatively high phylogenetic correlations with the full-length genome. Phylogenetic analyses confirmed the identical performances between these regions and the full-length genome in genotyping, in which the HEV isolates involved could be divided into reasonable genotypes. This analysis may be of value in developing a partial sequence-based consensus classification of HEV species.

  19. Chromosome region-specific libraries for human genome analysis

    Energy Technology Data Exchange (ETDEWEB)

    Kao, Fa-Ten.

    1991-01-01

    We have made important progress since the beginning of the current grant year. We have further developed the microdissection and PCR- assisted microcloning techniques using the linker-adaptor method. We have critically evaluated the microdissection libraries constructed by this microtechnology and proved that they are of high quality. We further demonstrated that these microdissection clones are useful in identifying corresponding YAC clones for a thousand-fold expansion of the genomic coverage and for contig construction. We are also improving the technique of cloning the dissected fragments in test tube by the TDT method. We are applying both of these PCR cloning technique to human chromosomes 2 and 5 to construct region-specific libraries for physical mapping purposes of LLNL and LANL. Finally, we are exploring efficient procedures to use unique sequence microclones to isolate cDNA clones from defined chromosomal regions as valuable resources for identifying expressed gene sequences in the human genome. We believe that we are making important progress under the auspices of this DOE human genome program grant and we will continue to make significant contributions in the coming year. 4 refs., 4 figs.

  20. Searching for additional disease loci in a genomic region.

    Science.gov (United States)

    Thomson, Glenys; Barcellos, Lisa F; Valdes, Ana M

    2008-01-01

    Our aim is to review methods to optimize detection of all disease genes in a genetic region. As a starting point, we assume there is sufficient evidence from linkage and/or association studies, based on significance levels or replication studies, for the involvement in disease risk of the genetic region under study. For closely linked markers, there will often be multiple associations with disease, and linkage analyses identify a region rather than the specific disease-predisposing gene. Hence, the first task is to identify the primary (major) disease-predisposing gene or genes in a genetic region, and single nucleotide polymorphisms thereof, that is, how to distinguish true associations from those that are just due to linkage disequilibrium with the actual disease-predisposing variants. Then, how do we detect additional disease genes in this genetic region? These two issues are of course very closely interrelated. No existing programs, either individually or in aggregate, can handle the magnitude and complexity of the analyses needed using currently available methods. Further, even with modern computers, one cannot study every possible combination of genetic markers and their haplotypes across the genome, or even within a genetic region. Although we must rely heavily on computers, in the final analysis of multiple effects in a genetic region and/or interaction or independent effects between unlinked genes, manipulation of the data by the individual investigator will play a crucial role. We recommend a multistrategy approach using a variety of complementary methods described below.

  1. Comparative Genomic Analyses of the Human NPHP1 Locus Reveal Complex Genomic Architecture and Its Regional Evolution in Primates.

    Directory of Open Access Journals (Sweden)

    Bo Yuan

    2015-12-01

    Full Text Available Many loci in the human genome harbor complex genomic structures that can result in susceptibility to genomic rearrangements leading to various genomic disorders. Nephronophthisis 1 (NPHP1, MIM# 256100 is an autosomal recessive disorder that can be caused by defects of NPHP1; the gene maps within the human 2q13 region where low copy repeats (LCRs are abundant. Loss of function of NPHP1 is responsible for approximately 85% of the NPHP1 cases-about 80% of such individuals carry a large recurrent homozygous NPHP1 deletion that occurs via nonallelic homologous recombination (NAHR between two flanking directly oriented ~45 kb LCRs. Published data revealed a non-pathogenic inversion polymorphism involving the NPHP1 gene flanked by two inverted ~358 kb LCRs. Using optical mapping and array-comparative genomic hybridization, we identified three potential novel structural variant (SV haplotypes at the NPHP1 locus that may protect a haploid genome from the NPHP1 deletion. Inter-species comparative genomic analyses among primate genomes revealed massive genomic changes during evolution. The aggregated data suggest that dynamic genomic rearrangements occurred historically within the NPHP1 locus and generated SV haplotypes observed in the human population today, which may confer differential susceptibility to genomic instability and the NPHP1 deletion within a personal genome. Our study documents diverse SV haplotypes at a complex LCR-laden human genomic region. Comparative analyses provide a model for how this complex region arose during primate evolution, and studies among humans suggest that intra-species polymorphism may potentially modulate an individual's susceptibility to acquiring disease-associated alleles.

  2. High-Throughput resequencing of maize landraces at genomic regions associated with flowering time

    Science.gov (United States)

    Despite the reduction in the price of sequencing, it remains expensive to sequence and assemble whole, complex genomes of multiple samples for population studies, particularly for large genomes like those of many crop species. Enrichment of target genome regions coupled with next generation sequenci...

  3. Genome-wide expression profiling of complex regional pain syndrome.

    Directory of Open Access Journals (Sweden)

    Eun-Heui Jin

    Full Text Available Complex regional pain syndrome (CRPS is a chronic, progressive, and devastating pain syndrome characterized by spontaneous pain, hyperalgesia, allodynia, altered skin temperature, and motor dysfunction. Although previous gene expression profiling studies have been conducted in animal pain models, there genome-wide expression profiling in the whole blood of CRPS patients has not been reported yet. Here, we successfully identified certain pain-related genes through genome-wide expression profiling in the blood from CRPS patients. We found that 80 genes were differentially expressed between 4 CRPS patients (2 CRPS I and 2 CRPS II and 5 controls (cut-off value: 1.5-fold change and p<0.05. Most of those genes were associated with signal transduction, developmental processes, cell structure and motility, and immunity and defense. The expression levels of major histocompatibility complex class I A subtype (HLA-A29.1, matrix metalloproteinase 9 (MMP9, alanine aminopeptidase N (ANPEP, l-histidine decarboxylase (HDC, granulocyte colony-stimulating factor 3 receptor (G-CSF3R, and signal transducer and activator of transcription 3 (STAT3 genes selected from the microarray were confirmed in 24 CRPS patients and 18 controls by quantitative reverse transcription-polymerase chain reaction (qRT-PCR. We focused on the MMP9 gene that, by qRT-PCR, showed a statistically significant difference in expression in CRPS patients compared to controls with the highest relative fold change (4.0±1.23 times and p = 1.4×10(-4. The up-regulation of MMP9 gene in the blood may be related to the pain progression in CRPS patients. Our findings, which offer a valuable contribution to the understanding of the differential gene expression in CRPS may help in the understanding of the pathophysiology of CRPS pain progression.

  4. Pan-genome sequence analysis using Panseq: an online tool for the rapid analysis of core and accessory genomic regions

    Directory of Open Access Journals (Sweden)

    Villegas Andre

    2010-09-01

    Full Text Available Abstract Background The pan-genome of a bacterial species consists of a core and an accessory gene pool. The accessory genome is thought to be an important source of genetic variability in bacterial populations and is gained through lateral gene transfer, allowing subpopulations of bacteria to better adapt to specific niches. Low-cost and high-throughput sequencing platforms have created an exponential increase in genome sequence data and an opportunity to study the pan-genomes of many bacterial species. In this study, we describe a new online pan-genome sequence analysis program, Panseq. Results Panseq was used to identify Escherichia coli O157:H7 and E. coli K-12 genomic islands. Within a population of 60 E. coli O157:H7 strains, the existence of 65 accessory genomic regions identified by Panseq analysis was confirmed by PCR. The accessory genome and binary presence/absence data, and core genome and single nucleotide polymorphisms (SNPs of six L. monocytogenes strains were extracted with Panseq and hierarchically clustered and visualized. The nucleotide core and binary accessory data were also used to construct maximum parsimony (MP trees, which were compared to the MP tree generated by multi-locus sequence typing (MLST. The topology of the accessory and core trees was identical but differed from the tree produced using seven MLST loci. The Loci Selector module found the most variable and discriminatory combinations of four loci within a 100 loci set among 10 strains in 1 s, compared to the 449 s required to exhaustively search for all possible combinations; it also found the most discriminatory 20 loci from a 96 loci E. coli O157:H7 SNP dataset. Conclusion Panseq determines the core and accessory regions among a collection of genomic sequences based on user-defined parameters. It readily extracts regions unique to a genome or group of genomes, identifies SNPs within shared core genomic regions, constructs files for use in phylogeny programs

  5. Computational Comparison of Human Genomic Sequence Assemblies for a Region of Chromosome 4

    OpenAIRE

    Semple, Colin; Stewart W. Morris; Porteous, David J.; Evans, Kathryn L.

    2002-01-01

    Much of the available human genomic sequence data exist in a fragmentary draft state following the completion of the initial high-volume sequencing performed by the International Human Genome Sequencing Consortium (IHGSC) and Celera Genomics (CG). We compared six draft genome assemblies over a region of chromosome 4p (D4S394–D4S403), two consecutive releases by the IHGSC at University of California, Santa Cruz (UCSC), two consecutive releases from the National Centre for Biotechnology Informa...

  6. Forces shaping the fastest evolving regions in the human genome

    DEFF Research Database (Denmark)

    Pollard, Katherine S; Salama, Sofie R; King, Bryan;

    2006-01-01

    Comparative genomics allow us to search the human genome for segments that were extensively changed in the last approximately 5 million years since divergence from our common ancestor with chimpanzee, but are highly conserved in other species and thus are likely to be functional. We found 202 gen...... contributed to accelerated evolution of the fastest evolving elements in the human genome.......Comparative genomics allow us to search the human genome for segments that were extensively changed in the last approximately 5 million years since divergence from our common ancestor with chimpanzee, but are highly conserved in other species and thus are likely to be functional. We found 202...... genomic elements that are highly conserved in vertebrates but show evidence of significantly accelerated substitution rates in human. These are mostly in non-coding DNA, often near genes associated with transcription and DNA binding. Resequencing confirmed that the five most accelerated elements...

  7. Augmenting Chinese hamster genome assembly by identifying regions of high confidence.

    Science.gov (United States)

    Vishwanathan, Nandita; Bandyopadhyay, Arpan A; Fu, Hsu-Yuan; Sharma, Mohit; Johnson, Kathryn C; Mudge, Joann; Ramaraj, Thiruvarangan; Onsongo, Getiria; Silverstein, Kevin A T; Jacob, Nitya M; Le, Huong; Karypis, George; Hu, Wei-Shou

    2016-09-01

    Chinese hamster Ovary (CHO) cell lines are the dominant industrial workhorses for therapeutic recombinant protein production. The availability of genome sequence of Chinese hamster and CHO cells will spur further genome and RNA sequencing of producing cell lines. However, the mammalian genomes assembled using shot-gun sequencing data still contain regions of uncertain quality due to assembly errors. Identifying high confidence regions in the assembled genome will facilitate its use for cell engineering and genome engineering. We assembled two independent drafts of Chinese hamster genome by de novo assembly from shotgun sequencing reads and by re-scaffolding and gap-filling the draft genome from NCBI for improved scaffold lengths and gap fractions. We then used the two independent assemblies to identify high confidence regions using two different approaches. First, the two independent assemblies were compared at the sequence level to identify their consensus regions as "high confidence regions" which accounts for at least 78 % of the assembled genome. Further, a genome wide comparison of the Chinese hamster scaffolds with mouse chromosomes revealed scaffolds with large blocks of collinearity, which were also compiled as high-quality scaffolds. Genome scale collinearity was complemented with EST based synteny which also revealed conserved gene order compared to mouse. As cell line sequencing becomes more commonly practiced, the approaches reported here are useful for assessing the quality of assembly and potentially facilitate the engineering of cell lines.

  8. Comprehensive repertoire of foldable regions within whole genomes.

    Directory of Open Access Journals (Sweden)

    Guilhem Faure

    2013-10-01

    Full Text Available In order to get a comprehensive repertoire of foldable domains within whole proteomes, including orphan domains, we developed a novel procedure, called SEG-HCA. From only the information of a single amino acid sequence, SEG-HCA automatically delineates segments possessing high densities in hydrophobic clusters, as defined by Hydrophobic Cluster Analysis (HCA. These hydrophobic clusters mainly correspond to regular secondary structures, which together form structured or foldable regions. Genome-wide analyses revealed that SEG-HCA is opposite of disorder predictors, both addressing distinct structural states. Interestingly, there is however an overlap between the two predictions, including small segments of disordered sequences, which undergo coupled folding and binding. SEG-HCA thus gives access to these specific domains, which are generally poorly represented in domain databases. Comparison of the whole set of SEG-HCA predictions with the Conserved Domain Database (CDD also highlighted a wide proportion of predicted large (length >50 amino acids segments, which are CDD orphan. These orphan sequences may either correspond to highly divergent members of already known families or belong to new families of domains. Their comprehensive description thus opens new avenues to investigate new functional and/or structural features, which remained so far uncovered. Altogether, the data described here provide new insights into the protein architecture and organization throughout the three kingdoms of life.

  9. New genomic resources for switchgrass: a BAC library and comparative analysis of homoeologous genomic regions harboring bioenergy traits

    Directory of Open Access Journals (Sweden)

    Feltus Frank A

    2011-07-01

    Full Text Available Abstract Background Switchgrass, a C4 species and a warm-season grass native to the prairies of North America, has been targeted for development into an herbaceous biomass fuel crop. Genetic improvement of switchgrass feedstock traits through marker-assisted breeding and biotechnology approaches calls for genomic tools development. Establishment of integrated physical and genetic maps for switchgrass will accelerate mapping of value added traits useful to breeding programs and to isolate important target genes using map based cloning. The reported polyploidy series in switchgrass ranges from diploid (2X = 18 to duodecaploid (12X = 108. Like in other large, repeat-rich plant genomes, this genomic complexity will hinder whole genome sequencing efforts. An extensive physical map providing enough information to resolve the homoeologous genomes would provide the necessary framework for accurate assembly of the switchgrass genome. Results A switchgrass BAC library constructed by partial digestion of nuclear DNA with EcoRI contains 147,456 clones covering the effective genome approximately 10 times based on a genome size of 3.2 Gigabases (~1.6 Gb effective. Restriction digestion and PFGE analysis of 234 randomly chosen BACs indicated that 95% of the clones contained inserts, ranging from 60 to 180 kb with an average of 120 kb. Comparative sequence analysis of two homoeologous genomic regions harboring orthologs of the rice OsBRI1 locus, a low-copy gene encoding a putative protein kinase and associated with biomass, revealed that orthologous clones from homoeologous chromosomes can be unambiguously distinguished from each other and correctly assembled to respective fingerprint contigs. Thus, the data obtained not only provide genomic resources for further analysis of switchgrass genome, but also improve efforts for an accurate genome sequencing strategy. Conclusions The construction of the first switchgrass BAC library and comparative analysis of

  10. Harnessing genomics to improve health in the Eastern Mediterranean Region – an executive course in genomics policy

    Directory of Open Access Journals (Sweden)

    Singer Peter A

    2005-01-01

    Full Text Available Abstract Background While innovations in medicine, science and technology have resulted in improved health and quality of life for many people, the benefits of modern medicine continue to elude millions of people in many parts of the world. To assess the potential of genomics to address health needs in EMR, the World Health Organization's Eastern Mediterranean Regional Office and the University of Toronto Joint Centre for Bioethics jointly organized a Genomics and Public Health Policy Executive Course, held September 20th–23rd, 2003, in Muscat, Oman. The 4-day course was sponsored by WHO-EMRO with additional support from the Canadian Program in Genomics and Global Health. The overall objective of the course was to collectively explore how to best harness genomics to improve health in the region. This article presents the course findings and recommendations for genomics policy in EMR. Methods The course brought together senior representatives from academia, biotechnology companies, regulatory bodies, media, voluntary, and legal organizations to engage in discussion. Topics covered included scientific advances in genomics, followed by innovations in business models, public sector perspectives, ethics, legal issues and national innovation systems. Results A set of recommendations, summarized below, was formulated for the Regional Office, the Member States and for individuals. • Advocacy for genomics and biotechnology for political leadership; • Networking between member states to share information, expertise, training, and regional cooperation in biotechnology; coordination of national surveys for assessment of health biotechnology innovation systems, science capacity, government policies, legislation and regulations, intellectual property policies, private sector activity; • Creation in each member country of an effective National Body on genomics, biotechnology and health to: - formulate national biotechnology strategies - raise

  11. Structured RNAs and synteny regions in the pig genome

    DEFF Research Database (Denmark)

    Anthon, Christian; Tafer, Hakim; Havgaard, Jakob Hull;

    2014-01-01

    for Laurasiatheria (pig, cow, dolphin, horse, cat, dog, hedgehog). CONCLUSIONS: We have obtained one of the most comprehensive annotations for structured ncRNAs of a mammalian genome, which is likely to play central roles in both health modelling and production. The core annotation is available in Ensembl 70......BACKGROUND: Annotating mammalian genomes for noncoding RNAs (ncRNAs) is nontrivial since far from all ncRNAs are known and the computational models are resource demanding. Currently, the human genome holds the best mammalian ncRNA annotation, a result of numerous efforts by several groups. However...

  12. Database of Periodic DNA Regions in Major Genomes

    Directory of Open Access Journals (Sweden)

    Felix E. Frenkel

    2017-01-01

    Full Text Available Summary. We analyzed several prokaryotic and eukaryotic genomes looking for the periodicity sequences availability and employing a new mathematical method. The method envisaged using the random position weight matrices and dynamic programming. Insertions and deletions were allowed inside periodicities, thus adding a novelty to the results we obtained. A periodicity length, one of the key periodicity features, varied from 2 to 50 nt. Totally over 60,000 periodicity sequences were found in 15 genomes including some chromosomes of the H. sapiens (partial, C. elegans, D. melanogaster, and A. thaliana genomes.

  13. Database of Periodic DNA Regions in Major Genomes

    Science.gov (United States)

    2017-01-01

    Summary. We analyzed several prokaryotic and eukaryotic genomes looking for the periodicity sequences availability and employing a new mathematical method. The method envisaged using the random position weight matrices and dynamic programming. Insertions and deletions were allowed inside periodicities, thus adding a novelty to the results we obtained. A periodicity length, one of the key periodicity features, varied from 2 to 50 nt. Totally over 60,000 periodicity sequences were found in 15 genomes including some chromosomes of the H. sapiens (partial), C. elegans, D. melanogaster, and A. thaliana genomes. PMID:28182099

  14. Annotation of the protein coding regions of the equine genome

    DEFF Research Database (Denmark)

    Hestand, Matthew S.; Kalbfleisch, Theodore S.; Coleman, Stephen J.

    2015-01-01

    Current gene annotation of the horse genome is largely derived from in silico predictions and cross-species alignments. Only a small number of genes are annotated based on equine EST and mRNA sequences. To expand the number of equine genes annotated from equine experimental evidence, we sequenced m...... and appear to be small errors in the equine reference genome, since they are also identified as homozygous variants by genomic DNA resequencing of the reference horse. Taken together, we provide a resource of equine mRNA structures and protein coding variants that will enhance equine and cross...

  15. Identification of Low-Confidence Regions in the Pig Reference Genome (Sscrofa10.2)

    Science.gov (United States)

    Warr, Amanda; Robert, Christelle; Hume, David; Archibald, Alan L.; Deeb, Nader; Watson, Mick

    2015-01-01

    Many applications of high throughput sequencing rely on the availability of an accurate reference genome. Variant calling often produces large data sets that cannot be realistically validated and which may contain large numbers of false-positives. Errors in the reference assembly increase the number of false-positives. While resources are available to aid in the filtering of variants from human data, for other species these do not yet exist and strict filtering techniques must be employed which are more likely to exclude true-positives. This work assesses the accuracy of the pig reference genome (Sscrofa10.2) using whole genome sequencing reads from the Duroc sow whose genome the assembly was based on. Indicators of structural variation including high regional coverage, unexpected insert sizes, improper pairing and homozygous variants were used to identify low quality (LQ) regions of the assembly. Low coverage (LC) regions were also identified and analyzed separately. The LQ regions covered 13.85% of the genome, the LC regions covered 26.6% of the genome and combined (LQLC) they covered 33.07% of the genome. Over half of dbSNP variants were located in the LQLC regions. Of copy number variable regions identified in a previous study, 86.3% were located in the LQLC regions. The regions were also enriched for gene predictions from RNA-seq data with 42.98% falling in the LQLC regions. Excluding variants in the LQ, LC, or LQLC from future analyses will help reduce the number of false-positive variant calls. Researchers using WGS data should be aware that the current pig reference genome does not give an accurate representation of the copy number of alleles in the original Duroc sow’s genome. PMID:26640477

  16. Identification of Low-Confidence Regions in the Pig Reference Genome (Sscrofa10.2).

    Science.gov (United States)

    Warr, Amanda; Robert, Christelle; Hume, David; Archibald, Alan L; Deeb, Nader; Watson, Mick

    2015-01-01

    Many applications of high throughput sequencing rely on the availability of an accurate reference genome. Variant calling often produces large data sets that cannot be realistically validated and which may contain large numbers of false-positives. Errors in the reference assembly increase the number of false-positives. While resources are available to aid in the filtering of variants from human data, for other species these do not yet exist and strict filtering techniques must be employed which are more likely to exclude true-positives. This work assesses the accuracy of the pig reference genome (Sscrofa10.2) using whole genome sequencing reads from the Duroc sow whose genome the assembly was based on. Indicators of structural variation including high regional coverage, unexpected insert sizes, improper pairing and homozygous variants were used to identify low quality (LQ) regions of the assembly. Low coverage (LC) regions were also identified and analyzed separately. The LQ regions covered 13.85% of the genome, the LC regions covered 26.6% of the genome and combined (LQLC) they covered 33.07% of the genome. Over half of dbSNP variants were located in the LQLC regions. Of copy number variable regions identified in a previous study, 86.3% were located in the LQLC regions. The regions were also enriched for gene predictions from RNA-seq data with 42.98% falling in the LQLC regions. Excluding variants in the LQ, LC, or LQLC from future analyses will help reduce the number of false-positive variant calls. Researchers using WGS data should be aware that the current pig reference genome does not give an accurate representation of the copy number of alleles in the original Duroc sow's genome.

  17. Identification of low-confidence regions in the pig reference genome (Sscrofa10.2

    Directory of Open Access Journals (Sweden)

    Amanda eWarr

    2015-11-01

    Full Text Available Many applications of high throughput sequencing rely on the availability of an accurate reference genome. Variant calling often produces large data sets that cannot be realistically validated and which may contain large numbers of false-positives. Errors in the reference assembly increase the number of false-positives. While resources are available to aid in the filtering of variants from human data, for other species these do not yet exist and strict filtering techniques must be employed which are more likely to exclude true-positives. This work assesses the accuracy of the pig reference genome (Sscrofa10.2 using whole genome sequencing reads from the Duroc sow whose genome the assembly was based on. Indicators of structural variation including high regional coverage, unexpected insert sizes, improper pairing and homozygous variants were used to identify low quality (LQ regions of the assembly. Low coverage (LC regions were also identified and analyzed separately. The LQ regions covered 13.85% of the genome, the LC regions covered 26.6% of the genome and combined (LQLC they covered 33.07% of the genome. Over half of dbSNP variants were located in the LQLC regions. Of CNVRs identified in a previous study, 86.3% were located in the LQLC regions. The regions were also enriched for gene predictions from RNA-seq data with 42.98% falling in the LQLC regions. Excluding variants in the LQ, LC or LQLC from future analyses will help reduce the number of false-positive variant calls. Researchers using WGS data should be aware that the current pig reference genome does not give an accurate representation of the copy number of alleles in the original Duroc sow’s genome.

  18. Breaking Good: Accounting for Fragility of Genomic Regions in Rearrangement Distance Estimation.

    Science.gov (United States)

    Biller, Priscila; Guéguen, Laurent; Knibbe, Carole; Tannier, Eric

    2016-01-01

    Models of evolution by genome rearrangements are prone to two types of flaws: One is to ignore the diversity of susceptibility to breakage across genomic regions, and the other is to suppose that susceptibility values are given. Without necessarily supposing their precise localization, we call "solid" the regions that are improbably broken by rearrangements and "fragile" the regions outside solid ones. We propose a model of evolution by inversions where breakage probabilities vary across fragile regions and over time. It contains as a particular case the uniform breakage model on the nucleotidic sequence, where breakage probabilities are proportional to fragile region lengths. This is very different from the frequently used pseudouniform model where all fragile regions have the same probability to break. Estimations of rearrangement distances based on the pseudouniform model completely fail on simulations with the truly uniform model. On pairs of amniote genomes, we show that identifying coding genes with solid regions yields incoherent distance estimations, especially with the pseudouniform model, and to a lesser extent with the truly uniform model. This incoherence is solved when we coestimate the number of fragile regions with the rearrangement distance. The estimated number of fragile regions is surprisingly small, suggesting that a minority of regions are recurrently used by rearrangements. Estimations for several pairs of genomes at different divergence times are in agreement with a slowly evolvable colocalization of active genomic regions in the cell.

  19. Identification and annotation of promoter regions in microbial genome sequences on the basis of DNA stability

    Indian Academy of Sciences (India)

    Vetriselvi Rangannan; Manju Bansal

    2007-08-01

    Analysis of various predicted structural properties of promoter regions in prokaryotic as well as eukaryotic genomes had earlier indicated that they have several common features, such as lower stability, higher curvature and less bendability, when compared with their neighboring regions. Based on the difference in stability between neighboring upstream and downstream regions in the vicinity of experimentally determined transcription start sites, a promoter prediction algorithm has been developed to identify prokaryotic promoter sequences in whole genomes. The average free energy (E) over known promoter sequences and the difference (D) between E and the average free energy over the entire genome (G) are used to search for promoters in the genomic sequences. Using these cutoff values to predict promoter regions across entire Escherichia coli genome, we achieved a reliability of 70% when the predicted promoters were cross verified against the 960 transcription start sites (TSSs) listed in the Ecocyc database. Annotation of the whole E. coli genome for promoter region could be carried out with 49% accuracy. The method is quite general and it can be used to annotate the promoter regions of other prokaryotic genomes.

  20. Genomic regions associated with necrotic enteritis resistance in Fayoumi and White Leghorn chickens

    Science.gov (United States)

    In this study, we used two breeds of chicken to identify genomic regions corresponding to necrotic enteritis (NE) resistance. We scanned the genomes of a resistant and susceptible line of Fayoumi and White Leghorn chicken using a chicken 60K Illumina SNP panel. A total of 235 loci with divergently ...

  1. Identifying Human Genome-Wide CNV, LOH and UPD by Targeted Sequencing of Selected Regions.

    Science.gov (United States)

    Wang, Yu; Li, Wei; Xia, Yingying; Wang, Chongzhi; Tang, Y Tom; Guo, Wenying; Li, Jinliang; Zhao, Xia; Sun, Yepeng; Hu, Juan; Zhen, Hefu; Zhang, Xiandong; Chen, Chao; Shi, Yujian; Li, Lin; Cao, Hongzhi; Du, Hongli; Li, Jian

    2014-01-01

    Copy-number variations (CNV), loss of heterozygosity (LOH), and uniparental disomy (UPD) are large genomic aberrations leading to many common inherited diseases, cancers, and other complex diseases. An integrated tool to identify these aberrations is essential in understanding diseases and in designing clinical interventions. Previous discovery methods based on whole-genome sequencing (WGS) require very high depth of coverage on the whole genome scale, and are cost-wise inefficient. Another approach, whole exome genome sequencing (WEGS), is limited to discovering variations within exons. Thus, we are lacking efficient methods to detect genomic aberrations on the whole genome scale using next-generation sequencing technology. Here we present a method to identify genome-wide CNV, LOH and UPD for the human genome via selectively sequencing a small portion of genome termed Selected Target Regions (SeTRs). In our experiments, the SeTRs are covered by 99.73%~99.95% with sufficient depth. Our developed bioinformatics pipeline calls genome-wide CNVs with high confidence, revealing 8 credible events of LOH and 3 UPD events larger than 5M from 15 individual samples. We demonstrate that genome-wide CNV, LOH and UPD can be detected using a cost-effective SeTRs sequencing approach, and that LOH and UPD can be identified using just a sample grouping technique, without using a matched sample or familial information.

  2. Identifying Human Genome-Wide CNV, LOH and UPD by Targeted Sequencing of Selected Regions.

    Directory of Open Access Journals (Sweden)

    Wei Li

    Full Text Available Copy-number variations (CNV, loss of heterozygosity (LOH, and uniparental disomy (UPD are large genomic aberrations leading to many common inherited diseases, cancers, and other complex diseases. An integrated tool to identify these aberrations is essential in understanding diseases and in designing clinical interventions. Previous discovery methods based on whole-genome sequencing (WGS require very high depth of coverage on the whole genome scale, and are cost-wise inefficient. Another approach, whole exome genome sequencing (WEGS, is limited to discovering variations within exons. Thus, we are lacking efficient methods to detect genomic aberrations on the whole genome scale using next-generation sequencing technology. Here we present a method to identify genome-wide CNV, LOH and UPD for the human genome via selectively sequencing a small portion of genome termed Selected Target Regions (SeTRs. In our experiments, the SeTRs are covered by 99.73%~99.95% with sufficient depth. Our developed bioinformatics pipeline calls genome-wide CNVs with high confidence, revealing 8 credible events of LOH and 3 UPD events larger than 5M from 15 individual samples. We demonstrate that genome-wide CNV, LOH and UPD can be detected using a cost-effective SeTRs sequencing approach, and that LOH and UPD can be identified using just a sample grouping technique, without using a matched sample or familial information.

  3. Localising loci underlying complex trait variation using Regional Genomic Relationship Mapping.

    Directory of Open Access Journals (Sweden)

    Yoshitaka Nagamine

    Full Text Available The limited proportion of complex trait variance identified in genome-wide association studies may reflect the limited power of single SNP analyses to detect either rare causative alleles or those of small effect. Motivated by studies that demonstrate that loci contributing to trait variation may contain a number of different alleles, we have developed an analytical approach termed Regional Genomic Relationship Mapping that, like linkage-based family methods, integrates variance contributed by founder gametes within a pedigree. This approach takes advantage of very distant (and unrecorded relationships, and this greatly increases the power of the method, compared with traditional pedigree-based linkage analyses. By integrating variance contributed by founder gametes in the population, our approach provides an estimate of the Regional Heritability attributable to a small genomic region (e.g. 100 SNP window covering ca. 1 Mb of DNA in a 300000 SNP GWAS and has the power to detect regions containing multiple alleles that individually contribute too little variance to be detectable by GWAS as well as regions with single common GWAS-detectable SNPs. We use genome-wide SNP array data to obtain both a genome-wide relationship matrix and regional relationship ("identity by state" or IBS matrices for sequential regions across the genome. We then estimate a heritability for each region sequentially in our genome-wide scan. We demonstrate by simulation and with real data that, when compared to traditional ("individual SNP" GWAS, our method uncovers new loci that explain additional trait variation. We analysed data from three Southern European populations and from Orkney for exemplar traits - serum uric acid concentration and height. We show that regional heritability estimates are correlated with results from genome-wide association analysis but can capture more of the genetic variance segregating in the population and identify additional trait loci.

  4. Targeted enrichment of genomic DNA regions for next generation sequencing

    NARCIS (Netherlands)

    Mertens, F.; El-Sharawy, A.; Sauer, S.; Van Helvoort, J.; Van der Zaag, P.J.; Franke, A.; Nilsson, M.; Lehrach. H.; Brookes, A.

    2011-01-01

    In this review we discuss the latest targeted enrichment methods, and aspects of their utilization along with second generation sequencing for complex genome analysis. In doing so we provide an overview of issues involved in detecting genetic variation, for which targeted enrichment has become a pow

  5. Structured RNAs and synteny regions in the pig genome

    DEFF Research Database (Denmark)

    Anthon, Christian; Tafer, Hakim; Havgaard, Jakob H

    2014-01-01

    for Laurasiatheria (pig, cow, dolphin, horse, cat, dog, hedgehog). CONCLUSIONS: We have obtained one of the most comprehensive annotations for structured ncRNAs of a mammalian genome, which is likely to play central roles in both health modelling and production. The core annotation is available in Ensembl 70...

  6. Annotation of the protein coding regions of the equine genome

    DEFF Research Database (Denmark)

    Hestand, Matthew S.; Kalbfleisch, Theodore S.; Coleman, Stephen J.;

    2015-01-01

    Current gene annotation of the horse genome is largely derived from in silico predictions and cross-species alignments. Only a small number of genes are annotated based on equine EST and mRNA sequences. To expand the number of equine genes annotated from equine experimental evidence, we sequenced...

  7. Estimation of (co)variances for genomic regions of flexible sizes

    DEFF Research Database (Denmark)

    Sørensen, Lars P; Janss, Luc; Madsen, Per;

    2012-01-01

    traits such as mammary disease traits in dairy cattle. METHODS: Data on progeny means of six traits related to mastitis resistance in dairy cattle (general mastitis resistance and five pathogen-specific mastitis resistance traits) were analyzed using a bivariate Bayesian SNP-based genomic model......)variances of mastitis resistance traits in dairy cattle using multivariate genomic models......., per chromosome, and in regions of 100 SNP on a chromosome. RESULTS: Genomic proportions of the total variance differed between traits. Genomic correlations were lower than pedigree-based genetic correlations and they were highest between general mastitis and pathogen-specific traits because...

  8. A genome-wide analysis of genetic diversity in Trypanosoma cruzi intergenic regions.

    Directory of Open Access Journals (Sweden)

    Leonardo G Panunzi

    2014-05-01

    Full Text Available BACKGROUND: Trypanosoma cruzi is the causal agent of Chagas Disease. Recently, the genomes of representative strains from two major evolutionary lineages were sequenced, allowing the construction of a detailed genetic diversity map for this important parasite. However this map is focused on coding regions of the genome, leaving a vast space of regulatory regions uncharacterized in terms of their evolutionary conservation and/or divergence. METHODOLOGY: Using data from the hybrid CL Brener and Sylvio X10 genomes (from the TcVI and TcI Discrete Typing Units, respectively, we identified intergenic regions that share a common evolutionary ancestry, and are present in both CL Brener haplotypes (TcII-like and TcIII-like and in the TcI genome; as well as intergenic regions that were conserved in only two of the three genomes/haplotypes analyzed. The genetic diversity in these regions was characterized in terms of the accumulation of indels and nucleotide changes. PRINCIPAL FINDINGS: Based on this analysis we have identified i a core of highly conserved intergenic regions, which remained essentially unchanged in independently evolving lineages; ii intergenic regions that show high diversity in spite of still retaining their corresponding upstream and downstream coding sequences; iii a number of defined sequence motifs that are shared by a number of unrelated intergenic regions. A fraction of indels explains the diversification of some intergenic regions by the expansion/contraction of microsatellite-like repeats.

  9. CGHScan: finding variable regions using high-density microarray comparative genomic hybridization data

    Directory of Open Access Journals (Sweden)

    Rajashekara Gireesh

    2006-04-01

    Full Text Available Abstract Background Comparative genomic hybridization can rapidly identify chromosomal regions that vary between organisms and tissues. This technique has been applied to detecting differences between normal and cancerous tissues in eukaryotes as well as genomic variability in microbial strains and species. The density of oligonucleotide probes available on current microarray platforms is particularly well-suited for comparisons of organisms with smaller genomes like bacteria and yeast where an entire genome can be assayed on a single microarray with high resolution. Available methods for analyzing these experiments typically confine analyses to data from pre-defined annotated genome features, such as entire genes. Many of these methods are ill suited for datasets with the number of measurements typical of high-density microarrays. Results We present an algorithm for analyzing microarray hybridization data to aid identification of regions that vary between an unsequenced genome and a sequenced reference genome. The program, CGHScan, uses an iterative random walk approach integrating multi-layered significance testing to detect these regions from comparative genomic hybridization data. The algorithm tolerates a high level of noise in measurements of individual probe intensities and is relatively insensitive to the choice of method for normalizing probe intensity values and identifying probes that differ between samples. When applied to comparative genomic hybridization data from a published experiment, CGHScan identified eight of nine known deletions in a Brucella ovis strain as compared to Brucella melitensis. The same result was obtained using two different normalization methods and two different scores to classify data for individual probes as representing conserved or variable genomic regions. The undetected region is a small (58 base pair deletion that is below the resolution of CGHScan given the array design employed in the study

  10. Identifying genomic regions for fine-mapping using genome scan meta-analysis (GSMA) to identify the minimum regions of maximum significance (MRMS) across populations.

    Science.gov (United States)

    Cooper, Margaret E; Goldstein, Toby H; Maher, Brion S; Marazita, Mary L

    2005-12-30

    In order to detect linkage of the simulated complex disease Kofendrerd Personality Disorder across studies from multiple populations, we performed a genome scan meta-analysis (GSMA). Using the 7-cM microsatellite map, nonparametric multipoint linkage analyses were performed separately on each of the four simulated populations independently to determine p-values. The genome of each population was divided into 20-cM bin regions, and each bin was rank-ordered based on the most significant linkage p-value for that population in that region. The bin ranks were then averaged across all four studies to determine the most significant 20-cM regions over all studies. Statistical significance of the averaged bin ranks was determined from a normal distribution of randomly assigned rank averages. To narrow the region of interest for fine-mapping, the meta-analysis was repeated two additional times, with each of the 20-cM bins offset by 7 cM and 13 cM, respectively, creating regions of overlap with the original method. The 6-7 cM shared regions, where the highest averaged 20-cM bins from each of the three offsets overlap, designated the minimum region of maximum significance (MRMS). Application of the GSMA-MRMS method revealed genome wide significance (p-values refer to the average rank assigned to the bin) at regions including or adjacent to all of the simulated disease loci: chromosome 1 (p value value value < 0.05 for 7-14 cM, the region adjacent to D4). This GSMA analysis approach demonstrates the power of linkage meta-analysis to detect multiple genes simultaneously for a complex disorder. The MRMS method enhances this powerful tool to focus on more localized regions of linkage.

  11. A novel statistical method to estimate the effective SNP size in vertebrate genomes and categorized genomic regions

    Directory of Open Access Journals (Sweden)

    Zhao Zhongming

    2006-12-01

    Full Text Available Abstract Background The local environment of single nucleotide polymorphisms (SNPs contains abundant genetic information for the study of mechanisms of mutation, genome evolution, and causes of diseases. Recent studies revealed that neighboring-nucleotide biases on SNPs were strong and the genome-wide bias patterns could be represented by a small subset of the total SNPs. It remains unsolved for the estimation of the effective SNP size, the number of SNPs that are sufficient to represent the bias patterns observed from the whole SNP data. Results To estimate the effective SNP size, we developed a novel statistical method, SNPKS, which considers both the statistical and biological significances. SNPKS consists of two major steps: to obtain an initial effective size by the Kolmogorov-Smirnov test (KS test and to find an intermediate effective size by interval evaluation. The SNPKS algorithm was implemented in computer programs and applied to the real SNP data. The effective SNP size was estimated to be 38,200, 39,300, 38,000, and 38,700 in the human, chimpanzee, dog, and mouse genomes, respectively, and 39,100, 39,600, 39,200, and 42,200 in human intergenic, genic, intronic, and CpG island regions, respectively. Conclusion SNPKS is the first statistical method to estimate the effective SNP size. It runs efficiently and greatly outperforms the algorithm implemented in SNPNB. The application of SNPKS to the real SNP data revealed the similar small effective SNP size (38,000 – 42,200 in the human, chimpanzee, dog, and mouse genomes as well as in human genomic regions. The findings suggest strong influence of genetic factors across vertebrate genomes.

  12. LD-Spline: Mapping SNPs on genotyping platforms to genomic regions using patterns of linkage disequilibrium

    Directory of Open Access Journals (Sweden)

    Bush William S

    2009-12-01

    Full Text Available Abstract Background Gene-centric analysis tools for genome-wide association study data are being developed both to annotate single locus statistics and to prioritize or group single nucleotide polymorphisms (SNPs prior to analysis. These approaches require knowledge about the relationships between SNPs on a genotyping platform and genes in the human genome. SNPs in the genome can represent broader genomic regions via linkage disequilibrium (LD, and population-specific patterns of LD can be exploited to generate a data-driven map of SNPs to genes. Methods In this study, we implemented LD-Spline, a database routine that defines the genomic boundaries a particular SNP represents using linkage disequilibrium statistics from the International HapMap Project. We compared the LD-Spline haplotype block partitioning approach to that of the four gamete rule and the Gabriel et al. approach using simulated data; in addition, we processed two commonly used genome-wide association study platforms. Results We illustrate that LD-Spline performs comparably to the four-gamete rule and the Gabriel et al. approach; however as a SNP-centric approach LD-Spline has the added benefit of systematically identifying a genomic boundary for each SNP, where the global block partitioning approaches may falter due to sampling variation in LD statistics. Conclusion LD-Spline is an integrated database routine that quickly and effectively defines the genomic region marked by a SNP using linkage disequilibrium, with a SNP-centric block definition algorithm.

  13. Host susceptibility to periodontitis: mapping murine genomic regions.

    Science.gov (United States)

    Shusterman, A; Durrant, C; Mott, R; Polak, D; Schaefer, A; Weiss, E I; Iraqi, F A; Houri-Haddad, Y

    2013-05-01

    Host susceptibility to periodontal infection is controlled by genetic factors. As a step toward identifying and cloning these factors, we generated an A/J x BALB/cJ F2 mouse resource population. A genome-wide search for Quantitative Trait Loci (QTL) associated with periodontitis was performed. We aimed to quantify the phenotypic response of the progenies to periodontitis by microCT analysis, to perform a genome-wide search for QTL associated with periodontitis, and, finally, to suggest candidate genes for periodontitis. We were able to produce 408 F2 mice. All mice were co-infected with Porphyromonas gingivalis and Fusobacterium nucleatum bacteria. Six weeks following infection, alveolar bone loss was quantified by computerized tomography (microCT) technology. We found normal distribution of the phenotype, with 2 highly significant QTL on chromosomes 5 and 3. A third significant QTL was found on chromosome 1. Candidate genes were suggested, such as Toll-like receptors (TLR) 1 and 6, chemokines, and bone-remodeling genes (enamelin, ameloblastin, and amelotin). This report shows that periodontitis in mice is a polygenic trait with highly significant mapped QTL.

  14. Genomic shotgun array: a procedure linking large-scale DNA sequencing with regional transcript mapping.

    Science.gov (United States)

    Li, Ling-Hui; Li, Jian-Chiuan; Lin, Yung-Feng; Lin, Chung-Yen; Chen, Chung-Yung; Tsai, Shih-Feng

    2004-02-11

    To facilitate transcript mapping and to investigate alterations in genomic structure and gene expression in a defined genomic target, we developed a novel microarray-based method to detect transcriptional activity of the human chromosome 4q22-24 region. Loss of heterozygosity of human 4q22-24 is frequently observed in hepatocellular carcinoma (HCC). One hundred and eighteen well-characterized genes have been identified from this region. We took previously sequenced shotgun subclones as templates to amplify overlapping sequences for the genomic segment and constructed a chromosome-region-specific microarray. Using genomic DNA fragments as probes, we detected transcriptional activity from within this region among five different tissues. The hybridization results indicate that there are new transcripts that have not yet been identified by other methods. The existence of new transcripts encoded by genes in this region was confirmed by PCR cloning or cDNA library screening. The procedure reported here allows coupling of shotgun sequencing with transcript mapping and, potentially, detailed analysis of gene expression and chromosomal copy of the genomic sequence for the putative HCC tumor suppressor gene(s) in the 4q candidate region.

  15. CpG islands undermethylation in human genomic regions under selective pressure.

    Directory of Open Access Journals (Sweden)

    Sergio Cocozza

    Full Text Available DNA methylation at CpG islands (CGIs is one of the most intensively studied epigenetic mechanisms. It is fundamental for cellular differentiation and control of transcriptional potential. DNA methylation is involved also in several processes that are central to evolutionary biology, including phenotypic plasticity and evolvability. In this study, we explored the relationship between CpG islands methylation and signatures of selective pressure in Homo Sapiens, using a computational biology approach. By analyzing methylation data of 25 cell lines from the Encyclopedia of DNA Elements (ENCODE Consortium, we compared the DNA methylation of CpG islands in genomic regions under selective pressure with the methylation of CpG islands in the remaining part of the genome. To define genomic regions under selective pressure, we used three different methods, each oriented to provide distinct information about selective events. Independently of the method and of the cell type used, we found evidences of undermethylation of CGIs in human genomic regions under selective pressure. Additionally, by analyzing SNP frequency in CpG islands, we demonstrated that CpG islands in regions under selective pressure show lower genetic variation. Our findings suggest that the CpG islands in regions under selective pressure seem to be somehow more "protected" from methylation when compared with other regions of the genome.

  16. Attenuation of Sindbis virus neurovirulence by using defined mutations in nontranslated regions of the genome RNA

    NARCIS (Netherlands)

    Kuhn, R J; Griffin, D E; Zhang, H; Niesters, Hubert G. M.; Strauss, J H

    1992-01-01

    We examined a panel of Sindbis virus mutants containing defined mutations in the 5' nontranslated region of the genome RNA, in the 3' nontranslated region, or in both for their growth in cultured cells and virulence in newborn mice. In cultured cells, these viruses all had defects in RNA synthesis a

  17. ECRbase: Database of Evolutionary Conserved Regions, Promoters, and Transcription Factor Binding Sites in Vertebrate Genomes

    Energy Technology Data Exchange (ETDEWEB)

    Loots, G; Ovcharenko, I

    2006-08-08

    Evolutionary conservation of DNA sequences provides a tool for the identification of functional elements in genomes. We have created a database of evolutionary conserved regions (ECRs) in vertebrate genomes entitled ECRbase that is constructed from a collection of pairwise vertebrate genome alignments produced by the ECR Browser database. ECRbase features a database of syntenic blocks that recapitulate the evolution of rearrangements in vertebrates and a collection of promoters in all vertebrate genomes presented in the database. The database also contains a collection of annotated transcription factor binding sites (TFBS) in all ECRs and promoter elements. ECRbase currently includes human, rhesus macaque, dog, opossum, rat, mouse, chicken, frog, zebrafish, and two pufferfish genomes. It is freely accessible at http://ECRbase.dcode.org.

  18. Structure-infectivity analysis of the human rhinovirus genomic RNA 3' non-coding region.

    OpenAIRE

    1996-01-01

    The specific recognition of genomic positive strand RNAS as templates for the synthesis of intermediate negative strands by the picornavirus replication machinery is presumably mediated by cis-acting sequences within the genomic RNA 3' non-coding region (NCR). A structure-infectivity analysis was conducted on the 44 nt human rhinovirus 14 (HRV14) 3' NCR to identify the primary sequence and/or secondary structure determinants required for viral replication. Using biochemical RNA secondary stru...

  19. Regions of homozygosity in the porcine genome: consequence of demography and the recombination landscape.

    Directory of Open Access Journals (Sweden)

    Mirte Bosse

    Full Text Available Inbreeding has long been recognized as a primary cause of fitness reduction in both wild and domesticated populations. Consanguineous matings cause inheritance of haplotypes that are identical by descent (IBD and result in homozygous stretches along the genome of the offspring. Size and position of regions of homozygosity (ROHs are expected to correlate with genomic features such as GC content and recombination rate, but also direction of selection. Thus, ROHs should be non-randomly distributed across the genome. Therefore, demographic history may not fully predict the effects of inbreeding. The porcine genome has a relatively heterogeneous distribution of recombination rate, making Sus scrofa an excellent model to study the influence of both recombination landscape and demography on genomic variation. This study utilizes next-generation sequencing data for the analysis of genomic ROH patterns, using a comparative sliding window approach. We present an in-depth study of genomic variation based on three different parameters: nucleotide diversity outside ROHs, the number of ROHs in the genome, and the average ROH size. We identified an abundance of ROHs in all genomes of multiple pigs from commercial breeds and wild populations from Eurasia. Size and number of ROHs are in agreement with known demography of the populations, with population bottlenecks highly increasing ROH occurrence. Nucleotide diversity outside ROHs is high in populations derived from a large ancient population, regardless of current population size. In addition, we show an unequal genomic ROH distribution, with strong correlations of ROH size and abundance with recombination rate and GC content. Global gene content does not correlate with ROH frequency, but some ROH hotspots do contain positive selected genes in commercial lines and wild populations. This study highlights the importance of the influence of demography and recombination on homozygosity in the genome to understand

  20. Transcription Restores DNA Repair to Heterochromatin, Determining Regional Mutation Rates in Cancer Genomes

    Directory of Open Access Journals (Sweden)

    Christina L. Zheng

    2014-11-01

    Full Text Available Somatic mutations in cancer are more frequent in heterochromatic and late-replicating regions of the genome. We report that regional disparities in mutation density are virtually abolished within transcriptionally silent genomic regions of cutaneous squamous cell carcinomas (cSCCs arising in an XPC−/− background. XPC−/− cells lack global genome nucleotide excision repair (GG-NER, thus establishing differential access of DNA repair machinery within chromatin-rich regions of the genome as the primary cause for the regional disparity. Strikingly, we find that increasing levels of transcription reduce mutation prevalence on both strands of gene bodies embedded within H3K9me3-dense regions, and only to those levels observed in H3K9me3-sparse regions, also in an XPC-dependent manner. Therefore, transcription appears to reduce mutation prevalence specifically by relieving the constraints imposed by chromatin structure on DNA repair. We model this relationship among transcription, chromatin state, and DNA repair, revealing a new, personalized determinant of cancer risk.

  1. DNA sequence comparative analysis of the 3pter-p26 region of human genome

    Institute of Scientific and Technical Information of China (English)

    LUO; Chunqing; LI; Yan; ZHANG; Xiaowei; ZHANG; Yilin; ZHAN

    2005-01-01

    Most proterminal regions of human chromosomes are GC-rich and gene-rich. Chromosome 3p is an exception. Its proterminal region is GC-poor, and likely to lose heterozygosity, thus causing a number of fatal diseases. Except one gap left in the telomeric position, the proterminal region of human chromosome 3p has been completely sequenced. The detailed sequence analysis showed: (i) the GC content of this region was 38.5%, being the lowest among all the human proterminal regions; (ii) this region contained 20 known genes and 22 predicted genes, with an average gene size of 97.5 kb. The previously mapped gene Cntn3 was not found in this region, but instead located in the 74 Mb position of human chromosome 3p; (iii) the interspersed repeats of this region were more active than the average level of the whole human genome, especially (TA)n, the content of which was twice the genome average; (iv) this region had a conserved synteny extending from 104.1 Mb to 112.4 Mb on the mouse chromosome 6, which was 8% larger in size, not in accordance with the whole genome comparison, probably because the 3pter-p26 region was more likely to lose neocleitides and its mouse synteny had more active interspersed repeats.

  2. De Novo Identification of Regulatory Regions in Intergenic Spaces of Prokaryotic Genomes

    Energy Technology Data Exchange (ETDEWEB)

    Chain, P; Garcia, E; Mcloughlin, K; Ovcharenko, I

    2007-02-20

    This project was begun to implement, test, and experimentally validate the results of a novel algorithm for genome-wide identification of candidate transcription-factor binding sites in prokaryotes. Most techniques used to identify regulatory regions rely on conservation between different genomes or have a predetermined sequence motif(s) to perform a genome-wide search. Therefore, such techniques cannot be used with new genome sequences, where information regarding such motifs has not yet been discovered. This project aimed to apply a de novo search algorithm to identify candidate binding-site motifs in intergenic regions of prokaryotic organisms, initially testing the available genomes of the Yersinia genus. We retrofitted existing nucleotide pattern-matching algorithms, analyzed the candidate sites identified by these algorithms as well as their target genes to screen for meaningful patterns. Using properly annotated prokaryotic genomes, this project aimed to develop a set of procedures to identify candidate intergenic sites important for gene regulation. We planned to demonstrate this in Yersinia pestis, a model biodefense, Category A Select Agent pathogen, and then follow up with experimental evidence that these regions are indeed involved in regulation. The ability to quickly characterize transcription-factor binding sites will help lead to a better understanding of how known virulence pathways are modulated in biodefense-related organisms, and will help our understanding and exploration of regulons--gene regulatory networks--and novel pathways for metabolic processes in environmental microbes.

  3. Regional Regulation of Transcription in the Bovine Genome

    NARCIS (Netherlands)

    Kommadath, A.; Nie, H.; Groenen, M.A.M.; Pas, te M.F.W.; Veerkamp, R.F.; Smits, M.A.

    2011-01-01

    Eukaryotic genes are distributed along chromosomes as clusters of highly expressed genes termed RIDGEs (Regions of IncreaseD Gene Expression) and lowly expressed genes termed anti-RIDGEs, interspersed among genes expressed at intermediate levels or not expressed. Previous studies based on this obser

  4. An improved method for detecting and delineating genomic regions with altered gene expression in cancer

    OpenAIRE

    2008-01-01

    Genomic regions with altered gene expression are a characteristic feature of cancer cells. We present a novel method for identifying such regions in gene expression maps. This method is based on total variation minimization, a classical signal restoration technique. In systematic evaluations, we show that our method combines top-notch detection performance with an ability to delineate relevant regions without excessive over-segmentation, making it a significant advance over existing methods. ...

  5. Identification of genomic regions associated with phenotypic variation between dog breeds using selection mapping

    DEFF Research Database (Denmark)

    Vaysse, Amaury; Ratnakumar, Abhirami; Derrien, Thomas;

    2011-01-01

    across the genome in dog breeds are the result of both selection and genetic drift, but extended blocks of homozygosity on a megabase scale appear to be best explained by selection. Further elucidation of the variants under selection will help to uncover the genetic basis of complex traits and disease....... breeds using a newly developed high-density genotyping array consisting of >170,000 evenly spaced SNPs. We first identify 44 genomic regions exhibiting extreme differentiation across multiple breeds. Genetic variation in these regions correlates with variation in several phenotypic traits that vary...... to provide a list of variants that may directly affect these traits. This study provides a catalogue of genomic regions showing extreme reduction in genetic variation or population differentiation in dogs, including many linked to phenotypic variation. The many blocks of reduced haplotype diversity observed...

  6. Specific amplification by PCR of rearranged genomic variable regions of immunoglobulin genes from mouse hybridoma cells.

    Science.gov (United States)

    Berdoz, J; Monath, T P; Kraehenbuhl, J P

    1995-04-01

    We have designed a novel strategy for the isolation of the rearranged genomic fragments encoding the L-VH-D-JH and L-V kappa/lambda-J kappa/lambda regions of mouse immunoglobulin genes. This strategy is based on the PCR amplification of genomic DNA from mouse hybridomas using multiple specific primers chosen in the 5'-untranslated region and in the intron downstream of the rearranged JH/J kappa/lambda sequences. Variable regions with intact coding sequences, including full-length leader peptides (L) can be obtained without previous DNA sequencing. Our strategy is based on a genomic template that produces fragments that do not need to be adapted for recombinant antibody expression, thus facilitating the generation of chimeric and isotype-switched immunoglobulins.

  7. Detection of genomic variation by selection of a 9 mb DNA region and high throughput sequencing.

    Directory of Open Access Journals (Sweden)

    Sergey I Nikolaev

    Full Text Available Detection of the rare polymorphisms and causative mutations of genetic diseases in a targeted genomic area has become a major goal in order to understand genomic and phenotypic variability. We have interrogated repeat-masked regions of 8.9 Mb on human chromosomes 21 (7.8 Mb and 7 (1.1 Mb from an individual from the International HapMap Project (NA12872. We have optimized a method of genomic selection for high throughput sequencing. Microarray-based selection and sequencing resulted in 260-fold enrichment, with 41% of reads mapping to the target region. 83% of SNPs in the targeted region had at least 4-fold sequence coverage and 54% at least 15-fold. When assaying HapMap SNPs in NA12872, our sequence genotypes are 91.3% concordant in regions with coverage > or = 4-fold, and 97.9% concordant in regions with coverage > or = 15-fold. About 81% of the SNPs recovered with both thresholds are listed in dbSNP. We observed that regions with low sequence coverage occur in close proximity to low-complexity DNA. Validation experiments using Sanger sequencing were performed for 46 SNPs with 15-20 fold coverage, with a confirmation rate of 96%, suggesting that DNA selection provides an accurate and cost-effective method for identifying rare genomic variants.

  8. Genomic variation in the porcine immunoglobulin lambda variable region.

    Science.gov (United States)

    Guo, Xi; Schwartz, John C; Murtaugh, Michael P

    2016-04-01

    Production of a vast antibody repertoire is essential for the protection against pathogens. Variable region germline complexity contributes to repertoire diversity and is a standard feature of mammalian immunoglobulin loci, but functional V region genes are limited in swine. For example, the porcine lambda light chain locus is composed of 23 variable (V) genes and 4 joining (J) genes, but only 10 or 11 V and 2 J genes are functional. Allelic variation in V and J may increase overall diversity within a population, yet lead to repertoire holes in individuals lacking key alleles. Previous studies focused on heavy chain genetic variation, thus light chain allelic diversity is not known. We characterized allelic variation of the porcine immunoglobulin lambda variable (IGLV) region genes. All intact IGLV genes in 81 pigs were amplified, sequenced, and analyzed to determine their allelic variation and functionality. We observed mutational variation across the entire length of the IGLV genes, in both framework and complementarity determining regions (CDRs). Three recombination hotspot motifs were also identified suggesting that non-allelic homologous recombination is an evolutionarily alternative mechanism for generating germline antibody diversity. Functional alleles were greatest in the most highly expressed families, IGLV3 and IGLV8. At the population level, allelic variation appears to help maintain the potential for broad antibody repertoire diversity in spite of reduced gene segment choices and limited germline sequence modification. The trade-off may be a reduction in repertoire diversity within individuals that could result in an increased variation in immunity to infectious disease and response to vaccination.

  9. Definition of Soybean Genomic Regions That Control Seed Phytoestrogen Amounts

    Directory of Open Access Journals (Sweden)

    Kassem My A.

    2004-01-01

    Full Text Available Soybean seeds contain large amounts of isoflavones or phytoestrogens such as genistein, daidzein, and glycitein that display biological effects when ingested by humans and animals. In seeds, the total amount, and amount of each type, of isoflavone varies by 5 fold between cultivars and locations. Isoflavone content and quality are one key to the biological effects of soy foods, dietary supplements, and nutraceuticals. Previously we had identified 6 loci (QTL controlling isoflavone content using 150 DNA markers. This study aimed to identify and delimit loci underlying heritable variation in isoflavone content with additional DNA markers. We used a recombinant inbred line (RIL population ( n=100 derived from the cross of “Essex” by “Forrest,” two cultivars that contrast for isoflavone content. Seed isoflavone content of each RIL was determined by HPLC and compared against 240 polymorphic microsatellite markers by one-way analysis of variance. Two QTL that underlie seed isoflavone content were newly discovered. The additional markers confirmed and refined the positions of the six QTL already reported. The first new region anchored by the marker BARC-Satt063 was significantly associated with genistein ( P=0.009 , R 2 =29.5% and daidzein ( P=0.007 , R 2 =17.0% . The region is located on linkage group B2 and derived the beneficial allele from Essex. The second new region defined by the marker BARC-Satt129 was significantly associated with total glycitein ( P=0.0005 , R 2 =32.0% . The region is located on linkage group D1a+Q and also derived the beneficial allele from Essex. Jointly the eight loci can explain the heritable variation in isoflavone content. The loci may be used to stabilize seed isoflavone content by selection and to isolate the underlying genes.

  10. Acute hepatitis C in a chronically HIV-infected patient: Evolution of different viral genomic regions

    Institute of Scientific and Technical Information of China (English)

    Diego Flichman; Veronica Kott; Silvia Sookoian; Rodolfo Campos

    2003-01-01

    AIM: To analyze the molecular evolution of different viral genomic regions of HCV in an acute HCV infected patient chronically infected with HIV through a 42-month follow-up.METHODS: Serum samples of a chronically HIV infected patient that seroconverted to anti HCV antibodies were sequenced, from the event of superinfection through a period of 17 months and in a late sample (42nd month). Hypervariable genomic regions of HIV (V3 loop of the gp120) and HCV (HVR-1 on the E2 glycoprotein gene) were studied. In order to analyze genomic regions involved in different biological functions and with the cellular immune response, HCV core and NS5A were also chosen to be sequenced. Amplification of the different regions was done by RT-PCR and directly sequenced. Confirmation of sequences was done on reamplified material. Nucleotide sequences of the different time points were aligned with CLUSTAL W 1.5, and the corresponding amino acid ones were deduced.RESULTS: Hypervariable genomic regions of both viruses (HVR1 and gp120 V3 loop) presented several nonsynonymous changes but, while in the gp120 V3 loop mutations were detected in the sample obtained right after HCV superinfection and maintained throughout, they occurred following a sequential and cumulative pattern in the HVR1. In the NS5A region of HCV, two amino acid changes were detected during the follow-up period, whereas the core region presented several amino acid replacements, once the HCV chronic infection had been established.CONCLUSION: During the HIV-HCV superinfection, each genomic region analyzed shows a different evolutionary pattem.Most of the nucleotide substitutions observed are nonsynonymous and clustered in previously described epitopes,thus suggesting an immune-driven evolutionary process.

  11. Genetics/Genomics Research in the Central Region

    Science.gov (United States)

    ,

    2006-01-01

    Genetics-based research within the Biological Resources Discipline (BRD) Science Centers in the Central Region incorporates many aspects of the field of genetics. Research activities range from documenting patterns of genetic variation in order to investigate relationships among species, populations and individuals to investigating the structure, function and expression of genes and their response to environmental stressors. Research in the broad areas of genetics requires multidisciplinary expertise and specialized equipment and instrumentation. Brief summaries of the capabilities of the five BRD Centers are given below.

  12. Identification of the most informative regions of the mitochondrial genome for phylogenetic and coalescent analyses.

    Science.gov (United States)

    Non, A L; Kitchen, A; Mulligan, C J

    2007-09-01

    Analysis of complete mitochondrial genome sequences is becoming increasingly common in genetic studies. The availability of full genome datasets enables an analysis of the information content distributed throughout the mitochondrial genome in order to optimize the research design of future evolutionary studies. The goal of our study was to identify informative regions of the human mitochondrial genome using two criteria: (1) accurate reconstruction of a phylogeny and (2) consistent estimates of time to most recent common ancestor (TMRCA). We created two series of datasets by deleting individual genes of varied length and by deleting 10 equal-size fragments throughout the coding region. Phylogenies were statistically compared to the full-coding-region tree, while coalescent methods were used to estimate the TMRCA and associated credible intervals. Individual fragments important for maintaining a phylogeny similar to the full-coding-region tree encompassed bp 577-2122 and 11,399-16,023, including all or part of 12S rRNA, 16S rRNA, ND4, ND5, ND6, and cytb. The control region only tree was the most poorly resolved with the majority of the tree manifest as an unresolved polytomy. Coalescent estimates of TMRCA were less sensitive to removal of any particular fragment(s) than reconstruction of a consistent phylogeny. Overall, we discovered that half the genome, i.e., bp 3669-11,398, could be removed with no significant change in the phylogeny (p(AU)=0.077) while still maintaining overlap of TMRCA 95% credible intervals. Thus, sequencing a contiguous fragment from bp 11,399 through the control region to bp 3668 would create a dataset that optimizes the information necessary for phylogenetic and coalescent analyses and also takes advantage of the wealth of data already available on the control region.

  13. Genome-wide analysis of regions similar to promoters of histone genes

    KAUST Repository

    Chowdhary, Rajesh

    2010-05-28

    Background: The purpose of this study is to: i) develop a computational model of promoters of human histone-encoding genes (shortly histone genes), an important class of genes that participate in various critical cellular processes, ii) use the model so developed to identify regions across the human genome that have similar structure as promoters of histone genes; such regions could represent potential genomic regulatory regions, e.g. promoters, of genes that may be coregulated with histone genes, and iii/ identify in this way genes that have high likelihood of being coregulated with the histone genes.Results: We successfully developed a histone promoter model using a comprehensive collection of histone genes. Based on leave-one-out cross-validation test, the model produced good prediction accuracy (94.1% sensitivity, 92.6% specificity, and 92.8% positive predictive value). We used this model to predict across the genome a number of genes that shared similar promoter structures with the histone gene promoters. We thus hypothesize that these predicted genes could be coregulated with histone genes. This hypothesis matches well with the available gene expression, gene ontology, and pathways data. Jointly with promoters of the above-mentioned genes, we found a large number of intergenic regions with similar structure as histone promoters.Conclusions: This study represents one of the most comprehensive computational analyses conducted thus far on a genome-wide scale of promoters of human histone genes. Our analysis suggests a number of other human genes that share a high similarity of promoter structure with the histone genes and thus are highly likely to be coregulated, and consequently coexpressed, with the histone genes. We also found that there are a large number of intergenic regions across the genome with their structures similar to promoters of histone genes. These regions may be promoters of yet unidentified genes, or may represent remote control regions that

  14. Mutational signatures of de-differentiation in functional non-coding regions of melanoma genomes.

    Directory of Open Access Journals (Sweden)

    Stephen C J Parker

    Full Text Available Much emphasis has been placed on the identification, functional characterization, and therapeutic potential of somatic variants in tumor genomes. However, the majority of somatic variants lie outside coding regions and their role in cancer progression remains to be determined. In order to establish a system to test the functional importance of non-coding somatic variants in cancer, we created a low-passage cell culture of a metastatic melanoma tumor sample. As a foundation for interpreting functional assays, we performed whole-genome sequencing and analysis of this cell culture, the metastatic tumor from which it was derived, and the patient-matched normal genomes. When comparing somatic mutations identified in the cell culture and tissue genomes, we observe concordance at the majority of single nucleotide variants, whereas copy number changes are more variable. To understand the functional impact of non-coding somatic variation, we leveraged functional data generated by the ENCODE Project Consortium. We analyzed regulatory regions derived from multiple different cell types and found that melanocyte-specific regions are among the most depleted for somatic mutation accumulation. Significant depletion in other cell types suggests the metastatic melanoma cells de-differentiated to a more basal regulatory state. Experimental identification of genome-wide regulatory sites in two different melanoma samples supports this observation. Together, these results show that mutation accumulation in metastatic melanoma is nonrandom across the genome and that a de-differentiated regulatory architecture is common among different samples. Our findings enable identification of the underlying genetic components of melanoma and define the differences between a tissue-derived tumor sample and the cell culture created from it. Such information helps establish a broader mechanistic understanding of the linkage between non-coding genomic variations and the cellular

  15. Mutational Signatures of De-Differentiation in Functional Non-Coding Regions of Melanoma Genomes

    Science.gov (United States)

    Parker, Stephen C. J.; Gartner, Jared; Cardenas-Navia, Isabel; Wei, Xiaomu; Ozel Abaan, Hatice; Ajay, Subramanian S.; Hansen, Nancy F.; Song, Lingyun; Bhanot, Umesh K.; Killian, J. Keith; Gindin, Yevgeniy; Walker, Robert L.; Meltzer, Paul S.; Mullikin, James C.; Furey, Terrence S.; Crawford, Gregory E.; Rosenberg, Steven A.; Samuels, Yardena; Margulies, Elliott H.

    2012-01-01

    Much emphasis has been placed on the identification, functional characterization, and therapeutic potential of somatic variants in tumor genomes. However, the majority of somatic variants lie outside coding regions and their role in cancer progression remains to be determined. In order to establish a system to test the functional importance of non-coding somatic variants in cancer, we created a low-passage cell culture of a metastatic melanoma tumor sample. As a foundation for interpreting functional assays, we performed whole-genome sequencing and analysis of this cell culture, the metastatic tumor from which it was derived, and the patient-matched normal genomes. When comparing somatic mutations identified in the cell culture and tissue genomes, we observe concordance at the majority of single nucleotide variants, whereas copy number changes are more variable. To understand the functional impact of non-coding somatic variation, we leveraged functional data generated by the ENCODE Project Consortium. We analyzed regulatory regions derived from multiple different cell types and found that melanocyte-specific regions are among the most depleted for somatic mutation accumulation. Significant depletion in other cell types suggests the metastatic melanoma cells de-differentiated to a more basal regulatory state. Experimental identification of genome-wide regulatory sites in two different melanoma samples supports this observation. Together, these results show that mutation accumulation in metastatic melanoma is nonrandom across the genome and that a de-differentiated regulatory architecture is common among different samples. Our findings enable identification of the underlying genetic components of melanoma and define the differences between a tissue-derived tumor sample and the cell culture created from it. Such information helps establish a broader mechanistic understanding of the linkage between non-coding genomic variations and the cellular evolution of cancer

  16. Regulation of sex determination in mice by a non-coding genomic region.

    Science.gov (United States)

    Arboleda, Valerie A; Fleming, Alice; Barseghyan, Hayk; Délot, Emmanuèle; Sinsheimer, Janet S; Vilain, Eric

    2014-07-01

    To identify novel genomic regions that regulate sex determination, we utilized the powerful C57BL/6J-Y(POS) (B6-Y(POS)) model of XY sex reversal where mice with autosomes from the B6 strain and a Y chromosome from a wild-derived strain, Mus domesticus poschiavinus (Y(POS)), show complete sex reversal. In B6-Y(POS), the presence of a 55-Mb congenic region on chromosome 11 protects from sex reversal in a dose-dependent manner. Using mouse genetic backcross designs and high-density SNP arrays, we narrowed the congenic region to a 1.62-Mb genomic region on chromosome 11 that confers 80% protection from B6-Y(POS) sex reversal when one copy is present and complete protection when two copies are present. It was previously believed that the protective congenic region originated from the 129S1/SviMJ (129) strain. However, genomic analysis revealed that this region is not derived from 129 and most likely is derived from the semi-inbred strain POSA. We show that the small 1.62-Mb congenic region that protects against B6-Y(POS) sex reversal is located within the Sox9 promoter and promotes the expression of Sox9, thereby driving testis development within the B6-Y(POS) background. Through 30 years of backcrossing, this congenic region was maintained, as it promoted male sex determination and fertility despite the female-promoting B6-Y(POS) genetic background. Our findings demonstrate that long-range enhancer regions are critical to developmental processes and can be used to identify the complex interplay between genome variants, epigenetics, and developmental gene regulation.

  17. Thousands of corresponding human and mouse genomic regions unalignable in primary sequence contain common RNA structure

    DEFF Research Database (Denmark)

    Torarinsson, Elfar; Sawera, Milena; Havgaard, Jakob Hull;

    2006-01-01

    Human and mouse genome sequences contain roughly 100,000 regions that are unalignable in primary sequence and neighbor corresponding alignable regions between both organisms. These pairs are generally assumed to be nonconserved, although the level of structural conservation between these has never...... been investigated. Owing to the limitations in computational methods, comparative genomics has been lacking the ability to compare such nonconserved sequence regions for conserved structural RNA elements. We have investigated the presence of structural RNA elements by conducting a local structural...... overlapped by transfrags than regions that are not overlapped by transfrags. To verify the coexpression between predicted candidates in human and mouse, we conducted expression studies by RT-PCR and Northern blotting on mouse candidates, which overlap with transfrags on human chromosome 20. RT-PCR results...

  18. Detecting genomic regions associated with a disease using variability functions and Adjusted Rand Index

    Directory of Open Access Journals (Sweden)

    Makarenkov Vladimir

    2011-10-01

    Full Text Available Abstract Background The identification of functional regions contained in a given multiple sequence alignment constitutes one of the major challenges of comparative genomics. Several studies have focused on the identification of conserved regions and motifs. However, most of existing methods ignore the relationship between the functional genomic regions and the external evidence associated with the considered group of species (e.g., carcinogenicity of Human Papilloma Virus. In the past, we have proposed a method that takes into account the prior knowledge on an external evidence (e.g., carcinogenicity or invasivity of the considered organisms and identifies genomic regions related to a specific disease. Results and conclusion We present a new algorithm for detecting genomic regions that may be associated with a disease. Two new variability functions and a bipartition optimization procedure are described. We validate and weigh our results using the Adjusted Rand Index (ARI, and thus assess to what extent the selected regions are related to carcinogenicity, invasivity, or any other species classification, given as input. The predictive power of different hit region detection functions was assessed on synthetic and real data. Our simulation results suggest that there is no a single function that provides the best results in all practical situations (e.g., monophyletic or polyphyletic evolution, and positive or negative selection, and that at least three different functions might be useful. The proposed hit region identification functions that do not benefit from the prior knowledge (i.e., carcinogenicity or invasivity of the involved organisms can provide equivalent results than the existing functions that take advantage of such a prior knowledge. Using the new algorithm, we examined the Neisseria meningitidis FrpB gene product for invasivity and immunologic activity, and human papilloma virus (HPV E6 oncoprotein for carcinogenicity, and confirmed

  19. Goldilocks: a tool for identifying genomic regions that are ‘just right’

    OpenAIRE

    Nicholls, Samuel M.; Clare, Amanda; Randall, Joshua C.

    2016-01-01

    Summary: We present Goldilocks: a Python package providing functionality for collecting summary statistics, identifying shifts in variation, discovering outlier regions and locating and extracting interesting regions from one or more arbitrary genomes for further analysis, for a user-provided definition of interesting. Availability and implementation: Goldilocks is freely available open-source software distributed under the MIT licence. Source code is hosted publicly at https://github.com/Sam...

  20. Genomic Characterization and Comparison of Multi-Regional and Pooled Tumor Biopsy Specimens.

    Science.gov (United States)

    Joung, Je-Gun; Bae, Joon Seol; Kim, Sang Cheol; Jung, HyunChul; Park, Woong-Yang; Song, Sang-Yong

    2016-01-01

    A single tumor biopsy specimen is typically used in cancer genome studies. However, it may represent incompletely the underlying mutational and transcriptional profiles of tumor biology. Multi-regional biopsies have the advantage of increased sensitivity for genomic profiling, but they are not cost-effective. The concept of an alternative method such as the pooling of multiple biopsies is a challenge. In order to determine if the pooling of distinct regions is representative at the genomic and transcriptome level, we performed sequencing of four regional samples and pooled samples for four cancer types including colon, stomach, kidney and liver cancer. Subsequently, a comparative analysis was conducted to explore differences in mutations and gene expression profiles between multiple regional biopsies and pooled biopsy for each tumor. Our analysis revealed a marginal level of regional difference in detected variants, but in those with low allele frequency, considerable discrepancies were observed. In conclusion, sequencing pooled samples has the benefit of detecting many variants with moderate allele frequency that occur in partial regions, but it is not applicable for detecting low-frequency mutations that require deep sequencing.

  1. Genic regions of a large salamander genome contain long introns and novel genes

    Directory of Open Access Journals (Sweden)

    Bryant Susan V

    2009-01-01

    Full Text Available Abstract Background The basis of genome size variation remains an outstanding question because DNA sequence data are lacking for organisms with large genomes. Sixteen BAC clones from the Mexican axolotl (Ambystoma mexicanum: c-value = 32 × 109 bp were isolated and sequenced to characterize the structure of genic regions. Results Annotation of genes within BACs showed that axolotl introns are on average 10× longer than orthologous vertebrate introns and they are predicted to contain more functional elements, including miRNAs and snoRNAs. Loci were discovered within BACs for two novel EST transcripts that are differentially expressed during spinal cord regeneration and skin metamorphosis. Unexpectedly, a third novel gene was also discovered while manually annotating BACs. Analysis of human-axolotl protein-coding sequences suggests there are 2% more lineage specific genes in the axolotl genome than the human genome, but the great majority (86% of genes between axolotl and human are predicted to be 1:1 orthologs. Considering that axolotl genes are on average 5× larger than human genes, the genic component of the salamander genome is estimated to be incredibly large, approximately 2.8 gigabases! Conclusion This study shows that a large salamander genome has a correspondingly large genic component, primarily because genes have incredibly long introns. These intronic sequences may harbor novel coding and non-coding sequences that regulate biological processes that are unique to salamanders.

  2. Identification and analysis of genomic regions with large between-population differentiation in humans.

    Science.gov (United States)

    Myles, S; Tang, K; Somel, M; Green, R E; Kelso, J; Stoneking, M

    2008-01-01

    The primary aim of genetic association and linkage studies is to identify genetic variants that contribute to phenotypic variation within human populations. Since the overwhelming majority of human genetic variation is found within populations, these methods are expected to be effective and can likely be extrapolated from one human population to another. However, they may lack power in detecting the genetic variants that contribute to phenotypes that differ greatly between human populations. Phenotypes that show large differences between populations are expected to be associated with genomic regions exhibiting large allele frequency differences between populations. Thus, from genome-wide polymorphism data genomic regions with large allele frequency differences between populations can be identified, and evaluated as candidates for large between-population phenotypic differences. Here we use allele frequency data from approximately 1.5 million SNPs from three human populations, and present an algorithm that identifies genomic regions containing SNPs with extreme Fst. We demonstrate that our candidate regions have reduced heterozygosity in Europeans and Chinese relative to African-Americans, and are likely enriched with genes that have experienced positive natural selection. We identify genes that are likely responsible for phenotypes known to differ dramatically between human populations and present several candidates worthy of future investigation. Our list of high Fst genomic regions is a first step in identifying the genetic variants that contribute to large phenotypic differences between populations, many of which have likely experienced positive natural selection. Our approach based on between population differences can compliment traditional within population linkage and association studies to uncover novel genotype-phenotype relationships.

  3. Full-genome sequences of hepatitis B virus subgenotype D3 isolates from the Brazilian Amazon Region

    OpenAIRE

    Natália Spitz; Francisco CA Mello; Natalia Motta Araujo

    2015-01-01

    The Brazilian Amazon Region is a highly endemic area for hepatitis B virus (HBV). However, little is known regarding the genetic variability of the strains circulating in this geographical region. Here, we describe the first full-length genomes of HBV isolated in the Brazilian Amazon Region; these genomes are also the first complete HBV subgenotype D3 genomes reported for Brazil. The genomes of the five Brazilian isolates were all 3,182 base pairs in length and the isolates were classified as...

  4. Identification of genomic regions associated with phenotypic variation between dog breeds using selection mapping.

    Directory of Open Access Journals (Sweden)

    Amaury Vaysse

    2011-10-01

    Full Text Available The extraordinary phenotypic diversity of dog breeds has been sculpted by a unique population history accompanied by selection for novel and desirable traits. Here we perform a comprehensive analysis using multiple test statistics to identify regions under selection in 509 dogs from 46 diverse breeds using a newly developed high-density genotyping array consisting of >170,000 evenly spaced SNPs. We first identify 44 genomic regions exhibiting extreme differentiation across multiple breeds. Genetic variation in these regions correlates with variation in several phenotypic traits that vary between breeds, and we identify novel associations with both morphological and behavioral traits. We next scan the genome for signatures of selective sweeps in single breeds, characterized by long regions of reduced heterozygosity and fixation of extended haplotypes. These scans identify hundreds of regions, including 22 blocks of homozygosity longer than one megabase in certain breeds. Candidate selection loci are strongly enriched for developmental genes. We chose one highly differentiated region, associated with body size and ear morphology, and characterized it using high-throughput sequencing to provide a list of variants that may directly affect these traits. This study provides a catalogue of genomic regions showing extreme reduction in genetic variation or population differentiation in dogs, including many linked to phenotypic variation. The many blocks of reduced haplotype diversity observed across the genome in dog breeds are the result of both selection and genetic drift, but extended blocks of homozygosity on a megabase scale appear to be best explained by selection. Further elucidation of the variants under selection will help to uncover the genetic basis of complex traits and disease.

  5. Evaluation of Apis mellifera syriaca Levant region honeybee conservation using comparative genome hybridization.

    Science.gov (United States)

    Haddad, Nizar Jamal; Batainh, Ahmed; Saini, Deepti; Migdadi, Osama; Aiyaz, Mohamed; Manchiganti, Rushiraj; Krishnamurthy, Venkatesh; Al-Shagour, Banan; Brake, Mohammad; Bourgeois, Lelania; De Guzman, Lilia; Rinderer, Thomas; Hamouri, Zayed Mahoud

    2016-06-01

    Apis mellifera syriaca is the native honeybee subspecies of Jordan and much of the Levant region. It expresses behavioral adaptations to a regional climate with very high temperatures, nectar dearth in summer, attacks of the Oriental wasp and is resistant to Varroa mites. The A. m. syriaca control reference sample (CRS) in this study was originally collected and stored since 2001 from "Wadi Ben Hammad", a remote valley in the southern region of Jordan. Morphometric and mitochondrial DNA markers of these honeybees had shown highest similarity to reference A. m. syriaca samples collected in 1952 by Brother Adam of samples collected from the Middle East. Samples 1-5 were collected from the National Center for Agricultural Research and Extension breeding apiary which was established for the conservation of A. m. syriaca. Our objective was to determine the success of an A. m. syriaca honey bee conservation program using genomic information from an array-based comparative genomic hybridization platform to evaluate genetic similarities to a historic reference collection (CRS). Our results had shown insignificant genomic differences between the current population in the conservation program and the CRS indicated that program is successfully conserving A. m. syriaca. Functional genomic variations were identified which are useful for conservation monitoring and may be useful for breeding programs designed to improve locally adapted strains of A. m. syriaca.

  6. Divergence is focused on few genomic regions early in speciation: incipient speciation of sunflower ecotypes.

    Science.gov (United States)

    Andrew, Rose L; Rieseberg, Loren H

    2013-09-01

    Early in speciation, as populations undergo the transition from local adaptation to incipient species, is when a number of transient, but potentially important, processes appear to be most easily detected. These include signatures of selective sweeps that can point to asymmetry in selection between habitats, divergence hitchhiking, and associations of adaptive genes with environments. In a genomic comparison of ecotypes of the prairie sunflower, Helianthus petiolaris, occurring at Great Sand Dunes National Park and Preserve (Colorado), we found that selective sweeps were mainly restricted to the dune ecotype and that there was variation across the genome in whether proximity to the nondune population constrained or promoted divergence. The major regions of divergence were few and large between ecotypes, in contrast with an interspecific comparison between H. petiolaris and a sympatric congener, Helianthus annuus. In general, the large regions of divergence observed in the ecotypic comparison swamped locus-specific associations with environmental variables. In both comparisons, regions of high divergence occurred in portions of the genetic map with high marker density, probably reflecting regions of low recombination. The difference in genomic distributions of highly divergent regions between ecotypic and interspecific comparisons highlights the value of studies spanning the spectrum of speciation in related taxa.

  7. Sequence based polymorphic (SBP marker technology for targeted genomic regions: its application in generating a molecular map of the Arabidopsis thaliana genome

    Directory of Open Access Journals (Sweden)

    Sahu Binod B

    2012-01-01

    Full Text Available Abstract Background Molecular markers facilitate both genotype identification, essential for modern animal and plant breeding, and the isolation of genes based on their map positions. Advancements in sequencing technology have made possible the identification of single nucleotide polymorphisms (SNPs for any genomic regions. Here a sequence based polymorphic (SBP marker technology for generating molecular markers for targeted genomic regions in Arabidopsis is described. Results A ~3X genome coverage sequence of the Arabidopsis thaliana ecotype, Niederzenz (Nd-0 was obtained by applying Illumina's sequencing by synthesis (Solexa technology. Comparison of the Nd-0 genome sequence with the assembled Columbia-0 (Col-0 genome sequence identified putative single nucleotide polymorphisms (SNPs throughout the entire genome. Multiple 75 base pair Nd-0 sequence reads containing SNPs and originating from individual genomic DNA molecules were the basis for developing co-dominant SBP markers. SNPs containing Col-0 sequences, supported by transcript sequences or sequences from multiple BAC clones, were compared to the respective Nd-0 sequences to identify possible restriction endonuclease enzyme site variations. Small amplicons, PCR amplified from both ecotypes, were digested with suitable restriction enzymes and resolved on a gel to reveal the sequence based polymorphisms. By applying this technology, 21 SBP markers for the marker poor regions of the Arabidopsis map representing polymorphisms between Col-0 and Nd-0 ecotypes were generated. Conclusions The SBP marker technology described here allowed the development of molecular markers for targeted genomic regions of Arabidopsis. It should facilitate isolation of co-dominant molecular markers for targeted genomic regions of any animal or plant species, whose genomic sequences have been assembled. This technology will particularly facilitate the development of high density molecular marker maps, essential for

  8. Structured RNAs in the ENCODE selected regions of the human genome

    DEFF Research Database (Denmark)

    Washietl, Stefan; Pedersen, Jakob Skou; Korbel, Jan O;

    2007-01-01

    several thousand candidate structures (corresponding to approximately 2.7% of the ENCODE regions). EvoFold has its highest sensitivity in highly conserved and relatively AU-rich regions, while RNAz favors slightly GC-rich regions, resulting in a relatively small overlap between methods. Comparison...... with the GENCODE annotation points to functional RNAs in all genomic contexts, with a slightly increased density in 3'-UTRs. While we estimate a significant false discovery rate of approximately 50%-70% many of the predictions can be further substantiated by additional criteria: 248 loci are predicted by both RNAz...

  9. PCR primers for 30 novel gene regions in the nuclear genomes of Lepidoptera

    OpenAIRE

    Wahlberg, Niklas; Peña, Carlos; Ahola, Milla; Wheat, Christopher W.; Rota, Jadranka

    2016-01-01

    Abstract We report primer pairs for 30 new gene regions in the nuclear genomes of Lepidoptera that can be amplified using a standard PCR protocol. The new primers were tested across diverse Lepidoptera , including nonditrysians and a wide selection of ditrysians. These new gene regions give a total of 11,043 bp of DNA sequence data and they show similar variability to traditionally used nuclear gene regions in studies of Lepidoptera . We feel that a PCR-based approach still has its place in m...

  10. Tumorigenic poxviruses: genomic organization and DNA sequence of the telomeric region of the Shope fibroma virus genome.

    Science.gov (United States)

    Upton, C; DeLange, A M; McFadden, G

    1987-09-01

    Shope fibroma virus (SFV), a tumorigenic poxvirus, has a 160-kb linear double-stranded DNA genome and possesses terminal inverted repeats (TIRs) of 12.4 kb. The DNA sequence of the terminal 5.5 kb of the viral genome is presented and together with previously published sequences completes the entire sequence of the SFV TIR. The terminal 400-bp region contains no major open reading frames (ORFs) but does possess five related imperfect palindromes. The remaining 5.1 kb of the sequence contains seven tightly clustered and tandemly oriented ORFs, four larger than 100 amino acids in length (T1, T2, T4, and T5) and three smaller ORFs (T3A, T3B, and T3C). All are transcribed toward the viral hairpin and almost all possess the consensus sequence TTTTTNT near their 3' ends which has been implicated for the transcription termination of vaccinia virus early genes. Searches of the published DNA database revealed no sequences with significant homology with this region of the SFV genome but when the protein database was searched with the translation products of ORFs T1-T5 it was found that the N-terminus of the putative T4 polypeptide is closely related to the signal sequence of the hemagglutinin precursor from influenza A virus, suggesting that the T4 polypeptide may be secreted from SFV-infected cells. Examination of other SFV ORFs shows that T1 and T2 also possess signal-like hydrophobic amino acid stretches close to their N-termini. The protein database search also revealed that the putative T2 protein has significant homology to the insulin family of polypeptides. In terms of sequence repetitions, seven tandemly repeated copies of the hexanucleotide ATTGTT and three flanking regions of dyad symmetry were detected, all in ORF T3C. A search for palindromic sequences also revealed two clusters, one in ORF T3A/B and a second in ORF T2. ORF T2 harbors five short sequence domains, each of which consists of a 6-bp short palindrome and a 10- to 18-bp larger palindrome. The

  11. The Evolution of Orphan Regions in Genomes of a Fungal Pathogen of Wheat

    Directory of Open Access Journals (Sweden)

    Clémence Plissonneau

    2016-10-01

    Full Text Available Fungal plant pathogens rapidly evolve virulence on resistant hosts through mutations in genes encoding proteins that modulate the host immune responses. The mutational spectrum likely includes chromosomal rearrangements responsible for gains or losses of entire genes. However, the mechanisms creating adaptive structural variation in fungal pathogen populations are poorly understood. We used complete genome assemblies to quantify structural variants segregating in the highly polymorphic fungal wheat pathogen Zymoseptoria tritici. The genetic basis of virulence in Z. tritici is complex, and populations harbor significant genetic variation for virulence; hence, we aimed to identify whether structural variation led to functional differences. We combined single-molecule real-time sequencing, genetic maps, and transcriptomics data to generate a fully assembled and annotated genome of the highly virulent field isolate 3D7. Comparative genomics analyses against the complete reference genome IPO323 identified large chromosomal inversions and the complete gain or loss of transposable-element clusters, explaining the extensive chromosomal-length polymorphisms found in this species. Both the 3D7 and IPO323 genomes harbored long tracts of sequences exclusive to one of the two genomes. These orphan regions contained 296 genes unique to the 3D7 genome and not previously known for this species. These orphan genes tended to be organized in clusters and showed evidence of mutational decay. Moreover, the orphan genes were enriched in genes encoding putative effectors and included a gene that is one of the most upregulated putative effector genes during wheat infection. Our study showed that this pathogen species harbored extensive chromosomal structure polymorphism that may drive the evolution of virulence.

  12. The Rhodomonas salina mitochondrial genome: bacteria-like operons, compact gene arrangement and complex repeat region.

    Science.gov (United States)

    Hauth, Amy M; Maier, Uwe G; Lang, B Franz; Burger, Gertraud

    2005-01-01

    To gain insight into the mitochondrial genome structure and gene content of a putatively ancestral group of eukaryotes, the cryptophytes, we sequenced the complete mitochondrial DNA of Rhodomonas salina. The 48 063 bp circular-mapping molecule codes for 2 rRNAs, 27 tRNAs and 40 proteins including 23 components of oxidative phosphorylation, 15 ribosomal proteins and two subunits of tat translocase. One potential protein (ORF161) is without assigned function. Only two introns occur in the genome; both are present within cox1 belong to group II and contain RT open reading frames. Primitive genome features include bacteria-like rRNAs and tRNAs, ribosomal protein genes organized in large clusters resembling bacterial operons and the presence of the otherwise rare genes such as rps1 and tatA. The highly compact gene organization contrasts with the presence of a 4.7 kb long, repeat-containing intergenic region. Repeat motifs approximately 40-700 bp long occur up to 31 times, forming a complex repeat structure. Tandem repeats are the major arrangement but the region also includes a large, approximately 3 kb, inverted repeat and several potentially stable approximately 40-80 bp long hairpin structures. We provide evidence that the large repeat region is involved in replication and transcription initiation, predict a promoter motif that occurs in three locations and discuss two likely scenarios of how this highly structured repeat region might have evolved.

  13. Read clouds uncover variation in complex regions of the human genome.

    Science.gov (United States)

    Bishara, Alex; Liu, Yuling; Weng, Ziming; Kashef-Haghighi, Dorna; Newburger, Daniel E; West, Robert; Sidow, Arend; Batzoglou, Serafim

    2015-10-01

    Although an increasing amount of human genetic variation is being identified and recorded, determining variants within repeated sequences of the human genome remains a challenge. Most population and genome-wide association studies have therefore been unable to consider variation in these regions. Core to the problem is the lack of a sequencing technology that produces reads with sufficient length and accuracy to enable unique mapping. Here, we present a novel methodology of using read clouds, obtained by accurate short-read sequencing of DNA derived from long fragment libraries, to confidently align short reads within repeat regions and enable accurate variant discovery. Our novel algorithm, Random Field Aligner (RFA), captures the relationships among the short reads governed by the long read process via a Markov Random Field. We utilized a modified version of the Illumina TruSeq synthetic long-read protocol, which yielded shallow-sequenced read clouds. We test RFA through extensive simulations and apply it to discover variants on the NA12878 human sample, for which shallow TruSeq read cloud sequencing data are available, and on an invasive breast carcinoma genome that we sequenced using the same method. We demonstrate that RFA facilitates accurate recovery of variation in 155 Mb of the human genome, including 94% of 67 Mb of segmental duplication sequence and 96% of 11 Mb of transcribed sequence, that are currently hidden from short-read technologies.

  14. Genomic Regions Associated With Interspecies Communication in Dogs Contain Genes Related to Human Social Disorders

    Science.gov (United States)

    Persson, Mia E.; Wright, Dominic; Roth, Lina S. V.; Batakis, Petros; Jensen, Per

    2016-01-01

    Unlike their wolf ancestors, dogs have unique social skills for communicating and cooperating with humans. Previously, significant heritabilities for human-directed social behaviors have been found in laboratory beagles. Here, a Genome-Wide Association Study identified two genomic regions associated with dog’s human-directed social behaviors. We recorded the propensity of laboratory beagles, bred, kept and handled under standardized conditions, to initiate physical interactions with a human during an unsolvable problem-task, and 190 individuals were genotyped with an HD Canine SNP-chip. One genetic marker on chromosome 26 within the SEZ6L gene was significantly associated with time spent close to, and in physical contact with, the human. Two suggestive markers on chromosome 26, located within the ARVCF gene, were also associated with human contact seeking. Strikingly, four additional genes present in the same linkage blocks affect social abilities in humans, e.g., SEZ6L has been associated with autism and COMT affects aggression in adolescents with ADHD. This is, to our knowledge, the first genome-wide study presenting candidate genomic regions for dog sociability and inter-species communication. These results advance our understanding of dog domestication and raise the use of the dog as a novel model system for human social disorders. PMID:27685260

  15. Small tumor virus genomes are integrated near nuclear matrix attachment regions in transformed cells.

    Science.gov (United States)

    Shera, K A; Shera, C A; McDougall, J K

    2001-12-01

    More than 15% of human cancers have a viral etiology. In benign lesions induced by the small DNA tumor viruses, viral genomes are typically maintained extrachromosomally. Malignant progression is often associated with viral integration into host cell chromatin. To study the role of viral integration in tumorigenesis, we analyzed the positions of integrated viral genomes in tumors and tumor cell lines induced by the small oncogenic viruses, including the high-risk human papillomaviruses, hepatitis B virus, simian virus 40, and human T-cell leukemia virus type 1. We show that viral integrations in tumor cells lie near cellular sequences identified as nuclear matrix attachment regions (MARs), while integrations in nonneoplastic cells show no significant correlation with these regions. In mammalian cells, the nuclear matrix functions in gene expression and DNA replication. MARs play varied but poorly understood roles in eukaryotic gene expression. Our results suggest that integrated tumor virus genomes are subject to MAR-mediated transcriptional regulation, providing insight into mechanisms of viral carcinogenesis. Furthermore, the viral oncoproteins serve as invaluable tools for the study of mechanisms controlling cellular growth. Similarly, our demonstration that integrated viral genomes may be subject to MAR-mediated transcriptional effects should facilitate elucidation of fundamental mechanisms regulating eukaryotic gene expression.

  16. Genomic Regions Associated With Interspecies Communication in Dogs Contain Genes Related to Human Social Disorders.

    Science.gov (United States)

    Persson, Mia E; Wright, Dominic; Roth, Lina S V; Batakis, Petros; Jensen, Per

    2016-09-29

    Unlike their wolf ancestors, dogs have unique social skills for communicating and cooperating with humans. Previously, significant heritabilities for human-directed social behaviors have been found in laboratory beagles. Here, a Genome-Wide Association Study identified two genomic regions associated with dog's human-directed social behaviors. We recorded the propensity of laboratory beagles, bred, kept and handled under standardized conditions, to initiate physical interactions with a human during an unsolvable problem-task, and 190 individuals were genotyped with an HD Canine SNP-chip. One genetic marker on chromosome 26 within the SEZ6L gene was significantly associated with time spent close to, and in physical contact with, the human. Two suggestive markers on chromosome 26, located within the ARVCF gene, were also associated with human contact seeking. Strikingly, four additional genes present in the same linkage blocks affect social abilities in humans, e.g., SEZ6L has been associated with autism and COMT affects aggression in adolescents with ADHD. This is, to our knowledge, the first genome-wide study presenting candidate genomic regions for dog sociability and inter-species communication. These results advance our understanding of dog domestication and raise the use of the dog as a novel model system for human social disorders.

  17. Deciphering heterogeneity in pig genome assembly Sscrofa9 by isochore and isochore-like region analyses.

    Directory of Open Access Journals (Sweden)

    Wenqian Zhang

    Full Text Available BACKGROUND: The isochore, a large DNA sequence with relatively small GC variance, is one of the most important structures in eukaryotic genomes. Although the isochore has been widely studied in humans and other species, little is known about its distribution in pigs. PRINCIPAL FINDINGS: In this paper, we construct a map of long homogeneous genome regions (LHGRs, i.e., isochores and isochore-like regions, in pigs to provide an intuitive version of GC heterogeneity in each chromosome. The LHGR pattern study not only quantifies heterogeneities, but also reveals some primary characteristics of the chromatin organization, including the followings: (1 the majority of LHGRs belong to GC-poor families and are in long length; (2 a high gene density tends to occur with the appearance of GC-rich LHGRs; and (3 the density of LINE repeats decreases with an increase in the GC content of LHGRs. Furthermore, a portion of LHGRs with particular GC ranges (50%-51% and 54%-55% tend to have abnormally high gene densities, suggesting that biased gene conversion (BGC, as well as time- and energy-saving principles, could be of importance to the formation of genome organization. CONCLUSION: This study significantly improves our knowledge of chromatin organization in the pig genome. Correlations between the different biological features (e.g., gene density and repeat density and GC content of LHGRs provide a unique glimpse of in silico gene and repeats prediction.

  18. A hybrid neural network system for prediction and recognition of promoter regions in human genome

    Institute of Scientific and Technical Information of China (English)

    CHEN Chuan-bo; LI Tao

    2005-01-01

    This paper proposes a high specificity and sensitivity algorithm called PromPredictor for recognizing promoter regions in the human genome. PromPredictor extracts compositional features and CpG islands information from genomic sequence,feeding these features as input for a hybrid neural network system (HNN) and then applies the HNN for prediction. It combines a novel promoter recognition model, coding theory, feature selection and dimensionality reduction with machine learning algorithm.Evaluation on Human chromosome 22 was ~66% in sensitivity and ~48% in specificity. Comparison with two other systems revealed that our method had superior sensitivity and specificity in predicting promoter regions. PromPredictor is written in MATLAB and requires Matlab to run. PromPredictor is freely available at http://www.whtelecom.com/Prompredictor.htm.

  19. New insights into the origin of the B genome of hexaploid wheat: Evolutionary relationships at the SPA genomic region with the S genome of the diploid relative Aegilops speltoides

    Directory of Open Access Journals (Sweden)

    Charmet Gilles

    2008-11-01

    Full Text Available Abstract Background Several studies suggested that the diploid ancestor of the B genome of tetraploid and hexaploid wheat species belongs to the Sitopsis section, having Aegilops speltoides (SS, 2n = 14 as the closest identified relative. However molecular relationships based on genomic sequence comparison, including both coding and non-coding DNA, have never been investigated. In an attempt to clarify these relationships, we compared, in this study, sequences of the Storage Protein Activator (SPA locus region of the S genome of Ae. speltoides (2n = 14 to that of the A, B and D genomes co-resident in the hexaploid wheat species (Triticum aestivum, AABBDD, 2n = 42. Results Four BAC clones, spanning the SPA locus of respectively the A, B, D and S genomes, were isolated and sequenced. Orthologous genomic regions were identified as delimited by shared non-transposable elements and non-coding sequences surrounding the SPA gene and correspond to 35 268, 22 739, 43 397 and 53 919 bp for the A, B, D and S genomes, respectively. Sequence length discrepancies within and outside the SPA orthologous regions are the result of non-shared transposable elements (TE insertions, all of which inserted after the progenitors of the four genomes divergence. Conclusion On the basis of conserved sequence length as well as identity of the shared non-TE regions and the SPA coding sequence, Ae speltoides appears to be more evolutionary related to the B genome of T. aestivum than the A and D genomes. However, the differential insertions of TEs, none of which are conserved between the two genomes led to the conclusion that the S genome of Ae. speltoides has diverged very early from the progenitor of the B genome which remains to be identified.

  20. The structure of the Morganella morganii lipopolysaccharide core region and identification of its genomic loci.

    Science.gov (United States)

    Vinogradov, Evgeny; Nash, John H E; Foote, Simon; Young, N Martin

    2015-01-30

    The core region of the lipopolysaccharide of Morganella morganii serotype O:1ab was obtained by hydrolysis of the LPS and studied by 2D NMR, ESI MS, and chemical methods. Its structure was highly homologous to those from the two major members of the same Proteeae tribe, Proteus mirabilis and Providencia alcalifaciens, and analysis of the M. morganii genome disclosed that the loci for its outer core, lipid A and Ara4N moieties are similarly conserved.

  1. The SeqWord Genome Browser: an online tool for the identification and visualization of atypical regions of bacterial genomes through oligonucleotide usage

    Directory of Open Access Journals (Sweden)

    Tümmler Burkhard

    2008-08-01

    Full Text Available Abstract Background Data mining in large DNA sequences is a major challenge in microbial genomics and bioinformatics. Oligonucleotide usage (OU patterns provide a wealth of information for large scale sequence analysis and visualization. The purpose of this research was to make OU statistical analysis available as a novel web-based tool for functional genomics and annotation. The tool is also available as a downloadable package. Results The SeqWord Genome Browser (SWGB was developed to visualize the natural compositional variation of DNA sequences. The applet is also used for identification of divergent genomic regions both in annotated sequences of bacterial chromosomes, plasmids, phages and viruses, and in raw DNA sequences prior to annotation by comparing local and global OU patterns. The applet allows fast and reliable identification of clusters of horizontally transferred genomic islands, large multi-domain genes and genes for ribosomal RNA. Within the majority of genomic fragments (also termed genomic core sequence, regions enriched with housekeeping genes, ribosomal proteins and the regions rich in pseudogenes or genetic vestiges may be contrasted. Conclusion The SWGB applet presents a range of comprehensive OU statistical parameters calculated for a range of bacterial species, plasmids and phages. It is available on the Internet at http://www.bi.up.ac.za/SeqWord/mhhapplet.php.

  2. Tandem repeat regions within the Burkholderia pseudomallei genome and their application for high resolution genotyping

    Directory of Open Access Journals (Sweden)

    Harvey Steven P

    2007-03-01

    Full Text Available Abstract Background The facultative, intracellular bacterium Burkholderia pseudomallei is the causative agent of melioidosis, a serious infectious disease of humans and animals. We identified and categorized tandem repeat arrays and their distribution throughout the genome of B. pseudomallei strain K96243 in order to develop a genetic typing method for B. pseudomallei. We then screened 104 of the potentially polymorphic loci across a diverse panel of 31 isolates including B. pseudomallei, B. mallei and B. thailandensis in order to identify loci with varying degrees of polymorphism. A subset of these tandem repeat arrays were subsequently developed into a multiple-locus VNTR analysis to examine 66 B. pseudomallei and 21 B. mallei isolates from around the world, as well as 95 lineages from a serial transfer experiment encompassing ~18,000 generations. Results B. pseudomallei contains a preponderance of tandem repeat loci throughout its genome, many of which are duplicated elsewhere in the genome. The majority of these loci are composed of repeat motif lengths of 6 to 9 bp with 4 to 10 repeat units and are predominately located in intergenic regions of the genome. Across geographically diverse B. pseudomallei and B.mallei isolates, the 32 VNTR loci displayed between 7 and 28 alleles, with Nei's diversity values ranging from 0.47 and 0.94. Mutation rates for these loci are comparable (>10-5 per locus per generation to that of the most diverse tandemly repeated regions found in other less diverse bacteria. Conclusion The frequency, location and duplicate nature of tandemly repeated regions within the B. pseudomallei genome indicate that these tandem repeat regions may play a role in generating and maintaining adaptive genomic variation. Multiple-locus VNTR analysis revealed extensive diversity within the global isolate set containing B. pseudomallei and B. mallei, and it detected genotypic differences within clonal lineages of both species that were

  3. Complete mitochondrial genome of the mudskipper Boleophthalmus pectinirostris (Perciformes, Gobiidae): repetitive sequences in the control region.

    Science.gov (United States)

    Liu, Zhi Zhi; Wang, Cong Tao; Ma, Ling Bo; He, An Yuan; Yang, Jin Quan; Tang, Wen Qiao

    2012-02-01

    The mudskipper, Boleophthalmus pectinirostris (Perciformes, Gobiidae), is an amphibious gobioid fish. In this paper, the complete mitochondrial genome of B. pectinirostris was firstly determined. The mitogenome (17,111 bp) comprises 13 protein-coding genes, 22 tRNA genes, 2 rRNA genes and 1 putative control region. 130-bp tandem repeat was identified in the control region, which was almost identical among the 10 individuals examined, and three different frequencies of the repeat unit (five, six or seven) were found among these individuals.

  4. Contrast features of CpG islands in the promoter and other regions in the dog genome.

    Science.gov (United States)

    Han, Leng; Zhao, Zhongming

    2009-08-01

    The recent release of the domestic dog genome provides us with an ideal opportunity to investigate dog-specific genomic features. In this study, we performed a systematic analysis of CpG islands (CGIs), which are often considered gene markers, in the dog genome. Relative to the human and mouse genomes, the dog genome has a remarkably large number of CGIs and high CGI density, which is contributed by its noncoding sequences. Surprisingly, the dog genome has fewer CGIs associated with the promoter regions of genes than the human or the mouse. Further examination of functional features of dog-human-mouse homologous genes suggests that the dog might have undergone a faster erosion rate of promoter-associated CGIs than the human or mouse. Some genetic or genomic factors such as local recombination rate and karyotype may be related to the unique dog CGI features.

  5. The latent origin of replication of Epstein-Barr virus directs viral genomes to active regions of the nucleus.

    Science.gov (United States)

    Deutsch, Manuel J; Ott, Elisabeth; Papior, Peer; Schepers, Aloys

    2010-03-01

    The Epstein-Barr virus efficiently infects human B cells. The EBV genome is maintained extrachromosomally and replicates synchronously with the host's chromosomes. The latent origin of replication (oriP) guarantees plasmid stability by mediating two basic functions: replication and segregation of the viral genome. While the segregation process of EBV genomes is well understood, little is known about its chromatin association and nuclear distribution during interphase. Here, we analyzed the nuclear localization of EBV genomes and the role of functional oriP domains FR and DS for basic functions such as the transformation of primary cells, their role in targeting EBV genomes to distinct nuclear regions, and their association with epigenetic domains. Fluorescence in situ hybridization visualized the localization of extrachromosomal EBV genomes in the regions adjacent to chromatin-dense territories called the perichromatin. Further, immunofluorescence experiments demonstrated a preference of the viral genome for histone 3 lysine 4-trimethylated (H3K4me3) and histone 3 lysine 9-acetylated (H3K9ac) nuclear regions. To determine the role of FR and DS for establishment and subnuclear localization of EBV genomes, we transformed primary human B lymphocytes with recombinant mini-EBV genomes containing different oriP mutants. The loss of DS results in a slightly increased association in H3K27me3 domains. This study demonstrates that EBV genomes or oriP-based extrachromosomal vector systems are integrated into the higher order nuclear organization. We found that viral genomes are not randomly distributed in the nucleus. FR but not DS is crucial for the localization of EBV in perichromatic regions that are enriched for H3K4me3 and H3K9ac, which are hallmarks of transcriptionally active regions.

  6. Variability among the most rapidly evolving plastid genomic regions is lineage-specific: implications of pairwise genome comparisons in Pyrus (Rosaceae and other angiosperms for marker choice.

    Directory of Open Access Journals (Sweden)

    Nadja Korotkova

    Full Text Available Plastid genomes exhibit different levels of variability in their sequences, depending on the respective kinds of genomic regions. Genes are usually more conserved while noncoding introns and spacers evolve at a faster pace. While a set of about thirty maximum variable noncoding genomic regions has been suggested to provide universally promising phylogenetic markers throughout angiosperms, applications often require several regions to be sequenced for many individuals. Our project aims to illuminate evolutionary relationships and species-limits in the genus Pyrus (Rosaceae-a typical case with very low genetic distances between taxa. In this study, we have sequenced the plastid genome of Pyrus spinosa and aligned it to the already available P. pyrifolia sequence. The overall p-distance of the two Pyrus genomes was 0.00145. The intergenic spacers between ndhC-trnV, trnR-atpA, ndhF-rpl32, psbM-trnD, and trnQ-rps16 were the most variable regions, also comprising the highest total numbers of substitutions, indels and inversions (potentially informative characters. Our comparative analysis of further plastid genome pairs with similar low p-distances from Oenothera (representing another rosid, Olea (asterids and Cymbidium (monocots showed in each case a different ranking of genomic regions in terms of variability and potentially informative characters. Only two intergenic spacers (ndhF-rpl32 and trnK-rps16 were consistently found among the 30 top-ranked regions. We have mapped the occurrence of substitutions and microstructural mutations in the four genome pairs. High AT content in specific sequence elements seems to foster frequent mutations. We conclude that the variability among the fastest evolving plastid genomic regions is lineage-specific and thus cannot be precisely predicted across angiosperms. The often lineage-specific occurrence of stem-loop elements in the sequences of introns and spacers also governs lineage-specific mutations. Sequencing

  7. Genomic region operation kit for flexible processing of deep sequencing data.

    Science.gov (United States)

    Ovaska, Kristian; Lyly, Lauri; Sahu, Biswajyoti; Jänne, Olli A; Hautaniemi, Sampsa

    2013-01-01

    Computational analysis of data produced in deep sequencing (DS) experiments is challenging due to large data volumes and requirements for flexible analysis approaches. Here, we present a mathematical formalism based on set algebra for frequently performed operations in DS data analysis to facilitate translation of biomedical research questions to language amenable for computational analysis. With the help of this formalism, we implemented the Genomic Region Operation Kit (GROK), which supports various DS-related operations such as preprocessing, filtering, file conversion, and sample comparison. GROK provides high-level interfaces for R, Python, Lua, and command line, as well as an extension C++ API. It supports major genomic file formats and allows storing custom genomic regions in efficient data structures such as red-black trees and SQL databases. To demonstrate the utility of GROK, we have characterized the roles of two major transcription factors (TFs) in prostate cancer using data from 10 DS experiments. GROK is freely available with a user guide from >http://csbi.ltdk.helsinki.fi/grok/.

  8. Genomic Regions Associated with Feed Efficiency Indicator Traits in an Experimental Nellore Cattle Population

    Science.gov (United States)

    Olivieri, Bianca Ferreira; Mercadante, Maria Eugênia Zerlotti; Cyrillo, Joslaine Noely dos Santos Gonçalves; Branco, Renata Helena; Bonilha, Sarah Figueiredo Martins; de Albuquerque, Lucia Galvão; Silva, Rafael Medeiros de Oliveira; Baldi, Fernando

    2016-01-01

    The objective of this study was to identify genomic regions and metabolic pathways associated with dry matter intake, average daily gain, feed efficiency and residual feed intake in an experimental Nellore cattle population. The high-density SNP chip (Illumina High-Density Bovine BeadChip, 777k) was used to genotype the animals. The SNP markers effects and their variances were estimated using the single-step genome wide association method. The (co)variance components were estimated by Bayesian inference. The chromosome segments that are responsible for more than 1.0% of additive genetic variance were selected to explore and determine possible quantitative trait loci. The bovine genome Map Viewer was used to identify genes. In total, 51 genomic regions were identified for all analyzed traits. The heritability estimated for feed efficiency was low magnitude (0.13±0.06). For average daily gain, dry matter intake and residual feed intake, heritability was moderate to high (0.43±0.05; 0.47±0.05, 0.18±0.05, respectively). A total of 8, 17, 14 and 12 windows that are responsible for more than 1% of the additive genetic variance for dry matter intake, average daily gain, feed efficiency and residual feed intake, respectively, were identified. Candidate genes GOLIM4, RFX6, CACNG7, CACNG6, CAPN8, CAPN2, AKT2, GPRC6A, and GPR45 were associated with feed efficiency traits. It was expected that the response to selection would be higher for residual feed intake than for feed efficiency. Genomic regions harboring possible QTL for feed efficiency indicator traits were identified. Candidate genes identified are involved in energy use, metabolism protein, ion transport, transmembrane transport, the olfactory system, the immune system, secretion and cellular activity. The identification of these regions and their respective candidate genes should contribute to the formation of a genetic basis in Nellore cattle for feed efficiency indicator traits, and these results would support

  9. Sardinians genetic background explained by runs of homozygosity and genomic regions under positive selection.

    Directory of Open Access Journals (Sweden)

    Cornelia Di Gaetano

    Full Text Available The peculiar position of Sardinia in the Mediterranean sea has rendered its population an interesting biogeographical isolate. The aim of this study was to investigate the genetic population structure, as well as to estimate Runs of Homozygosity and regions under positive selection, using about 1.2 million single nucleotide polymorphisms genotyped in 1077 Sardinian individuals. Using four different methods--fixation index, inflation factor, principal component analysis and ancestry estimation--we were able to highlight, as expected for a genetic isolate, the high internal homogeneity of the island. Sardinians showed a higher percentage of genome covered by RoHs>0.5 Mb (F(RoH%0.5 when compared to peninsular Italians, with the only exception of the area surrounding Alghero. We furthermore identified 9 genomic regions showing signs of positive selection and, we re-captured many previously inferred signals. Other regions harbor novel candidate genes for positive selection, like TMEM252, or regions containing long non coding RNA. With the present study we confirmed the high genetic homogeneity of Sardinia that may be explained by the shared ancestry combined with the action of evolutionary forces.

  10. Comparative genomic analysis of duplicated homoeologous regions involved in the resistance of Brassica napus to stem canker

    Directory of Open Access Journals (Sweden)

    Berline eFopa Fomeju

    2015-09-01

    Full Text Available All crop species are current or ancient polyploids. Following whole genome duplication, structural and functional modifications result in differential gene content or regulation in the duplicated regions, which can play a fundamental role in the diversification of genes underlying complex traits. We have investigated this issue in Brassica napus, a species with a highly duplicated genome, with the aim of studying the structural and functional organization of duplicated regions involved in quantitative resistance to stem canker, a disease caused by the fungal pathogen Leptosphaeria maculans. Genome-wide association analysis on two oilseed rape panels confirmed that duplicated regions of ancestral blocks E, J, R, U and W were involved in resistance to stem canker. The structural analysis of the duplicated genomic regions showed a higher gene density on the A genome than on the C genome and a better collinearity between homoeologous regions than paralogous regions, as overall in the whole B. napus genome. The three ancestral sub-genomes were involved in the resistance to stem canker and the fractionation profile of the duplicated regions corresponded to what was expected from results on the B. napus progenitors. About 60% of the genes identified in these duplicated regions were single-copy genes while less than 5% were retained in all the duplicated copies of a given ancestral block. Genes retained in several copies were mainly involved in response to stress, signaling or transcription regulation. Genes with resistance-associated markers were mainly retained in more than two copies. These results suggested that some genes underlying quantitative resistance to stem canker might be duplicated genes. Genes with a hydrolase activity that were retained in one copy or R-like genes might also account for resistance in some regions. Further analyses need to be conducted to indicate to what extent duplicated genes contribute to the expression of the

  11. Drosophila duplication hotspots are associated with late-replicating regions of the genome.

    Directory of Open Access Journals (Sweden)

    Margarida Cardoso-Moreira

    2011-11-01

    Full Text Available Duplications play a significant role in both extremes of the phenotypic spectrum of newly arising mutations: they can have severe deleterious effects (e.g. duplications underlie a variety of diseases but can also be highly advantageous. The phenotypic potential of newly arisen duplications has stimulated wide interest in both the mutational and selective processes shaping these variants in the genome. Here we take advantage of the Drosophila simulans-Drosophila melanogaster genetic system to further our understanding of both processes. Regarding mutational processes, the study of two closely related species allows investigation of the potential existence of shared duplication hotspots, and the similarities and differences between the two genomes can be used to dissect its underlying causes. Regarding selection, the difference in the effective population size between the two species can be leveraged to ask questions about the strength of selection acting on different classes of duplications. In this study, we conducted a survey of duplication polymorphisms in 14 different lines of D. simulans using tiling microarrays and combined it with an analogous survey for the D. melanogaster genome. By integrating the two datasets, we identified duplication hotspots conserved between the two species. However, unlike the duplication hotspots identified in mammalian genomes, Drosophila duplication hotspots are not associated with sequences of high sequence identity capable of mediating non-allelic homologous recombination. Instead, Drosophila duplication hotspots are associated with late-replicating regions of the genome, suggesting a link between DNA replication and duplication rates. We also found evidence supporting a higher effectiveness of selection on duplications in D. simulans than in D. melanogaster. This is also true for duplications segregating at high frequency, where we find evidence in D. simulans that a sizeable fraction of these mutations is

  12. High-Throughput Resequencing of Maize Landraces at Genomic Regions Associated with Flowering Time

    Science.gov (United States)

    Jamann, Tiffany M.; Sood, Shilpa; Wisser, Randall J.; Holland, James B.

    2017-01-01

    Despite the reduction in the price of sequencing, it remains expensive to sequence and assemble whole, complex genomes of multiple samples for population studies, particularly for large genomes like those of many crop species. Enrichment of target genome regions coupled with next generation sequencing is a cost-effective strategy to obtain sequence information for loci of interest across many individuals, providing a less expensive approach to evaluating sequence variation at the population scale. Here we evaluate amplicon-based enrichment coupled with semiconductor sequencing on a validation set consisting of three maize inbred lines, two hybrids and 19 landrace accessions. We report the use of a multiplexed panel of 319 PCR assays that target 20 candidate loci associated with photoperiod sensitivity in maize while requiring 25 ng or less of starting DNA per sample. Enriched regions had an average on-target sequence read depth of 105 with 98% of the sequence data mapping to the maize ‘B73’ reference and 80% of the reads mapping to the target interval. Sequence reads were aligned to B73 and 1,486 and 1,244 variants were called using SAMtools and GATK, respectively. Of the variants called by both SAMtools and GATK, 30% were not previously reported in maize. Due to the high sequence read depth, heterozygote genotypes could be called with at least 92.5% accuracy in hybrid materials using GATK. The genetic data are congruent with previous reports of high total genetic diversity and substantial population differentiation among maize landraces. In conclusion, semiconductor sequencing of highly multiplexed PCR reactions is a cost-effective strategy for resequencing targeted genomic loci in diverse maize materials. PMID:28045987

  13. Genomic study of the critical region of chromosome 21 associated to Down syndrome

    Directory of Open Access Journals (Sweden)

    Julio César Montoya

    2011-03-01

    Full Text Available Introduction: Previous reports have identified a region of chromosome 21 known as Down ayndrome critical region (DSCR in which the expression of some genes would modulate the main clinical characteristics of this pathology. In this sense, there is currently limited information on the architecture of the DSCR associated.Objective: To obtain in silico a detailed vision of the chromatin structure associated with the evaluation of genomic covariables contained in public data bases.Methods: Taking as reference the information consigned in the National Center for Biotechnology Information, the Genome Browser from the University of California at Santa Cruz and from the HapMap project, a chromosome walk along 21 Mb of the distal portion of chromosome 21q arm was performed. In this distal portion, the number of single nucleotide polymorphisms (SNP, number of CpG islands, repetitive elements, recombination frequencies, and topographical state of that chromatin were recorded.Results: The frequency of CpG islands and Ref genes increased in the more distal 1.2 Mb DSCR that contrast with those localized near to the centromere. The highest level of recombination calculated for women was registered in the 21q22.12 to 22.3 bands. DSCR 6 and 9 genes showed a high percentage of methylation in CpG islands in DNA from normal and trisomic fibroblasts. The DSCR2 gene exhibited high levels of open chromatin and also methylation in some lysine residues of the histone H3 as relevant characteristics.Conclusion: The existence of a genomic environment characterized by high values of recombination frequencies and CpG methylation in DSCR 6 and 9 and also DSCR2 genes led us to postulate that in non-disjunction detected in Down syndrome, complex genomic, epigenetic and environmental relationships regulate some processes of meiosis.

  14. Genomic study of the critical region of chromosome 21 associated to Down syndrome

    Directory of Open Access Journals (Sweden)

    Julio César Montoya

    2011-04-01

    Full Text Available Introduction: Previous reports have identified a region of chromosome 21 known as Down ayndrome critical region (DSCR in which the expression of some genes would modulate the main clinical characteristics of this pathology. In this sense, there is currently limited information on the architecture of the DSCR associated. Objective: To obtain in silico a detailed vision of the chromatin structure associated with the evaluation of genomic covariables contained in public data bases. Methods: Taking as reference the information consigned in the National Center for Biotechnology Information, the Genome Browser from the University of California at Santa Cruz and from the HapMap project, a chromosome walk along 21 Mb of the distal portion of chromosome 21q arm was performed. In this distal portion, the number of single nucleotide polymorphisms (SNP, number of CpG islands, repetitive elements, recombination frequencies, and topographical state of that chromatin were recorded. Results: The frequency of CpG islands and Ref genes increased in the more distal 1.2 Mb DSCR that contrast with those localized near to the centromere. The highest level of recombination calculated for women was registered in the 21q22.12 to 22.3 bands. DSCR 6 and 9 genes showed a high percentage of methylation in CpG islands in DNA from normal and trisomic fibroblasts. The DSCR2 gene exhibited high levels of open chromatin and also methylation in some lysine residues of the histone H3 as relevant characteristics. Conclusion: The existence of a genomic environment characterized by high values of recombination frequencies and CpG methylation in DSCR 6 and 9 and also DSCR2 genes led us to postulate that in non-disjunction detected in Down syndrome, complex genomic, epigenetic and environmental relationships regulate some processes of meiosis.

  15. Complete genome sequence of Deltapapillomavirus 4 (bovine papillomavirus 2 from a bovine papillomavirus lesion in Amazon Region, Brazil

    Directory of Open Access Journals (Sweden)

    Cíntia Daudt

    2016-04-01

    Full Text Available The complete genome sequence of bovine papillomavirus 2 (BPV2 from Brazilian Amazon Region was determined using multiple-primed rolling circle amplification followed by Illumina sequencing. The genome is 7,947 bp long, with 45.9% GC content. It encodes seven early (E1, E2,E4, E5, E6,E7, and E8 and two late (L1 and L2 genes. The complete genome of a BPV2 can help in future studies since this BPV type is highly reported worldwide although the lack of complete genome sequences available.

  16. Complete genome sequence of Deltapapillomavirus 4 (bovine papillomavirus 2) from a bovine papillomavirus lesion in Amazon Region, Brazil

    Science.gov (United States)

    Daudt, Cíntia; da Silva, Flavio RC; Cibulski, Samuel P; Weber, Matheus N; Mayer, Fabiana Q; Varela, Ana Paula M; Roehe, Paulo M; Canal, Cláudio W

    2016-01-01

    The complete genome sequence of bovine papillomavirus 2 (BPV2) from Brazilian Amazon Region was determined using multiple-primed rolling circle amplification followed by Illumina sequencing. The genome is 7,947 bp long, with 45.9% GC content. It encodes seven early (E1, E2,E4, E5, E6,E7, and E8) and two late (L1 and L2) genes. The complete genome of a BPV2 can help in future studies since this BPV type is highly reported worldwide although the lack of complete genome sequences available. PMID:27074259

  17. Rapid genome evolution in Pms1 region of rice revealed by comparative sequence analysis

    Institute of Scientific and Technical Information of China (English)

    YU JinSheng; FAN YouRong; LIU Nan; SHAN Yan; LI XiangHua; ZHANG QiFa

    2007-01-01

    Pms1, a locus for photoperiod sensitive genic male sterility in rice, was identified and mapped to chromosome 7 in previous studies. Here we report an effort to identify the candidate genes for Pms1 by comparative sequencing of BAC clones from two cultivars Minghui 63 and Nongken 58, the parents for the initial mapping population. Annotation and comparison of the sequences of the two clones resulted in a total of five potential candidates which should be functionally tested. We also conducted comparative analysis of sequences of these two cultivars with two other cultivars, Nipponbare and 93-11,for which sequence data were available in public databases. The analysis revealed large differences in sequence composition among the four genotypes in the Pms1 region primarily due to retroelement activity leading to rapid recent growth and divergence of the genomes. High levels of polymorphism in the forms of indels and SNPs were found both in intra- and inter-subspecific comparisons. Dating analysis using LTRs of the retroelements in this region showed that the substitution rate of LTRs was much higher than reported in the literature. The results provided strong evidence for rapid genomic evolution of this region as a consequence of natural and artificial selection.

  18. Identification of genomic regions associated with phenotypic variation between dog breeds using selection mapping

    DEFF Research Database (Denmark)

    Vaysse, Amaury; Ratnakumar, Abhirami; Derrien, Thomas;

    2011-01-01

    The extraordinary phenotypic diversity of dog breeds has been sculpted by a unique population history accompanied by selection for novel and desirable traits. Here we perform a comprehensive analysis using multiple test statistics to identify regions under selection in 509 dogs from 46 diverse...... across the genome in dog breeds are the result of both selection and genetic drift, but extended blocks of homozygosity on a megabase scale appear to be best explained by selection. Further elucidation of the variants under selection will help to uncover the genetic basis of complex traits and disease....

  19. Global identification and characterization of transcriptionally active regions in the rice genome.

    Directory of Open Access Journals (Sweden)

    Lei Li

    Full Text Available Genome tiling microarray studies have consistently documented rich transcriptional activity beyond the annotated genes. However, systematic characterization and transcriptional profiling of the putative novel transcripts on the genome scale are still lacking. We report here the identification of 25,352 and 27,744 transcriptionally active regions (TARs not encoded by annotated exons in the rice (Oryza. sativa subspecies japonica and indica, respectively. The non-exonic TARs account for approximately two thirds of the total TARs detected by tiling arrays and represent transcripts likely conserved between japonica and indica. Transcription of 21,018 (83% japonica non-exonic TARs was verified through expression profiling in 10 tissue types using a re-array in which annotated genes and TARs were each represented by five independent probes. Subsequent analyses indicate that about 80% of the japonica TARs that were not assigned to annotated exons can be assigned to various putatively functional or structural elements of the rice genome, including splice variants, uncharacterized portions of incompletely annotated genes, antisense transcripts, duplicated gene fragments, and potential non-coding RNAs. These results provide a systematic characterization of non-exonic transcripts in rice and thus expand the current view of the complexity and dynamics of the rice transcriptome.

  20. Mitochondrial genome of the Levant Region honeybee, Apis mellifera syriaca (Hymenoptera: Apidae).

    Science.gov (United States)

    Haddad, Nizar Jamal

    2016-11-01

    The mitochondrial genome sequence of Levant Region honeybee, Apis mellifera syriaca, is analyzed and presented for the public for the first time. The genome of this honeybee is 15,428 bp in its length, containing 13 protein-coding genes, 22 transfer RNA genes and 2 ribosomal RNA genes. The overall base composition is A (42.88%), C (9.97%), G (5.85%), and T (41.3%), the percentage of A and T being higher than that of G and C. Percentage of non-ATGC characters is 0.007. All the genes are encoded on H-strand, except for four subunit genes (ND1, ND4, ND4L, and ND5), two rRNA genes and eight tRNA genes. The publication of the mitochondrial genome sequence will play a vital role in the conservation genetic projects of A. mellifera, in general, and Apis mellifera syriaca, in particular; moreover, it will be useful for further phylogenetic analysis.

  1. Full-genome sequences of hepatitis B virus subgenotype D3 isolates from the Brazilian Amazon Region

    Directory of Open Access Journals (Sweden)

    Natália Spitz

    2015-02-01

    Full Text Available The Brazilian Amazon Region is a highly endemic area for hepatitis B virus (HBV. However, little is known regarding the genetic variability of the strains circulating in this geographical region. Here, we describe the first full-length genomes of HBV isolated in the Brazilian Amazon Region; these genomes are also the first complete HBV subgenotype D3 genomes reported for Brazil. The genomes of the five Brazilian isolates were all 3,182 base pairs in length and the isolates were classified as belonging to subgenotype D3, subtypes ayw2 (n = 3 and ayw3 (n = 2. Phylogenetic analysis suggested that the Brazilian sequences are not likely to be closely related to European D3 sequences. Such results will contribute to further epidemiological and evolutionary studies of HBV.

  2. Full-genome sequences of hepatitis B virus subgenotype D3 isolates from the Brazilian Amazon Region.

    Science.gov (United States)

    Spitz, Natália; Mello, Francisco C A; Araujo, Natalia Motta

    2015-02-01

    The Brazilian Amazon Region is a highly endemic area for hepatitis B virus (HBV). However, little is known regarding the genetic variability of the strains circulating in this geographical region. Here, we describe the first full-length genomes of HBV isolated in the Brazilian Amazon Region; these genomes are also the first complete HBV subgenotype D3 genomes reported for Brazil. The genomes of the five Brazilian isolates were all 3,182 base pairs in length and the isolates were classified as belonging to subgenotype D3, subtypes ayw2 (n = 3) and ayw3 (n = 2). Phylogenetic analysis suggested that the Brazilian sequences are not likely to be closely related to European D3 sequences. Such results will contribute to further epidemiological and evolutionary studies of HBV.

  3. Dynamic nucleotide mutation gradients and control region usage in squamate reptile mitochondrial genomes.

    Science.gov (United States)

    Castoe, T A; Gu, W; de Koning, A P J; Daza, J M; Jiang, Z J; Parkinson, C L; Pollock, D D

    2009-01-01

    Gradients of nucleotide bias and substitution rates occur in vertebrate mitochondrial genomes due to the asymmetric nature of the replication process. The evolution of these gradients has previously been studied in detail in primates, but not in other vertebrate groups. From the primate study, the strengths of these gradients are known to evolve in ways that can substantially alter the substitution process, but it is unclear how rapidly they evolve over evolutionary time or how different they may be in different lineages or groups of vertebrates. Given the importance of mitochondrial genomes in phylogenetics and molecular evolutionary research, a better understanding of how asymmetric mitochondrial substitution gradients evolve would contribute key insights into how this gradient evolution may mislead evolutionary inferences, and how it may also be incorporated into new evolutionary models. Most snake mitochondrial genomes have an additional interesting feature, 2 nearly identical control regions, which vary among different species in the extent that they are used as origins of replication. Given the expanded sampling of complete snake genomes currently available, together with 2 additional snakes sequenced in this study, we reexamined gradient strength and CR usage in alethinophidian snakes as well as several lizards that possess dual CRs. Our results suggest that nucleotide substitution gradients (and corresponding nucleotide bias) and CR usage is highly labile over the approximately 200 m.y. of squamate evolution, and demonstrates greater overall variability than previously shown in primates. The evidence for the existence of such gradients, and their ability to evolve rapidly and converge among unrelated species suggests that gradient dynamics could easily mislead phylogenetic and molecular evolutionary inferences, and argues strongly that these dynamics should be incorporated into phylogenetic models.

  4. Selection for Unequal Densities of Sigma70 Promoter-like Signalsin Different Regions of Large Bacterial Genomes

    Energy Technology Data Exchange (ETDEWEB)

    Huerta, Araceli M.; Francino, M. Pilar; Morett, Enrique; Collado-Vides, Julio

    2006-03-01

    The evolutionary processes operating in the DNA regions that participate in the regulation of gene expression are poorly understood. In Escherichia coli, we have established a sequence pattern that distinguishes regulatory from nonregulatory regions. The density of promoter-like sequences, that are recognizable by RNA polymerase and may function as potential promoters, is high within regulatory regions, in contrast to coding regions and regions located between convergently-transcribed genes. Moreover, functional promoter sites identified experimentally are often found in the subregions of highest density of promoter-like signals, even when individual sites with higher binding affinity for RNA polymerase exist elsewhere within the regulatory region. In order to investigate the generality of this pattern, we have used position weight matrices describing the -35 and -10 promoter boxes of E. coli to search for these motifs in 43 additional genomes belonging to most established bacterial phyla, after specific calibration of the matrices according to the base composition of the noncoding regions of each genome. We have found that all bacterial species analyzed contain similar promoter-like motifs, and that, in most cases, these motifs follow the same genomic distribution observed in E. coli. Differential densities between regulatory and nonregulatory regions are detectable in most bacterial genomes, with the exception of those that have experienced evolutionary extreme genome reduction. Thus, the phylogenetic distribution of this pattern mirrors that of genes and other genomic features that require weak selection to be effective in order to persist. On this basis, we suggest that the loss of differential densities in the reduced genomes of host-restricted pathogens and symbionts is the outcome of a process of genome degradation resulting from the decreased efficiency of purifying selection in highly structured small populations. This implies that the differential

  5. A genomic region involved in the formation of adhesin fibers in Bacillus cereus biofilms

    Directory of Open Access Journals (Sweden)

    Joaquín eCaro-Astorga

    2015-01-01

    Full Text Available Bacillus cereus is a bacterial pathogen that is responsible for many recurrent disease outbreaks due to food contamination. Spores and biofilms are considered the most important reservoirs of B. cereus in contaminated fresh vegetables and fruits. Biofilms are bacterial communities that are difficult to eradicate from biotic and abiotic surfaces because of their stable and extremely strong extracellular matrix. These extracellular matrixes contain exopolysaccharides, proteins, extracellular DNA, and other minor components. Although B. cereus can form biofilms, the bacterial features governing assembly of the protective extracellular matrix are not known. Using the well-studied bacterium B. subtilis as a model, we identified two genomic loci in B. cereus, which encodes two orthologs of the amyloid-like protein TasA of B. subtilis and a SipW signal peptidase. Deletion of this genomic region in B. cereus inhibited biofilm assembly; notably, mutation of the putative signal peptidase SipW caused the same phenotype. However, mutations in tasA or calY did not completely prevent biofilm formation; strains that were mutated for either of these genes formed phenotypically different surface attached biofilms. Electron microscopy studies revealed that TasA polymerizes to form long and abundant fibers on cell surfaces, whereas CalY does not aggregate similarly. Heterologous expression of this amyloid-like cassette in a B. subtilis strain lacking the factors required for the assembly of TasA amyloid-like fibers revealed i the involvement of this B. cereus genomic region in formation of the air-liquid interphase pellicles and ii the intrinsic ability of TasA to form fibers similar to the amyloid-like fibers produced by its B. subtilis ortholog.

  6. Microcollinearity in an ethylene receptor coding gene region of the Coffea canephora genome is extensively conserved with Vitis vinifera and other distant dicotyledonous sequenced genomes

    Directory of Open Access Journals (Sweden)

    Campa Claudine

    2009-02-01

    Full Text Available Abstract Background Coffea canephora, also called Robusta, belongs to the Rubiaceae, the fourth largest angiosperm family. This diploid species (2x = 2n = 22 has a fairly small genome size of ≈ 690 Mb and despite its extreme economic importance, particularly for developing countries, knowledge on the genome composition, structure and evolution remain very limited. Here, we report the 160 kb of the first C. canephora Bacterial Artificial Chromosome (BAC clone ever sequenced and its fine analysis. Results This clone contains the CcEIN4 gene, encoding an ethylene receptor, and twenty other predicted genes showing a high gene density of one gene per 7.8 kb. Most of them display perfect matches with C. canephora expressed sequence tags or show transcriptional activities through PCR amplifications on cDNA libraries. Twenty-three transposable elements, mainly Class II transposon derivatives, were identified at this locus. Most of these Class II elements are Miniature Inverted-repeat Transposable Elements (MITE known to be closely associated with plant genes. This BAC composition gives a pattern similar to those found in gene rich regions of Solanum lycopersicum and Medicago truncatula genomes indicating that the CcEIN4 regions may belong to a gene rich region in the C. canephora genome. Comparative sequence analysis indicated an extensive conservation between C. canephora and most of the reference dicotyledonous genomes studied in this work, such as tomato (S. lycopersicum, grapevine (V. vinifera, barrel medic M. truncatula, black cottonwood (Populus trichocarpa and Arabidopsis thaliana. The higher degree of microcollinearity was found between C. canephora and V. vinifera, which belong respectively to the Asterids and Rosids, two clades that diverged more than 114 million years ago. Conclusion This study provides a first glimpse of C. canephora genome composition and evolution. Our data revealed a remarkable conservation of the microcollinearity

  7. Lost region in amyloid precursor protein (APP) through TALEN-mediated genome editing alters mitochondrial morphology.

    Science.gov (United States)

    Wang, Yajie; Wu, Fengyi; Pan, Haining; Zheng, Wenzhong; Feng, Chi; Wang, Yunfu; Deng, Zixin; Wang, Lianrong; Luo, Jie; Chen, Shi

    2016-02-29

    Alzheimer's disease (AD) is characterized by amyloid-β (Aβ) deposition in the brain. Aβ plaques are produced through sequential β/γ cleavage of amyloid precursor protein (APP), of which there are three main APP isoforms: APP695, APP751 and APP770. KPI-APPs (APP751 and APP770) are known to be elevated in AD, but the reason remains unclear. Transcription activator-like (TAL) effector nucleases (TALENs) induce mutations with high efficiency at specific genomic loci, and it is thus possible to knock out specific regions using TALENs. In this study, we designed and expressed TALENs specific for the C-terminus of APP in HeLa cells, in which KPI-APPs are predominantly expressed. The KPI-APP mutants lack a 12-aa region that encompasses a 5-aa trans-membrane (TM) region and 7-aa juxta-membrane (JM) region. The mutated KPI-APPs exhibited decreased mitochondrial localization. In addition, mitochondrial morphology was altered, resulting in an increase in spherical mitochondria in the mutant cells through the disruption of the balance between fission and fusion. Mitochondrial dysfunction, including decreased ATP levels, disrupted mitochondrial membrane potential, increased ROS generation and impaired mitochondrial dehydrogenase activity, was also found. These results suggest that specific regions of KPI-APPs are important for mitochondrial localization and function.

  8. Sequence Analysis of SSR-Flanking Regions Identifies Genome Affinities between Pasture Grass Fungal Endophyte Taxa

    Directory of Open Access Journals (Sweden)

    Eline van Zijll de Jong

    2011-01-01

    Full Text Available Fungal species of the Neotyphodium and Epichloë genera are endophytes of pasture grasses showing complex differences of life-cycle and genetic architecture. Simple sequence repeat (SSR markers have been developed from endophyte-derived expressed sequence tag (EST collections. Although SSR array size polymorphisms are appropriate for phenetic analysis to distinguish between taxa, the capacity to resolve phylogenetic relationships is limited by both homoplasy and heteroploidy effects. In contrast, nonrepetitive sequence regions that flank SSRs have been effectively implemented in this study to demonstrate a common evolutionary origin of grass fungal endophytes. Consistent patterns of relationships between specific taxa were apparent across multiple target loci, confirming previous studies of genome evolution based on variation of individual genes. Evidence was obtained for the definition of endophyte taxa not only through genomic affinities but also by relative gene content. Results were compatible with the current view that some asexual Neotyphodium species arose following interspecific hybridisation between sexual Epichloë ancestors. Phylogenetic analysis of SSR-flanking regions, in combination with the results of previous studies with other EST-derived SSR markers, further permitted characterisation of Neotyphodium isolates that could not be assigned to known taxa on the basis of morphological characteristics.

  9. Detailed comparative map of human chromosome 19q and related regions of the mouse genome

    Energy Technology Data Exchange (ETDEWEB)

    Stubbs, L.; Shannon, M.E.; Kim, Joomyeong [Oak Ridge National Lab., TN (United States)] [and others

    1996-08-01

    One of the larger contiguous blocks of mouse-human genomic homology includes the proximal portion of mouse chromosome 7 and the long arm of human chromosome 19. Previous studies have demonstrated the close relationship between the two regions, but have also indicated significant rearrangements in the relative orders of homologous mouse and human genes. Here we present the genetic locations of the homologs of 42 human chromosome 19q markers in the mouse, with an emphasis on genes also included in the human chromosome 19 physical map. Our results demonstrate that despite an overall inversion of sequences relative to the centromere, apparent {open_quotes}transpositions{close_quotes} of three gene-rich segments, and a local inversion of markers mapping near the 19q telomere, gene content, order, and spacing are remarkably well conserved throughout the lengths of these related mouse and humans regions. Although most human 19q markers have remained genetically linked in mouse, one small human segment forms a separate region of homology between human chromosome 19q and mouse chromosome 17. Three of the four rearrangements of mouse versus human 19q sequences involve segments that are located directly adjacent to each other in 19q13.3-q13.4, suggesting either the coincident occurrence of these events or their common association with unstable DNA sequences. These data permit an unusually in-depth examination of this large region of mouse-human genomic homology and provide an important new tool to aid in the mapping of genes and associated phenotypes in both species. 66 refs., 3 figs., 1 tab.

  10. Epigenetic Mechanisms of Genomic Imprinting: Common Themes in the Regulation of Imprinted Regions in Mammals, Plants, and Insects

    Directory of Open Access Journals (Sweden)

    William A. MacDonald

    2012-01-01

    Full Text Available Genomic imprinting is a form of epigenetic inheritance whereby the regulation of a gene or chromosomal region is dependent on the sex of the transmitting parent. During gametogenesis, imprinted regions of DNA are differentially marked in accordance to the sex of the parent, resulting in parent-specific expression. While mice are the primary research model used to study genomic imprinting, imprinted regions have been described in a broad variety of organisms, including other mammals, plants, and insects. Each of these organisms employs multiple, interrelated, epigenetic mechanisms to maintain parent-specific expression. While imprinted genes and imprint control regions are often species and locus-specific, the same suites of epigenetic mechanisms are often used to achieve imprinted expression. This review examines some examples of the epigenetic mechanisms responsible for genomic imprinting in mammals, plants, and insects.

  11. Diversity and selective sweep in the OsAMT1;1 genomic region of rice

    Directory of Open Access Journals (Sweden)

    Chen Sheng

    2011-03-01

    Full Text Available Abstract Background Ammonium is one of the major forms in which nitrogen is available for plant growth. OsAMT1;1 is a high-affinity ammonium transporter in rice (Oryza sativa L., responsible for ammonium uptake at low nitrogen concentration. The expression pattern of the gene has been reported. However, variations in its nucleotides and the evolutionary pathway of its descent from wild progenitors are yet to be elucidated. In this study, nucleotide diversity of the gene OsAMT1;1 and the diversity pattern of seven gene fragments spanning a genomic region approximately 150 kb long surrounding the gene were surveyed by sequencing a panel of 216 rice accessions including both cultivated rice and wild relatives. Results Nucleotide polymorphism (Pi of OsAMT1;1 was as low as 0.00004 in cultivated rice (Oryza sativa, only 2.3% of that in the common wild rice (O. rufipogon. A single dominant haplotype was fixed at the locus in O. sativa. The test values for neutrality were significantly negative in the entire region stretching 5' upstream and 3' downstream of the gene in all accessions. The value of linkage disequilibrium remained high across a 100 kb genomic region around OsAMT1;1 in O. sativa, but fell rapidly in O. rufipogon on either side of the promoter of OsAMT1;1, demonstrating a strong natural selection within or nearby the ammonium transporter. Conclusions The severe reduction in nucleotide variation at OsAMT1;1 in rice was caused by a selective sweep around OsAMT1;1, which may reflect the nitrogen uptake system under strong selection by the paddy soil during the domestication of rice. Purifying selection also occurred before the wild rice diverged into its two subspecies, namely indica and japonica. These findings would provide useful insights into the processes of evolution and domestication of nitrogen uptake genes in rice.

  12. Mitochondrial Genome Analyses Suggest Multiple Trichuris Species in Humans, Baboons, and Pigs from Different Geographical Regions.

    Directory of Open Access Journals (Sweden)

    Mohamed B F Hawash

    Full Text Available The whipworms Trichuris trichiura and Trichuris suis are two parasitic nematodes of humans and pigs, respectively. Although whipworms in human and non-human primates historically have been referred to as T. trichiura, recent reports suggest that several Trichuris spp. are found in primates.We sequenced and annotated complete mitochondrial genomes of Trichuris recovered from a human in Uganda, an olive baboon in the US, a hamadryas baboon in Denmark, and two pigs from Denmark and Uganda. Comparative analyses using other published mitochondrial genomes of Trichuris recovered from a human and a porcine host in China and from a françois' leaf-monkey (China were performed, including phylogenetic analyses and pairwise genetic and amino acid distances. Genetic and protein distances between human Trichuris in Uganda and China were high (~19% and 15%, respectively suggesting that they represented different species. Trichuris from the olive baboon in US was genetically related to human Trichuris in China, while the other from the hamadryas baboon in Denmark was nearly identical to human Trichuris from Uganda. Baboon-derived Trichuris was genetically distinct from Trichuris from françois' leaf monkey, suggesting multiple whipworm species circulating among non-human primates. The genetic and protein distances between pig Trichuris from Denmark and other regions were roughly 9% and 6%, respectively, while Chinese and Ugandan whipworms were more closely related.Our results indicate that Trichuris species infecting humans and pigs are phylogenetically distinct across geographical regions, which might have important implications for the implementation of suitable and effective control strategies in different regions. Moreover, we provide support for the hypothesis that Trichuris infecting primates represents a complex of cryptic species with some species being able to infect both humans and non-human primates.

  13. Proteins Encoded in Genomic Regions Associated with Immune-Mediated Disease Physically Interact and Suggest Underlying Biology

    DEFF Research Database (Denmark)

    Rossin, Elizabeth J.; Hansen, Kasper Lage; Raychaudhuri, Soumya

    2011-01-01

    Genome-wide association studies (GWAS) have defined over 150 genomic regions unequivocally containing variation predisposing to immune-mediated disease. Inferring disease biology from these observations, however, hinges on our ability to discover the molecular processes being perturbed by these r......Genome-wide association studies (GWAS) have defined over 150 genomic regions unequivocally containing variation predisposing to immune-mediated disease. Inferring disease biology from these observations, however, hinges on our ability to discover the molecular processes being perturbed......-immune traits to assess its applicability to complex traits in general. We find that genes in loci associated to height and lipid levels assemble into significantly connected networks but did not detect excess connectivity among Type 2 Diabetes (T2D) loci beyond chance. Taken together, our results constitute...

  14. Genomic regions associated with ventro-cranial chronic pleuritis in pig.

    Science.gov (United States)

    Sørensen, K K; Gregersen, V R; Christensen, O F; Velander, I H; Bendixen, C

    2011-08-01

    Ventro-cranial chronic pleuritis can be a result of pleuropneumonia and enzootic pneumonia. These diseases cause severe losses in intensive pig production worldwide, but host resistance is difficult to breed for. It could be beneficial to use marker-assisted selection, and a step towards this is to identify genomic regions associated with the trait. For this purpose, 7304 pigs from 11 boar families were analysed for associations between single nucleotide polymorphisms and ventro-cranial chronic pleuritis. The pigs were genotyped by the use of the iSelect Custom 7 K porcine SNP Chip. Quantitative trait loci (QTL), significant at the chromosome-wide level, were identified on Sus scrofa chromosomes (SSC) 2, 4, 11, 12 and 13 in four different boar families. The QTL on SSC 4 in family G was also significant at the genome-wide threshold according to Bonferroni correction. We have identified a number of candidate genes, but the causative mutations still need to be identified. Markers closely associated with the resistance traits have a strong potential for use in breeding towards animals with improved characteristics concerning ventro-cranial chronic pleuritis.

  15. HYBRIDCHECK: software for the rapid detection, visualization and dating of recombinant regions in genome sequence data.

    Science.gov (United States)

    Ward, Ben J; van Oosterhout, Cock

    2016-03-01

    HYBRIDCHECK is a software package to visualize the recombination signal in large DNA sequence data set, and it can be used to analyse recombination, genetic introgression, hybridization and horizontal gene transfer. It can scan large (multiple kb) contigs and whole-genome sequences of three or more individuals. HYBRIDCHECK is written in the r software for OS X, Linux and Windows operating systems, and it has a simple graphical user interface. In addition, the r code can be readily incorporated in scripts and analysis pipelines. HYBRIDCHECK implements several ABBA-BABA tests and visualizes the effects of hybridization and the resulting mosaic-like genome structure in high-density graphics. The package also reports the following: (i) the breakpoint positions, (ii) the number of mutations in each introgressed block, (iii) the probability that the identified region is not caused by recombination and (iv) the estimated age of each recombination event. The divergence times between the donor and recombinant sequence are calculated using a JC, K80, F81, HKY or GTR correction, and the dating algorithm is exceedingly fast. By estimating the coalescence time of introgressed blocks, it is possible to distinguish between hybridization and incomplete lineage sorting. HYBRIDCHECK is libré software and it and its manual are free to download from http://ward9250.github.io/HybridCheck/.

  16. Mapping of the genomic regions controlling seed storability in soybean (Glycine max L.)

    Indian Academy of Sciences (India)

    Hamidreza Dargahi; Patcharin Tanya; Peerasak Srinives

    2014-08-01

    Seed storability is especially important in the tropics due to high temperature and relative humidity of storage environment that cause rapid deterioration of seeds in storage. The objective of this study was to use SSR markers to identify genomic regions associated with quantitative trait loci (QTLs) controlling seed storability based on relative germination rate in the F2:3 population derived from a cross between vegetable soybean line (MJ0004-6) with poor longevity and landrace cultivar from Myanmar (R18500) with good longevity. The F2:4 seeds harvested in 2011 and 2012 were used to investigate seed storability. The F2 population was genotyped with 148 markers and the genetic map consisted of 128 SSR loci which converged into 38 linkage groups covering 1664.3 cM of soybean genome. Single marker analysis revealed that 13 markers from six linkage groups (C1, D2, E, F, J and L) were associated with seed storability. Composite interval mapping identified a total of three QTLs on linkage groups C1, F and L with phenotypic variance explained ranging from 8.79 to 13.43%. The R18500 alleles increased seed storability at all of the detected QTLs. No common QTLs were found for storability of seeds harvested in 2011 and 2012. This study agreed with previous reports in other crops that genotype by environment interaction plays an important role in expression of seed storability.

  17. Molecular markers detect stable genomic regions underlying tomato fruit shelf life and weight

    Directory of Open Access Journals (Sweden)

    Guillermo Raúl Pratta

    2011-01-01

    Full Text Available Incorporating wild germplasm such as S. pimpinellifolium is an alternative strategy to prolong tomato fruit shelf life(SL without reducing fruit quality. A set of recombinant inbred lines with discrepant values of SL and weight (FW were derived byantagonistic-divergent selection from an interspecific cross. The general objective of this research was to evaluate Genotype x Year(GY and Marker x Year (MY interaction in these new genetic materials for both traits. Genotype and year principal effects and GYinteraction were statistically significant for SL. Genotype and year principal effects were significant for FW but GY interaction wasnot. The marker principal effect was significant for SL and FW but both year principal effect and MY interaction were not significant.Though SL was highly influenced by year conditions, some genome regions appeared to maintain a stable effect across years ofevaluation. Fruit weight, instead, was more independent of year effect.

  18. Novel Altered Region for Biomarker Discovery in Hepatocellular Carcinoma (HCC Using Whole Genome SNP Array

    Directory of Open Access Journals (Sweden)

    Esraa M. Hashem

    2016-04-01

    Full Text Available cancer represents one of the greatest medical causes of mortality. The majority of Hepatocellular carcinoma arises from the accumulation of genetic abnormalities, and possibly induced by exterior etiological factors especially HCV and HBV infections. There is a need for new tools to analysis the large sum of data to present relevant genetic changes that may be critical for both understanding how cancers develop and determining how they could ultimately be treated. Gene expression profiling may lead to new biomarkers that may help develop diagnostic accuracy for detecting Hepatocellular carcinoma. In this work, statistical technique (discrete stationary wavelet transform for detection of copy number alternations to analysis high-density single-nucleotide polymorphism array of 30 cell lines on specific chromosomes, which are frequently detected in Hepatocellular carcinoma have been proposed. The results demonstrate the feasibility of whole-genome fine mapping of copy number alternations via high-density single-nucleotide polymorphism genotyping, Results revealed that a novel altered chromosomal region is discovered; region amplification (4q22.1 have been detected in 22 out of 30-Hepatocellular carcinoma cell lines (73%. This region strike, AFF1 and DSPP, tumor suppressor genes. This finding has not previously reported to be involved in liver carcinogenesis; it can be used to discover a new HCC biomarker, which helps in a better understanding of hepatocellular carcinoma.

  19. Qualitative, quantitative and structural analysis of non- coding regions of classical swine fever virus genome

    Institute of Scientific and Technical Information of China (English)

    2001-01-01

    Classical swine fever virus (CSFV) is the pathogen of the swine fever. Understanding of the replication and expression of its genome is the basis for research of the pathogenicity for CSFV and development of antiviral drug. The noncoding regions (NCRs) of CSFV are the main regulatory regions for replication and expression. Qualitative, quantitative and structural analysis of 3′ NCRs and 5′ NCRs was done in order to locate the regulatory region in the NCRs and to character the NCRs. The sites, conserved sequences and structural elements related to the initiation of replication and expression were extracted from 17 3′ NCRs and 56 5′ NCRs. Those cis-elements may be initial recognition sites for replication, binding sites for transcription factors of host cell and interacting sites for initiation of protein synthesis, based on which a mechanism for the replication and expression of CSFV was brought forth. This research offers the direction for further experiment and lays down a basis for the research on hepatitis C virus (HCV), other pestiviruses and plus-strand RNA viruses.

  20. Characterization of the Helicoverpa assulta nucleopolyhedrovirus genome and sequence analysis of the polyhedrin gene region

    Indian Academy of Sciences (India)

    Soo-Dong Woo; Jae Young Choi; Yeon Ho Je; Byung Rae Jin

    2006-09-01

    A local strain of Helicoverpa assulta nucleopolyhedrovirus (HasNPV) was isolated from infected H. assulta larvae in Korea. Restriction endonuclease fragment analysis, using 4 restriction enzymes, estimated that the total genome size of HasNPV is about 138 kb. A degenerate polymerase chain reaction (PCR) primer set for the polyhedrin gene successfully amplified the partial polyhedrin gene of HasNPV. The sequencing results showed that the about 430 bp PCR product was a fragment of the corresponding polyhedrin gene. Using HasNPV partial predicted polyhedrin to probe the Southern blots, we identified the location of the polyhedrin gene within the 6 kb EcoRI, 15 kb NcoI, 20 kb XhoI, 17 kb BglII and 3 kb ClaI fragments, respectively. The 3 kb ClaI fragment was cloned and the nucleotide sequences of the polyhedrin coding region and its flaking regions were determined. Nucleotide sequence analysis indicated the presence of an open reading frame of 735 nucleotides which could encode 245 amino acids with a predicted molecular mass of 29 kDa. The nucleotide sequences within the coding region of HasNPV polyhedrin shared 73.7% identity with the polyhedrin gene from Autographa californica NPV but were most closely related to Helicoverpa and Heliothis species NPVs with over 99% sequence identity.

  1. Differential DNA methylation regions in cytokine and transcription factor genomic loci associate with childhood physical aggression.

    Directory of Open Access Journals (Sweden)

    Nadine Provençal

    Full Text Available BACKGROUND: Animal and human studies suggest that inflammation is associated with behavioral disorders including aggression. We have recently shown that physical aggression of boys during childhood is strongly associated with reduced plasma levels of cytokines IL-1α, IL-4, IL-6, IL-8 and IL-10, later in early adulthood. This study tests the hypothesis that there is an association between differential DNA methylation regions in cytokine genes in T cells and monocytes DNA in adult subjects and a trajectory of physical aggression from childhood to adolescence. METHODOLOGY/PRINCIPAL FINDINGS: We compared the methylation profiles of the entire genomic loci encompassing the IL-1α, IL-6, IL-4, IL-10 and IL-8 and three of their regulatory transcription factors (TF NFkB1, NFAT5 and STAT6 genes in adult males on a chronic physical aggression trajectory (CPA and males with the same background who followed a normal physical aggression trajectory (control group from childhood to adolescence. We used the method of methylated DNA immunoprecipitation with comprehensive cytokine gene loci and TF loci microarray hybridization, statistical analysis and false discovery rate correction. We found differentially methylated regions to associate with CPA in both the cytokine loci as well as in their transcription factors loci analyzed. Some of these differentially methylated regions were located in known regulatory regions whereas others, to our knowledge, were previously unknown as regulatory areas. However, using the ENCODE database, we were able to identify key regulatory elements in many of these regions that indicate that they might be involved in the regulation of cytokine expression. CONCLUSIONS: We provide here the first evidence for an association between differential DNA methylation in cytokines and their regulators in T cells and monocytes and male physical aggression.

  2. Genome-wide association study identified a narrow chromosome 1 region associated with chicken growth traits.

    Directory of Open Access Journals (Sweden)

    Liang Xie

    Full Text Available Chicken growth traits are important economic traits in broilers. A large number of studies are available on finding genetic factors affecting chicken growth. However, most of these studies identified chromosome regions containing putative quantitative trait loci and finding causal mutations is still a challenge. In this genome-wide association study (GWAS, we identified a narrow 1.5 Mb region (173.5-175 Mb of chicken (Gallus gallus chromosome (GGA 1 to be strongly associated with chicken growth using 47,678 SNPs and 489 F2 chickens. The growth traits included aggregate body weight (BW at 0-90 d of age measured weekly, biweekly average daily gains (ADG derived from weekly body weight, and breast muscle weight (BMW, leg muscle weight (LMW and wing weight (WW at 90 d of age. Five SNPs in the 1.5 Mb KPNA3-FOXO1A region at GGA1 had the highest significant effects for all growth traits in this study, including a SNP at 8.9 Kb upstream of FOXO1A for BW at 22-48 d and 70 d, a SNP at 1.9 Kb downstream of FOXO1A for WW, a SNP at 20.9 Kb downstream of ENSGALG00000022732 for ADG at 29-42 d, a SNP in INTS6 for BW at 90 d, and a SNP in KPNA3 for BMW and LMW. The 1.5 Mb KPNA3-FOXO1A region contained two microRNA genes that could bind to messenger ribonucleic acid (mRNA of IGF1, FOXO1A and KPNA3. It was further indicated that the 1.5 Mb GGA1 region had the strongest effects on chicken growth during 22-42 d.

  3. Identifying relationships among genomic disease regions: predicting genes at pathogenic SNP associations and rare deletions.

    Directory of Open Access Journals (Sweden)

    Soumya Raychaudhuri

    2009-06-01

    Full Text Available Translating a set of disease regions into insight about pathogenic mechanisms requires not only the ability to identify the key disease genes within them, but also the biological relationships among those key genes. Here we describe a statistical method, Gene Relationships Among Implicated Loci (GRAIL, that takes a list of disease regions and automatically assesses the degree of relatedness of implicated genes using 250,000 PubMed abstracts. We first evaluated GRAIL by assessing its ability to identify subsets of highly related genes in common pathways from validated lipid and height SNP associations from recent genome-wide studies. We then tested GRAIL, by assessing its ability to separate true disease regions from many false positive disease regions in two separate practical applications in human genetics. First, we took 74 nominally associated Crohn's disease SNPs and applied GRAIL to identify a subset of 13 SNPs with highly related genes. Of these, ten convincingly validated in follow-up genotyping; genotyping results for the remaining three were inconclusive. Next, we applied GRAIL to 165 rare deletion events seen in schizophrenia cases (less than one-third of which are contributing to disease risk. We demonstrate that GRAIL is able to identify a subset of 16 deletions containing highly related genes; many of these genes are expressed in the central nervous system and play a role in neuronal synapses. GRAIL offers a statistically robust approach to identifying functionally related genes from across multiple disease regions--that likely represent key disease pathways. An online version of this method is available for public use (http://www.broad.mit.edu/mpg/grail/.

  4. siRNA Targeting the 2Apro Genomic Region Prevents Enterovirus 71 Replication In Vitro

    Science.gov (United States)

    Kong, Zhenzhen; Shao, Qixiang; Su, Zhaoliang; Wang, Shengjun; Chen, Jianguo

    2016-01-01

    Enterovirus 71 (EV71) is the most important etiological agent of hand, foot, and mouth disease (HFMD) in young children, which is associated with severe neurological complications and has caused significant mortalities in recent HFMD outbreaks in Asia. However, there is no effective antiviral therapy against EV71. In this study, RNA interference (RNAi) was used as an antiviral strategy to inhibit EV71 replication. Three small interfering RNAs (siRNAs) targeting the 2Apro region of the EV71 genome were designed and synthesized. All the siRNAs were transfected individually into rhabdomyosarcoma (RD) cells, which were then infected with strain EV71-2006-52-9. The cytopathic effects (CPEs) in the infected RD cells, cell viability, viral titer, and viral RNA and protein expression were examined to evaluate the specific viral inhibition by the siRNAs. The results of cytopathogenicity and MTT tests indicated that the RD cells transfected with the three siRNAs showed slight CPEs and significantly high viability. The 50% tissue culture infective dose (TCID50) values demonstrated that the viral titer of the groups treated with three siRNAs were lower than those of the control groups. qRT–PCR and western blotting revealed that the levels of viral RNA and protein in the RD cells treated with the three siRNAs were lower than those in the controls. When RD cells transfected with siRNAs were also infected with strain EV71-2008-43-16, the expression of the VP1 protein was significantly inhibited. The levels of interferon α (IFN-α) and IFN-β did not differ significantly in any group. These results suggest that siRNAs targeting the 2Apro region of the EV71 genome exerted antiviral effects in vitro. PMID:26886455

  5. An exploration of the sequence of a 2.9-Mb region of the genome of Drosophila melanogaster: The Adh region

    Energy Technology Data Exchange (ETDEWEB)

    Ashburner, M.; Misra, S.; Roote, J.; Lewis, S.E.; Blazej, R.; Davis, T.; Doyle, C.; Galle, R.; George, R.; Harris, N.; Hartzell, G.; Harvey, D.; Hong, L.; Houston, K.; Hoskins, R.; Johnson, G.; Martin, C.; Moshrefi, A.; Palazzolo, M.; Reese, M.G.; Spradling, A.; Tsang, G.; Wan, K.; Whitelaw, K.; Kimmel, B.; Celniker, S.; Rubin, G.M.

    1999-03-24

    A contiguous sequence of nearly 3 Mb from the genome of Drosophila melanogaster has been sequenced from a series of overlapping P1 and BAC clones. This region covers 69 chromosome polytene bands on chromosome arm 2L, including the genetically well-characterized

  6. Genome-wide association study confirms SNPs in SNCA and the MAPT region as common risk factors for Parkinson disease.

    Science.gov (United States)

    Edwards, Todd L; Scott, William K; Almonte, Cherylyn; Burt, Amber; Powell, Eric H; Beecham, Gary W; Wang, Liyong; Züchner, Stephan; Konidari, Ioanna; Wang, Gaofeng; Singer, Carlos; Nahab, Fatta; Scott, Burton; Stajich, Jeffrey M; Pericak-Vance, Margaret; Haines, Jonathan; Vance, Jeffery M; Martin, Eden R

    2010-03-01

    Parkinson disease (PD) is a chronic neurodegenerative disorder with a cumulative prevalence of greater than one per thousand. To date three independent genome-wide association studies (GWAS) have investigated the genetic susceptibility to PD. These studies implicated several genes as PD risk loci with strong, but not genome-wide significant, associations. In this study, we combined data from two previously published GWAS of Caucasian subjects with our GWAS of 604 cases and 619 controls for a joint analysis with a combined sample size of 1752 cases and 1745 controls. SNPs in SNCA (rs2736990, p-value = 6.7 x 10(-8); genome-wide adjusted p = 0.0109, odds ratio (OR) = 1.29 [95% CI: 1.17-1.42] G vs. A allele, population attributable risk percent (PAR%) = 12%) and the MAPT region (rs11012, p-value = 5.6 x 10(-8); genome-wide adjusted p = 0.0079, OR = 0.70 [95% CI: 0.62-0.79] T vs. C allele, PAR%= 8%) were genome-wide significant. No other SNPs were genome-wide significant in this analysis. This study confirms that SNCA and the MAPT region are major genes whose common variants are influencing risk of PD.

  7. [Mutation frequencies in HIV-1 subtype-A genome in regions containing efficient RNAi targets].

    Science.gov (United States)

    Kravatsky, Y V; Chechetkin, V R; Fedoseeva, D M; Gorbacheva, M A; Kretova, O V; Tchurikov, N A

    2016-01-01

    The development of gene-therapy technology using RNAi for AIDS/HIV-1 treatment is a prospective alternative to traditional anti-retroviral therapy. RNAi targets could be selected in HIV-1 transcripts and in CCR5 mRNA. Previously, we experimentally selected a number of efficient siRNAs that target HIV-1 RNAs. The viral genome mutates frequently, and RNAi strength is very sensitive, even for a single mismatches. That is why it is important to study nucleotide sequences of targets in clinical isolates of HIV-1. In the present study, we analyzed mutations in 6 of about 300-bp regions containing RNAi targets from HIV-1 subtype A isolates in Russia. Estimates of the mean frequencies of mutations in the targets were obtained and the frequencies of mutations in the different codon positions were compared. The frequencies of mutations in the vicinity of the targets and directly within the targets were also compared and have been shown to be approximately the same. The frequencies of indels in the chosen regions have been assessed. Their frequencies have proved to be two to three orders of magnitude less compared to that for mutations.

  8. Genome-wide function of H2B ubiquitylation in promoter and genic regions.

    Science.gov (United States)

    Batta, Kiran; Zhang, Zhenhai; Yen, Kuangyu; Goffman, David B; Pugh, B Franklin

    2011-11-01

    Nucleosomal organization in and around genes may contribute substantially to transcriptional regulation. The contribution of histone modifications to genome-wide nucleosomal organization has not been systematically evaluated. In the present study, we examine the role of H2BK123 ubiquitylation, a key regulator of several histone modifications, on nucleosomal organization at promoter, genic, and transcription termination regions in Saccharomyces cerevisiae. Using high-resolution MNase chromatin immunoprecipitation and sequencing (ChIP-seq), we map nucleosome positioning and occupancy in mutants of the H2BK123 ubiquitylation pathway. We found that H2B ubiquitylation-mediated nucleosome formation and/or stability inhibits the assembly of the transcription machinery at normally quiescent promoters, whereas ubiquitylation within highly active gene bodies promotes transcription elongation. This regulation does not proceed through ubiquitylation-regulated histone marks at H3K4, K36, and K79. Our findings suggest that mechanistically similar functions of H2B ubiquitylation (nucleosome assembly) elicit different functional outcomes on genes depending on its positional context in promoters (repressive) versus transcribed regions (activating).

  9. Gametic phase estimation over large genomic regions using an adaptive window approach

    Directory of Open Access Journals (Sweden)

    Excoffier Laurent

    2003-11-01

    Full Text Available Abstract The authors present ELB, an easy to programme and computationally fast algorithm for inferring gametic phase in population samples of multilocus genotypes. Phase updates are made on the basis of a window of neighbouring loci, and the window size varies according to the local level of linkage disequilibrium. Thus, ELB is particularly well suited to problems involving many loci and/or relatively large genomic regions, including those with variable recombination rate. The authors have simulated population samples of single nucleotide polymorphism genotypes with varying levels of recombination and marker density, and find that ELB provides better local estimation of gametic phase than the PHASE or HTYPER programs, while its global accuracy is broadly similar. The relative improvement in local accuracy increases both with increasing recombination and with increasing marker density. Short tandem repeat (STR, or microsatellite simulation studies demonstrate ELB's superiority over PHASE both globally and locally. Missing data are handled by ELB; simulations show that phase recovery is virtually unaffected by up to 2 per cent of missing data, but that phase estimation is noticeably impaired beyond this amount. The authors also applied ELB to datasets obtained from random pairings of 42 human X chromosomes typed at 97 diallelic markers in a 200 kb low-recombination region. Once again, they found ELB to have consistently better local accuracy than PHASE or HTYPER, while its global accuracy was close to the best.

  10. Genome size and base composition of five Pinus species from the Balkan region.

    Science.gov (United States)

    Bogunic, F; Muratovic, E; Brown, S C; Siljak-Yakovlev, S

    2003-08-01

    The 2C DNA content and base composition of five Pinus (2 n=24) species and two Pinus subspecies from the Balkan region have been estimated by flow cytometry. P. heldreichii (five populations) and P. peuce (one population) were assessed for the first time, as also were subspecies of P. nigra (three populations-two of subspecies nigra and one of subspecies dalmatica) along with P. sylvestris, and P. mugo from the same region. The 2C DNA values of these Pinus ranged from 42.5 pg to 54.9 pg (41.7-53.8 x 10(9)bp), and the base composition was quite stable (about 39.5% GC). Significant differences were observed between two subspecies of P. nigra and even between two populations of subsp. nigra. The two other species (P. sylvestris and P. mugo) had 2C values of 42.5 pg and 42.8 pg, respectively, while that of P. peuce was 54.9 pg. These genome sizes are in accordance with published values except for P. sylvestris, which was 20% below estimates made by other authors.

  11. Selection Under Domestication: Evidence for a Sweep in the Rice Waxy Genomic Region

    Science.gov (United States)

    Olsen, Kenneth M.; Caicedo, Ana L.; Polato, Nicholas; McClung, Anna; McCouch, Susan; Purugganan, Michael D.

    2006-01-01

    Rice (Oryza sativa) was cultivated by Asian Neolithic farmers >11,000 years ago, and different cultures have selected for divergent starch qualities in the rice grain during and after the domestication process. An intron 1 splice donor site mutation of the Waxy gene is responsible for the absence of amylose in glutinous rice varieties. This mutation appears to have also played an important role in the origin of low amylose, nonglutinous temperate japonica rice varieties, which form a primary component of Northeast Asian cuisines. Waxy DNA sequence analyses indicate that the splice donor mutation is prevalent in temperate japonica rice varieties, but rare or absent in tropical japonica, indica, aus, and aromatic varieties. Sequence analysis across a 500-kb genomic region centered on Waxy reveals patterns consistent with a selective sweep in the temperate japonicas associated with the mutation. The size of the selective sweep (>250 kb) indicates very strong selection in this region, with an inferred selection coefficient that is higher than similar estimates from maize domestication genes or wild species. These findings demonstrate that selection pressures associated with crop domestication regimes can exceed by one to two orders of magnitude those observed for genes under even strong selection in natural systems. PMID:16547098

  12. Visualization of shared genomic regions and meiotic recombination in high-density SNP data.

    Directory of Open Access Journals (Sweden)

    Elisha D O Roberson

    Full Text Available BACKGROUND: A fundamental goal of single nucleotide polymorphism (SNP genotyping is to determine the sharing of alleles between individuals across genomic loci. Such analyses have diverse applications in defining the relatedness of individuals (including unexpected relationships in nominally unrelated individuals, or consanguinity within pedigrees, analyzing meiotic crossovers, and identifying a broad range of chromosomal anomalies such as hemizygous deletions and uniparental disomy, and analyzing population structure. PRINCIPAL FINDINGS: We present SNPduo, a command-line and web accessible tool for analyzing and visualizing the relatedness of any two individuals using identity by state. Using identity by state does not require prior knowledge of allele frequencies or pedigree information, and is more computationally tractable and is less affected by population stratification than calculating identity by descent probabilities. The web implementation visualizes shared genomic regions, and generates UCSC viewable tracks. The command-line version requires pedigree information for compatibility with existing software and determining specified relationships even though pedigrees are not required for IBS calculation, generates no visual output, is written in portable C++, and is well-suited to analyzing large datasets. We demonstrate how the SNPduo web tool identifies meiotic crossover positions in siblings, and confirm our findings by visualizing meiotic recombination in synthetic three-generation pedigrees. We applied SNPduo to 210 nominally unrelated Phase I / II HapMap samples and, consistent with previous findings, identified six undeclared pairs of related individuals. We further analyzed identity by state in 2,883 individuals from multiplex families with autism and identified a series of anomalies including related parents, an individual with mosaic loss of chromosome 18, an individual with maternal heterodisomy of chromosome 16, and

  13. Organization and evolution of a gene-rich region of the mouse genome: a 12.7-Mb region deleted in the Del(13)Svea36H mouse.

    Science.gov (United States)

    Mallon, Ann-Marie; Wilming, Laurens; Weekes, Joseph; Gilbert, James G R; Ashurst, Jennifer; Peyrefitte, Sandrine; Matthews, Lucy; Cadman, Matthew; McKeone, Richard; Sellick, Chris A; Arkell, Ruth; Botcherby, Marc R M; Strivens, Mark A; Campbell, R Duncan; Gregory, Simon; Denny, Paul; Hancock, John M; Rogers, Jane; Brown, Steve D M

    2004-10-01

    Del(13)Svea36H (Del36H) is a deletion of approximately 20% of mouse chromosome 13 showing conserved synteny with human chromosome 6p22.1-6p22.3/6p25. The human region is lost in some deletion syndromes and is the site of several disease loci. Heterozygous Del36H mice show numerous phenotypes and may model aspects of human genetic disease. We describe 12.7 Mb of finished, annotated sequence from Del36H. Del36H has a higher gene density than the draft mouse genome, reflecting high local densities of three gene families (vomeronasal receptors, serpins, and prolactins) which are greatly expanded relative to human. Transposable elements are concentrated near these gene families. We therefore suggest that their neighborhoods are gene factories, regions of frequent recombination in which gene duplication is more frequent. The gene families show different proportions of pseudogenes, likely reflecting different strengths of purifying selection and/or gene conversion. They are also associated with relatively low simple sequence concentrations, which vary across the region with a periodicity of approximately 5 Mb. Del36H contains numerous evolutionarily conserved regions (ECRs). Many lie in noncoding regions, are detectable in species as distant as Ciona intestinalis, and therefore are candidate regulatory sequences. This analysis will facilitate functional genomic analysis of Del36H and provides insights into mouse genome evolution.

  14. Genomic characterization of Sinorhizobium meliloti AK21, a wild isolate from the Aral Sea Region.

    Science.gov (United States)

    Molina-Sánchez, María Dolores; López-Contreras, José Antonio; Toro, Nicolás; Fernández-López, Manuel

    2015-01-01

    The symbiotic, nitrogen-fixing bacterium Sinorhizobium meliloti has been widely studied due to its ability to improve crop yields through direct interactions with leguminous plants. S. meliloti AK21 is a wild type strain that forms nodules on Medicago plants in saline and drought conditions in the Aral Sea Region. The aim of this work was to establish the genetic similarities and differences between S. meliloti AK21 and the reference strain S. meliloti 1021. Comparative genome hybridization with the model reference strain S. meliloti 1021 yielded 365 variable genes, grouped into 11 regions in the three main replicons in S. meliloti AK21. The most extensive regions of variability were found in the symbiotic plasmid pSymA, which also contained the largest number of orthologous and polymorphic sequences identified by suppression subtractive hybridization. This procedure identified a large number of divergent sequences and others without homology in the databases, the further investigation of which could provide new insight into the alternative metabolic pathways present in S. meliloti AK21. We identified a plasmid replication module from the repABC replicon family, together with plasmid mobilization-related genes (traG and a VirB9-like protein), which suggest that this indigenous isolate harbors an accessory plasmid. Furthermore, the transcriptomic profiles reflected differences in gene content and regulation between S. meliloti AK21 and S. meliloti 1021 (ExpR and PhoB regulons), but provided evidence for an as yet unknown, alternative mechanism involving activation of the cbb3 terminal oxidase. Finally, phenotypic microarrays characterization revealed a greater versatility of substrate use and chemical degradation than for S. meliloti 1021.

  15. Genome-scale prediction of proteins with long intrinsically disordered regions.

    Science.gov (United States)

    Peng, Zhenling; Mizianty, Marcin J; Kurgan, Lukasz

    2014-01-01

    Proteins with long disordered regions (LDRs), defined as having 30 or more consecutive disordered residues, are abundant in eukaryotes, and these regions are recognized as a distinct class of biologically functional domains. LDRs facilitate various cellular functions and are important for target selection in structural genomics. Motivated by the lack of methods that directly predict proteins with LDRs, we designed Super-fast predictor of proteins with Long Intrinsically DisordERed regions (SLIDER). SLIDER utilizes logistic regression that takes an empirically chosen set of numerical features, which consider selected physicochemical properties of amino acids, sequence complexity, and amino acid composition, as its inputs. Empirical tests show that SLIDER offers competitive predictive performance combined with low computational cost. It outperforms, by at least a modest margin, a comprehensive set of modern disorder predictors (that can indirectly predict LDRs) and is 16 times faster compared to the best currently available disorder predictor. Utilizing our time-efficient predictor, we characterized abundance and functional roles of proteins with LDRs over 110 eukaryotic proteomes. Similar to related studies, we found that eukaryotes have many (on average 30.3%) proteins with LDRs with majority of proteomes having between 25 and 40%, where higher abundance is characteristic to proteomes that have larger proteins. Our first-of-its-kind large-scale functional analysis shows that these proteins are enriched in a number of cellular functions and processes including certain binding events, regulation of catalytic activities, cellular component organization, biogenesis, biological regulation, and some metabolic and developmental processes. A webserver that implements SLIDER is available at http://biomine.ece.ualberta.ca/SLIDER/.

  16. Identification of nine genomic regions of amplification in urothelial carcinoma, correlation with stage, and potential prognostic and therapeutic value.

    Directory of Open Access Journals (Sweden)

    Yvonne Chekaluk

    Full Text Available We performed a genome wide analysis of 164 urothelial carcinoma samples and 27 bladder cancer cell lines to identify copy number changes associated with disease characteristics, and examined the association of amplification events with stage and grade of disease. Multiplex inversion probe (MIP analysis, a recently developed genomic technique, was used to study 80 urothelial carcinomas to identify mutations and copy number changes. Selected amplification events were then analyzed in a validation cohort of 84 bladder cancers by multiplex ligation-dependent probe assay (MLPA. In the MIP analysis, 44 regions of significant copy number change were identified using GISTIC. Nine gene-containing regions of amplification were selected for validation in the second cohort by MLPA. Amplification events at these 9 genomic regions were found to correlate strongly with stage, being seen in only 2 of 23 (9% Ta grade 1 or 1-2 cancers, in contrast to 31 of 61 (51% Ta grade 3 and T2 grade 2 cancers, p<0.001. These observations suggest that analysis of genomic amplification of these 9 regions might help distinguish non-invasive from invasive urothelial carcinoma, although further study is required. Both MIP and MLPA methods perform well on formalin-fixed paraffin-embedded DNA, enhancing their potential clinical use. Furthermore several of the amplified genes identified here (ERBB2, MDM2, CCND1 are potential therapeutic targets.

  17. Phylogenetic placement of Cynomorium in Rosales inferred from sequences of the inverted repeat region of the chloroplast genome

    Institute of Scientific and Technical Information of China (English)

    Zhi-Hong ZHANG; Chun-Qi LI; Jian-hua LI

    2009-01-01

    Cynomorium is a herbaceous holoparasite that has been placed in Santalales, Saxifragales, Myrtales, or Sapindales. The inverted repeat (IR) region of the chloroplast genome region is slow evolving and, unlike mitochondrial genes, the chloroplast genome experiences few horizontal gene transfers between the host and parasite. Thus, in the present study, we used sequences of the IR region to test the phylogenetic placements of Cynomorium. Phylogenetic analyses of the chloroplast IR sequences generated largely congruent ordinal relationships with those from previous studies of angiosperm phylogeny based on single or multiple genes. Santalales was closely related to Caryophyllales and asterids. Saxifragales formed a clade where Peridiscus was sister to the remainder of the order, whereas Paeonia was sister to the woody clade of Saxifragales. Cynomorium is not closely related to Santalales, Saxifragales, Myrtales, or Sapindales; instead, it is included in Rosales and sister to Rosaceae. The various placements of the holoparasite on the basis of different regions of the mitochondrial genome may indicate the heterogeneous nature of the genome in the parasite. However, it is unlikely that the placement of Cynomorium in Rosales is the result of chloroplast gene transfer because Cynomorium does not parasitize on rosaceous plants and there is no chloroplast gene transfer between Cynomorium and Nitraria, a confirmed host of Cynomorium and a member of Sapindales.

  18. Fractionation of Synteny in a Genomic Region Containing Tandemly Duplicated Genes Across Glycine max, Medicago truncatula and Arabidopsis thaliana

    Science.gov (United States)

    Extended comparison of gene sequences found on homeologous soybean BACs to Medicago truncatula and Arabidopsis thaliana genomic sequences demonstrated a network of synteny within conserved regions interrupted by gene addition and/or deletions. Consolidation of gene order among all three species prov...

  19. Two distinct plastid genome configurations and unprecedented intraspecies length variation in the accD coding region in Medicago truncatula.

    Science.gov (United States)

    Gurdon, Csanad; Maliga, Pal

    2014-08-01

    We fully sequenced four and partially sequenced six additional plastid genomes of the model legume Medicago truncatula. Three accessions, Jemalong 2HA, Borung and Paraggio, belong to ssp. truncatula, and R108 to ssp. tricycla. We report here that the R108 ptDNA has a ~45-kb inversion compared with the ptDNA in ssp. truncatula, mediated by a short, imperfect repeat. DNA gel blot analyses of seven additional ssp. tricycla accessions detected only one of the two alternative genome arrangements, represented by three and four accessions each. Furthermore, we found a variable number of repeats in the essential accD and ycf1 coding regions. The repeats within accD are recombinationally active, yielding variable-length insertions and deletions in the central part of the coding region. The length of ACCD was distinct in each of the 10 sequenced ecotypes, ranging between 650 and 796 amino acids. The repeats in the ycf1 coding region are also recombinationally active, yielding short indels in 10 regions of the reading frames. Thus, the plastid genome variability we report here could be linked to repeat-mediated genome rearrangements. However, the rate of recombination was sufficiently low, so that no heterogeneity of ptDNA could be observed in populations maintained by single-seed descent.

  20. The highly recombinogenic bz locus lies in an unusually gene-rich region of the maize genome.

    Science.gov (United States)

    Fu, H; Park, W; Yan, X; Zheng, Z; Shen, B; Dooner, H K

    2001-07-17

    The bronze (bz) locus exhibits the highest rate of recombination of any gene in higher plants. To investigate the possible basis of this high rate of recombination, we have analyzed the physical organization of the region around the bz locus. Two adjacent bacterial artificial chromosome clones, comprising a 240-kb contig centered around the Bz-McC allele, were isolated, and 60 kb of contiguous DNA spanning the two bacterial artificial chromosome clones was sequenced. We find that the bz locus lies in an unusually gene-rich region of the maize genome. Ten genes, at least eight of which are shown to be transcribed, are contained in a 32-kb stretch of DNA that is uninterrupted by retrotransposons. We have isolated nearly full length cDNAs corresponding to the five proximal genes in the cluster. The average intertranscript distance between them is just 1 kb, revealing a surprisingly compact packaging of adjacent genes in this part of the genome. At least 11 small insertions, including several previously described miniature inverted repeat transposable elements, were detected in the introns and 3' untranslated regions of genes and between genes. The gene-rich region is flanked at the proximal and distal ends by retrotransposon blocks. Thus, the maize genome appears to have scattered regions of high gene density similar to those found in other plants. The unusually high rate of intragenic recombination seen in bz may be related to the very high gene density of the region.

  1. Large-insert BAC/YAC libraries for selective re-isolation of genomic regions by homologous recombination in yeast.

    Science.gov (United States)

    Zeng, C; Kouprina, N; Zhu, B; Cairo, A; Hoek, M; Cross, G; Osoegawa, K; Larionov, V; de Jong, P

    2001-09-01

    We constructed representative large-insert bacterial artificial chromosome (BAC) libraries of two human pathogens (Trypanosoma brucei and Giardia lamblia) using a new hybrid vector, pTARBAC1, containing a yeast artificial chromosome (YAC) cassette (a yeast selectable marker and a centromere). The cassette allows transferring of BACs into yeast for their further modification. Furthermore, the new hybrid vector provides the opportunity to re-isolate each DNA insert without construction of a new library of random clones. Digestion of a BAC DNA by an endonuclease that has no recognition site in the vector, but which deletes most of the internal insert sequence and leaves the unique flanking sequences, converts a BAC into a TAR vector, thus allowing direct gene isolation. Cotransformation of a TAR vector and genomic DNA into yeast spheroplasts, and subsequent recombination between the TAR vector's flanking ends and a specific genomic fragment, allows rescue of the fragment as a circular YAC/BAC molecule. Here we prove a new cloning strategy by re-isolation of randomly chosen genomic fragments of different size from T. brucei cloned in BACs. We conclude that genomic regions of unicellular eukaryotes can be easily re-isolated using this technique, which provides an opportunity to study evolution of these genomes and the role of genome instability in pathogenicity.

  2. Lung adenocarcinoma of never smokers and smokers harbor differential regions of genetic alteration and exhibit different levels of genomic instability.

    Directory of Open Access Journals (Sweden)

    Kelsie L Thu

    Full Text Available Recent evidence suggests that the observed clinical distinctions between lung tumors in smokers and never smokers (NS extend beyond specific gene mutations, such as EGFR, EML4-ALK, and KRAS, some of which have been translated into targeted therapies. However, the molecular alterations identified thus far cannot explain all of the clinical and biological disparities observed in lung tumors of NS and smokers. To this end, we performed an unbiased genome-wide, comparative study to identify novel genomic aberrations that differ between smokers and NS. High resolution whole genome DNA copy number profiling of 69 lung adenocarcinomas from smokers (n = 39 and NS (n = 30 revealed both global and regional disparities in the tumor genomes of these two groups. We found that NS lung tumors had a greater proportion of their genomes altered than those of smokers. Moreover, copy number gains on chromosomes 5q, 7p, and 16p occurred more frequently in NS. We validated our findings in two independently generated public datasets. Our findings provide a novel line of evidence distinguishing genetic differences between smoker and NS lung tumors, namely, that the extent of segmental genomic alterations is greater in NS tumors. Collectively, our findings provide evidence that these lung tumors are globally and genetically different, which implies they are likely driven by distinct molecular mechanisms.

  3. Exploring an Annotated Sequence Assembly of the Perennial Ryegrass Genome for Genomic Regions Enriched for Trait Associated Variants

    DEFF Research Database (Denmark)

    Byrne, Stephen; Cericola, Fabio; Janss, Luc;

    2015-01-01

    Perennial ryegrass (Lolium perenne L.) is an outbreeding diploid species and one of the most important forage crops used in temperate agriculture. We have developed a draft sequence assembly of the perennial ryegrass genome and annotated it with the aid of RNA-seq data from various genotypes, plant...... components, and treatments. We predicted 39,795 high quality proteins originating from 28,182 genetic loci. We wanted to use the annotated assembly to study if SNPs falling within various annotation classes explain differing proportions of the variance for traits such as heading date and rust resistance...

  4. Sequencing of 15 622 gene-bearing BACs clarifies the gene-dense regions of the barley genome.

    Science.gov (United States)

    Muñoz-Amatriaín, María; Lonardi, Stefano; Luo, MingCheng; Madishetty, Kavitha; Svensson, Jan T; Moscou, Matthew J; Wanamaker, Steve; Jiang, Tao; Kleinhofs, Andris; Muehlbauer, Gary J; Wise, Roger P; Stein, Nils; Ma, Yaqin; Rodriguez, Edmundo; Kudrna, Dave; Bhat, Prasanna R; Chao, Shiaoman; Condamine, Pascal; Heinen, Shane; Resnik, Josh; Wing, Rod; Witt, Heather N; Alpert, Matthew; Beccuti, Marco; Bozdag, Serdar; Cordero, Francesca; Mirebrahim, Hamid; Ounit, Rachid; Wu, Yonghui; You, Frank; Zheng, Jie; Simková, Hana; Dolezel, Jaroslav; Grimwood, Jane; Schmutz, Jeremy; Duma, Denisa; Altschmied, Lothar; Blake, Tom; Bregitzer, Phil; Cooper, Laurel; Dilbirligi, Muharrem; Falk, Anders; Feiz, Leila; Graner, Andreas; Gustafson, Perry; Hayes, Patrick M; Lemaux, Peggy; Mammadov, Jafar; Close, Timothy J

    2015-10-01

    Barley (Hordeum vulgare L.) possesses a large and highly repetitive genome of 5.1 Gb that has hindered the development of a complete sequence. In 2012, the International Barley Sequencing Consortium released a resource integrating whole-genome shotgun sequences with a physical and genetic framework. However, because only 6278 bacterial artificial chromosome (BACs) in the physical map were sequenced, fine structure was limited. To gain access to the gene-containing portion of the barley genome at high resolution, we identified and sequenced 15 622 BACs representing the minimal tiling path of 72 052 physical-mapped gene-bearing BACs. This generated ~1.7 Gb of genomic sequence containing an estimated 2/3 of all Morex barley genes. Exploration of these sequenced BACs revealed that although distal ends of chromosomes contain most of the gene-enriched BACs and are characterized by high recombination rates, there are also gene-dense regions with suppressed recombination. We made use of published map-anchored sequence data from Aegilops tauschii to develop a synteny viewer between barley and the ancestor of the wheat D-genome. Except for some notable inversions, there is a high level of collinearity between the two species. The software HarvEST:Barley provides facile access to BAC sequences and their annotations, along with the barley-Ae. tauschii synteny viewer. These BAC sequences constitute a resource to improve the efficiency of marker development, map-based cloning, and comparative genomics in barley and related crops. Additional knowledge about regions of the barley genome that are gene-dense but low recombination is particularly relevant.

  5. Genomic variability of Helicobacter pylori isolates of gastric regions from two Colombian populations

    Science.gov (United States)

    Matta, Andrés Jenuer; Pazos, Alvaro Jairo; Bustamante-Rengifo, Javier Andrés; Bravo, Luis Eduardo

    2017-01-01

    AIM To compare the genomic variability and the multiple colonization of Helicobacter pylori (H. pylori) in patients with chronic gastritis from two Colombian populations with contrast in the risk of developing gastric cancer (GC): Túquerres-Nariño (High risk) and Tumaco-Nariño (Low risk). METHODS Four hundred and nine patients from both genders with dyspeptic symptoms were studied. Seventy-two patients were included in whom H. pylori was isolated from three anatomic regions of the gastric mucosa, (31/206) of the high risk population of GC (Túquerres) and (41/203) of the low risk population of GC (Tumaco). The isolates were genotyped by PCR-RAPD. Genetic diversity between the isolates was evaluated by conglomerates analysis and multiple correspondence analyses. RESULTS The proportion of virulent genotypes of H. pylori was 99% in Túquerres and 94% in Tumaco. The coefficient of similarity of Nei-Li showed greater genetic diversity among isolates of Túquerres (0.13) than those of Tumaco (0.07). After adjusting by age, gender and type of gastritis, the multiple colonization was 1.7 times more frequent in Túquerres than in Tumaco (P = 0.05). CONCLUSION In Túquerres, high risk of GC there was a greater probability of multiple colonization by H. pylori. From the analysis of the results of the PCR-RAPD, it was found higher genetic variability in the isolates of H. pylori in the population of high risk for the development of GC. PMID:28223724

  6. Genomic regions associated with ventro-cranial chronic pleuritis in pig

    DEFF Research Database (Denmark)

    Sørensen, Kirsten Kørup; Gregersen, Vivi Raundahl; Christensen, Ole Fredslund

    2011-01-01

    Ventro-cranial chronic pleuritis can be a result of pleuropneumonia and enzootic pneumonia. These diseases cause severe losses in intensive pig production worldwide, but host resistance is difficult to breed for. It could be beneficial to use marker-assisted selection, and a step towards this is ......Ventro-cranial chronic pleuritis can be a result of pleuropneumonia and enzootic pneumonia. These diseases cause severe losses in intensive pig production worldwide, but host resistance is difficult to breed for. It could be beneficial to use marker-assisted selection, and a step towards...... this is to identify genomic regions associated with the trait. For this purpose, 7304 pigs from 11 boar families were analysed for associations between single nucleotide polymorphisms and ventro-cranial chronic pleuritis. The pigs were genotyped by the use of the iSelect Custom 7 K porcine SNP Chip. Quantitative...... of candidate genes, but the causative mutations still need to be identified. Markers closely associated with the resistance traits have a strong potential for use in breeding towards animals with improved characteristics concerning ventro-cranial chronic pleuritis...

  7. Genomic and network patterns of schizophrenia genetic variation in human evolutionary accelerated regions.

    Science.gov (United States)

    Xu, Ke; Schadt, Eric E; Pollard, Katherine S; Roussos, Panos; Dudley, Joel T

    2015-05-01

    The population persistence of schizophrenia despite associated reductions in fitness and fecundity suggests that the genetic basis of schizophrenia has a complex evolutionary history. A recent meta-analysis of schizophrenia genome-wide association studies offers novel opportunities for assessment of the evolutionary trajectories of schizophrenia-associated loci. In this study, we hypothesize that components of the genetic architecture of schizophrenia are attributable to human lineage-specific evolution. Our results suggest that schizophrenia-associated loci enrich in genes near previously identified human accelerated regions (HARs). Specifically, we find that genes near HARs conserved in nonhuman primates (pHARs) are enriched for schizophrenia-associated loci, and that pHAR-associated schizophrenia genes are under stronger selective pressure than other schizophrenia genes and other pHAR-associated genes. We further evaluate pHAR-associated schizophrenia genes in regulatory network contexts to investigate associated molecular functions and mechanisms. We find that pHAR-associated schizophrenia genes significantly enrich in a GABA-related coexpression module that was previously found to be differentially regulated in schizophrenia affected individuals versus healthy controls. In another two independent networks constructed from gene expression profiles from prefrontal cortex samples, we find that pHAR-associated schizophrenia genes are located in more central positions and their average path lengths to the other nodes are significantly shorter than those of other schizophrenia genes. Together, our results suggest that HARs are associated with potentially important functional roles in the genetic architecture of schizophrenia.

  8. Molecular analysis of the Adh region of the genome of Drosophila melanogaster.

    Science.gov (United States)

    Chia, W; Karp, R; McGill, S; Ashburner, M

    1985-12-20

    A small region of the genome of Drosophila melanogaster has been cloned in a series of overlapping phage. A length of 165 X 10(3) base-pairs of contiguous DNA that spans polytene chromosome region 35A4 to 35B1 and includes the structural gene for alcohol dehydrogenase (Adh) as well as at least two other genes, outspread (osp) and no-ocelli (noc), has been characterized by mapping chromosome aberrations to the DNA. The relationship between osp and Adh is surprising: of nine osp alleles associated with chromosome breakpoints, five map distal (i.e. 5') to Adh and four map proximal (i.e. 3') to this gene. None affects the expression of Adh. As defined by these and other breakpoints, the osp gene spans at least 52 X 10(3) base-pairs and overlaps the Adh gene. The noc gene, as defined by the mapping of nearly 30 breakpoints, is at least 50 X 10(3) base-pairs in size. Alleles of noc and noc- deletions show either of two kinds of interaction with the recessive lethality of l(2)br29ScoR+1, a lethal that maps immediately distal to noc. One class of noc allele is viable when heterozygous with ScoR+1, while the other class is lethal or semi-lethal. Both classes, however, are homozygous or hemizygous viable. The locations of these two classes of noc allele on the DNA fall into two clusters, with those that are viable with ScoR+1 located proximal to those that are not. The physical boundary between these classes lies at a site just distal to that of the breakpoint of the inversion associated with ScoR+1 itself.

  9. Allelic variation in a willow warbler genomic region is associated with climate clines.

    Directory of Open Access Journals (Sweden)

    Keith W Larson

    Full Text Available Local adaptation is an important process contributing to population differentiation which can occur in continuous or isolated populations connected by various amounts of gene flow. The willow warbler (Phylloscopus trochilus is one of the most common songbirds in Fennoscandia. It has a continuous breeding distribution where it is found in all forested habitats from sea level to the tree line and therefore constitutes an ideal species for the study of locally adapted genes associated with environmental gradients. Previous studies in this species identified a genetic marker (AFLP-WW1 that showed a steep north-south cline in central Sweden with one allele associated with coastal lowland habitats and the other with mountainous habitats. It was further demonstrated that this marker is embedded in a highly differentiated chromosome region that spans several megabases. In the present study, we sampled 2,355 individuals at 128 sites across all of Fennoscandia to study the geographic and climatic variables associated with the allele frequency distributions of WW1. Our results demonstrate that 1 allele frequency patterns significantly differ between mountain and lowland populations, 2 these allele differences coincide with extreme temperature conditions and the short growing season in the mountains, and milder conditions in coastal areas, and 3 the northern-allele or "altitude variant" of WW1 occurs in willow warblers that occupy mountainous habitat regardless of subspecies. Finally these results suggest that climate may exert selection on the genomic region associated with these alleles and would allow us to develop testable predictions for the distribution of the genetic marker based on climate change scenarios.

  10. Combination of native and denaturing PAGE for the detection of protein binding regions in long fragments of genomic DNA

    Directory of Open Access Journals (Sweden)

    Metsis Madis

    2008-06-01

    Full Text Available Abstract Background In a traditional electrophoresis mobility shift assay (EMSA a 32P-labeled double-stranded DNA oligonucleotide or a restriction fragment bound to a protein is separated from the unbound DNA by polyacrylamide gel electrophoresis (PAGE in nondenaturing conditions. An extension of this method uses the large population of fragments derived from long genomic regions (approximately 600 kb for the identification of fragments containing protein binding regions. With this method, genomic DNA is fragmented by restriction enzymes, fragments are amplified by PCR, radiolabeled, incubated with nuclear proteins and the resulting DNA-protein complexes are separated by two-dimensional PAGE. Shifted DNA fragments containing protein binding sites are identified by using additional procedures, i. e. gel elution, PCR amplification, cloning and sequencing. Although the method allows simultaneous analysis of a large population of fragments, it is relatively laborious and can be used to detect only high affinity protein binding sites. Here we propose an alternative and straightforward strategy which is based on a combination of native and denaturing PAGE. This strategy allows the identification of DNA fragments containing low as well as high affinity protein binding regions, derived from genomic DNA ( Results We have combined an EMSA-based selection step with subsequent denaturing PAGE for the localization of protein binding regions in long (up to10 kb fragments of genomic DNA. Our strategy consists of the following steps: digestion of genomic DNA with a 4-cutter restriction enzyme (AluI, BsuRI, TruI, etc, separation of low and high molecular weight fractions of resultant DNA fragments, 32P-labeling with Klenow polymerase, traditional EMSA, gel elution and identification of the shifted bands (or smear by denaturing PAGE. The identification of DNA fragments containing protein binding sites is carried out by running the gel-eluted fragments alongside

  11. A genomic region of lactococcal temperate bacteriophage TP901-1 encoding major virion proteins

    DEFF Research Database (Denmark)

    Johnsen, Mads G.; Appel, Karen Fuglede; Madsen, Hans Peter Lynge;

    1996-01-01

    Two major structural proteins, MHP (major head protein) and MTP (major tail protein), from the lactococcal temperate phage TP901-1 were sequenced at their amino acid termini, and derived degenerate oligonucleotides were used to locate the corresponding genes in the phage genome. This genomic regi...

  12. Identifying selected regions from heterozygosity and divergence using a light-coverage genomic dataset from two human populations.

    Directory of Open Access Journals (Sweden)

    Taras K Oleksyk

    Full Text Available When a selective sweep occurs in the chromosomal region around a target gene in two populations that have recently separated, it produces three dramatic genomic consequences: 1 decreased multi-locus heterozygosity in the region; 2 elevated or diminished genetic divergence (F(ST of multiple polymorphic variants adjacent to the selected locus between the divergent populations, due to the alternative fixation of alleles; and 3 a consequent regional increase in the variance of F(ST (S(2F(ST for the same clustered variants, due to the increased alternative fixation of alleles in the loci surrounding the selection target. In the first part of our study, to search for potential targets of directional selection, we developed and validated a resampling-based computational approach; we then scanned an array of 31 different-sized moving windows of SNP variants (5-65 SNPs across the human genome in a set of European and African American population samples with 183,997 SNP loci after correcting for the recombination rate variation. The analysis revealed 180 regions of recent selection with very strong evidence in either population or both. In the second part of our study, we compared the newly discovered putative regions to those sites previously postulated in the literature, using methods based on inspecting patterns of linkage disequilibrium, population divergence and other methodologies. The newly found regions were cross-validated with those found in nine other studies that have searched for selection signals. Our study was replicated especially well in those regions confirmed by three or more studies. These validated regions were independently verified, using a combination of different methods and different databases in other studies, and should include fewer false positives. The main strength of our analysis method compared to others is that it does not require dense genotyping and therefore can be used with data from population-based genome SNP scans

  13. The complete genome sequence of a Crimean-Congo Hemorrhagic Fever virus isolated from an endemic region in Kosovo

    Directory of Open Access Journals (Sweden)

    Dedushaj Iusuf

    2008-01-01

    Full Text Available Abstract The Balkan region and Kosovo in particular, is a well-known Crimean-Congo hemorrhagic fever (CCHF endemic region, with frequent epidemic outbreaks and sporadic cases occurring with a hospitalized case fatality of approximately 30%. Recent analysis of complete genome sequences of diverse CCHF virus strains showed that the genome plasticity of the virus is surprisingly high for an arthropod-borne virus. High levels of nucleotide and amino acid differences, frequent RNA segment reassortment and even RNA recombination have been recently described. This diversity illustrates the need to determine the complete genome sequence of CCHF virus representatives of all geographically distinct endemic areas, particularly in light of the high pathogenicity of the virus and its listing as a potential bioterrorism threat. Here we describe the first complete CCHF virus genome sequence of a virus (strain Kosova Hoti isolated from a hemorrhagic fever case in the Balkans. This virus strain was isolated from a fatal CCHF case, and passaged only twice on Vero E6 cells prior to sequence analysis. The virus total genome was found to be 19.2 kb in length, consisting of a 1672 nucleotide (nt S segment, a 5364 nt M segment and a 12150 nt L segment. Phylogenetic analysis of CCHF virus complete genomes placed the Kosova Hoti strain in the Europe/Turkey group, with highest similarity seen with Russian isolates. The virus M segments are the most diverse with up to 31 and 27% differences seen at the nt and amino acid levels, and even 1.9% amino acid difference found between the Kosova Hoti and another strain from Kosovo (9553-01. This suggests that distinct virus strains can coexist in highly endemic areas.

  14. Genome Regions Associated with Functional Performance of Soybean Stem Fibers in Polypropylene Thermoplastic Composites.

    Science.gov (United States)

    Reinprecht, Yarmilla; Arif, Muhammad; Simon, Leonardo C; Pauls, K Peter

    2015-01-01

    Plant fibers can be used to produce composite materials for automobile parts, thus reducing plastic used in their manufacture, overall vehicle weight and fuel consumption when they replace mineral fillers and glass fibers. Soybean stem residues are, potentially, significant sources of inexpensive, renewable and biodegradable natural fibers, but are not curretly used for biocomposite production due to the functional properties of their fibers in composites being unknown. The current study was initiated to investigate the effects of plant genotype on the performance characteristics of soybean stem fibers when incorporated into a polypropylene (PP) matrix using a selective phenotyping approach. Fibers from 50 lines of a recombinant inbred line population (169 RILs) grown in different environments were incorporated into PP at 20% (wt/wt) by extrusion. Test samples were injection molded and characterized for their mechanical properties. The performance of stem fibers in the composites was significantly affected by genotype and environment. Fibers from different genotypes had significantly different chemical compositions, thus composites prepared with these fibers displayed different physical properties. This study demonstrates that thermoplastic composites with soybean stem-derived fibers have mechanical properties that are equivalent or better than wheat straw fiber composites currently being used for manufacturing interior automotive parts. The addition of soybean stem residues improved flexural, tensile and impact properties of the composites. Furthermore, by linkage and in silico mapping we identified genomic regions to which quantitative trait loci (QTL) for compositional and functional properties of soybean stem fibers in thermoplastic composites, as well as genes for cell wall synthesis, were co-localized. These results may lead to the development of high value uses for soybean stem residue.

  15. Genome Regions Associated with Functional Performance of Soybean Stem Fibers in Polypropylene Thermoplastic Composites.

    Directory of Open Access Journals (Sweden)

    Yarmilla Reinprecht

    Full Text Available Plant fibers can be used to produce composite materials for automobile parts, thus reducing plastic used in their manufacture, overall vehicle weight and fuel consumption when they replace mineral fillers and glass fibers. Soybean stem residues are, potentially, significant sources of inexpensive, renewable and biodegradable natural fibers, but are not curretly used for biocomposite production due to the functional properties of their fibers in composites being unknown. The current study was initiated to investigate the effects of plant genotype on the performance characteristics of soybean stem fibers when incorporated into a polypropylene (PP matrix using a selective phenotyping approach. Fibers from 50 lines of a recombinant inbred line population (169 RILs grown in different environments were incorporated into PP at 20% (wt/wt by extrusion. Test samples were injection molded and characterized for their mechanical properties. The performance of stem fibers in the composites was significantly affected by genotype and environment. Fibers from different genotypes had significantly different chemical compositions, thus composites prepared with these fibers displayed different physical properties. This study demonstrates that thermoplastic composites with soybean stem-derived fibers have mechanical properties that are equivalent or better than wheat straw fiber composites currently being used for manufacturing interior automotive parts. The addition of soybean stem residues improved flexural, tensile and impact properties of the composites. Furthermore, by linkage and in silico mapping we identified genomic regions to which quantitative trait loci (QTL for compositional and functional properties of soybean stem fibers in thermoplastic composites, as well as genes for cell wall synthesis, were co-localized. These results may lead to the development of high value uses for soybean stem residue.

  16. Genes involved in complex adaptive processes tend to have highly conserved upstream regions in mammalian genomes

    Directory of Open Access Journals (Sweden)

    Kohane Isaac

    2005-11-01

    Full Text Available Abstract Background Recent advances in genome sequencing suggest a remarkable conservation in gene content of mammalian organisms. The similarity in gene repertoire present in different organisms has increased interest in studying regulatory mechanisms of gene expression aimed at elucidating the differences in phenotypes. In particular, a proximal promoter region contains a large number of regulatory elements that control the expression of its downstream gene. Although many studies have focused on identification of these elements, a broader picture on the complexity of transcriptional regulation of different biological processes has not been addressed in mammals. The regulatory complexity may strongly correlate with gene function, as different evolutionary forces must act on the regulatory systems under different biological conditions. We investigate this hypothesis by comparing the conservation of promoters upstream of genes classified in different functional categories. Results By conducting a rank correlation analysis between functional annotation and upstream sequence alignment scores obtained by human-mouse and human-dog comparison, we found a significantly greater conservation of the upstream sequence of genes involved in development, cell communication, neural functions and signaling processes than those involved in more basic processes shared with unicellular organisms such as metabolism and ribosomal function. This observation persists after controlling for G+C content. Considering conservation as a functional signature, we hypothesize a higher density of cis-regulatory elements upstream of genes participating in complex and adaptive processes. Conclusion We identified a class of functions that are associated with either high or low promoter conservation in mammals. We detected a significant tendency that points to complex and adaptive processes were associated with higher promoter conservation, despite the fact that they have emerged

  17. Chromosome region-specific libraries for human genome analysis. Final progress report, 1 March 1991--28 February 1994

    Energy Technology Data Exchange (ETDEWEB)

    Kao, F.T.

    1994-04-01

    The objectives of this grant proposal include (1) development of a chromosome microdissection and PCR-mediated microcloning technology, (2) application of this microtechnology to the construction of region-specific libraries for human genome analysis. During this grant period, the authors have successfully developed this microtechnology and have applied it to the construction of microdissection libraries for the following chromosome regions: a whole chromosome 21 (21E), 2 region-specific libraries for the long arm of chromosome 2, 2q35-q37 (2Q1) and 2q33-q35 (2Q2), and 4 region-specific libraries for the entire short arm of chromosome 2, 2p23-p25 (2P1), 2p21-p23 (2P2), 2p14-p16 (wP3) and 2p11-p13 (2P4). In addition, 20--40 unique sequence microclones have been isolated and characterized for genomic studies. These region-specific libraries and the single-copy microclones from the library have been used as valuable resources for (1) isolating microsatellite probes in linkage analysis to further refine the disease locus; (2) isolating corresponding clones with large inserts, e.g. YAC, BAC, P1, cosmid and phage, to facilitate construction of contigs for high resolution physical mapping; and (3) isolating region-specific cDNA clones for use as candidate genes. These libraries are being deposited in the American Type Culture Collection (ATCC) for general distribution.

  18. Genome-Based Identification of Chromosomal Regions Specific for Salmonella spp.

    OpenAIRE

    Hansen-Wester, Imke; Hensel, Michael

    2002-01-01

    Acquisition of genomic elements by horizontal gene transfer represents an important mechanism in the evolution of bacterial species. Pathogenicity islands are a subset of horizontally acquired elements present in various pathogens. These elements are frequently located adjacent to tRNA genes. We performed a comparative genome analysis of Salmonella enterica serovars Typhi and Typhimurium and Escherichia coli and scanned tRNA loci for the presence of species-specific, horizontally acquired gen...

  19. An exploration of the sequence of a 2.9-Mb region of the genome of Drosophila melanogaster: the Adh region.

    Science.gov (United States)

    Ashburner, M; Misra, S; Roote, J; Lewis, S E; Blazej, R; Davis, T; Doyle, C; Galle, R; George, R; Harris, N; Hartzell, G; Harvey, D; Hong, L; Houston, K; Hoskins, R; Johnson, G; Martin, C; Moshrefi, A; Palazzolo, M; Reese, M G; Spradling, A; Tsang, G; Wan, K; Whitelaw, K; Celniker, S

    1999-01-01

    A contiguous sequence of nearly 3 Mb from the genome of Drosophila melanogaster has been sequenced from a series of overlapping P1 and BAC clones. This region covers 69 chromosome polytene bands on chromosome arm 2L, including the genetically well-characterized "Adh region." A computational analysis of the sequence predicts 218 protein-coding genes, 11 tRNAs, and 17 transposable element sequences. At least 38 of the protein-coding genes are arranged in clusters of from 2 to 6 closely related genes, suggesting extensive tandem duplication. The gene density is one protein-coding gene every 13 kb; the transposable element density is one element every 171 kb. Of 73 genes in this region identified by genetic analysis, 49 have been located on the sequence; P-element insertions have been mapped to 43 genes. Ninety-five (44%) of the known and predicted genes match a Drosophila EST, and 144 (66%) have clear similarities to proteins in other organisms. Genes known to have mutant phenotypes are more likely to be represented in cDNA libraries, and far more likely to have products similar to proteins of other organisms, than are genes with no known mutant phenotype. Over 650 chromosome aberration breakpoints map to this chromosome region, and their nonrandom distribution on the genetic map reflects variation in gene spacing on the DNA. This is the first large-scale analysis of the genome of D. melanogaster at the sequence level. In addition to the direct results obtained, this analysis has allowed us to develop and test methods that will be needed to interpret the complete sequence of the genome of this species.Before beginning a Hunt, it is wise to ask someone what you are looking for before you begin looking for it. Milne 1926 PMID:10471707

  20. "Beijing Region" (3pter-D3S3397) of the Human Genome: Complete sequence and analysis

    Institute of Scientific and Technical Information of China (English)

    The; Chinese; Human; Genome; Sequencing; Consortium

    2005-01-01

    The goal of the Human Genome Project (HGP) is to determine a complete and high-quality sequence of the human genome. China, as one of the six member states, takes a region between 3pter and D3S3397 of the human chromosome 3 as its share of this historic project, referred as "Beijing Region". The complete sequence of this region comprises of 17.4 megabasepairs (Mb) with an average GC content of 42% and an average recombination rate of 2.14 cM/Mb. Within Beijing Region, 122 known and 20 novel genes are identified, as well as 42607 single nucleotide polymorphisms (SNPs). Comprehensive analyses also reveal: (i) gene density and GC-content of Beijing Region are in agreement with human cytogenetic maps, i.e. G-minus bands are GC-rich and of a high gene density, whereas G-plus bands are GC-poor and of a relatively low gene density; (ii) the average recombination rate within Beijing Region is relatively high compared with other regions of chromosome 3, with the highest recombination rate of 6.06 cM/Mb in the subtelomeric area; (iii) it is most likely that a large gene, associated with the mammary gland, may reside in the 1.1 Mb gene-poor area near the telomere; (iv) many disease-related genes are genetically mapped to Beijing Region, including those associated with cancers and metabolic syndromes. All make Beijing Region an important target for in-depth molecular investigations with a purpose of medical applications.

  1. Genome-wide genetic diversity and differentially selected regions among Suffolk, Rambouillet, Columbia, Polypay, and Targhee sheep.

    Science.gov (United States)

    Zhang, Lifan; Mousel, Michelle R; Wu, Xiaolin; Michal, Jennifer J; Zhou, Xiang; Ding, Bo; Dodson, Michael V; El-Halawany, Nermin K; Lewis, Gregory S; Jiang, Zhihua

    2013-01-01

    Sheep are among the major economically important livestock species worldwide because the animals produce milk, wool, skin, and meat. In the present study, the Illumina OvineSNP50 BeadChip was used to investigate genetic diversity and genome selection among Suffolk, Rambouillet, Columbia, Polypay, and Targhee sheep breeds from the United States. After quality-control filtering of SNPs (single nucleotide polymorphisms), we used 48,026 SNPs, including 46,850 SNPs on autosomes that were in Hardy-Weinberg equilibrium and 1,176 SNPs on chromosome × for analysis. Phylogenetic analysis based on all 46,850 SNPs clearly separated Suffolk from Rambouillet, Columbia, Polypay, and Targhee, which was not surprising as Rambouillet contributed to the synthesis of the later three breeds. Based on pair-wise estimates of F(ST), significant genetic differentiation appeared between Suffolk and Rambouillet (F(ST) = 0.1621), while Rambouillet and Targhee had the closest relationship (F(ST) = 0.0681). A scan of the genome revealed 45 and 41 differentially selected regions (DSRs) between Suffolk and Rambouillet and among Rambouillet-related breed populations, respectively. Our data indicated that regions 13 and 24 between Suffolk and Rambouillet might be good candidates for evaluating breed differences. Furthermore, ovine genome v3.1 assembly was used as reference to link functionally known homologous genes to economically important traits covered by these differentially selected regions. In brief, our present study provides a comprehensive genome-wide view on within- and between-breed genetic differentiation, biodiversity, and evolution among Suffolk, Rambouillet, Columbia, Polypay, and Targhee sheep breeds. These results may provide new guidance for the synthesis of new breeds with different breeding objectives.

  2. Genome-wide genetic diversity and differentially selected regions among Suffolk, Rambouillet, Columbia, Polypay, and Targhee sheep.

    Directory of Open Access Journals (Sweden)

    Lifan Zhang

    Full Text Available Sheep are among the major economically important livestock species worldwide because the animals produce milk, wool, skin, and meat. In the present study, the Illumina OvineSNP50 BeadChip was used to investigate genetic diversity and genome selection among Suffolk, Rambouillet, Columbia, Polypay, and Targhee sheep breeds from the United States. After quality-control filtering of SNPs (single nucleotide polymorphisms, we used 48,026 SNPs, including 46,850 SNPs on autosomes that were in Hardy-Weinberg equilibrium and 1,176 SNPs on chromosome × for analysis. Phylogenetic analysis based on all 46,850 SNPs clearly separated Suffolk from Rambouillet, Columbia, Polypay, and Targhee, which was not surprising as Rambouillet contributed to the synthesis of the later three breeds. Based on pair-wise estimates of F(ST, significant genetic differentiation appeared between Suffolk and Rambouillet (F(ST = 0.1621, while Rambouillet and Targhee had the closest relationship (F(ST = 0.0681. A scan of the genome revealed 45 and 41 differentially selected regions (DSRs between Suffolk and Rambouillet and among Rambouillet-related breed populations, respectively. Our data indicated that regions 13 and 24 between Suffolk and Rambouillet might be good candidates for evaluating breed differences. Furthermore, ovine genome v3.1 assembly was used as reference to link functionally known homologous genes to economically important traits covered by these differentially selected regions. In brief, our present study provides a comprehensive genome-wide view on within- and between-breed genetic differentiation, biodiversity, and evolution among Suffolk, Rambouillet, Columbia, Polypay, and Targhee sheep breeds. These results may provide new guidance for the synthesis of new breeds with different breeding objectives.

  3. Pan-Genome Analysis of Human Gastric Pathogen H. pylori: Comparative Genomics and Pathogenomics Approaches to Identify Regions Associated with Pathogenicity and Prediction of Potential Core Therapeutic Targets

    DEFF Research Database (Denmark)

    Ali, Amjad; Naz, Anam; Soares, Siomar C.

    2015-01-01

    . Pan-genome analyses of the global representative H. pylori isolates consisting of 39 complete genomes are presented in this paper. Phylogenetic analyses have revealed close relationships among geographically diverse strains of H. pylori. The conservation among these genomes was further analyzed by pan-genome...

  4. Genome wide signatures of positive selection: The comparison of independent samples and the identification of regions associated to traits

    Directory of Open Access Journals (Sweden)

    Thomas Merle B

    2009-04-01

    Full Text Available Abstract Background The goal of genome wide analyses of polymorphisms is to achieve a better understanding of the link between genotype and phenotype. Part of that goal is to understand the selective forces that have operated on a population. Results In this study we compared the signals of selection, identified through population divergence in the Bovine HapMap project, to those found in an independent sample of cattle from Australia. Evidence for population differentiation across the genome, as measured by FST, was highly correlated in the two data sets. Nevertheless, 40% of the variance in FST between the two studies was attributed to the differences in breed composition. Seventy six percent of the variance in FST was attributed to differences in SNP composition and density when the same breeds were compared. The difference between FST of adjacent loci increased rapidly with the increase in distance between SNP, reaching an asymptote after 20 kb. Using 129 SNP that have highly divergent FST values in both data sets, we identified 12 regions that had additive effects on the traits residual feed intake, beef yield or intramuscular fatness measured in the Australian sample. Four of these regions had effects on more than one trait. One of these regions includes the R3HDM1 gene, which is under selection in European humans. Conclusion Firstly, many different populations will be necessary for a full description of selective signatures across the genome, not just a small set of highly divergent populations. Secondly, it is necessary to use the same SNP when comparing the signatures of selection from one study to another. Thirdly, useful signatures of selection can be obtained where many of the groups have only minor genetic differences and may not be clearly separated in a principal component analysis. Fourthly, combining analyses of genome wide selection signatures and genome wide associations to traits helps to define the trait under selection or

  5. Differentially Methylated Genomic Regions in Birth-Weight Discordant Twin Pairs

    DEFF Research Database (Denmark)

    Chen, Mubo; Baumbach, Jan; Vandin, Fabio;

    2016-01-01

    Poor nutrition during critical growth phases may alter the structural and physiologic development of vital organs thus “programming” the susceptibility to adult-onset diseases and disease-related health conditions. Epigenome-wide association studies have been performed in birth-weight discordant...... twin pairs to find evidence for such “programming” effects, but no significant results emerged. We further investigated this issue using a new computational approach: Instead of probing single genomic sites for significant alterations in epigenetic marks, we scan for differentially methylated genomic...

  6. Draft Genome Sequence of Pseudomonas hussainii Strain MB3, a Denitrifying Aerobic Bacterium Isolated from the Rhizospheric Region of Mangrove Trees in the Andaman Islands, India.

    Science.gov (United States)

    Jaiswal, Shubham K; Saxena, Rituja; Mittal, Parul; Gupta, Ankit; Sharma, Vineet K

    2017-02-02

    The genome sequence of Pseudomonas hussainii MB3, isolated from the rhizospheric region of mangroves in the Andaman Islands, is comprised of 3,644,788 bp and 3,159 protein coding genes. Draft genome analysis indicates that MB3 is an aerobic bacterium capable of performing assimilatory sulfate reduction, dissimilatory nitrate reduction, and denitrification.

  7. Genomic DNA pooling strategy for next-generation sequencing-based rare variant discovery in abdominal aortic aneurysm regions of interest-challenges and limitations

    NARCIS (Netherlands)

    Harakalova, M.; Nijman, I.J.; Medic, J.; Mokry, M.; Renkens, I.; Blankensteijn, J.D.; Kloosterman, W.P.; Baas, A.F.; Cuppen, E.

    2011-01-01

    The costs and efforts for sample preparation of hundreds of individuals, their genomic enrichment for regions of interest, and sufficient deep sequencing bring a significant burden to next-generation sequencing-based experiments. We investigated whether pooling of samples at the level of genomic DNA

  8. Genome analysis of Excretory/Secretory proteins in Taenia solium reveals their Abundance of Antigenic Regions (AAR).

    Science.gov (United States)

    Gomez, Sandra; Adalid-Peralta, Laura; Palafox-Fonseca, Hector; Cantu-Robles, Vito Adrian; Soberón, Xavier; Sciutto, Edda; Fragoso, Gladis; Bobes, Raúl J; Laclette, Juan P; Yauner, Luis del Pozo; Ochoa-Leyva, Adrián

    2015-05-19

    Excretory/Secretory (ES) proteins play an important role in the host-parasite interactions. Experimental identification of ES proteins is time-consuming and expensive. Alternative bioinformatics approaches are cost-effective and can be used to prioritize the experimental analysis of therapeutic targets for parasitic diseases. Here we predicted and functionally annotated the ES proteins in T. solium genome using an integration of bioinformatics tools. Additionally, we developed a novel measurement to evaluate the potential antigenicity of T. solium secretome using sequence length and number of antigenic regions of ES proteins. This measurement was formalized as the Abundance of Antigenic Regions (AAR) value. AAR value for secretome showed a similar value to that obtained for a set of experimentally determined antigenic proteins and was different to the calculated value for the non-ES proteins of T. solium genome. Furthermore, we calculated the AAR values for known helminth secretomes and they were similar to that obtained for T. solium. The results reveal the utility of AAR value as a novel genomic measurement to evaluate the potential antigenicity of secretomes. This comprehensive analysis of T. solium secretome provides functional information for future experimental studies, including the identification of novel ES proteins of therapeutic, diagnosis and immunological interest.

  9. Characterization of the genomic organization of the region bordering the centromere of chromosome V of Podospora anserina by direct sequencing.

    Science.gov (United States)

    Silar, Philippe; Barreau, Christian; Debuchy, Robert; Kicka, Sébastien; Turcq, Béatrice; Sainsard-Chanet, Annie; Sellem, Carole H; Billault, Alain; Cattolico, Laurence; Duprat, Simone; Weissenbach, Jean

    2003-08-01

    A Podospora anserina BAC library of 4800 clones has been constructed in the vector pBHYG allowing direct selection in fungi. Screening of the BAC collection for centromeric sequences of chromosome V allowed the recovery of clones localized on either sides of the centromere, but no BAC clone was found to contain the centromere. Seven BAC clones containing 322,195 and 156,244bp from either sides of the centromeric region were sequenced and annotated. One 5S rRNA gene, 5 tRNA genes, and 163 putative coding sequences (CDS) were identified. Among these, only six CDS seem specific to P. anserina. The gene density in the centromeric region is approximately one gene every 2.8kb. Extrapolation of this gene density to the whole genome of P. anserina suggests that the genome contains about 11,000 genes. Synteny analyses between P. anserina and Neurospora crassa show that co-linearity extends at the most to a few genes, suggesting rapid genome rearrangements between these two species.

  10. Non-coding genomic regions possessing enhancer and silencer potential are associated with healthy aging and exceptional survival.

    Science.gov (United States)

    Kim, Sangkyu; Welsh, David A; Myers, Leann; Cherry, Katie E; Wyckoff, Jennifer; Jazwinski, S Michal

    2015-02-28

    We have completed a genome-wide linkage scan for healthy aging using data collected from a family study, followed by fine-mapping by association in a separate population, the first such attempt reported. The family cohort consisted of parents of age 90 or above and their children ranging in age from 50 to 80. As a quantitative measure of healthy aging, we used a frailty index, called FI34, based on 34 health and function variables. The linkage scan found a single significant linkage peak on chromosome 12. Using an independent cohort of unrelated nonagenarians, we carried out a fine-scale association mapping of the region suggestive of linkage and identified three sites associated with healthy aging. These healthy-aging sites (HASs) are located in intergenic regions at 12q13-14. HAS-1 has been previously associated with multiple diseases, and an enhancer was recently mapped and experimentally validated within the site. HAS-2 is a previously uncharacterized site possessing genomic features suggestive of enhancer activity. HAS-3 contains features associated with Polycomb repression. The HASs also contain variants associated with exceptional longevity, based on a separate analysis. Our results provide insight into functional genomic networks involving non-coding regulatory elements that are involved in healthy aging and longevity.

  11. Genome-environment association study suggests local adaptation to climate at the regional scale in Fagus sylvatica.

    Science.gov (United States)

    Pluess, Andrea R; Frank, Aline; Heiri, Caroline; Lalagüe, Hadrien; Vendramin, Giovanni G; Oddou-Muratorio, Sylvie

    2016-04-01

    The evolutionary potential of long-lived species, such as forest trees, is fundamental for their local persistence under climate change (CC). Genome-environment association (GEA) analyses reveal if species in heterogeneous environments at the regional scale are under differential selection resulting in populations with potential preadaptation to CC within this area. In 79 natural Fagus sylvatica populations, neutral genetic patterns were characterized using 12 simple sequence repeat (SSR) markers, and genomic variation (144 single nucleotide polymorphisms (SNPs) out of 52 candidate genes) was related to 87 environmental predictors in the latent factor mixed model, logistic regressions and isolation by distance/environmental (IBD/IBE) tests. SSR diversity revealed relatedness at up to 150 m intertree distance but an absence of large-scale spatial genetic structure and IBE. In the GEA analyses, 16 SNPs in 10 genes responded to one or several environmental predictors and IBE, corrected for IBD, was confirmed. The GEA often reflected the proposed gene functions, including indications for adaptation to water availability and temperature. Genomic divergence and the lack of large-scale neutral genetic patterns suggest that gene flow allows the spread of advantageous alleles in adaptive genes. Thereby, adaptation processes are likely to take place in species occurring in heterogeneous environments, which might reduce their regional extinction risk under CC.

  12. Genome analysis of Excretory/Secretory proteins in Taenia solium reveals their Abundance of Antigenic Regions (AAR)

    Science.gov (United States)

    Gomez, Sandra; Adalid-Peralta, Laura; Palafox-Fonseca, Hector; Cantu-Robles, Vito Adrian; Soberón, Xavier; Sciutto, Edda; Fragoso, Gladis; Bobes, Raúl J.; Laclette, Juan P.; Yauner, Luis del Pozo; Ochoa-Leyva, Adrián

    2015-01-01

    Excretory/Secretory (ES) proteins play an important role in the host-parasite interactions. Experimental identification of ES proteins is time-consuming and expensive. Alternative bioinformatics approaches are cost-effective and can be used to prioritize the experimental analysis of therapeutic targets for parasitic diseases. Here we predicted and functionally annotated the ES proteins in T. solium genome using an integration of bioinformatics tools. Additionally, we developed a novel measurement to evaluate the potential antigenicity of T. solium secretome using sequence length and number of antigenic regions of ES proteins. This measurement was formalized as the Abundance of Antigenic Regions (AAR) value. AAR value for secretome showed a similar value to that obtained for a set of experimentally determined antigenic proteins and was different to the calculated value for the non-ES proteins of T. solium genome. Furthermore, we calculated the AAR values for known helminth secretomes and they were similar to that obtained for T. solium. The results reveal the utility of AAR value as a novel genomic measurement to evaluate the potential antigenicity of secretomes. This comprehensive analysis of T. solium secretome provides functional information for future experimental studies, including the identification of novel ES proteins of therapeutic, diagnosis and immunological interest. PMID:25989346

  13. DNA Barcoding: Amplification and sequence analysis of rbcl and matK genome regions in three divergent plant species

    Directory of Open Access Journals (Sweden)

    Javed Iqbal Wattoo

    2016-11-01

    Full Text Available Background: DNA barcoding is a novel method of species identification based on nucleotide diversity of conserved sequences. The establishment and refining of plant DNA barcoding systems is more challenging due to high genetic diversity among different species. Therefore, targeting the conserved nuclear transcribed regions would be more reliable for plant scientists to reveal genetic diversity, species discrimination and phylogeny. Methods: In this study, we amplified and sequenced the chloroplast DNA regions (matk+rbcl of Solanum nigrum, Euphorbia helioscopia and Dalbergia sissoo to study the functional annotation, homology modeling and sequence analysis to allow a more efficient utilization of these sequences among different plant species. These three species represent three families; Solanaceae, Euphorbiaceae and Fabaceae respectively. Biological sequence homology and divergence of amplified sequences was studied using Basic Local Alignment Tool (BLAST. Results: Both primers (matk+rbcl showed good amplification in three species. The sequenced regions reveled conserved genome information for future identification of different medicinal plants belonging to these species. The amplified conserved barcodes revealed different levels of biological homology after sequence analysis. The results clearly showed that the use of these conserved DNA sequences as barcode primers would be an accurate way for species identification and discrimination. Conclusion: The amplification and sequencing of conserved genome regions identified a novel sequence of matK in native species of Solanum nigrum. The findings of the study would be applicable in medicinal industry to establish DNA based identification of different medicinal plant species to monitor adulteration.

  14. Integration of multiethnic fine-mapping and genomic annotation to prioritize candidate functional SNPs at prostate cancer susceptibility regions

    Science.gov (United States)

    Han, Ying; Hazelett, Dennis J.; Wiklund, Fredrik; Schumacher, Fredrick R.; Stram, Daniel O.; Berndt, Sonja I.; Wang, Zhaoming; Rand, Kristin A.; Hoover, Robert N.; Machiela, Mitchell J.; Yeager, Merideth; Burdette, Laurie; Chung, Charles C.; Hutchinson, Amy; Yu, Kai; Xu, Jianfeng; Travis, Ruth C.; Key, Timothy J.; Siddiq, Afshan; Canzian, Federico; Takahashi, Atsushi; Kubo, Michiaki; Stanford, Janet L.; Kolb, Suzanne; Gapstur, Susan M.; Diver, W. Ryan; Stevens, Victoria L.; Strom, Sara S.; Pettaway, Curtis A.; Al Olama, Ali Amin; Kote-Jarai, Zsofia; Eeles, Rosalind A.; Yeboah, Edward D.; Tettey, Yao; Biritwum, Richard B.; Adjei, Andrew A.; Tay, Evelyn; Truelove, Ann; Niwa, Shelley; Chokkalingam, Anand P.; Isaacs, William B.; Chen, Constance; Lindstrom, Sara; Le Marchand, Loic; Giovannucci, Edward L.; Pomerantz, Mark; Long, Henry; Li, Fugen; Ma, Jing; Stampfer, Meir; John, Esther M.; Ingles, Sue A.; Kittles, Rick A.; Murphy, Adam B.; Blot, William J.; Signorello, Lisa B.; Zheng, Wei; Albanes, Demetrius; Virtamo, Jarmo; Weinstein, Stephanie; Nemesure, Barbara; Carpten, John; Leske, M. Cristina; Wu, Suh-Yuh; Hennis, Anselm J. M.; Rybicki, Benjamin A.; Neslund-Dudas, Christine; Hsing, Ann W.; Chu, Lisa; Goodman, Phyllis J.; Klein, Eric A.; Zheng, S. Lilly; Witte, John S.; Casey, Graham; Riboli, Elio; Li, Qiyuan; Freedman, Matthew L.; Hunter, David J.; Gronberg, Henrik; Cook, Michael B.; Nakagawa, Hidewaki; Kraft, Peter; Chanock, Stephen J.; Easton, Douglas F.; Henderson, Brian E.; Coetzee, Gerhard A.; Conti, David V.; Haiman, Christopher A.

    2015-01-01

    Interpretation of biological mechanisms underlying genetic risk associations for prostate cancer is complicated by the relatively large number of risk variants (n = 100) and the thousands of surrogate SNPs in linkage disequilibrium. Here, we combined three distinct approaches: multiethnic fine-mapping, putative functional annotation (based upon epigenetic data and genome-encoded features), and expression quantitative trait loci (eQTL) analyses, in an attempt to reduce this complexity. We examined 67 risk regions using genotyping and imputation-based fine-mapping in populations of European (cases/controls: 8600/6946), African (cases/controls: 5327/5136), Japanese (cases/controls: 2563/4391) and Latino (cases/controls: 1034/1046) ancestry. Markers at 55 regions passed a region-specific significance threshold (P-value cutoff range: 3.9 × 10−4–5.6 × 10−3) and in 30 regions we identified markers that were more significantly associated with risk than the previously reported variants in the multiethnic sample. Novel secondary signals (P < 5.0 × 10−6) were also detected in two regions (rs13062436/3q21 and rs17181170/3p12). Among 666 variants in the 55 regions with P-values within one order of magnitude of the most-associated marker, 193 variants (29%) in 48 regions overlapped with epigenetic or other putative functional marks. In 11 of the 55 regions, cis-eQTLs were detected with nearby genes. For 12 of the 55 regions (22%), the most significant region-specific, prostate-cancer associated variant represented the strongest candidate functional variant based on our annotations; the number of regions increased to 20 (36%) and 27 (49%) when examining the 2 and 3 most significantly associated variants in each region, respectively. These results have prioritized subsets of candidate variants for downstream functional evaluation. PMID:26162851

  15. Integration of multiethnic fine-mapping and genomic annotation to prioritize candidate functional SNPs at prostate cancer susceptibility regions.

    Science.gov (United States)

    Han, Ying; Hazelett, Dennis J; Wiklund, Fredrik; Schumacher, Fredrick R; Stram, Daniel O; Berndt, Sonja I; Wang, Zhaoming; Rand, Kristin A; Hoover, Robert N; Machiela, Mitchell J; Yeager, Merideth; Burdette, Laurie; Chung, Charles C; Hutchinson, Amy; Yu, Kai; Xu, Jianfeng; Travis, Ruth C; Key, Timothy J; Siddiq, Afshan; Canzian, Federico; Takahashi, Atsushi; Kubo, Michiaki; Stanford, Janet L; Kolb, Suzanne; Gapstur, Susan M; Diver, W Ryan; Stevens, Victoria L; Strom, Sara S; Pettaway, Curtis A; Al Olama, Ali Amin; Kote-Jarai, Zsofia; Eeles, Rosalind A; Yeboah, Edward D; Tettey, Yao; Biritwum, Richard B; Adjei, Andrew A; Tay, Evelyn; Truelove, Ann; Niwa, Shelley; Chokkalingam, Anand P; Isaacs, William B; Chen, Constance; Lindstrom, Sara; Le Marchand, Loic; Giovannucci, Edward L; Pomerantz, Mark; Long, Henry; Li, Fugen; Ma, Jing; Stampfer, Meir; John, Esther M; Ingles, Sue A; Kittles, Rick A; Murphy, Adam B; Blot, William J; Signorello, Lisa B; Zheng, Wei; Albanes, Demetrius; Virtamo, Jarmo; Weinstein, Stephanie; Nemesure, Barbara; Carpten, John; Leske, M Cristina; Wu, Suh-Yuh; Hennis, Anselm J M; Rybicki, Benjamin A; Neslund-Dudas, Christine; Hsing, Ann W; Chu, Lisa; Goodman, Phyllis J; Klein, Eric A; Zheng, S Lilly; Witte, John S; Casey, Graham; Riboli, Elio; Li, Qiyuan; Freedman, Matthew L; Hunter, David J; Gronberg, Henrik; Cook, Michael B; Nakagawa, Hidewaki; Kraft, Peter; Chanock, Stephen J; Easton, Douglas F; Henderson, Brian E; Coetzee, Gerhard A; Conti, David V; Haiman, Christopher A

    2015-10-01

    Interpretation of biological mechanisms underlying genetic risk associations for prostate cancer is complicated by the relatively large number of risk variants (n = 100) and the thousands of surrogate SNPs in linkage disequilibrium. Here, we combined three distinct approaches: multiethnic fine-mapping, putative functional annotation (based upon epigenetic data and genome-encoded features), and expression quantitative trait loci (eQTL) analyses, in an attempt to reduce this complexity. We examined 67 risk regions using genotyping and imputation-based fine-mapping in populations of European (cases/controls: 8600/6946), African (cases/controls: 5327/5136), Japanese (cases/controls: 2563/4391) and Latino (cases/controls: 1034/1046) ancestry. Markers at 55 regions passed a region-specific significance threshold (P-value cutoff range: 3.9 × 10(-4)-5.6 × 10(-3)) and in 30 regions we identified markers that were more significantly associated with risk than the previously reported variants in the multiethnic sample. Novel secondary signals (P < 5.0 × 10(-6)) were also detected in two regions (rs13062436/3q21 and rs17181170/3p12). Among 666 variants in the 55 regions with P-values within one order of magnitude of the most-associated marker, 193 variants (29%) in 48 regions overlapped with epigenetic or other putative functional marks. In 11 of the 55 regions, cis-eQTLs were detected with nearby genes. For 12 of the 55 regions (22%), the most significant region-specific, prostate-cancer associated variant represented the strongest candidate functional variant based on our annotations; the number of regions increased to 20 (36%) and 27 (49%) when examining the 2 and 3 most significantly associated variants in each region, respectively. These results have prioritized subsets of candidate variants for downstream functional evaluation.

  16. Characterization of promoter region and genomic structure of the murine and human genes encoding Src like adapter protein.

    Science.gov (United States)

    Kratchmarova, I; Sosinowski, T; Weiss, A; Witter, K; Vincenz, C; Pandey, A

    2001-01-10

    Src-like adapter protein (SLAP) was identified as a signaling molecule in a yeast two-hybrid system using the cytoplasmic domain of EphA2, a receptor protein tyrosine kinase (Pandey et al., 1995. Characterization of a novel Src-like adapter protein that associates with the Eck receptor tyrosine kinase. J. Biol. Chem. 270, 19201-19204). It is very similar to members of the Src family of cytoplasmic tyrosine kinases in that it contains very homologous SH3 and SH2 domains (Abram and Courtneidge, 2000. Src family tyrosine kinases and growth factor signaling. Exp. Cell. Res. 254, 1-13.). However, instead of a kinase domain at the C-terminus, it contains a unique C-terminal region. In order to exclude the possibility that an alternative form exists, we have isolated genomic clones containing the murine Slap gene as well as the human SLA gene. The coding regions of murine Slap and human SLA genes contain seven exons and six introns. Absence of any kinase domain in the genomic region confirm its designation as an adapter protein. Additionally, we have cloned and sequenced approximately 2.6 kb of the region 5' to the initiator methionine of the murine Slap gene. When subcloned upstream of a luciferase gene, this fragment increased the transcriptional activity about 6-fold in a human Jurkat T cell line and approximately 52-fold in a murine T cell line indicating that this region contains promoter elements that dictate SLAP expression. We have also cloned the promoter region of the human SLA gene. Since SLAP is transcriptionally regulated by retinoic acid and by activation of B cells, the cloning of its promoter region will permit a detailed analysis of the elements required for its transcriptional regulation.

  17. Isolation and characterization of the genomic region from Drosophila kuntzei containing the Adh and Adhr genes

    NARCIS (Netherlands)

    Oppentocht, JE; van Delden, W; van de Zande, L

    2002-01-01

    The nucleotide sequences of the Adh and Adhr genes of Drosophila kuntzei were derived from combined overlapping sequences of clones isolated from a genomic library and from cloned PCR and inverse-PCR fragments. Only a proximal promoter was detected upstream of the Adh gene, indicating that D. kuntze

  18. Mapping of 5q35 chromosomal rearrangements within a genomically unstable region

    DEFF Research Database (Denmark)

    Buysse, Karen; Crepel, An; Menten, Björn

    2008-01-01

    BACKGROUND: Recent molecular studies of breakpoints of recurrent chromosome rearrangements revealed the role of genomic architecture in their formation. In particular, segmental duplications representing blocks of >1 kb with >90% sequence homology were shown to mediate non-allelic homologous reco...

  19. Development and validation of new SSR markers from expressed regions in the garlic genome

    Science.gov (United States)

    Limited number of simple sequence repeat (SSR) markers is available for the genome of garlic (Allium sativum L.) although SSR markers have become one of the most preferred marker systems because they are typically co-dominant, reproducible, cross species transferable and highly polymorphic. In this ...

  20. The evolution of sex ratio distorter suppression affects a 25 cM genomic region in the butterfly Hypolimnas bolina.

    Directory of Open Access Journals (Sweden)

    Emily A Hornett

    2014-12-01

    Full Text Available Symbionts that distort their host's sex ratio by favouring the production and survival of females are common in arthropods. Their presence produces intense Fisherian selection to return the sex ratio to parity, typified by the rapid spread of host 'suppressor' loci that restore male survival/development. In this study, we investigated the genomic impact of a selective event of this kind in the butterfly Hypolimnas bolina. Through linkage mapping, we first identified a genomic region that was necessary for males to survive Wolbachia-induced male-killing. We then investigated the genomic impact of the rapid spread of suppression, which converted the Samoan population of this butterfly from a 100:1 female-biased sex ratio in 2001 to a 1:1 sex ratio by 2006. Models of this process revealed the potential for a chromosome-wide effect. To measure the impact of this episode of selection directly, the pattern of genetic variation before and after the spread of suppression was compared. Changes in allele frequencies were observed over a 25 cM region surrounding the suppressor locus, with a reduction in overall diversity observed at loci that co-segregate with the suppressor. These changes exceeded those expected from drift and occurred alongside the generation of linkage disequilibrium. The presence of novel allelic variants in 2006 suggests that the suppressor was likely to have been introduced via immigration rather than through de novo mutation. In addition, further sampling in 2010 indicated that many of the introduced variants were lost or had declined in frequency since 2006. We hypothesize that this loss may have resulted from a period of purifying selection, removing deleterious material that introgressed during the initial sweep. Our observations of the impact of suppression of sex ratio distorting activity reveal a very wide genomic imprint, reflecting its status as one of the strongest selective forces in nature.

  1. Isolation of a Genomic Region Affecting Most Components of Metabolic Syndrome in a Chromosome-16 Congenic Rat Model.

    Directory of Open Access Journals (Sweden)

    Lucie Šedová

    Full Text Available Metabolic syndrome is a highly prevalent human disease with substantial genomic and environmental components. Previous studies indicate the presence of significant genetic determinants of several features of metabolic syndrome on rat chromosome 16 (RNO16 and the syntenic regions of human genome. We derived the SHR.BN16 congenic strain by introgression of a limited RNO16 region from the Brown Norway congenic strain (BN-Lx into the genomic background of the spontaneously hypertensive rat (SHR strain. We compared the morphometric, metabolic, and hemodynamic profiles of adult male SHR and SHR.BN16 rats. We also compared in silico the DNA sequences for the differential segment in the BN-Lx and SHR parental strains. SHR.BN16 congenic rats had significantly lower weight, decreased concentrations of total triglycerides and cholesterol, and improved glucose tolerance compared with SHR rats. The concentrations of insulin, free fatty acids, and adiponectin were comparable between the two strains. SHR.BN16 rats had significantly lower systolic (18-28 mmHg difference and diastolic (10-15 mmHg difference blood pressure throughout the experiment (repeated-measures ANOVA, P < 0.001. The differential segment spans approximately 22 Mb of the telomeric part of the short arm of RNO16. The in silico analyses revealed over 1200 DNA variants between the BN-Lx and SHR genomes in the SHR.BN16 differential segment, 44 of which lead to missense mutations, and only eight of which (in Asb14, Il17rd, Itih1, Syt15, Ercc6, RGD1564958, Tmem161a, and Gatad2a genes are predicted to be damaging to the protein product. Furthermore, a number of genes within the RNO16 differential segment associated with metabolic syndrome components in human studies showed polymorphisms between SHR and BN-Lx (including Lpl, Nrg3, Pbx4, Cilp2, and Stab1. Our novel congenic rat model demonstrates that a limited genomic region on RNO16 in the SHR significantly affects many of the features of metabolic

  2. Sequencing of a QTL-rich region of the Theobroma cacao genome using pooled BACs and the identification of trait specific candidate genes

    Directory of Open Access Journals (Sweden)

    Blackmon Barbara P

    2011-07-01

    Full Text Available Abstract Background BAC-based physical maps provide for sequencing across an entire genome or a selected sub-genomic region of biological interest. Such a region can be approached with next-generation whole-genome sequencing and assembly as if it were an independent small genome. Using the minimum tiling path as a guide, specific BAC clones representing the prioritized genomic interval are selected, pooled, and used to prepare a sequencing library. Results This pooled BAC approach was taken to sequence and assemble a QTL-rich region, of ~3 Mbp and represented by twenty-seven BACs, on linkage group 5 of the Theobroma cacao cv. Matina 1-6 genome. Using various mixtures of read coverages from paired-end and linear 454 libraries, multiple assemblies of varied quality were generated. Quality was assessed by comparing the assembly of 454 reads with a subset of ten BACs individually sequenced and assembled using Sanger reads. A mixture of reads optimal for assembly was identified. We found, furthermore, that a quality assembly suitable for serving as a reference genome template could be obtained even with a reduced depth of sequencing coverage. Annotation of the resulting assembly revealed several genes potentially responsible for three T. cacao traits: black pod disease resistance, bean shape index, and pod weight. Conclusions Our results, as with other pooled BAC sequencing reports, suggest that pooling portions of a minimum tiling path derived from a BAC-based physical map is an effective method to target sub-genomic regions for sequencing. While we focused on a single QTL region, other QTL regions of importance could be similarly sequenced allowing for biological discovery to take place before a high quality whole-genome assembly is completed.

  3. A 5'-proximal region of the Citrus tristeza virus genome encoding two leader proteases is involved in virus superinfection exclusion.

    Science.gov (United States)

    Atallah, Osama O; Kang, Sung-Hwan; El-Mohtar, Choaa A; Shilts, Turksen; Bergua, María; Folimonova, Svetlana Y

    2016-02-01

    Superinfection exclusion (SIE), a phenomenon in which a primary virus infection prevents a secondary infection with the same or closely related virus, has been observed with various viruses. Earlier we demonstrated that SIE by Citrus tristeza virus (CTV) requires viral p33 protein. In this work we show that p33 alone is not sufficient for virus exclusion. To define the additional viral components that are involved in this phenomenon, we engineered a hybrid virus in which a 5'-proximal region in the genome of the T36 isolate containing coding sequences for the two leader proteases L1 and L2 has been substituted with a corresponding region from the genome of a heterologous T68-1 isolate. Sequential inoculation of plants pre-infected with the CTV L1L2T68 hybrid with T36 CTV resulted in superinfection with the challenge virus, which indicated that the substitution of the L1-L2 coding region affected SIE ability of the virus.

  4. The complete mitochondrial genome of bighead croaker, Collichthys niveatus (Perciformes, Sciaenidae): structure of control region and phylogenetic considerations.

    Science.gov (United States)

    Xu, Tian-Jun; Cheng, Yuan-Zhi; Sun, Yue-Na; Shi, Ge; Wang, Ri-Xin

    2011-10-01

    Sciaenidae is a diverse, commercially important family. To understand the phylogenetic position of Collichthys niveatus in this family, we present its complete mitochondrial genome sequence. The genome is 16469 bp in length and contains 37 mitochondrial genes (13 protein-coding genes, 2 ribosomal RNA genes and 22 transfer RNA genes) and a control region (CR) as in other bony fishes. Further sequencing for the complete control region was performed on Collichthys lucida. Although the conserved sequence domains such as extend termination associated sequence (ETAS) and conserved sequence block domains (CSB-1, CSB-2 and CSB-3) are recognized in the control region of the two congeneric species, the typical central conserved blocks (CSB-F, CSB-E and CSB-D) could not be detected, while they are found in Miichthys miiuy and Cynoscion acoupa of Sciaenidae and other Percoidei fishes. Phylogenetic analyses do not support the monophyly of Pseudosciaeniae, which is against with the morphological results. C. niveatus is most closely related to Larimichthys polyactis, and Collichthys and Larimichthys may be merged into one genus, based on the current datasets.

  5. swDMR: A Sliding Window Approach to Identify Differentially Methylated Regions Based on Whole Genome Bisulfite Sequencing.

    Directory of Open Access Journals (Sweden)

    Zhen Wang

    Full Text Available DNA methylation is a widespread epigenetic modification that plays an essential role in gene expression through transcriptional regulation and chromatin remodeling. The emergence of whole genome bisulfite sequencing (WGBS represents an important milestone in the detection of DNA methylation. Characterization of differential methylated regions (DMRs is fundamental as well for further functional analysis. In this study, we present swDMR (http://sourceforge.net/projects/swDMR/ for the comprehensive analysis of DMRs from whole genome methylation profiles by a sliding window approach. It is an integrated tool designed for WGBS data, which not only implements accessible statistical methods to perform hypothesis test adapted to two or more samples without replicates, but false discovery rate was also controlled by multiple test correction. Downstream analysis tools were also provided, including cluster, annotation and visualization modules. In summary, based on WGBS data, swDMR can produce abundant information of differential methylated regions. As a convenient and flexible tool, we believe swDMR will bring us closer to unveil the potential functional regions involved in epigenetic regulation.

  6. Mitochondrial genome analyses suggest multiple Trichuris species in humans, baboons, and pigs from different geographical regions

    DEFF Research Database (Denmark)

    Hawash, Mohamed B. F.; Andersen, Lee O.; Gasser, Robin B.;

    2015-01-01

    BACKGROUND: The whipworms Trichuris trichiura and Trichuris suis are two parasitic nematodes of humans and pigs, respectively. Although whipworms in human and non-human primates historically have been referred to as T. trichiura, recent reports suggest that several Trichuris spp. are found...... in primates. METHODS AND FINDINGS: We sequenced and annotated complete mitochondrial genomes of Trichuris recovered from a human in Uganda, an olive baboon in the US, a hamadryas baboon in Denmark, and two pigs from Denmark and Uganda. Comparative analyses using other published mitochondrial genomes...... of Trichuris recovered from a human and a porcine host in China and from a françois' leaf-monkey (China) were performed, including phylogenetic analyses and pairwise genetic and amino acid distances. Genetic and protein distances between human Trichuris in Uganda and China were high (~19% and 15%, respectively...

  7. Multiple recent horizontal transfers of a large genomic region in cheese making fungi.

    Science.gov (United States)

    Cheeseman, Kevin; Ropars, Jeanne; Renault, Pierre; Dupont, Joëlle; Gouzy, Jérôme; Branca, Antoine; Abraham, Anne-Laure; Ceppi, Maurizio; Conseiller, Emmanuel; Debuchy, Robert; Malagnac, Fabienne; Goarin, Anne; Silar, Philippe; Lacoste, Sandrine; Sallet, Erika; Bensimon, Aaron; Giraud, Tatiana; Brygoo, Yves

    2014-01-01

    While the extent and impact of horizontal transfers in prokaryotes are widely acknowledged, their importance to the eukaryotic kingdom is unclear and thought by many to be anecdotal. Here we report multiple recent transfers of a huge genomic island between Penicillium spp. found in the food environment. Sequencing of the two leading filamentous fungi used in cheese making, P. roqueforti and P. camemberti, and comparison with the penicillin producer P. rubens reveals a 575 kb long genomic island in P. roqueforti--called Wallaby--present as identical fragments at non-homologous loci in P. camemberti and P. rubens. Wallaby is detected in Penicillium collections exclusively in strains from food environments. Wallaby encompasses about 250 predicted genes, some of which are probably involved in competition with microorganisms. The occurrence of multiple recent eukaryotic transfers in the food environment provides strong evidence for the importance of this understudied and probably underestimated phenomenon in eukaryotes.

  8. Tandem Repeat Regions within the Burkholderia pseudomallei Genome and their Application for High-Resolution Genotyping

    Science.gov (United States)

    2007-03-30

    multilocus sequence typing (MLST) [6]. RAPD detects differences in genomes by amplifying segments of unknown DNA. Drawbacks to this technique include the...Australian isolates using MLST exhibited no overlap between sequence types for the two countries [50]. However, phylogenetic analysis of these data...Aanensen DM, Pitt TL, Kinoshita R, Spratt BG: Multilocus sequence typing and evolutionary rela- tionships among the causative agents of melioidosis

  9. Genomic and Network Patterns of Schizophrenia Genetic Variation in Human Evolutionary Accelerated Regions

    OpenAIRE

    Xu, Ke; Schadt, Eric E.; Pollard, Katherine S.; Roussos, Panos; Joel T Dudley

    2015-01-01

    The population persistence of schizophrenia despite associated reductions in fitness and fecundity suggests that the genetic basis of schizophrenia has a complex evolutionary history. A recent meta-analysis of schizophrenia genome-wide association studies offers novel opportunities for assessment of the evolutionary trajectories of schizophrenia-associated loci. In this study, we hypothesize that components of the genetic architecture of schizophrenia are attributable to human lineage-specifi...

  10. Genome Sequences of 11 Brucella abortus Isolates from Persistently Infected Italian Regions

    OpenAIRE

    2015-01-01

    Bovine brucellosis, typically caused by Brucella abortus, has been eradicated from much of the developed world. However, the disease remains prevalent in southern Italy, persisting as a public and livestock health concern. We report here the whole-genome sequences of 11 isolates from cattle (Bos taurus) and water buffalo (Bubalus bubalis) that are representative of the current genetic diversity of B. abortus lineages circulating in Italy.

  11. Scanning for genes in large genomic regions: cosmid-based exon trapping of multiple exons in a single product.

    OpenAIRE

    Datson, N.A.; Vosse, E van de; Dauwerse, H.G.; Bout, M; van Ommen, G J; J T den Dunnen

    1996-01-01

    To facilitate the scanning of large genomic regions for the presence of exonic gene segments we have constructed a cosmid-based exon trap vector. The vector serves a dual purpose since it is also suitable for contig construction and physical mapping. The exon trap cassette of vector sCOGH1 consists of the human growth hormone gene driven by the mouse mettallothionein-1 promoter. Inserts are cloned in the multicloning site located in intron 2 of the hGH gene. The efficiency of the system is de...

  12. Genome-wide association of bipolar disorder suggests an enrichment of replicable associations in regions near genes.

    Directory of Open Access Journals (Sweden)

    Erin N Smith

    2011-06-01

    Full Text Available Although a highly heritable and disabling disease, bipolar disorder's (BD genetic variants have been challenging to identify. We present new genotype data for 1,190 cases and 401 controls and perform a genome-wide association study including additional samples for a total of 2,191 cases and 1,434 controls. We do not detect genome-wide significant associations for individual loci; however, across all SNPs, we show an association between the power to detect effects calculated from a previous genome-wide association study and evidence for replication (P = 1.5×10(-7. To demonstrate that this result is not likely to be a false positive, we analyze replication rates in a large meta-analysis of height and show that, in a large enough study, associations replicate as a function of power, approaching a linear relationship. Within BD, SNPs near exons exhibit a greater probability of replication, supporting an enrichment of reproducible associations near functional regions of genes. These results indicate that there is likely common genetic variation associated with BD near exons (±10 kb that could be identified in larger studies and, further, provide a framework for assessing the potential for replication when combining results from multiple studies.

  13. Relative effects of mutability and selection on single nucleotide polymorphisms in transcribed regions of the human genome

    Directory of Open Access Journals (Sweden)

    Amos Christopher I

    2008-06-01

    Full Text Available Abstract Motivation Single nucleotide polymorphisms (SNPs are the most common type of genetic variation in humans. However, the factors that affect SNP density are poorly understood. The goal of this study was to estimate the relative effects of mutability and selection on SNP density in transcribed regions of human genes. It is important for prediction of the regions that harbor functional polymorphisms. Results We used frequency-validated SNPs resulting from single-nucleotide substitutions. SNPs were subdivided into five functional categories: (i 5' untranslated region (UTR SNPs, (ii 3' UTR SNPs, (iii synonymous SNPs, (iv SNPs producing conservative missense mutations, and (v SNPs producing radical missense mutations. Each of these categories was further subdivided into nine mutational categories on the basis of the single-nucleotide substitution type. Thus, 45 functional/mutational categories were analyzed. The relative mutation rate in each mutational category was estimated on the basis of published data. The proportion of segregating sites (PSSs for each functional/mutational category was estimated by dividing the observed number of SNPs by the number of potential sites in the genome for a given functional/mutational category. By analyzing each functional group separately, we found significant positive correlations between PSSs and relative mutation rates (Spearman's correlation coefficient, at least r = 0.96, df = 9, P P = 0.001, suggesting that selection affects SNP density in transcribed regions of the genome. We used analyses of variance and covariance to estimate the relative effects of selection (functional category and mutability (relative mutation rate on the PSSs and found that approximately 87% of variation in PSS was due to variation in the mutation rate and approximately 13% was due to selection, suggesting that the probability that a site located in a transcribed region of a gene is polymorphic mostly depends on the mutability

  14. Genomic regions showing copy number variations associate with resistance or susceptibility to gastrointestinal nematodes in Angus cattle.

    Science.gov (United States)

    Hou, Yali; Liu, George E; Bickhart, Derek M; Matukumalli, Lakshmi K; Li, Congjun; Song, Jiuzhou; Gasbarre, Louis C; Van Tassell, Curtis P; Sonstegard, Tad S

    2012-03-01

    Genomic structural variation is an important and abundant source of genetic and phenotypic variation. We previously reported an initial analysis of copy number variations (CNVs) in Angus cattle selected for resistance or susceptibility to gastrointestinal nematodes. In this study, we performed a large-scale analysis of CNVs using SNP genotyping data from 472 animals of the same population. We detected 811 candidate CNV regions, which represent 141.8 Mb (~4.7%) of the genome. To investigate the functional impacts of CNVs, we created 2 groups of 100 individual animals with extremely low or high estimated breeding values of eggs per gram of feces and referred to these groups as parasite resistant (PR) or parasite susceptible (PS), respectively. We identified 297 (~51 Mb) and 282 (~48 Mb) CNV regions from PR and PS groups, respectively. Approximately 60% of the CNV regions were specific to the PS group or PR group of animals. Selected PR- or PS-specific CNVs were further experimentally validated by quantitative PCR. A total of 297 PR CNV regions overlapped with 437 Ensembl genes enriched in immunity and defense, like WC1 gene which uniquely expresses on gamma/delta T cells in cattle. Network analyses indicated that the PR-specific genes were predominantly involved in gastrointestinal disease, immunological disease, inflammatory response, cell-to-cell signaling and interaction, lymphoid tissue development, and cell death. By contrast, the 282 PS CNV regions contained 473 Ensembl genes which are overrepresented in environmental interactions. Network analyses indicated that the PS-specific genes were particularly enriched for inflammatory response, immune cell trafficking, metabolic disease, cell cycle, and cellular organization and movement.

  15. Genomic analysis of a 1 Mb region near the telomere of Hessian fly chromosome X2 and avirulence gene vH13

    Directory of Open Access Journals (Sweden)

    Chen Ming-Shun

    2006-01-01

    Full Text Available Abstract Background To have an insight into the Mayetiola destructor (Hessian fly genome, we performed an in silico comparative genomic analysis utilizing genetic mapping, genomic sequence and EST sequence data along with data available from public databases. Results Chromosome walking and FISH were utilized to identify a contig of 50 BAC clones near the telomere of the short arm of Hessian fly chromosome X2 and near the avirulence gene vH13. These clones enabled us to correlate physical and genetic distance in this region of the Hessian fly genome. Sequence data from these BAC ends encompassing a 760 kb region, and a fully sequenced and assembled 42.6 kb BAC clone, was utilized to perform a comparative genomic study. In silico gene prediction combined with BLAST analyses was used to determine putative orthology to the sequenced dipteran genomes of the fruit fly, Drosophila melanogaster, and the malaria mosquito, Anopheles gambiae, and to infer evolutionary relationships. Conclusion This initial effort enables us to advance our understanding of the structure, composition and evolution of the genome of this important agricultural pest and is an invaluable tool for a whole genome sequencing effort.

  16. Inflammatory peeling skin syndrome caused by homozygous genomic deletion in the PSORS1 region encompassing the CDSN gene.

    Science.gov (United States)

    Ishida-Yamamoto, Akemi; Furio, Laetitia; Igawa, Satomi; Honma, Masaru; Tron, Elodie; Malan, Valerie; Murakami, Masamoto; Hovnanian, Alain

    2014-01-01

    Peeling skin syndrome (PSS) type B is a rare recessive genodermatosis characterized by lifelong widespread, reddish peeling of the skin with pruritus. The disease is caused by small-scale mutations in the Corneodesmosin gene (CDSN) leading to premature termination codons. We report for the first time a Japanese case resulting from complete deletion of CDSN. Corneodesmosin was undetectable in the epidermis, and CDSN was unamplifiable by PCR. QMPSF analysis demonstrated deletion of CDSN exons inherited from each parent. Deletion mapping using microsatellite haplotyping, CGH array and PCR analysis established that the genomic deletion spanned 49-72 kb between HCG22 and TCF19, removing CDSN as well as five other genes within the psoriasis susceptibility region 1 (PSORS1) on 6p21.33. This observation widens the spectrum of molecular defects underlying PSS type B and shows that loss of these five genes from the PSORS1 region does not result in an additional cutaneous phenotype.

  17. Whole genome comparisons suggest random distribution of Mycobacterium ulcerans genotypes in a Buruli ulcer endemic region of Ghana.

    Science.gov (United States)

    Ablordey, Anthony S; Vandelannoote, Koen; Frimpong, Isaac A; Ahortor, Evans K; Amissah, Nana Ama; Eddyani, Miriam; Durnez, Lies; Portaels, Françoise; de Jong, Bouke C; Leirs, Herwig; Porter, Jessica L; Mangas, Kirstie M; Lam, Margaret M C; Buultjens, Andrew; Seemann, Torsten; Tobias, Nicholas J; Stinear, Timothy P

    2015-03-01

    Efforts to control the spread of Buruli ulcer--an emerging ulcerative skin infection caused by Mycobacterium ulcerans--have been hampered by our poor understanding of reservoirs and transmission. To help address this issue, we compared whole genomes from 18 clinical M. ulcerans isolates from a 30 km2 region within the Asante Akim North District, Ashanti region, Ghana, with 15 other M. ulcerans isolates from elsewhere in Ghana and the surrounding countries of Ivory Coast, Togo, Benin and Nigeria. Contrary to our expectations of finding minor DNA sequence variations among isolates representing a single M. ulcerans circulating genotype, we found instead two distinct genotypes. One genotype was closely related to isolates from neighbouring regions of Amansie West and Densu, consistent with the predicted local endemic clone, but the second genotype (separated by 138 single nucleotide polymorphisms [SNPs] from other Ghanaian strains) most closely matched M. ulcerans from Nigeria, suggesting another introduction of M. ulcerans to Ghana, perhaps from that country. Both the exotic genotype and the local Ghanaian genotype displayed highly restricted intra-strain genetic variation, with less than 50 SNP differences across a 5.2 Mbp core genome within each genotype. Interestingly, there was no discernible spatial clustering of genotypes at the local village scale. Interviews revealed no obvious epidemiological links among BU patients who had been infected with identical M. ulcerans genotypes but lived in geographically separate villages. We conclude that M. ulcerans is spread widely across the region, with multiple genotypes present in any one area. These data give us new perspectives on the behaviour of possible reservoirs and subsequent transmission mechanisms of M. ulcerans. These observations also show for the first time that M. ulcerans can be mobilized, introduced to a new area and then spread within a population. Potential reservoirs of M. ulcerans thus might include

  18. Whole genome comparisons suggest random distribution of Mycobacterium ulcerans genotypes in a Buruli ulcer endemic region of Ghana.

    Directory of Open Access Journals (Sweden)

    Anthony S Ablordey

    2015-03-01

    Full Text Available Efforts to control the spread of Buruli ulcer--an emerging ulcerative skin infection caused by Mycobacterium ulcerans--have been hampered by our poor understanding of reservoirs and transmission. To help address this issue, we compared whole genomes from 18 clinical M. ulcerans isolates from a 30 km2 region within the Asante Akim North District, Ashanti region, Ghana, with 15 other M. ulcerans isolates from elsewhere in Ghana and the surrounding countries of Ivory Coast, Togo, Benin and Nigeria. Contrary to our expectations of finding minor DNA sequence variations among isolates representing a single M. ulcerans circulating genotype, we found instead two distinct genotypes. One genotype was closely related to isolates from neighbouring regions of Amansie West and Densu, consistent with the predicted local endemic clone, but the second genotype (separated by 138 single nucleotide polymorphisms [SNPs] from other Ghanaian strains most closely matched M. ulcerans from Nigeria, suggesting another introduction of M. ulcerans to Ghana, perhaps from that country. Both the exotic genotype and the local Ghanaian genotype displayed highly restricted intra-strain genetic variation, with less than 50 SNP differences across a 5.2 Mbp core genome within each genotype. Interestingly, there was no discernible spatial clustering of genotypes at the local village scale. Interviews revealed no obvious epidemiological links among BU patients who had been infected with identical M. ulcerans genotypes but lived in geographically separate villages. We conclude that M. ulcerans is spread widely across the region, with multiple genotypes present in any one area. These data give us new perspectives on the behaviour of possible reservoirs and subsequent transmission mechanisms of M. ulcerans. These observations also show for the first time that M. ulcerans can be mobilized, introduced to a new area and then spread within a population. Potential reservoirs of M. ulcerans

  19. Genomic organization of duplicated major histocompatibility complex class I regions in Atlantic salmon (Salmo salar

    Directory of Open Access Journals (Sweden)

    Phillips Ruth B

    2007-07-01

    Full Text Available Abstract Background We have previously identified associations between major histocompatibility complex (MHC class I and resistance towards bacterial and viral pathogens in Atlantic salmon. To evaluate if only MHC or also closely linked genes contributed to the observed resistance we ventured into sequencing of the duplicated MHC class I regions of Atlantic salmon. Results Nine BACs covering more than 500 kb of the two duplicated MHC class I regions of Atlantic salmon were sequenced and the gene organizations characterized. Both regions contained the proteasome components PSMB8, PSMB9, PSMB9-like and PSMB10 in addition to the transporter for antigen processing TAP2, as well as genes for KIFC1, ZBTB22, DAXX, TAPBP, BRD2, COL11A2, RXRB and SLC39A7. The IA region contained the recently reported MHC class I Sasa-ULA locus residing approximately 50 kb upstream of the major Sasa-UBA locus. The duplicated class IB region contained an MHC class I locus resembling the rainbow trout UCA locus, but although transcribed it was a pseudogene. No other MHC class I-like genes were detected in the two duplicated regions. Two allelic BACs spanning the UBA locus had 99.2% identity over 125 kb, while the IA region showed 82.5% identity over 136 kb to the IB region. The Atlantic salmon IB region had an insert of 220 kb in comparison to the IA region containing three chitin synthase genes. Conclusion We have characterized the gene organization of more than 500 kb of the two duplicated MHC class I regions in Atlantic salmon. Although Atlantic salmon and rainbow trout are closely related, the gene organization of their IB region has undergone extensive gene rearrangements. The Atlantic salmon has only one class I UCA pseudogene in the IB region while trout contains the four MHC UCA, UDA, UEA and UFA class I loci. The large differences in gene content and most likely function of the salmon and trout class IB region clearly argues that sequencing of salmon will not

  20. DNA-guided establishment of nucleosome patterns within coding regions of a eukaryotic genome.

    Science.gov (United States)

    Beh, Leslie Y; Müller, Manuel M; Muir, Tom W; Kaplan, Noam; Landweber, Laura F

    2015-11-01

    A conserved hallmark of eukaryotic chromatin architecture is the distinctive array of well-positioned nucleosomes downstream from transcription start sites (TSS). Recent studies indicate that trans-acting factors establish this stereotypical array. Here, we present the first genome-wide in vitro and in vivo nucleosome maps for the ciliate Tetrahymena thermophila. In contrast with previous studies in yeast, we find that the stereotypical nucleosome array is preserved in the in vitro reconstituted map, which is governed only by the DNA sequence preferences of nucleosomes. Remarkably, this average in vitro pattern arises from the presence of subsets of nucleosomes, rather than the whole array, in individual Tetrahymena genes. Variation in GC content contributes to the positioning of these sequence-directed nucleosomes and affects codon usage and amino acid composition in genes. Given that the AT-rich Tetrahymena genome is intrinsically unfavorable for nucleosome formation, we propose that these "seed" nucleosomes--together with trans-acting factors--may facilitate the establishment of nucleosome arrays within genes in vivo, while minimizing changes to the underlying coding sequences.

  1. The Genomic Ancestry of Individuals from Different Geographical Regions of Brazil Is More Uniform Than Expected

    Science.gov (United States)

    Pena, Sérgio D. J.; Di Pietro, Giuliano; Fuchshuber-Moraes, Mateus; Genro, Julia Pasqualini; Hutz, Mara H.; Kehdy, Fernanda de Souza Gomes; Kohlrausch, Fabiana; Magno, Luiz Alexandre Viana; Montenegro, Raquel Carvalho; Moraes, Manoel Odorico; de Moraes, Maria Elisabete Amaral; de Moraes, Milene Raiol; Ojopi, Élida B.; Perini, Jamila A.; Racciopi, Clarice; Ribeiro-dos-Santos, Ândrea Kely Campos; Rios-Santos, Fabrício; Romano-Silva, Marco A.; Sortica, Vinicius A.; Suarez-Kurtz, Guilherme

    2011-01-01

    Based on pre-DNA racial/color methodology, clinical and pharmacological trials have traditionally considered the different geographical regions of Brazil as being very heterogeneous. We wished to ascertain how such diversity of regional color categories correlated with ancestry. Using a panel of 40 validated ancestry-informative insertion-deletion DNA polymorphisms we estimated individually the European, African and Amerindian ancestry components of 934 self-categorized White, Brown or Black Brazilians from the four most populous regions of the Country. We unraveled great ancestral diversity between and within the different regions. Especially, color categories in the northern part of Brazil diverged significantly in their ancestry proportions from their counterparts in the southern part of the Country, indicating that diverse regional semantics were being used in the self-classification as White, Brown or Black. To circumvent these regional subjective differences in color perception, we estimated the general ancestry proportions of each of the four regions in a form independent of color considerations. For that, we multiplied the proportions of a given ancestry in a given color category by the official census information about the proportion of that color category in the specific region, to arrive at a “total ancestry” estimate. Once such a calculation was performed, there emerged a much higher level of uniformity than previously expected. In all regions studied, the European ancestry was predominant, with proportions ranging from 60.6% in the Northeast to 77.7% in the South. We propose that the immigration of six million Europeans to Brazil in the 19th and 20th centuries - a phenomenon described and intended as the “whitening of Brazil” - is in large part responsible for dissipating previous ancestry dissimilarities that reflected region-specific population histories. These findings, of both clinical and sociological importance for Brazil, should also be

  2. QTL mapping in white spruce: gene maps and genomic regions underlying adaptive traits across pedigrees, years and environments

    Directory of Open Access Journals (Sweden)

    Meirmans Patrick G

    2011-03-01

    Full Text Available Abstract Background The genomic architecture of bud phenology and height growth remains poorly known in most forest trees. In non model species, QTL studies have shown limited application because most often QTL data could not be validated from one experiment to another. The aim of our study was to overcome this limitation by basing QTL detection on the construction of genetic maps highly-enriched in gene markers, and by assessing QTLs across pedigrees, years, and environments. Results Four saturated individual linkage maps representing two unrelated mapping populations of 260 and 500 clonally replicated progeny were assembled from 471 to 570 markers, including from 283 to 451 gene SNPs obtained using a multiplexed genotyping assay. Thence, a composite linkage map was assembled with 836 gene markers. For individual linkage maps, a total of 33 distinct quantitative trait loci (QTLs were observed for bud flush, 52 for bud set, and 52 for height growth. For the composite map, the corresponding numbers of QTL clusters were 11, 13, and 10. About 20% of QTLs were replicated between the two mapping populations and nearly 50% revealed spatial and/or temporal stability. Three to four occurrences of overlapping QTLs between characters were noted, indicating regions with potential pleiotropic effects. Moreover, some of the genes involved in the QTLs were also underlined by recent genome scans or expression profile studies. Overall, the proportion of phenotypic variance explained by each QTL ranged from 3.0 to 16.4% for bud flush, from 2.7 to 22.2% for bud set, and from 2.5 to 10.5% for height growth. Up to 70% of the total character variance could be accounted for by QTLs for bud flush or bud set, and up to 59% for height growth. Conclusions This study provides a basic understanding of the genomic architecture related to bud flush, bud set, and height growth in a conifer species, and a useful indicator to compare with Angiosperms. It will serve as a basic

  3. Potential virulence determinants in terminal regions of variola smallpox virus genome.

    Science.gov (United States)

    Massung, R F; Esposito, J J; Liu, L I; Qi, J; Utterback, T R; Knight, J C; Aubin, L; Yuran, T E; Parsons, J M; Loparev, V N

    Smallpox eradication culminated the most successful antimicrobial campaign in medical history. To characterize further the linear double-stranded DNA genome of the aetiological agent of smallpox, we have determined the entire nucleotide sequence of the highly virulent variola major virus, strain Bangladesh-1975 (VAR-BSH; 186,102 base pairs, 33.7% G + C; Genbank accession number, L22579). Here we highlight features of the molecule and focus on a few of the 187 putative proteins that probably contribute to pathogenicity and virus host-range properties. One hundred and fifty proteins were markedly similar to those of vaccinia virus (smallpox vaccine), for which a complete sequence has been reported for strain Copenhagen (VAC-CPN; 191,636 base pairs, 33.3% G + C). The remaining 37 proteins reflected variola-specific sequences or open reading frame divergences for variant proteins, which are often truncated or elongated compared with their vaccinia counterparts.

  4. Rearrangements of archetypal regulatory regions in JC virus genomes from urine.

    Science.gov (United States)

    Agostini, H T; Ryschkewitsch, C F; Stoner, G L

    1998-01-01

    The regulatory region of progressive multifocal leukoencephalopathy-type JC virus (JCV) is rearranged in each host by a process of deletion and duplication. Of the more than 40 that have been examined, no two regulatory regions have been rearranged identically in the brain. The substrate for this rearrangement appears to be a highly stable archetypal regulatory region excreted in the urine. Its role as the transmissible form of the virus, although inferred, has never been proven. We have now amplified by PCR and cycle-sequenced the regulatory regions from 48 urinary strains of the virus. We find that the urinary form of the regulatory region is not entirely stable. Short deletions and duplications in the range of 2-16 bp were observed in seven of these strains. One of these, an inverted repeat, is a pattern of rearrangement not yet found in the brain. Two others (#208 and 230) showed a 2-bp deletion at position nos. 221 and 222, and an unusual mutation at position no. 219. These two urines were collected in different states of the USA at different times and analysed months apart. It is very unlikely that these unusual changes represent sample contamination or that they arose independently. This finding indicates that archetypal forms of the JCV regulatory region are infectious, despite their relative inactivity in tissue culture. While changes in the archetypal structure can be found, it is clear that rearrangements in the kidney are rare or rarely infectious.

  5. Mapping codon usage of the translation initiation region in porcine reproductive and respiratory syndrome virus genome

    Directory of Open Access Journals (Sweden)

    Dou Yong-xi

    2011-10-01

    Full Text Available Abstract Background Porcine reproductive and respitatory syndrome virus (PRRSV is a recently emerged pathogen and severely affects swine populations worldwide. The replication of PRRSV is tightly controlled by viral gene expression and the codon usage of translation initiation region within each gene could potentially regulate the translation rate. Therefore, a better understanding of the codon usage pattern of the initiation translation region would shed light on the regulation of PRRSV gene expression. Results In this study, the codon usage in the translation initiation region and in the whole coding sequence was compared in PRRSV ORF1a and ORFs2-7. To investigate the potential role of codon usage in affecting the translation initiation rate, we established a codon usage model for PRRSV translation initiation region. We observed that some non-preferential codons are preferentially used in the translation initiation region in particular ORFs. Although some positions vary with codons, they intend to use codons with negative CUB. Furthermore, our model of codon usage showed that the conserved pattern of CUB is not directly consensus with the conserved sequence, but shaped under the translation selection. Conclusions The non-variation pattern with negative CUB in the PRRSV translation initiation region scanned by ribosomes is considered the rate-limiting step in the translation process.

  6. The Variable Regions of Lactobacillus rhamnosus Genomes Reveal the Dynamic Evolution of Metabolic and Host-Adaptation Repertoires

    NARCIS (Netherlands)

    Ceapa, C.D.; Davids, M.; Ritari, Jarmo; Lambert, J.; Wels, M.; Douillard, François P.; Smokvina, Tamara; Vos, de Willem M.; Knol, J.; Kleerebezem, M.

    2016-01-01

    Lactobacillus rhamnosus is a diverse Gram-positive species with strains isolated from different ecological niches. Here, we report the genome sequence analysis of 40 diverse strains of L. rhamnosus and their genomic comparison, with a focus on the variable genome. Genomic comparison of 40 L. rhamnos

  7. Genome sequence of the acid-tolerant Desulfovibrio sp. DV isolated from the sediments of a Pb-Zn mine tailings dam in the Chita region, Russia

    Directory of Open Access Journals (Sweden)

    Anastasiia Kovaliova

    2017-03-01

    Full Text Available Here we report the draft genome sequence of the acid-tolerant Desulfovibrio sp. DV isolated from the sediments of a Pb-Zn mine tailings dam in the Chita region, Russia. The draft genome has a size of 4.9 Mb and encodes multiple K+-transporters and proton-consuming decarboxylases. The phylogenetic analysis based on concatenated ribosomal proteins revealed that strain DV clusters together with the acid-tolerant Desulfovibrio sp. TomC and Desulfovibrio magneticus. The draft genome sequence and annotation have been deposited at GenBank under the accession number MLBG00000000.

  8. Identification and characterization of genomic regions on chromosomes 4 and 8 that control the rate of photosynthesis in rice leaves.

    Science.gov (United States)

    Adachi, Shunsuke; Tsuru, Yukiko; Nito, Naoko; Murata, Kazumasa; Yamamoto, Toshio; Ebitani, Takeshi; Ookawa, Taiichiro; Hirasawa, Tadashi

    2011-03-01

    DNA marker-assisted selection appears to be a promising strategy for improving rates of leaf photosynthesis in rice. The rate of leaf photosynthesis was significantly higher in a high-yielding indica variety, Habataki, than in the most popular Japanese variety, Koshihikari, at the full heading stage as a result of the higher level of leaf nitrogen at the same rate of application of nitrogen and the higher stomatal conductance even when the respective levels of leaf nitrogen were the same. The higher leaf nitrogen content of Habataki was caused by the greater accumulation of nitrogen by plants. The higher stomatal conductance of Habataki was caused by the higher hydraulic conductance. Using progeny populations and selected lines derived from a cross between Koshihikari and Habataki, it was possible to identify the genomic regions responsible for the rate of photosynthesis within a 2.1 Mb region between RM17459 and RM17552 and within a 1.2 Mb region between RM6999 and RM22529 on the long arm of chromosome 4 and on the short arm of chromosome 8, respectively. The designated region on chromosome 4 of Habataki was responsible for both the increase in the nitrogen content of leaves and hydraulic conductance in the plant by increasing the root surface area. The designated region on chromosome 8 of Habataki was responsible for the increase in hydraulic conductance by increasing the root hydraulic conductivity. The results suggest that it may be possible to improve photosynthesis in rice leaves by marker-assisted selection that focuses on these regions of chromosomes 4 and 8.

  9. Physical mapping of a large plant genome using global high-information-content-fingerprinting: the distal region of the wheat ancestor Aegilops tauschii chromosome 3DS

    Directory of Open Access Journals (Sweden)

    You Frank M

    2010-06-01

    Full Text Available Abstract Background Physical maps employing libraries of bacterial artificial chromosome (BAC clones are essential for comparative genomics and sequencing of large and repetitive genomes such as those of the hexaploid bread wheat. The diploid ancestor of the D-genome of hexaploid wheat (Triticum aestivum, Aegilops tauschii, is used as a resource for wheat genomics. The barley diploid genome also provides a good model for the Triticeae and T. aestivum since it is only slightly larger than the ancestor wheat D genome. Gene co-linearity between the grasses can be exploited by extrapolating from rice and Brachypodium distachyon to Ae. tauschii or barley, and then to wheat. Results We report the use of Ae. tauschii for the construction of the physical map of a large distal region of chromosome arm 3DS. A physical map of 25.4 Mb was constructed by anchoring BAC clones of Ae. tauschii with 85 EST on the Ae. tauschii and barley genetic maps. The 24 contigs were aligned to the rice and B. distachyon genomic sequences and a high density SNP genetic map of barley. As expected, the mapped region is highly collinear to the orthologous chromosome 1 in rice, chromosome 2 in B. distachyon and chromosome 3H in barley. However, the chromosome scale of the comparative maps presented provides new insights into grass genome organization. The disruptions of the Ae. tauschii-rice and Ae. tauschii-Brachypodium syntenies were identical. We observed chromosomal rearrangements between Ae. tauschii and barley. The comparison of Ae. tauschii physical and genetic maps showed that the recombination rate across the region dropped from 2.19 cM/Mb in the distal region to 0.09 cM/Mb in the proximal region. The size of the gaps between contigs was evaluated by comparing the recombination rate along the map with the local recombination rates calculated on single contigs. Conclusions The physical map reported here is the first physical map using fingerprinting of a complete

  10. Isolation of Specific Genomic Regions and Identification of Their Associated Molecules by Engineered DNA-Binding Molecule-Mediated Chromatin Immunoprecipitation (enChIP) Using the CRISPR System and TAL Proteins.

    Science.gov (United States)

    Fujii, Hodaka; Fujita, Toshitsugu

    2015-09-09

    Comprehensive understanding of genome functions requires identification of molecules (proteins, RNAs, genomic regions, etc.) bound to specific genomic regions of interest in vivo. To perform biochemical and molecular biological analysis of specific genomic regions, we developed engineered DNA-binding molecule-mediated chromatin immunoprecipitation (enChIP) to purify genomic regions of interest. In enChIP, specific genomic regions are tagged for biochemical purification using engineered DNA-binding molecules, such as transcription activator-like (TAL) proteins and a catalytically inactive form of the clustered regularly interspaced short palindromic repeats (CRISPR) system. enChIP is a comprehensive approach that emphasizes non-biased search using next-generation sequencing (NGS), microarrays, mass spectrometry (MS), and other methods. Moreover, this approach is not restricted to cultured cell lines and can be easily extended to organisms. In this review, we discuss applications of enChIP to elucidating the molecular mechanisms underlying genome functions.

  11. Characterization of untranslated regions of the salmonid alphavirus 3 (SAV3 genome and construction of a SAV3 based replicon

    Directory of Open Access Journals (Sweden)

    Rimstad Espen

    2009-10-01

    Full Text Available Abstract Salmonid alphavirus (SAV causes disease in farmed salmonid fish and is divided into different genetic subtypes (SAV1-6. Here we report the cloning and characterization of the 5'- and 3'- untranslated regions (UTR of a SAV3 isolated from Atlantic salmon in Norway. The sequences of the UTRs are very similar to those of SAV1 and SAV2, but single nucleotide polymorphisms are present, also in the 3' - conserved sequence element (3'-CSE. Prediction of the RNA secondary structure suggested putative stem-loop structures in both the 5'- and 3'-ends, similar to those of alphaviruses from the terrestrial environment, indicating that the general genome replication initiation strategy for alphaviruses is also utilized by SAV. A DNA replicon vector, pmSAV3, based upon a pVAX1 backbone and the SAV3 genome was constructed, and the SAV3 non-structural proteins were used to express a reporter gene controlled by the SAV3 subgenomic promoter. Transfection of pmSAV3 into CHSE and BF2 cell lines resulted in expression of the reporter protein, confirming that the cloned SAV3 replication apparatus and UTRs are functional in fish cells.

  12. Characterization of untranslated regions of the salmonid alphavirus 3 (SAV3) genome and construction of a SAV3 based replicon.

    Science.gov (United States)

    Karlsen, Marius; Villoing, Stephane; Rimstad, Espen; Nylund, Are

    2009-10-27

    Salmonid alphavirus (SAV) causes disease in farmed salmonid fish and is divided into different genetic subtypes (SAV1-6). Here we report the cloning and characterization of the 5'- and 3'- untranslated regions (UTR) of a SAV3 isolated from Atlantic salmon in Norway. The sequences of the UTRs are very similar to those of SAV1 and SAV2, but single nucleotide polymorphisms are present, also in the 3' - conserved sequence element (3'-CSE). Prediction of the RNA secondary structure suggested putative stem-loop structures in both the 5'- and 3'-ends, similar to those of alphaviruses from the terrestrial environment, indicating that the general genome replication initiation strategy for alphaviruses is also utilized by SAV. A DNA replicon vector, pmSAV3, based upon a pVAX1 backbone and the SAV3 genome was constructed, and the SAV3 non-structural proteins were used to express a reporter gene controlled by the SAV3 subgenomic promoter. Transfection of pmSAV3 into CHSE and BF2 cell lines resulted in expression of the reporter protein, confirming that the cloned SAV3 replication apparatus and UTRs are functional in fish cells.

  13. Identification and characterization of regions of difference between the Salmonella Gallinarum biovar Gallinarum and the Salmonella Gallinarum biovar Pullorum genomes.

    Science.gov (United States)

    Batista, Diego Felipe Alves; Freitas Neto, Oliveiro Caetano; Barrow, Paul Andrew; Oliveira, Marcos Túlio de; Almeida, Adriana Maria; Ferraudo, Antonio Sergio; Berchieri, Angelo

    2015-03-01

    Salmonella Gallinarum is the causative agent of fowl typhoid, a severe septicaemic disease that affects birds of all ages, whereas S. Pullorum causes pullorum disease, a systemic disorder affecting primarily young birds. A proportion of birds with pullorum disease become carriers and are thereby able to transmit S. Pullorum vertically. Although these two pathogens cause distinct diseases, they are otherwise phenotypically and genetically similar. Therefore, the small variations that lead to the differences in virulence must have a genetic basis which currently is unknown. In the present study, we compared the genome sequences of S. Gallinarum (strains: SG287/91 and SG9) and S. Pullorum (strains: SP_CDC, SP_RKS, SP_FCAV, SP_S06) and identified 223 regions of difference (RODs), characterized by indels which were detected by using the software Artemis Comparison Tool. Some of the RODs led to pseudogenes frequently formed by frameshifts and premature stop codons in genes primarily involved in virulence and metabolism. We further verified the presence of some conserved RODs by PCR in 26 isolates of S. Gallinarum and 17 of S. Pullorum in order to extrapolate data analyses from genome comparison to field strains. The variations observed in virulence-related genes of S. Gallinarum and S. Pullorum appear not to be sufficient to explain the differences between the distinct biology of infection of fowl typhoid and pullorum disease. Thus, we suggest that the identified pseudogenes affecting metabolism might play a greater role during infection than previously thought.

  14. Microalterations of inherently unstable genomic regions in rat mammary carcinomas as revealed by long oligonucleotide array-based comparative genomic hybridization

    NARCIS (Netherlands)

    Adamovic, T.; McAllister, D.; Guryev, V.; Wang, X.; Andrae, J.W.; Cuppen, E.; Jacob, H.; Sugg, S.L.

    2009-01-01

    The presence of copy number variants in normal genomes poses a challenge to identify small genuine somatic copy number changes in high-resolution cancer genome profiling studies due to the use of unpaired reference DNA. Another problem is the well-known rearrangements of immunoglobulin and T-cell re

  15. Microalterations of Inherently Unstable Genomic Regions in Rat Mammary Carcinomas as Revealed by Long Oligonucleotide Array-Based Comparative Genomic Hybridization

    NARCIS (Netherlands)

    Adamovic, Tatjana; McAllister, Donna; Guryev, Victor; Wang, Xujing; Andrae, Jaime Wendt; Cuppen, Edwin; Jacob, Howard J.; Sugg, Sonia L.

    2009-01-01

    The presence of copy number variants in normal genomes poses a challenge to identify small genuine somatic copy number changes in high-resolution cancer genome profiling studies due to the use of unpaired reference DNA. Another problem is the well-known rearrangements of immunoglobulin and T-cell re

  16. DNA copy number analysis of fresh and formalin-fixed specimens by shallow whole-genome sequencing with identification and exclusion of problematic regions in the genome assembly

    NARCIS (Netherlands)

    Scheinin, I.; Sie, D.; Bengtsson, H.; Wiel, M.A. van de; Olshen, A.B.; Thuijl, H.F. van; Essen, H.F. van; Eijk, P.P.; Rustenburg, F.; Meijer, G.A.; Reijneveld, J.C.; Wesseling, P.; Pinkel, D.; Albertson, D.G.; Ylstra, B.

    2014-01-01

    Detection of DNA copy number aberrations by shallow whole-genome sequencing (WGS) faces many challenges, including lack of completion and errors in the human reference genome, repetitive sequences, polymorphisms, variable sample quality, and biases in the sequencing procedures. Formalin-fixed paraff

  17. Genome-wide association study reveals regions associated with gestation length in two pig populations.

    Science.gov (United States)

    Hidalgo, A M; Lopes, M S; Harlizius, B; Bastiaansen, J W M

    2016-04-01

    Reproduction traits, such as gestation length (GLE), play an important role in dam line breeding in pigs. The objective of our study was to identify single nucleotide polymorphisms (SNPs) that are associated with GLE in two pig populations. Genotypes and deregressed breeding values were available for 2081 Dutch Landrace-based (DL) and 2301 Large White-based (LW) pigs. We identified two QTL regions for GLE, one in each population. For DL, three associated SNPs were detected in one QTL region spanning 0.52 Mbp on Sus scrofa chromosome (SSC) 2. For LW, four associated SNPs were detected in one region of 0.14 Mbp on SSC5. The region on SSC2 contains the heparin-binding EGF-like growth factor (HBEGF) gene, which promotes embryo implantation and has been described to be involved in embryo survival throughout gestation. The associated SNP can be used for marker-assisted selection in the studied populations, and further studies of the HBEGF gene are warranted to investigate its role in GLE.

  18. Comparative Genomics of the Conjugation Region of F-like Plasmids: Five Shades of F

    Science.gov (United States)

    Fernandez-Lopez, Raul; de Toro, Maria; Moncalian, Gabriel; Garcillan-Barcia, M. Pilar; de la Cruz, Fernando

    2016-01-01

    The F plasmid is the foremost representative of a large group of conjugative plasmids, prevalent in Escherichia coli, and widely distributed among the Enterobacteriaceae. These plasmids are of clinical relevance, given their frequent association with virulence determinants, colicins, and antibiotic resistance genes. Originally defined by their sensitivity to certain male-specific phages, IncF plasmids share a conserved conjugative system and regulatory circuits. In order to determine whether the genetic architecture and regulation circuits are preserved among these plasmids, we analyzed the natural diversity of F-like plasmids. Using the relaxase as a phylogenetic marker, we identified 256 plasmids belonging to the IncF/ MOBF12group, present as complete DNA sequences in the NCBI database. By comparative genomics, we identified five major groups of F-like plasmids. Each shows a particular operon structure and alternate regulatory systems. Results show that the IncF/MOBF12 conjugation gene cluster conforms a diverse and ancient group, which evolved alternative regulatory schemes in its adaptation to different environments and bacterial hosts. PMID:27891505

  19. Genetic and physical mapping of the genomic region spanning CMT4A

    Energy Technology Data Exchange (ETDEWEB)

    Othmane, K.B.; Loeb, D.; Roses, A.D. [Duke Univ. Medical Center, Durham, NC (United States)] [and others

    1994-09-01

    Autosomal recessive Charcot-Marie-Tooth disease (CMT4) is a severe childhood neuropathy classified into three types: A, B, and C. We previously mapped CMT4A to chromosome 8q13-q21 in four large Tunisian families. Analysis of recombination events suggested the order: cent.-D8S279-(D8S286,D8S164, CMT4A)-D8S84-tel. Families with types B and C were subsequently typed and linkage for these types was excluded for the CMT4A region and other known CMT loci. Recently, the gene for a major peripheral myelin protein (PMP2) was mapped by FISH to chromosome 8q21-q22 and therefore appeared to be a strong candidate gene for CMT4A. We used SSCP analysis, DNA sequencing, FISH and YAC mapping analysis, and demonstrated that PMP2 is not the defect in CMT4A. Using physical mapping data, we sublocalized a new genethon marker (D8S548) to the CMT4A region between D8S286 and D8S164. All affected CMT4A patients were homozygotes for this polymorphic microsatellite as expected from its physical localization. We screened the CEPH megabase YAC library using the closest markers; over 30 YACs were isolated and characterized by PFGE. FISH analysis revealed about 16% chimeras. The YACs span the 8 cM region between D8S279 and PMP2 (mapped distal to D8S84), with a current 1 cM gap between D8S164 and D8S84. We are currently using Alu-PCR and vectorette to develop end clones in order to identify new YACs in the region and further close this gap. Alu-PCR fragments have identified several new microsatellites in the region which can be used for additional mapping of the CMT4A gene.

  20. Transcriptional analyses of the region of the equine herpesvirus type 4 genome encoding glycoproteins I and E.

    Science.gov (United States)

    Damiani, A M; Jang, H K; Matsumura, T; Yokoyama, N; Miyazawa, T; Mikami, T

    1999-01-01

    To map the transcripts encoding the equine herpesvirus type 4 (EHV-4) glycoproteins I (gI) and E (gE), transcriptional analyses were performed at the right part of the unique short segment of EHV-4 genome. The results revealed that the gI gene is encoded by a 1.6-kb transcript which is 3' coterminal with a 3.0-kb gD mRNA while the gE gene is encoded by two transcripts of 3.5- and 2.4-kb in size. The transcriptional patterns described in this study for the EHV-4 gI and gE are similar to those found in the equivalent region of herpes simplex virus type 1 and feline herpesvirus type 1. Characterization of EHV-4 gI and gE glycoprotein genes may facilitate future studies to define their roles in the EHV-4 infection.

  1. High-throughput engineering of a mammalian genome reveals building principles of methylation states at CG rich regions.

    Science.gov (United States)

    Krebs, Arnaud R; Dessus-Babus, Sophie; Burger, Lukas; Schübeler, Dirk

    2014-09-26

    The majority of mammalian promoters are CpG islands; regions of high CG density that require protection from DNA methylation to be functional. Importantly, how sequence architecture mediates this unmethylated state remains unclear. To address this question in a comprehensive manner, we developed a method to interrogate methylation states of hundreds of sequence variants inserted at the same genomic site in mouse embryonic stem cells. Using this assay, we were able to quantify the contribution of various sequence motifs towards the resulting DNA methylation state. Modeling of this comprehensive dataset revealed that CG density alone is a minor determinant of their unmethylated state. Instead, these data argue for a principal role for transcription factor binding sites, a prediction confirmed by testing synthetic mutant libraries. Taken together, these findings establish the hierarchy between the two cis-encoded mechanisms that define the DNA methylation state and thus the transcriptional competence of CpG islands.

  2. Comparative sequence analysis of the Ghd7 orthologous regions revealed movement of Ghd7 in the grass genomes.

    Directory of Open Access Journals (Sweden)

    Lu Yang

    Full Text Available Ghd7 is an important rice gene that has a major effect on several agronomic traits, including yield. To reveal the origin of Ghd7 and sequence evolution of this locus, we performed a comparative sequence analysis of the Ghd7 orthologous regions from ten diploid Oryza species, Brachypodium distachyon, sorghum and maize. Sequence analysis demonstrated high gene collinearity across the genus Oryza and a disruption of collinearity among non-Oryza species. In particular, Ghd7 was not present in orthologous positions except in Oryza species. The Ghd7 regions were found to have low gene densities and high contents of repetitive elements, and that the sizes of orthologous regions varied tremendously. The large transposable element contents resulted in a high frequency of pseudogenization and gene movement events surrounding the Ghd7 loci. Annotation information and cytological experiments have indicated that Ghd7 is a heterochromatic gene. Ghd7 orthologs were identified in B. distachyon, sorghum and maize by phylogenetic analysis; however, the positions of orthologous genes differed dramatically as a consequence of gene movements in grasses. Rather, we identified sequence remnants of gene movement of Ghd7 mediated by illegitimate recombination in the B. distachyon genome.

  3. Re-annotation of the physical map of Glycine max for polyploid-like regions by BAC end sequence driven whole genome shotgun read assembly

    Directory of Open Access Journals (Sweden)

    Shultz Jeffry

    2008-07-01

    Full Text Available Abstract Background Many of the world's most important food crops have either polyploid genomes or homeologous regions derived from segmental shuffling following polyploid formation. The soybean (Glycine max genome has been shown to be composed of approximately four thousand short interspersed homeologous regions with 1, 2 or 4 copies per haploid genome by RFLP analysis, microsatellite anchors to BACs and by contigs formed from BAC fingerprints. Despite these similar regions,, the genome has been sequenced by whole genome shotgun sequence (WGS. Here the aim was to use BAC end sequences (BES derived from three minimum tile paths (MTP to examine the extent and homogeneity of polyploid-like regions within contigs and the extent of correlation between the polyploid-like regions inferred from fingerprinting and the polyploid-like sequences inferred from WGS matches. Results Results show that when sequence divergence was 1–10%, the copy number of homeologous regions could be identified from sequence variation in WGS reads overlapping BES. Homeolog sequence variants (HSVs were single nucleotide polymorphisms (SNPs; 89% and single nucleotide indels (SNIs 10%. Larger indels were rare but present (1%. Simulations that had predicted fingerprints of homeologous regions could be separated when divergence exceeded 2% were shown to be false. We show that a 5–10% sequence divergence is necessary to separate homeologs by fingerprinting. BES compared to WGS traces showed polyploid-like regions with less than 1% sequence divergence exist at 2.3% of the locations assayed. Conclusion The use of HSVs like SNPs and SNIs to characterize BACs wil improve contig building methods. The implications for bioinformatic and functional annotation of polyploid and paleopolyploid genomes show that a combined approach of BAC fingerprint based physical maps, WGS sequence and HSV-based partitioning of BAC clones from homeologous regions to separate contigs will allow reliable de

  4. Integrated pathway-based approach identifies association between genomic regions at CTCF and CACNB2 and schizophrenia.

    Directory of Open Access Journals (Sweden)

    Dilafruz Juraeva

    2014-06-01

    Full Text Available In the present study, an integrated hierarchical approach was applied to: (1 identify pathways associated with susceptibility to schizophrenia; (2 detect genes that may be potentially affected in these pathways since they contain an associated polymorphism; and (3 annotate the functional consequences of such single-nucleotide polymorphisms (SNPs in the affected genes or their regulatory regions. The Global Test was applied to detect schizophrenia-associated pathways using discovery and replication datasets comprising 5,040 and 5,082 individuals of European ancestry, respectively. Information concerning functional gene-sets was retrieved from the Kyoto Encyclopedia of Genes and Genomes, Gene Ontology, and the Molecular Signatures Database. Fourteen of the gene-sets or pathways identified in the discovery dataset were confirmed in the replication dataset. These include functional processes involved in transcriptional regulation and gene expression, synapse organization, cell adhesion, and apoptosis. For two genes, i.e. CTCF and CACNB2, evidence for association with schizophrenia was available (at the gene-level in both the discovery study and published data from the Psychiatric Genomics Consortium schizophrenia study. Furthermore, these genes mapped to four of the 14 presently identified pathways. Several of the SNPs assigned to CTCF and CACNB2 have potential functional consequences, and a gene in close proximity to CACNB2, i.e. ARL5B, was identified as a potential gene of interest. Application of the present hierarchical approach thus allowed: (1 identification of novel biological gene-sets or pathways with potential involvement in the etiology of schizophrenia, as well as replication of these findings in an independent cohort; (2 detection of genes of interest for future follow-up studies; and (3 the highlighting of novel genes in previously reported candidate regions for schizophrenia.

  5. Origin of the CMS gene locus in rapeseed cybrid mitochondria: active and inactive recombination produces the complex CMS gene region in the mitochondrial genomes of Brassicaceae.

    Science.gov (United States)

    Oshima, Masao; Kikuchi, Rie; Imamura, Jun; Handa, Hirokazu

    2010-01-01

    CMS (cytoplasmic male sterile) rapeseed is produced by asymmetrical somatic cell fusion between the Brassica napus cv. Westar and the Raphanus sativus Kosena CMS line (Kosena radish). The CMS rapeseed contains a CMS gene, orf125, which is derived from Kosena radish. Our sequence analyses revealed that the orf125 region in CMS rapeseed originated from recombination between the orf125/orfB region and the nad1C/ccmFN1 region by way of a 63 bp repeat. A precise sequence comparison among the related sequences in CMS rapeseed, Kosena radish and normal rapeseed showed that the orf125 region in CMS rapeseed consisted of the Kosena orf125/orfB region and the rapeseed nad1C/ccmFN1 region, even though Kosena radish had both the orf125/orfB region and the nad1C/ccmFN1 region in its mitochondrial genome. We also identified three tandem repeat sequences in the regions surrounding orf125, including a 63 bp repeat, which were involved in several recombination events. Interestingly, differences in the recombination activity for each repeat sequence were observed, even though these sequences were located adjacent to each other in the mitochondrial genome. We report results indicating that recombination events within the mitochondrial genomes are regulated at the level of specific repeat sequences depending on the cellular environment.

  6. Pan-Genome Analysis of Human Gastric Pathogen H. pylori: Comparative Genomics and Pathogenomics Approaches to Identify Regions Associated with Pathogenicity and Prediction of Potential Core Therapeutic Targets

    Directory of Open Access Journals (Sweden)

    Amjad Ali

    2015-01-01

    Full Text Available Helicobacter pylori is a human gastric pathogen implicated as the major cause of peptic ulcer and second leading cause of gastric cancer (~70% around the world. Conversely, an increased resistance to antibiotics and hindrances in the development of vaccines against H. pylori are observed. Pan-genome analyses of the global representative H. pylori isolates consisting of 39 complete genomes are presented in this paper. Phylogenetic analyses have revealed close relationships among geographically diverse strains of H. pylori. The conservation among these genomes was further analyzed by pan-genome approach; the predicted conserved gene families (1,193 constitute ~77% of the average H. pylori genome and 45% of the global gene repertoire of the species. Reverse vaccinology strategies have been adopted to identify and narrow down the potential core-immunogenic candidates. Total of 28 nonhost homolog proteins were characterized as universal therapeutic targets against H. pylori based on their functional annotation and protein-protein interaction. Finally, pathogenomics and genome plasticity analysis revealed 3 highly conserved and 2 highly variable putative pathogenicity islands in all of the H. pylori genomes been analyzed.

  7. Genomic scan of selective sweeps in thin and fat tail sheep breeds for identifying of candidate regions associated with fat deposition

    Directory of Open Access Journals (Sweden)

    Moradi Mohammad Hossein

    2012-02-01

    Full Text Available Abstract Background Identification of genomic regions that have been targets of selection for phenotypic traits is one of the most important and challenging areas of research in animal genetics. However, currently there are relatively few genomic regions identified that have been subject to positive selection. In this study, a genome-wide scan using ~50,000 Single Nucleotide Polymorphisms (SNPs was performed in an attempt to identify genomic regions associated with fat deposition in fat-tail breeds. This trait and its modification are very important in those countries grazing these breeds. Results Two independent experiments using either Iranian or Ovine HapMap genotyping data contrasted thin and fat tail breeds. Population differentiation using FST in Iranian thin and fat tail breeds revealed seven genomic regions. Almost all of these regions overlapped with QTLs that had previously been identified as affecting fat and carcass yield traits in beef and dairy cattle. Study of selection sweep signatures using FST in thin and fat tail breeds sampled from the Ovine HapMap project confirmed three of these regions located on Chromosomes 5, 7 and X. We found increased homozygosity in these regions in favour of fat tail breeds on chromosome 5 and X and in favour of thin tail breeds on chromosome 7. Conclusions In this study, we were able to identify three novel regions associated with fat deposition in thin and fat tail sheep breeds. Two of these were associated with an increase of homozygosity in the fat tail breeds which would be consistent with selection for mutations affecting fat tail size several thousand years after domestication.

  8. RNA interactions in the 5' region of the HIV-1 genome

    DEFF Research Database (Denmark)

    Damgaard, Christian Kroun; Andersen, Ebbe Sloth; Knudsen, Bjarne;

    2004-01-01

    present a secondary structure model for the entire 744 nucleotide fragment, which incorporates previously recognized and novel structural elements. In particular, our data provided strong evidence for a long-distance interaction between the region encompassing the AUG Gag initiation codon and an upstream...... a combination of bioinformatics, enzymatic probing, native gel electrophoresis, and UV-crosslinking experiments. We used a recently developed RNA folding algorithm (Pfold) to predict the common secondary structure of an alignment of 20 divergent HIV-1 sequences. Combining this analysis with biochemical data, we...

  9. Positive selection in the chromosome 16 VKORC1 genomic region has contributed to the variability of anticoagulant response in humans.

    Directory of Open Access Journals (Sweden)

    Blandine Patillon

    Full Text Available VKORC1 (vitamin K epoxide reductase complex subunit 1, 16p11.2 is the main genetic determinant of human response to oral anticoagulants of antivitamin K type (AVK. This gene was recently suggested to be a putative target of positive selection in East Asian populations. In this study, we genotyped the HGDP-CEPH Panel for six VKORC1 SNPs and downloaded chromosome 16 genotypes from the HGDP-CEPH database in order to characterize the geographic distribution of footprints of positive selection within and around this locus. A unique VKORC1 haplotype carrying the promoter mutation associated with AVK sensitivity showed especially high frequencies in all the 17 HGDP-CEPH East Asian population samples. VKORC1 and 24 neighboring genes were found to lie in a 505 kb region of strong linkage disequilibrium in these populations. Patterns of allele frequency differentiation and haplotype structure suggest that this genomic region has been submitted to a near complete selective sweep in all East Asian populations and only in this geographic area. The most extreme scores of the different selection tests are found within a smaller 45 kb region that contains VKORC1 and three other genes (BCKDK, MYST1 (KAT8, and PRSS8 with different functions. Because of the strong linkage disequilibrium, it is not possible to determine if VKORC1 or one of the three other genes is the target of this strong positive selection that could explain present-day differences among human populations in AVK dose requirement. Our results show that the extended region surrounding a presumable single target of positive selection should be analyzed for genetic variation in a wide range of genetically diverse populations in order to account for other neighboring and confounding selective events and the hitchhiking effect.

  10. Positive selection in the chromosome 16 VKORC1 genomic region has contributed to the variability of anticoagulant response in humans.

    Science.gov (United States)

    Patillon, Blandine; Luisi, Pierre; Blanché, Hélène; Patin, Etienne; Cann, Howard M; Génin, Emmanuelle; Sabbagh, Audrey

    2012-01-01

    VKORC1 (vitamin K epoxide reductase complex subunit 1, 16p11.2) is the main genetic determinant of human response to oral anticoagulants of antivitamin K type (AVK). This gene was recently suggested to be a putative target of positive selection in East Asian populations. In this study, we genotyped the HGDP-CEPH Panel for six VKORC1 SNPs and downloaded chromosome 16 genotypes from the HGDP-CEPH database in order to characterize the geographic distribution of footprints of positive selection within and around this locus. A unique VKORC1 haplotype carrying the promoter mutation associated with AVK sensitivity showed especially high frequencies in all the 17 HGDP-CEPH East Asian population samples. VKORC1 and 24 neighboring genes were found to lie in a 505 kb region of strong linkage disequilibrium in these populations. Patterns of allele frequency differentiation and haplotype structure suggest that this genomic region has been submitted to a near complete selective sweep in all East Asian populations and only in this geographic area. The most extreme scores of the different selection tests are found within a smaller 45 kb region that contains VKORC1 and three other genes (BCKDK, MYST1 (KAT8), and PRSS8) with different functions. Because of the strong linkage disequilibrium, it is not possible to determine if VKORC1 or one of the three other genes is the target of this strong positive selection that could explain present-day differences among human populations in AVK dose requirement. Our results show that the extended region surrounding a presumable single target of positive selection should be analyzed for genetic variation in a wide range of genetically diverse populations in order to account for other neighboring and confounding selective events and the hitchhiking effect.

  11. Complete genome sequence of mandarin decline Citrus tristeza virus of the Northeastern Himalayan hill region of India: comparative analyses determine recombinant.

    Science.gov (United States)

    Biswas, Kajal K; Tarafdar, Avijit; Sharma, Susheel K

    2012-03-01

    The complete genome sequence of a mandarin (Citrus reticulata) decline CTV isolate, Kpg3, of the Darjeeling hills of the Northeastern Himalayan region of India is reported for the first time. The complete Kpg3 genome has 19253 nt, and its nucleotide sequence identity ranged from 79% with the Florida CTV isolate T36 to 94% with the Israel isolate VT, whereas its identity to B165, the other Indian isolate, was 89%. Phylogenetic analysis indicated that the Kpg3 genome is closely related to isolate VT and distantly to T36 and B165. Recombination analysis indicated that Kpg3 is recombinant and originated through multiple recombination events in which parts of the genome were exchanged between divergent CTV sequences.

  12. Engineered chromosome-based genetic mapping establishes a 3.7 Mb critical genomic region for Down syndrome-associated heart defects in mice.

    Science.gov (United States)

    Liu, Chunhong; Morishima, Masae; Jiang, Xiaoling; Yu, Tao; Meng, Kai; Ray, Debjit; Pao, Annie; Ye, Ping; Parmacek, Michael S; Yu, Y Eugene

    2014-06-01

    Trisomy 21 (Down syndrome, DS) is the most common human genetic anomaly associated with heart defects. Based on evolutionary conservation, DS-associated heart defects have been modeled in mice. By generating and analyzing mouse mutants carrying different genomic rearrangements in human chromosome 21 (Hsa21) syntenic regions, we found the triplication of the Tiam1-Kcnj6 region on mouse chromosome 16 (Mmu16) resulted in DS-related cardiovascular abnormalities. In this study, we developed two tandem duplications spanning the Tiam1-Kcnj6 genomic region on Mmu16 using recombinase-mediated genome engineering, Dp(16)3Yey and Dp(16)4Yey, spanning the 2.1 Mb Tiam1-Il10rb and 3.7 Mb Ifnar1-Kcnj6 regions, respectively. We found that Dp(16)4Yey/+, but not Dp(16)3Yey/+, led to heart defects, suggesting the triplication of the Ifnar1-Kcnj6 region is sufficient to cause DS-associated heart defects. Our transcriptional analysis of Dp(16)4Yey/+ embryos showed that the Hsa21 gene orthologs located within the duplicated interval were expressed at the elevated levels, reflecting the consequences of the gene dosage alterations. Therefore, we have identified a 3.7 Mb genomic region, the smallest critical genomic region, for DS-associated heart defects, and our results should set the stage for the final step to establish the identities of the causal gene(s), whose elevated expression(s) directly underlie this major DS phenotype.

  13. In situ optical sequencing and structure analysis of a trinucleotide repeat genome region by localization microscopy after specific COMBO-FISH nano-probing

    Science.gov (United States)

    Stuhlmüller, M.; Schwarz-Finsterle, J.; Fey, E.; Lux, J.; Bach, M.; Cremer, C.; Hinderhofer, K.; Hausmann, M.; Hildenbrand, G.

    2015-10-01

    Trinucleotide repeat expansions (like (CGG)n) of chromatin in the genome of cell nuclei can cause neurological disorders such as for example the Fragile-X syndrome. Until now the mechanisms are not clearly understood as to how these expansions develop during cell proliferation. Therefore in situ investigations of chromatin structures on the nanoscale are required to better understand supra-molecular mechanisms on the single cell level. By super-resolution localization microscopy (Spectral Position Determination Microscopy; SPDM) in combination with nano-probing using COMBO-FISH (COMBinatorial Oligonucleotide FISH), novel insights into the nano-architecture of the genome will become possible. The native spatial structure of trinucleotide repeat expansion genome regions was analysed and optical sequencing of repetitive units was performed within 3D-conserved nuclei using SPDM after COMBO-FISH. We analysed a (CGG)n-expansion region inside the 5' untranslated region of the FMR1 gene. The number of CGG repeats for a full mutation causing the Fragile-X syndrome was found and also verified by Southern blot. The FMR1 promotor region was similarly condensed like a centromeric region whereas the arrangement of the probes labelling the expansion region seemed to indicate a loop-like nano-structure. These results for the first time demonstrate that in situ chromatin structure measurements on the nanoscale are feasible. Due to further methodological progress it will become possible to estimate the state of trinucleotide repeat mutations in detail and to determine the associated chromatin strand structural changes on the single cell level. In general, the application of the described approach to any genome region will lead to new insights into genome nano-architecture and open new avenues for understanding mechanisms and their relevance in the development of heredity diseases.

  14. A novel mitochondrial genome architecture in thrips (Insecta: Thysanoptera): extreme size asymmetry among chromosomes and possible recent control region duplication

    Science.gov (United States)

    Multi-partite mitochondrial genomes are very rare in animals but have been found previously in two insect orders with highly rearranged genomes, the Phthiraptera (parasitic lice), and the Psocoptera (booklice/barklice). We provide the first report of a multi-partite mitochondrial genome architecture...

  15. Genomic plasticity of the immune-related Mhc class I B region in macaque species

    Directory of Open Access Journals (Sweden)

    Bontrop Ronald E

    2008-10-01

    Full Text Available Abstract Background In sharp contrast to humans and great apes, the expanded Mhc-B region of rhesus and cynomolgus macaques is characterized by the presence of differential numbers and unique combinations of polymorphic class I B genes per haplotype. The MIB microsatellite is closely linked to the single class I B gene in human and in some great apes studied. The physical map of the Mhc of a heterozygous rhesus monkey provides unique material to analyze MIB and Mamu-B copy number variation and then allows one to decipher the compound evolutionary history of this region in primate species. Results In silico research pinpointed 12 MIB copies (duplicons, most of which are associated with expressed B-genes that cluster in a separate clade in the phylogenetic tree. Generic primers tested on homozygous rhesus and pedigreed cynomolgus macaques allowed the identification of eight to eleven MIB copies per individual. The number of MIB copies present per haplotype varies from a minimum of three to six in cynomolgus macaques and from five to eight copies in rhesus macaques. Phylogenetic analyses highlight a strong transpecific sharing of MIB duplicons. Using the physical map, we observed that, similar to MIB duplicons, highly divergent Mamu-B genes can be present on the same haplotype. Haplotype variation as reflected by the copy number variation of class I B loci is best explained by recombination events, which are found to occur between MIBs and Mamu-B. Conclusion The data suggest the existence of highly divergent MIB and Mamu-B lineages on a given haplotype, as well as variable MIB and B copy numbers and configurations, at least in rhesus macaque. Recombination seems to occur between MIB and Mamu-B loci, and the resulting haplotypic plasticity at the individual level may be a strategy to better cope with pathogens. Therefore, evolutionary inferences based on the multiplicated MIB loci but also other markers close to B-genes appear to be promising for

  16. Regional mosaic genomic heterogeneity in the elderly and in Alzheimer's disease as a correlate of neuronal vulnerability.

    Science.gov (United States)

    Arendt, Thomas; Brückner, Martina K; Lösche, Andreas

    2015-10-01

    Alzheimer's disease (AD) is a neurodegenerative disorder characterized by fibrillary aggregates of Aβ peptide and tau protein. The distribution of these pathological hallmarks throughout the brain is not random; it follows a predictive pattern that is used for pathological staging. However, most etiopathogenetic concepts, irrespective of whether they focus on Aβ or tau pathology, leave a key question unanswered: what is the explanation for the different vulnerabilities of brain regions in AD? The pattern of regional progression of neurofibrillary degeneration in AD to some extent inversely recapitulates ontogenetic and phylogenetic brain development. Accordingly, degeneration preferentially affects brain areas that have recently been acquired or restructured during anthropoid evolution, which means that the involvement of a neurodevelopmental mechanism is highly likely. Since evolutionary expansion of the neocortex is based on a substantial extension of the mitotic activity of progenitor cells, we propose a conceptual link between neurogenesis in anthropoid primates and a higher risk of accumulating mitotic errors that give rise to genomic aberrations commonly referred to as DNA content variation (DCV). If increased rates of DCV make neurons more vulnerable to AD-related pathology, one might expect there to be a higher rate of DCV in areas that are affected very early during the course of AD, as compared to areas which are hardly affected or are affected only during the most advanced stages. Therefore, in the present study, we comparatively analyzed the DCV in five different cortical areas that are affected during the early stage (entorhinal cortex), the intermediate stage (temporal, frontal, and parietal association cortex), and the late stage (primary sensory occipital cortex) of AD in both normal elderly subjects and AD patients. On average, we observed about 10 % neuronal mosaic DCV in the normal elderly and a two- to threefold increase in DCV in AD patients

  17. A maximum likelihood QTL analysis reveals common genome regions controlling resistance to Salmonella colonization and carrier-state

    Directory of Open Access Journals (Sweden)

    Thanh-Son Tran

    2012-05-01

    Full Text Available Abstract Background The serovars Enteritidis and Typhimurium of the Gram-negative bacterium Salmonella enterica are significant causes of human food poisoning. Fowl carrying these bacteria often show no clinical disease, with detection only established post-mortem. Increased resistance to the carrier state in commercial poultry could be a way to improve food safety by reducing the spread of these bacteria in poultry flocks. Previous studies identified QTLs for both resistance to carrier state and resistance to Salmonella colonization in the same White Leghorn inbred lines. Until now, none of the QTLs identified was common to the two types of resistance. All these analyses were performed using the F2 inbred or backcross option of the QTLExpress software based on linear regression. In the present study, QTL analysis was achieved using Maximum Likelihood with QTLMap software, in order to test the effect of the QTL analysis method on QTL detection. We analyzed the same phenotypic and genotypic data as those used in previous studies, which were collected on 378 animals genotyped with 480 genome-wide SNP markers. To enrich these data, we added eleven SNP markers located within QTLs controlling resistance to colonization and we looked for potential candidate genes co-localizing with QTLs. Results In our case the QTL analysis method had an important impact on QTL detection. We were able to identify new genomic regions controlling resistance to carrier-state, in particular by testing the existence of two segregating QTLs. But some of the previously identified QTLs were not confirmed. Interestingly, two QTLs were detected on chromosomes 2 and 3, close to the locations of the major QTLs controlling resistance to colonization and to candidate genes involved in the immune response identified in other, independent studies. Conclusions Due to the lack of stability of the QTLs detected, we suggest that interesting regions for further studies are those that were

  18. Sequence analysis of the E3 region and fiber gene of human adenovirus genome type 7h.

    Science.gov (United States)

    Kajon, A E; Wadell, G

    1996-01-15

    Adenovirus type 7h is currently the predominant virulent genome type of serotype 7 isolated in Argentina, Chile, and Uruguay in association with severe infantile pneumonia. In order to characterize possible molecular determinants of pathogenicity, the nucleotide sequence of a 5904-bp fragment (76 to 93 mu) containing the entire E3 region and the fiber gene of Ad7h was established. The organization of the ORFs within the E3 region was similar to that reported for the prototype strains of Ad7 and Ad3. A comparison of the nucleotide and amino acid sequences of all ORFs revealed a higher homology between Ad7h and Ad7p than between Ad7h and Ad3 for 12.0K and 16.1K, whereas the 15.3K ORF and the adjacent fiber gene were strikingly more homologous to those of Ad3 (99.5 vs 81.1% and 98.2 vs 66.6%, respectively). The equivalent to ORF 7.7K in Ad7p was missing in Ad7h due to a deletion and a mutation affecting the start codon (ATG-->ATT). Although the hemagglutinin of the Ad7h fiber could not be characterized due to its lack of activity on monkey erythrocytes, our results indicate that Ad7h is an intermediate strain 7-3.

  19. Robust physical methods that enrich genomic regions identical by descent for linkage studies: confirmation of a locus for osteogenesis imperfecta

    Directory of Open Access Journals (Sweden)

    Cohen Nadine

    2009-03-01

    Full Text Available Abstract Background The monogenic disease osteogenesis imperfecta (OI is due to single mutations in either of the collagen genes ColA1 or ColA2, but within the same family a given mutation is accompanied by a wide range of disease severity. Although this phenotypic variability implies the existence of modifier gene variants, genome wide scanning of DNA from OI patients has not been reported. Promising genome wide marker-independent physical methods for identifying disease-related loci have lacked robustness for widespread applicability. Therefore we sought to improve these methods and demonstrate their performance to identify known and novel loci relevant to OI. Results We have improved methods for enriching regions of identity-by-descent (IBD shared between related, afflicted individuals. The extent of enrichment exceeds 10- to 50-fold for some loci. The efficiency of the new process is shown by confirmation of the identification of the Col1A2 locus in osteogenesis imperfecta patients from Amish families. Moreover the analysis revealed additional candidate linkage loci that may harbour modifier genes for OI; a locus on chromosome 1q includes COX-2, a gene implicated in osteogenesis. Conclusion Technology for physical enrichment of IBD loci is now robust and applicable for finding genes for monogenic diseases and genes for complex diseases. The data support the further investigation of genetic loci other than collagen gene loci to identify genes affecting the clinical expression of osteogenesis imperfecta. The discrimination of IBD mapping will be enhanced when the IBD enrichment procedure is coupled with deep resequencing.

  20. Origin, evolution, and biological role of miRNA cluster in DLK-DIO3 genomic region in placental mammals.

    Science.gov (United States)

    Glazov, Evgeny A; McWilliam, Sean; Barris, Wesley C; Dalrymple, Brian P

    2008-05-01

    MicroRNAs (miRNAs) are a rapidly growing family of small regulatory RNAs modulating gene expression in plants and animals. In animals, most of the miRNAs discovered in early studies were found to be evolutionarily conserved across the whole kingdom. More recent studies, however, have identified many miRNAs that are specific to a particular group of organisms or even a single species. These present a question about evolution of the individual miRNAs and their role in establishing and maintaining lineage-specific functions and characteristics. In this study, we describe a detailed analysis of the miRNA cluster (hereafter mir-379/mir-656 cluster) located within the imprinted DLK-DIO3 region on human chromosome 14. We show that orthologous miRNA clusters are present in all sequenced genomes of the placental (eutherian) mammals but not in the marsupial (metatherian), monotreme (prototherian), or any other vertebrate genomes. We provide evidence that the locus encompassing this cluster emerged in an early eutherian ancestor prior to the radiation of modern placental mammals by tandem duplication of the ancient precursor sequence. The original amplified cluster may have contained in excess of 250 miRNA precursor sequences, most of which now appear to be inactive. Examination of the eutherian genomes showed that the cluster has been maintained in evolution for approximately 100 Myr. Analysis of genes that contain predicted evolutionarily conserved targets for miRNAs from this cluster revealed significant overrepresentation of the Gene Ontology terms associated with biological processes such as neurogenesis, embryonic development, transcriptional regulation, and RNA metabolism. Consistent with these findings, a survey of the miRNA expression data within the cluster demonstrates a strong bias toward brain and placenta samples from adult organisms and some embryonic tissues. Our results suggest that emergence of the mir-379/mir-656 miRNA cluster was one of the factors that

  1. A Legionella pneumophila effector protein encoded in a region of genomic plasticity binds to Dot/Icm-modified vacuoles.

    Directory of Open Access Journals (Sweden)

    Shira Ninio

    2009-01-01

    Full Text Available Legionella pneumophila is an opportunistic pathogen that can cause a severe pneumonia called Legionnaires' disease. In the environment, L. pneumophila is found in fresh water reservoirs in a large spectrum of environmental conditions, where the bacteria are able to replicate within a variety of protozoan hosts. To survive within eukaryotic cells, L. pneumophila require a type IV secretion system, designated Dot/Icm, that delivers bacterial effector proteins into the host cell cytoplasm. In recent years, a number of Dot/Icm substrate proteins have been identified; however, the function of most of these proteins remains unknown, and it is unclear why the bacterium maintains such a large repertoire of effectors to promote its survival. Here we investigate a region of the L. pneumophila chromosome that displays a high degree of plasticity among four sequenced L. pneumophila strains. Analysis of GC content suggests that several genes encoded in this region were acquired through horizontal gene transfer. Protein translocation studies establish that this region of genomic plasticity encodes for multiple Dot/Icm effectors. Ectopic expression studies in mammalian cells indicate that one of these substrates, a protein called PieA, has unique effector activities. PieA is an effector that can alter lysosome morphology and associates specifically with vacuoles that support L. pneumophila replication. It was determined that the association of PieA with vacuoles containing L. pneumophila requires modifications to the vacuole mediated by other Dot/Icm effectors. Thus, the localization properties of PieA reveal that the Dot/Icm system has the ability to spatially and temporally control the association of an effector with vacuoles containing L. pneumophila through activities mediated by other effector proteins.

  2. Systematic analysis of enhancer and critical cis-acting RNA elements in the protein-encoding region of the hepatitis C virus genome.

    Science.gov (United States)

    Chu, Derrick; Ren, Songyang; Hu, Stacy; Wang, Wei Gang; Subramanian, Aparna; Contreras, Deisy; Kanagavel, Vidhya; Chung, Eric; Ko, Justine; Amirtham Jacob Appadorai, Ranjit Singh; Sinha, Sanjeev; Jalali, Ziba; Hardy, David W; French, Samuel W; Arumugaswami, Vaithilingaraja

    2013-05-01

    Hepatitis C virus (HCV) causes chronic hepatitis, cirrhosis, and liver cancer. cis-acting RNA elements of the HCV genome are critical for translation initiation and replication of the viral genome. We hypothesized that the coding regions of nonstructural proteins harbor enhancer and essential cis-acting replication elements (CRE). In order to experimentally identify new cis RNA elements, we utilized an unbiased approach to introduce synonymous substitutions. The HCV genome coding for nonstructural proteins (nucleotide positions 3872 to 9097) was divided into 17 contiguous segments. The wobble nucleotide positions of each codon were replaced, resulting in 33% to 41% nucleotide changes. The HCV genome containing one of each of 17 mutant segments (S1 to S17) was tested for genome replication and infectivity. We observed that silent mutations in segment 13 (S13) (nucleotides [nt] 7457 to 7786), S14 (nt 7787 to 8113), S15 (nt 8114 to 8440), S16 (nt 8441 to 8767), and S17 (nt 8768 to 9097) resulted in impaired genome replication, suggesting CRE structures are enriched in the NS5B region. Subsequent high-resolution mutational analysis of NS5B (nt 7787 to 9289) using approximately 51-nucleotide contiguous subsegment mutant viruses having synonymous mutations revealed that subsegments SS8195-8245, SS8654-8704, and SS9011-9061 were required for efficient viral growth, suggesting that these regions act as enhancer elements. Covariant nucleotide substitution analysis of a stem-loop, JFH-SL9098, revealed the formation of an extended stem structure, which we designated JFH-SL9074. We have identified new enhancer RNA elements and an extended stem-loop in the NS5B coding region. Genetic modification of enhancer RNA elements can be utilized for designing attenuated HCV vaccine candidates.

  3. ChIP-seq Analysis in R (CSAR): An R package for the statistical detection of protein-bound genomic regions

    NARCIS (Netherlands)

    Muino, J.M.; Kaufmann, K.; Ham, van R.C.H.J.; Angenent, G.C.; Krajewski, P.

    2011-01-01

    Background In vivo detection of protein-bound genomic regions can be achieved by combining chromatin-immunoprecipitation with next-generation sequencing technology (ChIP-seq). The large amount of sequence data produced by this method needs to be analyzed in a statistically proper and computationally

  4. Engineered chromosome-based genetic mapping establishes a 3.7-Mb critical genomic region for Down syndrome-associated heart defects in mice

    OpenAIRE

    Liu, Chunhong; Morishima, Masae; Jiang,Xiaoling; Yu, Tao; Meng, Kai; Ray, Debjit; Pao, Annie; Ye, Ping; Parmacek, Michael S.; Yu, Y. Eugene

    2013-01-01

    Trisomy 21 (Down syndrome, DS) is the most common human genetic anomaly associated with heart defects. Based on evolutionary conservation, DS-associated heart defects have been modeled in mice. By generating and analyzing mouse mutants carrying different genomic rearrangements in human chromosome 21 (Hsa21) syntenic regions, we found the triplication of the Tiam1-Kcnj6 region on mouse chromosome 16 (Mmu16) resulted in DS-related cardiovascular abnormalities. In this study, we developed two ta...

  5. Remnants of the Legume Ancestral Genome Preserved in Gene-Rich Regions: Insights from Lupinus angustifolius Physical, Genetic, and Comparative Mapping.

    Science.gov (United States)

    Książkiewicz, Michał; Zielezinski, Andrzej; Wyrwa, Katarzyna; Szczepaniak, Anna; Rychel, Sandra; Karlowski, Wojciech; Wolko, Bogdan; Naganowska, Barbara

    The narrow-leafed lupin (Lupinus angustifolius) was recently considered as a legume reference species. Genetic resources have been developed, including a draft genome sequence, linkage maps, nuclear DNA libraries, and cytogenetic chromosome-specific landmarks. Here, we used a complex approach, involving DNA fingerprinting, sequencing, genetic mapping, and molecular cytogenetics, to localize and analyze L. angustifolius gene-rich regions (GRRs). A L. angustifolius genomic bacterial artificial chromosome (BAC) library was screened with short sequence repeat (SSR)-based probes. Selected BACs were fingerprinted and assembled into contigs. BAC-end sequence (BES) annotation allowed us to choose clones for sequencing, targeting GRRs. Additionally, BESs were aligned to the scaffolds of the genome sequence. The genetic map was supplemented with 35 BES-derived markers, distributed in 14 linkage groups and tagging 37 scaffolds. The identified GRRs had an average gene density of 19.6 genes/100 kb and physical-to-genetic distance ratios of 11 to 109 kb/cM. Physical and genetic mapping was supported by multi-BAC-fluorescence in situ hybridization (FISH), and five new linkage groups were assigned to the chromosomes. Syntenic links to the genome sequences of five legume species (Medicago truncatula, Glycine max, Lotus japonicus, Phaseolus vulgaris, and Cajanus cajan) were identified. The comparative mapping of the two largest lupin GRRs provides novel evidence for ancient duplications in all of the studied species. These regions are conserved among representatives of the main clades of Papilionoideae. Furthermore, despite the complex evolution of legumes, some segments of the nuclear genome were not substantially modified and retained their quasi-ancestral structures. Cytogenetic markers anchored in these regions constitute a platform for heterologous mapping of legume genomes.

  6. Structural organization of poliovirus RNA replication is mediated by viral proteins of the P2 genomic region

    Energy Technology Data Exchange (ETDEWEB)

    Bienz, K.; Egger, D.; Troxler, M.; Pasamontes, L. (Univ. of Basel (Switzerland))

    1990-03-01

    Transcriptionally active replication complexes bound to smooth membrane vesicles were isolated from poliovirus-infected cells. In electron microscopic, negatively stained preparations, the replication complex appeared as an irregularly shaped, oblong structure attached to several virus-induced vesicles of a rosettelike arrangement. Electron microscopic immunocytochemistry of such preparations demonstrated that the poliovirus replication complex contains the proteins coded by the P2 genomic region (P2 proteins) in a membrane-associated form. In addition, the P2 proteins are also associated with viral RNA, and they can be cross-linked to viral RNA by UV irradiation. Guanidine hydrochloride prevented the P2 proteins from becoming membrane bound but did not change their association with viral RNA. The findings allow the conclusion that the protein 2C or 2C-containing precursor(s) is responsible for the attachment of the viral RNA to the vesicular membrane and for the spatial organization of the replication complex necessary for its proper functioning in viral transcription. A model for the structure of the viral replication complex and for the function of the 2C-containing P2 protein(s) and the vesicular membranes is proposed.

  7. Scanning for genes in large genomic regions: cosmid-based exon trapping of multiple exons in a single product.

    Science.gov (United States)

    Datson, N A; van de Vosse, E; Dauwerse, H G; Bout, M; van Ommen, G J; den Dunnen, J T

    1996-03-15

    To facilitate the scanning of large genomic regions for the presence of exonic gene segments we have constructed a cosmid-based exon trap vector. The vector serves a dual purpose since it is also suitable for contig construction and physical mapping. The exon trap cassette of vector sCOGH1 consists of the human growth hormone gene driven by the mouse mettallothionein-1 promoter. Inserts are cloned in the multicloning site located in intron 2 of the hGH gene. The efficiency of the system is demonstrated with cosmids containing multiple exons of the Duchenne Muscular Dystrophy gene. All exons present in the inserts were successfully retrieved and no cryptic products were detected. Up to seven exons were isolated simultaneously in a single spliced product. The system has been extended by a transcription-translation-test protocol to determine the presence of large open reading frames in the trapped products, using a combination of tailed PCR primers directing protein synthesis in three different reading frames, followed by in vitro transcription-translation. Having larger stretches of coding sequence in a single exon trap product rather than small single exons greatly facilitates further analysis of potential genes and offers new possibilities for direct mutation analysis of exon trap material.

  8. Comparative genomics of marine cyanomyoviruses reveals the widespread occurrence of Synechococcus host genes localized to a hyperplastic region: implications for mechanisms of cyanophage evolution.

    Science.gov (United States)

    Millard, Andrew D; Zwirglmaier, Katrin; Downey, Mike J; Mann, Nicholas H; Scanlan, Dave J

    2009-09-01

    The vast majority of cyanophages isolated to date are cyanomyoviruses, a group related to bacteriophage T4. Comparative genome analysis of five cyanomyoviruses, including a newly sequenced cyanophage S-RSM4, revealed a 'core genome' of 64 genes, the majority of which are also found in other T4-like phages. Subsequent comparative genomic hybridization analysis using a pilot microarray showed that a number of 'host' genes are widespread in cyanomyovirus isolates. Furthermore, a hyperplastic region was identified between genes g15-g18, within a highly conserved structural gene module, which contained a variable number of inserted genes that lacked conservation in gene order. Several of these inserted genes were host-like and included ptoX, gnd, zwf and petE encoding plastoquinol terminal oxidase, 6-phosphogluconate dehydrogenase, glucose 6-phosphate dehydrogenase and plastocyanin respectively. Phylogenetic analyses suggest that these genes were acquired independently of each other, even though they have become localized within the same genomic region. This hyperplastic region contains no detectable sequence features that might be mechanistically involved with the acquisition of host-like genes, but does appear to be a site specifically associated with the acquisition process and may represent a novel facet of the evolution of marine cyanomyoviruses.

  9. Frequent Loss of Genome Gap Region in 4p16.3 Subtelomere in Early-Onset Type 2 Diabetes Mellitus

    Directory of Open Access Journals (Sweden)

    Hirohito Kudo

    2011-01-01

    Full Text Available A small portion of Type 2 diabetes mellitus (T2DM is familial, but the majority occurs as sporadic disease. Although causative genes are found in some rare forms, the genetic basis for sporadic T2DM is largely unknown. We searched for a copy number abnormality in 100 early-onset Japanese T2DM patients (onset age <35 years by whole-genome screening with a copy number variation BeadChip. Within the 1.3-Mb subtelomeric region on chromosome 4p16.3, we found copy number losses in early-onset T2DM (13 of 100 T2DM versus one of 100 controls. This region surrounds a genome gap, which is rich in multiple low copy repeats. Subsequent region-targeted high-density custom-made oligonucleotide microarray experiments verified the copy number losses and delineated structural changes in the 1.3-Mb region. The results suggested that copy number losses of the genes in the deleted region around the genome gap in 4p16.3 may play significant roles in the etiology of T2DM.

  10. Comparative analysis of the base compositions of the pre-mRNA 3' cleaved-off region and the mRNA 3' untranslated region relative to the genomic base composition in animals and plants.

    Science.gov (United States)

    Li, Xiu-Qing

    2014-01-01

    The precursor messenger RNA (pre-mRNA) three-prime cleaved-off region (3'COR) and the mRNA three-prime untranslated region (3'UTR) play critical roles in regulating gene expression. The differences in base composition between these regions and the corresponding genomes are still largely uncharacterized in animals and plants. In this study, the base compositions of non-redundant 3'CORs and 3'UTRs were compared with the corresponding whole genomes of eleven animals, four dicotyledonous plants, and three monocotyledonous (cereal) plants. Among the four bases (A, C, G, and U for adenine, cytosine, guanine, and uracil, respectively), U (which corresponds to T, for thymine, in DNA) was the most frequent, A the second most frequent, G the third most frequent, and C the least frequent in most of the species in both the 3'COR and 3'UTR regions. In comparison with the whole genomes, in both regions the U content was usually the most overrepresented (particularly in the monocotyledonous plants), and the C content was the most underrepresented. The order obtained for the species groups, when ranked from high to low according to the U contents in the 3'COR and 3'UTR was as follows: dicotyledonous plants, monocotyledonous plants, non-mammal animals, and mammals. In contrast, the genomic T content was highest in dicotyledonous plants, lowest in monocotyledonous plants, and intermediate in animals. These results suggest the following: 1) there is a mechanism operating in both animals and plants which is biased toward U and against C in the 3'COR and 3'UTR; 2) the 3'UTR and 3'COR, as functional units, minimized the difference between dicotyledonous and monocotyledonous plants, while the dicotyledonous and monocotyledonous genomes evolved into two extreme groups in terms of base composition.

  11. Genomic structure analysis of SNC6, a progesterone-receptor associated protein gene, and cloning and characterization of its 5'-flanking region . 

    Institute of Scientific and Technical Information of China (English)

    2002-01-01

    Objective: To analyze the genomic structure of SNC6, a progesterone-receptor associated protein gene and its regulatory elements in its 5'-flanking region. Methods: Genomic sequence from GenBank database (accession number: Z98048) covering the whole SNC6 gene was used to analyze the genomic structure of SNC6 and design primers for PCR amplification of its 5'-flanking region. A 1894 bp fragment of the 5'-flanking region (-1814 to +75) was cloned by PCR using genomic DNA from a healthy donor peripheral blood lymphocyte as template. This fragment, as well as 3 shorter derivative fragments (1423 bp, 632 bp and 416 bp, which correspond to -1344 to +75, -552 to +75 and -337 to +75 respectively), were subcloned into pGL2 series luciferase reporter vectors. These constructs were introduced into colorectal cancer cell line SW620 for transient expression of reporter gene and luciferase activities were measured. Results: The genomic structure analysis showed there are 12 exons for SNC6 gene, which spans 32017 bp (nt71529 to nt39513 in Z98048 sequence). All transfected SW620 cells with the above 5-flanking region-containing constructs showed luciferase activities. The highest luciferase activities were measured in transfected cells with vectors containing 1894 bp fragments, and the lowest luciferase activities were measured in transfected cells with vectors containing 416 bp fragments. Luciferase activities were higher in transfected cells with vectors containing 632 bp fragments than that in transfected cells with vectors containing 1423 bp fragments. Conclusion: The basic transcription-promoting element (promoter) for SNC6 expression resides between 0 to -337, and two transcription-enhancing elements (enhancer) resides between -337 to -552 and -1344 to -1814, whereas one transcription-inhibiting element (silencer) exists between -552 to -1344.

  12. Conserved cis-regulatory regions in a large genomic landscape control SHH and BMP-regulated Gremlin1 expression in mouse limb buds

    Directory of Open Access Journals (Sweden)

    Zuniga Aimée

    2012-08-01

    Full Text Available Abstract Background Mouse limb bud is a prime model to study the regulatory interactions that control vertebrate organogenesis. Major aspects of limb bud development are controlled by feedback loops that define a self-regulatory signalling system. The SHH/GREM1/AER-FGF feedback loop forms the core of this signalling system that operates between the posterior mesenchymal organiser and the ectodermal signalling centre. The BMP antagonist Gremlin1 (GREM1 is a critical node in this system, whose dynamic expression is controlled by BMP, SHH, and FGF signalling and key to normal progression of limb bud development. Previous analysis identified a distant cis-regulatory landscape within the neighbouring Formin1 (Fmn1 locus that is required for Grem1 expression, reminiscent of the genomic landscapes controlling HoxD and Shh expression in limb buds. Results Three highly conserved regions (HMCO1-3 were identified within the previously defined critical genomic region and tested for their ability to regulate Grem1 expression in mouse limb buds. Using a combination of BAC and conventional transgenic approaches, a 9 kb region located ~70 kb downstream of the Grem1 transcription unit was identified. This region, termed Grem1 Regulatory Sequence 1 (GRS1, is able to recapitulate major aspects of Grem1 expression, as it drives expression of a LacZ reporter into the posterior and, to a lesser extent, in the distal-anterior mesenchyme. Crossing the GRS1 transgene into embryos with alterations in the SHH and BMP pathways established that GRS1 depends on SHH and is modulated by BMP signalling, i.e. integrates inputs from these pathways. Chromatin immunoprecipitation revealed interaction of endogenous GLI3 proteins with the core cis-regulatory elements in the GRS1 region. As GLI3 is a mediator of SHH signal transduction, these results indicated that SHH directly controls Grem1 expression through the GRS1 region. Finally, all cis-regulatory regions within the Grem1

  13. Sequencing of 15,622 gene-bearing BACs clarifies the gene-dense regions of the barley genome

    Science.gov (United States)

    Barley (Hordeum vulgare L.) possesses a large and highly repetitive genome of 5.1 Gb that has hindered the development of a complete sequence. In 2012, the International Barley Sequencing Consortium released a resource integrating whole-genome shotgun sequences with a physical and genetic framework....

  14. The role of genomic oxidative-reductive balance as predictor of complex regional pain syndrome development: a novel theory.

    Science.gov (United States)

    Smith, Howard S

    2010-01-01

    The aftermath of sequencing the human genome has birthed many efforts to utilize an individual's genetic information in order to tailor optimal treatment strategies - so-called personalized medicine. An individual's genetic information may eventually help diagnosis and treatment, as well selecting optimal pharmacologic agents based partly on how well they reach their target, how well they will bind to and produce an effect at their targets, how well they will be metabolized, and the profile of their adverse effects. It also appears that clinicians may be able to utilize an individual's genetic information to ascertain a subject's risk or susceptibility of developing a particular medical condition. Although, this has not been widely utilized in pain medicine at this point, the future may revolutionize the role of genetic information in the evaluation and management of various pain conditions. One reason for variations in therapeutic outcomes from different pharmacologic pain treatments is the different genetic disposition of patient to develop pain or to respond to analgesics. The patient's phenotype may represent a conglomerate of several different genetic variants concomitantly present in an individual. Genetic variants may modulate the risk of developing a painful condition, or may modulate the perception of pain (e.g. OPRM1 or GCH1 variants conferring modest "protection" from pain by increasing the tone of the endogenous opioid system or decreasing nitric oxide formation). Other genetic polymorphisms may alter pharmacokinetic mechanisms (e.g. CYP2D6 related prodrug activation of codeine to morphine), alter pharmacodynamic mechanisms (e.g. opioid receptor mutations), or alter other analgesic effects (e.g. diminished euphoric effects from opioids potentially due to DRD2 polymorphisms decreasing the functioning of the dopaminergic reward system). This article theorizes that genetic alterations including functional polymorphisms of Nrf2 (a master regulator of the

  15. Combined Analysis of Variation in Core, Accessory and Regulatory Genome Regions Provides a Super-Resolution View into the Evolution of Bacterial Populations

    Science.gov (United States)

    McNally, Alan; Oren, Yaara; Kelly, Darren; Sreecharan, Tristan; Vehkala, Minna; Välimäki, Niko; Prentice, Michael B.; Ashour, Amgad; Avram, Oren; Pupko, Tal; Literak, Ivan; Guenther, Sebastian; Schaufler, Katharina; Wieler, Lothar H.; Zhiyong, Zong; Sheppard, Samuel K.; Corander, Jukka

    2016-01-01

    The use of whole-genome phylogenetic analysis has revolutionized our understanding of the evolution and spread of many important bacterial pathogens due to the high resolution view it provides. However, the majority of such analyses do not consider the potential role of accessory genes when inferring evolutionary trajectories. Moreover, the recently discovered importance of the switching of gene regulatory elements suggests that an exhaustive analysis, combining information from core and accessory genes with regulatory elements could provide unparalleled detail of the evolution of a bacterial population. Here we demonstrate this principle by applying it to a worldwide multi-host sample of the important pathogenic E. coli lineage ST131. Our approach reveals the existence of multiple circulating subtypes of the major drug–resistant clade of ST131 and provides the first ever population level evidence of core genome substitutions in gene regulatory regions associated with the acquisition and maintenance of different accessory genome elements. PMID:27618184

  16. The evolutionary rates of HCV estimated with subtype 1a and 1b sequences over the ORF length and in different genomic regions.

    Directory of Open Access Journals (Sweden)

    Manqiong Yuan

    Full Text Available BACKGROUND: Considerable progress has been made in the HCV evolutionary analysis, since the software BEAST was released. However, prior information, especially the prior evolutionary rate, which plays a critical role in BEAST analysis, is always difficult to ascertain due to various uncertainties. Providing a proper prior HCV evolutionary rate is thus of great importance. METHODS/RESULTS: 176 full-length sequences of HCV subtype 1a and 144 of 1b were assembled by taking into consideration the balance of the sampling dates and the even dispersion in phylogenetic trees. According to the HCV genomic organization and biological functions, each dataset was partitioned into nine genomic regions and two routinely amplified regions. A uniform prior rate was applied to the BEAST analysis for each region and also the entire ORF. All the obtained posterior rates for 1a are of a magnitude of 10(-3 substitutions/site/year and in a bell-shaped distribution. Significantly lower rates were estimated for 1b and some of the rate distribution curves resulted in a one-sided truncation, particularly under the exponential model. This indicates that some of the rates for subtype 1b are less accurate, so they were adjusted by including more sequences to improve the temporal structure. CONCLUSION: Among the various HCV subtypes and genomic regions, the evolutionary patterns are dissimilar. Therefore, an applied estimation of the HCV epidemic history requires the proper selection of the rate priors, which should match the actual dataset so that they can fit for the subtype, the genomic region and even the length. By referencing the findings here, future evolutionary analysis of the HCV subtype 1a and 1b datasets may become more accurate and hence prove useful for tracing their patterns.

  17. Plant Genome Duplication Database.

    Science.gov (United States)

    Lee, Tae-Ho; Kim, Junah; Robertson, Jon S; Paterson, Andrew H

    2017-01-01

    Genome duplication, widespread in flowering plants, is a driving force in evolution. Genome alignments between/within genomes facilitate identification of homologous regions and individual genes to investigate evolutionary consequences of genome duplication. PGDD (the Plant Genome Duplication Database), a public web service database, provides intra- or interplant genome alignment information. At present, PGDD contains information for 47 plants whose genome sequences have been released. Here, we describe methods for identification and estimation of dates of genome duplication and speciation by functions of PGDD.The database is freely available at http://chibba.agtec.uga.edu/duplication/.

  18. The dark matter of the cancer genome: aberrations in regulatory elements, untranslated regions, splice sites, non-coding RNA and synonymous mutations.

    Science.gov (United States)

    Diederichs, Sven; Bartsch, Lorenz; Berkmann, Julia C; Fröse, Karin; Heitmann, Jana; Hoppe, Caroline; Iggena, Deetje; Jazmati, Danny; Karschnia, Philipp; Linsenmeier, Miriam; Maulhardt, Thomas; Möhrmann, Lino; Morstein, Johannes; Paffenholz, Stella V; Röpenack, Paula; Rückert, Timo; Sandig, Ludger; Schell, Maximilian; Steinmann, Anna; Voss, Gjendine; Wasmuth, Jacqueline; Weinberger, Maria E; Wullenkord, Ramona

    2016-01-01

    Cancer is a disease of the genome caused by oncogene activation and tumor suppressor gene inhibition. Deep sequencing studies including large consortia such as TCGA and ICGC identified numerous tumor-specific mutations not only in protein-coding sequences but also in non-coding sequences. Although 98% of the genome is not translated into proteins, most studies have neglected the information hidden in this "dark matter" of the genome. Malignancy-driving mutations can occur in all genetic elements outside the coding region, namely in enhancer, silencer, insulator, and promoter as well as in 5'-UTR and 3'-UTR Intron or splice site mutations can alter the splicing pattern. Moreover, cancer genomes contain mutations within non-coding RNA, such as microRNA, lncRNA, and lincRNA A synonymous mutation changes the coding region in the DNA and RNA but not the protein sequence. Importantly, oncogenes such as TERT or miR-21 as well as tumor suppressor genes such as TP53/p53, APC, BRCA1, or RB1 can be affected by these alterations. In summary, coding-independent mutations can affect gene regulation from transcription, splicing, mRNA stability to translation, and hence, this largely neglected area needs functional studies to elucidate the mechanisms underlying tumorigenesis. This review will focus on the important role and novel mechanisms of these non-coding or allegedly silent mutations in tumorigenesis.

  19. A Novel Phytophthora sojae Resistance Rps12 Gene Mapped to a Genomic Region That Contains Several Rps Genes

    Science.gov (United States)

    Sahoo, Dipak K.; Abeysekara, Nilwala S.; Cianzio, Silvia R.; Robertson, Alison E.

    2017-01-01

    Phytophthora sojae Kaufmann and Gerdemann, which causes Phytophthora root rot, is a widespread pathogen that limits soybean production worldwide. Development of Phytophthora resistant cultivars carrying Phytophthora resistance Rps genes is a cost-effective approach in controlling this disease. For this mapping study of a novel Rps gene, 290 recombinant inbred lines (RILs) (F7 families) were developed by crossing the P. sojae resistant cultivar PI399036 with the P. sojae susceptible AR2 line, and were phenotyped for responses to a mixture of three P. sojae isolates that overcome most of the known Rps genes. Of these 290 RILs, 130 were homozygous resistant, 12 heterzygous and segregating for Phytophthora resistance, and 148 were recessive homozygous and susceptible. From this population, 59 RILs homozygous for Phytophthora sojae resistance and 61 susceptible to a mixture of P. sojae isolates R17 and Val12-11 or P7074 that overcome resistance encoded by known Rps genes mapped to Chromosome 18 were selected for mapping novel Rps gene. A single gene accounted for the 1:1 segregation of resistance and susceptibility among the RILs. The gene encoding the Phytophthora resistance mapped to a 5.8 cM interval between the SSR markers BARCSOYSSR_18_1840 and Sat_064 located in the lower arm of Chromosome 18. The gene is mapped 2.2 cM proximal to the NBSRps4/6-like sequence that was reported to co-segregate with the Phytophthora resistance genes Rps4 and Rps6. The gene is mapped to a highly recombinogenic, gene-rich genomic region carrying several nucleotide binding site-leucine rich repeat (NBS-LRR)-like genes. We named this novel gene as Rps12, which is expected to be an invaluable resource in breeding soybeans for Phytophthora resistance. PMID:28081566

  20. Recombination and evolution of duplicate control regions in the mitochondrial genome of the Asian big-headed turtle, Platysternon megacephalum.

    Directory of Open Access Journals (Sweden)

    Chenfei Zheng

    Full Text Available Complete mitochondrial (mt genome sequences with duplicate control regions (CRs have been detected in various animal species. In Testudines, duplicate mtCRs have been reported in the mtDNA of the Asian big-headed turtle, Platysternon megacephalum, which has three living subspecies. However, the evolutionary pattern of these CRs remains unclear. In this study, we report the completed sequences of duplicate CRs from 20 individuals belonging to three subspecies of this turtle and discuss the micro-evolutionary analysis of the evolution of duplicate CRs. Genetic distances calculated with MEGA 4.1 using the complete duplicate CR sequences revealed that within turtle subspecies, genetic distances between orthologous copies from different individuals were 0.63% for CR1 and 1.2% for CR2app:addword:respectively, and the average distance between paralogous copies of CR1 and CR2 was 4.8%. Phylogenetic relationships were reconstructed from the CR sequences, excluding the variable number of tandem repeats (VNTRs at the 3' end using three methods: neighbor-joining, maximum likelihood algorithm, and Bayesian inference. These data show that any two CRs within individuals were more genetically distant from orthologous genes in different individuals within the same subspecies. This suggests independent evolution of the two mtCRs within each P. megacephalum subspecies. Reconstruction of separate phylogenetic trees using different CR components (TAS, CD, CSB, and VNTRs suggested the role of recombination in the evolution of duplicate CRs. Consequently, recombination events were detected using RDP software with break points at ≈290 bp and ≈1,080 bp. Based on these results, we hypothesize that duplicate CRs in P. megacephalum originated from heterological ancestral recombination of mtDNA. Subsequent recombination could have resulted in homogenization during independent evolutionary events, thus maintaining the functions of duplicate CRs in the mtDNA of P

  1. Evaluation of a partial genome screening of two asthma susceptibility regions using bayesian network based bayesian multilevel analysis of relevance.

    Directory of Open Access Journals (Sweden)

    Ildikó Ungvári

    Full Text Available Genetic studies indicate high number of potential factors related to asthma. Based on earlier linkage analyses we selected the 11q13 and 14q22 asthma susceptibility regions, for which we designed a partial genome screening study using 145 SNPs in 1201 individuals (436 asthmatic children and 765 controls. The results were evaluated with traditional frequentist methods and we applied a new statistical method, called bayesian network based bayesian multilevel analysis of relevance (BN-BMLA. This method uses bayesian network representation to provide detailed characterization of the relevance of factors, such as joint significance, the type of dependency, and multi-target aspects. We estimated posteriors for these relations within the bayesian statistical framework, in order to estimate the posteriors whether a variable is directly relevant or its association is only mediated.With frequentist methods one SNP (rs3751464 in the FRMD6 gene provided evidence for an association with asthma (OR = 1.43(1.2-1.8; p = 3×10(-4. The possible role of the FRMD6 gene in asthma was also confirmed in an animal model and human asthmatics.In the BN-BMLA analysis altogether 5 SNPs in 4 genes were found relevant in connection with asthma phenotype: PRPF19 on chromosome 11, and FRMD6, PTGER2 and PTGDR on chromosome 14. In a subsequent step a partial dataset containing rhinitis and further clinical parameters was used, which allowed the analysis of relevance of SNPs for asthma and multiple targets. These analyses suggested that SNPs in the AHNAK and MS4A2 genes were indirectly associated with asthma. This paper indicates that BN-BMLA explores the relevant factors more comprehensively than traditional statistical methods and extends the scope of strong relevance based methods to include partial relevance, global characterization of relevance and multi-target relevance.

  2. [Isolation and characteristics of DNA fragments for the region of the tissue plasminogen activator genes and areas adjacent to it in the human genome].

    Science.gov (United States)

    Sarafanov, A G; Timofeeva, M Ia; Aleshkov, S B; Kupriianova, N S; Bannikov, V M; Zakhar'ev, V M; Baev, A A

    1994-01-01

    Fragments overlapping the tPA gene and its 5'- and 3'-flanking regions were isolated from human liver DNA library cloned in lambda Charon4A vector. A BglII fragment comprising the 3' end and the adjacent genomic region (total length 3.7 kb) was subcloned in plasmid pUC19 and its restriction map was determined. The nucleotide sequence of the 5' region of this fragment was compared with the 3' end region of the tPA gene and the corresponding regions of five published variants of tPA mRNA cDNA from different tissues; discrepancies in seven positions were revealed, which might be caused by intragenomic polymorphism.

  3. A microsatellite linkage map for the cultivated strawberry (Fragaria × ananassa) suggests extensive regions of homozygosity in the genome that may have resulted from breeding and selection.

    Science.gov (United States)

    Sargent, D J; Passey, T; Surbanovski, N; Lopez Girona, E; Kuchta, P; Davik, J; Harrison, R; Passey, A; Whitehouse, A B; Simpson, D W

    2012-05-01

    The linkage maps of the cultivated strawberry, Fragaria × ananassa (2n = 8x = 56) that have been reported to date have been developed predominantly from AFLPs, along with supplementation with transferrable microsatellite (SSR) markers. For the investigation of the inheritance of morphological characters in the cultivated strawberry and for the development of tools for marker-assisted breeding and selection, it is desirable to populate maps of the genome with an abundance of transferrable molecular markers such as microsatellites (SSRs) and gene-specific markers. Exploiting the recent release of the genome sequence of the diploid F. vesca, and the publication of an extensive number of polymorphic SSR markers for the genus Fragaria, we have extended the linkage map of the 'Redgauntlet' × 'Hapil' (RG × H) mapping population to include a further 330 loci, generated from 160 primer pairs, to create a linkage map for F. × ananassa containing 549 loci, 490 of which are transferrable SSR or gene-specific markers. The map covers 2140.3 cM in the expected 28 linkage groups for an integrated map (where one group is composed of two separate male and female maps), which represents an estimated 91% of the cultivated strawberry genome. Despite the relative saturation of the linkage map on the majority of linkage groups, regions of apparent extensive homozygosity were identified in the genomes of 'Redgauntlet' and 'Hapil' which may be indicative of allele fixation during the breeding and selection of modern F. × ananassa cultivars. The genomes of the octoploid and diploid Fragaria are largely collinear, but through comparison of mapped markers on the RG × H linkage map to their positions on the genome sequence of F. vesca, a number of inversions were identified that may have occurred before the polyploidisation event that led to the evolution of the modern octoploid strawberry species.

  4. Genome-wide DNA methylation analysis of neuroblastic tumors reveals clinically relevant epigenetic events and large-scale epigenomic alterations localized to telomeric regions.

    Science.gov (United States)

    Buckley, Patrick G; Das, Sudipto; Bryan, Kenneth; Watters, Karen M; Alcock, Leah; Koster, Jan; Versteeg, Rogier; Stallings, Raymond L

    2011-05-15

    The downregulation of specific genes through DNA hypermethylation is a major hallmark of cancer, although the extent and genomic distribution of hypermethylation occurring within cancer genomes is poorly understood. We report on the first genome-wide analysis of DNA methylation alterations in different neuroblastic tumor subtypes and cell lines, revealing higher order organization and clinically relevant alterations of the epigenome. The methylation status of 33,485 discrete loci representing all annotated CpG islands and RefSeq gene promoters was assessed in primary neuroblastic tumors and cell lines. A comparison of genes that were hypermethylated exclusively in the clinically favorable ganglioneuroma/ganglioneuroblastoma tumors revealed that nine genes were associated with poor clinical outcome when overexpressed in the unfavorable neuroblastoma (NB) tumors. Moreover, an integrated DNA methylation and copy number analysis identified 80 genes that were recurrently concomitantly deleted and hypermethylated in NB, with 37 reactivated by 5-aza-deoxycytidine. Lower expression of four of these genes was correlated with poor clinical outcome, further implicating their inactivation in aggressive disease pathogenesis. Analysis of genome-wide hypermethylation patterns revealed 70 recurrent large-scale blocks of contiguously hypermethylated promoters/CpG islands, up to 590 kb in length, with a distribution bias toward telomeric regions. Genome-wide hypermethylation events in neuroblastic tumors are extensive and frequently occur in large-scale blocks with a significant bias toward telomeric regions, indicating that some methylation alterations have occurred in a coordinated manner. Our results indicate that methylation contributes toward the clinicopathological features of neuroblastic tumors, revealing numerous genes associated with poor patient survival in NB.

  5. Frequent loss of genome gap region in 4p16.3 subtelomere in early-onset type 2 diabetes mellitus.

    Science.gov (United States)

    Kudo, Hirohito; Emi, Mitsuru; Ishigaki, Yasushi; Tsunoda, Uiko; Hinokio, Yoshinori; Ishii, Miho; Sato, Hidenori; Yamada, Tetsuya; Katagiri, Hideki; Oka, Yoshitomo

    2011-01-01

    A small portion of Type 2 diabetes mellitus (T2DM) is familial, but the majority occurs as sporadic disease. Although causative genes are found in some rare forms, the genetic basis for sporadic T2DM is largely unknown. We searched for a copy number abnormality in 100 early-onset Japanese T2DM patients (onset age experiments verified the copy number losses and delineated structural changes in the 1.3-Mb region. The results suggested that copy number losses of the genes in the deleted region around the genome gap in 4p16.3 may play significant roles in the etiology of T2DM.

  6. MAF45, a highly polymorphic marker for the pseudoautosomal region of the sheep genome, is not linked to the FecXI (Inverdale) gene.

    Science.gov (United States)

    Swarbrick, P A; Schmack, A E; Crawford, A M

    1992-07-01

    A highly polymorphic dinucleotide repeat, or microsatellite, that shows partial sex-linked inheritance in sheep has been isolated from the sheep genome. Our data indicate that the locus is in the pseudoautosomal region approximately 13 cm from the boundary with the sex-linked regions. The locus, designated MAF45, has 12 alleles with a PIC of 0.84. The same primers amplify a single polymorphic locus in cattle and goats. This locus was not linked to the Inverdale gene, an X-linked gene that increases the ovulation rate in sheep.

  7. Sequence characterization of hypervariable regions in the soybean genome: leucine-rich repeats and simple sequence repeats

    Directory of Open Access Journals (Sweden)

    Everaldo G. de Barros

    2000-06-01

    Full Text Available The genetic basis of cultivated soybean is rather narrow. This observation has been confirmed by analysis of agronomic traits among different genotypes, and more recently by the use of molecular markers. During the construction of an RFLP soybean map (Glycine soja x Glycine max the two progenitors were analyzed with over 2,000 probes, of which 25% were polymorphic. Among the probes that revealed polymorphisms, a small proportion, about 0.5%, hybridized to regions that were highly polymorphic. Here we report the sequencing and analysis of five of these probes. Three of the five contain segments that encode leucine-rich repeat (LRR sequence homologous to known disease resistance genes in plants. Two other probes are relatively AT-rich and contain segments of (An/(Tn. DNA segments corresponding to one of the probes (A45-10 were amplified from nine soybean genotypes. Partial sequencing of these amplicons suggests that deletions and/or insertions are responsible for the extensive polymorphism observed. We propose that genes encoding LRR proteins and simple sequence repeat region prone to slippage are some of the most hypervariable regions of the soybean genome.A base genética da soja cultivada é relativamente estreita. Essa observação foi confirmada por análises de características agronômicas entre diferentes genótipos e, mais recentemente, pelo uso de marcadores moleculares. Durante a construção de um mapa de RFLP da soja (Glycine soja x Glycine max, os dois progenitores foram analisados com mais de 2000 sondas, das quais 25% eram polimórficas. Entre as sondas que revelaram polimorfismos, uma pequena proporção, cerca de 0,5%, hibridizou com regiões que eram altamente polimórficas. Neste trabalho, são apresentados o seqüenciamento e análise de cinco dessas sondas. Três dessas sondas contêm segmentos que codificam repetições ricas em leucina que são homólogas a genes de resistência a doenças já conhecidos em plantas. As duas

  8. Identification of novel SNPs by next-generation sequencing of the genomic region containing the APC gene in colorectal cancer patients in China.

    Science.gov (United States)

    Cheng, Yin; Wang, Jun; Shao, Jiaofang; Chen, Qiyun; Mo, Fan; Ma, Liang; Han, Xu; Zhang, Jing; Chen, Chen; Zhang, Cixiong; Lin, Shuyong; Yu, Jiekai; Zheng, Shu; Lin, Sheng-Cai; Lin, Biaoyang

    2010-06-01

    We described an approach of identifying single nucleotide polymorphisms (SNPs) in complete genomic regions of key genes including promoters, exons, introns, and downstream sequences by combining long-range polymerase chain reaction (PCR) or NimbleGen sequence capture with next-generation sequencing. Using the adenomatous polyposis coli (APC) gene as an example, we identified 210 highly reliable SNPs by next-generation sequencing analysis program MAQ and Samtools, of which 69 were novel ones, in the 123-kb APC genomic region in 27 pair of colorectal cancers and normal adjacent tissues. We confirmed all of the eight randomly selected high-quality SNPs by allele-specific PCR, suggesting that our false discovery rate is negligible. We identified 11 SNPs in the exonic region, including one novel SNP that was not previously reported. Although 10 of them are synonymous, they were predicted to affect splicing by creating or removing exonic splicing enhancers or exonic splicing silencers. We also identified seven SNPs in the upstream region of the APC gene, three of which were only identified in the cancer tissues. Six of these upstream SNPs were predicted to affect transcription factor binding. We also observed that long-range PCR was better in capturing GC-rich regions than the NimbleGen sequence capture technique.

  9. Genome-wide association study of ulcerative colitis identifies three new susceptibility loci, including the HNF4A region

    OpenAIRE

    Barrett, Jeffrey C.; Lee, James C.; Lees, Charles W.; Prescott, Natalie J.; Anderson, Carl A.; Phillips, Anne; Wesley, Emma; Parnell, Kirstie; Zhang, Hu; DRUMMOND, HAZEL; Elaine R Nimmo; Massey, Dunecan; Blaszczyk, Kasia; Elliott, Timothy; Cotterill, Lynn

    2009-01-01

    Ulcerative colitis is a common form of inflammatory bowel disease with a complex etiology. As part of the Wellcome Trust Case Control Consortium 2, we performed a genome-wide association scan for ulcerative colitis in 2,361 cases and 5,417 controls. Loci showing evidence of association at P < 1 x 10(-5) were followed up by genotyping in an independent set of 2,321 cases and 4,818 controls. We find genome-wide significant evidence of association at three new loci, each containing at least o...

  10. Comparative Genomics of H. pylori and Non-Pylori Helicobacter Species to Identify New Regions Associated with Its Pathogenicity and Adaptability

    Directory of Open Access Journals (Sweden)

    De-Min Cao

    2016-01-01

    Full Text Available The genus Helicobacter is a group of Gram-negative, helical-shaped pathogens consisting of at least 36 bacterial species. Helicobacter pylori (H. pylori, infecting more than 50% of the human population, is considered as the major cause of gastritis, peptic ulcer, and gastric cancer. However, the genetic underpinnings of H. pylori that are responsible for its large scale epidemic and gastrointestinal environment adaption within human beings remain unclear. Core-pan genome analysis was performed among 75 representative H. pylori and 24 non-pylori Helicobacter genomes. There were 1173 conserved protein families of H. pylori and 673 of all 99 Helicobacter genus strains. We found 79 genome unique regions, a total of 202,359bp, shared by at least 80% of the H. pylori but lacked in non-pylori Helicobacter species. The operons, genes, and sRNAs within the H. pylori unique regions were considered as potential ones associated with its pathogenicity and adaptability, and the relativity among them has been partially confirmed by functional annotation analysis. However, functions of at least 54 genes and 10 sRNAs were still unclear. Our analysis of protein-protein interaction showed that 30 genes within them may have the cooperation relationship.

  11. Genomic insights into hybridization in a localized region of sympatr y between pewee sister species (Contopus sordidulus × C. virens) and their chromosomal patterns of differentiation

    Institute of Scientific and Technical Information of China (English)

    Joseph D Manthey; and Mark B Robbins

    2016-01-01

    Background: The Great Plains of the United States includes a large number of hybrid and contact zones between bird species. The amount of gene lfow between sister species in these zones ranges from very rare hybridization events to widespread and prevalent introgression. Some of these avian systems have been studied extensively, while others have been indeterminate of whether hybridization exists in areas of sympatry. Using genomic-level approaches allows investigation of genomic patterns of hybridization and gene lfow between species—or lack thereof. Methods: We investigated a narrow zone of sympatry in Nebraska, USA between pewee species (Contopus sordidu-lus and C. virens), for which no hybridization has been conifrmed. We used thousands of single nucleotide polymor-phisms to identify potential hybridization and investigate genomic patterns of differentiation between these two species. Results: We found evidence of multiple hybrid individuals in the contact zone. Little genomic variation was ifxed between species, but a large proportion had differentiated allele frequencies between species. There was a positive relationship between genetic differentiation and chromosome size. Conclusions: We provided the ifrst conclusive evidence of hybridization between C. sordidulus and C. virens, in a region where secondary contact likely occurred due to human disturbance and habitat modiifcation. The genomic patterns of differentiation affrm that these species split in the relatively recent past. Finally, the relationship of chro-mosome size and genetic differentiation may have resulted from differential rates of chromosomal recombination in songbirds and genetic differentiation between species largely due to genetic drift (possibly in concert with selection).

  12. An easy PCR-based genome-walking method for getting the unknown 5’ flanking region of a Scenedesmus sp

    Institute of Scientific and Technical Information of China (English)

    Ahmed Elsayed Gomma; Jin Man Kim; Seung HwanYang; Gyuhwa Chung

    2015-01-01

    Objective: To develop the current single primer PCR-based genome-walking method with Scenedesmus sp. Methods: The unknown 5’ and/or 3’ flanking regions for a specific conserved sequence were optimized and the current single primer PCR-based genome-walking method were developed. Alignment was between the related species of microalga and Scenedesmus sp. For 18S rDNA, we selected the species Scenedesmus sp., Chlorella sp., and Chlamydomonas sp. For the rbcL gene from the chloroplast genome, alignment was done between Scenedesmus sp., and Chlamydomonas sp. Results: Obtaining a small conserved sequence for any gene family is something that can be achieved quite easily. However, identifying the whole gene is often difficult. After investigating and testing, some of the current protocols using to get the unknown 5’ and/or 3’ flanking regions for a specific conserved sequence, we developed the current single primer PCR-based genome-walking method. We performed two consecutive PCR reactions; band extraction and the PCR product were sequenced. We got our results by testing the method on three genes from the total DNA of Scenedesmus sp.; two genes had a fully known sequence in gene bank (18S rDNA and rbcL), but the third one has not yet been identified (rbcS). We designed our primers based on the alignment between the related species and to each other. We also tested two different DNA polymerases Ex Taq and TLA polymerase. Conclusions: Results from our study suggest that Ex Taq is the most suitable polymerase for the current protocol.

  13. The complete mitochondrial genome of the Senegal sole, Solea senegalensis Kaup. Comparative analysis of tandem repeats in the control region among soles.

    Science.gov (United States)

    Manchado, Manuel; Catanese, Gaetano; Ponce, Marian; Funes, Victoria; Infante, Carlos

    2007-06-01

    The complete nucleotide sequence of the mitochondrial genome for the Senegal sole Solea senegalensis Kaup was determined. The mitochondrial DNA was 16,659 base pairs (bp) in length. Sequence features of the 13 protein-coding genes, two ribosomal RNAs and 22 transfer RNAs are described. The non-coding control region (1017 bp) was compared with those of the closely related soles Solea solea and Solea lascaris. The typical conservative blocks were identified. A cluster of 42 and 22 tandemly arrayed repeats was detected near the 3' end of control region in S. solea and S. lascaris, respectively. On the contrary, only two (93.8% of haplotypes) or three copies (6.2%) of an 8-bp repeated sequence motif was found in S. senegalensis. Phylogenetic analysis showed that 7 out of 9 of haplotypes bearing three copies grouped in a separate cluster. Possible mechanisms influencing the evolution of control region among soles are discussed.

  14. Analysis of gene order data supports vertical inheritance of the leukotoxin operon and genome rearrangements in the 5' flanking region in genus Mannheimia

    DEFF Research Database (Denmark)

    Larsen, Jesper; Kuhnert, Peter; Frey, Joachim;

    2007-01-01

    , the supposed sister group, lives as a commensal in the ovine rumen. We have tested the hypothesis that vertical inheritance of the leukotoxin (lktCABD) operon has occurred from the last common ancestor of genus Mannheimia to any ancestor of the diverging subclades by exploring gene order data. RESULTS: We...... examined the gene order in the 5' flanking region of the leukotoxin operon and found that the 5' flanking gene strings, hslVU-lapB-artJ-lktC and xylAB-lktC, are peculiar to M. haemolytica + M. glucosida and M. granulomatis, respectively, whereas the gene string hslVU-lapB-lktC is present in M. ruminalis...... subclades, thus reaffirming the hypothesis of vertical inheritance of the leukotoxin operon. The presence of individual 5' flanking regions in M. haemolytica + M. glucosida and M. granulomatis reflects later genome rearrangements within each subclade. The evolution of the novel 5' flanking region in M...

  15. The complete mitochondrial genome of the common sea slater, Ligia oceanica (Crustacea, Isopoda bears a novel gene order and unusual control region features

    Directory of Open Access Journals (Sweden)

    Podsiadlowski Lars

    2006-09-01

    Full Text Available Abstract Background Sequence data and other characters from mitochondrial genomes (gene translocations, secondary structure of RNA molecules are useful in phylogenetic studies among metazoan animals from population to phylum level. Moreover, the comparison of complete mitochondrial sequences gives valuable information about the evolution of small genomes, e.g. about different mechanisms of gene translocation, gene duplication and gene loss, or concerning nucleotide frequency biases. The Peracarida (gammarids, isopods, etc. comprise about 21,000 species of crustaceans, living in many environments from deep sea floor to arid terrestrial habitats. Ligia oceanica is a terrestrial isopod living at rocky seashores of the european North Sea and Atlantic coastlines. Results The study reveals the first complete mitochondrial DNA sequence from a peracarid crustacean. The mitochondrial genome of Ligia oceanica is a circular double-stranded DNA molecule, with a size of 15,289 bp. It shows several changes in mitochondrial gene order compared to other crustacean species. An overview about mitochondrial gene order of all crustacean taxa yet sequenced is also presented. The largest non-coding part (the putative mitochondrial control region of the mitochondrial genome of Ligia oceanica is unexpectedly not AT-rich compared to the remainder of the genome. It bears two repeat regions (4× 10 bp and 3× 64 bp, and a GC-rich hairpin-like secondary structure. Some of the transfer RNAs show secondary structures which derive from the usual cloverleaf pattern. While some tRNA genes are putative targets for RNA editing, trnR could not be localized at all. Conclusion Gene order is not conserved among Peracarida, not even among isopods. The two isopod species Ligia oceanica and Idotea baltica show a similarly derived gene order, compared to the arthropod ground pattern and to the amphipod Parhyale hawaiiensis, suggesting that most of the translocation events were already

  16. Developmental roles of 21 Drosophila transcription factors are determined by quantitative differences in binding to an overlapping set of thousands of genomic regions

    Energy Technology Data Exchange (ETDEWEB)

    MacArthur, Stewart; Li, Xiao-Yong; Li, Jingyi; Brown, James B.; Chu, Hou Cheng; Zeng, Lucy; Grondona, Brandi P.; Hechmer, Aaron; Simirenko, Lisa; Keranen, Soile V.E.; Knowles, David W.; Stapleton, Mark; Bickel, Peter; Biggin, Mark D.; Eisen, Michael B.

    2009-05-15

    BACKGROUND: We previously established that six sequence-specific transcription factors that initiate anterior/posterior patterning in Drosophila bind to overlapping sets of thousands of genomic regions in blastoderm embryos. While regions bound at high levels include known and probable functional targets, more poorly bound regions are preferentially associated with housekeeping genes and/or genes not transcribed in the blastoderm, and are frequently found in protein coding sequences or in less conserved non-coding DNA, suggesting that many are likely non-functional. RESULTS: Here we show that an additional 15 transcription factors that regulate other aspects of embryo patterning show a similar quantitative continuum of function and binding to thousands of genomic regions in vivo. Collectively, the 21 regulators show a surprisingly high overlap in the regions they bind given that they belong to 11 DNA binding domain families, specify distinct developmental fates, and can act via different cis-regulatory modules. We demonstrate, however, that quantitative differences in relative levels of binding to shared targets correlate with the known biological and transcriptional regulatory specificities of these factors. CONCLUSIONS: It is likely that the overlap in binding of biochemically and functionally unrelated transcription factors arises from the high concentrations of these proteins in nuclei, which, coupled with their broad DNA binding specificities, directs them to regions of open chromatin. We suggest that most animal transcription factors will be found to show a similar broad overlapping pattern of binding in vivo, with specificity achieved by modulating the amount, rather than the identity, of bound factor.

  17. MutMap-Gap: whole-genome resequencing of mutant F2 progeny bulk combined with de novo assembly of gap regions identifies the rice blast resistance gene Pii.

    Science.gov (United States)

    Takagi, Hiroki; Uemura, Aiko; Yaegashi, Hiroki; Tamiru, Muluneh; Abe, Akira; Mitsuoka, Chikako; Utsushi, Hiroe; Natsume, Satoshi; Kanzaki, Hiroyuki; Matsumura, Hideo; Saitoh, Hiromasa; Yoshida, Kentaro; Cano, Liliana M; Kamoun, Sophien; Terauchi, Ryohei

    2013-10-01

    Next-generation sequencing allows the identification of mutations responsible for mutant phenotypes by whole-genome resequencing and alignment to a reference genome. However, when the resequenced cultivar/line displays significant structural variation from the reference genome, mutations in the genome regions missing from the reference (gaps) cannot be identified by simple alignment. Here we report on a method called 'MutMap-Gap', which involves delineating a candidate region harboring a mutation of interest using the recently reported MutMap method, followed by de novo assembly, alignment, and identification of the mutation within genome gaps. We applied MutMap-Gap to isolate the blast resistant gene Pii from the rice cv Hitomebore using mutant lines that have lost Pii function. MutMap-Gap should prove useful for cloning genes that exhibit significant structural variations such as disease resistance genes of the nucleotide-binding site-leucine rich repeat (NBS-LRR) class.

  18. The complete mitochondrial genome of the mantid shrimp Oratosquilla oratoria (Crustacea: Malacostraca: Stomatopoda): Novel non-coding regions features and phylogenetic implications of the Stomatopoda.

    Science.gov (United States)

    Liu, Yuan; Cui, Zhaoxia

    2010-09-01

    The complete mitochondrial (mt) genome sequence of Oratosquilla oratoria (Crustacea: Malacostraca: Stomatopoda) was determined; a circular molecule of 15,783 bp in length. The gene content and arrangement are consistent with the pancrustacean ground pattern. The mt control region of O. oratoria is characterized by no GA-block near the 3' end and different position of [TA(A)]n-blocks compared with other reported Stomatopoda species. The sequence of the second hairpin structure is relative conserved which suggests this region may be a synapomorphic character for the Stomatopoda. In addition, a relative large intergenic spacer (101 bp) with higher A+T content than that in control region was identified between the tRNA(Glu) and tRNA(Phe) genes. Phylogenetic analyses based on the current dataset of complete mt genomes strongly support the Stomatopoda is closely related to Euphausiacea. They in turn cluster with Penaeoidea and Caridea clades while other decapods form a separate group, which rejects the monophyly of Decapoda. This challenges the suitability of Stomatopoda as an outgroup of Decapoda in phylogenetic analyses. The basal position of Stomatopoda within Eumalacostraca according to the morphological characters is also questioned.

  19. Intrinsically disordered region of influenza A NP regulates viral genome packaging via interactions with viral RNA and host PI(4,5)P2.

    Science.gov (United States)

    Kakisaka, Michinori; Yamada, Kazunori; Yamaji-Hasegawa, Akiko; Kobayashi, Toshihide; Aida, Yoko

    2016-09-01

    To be incorporated into progeny virions, the viral genome must be transported to the inner leaflet of the plasma membrane (PM) and accumulate there. Some viruses utilize lipid components to assemble at the PM. For example, simian virus 40 (SV40) targets the ganglioside GM1 and human immunodeficiency virus type 1 (HIV-1) utilizes phosphatidylinositol (4,5) bisphosphate [PI(4,5)P2]. Recent studies clearly indicate that Rab11-mediated recycling endosomes are required for influenza A virus (IAV) trafficking of vRNPs to the PM but it remains unclear how IAV vRNP localized or accumulate underneath the PM for viral genome incorporation into progeny virions. In this study, we found that the second intrinsically disordered region (IDR2) of NP regulates two binding steps involved in viral genome packaging. First, IDR2 facilitates NP oligomer binding to viral RNA to form vRNP. Secondly, vRNP assemble by interacting with PI(4,5)P2 at the PM via IDR2. These findings suggest that PI(4,5)P2 functions as the determinant of vRNP accumulation at the PM.

  20. Z-DNA-forming sites identified by ChIP-Seq are associated with actively transcribed regions in the human genome

    Science.gov (United States)

    Shin, So-I.; Ham, Seokjin; Park, Jihwan; Seo, Seong Hye; Lim, Chae Hyun; Jeon, Hyeongrin; Huh, Jounghyun; Roh, Tae-Young

    2016-01-01

    Z-DNA, a left-handed double helical DNA is structurally different from the most abundant B-DNA. Z-DNA has been known to play a significant role in transcription and genome stability but the biological meaning and positions of Z-DNA-forming sites (ZFSs) in the human genome has not been fully explored. To obtain genome-wide map of ZFSs, Zaa with two Z-DNA-binding domains was used for ChIP-Seq analysis. A total of 391 ZFSs were found and their functions were examined in vivo. A large portion of ZFSs was enriched in the promoter regions and contain sequences with high potential to form Z-DNA. Genes containing ZFSs were occupied by RNA polymerase II at the promoters and showed high levels of expression. Moreover, ZFSs were significantly related to active histone marks such as H3K4me3 and H3K9ac. The association of Z-DNA with active transcription was confirmed by the reporter assay system. Overall, our results suggest that Z-DNA formation depends on chromatin structure as well as sequence composition, and is associated with active transcription in human cells. The global information about ZFSs positioning will provide a useful resource for further understanding of DNA structure-dependent transcriptional regulation. PMID:27374614

  1. Application of semi-nested polymerase chain reaction targeting internal transcribed spacer region for rapid detection of panfungal genome directly from ocular specimens

    Directory of Open Access Journals (Sweden)

    Bagyalakshmi R

    2007-01-01

    Full Text Available Background: The incidence of fungal endophthalmitis has dramatically increased in recent years and rapid detection of fungi using nucleic acid-based amplification techniques is helpful in management. Aim: To evaluate semi-nested polymerase chain reaction (PCR targeting internal transcribed spacer (ITS region for detection of panfungal genome in ocular specimens. Statistical analysis used: Z test for two proportion. Materials and Methods: Standardization of PCR targeting ITS primers was carried out by determining analytical sensitivity and specificity. The sensitivity and specificity of PCR was determined by serial tenfold dilutions of C. albicans (ATCC 24433 DNA and DNA extracts of laboratory isolates of Aspergillus fumigatus , Fusarium lichenicola (4, other fungal and closely related bacterial strains and also human DNA. Semi-nested PCR was applied onto a total of 168 ocular specimens with clinically suspected fungal etiology during 2003-2005. Results and Conclusions: PCR was specific and sensitive to detect 1fg of fungal DNA with ITS primers. PCR detected fungal genome in 90 (53.57% in comparison with the conventional technique, positive in 34 (20.23% by smear examination and in 42 (25% by culture. The increase in clinical sensitivity by 28.57% using PCR was found to be statistically significant { P < 0.001 using Z test for two proportion}. The accuracy of the test was found to be 70.85%. PCR proved to be a rapid diagnostic technique for detection of panfungal genome directly from clinical specimens

  2. Mini-genome rescue of Crimean-Congo hemorrhagic fever virus and research into the evolutionary patterns of its untranslated regions.

    Science.gov (United States)

    Zhao, Jiuru; Xia, Han; Zhang, Yujiang; Yin, Shiyu; Zhang, Zhong; Tang, Shuang; Kou, Zheng; Yu, Jingfeng; Fan, Zhaojun; Li, Tianxian

    2013-10-01

    Crimean-Congo hemorrhagic fever virus (CCHFV) is a member of genus Nairovirus, family Bunyaviridae, which are distributed widely in Africa, Europe and Asia with several genotypes. As a BSL-4 level pathogen, the requirement of high-level biosafety facilities severely constrains researches on live virus manipulation. In this study, we developed a helper-virus-independent mini-genome rescue system for the Chinese YL04057 strain. Based on the enhanced green fluorescent protein (EGFP)-derived mini-genome plasmids, this polymerase I driven system permits easy observation and quantification. Unlike previous report, gradually reduced levels of activity of the CCHFV L, M and S untranslated regions (UTRs) were observed in our system. We also demonstrated that the UTRs at both ends were indispensable for mini-genome background expression. In addition, we phylogentically analyzed all six UTRs of CCHFV and showed that L-UTRs were clustered together approximately corresponding to their original geographical continents. The UTRs of M segment showed a similar branch structure to its open reading frames (ORFs), and nearly an identical tree was generated with 5' UTRs of S segment compared with its ORFs. However, the 3' UTRs of S segment formed new divergent groups. Compatibility tests of YL04057 strain nucleocapsid protein and L protein expression plasmids with Nigerian strain IbAr10200 mini-genomes revealed lower compatibility of L-UTRs without an obvious effect on M-UTRs. Moreover, we demonstrated that the L-UTRs could tolerate certain nucleotide mutations. This system may provide a foundation for future studies of the viral replication cycle, pathogenic mechanisms and evolutionary patterns of CCHFV.

  3. The Campylobacter jejuni Oxidative Stress Regulator RrpB Is Associated with a Genomic Hypervariable Region and Altered Oxidative Stress Resistance

    Science.gov (United States)

    Gundogdu, Ozan; da Silva, Daiani T.; Mohammad, Banaz; Elmi, Abdi; Wren, Brendan W.; van Vliet, Arnoud H. M.; Dorrell, Nick

    2016-01-01

    Campylobacter jejuni is the leading cause of bacterial foodborne diarrhoeal disease worldwide. Despite the microaerophilic nature of the bacterium, C. jejuni can survive the atmospheric oxygen conditions in the environment. Bacteria that can survive either within a host or in the environment like C. jejuni require variable responses to survive the stresses associated with exposure to different levels of reactive oxygen species. The MarR-type transcriptional regulators RrpA and RrpB have recently been shown to play a role in controlling both the C. jejuni oxidative and aerobic stress responses. Analysis of 3,746 C. jejuni and 486 C. coli genome sequences showed that whilst rrpA is present in over 99% of C. jejuni strains, the presence of rrpB is restricted and appears to correlate with specific MLST clonal complexes (predominantly ST-21 and ST-61). C. coli strains in contrast lack both rrpA and rrpB. In C. jejuni rrpB+ strains, the rrpB gene is located within a variable genomic region containing the IF subtype of the type I Restriction-Modification (hsd) system, whilst this variable genomic region in C. jejuni rrpB- strains contains the IAB subtype hsd system and not the rrpB gene. C. jejuni rrpB- strains exhibit greater resistance to peroxide and aerobic stress than C. jejuni rrpB+ strains. Inactivation of rrpA resulted in increased sensitivity to peroxide stress in rrpB+ strains, but not in rrpB- strains. Mutation of rrpA resulted in reduced killing of Galleria mellonella larvae and enhanced biofilm formation independent of rrpB status. The oxidative and aerobic stress responses of rrpB- and rrpB+ strains suggest adaptation of C. jejuni within different hosts and niches that can be linked to specific MLST clonal complexes. PMID:28082970

  4. Sequence determination of cDNA clones of transcripts from the tumor-associated region of the Marek's disease virus genome.

    Science.gov (United States)

    Iwata, A; Ueda, S; Ishihama, A; Hirai, K

    1992-04-01

    The number of 132-bp tandem direct repeats within the long inverted repeat region of the Marek's disease virus type 1 (MDV1) genome increases concomitantly with the loss of oncogenicity during serial passages in cultured cells. Twelve clones carrying the 132-bp sequence were isolated from a cDNA library constructed from chicken embryo fibroblasts infected with the MDV1 Md5 strain. Through sequence analysis of a cDNA clone and primer extension analysis, the corresponding mRNA was found to be a linear transcript which included the two 132-bp tandem direct repeats. Two open reading frames were found in this transcript. One had a week homology with v-fms. The other should increase its size concomitantly with expansion of the 132-bp tandem direct repeat. PCR analysis of both cDNA clones and RNA gave amplified products which were as large as that produced from the genomic clone, indicating that a majority of mRNA from this region is composed of unspliced transcripts.

  5. The influence of landscape configuration and environment on population genetic structure in a sedentary passerine: insights from loci located in different genomic regions.

    Science.gov (United States)

    Ferrer, E S; García-Navas, V; Bueno-Enciso, J; Barrientos, R; Serrano-Davies, E; Cáliz-Campal, C; Sanz, J J; Ortego, J

    2016-01-01

    The study of the factors structuring genetic variation can help to infer the neutral and adaptive processes shaping the demographic and evolutionary trajectories of natural populations. Here, we analyse the role of isolation by distance (IBD), isolation by resistance (IBR, defined by landscape composition) and isolation by environment (IBE, estimated as habitat and elevation dissimilarity) in structuring genetic variation in 25 blue tit (Cyanistes caeruleus) populations. We typed 1385 individuals at 26 microsatellite loci classified into two groups by considering whether they are located into genomic regions that are actively (TL; 12 loci) or not (NTL; 14 loci) transcribed to RNA. Population genetic differentiation was mostly detected using the panel of NTL. Landscape genetic analyses showed a pattern of IBD for all loci and the panel of NTL, but genetic differentiation estimated at TL was only explained by IBR models considering high resistance for natural vegetation and low resistance for agricultural lands. Finally, the absence for IBE suggests a lack of divergent selection pressures associated with differences in habitat and elevation. Overall, our study shows that markers located in different genomic regions can yield contrasting inferences on landscape-level patterns of realized gene flow in natural populations.

  6. Organization, structure and evolution of 41kb of genomic DNA spanning the D-J-C region of the sheep TRB locus.

    Science.gov (United States)

    Antonacci, R; Di Tommaso, S; Lanave, C; Cribiu, E P; Ciccarese, S; Massari, S

    2008-01-01

    A genomic region of 41,045 bp encompassing the 3'-end of the sheep T cell receptor beta chain was sequenced. Extensive molecular analysis has revealed that this region retains a unique structural feature for the presence of a third D-J-C cluster, never detected in any other mammalian species examined so far. A total of 3 TRBD, 18 TRBJ and 3 substantially identical TRBC genes were identified in about 28kb. At 13kb, downstream from the last TRBC gene, in an inverted transcriptional orientation, lies a TRBV gene. Sequence comparison and phylogenetic analyses have demonstrated that the extra D-J-C cluster originated from an unequal crossing over between the two ancestral TRBC genes. Interspersed repeats spanning 22.2% of the sequence, contribute to the wider size of the sheep TRB locus with respect to the other mammalian counterparts, without modifying the general genomic architecture. The nucleotide and predicted amino acid sequences from peripheral T cells cDNA clones indicated that the genes from cluster 3 are fully implicated in the beta chain recombination machinery. Closer inspections of the transcripts have also shown that inter-cluster rearrangements and splice variants, involving the additional cluster, increase the functional diversity of the sheep beta chain repertoire.

  7. High abundance of Serine/Threonine-rich regions predicted to be hyper-O-glycosylated in the secretory proteins coded by eight fungal genomes

    Directory of Open Access Journals (Sweden)

    González Mario

    2012-09-01

    Full Text Available Abstract Background O-glycosylation of secretory proteins has been found to be an important factor in fungal biology and virulence. It consists in the addition of short glycosidic chains to Ser or Thr residues in the protein backbone via O-glycosidic bonds. Secretory proteins in fungi frequently display Ser/Thr rich regions that could be sites of extensive O-glycosylation. We have analyzed in silico the complete sets of putatively secretory proteins coded by eight fungal genomes (Botrytis cinerea, Magnaporthe grisea, Sclerotinia sclerotiorum, Ustilago maydis, Aspergillus nidulans, Neurospora crassa, Trichoderma reesei, and Saccharomyces cerevisiae in search of Ser/Thr-rich regions as well as regions predicted to be highly O-glycosylated by NetOGlyc (http://www.cbs.dtu.dk. Results By comparison with experimental data, NetOGlyc was found to overestimate the number of O-glycosylation sites in fungi by a factor of 1.5, but to be quite reliable in the prediction of highly O-glycosylated regions. About half of secretory proteins have at least one Ser/Thr-rich region, with a Ser/Thr content of at least 40% over an average length of 40 amino acids. Most secretory proteins in filamentous fungi were predicted to be O-glycosylated, sometimes in dozens or even hundreds of sites. Residues predicted to be O-glycosylated have a tendency to be grouped together forming hyper-O-glycosylated regions of varying length. Conclusions About one fourth of secretory fungal proteins were predicted to have at least one hyper-O-glycosylated region, which consists of 45 amino acids on average and displays at least one O-glycosylated Ser or Thr every four residues. These putative highly O-glycosylated regions can be found anywhere along the proteins but have a slight tendency to be at either one of the two ends.

  8. An N-terminal region of Lassa virus L protein plays a critical role in transcription but not replication of the virus genome.

    Science.gov (United States)

    Lelke, Michaela; Brunotte, Linda; Busch, Carola; Günther, Stephan

    2010-02-01

    The central domain of the 200-kDa Lassa virus L protein is a putative RNA-dependent RNA polymerase. N- and C-terminal domains may harbor enzymatic functions important for viral mRNA synthesis, including capping enzymes or cap-snatching endoribonucleases. In the present study, we have employed a large-scale mutagenesis approach to map functionally relevant residues in these regions. The main targets were acidic (Asp and Glu) and basic residues (Lys and Arg) known to form catalytic and binding sites of capping enzymes and endoribonucleases. A total of 149 different mutants were generated and tested in the Lassa virus replicon system. Nearly 25% of evolutionarily highly conserved acidic and basic side chains were dispensable for function of L protein in the replicon context. The vast majority of the remaining mutants had defects in both transcription and replication. Seven residues (Asp-89, Glu-102, Asp-119, Lys-122, Asp-129, Glu-180, and Arg-185) were selectively important for mRNA synthesis. The phenotype was particularly pronounced for Asp-89, Glu-102, and Asp-129, which were indispensable for transcription but could be replaced by a variety of amino acid residues without affecting genome replication. Bioinformatics disclosed the remote similarity of this region to type IIs endonucleases. The mutagenesis was complemented by experiments with the RNA polymerase II inhibitor alpha-amanitin, demonstrating dependence of viral transcription from the cellular mRNA pool. In conclusion, this paper describes an N-terminal region in L protein being important for mRNA, but not genome synthesis. Bioinformatics and cell biological experiments lend support to the hypothesis that this region could be part of a cap-snatching enzyme.

  9. Genomic rearrangements and functional diversification of lecA and lecB lectin-coding regions impacting the efficacy of glycomimetics directed against Pseudomonas aeruginosa

    Directory of Open Access Journals (Sweden)

    Amine M Boukerb

    2016-05-01

    Full Text Available LecA and LecB tetrameric lectins take part in oligosaccharide-mediated adhesion-processes of Pseudomonas aeruginosa. Glycomimetics have been designed to block these interactions. The great versatility of P. aeruginosa suggests that the range of application of these glycomimetics could be restricted to genotypes with particular lectin types. The likelihood of having genomic and genetic changes impacting LecA and LecB interactions with glycomimetics such as galactosylated and fucosylated calix[4]arene was investigated over a collection of strains from the main clades of P. aeruginosa. Lectin types were defined, and their ligand specificities were inferred. These analyses showed a loss of lecA among the PA7 clade. Genomic changes impacting lec loci were thus assessed using strains of this clade, and by making comparisons with the PAO1 genome. The lecA regions were found challenged by phage attacks and PAGI-2 (genomic island integrations. A prophage was linked to the loss of lecA. The lecB regions were found less impacted by such rearrangements but greater lecB than lecA genetic divergences were recorded. Sixteen combinations of LecA and LecB types were observed. Amino acid variations were mapped on PAO1 crystal structures. Most significant changes were observed on LecBPA7, and found close to the fucose binding site. Glycan array analyses were performed with purified LecBPA7. LecBPA7 was found less specific for fucosylated oligosaccharides than LecBPAO1, with a preference for H type 2 rather than type 1, and Lewisa rather than Lewisx. Comparison of the crystal structures of LecBPA7 and LecBPAO1 in complex with Lewisa showed these changes in specificity to have resulted from a modification of the water network between the lectin, galactose and GlcNAc residues. Incidence of these modifications on the interactions with calix[4]arene glycomimetics at the cell level was investigated. An aggregation test was used to establish the efficacy of these ligands

  10. Genomic Rearrangements and Functional Diversification of lecA and lecB Lectin-Coding Regions Impacting the Efficacy of Glycomimetics Directed against Pseudomonas aeruginosa

    Science.gov (United States)

    Boukerb, Amine M.; Decor, Aude; Ribun, Sébastien; Tabaroni, Rachel; Rousset, Audric; Commin, Loris; Buff, Samuel; Doléans-Jordheim, Anne; Vidal, Sébastien; Varrot, Annabelle; Imberty, Anne; Cournoyer, Benoit

    2016-01-01

    LecA and LecB tetrameric lectins take part in oligosaccharide-mediated adhesion-processes of Pseudomonas aeruginosa. Glycomimetics have been designed to block these interactions. The great versatility of P. aeruginosa suggests that the range of application of these glycomimetics could be restricted to genotypes with particular lectin types. The likelihood of having genomic and genetic changes impacting LecA and LecB interactions with glycomimetics such as galactosylated and fucosylated calix[4]arene was investigated over a collection of strains from the main clades of P. aeruginosa. Lectin types were defined, and their ligand specificities were inferred. These analyses showed a loss of lecA among the PA7 clade. Genomic changes impacting lec loci were thus assessed using strains of this clade, and by making comparisons with the PAO1 genome. The lecA regions were found challenged by phage attacks and PAGI-2 (genomic island) integrations. A prophage was linked to the loss of lecA. The lecB regions were found less impacted by such rearrangements but greater lecB than lecA genetic divergences were recorded. Sixteen combinations of LecA and LecB types were observed. Amino acid variations were mapped on PAO1 crystal structures. Most significant changes were observed on LecBPA7, and found close to the fucose binding site. Glycan array analyses were performed with purified LecBPA7. LecBPA7 was found less specific for fucosylated oligosaccharides than LecBPAO1, with a preference for H type 2 rather than type 1, and Lewisa rather than Lewisx. Comparison of the crystal structures of LecBPA7 and LecBPAO1 in complex with Lewisa showed these changes in specificity to have resulted from a modification of the water network between the lectin, galactose and GlcNAc residues. Incidence of these modifications on the interactions with calix[4]arene glycomimetics at the cell level was investigated. An aggregation test was used to establish the efficacy of these ligands. Great variations

  11. Variations in the G6PC2/ABCB11 genomic region are associated with fasting glucose levels

    DEFF Research Database (Denmark)

    Chen, Wei-Min; Erdos, Michael R; Jackson, Anne U

    2008-01-01

    Identifying the genetic variants that regulate fasting glucose concentrations may further our understanding of the pathogenesis of diabetes. We therefore investigated the association of fasting glucose levels with SNPs in 2 genome-wide scans including a total of 5,088 nondiabetic individuals from...... Finland and Sardinia. We found a significant association between the SNP rs563694 and fasting glucose concentrations (P = 3.5 x 10(-7)). This association was further investigated in an additional 18,436 nondiabetic individuals of mixed European descent from 7 different studies. The combined P value...... for association in these follow-up samples was 6.9 x 10(-26), and combining results from all studies resulted in an overall P value for association of 6.4 x 10(-33). Across these studies, fasting glucose concentrations increased 0.01-0.16 mM with each copy of the major allele, accounting for approximately 1...

  12. Regulatory genomic regions active in immune cell types explain a large proportion of the genetic risk of multiple sclerosis.

    Science.gov (United States)

    Elangovan, Ramyiadarsini I; Disanto, Giulio; Berlanga-Taylor, Antonio J; Ramagopalan, Sreeram V; Handunnetthi, Lahiru

    2014-04-01

    There is little understanding of how genetic variants discovered in recent genome-wide association studies are involved in the pathogenesis of multiple sclerosis (MS). We aimed to investigate which chromatin states and cell types explain genetic risk in MS. We used genotype data from 1854 MS patients and 5164 controls produced by the International Multiple Sclerosis Genetics Consortium and Wellcome Trust Case Control Consortium. We estimated the proportion of phenotypic variance between cases and controls explained by cell-specific chromatin state and DNase I hypersensitivity sites (DHSs) using the Genome-wide Complex Trait Analysis software. A large proportion of variance was explained by single-nucleotide polymorphisms (SNPs) in strong enhancer (SE) elements of immortalized B lymphocytes (5.39%). Three independent SNPs located within SE showed suggestive evidence of association with MS: rs12928822 (odds ratio (OR)=0.81, 95% confidence interval (CI)=0.73-0.89, P=2.48E-05), rs727263 (OR=0.75, 95% CI=0.66-0.85, P=3.26E-06) and rs4674923 (OR=0.85, 95% CI=0.79-0.92, P=1.63E-05). Genetic variants located within DHSs of CD19+ B cells explained the greatest proportion of variance. Genetic variants influencing the risk of MS are located within regulatory elements active in immune cells. This study also identifies a number of immune cell types likely to be involved in the causal cascade and that carry important implications for future studies of therapeutic design.

  13. Multiple sex-associated regions and a putative sex chromosome in zebrafish revealed by RAD mapping and population genomics.

    Directory of Open Access Journals (Sweden)

    Jennifer L Anderson

    Full Text Available Within vertebrates, major sex determining genes can differ among taxa and even within species. In zebrafish (Danio rerio, neither heteromorphic sex chromosomes nor single sex determination genes of large effect, like Sry in mammals, have yet been identified. Furthermore, environmental factors can influence zebrafish sex determination. Although progress has been made in understanding zebrafish gonad differentiation (e.g. the influence of germ cells on gonad fate, the primary genetic basis of zebrafish sex determination remains poorly understood. To identify genetic loci associated with sex, we analyzed F(2 offspring of reciprocal crosses between Oregon *AB and Nadia (NA wild-type zebrafish stocks. Genome-wide linkage analysis, using more than 5,000 sequence-based polymorphic restriction site associated (RAD-tag markers and population genomic analysis of more than 30,000 single nucleotide polymorphisms in our *ABxNA crosses revealed a sex-associated locus on the end of the long arm of chr-4 for both cross families, and an additional locus in the middle of chr-3 in one cross family. Additional sequencing showed that two SNPs in dmrt1 previously suggested to be functional candidates for sex determination in a cross of ABxIndia wild-type zebrafish, are not associated with sex in our AB fish. Our data show that sex determination in zebrafish is polygenic and that different genes may influence sex determination in different strains or that different genes become more important under different environmental conditions. The association of the end of chr-4 with sex is remarkable because, unique in the karyotype, this chromosome arm shares features with known sex chromosomes: it is highly heterochromatic, repetitive, late replicating, and has reduced recombination. Our results reveal that chr-4 has functional and structural properties expected of a sex chromosome.

  14. Multiple sex-associated regions and a putative sex chromosome in zebrafish revealed by RAD mapping and population genomics.

    Science.gov (United States)

    Anderson, Jennifer L; Rodríguez Marí, Adriana; Braasch, Ingo; Amores, Angel; Hohenlohe, Paul; Batzel, Peter; Postlethwait, John H

    2012-01-01

    Within vertebrates, major sex determining genes can differ among taxa and even within species. In zebrafish (Danio rerio), neither heteromorphic sex chromosomes nor single sex determination genes of large effect, like Sry in mammals, have yet been identified. Furthermore, environmental factors can influence zebrafish sex determination. Although progress has been made in understanding zebrafish gonad differentiation (e.g. the influence of germ cells on gonad fate), the primary genetic basis of zebrafish sex determination remains poorly understood. To identify genetic loci associated with sex, we analyzed F(2) offspring of reciprocal crosses between Oregon *AB and Nadia (NA) wild-type zebrafish stocks. Genome-wide linkage analysis, using more than 5,000 sequence-based polymorphic restriction site associated (RAD-tag) markers and population genomic analysis of more than 30,000 single nucleotide polymorphisms in our *ABxNA crosses revealed a sex-associated locus on the end of the long arm of chr-4 for both cross families, and an additional locus in the middle of chr-3 in one cross family. Additional sequencing showed that two SNPs in dmrt1 previously suggested to be functional candidates for sex determination in a cross of ABxIndia wild-type zebrafish, are not associated with sex in our AB fish. Our data show that sex determination in zebrafish is polygenic and that different genes may influence sex determination in different strains or that different genes become more important under different environmental conditions. The association of the end of chr-4 with sex is remarkable because, unique in the karyotype, this chromosome arm shares features with known sex chromosomes: it is highly heterochromatic, repetitive, late replicating, and has reduced recombination. Our results reveal that chr-4 has functional and structural properties expected of a sex chromosome.

  15. BAC array CGH in patients with Velocardiofacial syndrome-like features reveals genomic aberrations on chromosome region 1q21.1

    Directory of Open Access Journals (Sweden)

    Estivill Xavier

    2009-12-01

    Full Text Available Abstract Background Microdeletion of the chromosome 22q11.2 region is the most common genetic aberration among patients with velocardiofacial syndrome (VCFS but a subset of subjects do not show alterations of this chromosome region. Methods We analyzed 18 patients with VCFS-like features by comparative genomic hybridisation (aCGH array and performed a face-to-face slide hybridization with two different arrays: a whole genome and a chromosome 22-specific BAC array. Putative rearrangements were confirmed by FISH and MLPA assays. Results One patient carried a combination of rearrangements on 1q21.1, consisting in a microduplication of 212 kb and a close microdeletion of 1.15 Mb, previously reported in patients with variable phenotypes, including mental retardation, congenital heart defects (CHD and schizophrenia. While 326 control samples were negative for both 1q21.1 rearrangements, one of 73 patients carried the same 212-kb microduplication, reciprocal to TAR microdeletion syndrome. Also, we detected four copy number variants (CNVs inherited from one parent (a 744-kb duplication on 10q11.22; a 160 kb duplication and deletion on 22q11.21 in two cases; and a gain of 140 kb on 22q13.2, not present in control subjects, raising the potential role of these CNVs in the VCFS-like phenotype. Conclusions Our results confirmed aCGH as a successful strategy in order to characterize additional submicroscopic aberrations in patients with VCF-like features that fail to show alterations in 22q11.2 region. We report a 212-kb microduplication on 1q21.1, detected in two patients, which may contribute to CHD.

  16. hSmad5 gene, a human hSmad family member: its full length cDNA, genomic structure, promoter region and mutation analysis in human tumors.

    Science.gov (United States)

    Gemma, A; Hagiwara, K; Vincent, F; Ke, Y; Hancock, A R; Nagashima, M; Bennett, W P; Harris, C C

    1998-02-19

    hSmad (mothers against decapentaplegic)-related proteins are important messengers within the Transforming Growth Factor-beta1 (TGF-beta1) superfamily signal transduction pathways. To further characterize a member of this family, we obtained a full length cDNA of the human hSmad5 (hSmad5) gene by rapid amplification of cDNA ends (RACE) and then determined the genomic structure of the gene. There are eight exons and two alternative transcripts; the shorter transcript lacks exon 2. We identified the hSmad5 promoter region from a human genomic YAC clone by obtaining the nucleotide sequence extending 1235 base pairs upstream of the 5' end of the cDNA. We found a CpG island consistent with a promoter region, and we demonstrated promoter activity in a 1232 bp fragment located upstream of the transcription initiation site. To investigate the frequency of somatic hSmad5 mutations in human cancers, we designed intron-based primers to examine coding regions by polymerase chain reaction-single strand conformation polymorphism (PCR-SSCP) analysis. Neither homozygous deletions or point mutations were found in 40 primary gastric tumors and 51 cell lines derived from diverse types of human cancer including 20 cell lines resistant to the growth inhibitory effects of TGF-beta1. These results suggest that the hSmad5 gene is not commonly mutated and that other genetic alterations mediate the loss of TGF-beta1 responsiveness in human cancers.

  17. Organization and expression of genes in the genomic region surrounding the glutamine synthetase gene Gln1 from Lotus japonicus

    DEFF Research Database (Denmark)

    Thykjaer, T; Danielsen, D; She, Q

    1997-01-01

    a segment carrying two apparently non-functional, fragmented copia-like retroelements, dRtp1 and dRtp2, was identified. Sequence similarity to reverse transcriptase- and RNaseH-coding regions defined the defective retro-elements dRtp1 and dRtp2 within this segment. Terminal repeats were not found but three...

  18. Genome wide expression profiling of the mesodiencephalic region identifies novel factors involved in early and late dopaminergic development

    Directory of Open Access Journals (Sweden)

    Koushik Chakrabarty

    2012-05-01

    Meso-diencephalic dopaminergic (mdDA neurons are critical for motor control and cognitive functioning and their loss or dysfunction is associated with disorders such as Parkinson's disease (PD, schizophrenia and addiction. However, relatively little is known about the molecular mechanisms underlying mdDA neuron development and maintenance. Here, we determined the spatiotemporal map of genes involved in the development of mdDA neurons to gain further insight into their molecular programming. Genome-wide gene expression profiles of the developing ventral mesencephalon (VM were compared at different developmental stages leading to the identification of novel regulatory roles of neuronal signaling through nicotinic acthylcholine receptors (Chrna6 and Chrnb3 subunits and the identification of novel transcription factors (Oc2 and 3 involved in the generation of the mdDA neuronal field. We show here that Pitx3, in cooperation with Nurr1, is the critical component in the activation of the Chrna6 and Chrnb3 subunits in mdDA neurons. Furthermore, we provide evidence of two divergent regulatory pathways resulting in the expression of Chrna6 and Chrnb3 respectively.

  19. Differentiation of Plum pox virus isolates by single-strand conformation polymorphism and low-stringency single specific primer PCR analysis of HC-Pro genome region.

    Science.gov (United States)

    Gadiou, S; Safárová, D; Navrátil, M

    2009-01-01

    Single-strand conformation polymorphism (SSCP) and low-stringency single specific primer (LSSP)-PCR were assessed for suitability and reliability in genotyping of Plum pox virus (PPV) isolates. Examined PPV isolates included 16 PPV-D, 12 PPV-M, and 14 PPV-Rec isolates collected in Czech Republic. The analysis was performed on the helper component protease (HC-Pro) region of the PPV genome. SSCP and LSSP-PCR allowed the differentiation of PPV strain, but SSCP was not able to distinguish isolates within the same strain. The individual genotyping of each PPV isolate was obtained by LSSP-PCR. Nevertheless, both SSCP and LSSP-PCR techniques are suitable for preliminary screening of genetic variability of plant RNA viruses.

  20. Progression from Sustained BK Viruria to Sustained BK Viremia with Immunosuppression Reduction Is Not Associated with Changes in the Noncoding Control Region of the BK Virus Genome

    Science.gov (United States)

    Memon, Imran A.; Parikh, Bijal A.; Gaudreault-Keener, Monique; Skelton, Rebecca; Storch, Gregory A.; Brennan, Daniel C.

    2012-01-01

    Changes in the BK virus archetypal noncoding control region (NCCR) have been associated with BK-virus-associated nephropathy (BKVAN). Whether sustained viremia, a surrogate for BKVAN, is associated with significant changes in the BK-NCCR is unknown. We performed PCR amplification and sequencing of (1) stored urine and (2) plasma samples from the time of peak viremia from 11 patients with sustained viremia who participated in a 200-patient clinical trial. The antimetabolite was withdrawn for BK viremia and reduction of the calcineurin inhibitor for sustained BK viremia. DNA sequencing from the 11 patients with sustained viremia revealed 8 insertions, 16 transversions, 3 deletions, and 17 transitions. None were deemed significant. No patient developed clinically evident BKVAN. Our data support, at a genomic level, the effectiveness of reduction of immunosuppression for prevention of progression from viremia to BKVAN. PMID:22701777

  1. The sequences of the spacer region between the atpF and atpA genes in the plastid genome allows discrimination among three varieties of medicinal Angelica.

    Science.gov (United States)

    Hosokawa, Keizo; Hishida, Atsuyuki; Nakamura, Ikuo; Shibata, Toshiro

    2006-05-01

    The dried roots of Angelica acutiloba Kitagawa var. acutiloba Kitagawa, A. acutiloba Kitagawa var. iwatensis Hikino and A. acutiloba Kitagawa var. sugiyamae Hikino have been used as the herbal medicine known in Japan as Japanese Angelica Root. The respective morphological features of, in particular, A. acutiloba var. sugiyamae and A. acutiloba var. iwatensis are similar, and they are not easy to distinguish morphologically from each other. In an attempt to find a method for discriminating among these three varieties, we compared the nucleotide sequence of the spacer region between the atpF and atpA genes among the respective plastid genomes. Comparison of these sequences allowed us to identify each of the three varieties unequivocally.

  2. Unusual features of control region and a novel NADH 6 genes in mitochondrial genome of the finespot goby, Chaeturichthys stigmatias (Perciformes, Gobiidae).

    Science.gov (United States)

    Sun, Yuena; Wei, Tao; Jin, Xiaoxiao

    2015-01-01

    In this article, we determined the complete mitogenome of finespot goby Chaeturichthys stigmatias with emphasis on the arranged gene order and gene feature with published Gobiidae species. The C. stigmatias mtDNA was 18,562 bp in length (56.94% AT), and comprised 37 genes (13 protein genes, 2 rRNAs and 22 tRNAs) that was typical for mitochondrial genome of Gobiidae species. Unusually, the NADH 6 gene was very large in length compared with other Gobiidae species. Mitogenome of C. stigmatias had a long putative control region with high AT content (71.28%). Within this sequence, we determined repeat regions, the termination-associated sequence and the conserved sequence block for this region. The origin of L-strand replication in C. stigmatias was located in a cluster of five tRNA genes (WANCY). The conserved motif (5'-GCCGG-3') was also determined at the base of the stem in the tRNA-Cys gene. This study will provide a better understanding of Gobiidae mitogenomes and offer useful information for future studies concerning Gobiidae mitogenome evolution.

  3. Most significant genome regions involved in the control of earliness traits in bread wheat, as revealed by QTL meta-analysis.

    Science.gov (United States)

    Hanocq, E; Laperche, A; Jaminon, O; Lainé, A-L; Le Gouis, J

    2007-02-01

    Earliness is one of the most important adaptation traits in plant breeding. Our purpose was to identify the genome regions of bread wheat involved in the control of earliness and its three components: photoperiod sensitivity (PS), vernalization requirement (VR) and intrinsic earliness (IE). A QTL meta-analysis was carried out to examine the replicability of QTL across 13 independent studies and to propose meta-QTL (MQTL). Initial QTL were projected on a recent consensus map (2004). Quality criteria were proposed to assess the reliability of this projection. These criteria were based on the distances between markers in the QTL regions. Chromosomes of groups 2 and 5 had a greater incidence on earliness control as they carry the known, major genes Ppd and Vrn. Other chromosome regions played an intermediate role in earliness control: 4A [heading date (HD) Meta-QTL], 4B (HD MQTL), 2B (VR MQTL) and 5B (IE MQTL). Markers at this four MQTL should prove helpful in marker-assisted selection, to better control earliness.

  4. Smoking correlates with increased cytoskeletal protein-related coding region mutations in the lung and head and neck datasets of the cancer genome atlas.

    Science.gov (United States)

    Yavorski, John M; Blanck, George

    2016-12-01

    Cancer from smoking tobacco is considered dependent on mutagens, but significant molecular aspects of smoking-specific, cancer development remain unknown. We defined sets of coding regions for oncoproteins, tumor suppressor proteins, and cytoskeletal-related proteins that were compared between nonsmokers and smokers, for mutation occurrences, in the lung adenocarcinoma (LUAD), head and neck squamous carcinoma (HNSC), bladder carcinoma (BLCA), and pancreatic adenocarcinoma ( PAAD) datasets from the cancer genome atlas (TCGA). We uncovered significant differences in overall mutation rates, and in mutation rates in cytoskeletal protein-related coding regions (CPCRs, including extracellular matrix protein coding regions), between nonsmokers and smokers in LUAD and HNSC (P < 0.001), raising the question of whether the CPCR mutation differences lead to different clinical courses for nonsmoker and smoker cancers. Another important question inspired by these results is, whether high smoker cancer mutation rates would facilitate genotoxicity or neoantigen-based therapies. No significant, mutation-based differences were found in the BLCA or PAAD datasets, between nonsmokers and smokers. However, a significant difference was uncovered for the average number of overall cancer mutations, in LUAD, for persons who stopped smoking more than 15 years ago, compared with more recent smokers (P < 0.032).

  5. Inter- and intraspecific genomic variability of the 16S-23S intergenic spacer regions (ISR) in representatives of Acidithiobacillus thiooxidans and Acidithiobacillus ferrooxidans.

    Science.gov (United States)

    Ni, Yong-Qing; Yang, Yuan; Bao, Jing-Ting; He, Kai-Yu; Li, Hong-Yu

    2007-05-01

    The complete sequences of 32 intergenic spacer regions (ISR) from Acidithiobacillus strains, including 29 field strains isolated from coal, copper, molybdenum mine wastes or sediment of different geoclimatic regions in China, reference strain ATCC19859 and the type strains of the two species were determined. These data, together with other sequences available in the GenBank database, were used to carry out the first detailed assessment of the inter- and intraspecific genomic variability of the ISR sequences and to infer phylogenetic relationships within the genus. The total length of the 16S-23S rRNA intergenic spacer regions of the Acidithiobacillus thiooxidans and Acidithiobacillus ferrooxidans strains ranged from 451 to 490 bp, and from 434 to 456 bp, respectively. The degree of intrageneric ISR sequence similarity was higher than the degree of intergeneric similarity, and the overall similarity values of the ISRs varied from 60.49% to 84.71% between representatives of different species of the genus Acidithiobacillus. Sequences from the spacer of the A. thiooxidans and A. ferrooxidans strains ranged from 86.71% to 99.56% and 92.36% to 100% similarity, respectively. All Acidithiobacillus strains were separated into three phylogenetic major clusters and seven phylogenetic groups. ISR may be a potential target for the development of in situ hybridization probe aimed at accurately detecting acidithiobacilli in the various acidic environments.

  6. ChIP-seq Analysis in R (CSAR: An R package for the statistical detection of protein-bound genomic regions

    Directory of Open Access Journals (Sweden)

    van Ham Roeland CHJ

    2011-05-01

    Full Text Available Abstract Background In vivo detection of protein-bound genomic regions can be achieved by combining chromatin-immunoprecipitation with next-generation sequencing technology (ChIP-seq. The large amount of sequence data produced by this method needs to be analyzed in a statistically proper and computationally efficient manner. The generation of high copy numbers of DNA fragments as an artifact of the PCR step in ChIP-seq is an important source of bias of this methodology. Results We present here an R package for the statistical analysis of ChIP-seq experiments. Taking the average size of DNA fragments subjected to sequencing into account, the software calculates single-nucleotide read-enrichment values. After normalization, sample and control are compared using a test based on the ratio test or the Poisson distribution. Test statistic thresholds to control the false discovery rate are obtained through random permutations. Computational efficiency is achieved by implementing the most time-consuming functions in C++ and integrating these in the R package. An analysis of simulated and experimental ChIP-seq data is presented to demonstrate the robustness of our method against PCR-artefacts and its adequate control of the error rate. Conclusions The software ChIP-seq Analysis in R (CSAR enables fast and accurate detection of protein-bound genomic regions through the analysis of ChIP-seq experiments. Compared to existing methods, we found that our package shows greater robustness against PCR-artefacts and better control of the error rate.

  7. Evolution of the rpoB-psbZ region in fern plastid genomes: notable structural rearrangements and highly variable intergenic spacers

    Directory of Open Access Journals (Sweden)

    Su Ying-Juan

    2011-04-01

    Full Text Available Abstract Background The rpoB-psbZ (BZ region of some fern plastid genomes (plastomes has been noted to go through considerable genomic changes. Unraveling its evolutionary dynamics across all fern lineages will lead to clarify the fundamental process shaping fern plastome structure and organization. Results A total of 24 fern BZ sequences were investigated with taxon sampling covering all the extant fern orders. We found that: (i a tree fern Plagiogyria japonica contained a novel gene order that can be generated from either the ancestral Angiopteris type or the derived Adiantum type via a single inversion; (ii the trnY-trnE intergenic spacer (IGS of the filmy fern Vandenboschia radicans was expanded 3-fold due to the tandem 27-bp repeats which showed strong sequence similarity with the anticodon domain of trnY; (iii the trnY-trnE IGSs of two horsetail ferns Equisetum ramosissimum and E. arvense underwent an unprecedented 5-kb long expansion, more than a quarter of which was consisted of a single type of direct repeats also relevant to the trnY anticodon domain; and (iv ycf66 has independently lost at least four times in ferns. Conclusions Our results provided fresh insights into the evolutionary process of fern BZ regions. The intermediate BZ gene order was not detected, supporting that the Adiantum type was generated by two inversions occurring in pairs. The occurrence of Vandenboschia 27-bp repeats represents the first evidence of partial tRNA gene duplication in fern plastomes. Repeats potentially forming a stem-loop structure play major roles in the expansion of the trnY-trnE IGS.

  8. Quantitative trait loci (QTL study identifies novel genomic regions associated to Chiari-like malformation in Griffon Bruxellois dogs.

    Directory of Open Access Journals (Sweden)

    Philippe Lemay

    Full Text Available Chiari-like malformation (CM is a developmental abnormality of the craniocervical junction that is common in the Griffon Bruxellois (GB breed with an estimated prevalence of 65%. This disease is characterized by overcrowding of the neural parenchyma at the craniocervical junction and disturbance of cerebrospinal fluid (CSF flow. The most common clinical sign is pain either as a direct consequence of CM or neuropathic pain as a consequence of secondary syringomyelia. The etiology of CM remains unknown but genetic factors play an important role. To investigate the genetic complexity of the disease, a quantitative trait locus (QTL approach was adopted. A total of 14 quantitative skull and atlas measurements were taken and were tested for association to CM. Six traits were found to be associated to CM and were subjected to a whole-genome association study using the Illumina canine high density bead chip in 74 GB dogs (50 affected and 24 controls. Linear and mixed regression analyses identified associated single nucleotide polymorphisms (SNPs on 5 Canis Familiaris Autosomes (CFAs: CFA2, CFA9, CFA12, CFA14 and CFA24. A reconstructed haplotype of 0.53 Mb on CFA2 strongly associated to the height of the cranial fossa (diameter F and an haplotype of 2.5 Mb on CFA14 associated to both the height of the rostral part of the caudal cranial fossa (AE and the height of the brain (FG were significantly associated to CM after 10 000 permutations strengthening their candidacy for this disease (P = 0.0421, P = 0.0094 respectively. The CFA2 QTL harbours the Sall-1 gene which is an excellent candidate since its orthologue in humans is mutated in Townes-Brocks syndrome which has previously been associated to Chiari malformation I. Our study demonstrates the implication of multiple traits in the etiology of CM and has successfully identified two new QTL associated to CM and a potential candidate gene.

  9. The 3' untranslated regions of influenza genomic sequences are 5'PPP-independent ligands for RIG-I.

    Directory of Open Access Journals (Sweden)

    William G Davis

    Full Text Available Retinoic acid inducible gene-I (RIG-I is a key regulator of antiviral immunity. RIG-I is generally thought to be activated by ssRNA species containing a 5'-triphosphate (PPP group or by unphosphorylated dsRNA up to ~300 bp in length. However, it is not yet clear how changes in the length, nucleotide sequence, secondary structure, and 5' end modification affect the abilities of these ligands to bind and activate RIG-I. To further investigate these parameters in the context of naturally occurring ligands, we examined RNA sequences derived from the 5' and 3' untranslated regions (UTR of the influenza virus NS1 gene segment. As expected, RIG-I-dependent interferon-β (IFN-β induction by sequences from the 5' UTR of the influenza cRNA or its complement (26 nt in length required the presence of a 5'PPP group. In contrast, activation of RIG-I by the 3' UTR cRNA sequence or its complement (172 nt exhibited only a partial 5'PPP-dependence, as capping the 5' end or treatment with CIP showed a modest reduction in RIG-I activation. Furthermore, induction of IFN-β by a smaller, U/A-rich region within the 3' UTR was completely 5'PPP-independent. Our findings demonstrated that RNA sequence, length, and secondary structure all contributed to whether or not the 5'PPP moiety is needed for interferon induction by RIG-I.

  10. Genome-wide DNA methylation analysis of neuroblastic tumors reveals clinically relevant epigenetic events and large-scale epigenomic alterations localized to telomeric regions

    NARCIS (Netherlands)

    P.G. Buckley; S. Das; K. Bryan; K.M. Watters; L. Alcock; J. Koster; R. Versteeg; R.L. Stallings

    2011-01-01

    The downregulation of specific genes through DNA hypermethylation is a major hallmark of cancer, although the extent and genomic distribution of hypermethylation occurring within cancer genomes is poorly understood. We report on the first genome-wide analysis of DNA methylation alterations in differ

  11. Investigating the prehistory of Tungusic peoples of Siberia and the Amur-Ussuri region with complete mtDNA genome sequences and Y-chromosomal markers.

    Directory of Open Access Journals (Sweden)

    Ana T Duggan

    Full Text Available Evenks and Evens, Tungusic-speaking reindeer herders and hunter-gatherers, are spread over a wide area of northern Asia, whereas their linguistic relatives the Udegey, sedentary fishermen and hunter-gatherers, are settled to the south of the lower Amur River. The prehistory and relationships of these Tungusic peoples are as yet poorly investigated, especially with respect to their interactions with neighbouring populations. In this study, we analyse over 500 complete mtDNA genome sequences from nine different Evenk and even subgroups as well as their geographic neighbours from Siberia and their linguistic relatives the Udegey from the Amur-Ussuri region in order to investigate the prehistory of the Tungusic populations. These data are supplemented with analyses of Y-chromosomal haplogroups and STR haplotypes in the Evenks, Evens, and neighbouring Siberian populations. We demonstrate that whereas the North Tungusic Evenks and Evens show evidence of shared ancestry both in the maternal and in the paternal line, this signal has been attenuated by genetic drift and differential gene flow with neighbouring populations, with isolation by distance further shaping the maternal genepool of the Evens. The Udegey, in contrast, appear quite divergent from their linguistic relatives in the maternal line, with a mtDNA haplogroup composition characteristic of populations of the Amur-Ussuri region. Nevertheless, they show affinities with the Evenks, indicating that they might be the result of admixture between local Amur-Ussuri populations and Tungusic populations from the north.

  12. BAC-end microsatellites from intra and inter-genic regions of the common bean genome and their correlation with cytogenetic features.

    Directory of Open Access Journals (Sweden)

    Matthew Wohlgemuth Blair

    Full Text Available Highly polymorphic markers such as simple sequence repeats (SSRs or microsatellites are very useful for genetic mapping. In this study novel SSRs were identified in BAC-end sequences (BES from non-contigged, non-overlapping bacterial artificial clones (BACs in common bean (Phaseolus vulgaris L.. These so called "singleton" BACs were from the G19833 Andean gene pool physical map and the new BES-SSR markers were used for the saturation of the inter-gene pool, DOR364×G19833 genetic map. A total of 899 SSR loci were found among the singleton BES, but only 346 loci corresponded to the single di- or tri-nucleotide motifs that were likely to be polymorphic (ATT or AG motifs, principally and useful for primer design and individual marker mapping. When these novel SSR markers were evaluated in the DOR364×G19833 population parents, 136 markers revealed polymorphism and 106 were mapped. Genetic mapping resulted in a map length of 2291 cM with an average distance between markers of 5.2 cM. The new genetic map was compared to the most recent cytogenetic analysis of common bean chromosomes. We found that the new singleton BES-SSR were helpful in filling peri-centromeric spaces on the cytogenetic map. Short genetic distances between some new singleton-derived BES-SSR markers was common showing suppressed recombination in these regions compared to other parts of the genome. The correlation of singleton-derived SSR marker distribution with other cytogenetic features of the bean genome is discussed.

  13. Genome-wide DNA methylation analyses in the brain reveal four differentially methylated regions between humans and non-human primates

    Directory of Open Access Journals (Sweden)

    Wang Jinkai

    2012-08-01

    Full Text Available Abstract Background The highly improved cognitive function is the most significant change in human evolutionary history. Recently, several large-scale studies reported the evolutionary roles of DNA methylation; however, the role of DNA methylation on brain evolution is largely unknown. Results To test if DNA methylation has contributed to the evolution of human brain, with the use of MeDIP-Chip and SEQUENOM MassARRAY, we conducted a genome-wide analysis to identify differentially methylated regions (DMRs in the brain between humans and rhesus macaques. We first identified a total of 150 candidate DMRs by the MeDIP-Chip method, among which 4 DMRs were confirmed by the MassARRAY analysis. All 4 DMRs are within or close to the CpG islands, and a MIR3 repeat element was identified in one DMR, but no repeat sequence was observed in the other 3 DMRs. For the 4 DMR genes, their proteins tend to be conserved and two genes have neural related functions. Bisulfite sequencing and phylogenetic comparison among human, chimpanzee, rhesus macaque and rat suggested several regions of lineage specific DNA methylation, including a human specific hypomethylated region in the promoter of K6IRS2 gene. Conclusions Our study provides a new angle of studying human brain evolution and understanding the evolutionary role of DNA methylation in the central nervous system. The results suggest that the patterns of DNA methylation in the brain are in general similar between humans and non-human primates, and only a few DMRs were identified.

  14. Nucleotide sequence and transcript organization of a region of the vaccinia virus genome which encodes a constitutively expressed gene required for DNA replication.

    Science.gov (United States)

    Roseman, N A; Hruby, D E

    1987-05-01

    A vaccinia virus (VV) gene required for DNA replication has been mapped to the left side of the 16-kilobase (kb) VV HindIII D DNA fragment by marker rescue of a DNA- temperature-sensitive mutant, ts17, using cloned fragments of the viral genome. The region of VV DNA containing the ts17 locus (3.6 kb) was sequenced. This nucleotide sequence contains one complete open reading frame (ORF) and two incomplete ORFs reading from left to right. Analysis of this region at early times revealed that transcription from the incomplete upstream ORF terminates coincidentally with the complete ORF encoding the ts17 gene product, which is directly downstream. The predicted proteins encoded by this region correlate well with polypeptides mapped by in vitro translation of hybrid-selected early mRNA. The nucleotide sequences of a 1.3-kb BglII fragment derived from ts17 and from two ts17 revertants were also determined, and the nature of the ts17 mutation was identified. S1 nuclease protection studies were carried out to determine the 5' and 3' ends of the transcripts and to examine the kinetics of expression of the ts17 gene during viral infection. The ts17 transcript is present at both early and late times postinfection, indicating that this gene is constitutively expressed. Surprisingly, the transcriptional start throughout infection occurs at the proposed late regulatory element TAA, which immediately precedes the putative initiation codon ATG. Although the biological activity of the ts17-encoded polypeptide was not identified, it was noted that in ts17-infected cells, expression of a nonlinked VV immediate-early gene (thymidine kinase) was deregulated at the nonpermissive temperature. This result may indicate that the ts17 gene product is functionally required at an early step of the VV replicative cycle.

  15. The database of chromosome imbalance regions and genes resided in lung cancer from Asian and Caucasian identified by array-comparative genomic hybridization

    Directory of Open Access Journals (Sweden)

    Lo Fang-Yi

    2012-06-01

    Full Text Available Abstract Background Cancer-related genes show racial differences. Therefore, identification and characterization of DNA copy number alteration regions in different racial groups helps to dissect the mechanism of tumorigenesis. Methods Array-comparative genomic hybridization (array-CGH was analyzed for DNA copy number profile in 40 Asian and 20 Caucasian lung cancer patients. Three methods including MetaCore analysis for disease and pathway correlations, concordance analysis between array-CGH database and the expression array database, and literature search for copy number variation genes were performed to select novel lung cancer candidate genes. Four candidate oncogenes were validated for DNA copy number and mRNA and protein expression by quantitative polymerase chain reaction (qPCR, chromogenic in situ hybridization (CISH, reverse transcriptase-qPCR (RT-qPCR, and immunohistochemistry (IHC in more patients. Results We identified 20 chromosomal imbalance regions harboring 459 genes for Caucasian and 17 regions containing 476 genes for Asian lung cancer patients. Seven common chromosomal imbalance regions harboring 117 genes, included gain on 3p13-14, 6p22.1, 9q21.13, 13q14.1, and 17p13.3; and loss on 3p22.2-22.3 and 13q13.3 were found both in Asian and Caucasian patients. Gene validation for four genes including ARHGAP19 (10q24.1 functioning in Rho activity control, FRAT2 (10q24.1 involved in Wnt signaling, PAFAH1B1 (17p13.3 functioning in motility control, and ZNF322A (6p22.1 involved in MAPK signaling was performed using qPCR and RT-qPCR. Mean gene dosage and mRNA expression level of the four candidate genes in tumor tissues were significantly higher than the corresponding normal tissues (PP=0.06. In addition, CISH analysis of patients indicated that copy number amplification indeed occurred for ARHGAP19 and ZNF322A genes in lung cancer patients. IHC analysis of paraffin blocks from Asian Caucasian patients demonstrated that the frequency of

  16. Report from the Maryland epidemiology schizophrenia linkage study: No evidence for linkage between schizophrenia and a number of candidate and other genomic regions using a complex dominant model

    Energy Technology Data Exchange (ETDEWEB)

    Karayiorgou, M.; Hwang, J.; Elango, R. [Massachusetts Institute of Technology, Cambridge, MA (United States)] [and others

    1994-12-15

    Our collaborative group has undertaken a linkage study of schizophrenia, using a systematic sample of patients admitted to Maryland hospitals. An initial sample of 39 families, each having two or more affecteds, was available for genotyping candidate genes, candidate regions, and highly polymorphic markers randomly distributed throughout the genome. We used a single complex dominant model (with a disease gene frequency of 0.005 and age-dependent penetrance for affected phenotype: for under 35, penetrance = .45; for 35 and older, penetrance = .85). We report here 130 markers which met the exclusion criteria of LOD score < -2.00 at theta > 0.01 in at least 10 informative families, and no evidence for heterogeneity. We also report here markers that were tested as candidates for linkage to the schizophrenic phenotype. They were selected based on the following criteria: (a) proximity to reported chromosomal rearrangements (both 5q and 11q), (b) suggestions of linkage from other families (5q), or (c) presence of a candidate gene (5q, 11q, 3q: dopamine receptors 1, 2, and 3, respectively). We also tested for mutations of codon 717 in exon 17 of the amyloid precursor protein (APP) gene and were unable to detect the C to T substitution in our schizophrenic group. 48 refs., 2 tabs.

  17. Inhibition of hepatitis B virus by the CRISPR/Cas9 system via targeting the conserved regions of the viral genome.

    Science.gov (United States)

    Liu, Xing; Hao, Ruidong; Chen, Shuliang; Guo, Deyin; Chen, Yu

    2015-08-01

    Hepatitis B virus (HBV) remains a global health threat as chronic HBV infection may lead to liver cirrhosis or cancer. Current antiviral therapies with nucleoside analogues can inhibit the replication of HBV, but do not disrupt the already existing HBV covalently closed circular DNA. The newly developed CRISPR (clustered regularly interspaced short palindromic repeats)/Cas9 (CRISPR-associated 9) system is a powerful tool to target cellular genome DNA for gene editing. In order to investigate the possibility of using the CRISPR/Cas9 system to disrupt the HBV DNA templates, we designed eight guide RNAs (gRNAs) that targeted the conserved regions of different HBV genotypes, which could significantly inhibit HBV replication both in vitro and in vivo. Moreover, the HBV-specific gRNA/Cas9 system could inhibit the replication of HBV of different genotypes in cells, and the viral DNA was significantly reduced by a single gRNA/Cas9 system and cleared by a combination of different gRNA/Cas9 systems.

  18. The pathogenicity determinant of Citrus tristeza virus causing the seedling yellows syndrome maps at the 3'-terminal region of the viral genome.

    Science.gov (United States)

    Albiach-Marti, Maria R; Robertson, Cecile; Gowda, Siddarame; Tatineni, Satyanarayana; Belliure, Belén; Garnsey, Stephen M; Folimonova, Svetlana Y; Moreno, Pedro; Dawson, William O

    2010-01-01

    Citrus tristeza virus (CTV) (genus Closterovirus, family Closteroviridae) causes some of the more important viral diseases of citrus worldwide. The ability to map disease-inducing determinants of CTV is needed to develop better diagnostic and disease control procedures. A distinctive phenotype of some isolates of CTV is the ability to induce seedling yellows (SY) in sour orange, lemon and grapefruit seedlings. In Florida, the decline isolate of CTV, T36, induces SY, whereas a widely distributed mild isolate, T30, does not. To delimit the viral sequences associated with the SY syndrome, we created a number of T36/T30 hybrids by substituting T30 sequences into different regions of the 3' half of the genome of an infectious cDNA of T36. Eleven T36/T30 hybrids replicated in Nicotiana benthamiana protoplasts. Five of these hybrids formed viable virions that were mechanically transmitted to Citrus macrophylla, a permissive host for CTV. All induced systemic infections, similar to that of the parental T36 clone. Tissues from these C. macrophylla source plants were then used to graft inoculate sour orange and grapefruit seedlings. Inoculation with three of the T30/T36 hybrid constructs induced SY symptoms identical to those of T36; however, two hybrids with T30 substitutions in the p23-3' nontranslated region (NTR) (nucleotides 18 394-19 296) failed to induce SY. Sour orange seedlings infected with a recombinant non-SY p23-3' NTR hybrid also remained symptomless when challenged with the parental virus (T36), demonstrating the potential feasibility of using engineered constructs of CTV to mitigate disease.

  19. Genome-wide association study to identify chromosomal regions associated with antibody response to Mycobacterium avium subspecies paratuberculosis in milk of Dutch Holstein-Friesians.

    Science.gov (United States)

    van Hulzen, K J E; Schopen, G C B; van Arendonk, J A M; Nielen, M; Koets, A P; Schrooten, C; Heuven, H C M

    2012-05-01

    Heritability of susceptibility to Johne's disease in cattle has been shown to vary from 0.041 to 0.159. Although the presence of genetic variation involved in susceptibility to Johne's disease has been demonstrated, the understanding of genes contributing to the genetic variance is far from complete. The objective of this study was to contribute to further understanding of genetic variation involved in susceptibility to Johne's disease by identifying associated chromosomal regions using a genome-wide association approach. Log-transformed ELISA test results of 265,290 individual Holstein-Friesian cows from 3,927 herds from the Netherlands were analyzed to obtain sire estimated breeding values for Mycobacterium avium subspecies paratuberculosis (MAP)-specific antibody response in milk using a sire-maternal grandsire model with fixed effects for parity, year of birth, lactation stage, and herd; a covariate for milk yield on test day; and random effects for sire, maternal grandsire, and error. For 192 sires with estimated breeding values with a minimum reliability of 70%, single nucleotide polymorphism (SNP) typing was conducted by a multiple SNP analysis with a random polygenic effect fitting 37,869 SNP simultaneously. Five SNP associated with MAP-specific antibody response in milk were identified distributed over 4 chromosomal regions (chromosome 4, 15, 18, and 28). Thirteen putative SNP associated with MAP-specific antibody response in milk were identified distributed over 10 chromosomes (chromosome 4, 14, 16, 18, 19, 20, 21, 26, 27, and 29). This knowledge contributes to the current understanding of genetic variation involved in Johne's disease susceptibility and facilitates control of Johne's disease and improvement of health status by breeding.

  20. Genetically based location from triploid populations and gene ontology of a 3.3-mb genome region linked to Alternaria brown spot resistance in citrus reveal clusters of resistance genes.

    Directory of Open Access Journals (Sweden)

    José Cuenca

    Full Text Available Genetic analysis of phenotypical traits and marker-trait association in polyploid species is generally considered as a challenge. In the present work, different approaches were combined taking advantage of the particular genetic structures of 2n gametes resulting from second division restitution (SDR to map a genome region linked to Alternaria brown spot (ABS resistance in triploid citrus progeny. ABS in citrus is a serious disease caused by the tangerine pathotype of the fungus Alternaria alternata. This pathogen produces ACT-toxin, which induces necrotic lesions on fruit and young leaves, defoliation and fruit drop in susceptible genotypes. It is a strong concern for triploid breeding programs aiming to produce seedless mandarin cultivars. The monolocus dominant inheritance of susceptibility, proposed on the basis of diploid population studies, was corroborated in triploid progeny. Bulk segregant analysis coupled with genome scan using a large set of genetically mapped SNP markers and targeted genetic mapping by half tetrad analysis, using SSR and SNP markers, allowed locating a 3.3 Mb genomic region linked to ABS resistance near the centromere of chromosome III. Clusters of resistance genes were identified by gene ontology analysis of this genomic region. Some of these genes are good candidates to control the dominant susceptibility to the ACT-toxin. SSR and SNP markers were developed for efficient early marker-assisted selection of ABS resistant hybrids.

  1. Genetically based location from triploid populations and gene ontology of a 3.3-mb genome region linked to Alternaria brown spot resistance in citrus reveal clusters of resistance genes.

    Science.gov (United States)

    Cuenca, José; Aleza, Pablo; Vicent, Antonio; Brunel, Dominique; Ollitrault, Patrick; Navarro, Luis

    2013-01-01

    Genetic analysis of phenotypical traits and marker-trait association in polyploid species is generally considered as a challenge. In the present work, different approaches were combined taking advantage of the particular genetic structures of 2n gametes resulting from second division restitution (SDR) to map a genome region linked to Alternaria brown spot (ABS) resistance in triploid citrus progeny. ABS in citrus is a serious disease caused by the tangerine pathotype of the fungus Alternaria alternata. This pathogen produces ACT-toxin, which induces necrotic lesions on fruit and young leaves, defoliation and fruit drop in susceptible genotypes. It is a strong concern for triploid breeding programs aiming to produce seedless mandarin cultivars. The monolocus dominant inheritance of susceptibility, proposed on the basis of diploid population studies, was corroborated in triploid progeny. Bulk segregant analysis coupled with genome scan using a large set of genetically mapped SNP markers and targeted genetic mapping by half tetrad analysis, using SSR and SNP markers, allowed locating a 3.3 Mb genomic region linked to ABS resistance near the centromere of chromosome III. Clusters of resistance genes were identified by gene ontology analysis of this genomic region. Some of these genes are good candidates to control the dominant susceptibility to the ACT-toxin. SSR and SNP markers were developed for efficient early marker-assisted selection of ABS resistant hybrids.

  2. Genetically Based Location from Triploid Populations and Gene Ontology of a 3.3-Mb Genome Region Linked to Alternaria Brown Spot Resistance in Citrus Reveal Clusters of Resistance Genes

    Science.gov (United States)

    Cuenca, José; Aleza, Pablo; Vicent, Antonio; Brunel, Dominique; Ollitrault, Patrick; Navarro, Luis

    2013-01-01

    Genetic analysis of phenotypical traits and marker-trait association in polyploid species is generally considered as a challenge. In the present work, different approaches were combined taking advantage of the particular genetic structures of 2n gametes resulting from second division restitution (SDR) to map a genome region linked to Alternaria brown spot (ABS) resistance in triploid citrus progeny. ABS in citrus is a serious disease caused by the tangerine pathotype of the fungus Alternaria alternata. This pathogen produces ACT-toxin, which induces necrotic lesions on fruit and young leaves, defoliation and fruit drop in susceptible genotypes. It is a strong concern for triploid breeding programs aiming to produce seedless mandarin cultivars. The monolocus dominant inheritance of susceptibility, proposed on the basis of diploid population studies, was corroborated in triploid progeny. Bulk segregant analysis coupled with genome scan using a large set of genetically mapped SNP markers and targeted genetic mapping by half tetrad analysis, using SSR and SNP markers, allowed locating a 3.3 Mb genomic region linked to ABS resistance near the centromere of chromosome III. Clusters of resistance genes were identified by gene ontology analysis of this genomic region. Some of these genes are good candidates to control the dominant susceptibility to the ACT-toxin. SSR and SNP markers were developed for efficient early marker-assisted selection of ABS resistant hybrids. PMID:24116149

  3. Genomic sequence analysis of the 238-kb swine segment with a cluster of TRIM and olfactory receptor genes located, but with no class I genes, at the distal end of the SLA class I region.

    Science.gov (United States)

    Ando, Asako; Shigenari, Atsuko; Kulski, Jerzy K; Renard, Christine; Chardon, Patrick; Shiina, Takashi; Inoko, Hidetoshi

    2005-12-01

    Continuous genomic sequence has been previously determined for the swine leukocyte antigen (SLA) class I region from the TNF gene cluster at the border between the major histocompatibility complex (MHC) class III and class I regions to the UBD gene at the telomeric end of the classical class I gene cluster (SLA-1 to SLA-5, SLA-9, SLA-11). To complete the genomic sequence of the entire SLA class I genomic region, we have analyzed the genomic sequences of two BAC clones carrying a continuous 237,633-bp-long segment spanning from the TRIM15 gene to the UBD gene located on the telomeric side of the classical SLA class I gene cluster. Fifteen non-class I genes, including the zinc finger and the tripartite motif (TRIM) ring-finger-related family genes and olfactory receptor genes, were identified in the 238-kilobase (kb) segment, and their location in the segment was similar to their apparent human homologs. In contrast, a human segment (alpha block) spanning about 375 kb from the gene ETF1P1 and from the HLA-J to HLA-F genes was absent from the 238-kb swine segment. We conclude that the gene organization of the MHC non-class I genes located in the telomeric side of the classical SLA class I gene cluster is remarkably similar between the swine and the human segments, although the swine lacks a 375-kb segment corresponding to the human alpha block.

  4. Genomics of Clostridium tetani.

    Science.gov (United States)

    Brüggemann, Holger; Brzuszkiewicz, Elzbieta; Chapeton-Montes, Diana; Plourde, Lucile; Speck, Denis; Popoff, Michel R

    2015-05-01

    Genomic information about Clostridium tetani, the causative agent of the tetanus disease, is scarce. The genome of strain E88, a strain used in vaccine production, was sequenced about 10 years ago. One additional genome (strain 12124569) has recently been released. Here we report three new genomes of C. tetani and describe major differences among all five C. tetani genomes. They all harbor tetanus-toxin-encoding plasmids that contain highly conserved genes for TeNT (tetanus toxin), TetR (transcriptional regulator of TeNT) and ColT (collagenase), but substantially differ in other plasmid regions. The chromosomes share a large core genome that contains about 85% of all genes of a given chromosome. The non-core chromosome comprises mainly prophage-like genomic regions and genes encoding environmental interaction and defense functions (e.g. surface proteins, restriction-modification systems, toxin-antitoxin systems, CRISPR/Cas systems) and other fitness functions (e.g. transport systems, metabolic activities). This new genome information will help to assess the level of genome plasticity of the species C. tetani and provide the basis for detailed comparative studies.

  5. Array comparative genomic hybridisation analysis of boys with X linked hypopituitarism identifies a 3.9 Mb duplicated critical region at Xq27 containing SOX3.

    NARCIS (Netherlands)

    Solomon, N.M.; Ross, S.; Morgan, T.; Belsky, J.L.; Hol, F.A.; Karnes, P.; Hopwood, N.J.; Myers, S.E.; Tan, A.; Warne, G.L.; Forrest, S.M.; Thomas, P.Q.

    2004-01-01

    INTRODUCTION: Array comparative genomic hybridisation (array CGH) is a powerful method that detects alteration of gene copy number with greater resolution and efficiency than traditional methods. However, its ability to detect disease causing duplications in constitutional genomic DNA has not been s

  6. Bin mapping of tomato diversity array (DArT) markers to genomic regions of Solanum lycopersicum × Solanum pennellii introgression lines

    NARCIS (Netherlands)

    Schalkwyk, A.; Wenzl, P.; Smit, S.; Lopez-Cobollo, R.; Kilian, A.; Bishop, G.; Hefer, C.; Berger, D.K.

    2012-01-01

    Marker-trait association studies in tomato have progressed rapidly due to the availability of several populations developed between wild species and domesticated tomato. However, in the absence of whole genome sequences for each wild species, molecular marker methods for whole genome comparisons and

  7. Complete mitochondrial genome of the Tristram's Bunting, Emberiza tristrami (Aves: Passeriformes): the first representative of the family Emberizidae with six boxes in the central conserved domain II of control region.

    Science.gov (United States)

    Kan, Xianzhao; Yuan, Jian; Zhang, Liqin; Li, Xifeng; Yu, Lei; Chen, Lei; Guo, Zhichun; Yang, Jianke

    2013-12-01

    Mitochondrial genome has proven to be a powerful tool for phylogenetic inference, phylogeography, and molecular evolution. In this study, we determined the complete mitochondrial genome of Emberiza tristrami (Passeriformes: Emberizidae) for use in future phylogenetic analyses. This circular mitochondrial genome is 16,789 bp in length and composed of 13 typical protein-coding genes, 22 transfer RNA genes, 2 ribosomal RNA genes, and 1 putative control region (CR). One extra nucleotide "C" of nad3 is not detected in the mitogenome of E. tristrami. The CR of E. tristrami can be divided into three domains: ETAS (extended termination-associated sequence) domain I (nt 1-431), central conserved domain II (nt 432-847), and CSB (conserved sequence block) domain III (nt 848-1217). Six conserved sequence boxes in the central conserved domain II were identified as boxes F, E, D, C, b, and B.

  8. Genome evolution of Oryza

    Directory of Open Access Journals (Sweden)

    Tieyan Liu

    2014-01-01

    Full Text Available The genus Oryza is composed of approximately 24 species. Wild species of Oryza contain a largely untapped resource of agronomically important genes. As an increasing number of genomes of wild rice species have been or will be sequenced, Oryza is becoming a model system for plant comparative, functional and evolutionary genomics studies. Comparative analyses of large genomic regions and whole-genome sequences have revealed molecular mechanisms involved in genome size variation, gene movement, genome evolution of polyploids, transition of euchromatin to heterochromatin and centromere evolution in the genus Oryza. Transposon activity and removal of transposable elements by unequal recombination or illegitimate recombination are two important factors contributing to expansion or contraction of Oryza genomes. Double-strand break repair mediated gene movement, especially non-homologous end joining, is an important source of non-colinear genes. Transition of euchromatin to heterochromatin is accompanied by transposable element amplification, segmental and tandem duplication of genic segments, and acquisition of heterochromatic genes from other genomic locations. Comparative analyses of multiple genomes dramatically improve the precision and sensitivity of evolutionary inference than single-genome analyses can provide. Further investigations on the impact of structural variation, lineage-specific genes and evolution of agriculturally important genes on phenotype diversity and adaptation in the genus Oryza should facilitate molecular breeding and genetic improvement of rice.

  9. QTL analysis of novel genomic regions associated with yield and yield related traits in new plant type based recombinant inbred lines of rice (Oryza sativa L.

    Directory of Open Access Journals (Sweden)

    Marathi Balram

    2012-08-01

    Full Text Available Abstract Background Rice is staple food for more than half of the world’s population including two billion Asians, who obtain 60-70% of their energy intake from rice and its derivatives. To meet the growing demand from human population, rice varieties with higher yield potential and greater yield stability need to be developed. The favourable alleles for yield and yield contributing traits are distributed among two subspecies i.e., indica and japonica of cultivated rice (Oryza sativa L.. Identification of novel favourable alleles in indica/japonica will pave way to marker-assisted mobilization of these alleles in to a genetic background to break genetic barriers to yield. Results A new plant type (NPT based mapping population of 310 recombinant inbred lines (RILs was used to map novel genomic regions and QTL hotspots influencing yield and eleven yield component traits. We identified major quantitative trait loci (QTLs for days to 50% flowering (R2 = 25%, LOD = 14.3, panicles per plant (R2 = 19%, LOD = 9.74, flag leaf length (R2 = 22%, LOD = 3.05, flag leaf width (R2 = 53%, LOD = 46.5, spikelets per panicle (R2 = 16%, LOD = 13.8, filled grains per panicle (R2 = 22%, LOD = 15.3, percent spikelet sterility (R2 = 18%, LOD = 14.24, thousand grain weight (R2 = 25%, LOD = 12.9 and spikelet setting density (R2 = 23%, LOD = 15 expressing over two or more locations by using composite interval mapping. The phenotypic variation (R2 ranged from 8 to 53% for eleven QTLs expressing across all three locations. 19 novel QTLs were contributed by the NPT parent, Pusa1266. 15 QTL hotpots on eight chromosomes were identified for the correlated traits. Six epistatic QTLs effecting five traits at two locations were identified. A marker interval (RM3276-RM5709 on chromosome 4 harboring major QTLs for four traits was identified. Conclusions The present study reveals that favourable alleles for

  10. Association and haplotype analysis of candidate genes in five genomic regions linked to sow maternal infanticide in a white Duroc × Erhualian resource population

    Directory of Open Access Journals (Sweden)

    Ding Nengshui

    2011-02-01

    Full Text Available Abstract Background Maternal infanticide is an extreme and failed maternal behavior, which is defined as an active attack on piglets using the jaws, resulting in serious or fatal bite wounds. It brings big economic loss to the pig industry and severe problems to piglets' welfare. But little is known about the genetic background of this behavior. Quantitative trait loci (QTL for maternal infanticide were identified in a White Duroc × Erhualian intercross by a non-parametric linkage analysis (NPL in our previous study. In this study, associations of 194 microsatellite markers used in NPL analysis with maternal infanticide behavior were further analyzed by transmission-disequilibrium test (TDT. On this basis, seven genes (ESR2, EAAT2, BDNF, OXTR, 5-HTR2C, DRD1 and GABRA6 at five genomic regions were selected and further analyzed. Associations of single nucleotide polymorphisms (SNPs and haplotypes in each gene with maternal infanticide behavior were evaluated. Results Microsatellite markers on pig chromosome (SSC 2, 13, 15, and X displayed significance at P ESR2 SNPs had nominal evidence for association (P A at EAAT2 g. 233G > A and allele T at DRD1 g.1013C > G > T also showed evidence of overtransmission to infanticidal sows. In the overall tests of association of haplotypes, candidate genes of ESR2, EAAT2 and DRD1 achieved overall significance level (P ESR2, EAAT2 and DRD1 showed higher frequencies to infanticidal sows (P Conclusions From association tests of SNPs and haplotypes, ESR2, EAAT2 and DRD1 showed significant associations with maternal infanticide. This result supported the existence of QTL for maternal infanticide behavior on SSC1, SSC2 and SSC16.

  11. Expansion of a unique region in the Marek's disease virus genome occurs concomitantly with attenuation but is not sufficient to cause attenuation.

    Science.gov (United States)

    Silva, R F; Reddy, S M; Lupiani, B

    2004-01-01

    Pathogenic Marek's disease viruses (MDVs) have two head-to-tail copies of a 132-bp repeat. As MDV is serially passaged in cell culture, the virus becomes attenuated and the number of copies of the 132-bp repeat increases from 2 to often more than 20 copies. To determine the role of the repeats in attenuation, we used five overlapping cosmid clones that spanned the MDV genome to reconstitute infectious virus (rMd5). By mutating the appropriate cosmids, we generated clones of infectious MDVs that contained zero copies of the 132-bp repeats, rMd5(Delta132); nine copies of the 132-bp repeats, rMd5(9-132); and nine copies of the 132-bp repeats inserted in the reverse orientation, rMd5(rev9-132). After two passages in cell culture, wild-type Md5, rMd5, and rMd5(Delta132) were stable. However, rMd5(9-132) and rMd5(rev9-132) contained a population of viruses that contained from 3 to over 20 copies of the repeats. A major 1.8-kb mRNA, containing two copies of the 132-bp repeat, was present in wild-type Md5 and rMd5 but was not present in rMd5(Delta132), rMd5(9-132), rMd5(rev9-132), or an attenuated MDV. Instead, the RNAs transcribed from the 132-bp repeat region in rMd5(9-132) and rMd5(rev9-132) closely resembled the pattern of RNAs transcribed in attenuated MDVs. When inoculated into susceptible day-old chicks, all viruses produced various lesions. Thus, expansion of the number of copies of 132-bp repeats, which accompanies attenuation, is not sufficient in itself to attenuate pathogenic MDVs.

  12. Did androgen-binding protein paralogs undergo neo- and/or Subfunctionalization as the Abp gene region expanded in the mouse genome?

    Science.gov (United States)

    Karn, Robert C; Chung, Amanda G; Laukaitis, Christina M

    2014-01-01

    The Androgen-binding protein (Abp) region of the mouse genome contains 30 Abpa genes encoding alpha subunits and 34 Abpbg genes encoding betagamma subunits, their products forming dimers composed of an alpha and a betagamma subunit. We endeavored to determine how many Abp genes are expressed as proteins in tears and saliva, and as transcripts in the exocrine glands producing them. Using standard PCR, we amplified Abp transcripts from cDNA libraries of C57BL/6 mice and found fifteen Abp gene transcripts in the lacrimal gland and five in the submandibular gland. Proteomic analyses identified proteins corresponding to eleven of the lacrimal gland transcripts, all of them different from the three salivary ABPs reported previously. Our qPCR results showed that five of the six transcripts that lacked corresponding proteins are expressed at very low levels compared to those transcripts with proteins. We found 1) no overlap in the repertoires of expressed Abp paralogs in lacrimal gland/tears and salivary glands/saliva; 2) substantial sex-limited expression of lacrimal gland/tear expressed-paralogs in males but no sex-limited expression in females; and 3) that the lacrimal gland/tear expressed-paralogs are found exclusively in ancestral clades 1, 2 and 3 of the five clades described previously while the salivary glands/saliva expressed-paralogs are found only in clade 5. The number of instances of extremely low levels of transcription without corresponding protein production in paralogs specific to tears and saliva suggested the role of subfunctionalization, a derived condition wherein genes that may have been expressed highly in both glands ancestrally were down-regulated subsequent to duplication. Thus, evidence for subfunctionalization can be seen in our data and we argue that the partitioning of paralog expression between lacrimal and salivary glands that we report here occurred as the result of adaptive evolution.

  13. Did androgen-binding protein paralogs undergo neo- and/or Subfunctionalization as the Abp gene region expanded in the mouse genome?

    Directory of Open Access Journals (Sweden)

    Robert C Karn

    Full Text Available The Androgen-binding protein (Abp region of the mouse genome contains 30 Abpa genes encoding alpha subunits and 34 Abpbg genes encoding betagamma subunits, their products forming dimers composed of an alpha and a betagamma subunit. We endeavored to determine how many Abp genes are expressed as proteins in tears and saliva, and as transcripts in the exocrine glands producing them. Using standard PCR, we amplified Abp transcripts from cDNA libraries of C57BL/6 mice and found fifteen Abp gene transcripts in the lacrimal gland and five in the submandibular gland. Proteomic analyses identified proteins corresponding to eleven of the lacrimal gland transcripts, all of them different from the three salivary ABPs reported previously. Our qPCR results showed that five of the six transcripts that lacked corresponding proteins are expressed at very low levels compared to those transcripts with proteins. We found 1 no overlap in the repertoires of expressed Abp paralogs in lacrimal gland/tears and salivary glands/saliva; 2 substantial sex-limited expression of lacrimal gland/tear expressed-paralogs in males but no sex-limited expression in females; and 3 that the lacrimal gland/tear expressed-paralogs are found exclusively in ancestral clades 1, 2 and 3 of the five clades described previously while the salivary glands/saliva expressed-paralogs are found only in clade 5. The number of instances of extremely low levels of transcription without corresponding protein production in paralogs specific to tears and saliva suggested the role of subfunctionalization, a derived condition wherein genes that may have been expressed highly in both glands ancestrally were down-regulated subsequent to duplication. Thus, evidence for subfunctionalization can be seen in our data and we argue that the partitioning of paralog expression between lacrimal and salivary glands that we report here occurred as the result of adaptive evolution.

  14. Identification and characterization of a highly variable region in mitochondrial genomes of fusarium species and analysis of power generation from microbial fuel cells

    Science.gov (United States)

    Hamzah, Haider Mousa

    In the microbial fuel cell (MFC) project, power generation from Shewanella oneidensis MR-1 was analyzed looking for a novel system for both energy generation and sustainability. The results suggest the possibility of generating electricity from different organic substances, which include agricultural and industrial by-products. Shewanella oneidensis MR-1 generates usable electrons at 30°C using both submerged and solid state cultures. In the MFC biocathode experiment, most of the CO2 generated at the anodic chamber was converted into bicarbonate due the activity of carbonic anhydrase (CA) of the Gluconobacter sp.33 strain. These findings demonstrate the possibility of generation of electricity while at the same time allowing the biomimetic sequestration of CO2 using bacterial CA. In the mitochondrial genomes project, the filamentous fungal species Fusarium oxysporum was used as a model. This species causes wilt of several important agricultural crops. A previous study revealed that a highly variable region (HVR) in the mitochondrial DNA (mtDNA) of three species of Fusarium contained a large, variable unidentified open reading frame (LV-uORF). Using specific primers for two regions of the LV-uORF, six strains were found to contain the ORF by PCR and database searches identified 18 other strains outside of the Fusarium oxysporum species complex. The LV-uORF was also identified in three isolates of the F. oxysporum species complex. Interestingly, several F. oxysporum isolates lack the LV-uORF and instead contain 13 ORFs in the HVR, nine of which are unidentified. The high GC content and codon usage of the LV-uORF indicate that it did not co-evolve with other mt genes and was horizontally acquired and was introduced to the Fusarium lineage prior to speciation. The nonsynonymous/synonymous (dN/dS) ratio of the LV-uORFs (0.43) suggests it is under purifying selection and the putative polypeptide is predicted to be located in the mitochondrial membrane. Growth assays

  15. Draft Genome Sequence of Acinetobacter sp. Strain BMW17, a Cellulolytic and Plant Growth-Promoting Bacterium Isolated from the Rhizospheric Region of Phragmites karka of Chilika Lake, India.

    Science.gov (United States)

    Mishra, Samir R; Ray, Lopamudra; Panda, Ananta Narayan; Sahu, Neha; Xess, Sonal S; Jadhao, Sudhir; Suar, Mrutyunjay; Adhya, Tapan Kumar; Rastogi, Gurdeep; Pattnaik, Ajit Kumar; Raina, Vishakha

    2016-06-30

    We report the 3.16 Mb draft genome of Acinetobacter sp. strain BMW17, a Gram-negative bacterium in the class of Gammaproteobacteria, isolated from the rhizospheric region of Phragmites karka, an invasive weed in Chilika Lake, Odisha, India. The strain BMW17(T) is capable of degrading cellulose and is also an efficient plant growth promoter that can be useful for various phytoremedial and commercial applications.

  16. Genomic Database Searching.

    Science.gov (United States)

    Hutchins, James R A

    2017-01-01

    The availability of reference genome sequences for virtually all species under active research has revolutionized biology. Analyses of genomic variations in many organisms have provided insights into phenotypic traits, evolution and disease, and are transforming medicine. All genomic data from publicly funded projects are freely available in Internet-based databases, for download or searching via genome browsers such as Ensembl, Vega, NCBI's Map Viewer, and the UCSC Genome Browser. These online tools generate interactive graphical outputs of relevant chromosomal regions, showing genes, transcripts, and other genomic landmarks, and epigenetic features mapped by projects such as ENCODE.This chapter provides a broad overview of the major genomic databases and browsers, and describes various approaches and the latest resources for searching them. Methods are provided for identifying genomic locus and sequence information using gene names or codes, identifiers for DNA and RNA molecules and proteins; also from karyotype bands, chromosomal coordinates, sequences, motifs, and matrix-based patterns. Approaches are also described for batch retrieval of genomic information, performing more complex queries, and analyzing larger sets of experimental data, for example from next-generation sequencing projects.

  17. Recurring genomic breaks in independent lineages support genomic fragility

    Directory of Open Access Journals (Sweden)

    Hannenhalli Sridhar

    2006-11-01

    Full Text Available Abstract Background Recent findings indicate that evolutionary breaks in the genome are not randomly distributed, and that certain regions, so-called fragile regions, are predisposed to breakages. Previous approaches to the study of genomic fragility have examined the distribution of breaks, as well as the coincidence of breaks with segmental duplications and repeats, within a single species. In contrast, we investigate whether this regional fragility is an inherent genomic characteristic and is thus conserved over multiple independent lineages. Results We do this by quantifying the extent to which certain genomic regions are disrupted repeatedly in independent lineages. Our investigation, based on Human, Chimp, Mouse, Rat, Dog and Chicken, suggests that the propensity of a chromosomal region to break is significantly correlated among independent lineages, even when covariates are considered. Furthermore, the fragile regions are enriched for segmental duplications. Conclusion Based on a novel methodology, our work provides additional support for the existence of fragile regions.

  18. Sequence analysis for the complete proviral genome of subgroup J Avian Leukosis virus associated with hemangioma: a special 11 bp deletion was observed in U3 region of 3'UTR

    Directory of Open Access Journals (Sweden)

    Zou Nianli

    2011-04-01

    Full Text Available Abstract Background Avian Leukosis virus (ALV of subgroup J (ALV-J belong to retroviruses, which could induce tumors in domestic and wild birds. Myelocytomatosis was the most common neoplasma observed in infected flocks; however, few cases of hemangioma caused by ALV-J were reported in recent year. Results An ALV-J strain SCDY1 associated with hemangioma was isolated and its proviral genomic sequences were determined. The full proviral sequence of SCDY1 was 7489 nt long. Homology analysis of the env, pol and gag gene between SCDY1 and other strains in GenBank were 90.3-94.2%, 96.6-97.6%, and 94.3-96.5% at nucleotide level, respectively; while 85.1-90.7%, 97.4-98.7%, and 96.2-98.4% at amino acid level, respectively. Alignment analysis of the genomic sequence of ALV-J strains by using HPRS-103 as reference showed that a special 11 bp deletion was observed in U3 region of 3'UTR of SCDY1 and another ALV-J strain NHH isolated from case of hemangioma, and the non-functional TM and E element were absent in the genome of SCDY1, but the transcriptional regulatory elements including C/EBP, E2BP, NFAP-1, CArG box and Y box were highly conserved. Phylogenetic analysis revealed that all analyzed ALV-J strains could be separated into four groups, and SCDY1 as well as another strain NHH were included in the same cluster. Conclusion The variation in envelope glycoprotein was higher than other genes. The genome sequence of SCDY1 has a close relationship with that of another ALV-J strain NHH isolated from case of hemangioma. A 11 bp deletion observed in U3 region of 3'UTR of genome of ALV-J isolated from case of hemangioma is interesting, which may be associated with the occurrence of hemangioma.

  19. A gene-based high-resolution comparative radiation hybrid map as a framework for genome sequence assembly of a bovine chromosome 6 region associated with QTL for growth, body composition, and milk performance traits

    Directory of Open Access Journals (Sweden)

    Laurent Pascal

    2006-03-01

    Full Text Available Abstract Background A number of different quantitative trait loci (QTL for various phenotypic traits, including milk production, functional, and conformation traits in dairy cattle as well as growth and body composition traits in meat cattle, have been mapped consistently in the middle region of bovine chromosome 6 (BTA6. Dense genetic and physical maps and, ultimately, a fully annotated genome sequence as well as their mutual connections are required to efficiently identify genes and gene variants responsible for genetic variation of phenotypic traits. A comprehensive high-resolution gene-rich map linking densely spaced bovine markers and genes to the annotated human genome sequence is required as a framework to facilitate this approach for the region on BTA6 carrying the QTL. Results Therefore, we constructed a high-resolution radiation hybrid (RH map for the QTL containing chromosomal region of BTA6. This new RH map with a total of 234 loci including 115 genes and ESTs displays a substantial increase in loci density compared to existing physical BTA6 maps. Screening the available bovine genome sequence resources, a total of 73 loci could be assigned to sequence contigs, which were already identified as specific for BTA6. For 43 loci, corresponding sequence contigs, which were not yet placed on the bovine genome assembly, were identified. In addition, the improved potential of this high-resolution RH map for BTA6 with respect to comparative mapping was demonstrated. Mapping a large number of genes on BTA6 and cross-referencing them with map locations in corresponding syntenic multi-species chromosome segments (human, mouse, rat, dog, chicken achieved a refined accurate alignment of conserved segments and evolutionary breakpoints across the species included. Conclusion The gene-anchored high-resolution RH map (1 locus/300 kb for the targeted region of BTA6 presented here will provide a valuable platform to guide high-quality assembling and

  20. Unique and conserved genome regions in Vibrio harveyi and related species in comparison with the shrimp pathogen Vibrio harveyi CAIM 1792

    DEFF Research Database (Denmark)

    Valles, Iliana Espinoza; Vora, Gary J; Lin, Baochuan

    2015-01-01

    . The proteome of CAIM 1792 had higher similarity to those of other V. harveyi strains (78 %) than to those of the other closely related species Vibrio owensii (67 %), Vibrio rotiferianus (63 %) and Vibrio campbellii (59 %). Pan-genome ORFans trees showed the best fit with the accepted phylogeny based on DNA......-DNA hybridization and multi-locus sequence analysis of 11 concatenated housekeeping genes. SNP analysis clustered 34/38 genomes within their accepted species. The pangenomic and SNP trees showed that V. harveyi is the most conserved of the four species studied and V. campbellii may be divided into at least three...

  1. 鹿、牛基因组不同区域的微小变异研究%Micro-Variation in Different Regions between Deer and Cattle Genome Sequences

    Institute of Scientific and Technical Information of China (English)

    巴恒星; 杨福合; 李春义

    2012-01-01

    本研究借助鹿和牛基因组序列,通过同源序列比较的方法研究远缘物种基因组不同区域的微小变异,包括单碱基突变、小片段插入和删除.研究结果验证了基因组功能区的点突变和删除变异相对非功能区是保守的普遍性结论.点突变变异与删除变异在鹿、牛基因组不同区域上表现强的正相关性.比较近缘物种人、黑猩猩基因组点突变变异数据,表明牛、鹿基因组的功能区和非功能区突变速率各自保持大致恒定,符合分子钟理论.%In this study, sika deer genome sequence was used to compare with the cattle reference genome using the methods of homology comparison, which was used to identify the Micro-Variation in different regions between distant species genomes in the past. These Micro-Variations consisted of point mutation variations (SNP) and small nucleotide insertion and deletion variations (Indel). The results verified the general conclusion that the variations of SNP and Indel in the functional regions are more conservative than in the non-functional regions. Also, the results showed a strong positive correlation between the SNP variations and Indel variations among different regions. The variation data between human and chimpanzee are compared with those of this study. The results verified that the mutation rate has kept in consistent in the genome functional regions and non-functional regions respectively. The results validated once again the theory of molecular clock.

  2. Genomic relationships of Actinobacillus pleuropneumoniae serotype 2 strains evaluated by ribotyping, sequence analysis of ribosomal intergenic regions, and pulsed-field gel electrophoresis

    DEFF Research Database (Denmark)

    Fussing, V.

    1998-01-01

    The aim of the present study was to examine the genomic relationship among 112 Actinobacillus pleuropneumoniae serotype 2 strains obtained throughout Europe and North America. HindIII ribotyping of the strains resulted in five ribotypes of high similarity (87-98%). Sequence analysis of the riboso...

  3. Cancer genomics

    DEFF Research Database (Denmark)

    Norrild, Bodil; Guldberg, Per; Ralfkiær, Elisabeth Methner

    2007-01-01

    Almost all cells in the human body contain a complete copy of the genome with an estimated number of 25,000 genes. The sequences of these genes make up about three percent of the genome and comprise the inherited set of genetic information. The genome also contains information that determines whe...

  4. Genome-wide meta-analysis identifies regions on 7p21 (AHR and 15q24 (CYP1A2 as determinants of habitual caffeine consumption.

    Directory of Open Access Journals (Sweden)

    Marilyn C Cornelis

    2011-04-01

    Full Text Available We report the first genome-wide association study of habitual caffeine intake. We included 47,341 individuals of European descent based on five population-based studies within the United States. In a meta-analysis adjusted for age, sex, smoking, and eigenvectors of population variation, two loci achieved genome-wide significance: 7p21 (P = 2.4 × 10(-19, near AHR, and 15q24 (P = 5.2 × 10(-14, between CYP1A1 and CYP1A2. Both the AHR and CYP1A2 genes are biologically plausible candidates as CYP1A2 metabolizes caffeine and AHR regulates CYP1A2.

  5. Genome-wide blood DNA methylation alterations at regulatory elements and heterochromatic regions in monozygotic twins discordant for obesity and liver fat

    OpenAIRE

    Ollikainen, M; Ismail, K.; Gervin, Kristina; Harris, Jennifer; Lyle, Robert

    2015-01-01

    Background: The current epidemic of obesity and associated diseases calls for swift actions to better understand the mechanisms by which genetics and environmental factors affect metabolic health in humans. Monozygotic (MZ) twin pairs showing discordance for obesity suggest that epigenetic influences represent one such mechanism. We studied genome-wide leukocyte DNA methylation variation in 30 clinically healthy young adult MZ twin pairs discordant for body mass index (BMI; average within-pai...

  6. The 5′ Untranslated Region and Gag product of Idefix, a Long Terminal Repeat-Retrotransposon from Drosophila melanogaster, Act Together To Initiate a Switch between Translated and Untranslated States of the Genomic mRNA

    OpenAIRE

    Meignin, Carine; Bailly, Jean-Luc; Arnaud, Frédérick; Dastugue, Bernard; Vaury, Chantal

    2003-01-01

    Idefix is a long terminal repeat (LTR)-retrotransposon present in Drosophila melanogaster which shares similarities with vertebrates retroviruses both in its genomic arrangement and in the mechanism of transposition. Like in retroviruses, its two LTRs flank a long 5′ untranslated region (5′UTR) and three open reading frames referred to as the gag, pol, and env genes. Here we report that its 5′UTR, located upstream of the gag gene, can fold into highly structured domains that are known to be i...

  7. Yeast genome sequencing:

    DEFF Research Database (Denmark)

    Piskur, Jure; Langkjær, Rikke Breinhold

    2004-01-01

    For decades, unicellular yeasts have been general models to help understand the eukaryotic cell and also our own biology. Recently, over a dozen yeast genomes have been sequenced, providing the basis to resolve several complex biological questions. Analysis of the novel sequence data has shown...... of closely related species helps in gene annotation and to answer how many genes there really are within the genomes. Analysis of non-coding regions among closely related species has provided an example of how to determine novel gene regulatory sequences, which were previously difficult to analyse because...... they are short and degenerate and occupy different positions. Comparative genomics helps to understand the origin of yeasts and points out crucial molecular events in yeast evolutionary history, such as whole-genome duplication and horizontal gene transfer(s). In addition, the accumulating sequence data provide...

  8. Sequencing intractable DNA to close microbial genomes.

    Directory of Open Access Journals (Sweden)

    Richard A Hurt

    Full Text Available Advancement in high throughput DNA sequencing technologies has supported a rapid proliferation of microbial genome sequencing projects, providing the genetic blueprint for in-depth studies. Oftentimes, difficult to sequence regions in microbial genomes are ruled "intractable" resulting in a growing number of genomes with sequence gaps deposited in databases. A procedure was developed to sequence such problematic regions in the "non-contiguous finished" Desulfovibrio desulfuricans ND132 genome (6 intractable gaps and the Desulfovibrio africanus genome (1 intractable gap. The polynucleotides surrounding each gap formed GC rich secondary structures making the regions refractory to amplification and sequencing. Strand-displacing DNA polymerases used in concert with a novel ramped PCR extension cycle supported amplification and closure of all gap regions in both genomes. The developed procedures support accurate gene annotation, and provide a step-wise method that reduces the effort required for genome finishing.

  9. Sequencing intractable DNA to close microbial genomes.

    Science.gov (United States)

    Hurt, Richard A; Brown, Steven D; Podar, Mircea; Palumbo, Anthony V; Elias, Dwayne A

    2012-01-01

    Advancement in high throughput DNA sequencing technologies has supported a rapid proliferation of microbial genome sequencing projects, providing the genetic blueprint for in-depth studies. Oftentimes, difficult to sequence regions in microbial genomes are ruled "intractable" resulting in a growing number of genomes with sequence gaps deposited in databases. A procedure was developed to sequence such problematic regions in the "non-contiguous finished" Desulfovibrio desulfuricans ND132 genome (6 intractable gaps) and the Desulfovibrio africanus genome (1 intractable gap). The polynucleotides surrounding each gap formed GC rich secondary structures making the regions refractory to amplification and sequencing. Strand-displacing DNA polymerases used in concert with a novel ramped PCR extension cycle supported amplification and closure of all gap regions in both genomes. The developed procedures support accurate gene annotation, and provide a step-wise method that reduces the effort required for genome finishing.

  10. Molecular characterization and phylogeny of Shiga toxin-producing Escherichia coli isolates obtained from two Dutch regions using whole genome sequencing.

    Science.gov (United States)

    Ferdous, M; Friedrich, A W; Grundmann, H; de Boer, R F; Croughs, P D; Islam, M A; Kluytmans-van den Bergh, M F Q; Kooistra-Smid, A M D; Rossen, J W A

    2016-07-01

    Shiga toxin-producing Escherichia coli (STEC) is one of the major causes of human gastrointestinal disease and has been implicated in sporadic cases and outbreaks of diarrhoea, haemorrhagic colitis and haemolytic uremic syndrome worldwide. In this study, we determined the molecular characteristics and phylogenetic relationship of STEC isolates, and their genetic diversity was compared to that of other E. coli populations. Whole genome sequencing was performed on 132 clinical STEC isolates obtained from the faeces of 129 Dutch patients with gastrointestinal complaints. STEC isolates of this study belonged to 44 different sequence types (STs), 42 serogenotypes and 14 stx subtype combinations. Antibiotic resistance genes were more frequently present in stx1-positive isolates compared to stx2 and stx1 + stx2-positive isolates. The iha, mchB, mchC, mchF, subA, ireA, senB, saa and sigA genes were significantly more frequently present in eae-negative than in eae-positive STEC isolates. Presence of virulence genes encoding type III secretion proteins and adhesins was associated with isolates obtained from patients with bloody diarrhoea. Core genome phylogenetic analysis showed that isolates clustered according to their ST or serogenotypes irrespective of stx subtypes. Isolates obtained from patients with bloody diarrhoea were from diverse phylogenetic backgrounds. Some STEC isolates shared common ancestors with non-STEC isolates. Whole genome sequencing is a powerful tool for clinical microbiology, allowing high-resolution molecular typing, population structure analysis and detailed molecular characterization of strains. STEC isolates of a substantial genetic diversity and of distinct phylogenetic groups were observed in this study.

  11. Genome-Wide Analysis in Swine Associates Corneal Graft Rejection with Donor-Recipient Mismatches in Three Novel Histocompatibility Regions and One Locus Homologous to the Mouse H-3 Locus.

    Directory of Open Access Journals (Sweden)

    Susan Nicholls

    Full Text Available In rodents, immune responses to minor histocompatibility antigens are the most important drivers of corneal graft rejection. However, this has not been confirmed in humans or in a large animal model and the genetic loci are poorly characterised, even in mice. The gene sequence data now available for a range of relevant species permits the use of genome-wide association (GWA techniques to identify minor antigens associated with transplant rejection. We have used this technique in a pre-clinical model of corneal transplantation in semi-inbred NIH minipigs and Babraham swine to search for novel minor histocompatibility loci and to determine whether rodent findings have wider applicability. DNA from a cohort of MHC-matched and MHC-mismatched donors and recipients was analysed for single nucleotide polymorphisms (SNPs. The level of SNP homozygosity for each line was assessed. Genome-wide analysis of the association of SNP disparities with rejection was performed using log-likelihood ratios. Four genomic blocks containing four or more SNPs significantly linked to rejection were identified (on chromosomes 1, 4, 6 and 9, none at the location of the MHC. One block of 36 SNPs spanned a region that exhibits conservation of synteny with the mouse H-3 histocompatibility locus and contains the pig homologue of the mouse Zfp106 gene, which encodes peptide epitopes known to mediate corneal graft rejection. The other three regions are novel minor histocompatibility loci. The results suggest that rejection can be predicted from SNP analysis prior to transplant in this model and that a similar GWA analysis is merited in humans.

  12. Progression from Sustained BK Viruria to Sustained BK Viremia with Immunosuppression Reduction Is Not Associated with Changes in the Noncoding Control Region of the BK Virus Genome

    Directory of Open Access Journals (Sweden)

    Imran A. Memon

    2012-01-01

    We performed PCR amplification and sequencing of (1 stored urine and (2 plasma samples from the time of peak viremia from 11 patients with sustained viremia who participated in a 200-patient clinical trial. The antimetabolite was withdrawn for BK viremia and reduction of the calcineurin inhibitor for sustained BK viremia. DNA sequencing from the 11 patients with sustained viremia revealed 8 insertions, 16 transversions, 3 deletions, and 17 transitions. None were deemed significant. No patient developed clinically evident BKVAN. Our data support, at a genomic level, the effectiveness of reduction of immunosuppression for prevention of progression from viremia to BKVAN.

  13. A bias-reducing pathway enrichment analysis of genome-wide association data confirmed association of the MHC region with schizophrenia.

    LENUS (Irish Health Repository)

    Jia, Peilin

    2012-02-01

    After the recent successes of genome-wide association studies (GWAS), one key challenge is to identify genetic variants that might have a significant joint effect on complex diseases but have failed to be identified individually due to weak to moderate marginal effect. One popular and effective approach is gene set based analysis, which investigates the joint effect of multiple functionally related genes (eg, pathways). However, a typical gene set analysis method is biased towards long genes, a problem that is especially severe in psychiatric diseases.

  14. cDNA Cloning of c33-c Antigen Gene Derived From NS3 Region of Chinese HCV Genome, Expression in Escherichia coli and Development of HCV EIA Second-Generation Diagnostic Kit

    Institute of Scientific and Technical Information of China (English)

    杨永平; 刘崇柏; 金冬雁; 詹美云; 汤权; 夏宁邵; 曹经媛; 李景源< Author> YANG Yong-Ping LIU Chong-Bai JIN Dong-YanZHAN Mei-Yun TANG Quan; XIA Ning-ShaoCAO Jing-Yuan and LI Jing-Yuan(Institute of Virology; Chinese Academy of Preventive Medicine; Beijing 100052; PRC)

    1994-01-01

    A cDNA fragment of about 860 bp corresponding to the c33-c gene in the non-structural region 3 (NS3) of HCV genome was obtained from one plasma derived from a Chinese HCV carrier who came from Tai’an of Shandong Province, China by the application of reverse transcription (RT) and polymerase chain reaction (PCR) techniques. After the sequence of the cDNA fragment was determined and compared with the equivalent region of. the HCV-I (HCV-US) and HCV-II (HCV-BK) genomes, the nucleotide/ amino acid sequence homologies were found to be 79. 2%/91. 3% and 91. 3%/93. 9%, respectively. The prokaryotic expression vector pBV220 was employed for the overproduction of c33-c native recombinant protein in E. coli cells. The expression products were detected by enzyme-linked immunosorbent assay (ELISA) and Western blotting with antisera of chronic hepatitis C patients, and a molecular weight 31 kD of c33-c viral protein was shown to account for 14% of the total cellular soluble proteins. This product was extracted from

  15. Specific binding of Fusarium graminearum Hex1 protein to untranslated regions of the genomic RNA of Fusarium graminearum virus 1 correlates with increased accumulation of both strands of viral RNA.

    Science.gov (United States)

    Son, Moonil; Choi, Hoseong; Kim, Kook-Hyung

    2016-02-01

    The HEX1 gene of Fusarium graminearum was previously reported to be required for the efficient accumulation of Fusarium graminearum virus 1 (FgV1) RNA in its host. To investigate the molecular mechanism underlying the production of FgHEX1 and the replication of FgV1 viral RNA, we conducted electrophoretic mobility shift assays (EMSA) with recombinant FgHex1 protein and RNA sequences derived from various regions of FgV1 genomic RNA. These analyses demonstrated that FgHex1 and both the 5'- and 3'-untranslated regions of plus-strand FgV1 RNA formed complexes. To determine whether FgHex1 protein affects FgV1 replication, we quantified accumulation viral RNAs in protoplasts and showed that both (+)- and (-)-strands of FgV1 RNAs were increased in the over-expression mutant and decreased in the deletion mutant. These results indicate that the FgHex1 functions in the synthesis of both strands of FgV1 RNA and therefore in FgV1 replication probably by specifically binding to the FgV1 genomic RNA.

  16. Comparative genome research between maize and rice using genomic in situ hybridization

    Institute of Scientific and Technical Information of China (English)

    2001-01-01

    Using the genomic DNAs of maize and rice as probes respectively,the homology of maize and rice genomes was assessed by genomic in situ hybridization. When rice genomic DNAs were hybridized to maize, all chromosomes displayed many multiple discrete regions, while each rice chromosome delineated a single consecutive chromosomal region after they were hybridized with maize genomic DNAs. The results indicate that the genomes of maize and rice share high homology, and confirm the proposal that maize and rice are diverged from a common ancestor.

  17. Organizational heterogeneity of vertebrate genomes.

    Directory of Open Access Journals (Sweden)

    Svetlana Frenkel

    Full Text Available Genomes of higher eukaryotes are mosaics of segments with various structural, functional, and evolutionary properties. The availability of whole-genome sequences allows the investigation of their structure as "texts" using different statistical and computational methods. One such method, referred to as Compositional Spectra (CS analysis, is based on scoring the occurrences of fixed-length oligonucleotides (k-mers in the target DNA sequence. CS analysis allows generating species- or region-specific characteristics of the genome, regardless of their length and the presence of coding DNA. In this study, we consider the heterogeneity of vertebrate genomes as a joint effect of regional variation in sequence organization superimposed on the differences in nucleotide composition. We estimated compositional and organizational heterogeneity of genome and chromosome sequences separately and found that both heterogeneity types vary widely among genomes as well as among chromosomes in all investigated taxonomic groups. The high correspondence of heterogeneity scores obtained on three genome fractions, coding, repetitive, and the remaining part of the noncoding DNA (the genome dark matter--GDM allows the assumption that CS-heterogeneity may have functional relevance to genome regulation. Of special interest for such interpretation is the fact that natural GDM sequences display the highest deviation from the corresponding reshuffled sequences.

  18. A Parthenogenesis Gene Candidate and Evidence for Segmental Allopolyploidy in Apomictic Brachiaria decumbens.

    Science.gov (United States)

    Worthington, Margaret; Heffelfinger, Christopher; Bernal, Diana; Quintero, Constanza; Zapata, Yeny Patricia; Perez, Juan Guillermo; De Vega, Jose; Miles, John; Dellaporta, Stephen; Tohme, Joe

    2016-07-01

    Apomixis, asexual reproduction through seed, enables breeders to identify and faithfully propagate superior heterozygous genotypes by seed without the disadvantages of vegetative propagation or the expense and complexity of hybrid seed production. The availability of new tools such as genotyping by sequencing and bioinformatics pipelines for species lacking reference genomes now makes the construction of dense maps possible in apomictic species, despite complications including polyploidy, multisomic inheritance, self-incompatibility, and high levels of heterozygosity. In this study, we developed saturated linkage maps for the maternal and paternal genomes of an interspecific Brachiaria ruziziensis (R. Germ. and C. M. Evrard) × B. decumbens Stapf. F1 mapping population in order to identify markers linked to apomixis. High-resolution molecular karyotyping and comparative genomics with Setaria italica (L.) P. Beauv provided conclusive evidence for segmental allopolyploidy in B. decumbens, with strong preferential pairing of homologs across the genome and multisomic segregation relatively more common in chromosome 8. The apospory-specific genomic region (ASGR) was mapped to a region of reduced recombination on B. decumbens chromosome 5. The Pennisetum squamulatum (L.) R.Br. PsASGR-BABY BOOM-like (psASGR-BBML)-specific primer pair p779/p780 was in perfect linkage with the ASGR in the F1 mapping population and diagnostic for reproductive mode in a diversity panel of known sexual and apomict Brachiaria (Trin.) Griseb. and P. maximum Jacq. germplasm accessions and cultivars. These findings indicate that ASGR-BBML gene sequences are highly conserved across the Paniceae and add further support for the postulation of the ASGR-BBML as candidate genes for the apomictic function of parthenogenesis.

  19. Domestication genomics: evidence from animals.

    Science.gov (United States)

    Wang, Guo-Dong; Xie, Hai-Bing; Peng, Min-Sheng; Irwin, David; Zhang, Ya-Ping

    2014-02-01

    Animal domestication has far-reaching significance for human society. The sequenced genomes of domesticated animals provide critical resources for understanding the genetic basis of domestication. Various genomic analyses have shed a new light on the mechanism of artificial selection and have allowed the mapping of genes involved in important domestication traits. Here, we summarize the published genomes of domesticated animals that have been generated over the past decade, as well as their origins, from a phylogenomic point of view. This review provides a general description of the genomic features encountered under a two-stage domestication process. We also introduce recent findings for domestication traits based on results from genome-wide association studies and selective-sweep scans for artificially selected genomic regions. Particular attention is paid to issues relating to the costs of domestication and the convergent evolution of genes between domesticated animals and humans.

  20. A physical map of the human genome

    Energy Technology Data Exchange (ETDEWEB)

    McPherson, J.D.; Marra, M.; Hillier, L.; Waterston, R.H.; Chinwalla, A.; Wallis, J.; Sekhon, M.; Wylie, K.; Mardis, E.R.; Wilson, R.K.; Fulton, R.; Kucaba, T.A.; Wagner-McPherson, C.; Barbazuk, W.B.; Gregory, S.G.; Humphray, S.J.; French, L.; Evans, R.S.; Bethel, G.; Whittaker, A.; Holden, J.L.; McCann, O.T.; Dunham, A.; Soderlund, C.; Scott, C.E.; Bentley, D.R.; Schuler, G.; Chen, H.-C.; Jang, W.; Green, E.D.; Idol, J.R.; Maduro, V.V. Braden; Montgomery, K.T.; Lee, E.; Miller, A.; Emerling, S.; Kucherlapati; Gibbs, R.; Scherer, S.; Gorrell, J.H.; Sodergren, E.; Clerc-Blankenburg, K.; Tabor, P.; Naylor, S.; Garcia, D.; de Jong, P.J.; Catanese, J.J.; Nowak, N.; Osoegawa, K.; Qin, S.; Rowen, L.; Madan, A.; Dors, M.; Hood, L.; Trask, B.; Friedman, C.; Massa, H.; Cheung, V.G.; Kirsch, I.R.; Reid, T.; Yonescu, R.; Weissenbach, J.; Bruls, T.; Heilig, R.; Branscomb, E.; Olsen, A.; Doggett, N.; Cheng, J.F.; Hawkins, T.; Myers, R.M.; Shang, J.; Ramirez, L.; Schmutz, J.; Velasquez, O.; Dixon, K.; Stone, N.E.; Cox, D.R.; Haussler, D.; Kent, W.J.; Furey, T.; Rogic, S.; Kennedy, S.; Jones, S.; Rosenthal, A.; Wen, G.; Schilhabel, M.; Gloeckner, G.; Nyakatura, G.; Siebert, R.; Schlegelberger, B.; Korenberg, J.; Chen, X.N.; Fujiyama, A.; Hattori, M.; Toyoda, A.; Yada, T.; Park, H.S.; Sakaki, Y.; Shimizu, N.; Asakawa, S.; Kawasaki, K.; Sasaki, T.; Shintani, A.; Shimizu, A.; Shibuya, K.; Kudoh, J.; Minoshima, S.; Ramser, J.; Seranski, P.; Hoff, C.; Poustka, A.; Reinhardt, R.; Lehrach, H.

    2001-01-01

    The human genome is by far the largest genome to be sequenced, and its size and complexity present many challenges for sequence assembly. The International Human Genome Sequencing Consortium constructed a map of the whole genome to enable the selection of clones for sequencing and for the accurate assembly of the genome sequence. Here we report the construction of the whole-genome bacterial artificial chromosome (BAC) map and its integration with previous landmark maps and information from mapping efforts focused on specific chromosomal regions. We also describe the integration of sequence data with the map.

  1. PredictSNP2: A Unified Platform for Accurately Evaluating SNP Effects by Exploiting the Different Characteristics of Variants in Distinct Genomic Regions.

    Science.gov (United States)

    Bendl, Jaroslav; Musil, Miloš; Štourač, Jan; Zendulka, Jaroslav; Damborský, Jiří; Brezovský, Jan

    2016-05-01

    An important message taken from human genome sequencing projects is that the human population exhibits approximately 99.9% genetic similarity. Variations in the remaining parts of the genome determine our identity, trace our history and reveal our heritage. The precise delineation of phenotypically causal variants plays a key role in providing accurate personalized diagnosis, prognosis, and treatment of inherited diseases. Several computational methods for achieving such delineation have been reported recently. However, their ability to pinpoint potentially deleterious variants is limited by the fact that their mechanisms of prediction do not account for the existence of different categories of variants. Consequently, their output is biased towards the variant categories that are most strongly represented in the variant databases. Moreover, most such methods provide numeric scores but not binary predictions of the deleteriousness of variants or confidence scores that would be more easily understood by users. We have constructed three datasets covering different types of disease-related variants, which were divided across five categories: (i) regulatory, (ii) splicing, (iii) missense, (iv) synonymous, and (v) nonsense variants. These datasets were used to develop category-optimal decision thresholds and to evaluate six tools for variant prioritization: CADD, DANN, FATHMM, FitCons, FunSeq2 and GWAVA. This evaluation revealed some important advantages of the category-based approach. The results obtained with the five best-performing tools were then combined into a consensus score. Additional comparative analyses showed that in the case of missense variations, protein-based predictors perform better than DNA sequence-based predictors. A user-friendly web interface was developed that provides easy access to the five tools' predictions, and their consensus scores, in a user-understandable format tailored to the specific features of different categories of variations. To

  2. An original SERPINA3 gene cluster: Elucidation of genomic organization and gene expression in the Bos taurus 21q24 region

    Directory of Open Access Journals (Sweden)

    Ouali Ahmed

    2008-04-01

    Full Text Available Abstract Background The superfamily of serine proteinase inhibitors (serpins is involved in numerous fundamental biological processes as inflammation, blood coagulation and apoptosis. Our interest is focused on the SERPINA3 sub-family. The major human plasma protease inhibitor, α1-antichymotrypsin, encoded by the SERPINA3 gene, is homologous to genes organized in clusters in several mammalian species. However, although there is a similar genic organization with a high degree of sequence conservation, the reactive-centre-loop domains, which are responsible for the protease specificity, show significant divergences. Results We provide additional information by analyzing the situation of SERPINA3 in the bovine genome. A cluster of eight genes and one pseudogene sharing a high degree of identity and the same structural organization was characterized. Bovine SERPINA3 genes were localized by radiation hybrid mapping on 21q24 and only spanned over 235 Kilobases. For all these genes, we propose a new nomenclature from SERPINA3-1 to SERPINA3-8. They share approximately 70% of identity with the human SERPINA3 homologue. In the cluster, we described an original sub-group of six members with an unexpected high degree of conservation for the reactive-centre-loop domain, suggesting a similar peptidase inhibitory pattern. Preliminary expression analyses of these bovSERPINA3s showed different tissue-specific patterns and diverse states of glycosylation and phosphorylation. Finally, in the context of phylogenetic analyses, we improved our knowledge on mammalian SERPINAs evolution. Conclusion Our experimental results update data of the bovine genome sequencing, substantially increase the bovSERPINA3 sub-family and enrich the phylogenetic tree of serpins. We provide new opportunities for future investigations to approach the biological functions of this unusual subset of serine proteinase inhibitors.

  3. Genome-wide association study identifies two novel regions at 11p15.5-p13 and 1p31 with major impact on acute-phase serum amyloid A.

    Directory of Open Access Journals (Sweden)

    Carola Marzi

    2010-11-01

    Full Text Available Elevated levels of acute-phase serum amyloid A (A-SAA cause amyloidosis and are a risk factor for atherosclerosis and its clinical complications, type 2 diabetes, as well as various malignancies. To investigate the genetic basis of A-SAA levels, we conducted the first genome-wide association study on baseline A-SAA concentrations in three population-based studies (KORA, TwinsUK, Sorbs and one prospective case cohort study (LURIC, including a total of 4,212 participants of European descent, and identified two novel genetic susceptibility regions at 11p15.5-p13 and 1p31. The region at 11p15.5-p13 (rs4150642; p = 3.20×10(-111 contains serum amyloid A1 (SAA1 and the adjacent general transcription factor 2 H1 (GTF2H1, Hermansky-Pudlak Syndrome 5 (HPS5, lactate dehydrogenase A (LDHA, and lactate dehydrogenase C (LDHC. This region explains 10.84% of the total variation of A-SAA levels in our data, which makes up 18.37% of the total estimated heritability. The second region encloses the leptin receptor (LEPR gene at 1p31 (rs12753193; p = 1.22×10(-11 and has been found to be associated with CRP and fibrinogen in previous studies. Our findings demonstrate a key role of the 11p15.5-p13 region in the regulation of baseline A-SAA levels and provide confirmative evidence of the importance of the 1p31 region for inflammatory processes and the close interplay between A-SAA, leptin, and other acute-phase proteins.

  4. Domestication and plant genomes.

    Science.gov (United States)

    Tang, Haibao; Sezen, Uzay; Paterson, Andrew H

    2010-04-01

    The techniques of plant improvement have been evolving with the advancement of technology, progressing from crop domestication by Neolithic humans to scientific plant breeding, and now including DNA-based genotyping and genetic engineering. Archeological findings have shown that early human ancestors often unintentionally selected for and finally fixed a few major domestication traits over time. Recent advancement of molecular and genomic tools has enabled scientists to pinpoint changes to specific chromosomal regions and genetic loci that are responsible for dramatic morphological and other transitions that distinguish crops from their wild progenitors. Extensive studies in a multitude of additional crop species, facilitated by rapid progress in sequencing and resequencing(s) of crop genomes, will further our understanding of the genomic impact from both the unusual population history of cultivated plants and millennia of human selection.

  5. Genomic profiling of papillary renal cell tumours identifies small regions of DNA alterations: a possible role of HNF1B in tumour development

    NARCIS (Netherlands)

    Szponar, A.; Yusenko, M.V.; Kuiper, R.P.; Geurts van Kessel, A.H.M.; Kovacs, G.

    2011-01-01

    AIMS: Papillary renal cell tumours (RCT) are characterized by specific trisomies. The aim of this study was to identify small regions of duplication marking putative tumour genes. METHODS AND RESULTS: Full-tiling path bacterial artificial chromosome (BAC) array hybridization of 20 papillary RCTs con

  6. Congenital diaphragmatic hernia and chromosome 15q26: determination of a candidate region by use of fluorescent in situ hybridization and array-based comparative genomic hybridization

    NARCIS (Netherlands)

    M. Klaassens (Merel); C. Wouters (Cokkie); M.F. van Dooren (Marieke); H.J.F.M.M. Eussen (Bert); H. Douben (Hannie); J.E.M.M. de Klein (Annelies); A.T. den Dekker (Alexander); C. Lee; P.K. Donahoe; D. Tibboel (Dick); R-J.H. Galjaard (Robert-Jan); N.N.T. Goemaere (Natascha); B.A. Oostra (Ben); R.R. de Krijger (Ronald); J. Wauters (Jan)

    2005-01-01

    textabstractCongenital diaphragmatic hernia (CDH) has an incidence of 1 in 3,000 births and a high mortality rate (33%-58%). Multifactorial inheritance, teratogenic agents, and genetic abnormalities have all been suggested as possible etiologic factors. To define candidate regions

  7. A genome-wide association study identifies a region at chromosome 12 as a potential susceptibility locus for restenosis after percutaneous coronary intervention

    Science.gov (United States)

    Sampietro, M. Lourdes; Trompet, Stella; Verschuren, Jeffrey J.W.; Talens, Rudolf P.; Deelen, Joris; Heijmans, Bastiaan T.; de Winter, Robbert J.; Tio, Rene A.; Doevendans, Pieter A.F.M.; Ganesh, Santhi K.; Nabel, Elizabeth G.; Westra, Harm-Jan; Franke, Lude; van den Akker, Erik B.; Westendorp, Rudi G.J.; Zwinderman, Aeilko H.; Kastrati, Adnan; Koch, Werner; Slagboom, P.Eline; de Knijff, Peter; Jukema, J. Wouter

    2011-01-01

    Percutaneous coronary intervention (PCI) has become an effective therapy to treat obstructive coronary artery diseases (CAD). However, one of the major drawbacks of PCI is the occurrence of restenosis in 5–25% of all initially treated patients. Restenosis is defined as the re-narrowing of the lumen of the blood vessel, resulting in renewed symptoms and the need for repeated intervention. To identify genetic variants that are associated with restenosis, a genome-wide association study (GWAS) was conducted in 295 patients who developed restenosis (cases) and 571 who did not (controls) from the GENetic Determinants of Restenosis (GENDER) study. Analysis of ∼550 000 single nucleotide polymorphisms (SNPs) in GENDER was followed by a replication phase in three independent case–control populations (533 cases and 3067 controls). A potential susceptibility locus for restenosis at chromosome 12, including rs10861032 (Pcombined = 1.11 × 10−7) and rs9804922 (Pcombined = 1.45 × 10−6), was identified in the GWAS and replication phase. In addition, both SNPs were also associated with coronary events (rs10861032, Padditive = 0.005; rs9804922, Padditive = 0.023) in a trial based cohort set of elderly patients with (enhanced risk of) CAD (PROSPER) and all-cause mortality in PROSPER (rs10861032, Padditive = 0.007; rs9804922, Padditive = 0.013) and GENDER (rs10861032, Padditive = 0.005; rs9804922, Padditive = 0.023). Further analysis suggests that this locus could be involved in regulatory functions. PMID:21878436

  8. The OXA1L gene that controls cytochrome oxidase assembly maps to the 14q11.2 region of the human genome

    Energy Technology Data Exchange (ETDEWEB)

    Molina-Gomes, D.; Viegas-Pequignot, E. [INSERM, Paris (France); Bonnefoy, N.; Dujardin, G. [Universite Paris, Gif sur Yvette (France)] [and others

    1995-11-20

    Cytochrome-c oxidase, the terminal complex of the mitochondrial respiratory chain that transfers electrons from cytochrome c to oxygen, has a critical role in cellular energy metabolism. In eukaryotes, the cytochrome-c oxidase complex is composed of from 7 to 13 subunits (in mammals), and its assembly depends on several nuclear-encoded proteins. The 0XA1 gene, which was first isolated in Saccharomyces cerevisiae, encodes a protein essential for cytochrome-c oxidase assembly. The human OXA1-like (OXA1L, previously designated OXA1Hs) cDNA was isolated by functional complementation of an oxa1{sup -} mutation in yeast. The deduced sequences of the two Oxa1 and Oxa1L proteins share 33% identity. Oxygen consumption measurements and cytochrome absorption spectra show that replacement of the yeast protein with the human homolog leads to the correct assembly of cytochrome-c oxidase, suggesting that these proteins play essentially the same role in both organisms. In this report, we have used both somatic cell hybrid mapping and in situ hybridization to localize the OXA1L gene on the human genome. 7 refs., 2 figs.

  9. A genome wide association study for backfat thickness in Italian Large White pigs highlights new regions affecting fat deposition including neuronal genes

    Directory of Open Access Journals (Sweden)

    Fontanesi Luca

    2012-11-01

    Full Text Available Abstract Background Carcass fatness is an important trait in most pig breeding programs. Following market requests, breeding plans for fresh pork consumption are usually designed to reduce carcass fat content and increase lean meat deposition. However, the Italian pig industry is mainly devoted to the production of Protected Designation of Origin dry cured hams: pigs are slaughtered at around 160 kg of live weight and the breeding goal aims at maintaining fat coverage, measured as backfat thickness to avoid excessive desiccation of the hams. This objective has shaped the genetic pool of Italian heavy pig breeds for a few decades. In this study we applied a selective genotyping approach within a population of ~ 12,000 performance tested Italian Large White pigs. Within this population, we selectively genotyped 304 pigs with extreme and divergent backfat thickness estimated breeding value by the Illumina PorcineSNP60 BeadChip and performed a genome wide association study to identify loci associated to this trait. Results We identified 4 single nucleotide polymorphisms with P≤5.0E-07 and additional 119 ones with 5.0E-07 Conclusions Further investigations are needed to evaluate the effects of the identified single nucleotide polymorphisms associated with backfat thickness on other traits as a pre-requisite for practical applications in breeding programs. Reported results could improve our understanding of the biology of fat metabolism and deposition that could also be relevant for other mammalian species including humans, confirming the role of neuronal genes on obesity.

  10. The genomic landscapes of histone H3-Lys9 modifications of gene promoter regions and expression profiles in human bone marrow mesenchymal stem cells

    Institute of Scientific and Technical Information of China (English)

    2008-01-01

    Mesenchymal stem cells (MSCs) of nonembryortic origins possess the proliferation and multi-lineage differentiation potentials. It has been established that epigenetic mechanisms could be critical for determining the fate of stem ceils, and MSCs derived from different origins exhibited different expression profiles individually to a certain extent. In this study, ChiP-on-chip was used to generate genome-wide historic H3-Lys9 acetylation and dimethylation profiles at gene promoters in human bone marrow MSCs. We showed that modifications of histone H3-Lys9 at gene promoters correlated well with mRNA expression in human bone marrow MSCs. Functional analysis revealed that many key cellular pathways in human bone marrow MSC self-renewal, such as the canonical signaling pathways,cell cycle pathways and cytokine related pathways may be regulated by H3-Lys9 modifications. These data suggest that gene activation and silencing affected by H3-Lys9 acetylation and dimethylation, respectively, may be essential to the maintenance of human bone marrow MSC self-renewal and multi-potency.

  11. Accounting for discovery bias in genomic prediction

    Science.gov (United States)

    Our objective was to evaluate an approach to mitigating discovery bias in genomic prediction. Accuracy may be improved by placing greater emphasis on regions of the genome expected to be more influential on a trait. Methods emphasizing regions result in a phenomenon known as “discovery bias” if info...

  12. Smoking correlates with increased cytoskeletal protein‐related coding region mutations in the lung and head and neck datasets of the cancer genome atlas

    OpenAIRE

    Yavorski, John M.; Blanck, George

    2016-01-01

    Abstract Cancer from smoking tobacco is considered dependent on mutagens, but significant molecular aspects of smoking‐specific, cancer development remain unknown. We defined sets of coding regions for oncoproteins, tumor suppressor proteins, and cytoskeletal‐related proteins that were compared between nonsmokers and smokers, for mutation occurrences, in the lung adenocarcinoma (LUAD), head and neck squamous carcinoma (HNSC), bladder carcinoma (BLCA), and pancreatic adenocarcinoma ( PAAD) dat...

  13. Rhipicephalus (Boophilus) microplus strain Deutsch, whole genome shotgun sequencing project first submission of genome sequence

    Science.gov (United States)

    The size and repetitive nature of the Rhipicephalus microplus genome makes obtaining a full genome sequence difficult. Cot filtration/selection techniques were used to reduce the repetitive fraction of the tick genome and enrich for the fraction of DNA with gene-containing regions. The Cot-selected ...

  14. The bonobo genome compared with the chimpanzee and human genomes.

    Science.gov (United States)

    Prüfer, Kay; Munch, Kasper; Hellmann, Ines; Akagi, Keiko; Miller, Jason R; Walenz, Brian; Koren, Sergey; Sutton, Granger; Kodira, Chinnappa; Winer, Roger; Knight, James R; Mullikin, James C; Meader, Stephen J; Ponting, Chris P; Lunter, Gerton; Higashino, Saneyuki; Hobolth, Asger; Dutheil, Julien; Karakoç, Emre; Alkan, Can; Sajjadian, Saba; Catacchio, Claudia Rita; Ventura, Mario; Marques-Bonet, Tomas; Eichler, Evan E; André, Claudine; Atencia, Rebeca; Mugisha, Lawrence; Junhold, Jörg; Patterson, Nick; Siebauer, Michael; Good, Jeffrey M; Fischer, Anne; Ptak, Susan E; Lachmann, Michael; Symer, David E; Mailund, Thomas; Schierup, Mikkel H; Andrés, Aida M; Kelso, Janet; Pääbo, Svante

    2012-06-28

    Two African apes are the closest living relatives of humans: the chimpanzee (Pan troglodytes) and the bonobo (Pan paniscus). Although they are similar in many respects, bonobos and chimpanzees differ strikingly in key social and sexual behaviours, and for some of these traits they show more similarity with humans than with each other. Here we report the sequencing and assembly of the bonobo genome to study its evolutionary relationship with the chimpanzee and human genomes. We find that more than three per cent of the human genome is more closely related to either the bonobo or the chimpanzee genome than these are to each other. These regions allow various aspects of the ancestry of the two ape species to be reconstructed. In addition, many of the regions that overlap genes may eventually help us understand the genetic basis of phenotypes that humans share with one of the two apes to the exclusion of the other.

  15. The bonobo genome compared with the chimpanzee and human genomes

    Science.gov (United States)

    Prüfer, Kay; Munch, Kasper; Hellmann, Ines; Akagi, Keiko; Miller, Jason R.; Walenz, Brian; Koren, Sergey; Sutton, Granger; Kodira, Chinnappa; Winer, Roger; Knight, James R.; Mullikin, James C.; Meader, Stephen J.; Ponting, Chris P.; Lunter, Gerton; Higashino, Saneyuki; Hobolth, Asger; Dutheil, Julien; Karakoç, Emre; Alkan, Can; Sajjadian, Saba; Catacchio, Claudia Rita; Ventura, Mario; Marques-Bonet, Tomas; Eichler, Evan E.; André, Claudine; Atencia, Rebeca; Mugisha, Lawrence; Junhold, Jörg; Patterson, Nick; Siebauer, Michael; Good, Jeffrey M.; Fischer, Anne; Ptak, Susan E.; Lachmann, Michael; Symer, David E.; Mailund, Thomas; Schierup, Mikkel H.; Andrés, Aida M.; Kelso, Janet; Pääbo, Svante

    2012-01-01

    Two African apes are the closest living relatives of humans: the chimpanzee (Pan troglodytes) and the bonobo (Pan paniscus). Although they are similar in many respects, bonobos and chimpanzees differ strikingly in key social and sexual behaviours1–4, and for some of these traits they show more similarity with humans than with each other. Here we report the sequencing and assembly of the bonobo genome to study its evolutionary relationship with the chimpanzee and human genomes. We find that more than three per cent of the human genome is more closely related to either the bonobo or the chimpanzee genome than these are to each other. These regions allow various aspects of the ancestry of the two ape species to be reconstructed. In addition, many of the regions that overlap genes may eventually help us understand the genetic basis of phenotypes that humans share with one of the two apes to the exclusion of the other. PMID:22722832

  16. Analysis of gene order data supports vertical inheritance of the leukotoxin operon and genome rearrangements in the 5' flanking region in genus Mannheimia

    DEFF Research Database (Denmark)

    Larsen, Jesper; Kuhnert, Peter; Frey, Joachim;

    2007-01-01

    examined the gene order in the 5' flanking region of the leukotoxin operon and found that the 5' flanking gene strings, hslVU-lapB-artJ-lktC and xylAB-lktC, are peculiar to M. haemolytica + M. glucosida and M. granulomatis, respectively, whereas the gene string hslVU-lapB-lktC is present in M. ruminalis......, the supposed sister group of M. haemolytica + M. glucosida, and in the most ancient subclade M. varigena. In M. granulomatis, we found remnants of the gene string hslVU-lapB-lktC in the xylB-lktC intergenic region. CONCLUSIONS: These observations indicate that the gene string hslVU-lapB-lktC is more ancient...... than the hslVU-lapB-artJ-lktC and xylAB-lktC gene strings. The presence of (remnants of) the ancient gene string hslVU-lapB-lktC among any subclades within genus Mannheimia supports that it has been vertically inherited from the last common ancestor of genus Mannheimia to any ancestor of the diverging...

  17. Listeria Genomics

    Science.gov (United States)

    Cabanes, Didier; Sousa, Sandra; Cossart, Pascale

    The opportunistic intracellular foodborne pathogen Listeria monocytogenes has become a paradigm for the study of host-pathogen interactions and bacterial adaptation to mammalian hosts. Analysis of L. monocytogenes infection has provided considerable insight into how bacteria invade cells, move intracellularly, and disseminate in tissues, as well as tools to address fundamental processes in cell biology. Moreover, the vast amount of knowledge that has been gathered through in-depth comparative genomic analyses and in vivo studies makes L. monocytogenes one of the most well-studied bacterial pathogens. This chapter provides an overview of progress in the exploration of genomic, transcriptomic, and proteomic data in Listeria spp. to understand genome evolution and diversity, as well as physiological aspects of metabolism used by bacteria when growing in diverse environments, in particular in infected hosts.

  18. Marine genomics

    DEFF Research Database (Denmark)

    Oliveira Ribeiro, Ângela Maria; Foote, Andrew D.; Kupczok, Anne

    2017-01-01

    Marine ecosystems occupy 71% of the surface of our planet, yet we know little about their diversity. Although the inventory of species is continually increasing, as registered by the Census of Marine Life program, only about 10% of the estimated two million marine species are known. This lag......-throughput sequencing approaches have been helping to improve our knowledge of marine biodiversity, from the rich microbial biota that forms the base of the tree of life to a wealth of plant and animal species. In this review, we present an overview of the applications of genomics to the study of marine life, from...... evolutionary biology of non-model organisms to species of commercial relevance for fishing, aquaculture and biomedicine. Instead of providing an exhaustive list of available genomic data, we rather set to present contextualized examples that best represent the current status of the field of marine genomics....

  19. Sequence variations of the locus-specific 5' untranslated regions of SLA class I genes and the development of a comprehensive genomic DNA-based high-resolution typing method for SLA-2.

    Science.gov (United States)

    Choi, H; Le, M T; Lee, H; Choi, M-K; Cho, H-S; Nagasundarapandian, S; Kwon, O-J; Kim, J-H; Seo, K; Park, J-K; Lee, J-H; Ho, C-S; Park, C

    2015-10-01

    The genetic diversity of the major histocompatibility complex (MHC) class I molecules of pigs has not been well characterized. Therefore, the influence of MHC genetic diversity on the immune-related traits of pigs, including disease resistance and other MHC-dependent traits, is not well understood. Here, we attempted to develop an efficient method for systemic analysis of the polymorphisms in the epitope-binding region of swine leukocyte antigens (SLA) class I genes. We performed a comparative analysis of the last 92 bp of the 5' untranslated region (UTR) to the beginning of exon 4 of six SLA classical class I-related genes, SLA-1, -2, -3, -4, -5, and -9, from 36 different sequences. Based on this information, we developed a genomic polymerase chain reaction (PCR) and direct sequencing-based comprehensive typing method for SLA-2. We successfully typed SLA-2 from 400 pigs and 8 cell lines, consisting of 9 different pig breeds, and identified 49 SLA-2 alleles, including 31 previously reported alleles and 18 new alleles. We observed differences in the composition of SLA-2 alleles among different breeds. Our method can be used to study other SLA class I loci and to deepen our knowledge of MHC class I genes in pigs.

  20. Ancient genomics

    DEFF Research Database (Denmark)

    Der Sarkissian, Clio; Allentoft, Morten Erik; Avila Arcos, Maria del Carmen

    2015-01-01

    by increasing the number of sequence reads to billions effectively means that contamination issues that have haunted aDNA research for decades, particularly in human studies, can now be efficiently and confidently quantified. At present, whole genomes have been sequenced from ancient anatomically modern humans......, archaic hominins, ancient pathogens and megafaunal species. Those have revealed important functional and phenotypic information, as well as unexpected adaptation, migration and admixture patterns. As such, the field of aDNA has entered the new era of genomics and has provided valuable information when...

  1. Genome Sequencing

    DEFF Research Database (Denmark)

    Sato, Shusei; Andersen, Stig Uggerhøj

    2014-01-01

    The current Lotus japonicus reference genome sequence is based on a hybrid assembly of Sanger TAC/BAC, Sanger shotgun and Illumina shotgun sequencing data generated from the Miyakojima-MG20 accession. It covers nearly all expressed L. japonicus genes and has been annotated mainly based on transcr......The current Lotus japonicus reference genome sequence is based on a hybrid assembly of Sanger TAC/BAC, Sanger shotgun and Illumina shotgun sequencing data generated from the Miyakojima-MG20 accession. It covers nearly all expressed L. japonicus genes and has been annotated mainly based...

  2. PromBase: a web resource for various genomic features and predicted promoters in prokaryotic genomes

    Directory of Open Access Journals (Sweden)

    Bansal Manju

    2011-07-01

    Full Text Available Abstract Background As more and more genomes are being sequenced, an overview of their genomic features and annotation of their functional elements, which control the expression of each gene or transcription unit of the genome, is a fundamental challenge in genomics and bioinformatics. Findings Relative stability of DNA sequence has been used to predict promoter regions in 913 microbial genomic sequences with GC-content ranging from 16.6% to 74.9%. Irrespective of the genome GC-content the relative stability based promoter prediction method has already been proven to be robust in terms of recall and precision. The predicted promoter regions for the 913 microbial genomes have been accumulated in a database called PromBase. Promoter search can be carried out in PromBase either by specifying the gene name or the genomic position. Each predicted promoter region has been assigned to a reliability class (low, medium, high, very high and highest based on the difference between its average free energy and the downstream region. The recall and precision values for each class are shown graphically in PromBase. In addition, PromBase provides detailed information about base composition, CDS and CG/TA skews for each genome and various DNA sequence dependent structural properties (average free energy, curvature and bendability in the vicinity of all annotated translation start sites (TLS. Conclusion PromBase is a database, which contains predicted promoter regions and detailed analysis of various genomic features for 913 microbial genomes. PromBase can serve as a valuable resource for comparative genomics study and help the experimentalist to rapidly access detailed information on various genomic features and putative promoter regions in any given genome. This database is freely accessible for academic and non- academic users via the worldwide web http://nucleix.mbu.iisc.ernet.in/prombase/.

  3. Array-based comparative genomic hybridization for genome-wide screening of DNA copy number in bladder tumors.

    NARCIS (Netherlands)

    Veltman, J.A.; Fridlyand, J.; Pejavar, S.; Olshen, A.B.; Korkola, J.E.; Vries, S. de; Carroll, P.; Kuo, W.L.; Pinkel, D.; Albertson, D.; Cordon-Cardo, C.; Jain, A.N.; Waldman, F.M.

    2003-01-01

    Genome-wide copy number profiles were characterized in 41 primary bladder tumors using array-based comparative genomic hybridization (array CGH). In addition to previously identified alterations in large chromosomal regions, alterations were identified in many small genomic regions, some with high-l

  4. Stsub>2sub>-80 - A new FISH marker for St genome and genome analysis in Triticeae.

    Science.gov (United States)

    Wang, Long; Shi, Qinghua; Su, Handong; Wang, Yi; Lina, Sha; Fan, Xing; Houyang, Kang; Haiqin, Zhang; Zhou, Yong-Hong

    2017-03-17

    St genome is one of the most fundamental genomes in Triticeae. Repetitive sequences are widely used to distinguish different genomes or species. The primary objectives of this study are (1) to screen a new sequence that can easily distinguish the chromosome of St and other genome by fluorescence in situ hybridization (FISH); (2) to investigate the genome constitutions of some species remains uncertain and controversial. We used degenerated oligonucleotide primer PCR (Dop-PCR), Dot-blot and FISH to screen new marker for St genome and tested efficiency of this marker in detection St chromosome at different ploidy level. Signals of new FISH marker (denoted Stsub>2sub>-80) were found in the entire arm of chromosomes in St genome, except in centromeric region; by contrast, Stsub>2sub>-80 signals were found in the terminal region of chromosome in E, H, P and Y genomes. No signal was detected in A and B genomes, and only a teeny signals were detected in the terminal region of chromosomes in D genome. Stsub>2sub>-80 signals were obvious and stable in chromosomes of different genomes in either diploid or polyploid. Therefore, Stsub>2sub>-80 is a potential and useful FISH marker that can be used to the distinguish St and other genomes in Triticeae.

  5. 基于柑橘及其近缘属植物DNA条形码的叶绿体编码序列筛选%Screening Potential DNA Barcode Regions of Chloroplast Coding Genome for Citrus and Its Related Genera

    Institute of Scientific and Technical Information of China (English)

    于杰; 闫化学; 鲁振华; 周志钦

    2011-01-01

    [Objective] Four coding regions of chloroplast genome of Citrus and its close relatives were analyzed in an attempt to find suitable DNA barcoding markers for species identification and lay a foundation for further study of non-coding region.[ Method ] Four chloroplast DNA regions (matK, rpoB, rpoC1 and rbcL ) of 59 Citrus accessions were sequenced, the intergeneric,interspecific, intraspecific genetic distances were calculated, and the phylogenetic tree of all the accessions tested was built based on the distance data obtained. [Result] The intergeneric and interspecific sequence variations of matK were the highest among four coding regions tested, and had significant difference from other regions studied. On the contrary, no obvious variations were found in the rpoB and rpoC1 regions. The sequence variation of rbcL was medium among the fragments sequenced. [Conclusion] The matK sequence could be used as potential candidate fragment for future DNA barcoding study of Citrus and its closely related genera.%[目的]通过对柑橘及其近缘属植物叶绿体4种编码序列的测定分析,获得能进行DNA条形编码的特征序列,为进一步研究叶绿体非编码区序列奠定基础.[方法]对柑橘及其近缘属植物59份样品进行matK、rpoB、rpoC1、rbcL测序,序列比对与人工校正,计算属间,种同、种内的遗传距离,比较序列间的差异,建立系统发育树.[结果]4种序列中,matK序列在属间、种间差异最大,与其它序列相比具有显著性差异,rbcL序列次之,而rpoB、rpoC1序列两者间没有显著性差异.[结论]matK序列是柑橘及其近缘属植物DNA条形码的未来研究中一个重要的候选片段.

  6. Cephalopod genomics

    DEFF Research Database (Denmark)

    Albertin, Caroline B.; Bonnaud, Laure; Brown, C. Titus

    2012-01-01

    The Cephalopod Sequencing Consortium (CephSeq Consortium) was established at a NESCent Catalysis Group Meeting, ``Paths to Cephalopod Genomics-Strategies, Choices, Organization,'' held in Durham, North Carolina, USA on May 24-27, 2012. Twenty-eight participants representing nine countries (Austri...

  7. Ancient genomics

    DEFF Research Database (Denmark)

    Der Sarkissian, Clio; Allentoft, Morten Erik; Avila Arcos, Maria del Carmen;

    2015-01-01

    , archaic hominins, ancient pathogens and megafaunal species. Those have revealed important functional and phenotypic information, as well as unexpected adaptation, migration and admixture patterns. As such, the field of aDNA has entered the new era of genomics and has provided valuable information when...

  8. High Species C Human Adenovirus Genome Copy Numbers in the Treated Water Supply of a Neotropical Area of the Central-West Region of Brazil.

    Science.gov (United States)

    Silva, Hugo D; Fongaro, Gislaine; Garcíazapata, Marco T A; Melo, Arthur T O; Silveira-Lacerda, Elisângela P; de Faria, Karla M S; Anunciação, Carlos E

    2015-09-01

    There is little information about the presence of human adenovirus (HAdV) in drinking water in Neotropical regions. Thus, the present study sought to conduct quantification and molecular characterization of HAdVs detected in treated water samples from an area of the Cerrado ecoregion of Brazil. Between August and November 2012, samples were collected from four treated water reservoirs and their respective sites along the water distribution network of the city of Goiânia, for a total of 80 samples. All samples were concentrated and analyzed by qPCR, and selected samples were sequenced. Overall, 76.6 (10(0)-10(9) GC mL(-1)) and 37.5% (10(1)-10(8) GC mL(-1)) of samples drawn from reservoirs and their distribution sites, respectively, were positive for virus by qPCR. All samples selected for sequencing were characterized as species C human adenovirus. Such high HAdV counts have in treated water samples. This finding merits special attention, particularly from the sanitation authorities, because the high number of GC mL(-1) may be an indicative of risk to human health.

  9. Identification of genome regions controlling cotyledon, pod wall/seed coat and pod wall resistance to pea weevil through QTL mapping.

    Science.gov (United States)

    Aryamanesh, N; Zeng, Y; Byrne, O; Hardie, D C; Al-Subhi, A M; Khan, T; Siddique, K H M; Yan, G

    2013-11-15

    Pea weevil, Bruchus pisorum, is one of the limiting factors for field pea (Pisum sativum) cultivation in the world with pesticide application the only available method for its control. Resistance to pea weevil has been found in an accession of Pisum fulvum but transfer of this resistance to cultivated pea (P. sativum) is limited due to a lack of easy-to-use techniques for screening interspecific breeding populations. To address this problem, an interspecific population was created from a cross between cultivated field pea and P. fulvum (resistance source). Quantitative trait locus (QTL) mapping was performed to discover the regions associated with resistance to cotyledon, pod wall/seed coat and pod wall resistance. Three major QTLs, located on linkage groups LG2, LG4 and LG5 were found for cotyledon resistance explaining approximately 80 % of the phenotypic variation. Two major QTLs were found for pod wall/seed coat resistance on LG2 and LG5 explaining approximately 70 % of the phenotypic variation. Co-linearity of QTLs for cotyledon and pod wall/seed coat resistance suggested that the mechanism of resistance for these two traits might act through the same pathways. Only one QTL was found for pod wall resistance on LG7 explaining approximately 9 % of the phenotypic variation. This is the first report on the development of QTL markers to probe Pisum germplasm for pea weevil resistance genes. These flanking markers will be useful in accelerating the process of screening when breeding for pea weevil resistance.

  10. Contrasting patterns of population connectivity between regions in a commercially important mollusc Haliotis rubra: integrating population genetics, genomics and marine LiDAR data.

    Science.gov (United States)

    Miller, A D; van Rooyen, A; Rašić, G; Ierodiaconou, D A; Gorfine, H K; Day, R; Wong, C; Hoffmann, A A; Weeks, A R

    2016-08-01

    Estimating contemporary genetic structure and population connectivity in marine species is challenging, often compromised by genetic markers that lack adequate sensitivity, and unstructured sampling regimes. We show how these limitations can be overcome via the integration of modern genotyping methods and sampling designs guided by LiDAR and SONAR data sets. Here we explore patterns of gene flow and local genetic structure in a commercially harvested abalone species (Haliotis rubra) from southeastern Australia, where the viability of fishing stocks is believed to be dictated by recruitment from local sources. Using a panel of microsatellite and genomewide SNP markers, we compare allele frequencies across a replicated hierarchical sampling area guided by bathymetric LiDAR imagery. Results indicate high levels of gene flow and no significant genetic structure within or between benthic reef habitats across 1400 km of coastline. These findings differ to those reported for other regions of the fishery indicating that larval supply is likely to be spatially variable, with implications for management and long-term recovery from stock depletion. The study highlights the utility of suitably designed genetic markers and spatially informed sampling strategies for gaining insights into recruitment patterns in benthic marine species, assisting in conservation planning and sustainable management of fisheries.

  11. The Drosophila SRF homolog is expressed in a subset of tracheal cells and maps within a genomic region required for tracheal development.

    Science.gov (United States)

    Affolter, M; Montagne, J; Walldorf, U; Groppe, J; Kloter, U; LaRosa, M; Gehring, W J

    1994-04-01

    The Drosophila homolog of the vertebrate serum response factor (SRF) was isolated by low stringency hybridization. Nucleotide sequence analysis revealed that the Drosophila SRF homolog (DSRF) codes for a protein that displays 93% sequence identity with human SRF in the MADS domain, the region required for DNA binding, dimerization and interaction with accessory factors. The DSRF gene is expressed during several phases of embryonic development. In the egg, both the RNA and the protein are maternal in origin and slowly decrease in amount during gastrulation. After germ band retraction, high levels of zygotic expression are observed in a distinct subset of peripheral tracheal cells distributed throughout the embryo. Many of these cells are at the tip of tracheal branches and are in direct contact with the target tissues. The DSRF gene was mapped to position 60C on the second chromosome, and overlapping deficiencies which remove the gene were identified. Analysis of tracheal development in embryos carrying these deletions revealed a degeneration of most of the major branches of the tracheal system. Although the initial migration of tracheal cells was not affected in those deficient embryos, many tracheal cells appeared not to maintain their correct position and continued to migrate. Thus, the DSRF gene might play a role in the proper formation and maintenance of the trachea.

  12. Classifying Genomic Sequences by Sequence Feature Analysis

    Institute of Scientific and Technical Information of China (English)

    Zhi-Hua Liu; Dian Jiao; Xiao Sun

    2005-01-01

    Traditional sequence analysis depends on sequence alignment. In this study, we analyzed various functional regions of the human genome based on sequence features, including word frequency, dinucleotide relative abundance, and base-base correlation. We analyzed the human chromosome 22 and classified the upstream,exon, intron, downstream, and intergenic regions by principal component analysis and discriminant analysis of these features. The results show that we could classify the functional regions of genome based on sequence feature and discriminant analysis.

  13. Sub-megabase resolution tiling (SMRT array-based comparative genomic hybridization profiling reveals novel gains and losses of chromosomal regions in Hodgkin Lymphoma and Anaplastic Large Cell Lymphoma cell lines

    Directory of Open Access Journals (Sweden)

    Lam Wan L

    2008-01-01

    Full Text Available Abstract Background Hodgkin lymphoma (HL and Anaplastic Large Cell Lymphoma (ALCL, are forms of malignant lymphoma defined by unique morphologic, immunophenotypic, genotypic, and clinical characteristics, but both overexpress CD30. We used sub-megabase resolution tiling (SMRT array-based comparative genomic hybridization to screen HL-derived cell lines (KMH2 and L428 and ALCL cell lines (DEL and SR-786 in order to identify disease-associated gene copy number gains and losses. Results Significant copy number gains and losses were observed on several chromosomes in all four cell lines. Assessment of copy number alterations with 26,819 DNA segments identified an average of 20 genetic alterations. Of the recurrent minimally altered regions identified, 11 (55% were within previously published regions of chromosomal alterations in HL and ALCL cell lines while 9 (45% were novel alterations not previously reported. HL cell lines L428 and KMH2 shared gains in chromosome cytobands 2q23.1-q24.2, 7q32.2-q36.3, 9p21.3-p13.3, 12q13.13-q14.1, and losses in 13q12.13-q12.3, and 18q21.32-q23. ALCL cell lines SR-786 and DEL, showed gains in cytobands 5p15.32-p14.3, 20p12.3-q13.11, and 20q13.2-q13.32. Both pairs of HL and ALCL cell lines showed losses in 18q21.32-18q23. Conclusion This study is considered to be the first one describing HL and ALCL cell line genomes at sub-megabase resolution. This high-resolution analysis allowed us to propose novel candidate target genes that could potentially contribute to the pathogenesis of HL and ALCL. FISH was used to confirm the amplification of all three isoforms of the trypsin gene (PRSS1/PRSS2/PRSS3 in KMH2 and L428 (HL and DEL (ALCL cell lines. These are novel findings that have not been previously reported in the lymphoma literature, and opens up an entirely new area of research that has not been previously associated with lymphoma biology. The findings raise interesting possibilities about the role of signaling

  14. The CGView Server: a comparative genomics tool for circular genomes.

    Science.gov (United States)

    Grant, Jason R; Stothard, Paul

    2008-07-01

    The CGView Server generates graphical maps of circular genomes that show sequence features, base composition plots, analysis results and sequence similarity plots. Sequences can be supplied in raw, FASTA, GenBank or EMBL format. Additional feature or analysis information can be submitted in the form of GFF (General Feature Format) files. The server uses BLAST to compare the primary sequence to up to three comparison genomes or sequence sets. The BLAST results and feature information are converted to a graphical map showing the entire sequence, or an expanded and more detailed view of a region of interest. Several options are included to control which types of features are displayed and how the features are drawn. The CGView Server can be used to visualize features associated with any bacterial, plasmid, chloroplast or mitochondrial genome, and can aid in the identification of conserved genome segments, instances of horizontal gene transfer, and differences in gene copy number. Because a collection of sequences can be used in place of a comparison genome, maps can also be used to visualize regions of a known genome covered by newly obtained sequence reads. The CGView Server can be accessed at http://stothard.afns.ualberta.ca/cgview_server/

  15. Reticulate evolution of the rye genome.

    Science.gov (United States)

    Martis, Mihaela M; Zhou, Ruonan; Haseneyer, Grit; Schmutzer, Thomas; Vrána, Jan; Kubaláková, Marie; König, Susanne; Kugler, Karl G; Scholz, Uwe; Hackauf, Bernd; Korzun, Viktor; Schön, Chris-Carolin; Dolezel, Jaroslav; Bauer, Eva; Mayer, Klaus F X; Stein, Nils

    2013-10-01

    Rye (Secale cereale) is closely related to wheat (Triticum aestivum) and barley (Hordeum vulgare). Due to its large genome (~8 Gb) and its regional importance, genome analysis of rye has lagged behind other cereals. Here, we established a virtual linear gene order model (genome zipper) comprising 22,426 or 72% of the detected set of 31,008 rye genes. This was achieved by high-throughput transcript mapping, chromosome survey sequencing, and integration of conserved synteny information of three sequenced model grass genomes (Brachypodium distachyon, rice [Oryza sativa], and sorghum [Sorghum bicolor]). This enabled a genome-wide high-density comparative analysis of rye/barley/model grass genome synteny. Seventeen conserved syntenic linkage blocks making up the rye and barley genomes were defined in comparison to model grass genomes. Six major translocations shaped the modern rye genome in comparison to a putative Triticeae ancestral genome. Strikingly dissimilar conserved syntenic gene content, gene sequence diversity signatures, and phylogenetic networks were found for individual rye syntenic blocks. This indicates that introgressive hybridizations (diploid or polyploidy hybrid speciation) and/or a series of whole-genome or chromosome duplications played a role in rye speciation and genome evolution.

  16. Pseudomonas aeruginosa genomic structure and diversity

    Directory of Open Access Journals (Sweden)

    Jens eKlockgether

    2011-07-01

    Full Text Available The Pseudomonas aeruginosa genome (G + C content 65-67%, size 5.5 – 7 Mbp is made up of a single circular chromosome and a variable number of plasmids. Sequencing of complete genomes or blocks of the accessory genome has revealed that the genome encodes a large repertoire of transporters, transcriptional regulators and two-component regulatory systems which reflects its metabolic diversity to utilize a broad range of nutrients. The conserved core component of the genome is largely collinear among P. aeruginosa strains and exhibits an interclonal sequence diversity of 0.5 – 0.7%. Only a few loci of the core genome are subject to diversifying selection. Genome diversity is mainly caused by accessory DNA elements located in 79 regions of genome plasticity that are scattered around the genome and show an anomalous usage of mono- to tetradecanucleotides. Genomic islands of the pKLC102/PAGI-2 family that integrate into tRNALys or tRNAGly genes represent hotspots of inter- and intraclonal genomic diversity. The individual islands differ in their repertoire of metabolic genes that make a large contribution to the pangenome. In order to unravel intraclonal diversity of P. aeruginosa, the genomes of two members of the PA14 clonal complex from diverse habitats and geographic origin were compared. The genome sequences differed by less than 0.01% from each other. 198 of the 231 SNPs were non-randomly distributed in the genome. Non-synonymous SNPs were mainly found in an integrated Pf1-like phage and in genes involved in transcriptional regulation, membrane and extracellular constituents, transport and secretion. In summary, P. aeruginosa is endowed with a highly conserved core genome of low sequence diversity and a highly variable accessory genome that communicates with other pseudomonads and genera via horizontal gene transfer.

  17. Reticulate Evolution of the Rye Genome

    OpenAIRE

    2013-01-01

    Rye (Secale cereale) is closely related to wheat (Triticum aestivum) and barley (Hordeum vulgare). Due to its large genome (similar to 8 Gb) and its regional importance, genome analysis of rye has lagged behind other cereals. Here, we established a virtual linear gene order model (genome zipper) comprising 22,426 or 72% of the detected set of 31,008 rye genes. This was achieved by high-throughput transcript mapping, chromosome survey sequencing, and integration of conserved synteny informatio...

  18. The genome of Eucalyptus grandis

    Energy Technology Data Exchange (ETDEWEB)

    Myburg, Alexander A.; Grattapaglia, Dario; Tuskan, Gerald A.; Hellsten, Uffe; Hayes, Richard D.; Grimwood, Jane; Jenkins, Jerry; Lindquist, Erika; Tice, Hope; Bauer, Diane; Goodstein, David M.; Dubchak, Inna; Poliakov, Alexandre; Mizrachi, Eshchar; Kullan, Anand R. K.; Hussey, Steven G.; Pinard, Desre; van der Merwe, Karen; Singh, Pooja; van Jaarsveld, Ida; Silva-Junior, Orzenil B.; Togawa, Roberto C.; Pappas, Marilia R.; Faria, Danielle A.; Sansaloni, Carolina P.; Petroli, Cesar D.; Yang, Xiaohan; Ranjan, Priya; Tschaplinski, Timothy J.; Ye, Chu-Yu; Li, Ting; Sterck, Lieven; Vanneste, Kevin; Murat, Florent; Soler, Marçal; Clemente, Hélène San; Saidi, Naijib; Cassan-Wang, Hua; Dunand, Christophe; Hefer, Charles A.; Bornberg-Bauer, Erich; Kersting, Anna R.; Vining, Kelly; Amarasinghe, Vindhya; Ranik, Martin; Naithani, Sushma; Elser, Justin; Boyd, Alexander E.; Liston, Aaron; Spatafora, Joseph W.; Dharmwardhana, Palitha; Raja, Rajani; Sullivan, Christopher; Romanel, Elisson; Alves-Ferreira, Marcio; Külheim, Carsten; Foley, William; Carocha, Victor; Paiva, Jorge; Kudrna, David; Brommonschenkel, Sergio H.; Pasquali, Giancarlo; Byrne, Margaret; Rigault, Philippe; Tibbits, Josquin; Spokevicius, Antanas; Jones, Rebecca C.; Steane, Dorothy A.; Vaillancourt, René E.; Potts, Brad M.; Joubert, Fourie; Barry, Kerrie; Pappas, Georgios J.; Strauss, Steven H.; Jaiswal, Pankaj; Grima-Pettenati, Jacqueline; Salse, Jérôme; Van de Peer, Yves; Rokhsar, Daniel S.; Schmutz, Jeremy

    2014-06-11

    Eucalypts are the world s most widely planted hardwood trees. Their broad adaptability, rich species diversity, fast growth and superior multipurpose wood, have made them a global renewable resource of fiber and energy that mitigates human pressures on natural forests. We sequenced and assembled >94% of the 640 Mbp genome of Eucalyptus grandis into its 11 chromosomes. A set of 36,376 protein coding genes were predicted revealing that 34% occur in tandem duplications, the largest proportion found thus far in any plant genome. Eucalypts also show the highest diversity of genes for plant specialized metabolism that act as chemical defence against biotic agents and provide unique pharmaceutical oils. Resequencing of a set of inbred tree genomes revealed regions of strongly conserved heterozygosity, likely hotspots of inbreeding depression. The resequenced genome of the sister species E. globulus underscored the high inter-specific genome colinearity despite substantial genome size variation in the genus. The genome of E. grandis is the first reference for the early diverging Rosid order Myrtales and is placed here basal to the Eurosids. This resource expands knowledge on the unique biology of large woody perennials and provides a powerful tool to accelerate comparative biology, breeding and biotechnology.

  19. Comparative genetic mapping revealed powdery mildew resistance gene MlWE4 derived from wild emmer is located in same genomic region of Pm36 and Ml3D232 on chromosome 5BL

    Institute of Scientific and Technical Information of China (English)

    ZHANG Dong; WANG Yong; CHEN Yong-xing; LIU Zhi-yong; OUYANG Shu-hong; WANG Li-li; CUI Yu; WU Qiu-hong; LIANG Yong; WANG Zhen-zhong; XIE Jing-zhong; ZHANG De-yun

    2015-01-01

    Powdery mildew, caused by Blumeria graminis f. sp. tritici, is one of the most devastating wheat diseases. Wild emmer wheat (Triticum turgidum ssp. dicoccoides) is a promising source of disease resistance for wheat. A powdery mildew resistance gene conferring resistance to B. graminis f. sp. tritici isolate E09, originating from wild emmer wheat, has been transferred into the hexaploid wheat line WE4 through crossing and backcrossing. Genetic analyses indicated that the powdery mildew resistance was control ed by a single dominant gene, temporarily designated MlWE4. By mean of comparative genomics and bulked segregant analysis, a genetic linkage map of MlWE4 was constructed, and MlWE4 was mapped on the distal region of chromosome arm 5BL. Comparative genetic linkage maps showed that genes MlWE4, Pm36 and Ml3D232 were co-segregated with markers XBD37670 and XBD37680, indicating they are likely the same gene or al eles in the same locus. The co-segregated markers provide a starting point for chromosome landing and map-based cloning of MlWE4, Pm36 and Ml3D232.

  20. The 5' untranslated region and Gag product of Idefix, a long terminal repeat-retrotransposon from Drosophila melanogaster, act together to initiate a switch between translated and untranslated states of the genomic mRNA.

    Science.gov (United States)

    Meignin, Carine; Bailly, Jean-Luc; Arnaud, Frédérick; Dastugue, Bernard; Vaury, Chantal

    2003-11-01

    Idefix is a long terminal repeat (LTR)-retrotransposon present in Drosophila melanogaster which shares similarities with vertebrates retroviruses both in its genomic arrangement and in the mechanism of transposition. Like in retroviruses, its two LTRs flank a long 5' untranslated region (5'UTR) and three open reading frames referred to as the gag, pol, and env genes. Here we report that its 5'UTR, located upstream of the gag gene, can fold into highly structured domains that are known to be incompatible with efficient translation by ribosome scanning. Using dicistronic plasmids analyzed by both (i) in vitro transcription and translation in rabbit reticulocyte or wheat germ lysates and (ii) in vivo expression in transgenic flies, we show that the 5'UTR of Idefix exhibits an internal ribosome entry site (IRES) activity that is able to promote translation of a downstream cistron in a cap-independent manner. The functional state of this novel IRES depends on eukaryotic factors that are independent of their host origin. However, in vivo, its function can be down-regulated by trans-acting factors specific to tissues or developmental stages of its host. We identify one of these trans-acting factors as the Gag protein encoded by Idefix itself. Our data support a model in which nascent Gag is able to block translation initiated from the viral mRNA and thus its own translation. These data highlight the fact that LTR-retrotransposons may autoregulate their replication cycle through their Gag production.

  1. The genomic region of rbcLS in Synechococcus sp. PCC 7942 contains genes involved in the ability to grow under low CO2 concentration and in chlorophyll biosynthesis.

    Science.gov (United States)

    Ronen-Tarazi, M; Lieman-Hurwitz, J; Gabay, C; Orus, M I; Kaplan, A

    1995-08-01

    Several genes involved in the ability of Synechococcus sp. PCC 7942 to grow under different CO2 concentrations were mapped in the genomic region of rbcLS (the operon encoding the large and small subunits of ribulose-1,5-bisphosphate carboxylase/oxygenase). Insertion of a cartridge encoding kanamycin resistance within open reading frame (ORF) 78, designated ccmJ, located 7 kb upstream of rbcLS, resulted in a kanamycin-resistant, high-CO2-requiring mutant, M3, which does not contain normal carboxysomes. ccmJ shows significant homology to csoS1 encoding a carboxysomal shell polypeptide in Thiobacillus neopolitanus. Analysis of the polypeptide pattern of a carboxysome-enriched fraction indicated several differences between the wild type and the mutant. The amount of the ribulose-1,5-bisphosphate carboxylase/oxygenase subunits was considerably smaller in the carboxysomal fraction of the mutant when compared to the wild type. On the basis of the sequence analyses, ORF286 and ORF466, located downstream of ccmJ, were identified as chlL and chlN, respectively, which are involved in chlorophyll biosynthesis in the dark.

  2. The function genomics study

    Institute of Scientific and Technical Information of China (English)

    2001-01-01

    @@ Genomics is a biology term appeared ten years ago, used to describe the researches of genomic mapping, sequencing, and structure analysis, etc. Genomics, the first journal for publishing papers on genomics research was born in 1986. In the past decade, the concept of genomics has been widely accepted by scientists who are engaging in biology research. Meanwhile, the research scope of genomics has been extended continuously, from simple gene mapping and sequencing to function genomics study. To reflect the change, genomics is divided into two parts now, the structure genomics and the function genomics.

  3. Visualization for genomics: the Microbial Genome Viewer.

    NARCIS (Netherlands)

    Kerkhoven, R.; Enckevort, F.H.J. van; Boekhorst, J.; Molenaar, D.; Siezen, R.J.

    2004-01-01

    SUMMARY: A Web-based visualization tool, the Microbial Genome Viewer, is presented that allows the user to combine complex genomic data in a highly interactive way. This Web tool enables the interactive generation of chromosome wheels and linear genome maps from genome annotation data stored in a My

  4. Genome analysis methods - PGDBj Registered plant list, Marker list, QTL list, Plant DB link & Genome analysis methods | LSDB Archive [Life Science Database Archive metadata

    Lifescience Database Archive (English)

    Full Text Available [ Credits ] BLAST Search Image Search Home About Archive Update History Contact us PGDBj Registered...ear Year of genome analysis Sequencing method Sequencing method Read counts Read counts Covered genome region Covered...otation method Number of predicted genes Number of predicted genes Genome database Genome database informati... License Update History of This Database Site Policy | Contact Us Genome analysis... methods - PGDBj Registered plant list, Marker list, QTL list, Plant DB link & Genome analysis methods | LSDB Archive ...

  5. Is the pan-genome also a pan-selectome?

    DEFF Research Database (Denmark)

    Ussery, David; Rodriguez-Valera, Francisco

    2012-01-01

    The comparative genomics of prokaryotes has shown the presence of conserved regions containing highly similar genes (the 'core genome') and other regions that vary in gene content (the 'flexible' regions). A significant part of the latter is involved in surface structures that are phage recogniti...

  6. Selfish drive can trump function when animal mitochondrial genomes compete.

    Science.gov (United States)

    Ma, Hansong; O'Farrell, Patrick H

    2016-07-01

    Mitochondrial genomes compete for transmission from mother to progeny. We explored this competition by introducing a second genome into Drosophila melanogaster to follow transmission. Competitions between closely related genomes favored those functional in electron transport, resulting in a host-beneficial purifying selection. In contrast, matchups between distantly related genomes often favored those with negligible, negative or lethal consequences, indicating selfish selection. Exhibiting powerful selfish selection, a genome carrying a detrimental mutation displaced a complementing genome, leading to population death after several generations. In a different pairing, opposing selfish and purifying selection counterbalanced to give stable transmission of two genomes. Sequencing of recombinant mitochondrial genomes showed that the noncoding region, containing origins of replication, governs selfish transmission. Uniparental inheritance prevents encounters between distantly related genomes. Nonetheless, in each maternal lineage, constant competition among sibling genomes selects for super-replicators. We suggest that this relentless competition drives positive selection, promoting change in the sequences influencing transmission.

  7. Patterns of genome size variation in snapping shrimp.

    Science.gov (United States)

    Jeffery, Nicholas W; Hultgren, Kristin; Chak, Solomon Tin Chi; Gregory, T Ryan; Rubenstein, Dustin R

    2016-06-01

    Although crustaceans vary extensively in genome size, little is known about how genome size may affect the ecology and evolution of species in this diverse group, in part due to the lack of large genome size datasets. Here we investigate interspecific, intraspecific, and intracolony variation in genome size in 39 species of Synalpheus shrimps, representing one of the largest genome size datasets for a single genus within crustaceans. We find that genome size ranges approximately 4-fold across Synalpheus with little phylogenetic signal, and is not related to body size. In a subset of these species, genome size is related to chromosome size, but not to chromosome number, suggesting that despite large genomes, these species are not polyploid. Interestingly, there appears to be 35% intraspecific genome size variation in Synalpheus idios among geographic regions, and up to 30% variation in Synalpheus duffyi genome size within the same colony.

  8. Mitochondrial genomes of parasitic flatworms.

    Science.gov (United States)

    Le, Thanh H; Blair, David; McManus, Donald P

    2002-05-01

    Complete or near-complete mitochondrial genomes are now available for 11 species or strains of parasitic flatworms belonging to the Trematoda and the Cestoda. The organization of these genomes is not strikingly different from those of other eumetazoans, although one gene (atp8) commonly found in other phyla is absent from flatworms. The gene order in most flatworms has similarities to those seen in higher protostomes such as annelids. However, the gene order has been drastically altered in Schistosoma mansoni, which obscures this possible relationship. Among the sequenced taxa, base composition varies considerably, creating potential difficulties for phylogeny reconstruction. Long non-coding regions are present in all taxa, but these vary in length from only a few hundred to approximately 10000 nucleotides. Among Schistosoma spp., the long non-coding regions are rich in repeats and length variation among individuals is known. Data from mitochondrial genomes are valuable for studies on species identification, phylogenies and biogeography.

  9. Autopolyploidy genome duplication preserves other ancient genome duplications in Atlantic salmon (Salmo salar)

    Science.gov (United States)

    Davidson, William S.

    2017-01-01

    Salmonids (e.g. Atlantic salmon, Pacific salmon, and trouts) have a long legacy of genome duplication. In addition to three ancient genome duplications that all teleosts are thought to share, salmonids have had one additional genome duplication. We explored a methodology for untangling these duplications from each other to better understand them in Atlantic salmon. In this methodology, homeologous regions (paralogous/duplicated genomic regions originating from a whole genome duplication) from the most recent genome duplication were assumed to have duplicated genes at greater density and have greater sequence similarity. This assumption was used to differentiate duplicated gene pairs in Atlantic salmon that are either from the most recent genome duplication or from earlier duplications. From a comparison with multiple vertebrate species, it is clear that Atlantic salmon have retained more duplicated genes from ancient genome duplications than other vertebrates--often at higher density in the genome and containing fewer synonymous mutations. It may be that polysomic inheritance is the mechanism responsible for maintaining ancient gene duplicates in salmonids. Polysomic inheritance (when multiple chromosomes pair during meiosis) is thought to be relatively common in salmonids compared to other vertebrate species. These findings illuminate how genome duplications may not only increase the number of duplicated genes, but may also be involved in the maintenance of them from previous genome duplications as well. PMID:28241055

  10. Complete chloroplast genome of Sedum sarmentosum and chloroplast genome evolution in Saxifragales.

    Directory of Open Access Journals (Sweden)

    Wenpan Dong

    Full Text Available Comparative chloroplast genome analyses are mostly carried out at lower taxonomic levels, such as the family and genus levels. At higher taxonomic levels, chloroplast genomes are generally used to reconstruct phylogenies. However, little attention has been paid to chloroplast genome evolution within orders. Here, we present the chloroplast genome of Sedum sarmentosum and take advantage of several available (or elucidated chloroplast genomes to examine the evolution of chloroplast genomes in Saxifragales. The chloroplast genome of S. sarmentosum is 150,448 bp long and includes 82,212 bp of a large single-copy (LSC region, 16.670 bp of a small single-copy (SSC region, and a pair of 25,783 bp sequences of inverted repeats (IRs.The genome contains 131 unique genes, 18 of which are duplicated within the IRs. Based on a comparative analysis of chloroplast genomes from four representative Saxifragales families, we observed two gene losses and two pseudogenes in Paeonia obovata, and the loss of an intron was detected in the rps16 gene of Penthorum chinense. Comparisons among the 72 common protein-coding genes confirmed that the chloroplast genomes of S. sarmentosum and Paeonia obovata exhibit accelerated sequence evolution. Furthermore, a strong correlation was observed between the rates of genome evolution and genome size. The detected genome size variations are predominantly caused by the length of intergenic spacers, rather than losses of genes and introns, gene pseudogenization or IR expansion or contraction. The genome sizes of these species are negatively correlated with nucleotide substitution rates. Species with shorter duration of the life cycle tend to exhibit shorter chloroplast genomes than those with longer life cycles.

  11. The Giardia lamblia genome.

    Science.gov (United States)

    Adam, R D

    2000-04-10

    Giardia lamblia is a protozoan parasite of humans and other mammals that is thought to be one of the most primitive extant eukaryotic organisms. Although distinctly eukaryotic, it is notable for its lack of mitochondria, nucleoli, and perixosomes. It has been suggested that Giardia spp. are pre-mitochondriate organisms, but the identification of genes in G. lamblia thought to be of mitochondrial origin has generated controversy regarding that designation. Giardi lamblia trophozoites have two nuclei that are identical in all ways that have been studied. They are polyploid with at least four, and perhaps eight or more, copies of each of five chromosomes per organism and have an estimated genome complexity of 1.2x10(7)bp of DNA, and GC content of 46%. There is evidence for recombination at the telomeres of some of the chromosomes, and multiple size variants of single chromosomes have been identified within cloned isolates. However, the internal regions of the chromosomes demonstrate no evidence of recombination. For example, there is no evidence for control of vsp gene expression by DNA recombination, and no evidence for rapid mutation in the vsp genes. Single pass sequences of approximately 9% of the G. lamblia genome have already been obtained. An ongoing genome project plans to obtain approximately 95% of the genome by a random approach, as well as a complete physical map using a bacterial artificial chromosome library. The results will facilitate a better understanding of the biology of Giardia spp. as well as their phylogenetic relationship to other primitive organisms.

  12. Genomic landscape of liposarcoma.

    Science.gov (United States)

    Kanojia, Deepika; Nagata, Yasunobu; Garg, Manoj; Lee, Dhong Hyun; Sato, Aiko; Yoshida, Kenichi; Sato, Yusuke; Sanada, Masashi; Mayakonda, Anand; Bartenhagen, Christoph; Klein, Hans-Ulrich; Doan, Ngan B; Said, Jonathan W; Mohith, S; Gunasekar, Swetha; Shiraishi, Yuichi; Chiba, Kenichi; Tanaka, Hiroko; Miyano, Satoru; Myklebost, Ola; Yang, Henry; Dugas, Martin; Meza-Zepeda, Leonardo A; Silberman, Allan W; Forscher, Charles; Tyner, Jeffrey W; Ogawa, Seishi; Koeffler, H Phillip

    2015-12-15

    Liposarcoma (LPS) is the most common type of soft tissue sarcoma accounting for 20% of all adult sarcomas. Due to absence of clinically effective treatment options in inoperable situations and resistance to chemotherapeutics, a critical need exists to identify novel therapeutic targets. We analyzed LPS genomic landscape using SNP arrays, whole exome sequencing and targeted exome sequencing to uncover the genomic information for development of specific anti-cancer targets. SNP array analysis indicated known amplified genes (MDM2, CDK4, HMGA2) and important novel genes (UAP1, MIR557, LAMA4, CPM, IGF2, ERBB3, IGF1R). Carboxypeptidase M (CPM), recurrently amplified gene in well-differentiated/de-differentiated LPS was noted as a putative oncogene involved in the EGFR pathway. Notable deletions were found at chromosome 1p (RUNX3, ARID1A), chromosome 11q (ATM, CHEK1) and chromosome 13q14.2 (MIR15A, MIR16-1). Significantly and recurrently mutated genes (false discovery rate < 0.05) included PLEC (27%), MXRA5 (21%), FAT3 (24%), NF1 (20%), MDC1 (10%), TP53 (7%) and CHEK2 (6%). Further, in vitro and in vivo functional studies provided evidence for the tumor suppressor role for Neurofibromin 1 (NF1) gene in different subtypes of LPS. Pathway analysis of recurrent mutations demonstrated signaling through MAPK, JAK-STAT, Wnt, ErbB, axon guidance, apoptosis, DNA damage repair and cell cycle pathways were involved in liposarcomagenesis. Interestingly, we also found mutational and copy number heterogeneity within a primary LPS tumor signifying the importance of multi-region sequencing for cancer-genome guided therapy. In summary, these findings provide insight into the genomic complexity of LPS and highlight potential druggable pathways for targeted therapeutic approach.

  13. Development of a suspension microarray for the genotyping of African swine fever virus targeting the SNPs in the C-terminal end of the p72 gene region of the genome.

    Science.gov (United States)

    Leblanc, N; Cortey, M; Fernandez Pinero, J; Gallardo, C; Masembe, C; Okurut, A R; Heath, L; van Heerden, J; Sánchez-Vizcaino, J M; Ståhl, K; Belák, S

    2013-08-01

    African swine fever virus (ASFV) causes one of the most dreaded transboundary animal diseases (TADs) in Suidae. African swine fever (ASF) often causes high rates of morbidity and mortality, which can reach 100% in domestic swine. To date, serological diagnosis has the drawback of not being able to differentiate variants of this virus. Previous studies have identified the 22 genotypes based on sequence variation in the C-terminal region of the p72 gene, which has become the standard for categorizing ASFVs. This article describes a genotyping assay developed using a segment of PCR-amplified genomic DNA of approximately 450 bp, which encompasses the C-terminal end of the p72 gene. Complementary paired DNA probes of 15 or 17 bp in length, which are identical except for a single nucleotide polymorphism (SNP) in the central position, were designed to either individually or in combination differentiate between the 22 genotypes. The assay was developed using xMAP technology; probes were covalently linked to microspheres, hybridized to PCR product, labelled with a reporter and read in the Luminex 200 analyzer. Characterization of the sample was performed by comparing fluorescence of the paired SNP probes, that is, the probe with higher fluorescence in a complementary pair identified the SNP that a particular sample possessed. In the final assay, a total of 52 probes were employed, 24 SNP pairs and 4 for general detection. One or more samples from each of the 22 genotypes were tested. The assay was able to detect and distinguish all 22 genotypes. This novel assay provides a powerful novel tool for the simultaneous rapid diagnosis and genotypic differentiation of ASF.

  14. Identification of Sesame Genomic Variations from Genome Comparison of Landrace and Variety.

    Science.gov (United States)

    Wei, Xin; Zhu, Xiaodong; Yu, Jingyin; Wang, Linhai; Zhang, Yanxin; Li, Donghua; Zhou, Rong; Zhang, Xiurong

    2016-01-01

    Sesame (Sesamum indicum L.) is one of the main oilseed crops, providing vegetable oil and protein to human. Landrace is the gene source of variety, carrying many desire alleles for genetic improvement. Despite the importance of sesame landrace, genome of sesame landrace remains unexplored and genomic variations between landrace and variety still is not clear. To identify the genomic variations between sesame landrace and variety, two representative sesame landrace accessions, "Baizhima" and "Mishuozhima," were selected and re-sequenced. The genome sequencing and de novo assembling of the two sesame landraces resulted in draft genomes of 267 Mb and 254 Mb, respectively, with the contig N50 more than 47 kb. Totally, 1,332,025 SNPs and 506,245 InDels were identified from the genome of "Baizhima" and "Mishuozhima" by comparison of the genome of a variety "Zhongzhi13." Among the genomic variations, 70,018 SNPs and 8311 InDels were located in the coding regions of genes. Genomic variations may contribute to variation of sesame agronomic traits such as flowering time, plant height, and oil content. The identified genomic variations were successfully used in the QTL mapping and the black pigment synthesis gene, PPO, was found to be the candidate gene of sesame seed coat color. The comprehensively compared genomes of sesame landrace and modern variety produced massive useful genomic information, constituting a powerful tool to support genetic research, and molecular breeding of sesame.

  15. Combining image analysis, genome wide association studies and different field trials to reveal stable genetic regions related to panicle architecture and the number of spikelets per panicle in rice

    Directory of Open Access Journals (Sweden)

    Maria Camila Rebolledo

    2016-09-01

    Full Text Available Number of spikelets per panicle (NSP is a key trait to increase yield potential in rice (O. sativa. The architecture of the rice inflorescence which is mainly determined by the length and number of primary (PBL and PBN and secondary (SBL and SBN branches can influence NSP. Although several genes controlling panicle architecture and NSP in rice have been identified, there is little evidence of (i the genetic control of panicle architecture and NSP in different environments and (ii the presence of stable genetic associations with panicle architecture across environments. This study combines image phenotyping of 225 accessions belonging to a genetic diversity array of indica rice grown under irrigated field condition in two different environments and Genome Wide Association Studies (GWAS based on the genotyping of the diversity panel, providing 83,374 SNPs.Accessions sown under direct seeding in one environement had reduced Panicle Length (PL, NSP, PBN, PBL, SBN and SBL compared to those established under transplanting in the second environment. Across environments, NSP was significantly and positively correlated with PBN, SBN and PBL. However, the length of branches (PBL and SBL was not significantly correlated with variables related to number of branches (PBN and SBN , suggesting independent genetic control.Twenty- three GWAS sites were detected with P-values ≤ 1.0E-04 and 27 GWAS sites with p ≤ 5.9E-04. We found 17 GWAS sites related to NSP, 10 for PBN and 11 for SBN, 7 for PBL and 11 for SBL. This study revealed new regions related to NSP, but only three associations were related to both branching number (PBN and SBN and NSP. Two GWAS sites associated with SBL and SBN were stable across contrasting environments and were not related to genes previously reported.The new regions reported in this study can help improving NSP in rice for both direct seeded and transplanted conditions. The integrated approach of high-throughput phenotyping, multi

  16. The Oxytricha trifallax macronuclear genome: a complex eukaryotic genome with 16,000 tiny chromosomes.

    Directory of Open Access Journals (Sweden)

    Estienne C Swart

    Full Text Available The macronuclear genome of the ciliate Oxytricha trifallax displays an extreme and unique eukaryotic genome architecture with extensive genomic variation. During sexual genome development, the expressed, somatic macronuclear genome is whittled down to the genic portion of a small fraction (∼5% of its precursor "silent" germline micronuclear genome by a process of "unscrambling" and fragmentation. The tiny macronuclear "nanochromosomes" typically encode single, protein-coding genes (a small portion, 10%, encode 2-8 genes, have minimal noncoding regions, and are differentially amplified to an average of ∼2,000 copies. We report the high-quality genome assembly of ∼16,000 complete nanochromosomes (∼50 Mb haploid genome size that vary from 469 bp to 66 kb long (mean ∼3.2 kb and encode ∼18,500 genes. Alternative DNA fragmentation processes ∼10% of the nanochromosomes into multiple isoforms that usually encode complete genes. Nucleotide diversity in the macronucleus is very high (SNP heterozygosity is ∼4.0%, suggesting that Oxytricha trifallax may have one of the largest known effective population sizes of eukaryotes. Comparison to other ciliates with nonscrambled genomes and long macronuclear chromosomes (on the order of 100 kb suggests several candidate proteins that could be involved in genome rearrangement, including domesticated MULE and IS1595-like DDE transposases. The assembly of the highly fragmented Oxytricha macronuclear genome is the first completed genome with such an unusual architecture. This genome sequence provides tantalizing glimpses into novel molecular biology and evolution. For example, Oxytricha maintains tens of millions of telomeres per cell and has also evolved an intriguing expansion of telomere end-binding proteins. In conjunction with the micronuclear genome in progress, the O. trifallax macronuclear genome will provide an invaluable resource for investigating programmed genome rearrangements, complementing

  17. The Arabidopsis lyrata genome sequence and the basis of rapid genome size change

    Energy Technology Data Exchange (ETDEWEB)

    Hu, Tina T.; Pattyn, Pedro; Bakker, Erica G.; Cao, Jun; Cheng, Jan-Fang; Clark, Richard M.; Fahlgren, Noah; Fawcett, Jeffrey A.; Grimwood, Jane; Gundlach, Heidrun; Haberer, Georg; Hollister, Jesse D.; Ossowski, Stephan; Ottilar, Robert P.; Salamov, Asaf A.; Schneeberger, Korbinian; Spannagl, Manuel; Wang, Xi; Yang, Liang; Nasrallah, Mikhail E.; Bergelson, Joy; Carrington, James C.; Gaut, Brandon S.; Schmutz, Jeremy; Mayer, Klaus F. X.; Van de Peer, Yves; Grigoriev, Igor V.; Nordborg, Magnus; Weigel, Detlef; Guo, Ya-Long

    2011-04-29

    In our manuscript, we present a high-quality genome sequence of the Arabidopsis thaliana relative, Arabidopsis lyrata, produced by dideoxy sequencing. We have performed the usual types of genome analysis (gene annotation, dN/dS studies etc. etc.), but this is relegated to the Supporting Information. Instead, we focus on what was a major motivation for sequencing this genome, namely to understand how A. thaliana lost half its genome in a few million years and lived to tell the tale. The rather surprising conclusion is that there is not a single genomic feature that accounts for the reduced genome, but that every aspect centromeres, intergenic regions, transposable elements, gene family number is affected through hundreds of thousands of cuts. This strongly suggests that overall genome size in itself is what has been under selection, a suggestion that is strongly supported by our demonstration (using population genetics data from A. thaliana) that new deletions seem to be driven to fixation.

  18. Genomic heritability: what is it?

    Directory of Open Access Journals (Sweden)

    Gustavo de Los Campos

    2015-05-01

    Full Text Available Whole-genome regression methods are being increasingly used for the analysis and prediction of complex traits and diseases. In human genetics, these methods are commonly used for inferences about genetic parameters, such as the amount of genetic variance among individuals or the proportion of phenotypic variance that can be explained by regression on molecular markers. This is so even though some of the assumptions commonly adopted for data analysis are at odds with important quantitative genetic concepts. In this article we develop theory that leads to a precise definition of parameters arising in high dimensional genomic regressions; we focus on the so-called genomic heritability: the proportion of variance of a trait that can be explained (in the population by a linear regression on a set of markers. We propose a definition of this parameter that is framed within the classical quantitative genetics theory and show that the genomic heritability and the trait heritability parameters are equal only when all causal variants are typed. Further, we discuss how the genomic variance and genomic heritability, defined as quantitative genetic parameters, relate to parameters of statistical models commonly used for inferences, and indicate potential inferential problems that are assessed further using simulations. When a large proportion of the markers used in the analysis are in LE with QTL the likelihood function can be misspecified. This can induce a sizable finite-sample bias and, possibly, lack of consistency of likelihood (or Bayesian estimates. This situation can be encountered if the individuals in the sample are distantly related and linkage disequilibrium spans over short regions. This bias does not negate the use of whole-genome regression models as predictive machines; however, our results indicate that caution is needed when using marker-based regressions for inferences about population parameters such as the genomic heritability.

  19. Genomics of sex determination.

    Science.gov (United States)

    Zhang, Jisen; Boualem, Adnane; Bendahmane, Abdelhafid; Ming, Ray

    2014-04-01

    Sex determination is a major switch in the evolutionary history of angiosperm, resulting 11% monoecious and dioecious species. The genomic sequences of papaya sex chromosomes unveiled the molecular basis of recombination suppression in the sex determination region, and candidate genes for sex determination. Identification and analyses of sex determination genes in cucurbits and maize demonstrated conservation of sex determination mechanism in one lineage and divergence between the two systems. Epigenetic control and hormonal influence of sex determination were elucidated in both plants and animals. Intensive investigation of potential sex determination genes in model species will improve our understanding of sex determination gene network. Such network will in turn accelerate the identification of sex determination genes in dioecious species with sex chromosomes, which are burdensome due to no recombination in sex determining regions. The sex determination genes in dioecious species are crucial for understanding the origin of dioecy and sex chromosomes, particularly in their early stage of evolution.

  20. Simple sequence repeats in mycobacterial genomes

    Indian Academy of Sciences (India)

    Vattipally B Sreenu; Pankaj Kumar; Javaregowda Nagaraju; Hampapathalu A Nagarajaram

    2007-01-01

    Simple sequence repeats (SSRs) or microsatellites are the repetitive nucleotide sequences of motifs of length 1–6 bp. They are scattered throughout the genomes of all the known organisms ranging from viruses to eukaryotes. Microsatellites undergo mutations in the form of insertions and deletions (INDELS) of their repeat units with some bias towards insertions that lead to microsatellite tract expansion. Although prokaryotic genomes derive some plasticity due to microsatellite mutations they have in-built mechanisms to arrest undue expansions of microsatellites and one such mechanism is constituted by post-replicative DNA repair enzymes MutL, MutH and MutS. The mycobacterial genomes lack these enzymes and as a null hypothesis one could expect these genomes to harbour many long tracts. It is therefore interesting to analyse the mycobacterial genomes for distribution and abundance of microsatellites tracts and to look for potentially polymorphic microsatellites. Available mycobacterial genomes, Mycobacterium avium, M. leprae, M. bovis and the two strains of M. tuberculosis (CDC1551 and H37Rv) were analysed for frequencies and abundance of SSRs. Our analysis revealed that the SSRs are distributed throughout the mycobacterial genomes at an average of 220–230 SSR tracts per kb. All the mycobacterial genomes contain few regions that are conspicuously denser or poorer in microsatellites compared to their expected genome averages. The genomes distinctly show scarcity of long microsatellites despite the absence of a post-replicative DNA repair system. Such severe scarcity of long microsatellites could arise as a result of strong selection pressures operating against long and unstable sequences although influence of GC-content and role of point mutations in arresting microsatellite expansions can not be ruled out. Nonetheless, the long tracts occasionally found in coding as well as non-coding regions may account for limited genome plasticity in these genomes.

  1. Genome size evolution in pufferfish: an insight from BAC clone-based Diodon holocanthus genome sequencing

    Directory of Open Access Journals (Sweden)

    Gan Xiaoni

    2010-06-01

    Full Text Available Abstract Background Variations in genome size within and between species have been observed since the 1950 s in diverse taxonomic groups. Serving as model organisms, smooth pufferfish possess the smallest vertebrate genomes. Interestingly, spiny pufferfish from its sister family have genome twice as large as smooth pufferfish. Therefore, comparative genomic analysis between smooth pufferfish and spiny pufferfish is useful for our understanding of genome size evolution in pufferfish. Results Ten BAC clones of a spiny pufferfish Diodon holocanthus were randomly selected and shotgun sequenced. In total, 776 kb of non-redundant sequences without gap representing 0.1% of the D. holocanthus genome were identified, and 77 distinct genes were predicted. In the sequenced D. holocanthus genome, 364 kb is homologous with 265 kb of the Takifugu rubripes genome, and 223 kb is homologous with 148 kb of the Tetraodon nigroviridis genome. The repetitive DNA accounts for 8% of the sequenced D. holocanthus genome, which is higher than that in the T. rubripes genome (6.89% and that in the Te. nigroviridis genome (4.66%. In the repetitive DNA, 76% is retroelements which account for 6% of the sequenced D. holocanthus genome and belong to known families of transposable elements. More than half of retroelements were distributed within genes. In the non-homologous regions, repeat element proportion in D. holocanthus genome increased to 10.6% compared with T. rubripes and increased to 9.19% compared with Te. nigroviridis. A comparison of 10 well-defined orthologous genes showed that the average intron size (566 bp in D. holocanthus genome is significantly longer than that in the smooth pufferfish genome (435 bp. Conclusion Compared with the smooth pufferfish, D. holocanthus has a low gene density and repeat elements rich genome. Genome size variation between D. holocanthus and the smooth pufferfish exhibits as length variation between homologous region and different

  2. Rodent malaria parasites : genome organization & comparative genomics

    NARCIS (Netherlands)

    Kooij, Taco W.A.

    2006-01-01

    The aim of the studies described in this thesis was to investigate the genome organization of rodent malaria parasites (RMPs) and compare the organization and gene content of the genomes of RMPs and the human malaria parasite P. falciparum. The release of the complete genome sequence of P. falciparu

  3. A new database (GCD) on genome composition for eukaryote and prokaryote genome sequences and their initial analyses.

    Science.gov (United States)

    Kryukov, Kirill; Sumiyama, Kenta; Ikeo, Kazuho; Gojobori, Takashi; Saitou, Naruya

    2012-01-01

    Eukaryote genomes contain many noncoding regions, and they are quite complex. To understand these complexities, we constructed a database, Genome Composition Database, for the whole genome composition statistics for 101 eukaryote genome data, as well as more than 1,000 prokaryote genomes. Frequencies of all possible one to ten oligonucleotides were counted for each genome, and these observed values were compared with expected values computed under observed oligonucleotide frequencies of length 1-4. Deviations from expected values were much larger for eukaryotes than prokaryotes, except for fungal genomes. Mammalian genomes showed the largest deviation among animals. The results of comparison are available online at http://esper.lab.nig.ac.jp/genome-composition-database/.

  4. Sinbase: an integrated database to study genomics, genetics and comparative genomics in Sesamum indicum.

    Science.gov (United States)

    Wang, Linhai; Yu, Jingyin; Li, Donghua; Zhang, Xiurong

    2015-01-01

    Sesame (Sesamum indicum L.) is an ancient and important oilseed crop grown widely in tropical and subtropical areas. It belongs to the gigantic order Lamiales, which includes many well-known or economically important species, such as olive (Olea europaea), leonurus (Leonurus japonicus) and lavender (Lavandula spica), many of which have important pharmacological properties. Despite their importance, genetic and genomic analyses on these species have been insufficient due to a lack of reference genome information. The now available S. indicum genome will provide an unprecedented opportunity for studying both S. indicum genetic traits and comparative genomics. To deliver S. indicum genomic information to the worldwide research community, we designed Sinbase, a web-based database with comprehensive sesame genomic, genetic and comparative genomic information. Sinbase includes sequences of assembled sesame pseudomolecular chromosomes, protein-coding genes (27,148), transposable elements (372,167) and non-coding RNAs (1,748). In particular, Sinbase provides unique and valuable information on colinear regions with various plant genomes, including Arabidopsis thaliana, Glycine max, Vitis vinifera and Solanum lycopersicum. Sinbase also provides a useful search function and data mining tools, including a keyword search and local BLAST service. Sinbase will be updated regularly with new features, improvements to genome annotation and new genomic sequences, and is freely accessible at http://ocri-genomics.org/Sinbase/.

  5. Funding Opportunity: Genomic Data Centers

    Science.gov (United States)

    Funding Opportunity CCG, Funding Opportunity Center for Cancer Genomics, CCG, Center for Cancer Genomics, CCG RFA, Center for cancer genomics rfa, genomic data analysis network, genomic data analysis network centers,

  6. Ontology for Genome Comparison and Genomic Rearrangements

    Directory of Open Access Journals (Sweden)

    Anil Wipat

    2006-04-01

    Full Text Available We present an ontology for describing genomes, genome comparisons, their evolution and biological function. This ontology will support the development of novel genome comparison algorithms and aid the community in discussing genomic evolution. It provides a framework for communication about comparative genomics, and a basis upon which further automated analysis can be built. The nomenclature defined by the ontology will foster clearer communication between biologists, and also standardize terms used by data publishers in the results of analysis programs. The overriding aim of this ontology is the facilitation of consistent annotation of genomes through computational methods, rather than human annotators. To this end, the ontology includes definitions that support computer analysis and automated transfer of annotations between genomes, rather than relying upon human mediation.

  7. Genome Mapping in Plant Comparative Genomics.

    Science.gov (United States)

    Chaney, Lindsay; Sharp, Aaron R; Evans, Carrie R; Udall, Joshua A

    2016-09-01

    Genome mapping produces fingerprints of DNA sequences to construct a physical map of the whole genome. It provides contiguous, long-range information that complements and, in some cases, replaces sequencing data. Recent advances in genome-mapping technology will better allow researchers to detect large (>1kbp) structural variations between plant genomes. Some molecular and informatics complications need to be overcome for this novel technology to achieve its full utility. This technology will be useful for understanding phenotype responses due to DNA rearrangements and will yield insights into genome evolution, particularly in polyploids. In this review, we outline recent advances in genome-mapping technology, including the processes required for data collection and analysis, and applications in plant comparative genomics.

  8. Enabling functional genomics with genome engineering.

    Science.gov (United States)

    Hilton, Isaac B; Gersbach, Charles A

    2015-10-01

    Advances in genome engineering technologies have made the precise control over genome sequence and regulation possible across a variety of disciplines. These tools can expand our understanding of fundamental biological processes and create new opportunities for therapeutic designs. The rapid evolution of these methods has also catalyzed a new era of genomics that includes multiple approaches to functionally characterize and manipulate the regulation of genomic information. Here, we review the recent advances of the most widely adopted genome engineering platforms and their application to functional genomics. This includes engineered zinc finger proteins, TALEs/TALENs, and the CRISPR/Cas9 system as nucleases for genome editing, transcription factors for epigenome editing, and other emerging applications. We also present current and potential future applications of these tools, as well as their current limitations and areas for future advances.

  9. Characterization of genetic rearrangements in esophageal squamous carcinoma cell lines by a combination of M-FISH and array-CGH: further confirmation of some split genomic regions in primary tumors

    Directory of Open Access Journals (Sweden)

    Hao Jia-Jie

    2012-08-01

    Full Text Available Abstract Background Chromosomal and genomic aberrations are common features of human cancers. However, chromosomal numerical and structural aberrations, breakpoints and disrupted genes have yet to be identified in esophageal squamous cell carcinoma (ESCC. Methods Using multiplex-fluorescence in situ hybridization (M-FISH and oligo array-based comparative hybridization (array-CGH, we identified aberrations and breakpoints in six ESCC cell lines. Furthermore, we detected recurrent breakpoints in primary tumors by dual-color FISH. Results M-FISH and array-CGH results revealed complex numerical and structural aberrations. Frequent gains occurred at 3q26.33-qter, 5p14.1-p11, 7pter-p12.3, 8q24.13-q24.21, 9q31.1-qter, 11p13-p11, 11q11-q13.4, 17q23.3-qter, 18pter-p11, 19 and 20q13.32-qter. Losses were frequent at 18q21.1-qter. Breakpoints that clustered within 1 or 2 Mb were identified, including 9p21.3, 11q13.3-q13.4, 15q25.3 and 3q28. By dual-color FISH, we observed that several recurrent breakpoint regions in cell lines were also present in ESCC tumors. In particular, breakpoints clustered at 11q13.3-q13.4 were identified in 43.3% (58/134 of ESCC tumors. Both 11q13.3-q13.4 splitting and amplification were significantly correlated with lymph node metastasis (LNM (P = 0.004 and 0.022 and advanced stages (P = 0.004 and 0.039. Multivariate logistic regression analysis revealed that only 11q13.3-q13.4 splitting was an independent predictor for LNM (P = 0.026. Conclusions The combination of M-FISH and array-CGH helps produce more accurate karyotypes. Our data provide significant, detailed information for appropriate uses of these ESCC cell lines for cytogenetic and molecular biological studies. The aberrations and breakpoints detected in both the cell lines and primary tumors will contribute to identify affected genes involved in the development and progression of ESCC.

  10. The lincRNA HOTAIRM1, located in the HOXA genomic region, is expressed in acute myeloid leukemia, impacts prognosis in patients in the intermediate-risk cytogenetic category, and is associated with a distinctive microRNA signature

    Science.gov (United States)

    Díaz-Beyá, Marina; Brunet, Salut; Nomdedéu, Josep; Pratcorona, Marta; Cordeiro, Anna; Gallardo, David; Escoda, Lourdes; Tormo, Mar; Heras, Inmaculada; Ribera, Josep Maria; Duarte, Rafael; de Llano, María Paz Queipo; Bargay, Joan; Sampol, Antonia; Nomdedeu, Mertixell; Risueño, Ruth M.; Hoyos, Montserrat; Sierra, Jorge; Monzo, Mariano; Navarro, Alfons; Esteve, Jordi

    2015-01-01

    Long non-coding RNAs (lncRNAs) are deregulated in several tumors, although their role in acute myeloid leukemia (AML) is mostly unknown. We have examined the expression of the lncRNA HOX antisense intergenic RNA myeloid 1 (HOTAIRM1) in 241 AML patients. We have correlated HOTAIRM1 expression with a miRNA expression profile. We have also analyzed the prognostic value of HOTAIRM1 expression in 215 intermediate-risk AML (IR-AML) patients. The lowest expression level was observed in acute promyelocytic leukemia (P < 0.001) and the highest in t(6;9) AML (P = 0.005). In 215 IR-AML patients, high HOTAIRM1 expression was independently associated with shorter overall survival (OR:2.04;P = 0.001), shorter leukemia-free survival (OR:2.56; P < 0.001) and a higher cumulative incidence of relapse (OR:1.67; P = 0.046). Moreover, HOTAIRM1 maintained its independent prognostic value within the favorable molecular subgroup (OR: 3.43; P = 0.009). Interestingly, HOTAIRM1 was overexpressed in NPM1-mutated AML (P < 0.001) and within this group retained its prognostic value (OR: 2.21; P = 0.01). Moreover, HOTAIRM1 expression was associated with a specific 33- microRNA signature that included miR-196b (P < 0.001). miR-196b is located in the HOX genomic region and has previously been reported to have an independent prognostic value in AML. miR-196b and HOTAIRM1 in combination as a prognostic factor can classify patients as high-, intermediate-, or low-risk (5-year OS: 24% vs 42% vs 70%; P = 0.004). Determination of HOTAIRM1 level at diagnosis provided relevant prognostic information in IR-AML and allowed refinement of risk stratification based on common molecular markers. The prognostic information provided by HOTAIRM1 was strengthened when combined with miR-196b expression. Furthermore, HOTAIRM1 correlated with a 33-miRNA signature. PMID:26436590

  11. The complete mitochondrial genome of the Yorkshire pig (Sus scrofa).

    Science.gov (United States)

    Xu, Dong; Yang, Hu; Ma, Haiming

    2016-01-01

    This study aims to identify the complete nucleotide sequence of mitochondrial genome in the Yorkshire pig. Sequence analysis indicates that the genome structure is in accordance with other pig breeds, and it contains 22 tRNA genes, 2 ribosomal RNA genes, 13 protein-coding genes and 1 control region (D-loop region). The complete mitochondrial genome sequence of the Yorkshire pig provides an important record set for further study on genetic mechanism.

  12. Exploring Other Genomes: Bacteria.

    Science.gov (United States)

    Flannery, Maura C.

    2001-01-01

    Points out the importance of genomes other than the human genome project and provides information on the identified bacterial genomes Pseudomonas aeuroginosa, Leprosy, Cholera, Meningitis, Tuberculosis, Bubonic Plague, and plant pathogens. Considers the computer's use in genome studies. (Contains 14 references.) (YDS)

  13. Long- and short-term selective forces on malaria parasite genomes

    DEFF Research Database (Denmark)

    Nygaard, Sanne; Braunstein, Alexander; Malsen, Gareth;

    2010-01-01

    a significant impact on malaria control, the selective pressures within Plasmodium genomes are poorly understood, particularly in the non-protein-coding portion of the genome. We use evolutionary methods to describe selective processes in both the coding and non-coding regions of these genomes. Based on genome...

  14. GENOMIC MEDICINE

    Directory of Open Access Journals (Sweden)

    Ignacio Briceño Balcázar

    2011-03-01

    Full Text Available Until the twilight of the 20th century, genetics was a branch of medicine applied to diseases of rare occurrence. The advent of the human genome sequence and the possibility of studying it at affordable costs for patients and healthcare institutions, has permitted its application in high-priority diseases like cancer, cardiovascular disease, diabetes, and Alzheimer’s, among others.There is great potential in predictive and preventive medicine, through studying polymorphic genetic variants associated to risks for different diseases. Currently, clinical laboratories offer studies of over 30,000 variants associated with susceptibilities, to which individuals can access without much difficulty because a medical prescription is not required. These exams permit conducting a specific plan of preventive medicine. For example, upon the possibility of finding a deleterious mutation in the BRCA1 and BRCA2 genes, the patient can prevent the breast cancer by mastectomy or chemoprophylaxis and in the presence of polymorphisms associated to cardiovascular risk preventive action may be undertaken through changes in life style (diet, exercise, etc..Legal aspects are also present in this new conception of medicine. For example, currently there is legislation for medications to indicate on their labels the different responses such medication can offer regarding the genetic variants of the patients, given that similar doses may provoke adverse reactions in an individual, while for another such dosage may be insufficient. This scenario would allow verifying the polymorphisms of drug response prior to administering medications like anticoagulants, hyperlipidemia treatments, or chemotherapy, among others.We must specially mention recessive diseases, produced by the presence of two alleles of a mutated gene, which are inherited from the mother, as well as the father. By studying the mutations, we may learn if a couple is at risk of bearing children with the disease

  15. Genomic Medicine

    Directory of Open Access Journals (Sweden)

    Ignacio Briceño Balcázar

    2011-04-01

    Full Text Available Until the twilight of the 20th century, genetics was a branch of medicine applied to diseases of rare occurrence.  The advent of the human genome sequence and the possibility of studying it at affordable costs for patients and healthcare institutions, has permitted its application in high-priority diseases like cancer, cardiovascular disease, diabetes, and Alzheimer’s, among others. There is great potential in predictive and preventive medicine, through studying polymorphic genetic variants associated to risks for different diseases. Currently, clinical laboratories offer studies of over 30,000 variants associated with susceptibilities, to which individuals can access without much difficulty because a medical prescription is not required. These exams permit conducting a specific plan of preventive medicine.  For example, upon the possibility of finding a deleterious mutation in the BRCA1 and BRCA2 genes, the patient can prevent the breast cancer by mastectomy or chemoprophylaxis and in the presence of polymorphisms associated to cardiovascular risk preventive action may be undertaken through changes in life style (diet, exercise, etc.. Legal aspects are also present in this new conception of medicine.  For example, currently there is legislation for medications to indicate on their labels the different responses such medication can offer regarding the genetic variants of the patients, given that similar doses may provoke adverse reactions in an individual, while for another such dosage may be insufficient. This scenario would allow verifying the polymorphisms of drug response prior to administering medications like anticoagulants, hyperlipidemia treatments, or chemotherapy, among others. We must specially mention recessive diseases, produced by the presence of two alleles of a mutated gene, which are inherited from the mother, as well as the father. By studying the mutations, we may learn if a couple is at risk of bearing children with the

  16. Between Two Fern Genomes

    OpenAIRE

    Sessa, Emily B.; Banks, Jo; Michael S Barker; Der, Joshua P; Duffy, Aaron M; Graham, Sean W.; Hasebe, Mitsuyasu; Langdale, Jane; Li, Fay-Wei; Marchant, D; Kathleen M. Pryer; Rothfels, Carl J.; Roux, Stanley J.; Salmi, Mari L; Sigel, Erin M.

    2014-01-01

    Ferns are the only major lineage of vascular plants not represented by a sequenced nuclear genome. This lack of genome sequence information significantly impedes our ability to understand and reconstruct genome evolution not only in ferns, but across all land plants. Azolla and Ceratopteris are ideal and complementary candidates to be the first ferns to have their nuclear genomes sequenced. They differ dramatically in genome size, life history, and habit, a