WorldWideScience

Sample records for analyses genomes comparison

  1. Use of gene sequence analyses and genome comparisons for yeast systematics

    Science.gov (United States)

    Detection, identification, and classification of yeasts has undergone a major transformation in the past decade and a half following application of gene sequence analyses and genome comparisons. Development of a database (barcode) of easily determined gene sequences from domains 1 and 2 of large sub...

  2. Comparison of analyses of the XVth QTLMAS common dataset III: Genomic Estimations of Breeding Values

    Directory of Open Access Journals (Sweden)

    Demeure Olivier

    2012-05-01

    Full Text Available Abstract Background The QTLMAS XVth dataset consisted of pedigree, marker genotypes and quantitative trait performances of animals with a sib family structure. Pedigree and genotypes concerned 3,000 progenies among those 2,000 were phenotyped. The trait was regulated by 8 QTLs which displayed additive, imprinting or epistatic effects. The 1,000 unphenotyped progenies were considered as candidates to selection and their Genomic Estimated Breeding Values (GEBV were evaluated by participants of the XVth QTLMAS workshop. This paper aims at comparing the GEBV estimation results obtained by seven participants to the workshop. Methods From the known QTL genotypes of each candidate, two "true" genomic values (TV were estimated by organizers: the genotypic value of the candidate (TGV and the expectation of its progeny genotypic values (TBV. GEBV were computed by the participants following different statistical methods: random linear models (including BLUP and Ridge Regression, selection variable techniques (LASSO, Elastic Net and Bayesian methods. Accuracy was evaluated by the correlation between TV (TGV or TBV and GEBV presented by participants. Rank correlation of the best 10% of individuals and error in predictions were also evaluated. Bias was tested by regression of TV on GEBV. Results Large differences between methods were found for all criteria and type of genetic values (TGV, TBV. In general, the criteria ranked consistently methods belonging to the same family. Conclusions Bayesian methods - A

  3. Comparison of Genome-Wide Association Methods in Analyses of Admixed Populations with Complex Familial Relationships

    DEFF Research Database (Denmark)

    Kadri, Naveen; Guldbrandtsen, Bernt; Sørensen, Peter;

    2014-01-01

    Population structure is known to cause false-positive detection in association studies. We compared the power, precision, and type-I error rates of various association models in analyses of a simulated dataset with structure at the population (admixture from two populations; P) and family (K......) levels. We also compared type-I error rates among models in analyses of publicly available human and dog datasets. The models corrected for none, one, or both structure levels. Correction for K was performed with linear mixed models incorporating familial relationships estimated from pedigrees or genetic...... markers. Linear models that ignored K were also tested. Correction for P was performed using principal component or structured association analysis. In analyses of simulated and real data, linear mixed models that corrected for K were able to control for type-I error, regardless of whether they also...

  4. Gene discovery and transcript analyses in the corn smut pathogen Ustilago maydis: expressed sequence tag and genome sequence comparison

    Directory of Open Access Journals (Sweden)

    Saville Barry J

    2007-09-01

    Full Text Available Abstract Background Ustilago maydis is the basidiomycete fungus responsible for common smut of corn and is a model organism for the study of fungal phytopathogenesis. To aid in the annotation of the genome sequence of this organism, several expressed sequence tag (EST libraries were generated from a variety of U. maydis cell types. In addition to utility in the context of gene identification and structure annotation, the ESTs were analyzed to identify differentially abundant transcripts and to detect evidence of alternative splicing and anti-sense transcription. Results Four cDNA libraries were constructed using RNA isolated from U. maydis diploid teliospores (U. maydis strains 518 × 521 and haploid cells of strain 521 grown under nutrient rich, carbon starved, and nitrogen starved conditions. Using the genome sequence as a scaffold, the 15,901 ESTs were assembled into 6,101 contiguous expressed sequences (contigs; among these, 5,482 corresponded to predicted genes in the MUMDB (MIPS Ustilago maydis database, while 619 aligned to regions of the genome not yet designated as genes in MUMDB. A comparison of EST abundance identified numerous genes that may be regulated in a cell type or starvation-specific manner. The transcriptional response to nitrogen starvation was assessed using RT-qPCR. The results of this suggest that there may be cross-talk between the nitrogen and carbon signalling pathways in U. maydis. Bioinformatic analysis identified numerous examples of alternative splicing and anti-sense transcription. While intron retention was the predominant form of alternative splicing in U. maydis, other varieties were also evident (e.g. exon skipping. Selected instances of both alternative splicing and anti-sense transcription were independently confirmed using RT-PCR. Conclusion Through this work: 1 substantial sequence information has been provided for U. maydis genome annotation; 2 new genes were identified through the discovery of 619

  5. The complete mitochondrial genome of Strongylus equinus (Chromadorea: Strongylidae): Comparison with other closely related species and phylogenetic analyses.

    Science.gov (United States)

    Xu, Wen-Wen; Qiu, Jian-Hua; Liu, Guo-Hua; Zhang, Yan; Liu, Ze-Xuan; Duan, Hong; Yue, Dong-Mei; Chang, Qiao-Cheng; Wang, Chun-Ren; Zhao, Xing-Cun

    2015-12-01

    The roundworms of genus Strongylus are the common parasitic nematodes in the large intestine of equine, causing significant economic losses to the livestock industries. In spite of its importance, the genetic data and epidemiology of this parasite are not entirely understood. In the present study, the complete S. equinus mitochondrial (mt) genome was determined. The length of S. equinus mt genome DNA sequence is 14,545 bp, containing 36 genes, of which 12 code for protein, 22 for transfer RNA, and two for ribosomal RNA, but lacks atp8 gene. All 36 genes are encoded in the same direction which is consistent with all other Chromadorea nematode mtDNAs published to date. Phylogenetic analysis based on concatenated amino acid sequence data of all 12 protein-coding genes showed that there were two large branches in the Strongyloidea nematodes, and S. equinus is genetically closer to S. vulgaris than to Cylicocyclus insignis in Strongylidae. This new mt genome provides a source of genetic markers for the molecular phylogeny and population genetics of equine strongyles. PMID:26366671

  6. Comparative genomic analyses in Asparagus.

    Science.gov (United States)

    Kuhl, Joseph C; Havey, Michael J; Martin, William J; Cheung, Foo; Yuan, Qiaoping; Landherr, Lena; Hu, Yi; Leebens-Mack, James; Town, Christopher D; Sink, Kenneth C

    2005-12-01

    Garden asparagus (Asparagus officinalis L.) belongs to the monocot family Asparagaceae in the order Asparagales. Onion (Allium cepa L.) and Asparagus officinalis are 2 of the most economically important plants of the core Asparagales, a well supported monophyletic group within the Asparagales. Coding regions in onion have lower GC contents than the grasses. We compared the GC content of 3374 unique expressed sequence tags (ESTs) from A. officinalis with Lycoris longituba and onion (both members of the core Asparagales), Acorus americanus (sister to all other monocots), the grasses, and Arabidopsis. Although ESTs in A. officinalis and Acorus had a higher average GC content than Arabidopsis, Lycoris, and onion, all were clearly lower than the grasses. The Asparagaceae have the smallest nuclear genomes among all plants in the core Asparagales, which typically have huge genomes. Within the Asparagaceae, European Asparagus species have approximately twice the nuclear DNA of that of southern African Asparagus species. We cloned and sequenced 20 genomic amplicons from European A. officinalis and the southern African species Asparagus plumosus and observed no clear evidence for a recent genome doubling in A. officinalis relative to A. plumosus. These results indicate that members of the genus Asparagus with smaller genomes may be useful genomic models for plants in the core Asparagales. PMID:16391674

  7. Genome-Facilitated Analyses of Geomicrobial Processes

    Energy Technology Data Exchange (ETDEWEB)

    Kenneth H. Nealson

    2012-05-02

    that makes up chitin, virtually all of the strains were in fact capable. This led to the discovery of a great many new genes involved with chitin and NAG metabolism (7). In a similar vein, a detailed study of the sugar utilization pathway revealed a major new insight into the regulation of sugar metabolism in this genus (19). Systems Biology and Comparative Genomics of the shewanellae: Several publications were put together describing the use of comparative genomics for analyses of the group Shewanella, and these were a logical culmination of our genomic-driven research (10,15,18). Eight graduate students received their Ph.D. degrees doing part of the work described here, and four postdoctoral fellows were supported. In addition, approximately 20 undergraduates took part in projects during the grant period.

  8. Comparison of the genome sequences and the phylogenetic analyses of the GP78 and the Vellore P20778 isolates of Japanese encephalitis virus from India

    Indian Academy of Sciences (India)

    Sudhanshu Vrati

    2000-09-01

    The nucleotide sequence of the complete genomes of two Indian isolates of Japanese encephalitis virus were compared. One of these isolates, GP78 was obtained from northern India in 1978. The other, the Vellore P20778 isolate, was obtained from southern India in 1958. There was 4.40% nucleotide sequence divergence between the two Indian isolates that resulted in a 1.86% amino acid sequence divergence. Phylogenetic analyses showed that in evolutionary terms the north Indian GP78 isolate was close to the SA14 isolate from China whereas the south Indian Vellore P20778 isolate was close to the Beijing-1 isolate, also from China. The two Indian isolates, however, appear to have evolved independently.

  9. Analysing complex Triticeae genomes – concepts and strategies

    OpenAIRE

    Spannagl, Manuel; Martis, Mihaela M.; Pfeifer, Matthias; Nussbaumer, Thomas; Mayer, Klaus FX

    2013-01-01

    The genomic sequences of many important Triticeae crop species are hard to assemble and analyse due to their large genome sizes, (in part) polyploid genomes and high repeat content. Recently, the draft genomes of barley and bread wheat were reported thanks to cost-efficient and fast NGS technologies. The genome of barley is estimated to be 5 Gb in size whereas the genome of bread wheat accounts for 17 Gb and harbours an allo-hexaploid genome. Direct assembly of the sequence reads and access t...

  10. Genome size analyses of Pucciniales reveal the largest fungal genomes

    Directory of Open Access Journals (Sweden)

    Silvia eTavares

    2014-08-01

    Full Text Available Rust fungi (Basidiomycota, Pucciniales are biotrophic plant pathogens which exhibit diverse complexities in their life cycles and host ranges. The completion of genome sequencing of a few rust fungi has revealed the occurrence of large genomes. Sequencing efforts for other rust fungi have been hampered by uncertainty concerning their genome sizes. Flow cytometry was recently applied to estimate the genome size of a few rust fungi, and confirmed the occurrence of large genomes in this order (averaging 151.5 Mbp, while the average for Basidiomycota was 49.9 Mbp and was 37.7 Mbp for all fungi. In this work, we have used an innovative and simple approach to simultaneously isolate nuclei from the rust and its host plant in order to estimate the genome size of 30 rust species by flow cytometry. Genome sizes varied over 10-fold, from 70 to 893 Mbp, with an average genome size value of 380.2 Mbp. Compared to the genome sizes of over 1,800 fungi, Gymnosporangium confusum possesses the largest fungal genome ever reported (893.2 Mbp. Moreover, even the smallest rust genome determined in this study is larger than the vast majority of fungal genomes (94 %. The average genome size of the Pucciniales is now of 305.5 Mbp, while the average Basidiomycota genome size has shifted to 70.4 Mbp and the average for all fungi reached 44.2 Mbp. Despite the fact that no correlation could be drawn between the genome sizes, the phylogenomics or the life cycle of rust fungi, it is interesting to note that rusts with Fabaceae hosts present genomes clearly larger than those with Poaceae hosts. Although this study comprises only a small fraction of the more than 7,000 rust species described, it seems already evident that the Pucciniales represent a group where genome size expansion could be a common characteristic. This is in sharp contrast to sister taxa, placing this order in a relevant position in fungal genomics research.

  11. BEACON: automated tool for Bacterial GEnome Annotation ComparisON

    KAUST Repository

    Kalkatawi, Manal Matoq Saeed

    2015-08-18

    Background Genome annotation is one way of summarizing the existing knowledge about genomic characteristics of an organism. There has been an increased interest during the last several decades in computer-based structural and functional genome annotation. Many methods for this purpose have been developed for eukaryotes and prokaryotes. Our study focuses on comparison of functional annotations of prokaryotic genomes. To the best of our knowledge there is no fully automated system for detailed comparison of functional genome annotations generated by different annotation methods (AMs). Results The presence of many AMs and development of new ones introduce needs to: a/ compare different annotations for a single genome, and b/ generate annotation by combining individual ones. To address these issues we developed an Automated Tool for Bacterial GEnome Annotation ComparisON (BEACON) that benefits both AM developers and annotation analysers. BEACON provides detailed comparison of gene function annotations of prokaryotic genomes obtained by different AMs and generates extended annotations through combination of individual ones. For the illustration of BEACON’s utility, we provide a comparison analysis of multiple different annotations generated for four genomes and show on these examples that the extended annotation can increase the number of genes annotated by putative functions up to 27 %, while the number of genes without any function assignment is reduced. Conclusions We developed BEACON, a fast tool for an automated and a systematic comparison of different annotations of single genomes. The extended annotation assigns putative functions to many genes with unknown functions. BEACON is available under GNU General Public License version 3.0 and is accessible at: http://www.cbrc.kaust.edu.sa/BEACON/

  12. Cactus Graphs for Genome Comparisons

    Science.gov (United States)

    Paten, Benedict; Diekhans, Mark; Earl, Dent; St. John, John; Ma, Jian; Suh, Bernard; Haussler, David

    We introduce a data structure, analysis and visualization scheme called a cactus graph for comparing sets of related genomes. Cactus graphs capture some of the advantages of de Bruijn and breakpoint graphs in one unified framework. They naturally decompose the common substructures in a set of related genomes into a hierarchy of chains that can be visualized as multiple alignments and nets that can be visualized in circular genome plots.

  13. Insights from Human/Mouse genome comparisons

    Energy Technology Data Exchange (ETDEWEB)

    Pennacchio, Len A.

    2003-03-30

    Large-scale public genomic sequencing efforts have provided a wealth of vertebrate sequence data poised to provide insights into mammalian biology. These include deep genomic sequence coverage of human, mouse, rat, zebrafish, and two pufferfish (Fugu rubripes and Tetraodon nigroviridis) (Aparicio et al. 2002; Lander et al. 2001; Venter et al. 2001; Waterston et al. 2002). In addition, a high-priority has been placed on determining the genomic sequence of chimpanzee, dog, cow, frog, and chicken (Boguski 2002). While only recently available, whole genome sequence data have provided the unique opportunity to globally compare complete genome contents. Furthermore, the shared evolutionary ancestry of vertebrate species has allowed the development of comparative genomic approaches to identify ancient conserved sequences with functionality. Accordingly, this review focuses on the initial comparison of available mammalian genomes and describes various insights derived from such analysis.

  14. Whole-Genome Analyses of lung function, height and smoking

    DEFF Research Database (Denmark)

    Janss, Luc; Sigsgaard, Torben; Sorensen, Daniel

    2014-01-01

    quantify the relative contributions of genomic and environmental factors to the relationship between FEV1 and height. Smoking status was analysed using a probit random regression model and a second goal of the study was to estimate the genomic heritability of smoking status. Estimates of genomic...... heritabilities for height and FEV1 are equal to 0.47 and to 0.30, respectively. The estimates of the genomic and environmental correlations between height and FEV1 are 0.78 and 0.34, respectively. The posterior mean of the genomic heritability of smoking status is equal to 0.14 and provides evidence for the...... presence of genetic factors associated with the trait. Under the data augmentation strategy introduced, the joint posterior distribution of FEV1 and height factorises into two independent posterior distributions. This simplifies programming and results in excellent numerical behaviour. The approach can be...

  15. Hal: an Automated Pipeline for Phylogenetic Analyses of Genomic Data

    OpenAIRE

    Robbertse, Barbara; Yoder, Ryan J.; Boyd, Alex; Reeves, John; Spatafora, Joseph W.

    2011-01-01

    The rapid increase in genomic and genome-scale data is resulting in unprecedented levels of discrete sequence data available for phylogenetic analyses. Major analytical impasses exist, however, prior to analyzing these data with existing phylogenetic software. Obstacles include the management of large data sets without standardized naming conventions, identification and filtering of orthologous clusters of proteins or genes, and the assembly of alignments of orthologous sequence data into ind...

  16. Sequencing and comparative analyses of the genomes of zoysiagrasses.

    Science.gov (United States)

    Tanaka, Hidenori; Hirakawa, Hideki; Kosugi, Shunichi; Nakayama, Shinobu; Ono, Akiko; Watanabe, Akiko; Hashiguchi, Masatsugu; Gondo, Takahiro; Ishigaki, Genki; Muguerza, Melody; Shimizu, Katsuya; Sawamura, Noriko; Inoue, Takayasu; Shigeki, Yuichi; Ohno, Naoki; Tabata, Satoshi; Akashi, Ryo; Sato, Shusei

    2016-04-01

    Zoysiais a warm-season turfgrass, which comprises 11 allotetraploid species (2n= 4x= 40), each possessing different morphological and physiological traits. To characterize the genetic systems ofZoysiaplants and to analyse their structural and functional differences in individual species and accessions, we sequenced the genomes ofZoysiaspecies using HiSeq and MiSeq platforms. As a reference sequence ofZoysiaspecies, we generated a high-quality draft sequence of the genome ofZ. japonicaaccession 'Nagirizaki' (334 Mb) in which 59,271 protein-coding genes were predicted. In parallel, draft genome sequences ofZ. matrella'Wakaba' andZ. pacifica'Zanpa' were also generated for comparative analyses. To investigate the genetic diversity among theZoysiaspecies, genome sequence reads of three additional accessions,Z. japonica'Kyoto',Z. japonica'Miyagi' andZ. matrella'Chiba Fair Green', were accumulated, and aligned against the reference genome of 'Nagirizaki' along with those from 'Wakaba' and 'Zanpa'. As a result, we detected 7,424,163 single-nucleotide polymorphisms and 852,488 short indels among these species. The information obtained in this study will be valuable for basic studies on zoysiagrass evolution and genetics as well as for the breeding of zoysiagrasses, and is made available in the 'Zoysia Genome Database' athttp://zoysia.kazusa.or.jp. PMID:26975196

  17. Comparison of elastic and inelastic analyses

    International Nuclear Information System (INIS)

    The use of inelastic analysis methods instead of the traditional elastic analysis methods in the design of radioactive material (RAM) transport packagings leads to a better understanding of the response of the package to mechanical loadings. Thus, better assessment of the containment, thermal protection, and shielding integrity of the package after a structure accident event can be made. A more accurate prediction of the package response can lead to enhanced safety and also allow for a more efficient use of materials, possibly leading to a package with higher capacity or lower weight. This paper discusses the advantages and disadvantages of using inelastic analysis in the design of RAM shipping packages. The use of inelastic analysis presents several problems to the package designer. When using inelastic analysis the entire nonlinear response of the material must be known, including the effects of temperature changes and strain rate. Another problem is that there currently is not an acceptance criteria for this type of analysis that is approved by regulatory agencies. Inelastic analysis acceptance criteria based on failure stress, failure strain , or plastic energy density could be developed. For both elastic and inelastic analyses it is also important to include other sources of stress in the analyses, such as fabrication stresses, thermal stresses, stresses from bolt preloading, and contact stresses at material interfaces. Offsetting these added difficulties is the improved knowledge of the package behavior. This allows for incorporation of a more uniform margin of safety, which can result in weight savings and a higher level of confidence in the post-accident configuration of the package. In this paper, comparisons between elastic and inelastic analyses are made for a simple ring structure and for a package to transport a large quantity of RAM by rail (rail cask) with lead gamma shielding to illustrate the differences in the two analysis techniques

  18. Screening synteny blocks in pairwise genome comparisons through integer programming

    Directory of Open Access Journals (Sweden)

    Paterson Andrew H

    2011-04-01

    Full Text Available Abstract Background It is difficult to accurately interpret chromosomal correspondences such as true orthology and paralogy due to significant divergence of genomes from a common ancestor. Analyses are particularly problematic among lineages that have repeatedly experienced whole genome duplication (WGD events. To compare multiple "subgenomes" derived from genome duplications, we need to relax the traditional requirements of "one-to-one" syntenic matchings of genomic regions in order to reflect "one-to-many" or more generally "many-to-many" matchings. However this relaxation may result in the identification of synteny blocks that are derived from ancient shared WGDs that are not of interest. For many downstream analyses, we need to eliminate weak, low scoring alignments from pairwise genome comparisons. Our goal is to objectively select subset of synteny blocks whose total scores are maximized while respecting the duplication history of the genomes in comparison. We call this "quota-based" screening of synteny blocks in order to appropriately fill a quota of syntenic relationships within one genome or between two genomes having WGD events. Results We have formulated the synteny block screening as an optimization problem known as "Binary Integer Programming" (BIP, which is solved using existing linear programming solvers. The computer program QUOTA-ALIGN performs this task by creating a clear objective function that maximizes the compatible set of synteny blocks under given constraints on overlaps and depths (corresponding to the duplication history in respective genomes. Such a procedure is useful for any pairwise synteny alignments, but is most useful in lineages affected by multiple WGDs, like plants or fish lineages. For example, there should be a 1:2 ploidy relationship between genome A and B if genome B had an independent WGD subsequent to the divergence of the two genomes. We show through simulations and real examples using plant genomes

  19. Whole-Genome Analyses of LUNG FUNCTION, HEIGHT and SMOKING

    OpenAIRE

    Janss, Luc; Sigsgaard, Torben; Sorensen, Daniel

    2014-01-01

    A joint analysis of FEV1 (Forced Expiratory Volume after one second) and height is reported using novel methodology, as well as a single-trait analysis of smoking status. A first goal of the study was to incorporate dense genetic marker information in a random regression (Bayesian) model to quantify the relative contributions of genomic and environmental factors to the relationship between FEV1 and height. Smoking status was analysed using a probit random regression model and a second goal of...

  20. Quantitative metagenomic analyses based on average genome size normalization

    DEFF Research Database (Denmark)

    Frank, Jeremy Alexander; Sørensen, Søren Johannes

    2011-01-01

    Over the past quarter-century, microbiologists have used DNA sequence information to aid in the characterization of microbial communities. During the last decade, this has expanded from single genes to microbial community genomics, or metagenomics, in which the gene content of an environment can...... provide not just a census of the community members but direct information on metabolic capabilities and potential interactions among community members. Here we introduce a method for the quantitative characterization and comparison of microbial communities based on the normalization of metagenomic data...... by estimating average genome sizes. This normalization can relieve comparative biases introduced by differences in community structure, number of sequencing reads, and sequencing read lengths between different metagenomes. We demonstrate the utility of this approach by comparing metagenomes from two different...

  1. Integrated syntenic and phylogenomic analyses reveal an ancient genome duplication in monocots.

    Science.gov (United States)

    Jiao, Yuannian; Li, Jingping; Tang, Haibao; Paterson, Andrew H

    2014-07-01

    Unraveling widespread polyploidy events throughout plant evolution is a necessity for inferring the impacts of whole-genome duplication (WGD) on speciation, functional innovations, and to guide identification of true orthologs in divergent taxa. Here, we employed an integrated syntenic and phylogenomic analyses to reveal an ancient WGD that shaped the genomes of all commelinid monocots, including grasses, bromeliads, bananas (Musa acuminata), ginger, palms, and other plants of fundamental, agricultural, and/or horticultural interest. First, comprehensive phylogenomic analyses revealed 1421 putative gene families that retained ancient duplication shared by Musa (Zingiberales) and grass (Poales) genomes, indicating an ancient WGD in monocots. Intergenomic synteny blocks of Musa and Oryza were investigated, and 30 blocks were shown to be duplicated before Musa-Oryza divergence an estimated 120 to 150 million years ago. Synteny comparisons of four monocot (rice [Oryza sativa], sorghum [Sorghum bicolor], banana, and oil palm [Elaeis guineensis]) and two eudicot (grape [Vitis vinifera] and sacred lotus [Nelumbo nucifera]) genomes also support this additional WGD in monocots, herein called Tau (τ). Integrating synteny and phylogenomic comparisons achieves better resolution of ancient polyploidy events than either approach individually, a principle that is exemplified in the disambiguation of a WGD series of rho (ρ)-sigma (σ)-tau (τ) in the grass lineages that echoes the alpha (α)-beta (β)-gamma (γ) series previously revealed in the Arabidopsis thaliana lineage. PMID:25082857

  2. Comparison with Russian analyses of meteor impact

    Energy Technology Data Exchange (ETDEWEB)

    Canavan, G.H.

    1997-06-01

    The inversion model for meteor impacts is used to discuss Russian analyses and compare principal results. For common input parameters, the models produce consistent estimates of impactor parameters. Directions for future research are discussed and prioritized.

  3. Comparative chloroplast genomics: Analyses including new sequencesfrom the angiosperms Nuphar advena and Ranunculus macranthus

    Energy Technology Data Exchange (ETDEWEB)

    Raubeso, Linda A.; Peery, Rhiannon; Chumley, Timothy W.; Dziubek,Chris; Fourcade, H. Matthew; Boore, Jeffrey L.; Jansen, Robert K.

    2007-03-01

    The number of completely sequenced plastid genomes available is growing rapidly. This new array of sequences presents new opportunities to perform comparative analyses. In comparative studies, it is most useful to compare across wide phylogenetic spans and, within angiosperms, to include representatives from basally diverging lineages such as the new genomes reported here: Nuphar advena (from a basal-most lineage) and Ranunculus macranthus (from the basal group of eudicots). We report these two new plastid genome sequences and make comparisons (within angiosperms, seed plants, or all photosynthetic lineages) to evaluate features such as the status of ycf15 and ycf68 as protein coding genes, the distribution of simple sequence repeats (SSRs) and longer dispersed repeats (SDR), and patterns of nucleotide composition.

  4. Comparison of veterinary import risk analyses studies

    NARCIS (Netherlands)

    Vos-de Jong, de C.J.; Conraths, F.J.; Adkin, A.; Jones, E.M.; Hallgren, G.S.; Paisley, L.G.

    2011-01-01

    Twenty-two veterinary import risk analyses (IRAs) were audited: a) for inclusion of the main elements of risk analysis; b) between different types of IRAs; c) between reviewers' scores. No significant differences were detected between different types of IRAs, although quantitative IRAs and IRAs publ

  5. Genome-wide analyses of small noncoding RNAs in streptococci

    Directory of Open Access Journals (Sweden)

    Nadja ePatenge

    2015-05-01

    Full Text Available Streptococci represent a diverse group of Gram-positive bacteria, which colonize a wide range of hosts among animals and humans. Streptococcal species occur as commensal as well as pathogenic organisms. Many of the pathogenic species can cause severe, invasive infections in their hosts leading to a high morbidity and mortality. The consequence is a tremendous suffering on the part of men and livestock besides the significant financial burden in the agricultural and healthcare sectors. An environmentally stimulated and tightly controlled expression of virulence factor genes is of fundamental importance for streptococcal pathogenicity. Bacterial small noncoding RNAs (sRNAs modulate the expression of genes involved in stress response, sugar metabolism, surface composition, and other properties that are related to bacterial virulence. Even though the regulatory character is shared by this class of RNAs, variation on the molecular level results in a high diversity of functional mechanisms. The knowledge about the role of sRNAs in streptococci is still limited, but in recent years, genome-wide screens for sRNAs have been conducted in an increasing number of species. Bioinformatics prediction approaches have been employed as well as expression analyses by classical array techniques or next generation sequencing. This review will give an overview of whole genome screens for sRNAs in streptococci with a focus on describing the different methods and comparing their outcome considering sRNA conservation among species, functional similarities, and relevance for streptococcal infection.

  6. Genome-wide comparisons of phylogenetic similarities between partial genomic regions and the full-length genome in Hepatitis E virus genotyping.

    Directory of Open Access Journals (Sweden)

    Shuai Wang

    Full Text Available Besides the complete genome, different partial genomic sequences of Hepatitis E virus (HEV have been used in genotyping studies, making it difficult to compare the results based on them. No commonly agreed partial region for HEV genotyping has been determined. In this study, we used a statistical method to evaluate the phylogenetic performance of each partial genomic sequence from a genome wide, by comparisons of evolutionary distances between genomic regions and the full-length genomes of 101 HEV isolates to identify short genomic regions that can reproduce HEV genotype assignments based on full-length genomes. Several genomic regions, especially one genomic region at the 3'-terminal of the papain-like cysteine protease domain, were detected to have relatively high phylogenetic correlations with the full-length genome. Phylogenetic analyses confirmed the identical performances between these regions and the full-length genome in genotyping, in which the HEV isolates involved could be divided into reasonable genotypes. This analysis may be of value in developing a partial sequence-based consensus classification of HEV species.

  7. Analysis of health trait data from on-farm computer systems in the U.S. II: Comparison of genomic analyses including two-stage and single-step methods

    Science.gov (United States)

    The development of genomic selection methodology, with accompanying substantial gains in reliability for low-heritability traits, may dramatically improve the feasibility of genetic improvement of dairy cow health. Many methods for genomic analysis have now been developed, including the “Bayesian Al...

  8. Genomic Analyses of Transport Proteins in Ralstonia metallidurans

    Directory of Open Access Journals (Sweden)

    Milton H. Saier

    2006-04-01

    Full Text Available Ralstonia (Wautersia, Cupriavidus metallidurans (Rme is better able to withstand high concentrations of heavy metals than any other well-studied organism. This fact renders it a potential agent of bioremediation as well as an ideal model organism for understanding metal resistance phenotypes. We have analysed the genome of Rme for genes encoding homologues of established and putative transport proteins; 13% of all genes in Rme encode such homologues. Nearly one-third of the transporters identified (32% appear to function in inorganic ion transport with three-quarters of these acting on cations. Transporters specific for amino acids outnumber sugar transporters nearly 3 : 1, and this fact plus the large number of uptake systems for organic acids indicates the heterotrophic preferences of these bacteria. Putative drug efflux pumps comprise 10% of the encoded transporters, but numerous efflux pumps for heavy metals, metabolites and macromolecules were also identified. The results presented should facilitate genetic manipulation and mechanistic studies of transport in this remarkable bacterium.

  9. The Burkholderia Genome Database: facilitating flexible queries and comparative analyses

    OpenAIRE

    Winsor, Geoffrey L.; Khaira, Bhavjinder; Van Rossum, Thea; Lo, Raymond; Whiteside, Matthew D.; Fiona S.L. Brinkman

    2008-01-01

    Summary: As the genome sequences of multiple strains of a given bacterial species are obtained, more generalized bacterial genome databases may be complemented by databases that are focused on providing more information geared for a distinct bacterial phylogenetic group and its associated research community. The Burkholderia Genome Database represents a model for such a database, providing a powerful, user-friendly search and comparative analysis interface that contains features not found in ...

  10. Whole-genome analyses of speciation events in pathogenic Brucellae

    Energy Technology Data Exchange (ETDEWEB)

    Chain, Patrick S. G. [Lawrence Livermore National Laboratory (LLNL); Comerci, Diego J. [Universidad Nacional de General San Martin; Tolmasky, Marcelo E. [California State University; Larimer, Frank W [ORNL; Malfatti, Stephanie [Lawrence Livermore National Laboratory (LLNL); Vergez, Lisa [Lawrence Livermore National Laboratory (LLNL); Aguero, Fernan [Universidad Nacional de General San Martin; Land, Miriam L [ORNL; Ugalde, Rodolfo A. [Universidad Nacional de General San Martin; Garcia, Emilio [Lawrence Livermore National Laboratory (LLNL)

    2005-12-01

    Despite their high DNA identity and a proposal to group classical Brucella species as biovars of Brucella melitensis, the commonly recognized Brucella species can be distinguished by distinct biochemical and fatty acid characters, as well as by a marked host range (e.g., Brucella suis for swine, B. melitensis for sheep and goats, and Brucella abortus for cattle). Here we present the genome of B. abortus 2308, the virulent prototype biovar 1 strain, and its comparison to the two other human pathogenic Brucella species and to B. abortus field isolate 9-941. The global distribution of pseudogenes, deletions, and insertions supports previous indications that B. abortus and B. melitensis share a common ancestor that diverged from B. suis. With the exception of a dozen genes, the genetic complements of both B. abortus strains are identical, whereas the three species differ in gene content and pseudogenes. The pattern of species-specific gene inactivations affecting transcriptional regulators and outer membrane proteins suggests that these inactivations may play an important role in the establishment of host specificity and may have been a primary driver of speciation in the genus Brucella. Despite being nonmotile, the brucellae contain flagellum gene clusters and display species-specific flagellar gene inactivations, which lead to the putative generation of different versions of flagellum-derived structures and may contribute to differences in host specificity and virulence. Metabolic changes such as the lack of complete metabolic pathways for the synthesis of numerous compounds (e.g., glycogen, biotin, NAD, and choline) are consistent with adaptation of brucellae to an intracellular life-style.

  11. Screening synteny blocks in pairwise genome comparisons through integer programming

    OpenAIRE

    Paterson Andrew H; Schnable James C; Pedersen Brent; Lyons Eric; Tang Haibao; Freeling Michael

    2011-01-01

    Abstract Background It is difficult to accurately interpret chromosomal correspondences such as true orthology and paralogy due to significant divergence of genomes from a common ancestor. Analyses are particularly problematic among lineages that have repeatedly experienced whole genome duplication (WGD) events. To compare multiple "subgenomes" derived from genome duplications, we need to relax the traditional requirements of "one-to-one" syntenic matchings of genomic regions in order to refl...

  12. Interpreting Mammalian Evolution using Fugu Genome Comparisons

    Energy Technology Data Exchange (ETDEWEB)

    Stubbs, L; Ovcharenko, I; Loots, G G

    2004-04-02

    Comparative sequence analysis of the human and the pufferfish Fugu rubripes (fugu) genomes has revealed several novel functional coding and noncoding regions in the human genome. In particular, the fugu genome has been extremely valuable for identifying transcriptional regulatory elements in human loci harboring unusually high levels of evolutionary conservation to rodent genomes. In such regions, the large evolutionary distance between human and fishes provides an additional filter through which functional noncoding elements can be detected with high efficiency.

  13. Insights into the genome evolution of Yersinia pestis through whole genome comparison with Yersinia pseudotuberculosis

    Energy Technology Data Exchange (ETDEWEB)

    Souza, B; Stoutland, P; Derbise, A; Georgescu, A; Elliott, J; Land, M; Marceau, M; Motin, V; Hinnebusch, J; Simonet, M; Medigue, C; Dacheux, D; Chenal-Francisque, V; Regala, W; Brubaker, R R; Carniel, E; Chain, P; Verguez, L; Fowler, J; Garcia, E; Lamerdin, J; Hauser, L; Larimer, F

    2004-01-24

    Yersinia pestis, the causative agent of plague, is a highly uniform clone that diverged recently from the enteric pathogen Yersinia pseudotuberculosis. Despite their close genetic relationship, they differ radically in their pathogenicity and transmission. Here we report the complete genomic sequence of Y. pseudotuberculosis IP32953 and its use for detailed genome comparisons to available Y. pestis sequences. Analyses of identified differences across a panel of Yersinia isolates from around the world reveals 32 Y. pestis chromosomal genes that, together with the two Y. pestis-specific plasmids, represent the only new genetic material in Y. pestis acquired since the divergence from Y. pseudotuberculosis. In contrast, 149 new pseudogenes (doubling the previous estimate) and 317 genes absent from Y. pestis were detected, indicating that as many as 13% of Y. pseudotuberculosis genes no longer function in Y. pestis. Extensive IS-mediated genome rearrangements and reductive evolution through massive gene loss, resulting in elimination and modification of pre-existing gene expression pathways appear to be more important than acquisition of new genes in the evolution of Y. pestis. These results provide a sobering example of how a highly virulent epidemic clone can suddenly emerge from a less virulent, closely related progenitor.

  14. Comparison of 61 Sequenced Escherichia coli Genomes

    DEFF Research Database (Denmark)

    Lukjancenko, Oksana; Wassenaar, T. M.; Ussery, David

    2010-01-01

    Escherichia coli is an important component of the biosphere and is an ideal model for studies of processes involved in bacterial genome evolution. Sixty-one publically available E. coli and Shigella spp. sequenced genomes are compared, using basic methods to produce phylogenetic and proteomics...... the pan-genome and about 80% of a typical genome; some of these variable genes tend to be co-localized on genomic islands. The diversity within the species E. coli, and the overlap in gene content between this and related species, suggests a continuum rather than sharp species borders in this group of...

  15. Comprehensive analyses of genomes, transcriptomes and metabolites of neem tree.

    Science.gov (United States)

    Kuravadi, Nagesh A; Yenagi, Vijay; Rangiah, Kannan; Mahesh, H B; Rajamani, Anantharamanan; Shirke, Meghana D; Russiachand, Heikham; Loganathan, Ramya Malarini; Shankara Lingu, Chandana; Siddappa, Shilpa; Ramamurthy, Aishwarya; Sathyanarayana, B N; Gowda, Malali

    2015-01-01

    Neem (Azadirachta indica A. Juss) is one of the most versatile tropical evergreen tree species known in India since the Vedic period (1500 BC-600 BC). Neem tree is a rich source of limonoids, having a wide spectrum of activity against insect pests and microbial pathogens. Complex tetranortriterpenoids such as azadirachtin, salanin and nimbin are the major active principles isolated from neem seed. Absolutely nothing is known about the biochemical pathways of these metabolites in neem tree. To identify genes and pathways in neem, we sequenced neem genomes and transcriptomes using next generation sequencing technologies. Assembly of Illumina and 454 sequencing reads resulted in 267 Mb, which accounts for 70% of estimated size of neem genome. We predicted 44,495 genes in the neem genome, of which 32,278 genes were expressed in neem tissues. Neem genome consists about 32.5% (87 Mb) of repetitive DNA elements. Neem tree is phylogenetically related to citrus, Citrus sinensis. Comparative analysis anchored 62% (161 Mb) of assembled neem genomic contigs onto citrus chromomes. Ultrahigh performance liquid chromatography-mass spectrometry-selected reaction monitoring (UHPLC-MS/SRM) method was used to quantify azadirachtin, nimbin, and salanin from neem tissues. Weighted Correlation Network Analysis (WCGNA) of expressed genes and metabolites resulted in identification of possible candidate genes involved in azadirachtin biosynthesis pathway. This study provides genomic, transcriptomic and quantity of top three neem metabolites resource, which will accelerate basic research in neem to understand biochemical pathways. PMID:26290780

  16. BLAST Ring Image Generator (BRIG: simple prokaryote genome comparisons

    Directory of Open Access Journals (Sweden)

    Beatson Scott A

    2011-08-01

    Full Text Available Abstract Background Visualisation of genome comparisons is invaluable for helping to determine genotypic differences between closely related prokaryotes. New visualisation and abstraction methods are required in order to improve the validation, interpretation and communication of genome sequence information; especially with the increasing amount of data arising from next-generation sequencing projects. Visualising a prokaryote genome as a circular image has become a powerful means of displaying informative comparisons of one genome to a number of others. Several programs, imaging libraries and internet resources already exist for this purpose, however, most are either limited in the number of comparisons they can show, are unable to adequately utilise draft genome sequence data, or require a knowledge of command-line scripting for implementation. Currently, there is no freely available desktop application that enables users to rapidly visualise comparisons between hundreds of draft or complete genomes in a single image. Results BLAST Ring Image Generator (BRIG can generate images that show multiple prokaryote genome comparisons, without an arbitrary limit on the number of genomes compared. The output image shows similarity between a central reference sequence and other sequences as a set of concentric rings, where BLAST matches are coloured on a sliding scale indicating a defined percentage identity. Images can also include draft genome assembly information to show read coverage, assembly breakpoints and collapsed repeats. In addition, BRIG supports the mapping of unassembled sequencing reads against one or more central reference sequences. Many types of custom data and annotations can be shown using BRIG, making it a versatile approach for visualising a range of genomic comparison data. BRIG is readily accessible to any user, as it assumes no specialist computational knowledge and will perform all required file parsing and BLAST comparisons

  17. Comparative Genomic Analyses of the Human NPHP1 Locus Reveal Complex Genomic Architecture and Its Regional Evolution in Primates

    Science.gov (United States)

    Yuan, Bo; Liu, Pengfei; Gupta, Aditya; Beck, Christine R.; Tejomurtula, Anusha; Campbell, Ian M.; Gambin, Tomasz; Simmons, Alexandra D.; Withers, Marjorie A.; Harris, R. Alan; Rogers, Jeffrey; Schwartz, David C.; Lupski, James R.

    2015-01-01

    Many loci in the human genome harbor complex genomic structures that can result in susceptibility to genomic rearrangements leading to various genomic disorders. Nephronophthisis 1 (NPHP1, MIM# 256100) is an autosomal recessive disorder that can be caused by defects of NPHP1; the gene maps within the human 2q13 region where low copy repeats (LCRs) are abundant. Loss of function of NPHP1 is responsible for approximately 85% of the NPHP1 cases—about 80% of such individuals carry a large recurrent homozygous NPHP1 deletion that occurs via nonallelic homologous recombination (NAHR) between two flanking directly oriented ~45 kb LCRs. Published data revealed a non-pathogenic inversion polymorphism involving the NPHP1 gene flanked by two inverted ~358 kb LCRs. Using optical mapping and array-comparative genomic hybridization, we identified three potential novel structural variant (SV) haplotypes at the NPHP1 locus that may protect a haploid genome from the NPHP1 deletion. Inter-species comparative genomic analyses among primate genomes revealed massive genomic changes during evolution. The aggregated data suggest that dynamic genomic rearrangements occurred historically within the NPHP1 locus and generated SV haplotypes observed in the human population today, which may confer differential susceptibility to genomic instability and the NPHP1 deletion within a personal genome. Our study documents diverse SV haplotypes at a complex LCR-laden human genomic region. Comparative analyses provide a model for how this complex region arose during primate evolution, and studies among humans suggest that intra-species polymorphism may potentially modulate an individual’s susceptibility to acquiring disease-associated alleles. PMID:26641089

  18. Comparative Genomic Analyses of the Human NPHP1 Locus Reveal Complex Genomic Architecture and Its Regional Evolution in Primates.

    Directory of Open Access Journals (Sweden)

    Bo Yuan

    2015-12-01

    Full Text Available Many loci in the human genome harbor complex genomic structures that can result in susceptibility to genomic rearrangements leading to various genomic disorders. Nephronophthisis 1 (NPHP1, MIM# 256100 is an autosomal recessive disorder that can be caused by defects of NPHP1; the gene maps within the human 2q13 region where low copy repeats (LCRs are abundant. Loss of function of NPHP1 is responsible for approximately 85% of the NPHP1 cases-about 80% of such individuals carry a large recurrent homozygous NPHP1 deletion that occurs via nonallelic homologous recombination (NAHR between two flanking directly oriented ~45 kb LCRs. Published data revealed a non-pathogenic inversion polymorphism involving the NPHP1 gene flanked by two inverted ~358 kb LCRs. Using optical mapping and array-comparative genomic hybridization, we identified three potential novel structural variant (SV haplotypes at the NPHP1 locus that may protect a haploid genome from the NPHP1 deletion. Inter-species comparative genomic analyses among primate genomes revealed massive genomic changes during evolution. The aggregated data suggest that dynamic genomic rearrangements occurred historically within the NPHP1 locus and generated SV haplotypes observed in the human population today, which may confer differential susceptibility to genomic instability and the NPHP1 deletion within a personal genome. Our study documents diverse SV haplotypes at a complex LCR-laden human genomic region. Comparative analyses provide a model for how this complex region arose during primate evolution, and studies among humans suggest that intra-species polymorphism may potentially modulate an individual's susceptibility to acquiring disease-associated alleles.

  19. Molecular Characterization of Five Potyviruses Infecting Korean Sweet Potatoes Based on Analyses of Complete Genome Sequences

    Directory of Open Access Journals (Sweden)

    Hae-Ryun Kwak

    2015-12-01

    Full Text Available Sweet potatoes (Ipomea batatas L. are grown extensively, in tropical and temperate regions, and are important food crops worldwide. In Korea, potyviruses, including Sweet potato feathery mottle virus (SPFMV, Sweet potato virus C (SPVC, Sweet potato virus G (SPVG, Sweet potato virus 2 (SPV2, and Sweet potato latent virus (SPLV, have been detected in sweet potato fields at a high (~95% incidence. In the present work, complete genome sequences of 18 isolates, representing the five potyviruses mentioned above, were compared with previously reported genome sequences. The complete genomes consisted of 10,081 to 10,830 nucleotides, excluding the poly-A tails. Their genomic organizations were typical of the Potyvirus genus, including one target open reading frame coding for a putative polyprotein. Based on phylogenetic analyses and sequence comparisons, the Korean SPFMV isolates belonged to the strains RC and O with >98% nucleotide sequence identity. Korean SPVC isolates had 99% identity to the Japanese isolate SPVC-Bungo and 70% identity to the SPFMV isolates. The Korean SPVG isolates showed 99% identity to the three previously reported SPVG isolates. Korean SPV2 isolates had 97% identity to the SPV2 GWB-2 isolate from the USA. Korean SPLV isolates had a relatively low (88% nucleotide sequence identity with the Taiwanese SPLV-TW isolates, and they were phylogenetically distantly related to SPFMV isolates. Recombination analysis revealed that possible recombination events occurred in the P1, HC-Pro and NIa-NIb regions of SPFMV and SPLV isolates and these regions were identified as hotspots for recombination in the sweet potato potyviruses.

  20. Mutation analyses of integrated HBV genome in hepatitis B patients

    Institute of Scientific and Technical Information of China (English)

    Peilin Wang; Xiuhai Wang; Shuying Cong; Hongming Ma; Xuecheng Zhang

    2008-01-01

    Little has been learnt in the last 30 years about detection of HBV genome as well as its mutation analysis between hepatitis B fathers (HBF) and their children. In this study, we used nest polymerase chain reaction (PCR), fluorescence in situ hybridization (FISH), and DNA sequencing analysis, to examine the integrated HBV genome in paraffin-embedded testis tissues, which were taken as samples from HBF, and in peripheral blood mononuclear cells (PBMC) from 74 cases of HBFs and their children who were born after their fathers' HBV infection (caHBF). We found that HBV DNA existed in testis tissues, mainly in the basilar parts of the seminiferous tubules, and also in PBMC of HBF. It was also documented that there were point mutations of poly-loci, insertions and deletions of nucleotides in integrated HBV genomes, and the types of gene mutations in the HBFs were similar to those in caHBF. This study addresses the major types of gene mutations in integrated HBV genome in human patients and also presents reliable evidence of possible genetic transmission of hepatitis B.

  1. Comparison of the genome of the oral pathogen Treponema denticola with other spirochete genomes

    OpenAIRE

    Seshadri, Rekha; Myers, Garry S A; Tettelin, Hervé; Eisen, Jonathan A.; Heidelberg, John F.; Robert J Dodson; Davidsen, Tanja M.; DeBoy, Robert T; Derrick E Fouts; Haft, Dan H.; Selengut, Jeremy; Ren, Qinghu; Brinkac, Lauren M.; Madupu, Ramana; Kolonay, Jamie

    2004-01-01

    We present the complete 2,843,201-bp genome sequence of Treponema denticola (ATCC 35405) an oral spirochete associated with periodontal disease. Analysis of the T. denticola genome reveals factors mediating coaggregation, cell signaling, stress protection, and other competitive and cooperative measures, consistent with its pathogenic nature and lifestyle within the mixed-species environment of subgingival dental plaque. Comparisons with previously sequenced spirochete genomes revealed specifi...

  2. Genome-wide transcription analyses in rice using tiling microarrays

    DEFF Research Database (Denmark)

    Li, Lei; Wang, Xiangfeng; Stolc, Viktor;

    2006-01-01

    Sequencing and computational annotation revealed several features, including high gene numbers, unusual composition of the predicted genes and a large number of genes lacking homology to known genes, that distinguish the rice (Oryza sativa) genome from that of other fully sequenced model species....... We report here a full-genome transcription analysis of the indica rice subspecies using high-density oligonucleotide tiling microarrays. Our results provided expression data support for the existence of 35,970 (81.9%) annotated gene models and identified 5,464 unique transcribed intergenic regions...... that share similar compositional properties with the annotated exons and have significant homology to other plant proteins. Elucidating and mapping of all transcribed regions revealed an association between global transcription and cytological chromosome features, and an overall similarity of...

  3. GEMBASSY: an EMBOSS associated software package for comprehensive genome analyses

    OpenAIRE

    Itaya, Hidetoshi; Oshita, Kazuki; Arakawa, Kazuharu; Tomita, Masaru

    2013-01-01

    The popular European Molecular Biology Open Software Suite (EMBOSS) currently contains over 400 tools used in various bioinformatics researches, equipped with sophisticated development frameworks for interoperability and tool discoverability as well as rich documentations and various user interfaces. In order to further strengthen EMBOSS in the fields of genomics, we here present a novel EMBOSS associated software (EMBASSY) package named GEMBASSY, which adds more than 50 analysis tools from t...

  4. Identification of Ohnolog Genes Originating from Whole Genome Duplication in Early Vertebrates, Based on Synteny Comparison across Multiple Genomes.

    Science.gov (United States)

    Singh, Param Priya; Arora, Jatin; Isambert, Hervé

    2015-07-01

    Whole genome duplications (WGD) have now been firmly established in all major eukaryotic kingdoms. In particular, all vertebrates descend from two rounds of WGDs, that occurred in their jawless ancestor some 500 MY ago. Paralogs retained from WGD, also coined 'ohnologs' after Susumu Ohno, have been shown to be typically associated with development, signaling and gene regulation. Ohnologs, which amount to about 20 to 35% of genes in the human genome, have also been shown to be prone to dominant deleterious mutations and frequently implicated in cancer and genetic diseases. Hence, identifying ohnologs is central to better understand the evolution of vertebrates and their susceptibility to genetic diseases. Early computational analyses to identify vertebrate ohnologs relied on content-based synteny comparisons between the human genome and a single invertebrate outgroup genome or within the human genome itself. These approaches are thus limited by lineage specific rearrangements in individual genomes. We report, in this study, the identification of vertebrate ohnologs based on the quantitative assessment and integration of synteny conservation between six amniote vertebrates and six invertebrate outgroups. Such a synteny comparison across multiple genomes is shown to enhance the statistical power of ohnolog identification in vertebrates compared to earlier approaches, by overcoming lineage specific genome rearrangements. Ohnolog gene families can be browsed and downloaded for three statistical confidence levels or recompiled for specific, user-defined, significance criteria at http://ohnologs.curie.fr/. In the light of the importance of WGD on the genetic makeup of vertebrates, our analysis provides a useful resource for researchers interested in gaining further insights on vertebrate evolution and genetic diseases. PMID:26181593

  5. Complete mitochondrial genome of the brown alga Sargassum hemiphyllum (Sargassaceae, Phaeophyceae): comparative analyses.

    Science.gov (United States)

    Liu, Feng; Pang, Shaojun; Chen, Weizhou

    2016-01-01

    Sargassum hemiphyllum (Turner) C. Agardh is a common low intertidal brown alga in the northwestern Pacific, and two varieties (var. chinense and var. hemiphyllum) of this alga have been determined based on morphological and molecular data. In this study, we present the complete mitochondrial genome of S. hemiphyllum var. chinense. The circular-mapping S. hemiphyllum mitogenome of 34,686 bp has an overall A+T content of 63.43%, and contains 65 densely packed genes. The total intergenic spacer regions are 1597 bp, constituting 4.60% of the mitogenome. The gene content and genome organization of S. hemiphyllum are identical to that of other reported Sargassum species. The identity comparison of overall mitogenome sequences and phylogenomic analyses indicate that S. hemiphyllum has a closer evolutionary relationship with Sargassum muticum than other Sargassum species analyzed. The present mitogenomic data provide a powerful tool for definition of varieties and studies of population structure in S. hemiphyllum. PMID:25162799

  6. Meta-Analyses of Genome-Wide Association Data Hold New Promise for Addiction Genetics.

    Science.gov (United States)

    Agrawal, Arpana; Edenberg, Howard J; Gelernter, Joel

    2016-09-01

    Meta-analyses of genome-wide association study data have begun to lead to promising new discoveries for behavioral and psychiatrically relevant phenotypes (e.g., schizophrenia, educational attainment). We outline how this methodology can similarly lead to novel discoveries in genomic studies of substance use disorders, and discuss challenges that will need to be overcome to accomplish this goal. We illustrate our approach with the work of the newly established Substance Use Disorders workgroup of the Psychiatric Genomics Consortium. PMID:27588522

  7. A comparison of rice chloroplast genomes

    DEFF Research Database (Denmark)

    Tang, Jiabin; Xia, Hong'ai; Cao, Mengliang; Zhang, Xiuqing; Zeng, Wanyong; Hu, Songnian; Tong, Wei; Wang, Jun; Wang, Jian; Yu, Jun; Yang, Huanming; Zhu, Lihuang

    2004-01-01

    ), which are both parental varieties of the super-hybrid rice, LYP9. Based on the patterns of high sequence coverage, we partitioned chloroplast sequence variations into two classes, intravarietal and intersubspecific polymorphisms. Intravarietal polymorphisms refer to variations within 93-11 or PA64S...... intersubspecific polymorphisms. In our study, we found that the intersubspecific variations of 93-11 (indica) and PA64S (japonica) chloroplast genomes consisted of 72 single nucleotide polymorphisms and 27 insertions or deletions. The intersubspecific polymorphism rates between 93-11 and PA64S were 0.05% for...... single nucleotide polymorphisms and 0.02% for insertions or deletions, nearly 8 and 10 times lower than their respective nuclear genomes. Based on the total number of nucleotide substitutions between the two chloroplast genomes, we dated the divergence of indica and japonica chloroplast genomes as...

  8. Quality control and conduct of genome-wide association meta-analyses

    DEFF Research Database (Denmark)

    Winkler, Thomas W; Day, Felix R; Croteau-Chonka, Damien C;

    2014-01-01

    Rigorous organization and quality control (QC) are necessary to facilitate successful genome-wide association meta-analyses (GWAMAs) of statistics aggregated across multiple genome-wide association studies. This protocol provides guidelines for (i) organizational aspects of GWAMAs, and for (ii) Q...

  9. Comparative genome analyses reveal distinct structure in the saltwater crocodile MHC.

    Science.gov (United States)

    Jaratlerdsiri, Weerachai; Deakin, Janine; Godinez, Ricardo M; Shan, Xueyan; Peterson, Daniel G; Marthey, Sylvain; Lyons, Eric; McCarthy, Fiona M; Isberg, Sally R; Higgins, Damien P; Chong, Amanda Y; John, John St; Glenn, Travis C; Ray, David A; Gongora, Jaime

    2014-01-01

    The major histocompatibility complex (MHC) is a dynamic genome region with an essential role in the adaptive immunity of vertebrates, especially antigen presentation. The MHC is generally divided into subregions (classes I, II and III) containing genes of similar function across species, but with different gene number and organisation. Crocodylia (crocodilians) are widely distributed and represent an evolutionary distinct group among higher vertebrates, but the genomic organisation of MHC within this lineage has been largely unexplored. Here, we studied the MHC region of the saltwater crocodile (Crocodylus porosus) and compared it with that of other taxa. We characterised genomic clusters encompassing MHC class I and class II genes in the saltwater crocodile based on sequencing of bacterial artificial chromosomes. Six gene clusters spanning ∼452 kb were identified to contain nine MHC class I genes, six MHC class II genes, three TAP genes, and a TRIM gene. These MHC class I and class II genes were in separate scaffold regions and were greater in length (2-6 times longer) than their counterparts in well-studied fowl B loci, suggesting that the compaction of avian MHC occurred after the crocodilian-avian split. Comparative analyses between the saltwater crocodile MHC and that from the alligator and gharial showed large syntenic areas (>80% identity) with similar gene order. Comparisons with other vertebrates showed that the saltwater crocodile had MHC class I genes located along with TAP, consistent with birds studied. Linkage between MHC class I and TRIM39 observed in the saltwater crocodile resembled MHC in eutherians compared, but absent in avian MHC, suggesting that the saltwater crocodile MHC appears to have gene organisation intermediate between these two lineages. These observations suggest that the structure of the saltwater crocodile MHC, and other crocodilians, can help determine the MHC that was present in the ancestors of archosaurs. PMID:25503521

  10. Comparative genome analyses reveal distinct structure in the saltwater crocodile MHC.

    Directory of Open Access Journals (Sweden)

    Weerachai Jaratlerdsiri

    Full Text Available The major histocompatibility complex (MHC is a dynamic genome region with an essential role in the adaptive immunity of vertebrates, especially antigen presentation. The MHC is generally divided into subregions (classes I, II and III containing genes of similar function across species, but with different gene number and organisation. Crocodylia (crocodilians are widely distributed and represent an evolutionary distinct group among higher vertebrates, but the genomic organisation of MHC within this lineage has been largely unexplored. Here, we studied the MHC region of the saltwater crocodile (Crocodylus porosus and compared it with that of other taxa. We characterised genomic clusters encompassing MHC class I and class II genes in the saltwater crocodile based on sequencing of bacterial artificial chromosomes. Six gene clusters spanning ∼452 kb were identified to contain nine MHC class I genes, six MHC class II genes, three TAP genes, and a TRIM gene. These MHC class I and class II genes were in separate scaffold regions and were greater in length (2-6 times longer than their counterparts in well-studied fowl B loci, suggesting that the compaction of avian MHC occurred after the crocodilian-avian split. Comparative analyses between the saltwater crocodile MHC and that from the alligator and gharial showed large syntenic areas (>80% identity with similar gene order. Comparisons with other vertebrates showed that the saltwater crocodile had MHC class I genes located along with TAP, consistent with birds studied. Linkage between MHC class I and TRIM39 observed in the saltwater crocodile resembled MHC in eutherians compared, but absent in avian MHC, suggesting that the saltwater crocodile MHC appears to have gene organisation intermediate between these two lineages. These observations suggest that the structure of the saltwater crocodile MHC, and other crocodilians, can help determine the MHC that was present in the ancestors of archosaurs.

  11. Plant Ion Channels: Gene Families, Physiology, and Functional Genomics Analyses

    Science.gov (United States)

    Ward, John M.; Mäser, Pascal; Schroeder, Julian I.

    2016-01-01

    Distinct potassium, anion, and calcium channels in the plasma membrane and vacuolar membrane of plant cells have been identified and characterized by patch clamping. Primarily owing to advances in Arabidopsis genetics and genomics, and yeast functional complementation, many of the corresponding genes have been identified. Recent advances in our understanding of ion channel genes that mediate signal transduction and ion transport are discussed here. Some plant ion channels, for example, ALMT and SLAC anion channel subunits, are unique. The majority of plant ion channel families exhibit homology to animal genes; such families include both hyperpolarization-and depolarization-activated Shaker-type potassium channels, CLC chloride transporters/channels, cyclic nucleotide–gated channels, and ionotropic glutamate receptor homologs. These plant ion channels offer unique opportunities to analyze the structural mechanisms and functions of ion channels. Here we review gene families of selected plant ion channel classes and discuss unique structure-function aspects and their physiological roles in plant cell signaling and transport. PMID:18842100

  12. Genomic analyses identify molecular subtypes of pancreatic cancer.

    Science.gov (United States)

    Bailey, Peter; Chang, David K; Nones, Katia; Johns, Amber L; Patch, Ann-Marie; Gingras, Marie-Claude; Miller, David K; Christ, Angelika N; Bruxner, Tim J C; Quinn, Michael C; Nourse, Craig; Murtaugh, L Charles; Harliwong, Ivon; Idrisoglu, Senel; Manning, Suzanne; Nourbakhsh, Ehsan; Wani, Shivangi; Fink, Lynn; Holmes, Oliver; Chin, Venessa; Anderson, Matthew J; Kazakoff, Stephen; Leonard, Conrad; Newell, Felicity; Waddell, Nick; Wood, Scott; Xu, Qinying; Wilson, Peter J; Cloonan, Nicole; Kassahn, Karin S; Taylor, Darrin; Quek, Kelly; Robertson, Alan; Pantano, Lorena; Mincarelli, Laura; Sanchez, Luis N; Evers, Lisa; Wu, Jianmin; Pinese, Mark; Cowley, Mark J; Jones, Marc D; Colvin, Emily K; Nagrial, Adnan M; Humphrey, Emily S; Chantrill, Lorraine A; Mawson, Amanda; Humphris, Jeremy; Chou, Angela; Pajic, Marina; Scarlett, Christopher J; Pinho, Andreia V; Giry-Laterriere, Marc; Rooman, Ilse; Samra, Jaswinder S; Kench, James G; Lovell, Jessica A; Merrett, Neil D; Toon, Christopher W; Epari, Krishna; Nguyen, Nam Q; Barbour, Andrew; Zeps, Nikolajs; Moran-Jones, Kim; Jamieson, Nigel B; Graham, Janet S; Duthie, Fraser; Oien, Karin; Hair, Jane; Grützmann, Robert; Maitra, Anirban; Iacobuzio-Donahue, Christine A; Wolfgang, Christopher L; Morgan, Richard A; Lawlor, Rita T; Corbo, Vincenzo; Bassi, Claudio; Rusev, Borislav; Capelli, Paola; Salvia, Roberto; Tortora, Giampaolo; Mukhopadhyay, Debabrata; Petersen, Gloria M; Munzy, Donna M; Fisher, William E; Karim, Saadia A; Eshleman, James R; Hruban, Ralph H; Pilarsky, Christian; Morton, Jennifer P; Sansom, Owen J; Scarpa, Aldo; Musgrove, Elizabeth A; Bailey, Ulla-Maja Hagbo; Hofmann, Oliver; Sutherland, Robert L; Wheeler, David A; Gill, Anthony J; Gibbs, Richard A; Pearson, John V; Waddell, Nicola; Biankin, Andrew V; Grimmond, Sean M

    2016-03-01

    Integrated genomic analysis of 456 pancreatic ductal adenocarcinomas identified 32 recurrently mutated genes that aggregate into 10 pathways: KRAS, TGF-β, WNT, NOTCH, ROBO/SLIT signalling, G1/S transition, SWI-SNF, chromatin modification, DNA repair and RNA processing. Expression analysis defined 4 subtypes: (1) squamous; (2) pancreatic progenitor; (3) immunogenic; and (4) aberrantly differentiated endocrine exocrine (ADEX) that correlate with histopathological characteristics. Squamous tumours are enriched for TP53 and KDM6A mutations, upregulation of the TP63∆N transcriptional network, hypermethylation of pancreatic endodermal cell-fate determining genes and have a poor prognosis. Pancreatic progenitor tumours preferentially express genes involved in early pancreatic development (FOXA2/3, PDX1 and MNX1). ADEX tumours displayed upregulation of genes that regulate networks involved in KRAS activation, exocrine (NR5A2 and RBPJL), and endocrine differentiation (NEUROD1 and NKX2-2). Immunogenic tumours contained upregulated immune networks including pathways involved in acquired immune suppression. These data infer differences in the molecular evolution of pancreatic cancer subtypes and identify opportunities for therapeutic development. PMID:26909576

  13. Comparing thousands of circular genomes using the CGView Comparison Tool

    Directory of Open Access Journals (Sweden)

    Grant Jason R

    2012-05-01

    Full Text Available Abstract Background Continued sequencing efforts coupled with advances in sequencing technology will lead to the completion of a vast number of small genomes. Whole-genome comparisons represent an important part of the analysis of any new genome sequence, as they can provide a better understanding of the biology and evolution of the source organism. Visualization of the results is important, as it allows information from a variety of sources to be integrated and interpreted. However, existing graphical comparison tools lack features needed for efficiently comparing a new genome to hundreds or thousands of existing sequences. Moreover, existing tools are limited in terms of the types of comparisons that can be performed, the extent to which the output can be customized, and the ease with which the entire process can be automated. Results The CGView Comparison Tool (CCT is a package for visually comparing bacterial, plasmid, chloroplast, or mitochondrial sequences of interest to existing genomes or sequence collections. The comparisons are conducted using BLAST, and the BLAST results are presented in the form of graphical maps that can also show sequence features, gene and protein names, COG (Clusters of Orthologous Groups of proteins category assignments, and sequence composition characteristics. CCT can generate maps in a variety of sizes, including 400 Megapixel maps suitable for posters. Comparisons can be conducted within a particular species or genus, or all available genomes can be used. The entire map creation process, from downloading sequences to redrawing zoomed maps, can be completed easily using scripts included with the CCT. User-defined features or analysis results can be included on maps, and maps can be extensively customized. To simplify program setup, a CCT virtual machine that includes all dependencies preinstalled is available. Detailed tutorials illustrating the use of CCT are included with the CCT documentation. Conclusion

  14. Genome-based comparative analyses of Antarctic and temperate species of Paenibacillus.

    Directory of Open Access Journals (Sweden)

    Melissa Dsouza

    Full Text Available Antarctic soils represent a unique environment characterised by extremes of temperature, salinity, elevated UV radiation, low nutrient and low water content. Despite the harshness of this environment, members of 15 bacterial phyla have been identified in soils of the Ross Sea Region (RSR. However, the survival mechanisms and ecological roles of these phyla are largely unknown. The aim of this study was to investigate whether strains of Paenibacillus darwinianus owe their resilience to substantial genomic changes. For this, genome-based comparative analyses were performed on three P. darwinianus strains, isolated from gamma-irradiated RSR soils, together with nine temperate, soil-dwelling Paenibacillus spp. The genome of each strain was sequenced to over 1,000-fold coverage, then assembled into contigs totalling approximately 3 Mbp per genome. Based on the occurrence of essential, single-copy genes, genome completeness was estimated at approximately 88%. Genome analysis revealed between 3,043-3,091 protein-coding sequences (CDSs, primarily associated with two-component systems, sigma factors, transporters, sporulation and genes induced by cold-shock, oxidative and osmotic stresses. These comparative analyses provide an insight into the metabolic potential of P. darwinianus, revealing potential adaptive mechanisms for survival in Antarctic soils. However, a large proportion of these mechanisms were also identified in temperate Paenibacillus spp., suggesting that these mechanisms are beneficial for growth and survival in a range of soil environments. These analyses have also revealed that the P. darwinianus genomes contain significantly fewer CDSs and have a lower paralogous content. Notwithstanding the incompleteness of the assemblies, the large differences in genome sizes, determined by the number of genes in paralogous clusters and the CDS content, are indicative of genome content scaling. Finally, these sequences are a resource for further

  15. CompaGB: An open framework for genome browsers comparison

    Directory of Open Access Journals (Sweden)

    Chiapello Hélène

    2011-05-01

    Full Text Available Abstract Background Tools to visualize and explore genomes hold a central place in genomics and the diversity of genome browsers has increased dramatically over the last few years. It often turns out to be a daunting task to compare and choose a well-adapted genome browser, as multidisciplinary knowledge is required to carry out this task and the number of tools, functionalities and features are overwhelming. Findings To assist in this task, we propose a community-based framework based on two cornerstones: (i the implementation of industry promoted software qualification method (QSOS adapted for genome browser evaluations, and (ii a web resource providing numerous facilities either for visualizing comparisons or performing new evaluations. We formulated 60 criteria specifically for genome browsers, and incorporated another 65 directly from QSOS's generic section. Those criteria aim to answer versatile needs, ranging from a biologist whose interest primarily lies into user-friendly and informative functionalities, a bioinformatician who wants to integrate the genome browser into a wider framework, or a computer scientist who might choose a software according to more technical features. We developed a dedicated web application to enrich the existing QSOS functionalities (weighting of criteria, user profile with features of interest to a community-based framework: easy management of evolving data, user comments... Conclusions The framework is available at http://genome.jouy.inra.fr/CompaGB. It is open to anyone who wishes to participate in the evaluations. It helps the scientific community to (1 choose a genome browser that would better fit their particular project, (2 visualize features comparatively with easily accessible formats, such as tables or radar plots and (3 perform their own evaluation against the defined criteria. To illustrate the CompaGB functionalities, we have evaluated seven genome browsers according to the implemented methodology

  16. Molecular evolution of herpesviruses: genomic and protein sequence comparisons.

    OpenAIRE

    Karlin, S; Mocarski, E S; Schachtel, G A

    1994-01-01

    Phylogenetic reconstruction of herpesvirus evolution is generally founded on amino acid sequence comparisons of specific proteins. These are relevant to the evolution of the specific gene (or set of genes), but the resulting phylogeny may vary depending on the particular sequence chosen for analysis (or comparison). In the first part of this report, we compare 13 herpesvirus genomes by using a new multidimensional methodology based on distance measures and partial orderings of dinucleotide re...

  17. Development of an interactive genome browser to visualize and analyse large scale genomic data

    OpenAIRE

    Sinclair, Lucas

    2010-01-01

    Genomic bioinformatics is a growing and developing field. Indeed, data analysis is becoming an integrative and essential part of any quantitative biological experiment as the technologies evolve and the wet lab methods used generate larger and larger quantities of data. Yet few standards have emerged and a plethora of analytical tools exist, none of which are established as a standard. The difficulties arise early on, even before processing any genomic data, as one first needs to visualize it...

  18. Comparative analyses of chloroplast genome data representing nine green algae in Sphaeropleales (Chlorophyceae, Chlorophyta).

    Science.gov (United States)

    Fučíková, Karolina; Lewis, Louise A; Lewis, Paul O

    2016-06-01

    The chloroplast genomes of green algae are highly variable in their architecture. In this article we summarize gene content across newly obtained and published chloroplast genomes in Chlorophyceae, including new data from nine of species in Sphaeropleales (Chlorophyceae, Chlorophyta). We present genome architecture information, including genome synteny analysis across two groups of species. Also, we provide a phylogenetic tree obtained from analysis of gene order data for species in Chlorophyceae with fully sequenced chloroplast genomes. Further analyses and interpretation of the data can be found in "Chloroplast phylogenomic data from the green algal order Sphaeropleales (Chlorophyceae, Chlorophyta) reveal complex patterns of sequence evolution" (Fučíková et al., In review) [1]. PMID:27054159

  19. Comparative genomic and transcriptional analyses of CRISPR systems across the genus Pyrobaculum

    OpenAIRE

    Bernick, David L.; Cox, Courtney L.; Dennis, Patrick P.; Lowe, Todd M.

    2012-01-01

    Within the domain Archaea, the CRISPR immune system appears to be nearly ubiquitous based on computational genome analyses. Initial studies in bacteria demonstrated that the CRISPR system targets invading plasmid and viral DNA. Recent experiments in the model archaeon Pyrococcus furiosus have uncovered a novel RNA-targeting variant of the CRISPR system. Because our understanding of CRISPR system evolution in other archaea is limited, we have taken a comparative genomic and transcriptomic view...

  20. Comparative Genomic and Transcriptional Analyses of CRISPR Systems Across the Genus Pyrobaculum

    OpenAIRE

    Bernick, David L.; Cox, Courtney L.; Dennis, Patrick P.; Lowe, Todd M.

    2012-01-01

    Within the domain Archaea, the CRISPR immune system appears to be nearly ubiquitous based on computational genome analyses. Initial studies in bacteria demonstrated that the CRISPR system targets invading plasmid and viral DNA. Recent experiments in the model archaeon Pyrococcus furiosus uncovered a novel RNA-targeting variant of the CRISPR system potentially unique to archaea. Because our understanding of CRISPR system evolution in other archaea is limited, we have taken a comparative genom...

  1. Genetic basis for spontaneous hybrid genome doubling during allopolyploid speciation of common wheat shown by natural variation analyses of the paternal species.

    Directory of Open Access Journals (Sweden)

    Yoshihiro Matsuoka

    Full Text Available The complex process of allopolyploid speciation includes various mechanisms ranging from species crosses and hybrid genome doubling to genome alterations and the establishment of new allopolyploids as persisting natural entities. Currently, little is known about the genetic mechanisms that underlie hybrid genome doubling, despite the fact that natural allopolyploid formation is highly dependent on this phenomenon. We examined the genetic basis for the spontaneous genome doubling of triploid F1 hybrids between the direct ancestors of allohexaploid common wheat (Triticum aestivum L., AABBDD genome, namely Triticumturgidum L. (AABB genome and Aegilopstauschii Coss. (DD genome. An Ae. tauschii intraspecific lineage that is closely related to the D genome of common wheat was identified by population-based analysis. Two representative accessions, one that produces a high-genome-doubling-frequency hybrid when crossed with a T. turgidum cultivar and the other that produces a low-genome-doubling-frequency hybrid with the same cultivar, were chosen from that lineage for further analyses. A series of investigations including fertility analysis, immunostaining, and quantitative trait locus (QTL analysis showed that (1 production of functional unreduced gametes through nonreductional meiosis is an early step key to successful hybrid genome doubling, (2 first division restitution is one of the cytological mechanisms that cause meiotic nonreduction during the production of functional male unreduced gametes, and (3 six QTLs in the Ae. tauschii genome, most of which likely regulate nonreductional meiosis and its subsequent gamete production processes, are involved in hybrid genome doubling. Interlineage comparisons of Ae. tauschii's ability to cause hybrid genome doubling suggested an evolutionary model for the natural variation pattern of the trait in which non-deleterious mutations in six QTLs may have important roles. The findings of this study demonstrated

  2. Genomic and peptidomic analyses of the neuropeptides from the emerging pest, Drosophila suzukii.

    Science.gov (United States)

    Audsley, Neil; Down, Rachel E; Isaac, R Elwyn

    2015-06-01

    Drosophila suzukii is a highly polyphagous invasive pest which has been recently introduced into Europe and North America, where it is causing severe economic losses through larval infestations of stone and berry fruits. The peptidome of the selected nervous tissues of adult D. suzukii was investigated as a first step in identifying potential targets for the development of novel insecticides. Through in silico analyses of the D. suzukii genome databases 28 neuropeptide families, comprising more than 70 predicted peptides were identified. Using a combination of liquid chromatography and mass spectrometry of tissue extracts, 33 predicted peptides, representing 15 different peptide families were identified by their molecular masses and a total of 17 peptide sequences were confirmed by ion fragmentation. A comparison between the peptides and precursors of D. suzukii and D. melanogaster shows they are highly conserved, with differences only identified in the amino acid sequences of the peptides encoded in the FMRFamide, hugin and ecydysis triggering hormone precursors. All other peptides predicted and identified from D. suzukii appear to be identical to those previously characterized from D. melanogaster. Adipokinetic hormone was only identified in the corpus cardiacum, other peptides present included short neuropeptide F, a pyrokinin and myosuppressin, the latter of which was the only peptide identified from the crop nerve bundle. Peptides present in extracts of the brain and/or thoracico-abdominal ganglion included allatostatins, cardioacceleratory peptide 2b, corazonin, extended FMRFamides, pyrokinins, myoinihibitory peptides, neuropeptide-like precursor 1, SIFamide, short neuropeptide F, kinin, sulfakinins and tachykinin related peptides. PMID:25158078

  3. The spotted gar genome illuminates vertebrate evolution and facilitates human-teleost comparisons.

    Science.gov (United States)

    Braasch, Ingo; Gehrke, Andrew R; Smith, Jeramiah J; Kawasaki, Kazuhiko; Manousaki, Tereza; Pasquier, Jeremy; Amores, Angel; Desvignes, Thomas; Batzel, Peter; Catchen, Julian; Berlin, Aaron M; Campbell, Michael S; Barrell, Daniel; Martin, Kyle J; Mulley, John F; Ravi, Vydianathan; Lee, Alison P; Nakamura, Tetsuya; Chalopin, Domitille; Fan, Shaohua; Wcisel, Dustin; Cañestro, Cristian; Sydes, Jason; Beaudry, Felix E G; Sun, Yi; Hertel, Jana; Beam, Michael J; Fasold, Mario; Ishiyama, Mikio; Johnson, Jeremy; Kehr, Steffi; Lara, Marcia; Letaw, John H; Litman, Gary W; Litman, Ronda T; Mikami, Masato; Ota, Tatsuya; Saha, Nil Ratan; Williams, Louise; Stadler, Peter F; Wang, Han; Taylor, John S; Fontenot, Quenton; Ferrara, Allyse; Searle, Stephen M J; Aken, Bronwen; Yandell, Mark; Schneider, Igor; Yoder, Jeffrey A; Volff, Jean-Nicolas; Meyer, Axel; Amemiya, Chris T; Venkatesh, Byrappa; Holland, Peter W H; Guiguen, Yann; Bobe, Julien; Shubin, Neil H; Di Palma, Federica; Alföldi, Jessica; Lindblad-Toh, Kerstin; Postlethwait, John H

    2016-04-01

    To connect human biology to fish biomedical models, we sequenced the genome of spotted gar (Lepisosteus oculatus), whose lineage diverged from teleosts before teleost genome duplication (TGD). The slowly evolving gar genome has conserved in content and size many entire chromosomes from bony vertebrate ancestors. Gar bridges teleosts to tetrapods by illuminating the evolution of immunity, mineralization and development (mediated, for example, by Hox, ParaHox and microRNA genes). Numerous conserved noncoding elements (CNEs; often cis regulatory) undetectable in direct human-teleost comparisons become apparent using gar: functional studies uncovered conserved roles for such cryptic CNEs, facilitating annotation of sequences identified in human genome-wide association studies. Transcriptomic analyses showed that the sums of expression domains and expression levels for duplicated teleost genes often approximate the patterns and levels of expression for gar genes, consistent with subfunctionalization. The gar genome provides a resource for understanding evolution after genome duplication, the origin of vertebrate genomes and the function of human regulatory sequences. PMID:26950095

  4. Intra-species sequence comparisons for annotating genomes

    Energy Technology Data Exchange (ETDEWEB)

    Boffelli, Dario; Weer, Claire V.; Weng, Li; Lewis, Keith D.; Shoukry, Malak I.; Pachter, Lior; Keys, David N.; Rubin, Edward M.

    2004-07-15

    Analysis of sequence variation among members of a single species offers a potential approach to identify functional DNA elements responsible for biological features unique to that species. Due to its high rate of allelic polymorphism and ease of genetic manipulability, we chose the sea squirt, Ciona intestinalis, to explore intra-species sequence comparisons for genome annotation. A large number of C. intestinalis specimens were collected from four continents and a set of genomic intervals amplified, resequenced and analyzed to determine the mutation rates at each nucleotide in the sequence. We found that regions with low mutation rates efficiently demarcated functionally constrained sequences: these include a set of noncoding elements, which we showed in C intestinalis transgenic assays to act as tissue-specific enhancers, as well as the location of coding sequences. This illustrates that comparisons of multiple members of a species can be used for genome annotation, suggesting a path for the annotation of the sequenced genomes of organisms occupying uncharacterized phylogenetic branches of the animal kingdom and raises the possibility that the resequencing of a large number of Homo sapiens individuals might be used to annotate the human genome and identify sequences defining traits unique to our species. The sequence data from this study has been submitted to GenBank under accession nos. AY667278-AY667407.

  5. Comparison of 2D and 3D gamma analyses

    Energy Technology Data Exchange (ETDEWEB)

    Pulliam, Kiley B.; Huang, Jessie Y.; Howell, Rebecca M.; Followill, David; Kry, Stephen F., E-mail: sfkry@mdanderson.org [Department of Radiation Physics, The University of Texas MD Anderson Cancer Center and The University of Texas Graduate School of Biomedical Sciences at Houston, Houston, Texas 77030 (United States); Bosca, Ryan [Department of Imaging Physics, The University of Texas MD Anderson Cancer Center and The University of Texas Graduate School of Biomedical Sciences at Houston, Houston, Texas 77030 (United States); O’Daniel, Jennifer [Department of Radiation Oncology, Duke University, Durham, North Carolina 27705 (United States)

    2014-02-15

    Purpose: As clinics begin to use 3D metrics for intensity-modulated radiation therapy (IMRT) quality assurance, it must be noted that these metrics will often produce results different from those produced by their 2D counterparts. 3D and 2D gamma analyses would be expected to produce different values, in part because of the different search space available. In the present investigation, the authors compared the results of 2D and 3D gamma analysis (where both datasets were generated in the same manner) for clinical treatment plans. Methods: Fifty IMRT plans were selected from the authors’ clinical database, and recalculated using Monte Carlo. Treatment planning system-calculated (“evaluated dose distributions”) and Monte Carlo-recalculated (“reference dose distributions”) dose distributions were compared using 2D and 3D gamma analysis. This analysis was performed using a variety of dose-difference (5%, 3%, 2%, and 1%) and distance-to-agreement (5, 3, 2, and 1 mm) acceptance criteria, low-dose thresholds (5%, 10%, and 15% of the prescription dose), and data grid sizes (1.0, 1.5, and 3.0 mm). Each comparison was evaluated to determine the average 2D and 3D gamma, lower 95th percentile gamma value, and percentage of pixels passing gamma. Results: The average gamma, lower 95th percentile gamma value, and percentage of passing pixels for each acceptance criterion demonstrated better agreement for 3D than for 2D analysis for every plan comparison. The average difference in the percentage of passing pixels between the 2D and 3D analyses with no low-dose threshold ranged from 0.9% to 2.1%. Similarly, using a low-dose threshold resulted in a difference between the mean 2D and 3D results, ranging from 0.8% to 1.5%. The authors observed no appreciable differences in gamma with changes in the data density (constant difference: 0.8% for 2D vs 3D). Conclusions: The authors found that 3D gamma analysis resulted in up to 2.9% more pixels passing than 2D analysis. It must

  6. Completing bacterial genome assemblies: strategy and performance comparisons.

    Science.gov (United States)

    Liao, Yu-Chieh; Lin, Shu-Hung; Lin, Hsin-Hung

    2015-01-01

    Determining the genomic sequences of microorganisms is the basis and prerequisite for understanding their biology and functional characterization. While the advent of low-cost, extremely high-throughput second-generation sequencing technologies and the parallel development of assembly algorithms have generated rapid and cost-effective genome assemblies, such assemblies are often unfinished, fragmented draft genomes as a result of short read lengths and long repeats present in multiple copies. Third-generation, PacBio sequencing technologies circumvented this problem by greatly increasing read length. Hybrid approaches including ALLPATHS-LG, PacBio corrected reads pipeline, SPAdes, and SSPACE-LongRead, and non-hybrid approaches--hierarchical genome-assembly process (HGAP) and PacBio corrected reads pipeline via self-correction--have therefore been proposed to utilize the PacBio long reads that can span many thousands of bases to facilitate the assembly of complete microbial genomes. However, standardized procedures that aim at evaluating and comparing these approaches are currently insufficient. To address the issue, we herein provide a comprehensive comparison by collecting datasets for the comparative assessment on the above-mentioned five assemblers. In addition to offering explicit and beneficial recommendations to practitioners, this study aims to aid in the design of a paradigm positioned to complete bacterial genome assembly. PMID:25735824

  7. RSIADB, a collective resource for genome and transcriptome analyses in Rhizoctonia solani AG1 IA.

    Science.gov (United States)

    Chen, Lei; Ai, Peng; Zhang, Jinfeng; Deng, Qiming; Wang, Shiquan; Li, Shuangcheng; Zhu, Jun; Li, Ping; Zheng, Aiping

    2016-01-01

    Rice [Oryza sativa (L.)] feeds more than half of the world's population. Rhizoctonia solaniis a major fungal pathogen of rice causing extreme crop losses in all rice-growing regions of the world. R. solani AG1 IA is a major cause of sheath blight in rice. In this study, we constructed a comprehensive and user-friendly web-based database, RSIADB, to analyse its draft genome and transcriptome. The database was built using the genome sequence (10,489 genes) and annotation information for R. solani AG1 IA. A total of six RNAseq samples of R. solani AG1 IA were also analysed, corresponding to 10, 18, 24, 32, 48 and 72 h after infection of rice leaves. The RSIADB database enables users to search, browse, and download gene sequences for R. solani AG1 IA, and mine the data using BLAST, Sequence Extractor, Browse and Construction Diagram tools that were integrated into the database. RSIADB is an important genomic resource for scientists working with R. solani AG1 IA and will assist researchers in analysing the annotated genome and transcriptome of this pathogen. This resource will facilitate studies on gene function, pathogenesis factors and secreted proteins, as well as provide an avenue for comparative analyses of genes expressed during different stages of infection. Database URL:http://genedenovoweb.ticp.net:81/rsia/index.php. PMID:27022158

  8. The complete mitochondrial genomes of four cockroaches (Insecta: Blattodea) and phylogenetic analyses within cockroaches.

    Science.gov (United States)

    Cheng, Xue-Fang; Zhang, Le-Ping; Yu, Dan-Na; Storey, Kenneth B; Zhang, Jia-Yong

    2016-07-15

    Three complete mitochondrial genomes of Blaberidae (Insecta: Blattodea) (Gromphadorhina portentosa, Panchlora nivea, Blaptica dubia) and one complete mt genome of Blattidae (Insecta: Blattodea) (Shelfordella lateralis) were sequenced to further understand the characteristics of cockroach mitogenomes and reconstruct the phylogenetic relationship of Blattodea. The gene order and orientation of these four cockroach genomes were similar to known cockroach mt genomes, and contained 13 protein-coding genes (PCGs), 2 ribosomal RNA (rRNA) genes, 22 transfer RNA (tRNA) genes and one control region. The mt genomes of Blattodea exhibited a characteristics of a high A+T composition (70.7%-74.3%) and dominant usage of the TAA stop codon. The AT content of the whole mt genome, PCGs and total tRNAs in G. portentosa was the lowest in known cockroaches. The presence of a 71-bp intergenic spacer region between trnQ and trnM was a unique feature in B. dubia, but absent in other cockroaches, which can be explained by the duplication/random loss model. Based on the nucleotide and amino acid datasets of the 13 PCGs genes, neighbor-joining (NJ), maximum parsimony (MP), maximum likelihood (ML) and bayesian inference (BI) analyses were used to rebuild the phylogenetic relationship of cockroaches. All phylogenetic analyses consistently placed Isoptera as the sister cluster to Cryptocercidae of Blattodea. Ectobiidae and Blaberidae (Blaberoidea) formed a sister clade to Blattidae. Corydiidae is a sister clade of all the remaining cockroach species with a high value in NJ and MP analyses of nucleotide and amino acid datasets, and ML and BI analyses of the amino acid dataset. PMID:27045773

  9. Genomic analyses of the microsporidian Nosema ceranae, an emergent pathogen of honey bees.

    Directory of Open Access Journals (Sweden)

    R Scott Cornman

    2009-06-01

    Full Text Available Recent steep declines in honey bee health have severely impacted the beekeeping industry, presenting new risks for agricultural commodities that depend on insect pollination. Honey bee declines could reflect increased pressures from parasites and pathogens. The incidence of the microsporidian pathogen Nosema ceranae has increased significantly in the past decade. Here we present a draft assembly (7.86 MB of the N. ceranae genome derived from pyrosequence data, including initial gene models and genomic comparisons with other members of this highly derived fungal lineage. N. ceranae has a strongly AT-biased genome (74% A+T and a diversity of repetitive elements, complicating the assembly. Of 2,614 predicted protein-coding sequences, we conservatively estimate that 1,366 have homologs in the microsporidian Encephalitozoon cuniculi, the most closely related published genome sequence. We identify genes conserved among microsporidia that lack clear homology outside this group, which are of special interest as potential virulence factors in this group of obligate parasites. A substantial fraction of the diminutive N. ceranae proteome consists of novel and transposable-element proteins. For a majority of well-supported gene models, a conserved sense-strand motif can be found within 15 bases upstream of the start codon; a previously uncharacterized version of this motif is also present in E. cuniculi. These comparisons provide insight into the architecture, regulation, and evolution of microsporidian genomes, and will drive investigations into honey bee-Nosema interactions.

  10. Functional and comparative genomics analyses of pmp22 in medaka fish

    Directory of Open Access Journals (Sweden)

    Kawarabayasi Yutaka

    2009-06-01

    Full Text Available Abstract Background Pmp22, a member of the junction protein family Claudin/EMP/PMP22, plays an important role in myelin formation. Increase of pmp22 transcription causes peripheral neuropathy, Charcot-Marie-Tooth disease type1A (CMT1A. The pathophysiological phenotype of CMT1A is aberrant axonal myelination which induces a reduction in nerve conduction velocity (NCV. Several CMT1A model rodents have been established by overexpressing pmp22. Thus, it is thought that pmp22 expression must be tightly regulated for correct myelin formation in mammals. Interestingly, the myelin sheath is also present in other jawed vertebrates. The purpose of this study is to analyze the evolutionary conservation of the association between pmp22 transcription level and vertebrate myelin formation, and to find the conserved non-coding sequences for pmp22 regulation by comparative genomics analyses between jawed fishes and mammals. Results A transgenic pmp22 over-expression medaka fish line was established. The transgenic fish had approximately one fifth the peripheral NCV values of controls, and aberrant myelination of transgenic fish in the peripheral nerve system (PNS was observed. We successfully confirmed that medaka fish pmp22 has the same exon-intron structure as mammals, and identified some known conserved regulatory motifs. Furthermore, we found novel conserved sequences in the first intron and 3'UTR. Conclusion Medaka fish undergo abnormalities in the PNS when pmp22 transcription increases. This result indicates that an adequate pmp22 transcription level is necessary for correct myelination of jawed vertebrates. Comparison of pmp22 orthologs between distantly related species identifies evolutionary conserved sequences that contribute to precise regulation of pmp22 expression.

  11. RSIADB, a collective resource for genome and transcriptome analyses in Rhizoctonia solani AG1 IA

    OpenAIRE

    Chen, Lei; Ai, Peng; Zhang, Jinfeng; Deng, Qiming; Wang, Shiquan; Li, Shuangcheng; Zhu, Jun; Li, Ping; Zheng, Aiping

    2016-01-01

    Rice [Oryza sativa (L.)] feeds more than half of the world’s population. Rhizoctonia solani is a major fungal pathogen of rice causing extreme crop losses in all rice-growing regions of the world. R. solani AG1 IA is a major cause of sheath blight in rice. In this study, we constructed a comprehensive and user-friendly web-based database, RSIADB, to analyse its draft genome and transcriptome. The database was built using the genome sequence (10 489 genes) and annotation information for R. sol...

  12. Post-genomic analyses of fungal lignocellulosic biomass degradation reveal the unexpected potential of the plant pathogen Ustilago maydis

    Directory of Open Access Journals (Sweden)

    Couturier Marie

    2012-02-01

    Full Text Available Abstract Background Filamentous fungi are potent biomass degraders due to their ability to thrive in ligno(hemicellulose-rich environments. During the last decade, fungal genome sequencing initiatives have yielded abundant information on the genes that are putatively involved in lignocellulose degradation. At present, additional experimental studies are essential to provide insights into the fungal secreted enzymatic pools involved in lignocellulose degradation. Results In this study, we performed a wide analysis of 20 filamentous fungi for which genomic data are available to investigate their biomass-hydrolysis potential. A comparison of fungal genomes and secretomes using enzyme activity profiling revealed discrepancies in carbohydrate active enzymes (CAZymes sets dedicated to plant cell wall. Investigation of the contribution made by each secretome to the saccharification of wheat straw demonstrated that most of them individually supplemented the industrial Trichoderma reesei CL847 enzymatic cocktail. Unexpectedly, the most striking effect was obtained with the phytopathogen Ustilago maydis that improved the release of total sugars by 57% and of glucose by 22%. Proteomic analyses of the best-performing secretomes indicated a specific enzymatic mechanism of U. maydis that is likely to involve oxido-reductases and hemicellulases. Conclusion This study provides insight into the lignocellulose-degradation mechanisms by filamentous fungi and allows for the identification of a number of enzymes that are potentially useful to further improve the industrial lignocellulose bioconversion process.

  13. Genome sequencing elucidates Sardinian genetic architecture and augments association analyses for lipid and blood inflammatory markers.

    Science.gov (United States)

    Sidore, Carlo; Busonero, Fabio; Maschio, Andrea; Porcu, Eleonora; Naitza, Silvia; Zoledziewska, Magdalena; Mulas, Antonella; Pistis, Giorgio; Steri, Maristella; Danjou, Fabrice; Kwong, Alan; Ortega Del Vecchyo, Vicente Diego; Chiang, Charleston W K; Bragg-Gresham, Jennifer; Pitzalis, Maristella; Nagaraja, Ramaiah; Tarrier, Brendan; Brennan, Christine; Uzzau, Sergio; Fuchsberger, Christian; Atzeni, Rossano; Reinier, Frederic; Berutti, Riccardo; Huang, Jie; Timpson, Nicholas J; Toniolo, Daniela; Gasparini, Paolo; Malerba, Giovanni; Dedoussis, George; Zeggini, Eleftheria; Soranzo, Nicole; Jones, Chris; Lyons, Robert; Angius, Andrea; Kang, Hyun M; Novembre, John; Sanna, Serena; Schlessinger, David; Cucca, Francesco; Abecasis, Gonçalo R

    2015-11-01

    We report ∼17.6 million genetic variants from whole-genome sequencing of 2,120 Sardinians; 22% are absent from previous sequencing-based compilations and are enriched for predicted functional consequences. Furthermore, ∼76,000 variants common in our sample (frequency >5%) are rare elsewhere (<0.5% in the 1000 Genomes Project). We assessed the impact of these variants on circulating lipid levels and five inflammatory biomarkers. We observe 14 signals, including 2 major new loci, for lipid levels and 19 signals, including 2 new loci, for inflammatory markers. The new associations would have been missed in analyses based on 1000 Genomes Project data, underlining the advantages of large-scale sequencing in this founder population. PMID:26366554

  14. Genome-wide analyses of aggressiveness in attention-deficit hyperactivity disorder.

    Science.gov (United States)

    Brevik, Erlend J; van Donkelaar, Marjolein M J; Weber, Heike; Sánchez-Mora, Cristina; Jacob, Christian; Rivero, Olga; Kittel-Schneider, Sarah; Garcia-Martínez, Iris; Aebi, Marcel; van Hulzen, Kimm; Cormand, Bru; Ramos-Quiroga, Josep A; Lesch, Klaus-Peter; Reif, Andreas; Ribasés, Marta; Franke, Barbara; Posserud, Maj-Britt; Johansson, Stefan; Lundervold, Astri J; Haavik, Jan; Zayats, Tetyana

    2016-07-01

    Aggressiveness is a behavioral trait that has the potential to be harmful to individuals and society. With an estimated heritability of about 40%, genetics is important in its development. We performed an exploratory genome-wide association (GWA) analysis of childhood aggressiveness in attention deficit hyperactivity disorder (ADHD) to gain insight into the underlying biological processes associated with this trait. Our primary sample consisted of 1,060 adult ADHD patients (aADHD). To further explore the genetic architecture of childhood aggressiveness, we performed enrichment analyses of suggestive genome-wide associations observed in aADHD among GWA signals of dimensions of oppositionality (defiant/vindictive and irritable dimensions) in childhood ADHD (cADHD). No single polymorphism reached genome-wide significance (P disorders. © 2016 The Authors. American Journal of Medical Genetics Part B: Neuropsychiatric Genetics Published by Wiley Periodicals, Inc. PMID:27021288

  15. Comparative sequence analyses of genome and transcriptome reveal novel transcripts and variants in the Asian elephant Elephas maximus

    Indian Academy of Sciences (India)

    Puli Chandramouli Reddy; Ishani Sinha; Ashwin Kelkar; Farhat Habib; Saurabh J Pradhan; Raman Sukumar; Sanjeev Galande

    2015-12-01

    The Asian elephant Elephas maximus and the African elephant Loxodonta africana that diverged 5-7 million years ago exhibit differences in their physiology, behaviour and morphology. A comparative genomics approach would be useful and necessary for evolutionary and functional genetic studies of elephants. We performed sequencing of E. maximus and map to L. africana at ∼ 15X coverage. Through comparative sequence analyses, we have identified Asian elephant specific homozygous, non-synonymous single nucleotide variants (SNVs) that map to 1514 protein coding genes, many of which are involved in olfaction. We also present the first report of a high-coverage transcriptome sequence in E. maximus from peripheral blood lymphocytes. We have identified 103 novel protein coding transcripts and 66-long non-coding (Inc)RNAs. We also report the presence of 181 protein domains unique to elephants when compared to other Afrotheria species. Each of these findings can be further investigated to gain a better understanding of functional differences unique to elephant species, as well as those unique to elephantids in comparison with other mammals. This work therefore provides a valuable resource to explore the immense research potential of comparative analyses of transcriptome and genome sequences in the Asian elephant.

  16. Comparative genomic analyses identify the Vibrio harveyi genome sequenced strains BAA-1116 and HY01 as Vibrio campbellii

    Science.gov (United States)

    Lin, Baochuan; Wang, Zheng; Malanoski, Anthony P; O'Grady, Elizabeth A; Wimpee, Charles F; Vuddhakul, Varaporn; Alves Jr, Nelson; Thompson, Fabiano L; Gomez-Gil, Bruno; Vora, Gary J

    2010-01-01

    Three notable members of the Harveyi clade, Vibrio harveyi, Vibrio alginolyticus and Vibrio parahaemolyticus, are best known as marine pathogens of commercial and medical import. In spite of this fact, the discrimination of Harveyi clade members remains difficult due to genetic and phenotypic similarities, and this has led to misidentifications and inaccurate estimations of a species' involvement in certain environments. To begin to understand the underlying genetics that complicate species level discrimination, we compared the genomes of Harveyi clade members isolated from different environments (seawater, shrimp, corals, oysters, finfish, humans) using microarray-based comparative genomic hybridization (CGH) and multilocus sequence analyses (MLSA). Surprisingly, we found that the only two V. harveyi strains that have had their genomes sequenced (strains BAA-1116 and HY01) have themselves been misidentified. Instead of belonging to the species harveyi, they are actually members of the species campbellii. In total, 28% of the strains tested were found to be misidentified and 42% of these appear to comprise a novel species. Taken together, our findings correct a number of species misidentifications while validating the ability of both CGH and MLSA to distinguish closely related members of the Harveyi clade. PMID:20686623

  17. Phylogenetic Relationships of Tetraploid AB-Genome Avena Species Evaluated by Means of Cytogenetic (C-Banding and FISH and RAPD Analyses

    Directory of Open Access Journals (Sweden)

    E. D. Badaeva

    2010-01-01

    Full Text Available Tetraploid oat species Avena abyssinica, A. vaviloviana, A. barbata, and A. agadiriana were studied using C-banding technique, in situ hybridization with the 45S and 5S rDNA probes, and RAPD analysis in comparison with the diploid species carrying different types of the A-genome (A. wiestii, As; A. longiglumis, Al; A. canariensis, Ac; A. damascena, Ad, A. prostrata, Ap. The investigation confirmed that all four tetraploids belong to the same AB-genome group; however A. agadiriana occupies distinct position among others. The C-banding, FISH, and RAPD analyses showed that Avena abyssinica, A. vaviloviana, and A. barbata are very similar; most probably they originated from a common tetraploid ancestor as a result of minor translocations and alterations of C-banding polymorphism system. AB-genome species are closely related with the A-genome diploids, and an As-genome species may be regarded as the most probable donor of their A-genome. Although their second diploid progenitor has not been identified, it seems unlikely that it belongs to the As-genome group. The exact diploid progenitors of A. agadiriana have not been determined; however our results suggest that at least one of them could be related to A. damascena.

  18. Mitochondrial population genomic analyses reveal population structure and demography of Indian Plasmodium falciparum.

    Science.gov (United States)

    Tyagi, Suchi; Das, Aparup

    2015-09-01

    Inference on the genetic diversity of Plasmodium falciparum populations could help in better management of malaria. A very recent study with mitochondrial (mt) genomes in global P. falciparum had revealed interesting evolutionary genetic patterns of Indian isolates in comparison to global ones. However, no population genetic study using the whole mt genome sequences of P. falciparum isolates collected in the entire distribution range in India has yet been performed. We herewith have analyzed 85 whole mt genomes (48 already published and 37 entirely new) sampled from eight differentially endemic Indian locations to estimate genetic diversity and infer population structure and historical demography of Indian P. falciparum. We found 19 novel Indian-specific Single Nucleotide Polymorphisms (SNPs) and 22 novel haplotypes segregating in Indian P. falciparum. Accordingly, high haplotype and nucleotide diversities were detected in Indian P. falciparum in comparison to many other global isolates. Indian P. falciparum populations were found to be moderately sub-structured with four different genetic clusters. Interestingly, group of local populations aggregate to form each cluster; while samples from Jharkhand and Odisha formed a single cluster, P. falciparum isolates from Asom formed an independent one. Similarly, Surat, Bilaspur and Betul formed a single cluster and Goa and Mangalore formed another. Interestingly, P. falciparum isolates from the two later populations were significantly genetically differentiated from isolates collected in other six Indian locations. Signature of historical population expansion was evident in five population samples, and the onset of expansion event was found to be very similar to African P. falciparum. In agreement with the previous finding, the estimated Time to Most Recent Common Ancestor (TMRCA) and the effective population size were high in Indian P. falciparum. All these genetic features of Indian P. falciparum with high mt genome

  19. Comparative Chloroplast Genome Analyses of Streptophyte Green Algae Uncover Major Structural Alterations in the Klebsormidiophyceae, Coleochaetophyceae and Zygnematophyceae

    Science.gov (United States)

    Lemieux, Claude; Otis, Christian; Turmel, Monique

    2016-01-01

    The Streptophyta comprises all land plants and six main lineages of freshwater green algae: Mesostigmatophyceae, Chlorokybophyceae, Klebsormidiophyceae, Charophyceae, Coleochaetophyceae and Zygnematophyceae. Previous comparisons of the chloroplast genome from nine streptophyte algae (including four zygnematophyceans) revealed that, although land plant chloroplast DNAs (cpDNAs) inherited most of their highly conserved structural features from green algal ancestors, considerable cpDNA changes took place during the evolution of the Zygnematophyceae, the sister group of land plants. To gain deeper insights into the evolutionary dynamics of the chloroplast genome in streptophyte algae, we sequenced the cpDNAs of nine additional taxa: two klebsormidiophyceans (Entransia fimbriata and Klebsormidium sp. SAG 51.86), one coleocheatophycean (Coleochaete scutata) and six zygnematophyceans (Cylindrocystis brebissonii, Netrium digitus, Roya obtusa, Spirogyra maxima, Cosmarium botrytis and Closterium baillyanum). Our comparative analyses of these genomes with their streptophyte algal counterparts indicate that the large inverted repeat (IR) encoding the rDNA operon experienced loss or expansion/contraction in all three sampled classes and that genes were extensively shuffled in both the Klebsormidiophyceae and Zygnematophyceae. The klebsormidiophycean genomes boast greatly expanded IRs, with the Entransia 60,590-bp IR being the largest known among green algae. The 206,025-bp Entransia cpDNA, which is one of the largest genome among streptophytes, encodes 118 standard genes, i.e., four additional genes compared to its Klebsormidium flaccidum homolog. We inferred that seven of the 21 group II introns usually found in land plants were already present in the common ancestor of the Klebsormidiophyceae and its sister lineages. At 107,236 bp and with 117 standard genes, the Coleochaete IR-less genome is both the smallest and most compact among the streptophyte algal cpDNAs analyzed thus

  20. Cross genome comparisons of serine proteases in Arabidopsis and rice

    Directory of Open Access Journals (Sweden)

    Sowdhamini R

    2006-08-01

    Full Text Available Abstract Background Serine proteases are one of the largest groups of proteolytic enzymes found across all kingdoms of life and are associated with several essential physiological pathways. The availability of Arabidopsis thaliana and rice (Oryza sativa genome sequences has permitted the identification and comparison of the repertoire of serine protease-like proteins in the two plant species. Results Despite the differences in genome sizes between Arabidopsis and rice, we identified a very similar number of serine protease-like proteins in the two plant species (206 and 222, respectively. Nearly 40% of the above sequences were identified as potential orthologues. Atypical members could be identified in the plant genomes for Deg, Clp, Lon, rhomboid proteases and species-specific members were observed for the highly populated subtilisin and serine carboxypeptidase families suggesting multiple lateral gene transfers. DegP proteases, prolyl oligopeptidases, Clp proteases and rhomboids share a significantly higher percentage orthology between the two genomes indicating substantial evolutionary divergence was set prior to speciation. Single domain architectures and paralogues for several putative subtilisins, serine carboxypeptidases and rhomboids suggest they may have been recruited for additional roles in secondary metabolism with spatial and temporal regulation. The analysis reveals some domain architectures unique to either or both of the plant species and some inactive proteases, like in rhomboids and Clp proteases, which could be involved in chaperone function. Conclusion The systematic analysis of the serine protease-like proteins in the two plant species has provided some insight into the possible functional associations of previously uncharacterised serine protease-like proteins. Further investigation of these aspects may prove beneficial in our understanding of similar processes in commercially significant crop plant species.

  1. Genome-wide binding and mechanistic analyses of Smchd1-mediated epigenetic regulation.

    Science.gov (United States)

    Chen, Kelan; Hu, Jiang; Moore, Darcy L; Liu, Ruijie; Kessans, Sarah A; Breslin, Kelsey; Lucet, Isabelle S; Keniry, Andrew; Leong, Huei San; Parish, Clare L; Hilton, Douglas J; Lemmers, Richard J L F; van der Maarel, Silvère M; Czabotar, Peter E; Dobson, Renwick C J; Ritchie, Matthew E; Kay, Graham F; Murphy, James M; Blewitt, Marnie E

    2015-07-01

    Structural maintenance of chromosomes flexible hinge domain containing 1 (Smchd1) is an epigenetic repressor with described roles in X inactivation and genomic imprinting, but Smchd1 is also critically involved in the pathogenesis of facioscapulohumeral dystrophy. The underlying molecular mechanism by which Smchd1 functions in these instances remains unknown. Our genome-wide transcriptional and epigenetic analyses show that Smchd1 binds cis-regulatory elements, many of which coincide with CCCTC-binding factor (Ctcf) binding sites, for example, the clustered protocadherin (Pcdh) genes, where we show Smchd1 and Ctcf act in opposing ways. We provide biochemical and biophysical evidence that Smchd1-chromatin interactions are established through the homodimeric hinge domain of Smchd1 and, intriguingly, that the hinge domain also has the capacity to bind DNA and RNA. Our results suggest Smchd1 imparts epigenetic regulation via physical association with chromatin, which may antagonize Ctcf-facilitated chromatin interactions, resulting in coordinated transcriptional control. PMID:26091879

  2. Genome-wide analyses for dissecting gene regulatory networks in the shoot apical meristem.

    Science.gov (United States)

    Bustamante, Mariana; Matus, José Tomás; Riechmann, José Luis

    2016-04-01

    Shoot apical meristem activity is controlled by complex regulatory networks in which components such as transcription factors, miRNAs, small peptides, hormones, enzymes and epigenetic marks all participate. Many key genes that determine the inherent characteristics of the shoot apical meristem have been identified through genetic approaches. Recent advances in genome-wide studies generating extensive transcriptomic and DNA-binding datasets have increased our understanding of the interactions within the regulatory networks that control the activity of the meristem, identifying new regulators and uncovering connections between previously unlinked network components. In this review, we focus on recent studies that illustrate the contribution of whole genome analyses to understand meristem function. PMID:26956505

  3. Analysing earthquake slip models with the spatial prediction comparison test

    KAUST Repository

    Zhang, L.

    2014-11-10

    Earthquake rupture models inferred from inversions of geophysical and/or geodetic data exhibit remarkable variability due to uncertainties in modelling assumptions, the use of different inversion algorithms, or variations in data selection and data processing. A robust statistical comparison of different rupture models obtained for a single earthquake is needed to quantify the intra-event variability, both for benchmark exercises and for real earthquakes. The same approach may be useful to characterize (dis-)similarities in events that are typically grouped into a common class of events (e.g. moderate-size crustal strike-slip earthquakes or tsunamigenic large subduction earthquakes). For this purpose, we examine the performance of the spatial prediction comparison test (SPCT), a statistical test developed to compare spatial (random) fields by means of a chosen loss function that describes an error relation between a 2-D field (‘model’) and a reference model. We implement and calibrate the SPCT approach for a suite of synthetic 2-D slip distributions, generated as spatial random fields with various characteristics, and then apply the method to results of a benchmark inversion exercise with known solution. We find the SPCT to be sensitive to different spatial correlations lengths, and different heterogeneity levels of the slip distributions. The SPCT approach proves to be a simple and effective tool for ranking the slip models with respect to a reference model.

  4. UniPrimer: A Web-Based Primer Design Tool for Comparative Analyses of Primate Genomes

    Directory of Open Access Journals (Sweden)

    Nomin Batnyam

    2012-01-01

    Full Text Available Whole genome sequences of various primates have been released due to advanced DNA-sequencing technology. A combination of computational data mining and the polymerase chain reaction (PCR assay to validate the data is an excellent method for conducting comparative genomics. Thus, designing primers for PCR is an essential procedure for a comparative analysis of primate genomes. Here, we developed and introduced UniPrimer for use in those studies. UniPrimer is a web-based tool that designs PCR- and DNA-sequencing primers. It compares the sequences from six different primates (human, chimpanzee, gorilla, orangutan, gibbon, and rhesus macaque and designs primers on the conserved region across species. UniPrimer is linked to RepeatMasker, Primer3Plus, and OligoCalc softwares to produce primers with high accuracy and UCSC In-Silico PCR to confirm whether the designed primers work. To test the performance of UniPrimer, we designed primers on sample sequences using UniPrimer and manually designed primers for the same sequences. The comparison of the two processes showed that UniPrimer was more effective than manual work in terms of saving time and reducing errors.

  5. UniPrimer: A Web-Based Primer Design Tool for Comparative Analyses of Primate Genomes.

    Science.gov (United States)

    Batnyam, Nomin; Lee, Jimin; Lee, Jungnam; Hong, Seung Bok; Oh, Sejong; Han, Kyudong

    2012-01-01

    Whole genome sequences of various primates have been released due to advanced DNA-sequencing technology. A combination of computational data mining and the polymerase chain reaction (PCR) assay to validate the data is an excellent method for conducting comparative genomics. Thus, designing primers for PCR is an essential procedure for a comparative analysis of primate genomes. Here, we developed and introduced UniPrimer for use in those studies. UniPrimer is a web-based tool that designs PCR- and DNA-sequencing primers. It compares the sequences from six different primates (human, chimpanzee, gorilla, orangutan, gibbon, and rhesus macaque) and designs primers on the conserved region across species. UniPrimer is linked to RepeatMasker, Primer3Plus, and OligoCalc softwares to produce primers with high accuracy and UCSC In-Silico PCR to confirm whether the designed primers work. To test the performance of UniPrimer, we designed primers on sample sequences using UniPrimer and manually designed primers for the same sequences. The comparison of the two processes showed that UniPrimer was more effective than manual work in terms of saving time and reducing errors. PMID:22693428

  6. Integrative genomic analyses reveal an androgen-driven somatic alteration landscape in early-onset prostate cancer

    DEFF Research Database (Denmark)

    Weischenfeldt, Joachim; Simon, Ronald; Feuerbach, Lars; Schlangen, Karin; Weichenhan, Dieter; Minner, Sarah; Wuttig, Daniela; Warnatz, Hans-Jörg; Stehr, Henning; Rausch, Tobias; Jäger, Natalie; Gu, Lei; Bogatyrova, Olga; Stütz, Adrian M; Claus, Rainer; Eils, Jürgen; Eils, Roland; Gerhäuser, Clarissa; Huang, Po-Hsien; Hutter, Barbara; Kabbe, Rolf; Lawerenz, Christian; Radomski, Sylwester; Bartholomae, Cynthia C; Fälth, Maria; Gade, Stephan; Schmidt, Manfred; Amschler, Nina; Haß, Thomas; Galal, Rami; Gjoni, Jovisa; Kuner, Ruprecht; Baer, Constance; Masser, Sawinee; von Kalle, Christof; Zichner, Thomas; Benes, Vladimir; Raeder, Benjamin; Mader, Malte; Amstislavskiy, Vyacheslav; Avci, Meryem; Lehrach, Hans; Parkhomchuk, Dmitri; Sultan, Marc; Burkhardt, Lia; Graefen, Markus; Huland, Hartwig; Kluth, Martina; Krohn, Antje; Sirma, Hüseyin; Stumm, Laura; Steurer, Stefan; Grupp, Katharina; Sültmann, Holger; Sauter, Guido; Plass, Christoph; Brors, Benedikt; Yaspo, Marie-Laure; Korbel, Jan O; Schlomm, Thorsten

    2013-01-01

    Early-onset prostate cancer (EO-PCA) represents the earliest clinical manifestation of prostate cancer. To compare the genomic alteration landscapes of EO-PCA with "classical" (elderly-onset) PCA, we performed deep sequencing-based genomics analyses in 11 tumors diagnosed at young age, and pursue...

  7. Mitochondrial genome analyses suggest multiple Trichuris species in humans, baboons, and pigs from different geographical regions

    DEFF Research Database (Denmark)

    Hawash, Mohamed B. F.; Andersen, Lee O.; Gasser, Robin B.;

    2015-01-01

    primates. METHODS AND FINDINGS: We sequenced and annotated complete mitochondrial genomes of Trichuris recovered from a human in Uganda, an olive baboon in the US, a hamadryas baboon in Denmark, and two pigs from Denmark and Uganda. Comparative analyses using other published mitochondrial genomes of...

  8. Large-scale prokaryotic gene prediction and comparison to genome annotation

    DEFF Research Database (Denmark)

    Nielsen, Pernille; Krogh, Anders Stærmose

    2005-01-01

    Motivation: Prokaryotic genomes are sequenced and annotated at an increasing rate. The methods of annotation vary between sequencing groups. It makes genome comparison difficult and may lead to propagation of errors when questionable assignments are adapted from one genome to another. Genome...... comparison either on a large or small scale would be facilitated by using a single standard for annotation, which incorporates a transparency of why an open reading frame (ORF) is considered to be a gene. Results: A total of 143 prokaryotic genomes were scored with an updated version of the prokaryotic...

  9. Comparison of BEACON and COMPARE reactor cavity subcompartment analyses

    International Nuclear Information System (INIS)

    In this study, a more advanced best-estimate containment code, BEACON-MOD3A, was ued to calculate force and moment loads resulting from a high-energy blowdown for two reactor cavity geometries previously analyzed with the licensing computer code COMPARE-MOD1A. The BEACON force and moment loads were compared with the COMPARE results to determine the safety margins provided by the COMPARE code. The forces and moments calculated by the codes were found to be different, although not in any consistent manner, for the two reactor cavity geometries studied. Therefore, generic summary statements regarding margins cannot be made because of the effects of the detailed physical configuration. However, differences in the BEACON and COMPARE calculated forces and moments can be attributed to differences in the modeling assumptions used in the codes and the analyses

  10. Comparative Genomic and Transcriptional Analyses of CRISPR Systems Across the Genus Pyrobaculum

    Directory of Open Access Journals (Sweden)

    David L Bernick

    2012-07-01

    Full Text Available Within the domain Archaea, the CRISPR immune system appears to be nearly ubiquitous based on computational genome analyses. Initial studies in bacteria demonstrated that the CRISPR system targets invading plasmid and viral DNA. Recent experiments in the model archaeon Pyrococcus furiosus uncovered a novel RNA-targeting variant of the CRISPR system potentially unique to archaea. Because our understanding of CRISPR system evolution in other archaea is limited, we have taken a comparative genomic and transcriptomic view of the CRISPR arrays across six diverse species within the crenarchaeal genus Pyrobaculum. We present transcriptional data from each of four species in the genus (P. aerophilum, P. islandicum, P. calidifontis, P. arsenaticum, analyzing mature CRISPR-associated small RNA abundance from over 20 arrays. Within the genus, there is remarkable conservation of CRISPR array structure, as well as unique features that are have not been studied in other archaeal systems. These unique features include: a nearly invariant CRISPR promoter, conservation of direct repeat families, the 5' polarity of CRISPR-associated small RNA abundance, and a novel CRISPR-specific association with homologues of nurA and herA. These analyses provide a genus-level evolutionary perspective on archaeal CRISPR systems, broadening our understanding beyond existing non-comparative model systems.

  11. Phylogenetic analyses of complete mitochondrial genome sequences suggest a basal divergence of the enigmatic rodent Anomalurus

    Directory of Open Access Journals (Sweden)

    Gissi Carmela

    2007-02-01

    Full Text Available Abstract Background Phylogenetic relationships between Lagomorpha, Rodentia and Primates and their allies (Euarchontoglires have long been debated. While it is now generally agreed that Rodentia constitutes a monophyletic sister-group of Lagomorpha and that this clade (Glires is sister to Primates and Dermoptera, higher-level relationships within Rodentia remain contentious. Results We have sequenced and performed extensive evolutionary analyses on the mitochondrial genome of the scaly-tailed flying squirrel Anomalurus sp., an enigmatic rodent whose phylogenetic affinities have been obscure and extensively debated. Our phylogenetic analyses of the coding regions of available complete mitochondrial genome sequences from Euarchontoglires suggest that Anomalurus is a sister taxon to the Hystricognathi, and that this clade represents the most basal divergence among sampled Rodentia. Bayesian dating methods incorporating a relaxed molecular clock provide divergence-time estimates which are consistently in agreement with the fossil record and which indicate a rapid radiation within Glires around 60 million years ago. Conclusion Taken together, the data presented provide a working hypothesis as to the phylogenetic placement of Anomalurus, underline the utility of mitochondrial sequences in the resolution of even relatively deep divergences and go some way to explaining the difficulty of conclusively resolving higher-level relationships within Glires with available data and methodologies.

  12. Deciphering Clostridium tyrobutyricum Metabolism Based on the Whole-Genome Sequence and Proteome Analyses

    Science.gov (United States)

    Lee, Joungmin; Jang, Yu-Sin; Han, Mee-Jung; Kim, Jin Young

    2016-01-01

    ABSTRACT Clostridium tyrobutyricum is a Gram-positive anaerobic bacterium that efficiently produces butyric acid and is considered a promising host for anaerobic production of bulk chemicals. Due to limited knowledge on the genetic and metabolic characteristics of this strain, however, little progress has been made in metabolic engineering of this strain. Here we report the complete genome sequence of C. tyrobutyricum KCTC 5387 (ATCC 25755), which consists of a 3.07-Mbp chromosome and a 63-kbp plasmid. The results of genomic analyses suggested that C. tyrobutyricum produces butyrate from butyryl-coenzyme A (butyryl-CoA) through acetate reassimilation by CoA transferase, differently from Clostridium acetobutylicum, which uses the phosphotransbutyrylase-butyrate kinase pathway; this was validated by reverse transcription-PCR (RT-PCR) of related genes, protein expression levels, in vitro CoA transferase assay, and fed-batch fermentation. In addition, the changes in protein expression levels during the course of batch fermentations on glucose were examined by shotgun proteomics. Unlike C. acetobutylicum, the expression levels of proteins involved in glycolytic and fermentative pathways in C. tyrobutyricum did not decrease even at the stationary phase. Proteins related to energy conservation mechanisms, including Rnf complex, NfnAB, and pyruvate-phosphate dikinase that are absent in C. acetobutylicum, were identified. Such features explain why this organism can produce butyric acid to a much higher titer and better tolerate toxic metabolites. This study presenting the complete genome sequence, global protein expression profiles, and genome-based metabolic characteristics during the batch fermentation of C. tyrobutyricum will be valuable in designing strategies for metabolic engineering of this strain. PMID:27302759

  13. Application of the inter-line PCR for the analyse of genomic rearrangements in radiation-transformed mammalian cell lines

    International Nuclear Information System (INIS)

    Repetitive DNA sequences of the LINE-family (long interspersed elements) that are widely distributed among the mammalian genome can be activated or altered by the exposure to ionizing radiation [1]. By the integration at new sites in the genome alterations in the expression of genes that are involved in cell transformation and/or carcinogenesis may occur [2, 3]. A new technique -the inter-LINE PCR - has been developed in order to detect and analyse such genomic rearrangements in radiation-transformed cell lines. From the sites of transformation- or tumour-specific changes in the genome it might be possible to develop new tumour markers for diagnostic purpose. (orig.)

  14. Genome Sequence Analyses of Pseudomonas savastanoi pv. glycinea and Subtractive Hybridization-Based Comparative Genomics with Nine Pseudomonads

    OpenAIRE

    Qi, Mingsheng; Wang, Dongping; Bradley, Carl A.; Zhao, Youfu

    2011-01-01

    Bacterial blight, caused by Pseudomonas savastanoi pv. glycinea (Psg), is a common disease of soybean. In an effort to compare a current field isolate with one isolated in the early 1960s, the genomes of two Psg strains, race 4 and B076, were sequenced using 454 pyrosequencing. The genomes of both Psg strains share more than 4,900 highly conserved genes, indicating very low genetic diversity between Psg genomes. Though conserved, genome rearrangements and recombination events occur commonly w...

  15. Functional Analysis of Shewanella, a cross genome comparison.

    Energy Technology Data Exchange (ETDEWEB)

    Serres, Margrethe H.

    2009-05-15

    The bacterial genus Shewanella includes a group of highly versatile organisms that have successfully adapted to life in many environments ranging from aquatic (fresh and marine) to sedimentary (lake and marine sediments, subsurface sediments, sea vent). A unique respiratory capability of the Shewanellas, initially observed for Shewanella oneidensis MR-1, is the ability to use metals and metalloids, including radioactive compounds, as electron acceptors. Members of the Shewanella genus have also been shown to degrade environmental pollutants i.e. halogenated compounds, making this group highly applicable for the DOE mission. S. oneidensis MR-1 has in addition been found to utilize a diverse set of nutrients and to have a large set of genes dedicated to regulation and to sensing of the environment. The sequencing of the S. oneidensis MR-1 genome facilitated experimental and bioinformatics analyses by a group of collaborating researchers, the Shewanella Federation. Through the joint effort and with support from Department of Energy S. oneidensis MR-1 has become a model organism of study. Our work has been a functional analysis of S. oneidensis MR-1, both by itself and as part of a comparative study. We have improved the annotation of gene products, assigned metabolic functions, and analyzed protein families present in S. oneidensis MR-1. The data has been applied to analysis of experimental data (i.e. gene expression, proteome) generated for S. oneidensis MR-1. Further, this work has formed the basis for a comparative study of over 20 members of the Shewanella genus. The species and strains selected for genome sequencing represented an evolutionary gradient of DNA relatedness, ranging from close to intermediate, and to distant. The organisms selected have also adapted to a variety of ecological niches. Through our work we have been able to detect and interpret genome similarities and differences between members of the genus. We have in this way contributed to the

  16. CAGO: a software tool for dynamic visual comparison and correlation measurement of genome organization.

    Directory of Open Access Journals (Sweden)

    Yi-Feng Chang

    Full Text Available CAGO (Comparative Analysis of Genome Organization is developed to address two critical shortcomings of conventional genome atlas plotters: lack of dynamic exploratory functions and absence of signal analysis for genomic properties. With dynamic exploratory functions, users can directly manipulate chromosome tracks of a genome atlas and intuitively identify distinct genomic signals by visual comparison. Signal analysis of genomic properties can further detect inconspicuous patterns from noisy genomic properties and calculate correlations between genomic properties across various genomes. To implement dynamic exploratory functions, CAGO presents each genome atlas in Scalable Vector Graphics (SVG format and allows users to interact with it using a SVG viewer through JavaScript. Signal analysis functions are implemented using R statistical software and a discrete wavelet transformation package waveslim. CAGO is not only a plotter for generating complex genome atlases, but also a platform for exploring genome atlases with dynamic exploratory functions for visual comparison and with signal analysis for comparing genomic properties across multiple organisms. The web-based application of CAGO, its source code, user guides, video demos, and live examples are publicly available and can be accessed at http://cbs.ym.edu.tw/cago.

  17. CAGO: A Software Tool for Dynamic Visual Comparison and Correlation Measurement of Genome Organization

    Science.gov (United States)

    Chang, Yi-Feng; Chang, Chuan-Hsiung

    2011-01-01

    CAGO (Comparative Analysis of Genome Organization) is developed to address two critical shortcomings of conventional genome atlas plotters: lack of dynamic exploratory functions and absence of signal analysis for genomic properties. With dynamic exploratory functions, users can directly manipulate chromosome tracks of a genome atlas and intuitively identify distinct genomic signals by visual comparison. Signal analysis of genomic properties can further detect inconspicuous patterns from noisy genomic properties and calculate correlations between genomic properties across various genomes. To implement dynamic exploratory functions, CAGO presents each genome atlas in Scalable Vector Graphics (SVG) format and allows users to interact with it using a SVG viewer through JavaScript. Signal analysis functions are implemented using R statistical software and a discrete wavelet transformation package waveslim. CAGO is not only a plotter for generating complex genome atlases, but also a platform for exploring genome atlases with dynamic exploratory functions for visual comparison and with signal analysis for comparing genomic properties across multiple organisms. The web-based application of CAGO, its source code, user guides, video demos, and live examples are publicly available and can be accessed at http://cbs.ym.edu.tw/cago. PMID:22114666

  18. Genome-wide meta-analyses identify multiple loci associated with smoking behavior.

    LENUS (Irish Health Repository)

    2010-05-01

    Consistent but indirect evidence has implicated genetic factors in smoking behavior. We report meta-analyses of several smoking phenotypes within cohorts of the Tobacco and Genetics Consortium (n = 74,053). We also partnered with the European Network of Genetic and Genomic Epidemiology (ENGAGE) and Oxford-GlaxoSmithKline (Ox-GSK) consortia to follow up the 15 most significant regions (n > 140,000). We identified three loci associated with number of cigarettes smoked per day. The strongest association was a synonymous 15q25 SNP in the nicotinic receptor gene CHRNA3 (rs1051730[A], beta = 1.03, standard error (s.e.) = 0.053, P = 2.8 x 10(-73)). Two 10q25 SNPs (rs1329650[G], beta = 0.367, s.e. = 0.059, P = 5.7 x 10(-10); and rs1028936[A], beta = 0.446, s.e. = 0.074, P = 1.3 x 10(-9)) and one 9q13 SNP in EGLN2 (rs3733829[G], beta = 0.333, s.e. = 0.058, P = 1.0 x 10(-8)) also exceeded genome-wide significance for cigarettes per day. For smoking initiation, eight SNPs exceeded genome-wide significance, with the strongest association at a nonsynonymous SNP in BDNF on chromosome 11 (rs6265[C], odds ratio (OR) = 1.06, 95% confidence interval (Cl) 1.04-1.08, P = 1.8 x 10(-8)). One SNP located near DBH on chromosome 9 (rs3025343[G], OR = 1.12, 95% Cl 1.08-1.18, P = 3.6 x 10(-8)) was significantly associated with smoking cessation.

  19. Extending pathways and processes using molecular interaction networks to analyse cancer genome data

    Directory of Open Access Journals (Sweden)

    Krasnogor Natalio

    2010-12-01

    Full Text Available Abstract Background Cellular processes and pathways, whose deregulation may contribute to the development of cancers, are often represented as cascades of proteins transmitting a signal from the cell surface to the nucleus. However, recent functional genomic experiments have identified thousands of interactions for the signalling canonical proteins, challenging the traditional view of pathways as independent functional entities. Combining information from pathway databases and interaction networks obtained from functional genomic experiments is therefore a promising strategy to obtain more robust pathway and process representations, facilitating the study of cancer-related pathways. Results We present a methodology for extending pre-defined protein sets representing cellular pathways and processes by mapping them onto a protein-protein interaction network, and extending them to include densely interconnected interaction partners. The added proteins display distinctive network topological features and molecular function annotations, and can be proposed as putative new components, and/or as regulators of the communication between the different cellular processes. Finally, these extended pathways and processes are used to analyse their enrichment in pancreatic mutated genes. Significant associations between mutated genes and certain processes are identified, enabling an analysis of the influence of previously non-annotated cancer mutated genes. Conclusions The proposed method for extending cellular pathways helps to explain the functions of cancer mutated genes by exploiting the synergies of canonical knowledge and large-scale interaction data.

  20. Genome-wide binding and mechanistic analyses of Smchd1-mediated epigenetic regulation

    Science.gov (United States)

    Chen, Kelan; Hu, Jiang; Moore, Darcy L.; Liu, Ruijie; Kessans, Sarah A.; Breslin, Kelsey; Lucet, Isabelle S.; Keniry, Andrew; Leong, Huei San; Parish, Clare L.; Hilton, Douglas J.; Lemmers, Richard J. L. F.; van der Maarel, Silvère M.; Czabotar, Peter E.; Dobson, Renwick C. J.; Ritchie, Matthew E.; Kay, Graham F.; Murphy, James M.; Blewitt, Marnie E.

    2015-01-01

    Structural maintenance of chromosomes flexible hinge domain containing 1 (Smchd1) is an epigenetic repressor with described roles in X inactivation and genomic imprinting, but Smchd1 is also critically involved in the pathogenesis of facioscapulohumeral dystrophy. The underlying molecular mechanism by which Smchd1 functions in these instances remains unknown. Our genome-wide transcriptional and epigenetic analyses show that Smchd1 binds cis-regulatory elements, many of which coincide with CCCTC-binding factor (Ctcf) binding sites, for example, the clustered protocadherin (Pcdh) genes, where we show Smchd1 and Ctcf act in opposing ways. We provide biochemical and biophysical evidence that Smchd1–chromatin interactions are established through the homodimeric hinge domain of Smchd1 and, intriguingly, that the hinge domain also has the capacity to bind DNA and RNA. Our results suggest Smchd1 imparts epigenetic regulation via physical association with chromatin, which may antagonize Ctcf-facilitated chromatin interactions, resulting in coordinated transcriptional control. PMID:26091879

  1. Integrative genome analyses identify key somatic driver mutations of small cell lung cancer

    Science.gov (United States)

    Peifer, Martin; Fernández-Cuesta, Lynnette; Sos, Martin L; George, Julie; Seidel, Danila; Kasper, Lawryn H; Plenker, Dennis; Leenders, Frauke; Sun, Ruping; Zander, Thomas; Menon, Roopika; Koker, Mirjam; Dahmen, Ilona; Müller, Christian; Di Cerbo, Vincenzo; Schildhaus, Hans-Ulrich; Altmüller, Janine; Baessmann, Ingelore; Becker, Christian; de Wilde, Bram; Vandesompele, Jo; Böhm, Diana; Ansén, Sascha; Gabler, Franziska; Wilkening, Ines; Heynck, Stefanie; Heuckmann, Johannes M; Lu, Xin; Carter, Scott L; Cibulskis, Kristian; Banerji, Shantanu; Getz, Gad; Park, Kwon-Sik; Rauh, Daniel; Grütter, Christian; Fischer, Matthias; Pasqualucci, Laura; Wright, Gavin; Wainer, Zoe; Russell, Prudence; Petersen, Iver; Chen, Yuan; Stoelben, Erich; Ludwig, Corinna; Schnabel, Philipp; Hoffmann, Hans; Muley, Thomas; Brockmann, Michael; Engel-Riedel, Walburga; Muscarella, Lucia A; Fazio, Vito M; Groen, Harry; Timens, Wim; Sietsma, Hannie; Thunnissen, Erik; Smit, Egbert; Heideman, Daniëlle AM; Snijders, Peter JF; Cappuzzo, Federico; Ligorio, Claudia; Damiani, Stefania; Field, John; Solberg, Steinar; Brustugun, Odd Terje; Lund-Iversen, Marius; Sänger, Jörg; Clement, Joachim H; Soltermann, Alex; Moch, Holger; Weder, Walter; Solomon, Benjamin; Soria, Jean-Charles; Validire, Pierre; Besse, Benjamin; Brambilla, Elisabeth; Brambilla, Christian; Lantuejoul, Sylvie; Lorimier, Philippe; Schneider, Peter M; Hallek, Michael; Pao, William; Meyerson, Matthew; Sage, Julien; Shendure, Jay; Schneider, Robert; Büttner, Reinhard; Wolf, Jürgen; Nürnberg, Peter; Perner, Sven; Heukamp, Lukas C; Brindle, Paul K; Haas, Stefan; Thomas, Roman K

    2016-01-01

    Small-cell lung cancer (SCLC) is an aggressive lung tumor subtype with poor survival1–3. We sequenced 29 SCLC exomes, two genomes and 15 transcriptomes and found an extremely high mutation rate of 7.4±1 protein-changing mutations per million basepairs. Therefore, we conducted integrated analyses of the various data sets to identify pathogenetically relevant mutated genes. In all cases we found evidence for inactivation of TP53 and RB1 and identified recurrent mutations in histone-modifying genes, CREBBP, EP300, and MLL. Furthermore, we observed mutations in PTEN, in SLIT2, and EPHA7, as well as focal amplifications of the FGFR1 tyrosine kinase gene. Finally, we detected many of the alterations found in humans in SCLC tumors from p53/Rb1-deficient mice4. Our study implicates histone modification as a major feature of SCLC, reveals potentially therapeutically tractable genome alterations, and provides a generalizable framework for identification of biologically relevant genes in the context of high mutational background. PMID:22941188

  2. In silico phylogenetic and virulence gene profile analyses of avian pathogenic Escherichia coli genome sequences

    Directory of Open Access Journals (Sweden)

    Thaís C.G. Rojas

    2014-02-01

    Full Text Available Avian pathogenic Escherichia coli (APEC infections are responsible for significant losses in the poultry industry worldwide. A zoonotic risk has been attributed to APEC strains because they present similarities to extraintestinal pathogenic E. coli (ExPEC associated with illness in humans, mainly urinary tract infections and neonatal meningitis. Here, we present in silico analyses with pathogenic E. coli genome sequences, including recently available APEC genomes. The phylogenetic tree, based on multi-locus sequence typing (MLST of seven housekeeping genes, revealed high diversity in the allelic composition. Nevertheless, despite this diversity, the phylogenetic tree was able to cluster the different pathotypes together. An in silico virulence gene profile was also determined for each of these strains, through the presence or absence of 83 well-known virulence genes/traits described in pathogenic E. coli strains. The MLST phylogeny and the virulence gene profiles demonstrated a certain genetic similarity between Brazilian APEC strains, APEC isolated in the United States, UPEC (uropathogenic E. coli and diarrheagenic strains isolated from humans. This correlation corroborates and reinforces the zoonotic potential hypothesis proposed to APEC.

  3. Crowdsourcing genomic analyses of ash and ash dieback – power to the people

    Directory of Open Access Journals (Sweden)

    MacLean Dan

    2013-02-01

    Full Text Available Abstract Ash dieback is a devastating fungal disease of ash trees that has swept across Europe and recently reached the UK. This emergent pathogen has received little study in the past and its effect threatens to overwhelm the ash population. In response to this we have produced some initial genomics datasets and taken the unusual step of releasing them to the scientific community for analysis without first performing our own. In this manner we hope to ‘crowdsource’ analyses and bring the expertise of the community to bear on this problem as quickly as possible. Our data has been released through our website at oadb.tsl.ac.uk and a public GitHub repository.

  4. Comparison of genomic abnormalities between BRCAX and sporadic breast cancers studied by comparative genomic hybridization.

    Science.gov (United States)

    Gronwald, Jacek; Jauch, Anna; Cybulski, Cezary; Schoell, Brigitte; Böhm-Steuer, Barbara; Lener, Marcin; Grabowska, Ewa; Górski, Bohdan; Jakubowska, Anna; Domagała, Wenancjusz; Chosia, Maria; Scott, Rodney J; Lubiński, Jan

    2005-03-20

    Very little is known about the chromosomal regions harbouring genes involved in initiation and progression of BRCAX-associated breast cancers. We applied comparative genomic hybridization (CGH) to identify the most frequent genomic imbalances in 18 BRCAX hereditary breast cancers and compared them to chromosomal aberrations detected in a group of 27 sporadic breast cancers. The aberrations observed most frequently in BRCAX tumours were gains of 8q (83%), 19q (67%), 19p (61%), 20q (61%), 1q (56%), 17q (56%) and losses of 8p (56%), 11q (44%) and 13q (33%). The sporadic cases most frequently showed gains of 1q (67%), 8q (48%), 17q (37%), 16p (33%), 19q (33%) and losses of 11q (26%), 8p (22%) and 16q (19%). Losses of 8p and gains 8q, 19 as well as gains of 20q (with respect to ductal tumours only) were detected significantly more often in BRCAX than in sporadic breast cancers. Analysis of 8p-losses and 8q-gains showed that these aberrations are early events in the tumorigenesis of BRCAX tumors. The findings of this report indicate similarities between BRCAX and BRCA2 tumours, possibly suggesting a common pathway of disease. These findings need confirmation by more extensive studies because only a limited number of cases were analysed and there are relatively few reports published. PMID:15540206

  5. Genomic predictors of remission to antidepressant treatment in geriatric depression using genome-wide expression analyses: A pilot study

    OpenAIRE

    Eyre, HA; Eskin, A; Nelson, SF; St Cyr, NM; Siddarth, P.; Baune, BT; Lavretsky, H.

    2016-01-01

    © 2015 John Wiley & Sons, Ltd. Objective: This first pilot study of genome-wide expression as predictor of antidepressant response in late-life depression examined genome-wide transcriptional profiles in a randomized placebo-controlled trial of combined methylphenidate and citalopram. Methods: Genome-wide transcriptional profiles were examined in peripheral blood leukocytes sampled at baseline and 16weeks from 35 older adults with major depression, who were randomized to methylphenidate+cital...

  6. A Microarray Based Genomic Hybridization Method for Identification of New Genes in Plants: Case Analyses of Arabidopsis and Oryza

    Institute of Scientific and Technical Information of China (English)

    Chuanzhu Fan; Maria D. Vibranovski; Ying Chen; Manyuan Long

    2007-01-01

    To systematically estimate the gene duplication events in closely related species, we have to use comparative genomic approaches, either through genomic sequence comparison or comparative genomic hybridization (CGH). Given the scarcity of complete genomic sequences of plant species, in the present study we adopted an array based CGH to investigate gene duplications in the genus Arabidopsis. Fragment genomic DNA from four species, namely Arabidopsis thaliana, A. lyrata subsp. lyrata, A. lyrata subsp. petraea, and A. halleri, was hybridized to Affymetrix (Santa Clara, CA, USA) tiling arrays that are designed from the genomic sequences of A. thaliana. Pairwise comparisons of signal intensity were made to infer the potential duplicated candidates along each phylo-genetic branch. Ninety-four potential candidates of gene duplication along the genus were identified. Among them, the majority (69 of 94) were A. thaliana lineage specific. This result indicates that the array based CGH approach may be used to identify candidates of duplication in other plant genera containing closely related species, such as Oryza, particularly for the AA genome species. We compared the degree of gene duplication through retrotransposon between O. sativa and A. thaliana and found a strikingly higher number of chimera retroposed genes in rice. The higher rate of gene duplication through retroposition and other mechanisms may indicate that the grass species is able to adapt to more diverse environments.

  7. Genome Sequence of Azospirillum brasilense CBG497 and Comparative Analyses of Azospirillum Core and Accessory Genomes provide Insight into Niche Adaptation

    Directory of Open Access Journals (Sweden)

    Victor González

    2012-09-01

    Full Text Available Bacteria of the genus Azospirillum colonize roots of important cereals and grasses, and promote plant growth by several mechanisms, notably phytohormone synthesis. The genomes of several Azospirillum strains belonging to different species, isolated from various host plants and locations, were recently sequenced and published. In this study, an additional genome of an A. brasilense strain, isolated from maize grown on an alkaline soil in the northeast of Mexico, strain CBG497, was obtained. Comparative genomic analyses were performed on this new genome and three other genomes (A. brasilense Sp245, A. lipoferum 4B and Azospirillum sp. B510. The Azospirillum core genome was established and consists of 2,328 proteins, representing between 30% to 38% of the total encoded proteins within a genome. It is mainly chromosomally-encoded and contains 74% of genes of ancestral origin shared with some aquatic relatives. The non-ancestral part of the core genome is enriched in genes involved in signal transduction, in transport and in metabolism of carbohydrates and amino-acids, and in surface properties features linked to adaptation in fluctuating environments, such as soil and rhizosphere. Many genes involved in colonization of plant roots, plant-growth promotion (such as those involved in phytohormone biosynthesis, and properties involved in rhizosphere adaptation (such as catabolism of phenolic compounds, uptake of iron are restricted to a particular strain and/or species, strongly suggesting niche-specific adaptation.

  8. Secure distributed genome analysis for GWAS and sequence comparison computation

    OpenAIRE

    Zhang, Yihua; Blanton, Marina; Almashaqbeh, Ghada

    2015-01-01

    Background The rapid increase in the availability and volume of genomic data makes significant advances in biomedical research possible, but sharing of genomic data poses challenges due to the highly sensitive nature of such data. To address the challenges, a competition for secure distributed processing of genomic data was organized by the iDASH research center. Methods In this work we propose techniques for securing computation with real-life genomic data for minor allele frequency and chi-...

  9. Comparison of Numerical Analyses with a Static Load Test of a Continuous Flight Auger Pile

    Directory of Open Access Journals (Sweden)

    Hoľko Michal

    2014-12-01

    Full Text Available The article deals with numerical analyses of a Continuous Flight Auger (CFA pile. The analyses include a comparison of calculated and measured load-settlement curves as well as a comparison of the load distribution over a pile's length. The numerical analyses were executed using two types of software, i.e., Ansys and Plaxis, which are based on FEM calculations. Both types of software are different from each other in the way they create numerical models, model the interface between the pile and soil, and use constitutive material models. The analyses have been prepared in the form of a parametric study, where the method of modelling the interface and the material models of the soil are compared and analysed.

  10. Pan-genome analyses identify lineage- and niche-specific markers of evolution and adaptation in Epsilonproteobacteria

    Directory of Open Access Journals (Sweden)

    Ying eZhang

    2014-03-01

    Full Text Available The rapidly increasing availability of complete bacterial genomes has created new opportunities for reconstructing bacterial evolution, but it has also highlighted the difficulty to fully understand the genomic and functional variations occurring among different lineages. Using the class Epsilonproteobacteria as a case study, we investigated the composition, flexibility, and function of its pan-genomes. Models were constructed to extrapolate the expansion of pan-genomes at three different taxonomic levels. The results show that, for Epsilonproteobacteria the seemingly large genome variations among strains of the same species are less noticeable when compared with groups at higher taxonomic ranks, indicating that genome stability is imposed by the potential existence of taxonomic boundaries. The analyses of pan-genomes has also defined a set of universally conserved core genes, based on which a phylogenetic tree was constructed to confirm that thermophilic species from deep-sea hydrothermal vents represent the most ancient lineages of Epsilonproteobacteria. Moreover, by comparing the flexible genome of a chemoautotrophic deep-sea vent species to 1 genomes of species belonging to the same genus, but inhabiting different environments, and 2 genomes of other vent species, but belonging to different genera, we were able to delineate the relative importance of lineage-specific versus niche-specific genes. This result not only emphasizes the overall importance of phylogenetic proximity in shaping the variable part of the genome, but also highlights the adaptive functions of niche-specific genes. Overall, by modeling the expansion of pan-genomes and analyzing core and flexible genes, this study provides snapshots on how the complex processes of gene acquisition, conservation, and removal affect the evolution of different species, and contribute to the metabolic diversity and versatility of Epsilonproteobacteria.

  11. Genome and Phenotype Microarray Analyses of Rhodococcus sp. BCP1 and Rhodococcus opacus R7: Genetic Determinants and Metabolic Abilities with Environmental Relevance

    Science.gov (United States)

    D’Ursi, Pasqualina; Milanesi, Luciano; Di Canito, Alessandra; Zampolli, Jessica; Collina, Elena; Decorosi, Francesca; Viti, Carlo; Fedi, Stefano; Presentato, Alessandro; Zannoni, Davide; Di Gennaro, Patrizia

    2015-01-01

    In this paper comparative genome and phenotype microarray analyses of Rhodococcus sp. BCP1 and Rhodococcus opacus R7 were performed. Rhodococcus sp. BCP1 was selected for its ability to grow on short-chain n-alkanes and R. opacus R7 was isolated for its ability to grow on naphthalene and on o-xylene. Results of genome comparison, including BCP1, R7, along with other Rhodococcus reference strains, showed that at least 30% of the genome of each strain presented unique sequences and only 50% of the predicted proteome was shared. To associate genomic features with metabolic capabilities of BCP1 and R7 strains, hundreds of different growth conditions were tested through Phenotype Microarray, by using Biolog plates and plates manually prepared with additional xenobiotic compounds. Around one-third of the surveyed carbon sources was utilized by both strains although R7 generally showed higher metabolic activity values compared to BCP1. Moreover, R7 showed broader range of nitrogen and sulphur sources. Phenotype Microarray data were combined with genomic analysis to genetically support the metabolic features of the two strains. The genome analysis allowed to identify some gene clusters involved in the metabolism of the main tested xenobiotic compounds. Results show that R7 contains multiple genes for the degradation of a large set of aromatic and PAHs compounds, while a lower variability in terms of genes predicted to be involved in aromatic degradation was found in BCP1. This genetic feature can be related to the strong genetic pressure exerted by the two different environment from which the two strains were isolated. According to this, in the BCP1 genome the smo gene cluster involved in the short-chain n-alkanes degradation, is included in one of the unique regions and it is not conserved in the Rhodococcus strains compared in this work. Data obtained underline the great potential of these two Rhodococcus spp. strains for biodegradation and environmental decontamination

  12. SYSTEMS BIOLOGY ANALYSES OF GENE EXPRESSION AND GENOME WIDE ASSOCIATION STUDY DATA IN OBSTRUCTIVE SLEEP APNEA

    Science.gov (United States)

    LIU, YU; PATEL, SANJAY; NIBBE, ROD; MAXWELL, SEAN; CHOWDHURY, SALIM A.; KOYUTURK, MEHMET; ZHU, XIAOFENG; LARKIN, EMMA K.; BUXBAUM, SARAH G; PUNJABI, NARESH M.; GHARIB, SINA A.; REDLINE, SUSAN; CHANCE, MARK R.

    2015-01-01

    The precise molecular etiology of obstructive sleep apnea (OSA) is unknown; however recent research indicates that several interconnected aberrant pathways and molecular abnormalities are contributors to OSA. Identifying the genes and pathways associated with OSA can help to expand our understanding of the risk factors for the disease as well as provide new avenues for potential treatment. Towards these goals, we have integrated relevant high dimensional data from various sources, such as genome-wide expression data (microarray), protein-protein interaction (PPI) data and results from genome-wide association studies (GWAS) in order to define sub-network elements that connect some of the known pathways related to the disease as well as define novel regulatory modules related to OSA. Two distinct approaches are applied to identify sub-networks significantly associated with OSA. In the first case we used a biased approach based on sixty genes/proteins with known associations with sleep disorders and/or metabolic disease to seed a search using commercial software to discover networks associated with disease followed by information theoretic (mutual information) scoring of the sub-networks. In the second case we used an unbiased approach and generated an interactome constructed from publicly available gene expression profiles and PPI databases, followed by scoring of the network with p-values from GWAS data derived from OSA patients to uncover sub-networks significant for the disease phenotype. A comparison of the approaches reveals a number of proteins that have been previously known to be associated with OSA or sleep. In addition, our results indicate a novel association of Phosphoinositide 3-kinase, the STAT family of proteins and its related pathways with OSA. PMID:21121029

  13. CoCoNUT: an efficient system for the comparison and analysis of genomes

    Directory of Open Access Journals (Sweden)

    Kurtz Stefan

    2008-11-01

    Full Text Available Abstract Background Comparative genomics is the analysis and comparison of genomes from different species. This area of research is driven by the large number of sequenced genomes and heavily relies on efficient algorithms and software to perform pairwise and multiple genome comparisons. Results Most of the software tools available are tailored for one specific task. In contrast, we have developed a novel system CoCoNUT (Computational Comparative geNomics Utility Toolkit that allows solving several different tasks in a unified framework: (1 finding regions of high similarity among multiple genomic sequences and aligning them, (2 comparing two draft or multi-chromosomal genomes, (3 locating large segmental duplications in large genomic sequences, and (4 mapping cDNA/EST to genomic sequences. Conclusion CoCoNUT is competitive with other software tools w.r.t. the quality of the results. The use of state of the art algorithms and data structures allows CoCoNUT to solve comparative genomics tasks more efficiently than previous tools. With the improved user interface (including an interactive visualization component, CoCoNUT provides a unified, versatile, and easy-to-use software tool for large scale studies in comparative genomics.

  14. Comparative analyses of chloroplast genome data representing nine green algae in Sphaeropleales (Chlorophyceae, Chlorophyta)

    OpenAIRE

    Fučíková, Karolina; Lewis, Louise A.; Lewis, Paul O.

    2016-01-01

    The chloroplast genomes of green algae are highly variable in their architecture. In this article we summarize gene content across newly obtained and published chloroplast genomes in Chlorophyceae, including new data from nine of species in Sphaeropleales (Chlorophyceae, Chlorophyta). We present genome architecture information, including genome synteny analysis across two groups of species. Also, we provide a phylogenetic tree obtained from analysis of gene order data for species in Chlorophy...

  15. The Complete Chloroplast Genome Sequences of Five Epimedium Species: Lights into Phylogenetic and Taxonomic Analyses

    OpenAIRE

    Zhang, Yanjun; Du, Liuwen; Liu, Ao; Chen, Jianjun; Wu, Li; Hu, Weiming; Zhang, Wei; Kim, Kyunghee; Lee, Sang-Choon; Yang, Tae-Jin; Wang, Ying

    2016-01-01

    Epimedium L. is a phylogenetically and economically important genus in the family Berberidaceae. We here sequenced the complete chloroplast (cp) genomes of four Epimedium species using Illumina sequencing technology via a combination of de novo and reference-guided assembly, which was also the first comprehensive cp genome analysis on Epimedium combining the cp genome sequence of E. koreanum previously reported. The five Epimedium cp genomes exhibited typical quadripartite and circular struct...

  16. X-ray CT analyses, models and numerical simulations: a comparison with petrophysical analyses in an experimental CO2 study

    Science.gov (United States)

    Henkel, Steven; Pudlo, Dieter; Enzmann, Frieder; Reitenbach, Viktor; Albrecht, Daniel; Ganzer, Leonhard; Gaupp, Reinhard

    2016-06-01

    An essential part of the collaborative research project H2STORE (hydrogen to store), which is funded by the German government, was a comparison of various analytical methods for characterizing reservoir sandstones from different stratigraphic units. In this context Permian, Triassic and Tertiary reservoir sandstones were analysed. Rock core materials, provided by RWE Gasspeicher GmbH (Dortmund, Germany), GDF Suez E&P Deutschland GmbH (Lingen, Germany), E.ON Gas Storage GmbH (Essen, Germany) and RAG Rohöl-Aufsuchungs Aktiengesellschaft (Vienna, Austria), were processed by different laboratory techniques; thin sections were prepared, rock fragments were crushed and cubes of 1 cm edge length and plugs 3 to 5 cm in length with a diameter of about 2.5 cm were sawn from macroscopic homogeneous cores. With this prepared sample material, polarized light microscopy and scanning electron microscopy, coupled with image analyses, specific surface area measurements (after Brunauer, Emmet and Teller, 1938; BET), He-porosity and N2-permeability measurements and high-resolution microcomputer tomography (μ-CT), which were used for numerical simulations, were applied. All these methods were practised on most of the same sample material, before and on selected Permian sandstones also after static CO2 experiments under reservoir conditions. A major concern in comparing the results of these methods is an appraisal of the reliability of the given porosity, permeability and mineral-specific reactive (inner) surface area data. The CO2 experiments modified the petrophysical as well as the mineralogical/geochemical rock properties. These changes are detectable by all applied analytical methods. Nevertheless, a major outcome of the high-resolution μ-CT analyses and following numerical data simulations was that quite similar data sets and data interpretations were maintained by the different petrophysical standard methods. Moreover, the μ-CT analyses are not only time saving, but also non

  17. Genomic and Haplotype Comparison of Butanol Producing Bacteria Based on 16S rDNA

    OpenAIRE

    Ekwan Nofa Wiratno; Suharjono; Agustin Krisna Wardani

    2016-01-01

    High butanol demand for transportation fuel triggers butanol production development. Exploration of butanolproducing bacteria using genomic comparison and biogeography will help to develop butanol industry. The objectives of this research were butanol production, genome comparison and haplotype analysis of butanolproducing bacteria from Ranu Pani Lake sediment using 16S rDNA sequences. The highest butanol concentrations were showed by Paenibacillus polymyxa RP 2.2 isolate (10.34 g...

  18. Comparative genomic characterization of three Streptococcus parauberis strains in fish pathogen, as assessed by wide-genome analyses.

    Directory of Open Access Journals (Sweden)

    Seong-Won Nho

    Full Text Available Streptococcus parauberis, which is the main causative agent of streptococcosis among olive flounder (Paralichthys olivaceus in northeast Asia, can be distinctly divided into two groups (type I and type II by an agglutination test. Here, the whole genome sequences of two Japanese strains (KRS-02083 and KRS-02109 were determined and compared with the previously determined genome of a Korean strain (KCTC 11537. The genomes of S. parauberis are intermediate in size and have lower GC contents than those of other streptococci. We annotated 2,236 and 2,048 genes in KRS-02083 and KRS-02109, respectively. Our results revealed that the three S. parauberis strains contain different genomic insertions and deletions. In particular, the genomes of Korean and Japanese strains encode different factors for sugar utilization; the former encodes the phosphotransferase system (PTS for sorbose, whereas the latter encodes proteins for lactose hydrolysis, respectively. And the KRS-02109 strain, specifically, was the type II strain found to be able to resist phage infection through the clustered regularly interspaced short palindromic repeats (CRISPR/Cas system and which might contribute valuably to serologically distribution. Thus, our genome-wide association study shows that polymorphisms can affect pathogen responses, providing insight into biological/biochemical pathways and phylogenetic diversity.

  19. Comparative genomic analyses of nickel, cobalt and vitamin B12 utilization

    Directory of Open Access Journals (Sweden)

    Gelfand Mikhail S

    2009-02-01

    Full Text Available Abstract Background Nickel (Ni and cobalt (Co are trace elements required for a variety of biological processes. Ni is directly coordinated by proteins, whereas Co is mainly used as a component of vitamin B12. Although a number of Ni and Co-dependent enzymes have been characterized, systematic evolutionary analyses of utilization of these metals are limited. Results We carried out comparative genomic analyses to examine occurrence and evolutionary dynamics of the use of Ni and Co at the level of (i transport systems, and (ii metalloproteomes. Our data show that both metals are widely used in bacteria and archaea. Cbi/NikMNQO is the most common prokaryotic Ni/Co transporter, while Ni-dependent urease and Ni-Fe hydrogenase, and B12-dependent methionine synthase (MetH, ribonucleotide reductase and methylmalonyl-CoA mutase are the most widespread metalloproteins for Ni and Co, respectively. Occurrence of other metalloenzymes showed a mosaic distribution and a new B12-dependent protein family was predicted. Deltaproteobacteria and Methanosarcina generally have larger Ni- and Co-dependent proteomes. On the other hand, utilization of these two metals is limited in eukaryotes, and very few of these organisms utilize both of them. The Ni-utilizing eukaryotes are mostly fungi (except saccharomycotina and plants, whereas most B12-utilizing organisms are animals. The NiCoT transporter family is the most widespread eukaryotic Ni transporter, and eukaryotic urease and MetH are the most common Ni- and B12-dependent enzymes, respectively. Finally, investigation of environmental and other conditions and identity of organisms that show dependence on Ni or Co revealed that host-associated organisms (particularly obligate intracellular parasites and endosymbionts have a tendency for loss of Ni/Co utilization. Conclusion Our data provide information on the evolutionary dynamics of Ni and Co utilization and highlight widespread use of these metals in the three

  20. The Complete Chloroplast Genome Sequences of Five Epimedium Species: Lights into Phylogenetic and Taxonomic Analyses

    Science.gov (United States)

    Zhang, Yanjun; Du, Liuwen; Liu, Ao; Chen, Jianjun; Wu, Li; Hu, Weiming; Zhang, Wei; Kim, Kyunghee; Lee, Sang-Choon; Yang, Tae-Jin; Wang, Ying

    2016-01-01

    Epimedium L. is a phylogenetically and economically important genus in the family Berberidaceae. We here sequenced the complete chloroplast (cp) genomes of four Epimedium species using Illumina sequencing technology via a combination of de novo and reference-guided assembly, which was also the first comprehensive cp genome analysis on Epimedium combining the cp genome sequence of E. koreanum previously reported. The five Epimedium cp genomes exhibited typical quadripartite and circular structure that was rather conserved in genomic structure and the synteny of gene order. However, these cp genomes presented obvious variations at the boundaries of the four regions because of the expansion and contraction of the inverted repeat (IR) region and the single-copy (SC) boundary regions. The trnQ-UUG duplication occurred in the five Epimedium cp genomes, which was not found in the other basal eudicotyledons. The rapidly evolving cp genome regions were detected among the five cp genomes, as well as the difference of simple sequence repeats (SSR) and repeat sequence were identified. Phylogenetic relationships among the five Epimedium species based on their cp genomes showed accordance with the updated system of the genus on the whole, but reminded that the evolutionary relationships and the divisions of the genus need further investigation applying more evidences. The availability of these cp genomes provided valuable genetic information for accurately identifying species, taxonomy and phylogenetic resolution and evolution of Epimedium, and assist in exploration and utilization of Epimedium plants. PMID:27014326

  1. Genome-wide expression analyses of the stationary phase model of ageing in yeast.

    Science.gov (United States)

    Wanichthanarak, Kwanjeera; Wongtosrad, Nutvadee; Petranovic, Dina

    2015-07-01

    Ageing processes involved in replicative lifespan (RLS) and chronological lifespan (CLS) have been found to be conserved among many organisms, including in unicellular Eukarya such as yeast Saccharomyces cerevisiae. Here we performed an integrated approach of genome wide expression profiles of yeast at different time points, during growth and starvation. The aim of the study was to identify transcriptional changes in those conditions by using several different computational analyses in order to propose transcription factors, biological networks and metabolic pathways that seem to be relevant during the process of chronological ageing in yeast. Specifically, we performed differential gene expression analysis, gene-set enrichment analysis and network-based analysis, and we identified pathways affected in the stationary phase and specific transcription factors driving transcriptional adaptations. The results indicate signal propagation from G protein-coupled receptors through signaling pathway components and other stress and nutrient-induced transcription factors resulting in adaptation of yeast cells to the lack of nutrients by activating metabolism associated with aerobic metabolism of carbon sources such as ethanol, glycerol and fatty acids. In addition, we found STE12, XBP1 and TOS8 as highly connected nodes in the subnetworks of ageing yeast. PMID:26079307

  2. The GeneCards Suite: From Gene Data Mining to Disease Genome Sequence Analyses.

    Science.gov (United States)

    Stelzer, Gil; Rosen, Naomi; Plaschkes, Inbar; Zimmerman, Shahar; Twik, Michal; Fishilevich, Simon; Stein, Tsippi Iny; Nudel, Ron; Lieder, Iris; Mazor, Yaron; Kaplan, Sergey; Dahary, Dvir; Warshawsky, David; Guan-Golan, Yaron; Kohn, Asher; Rappaport, Noa; Safran, Marilyn; Lancet, Doron

    2016-01-01

    GeneCards, the human gene compendium, enables researchers to effectively navigate and inter-relate the wide universe of human genes, diseases, variants, proteins, cells, and biological pathways. Our recently launched Version 4 has a revamped infrastructure facilitating faster data updates, better-targeted data queries, and friendlier user experience. It also provides a stronger foundation for the GeneCards suite of companion databases and analysis tools. Improved data unification includes gene-disease links via MalaCards and merged biological pathways via PathCards, as well as drug information and proteome expression. VarElect, another suite member, is a phenotype prioritizer for next-generation sequencing, leveraging the GeneCards and MalaCards knowledgebase. It automatically infers direct and indirect scored associations between hundreds or even thousands of variant-containing genes and disease phenotype terms. VarElect's capabilities, either independently or within TGex, our comprehensive variant analysis pipeline, help prepare for the challenge of clinical projects that involve thousands of exome/genome NGS analyses. © 2016 by John Wiley & Sons, Inc. PMID:27322403

  3. Genome-wide comparison and taxonomic relatedness of multiple Xylella fastidiosa strains reveal the occurrence of three subspecies and a new Xylella species.

    Science.gov (United States)

    Marcelletti, Simone; Scortichini, Marco

    2016-10-01

    A total of 21 Xylella fastidiosa strains were assessed by comparing their genomes to infer their taxonomic relationships. The whole-genome-based average nucleotide identity and tetranucleotide frequency correlation coefficient analyses were performed. In addition, a consensus tree based on comparisons of 956 core gene families, and a genome-wide phylogenetic tree and a Neighbor-net network were constructed with 820,088 nucleotides (i.e., approximately 30-33 % of the entire X. fastidiosa genome). All approaches revealed the occurrence of three well-demarcated genetic clusters that represent X. fastidiosa subspecies fastidiosa, multiplex and pauca, with the latter appeared to diverge. We suggest that the proposed but never formally described subspecies 'sandyi' and 'morus' are instead members of the subspecies fastidiosa. These analyses support the view that the Xylella strain isolated from Pyrus pyrifolia in Taiwan is likely to be a new species. A widely used multilocus sequence typing analysis yielded conflicting results. PMID:27209415

  4. Secure distributed genome analysis for GWAS and sequence comparison computation

    Science.gov (United States)

    2015-01-01

    Background The rapid increase in the availability and volume of genomic data makes significant advances in biomedical research possible, but sharing of genomic data poses challenges due to the highly sensitive nature of such data. To address the challenges, a competition for secure distributed processing of genomic data was organized by the iDASH research center. Methods In this work we propose techniques for securing computation with real-life genomic data for minor allele frequency and chi-squared statistics computation, as well as distance computation between two genomic sequences, as specified by the iDASH competition tasks. We put forward novel optimizations, including a generalization of a version of mergesort, which might be of independent interest. Results We provide implementation results of our techniques based on secret sharing that demonstrate practicality of the suggested protocols and also report on performance improvements due to our optimization techniques. Conclusions This work describes our techniques, findings, and experimental results developed and obtained as part of iDASH 2015 research competition to secure real-life genomic computations and shows feasibility of securely computing with genomic data in practice. PMID:26733307

  5. Aspekte der bioinformatischen Analyse und Annotation des Genoms von Rhodopirellula baltica

    OpenAIRE

    Teeling, Hanno

    2004-01-01

    This thesis focuses on the bioinformatic analysis and annotation of the genome of the marine planctomycete Rhodopirellula baltica. A comprehensive bioinformatic pipeline was set up and established that comprises gene prediction, annotation and visualization tools. Considerable effort was put into the manual annotation process.The annotation of the genome of Rhodopirellula baltica revealed that this organism is specialized on the aerobic degradation of complex carbohydrates. Its genome harbors...

  6. UniPrimer: A Web-Based Primer Design Tool for Comparative Analyses of Primate Genomes

    OpenAIRE

    Nomin Batnyam; Jimin Lee; Jungnam Lee; Seung Bok Hong; Sejong Oh; Kyudong Han

    2012-01-01

    Whole genome sequences of various primates have been released due to advanced DNA-sequencing technology. A combination of computational data mining and the polymerase chain reaction (PCR) assay to validate the data is an excellent method for conducting comparative genomics. Thus, designing primers for PCR is an essential procedure for a comparative analysis of primate genomes. Here, we developed and introduced UniPrimer for use in those studies. UniPrimer is a web-based tool that designs PCR-...

  7. Whole-genome analyses of the speciation events in the pathogenic Brucellae

    Energy Technology Data Exchange (ETDEWEB)

    Chain, P; Comerci, D; Tolmasky, M; Larimer, F; Malfatti, S; Vergez, L; Aguero, F; Land, M; Ugalde, R; Garcia, E

    2005-07-14

    Despite their high DNA identity and a proposal to group classical Brucella species as biovars of B. melitensis, the commonly recognized Brucella species can be distinguished by distinct biochemical and fatty acid characters as well as by a marked host range (e.g. B. suis for swine, B. melitensis for sheep and goats, B. abortus for cattle). Here we present the genome of B. abortus 2308, the virulent prototype biovar 1 strain, and its comparison to the two other human pathogenic Brucellae species and to the B. abortus field isolate 9-941. The global distribution of pseudogenes, deletions and insertions support previous indications that B. abortus and B. melitensis share a common ancestor that diverged from B. suis. With the exception of a dozen genes, the genetic complement of both B. abortus strains is identical, whereas the three species differ in gene content and pseudogenes. The pattern of species-specific gene inactivations affecting transcriptional regulators and outer membrane proteins suggest that these inactivations may play an important role in the establishment of host-specificity and may have been a primary driver of speciation in the Brucellae. Despite being non-motile, the Brucellae contain flagellum gene clusters and display species-specific flagellar gene inactivations, which lead to the putative generation of different versions of flagellum-derived structures, and may contribute to differences in host-specificity and virulence. Metabolic changes such as the lack of complete metabolic pathways for the synthesis of numerous compounds (e.g. glycogen, biotin, NAD, and choline) are consistent with adaptation of Brucellae to an intracellular lifestyle.

  8. Alignment-free comparison of genome sequences by a new numerical characterization.

    Science.gov (United States)

    Huang, Guohua; Zhou, Houqing; Li, Yongfan; Xu, Lixin

    2011-07-21

    In order to compare different genome sequences, an alignment-free method has proposed. First, we presented a new graphical representation of DNA sequences without degeneracy, which is conducive to intuitive comparison of sequences. Then, a new numerical characterization based on the representation was introduced to quantitatively depict the intrinsic nature of genome sequences, and considered as a 10-dimensional vector in the mathematical space. Alignment-free comparison of sequences was performed by computing the distances between vectors of the corresponding numerical characterizations, which define the evolutionary relationship. Two data sets of DNA sequences were constructed to assess the performance on sequence comparison. The results illustrate well validity of the method. The new numerical characterization provides a powerful tool for genome comparison. PMID:21536050

  9. Explorations in genome-wide association studies and network analyses with dairy cattle fertility traits.

    Science.gov (United States)

    Parker Gaddis, K L; Null, D J; Cole, J B

    2016-08-01

    The objective of this study was to identify single nucleotide polymorphisms and gene networks associated with 3 fertility traits in dairy cattle-daughter pregnancy rate, heifer conception rate, and cow conception rate-using different approaches. Deregressed predicted transmitting abilities were available for approximately 24,000 Holstein bulls and 36,000 Holstein cows sampled from the National Dairy Database with high-density genotypes. Of those, 1,732 bulls and 375 cows had been genotyped with the Illumina BovineHD Genotyping BeadChip (Illumina Inc., San Diego, CA). The remaining animals were genotyped with various chips of lower density that were imputed to high density. Univariate and trivariate genome-wide association studies (GWAS) with both medium- (60,671 markers) and high-density (312,614 markers) panels were performed for daughter pregnancy rate, heifer conception rate, and cow conception rate using GEMMA (version 0.94; http://www.xzlab.org/software.html). Analyses were conducted using bulls only, cows only, and a sample of both bulls and cows. The partial correlation and information theory algorithm was used to develop gene interaction networks. The most significant markers were further investigated to identify putatively associated genes. Little overlap in associated genes could be found between GWAS using different reference populations of bulls only, cows only, and combined bulls and cows. The partial correlation and information theory algorithm was able to identify several genes that were not identified by ordinary GWAS. The results obtained herein will aid in further dissecting the complex biology underlying fertility traits in dairy cattle, while also providing insight into the nuances of GWAS. PMID:27209127

  10. The plant ontology as a tool for comparative plant anatomy and genomic analyses

    Science.gov (United States)

    Plant science is now a major player in the fields of genomics, gene expression analysis, phenomics and metabolomics. Recent advances in sequencing technologies have led to a windfall of data, with new species being added rapidly to the list of species whose genomes have been decoded. The Plant Ontol...

  11. Comparative genomic and proteomic analyses of Clostridium acetobutylicum Rh8 and its parent strain DSM 1731 revealed new understandings on butanol tolerance

    Energy Technology Data Exchange (ETDEWEB)

    Bao, Guanhui [CAS Key Laboratory of Microbial Physiological and Metabolic Engineering, Institute of Microbiology, Chinese Academy of Sciences, Beijing (China); University of Chinese Academy of Sciences, Beijing (China); Dong, Hongjun; Zhu, Yan; Mao, Shaoming [CAS Key Laboratory of Microbial Physiological and Metabolic Engineering, Institute of Microbiology, Chinese Academy of Sciences, Beijing (China); Zhang, Tianrui [CAS Key Laboratory of Microbial Physiological and Metabolic Engineering, Institute of Microbiology, Chinese Academy of Sciences, Beijing (China); Tianjin Institute of Industrial Biotechnology, Chinese Academy of Sciences, Tianjin (China); Zhang, Yanping [CAS Key Laboratory of Microbial Physiological and Metabolic Engineering, Institute of Microbiology, Chinese Academy of Sciences, Beijing (China); Chen, Zugen [Department of Human Genetics, School of Medicine, University of California, Los Angeles, CA 90095 (United States); Li, Yin, E-mail: yli@im.ac.cn [CAS Key Laboratory of Microbial Physiological and Metabolic Engineering, Institute of Microbiology, Chinese Academy of Sciences, Beijing (China)

    2014-08-08

    Highlights: • Genomes of a butanol tolerant strain and its parent strain were deciphered. • Comparative genomic and proteomic was applied to understand butanol tolerance. • None differentially expressed proteins have mutations in its corresponding genes. • Mutations in ribosome might be responsible for the global difference of proteomics. - Abstract: Clostridium acetobutylicum strain Rh8 is a butanol-tolerant mutant which can tolerate up to 19 g/L butanol, 46% higher than that of its parent strain DSM 1731. We previously performed comparative cytoplasm- and membrane-proteomic analyses to understand the mechanism underlying the improved butanol tolerance of strain Rh8. In this work, we further extended this comparison to the genomic level. Compared with the genome of the parent strain DSM 1731, two insertion sites, four deletion sites, and 67 single nucleotide variations (SNVs) are distributed throughout the genome of strain Rh8. Among the 67 SNVs, 16 SNVs are located in the predicted promoters and intergenic regions; while 29 SNVs are located in the coding sequence, affecting a total of 21 proteins involved in transport, cell structure, DNA replication, and protein translation. The remaining 22 SNVs are located in the ribosomal genes, affecting a total of 12 rRNA genes in different operons. Analysis of previous comparative proteomic data indicated that none of the differentially expressed proteins have mutations in its corresponding genes. Rchange Algorithms analysis indicated that the mutations occurred in the ribosomal genes might change the ribosome RNA thermodynamic characteristics, thus affect the translation strength of these proteins. Take together, the improved butanol tolerance of C. acetobutylicum strain Rh8 might be acquired through regulating the translational process to achieve different expression strength of genes involved in butanol tolerance.

  12. Comparative genomic and proteomic analyses of Clostridium acetobutylicum Rh8 and its parent strain DSM 1731 revealed new understandings on butanol tolerance

    International Nuclear Information System (INIS)

    Highlights: • Genomes of a butanol tolerant strain and its parent strain were deciphered. • Comparative genomic and proteomic was applied to understand butanol tolerance. • None differentially expressed proteins have mutations in its corresponding genes. • Mutations in ribosome might be responsible for the global difference of proteomics. - Abstract: Clostridium acetobutylicum strain Rh8 is a butanol-tolerant mutant which can tolerate up to 19 g/L butanol, 46% higher than that of its parent strain DSM 1731. We previously performed comparative cytoplasm- and membrane-proteomic analyses to understand the mechanism underlying the improved butanol tolerance of strain Rh8. In this work, we further extended this comparison to the genomic level. Compared with the genome of the parent strain DSM 1731, two insertion sites, four deletion sites, and 67 single nucleotide variations (SNVs) are distributed throughout the genome of strain Rh8. Among the 67 SNVs, 16 SNVs are located in the predicted promoters and intergenic regions; while 29 SNVs are located in the coding sequence, affecting a total of 21 proteins involved in transport, cell structure, DNA replication, and protein translation. The remaining 22 SNVs are located in the ribosomal genes, affecting a total of 12 rRNA genes in different operons. Analysis of previous comparative proteomic data indicated that none of the differentially expressed proteins have mutations in its corresponding genes. Rchange Algorithms analysis indicated that the mutations occurred in the ribosomal genes might change the ribosome RNA thermodynamic characteristics, thus affect the translation strength of these proteins. Take together, the improved butanol tolerance of C. acetobutylicum strain Rh8 might be acquired through regulating the translational process to achieve different expression strength of genes involved in butanol tolerance

  13. Genome-wide association analyses based on whole-genome sequencing in Sardinia provide insights into regulation of hemoglobin levels.

    Science.gov (United States)

    Danjou, Fabrice; Zoledziewska, Magdalena; Sidore, Carlo; Steri, Maristella; Busonero, Fabio; Maschio, Andrea; Mulas, Antonella; Perseu, Lucia; Barella, Susanna; Porcu, Eleonora; Pistis, Giorgio; Pitzalis, Maristella; Pala, Mauro; Menzel, Stephan; Metrustry, Sarah; Spector, Timothy D; Leoni, Lidia; Angius, Andrea; Uda, Manuela; Moi, Paolo; Thein, Swee Lay; Galanello, Renzo; Abecasis, Gonçalo R; Schlessinger, David; Sanna, Serena; Cucca, Francesco

    2015-11-01

    We report genome-wide association study results for the levels of A1, A2 and fetal hemoglobins, analyzed for the first time concurrently. Integrating high-density array genotyping and whole-genome sequencing in a large general population cohort from Sardinia, we detected 23 associations at 10 loci. Five signals are due to variants at previously undetected loci: MPHOSPH9, PLTP-PCIF1, ZFPM1 (FOG1), NFIX and CCND3. Among the signals at known loci, ten are new lead variants and four are new independent signals. Half of all variants also showed pleiotropic associations with different hemoglobins, which further corroborated some of the detected associations and identified features of coordinated hemoglobin species production. PMID:26366553

  14. Integrative genomic analyses reveal an androgen-driven somatic alteration landscape in early-onset prostate cancer.

    Science.gov (United States)

    Weischenfeldt, Joachim; Simon, Ronald; Feuerbach, Lars; Schlangen, Karin; Weichenhan, Dieter; Minner, Sarah; Wuttig, Daniela; Warnatz, Hans-Jörg; Stehr, Henning; Rausch, Tobias; Jäger, Natalie; Gu, Lei; Bogatyrova, Olga; Stütz, Adrian M; Claus, Rainer; Eils, Jürgen; Eils, Roland; Gerhäuser, Clarissa; Huang, Po-Hsien; Hutter, Barbara; Kabbe, Rolf; Lawerenz, Christian; Radomski, Sylwester; Bartholomae, Cynthia C; Fälth, Maria; Gade, Stephan; Schmidt, Manfred; Amschler, Nina; Haß, Thomas; Galal, Rami; Gjoni, Jovisa; Kuner, Ruprecht; Baer, Constance; Masser, Sawinee; von Kalle, Christof; Zichner, Thomas; Benes, Vladimir; Raeder, Benjamin; Mader, Malte; Amstislavskiy, Vyacheslav; Avci, Meryem; Lehrach, Hans; Parkhomchuk, Dmitri; Sultan, Marc; Burkhardt, Lia; Graefen, Markus; Huland, Hartwig; Kluth, Martina; Krohn, Antje; Sirma, Hüseyin; Stumm, Laura; Steurer, Stefan; Grupp, Katharina; Sültmann, Holger; Sauter, Guido; Plass, Christoph; Brors, Benedikt; Yaspo, Marie-Laure; Korbel, Jan O; Schlomm, Thorsten

    2013-02-11

    Early-onset prostate cancer (EO-PCA) represents the earliest clinical manifestation of prostate cancer. To compare the genomic alteration landscapes of EO-PCA with "classical" (elderly-onset) PCA, we performed deep sequencing-based genomics analyses in 11 tumors diagnosed at young age, and pursued comparative assessments with seven elderly-onset PCA genomes. Remarkable age-related differences in structural rearrangement (SR) formation became evident, suggesting distinct disease pathomechanisms. Whereas EO-PCAs harbored a prevalence of balanced SRs, with a specific abundance of androgen-regulated ETS gene fusions including TMPRSS2:ERG, elderly-onset PCAs displayed primarily non-androgen-associated SRs. Data from a validation cohort of > 10,000 patients showed age-dependent androgen receptor levels and a prevalence of SRs affecting androgen-regulated genes, further substantiating the activity of a characteristic "androgen-type" pathomechanism in EO-PCA. PMID:23410972

  15. Culture Independent Genomic Comparisons Reveal Environmental Adaptations for Altiarchaeales

    Science.gov (United States)

    Baker, Brett J.; Probst, Alexander J.; Podar, Mircea; Lloyd, Karen G.

    2016-01-01

    The recently proposed candidatus order Altiarchaeales remains an uncultured archaeal lineage composed of genetically diverse, globally widespread organisms frequently observed in anoxic subsurface environments. In spite of 15 years of studies on the psychrophilic biofilm-producing Candidatus Altiarchaeum hamiconexum and its close relatives, very little is known about the phylogenetic and functional diversity of the widespread free-living marine members of this taxon. From methanogenic sediments in the White Oak River Estuary, NC, USA, we sequenced a single cell amplified genome (SAG), WOR_SM1_SCG, and used it to identify and refine two high-quality genomes from metagenomes, WOR_SM1_79 and WOR_SM1_86-2, from the same site. These three genomic reconstructions form a monophyletic group, which also includes three previously published genomes from metagenomes from terrestrial springs and a SAG from Sakinaw Lake in a group previously designated as pMC2A384. A synapomorphic mutation in the Altiarchaeales tRNA synthetase β subunit, pheT, caused the protein to be encoded as two subunits at non-adjacent loci. Consistent with the terrestrial spring clades, our estuarine genomes contained a near-complete autotrophic metabolism, H2 or CO as potential electron donors, a reductive acetyl-CoA pathway for carbon fixation, and methylotroph-like NADP(H)-dependent dehydrogenase. Phylogenies based on 16S rRNA genes and concatenated conserved proteins identified two distinct sub-clades of Altiarchaeales, Alti-1 populated by organisms from actively flowing springs, and Alti-2 which was more widespread, diverse, and not associated with visible mats. The core Alti-1 genome suggested Alti-1 is adapted for the stream environment with lipopolysaccharide production capacity and extracellular hami structures. The core Alti-2 genome suggested members of this clade are free-living with distinct mechanisms for energy maintenance, motility, osmoregulation, and sulfur redox reactions. These data

  16. SBEAMS-Microarray: database software supporting genomic expression analyses for systems biology

    OpenAIRE

    Campbell David; Moss Patrick; Deutsch Eric W; Marzolf Bruz; Johnson Michael H; Galitski Timothy

    2006-01-01

    Abstract Background The biological information in genomic expression data can be understood, and computationally extracted, in the context of systems of interacting molecules. The automation of this information extraction requires high throughput management and analysis of genomic expression data, and integration of these data with other data types. Results SBEAMS-Microarray, a module of the open-source Systems Biology Experiment Analysis Management System (SBEAMS), enables MIAME-compliant st...

  17. Comparative Genetic Analyses of Human Rhinovirus C (HRV-C) Complete Genome from Malaysia.

    Science.gov (United States)

    Khaw, Yam Sim; Chan, Yoke Fun; Jafar, Faizatul Lela; Othman, Norlijah; Chee, Hui Yee

    2016-01-01

    Human rhinovirus-C (HRV-C) has been implicated in more severe illnesses than HRV-A and HRV-B, however, the limited number of HRV-C complete genomes (complete 5' and 3' non-coding region and open reading frame sequences) has hindered the in-depth genetic study of this virus. This study aimed to sequence seven complete HRV-C genomes from Malaysia and compare their genetic characteristics with the 18 published HRV-Cs. Seven Malaysian HRV-C complete genomes were obtained with newly redesigned primers. The seven genomes were classified as HRV-C6, C12, C22, C23, C26, C42, and pat16 based on the VP4/VP2 and VP1 pairwise distance threshold classification. Five of the seven Malaysian isolates, namely, 3430-MY-10/C22, 8713-MY-10/C23, 8097-MY-11/C26, 1570-MY-10/C42, and 7383-MY-10/pat16 are the first newly sequenced complete HRV-C genomes. All seven Malaysian isolates genomes displayed nucleotide similarity of 63-81% among themselves and 63-96% with other HRV-Cs. Malaysian HRV-Cs had similar putative immunogenic sites, putative receptor utilization and potential antiviral sites as other HRV-Cs. The genomic features of Malaysian isolates were similar to those of other HRV-Cs. Negative selections were frequently detected in HRV-Cs complete coding sequences indicating that these sequences were under functional constraint. The present study showed that HRV-Cs from Malaysia have diverse genetic sequences but share conserved genomic features with other HRV-Cs. This genetic information could provide further aid in the understanding of HRV-C infection. PMID:27199901

  18. Comparative Genetic Analyses of Human Rhinovirus C (HRV-C) Complete Genome from Malaysia

    Science.gov (United States)

    Khaw, Yam Sim; Chan, Yoke Fun; Jafar, Faizatul Lela; Othman, Norlijah; Chee, Hui Yee

    2016-01-01

    Human rhinovirus-C (HRV-C) has been implicated in more severe illnesses than HRV-A and HRV-B, however, the limited number of HRV-C complete genomes (complete 5′ and 3′ non-coding region and open reading frame sequences) has hindered the in-depth genetic study of this virus. This study aimed to sequence seven complete HRV-C genomes from Malaysia and compare their genetic characteristics with the 18 published HRV-Cs. Seven Malaysian HRV-C complete genomes were obtained with newly redesigned primers. The seven genomes were classified as HRV-C6, C12, C22, C23, C26, C42, and pat16 based on the VP4/VP2 and VP1 pairwise distance threshold classification. Five of the seven Malaysian isolates, namely, 3430-MY-10/C22, 8713-MY-10/C23, 8097-MY-11/C26, 1570-MY-10/C42, and 7383-MY-10/pat16 are the first newly sequenced complete HRV-C genomes. All seven Malaysian isolates genomes displayed nucleotide similarity of 63–81% among themselves and 63–96% with other HRV-Cs. Malaysian HRV-Cs had similar putative immunogenic sites, putative receptor utilization and potential antiviral sites as other HRV-Cs. The genomic features of Malaysian isolates were similar to those of other HRV-Cs. Negative selections were frequently detected in HRV-Cs complete coding sequences indicating that these sequences were under functional constraint. The present study showed that HRV-Cs from Malaysia have diverse genetic sequences but share conserved genomic features with other HRV-Cs. This genetic information could provide further aid in the understanding of HRV-C infection. PMID:27199901

  19. Genomic and genetic analyses of diversity and plant interactions of Pseudomonas fluorescens.

    OpenAIRE

    Silby, MW; Cerdeño-Tárraga, AM; Vernikos, GS; Giddens, SR; Jackson, RW; Preston, GM; Zhang, XX; Moon, CD; Gehrig, SM; Godfrey, SA; Knight, CG; Malone, JG; Robinson, Z; Spiers, AJ; Harris, S.

    2009-01-01

    BACKGROUND Pseudomonas fluorescens are common soil bacteria that can improve plant health through nutrient cycling, pathogen antagonism and induction of plant defenses. The genome sequences of strains SBW25 and Pf0-1 were determined and compared to each other and with P. fluorescens Pf-5. A functional genomic in vivo expression technology (IVET) screen provided insight into genes used by P. fluorescens in its natural environment and an improved understanding of the ecological significance of ...

  20. Comparative Genome Analyses Reveal Distinct Structure in the Saltwater Crocodile MHC

    OpenAIRE

    Jaratlerdsiri, Weerachai; Deakin, Janine; Ricardo M Godinez; Shan, Xueyan; Peterson, Daniel G.; Marthey, Sylvain; Lyons, Eric; McCarthy, Fiona M.; Isberg, Sally R.; Higgins, Damien P.; Chong, Amanda Y; St John, John; Glenn, Travis C.; Ray, David A.; Gongora, Jaime

    2014-01-01

    The major histocompatibility complex (MHC) is a dynamic genome region with an essential role in the adaptive immunity of vertebrates, especially antigen presentation. The MHC is generally divided into subregions (classes I, II and III) containing genes of similar function across species, but with different gene number and organisation. Crocodylia (crocodilians) are widely distributed and represent an evolutionary distinct group among higher vertebrates, but the genomic organisation of MHC wit...

  1. ANItools web: a web tool for fast genome comparison within multiple bacterial strains

    OpenAIRE

    Han, Na; Qiang, Yujun; Zhang, Wen

    2016-01-01

    Background: Early classification of prokaryotes was based solely on phenotypic similarities, but modern prokaryote characterization has been strongly influenced by advances in genetic methods. With the fast development of the sequencing technology, the ever increasing number of genomic sequences per species offers the possibility for developing distance determinations based on whole-genome information. The average nucleotide identity (ANI), calculated from pair-wise comparisons of all sequenc...

  2. Genome reorganization in Nicotiana asymmetric somatic hybrids analysed by in situ hybridization

    International Nuclear Information System (INIS)

    In situ hybridization was used to examine genome reorganization in asymmetric somatic hybrids between Nicotiana plumbaginifolia and Nicotiana sylvestris obtained by fusion of gamma-irradiated protoplasts from one of the parents (donor) with non-irradiated protoplasts from the other (recipient). Probing with biotinylated total genomic DNA from either the donor or the recipient species unequivocally identified genetic material from both parents in 31 regenerant plants, each originating from a different nuclear hybrid colony. This method, termed genomic in situ hybridization (GISH), allowed intergenomic translocations containing chromosome segments from both species to be recognized in four regenerants. A probe homologous to the consensus sequence of the Arabidopsis thaliana telomeric repeat (5'-TTTAGGG-3')n, identified telomeres on all chromosomes, including 'mini-chromosomes' originating from the irradiated donor genome. Genomic in situ hybridization to plant chromosomes provides a rapid and reliable means of screening for recombinant genotypes in asymmetric somatic hybrids. Used in combination with other DNA probes, it also contributes to a greater understanding of the events responsible for genomic recovery and restabilization following genetic manipulation in vitro

  3. Bluejay 1.0: genome browsing and comparison with rich customization provision and dynamic resource linking

    Directory of Open Access Journals (Sweden)

    Turinsky Andrei L

    2008-10-01

    Full Text Available Abstract Background The Bluejay genome browser has been developed over several years to address the challenges posed by the ever increasing number of data types as well as the increasing volume of data in genome research. Beginning with a browser capable of rendering views of XML-based genomic information and providing scalable vector graphics output, we have now completed version 1.0 of the system with many additional features. Our development efforts were guided by our observation that biologists who use both gene expression profiling and comparative genomics gain functional insights above and beyond those provided by traditional per-gene analyses. Results Bluejay 1.0 is a genome viewer integrating genome annotation with: (i gene expression information; and (ii comparative analysis with an unlimited number of other genomes in the same view. This allows the biologist to see a gene not just in the context of its genome, but also its regulation and its evolution. Bluejay now has rich provision for personalization by users: (i numerous display customization features; (ii the availability of waypoints for marking multiple points of interest on a genome and subsequently utilizing them; and (iii the ability to take user relevance feedback of annotated genes or textual items to offer personalized recommendations. Bluejay 1.0 also embeds the Seahawk browser for the Moby protocol, enabling users to seamlessly invoke hundreds of Web Services on genomic data of interest without any hard-coding. Conclusion Bluejay offers a unique set of customizable genome-browsing features, with the goal of allowing biologists to quickly focus on, analyze, compare, and retrieve related information on the parts of the genomic data they are most interested in. We expect these capabilities of Bluejay to benefit the many biologists who want to answer complex questions using the information available from completely sequenced genomes.

  4. Genome-wide analyses implicate 33 loci in heritable dog osteosarcoma, including regulatory variants near CDKN2A/B

    OpenAIRE

    Karlsson, Elinor K.; Sigurdsson, Snaevar; Ivansson, Emma; Thomas, Rachael; Elvers, Ingegerd; Wright, Jason; Howald, Cedric; Tonomura, Noriko; Perloski, Michele; Swofford, Ross; Biagi, Tara; Fryc, Sarah; Anderson, Nathan; Courtay-Cahen, Celine; Youell, Lisa

    2013-01-01

    Background: Canine osteosarcoma is clinically nearly identical to the human disease, but is common and highly heritable, making genetic dissection feasible. Results: Through genome-wide association analyses in three breeds (greyhounds, Rottweilers, and Irish wolfhounds), we identify 33 inherited risk loci explaining 55% to 85% of phenotype variance in each breed. The greyhound locus exhibiting the strongest association, located 150 kilobases upstream of the genes CDKN2A/B, is also the most re...

  5. Genome-wide analyses implicate 33 loci in heritable dog osteosarcoma, including regulatory variants near CDKN2A/B

    OpenAIRE

    Karlsson, Elinor K.; Sigurdsson, Snaevar; Ivansson, Emma; Thomas, Rachael; Elvers, Ingegerd; Wright, Jason; Howald, Cedric; Tonomura, Noriko; Perloski, Michele; Swofford, Ross; Biagi, Tara; Fryc, Sarah; Anderson, Nathan; Courtay-Cahen, Celine; Youell, Lisa

    2013-01-01

    BACKGROUND: Canine osteosarcoma is clinically nearly identical to the human disease, but is common and highly heritable, making genetic dissection feasible. RESULTS: Through genome-wide association analyses in three breeds (greyhounds, Rottweilers, and Irish wolfhounds), we identify 33 inherited risk loci explaining 55% to 85% of phenotype variance in each breed. The greyhound locus exhibiting the strongest association, located 150 kilobases upstream of the genes CDKN2A/B, is also the most re...

  6. Genome-wide analyses implicate 33 loci in heritable dog osteosarcoma, including regulatory variants near CDKN2A/B

    OpenAIRE

    Karlsson, Elinor K.; Sigurdsson, Snaevar; Ivansson, Emma; Thomas, Rachael; Elvers, Ingegerd; Wright, Jason; Howald, Cedric; Tonomura, Noriko; Perloski, Michele; Swofford, Ross; Biagi, Tara; Fryc, Sarah; Anderson, Nathan; Courtay-Cahen, Celine; Youell, Lisa

    2013-01-01

    Background: Canine osteosarcoma is clinically nearly identical to the human disease, but is common and highly heritable, making genetic dissection feasible. Results: Through genome-wide association analyses in three breeds (greyhounds, Rottweilers, and Irish wolfhounds), we identify 33 inherited risk loci explaining 55% to 85% of phenotype variance in each breed. The greyhound locus exhibiting the strongest association, located 150 kilobases upstream of the genes CDKN2A/B, is also th...

  7. Genome-wide comparison of medieval and modern Mycobacterium leprae

    DEFF Research Database (Denmark)

    Schuenemann, Verena J; Singh, Pushpendra; Mendum, Thomas A; Krause-Kyora, Ben; Jäger, Günter; Bos, Kirsten I; Herbig, Alexander; Economou, Christos; Benjak, Andrej; Busso, Philippe; Nebel, Almut; Boldsen, Jesper L; Kjellström, Anna; Wu, Huihai; Stewart, Graham R; Taylor, G Michael; Bauer, Peter; Lee, Oona Y-C; Wu, Houdini H T; Minnikin, David E; Besra, Gurdyal S; Tucker, Katie; Roffey, Simon; Sow, Samba O; Cole, Stewart T; Nieselt, Kay; Krause, Johannes

    2013-01-01

    Leprosy was endemic in Europe until the Middle Ages. Using DNA array capture, we have obtained genome sequences of Mycobacterium leprae from skeletons of five medieval leprosy cases from the United Kingdom, Sweden, and Denmark. In one case, the DNA was so well preserved that full de novo assembly...

  8. Genome and Phenotype Microarray Analyses of Rhodococcus sp. BCP1 and Rhodococcus opacus R7: Genetic Determinants and Metabolic Abilities with Environmental Relevance.

    Directory of Open Access Journals (Sweden)

    Alessandro Orro

    Full Text Available In this paper comparative genome and phenotype microarray analyses of Rhodococcus sp. BCP1 and Rhodococcus opacus R7 were performed. Rhodococcus sp. BCP1 was selected for its ability to grow on short-chain n-alkanes and R. opacus R7 was isolated for its ability to grow on naphthalene and on o-xylene. Results of genome comparison, including BCP1, R7, along with other Rhodococcus reference strains, showed that at least 30% of the genome of each strain presented unique sequences and only 50% of the predicted proteome was shared. To associate genomic features with metabolic capabilities of BCP1 and R7 strains, hundreds of different growth conditions were tested through Phenotype Microarray, by using Biolog plates and plates manually prepared with additional xenobiotic compounds. Around one-third of the surveyed carbon sources was utilized by both strains although R7 generally showed higher metabolic activity values compared to BCP1. Moreover, R7 showed broader range of nitrogen and sulphur sources. Phenotype Microarray data were combined with genomic analysis to genetically support the metabolic features of the two strains. The genome analysis allowed to identify some gene clusters involved in the metabolism of the main tested xenobiotic compounds. Results show that R7 contains multiple genes for the degradation of a large set of aromatic and PAHs compounds, while a lower variability in terms of genes predicted to be involved in aromatic degradation was found in BCP1. This genetic feature can be related to the strong genetic pressure exerted by the two different environment from which the two strains were isolated. According to this, in the BCP1 genome the smo gene cluster involved in the short-chain n-alkanes degradation, is included in one of the unique regions and it is not conserved in the Rhodococcus strains compared in this work. Data obtained underline the great potential of these two Rhodococcus spp. strains for biodegradation and

  9. The complete mitochondrial genome of Gastrothylax crumenifer (Gastrothylacidae, Trematoda) and comparative analyses with selected trematodes.

    Science.gov (United States)

    Yang, Xin; Wang, Lixia; Chen, Hongmei; Feng, Hanli; Shen, Bang; Hu, Min; Fang, Rui

    2016-06-01

    In the present study, we sequenced and analyzed the mitochondrial (mt) genome of Gastrothylax crumenifer and compared it with other selected trematodes. The full mt genome of G. crumenifer was amplified, sequenced, assembled, analyzed and then subjected to phylogenetic analysis. The complete mt genome of G. crumenifer is 14,801 bp in length and contains two rRNA genes, two non-coding regions (LNR and SNR), 12 protein-coding genes, and 22 transfer RNA genes. The gene organization of the G. crumenifer mt genome is the same as that of other trematodes, except for Schistosoma haematobium and Schistosoma spindale. All the genes are transcribed in the same direction and rich in "A + T", which is in accordance with other trematodes, such as Fasciola hepatica, Paramphistomum cervi, and Fischoederius elongatus. Phylogenetic analysis using concatenated amino acid sequences of the 12 protein-coding genes showed that G. crumenifer is closely related to F. elongatus. The availability of mt genome sequence of G. crumenifer can provide useful DNA markers for studying the molecular epidemiology and population genetics of this parasite and other paramphistomes. PMID:27021180

  10. A genomic comparison of two termites with different social complexity

    DEFF Research Database (Denmark)

    Korb, Judith; Thomas-Poulsen, Michael; Hu, Haofu;

    2015-01-01

    The termites evolved eusociality and complex societies before the ants, but have been studied much less. The recent publication of the first two termite genomes provides a unique comparative opportunity, particularly because the sequenced termites represent opposite ends of the social complexity...... spectrum. Zootermopsis nevadensis has simple colonies with totipotent workers that can develop into all castes (dispersing reproductives, nest-inheriting replacement reproductives, and soldiers). In contrast, the fungus-growing termite Macrotermes natalensis belongs to the higher termites and has very...... large and complex societies with morphologically distinct castes that are life-time sterile. Here we compare key characteristics of genomic architecture, focusing on genes involved in communication, immune defenses, mating biology and symbiosis that were likely important in termite social evolution. We...

  11. A genomic comparison of two termites with different social complexity

    Directory of Open Access Journals (Sweden)

    Judith eKorb

    2015-03-01

    Full Text Available The termites evolved eusociality and complex societies before the ants, but have been studied much less. The recent publication of the first two termite genomes provides a unique comparative opportunity, particularly because the sequenced termites represent opposite ends of the social complexity spectrum. Zootermopsis nevadensis has simple colonies with totipotent workers that can develop into all castes (dispersing reproductives, nest-inheriting replacement reproductives, and soldiers. In contrast, the fungus-growing termite Macrotermes natalensis belongs to the higher termites and has very large and complex societies with morphologically distinct castes that are life-time sterile. Here we compare key characteristics of genomic architecture, focusing on genes involved in communication, immune defenses, mating biology and symbiosis that were likely important in termite social evolution. We discuss these in relation to what is known about these genes in the ants and outline hypotheses for further testing.

  12. M-GCAT: interactively and efficiently constructing large-scale multiple genome comparison frameworks in closely related species

    Directory of Open Access Journals (Sweden)

    Messeguer Xavier

    2006-10-01

    Full Text Available Abstract Background Due to recent advances in whole genome shotgun sequencing and assembly technologies, the financial cost of decoding an organism's DNA has been drastically reduced, resulting in a recent explosion of genomic sequencing projects. This increase in related genomic data will allow for in depth studies of evolution in closely related species through multiple whole genome comparisons. Results To facilitate such comparisons, we present an interactive multiple genome comparison and alignment tool, M-GCAT, that can efficiently construct multiple genome comparison frameworks in closely related species. M-GCAT is able to compare and identify highly conserved regions in up to 20 closely related bacterial species in minutes on a standard computer, and as many as 90 (containing 75 cloned genomes from a set of 15 published enterobacterial genomes in an hour. M-GCAT also incorporates a novel comparative genomics data visualization interface allowing the user to globally and locally examine and inspect the conserved regions and gene annotations. Conclusion M-GCAT is an interactive comparative genomics tool well suited for quickly generating multiple genome comparisons frameworks and alignments among closely related species. M-GCAT is freely available for download for academic and non-commercial use at: http://alggen.lsi.upc.es/recerca/align/mgcat/intro-mgcat.html.

  13. A genomic comparison of two termites with different social complexity

    OpenAIRE

    Korb, Judith; Poulsen, Michael; Hu, Haofu; Li, Cai; Boomsma, Jacobus J.; Zhang, Guojie; Liebig, Jürgen

    2015-01-01

    The termites evolved eusociality and complex societies before the ants, but have been studied much less. The recent publication of the first two termite genomes provides a unique comparative opportunity, particularly because the sequenced termites represent opposite ends of the social complexity spectrum. Zootermopsis nevadensis has simple colonies with totipotent workers that can develop into all castes (dispersing reproductives, nest-inheriting replacement reproductives, and soldiers). In c...

  14. Comparison of methods for genomic localization of gene trap sequences

    Directory of Open Access Journals (Sweden)

    Ferrin Thomas E

    2006-09-01

    Full Text Available Abstract Background Gene knockouts in a model organism such as mouse provide a valuable resource for the study of basic biology and human disease. Determining which gene has been inactivated by an untargeted gene trapping event poses a challenging annotation problem because gene trap sequence tags, which represent sequence near the vector insertion site of a trapped gene, are typically short and often contain unresolved residues. To understand better the localization of these sequences on the mouse genome, we compared stand-alone versions of the alignment programs BLAT, SSAHA, and MegaBLAST. A set of 3,369 sequence tags was aligned to build 34 of the mouse genome using default parameters for each algorithm. Known genome coordinates for the cognate set of full-length genes (1,659 sequences were used to evaluate localization results. Results In general, all three programs performed well in terms of localizing sequences to a general region of the genome, with only relatively subtle errors identified for a small proportion of the sequence tags. However, large differences in performance were noted with regard to correctly identifying exon boundaries. BLAT correctly identified the vast majority of exon boundaries, while SSAHA and MegaBLAST missed the majority of exon boundaries. SSAHA consistently reported the fewest false positives and is the fastest algorithm. MegaBLAST was comparable to BLAT in speed, but was the most susceptible to localizing sequence tags incorrectly to pseudogenes. Conclusion The differences in performance for sequence tags and full-length reference sequences were surprisingly small. Characteristic variations in localization results for each program were noted that affect the localization of sequence at exon boundaries, in particular.

  15. Comparison and quantitative verification of mapping algorithms for whole-genome bisulfite sequencing.

    Science.gov (United States)

    Kunde-Ramamoorthy, Govindarajan; Coarfa, Cristian; Laritsky, Eleonora; Kessler, Noah J; Harris, R Alan; Xu, Mingchu; Chen, Rui; Shen, Lanlan; Milosavljevic, Aleksandar; Waterland, Robert A

    2014-04-01

    Coupling bisulfite conversion with next-generation sequencing (Bisulfite-seq) enables genome-wide measurement of DNA methylation, but poses unique challenges for mapping. However, despite a proliferation of Bisulfite-seq mapping tools, no systematic comparison of their genomic coverage and quantitative accuracy has been reported. We sequenced bisulfite-converted DNA from two tissues from each of two healthy human adults and systematically compared five widely used Bisulfite-seq mapping algorithms: Bismark, BSMAP, Pash, BatMeth and BS Seeker. We evaluated their computational speed and genomic coverage and verified their percentage methylation estimates. With the exception of BatMeth, all mappers covered >70% of CpG sites genome-wide and yielded highly concordant estimates of percentage methylation (r(2) ≥ 0.95). Fourfold variation in mapping time was found between BSMAP (fastest) and Pash (slowest). In each library, 8-12% of genomic regions covered by Bismark and Pash were not covered by BSMAP. An experiment using simulated reads confirmed that Pash has an exceptional ability to uniquely map reads in genomic regions of structural variation. Independent verification by bisulfite pyrosequencing generally confirmed the percentage methylation estimates by the mappers. Of these algorithms, Bismark provides an attractive combination of processing speed, genomic coverage and quantitative accuracy, whereas Pash offers considerably higher genomic coverage. PMID:24391148

  16. Comparison between CARIBIC aerosol samples analysed by accelerator-based methods and optical particle counter measurements

    Directory of Open Access Journals (Sweden)

    B. G. Martinsson

    2014-04-01

    Full Text Available Inter-comparison of results from two kinds of aerosol systems in the CARIBIC (Civil Aircraft for the Regular Investigation of the atmosphere Based on an Instrument Container passenger aircraft based observatory, operating during intercontinental flights at 9–12 km altitude, is presented. Aerosol from the lowermost stratosphere (LMS, the extra-tropical upper troposphere (UT and the tropical mid troposphere (MT were investigated. Aerosol particle volume concentration measured with an optical particle counter (OPC is compared with analytical results of the sum of masses of all major and several minor constituents from aerosol samples collected with an impactor. Analyses were undertaken with accelerator-based methods particle-induced X-ray emission (PIXE and particle elastic scattering analysis (PESA. Data from 48 flights during one year are used, leading to a total of 106 individual comparisons. The ratios of the particle volume from the OPC and the total mass from the analyses were in 84% within a relatively narrow interval. Data points outside this interval are connected with inlet-related effects in clouds, large variability in aerosol composition, particle size distribution effects and some cases of non-ideal sampling. Overall, the comparison of these two CARIBIC measurements based on vastly different methods show good agreement, implying that the chemical and size information can be combined in studies of the MT/UT/LMS aerosol.

  17. Genome comparisons reveal a dominant mechanism of chromosome number reduction in grasses and accelerated genome evolution in Triticeae

    Science.gov (United States)

    Luo, M. C.; Deal, K. R.; Akhunov, E. D.; Akhunova, A. R.; Anderson, O. D.; Anderson, J. A.; Blake, N.; Clegg, M. T.; Coleman-Derr, D.; Conley, E. J.; Crossman, C. C.; Dubcovsky, J.; Gill, B. S.; Gu, Y. Q.; Hadam, J.; Heo, H. Y.; Huo, N.; Lazo, G.; Ma, Y.; Matthews, D. E.; McGuire, P. E.; Morrell, P. L.; Qualset, C. O.; Renfro, J.; Tabanao, D.; Talbert, L. E.; Tian, C.; Toleno, D. M.; Warburton, M. L.; You, F. M.; Zhang, W.; Dvorak, J.

    2009-01-01

    Single-nucleotide polymorphism was used in the construction of an expressed sequence tag map of Aegilops tauschii, the diploid source of the wheat D genome. Comparisons of the map with the rice and sorghum genome sequences revealed 50 inversions and translocations; 2, 8, and 40 were assigned respectively to the rice, sorghum, and Ae. tauschii lineages, showing greatly accelerated genome evolution in the large Triticeae genomes. The reduction of the basic chromosome number from 12 to 7 in the Triticeae has taken place by a process during which an entire chromosome is inserted by its telomeres into a break in the centromeric region of another chromosome. The original centromere–telomere polarity of the chromosome arms is maintained in the new chromosome. An intrachromosomal telomere–telomere fusion resulting in a pericentric translocation of a chromosome segment or an entire arm accompanied or preceded the chromosome insertion in some instances. Insertional dysploidy has been recorded in three grass subfamilies and appears to be the dominant mechanism of basic chromosome number reduction in grasses. A total of 64% and 66% of Ae. tauschii genes were syntenic with sorghum and rice genes, respectively. Synteny was reduced in the vicinity of the termini of modern Ae. tauschii chromosomes but not in the vicinity of the ancient termini embedded in the Ae. tauschii chromosomes, suggesting that the dependence of synteny erosion on gene location along the centromere–telomere axis either evolved recently in the Triticeae phylogenetic lineage or its evolution was recently accelerated. PMID:19717446

  18. Genome-wide association analyses identify variants in developmental genes associated with hypospadias

    DEFF Research Database (Denmark)

    Geller, Frank; Feenstra, Bjarke; Carstensen, Lisbeth;

    2014-01-01

    Hypospadias is a common congenital condition in boys in which the urethra opens on the underside of the penis. We performed a genome-wide association study on 1,006 surgery-confirmed hypospadias cases and 5,486 controls from Denmark. After replication genotyping of an additional 1,972 cases and 1...

  19. Integrative genome analyses identify key somatic driver mutations of small-cell lung cancer

    NARCIS (Netherlands)

    Peifer, Martin; Fernandez-Cuesta, Lynnette; Sos, Martin L.; George, Julie; Seidel, Danila; Kasper, Lawryn H.; Plenker, Dennis; Leenders, Frauke; Sun, Ruping; Zander, Thomas; Menon, Roopika; Koker, Mirjam; Dahmen, Ilona; Mueller, Christian; Di Cerbo, Vincenzo; Schildhaus, Hans-Ulrich; Altmueller, Janine; Baessmann, Ingelore; Becker, Christian; de Wilde, Bram; Vandesompele, Jo; Boehm, Diana; Ansen, Sascha; Gabler, Franziska; Wilkening, Ines; Heynck, Stefanie; Heuckmann, Johannes M.; Lu, Xin; Carter, Scott L.; Cibulskis, Kristian; Banerji, Shantanu; Getz, Gad; Park, Kwon-Sik; Rauh, Daniel; Gruetter, Christian; Fischer, Matthias; Pasqualucci, Laura; Wright, Gavin; Wainer, Zoe; Russell, Prudence; Petersen, Iver; Chen, Yuan; Stoelben, Erich; Ludwig, Corinna; Schnabel, Philipp; Hoffmann, Hans; Muley, Thomas; Brockmann, Michael; Engel-Riedel, Walburga; Muscarella, Lucia A.; Fazio, Vito M.; Groen, Harry; Timens, Wim; Sietsma, Hannie; Thunnissen, Erik; Smit, Egbert; Heideman, Danielle A. M.; Snijders, Peter J. F.; Cappuzzo, Federico; Ligorio, Claudia; Damiani, Stefania; Field, John; Solberg, Steinar; Brustugun, Odd Terje; Lund-Iversen, Marius; Saenger, Joerg; Clement, Joachim H.; Soltermann, Alex; Moch, Holger; Weder, Walter; Solomon, Benjamin; Soria, Jean-Charles; Validire, Pierre; Besse, Benjamin; Brambilla, Elisabeth; Brambilla, Christian; Lantuejoul, Sylvie; Lorimier, Philippe; Schneider, Peter M.; Hallek, Michael; Pao, William; Meyerson, Matthew; Sage, Julien; Shendure, Jay; Schneider, Robert; Buettner, Reinhard; Wolf, Juergen; Nuernberg, Peter; Perner, Sven; Heukamp, Lukas C.; Brindle, Paul K.; Haas, Stefan; Thomas, Roman K.

    2012-01-01

    Small-cell lung cancer (SCLC) is an aggressive lung tumor subtype with poor prognosis(1-3). We sequenced 29 SCLC exomes, 2 genomes and 15 transcriptomes and found an extremely high mutation rate of 7.4 +/- 1 protein-changing mutations per million base pairs. Therefore, we conducted integrated analys

  20. Psychiatric genome-wide association study analyses implicate neuronal, immune and histone pathways

    NARCIS (Netherlands)

    O'Dushlaine, Colm; Rossin, Lizzy; Lee, Phil H.; Duncan, Laramie; Parikshak, Neelroop N.; Newhouse, Stephen; Ripke, Stephan; Neale, Benjamin M.; Purcell, Shaun M.; Posthuma, Danielle; Nurnberger, John I.; Lee, S. Hong; Faraone, Stephen V.; Perlis, Roy H.; Mowry, Bryan J.; Thapar, Anita; Goddard, Michael E.; Witte, John S.; Absher, Devin; Agartz, Ingrid; Akil, Huda; Amin, Farooq; Andreassen, Ole A.; Anjorin, Adebayo; Anney, Richard; Anttila, Verneri; Arking, Dan E.; Asherson, Philip; Azevedo, Maria H.; Backlund, Lena; Badner, Judith A.; Bailey, Anthony J.; Banaschewski, Tobias; Barchas, Jack D.; Barnes, Michael R.; Barrett, Thomas B.; Bass, Nicholas; Battaglia, Agatino; Bauer, Michael; Bayes, Monica; Bellivier, Frank; Bergen, Sarah E.; Berrettini, Wade; Betancur, Catalina; Bettecken, Thomas; Biederman, Joseph; Binder, Elisabeth B.; Black, Donald W.; Blackwood, Douglas H. R.; Bloss, Cinnamon S.; Boehnke, Michael; Boomsma, Dorret I.; Breuer, Rene; Bruggeman, Richard; Cormican, Paul; Buccola, Nancy G.; Buitelaar, Jan K.; Bunney, William E.; Buxbaum, Joseph D.; Byerley, William F.; Byrne, Enda M.; Caesar, Sian; Cahn, Wiepke; Cantor, Rita M.; Casas, Miguel; Chakravarti, Aravinda; Chambert, Kimberly; Choudhury, Khalid; Cichon, Sven; Mattheisen, Manuel; Cloninger, C. Robert; Collier, David A.; Cook, Edwin H.; Coon, Hilary; Cormand, Bru; Corvin, Aiden; Coryell, William H.; Craig, David W.; Craig, Ian W.; Crosbie, Jennifer; Cuccaro, Michael L.; Curtis, David; Czamara, Darina; Datta, Susmita; Dawson, Geraldine; Day, Richard; De Geus, Eco J.; Degenhardt, Franziska; Djurovic, Srdjan; Donohoe, Gary J.; Doyle, Alysa E.; Duan, Jubao; Dudbridge, Frank; Duketis, Eftichia; Ebstein, Richard P.; Edenberg, Howard J.; Elia, Josephine; Ennis, Sean; Etain, Bruno; Fanous, Ayman; Farmer, Anne E.; Ferrier, I. Nicol; Flicldnger, Matthew; Fombonne, Eric; Foroud, Tatiana; Frank, Josef; Franke, Barbara; Fraser, Christine; Freedman, Robert; Freimer, Nelson B.; Freitag, Christine M.; Friedl, Marion; Frisen, Louise; Gailagher, Louise; Gejman, Pablo V.; Georgieva, Lyudmila; Gershon, Elliot S.; Giegling, Ina; Gill, Michael; Gordon, Scott D.; Gordon-Smith, Katherine; Green, Elaine K.; Greenwood, Tiffany A.; Grice, Dorothy E.; Gross, Magdalena; Grozeva, Detelina; Guan, Weihua; Gurling, Hugh; De Haan, Lieuwe; Haines, Jonathan L.; Hakonarson, Hakon; Hallmayer, Joachim; Hamilton, Steven P.; Hamshere, Marian L.; Hansen, Thomas F.; Hartmann, Annette M.; Hautzinger, Martin; Heath, Andrew C.; Henders, Anjali K.; Herms, Stefan; Hickie, Ian B.; Hipolito, Maria; Hoefels, Susanne; Holsboer, Florian; Hoogendijk, Witte J.; Hottenga, Jouke-Jan; Hultman, Christina M.; Hus, Vanessa; Ingason, Andres; Ising, Marcus; Jamain, Stephane; Jones, Edward G.; Jones, Ian; Jones, Lisa; Tzeng, Jung-Ying; Kaehler, Anna K.; Kahn, Rene S.; Kandaswamy, Radhika; Keller, Matthew C.; Kennedy, James L.; Kenny, Elaine; Kent, Lindsey; Kim, Yunjung; Kirov, George K.; Klauck, Sabine M.; Klei, Lambertus; Knowles, James A.; Kohli, Martin A.; Koller, Daniel L.; Konte, Bettina; Korszun, Ania; Krabbendam, Lydia; Krasucki, Robert; Kuntsi, Jonna; Kwan, Phoenix; Landen, Mikael; Laengstroem, Niklas; Lathrop, Mark; Lawrence, Jacob; Lawson, William B.; Leboyer, Marion; Ledbetter, David H.; Lencz, Todd; Lesch, Klaus-Peter; Levinson, Douglas F.; Lewis, Cathryn M.; Li, Jun; Lichtenstein, Paul; Lieberman, Jeffrey A.; Lin, Dan-Yu; Linszen, Don H.; Liu, Chunyu; Lohoff, Falk W.; Loo, Sandra K.; Lord, Catherine; Lowe, Jennifer K.; Lucae, Susanne; MacIntyre, Donald J.; Madden, Pamela A. F.; Maestrini, Elena; Magnusson, Patrik K. E.; Mahon, Pamela B.; Maier, Wolfgang; Malhotra, Anil K.; Mane, Shrikant M.; Martin, Christa L.; Martin, Nicholas G.; Matthews, Keith; Mattingsdal, Morten; McCarroll, Steven A.; McGhee, Kevin A.; McGough, James J.; McGrath, Patrick J.; McGuffin, Peter; McInnis, Melvin G.; McIntosh, Andrew; McKinney, Rebecca; McLean, Alan W.; McMahon, Francis J.; McMahon, William M.; McQuillin, Andrew; Medeiros, Helena; Medland, Sarah E.; Meier, Sandra; Melle, Ingrid; Meng, Fan; Meyer, Jobst; Middeldorp, Christel M.; Middleton, Lefkos; Milanova, Vihra; Miranda, Ana; Monaco, Anthony P.; Montgomery, Grant W.; Moran, Jennifer L.; Moreno-De-Luca, Daniel; Morken, Gunnar; Morris, Derek W.; Morrow, Eric M.; Moskvina, Valentina; Muglia, Pierandrea; Muehleisen, Thomas W.; Muir, Walter J.; Mueller-Myhsok, Bertram; Murtha, Michael; Myers, Richard M.; Myin-Germeys, Inez; Neale, Michael C.; Nelson, Stan F.; Nievergelt, Caroline M.; Nikolov, Ivan; Nimgaonkar, Vishwajit; Nolen, Willem A.; Noethen, Markus M.; Nwulia, Evaristus A.; Nyholt, Dale R.; Oades, Robert D.; Olincy, Ann; Oliveira, Guiomar; Olsen, Line; Ophoff, Roel A.; Osby, Urban; Owen, Michael J.; Palotie, Aarno; Parr, Jeremy R.; Paterson, Andrew D.; Pato, Carlos N.; Pato, Michele T.; Penninx, Brenda W.; Pergadia, Michele L.; Pericak-Vance, Margaret A.; Pickard, Benjamin S.; Pimm, Jonathan; Piven, Joseph; Potash, James B.; Poustka, Fritz; Propping, Peter; Puri, Vinay; Quested, Digby J.; Quinn, Emma M.; Ramos-Quiroga, Josep Antoni; Rasmussen, Henrik B.; Raychaudhuri, Soumya; Rehnstroem, Karola; Reif, Andreas; Ribases, Marta; Rice, John P.; Rietschel, Marcella; Roeder, Kathryn; Roeyers, Herbert; Rothenberger, Aribert; Rouleau, Guy; Ruderfer, Douglas; Rujescu, Dan; Sanders, Alan R.; Sanders, Stephan J.; Santangelo, Susan L.; Sergeant, Joseph A.; Schachar, Russell; Schalling, Martin; Schatzberg, Alan F.; Scheftner, William A.; Schellenberg, Gerard D.; Scherer, Stephen W.; Schork, Nicholas J.; Schulze, Thomas G.; Schumacher, Johannes; Schwarz, Markus; Scolnick, Edward; Scott, Laura J.; Shi, Jianxin; Shilling, Paul D.; Shyn, Stanley I.; Silverman, Jeremy M.; Slager, Susan L.; Smalley, Susan L.; Smit, Johannes H.; Smith, Erin N.; Sonuga-Barke, Edmund J. S.; Cair, David St.; State, Matthew; Steffens, Michael; Steinhausen, Hans-Christoph; Strauss, John S.; Strohmaier, Jana; Stroup, T. Scott; Sutdiffe, James S.; Szatmari, Peter; Szelinger, Szabocls; Thirumalai, Srinivasa; Thompson, Robert C.; Todorov, Alexandre A.; Tozzi, Federica; Treutlein, Jens; Uhr, Manfred; Van den Oord, Edwin J. C. G.; Van Grootheest, Gerard; Van Os, Jim; Vicente, Astrid M.; Vieland, Veronica J.; Vincent, John B.; Visscher, Peter M.; Walsh, Christopher A.; Wassink, Thomas H.; Watson, Stanley J.; Weissman, Myrna M.; Werge, Thomas; Wienker, Thomas F.; Wijsman, Ellen M.; Willemsen, Gonneke; Williams, Nigel; Willsey, A. Jeremy; Witt, Stephanie H.; Xu, Wei; Young, Allan H.; Yu, Timothy W.; Zammit, Stanley; Zandi, Peter P.; Zhang, Peng; Zitman, Frans G.; Zoellner, Sebastian; Devlin, Bernie; Kelsoe, John R.; Sklar, Pamela; Daly, Mark J.; O'Donovan, Michael C.; Craddock, Nicholas; Kendler, Kenneth S.; Weiss, Lauren A.; Wray, Naomi R.; Zhao, Zhaoming; Geschwind, Daniel H.; Sullivan, Patrick F.; Smoller, Jordan W.; Holmans, Peter A.; Breen, Gerome

    2015-01-01

    Genome-wide association studies (GWAS) of psychiatric disorders have identified multiple genetic associations with such disorders, but better methods are needed to derive the underlying biological mechanisms that these signals indicate. We sought to identify biological pathways in GWAS data from ove

  1. Genome constitution of Narcissus variety, 'Tete-a-Tete', analysed through GISH and NBS profiling

    NARCIS (Netherlands)

    Wu, H.; Ramanna, M.S.; Arens, P.; Tuyl, van J.M.

    2011-01-01

    The Narcissus variety, ‘Tête-à-Tête’, has been the most popular variety since 1949, and a well known allotriploid (2n = 3x = 24 + B) of spontaneous origin. Because the identity of one of the parents of this variety was uncertain, the genome constitution of ‘Tête-à-Tête’ was investigated by using gen

  2. The chloroplast genome of the hexaploid Spartina maritima (Poaceae, Chloridoideae): Comparative analyses and molecular dating.

    Science.gov (United States)

    Rousseau-Gueutin, M; Bellot, S; Martin, G E; Boutte, J; Chelaifa, H; Lima, O; Michon-Coudouel, S; Naquin, D; Salmon, A; Ainouche, K; Ainouche, M

    2015-12-01

    The history of many plant lineages is complicated by reticulate evolution with cases of hybridization often followed by genome duplication (allopolyploidy). In such a context, the inference of phylogenetic relationships and biogeographic scenarios based on molecular data is easier using haploid markers like chloroplast genome sequences. Hybridization and polyploidization occurred recurrently in the genus Spartina (Poaceae, Chloridoideae), as illustrated by the recent formation of the invasive allododecaploid S. anglica during the 19th century in Europe. Until now, only a few plastid markers were available to explore the history of this genus and their low variability limited the resolution of species relationships. We sequenced the complete chloroplast genome (plastome) of S. maritima, the native European parent of S. anglica, and compared it to the plastomes of other Poaceae. Our analysis revealed the presence of fast-evolving regions of potential taxonomic, phylogeographic and phylogenetic utility at various levels within the Poaceae family. Using secondary calibrations, we show that the tetraploid and hexaploid lineages of Spartina diverged 6-10 my ago, and that the two parents of the invasive allopolyploid S. anglica separated 2-4 my ago via long distance dispersal of the ancestor of S. maritima over the Atlantic Ocean. Finally, we discuss the meaning of divergence times between chloroplast genomes in the context of reticulate evolution. PMID:26182838

  3. MRSA transmission on a neonatal intensive care unit: epidemiological and genome-based phylogenetic analyses.

    Directory of Open Access Journals (Sweden)

    Ulrich Nübel

    Full Text Available BACKGROUND: Methicillin-resistant Staphylococcus aureus (MRSA may cause prolonged outbreaks of infections in neonatal intensive care units (NICUs. While the specific factors favouring MRSA spread on neonatal wards are not well understood, colonized infants, their relatives, or health-care workers may all be sources for MRSA transmission. Whole-genome sequencing may provide a new tool for elucidating transmission pathways of MRSA at a local scale. METHODS AND FINDINGS: We applied whole-genome sequencing to trace MRSA spread in a NICU and performed a case-control study to identify risk factors for MRSA transmission. MRSA genomes had accumulated sequence variation sufficiently fast to reflect epidemiological linkage among individual patients, between infants and their mothers, and between infants and staff members, such that the relevance of individual nurses' nasal MRSA colonization for prolonged transmission could be evaluated. In addition to confirming previously reported risk factors, we identified an increased risk of transmission from infants with as yet unknown MRSA colonisation, in contrast to known MRSA-positive infants. CONCLUSIONS: The integration of epidemiological (temporal, spatial and genomic data enabled the phylogenetic testing of several hypotheses on specific MRSA transmission routes within a neonatal intensive-care unit. The pronounced risk of transmission emanating from undetected MRSA carriers suggested that increasing the frequency or speed of microbiological diagnostics could help to reduce transmission of MRSA.

  4. Metabolic model for the filamentous ‘Candidatus Microthrix parvicella’ based on genomic and metagenomic analyses

    DEFF Research Database (Denmark)

    McIlroy, Simon Jon; Kristiansen, Rikke; Albertsen, Mads;

    2013-01-01

    acids as triacylglycerols. Utilisation of trehalose and/or polyphosphate stores or partial oxidation of long-chain fatty acids may supply the energy required for anaerobic lipid uptake and storage. Comparing the genome sequence of this isolate with metagenomes from two full-scale wastewater treatment...

  5. Deciphering heterogeneity in pig genome assembly Sscrofa9 by isochore and isochore-like region analyses.

    Directory of Open Access Journals (Sweden)

    Wenqian Zhang

    Full Text Available BACKGROUND: The isochore, a large DNA sequence with relatively small GC variance, is one of the most important structures in eukaryotic genomes. Although the isochore has been widely studied in humans and other species, little is known about its distribution in pigs. PRINCIPAL FINDINGS: In this paper, we construct a map of long homogeneous genome regions (LHGRs, i.e., isochores and isochore-like regions, in pigs to provide an intuitive version of GC heterogeneity in each chromosome. The LHGR pattern study not only quantifies heterogeneities, but also reveals some primary characteristics of the chromatin organization, including the followings: (1 the majority of LHGRs belong to GC-poor families and are in long length; (2 a high gene density tends to occur with the appearance of GC-rich LHGRs; and (3 the density of LINE repeats decreases with an increase in the GC content of LHGRs. Furthermore, a portion of LHGRs with particular GC ranges (50%-51% and 54%-55% tend to have abnormally high gene densities, suggesting that biased gene conversion (BGC, as well as time- and energy-saving principles, could be of importance to the formation of genome organization. CONCLUSION: This study significantly improves our knowledge of chromatin organization in the pig genome. Correlations between the different biological features (e.g., gene density and repeat density and GC content of LHGRs provide a unique glimpse of in silico gene and repeats prediction.

  6. Meta-analyses of genome-wide association studies identify multiple loci associated with pulmonary function

    NARCIS (Netherlands)

    D.B. Hancock (Dana); M. Eijgelsheim (Mark); J.B. Wilk (Jemma); S.A. Gharib (Sina); L.R. Loehr (Laura); K. Marciante (Kristin); N. Franceschini (Nora); Y.M.T.A. van Durme; T.H. Chen; R.G. Barr (Graham); M.B. Schabath (Matthew); D.J. Couper (David); G.G. Brusselle (Guy); B.M. Psaty (Bruce); P. Tikka-Kleemola (Päivi); J.I. Rotter (Jerome); A.G. Uitterlinden (André); A. Hofman (Albert); N.M. Punjabi (Naresh); F. Rivadeneira Ramirez (Fernando); A.C. Morrison (Alanna); P.L. Enright (Paul); K.E. North (Kari); S.R. Heckbert (Susan); T. Lumley (Thomas); B.H.Ch. Stricker (Bruno); G.T. O'Connor (George); S.J. London (Stephanie)

    2010-01-01

    textabstractSpirometric measures of lung function are heritable traits that reflect respiratory health and predict morbidity and mortality. We meta-analyzed genome-wide association studies for two clinically important lung-function measures: forced expiratory volume in the first second (FEV1) and it

  7. Whole-genome analyses resolve early branches in the tree of life of modern birds

    DEFF Research Database (Denmark)

    Sicheritz-Pontén, Thomas; Li, Cai; Li, Bo;

    2014-01-01

    and confirm independent gains of vocal learning. Among Columbea, we identify pigeons and flamingoes as belonging to sister clades. Even with whole genomes, some of the earliest branches in Neoaves proved challenging to resolve, which was best explained by massive protein-coding sequence convergence...

  8. Psychiatric genome-wide association study analyses implicate neuronal, immune and histone pathways

    DEFF Research Database (Denmark)

    O'Dushlaine, Colm; Rossin, Lizzy; Lee, Phil H.;

    2015-01-01

    Genome-wide association studies (GWAS) of psychiatric disorders have identified multiple genetic associations with such disorders, but better methods are needed to derive the underlying biological mechanisms that these signals indicate. We sought to identify biological pathways in GWAS data from ...

  9. Classification and regression tree (CART analyses of genomic signatures reveal sets of tetramers that discriminate temperature optima of archaea and bacteria

    Directory of Open Access Journals (Sweden)

    Betsey Dexter Dyer

    2008-01-01

    Full Text Available Classification and regression tree (CART analysis was applied to genome-wide tetranucleotide frequencies (genomic signatures of 195 archaea and bacteria. Although genomic signatures have typically been used to classify evolutionary divergence, in this study, convergent evolution was the focus. Temperature optima for most of the organisms examined could be distinguished by CART analyses of tetranucleotide frequencies. This suggests that pervasive (nonlinear qualities of genomes may reflect certain environmental conditions (such as temperature in which those genomes evolved. The predominant use of GAGA and AGGA as the discriminating tetramers in CART models suggests that purine-loading and codon biases of thermophiles may explain some of the results.

  10. Genome-wide association analyses reveal complex genetic architecture underlying natural variation for flowering time in canola.

    Science.gov (United States)

    Raman, H; Raman, R; Coombes, N; Song, J; Prangnell, R; Bandaranayake, C; Tahira, R; Sundaramoorthi, V; Killian, A; Meng, J; Dennis, E S; Balasubramanian, S

    2016-06-01

    Optimum flowering time is the key to maximize canola production in order to meet global demand of vegetable oil, biodiesel and canola-meal. We reveal extensive variation in flowering time across diverse genotypes of canola under field, glasshouse and controlled environmental conditions. We conduct a genome-wide association study and identify 69 single nucleotide polymorphism (SNP) markers associated with flowering time, which are repeatedly detected across experiments. Several associated SNPs occur in clusters across the canola genome; seven of them were detected within 20 Kb regions of a priori candidate genes; FLOWERING LOCUS T, FRUITFUL, FLOWERING LOCUS C, CONSTANS, FRIGIDA, PHYTOCHROME B and an additional five SNPs were localized within 14 Kb of a previously identified quantitative trait loci for flowering time. Expression analyses showed that among FLC paralogs, BnFLC.A2 accounts for ~23% of natural variation in diverse accessions. Genome-wide association analysis for FLC expression levels mapped not only BnFLC.C2 but also other loci that contribute to variation in FLC expression. In addition to revealing the complex genetic architecture of flowering time variation, we demonstrate that the identified SNPs can be modelled to predict flowering time in diverse canola germplasm accurately and hence are suitable for genomic selection of adaptative traits in canola improvement programmes. PMID:26428711

  11. GenoMatrix: A Software Package for Pedigree-Based and Genomic Prediction Analyses on Complex Traits.

    Science.gov (United States)

    Nazarian, Alireza; Gezan, Salvador Alejandro

    2016-07-01

    Genomic and pedigree-based best linear unbiased prediction methodologies (G-BLUP and P-BLUP) have proven themselves efficient for partitioning the phenotypic variance of complex traits into its components, estimating the individuals' genetic merits, and predicting unobserved (or yet-to-be observed) phenotypes in many species and fields of study. The GenoMatrix software, presented here, is a user-friendly package to facilitate the process of using genome-wide marker data and parentage information for G-BLUP and P-BLUP analyses on complex traits. It provides users with a collection of applications which help them on a set of tasks from performing quality control on data to constructing and manipulating the genomic and pedigree-based relationship matrices and obtaining their inverses. Such matrices will be then used in downstream analyses by other statistical packages. The package also enables users to obtain predicted values for unobserved individuals based on the genetic values of observed related individuals. GenoMatrix is available to the research community as a Windows 64bit executable and can be downloaded free of charge at: http://compbio.ufl.edu/software/genomatrix/. PMID:27025440

  12. In silico genomic analyses reveal three distinct lineages of Escherichia coli O157:H7, one of which is associated with hyper-virulence

    Directory of Open Access Journals (Sweden)

    Karmali Mohamed A

    2009-06-01

    Full Text Available Abstract Background Many approaches have been used to study the evolution, population structure and genetic diversity of Escherichia coli O157:H7; however, observations made with different genotyping systems are not easily relatable to each other. Three genetic lineages of E. coli O157:H7 designated I, II and I/II have been identified using octamer-based genome scanning and microarray comparative genomic hybridization (mCGH. Each lineage contains significant phenotypic differences, with lineage I strains being the most commonly associated with human infections. Similarly, a clade of hyper-virulent O157:H7 strains implicated in the 2006 spinach and lettuce outbreaks has been defined using single-nucleotide polymorphism (SNP typing. In this study an in silico comparison of six different genotyping approaches was performed on 19 E. coli genome sequences from 17 O157:H7 strains and single O145:NM and K12 MG1655 strains to provide an overall picture of diversity of the E. coli O157:H7 population, and to compare genotyping methods for O157:H7 strains. Results In silico determination of lineage, Shiga-toxin bacteriophage integration site, comparative genomic fingerprint, mCGH profile, novel region distribution profile, SNP type and multi-locus variable number tandem repeat analysis type was performed and a supernetwork based on the combination of these methods was produced. This supernetwork showed three distinct clusters of strains that were O157:H7 lineage-specific, with the SNP-based hyper-virulent clade 8 synonymous with O157:H7 lineage I/II. Lineage I/II/clade 8 strains clustered closest on the supernetwork to E. coli K12 and E. coli O55:H7, O145:NM and sorbitol-fermenting O157 strains. Conclusion The results of this study highlight the similarities in relationships derived from multi-locus genome sampling methods and suggest a "common genotyping language" may be devised for population genetics and epidemiological studies. Future genotyping

  13. Genome-wide association analyses for growth and feed efficiency traits in beef cattle.

    Science.gov (United States)

    Lu, D; Miller, S; Sargolzaei, M; Kelly, M; Vander Voort, G; Caldwell, T; Wang, Z; Plastow, G; Moore, S

    2013-08-01

    A genome-wide association study using the Illumina 50K BeadChip included 38,745 SNP on 29 BTA analyzed on 751 animals, including 33 purebreds and 718 crossbred cattle. Genotypes and 6 production traits: birth weight (BWT), weaning weight (WWT), ADG, DMI, midtest metabolic BW (MMWT), and residual feed intake (RFI), were used to estimate effects of individual SNP on the traits. At the genome-wide level false discovery rate (FDR impact them in same direction. In terms of the size of SNP effect, the significant SNP (P information to further assist the identification of chromosome regions and subsequently genes affecting growth and feed efficiency traits in beef cattle. PMID:23851991

  14. Measuring alcohol consumption for genomic meta-analyses of alcohol intake: opportunities and challenges12345

    OpenAIRE

    Agrawal, Arpana; Neal D Freedman; Cheng, Yu-Ching; Lin, Peng; Shaffer, John R.; Sun, Qi; Taylor, Kira; Yaspan, Brian,; Cole, John W; Cornelis, Marilyn C.; Rebecca S. DeSensi; Fitzpatrick, Annette; Heiss, Gerardo; Kang, Jae H; O'Connell, Jeffrey

    2012-01-01

    Whereas moderate drinking may have health benefits, excessive alcohol consumption causes many important acute and chronic diseases and is the third leading contributor to preventable death in the United States. Twin studies suggest that alcohol-consumption patterns are heritable (50%); however, multiple genetic variants of modest effect size are likely to contribute to this heritable variation. Genome-wide association studies provide a tool for discovering genetic loci that contribute to vari...

  15. Comparative genomic analyses of the cyanobacterium, Lyngbya aestuarii BL J, a powerful hydrogen producer.

    Directory of Open Access Journals (Sweden)

    AnkitaKothari

    2013-12-01

    Full Text Available The filamentous, non-heterocystous cyanobacterium Lyngbya aestuarii is an important contributor to marine intertidal microbial mats system worldwide. The recent isolate L. aestuarii BL J, is an unusually powerful hydrogen producer. Here we report a morphological, ultrastructural and genomic characterization of this strain to set the basis for future systems studies and applications of this organism. The filaments contain circa 17 μm wide trichomes, composed of stacked disk-like short cells (2 μm long, encased in a prominent, laminated exopolysaccharide sheath. Cellular division occurs by transversal centripetal growth of cross-walls, where several rounds of division proceed simultaneously. Filament division occurs by cell self-immolation of one or groups of cells (necridial cells at the breakage point. Short, sheath-less, motile filaments (hormogonia are also formed. Morphologically and phylogenetically L. aestuarii belongs to a clade of important cyanobacteria that include members of the marine Trichodesmiun and Hydrocoleum genera, as well as terrestrial Microcoleus vaginatus strains, and alkalyphilic strains of Arthrospira. A draft genome of strain BL J was compared to those of other cyanobacteria in order to ascertain some of its ecological constraints and biotechnological potential. The genome had an average GC content of 41.1 %. Of the 6.87 Mb sequenced, 6.44 Mb was present as large contigs (>10,000 bp. It contained 6515 putative protein-encoding genes, of which, 43 % encode proteins of known functional role, 26 % corresponded to proteins with domain or family assignments, 19.6 % encode conserved hypothetical proteins, and 11.3 % encode apparently unique hypothetical proteins. The strain’s genome reveals its adaptations to a life of exposure to intense solar radiation and desiccation. It likely employs the storage compounds, glycogen and cyanophycin but no polyhydroxyalkanoates, and can produce the osmolytes, trehalose and glycine

  16. Genomic Analyses of the Microsporidian Nosema ceranae, an Emergent Pathogen of Honey Bees

    OpenAIRE

    Cornman, R. Scott; Chen, Yan Ping; Schatz, Michael C; Street, Craig; Yan ZHAO; Desany, Brian; Egholm, Michael; Hutchison, Stephen; Pettis, Jeffery S.; Lipkin, W. Ian; Evans, Jay D.

    2009-01-01

    Recent steep declines in honey bee health have severely impacted the beekeeping industry, presenting new risks for agricultural commodities that depend on insect pollination. Honey bee declines could reflect increased pressures from parasites and pathogens. The incidence of the microsporidian pathogen Nosema ceranae has increased significantly in the past decade. Here we present a draft assembly (7.86 MB) of the N. ceranae genome derived from pyrosequence data, including initial gene models a...

  17. Genomic analyses of the microsporidian Nosema ceranae, an emergent pathogen of honey bees.

    OpenAIRE

    R. Scott Cornman; Yan Ping Chen; Schatz, Michael C; Craig Street; Yan Zhao; Brian Desany; Michael Egholm; Stephen Hutchison; Pettis, Jeffery S.; W Ian Lipkin; Evans, Jay D.

    2009-01-01

    Recent steep declines in honey bee health have severely impacted the beekeeping industry, presenting new risks for agricultural commodities that depend on insect pollination. Honey bee declines could reflect increased pressures from parasites and pathogens. The incidence of the microsporidian pathogen Nosema ceranae has increased significantly in the past decade. Here we present a draft assembly (7.86 MB) of the N. ceranae genome derived from pyrosequence data, including initial gene models a...

  18. Genome wide evolutionary analyses reveal serotype specific patterns of positive selection in selected Salmonella serotypes

    OpenAIRE

    Soyer, Yeşim; Orsi, Renato H.; Rodriguez-Rivera, Lorraine D; Sun, Qi; Wiedmann, Martin

    2009-01-01

    Background The bacterium Salmonella enterica includes a diversity of serotypes that cause disease in humans and different animal species. Some Salmonella serotypes show a broad host range, some are host restricted and exclusively associated with one particular host, and some are associated with one particular host species, but able to cause disease in other host species and are thus considered "host adapted". Five Salmonella genome sequences, representing a broad host range serotype (Typhimur...

  19. Deciphering Clostridium tyrobutyricum Metabolism Based on the Whole-Genome Sequence and Proteome Analyses

    OpenAIRE

    Lee, Joungmin; Jang, Yu-Sin; Han, Mee-Jung; Kim, Jin Young; Lee, Sang Yup

    2016-01-01

    ABSTRACT Clostridium tyrobutyricum is a Gram-positive anaerobic bacterium that efficiently produces butyric acid and is considered a promising host for anaerobic production of bulk chemicals. Due to limited knowledge on the genetic and metabolic characteristics of this strain, however, little progress has been made in metabolic engineering of this strain. Here we report the complete genome sequence of C. tyrobutyricum KCTC 5387 (ATCC 25755), which consists of a 3.07-Mbp chromosome and a 63-kb...

  20. Leveraging wall-sized high-resolution displays for comparative genomics analyses of copy number variation

    OpenAIRE

    Ruddle, RA; Fateen, W; Treanor, D; Quirke, P.; Sondergeld, P

    2013-01-01

    The scale of comparative genomics data frequently overwhelms current data visualization methods on conventional (desktop) displays. This paper describes two types of solution that take advantage of wall-sized high-resolution displays (WHirDs), which have orders of magnitude more display real estate (i.e., pixels) than desktop displays. The first allows users to view detailed graphics of copy number variation (CNV) that were output by existing software. A WHirD's resolution allowed a 10× incre...

  1. Molecular Characterization of Five Potyviruses Infecting Korean Sweet Potatoes Based on Analyses of Complete Genome Sequences

    OpenAIRE

    Kwak, Hae-Ryun; Kim, Jaedeok; Kim, Mi-Kyeong; Seo, Jang-Kyun; Jung, Mi-Nam; Kim, Jeong-Soo; Lee, Sukchan; Choi, Hong-Soo

    2015-01-01

    Sweet potatoes (Ipomea batatas L.) are grown extensively, in tropical and temperate regions, and are important food crops worldwide. In Korea, potyviruses, including Sweet potato feathery mottle virus (SPFMV), Sweet potato virus C (SPVC), Sweet potato virus G (SPVG), Sweet potato virus 2 (SPV2), and Sweet potato latent virus (SPLV), have been detected in sweet potato fields at a high (~95%) incidence. In the present work, complete genome sequences of 18 isolates, representing the five potyvir...

  2. Functional and comparative genomic analyses of an operon involved in fructooligosaccharide utilization by Lactobacillus acidophilus

    OpenAIRE

    Barrangou, Rodolphe; Altermann, Eric; Hutkins, Robert; Cano, Raul; Klaenhammer, Todd R.

    2003-01-01

    Lactobacillus acidophilus is a probiotic organism that displays the ability to use prebiotic compounds such as fructooligosaccharides (FOS), which stimulate the growth of beneficial commensals in the gastrointestinal tract. However, little is known about the mechanisms and genes involved in FOS utilization by Lactobacillus species. Analysis of the L. acidophilus NCFM genome revealed an msm locus composed of a transcriptional regulator of the LacI family, a four-compone...

  3. Streptococcal taxonomy based on genome sequence analyses [v1; ref status: indexed, http://f1000r.es/o1

    Directory of Open Access Journals (Sweden)

    Cristiane C Thompson

    2013-03-01

    Full Text Available The identification of the clinically relevant viridans streptococci group, at species level, is still problematic. The aim of this study was to extract taxonomic information from the complete genome sequences of 67 streptococci, comprising 19 species, by means of genomic analyses, multilocus sequence analysis (MLSA, average amino acid identity (AAI, genomic signatures, genome-to-genome distances (GGD and codon usage bias. We then attempted to determine the usefulness of these genomic tools for species identification in streptococci. Our results showed that MLSA, AAI and GGD analyses are robust markers to identify streptococci at the species level, for instance, S. pneumoniae, S. mitis, and S. oralis. A Streptococcus species can be defined as a group of strains that share ≥ 95% DNA similarity in MLSA and AAI, and > 70% DNA identity in GGD. This approach allows an advanced understanding of bacterial diversity.

  4. Genome, Transcriptome, and Functional Analyses of Penicillium expansum Provide New Insights Into Secondary Metabolism and Pathogenicity.

    Science.gov (United States)

    Ballester, Ana-Rosa; Marcet-Houben, Marina; Levin, Elena; Sela, Noa; Selma-Lázaro, Cristina; Carmona, Lourdes; Wisniewski, Michael; Droby, Samir; González-Candelas, Luis; Gabaldón, Toni

    2015-03-01

    The relationship between secondary metabolism and infection in pathogenic fungi has remained largely elusive. The genus Penicillium comprises a group of plant pathogens with varying host specificities and with the ability to produce a wide array of secondary metabolites. The genomes of three Penicillium expansum strains, the main postharvest pathogen of pome fruit, and one Pencillium italicum strain, a postharvest pathogen of citrus fruit, were sequenced and compared with 24 other fungal species. A genomic analysis of gene clusters responsible for the production of secondary metabolites was performed. Putative virulence factors in P. expansum were identified by means of a transcriptomic analysis of apple fruits during the course of infection. Despite a major genome contraction, P. expansum is the Penicillium species with the largest potential for the production of secondary metabolites. Results using knockout mutants clearly demonstrated that neither patulin nor citrinin are required by P. expansum to successfully infect apples. Li et al. ( MPMI-12-14-0398-FI ) reported similar results and conclusions in their recently accepted paper. PMID:25338147

  5. Phylogenetic and Genomic Analyses Resolve the Origin of Important Plant Genes Derived from Transposable Elements.

    Science.gov (United States)

    Joly-Lopez, Zoé; Hoen, Douglas R; Blanchette, Mathieu; Bureau, Thomas E

    2016-08-01

    Once perceived as merely selfish, transposable elements (TEs) are now recognized as potent agents of adaptation. One way TEs contribute to evolution is through TE exaptation, a process whereby TEs, which persist by replicating in the genome, transform into novel host genes, which persist by conferring phenotypic benefits. Known exapted TEs (ETEs) contribute diverse and vital functions, and may facilitate punctuated equilibrium, yet little is known about this process. To better understand TE exaptation, we designed an approach to resolve the phylogenetic context and timing of exaptation events and subsequent patterns of ETE diversification. Starting with known ETEs, we search in diverse genomes for basal ETEs and closely related TEs, carefully curate the numerous candidate sequences, and infer detailed phylogenies. To distinguish TEs from ETEs, we also weigh several key genomic characteristics including repetitiveness, terminal repeats, pseudogenic features, and conserved domains. Applying this approach to the well-characterized plant ETEs MUG and FHY3, we show that each group is paraphyletic and we argue that this pattern demonstrates that each originated in not one but multiple exaptation events. These exaptations and subsequent ETE diversification occurred throughout angiosperm evolution including the crown group expansion, the angiosperm radiation, and the primitive evolution of angiosperms. In addition, we detect evidence of several putative novel ETE families. Our findings support the hypothesis that TE exaptation generates novel genes more frequently than is currently thought, often coinciding with key periods of evolution. PMID:27189548

  6. Comparative Genomic Analyses of Streptococcus pseudopneumoniae Provide Insight into Virulence and Commensalism Dynamics.

    Directory of Open Access Journals (Sweden)

    Dea Shahinas

    Full Text Available Streptococcus pseudopneumoniae (SPPN is a recently described species of the viridans group streptococci (VGS. Although the pathogenic potential of S. pseudopneumoniae remains uncertain, it is most commonly isolated from patients with underlying medical conditions, such as chronic obstructive pulmonary disease. S. pseudopneumoniae can be distinguished from the closely related species, S. pneumoniae and S. mitis, by phenotypic characteristics, including optochin resistance in the presence of 5% CO2, bile insolubility, and the lack of the pneumococcal capsule. Previously, we reported the draft genome sequence of S. pseudopneumoniae IS7493, a clinical isolate obtained from an immunocompromised patient with documented pneumonia. Here, we use comparative genomics approaches to identify similarities and key differences between S. pseudopneumoniae IS7493, S. pneumoniae and S. mitis. The genome structure of S. pseudopneumoniae IS7493 is most closely related to that of S. pneumoniae R6, but several recombination events are evident. Analysis of gene content reveals numerous unique features that distinguish S. pseudopneumoniae from other streptococci. The presence of loci for competence, iron transport, pneumolysin production and antimicrobial resistance reinforce the phylogenetic position of S. pseudopneumoniae as an intermediate species between S. pneumoniae and S. mitis. Additionally, the presence of several virulence factors and antibiotic resistance mechanisms suggest the potential of this commensal species to become pathogenic or to contribute to increasing antibiotic resistance levels seen among the VGS.

  7. A systematic comparison of genome-scale clustering algorithms

    OpenAIRE

    Jay, Jeremy J.; Eblen, John D; Zhang, Yun; Benson, Mikael; Perkins, Andy D.; Saxton, Arnold M.; Voy, Brynn H.; Elissa J Chesler; Langston, Michael A.

    2012-01-01

    Background: A wealth of clustering algorithms has been applied to gene co-expression experiments. These algorithms cover a broad range of approaches, from conventional techniques such as k-means and hierarchical clustering, to graphical approaches such as k-clique communities, weighted gene co-expression networks (WGCNA) and paraclique. Comparison of these methods to evaluate their relative effectiveness provides guidance to algorithm selection, development and implementation. Most prior work...

  8. A comparison of multivariate genome-wide association methods

    DEFF Research Database (Denmark)

    Galesloot, Tessel E; Van Steen, Kristel; Kiemeney, Lambertus A L M;

    2014-01-01

    Joint association analysis of multiple traits in a genome-wide association study (GWAS), i.e. a multivariate GWAS, offers several advantages over analyzing each trait in a separate GWAS. In this study we directly compared a number of multivariate GWAS methods using simulated data. We focused on six...... methods that are implemented in the software packages PLINK, SNPTEST, MultiPhen, BIMBAM, PCHAT and TATES, and also compared them to standard univariate GWAS, analysis of the first principal component of the traits, and meta-analysis of univariate results. We simulated data (N = 1000) for three...... correlation. We compared the power of the methods using empirically fixed significance thresholds (α = 0.05). Our results showed that the multivariate methods implemented in PLINK, SNPTEST, MultiPhen and BIMBAM performed best for the majority of the tested scenarios, with a notable increase in power...

  9. Genome-wide analyses of proliferation-important genes of Iridovirus-tiger frog virus by RNAi.

    Science.gov (United States)

    Xie, Jun-Feng; Lai, Yu-Xiong; Huang, Li-Jie; Huang, Run-Qing; Yang, Shao-Wei; Shi, Yan; Weng, Shao-Ping; Zhang, Yong; He, Jian-Guo

    2014-08-30

    Tiger frog virus (TFV), a species of genus Ranavirus in the family Iridoviridae, is a nuclear cytoplasmic large DNA virus that infects aquatic vertebrates such as tiger frog (Rana tigrina rugulosa) and Chinese soft-shelled turtle (Trionyx sinensis). Based on the available genome sequences of TFV, the well-developed RNA interference (RNAi) technique, and the reliable cell line for infection model, we decided to analyze the functional importance of all predicted genes. Firstly, a relative quantitative cytopathogenic effect (Q-CPE) assay was established to monitor the viral proliferation in fish cells. Then, genome-wide RNAi screens of 95 small interference (si) RNAs against TFV were performed to characterize the functional importance of nearly all (95%) predicted TFV genes by Q-CPE scaling system. We identified 32 (33.7%) genes as essential, 50 (52.6%) genes as semi-essential and 13 (13.7%) genes as nonessential for TFV proliferation. Quantitative RT-PCR and titer assays of selected genes were performed to verify the screen results. Furthermore, the screened essential genes were analyzed for their genome distribution and conservative comparison within Ranavirus. Such a systematic screen for viral functional genes by cell phenotypes should provide further insights into understanding of the information in antiviral targets, and in viral replication and pathogenesis of iridovirus. PMID:24886972

  10. Integrative genomic analyses of a novel cytokine, interleukin-34 and its potential role in cancer prediction

    OpenAIRE

    Wang, Bo; Xu, Wenming; TAN, MIAOLIAN; Xiao, Yan; Yang, Haiwei; Xia, Tian-Song

    2014-01-01

    Interleukin-34 (IL-34) is a novel cytokine, which is composed of 222 amino acids and forms homodimers. It binds to the macrophage colony-stimulating factor (M-CSF) receptor and plays an important role in innate immunity and inflammatory processes. In the present study, we identified the completed IL-34 gene in 25 various mammalian genomes and found that IL-34 existed in all types of vertebrates, including fish, amphibians, birds and mammals. These species have a similar 7 exon/6 intron gene o...

  11. Genome-wide meta-analyses identify three loci associated with primary biliary cirrhosis

    OpenAIRE

    Liu, Xiangdong; Invernizzi, Pietro; Lu, Yue,; Kosoy, Roman; Lu, Yan; Bianchi, Ilaria; Podda, Mauro; Chun XU; Xie, Gang; Macciardi, Fabio; Selmi, Carlo; Lupoli, Sara; Shigeta, Russell; Ransom, Michael; Lleo, Ana

    2010-01-01

    A genome-wide association screen for primary biliary cirrhosis risk alleles was performed in an Italian cohort. The results from the Italian cohort replicated IL12A and IL12RB associations, and a combined meta-analysis using a Canadian dataset identified newly associated loci at SPIB (P = 7.9 × 10–11, odds ratio (OR) = 1.46), IRF5-TNPO3 (P = 2.8 × 10–10, OR = 1.63) and 17q12-21 (P = 1.7 × 10–10, OR = 1.38).

  12. Genome-wide meta-analyses identify three loci associated with primary biliary cirrhosis

    Science.gov (United States)

    Liu, Xiangdong; Invernizzi, Pietro; Lu, Yue; Kosoy, Roman; Lu, Yan; Bianchi, Ilaria; Podda, Mauro; Xu, Chun; Xie, Gang; Macciardi, Fabio; Selmi, Carlo; Lupoli, Sara; Shigeta, Russell; Ransom, Michael; Lleo, Ana; Lee, Annette T; Mason, Andrew L; Myers, Robert P; Peltekian, Kevork M; Ghent, Cameron N; Bernuzzi, Francesca; Zuin, Massimo; Rosina, Floriano; Borghesio, Elisabetta; Floreani, Annarosa; Lazzari, Roberta; Niro, Grazia; Andriulli, Angelo; Muratori, Luigi; Muratori, Paolo; Almasio, Piero L; Andreone, Pietro; Margotti, Marzia; Brunetto, Maurizia; Coco, Barbara; Alvaro, Domenico; Bragazzi, Maria C; Marra, Fabio; Pisano, Alessandro; Rigamonti, Cristina; Colombo, Massimo; Marzioni, Marco; Benedetti, Antonio; Fabris, Luca; Strazzabosco, Mario; Portincasa, Piero; Palmieri, Vincenzo O; Tiribelli, Claudio; Croce, Lory; Bruno, Savino; Rossi, Sonia; Vinci, Maria; Prisco, Cleofe; Mattalia, Alberto; Toniutto, Pierluigi; Picciotto, Antonio; Galli, Andrea; Ferrari, Carlo; Colombo, Silvia; Casella, Giovanni; Morini, Lorenzo; Caporaso, Nicola; Colli, Agostino; Spinzi, Giancarlo; Montanari, Renzo; Gregersen, Peter K; Heathcote, E Jenny; Hirschfield, Gideon M; Siminovitch, Katherine A; Amos, Christopher I; Gershwin, M Eric; Seldin, Michael F

    2011-01-01

    A genome-wide association screen for primary biliary cirrhosis risk alleles was performed in an Italian cohort. The results from the Italian cohort replicated IL12A and IL12RB associations, and a combined meta-analysis using a Canadian dataset identified newly associated loci at SPIB (P = 7.9 × 10–11, odds ratio (OR) = 1.46), IRF5-TNPO3 (P = 2.8 × 10–10, OR = 1.63) and 17q12-21 (P = 1.7 × 10–10, OR = 1.38). PMID:20639880

  13. ANItools web: a web tool for fast genome comparison within multiple bacterial strains

    Science.gov (United States)

    Han, Na; Qiang, Yujun; Zhang, Wen

    2016-01-01

    Background: Early classification of prokaryotes was based solely on phenotypic similarities, but modern prokaryote characterization has been strongly influenced by advances in genetic methods. With the fast development of the sequencing technology, the ever increasing number of genomic sequences per species offers the possibility for developing distance determinations based on whole-genome information. The average nucleotide identity (ANI), calculated from pair-wise comparisons of all sequences shared between two given strains, has been proposed as the new metrics for bacterial species definition and classification. Results: In this study, we developed the web version of ANItools (http://ani.mypathogen.cn/), which helps users directly get ANI values from online sources. A database covering ANI values of any two strains in a genus was also included (2773 strains, 1487 species and 668 genera). Importantly, ANItools web can automatically run genome comparison between the input genomic sequence and data sequences (Genus and Species levels), and generate a graphical report for ANI calculation results. Conclusion: ANItools web is useful for defining the relationship between bacterial strains, further contributing to the classification and identification of bacterial species using genome data. Database URL: http://ani.mypathogen.cn/ PMID:27270714

  14. Complete mitochondrial genomes of Teinopalpus imperialis (Lepidoptera: Papilionidae) and phylogenetic relationships analyses.

    Science.gov (United States)

    Huang, Chao-Bin; Zeng, Ju-Ping; Zhou, Shan-Yi

    2016-07-01

    In this study, we sequenced the complete mitochondrial genome of Teinopalpus imperialis, which is a national butterfly of India, and a grade-II protected species in China. The complete mtDNA from T. imperialis was 15 299 base pairs in length and contained 13 protein-coding genes (PCGs), 22 tRNA genes, two rRNA genes, and 401 bp non-coding region. The T. imperialis genes were highly similar to those of sequenced mitogenomes of other lepidopteran species in the order and orientation. Twelve PCGs (ND2, ATP8, ND3, COII, ATP6, COIII, ND4, ND4L, CytB, ND1, ND5, and ND6) start with a typical ATN codon, only the COI gene starts with CGA codon. Eight PCGs (ND2, COI, ATP8, ATP6, COIII, ND5, ND6, and Cyt B) terminate in the common stop codon TAA, three PCGs (ND4L, ND3, and ND1) terminate in the stop codon TAG, and two PCGs (COII and ND4) terminate in a single T residue. The phylogenetic relationships were reconstructed with the concatenated sequences of the 13 PCGs of the mitochondrial genome, and phylogenetic results showed that Danaidae, Satyridae, Libytheidae, Nymphalidae, Acraeidae, Pieridae, Hesperiidae, Riodinidae, and Lycaenidae are monophyletic clades. PMID:26162054

  15. Integrated Genomic and Network-Based Analyses of Complex Diseases and Human Disease Network.

    Science.gov (United States)

    Al-Harazi, Olfat; Al Insaif, Sadiq; Al-Ajlan, Monirah A; Kaya, Namik; Dzimiri, Nduna; Colak, Dilek

    2016-06-20

    A disease phenotype generally reflects various pathobiological processes that interact in a complex network. The highly interconnected nature of the human protein interaction network (interactome) indicates that, at the molecular level, it is difficult to consider diseases as being independent of one another. Recently, genome-wide molecular measurements, data mining and bioinformatics approaches have provided the means to explore human diseases from a molecular basis. The exploration of diseases and a system of disease relationships based on the integration of genome-wide molecular data with the human interactome could offer a powerful perspective for understanding the molecular architecture of diseases. Recently, subnetwork markers have proven to be more robust and reliable than individual biomarker genes selected based on gene expression profiles alone, and achieve higher accuracy in disease classification. We have applied one of these methodologies to idiopathic dilated cardiomyopathy (IDCM) data that we have generated using a microarray and identified significant subnetworks associated with the disease. In this paper, we review the recent endeavours in this direction, and summarize the existing methodologies and computational tools for network-based analysis of complex diseases and molecular relationships among apparently different disorders and human disease network. We also discuss the future research trends and topics of this promising field. PMID:27318646

  16. Characterization of hemizygous deletions in Citrus using array-Comparative Genomic Hybridization and microsynteny comparisons with the poplar genome

    Directory of Open Access Journals (Sweden)

    Usach Antonio

    2008-08-01

    Full Text Available Abstract Background Many fruit-tree species, including relevant Citrus spp varieties exhibit a reproductive biology that impairs breeding and strongly constrains genetic improvements. In citrus, juvenility increases the generation time while sexual sterility, inbreeding depression and self-incompatibility prevent the production of homozygous cultivars. Genomic technology may provide citrus researchers with a new set of tools to address these various restrictions. In this work, we report a valuable genomics-based protocol for the structural analysis of deletion mutations on an heterozygous background. Results Two independent fast neutron mutants of self-incompatible clementine (Citrus clementina Hort. Ex Tan. cv. Clemenules were the subject of the study. Both mutants, named 39B3 and 39E7, were expected to carry DNA deletions in hemizygous dosage. Array-based Comparative Genomic Hybridization (array-CGH using a Citrus cDNA microarray allowed the identification of underrepresented genes in these two mutants. Subsequent comparison of citrus deleted genes with annotated plant genomes, especially poplar, made possible to predict the presence of a large deletion in 39B3 of about 700 kb and at least two deletions of approximately 100 and 500 kb in 39E7. The deletion in 39B3 was further characterized by PCR on available Citrus BACs, which helped us to build a partial physical map of the deletion. Among the deleted genes, ClpC-like gene coding for a putative subunit of a multifunctional chloroplastic protease involved in the regulation of chlorophyll b synthesis was directly related to the mutated phenotype since the mutant showed a reduced chlorophyll a/b ratio in green tissues. Conclusion In this work, we report the use of array-CGH for the successful identification of genes included in a hemizygous deletion induced by fast neutron irradiation on Citrus clementina. The study of gene content and order into the 39B3 deletion also led to the unexpected

  17. Genomic DNA restriction endonuclease from Pasteurella multocida isolated from Indonesia, katha strain and reference strains and analysed by PFGE

    Directory of Open Access Journals (Sweden)

    Supar

    2003-10-01

    Full Text Available Pasteurella multocida strains are the causative disease agents of wide range of domestic and wild animals in Indonesia. The most important serotypes are associated with Hemorrhagic septicaemic (HS diseases in cattle and buffaloes, cholera in ducks and chickens. The HS disease associated with P. multocia in large ruminants in Indonesia is controled by killed whole cell vaccines produced by the use of P. multocida Katha strains. There is no discriminatory data of the molecular biology technique has been applied to investigate P. multocida isolates from different geographic locations in Indonesia. The purpose of this studies were to observe the genetic diversity among P. multocida isolated from various geograpic locations and compared with Katha vaccine strain and other reference strains. A total samples of 38 isolates and strains of P. multocida were analysed by means of pulsed-field gel electrophoresis (PFGE. Each sample was grown in nutrient broth, cells were separeted by centrifugation. Whole cell pellet was mixed with agarose and then prepared agarose plugs. The genomic DNA of each sample was digested in situ (plug with either restriction endonuclease of ApaI and/or BamHI. The digested genomic DNA of each sample was analysed by PFGE, the genomic DNA restricted profile of each sample was compared with others. The use of ApaI restriction endonuclease digestion and analysed by PFGE, demonstrated that 34 out of 38 P. multocia samples could be differentiated into 16 ApaI types, whereas based on the BamHI digestion of these samples were differentiated into 20 BamHI types. Genomic DNA restriction pattern of Indonesian P. multocida isolates originated from cattle and buffaloes associated with haemorrhagic septicaemic diseases demonstrated different pattern to those of vaccine Katha strain, poultry strains as well as the reference strains currenly kept at Balitvet Culture Collection (BCC unit. Two P. multocida isolates derived from ducks with cholera

  18. The complete mitochondrial genome of Flustra foliacea (Ectoprocta, Cheilostomata - compositional bias affects phylogenetic analyses of lophotrochozoan relationships

    Directory of Open Access Journals (Sweden)

    Nesnidal Maximilian P

    2011-11-01

    Full Text Available Abstract Background The phylogenetic relationships of the lophophorate lineages, ectoprocts, brachiopods and phoronids, within Lophotrochozoa are still controversial. We sequenced an additional mitochondrial genome of the most species-rich lophophorate lineage, the ectoprocts. Although it is known that there are large differences in the nucleotide composition of mitochondrial sequences of different lineages as well as in the amino acid composition of the encoded proteins, this bias is often not considered in phylogenetic analyses. We applied several approaches for reducing compositional bias and saturation in the phylogenetic analyses of the mitochondrial sequences. Results The complete mitochondrial genome (16,089 bp of Flustra foliacea (Ectoprocta, Gymnolaemata, Cheilostomata was sequenced. All protein-encoding, rRNA and tRNA genes are transcribed from the same strand. Flustra shares long intergenic sequences with the cheilostomate ectoproct Bugula, which might be a synapomorphy of these taxa. Further synapomorphies might be the loss of the DHU arm of the tRNA L(UUR, the loss of the DHU arm of the tRNA S(UCN and the unique anticodon sequence GAG of the tRNA L(CUN. The gene order of the mitochondrial genome of Flustra differs strongly from that of the other known ectoprocts. Phylogenetic analyses of mitochondrial nucleotide and amino acid data sets show that the lophophorate lineages are more closely related to trochozoan phyla than to deuterostomes or ecdysozoans confirming the Lophotrochozoa hypothesis. Furthermore, they support the monophyly of Cheilostomata and Ectoprocta. However, the relationships of the lophophorate lineages within Lophotrochozoa differ strongly depending on the data set and the used method. Different approaches for reducing heterogeneity in nucleotide and amino acid data sets and saturation did not result in a more robust resolution of lophotrochozoan relationships. Conclusion The contradictory and usually weakly

  19. Genomic Analyses Reveal Demographic History and Temperate Adaptation of the Newly Discovered Honey Bee Subspecies Apis mellifera sinisxinyuan n. ssp.

    Science.gov (United States)

    Chen, Chao; Liu, Zhiguang; Pan, Qi; Chen, Xiao; Wang, Huihua; Guo, Haikun; Liu, Shidong; Lu, Hongfeng; Tian, Shilin; Li, Ruiqiang; Shi, Wei

    2016-05-01

    Studying the genetic signatures of climate-driven selection can produce insights into local adaptation and the potential impacts of climate change on populations. The honey bee (Apis mellifera) is an interesting species to study local adaptation because it originated in tropical/subtropical climatic regions and subsequently spread into temperate regions. However, little is known about the genetic basis of its adaptation to temperate climates. Here, we resequenced the whole genomes of ten individual bees from a newly discovered population in temperate China and downloaded resequenced data from 35 individuals from other populations. We found that the new population is an undescribed subspecies in the M-lineage of A. mellifera (Apis mellifera sinisxinyuan). Analyses of population history show that long-term global temperature has strongly influenced the demographic history of A. m. sinisxinyuan and its divergence from other subspecies. Further analyses comparing temperate and tropical populations identified several candidate genes related to fat body and the Hippo signaling pathway that are potentially involved in adaptation to temperate climates. Our results provide insights into the demographic history of the newly discovered A. m. sinisxinyuan, as well as the genetic basis of adaptation of A. mellifera to temperate climates at the genomic level. These findings will facilitate the selective breeding of A. mellifera to improve the survival of overwintering colonies. PMID:26823447

  20. Genome-wide linkage, exome sequencing and functional analyses identify ABCB6 as the pathogenic gene of dyschromatosis universalis hereditaria.

    Directory of Open Access Journals (Sweden)

    Hong Liu

    Full Text Available BACKGROUND: As a genetic disorder of abnormal pigmentation, the molecular basis of dyschromatosis universalis hereditaria (DUH had remained unclear until recently when ABCB6 was reported as a causative gene of DUH. METHODOLOGY: We performed genome-wide linkage scan using Illumina Human 660W-Quad BeadChip and exome sequencing analyses using Agilent SureSelect Human All Exon Kits in a multiplex Chinese DUH family to identify the pathogenic mutations and verified the candidate mutations using Sanger sequencing. Quantitative RT-PCR and Immunohistochemistry was performed to verify the expression of the pathogenic gene, Zebrafish was also used to confirm the functional role of ABCB6 in melanocytes and pigmentation. RESULTS: Genome-wide linkage (assuming autosomal dominant inheritance mode and exome sequencing analyses identified ABCB6 as the disease candidate gene by discovering a coding mutation (c.1358C>T; p.Ala453Val that co-segregates with the disease phenotype. Further mutation analysis of ABCB6 in four other DUH families and two sporadic cases by Sanger sequencing confirmed the mutation (c.1358C>T; p.Ala453Val and discovered a second, co-segregating coding mutation (c.964A>C; p.Ser322Lys in one of the four families. Both mutations were heterozygous in DUH patients and not present in the 1000 Genome Project and dbSNP database as well as 1,516 unrelated Chinese healthy controls. Expression analysis in human skin and mutagenesis interrogation in zebrafish confirmed the functional role of ABCB6 in melanocytes and pigmentation. Given the involvement of ABCB6 mutations in coloboma, we performed ophthalmological examination of the DUH carriers of ABCB6 mutations and found ocular abnormalities in them. CONCLUSION: Our study has advanced our understanding of DUH pathogenesis and revealed the shared pathological mechanism between pigmentary DUH and ocular coloboma.

  1. Clinical, polysomnographic and genome-wide association analyses of narcolepsy with cataplexy

    DEFF Research Database (Denmark)

    Luca, Gianina; Haba-Rubio, José; Dauvilliers, Yves;

    2013-01-01

    The aim of this study was to describe the clinical and PSG characteristics of narcolepsy with cataplexy and their genetic predisposition by using the retrospective patient database of the European Narcolepsy Network (EU-NN). We have analysed retrospective data of 1099 patients with narcolepsy dia...

  2. Insights from Genome-Wide Association Analyses of Nonalcoholic Fatty Liver Disease

    Science.gov (United States)

    Kahali, Bratati; Halligan, Brian; Speliotes, Elizabeth K.

    2016-01-01

    Nonalcoholic fatty liver disease (NAFLD) is caused by hepatic steatosis, which can progress to nonalcoholic steatohepatitis, fibrosis/cirrhosis, and hepatocellular carcinoma in the absence of excessive alcohol consumption. Nonalcoholic fatty liver disease will become the number one cause of liver disease worldwide by 2020. Nonalcoholic fatty liver disease is correlated albeit imperfectly with obesity and other metabolic diseases such as diabetes, hyperlipidemia, and cardiovascular disease, but exactly how having one of these diseases contributes to the development of other metabolic diseases is only now being elucidated. Development of NAFLD and related metabolic diseases is genetically influenced in the population, and recent genome-wide association studies (GWASs) have discovered genetic variants that associate with these diseases. These GWAS-associated variants cannot only help us to identify individuals at high risk of developing NAFLD, but also to better understand its pathophysiology so that we can develop more effective treatments for this disease and related metabolic diseases in the future. PMID:26676813

  3. Genome-wide identification and comprehensive analyses of the kinomes in four pathogenic microsporidia species.

    Directory of Open Access Journals (Sweden)

    Zhi Li

    Full Text Available Microsporidia have attracted considerable attention because they infect a wide range of hosts, from invertebrates to vertebrates, and cause serious human diseases and major economic losses in the livestock industry. There are no prospective drugs to counteract this pathogen. Eukaryotic protein kinases (ePKs play a central role in regulating many essential cellular processes and are therefore potential drug targets. In this study, a comprehensive summary and comparative analysis of the protein kinases in four microsporidia—Enterocytozoon bieneusi, Encephalitozoon cuniculi, Nosema bombycis and Nosema ceranae—was performed. The results show that there are 34 ePKs and 4 atypical protein kinases (aPKs in E. bieneusi, 29 ePKs and 6 aPKs in E. cuniculi, 41 ePKs and 5 aPKs in N. bombycis, and 27 ePKs and 4 aPKs in N. ceranae. These data support the previous conclusion that the microsporidian kinome is the smallest eukaryotic kinome. Microsporidian kinomes contain only serine-threonine kinases and do not contain receptor-like and tyrosine kinases. Many of the kinases related to nutrient and energy signaling and the stress response have been lost in microsporidian kinomes. However, cell cycle-, development- and growth-related kinases, which are important to parasites, are well conserved. This reduction of the microsporidian kinome is in good agreement with genome compaction, but kinome density is negatively correlated with proteome size. Furthermore, the protein kinases in each microsporidian genome are under strong purifying selection pressure. No remarkable differences in kinase family classification, domain features, gain and/or loss, and selective pressure were observed in these four species. Although microsporidia adapt to different host types, the coevolution of microsporidia and their hosts was not clearly reflected in the protein kinases. Overall, this study enriches and updates the microsporidian protein kinase database and may provide

  4. Genome-wide association analyses identify SPOCK as a key novel gene underlying age at menarche.

    Directory of Open Access Journals (Sweden)

    Yao-Zhong Liu

    2009-03-01

    Full Text Available For females, menarche is a most significant physiological event. Age at menarche (AAM is a trait with high genetic determination and is associated with major complex diseases in women. However, specific genes for AAM variation are largely unknown. To identify genetic factors underlying AAM variation, a genome-wide association study (GWAS examining about 380,000 SNPs was conducted in 477 Caucasian women. A follow-up replication study was performed to validate our major GWAS findings using two independent Caucasian cohorts with 854 siblings and 762 unrelated subjects, respectively, and one Chinese cohort of 1,387 unrelated subjects--all females. Our GWAS identified a novel gene, SPOCK (Sparc/Osteonectin, CWCV, and Kazal-like domains proteoglycan, which had seven SNPs associated with AAM with genome-wide false discovery rate (FDR q<0.05. Six most significant SNPs of the gene were selected for validation in three independent replication cohorts. All of the six SNPs were replicated in at least one cohort. In particular, SNPs rs13357391 and rs1859345 were replicated both within and across different ethnic groups in all three cohorts, with p values of 5.09 x 10(-3 and 4.37 x 10(-3, respectively, in the Chinese cohort and combined p values (obtained by Fisher's method of 5.19 x 10(-5 and 1.02 x 10(-4, respectively, in all three replication cohorts. Interestingly, SPOCK can inhibit activation of MMP-2 (matrix metalloproteinase-2, a key factor promoting endometrial menstrual breakdown and onset of menstrual bleeding. Our findings, together with the functional relevance, strongly supported that the SPOCK gene underlies variation of AAM.

  5. Complete mitochondrial genome DNA sequence for two ophiuroids and a holothuroid: the utility of protein gene sequence and gene maps in the analyses of deep deuterostome phylogeny.

    Science.gov (United States)

    Scouras, Andrea; Beckenbach, Karen; Arndt, Allan; Smith, Michael J

    2004-04-01

    The complete mitochondrial genome sequences have been determined for the holothuroid Cucumaria miniata and two ophiuroid species Ophiopholis aculeata and Ophiura lütkeni. In addition, the nucleotide sequence of the mitochondrial protein-coding genes for the asteroid Pisaster ochraceus has been completed. Maximum-likelihood and LogDet distance analyses of concatenated protein-coding sequences produced a series of trees that did not conclusively support generally accepted models of echinoderm phylogeny. The ophiuroid data consistently demonstrated accelerated nucleotide divergence rates and lack of stationarity. This confounds the phylogenetic analyses. Molecular investigations using individual protein-coding gene alignments demonstrated that the cytochrome b gene exhibits the least deviation in rate and stationarity and generated some trees consistent with proposed echinoderm phylogenies. Phylogenies based on echinoderm mitochondrial gene rearrangements also proved problematic because of extensive variation in gene order between and within classes. A comparison of the two distinctive ophiuroid mitochondrial gene orders supports the hypothesis that O. lütkeni has a more derived mitochondrial gene order versus O. aculeata. The variation in the echinoderm mitochondrial gene maps reinforces the limitations of the application of mitochondrial gene rearrangements as a global phylogenetic tool. PMID:15019608

  6. Genome comparison of the epiphytic bacteria Erwinia billingiae and E. tasmaniensis with the pear pathogen E. pyrifoliae

    Directory of Open Access Journals (Sweden)

    Kuhl Heiner

    2010-06-01

    Full Text Available Abstract Background The genus Erwinia includes plant-associated pathogenic and non-pathogenic Enterobacteria. Important pathogens such as Erwinia amylovora, the causative agent of fire blight and E. pyrifoliae causing bacterial shoot blight of pear in Asia belong to this genus. The species E. tasmaniensis and E. billingiae are epiphytic bacteria and may represent antagonists for biocontrol of fire blight. The presence of genes that are putatively involved in virulence in E. amylovora and E. pyrifoliae is of special interest for these species in consequence. Results Here we provide the complete genome sequences of the pathogenic E. pyrifoliae strain Ep1/96 with a size of 4.1 Mb and of the non-pathogenic species E. billingiae strain Eb661 with a size of 5.4 Mb, de novo determined by conventional Sanger sequencing and next generation sequencing techniques. Genome comparison reveals large inversions resulting from homologous recombination events. Furthermore, comparison of deduced proteins highlights a relation of E. billingiae strain Eb661 to E. tasmaniensis strain Et1/99 and a distance to E. pyrifoliae for the overall gene content as well as for the presence of encoded proteins representing virulence factors for the pathogenic species. Pathogenicity of E. pyrifoliae is supposed to have evolved by accumulation of potential virulence factors. E. pyrifoliae carries factors for type III secretion and cell invasion. Other genes described as virulence factors for E. amylovora are involved in the production of exopolysaccharides, the utilization of plant metabolites such as sorbitol and sucrose. Some virulence-associated genes of the pathogenic species are present in E. tasmaniensis but mostly absent in E. billingiae. Conclusion The data of the genome analyses correspond to the pathogenic lifestyle of E. pyrifoliae and underlines the epiphytic localization of E. tasmaniensis and E. billingiae as a saprophyte.

  7. Complete mitochondrial genome sequences of three Crocodylus species and their comparison within the Order Crocodylia.

    Science.gov (United States)

    Meganathan, P R; Dubey, Bhawna; Batzer, Mark A; Ray, David A; Haque, Ikramul

    2011-06-01

    Crocodylus is the largest genus within the Order Crocodylia consisting of eleven species. This paper reports the complete mitochondrial genome sequences of three Crocodylus species, Crocodylus moreletii, Crocodylus johnstoni and Crocodylus palustris, and compares the newly obtained mitochondrial DNA sequences with other crocodilians, available in the public databases. The mitochondrial genomes of C. moreletii, C. johnstoni and C. palustris are 16,827 bp, 16,851 bp and 16,852 bp in length, respectively. These mitochondrial genomes consist of 13 protein coding genes, two ribosomal RNA genes, 22 transfer RNA genes and a non-coding region. The mitochondrial genomes of all the Crocodylus species, studied herein show identical characteristics in terms of nucleotide composition and codon usage, suggestive of the existence of analogous evolutionary patterns within the genus, Crocodylus. The synonymous and non-synonymous substitution rates for all the protein coding genes of Crocodylus were observed in between 0.001 and 0.275 which reveal the prevalence of purifying selection in these genes. The phylogenetic analyses based on complete mitochondrial DNA data substantiate the previously established crocodilian phylogeny. This study provides a better understanding of the crocodilian mitochondrial genome and the data described herein will prove useful for future studies concerning crocodilian mitochondrial genome evolution. PMID:21310220

  8. A Comparison of the First Two Sequenced Chloroplast Genomes in Asteraceae: Lettuce and Sunflower

    Energy Technology Data Exchange (ETDEWEB)

    Timme, Ruth E.; Kuehl, Jennifer V.; Boore, Jeffrey L.; Jansen, Robert K.

    2006-01-20

    Asteraceae is the second largest family of plants, with over 20,000 species. For the past few decades, numerous phylogenetic studies have contributed to our understanding of the evolutionary relationships within this family, including comparisons of the fast evolving chloroplast gene, ndhF, rbcL, as well as non-coding DNA from the trnL intron plus the trnLtrnF intergenic spacer, matK, and, with lesser resolution, psbA-trnH. This culminated in a study by Panero and Funk in 2002 that used over 13,000 bp per taxon for the largest taxonomic revision of Asteraceae in over a hundred years. Still, some uncertainties remain, and it would be very useful to have more information on the relative rates of sequence evolution among various genes and on genome structure as a potential set of phylogenetic characters to help guide future phylogenetic structures. By way of contributing to this, we report the first two complete chloroplast genome sequences from members of the Asteraceae, those of Helianthus annuus and Lactuca sativa. These plants belong to two distantly related subfamilies, Asteroideae and Cichorioideae, respectively. In addition to these, there is only one other published chloroplast genome sequence for any plant within the larger group called Eusterids II, that of Panax ginseng (Araliaceae, 156,318 bps, AY582139). Early chloroplast genome mapping studies demonstrated that H. annuus and L. sativa share a 22 kb inversion relative to members of the subfamily Barnadesioideae. By comparison to outgroups, this inversion was shown to be derived, indicating that the Asteroideae and Cichorioideae are more closely related than either is to the Barnadesioideae. Later sequencing study found that taxa that share this 22 kb inversion also contain within this region a second, smaller, 3.3 kb inversion. These sequences also enable an analysis of patterns of shared repeats in the genomes at fine level and of RNA editing by comparison to available EST sequences. In addition, since

  9. Microarray analyses and comparisons of upper or lower flanks of rice shoot base preceding gravitropic bending.

    Directory of Open Access Journals (Sweden)

    Liwei Hu

    Full Text Available Gravitropism is a complex process involving a series of physiological pathways. Despite ongoing research, gravitropism sensing and response mechanisms are not well understood. To identify the key transcripts and corresponding pathways in gravitropism, a whole-genome microarray approach was used to analyze transcript abundance in the shoot base of rice (Oryza sativa sp. japonica at 0.5 h and 6 h after gravistimulation by horizontal reorientation. Between upper and lower flanks of the shoot base, 167 transcripts at 0.5 h and 1202 transcripts at 6 h were discovered to be significantly different in abundance by 2-fold. Among these transcripts, 48 were found to be changed both at 0.5 h and 6 h, while 119 transcripts were only changed at 0.5 h and 1154 transcripts were changed at 6 h in association with gravitropism. MapMan and PageMan analyses were used to identify transcripts significantly changed in abundance. The asymmetric regulation of transcripts related to phytohormones, signaling, RNA transcription, metabolism and cell wall-related categories between upper and lower flanks were demonstrated. Potential roles of the identified transcripts in gravitropism are discussed. Our results suggest that the induction of asymmetrical transcription, likely as a consequence of gravitropic reorientation, precedes gravitropic bending in the rice shoot base.

  10. Beyond Linear Sequence Comparisons: The use of genome-levelcharacters for phylogenetic reconstruction

    Energy Technology Data Exchange (ETDEWEB)

    Boore, Jeffrey L.

    2004-11-27

    Although the phylogenetic relationships of many organisms have been convincingly resolved by the comparisons of nucleotide or amino acid sequences, others have remained equivocal despite great effort. Now that large-scale genome sequencing projects are sampling many lineages, it is becoming feasible to compare large data sets of genome-level features and to develop this as a tool for phylogenetic reconstruction that has advantages over conventional sequence comparisons. Although it is unlikely that these will address a large number of evolutionary branch points across the broad tree of life due to the infeasibility of such sampling, they have great potential for convincingly resolving many critical, contested relationships for which no other data seems promising. However, it is important that we recognize potential pitfalls, establish reasonable standards for acceptance, and employ rigorous methodology to guard against a return to earlier days of scenario-driven evolutionary reconstructions.

  11. Developing fair tests for mathematics curriculum comparison studies: the role of content analyses

    Science.gov (United States)

    Chávez, Óscar; Papick, Ira; Ross, Daniel J.; Grouws, Douglas A.

    2011-12-01

    This article describes the process of development of assessment instruments for a three-year longitudinal comparative study that focused on evaluating American high school students' mathematics learning from two distinct approaches to content organization: curriculum built around a sequence of three full-year courses (Algebra 1, Geometry, and Algebra 2) and a sequence of integrated mathematics courses (algebra and geometry content, together with functions, data analysis, and discrete mathematics is integrated each year). The study was conducted in six school districts in five states involving over 4,000 students from schools that were using both curricular approaches but with different groups of students. In order to develop assessment instruments that were not biased towards either of the two curriculum programs (Fair Tests), an iterative process of content analyses, identification of common topics, internal and external reviews, pilot tests, and revisions was followed, resulting in five tests that were used in the three years of the study. Results indicate that these tests have solid discrimination properties and address adequately mathematics content common to both secondary curriculum programs. The corresponding scoring rubrics are highly reliable, with interrater reliability above 94% for all tests. Mathematics education researchers involved in curriculum comparison studies need to conduct content analyses of the curriculum materials under study in order to identify salient relationships between curriculum programs and student outcomes.

  12. On the Similarity of Sets of Permutations and its Applications to Genome Comparison

    OpenAIRE

    Bergeron, Anne; Stoye, Jens

    2003-01-01

    The comparison of genomes with the same gene content relies on our ability to compare permutations, either by measuring how much they differ, or by measuring how much they are alike. With the notable exception of the breakpoint distance, which is based on the concept of conserved adjacencies, measures of distance do not generalize easily to sets of more than two permutations. In this paper, we present a basic unifying notion, conserved intervals, as a powerful generalization of adjacencies, a...

  13. Genomic and transcriptomic analyses of the tangerine pathotype of Alternaria alternata in response to oxidative stress

    Science.gov (United States)

    Wang, Mingshuang; Sun, Xuepeng; Yu, Dongliang; Xu, Jianping; Chung, Kuangren; Li, Hongye

    2016-01-01

    The tangerine pathotype of Alternaria alternata produces the A. citri toxin (ACT) and is the causal agent of citrus brown spot that results in significant yield losses worldwide. Both the production of ACT and the ability to detoxify reactive oxygen species (ROS) are required for A. alternata pathogenicity in citrus. In this study, we report the 34.41 Mb genome sequence of strain Z7 of the tangerine pathotype of A. alternata. The host selective ACT gene cluster in strain Z7 was identified, which included 25 genes with 19 of them not reported previously. Of these, 10 genes were present only in the tangerine pathotype, representing the most likely candidate genes for this pathotype specialization. A transcriptome analysis of the global effects of H2O2 on gene expression revealed 1108 up-regulated and 498 down-regulated genes. Expressions of those genes encoding catalase, peroxiredoxin, thioredoxin and glutathione were highly induced. Genes encoding several protein families including kinases, transcription factors, transporters, cytochrome P450, ubiquitin and heat shock proteins were found associated with adaptation to oxidative stress. Our data not only revealed the molecular basis of ACT biosynthesis but also provided new insights into the potential pathways that the phytopathogen A. alternata copes with oxidative stress. PMID:27582273

  14. Genomic and transcriptomic analyses of the tangerine pathotype of Alternaria alternata in response to oxidative stress.

    Science.gov (United States)

    Wang, Mingshuang; Sun, Xuepeng; Yu, Dongliang; Xu, Jianping; Chung, Kuangren; Li, Hongye

    2016-01-01

    The tangerine pathotype of Alternaria alternata produces the A. citri toxin (ACT) and is the causal agent of citrus brown spot that results in significant yield losses worldwide. Both the production of ACT and the ability to detoxify reactive oxygen species (ROS) are required for A. alternata pathogenicity in citrus. In this study, we report the 34.41 Mb genome sequence of strain Z7 of the tangerine pathotype of A. alternata. The host selective ACT gene cluster in strain Z7 was identified, which included 25 genes with 19 of them not reported previously. Of these, 10 genes were present only in the tangerine pathotype, representing the most likely candidate genes for this pathotype specialization. A transcriptome analysis of the global effects of H2O2 on gene expression revealed 1108 up-regulated and 498 down-regulated genes. Expressions of those genes encoding catalase, peroxiredoxin, thioredoxin and glutathione were highly induced. Genes encoding several protein families including kinases, transcription factors, transporters, cytochrome P450, ubiquitin and heat shock proteins were found associated with adaptation to oxidative stress. Our data not only revealed the molecular basis of ACT biosynthesis but also provided new insights into the potential pathways that the phytopathogen A. alternata copes with oxidative stress. PMID:27582273

  15. Wavelet-based spatial comparison technique for analysing and evaluating two-dimensional geophysical model fields

    Directory of Open Access Journals (Sweden)

    S. Saux Picart

    2011-11-01

    Full Text Available Complex numerical models of the Earth's environment, based around 3-D or 4-D time and space domains are routinely used for applications including climate predictions, weather forecasts, fishery management and environmental impact assessments. Quantitatively assessing the ability of these models to accurately reproduce geographical patterns at a range of spatial and temporal scales has always been a difficult problem to address. However, this is crucial if we are to rely on these models for decision making. Satellite data are potentially the only observational dataset able to cover the large spatial domains analysed by many types of geophysical models. Consequently optical wavelength satellite data is beginning to be used to evaluate model hindcast fields of terrestrial and marine environments. However, these satellite data invariably contain regions of occluded or missing data due to clouds, further complicating or impacting on any comparisons with the model. A methodology has recently been developed to evaluate precipitation forecasts using radar observations. It allows model skill to be evaluated at a range of spatial scales and rain intensities. Here we extend the original method to allow its generic application to a range of continuous and discontinuous geophysical data fields, and therefore allowing its use with optical satellite data. This is achieved through two major improvements to the original method: (i all thresholds are determined based on the statistical distribution of the input data, so no a priori knowledge about the model fields being analysed is required and (ii occluded data can be analysed without impacting on the metric results. The method can be used to assess a model's ability to simulate geographical patterns over a range of spatial scales. We illustrate how the method provides a compact and concise way of visualising the degree of agreement between spatial features in two datasets. The application of the new method, its

  16. Mitochondrial Genome Analysis of Wild Rice (Oryza minuta) and Its Comparison with Other Related Species

    Science.gov (United States)

    Asaf, Sajjad; Khan, Abdul Latif; Khan, Abdur Rahim; Waqas, Muhammad; Kang, Sang-Mo; Khan, Muhammad Aaqil; Shahzad, Raheem; Seo, Chang-Woo; Shin, Jae-Ho; Lee, In-Jung

    2016-01-01

    Oryza minuta (Poaceae family) is a tetraploid wild relative of cultivated rice with a BBCC genome. O. minuta has the potential to resist against various pathogenic diseases such as bacterial blight (BB), white backed planthopper (WBPH) and brown plant hopper (BPH). Here, we sequenced and annotated the complete mitochondrial genome of O. minuta. The mtDNA genome is 515,022 bp, containing 60 protein coding genes, 31 tRNA genes and two rRNA genes. The mitochondrial genome organization and the gene content at the nucleotide level are highly similar (89%) to that of O. rufipogon. Comparison with other related species revealed that most of the genes with known function are conserved among the Poaceae members. Similarly, O. minuta mt genome shared 24 protein-coding genes, 15 tRNA genes and 1 ribosomal RNA gene with other rice species (indica and japonica). The evolutionary relationship and phylogenetic analysis revealed that O. minuta is more closely related to O. rufipogon than to any other related species. Such studies are essential to understand the evolutionary divergence among species and analyze common gene pools to combat risks in the current scenario of a changing environment. PMID:27045847

  17. Genome-wide identification and expression analyses of cytochrome P450 genes in mulberry (Morus notabilis)

    Institute of Scientific and Technical Information of China (English)

    Bi Ma; Yiwei Luo; Ling Jia; Xiwu Qi; Qiwei Zeng; Zhonghuai Xiang; Ningjia He

    2014-01-01

    Cytochrome P450s play critical roles in the biosyn-thesis of physiological y important compounds in plants. These compounds often act as defense toxins to prevent herbivory. In the present study, a total of 174 P450 genes of mulberry (Morus notabilis C.K.Schn) were identified based on bioinfor-matics analyses. These mulberry P450 genes were divided into nine clans and 47 families and were found to be expressed in a tissue-preferential manner. These genes were compared to the P450 genes in Arabidopsis thaliana. Families CYP80, CYP92, CYP728, CYP733, CYP736, and CYP749 were found to exist in mulberry, and they may play important roles in the biosynthesis of mulberry secondary metabolites. Analyses of the functional and metabolic pathways of these genes indicated that mulberry P450 genes may participate in the metabolism of lipids, other secondary metabolites, xenobiotics, amino acids, cofactors, vitamins, terpenoids, and polyketides. These results provide a foundation for understanding of the structures and biological functions of mulberry P450 genes.

  18. Integrative genomic analyses of a novel cytokine, interleukin-34 and its potential role in cancer prediction.

    Science.gov (United States)

    Wang, Bo; Xu, Wenming; Tan, Miaolian; Xiao, Yan; Yang, Haiwei; Xia, Tian-Song

    2015-01-01

    Interleukin-34 (IL-34) is a novel cytokine, which is composed of 222 amino acids and forms homodimers. It binds to the macrophage colony-stimulating factor (M-CSF) receptor and plays an important role in innate immunity and inflammatory processes. In the present study, we identified the completed IL-34 gene in 25 various mammalian genomes and found that IL-34 existed in all types of vertebrates, including fish, amphibians, birds and mammals. These species have a similar 7 exon/6 intron gene organization. The phylogenetic tree indicated that the IL-34 gene from the primate lineage, rodent lineage and teleost lineage form a species-specific cluster. It was found mammalian that IL-34 was under positive selection pressure with the identified positively selected site, 196Val. Fifty-five functionally relevant single nucleotide polymorphisms (SNPs), including 32 SNPs causing missense mutations, 3 exonic splicing enhancer SNPs and 20 SNPs causing nonsense mutations were identified from 2,141 available SNPs in the human IL-34 gene. IL-34 was expressed in various types of cancer, including blood, brain, breast, colorectal, eye, head and neck, lung, ovarian and skin cancer. A total of 5 out of 40 tests (1 blood cancer, 1 brain cancer, 1 colorectal cancer and 2 lung cancer) revealed an association between IL-34 gene expression and cancer prognosis. It was found that the association between the expression of IL-34 and cancer prognosis varied in different types of cancer, even in the same types of cancer from different databases. This suggests that the function of IL-34 in these tumors may be multidimensional. The upstream transcription factor 1 (USF1), regulatory factor X-1 (RFX1), the Sp1 transcription factor 1 , POU class 3 homeobox 2 (POU3F2) and forkhead box L1 (FOXL1) regulatory transcription factor binding sites were identified in the IL-34 gene upstream (promoter) region, which may be involved in the effects of IL-34 in tumors. PMID:25395235

  19. Comparative analyses of Campylobacter concisusstrains reveal the genome of the reference strain BAA-1457 is not representative of the species

    Directory of Open Access Journals (Sweden)

    Kaakoush Nadeem O

    2011-10-01

    Full Text Available Abstract Background Several studies have shown that significant genotypic heterogeneity exists among Campylobacter concisus strains. Recently, the genome of C. concisus UNSWCD, isolated from a patient with Crohn's disease, was sequenced. Results In this study, comparative analyses were performed between strain UNSWCD and BAA-1457, isolated from a patient with acute gastroenteritis. Searches between C. concisus UNSWCD and BAA-1457 showed that 76% of genes were homologues, whereas those between C. jejuni strains showed 90-91% to be homologues, indicating substantial variation exists within these two C. concisus genomes. More specific bidirectional homology searches identified 1593 genes that are shared between these strains, and 115 and 281 genes unique to UNSWCD and BAA-1457, respectively. Significantly, differences in the type of flagellin glycosylation pathways between the two strains were identified and confirmed by PCR. The protein profiles of UNSWCD, BAA-1457 and a further six strains of C. concisus were compared and analyzed bioinformatically, and this differentiated the strains into four clades. BAA-1457 was found to be highly divergent (average similarity: 56.8% from the other seven strains (mean average similarity ± standard deviation: 64.7 ± 1.7%. Furthermore, searches for homologues of the 1593 proteins found to be common between UNSWCD and BAA-1457 were conducted against all available bacterial genomes, and 18 proteins were found to be unique to C. concisus, of which 6 were predicted to be secreted, and may represent good markers for detection of this species. Conclusions This study has elucidated several features that may be responsible for the heterogeneity that exists among C. concisus strains, and has determined that the strain BAA-1457 is genetically atypical to other C. concisus strains and is not a good candidate reference strain.

  20. Genomic and transcriptomic analyses of the medicinal fungus Antrodia cinnamomea for its metabolite biosynthesis and sexual development.

    Science.gov (United States)

    Lu, Mei-Yeh Jade; Fan, Wen-Lang; Wang, Woei-Fuh; Chen, Tingchun; Tang, Yi-Ching; Chu, Fang-Hua; Chang, Tun-Tschu; Wang, Sheng-Yang; Li, Meng-yun; Chen, Yi-Hua; Lin, Ze-Shiang; Yang, Kai-Jung; Chen, Shih-May; Teng, Yu-Chuan; Lin, Yan-Liang; Shaw, Jei-Fu; Wang, Ting-Fang; Li, Wen-Hsiung

    2014-11-01

    Antrodia cinnamomea, a polyporus mushroom of Taiwan, has long been used as a remedy for cancer, hypertension, and hangover, with an annual market of over $100 million (US) in Taiwan. We obtained a 32.15-Mb genome draft containing 9,254 genes. Genome ontology enrichment and pathway analyses shed light on sexual development and the biosynthesis of sesquiterpenoids, triterpenoids, ergostanes, antroquinonol, and antrocamphin. We identified genes differentially expressed between mycelium and fruiting body and 242 proteins in the mevalonate pathway, terpenoid pathways, cytochrome P450s, and polyketide synthases, which may contribute to the production of medicinal secondary metabolites. Genes of secondary metabolite biosynthetic pathways showed expression enrichment for tissue-specific compounds, including 14-α-demethylase (CYP51F1) in fruiting body for converting lanostane to ergostane triterpenoids, coenzymes Q (COQ) for antroquinonol biosynthesis in mycelium, and polyketide synthase for antrocamphin biosynthesis in fruiting body. Our data will be useful for developing a strategy to increase the production of useful metabolites. PMID:25336756

  1. Genomic analyses of metal resistance genes in three plant growth promoting bacteria of legume plants in Northwest mine tailings, China

    Institute of Scientific and Technical Information of China (English)

    Pin Xie; Xiuli Hao; Martin Herzberg; Yantao Luo; Dietrich H.Nies; Gehong Wei

    2015-01-01

    To better understand the diversity of metal resistance genetic determinant from microbes that survived at metal tailings in northwest of China,a highly elevated level of heavy metal containing region,genomic analyses was conducted using genome sequence of three native metal-resistant plant growth promoting bacteria (PGPB).It shows that:Mesorhizobium amorphae CCNWGS0123 contains metal ~nsporters from P-type ATPase,CDF (Cation Diffusion Facilitator),HupE/UreJ and CHR (chromate ion transporter) family involved in copper,zinc,nickel as well as chromate resistance and homeostasis.Meanwhile,the putative CopA/CueO system is expected to mediate copper resistance in Sinorhizobium meliloti CCNWSX0020 while ZntA transporter,assisted with putative CzcD,determines zinc tolerance in Agrobacterium tumefaciens CCNWGS0286.The greenhouse experiment provides the consistent evidence of the plant growth promoting effects of these microbes on their hosts by nitrogen fixation and/or indoleacetic acid (IAA) secretion,indicating a potential in-site phytoremediation usage in the mining tailing regions of China.

  2. Whipworm genome and dual-species transcriptome analyses provide molecular insights into an intimate host-parasite interaction

    Science.gov (United States)

    Nichol, Sarah; Tracey, Alan; Holroyd, Nancy; Cotton, James A.; Stanley, Eleanor J.; Zarowiecki, Magdalena; Liu, Jimmy Z.; Huckvale, Thomas; Cooper, Philip J.; Grencis, Richard K.; Berriman, Matthew

    2014-01-01

    Whipworms are common soil-transmitted helminths that cause debilitating chronic infections in man. These nematodes are only distantly related to Caenorhabditis elegans and have evolved to occupy an unusual niche, tunneling through epithelial cells of the large intestine. Here we present the genome sequences of the human-infective Trichuris trichiura and the murine laboratory model T. muris. Based on whole transcriptome analyses we identify many genes that are expressed in a gender- or life stage-specific manner and characterise the transcriptional landscape of a morphological region with unique biological adaptations, namely bacillary band and stichosome, found only in whipworms and related parasites. Using RNAseq data from whipworm-infected mice we describe the regulated Th1-like immune response of the chronically infected cecum in unprecedented detail. In silico screening identifies numerous potential new drug targets against trichuriasis. Together, these genomes and associated functional data elucidate key aspects of the molecular host-parasite interactions that define chronic whipworm infection. PMID:24929830

  3. Comparison between heavy metal concentrations in sediments analysed by two methods: Analyses on detection limits and data quality

    International Nuclear Information System (INIS)

    Chemical components in geological samples are often measured by more than one method to be able to measure high concentrations in resistant minerals as well as low concentrations in other mineral phases. For quality control, it is necessary to evaluate if the results from these methods are consistent or not. A total of 1884 sediment samples taken from the Leinster area of Ireland were analysed by both instrumental neutron activation analysis (INAA) and atomic absorption spectrometry (AAS) for 5 chemical elements (Co, Cr, Fe, Ni, and Zn). It is well known that INAA detects the total concentrations of chemical elements, but it has much higher detection limits than AAS. In this study, descriptive statistics, scatter plots, statistical tests and GIS mapping techniques were employed to compare the differences between the results from the two analytical methods, and to evaluate the impact of detection limits on data quality. Descriptive statistics and scatter plots were effective in showing the discrepancies between values of the two methods. Scatter plots showed outliers and the poor accuracy of INAA for concentrations close to or below the detection limits. It is suggested that errors may have happened during both INAA and AAS processes for some samples. Statistical results of both nonparametric sign and Wilcoxon signed rank tests showed that the differences between concentrations measured with the two methods were significant at the level of p < 0.001, and the average differences were 20-30% for the 5 elements. Meanwhile, more than 25% of all samples showed differences greater than 40-60% for these elements. After removing values lower than 3 times detection limits of both methods, only a slight improvement was achieved for the consistency between the two methods. Besides detection limits, the incomplete digestion prior to AAS may have played an important role. The higher AAS values of Ni than its INAA values may have been caused by multiple detection limits in

  4. Hybridization Capture Using RAD Probes (hyRAD, a New Tool for Performing Genomic Analyses on Collection Specimens.

    Directory of Open Access Journals (Sweden)

    Tomasz Suchan

    Full Text Available In the recent years, many protocols aimed at reproducibly sequencing reduced-genome subsets in non-model organisms have been published. Among them, RAD-sequencing is one of the most widely used. It relies on digesting DNA with specific restriction enzymes and performing size selection on the resulting fragments. Despite its acknowledged utility, this method is of limited use with degraded DNA samples, such as those isolated from museum specimens, as these samples are less likely to harbor fragments long enough to comprise two restriction sites making possible ligation of the adapter sequences (in the case of double-digest RAD or performing size selection of the resulting fragments (in the case of single-digest RAD. Here, we address these limitations by presenting a novel method called hybridization RAD (hyRAD. In this approach, biotinylated RAD fragments, covering a random fraction of the genome, are used as baits for capturing homologous fragments from genomic shotgun sequencing libraries. This simple and cost-effective approach allows sequencing of orthologous loci even from highly degraded DNA samples, opening new avenues of research in the field of museum genomics. Not relying on the restriction site presence, it improves among-sample loci coverage. In a trial study, hyRAD allowed us to obtain a large set of orthologous loci from fresh and museum samples from a non-model butterfly species, with a high proportion of single nucleotide polymorphisms present in all eight analyzed specimens, including 58-year-old museum samples. The utility of the method was further validated using 49 museum and fresh samples of a Palearctic grasshopper species for which the spatial genetic structure was previously assessed using mtDNA amplicons. The application of the method is eventually discussed in a wider context. As it does not rely on the restriction site presence, it is therefore not sensitive to among-sample loci polymorphisms in the restriction sites

  5. Wavelet-based spatial comparison technique for analysing and evaluating two-dimensional geophysical model fields

    Directory of Open Access Journals (Sweden)

    S. Saux Picart

    2012-02-01

    Full Text Available Complex numerical models of the Earth's environment, based around 3-D or 4-D time and space domains are routinely used for applications including climate predictions, weather forecasts, fishery management and environmental impact assessments. Quantitatively assessing the ability of these models to accurately reproduce geographical patterns at a range of spatial and temporal scales has always been a difficult problem to address. However, this is crucial if we are to rely on these models for decision making. Satellite data are potentially the only observational dataset able to cover the large spatial domains analysed by many types of geophysical models. Consequently optical wavelength satellite data is beginning to be used to evaluate model hindcast fields of terrestrial and marine environments. However, these satellite data invariably contain regions of occluded or missing data due to clouds, further complicating or impacting on any comparisons with the model. This work builds on a published methodology, that evaluates precipitation forecast using radar observations based on predefined absolute thresholds. It allows model skill to be evaluated at a range of spatial scales and rain intensities. Here we extend the original method to allow its generic application to a range of continuous and discontinuous geophysical data fields, and therefore allowing its use with optical satellite data. This is achieved through two major improvements to the original method: (i all thresholds are determined based on the statistical distribution of the input data, so no a priori knowledge about the model fields being analysed is required and (ii occluded data can be analysed without impacting on the metric results. The method can be used to assess a model's ability to simulate geographical patterns over a range of spatial scales. We illustrate how the method provides a compact and concise way of visualising the degree of agreement between spatial features in two

  6. Comparison of COMPARE and BEACON subcompartment analyses of Battelle-Frankfurt containment tests

    International Nuclear Information System (INIS)

    This report presents the results of computations performed with the COMPARE/MOD1 and BEACON/MOD3 computer codes for selected Battelle-Frankfurt loss-of-coolant accident experiments. COMPARE is used widely to perform nuclear power plant containment subcompartment analyses, and BEACON is an advanced multiphase, multidimensional best-estimate code. The objective of this study was to evaluate the margins of COMPARE calculations by comparing them with BEACON calculations and test data. The calculations were performed for the Battelle-Frankfurt D3, D6, and C9 tests. Descriptions of the two codes and the Battelle-Frankfurt experiments are included. Comparisons of the codes' calculations and experimental data for absolute pressure, differential pressure, and temperature are presented for margin evaluation. Evaluations of the sensitivity of BEACON calculations to variations in model noding, form loss, and vent area modeling are prsesented. Conclusions summarizing the results of the COMPARE margin evaluation and BEACON sensitivity studies are given as well

  7. Comparison of variations detection between whole-genome amplification methods used in single-cell resequencing

    DEFF Research Database (Denmark)

    Hou, Yong; Wu, Kui; Shi, Xulian;

    2015-01-01

    -oligonucleotide-primed PCR (DOP-PCR) and multiple annealing and looping-based amplification cycles (MALBAC). However, a comprehensive comparison of variations detection performance between these WGA methods has not yet been performed. RESULTS: We systematically compared the advantages and disadvantages of different WGA...... methods, focusing particularly on variations detection. Low-coverage whole-genome sequencing revealed that DOP-PCR had the highest duplication ratio, but an even read distribution and the best reproducibility and accuracy for detection of copy-number variations (CNVs). However, MDA had significantly......BACKGROUND: Single-cell resequencing (SCRS) provides many biomedical advances in variations detection at the single-cell level, but it currently relies on whole genome amplification (WGA). Three methods are commonly used for WGA: multiple displacement amplification (MDA), degenerate...

  8. The complete mitochondrial genome of Oxyuris equi: Comparison with other closely related species and phylogenetic implications.

    Science.gov (United States)

    Zhang, Yan; Xu, Wen-Wen; Guo, Dong-Hui; Liu, Ze-Xuan; Duan, Hong; Su, Xin; Fu, Xue; Yue, Dong-Mei; Gao, Yuan; Wang, Chun-Ren

    2015-12-01

    The equine pinworm Oxyuris equi (Nematoda: Oxyuridomorpha) is the most common horse nematode, has a worldwide distribution, and causes major economic losses. In the present study, the complete O. equi mitochondrial (mt) genome was sequenced, and the mt genome structure and organization were compared with those of other closely related pinworm species, Enterobius vermicularis and Wellcomia siamensis. The O. equi mt genome is a 13,641-bp circular DNA molecule that encodes 36 genes (12 protein-coding genes, 22 tRNAs, and two rRNAs) and one non-coding region, which is slightly shorter than that of E. vermicularis and W. siamensis. The O. equi mt gene arrangement was consistent with that of GA13-type E. vermicularis but it differs from GA12-type W. siamensis. Phylogenetic analyses using concatenated amino acid sequences of the 12 protein-coding genes with three different computational algorithms (maximum parsimony, maximum likelihood, and Bayesian inference) revealed that there were two distinct clades in Chromadorea nematodes that reflected infraorder. Spiruromorpha formed one clade, whereas Rhabditomorpha, Ascaridomorpha, and Oxyuridomorpha formed another clade. O. equi, E. vermicularis, and W. siamensis represent distinct but closely related species, which indicated that Oxyuridomorpha is paraphyletic. Sequencing the O. equi mt genome provides novel genetic markers for studying the molecular epidemiology and population genetics of pinworms. PMID:26452611

  9. Cross-Disorder Genome-Wide Analyses Suggest a Complex Genetic Relationship Between Tourette Syndrome and Obsessive-Compulsive Disorder

    Science.gov (United States)

    Yu, Dongmei; Mathews, Carol A.; Scharf, Jeremiah M.; Neale, Benjamin M.; Davis, Lea K.; Gamazon, Eric R.; Derks, Eske M.; Evans, Patrick; Edlund, Christopher K.; Crane, Jacquelyn; Fagerness, Jesen A.; Osiecki, Lisa; Gallagher, Patience; Gerber, Gloria; Haddad, Stephen; Illmann, Cornelia; McGrath, Lauren M.; Mayerfeld, Catherine; Arepalli, Sampath; Barlassina, Cristina; Barr, Cathy L.; Bellodi, Laura; Benarroch, Fortu; Berrió, Gabriel Bedoya; Bienvenu, O. Joseph; Black, Donald; Bloch, Michael H.; Brentani, Helena; Bruun, Ruth D.; Budman, Cathy L.; Camarena, Beatriz; Campbell, Desmond D.; Cappi, Carolina; Cardona Silgado, Julio C.; Cavallini, Maria C.; Chavira, Denise A.; Chouinard, Sylvain; Cook, Edwin H.; Cookson, M. R.; Coric, Vladimir; Cullen, Bernadette; Cusi, Daniele; Delorme, Richard; Denys, Damiaan; Dion, Yves; Eapen, Valsama; Egberts, Karin; Falkai, Peter; Fernandez, Thomas; Fournier, Eduardo; Garrido, Helena; Geller, Daniel; Gilbert, Donald; Girard, Simon L.; Grabe, Hans J.; Grados, Marco A.; Greenberg, Benjamin D.; Gross-Tsur, Varda; Grünblatt, Edna; Hardy, John; Heiman, Gary A.; Hemmings, Sian M.J.; Herrera, Luis D.; Hezel, Dianne M.; Hoekstra, Pieter J.; Jankovic, Joseph; Kennedy, James L.; King, Robert A.; Konkashbaev, Anuar I.; Kremeyer, Barbara; Kurlan, Roger; Lanzagorta, Nuria; Leboyer, Marion; Leckman, James F.; Lennertz, Leonhard; Liu, Chunyu; Lochner, Christine; Lowe, Thomas L.; Lupoli, Sara; Macciardi, Fabio; Maier, Wolfgang; Manunta, Paolo; Marconi, Maurizio; McCracken, James T.; Mesa Restrepo, Sandra C.; Moessner, Rainald; Moorjani, Priya; Morgan, Jubel; Muller, Heike; Murphy, Dennis L.; Naarden, Allan L.; Ochoa, William Cornejo; Ophoff, Roel A.; Pakstis, Andrew J.; Pato, Michele T.; Pato, Carlos N.; Piacentini, John; Pittenger, Christopher; Pollak, Yehuda; Rauch, Scott L.; Renner, Tobias; Reus, Victor I.; Richter, Margaret A.; Riddle, Mark A.; Robertson, Mary M.; Romero, Roxana; Rosário, Maria C.; Rosenberg, David; Ruhrmann, Stephan; Sabatti, Chiara; Salvi, Erika; Sampaio, Aline S.; Samuels, Jack; Sandor, Paul; Service, Susan K.; Sheppard, Brooke; Singer, Harvey S.; Smit, Jan H.; Stein, Dan J.; Strengman, Eric; Tischfield, Jay A.; Turiel, Maurizio; Valencia Duarte, Ana V.; Vallada, Homero; Veenstra-VanderWeele, Jeremy; Walitza, Susanne; Walkup, John; Wang, Ying; Weale, Mike; Weiss, Robert; Wendland, Jens R.; Westenberg, Herman G.M.; Yao, Yin; Hounie, Ana G.; Miguel, Euripedes C.; Nicolini, Humberto; Wagner, Michael; Ruiz-Linares, Andres; Cath, Danielle C.; McMahon, William; Posthuma, Danielle; Oostra, Ben A.; Nestadt, Gerald; Rouleau, Guy A.; Purcell, Shaun; Jenike, Michael A.; Heutink, Peter; Hanna, Gregory L.; Conti, David V.; Arnold, Paul D.; Freimer, Nelson; Stewart, S. Evelyn; Knowles, James A.; Cox, Nancy J.; Pauls, David L.

    2014-01-01

    Obsessive-compulsive disorder (OCD) and Tourette Syndrome (TS) are highly heritable neurodevelopmental disorders that are thought to share genetic risk factors. However, the identification of definitive susceptibility genes for these etiologically complex disorders remains elusive. Here, we report a combined genome-wide association study (GWAS) of TS and OCD in 2723 cases (1310 with OCD, 834 with TS, 579 with OCD plus TS/chronic tics (CT)), 5667 ancestry-matched controls, and 290 OCD parent-child trios. Although no individual single nucleotide polymorphisms (SNPs) achieved genome-wide significance, the GWAS signals were enriched for SNPs strongly associated with variations in brain gene expression levels, i.e. expression quantitative loci (eQTLs), suggesting the presence of true functional variants that contribute to risk of these disorders. Polygenic score analyses identified a significant polygenic component for OCD (p=2×10−4), predicting 3.2% of the phenotypic variance in an independent data set. In contrast, TS had a smaller, non-significant polygenic component, predicting only 0.6% of the phenotypic variance (p=0.06). No significant polygenic signal was detected across the two disorders, although the sample is likely underpowered to detect a modest shared signal. Furthermore, the OCD polygenic signal was significantly attenuated when cases with both OCD and TS/CT were included in the analysis (p=0.01). Previous work has shown that TS and OCD have some degree of shared genetic variation. However, the data from this study suggest that there are also distinct components to the genetic architectures of TS and OCD. Furthermore, OCD with co-occurring TS/CT may have different underlying genetic susceptibility compared to OCD alone. PMID:25158072

  10. Large-scale genome-wide association studies and meta-analyses of longitudinal change in adult lung function.

    Directory of Open Access Journals (Sweden)

    Wenbo Tang

    Full Text Available Genome-wide association studies (GWAS have identified numerous loci influencing cross-sectional lung function, but less is known about genes influencing longitudinal change in lung function.We performed GWAS of the rate of change in forced expiratory volume in the first second (FEV1 in 14 longitudinal, population-based cohort studies comprising 27,249 adults of European ancestry using linear mixed effects model and combined cohort-specific results using fixed effect meta-analysis to identify novel genetic loci associated with longitudinal change in lung function. Gene expression analyses were subsequently performed for identified genetic loci. As a secondary aim, we estimated the mean rate of decline in FEV1 by smoking pattern, irrespective of genotypes, across these 14 studies using meta-analysis.The overall meta-analysis produced suggestive evidence for association at the novel IL16/STARD5/TMC3 locus on chromosome 15 (P  =  5.71 × 10(-7. In addition, meta-analysis using the five cohorts with ≥3 FEV1 measurements per participant identified the novel ME3 locus on chromosome 11 (P  =  2.18 × 10(-8 at genome-wide significance. Neither locus was associated with FEV1 decline in two additional cohort studies. We confirmed gene expression of IL16, STARD5, and ME3 in multiple lung tissues. Publicly available microarray data confirmed differential expression of all three genes in lung samples from COPD patients compared with controls. Irrespective of genotypes, the combined estimate for FEV1 decline was 26.9, 29.2 and 35.7 mL/year in never, former, and persistent smokers, respectively.In this large-scale GWAS, we identified two novel genetic loci in association with the rate of change in FEV1 that harbor candidate genes with biologically plausible functional links to lung function.

  11. Large-Scale Genome-Wide Association Studies and Meta-Analyses of Longitudinal Change in Adult Lung Function

    Science.gov (United States)

    Tang, Wenbo; Kowgier, Matthew; Loth, Daan W.; Soler Artigas, María; Joubert, Bonnie R.; Hodge, Emily; Gharib, Sina A.; Smith, Albert V.; Ruczinski, Ingo; Gudnason, Vilmundur; Mathias, Rasika A.; Harris, Tamara B.; Hansel, Nadia N.; Launer, Lenore J.; Barnes, Kathleen C.; Hansen, Joyanna G.; Albrecht, Eva; Aldrich, Melinda C.; Allerhand, Michael; Barr, R. Graham; Brusselle, Guy G.; Couper, David J.; Curjuric, Ivan; Davies, Gail; Deary, Ian J.; Dupuis, Josée; Fall, Tove; Foy, Millennia; Franceschini, Nora; Gao, Wei; Gläser, Sven; Gu, Xiangjun; Hancock, Dana B.; Heinrich, Joachim; Hofman, Albert; Imboden, Medea; Ingelsson, Erik; James, Alan; Karrasch, Stefan; Koch, Beate; Kritchevsky, Stephen B.; Kumar, Ashish; Lahousse, Lies; Li, Guo; Lind, Lars; Lindgren, Cecilia; Liu, Yongmei; Lohman, Kurt; Lumley, Thomas; McArdle, Wendy L.; Meibohm, Bernd; Morris, Andrew P.; Morrison, Alanna C.; Musk, Bill; North, Kari E.; Palmer, Lyle J.; Probst-Hensch, Nicole M.; Psaty, Bruce M.; Rivadeneira, Fernando; Rotter, Jerome I.; Schulz, Holger; Smith, Lewis J.; Sood, Akshay; Starr, John M.; Strachan, David P.; Teumer, Alexander; Uitterlinden, André G.; Völzke, Henry; Voorman, Arend; Wain, Louise V.; Wells, Martin T.; Wilk, Jemma B.; Williams, O. Dale; Heckbert, Susan R.; Stricker, Bruno H.; London, Stephanie J.; Fornage, Myriam; Tobin, Martin D.; O′Connor, George T.; Hall, Ian P.; Cassano, Patricia A.

    2014-01-01

    Background Genome-wide association studies (GWAS) have identified numerous loci influencing cross-sectional lung function, but less is known about genes influencing longitudinal change in lung function. Methods We performed GWAS of the rate of change in forced expiratory volume in the first second (FEV1) in 14 longitudinal, population-based cohort studies comprising 27,249 adults of European ancestry using linear mixed effects model and combined cohort-specific results using fixed effect meta-analysis to identify novel genetic loci associated with longitudinal change in lung function. Gene expression analyses were subsequently performed for identified genetic loci. As a secondary aim, we estimated the mean rate of decline in FEV1 by smoking pattern, irrespective of genotypes, across these 14 studies using meta-analysis. Results The overall meta-analysis produced suggestive evidence for association at the novel IL16/STARD5/TMC3 locus on chromosome 15 (P  =  5.71 × 10-7). In addition, meta-analysis using the five cohorts with ≥3 FEV1 measurements per participant identified the novel ME3 locus on chromosome 11 (P  =  2.18 × 10-8) at genome-wide significance. Neither locus was associated with FEV1 decline in two additional cohort studies. We confirmed gene expression of IL16, STARD5, and ME3 in multiple lung tissues. Publicly available microarray data confirmed differential expression of all three genes in lung samples from COPD patients compared with controls. Irrespective of genotypes, the combined estimate for FEV1 decline was 26.9, 29.2 and 35.7 mL/year in never, former, and persistent smokers, respectively. Conclusions In this large-scale GWAS, we identified two novel genetic loci in association with the rate of change in FEV1 that harbor candidate genes with biologically plausible functional links to lung function. PMID:24983941

  12. Genome sequence comparison of two United States live attenuated vaccines of infectious laryngotracheitis virus (ILTV).

    Science.gov (United States)

    Chandra, Yohanna Gita; Lee, Jeongyoon; Kong, Byung-Whi

    2012-06-01

    This study was conducted to identify unique nucleotide differences in two U.S. chicken embryo origin (CEO) vaccines [LT Blen (GenBank accession: JQ083493) designated as vaccine 1; Laryngo-Vac(®) (GenBank accession: JQ083494) designated as vaccine 2] of infectious laryngotracheitis virus (ILTV) genomes compared to an Australian Serva vaccine reference ILTV genome sequence [Gallid herpesvirus 1 (GaHV-1); GenBank accession number: HQ630064]. Genomes of the two vaccine ILTV strains were sequenced using Illumina Genome Analyzer 2X of 36 cycles of single-end reads. Results revealed that few nucleotide differences (23 in vaccine 1; 31 in vaccine 2) were found and indicate that the US CEO strains are practically identical to the Australian Serva CEO strain, which is a European-origin vaccine. The sequence differences demonstrated the spectrum of variability among vaccine strains. Only eight amino acid differences were found in ILTV proteins including UL54, UL27, UL28, UL20, UL1, ICP4, and US8 in vaccine 1. Similarly, in vaccine 2, eight amino acid differences were found in UL54, UL27, UL28, UL36, UL1, ICP4, US10, and US8. Further comparison of US CEO vaccines to several ILTV genome sequences revealed that US CEO vaccines are genetically close to both the Serva vaccine and 63140/C/08/BR (GenBank accession: HM188407) and are distinct from the two Australian-origin CEO vaccines, SA2 (GenBank accession: JN596962) and A20 (GenBank accession: JN596963), which showed close similarity to each other. These data demonstrate the potential of high-throughput sequencing technology to yield insight into the sequence variation of different ILTV strains. This information can be used to discriminate between vaccine ILTV strains and further, to identify newly emerging mutant strains of field isolates. PMID:22382591

  13. Genomic comparisons of Brucella spp. and closely related bacteria using base compositional and proteome based methods

    DEFF Research Database (Denmark)

    Bohlin, Jon; Snipen, Lars; Cloeckaert, Axel; Lagesen, Karin; Ussery, David; Kristoffersen, Anja B.; Godfroid, Jacques

    2010-01-01

    BACKGROUND: Classification of bacteria within the genus Brucella has been difficult due in part to considerable genomic homogeneity between the different species and biovars, in spite of clear differences in phenotypes. Therefore, many different methods have been used to assess Brucella taxonomy...... differences could be detected between all genera included in this study using the codon and amino acid frequencies based methods. Proteome comparisons were found to be in strong accordance with current Brucella taxonomy indicating a remarkable association between gene gain or loss on one hand and mutations in...

  14. Genomic and Phenotypic Analyses Reveal the Emergence of an Atypical Salmonella enterica Serovar Senftenberg Variant in China

    KAUST Repository

    Abd El Ghany, Moataz

    2016-05-25

    Human infections with Salmonella enterica subspecies enterica serovar Senftenberg are often associated with exposure to poultry flocks, farm environments, or contaminated food. The recent emergence of multidrug-resistant isolates has raised public health concerns. In this study, comparative genomics and phenotypic analysis were used to characterize 14 Salmonella Senftenberg clinical isolates recovered from multiple outbreaks in Shenzhen and Shanghai, China, between 2002 and 2011. Single-nucleotide polymorphism analyses identified two phylogenetically distinct clades of S. Senftenberg, designated SC1 and SC2, harboring variations in Salmonella pathogenicity island 1 (SPI-1) and SPI-2 and exhibiting distinct biochemical and phenotypic signatures. Although the two variants shared the same serotype, the SC2 isolates of sequence type 14 (ST14) harbored intact SPI-1 and -2 and hence were characterized by possessing efficient invasion capabilities. In contrast, the SC1 isolates had structural deletion patterns in both SPI-1 and -2 that correlated with an impaired capacity to invade cultured human cells and also the year of their isolation. These atypical SC1 isolates also lacked the capacity to produce hydrogen sulfide. These findings highlight the emergence of atypical Salmonella Senftenberg variants in China and provide genetic validation that variants lacking SPI-1 and regions of SPI-2, which leads to impaired invasion capacity, can still cause clinical disease. These data have identified an emerging public health concern and highlight the need to strengthen surveillance to detect the prevalence and transmission of nontyphoidal Salmonella species.

  15. Comparison of different sequencing and assembly strategies for a repeat-rich fungal genome, Ophiocordyceps sinensis.

    Science.gov (United States)

    Li, Yi; Hsiang, Tom; Yang, Rui-Heng; Hu, Xiao-Di; Wang, Ke; Wang, Wen-Jing; Wang, Xiao-Liang; Jiao, Lei; Yao, Yi-Jian

    2016-09-01

    Ophiocordyceps sinensis is one of the most expensive medicinal fungi world-wide, and has been used as a traditional Chinese medicine for centuries. In a recent report, the genome of this fungus was found to be expanded by extensive repetitive elements after assembly of Roche 454 (223Mb) and Illumina HiSeq (10.6Gb) sequencing data, producing a genome of 87.7Mb with an N50 scaffold length of 12kb and 6972 predicted genes. To test whether the assembly could be improved by deeper sequencing and to assess the amount of data needed for optimal assembly, genomic sequencing was run several times on genomic DNA extractions of a single ascospore isolate (strain 1229) on an Illumina HiSeq platform (25Gb total data). Assemblies were produced using different data types (raw vs. trimmed) and data amounts, and using three freely available assembly programs (ABySS, SOAP and Velvet). In nearly all cases, trimming the data for low quality base calls did not provide assemblies with higher N50 values compared to the non-trimmed data, and increasing the amount of input data (i.e. sequence reads) did not always lead to higher N50 values. Depending on the assembly program and data type, the maximal N50 was reached with between 50% to 90% of the total read data, equivalent to 100× to 200× coverage. The draft genome assembly was improved over the previously published version resulting in a 114Mb assembly, scaffold N50 of 70kb and 9610 predicted genes. Among the predicted genes, 9213 were validated by RNA-Seq analysis in this study, of which 8896 were found to be singletons. Evidence from genome and transcriptome analyses indicated that species assemblies could be improved with defined input material (e.g. haploid mono-ascospore isolate) without the requirement of multiple sequencing technologies, multiple library sizes or data trimming for low quality base calls, and with genome coverages between 100× and 200×. PMID:27343682

  16. The complete chloroplast genome sequence of Brachypodium distachyon: sequence comparison and phylogenetic analysis of eight grass plastomes

    Directory of Open Access Journals (Sweden)

    Anderson Olin D

    2008-07-01

    Full Text Available Abstract Background Wheat, barley, and rye, of tribe Triticeae in the Poaceae, are among the most important crops worldwide but they present many challenges to genomics-aided crop improvement. Brachypodium distachyon, a close relative of those cereals has recently emerged as a model for grass functional genomics. Sequencing of the nuclear and organelle genomes of Brachypodium is one of the first steps towards making this species available as a tool for researchers interested in cereals biology. Findings The chloroplast genome of Brachypodium distachyon was sequenced by a combinational approach using BAC end and shotgun sequences derived from a selected BAC containing the entire chloroplast genome. Comparative analysis indicated that the chloroplast genome is conserved in gene number and organization with respect to those of other cereals. However, several Brachypodium genes evolve at a faster rate than those in other grasses. Sequence analysis reveals that rice and wheat have a ~2.1 kb deletion in their plastid genomes and this deletion must have occurred independently in both species. Conclusion We demonstrate that BAC libraries can be used to sequence plastid, and likely other organellar, genomes. As expected, the Brachypodium chloroplast genome is very similar to those of other sequenced grasses. The phylogenetic analyses and the pattern of insertions and deletions in the chloroplast genome confirmed that Brachypodium is a close relative of the tribe Triticeae. Nevertheless, we show that some large indels can arise multiple times and may confound phylogenetic reconstruction.

  17. Multiple-genome comparison reveals new loci for Mycobacterium species identification.

    Science.gov (United States)

    Dai, Jianli; Chen, Yuansha; Dean, Susan; Morris, J Glenn; Salfinger, Max; Johnson, Judith A

    2011-01-01

    To identify loci useful for species identification and to enhance our understanding of the population structure and genetic variability of the genus Mycobacterium, we conducted a multiple-genome comparison of a total of 27 sequenced genomes in the suborder of Corynebacterineae (18 from the Mycobacterium genus, 7 from the Corynebacterium genus, 1 each from the Nocardia and Rhodococcus genera). Our study revealed 26 informative loci for species identification in Mycobacterium. The sequences from these loci were used in a phylogenetic analysis to infer the evolutionary relations of the 18 mycobacterial genomes. Among the loci that we identified, rpoBC, dnaK, and hsp65 were amplified from 29 ATCC reference strains and 17 clinical isolates and sequenced. The phylogenetic trees generated from these loci show similar topologies. The newly identified dnaK locus is more discriminatory and more robust than the widely used hsp65 locus. The length-variable rpoBC locus is the first intergenic locus between two protein-encoding genes being used for mycobacterial species identification. A multilocus sequence analysis system including the rpoBC, dnaK, and hsp65 loci is a robust tool for accurate identification of Mycobacterium species. PMID:21048007

  18. Genome-wide linkage analyses of two repetitive behavior phenotypes in Utah pedigrees with autism spectrum disorders

    Directory of Open Access Journals (Sweden)

    Cannon Dale S

    2010-02-01

    Full Text Available Abstract Background It has been suggested that efforts to identify genetic risk markers of autism spectrum disorder (ASD would benefit from the analysis of more narrowly defined ASD phenotypes. Previous research indicates that 'insistence on sameness' (IS and 'repetitive sensory-motor actions' (RSMA are two factors within the ASD 'repetitive and stereotyped behavior' domain. The primary aim of this study was to identify genetic risk markers of both factors to allow comparison of those markers with one another and with markers found in the same set of pedigrees using ASD diagnosis as the phenotype. Thus, we empirically addresses the possibilities that more narrowly defined phenotypes improve linkage analysis signals and that different narrowly defined phenotypes are associated with different loci. Secondary aims were to examine the correlates of IS and RSMA and to assess the heritability of both scales. Methods A genome-wide linkage analysis was conducted with a sample of 70 multiplex ASD pedigrees using IS and RSMA as phenotypes. Genotyping services were provided by the Center for Inherited Disease Research using the 6 K single nucleotide polymorphism linkage panel. Analysis was done using the multipoint linkage software program MCLINK, a Markov chain Monte Carlo (MCMC method that allows for multilocus linkage analysis on large extended pedigrees. Results Genome-wide significance was observed for IS at 2q37.1-q37.3 (dominant model heterogeneity lod score (hlod 3.42 and for RSMA at 15q13.1-q14 (recessive model hlod 3.93. We found some linkage signals that overlapped and others that were not observed in our previous linkage analysis of the ASD phenotype in the same pedigrees, and regions varied in the range of phenotypes with which they were linked. A new finding with respect to IS was that it is positively associated with IQ if the IS-RSMA correlation is statistically controlled. Conclusions The finding that IS and RSMA are linked to different

  19. Genome-wide and molecular evolution analyses of the phospholipase D gene family in Poplar and Grape

    Directory of Open Access Journals (Sweden)

    Yang Yongping

    2010-06-01

    Full Text Available Abstract Background The Phospholipase D (PLD family plays an important role in the regulation of cellular processes in plants, including abscisic acid signaling, programmed cell death, root hair patterning, root growth, freezing tolerance and other stress responses. PLD genes constitute an important gene family in higher plants. However, until now our knowledge concerning the PLD gene family members and their evolutionary relationship in woody plants such as Poplar and Grape has been limited. Results In this study, we have provided a genome-wide analysis of the PLD gene family in Poplar and Grape. Eighteen and eleven members of the PLD gene family were identified in Poplar and Grape respectively. Phylogenetic and gene structure analyses showed that the PLD gene family can be divided into 6 subgroups: α, β/γ, δ, ε, ζ, and φ, and that the 6 PLD subgroups originated from 4 original ancestors through a series of gene duplications. Interestingly, the majority of the PLD genes from both Poplar (76.5%, 13/17 and Grape (90.9%, 10/11 clustered closely together in the phylogenetic tree to the extent that their evolutionary relationship appears more tightly linked to each other, at least in terms of the PLD gene family, than it does to either Arabidopsis or rice. Five pairs of duplicated PLD genes were identified in Poplar, more than those in Grape, suggesting that frequent gene duplications occurred after these species diverged, resulting in a rapid expansion of the PLD gene family in Poplar. The majority of the gene duplications in Poplar were caused by segmental duplication and were distinct from those in Arabidopsis, rice and Grape. Additionally, the gene duplications in Poplar were estimated to have occurred from 11.31 to 13.76 million years ago, which are later than those that occurred in the other three plant species. Adaptive evolution analysis showed that positive selection contributed to the evolution of the PXPH- and SP-PLDs, whereas

  20. The complete mitochondrial genome sequence of Geisha distinctissima (Hemiptera: Flatidae) and comparison with other hemipteran insects.

    Science.gov (United States)

    Song, Nan; Liang, Aiping

    2009-03-01

    The complete nucleotide sequence of the mitochondrial genome (mitogenome) of Geisha distinctissima (Hemiptera: Flatidae) has been determined in this study. The genome is a circular molecule of 15,971 bp with a total A+T content of 75.1%. The gene content, order, and structure are consistent with the Drosophila yakuba genome structure and the hypothesized ancestral arthropod genome arrangement. All 13 protein-coding genes are observed to have a putative, inframe ATR methionine or ATT isoleucine codons as start signals. Canonical TAA and TAG termination codons are found in nine protein-coding genes, and the remaining four (cox1, atp6, cox3, and nad4) have incomplete termination codons. The anticodons of all transfer RNA (tRNAs) are identical to those observed in D. yakuba and Philaenus spumarius, and can be folded in the form of a typical clover-leaf structure except for tRNA(Ser(AGN)). The major non-coding region (the A+T-rich region or putative control region) between the small ribosomal subunit and the tRNA(Ile) gene includes two sets of repeat regions. The first repeat region consists of a direct 152-bp repetitive unit located near the srRNA gene end, and the second repeat region is composed of a direct repeat unit of 19 bp located toward tRNA(Ile) gene. Comparisons of gene variability across the order suggest that the gene content and arrangement of G. distinctissima mitogenome are similar to other hemipteran insects. PMID:19280059

  1. Phylogenetic Relationships of Tetraploid AB-Genome Avena Species Evaluated by Means of Cytogenetic (C-Banding and FISH) and RAPD Analyses

    OpenAIRE

    Badaeva, E. D.; O. Yu. Shelukhina; Goryunova, S. V.; Loskutov, I. G.; V. A. Pukhalskiy

    2010-01-01

    Tetraploid oat species Avena abyssinica, A. vaviloviana, A. barbata, and A. agadiriana were studied using C-banding technique, in situ hybridization with the 45S and 5S rDNA probes, and RAPD analysis in comparison with the diploid species carrying different types of the A-genome (A. wiestii, As; A. longiglumis, Al; A. canariensis, Ac; A. damascena, Ad, A. prostrata, Ap). The investigation confirmed that all four tetraploids belong to the same AB-genome group; however A. agadiriana occupies di...

  2. Genome-Wide Search Identifies 1.9 Mb from the Polar Bear Y Chromosome for Evolutionary Analyses

    OpenAIRE

    Bidon, Tobias; Schreck, Nancy; Hailer, Frank; Nilsson, Maria A.; Janke, Axel

    2015-01-01

    The male-inherited Y chromosome is the major haploid fraction of the mammalian genome, rendering Y-linked sequences an indispensable resource for evolutionary research. However, despite recent large-scale genome sequencing approaches, only a handful of Y chromosome sequences have been characterized to date, mainly in model organisms. Using polar bear (Ursus maritimus) genomes, we compare two different in silico approaches to identify Y-linked sequences: 1) Similarity to known Y-linked genes a...

  3. Time-series analyses of Monterey Bay coastal microbial picoplankton using a ‘genome proxy’ microarray

    OpenAIRE

    Rich, Virginia I.; Pham, Vinh D.; Eppley, John Marmaduke; Shi, Yanmei; DeLong, Edward

    2010-01-01

    To investigate the temporal, spatial and phylogenetic resolution of marine microbial community structure and variability, we designed and expanded a genome proxy array (an oligonucleotide microarray targeting marine microbial genome fragments and genomes), evaluated it against metagenomic sequencing, and applied it to time-series samples from the Monterey Bay. The expanded array targeted 268 microbial genotypes across much of the known diversity of cultured and uncultured marine microbes. The...

  4. Genomic and proteomic analyses of Prdm5 reveal interactions with insulator binding proteins in embryonic stem cells

    DEFF Research Database (Denmark)

    Galli, Giorgio Giacomo; Carrara, Matteo; Francavilla, Chiara;

    2013-01-01

    find that Prdm5 is highly expressed in mouse embryonic stem cells (mES) and exploit this cellular system to characterize molecular functions of Prdm5. By combining proteomics and next generation sequencing technologies we identify Prdm5 interaction partners and genomic occupancy. We demonstrate that......-occupies genomic loci. In summary, our data indicate how Prdm5 may modulate transcription by interacting with factors involved in genome organization in mouse embryonic stem cells....

  5. Estimation of the Whitefly Bemisia tabaci Genome Size Based on k-mer and Flow Cytometric Analyses

    OpenAIRE

    Wenbo Chen; Hasegawa, Daniel K.; Kathiravetpillai Arumuganathan; Simmons, Alvin M.; Wintermantel, William M.; Zhangjun Fei; Kai-Shu Ling

    2015-01-01

    Whiteflies of the Bemisia tabaci (Hemiptera: Aleyrodidae) cryptic species complex are among the most important agricultural insect pests in the world. These phloem-feeding insects can colonize over 1000 species of plants worldwide and inflict severe economic losses to crops, mainly through the transmission of pathogenic viruses. Surprisingly, there is very little genomic information about whiteflies. As a starting point to genome sequencing, we report a new estimation of the genome size of ...

  6. Analysis of a set of Australian northern brown bandicoot expressed sequence tags with comparison to the genome sequence of the South American grey short tailed opossum

    Directory of Open Access Journals (Sweden)

    Miller Robert D

    2007-02-01

    the first reported marsupial EST datasets with a significant inter-species genome comparison between marsupials, providing a valuable resource for transcriptional analyses in marsupials and for future annotation of marsupial whole genome sequences.

  7. Genome-Wide Evolutionary Analyses of G1P[8] Strains Isolated Before and After Rotavirus Vaccine Introduction

    Science.gov (United States)

    Zeller, Mark; Donato, Celeste; Trovão, Nídia Sequeira; Cowley, Daniel; Heylen, Elisabeth; Donker, Nicole C.; McAllen, John K.; Akopov, Asmik; Kirkness, Ewen F.; Lemey, Philippe; Van Ranst, Marc; Matthijnssens, Jelle; Kirkwood, Carl D.

    2015-01-01

    Rotaviruses are the most important etiological agent of acute gastroenteritis in young children worldwide. Among the first countries to introduce rotavirus vaccines into their national immunization programs were Belgium (November 2006) and Australia (July 2007). Surveillance programs in Belgium (since 1999) and Australia (since 1989) offer the opportunity to perform a detailed comparison of rotavirus strains circulating pre- and postvaccine introduction. G1P[8] rotaviruses are the most prominent genotype in humans, and a total of 157 G1P[8] rotaviruses isolated between 1999 and 2011 were selected from Belgium and Australia and their complete genomes were sequenced. Phylogenetic analysis showed evidence of frequent reassortment among Belgian and Australian G1P[8] rotaviruses. Although many different phylogenetic subclusters were present before and after vaccine introduction, some unique clusters were only identified after vaccine introduction, which could be due to natural fluctuation or the first signs of vaccine-driven evolution. The times to the most recent common ancestors for the Belgian and Australian G1P[8] rotaviruses ranged from 1846 to 1955 depending on the gene segment, with VP7 and NSP4 resulting in the most recent estimates. We found no evidence that rotavirus population size was affected after vaccine introduction and only six amino acid sites in VP2, VP3, VP7, and NSP1 were identified to be under positive selective pressure. Continued surveillance of G1P[8] strains is needed to determine long-term effects of vaccine introductions, particularly now rotavirus vaccines are implemented in the national immunization programs of an increasing number of countries worldwide. PMID:26254487

  8. Genome-Wide Evolutionary Analyses of G1P[8] Strains Isolated Before and After Rotavirus Vaccine Introduction.

    Science.gov (United States)

    Zeller, Mark; Donato, Celeste; Trovão, Nídia Sequeira; Cowley, Daniel; Heylen, Elisabeth; Donker, Nicole C; McAllen, John K; Akopov, Asmik; Kirkness, Ewen F; Lemey, Philippe; Van Ranst, Marc; Matthijnssens, Jelle; Kirkwood, Carl D

    2015-09-01

    Rotaviruses are the most important etiological agent of acute gastroenteritis in young children worldwide. Among the first countries to introduce rotavirus vaccines into their national immunization programs were Belgium (November 2006) and Australia (July 2007). Surveillance programs in Belgium (since 1999) and Australia (since 1989) offer the opportunity to perform a detailed comparison of rotavirus strains circulating pre- and postvaccine introduction. G1P[8] rotaviruses are the most prominent genotype in humans, and a total of 157 G1P[8] rotaviruses isolated between 1999 and 2011 were selected from Belgium and Australia and their complete genomes were sequenced. Phylogenetic analysis showed evidence of frequent reassortment among Belgian and Australian G1P[8] rotaviruses. Although many different phylogenetic subclusters were present before and after vaccine introduction, some unique clusters were only identified after vaccine introduction, which could be due to natural fluctuation or the first signs of vaccine-driven evolution. The times to the most recent common ancestors for the Belgian and Australian G1P[8] rotaviruses ranged from 1846 to 1955 depending on the gene segment, with VP7 and NSP4 resulting in the most recent estimates. We found no evidence that rotavirus population size was affected after vaccine introduction and only six amino acid sites in VP2, VP3, VP7, and NSP1 were identified to be under positive selective pressure. Continued surveillance of G1P[8] strains is needed to determine long-term effects of vaccine introductions, particularly now rotavirus vaccines are implemented in the national immunization programs of an increasing number of countries worldwide. PMID:26254487

  9. Estimation of the Whitefly Bemisia tabaci Genome Size Based on k-mer and Flow Cytometric Analyses.

    Science.gov (United States)

    Chen, Wenbo; Hasegawa, Daniel K; Arumuganathan, Kathiravetpillai; Simmons, Alvin M; Wintermantel, William M; Fei, Zhangjun; Ling, Kai-Shu

    2015-01-01

    Whiteflies of the Bemisia tabaci (Hemiptera: Aleyrodidae) cryptic species complex are among the most important agricultural insect pests in the world. These phloem-feeding insects can colonize over 1000 species of plants worldwide and inflict severe economic losses to crops, mainly through the transmission of pathogenic viruses. Surprisingly, there is very little genomic information about whiteflies. As a starting point to genome sequencing, we report a new estimation of the genome size of the B. tabaci B biotype or Middle East-Asia Minor 1 (MEAM1) population. Using an isogenic whitefly colony with over 6500 haploid male individuals for genomic DNA, three paired-end genomic libraries with insert sizes of ~300 bp, 500 bp and 1 Kb were constructed and sequenced on an Illumina HiSeq 2500 system. A total of ~50 billion base pairs of sequences were obtained from each library. K-mer analysis using these sequences revealed that the genome size of the whitefly was ~682.3 Mb. In addition, the flow cytometric analysis estimated the haploid genome size of the whitefly to be ~690 Mb. Considering the congruency between both estimation methods, we predict the haploid genome size of B. tabaci MEAM1 to be ~680-690 Mb. Our data provide a baseline for ongoing efforts to assemble and annotate the B. tabaci genome. PMID:26463411

  10. Estimation of the Whitefly Bemisia tabaci Genome Size Based on k-mer and Flow Cytometric Analyses

    Directory of Open Access Journals (Sweden)

    Wenbo Chen

    2015-07-01

    Full Text Available Whiteflies of the Bemisia tabaci (Hemiptera: Aleyrodidae cryptic species complex are among the most important agricultural insect pests in the world. These phloem-feeding insects can colonize over 1000 species of plants worldwide and inflict severe economic losses to crops, mainly through the transmission of pathogenic viruses. Surprisingly, there is very little genomic information about whiteflies. As a starting point to genome sequencing, we report a new estimation of the genome size of the B. tabaci B biotype or Middle East-Asia Minor 1 (MEAM1 population. Using an isogenic whitefly colony with over 6500 haploid male individuals for genomic DNA, three paired-end genomic libraries with insert sizes of ~300 bp, 500 bp and 1 Kb were constructed and sequenced on an Illumina HiSeq 2500 system. A total of ~50 billion base pairs of sequences were obtained from each library. K-mer analysis using these sequences revealed that the genome size of the whitefly was ~682.3 Mb. In addition, the flow cytometric analysis estimated the haploid genome size of the whitefly to be ~690 Mb. Considering the congruency between both estimation methods, we predict the haploid genome size of B. tabaci MEAM1 to be ~680–690 Mb. Our data provide a baseline for ongoing efforts to assemble and annotate the B. tabaci genome.

  11. Genomic Resources for Water Yam (Dioscorea alata L.): Analyses of EST-Sequences, De Novo Sequencing and GBS Libraries

    Science.gov (United States)

    Saski, Christopher A.; Bhattacharjee, Ranjana; Scheffler, Brian E.; Asiedu, Robert

    2015-01-01

    The reducing cost and rapid progress in next-generation sequencing techniques coupled with high performance computational approaches have resulted in large-scale discovery of advanced genomic resources in several model and non-model plant species. Yam (Dioscorea spp.) is a major food and cash crop in many countries but research efforts have been limited to understand the genetics and generate genomic information for the crop. The availability of a large number of genomic resources including genome-wide molecular markers will accelerate the breeding efforts and application of genomic selection in yams. In the present study, several methods including expressed sequence tags (EST)-sequencing, de novo sequencing, and genotyping-by-sequencing (GBS) profiles on two yam (Dioscorea alata L.) genotypes (TDa 95/00328 and TDa 95-310) was performed to generate genomic resources for use in its improvement programs. This includes a comprehensive set of EST-SSRs, genomic SSRs, whole genome SNPs, and reduced representation SNPs. A total of 1,152 EST-SSRs were developed from >40,000 EST-sequences generated from the two genotypes. A set of 388 EST-SSRs were validated as polymorphic showing a polymorphism rate of 34% when tested on two diverse parents targeted for anthracnose disease. In addition, approximately 40X de novo whole genome sequence coverage was generated for each of the two genotypes, and a total of 18,584 and 15,952 genomic SSRs were identified for TDa 95/00328 and TDa 95-310, respectively. A custom made pipeline resulted in the selection of 573 genomic SSRs common across the two genotypes, of which only eight failed, 478 being polymorphic and 62 monomorphic indicating a polymorphic rate of 83.5%. Additionally, 288,505 high quality SNPs were also identified between these two genotypes. Genotyping by sequencing reads on these two genotypes also revealed 36,790 overlapping SNP positions that are distributed throughout the genome. Our efforts in using different approaches

  12. Genomic Resources for Water Yam (Dioscorea alata L.: Analyses of EST-Sequences, De Novo Sequencing and GBS Libraries.

    Directory of Open Access Journals (Sweden)

    Christopher A Saski

    Full Text Available The reducing cost and rapid progress in next-generation sequencing techniques coupled with high performance computational approaches have resulted in large-scale discovery of advanced genomic resources in several model and non-model plant species. Yam (Dioscorea spp. is a major food and cash crop in many countries but research efforts have been limited to understand the genetics and generate genomic information for the crop. The availability of a large number of genomic resources including genome-wide molecular markers will accelerate the breeding efforts and application of genomic selection in yams. In the present study, several methods including expressed sequence tags (EST-sequencing, de novo sequencing, and genotyping-by-sequencing (GBS profiles on two yam (Dioscorea alata L. genotypes (TDa 95/00328 and TDa 95-310 was performed to generate genomic resources for use in its improvement programs. This includes a comprehensive set of EST-SSRs, genomic SSRs, whole genome SNPs, and reduced representation SNPs. A total of 1,152 EST-SSRs were developed from >40,000 EST-sequences generated from the two genotypes. A set of 388 EST-SSRs were validated as polymorphic showing a polymorphism rate of 34% when tested on two diverse parents targeted for anthracnose disease. In addition, approximately 40X de novo whole genome sequence coverage was generated for each of the two genotypes, and a total of 18,584 and 15,952 genomic SSRs were identified for TDa 95/00328 and TDa 95-310, respectively. A custom made pipeline resulted in the selection of 573 genomic SSRs common across the two genotypes, of which only eight failed, 478 being polymorphic and 62 monomorphic indicating a polymorphic rate of 83.5%. Additionally, 288,505 high quality SNPs were also identified between these two genotypes. Genotyping by sequencing reads on these two genotypes also revealed 36,790 overlapping SNP positions that are distributed throughout the genome. Our efforts in using

  13. Genome sequence of Candidatus Riesia pediculischaeffi, endosymbiont of chimpanzee lice, and genomic comparison of recently acquired endosymbionts from human and chimpanzee lice.

    Science.gov (United States)

    Boyd, Bret M; Allen, Julie M; de Crécy-Lagard, Valérie; Reed, David L

    2014-11-01

    The obligate-heritable endosymbionts of insects possess some of the smallest known bacterial genomes. This is likely due to loss of genomic material during symbiosis. The mode and rate of this erosion may change over evolutionary time: faster in newly formed associations and slower in long-established ones. The endosymbionts of human and anthropoid primate lice present a unique opportunity to study genome erosion in newly established (or young) symbionts. This is because we have a detailed phylogenetic history of these endosymbionts with divergence dates for closely related species. This allows for genome evolution to be studied in detail and rates of change to be estimated in a phylogenetic framework. Here, we sequenced the genome of the chimpanzee louse endosymbiont (Candidatus Riesia pediculischaeffi) and compared it with the closely related genome of the human body louse endosymbiont. From this comparison, we found evidence for recent genome erosion leading to gene loss in these endosymbionts. Although gene loss was detected, it was not significantly greater than in older endosymbionts from aphids and ants. Additionally, we searched for genes associated with B-vitamin synthesis in the two louse endosymbiont genomes because these endosymbionts are believed to synthesize essential B vitamins absent in the louse's diet. All of the expected genes were present, except those involved in thiamin synthesis. We failed to find genes encoding for proteins involved in the biosynthesis of thiamin or any complete exogenous means of salvaging thiamin, suggesting there is an undescribed mechanism for the salvage of thiamin. Finally, genes encoding for the pantothenate de novo biosynthesis pathway were located on a plasmid in both taxa along with a heat shock protein. Movement of these genes onto a plasmid may be functionally and evolutionarily significant, potentially increasing production and guarding against the deleterious effects of mutation. These data add to a growing

  14. Initial Evidence for Adaptive Selection on the NADH Subunit Two of Freshwater Dolphins by Analyses of Mitochondrial Genomes.

    Directory of Open Access Journals (Sweden)

    Susana Caballero

    Full Text Available A small number of cetaceans have adapted to an entirely freshwater environment, having colonized rivers in Asia and South America from an ancestral origin in the marine environment. This includes the 'river dolphins', early divergence from the odontocete lineage, and two species of true dolphins (Family Delphinidae. Successful adaptation to the freshwater environment may have required increased demands in energy involved in processes such as the mitochondrial osmotic balance. For this reason, riverine odontocetes provide a compelling natural experiment in adaptation of mammals from marine to freshwater habitats. Here we present initial evidence of positive selection in the NADH dehydrogenase subunit 2 of riverine odontocetes by analyses of full mitochondrial genomes, using tests of selection and protein structure modeling. The codon model with highest statistical support corresponds to three discrete categories for amino acid sites, those under positive, neutral, and purifying selection. With this model we found positive selection at site 297 of the NADH dehydrogenase subunit 2 (dN/dS>1.0, leading to a substitution of an Ala or Val from the ancestral state of Thr. A phylogenetic reconstruction of 27 cetacean mitogenomes showed that an Ala substitution has evolved at least four times in cetaceans, once or more in the three 'river dolphins' (Families Pontoporidae, Lipotidae and Inidae, once in the riverine Sotalia fluviatilis (but not in its marine sister taxa, once in the riverine Orcaella brevirostris from the Mekong River (but not in its marine sister taxa and once in two other related marine dolphins. We located the position of this amino acid substitution in an alpha-helix channel in the trans-membrane domain in both the E. coli structure and Sotalia fluviatilis model. In E. coli this position is located in a helix implicated in a proton translocation channel of respiratory complex 1 and may have a similar role in the NADH dehydrogenases of

  15. Genome-wide classification and evolutionary and expression analyses of citrus MYB transcription factor families in sweet orange.

    Directory of Open Access Journals (Sweden)

    Xiao-Jin Hou

    Full Text Available MYB family genes are widely distributed in plants and comprise one of the largest transcription factors involved in various developmental processes and defense responses of plants. To date, few MYB genes and little expression profiling have been reported for citrus. Here, we describe and classify 177 members of the sweet orange MYB gene (CsMYB family in terms of their genomic gene structures and similarity to their putative Arabidopsis orthologs. According to these analyses, these CsMYBs were categorized into four groups (4R-MYB, 3R-MYB, 2R-MYB and 1R-MYB. Gene structure analysis revealed that 1R-MYB genes possess relatively more introns as compared with 2R-MYB genes. Investigation of their chromosomal localizations revealed that these CsMYBs are distributed across nine chromosomes. Sweet orange includes a relatively small number of MYB genes compared with the 198 members in Arabidopsis, presumably due to a paralog reduction related to repetitive sequence insertion into promoter and non-coding transcribed region of the genes. Comparative studies of CsMYBs and Arabidopsis showed that CsMYBs had fewer gene duplication events. Expression analysis revealed that the MYB gene family has a wide expression profile in sweet orange development and plays important roles in development and stress responses. In addition, 337 new putative microsatellites with flanking sequences sufficient for primer design were also identified from the 177 CsMYBs. These results provide a useful reference for the selection of candidate MYB genes for cloning and further functional analysis forcitrus.

  16. Incorporating Concomitant Medications into Genome-Wide Analyses for the Study of Complex Disease and Drug Response

    Science.gov (United States)

    Graham, Hillary T.; Rotroff, Daniel M.; Marvel, Skylar W.; Buse, John B.; Havener, Tammy M.; Wilson, Alyson G.; Wagner, Michael J.; Motsinger-Reif, Alison A.; Friedewald, W.T.

    2016-01-01

    Given the high costs of conducting a drug-response trial, researchers are now aiming to use retrospective analyses to conduct genome-wide association studies (GWAS) to identify underlying genetic contributions to drug-response variation. To prevent confounding results from a GWAS to investigate drug response, it is necessary to account for concomitant medications, defined as any medication taken concurrently with the primary medication being investigated. We use data from the Action to Control Cardiovascular Disease (ACCORD) trial in order to implement a novel scoring procedure for incorporating concomitant medication information into a linear regression model in preparation for GWAS. In order to accomplish this, two primary medications were selected: thiazolidinediones and metformin because of the wide-spread use of these medications and large sample sizes available within the ACCORD trial. A third medication, fenofibrate, along with a known confounding medication, statin, were chosen as a proof-of-principle for the scoring procedure. Previous studies have identified SNP rs7412 as being associated with statin response. Here we hypothesize that including the score for statin as a covariate in the GWAS model will correct for confounding of statin and yield a change in association at rs7412. The response of the confounded signal was successfully diminished from p = 3.19 × 10−7 to p = 1.76 × 10−5, by accounting for statin using the scoring procedure presented here. This approach provides the ability for researchers to account for concomitant medications in complex trial designs where monotherapy treatment regimens are not available.

  17. Genome Sequence of Azospirillum brasilense CBG497 and Comparative Analyses of Azospirillum Core and Accessory Genomes provide Insight into Niche Adaptation

    OpenAIRE

    Victor González; Alberto Mendoza Herrera; Valérie Barbe; Zoé Rouy; Claire Prigent-Combaret; Benoît Drogue; Stéphanie Borland; Erika Acosta-Cruz; Luis Lozano; Florence Wisniewski-Dyé; Patrick Mavingui

    2012-01-01

    Bacteria of the genus Azospirillum colonize roots of important cereals and grasses, and promote plant growth by several mechanisms, notably phytohormone synthesis. The genomes of several Azospirillum strains belonging to different species, isolated from various host plants and locations, were recently sequenced and published. In this study, an additional genome of an A. brasilense strain, isolated from maize grown on an alkaline soil in the northeast of Mexico, strain CBG497, was obtained. Co...

  18. No evidence for genome-wide interactions on plasma fibrinogen by smoking, alcohol consumption and body mass index: results from meta-analyses of 80,607 subjects.

    Directory of Open Access Journals (Sweden)

    Jens Baumert

    Full Text Available Plasma fibrinogen is an acute phase protein playing an important role in the blood coagulation cascade having strong associations with smoking, alcohol consumption and body mass index (BMI. Genome-wide association studies (GWAS have identified a variety of gene regions associated with elevated plasma fibrinogen concentrations. However, little is yet known about how associations between environmental factors and fibrinogen might be modified by genetic variation. Therefore, we conducted large-scale meta-analyses of genome-wide interaction studies to identify possible interactions of genetic variants and smoking status, alcohol consumption or BMI on fibrinogen concentration. The present study included 80,607 subjects of European ancestry from 22 studies. Genome-wide interaction analyses were performed separately in each study for about 2.6 million single nucleotide polymorphisms (SNPs across the 22 autosomal chromosomes. For each SNP and risk factor, we performed a linear regression under an additive genetic model including an interaction term between SNP and risk factor. Interaction estimates were meta-analysed using a fixed-effects model. No genome-wide significant interaction with smoking status, alcohol consumption or BMI was observed in the meta-analyses. The most suggestive interaction was found for smoking and rs10519203, located in the LOC123688 region on chromosome 15, with a p value of 6.2 × 10(-8. This large genome-wide interaction study including 80,607 participants found no strong evidence of interaction between genetic variants and smoking status, alcohol consumption or BMI on fibrinogen concentrations. Further studies are needed to yield deeper insight in the interplay between environmental factors and gene variants on the regulation of fibrinogen concentrations.

  19. Complete genome sequence and comparative genomic analysis of an emerging human pathogen, serotype V Streptococcus agalactiae

    OpenAIRE

    Tettelin, Hervé; Masignani, Vega; Cieslewicz, Michael J.; Eisen, Jonathan A.; Peterson, Scott; Wessels, Michael R.; Paulsen, Ian T.; Nelson, Karen E.; Margarit, Immaculada; Read, Timothy D.; Madoff, Lawrence C.; Wolf, Alex M.; Beanan, Maureen J; Brinkac, Lauren M.; Sean C Daugherty

    2002-01-01

    The 2,160,267 bp genome sequence of Streptococcus agalactiae, the leading cause of bacterial sepsis, pneumonia, and meningitis in neonates in the U.S. and Europe, is predicted to encode 2,175 genes. Genome comparisons among S. agalactiae, Streptococcus pneumoniae, Streptococcus pyogenes, and the other completely sequenced genomes identified genes specific to the streptococci and to S. agalactiae. These in silico analyses, combined with comparative genome hybridization experiments between the ...

  20. Two low coverage bird genomes and a comparison of reference-guided versus de novo genome assemblies.

    Directory of Open Access Journals (Sweden)

    Daren C Card

    Full Text Available As a greater number and diversity of high-quality vertebrate reference genomes become available, it is increasingly feasible to use these references to guide new draft assemblies for related species. Reference-guided assembly approaches may substantially increase the contiguity and completeness of a new genome using only low levels of genome coverage that might otherwise be insufficient for de novo genome assembly. We used low-coverage (∼3.5-5.5x Illumina paired-end sequencing to assemble draft genomes of two bird species (the Gunnison Sage-Grouse, Centrocercus minimus, and the Clark's Nutcracker, Nucifraga columbiana. We used these data to estimate de novo genome assemblies and reference-guided assemblies, and compared the information content and completeness of these assemblies by comparing CEGMA gene set representation, repeat element content, simple sequence repeat content, and GC isochore structure among assemblies. Our results demonstrate that even lower-coverage genome sequencing projects are capable of producing informative and useful genomic resources, particularly through the use of reference-guided assemblies.

  1. Two low coverage bird genomes and a comparison of reference-guided versus de novo genome assemblies

    Science.gov (United States)

    Card, Daren C.; Schield, Drew R.; Reyes-Velasco, Jacobo; Fujita, Matthre K.; Andrew, Audra L.; Oyler-McCance, Sara J.; Fike, Jennifer A.; Tomback, Diana F.; Ruggiero, Robert P.; Castoe, Todd A.

    2014-01-01

    As a greater number and diversity of high-quality vertebrate reference genomes become available, it is increasingly feasible to use these references to guide new draft assemblies for related species. Reference-guided assembly approaches may substantially increase the contiguity and completeness of a new genome using only low levels of genome coverage that might otherwise be insufficient for de novo genome assembly. We used low-coverage (~3.5–5.5x) Illumina paired-end sequencing to assemble draft genomes of two bird species (the Gunnison Sage-Grouse, Centrocercus minimus, and the Clark's Nutcracker, Nucifraga columbiana). We used these data to estimate de novo genome assemblies and reference-guided assemblies, and compared the information content and completeness of these assemblies by comparing CEGMA gene set representation, repeat element content, simple sequence repeat content, and GC isochore structure among assemblies. Our results demonstrate that even lower-coverage genome sequencing projects are capable of producing informative and useful genomic resources, particularly through the use of reference-guided assemblies.

  2. Comparisons of the M1 genome segments and encoded μ2 proteins of different reovirus isolates

    Directory of Open Access Journals (Sweden)

    Arnold Michelle M

    2004-09-01

    Full Text Available Abstract Background The reovirus M1 genome segment encodes the μ2 protein, a structurally minor component of the viral core, which has been identified as a transcriptase cofactor, nucleoside and RNA triphosphatase, and microtubule-binding protein. The μ2 protein is the most poorly understood of the reovirus structural proteins. Genome segment sequences have been reported for 9 of the 10 genome segments for the 3 prototypic reoviruses type 1 Lang (T1L, type 2 Jones (T2J, and type 3 Dearing (T3D, but the M1 genome segment sequences for only T1L and T3D have been previously reported. For this study, we determined the M1 nucleotide and deduced μ2 amino acid sequences for T2J, nine other reovirus field isolates, and various T3D plaque-isolated clones from different laboratories. Results Determination of the T2J M1 sequence completes the analysis of all ten genome segments of that prototype. The T2J M1 sequence contained a 1 base pair deletion in the 3' non-translated region, compared to the T1L and T3D M1 sequences. The T2J M1 gene showed ~80% nucleotide homology, and the encoded μ2 protein showed ~71% amino acid identity, with the T1L and T3D M1 and μ2 sequences, respectively, making the T2J M1 gene and μ2 proteins amongst the most divergent of all reovirus genes and proteins. Comparisons of these newly determined M1 and μ2 sequences with newly determined M1 and μ2 sequences from nine additional field isolates and a variety of laboratory T3D clones identified conserved features and/or regions that provide clues about μ2 structure and function. Conclusions The findings suggest a model for the domain organization of μ2 and provide further evidence for a role of μ2 in viral RNA synthesis. The new sequences were also used to explore the basis for M1/μ2-determined differences in the morphology of viral factories in infected cells. The findings confirm the key role of Ser/Pro208 as a prevalent determinant of differences in factory morphology

  3. Multiple Comparison Analysis of Two New Genomic Sequences of ILTV Strains from China with Other Strains from Different Geographic Regions.

    Science.gov (United States)

    Zhao, Yan; Kong, Congcong; Wang, Yunfeng

    2015-01-01

    To date, twenty complete genome sequences of ILTV strains have been published in GenBank, including one strain from China, and nineteen strains from Australian and the United States. To investigate the genomic information on ILTVs from different geographic regions, two additional individual complete genome sequences of WG and K317 strains from China were determined. The genomes of WG and K317 strains were 153,505 and 153,639 bp in length, respectively. Alignments performed on the amino acid sequences of the twelve glycoproteins showed that 13 out of 116 mutational sites were present only among the Chinese strain WG and the Australian strains SA2 and A20. The phylogenetic tree analysis suggested that the WG strain established close relationships with the Australian strain SA2. The recombination events were detected and confirmed in different subregions of the WG strain with the sequences of SA2 and K317 strains as parental. In this study, two new complete genome sequences of Chinese ILTV strains were used in comparative analysis with other complete genome sequences of ILTV strains from China, the United States, and Australia. The analysis of genome comparison, phylogenetic trees, and recombination events showed close relationships among the Chinese strain WG and the Australian strains SA2. The information of the two new complete genome sequences from China will help to facilitate the analysis of phylogenetic relationships and the molecular differences among ILTV strains from different geographic regions. PMID:26186451

  4. Comparison of Metal and Metal Oxide Media for Phosphopeptide Enrichment Prior to Mass Spectrometric Analyses

    OpenAIRE

    Gates, Matthew B.; Tomer, Kenneth B.; Deterding, Leesa J.

    2010-01-01

    Several affinity resins consisting of ionic metals or metal oxides were investigated for their phosphopeptide enrichment capabilities with subsequent mass spectrometric analyses. Commercially-available enrichment MOAC resins using manufacturer’s and/or published protocols were compared and evaluated for the most efficient and selective method that could be implemented as a standard enrichment procedure. From these comparative analyses using a tryptic digest of casein proteins, it was determin...

  5. Genomic comparison of multi-drug resistant invasive and colonizing Acinetobacter baumannii isolated from diverse human body sites reveals genomic plasticity

    Directory of Open Access Journals (Sweden)

    Hsiao William W

    2011-06-01

    Full Text Available Abstract Background Acinetobacter baumannii has recently emerged as a significant global pathogen, with a surprisingly rapid acquisition of antibiotic resistance and spread within hospitals and health care institutions. This study examines the genomic content of three A. baumannii strains isolated from distinct body sites. Isolates from blood, peri-anal, and wound sources were examined in an attempt to identify genetic features that could be correlated to each isolation source. Results Pulsed-field gel electrophoresis, multi-locus sequence typing and antibiotic resistance profiles demonstrated genotypic and phenotypic variation. Each isolate was sequenced to high-quality draft status, which allowed for comparative genomic analyses with existing A. baumannii genomes. A high resolution, whole genome alignment method detailed the phylogenetic relationships of sequenced A. baumannii and found no correlation between phylogeny and body site of isolation. This method identified genomic regions unique to both those isolates found on the surface of the skin or in wounds, termed colonization isolates, and those identified from body fluids, termed invasive isolates; these regions may play a role in the pathogenesis and spread of this important pathogen. A PCR-based screen of 74 A. baumanii isolates demonstrated that these unique genes are not exclusive to either phenotype or isolation source; however, a conserved genomic region exclusive to all sequenced A. baumannii was identified and verified. Conclusions The results of the comparative genome analysis and PCR assay show that A. baumannii is a diverse and genomically variable pathogen that appears to have the potential to cause a range of human disease regardless of the isolation source.

  6. Genome-wide comparison of ferritin family from Archaea, Bacteria, Eukarya, and Viruses: its distribution, characteristic motif, and phylogenetic relationship

    Science.gov (United States)

    Bai, Lina; Xie, Ting; Hu, Qingqing; Deng, Changyan; Zheng, Rong; Chen, Wanping

    2015-10-01

    Ferritins are highly conserved proteins that are widely distributed in various species from archaea to humans. The ubiquitous characteristic of these proteins reflects the pivotal contribution of ferritins to the safe storage and timely delivery of iron to achieve iron homeostasis. This study investigated the ferritin genes in 248 genomes from various species, including viruses, archaea, bacteria, and eukarya. The distribution comparison suggests that mammals and eudicots possess abundant ferritin genes, whereas fungi contain very few ferritin genes. Archaea and bacteria show considerable numbers of ferritin genes. Generally, prokaryotes possess three types of ferritin (the typical ferritin, bacterioferritin, and DNA-binding protein from starved cell), whereas eukaryotes have various subunit types of ferritin, thereby indicating the individuation of the ferritin family during evolution. The characteristic motif analysis of ferritins suggested that all key residues specifying the unique structural motifs of ferritin are highly conserved across three domains of life. Meanwhile, the characteristic motifs were also distinguishable between ferritin groups, especially phytoferritins, which show a plant-specific motif. The phylogenetic analyses show that ferritins within the same subfamily or subunits are generally clustered together. The phylogenetic relationships among ferritin members suggest that both gene duplication and horizontal transfer contribute to the wide variety of ferritins, and their possible evolutionary scenario was also proposed. The results contribute to a better understanding of the distribution, characteristic motif, and evolutionary relationship of the ferritin family.

  7. Analyse: The Comparison of Czech and Foreign ASP/SAAS Providers

    Directory of Open Access Journals (Sweden)

    Beer Daniela

    2008-10-01

    Full Text Available This report gives an overview of the current Application Service Provider´s (ASP market, especially the offered services. A questionnaire was sent to more than 20 companies which offer some kind of ASP / SaaS, but none of them replied. For this reason all comparisons were done only on the basis of their websites. Chapter three gives an overview of ASP and SaaS definitions, as well as describing the differences between the two. The analysis is focused on the comparison of Czech and foreign companies. Czech companies were compared in accordance with the survey. As the foreign companies offered all services inquired in the survey given to Czech companies, comparison was instead based on the classification of the kind of ASP models offered such as business ASP, enterprice ASP, functional oriented ASP, vertical market ASP and ASP aggregators. At the end of this report, some trends in the ASP market are mentioned.

  8. Reproducibility of oligonucleotide microarray transcriptome analyses - An interlaboratory comparison using chemostat cultures of Saccharomyces cerevisiae

    DEFF Research Database (Denmark)

    Piper, M.D.W.; Daran-Lapujade, P.; Bro, Christoffer;

    2002-01-01

    -microarray analysis in functional genomics and metabolic engineering, we designed a set of experiments to specifically address this issue. Saccharomyces cerevisiae CEN.PK113-7D was grown under defined,conditions in, glucose-limited chemostats, followed by transcriptome analysis with Affymetrix Gene-Chip arrays. In...... each of the laboratories, three independent replicate cultures were grown aerobically as well as anaerobically. Although variations introduced by in vitro handling steps were small and unbiased, greater variation from replicate cultures underscored that, to obtain reliable information, experimental...

  9. Accounting for multiple comparisons in a genome-wide association study (GWAS

    Directory of Open Access Journals (Sweden)

    Lautenberger James A

    2010-12-01

    Full Text Available Abstract Background As we enter an era when testing millions of SNPs in a single gene association study will become the standard, consideration of multiple comparisons is an essential part of determining statistical significance. Bonferroni adjustments can be made but are conservative due to the preponderance of linkage disequilibrium (LD between genetic markers, and permutation testing is not always a viable option. Three major classes of corrections have been proposed to correct the dependent nature of genetic data in Bonferroni adjustments: permutation testing and related alternatives, principal components analysis (PCA, and analysis of blocks of LD across the genome. We consider seven implementations of these commonly used methods using data from 1514 European American participants genotyped for 700,078 SNPs in a GWAS for AIDS. Results A Bonferroni correction using the number of LD blocks found by the three algorithms implemented by Haploview resulted in an insufficiently conservative threshold, corresponding to a genome-wide significance level of α = 0.15 - 0.20. We observed a moderate increase in power when using PRESTO, SLIDE, and simpleℳ when compared with traditional Bonferroni methods for population data genotyped on the Affymetrix 6.0 platform in European Americans (α = 0.05 thresholds between 1 × 10-7 and 7 × 10-8. Conclusions Correcting for the number of LD blocks resulted in an anti-conservative Bonferroni adjustment. SLIDE and simpleℳ are particularly useful when using a statistical test not handled in optimized permutation testing packages, and genome-wide corrected p-values using SLIDE, are much easier to interpret for consumers of GWAS studies.

  10. Systems genetics of obesity in an F2 pig model by genome-wide association, genetic network and pathway analyses

    DEFF Research Database (Denmark)

    Kogelman, Lisette; Pant, Sameer Dinkar; Fredholm, Merete;

    2014-01-01

    Obesity is a complex condition with world-wide exponentially rising prevalence rates, linked with severe diseases like Type 2 Diabetes. Economic and welfare consequences have led to a raised interest in a better understanding of the biological and genetic background. To date, whole genome...... of obesity-related phenotypes and genotyped using the 60K SNP chip. Firstly, Genome Wide Association (GWA) analysis was performed on the Obesity Index to locate candidate genomic regions that were further validated using combined Linkage Disequilibrium Linkage Analysis and investigated by evaluation...

  11. DO CANCER CLINICAL TRIAL POPULATIONS TRULY REPRESENT CANCER PATIENTS? A COMPARISON OF OPEN CLINICAL TRIALS TO THE CANCER GENOME ATLAS

    Science.gov (United States)

    Geifman, Nophar; Butte, Atul J.

    2016-01-01

    Open clinical trial data offer many opportunities for the scientific community to independently verify published results, evaluate new hypotheses and conduct meta-analyses. These data provide a springboard for scientific advances in precision medicine but the question arises as to how representative clinical trials data are of cancer patients overall. Here we present the integrative analysis of data from several cancer clinical trials and compare these to patient-level data from The Cancer Genome Atlas (TCGA). Comparison of cancer type-specific survival rates reveals that these are overall lower in trial subjects. This effect, at least to some extent, can be explained by the more advanced stages of cancer of trial subjects. This analysis also reveals that for stage IV cancer, colorectal cancer patients have a better chance of survival than breast cancer patients. On the other hand, for all other stages, breast cancer patients have better survival than colorectal cancer patients. Comparison of survival in different stages of disease between the two datasets reveals that subjects with stage IV cancer from the trials dataset have a lower chance of survival than matching stage IV subjects from TCGA. One likely explanation for this observation is that stage IV trial subjects have lower survival rates since their cancer is less likely to respond to treatment. To conclude, we present here a newly available clinical trials dataset which allowed for the integration of patient-level data from many cancer clinical trials. Our comprehensive analysis reveals that cancer-related clinical trials are not representative of general cancer patient populations, mostly due to their focus on the more advanced stages of the disease. These and other limitations of clinical trials data should, perhaps, be taken into consideration in medical research and in the field of precision medicine. PMID:26776196

  12. Genomic and Haplotype Comparison of Butanol Producing Bacteria Based on 16S rDNA

    Directory of Open Access Journals (Sweden)

    Ekwan Nofa Wiratno

    2016-04-01

    Full Text Available High butanol demand for transportation fuel triggers butanol production development. Exploration of butanolproducing bacteria using genomic comparison and biogeography will help to develop butanol industry. The objectives of this research were butanol production, genome comparison and haplotype analysis of butanolproducing bacteria from Ranu Pani Lake sediment using 16S rDNA sequences. The highest butanol concentrations were showed by Paenibacillus polymyxa RP 2.2 isolate (10.34 g.L-1, followed by Bacillus methylotrophicus RP 3.2 and B. methylotrophicus RP 7.2 isolate (10.11 g.L-1 and 9.63 g.L-1 respectively. Paenibacillus polymyxa RP 2.2 showed similarity in nucleotide composition (ATGC with B. methylotrophicus RP 3.2, B. methylotrophicus RP 7.2, P. polymyxa CR1, Bacillus amyloliquefaciens NELB-12, and Paenibacillus polymyxa WR-2. Clostridium acetobutylicum ATCC 824 showed similarity in nucleotide composition (ATGC with Clostridium saccharoperbutylacetonicum N1-4, and Clostridium saccharobutylicum Ox29. The lowest G+C content was C. saccharobutylicum Ox29 (51.35%, and the highest was B. methylotrophicus RP 7.2 (55.33%. Conserved region of 16S rDNA (1044 bp were consisted of 17 conserved sequences. The number of Parsimony Informative Site (PIS was 319 spot and single tone was 48 spot. We found in this study that all of butanolproducing bacterial DNA sequences have clustered to 8 haplotypes. Based on the origin of sample, there were three haplotype groups. Bacteria from group A were could produce butanol 8.9-10.34 g.L-1, group B 9.2-14.2 g.L-1 and group C was could produce butanol 0.47 g.L-1. The haplotype analysis of bacteria based on 16S rDNA sequences in this study could predict capability of butanol production.

  13. Caisson Movement Caused by Wave Slamming—a Comparison of ABAQUS and FLAC Analyses

    DEFF Research Database (Denmark)

    Andersen, Lars; Burcharth, Hans F.; Andersen, Thomas Lykke;

    2010-01-01

    During wave slamming, caisson movement may occur as a combination of sliding along the caisson–foundation interface and local failure in the foundation and seabed. The paper presents a comparison between different techniques applied to the analysis of this movement. Thus, a finite-difference...

  14. A Bacterial Analysis Platform: An Integrated System for Analysing Bacterial Whole Genome Sequencing Data for Clinical Diagnostics and Surveillance

    DEFF Research Database (Denmark)

    Thomsen, Martin Christen Frølund; Ahrenfeldt, Johanne; Bellod Cisneros, Jose Luis;

    2016-01-01

    and antimicrobial resistance genes. A short printable report for each sample will be provided and an Excel spreadsheet containing all the metadata and a summary of the results for all submitted samples can be downloaded. The pipeline was benchmarked using datasets previously used to test the...... web-based tools we developed a single pipeline for batch uploading of whole genome sequencing data from multiple bacterial isolates. The pipeline will automatically identify the bacterial species and, if applicable, assemble the genome, identify the multilocus sequence type, plasmids, virulence genes...... platform was developed and made publicly available, providing easy-to-use automated analysis of bacterial whole genome sequencing data. The platform may be of immediate relevance as a guide for investigators using whole genome sequencing for clinical diagnostics and surveillance. The platform is freely...

  15. Genome and metagenome analyses reveal adaptive evolution of the host and interaction with the gut microbiota in the goose

    Science.gov (United States)

    Gao, Guangliang; Zhao, Xianzhi; Li, Qin; He, Chuan; Zhao, Wenjing; Liu, Shuyun; Ding, Jinmei; Ye, Weixing; Wang, Jun; Chen, Ye; Wang, Haiwei; Li, Jing; Luo, Yi; Su, Jian; Huang, Yong; Liu, Zuohua; Dai, Ronghua; Shi, Yixiang; Meng, He; Wang, Qigui

    2016-01-01

    The goose is an economically important waterfowl that exhibits unique characteristics and abilities, such as liver fat deposition and fibre digestion. Here, we report de novo whole-genome assemblies for the goose and swan goose and describe the evolutionary relationships among 7 bird species, including domestic and wild geese, which diverged approximately 3.4~6.3 million years ago (Mya). In contrast to chickens as a proximal species, the expanded and rapidly evolving genes found in the goose genome are mainly involved in metabolism, including energy, amino acid and carbohydrate metabolism. Further integrated analysis of the host genome and gut metagenome indicated that the most widely shared functional enrichment of genes occurs for functions such as glycolysis/gluconeogenesis, starch and sucrose metabolism, propanoate metabolism and the citrate cycle. We speculate that the unique physiological abilities of geese benefit from the adaptive evolution of the host genome and symbiotic interactions with gut microbes. PMID:27608918

  16. Adapting functional genomic tools to metagenomic analyses: investigating the role of gut bacteria in relation to obesity

    OpenAIRE

    Liu, Yuanhua; Zhang, Chenhong; Zhao, Liping; Nardini, Christine

    2010-01-01

    With the expanding availability of sequencing technologies, research previously centered on the human genome can now afford to include the study of humans’ internal ecosystem (human microbiome). Given the scale of the data involved in this metagenomic research (two orders of magnitude larger than the human genome) and their importance in relation to human health, it is crucial to guarantee (along with the appropriate data collection and taxonomy) proper tools for data analysis. We propose to ...

  17. Genome-Wide Search Identifies 1.9 Mb from the Polar Bear Y Chromosome for Evolutionary Analyses.

    Science.gov (United States)

    Bidon, Tobias; Schreck, Nancy; Hailer, Frank; Nilsson, Maria A; Janke, Axel

    2015-07-01

    The male-inherited Y chromosome is the major haploid fraction of the mammalian genome, rendering Y-linked sequences an indispensable resource for evolutionary research. However, despite recent large-scale genome sequencing approaches, only a handful of Y chromosome sequences have been characterized to date, mainly in model organisms. Using polar bear (Ursus maritimus) genomes, we compare two different in silico approaches to identify Y-linked sequences: 1) Similarity to known Y-linked genes and 2) difference in the average read depth of autosomal versus sex chromosomal scaffolds. Specifically, we mapped available genomic sequencing short reads from a male and a female polar bear against the reference genome and identify 112 Y-chromosomal scaffolds with a combined length of 1.9 Mb. We verified the in silico findings for the longer polar bear scaffolds by male-specific in vitro amplification, demonstrating the reliability of the average read depth approach. The obtained Y chromosome sequences contain protein-coding sequences, single nucleotide polymorphisms, microsatellites, and transposable elements that are useful for evolutionary studies. A high-resolution phylogeny of the polar bear patriline shows two highly divergent Y chromosome lineages, obtained from analysis of the identified Y scaffolds in 12 previously published male polar bear genomes. Moreover, we find evidence of gene conversion among ZFX and ZFY sequences in the giant panda lineage and in the ancestor of ursine and tremarctine bears. Thus, the identification of Y-linked scaffold sequences from unordered genome sequences yields valuable data to infer phylogenomic and population-genomic patterns in bears. PMID:26019166

  18. Whole Genome and Global Gene Expression Analyses of the Model Mushroom Flammulina velutipes Reveal a High Capacity for Lignocellulose Degradation

    OpenAIRE

    Park, Young-Jin; Baek, Jeong Hun; Lee, Seonwook; Kim, Changhoon; Rhee, Hwanseok; Kim, Hyungtae; Seo, Jeong-Sun; Park, Hae-Ran; Yoon, Dae-Eun; Nam, Jae-Young; Kim, Hong-Il; Kim, Jong-Guk; Yoon, Hyeokjun; Kang, Hee-Wan; Cho, Jae-Yong

    2014-01-01

    Flammulina velutipes is a fungus with health and medicinal benefits that has been used for consumption and cultivation in East Asia. F. velutipes is also known to degrade lignocellulose and produce ethanol. The overlapping interests of mushroom production and wood bioconversion make F. velutipes an attractive new model for fungal wood related studies. Here, we present the complete sequence of the F. velutipes genome. This is the first sequenced genome for a commercially produced edible mushro...

  19. Genome-wide association analyses of child genotype effects and parent-of-origin effects in specific language impairment

    OpenAIRE

    Nudel, R; Simpson, N.; Baird, G; O’Hare, A; Conti-Ramsden, G; Bolton, P.; Hennessy, E.; The SLli consortium,; Ring, S; G. Smith; Francks, C.; Paracchini, S.; Monaco, A.; Fisher, S.; Newbury, D.

    2014-01-01

    Specific language impairment (SLI) is a neurodevelopmental disorder that affects linguistic abilities when development is otherwise normal. We report the results of a genome-wide association study of SLI which included parent-of-origin effects and child genotype effects and used 278 families of language-impaired children. The child genotype effects analysis did not identify significant associations. We found genome-wide significant paternal parent-of-origin effects on chromosome 14q12 (P = 3....

  20. Heavy neutral MSSM Higgs bosons at the photon linear collider - a comparison of two analyses

    Indian Academy of Sciences (India)

    M Spira; P Nieżurawski; M Krawczyk; A F Żarnecki

    2007-11-01

    Measurement of the heavy neutral MSSM Higgs bosons and production in the process → / → $b\\bar{b}$ at the Photon Linear Collider [1,2] has been considered in two independent analyses for the parameter range corresponding to the so-called `LHC wedge'. Significantly different conclusions were obtained; signal-to-background ratio 36 vs. 2. Here assumptions and results of these two analyses are compared. We have found that differences in the final results are mainly due to different assumptions on -luminosity spectra, jet definitions and selection cuts.

  1. Biochemical and full genome sequence analyses of clinical Vibrio cholerae isolates in Mexico reveals the presence of novel V. cholerae strains.

    Science.gov (United States)

    Díaz-Quiñonez, José Alberto; Hernández-Monroy, Irma; Montes-Colima, Norma Angélica; Moreno-Pérez, María Asunción; Galicia-Nicolás, Adriana Guadalupe; López-Martínez, Irma; Ruiz-Matus, Cuitláhuac; Kuri-Morales, Pablo; Ortíz-Alcántara, Joanna María; Garcés-Ayala, Fabiola; Ramírez-González, José Ernesto

    2016-05-01

    The first week of September 2013, the National Epidemiological Surveillance System identified two cases of cholera in Mexico City. The cultures of both samples were confirmed as Vibrio cholerae serogroup O1, serotype Ogawa, biotype El Tor. Initial analyses by PFGE and by PCR-amplification of the virulence genes, suggested that both strains were similar, but different from those previously reported in Mexico. The following week, four more cases were identified in a community in the state of Hidalgo, located 121 km northeast of Mexico City. Thereafter a cholera outbreak started in the region of La Huasteca. Genomic analyses of the four strains obtained in this study confirmed the presence of Pathogenicity Islands VPI-1 and -2, VSP-1 and -2, and of the integrative element SXT. The genomic structure of the 4 isolates was similar to that of V. cholerae strain 2010 EL-1786, identified during the epidemic in Haiti in 2010. PMID:26828665

  2. Genome-Wide Association Analyses in 128,266 Individuals Identifies New Morningness and Sleep Duration Loci

    Science.gov (United States)

    Jones, Samuel E.; Tyrrell, Jessica; Tuke, Marcus A.; Yaghootkar, Hanieh; Hu, Youna; Teder-Laving, Maris; Hayward, Caroline; Roenneberg, Till; Del Greco, Fabiola; Hicks, Andrew A.; Shin, Chol; Metspalu, Andres; Byrne, Enda M.; Gehrman, Philip R.; Tiemeier, Henning; Allebrandt, Karla V.; Murray, Anna; Hinds, David A.

    2016-01-01

    Disrupted circadian rhythms and reduced sleep duration are associated with several human diseases, particularly obesity and type 2 diabetes, but until recently, little was known about the genetic factors influencing these heritable traits. We performed genome-wide association studies of self-reported chronotype (morning/evening person) and self-reported sleep duration in 128,266 white British individuals from the UK Biobank study. Sixteen variants were associated with chronotype (P<5x10-8), including variants near the known circadian rhythm genes RGS16 (1.21 odds of morningness, 95% CI [1.15, 1.27], P = 3x10-12) and PER2 (1.09 odds of morningness, 95% CI [1.06, 1.12], P = 4x10-10). The PER2 signal has previously been associated with iris function. We sought replication using self-reported data from 89,283 23andMe participants; thirteen of the chronotype signals remained associated at P<5x10-8 on meta-analysis and eleven of these reached P<0.05 in the same direction in the 23andMe study. We also replicated 9 additional variants identified when the 23andMe study was used as a discovery GWAS of chronotype (all P<0.05 and meta-analysis P<5x10-8). For sleep duration, we replicated one known signal in PAX8 (2.6 minutes per allele, 95% CI [1.9, 3.2], P = 5.7x10-16) and identified and replicated two novel associations at VRK2 (2.0 minutes per allele, 95% CI [1.3, 2.7], P = 1.2x10-9; and 1.6 minutes per allele, 95% CI [1.1, 2.2], P = 7.6x10-9). Although we found genetic correlation between chronotype and BMI (rG = 0.056, P = 0.05); undersleeping and BMI (rG = 0.147, P = 1x10-5) and oversleeping and BMI (rG = 0.097, P = 0.04), Mendelian Randomisation analyses, with limited power, provided no consistent evidence of causal associations between BMI or type 2 diabetes and chronotype or sleep duration. Our study brings the total number of loci associated with chronotype to 22 and with sleep duration to three, and provides new insights into the biology of sleep and circadian

  3. Genome Sequence and Transcriptome Analyses of Chrysochromulina tobin: Metabolic Tools for Enhanced Algal Fitness in the Prominent Order Prymnesiales (Haptophyceae).

    Science.gov (United States)

    Hovde, Blake T; Deodato, Chloe R; Hunsperger, Heather M; Ryken, Scott A; Yost, Will; Jha, Ramesh K; Patterson, Johnathan; Monnat, Raymond J; Barlow, Steven B; Starkenburg, Shawn R; Cattolico, Rose Ann

    2015-01-01

    Haptophytes are recognized as seminal players in aquatic ecosystem function. These algae are important in global carbon sequestration, form destructive harmful blooms, and given their rich fatty acid content, serve as a highly nutritive food source to a broad range of eco-cohorts. Haptophyte dominance in both fresh and marine waters is supported by the mixotrophic nature of many taxa. Despite their importance the nuclear genome sequence of only one haptophyte, Emiliania huxleyi (Isochrysidales), is available. Here we report the draft genome sequence of Chrysochromulina tobin (Prymnesiales), and transcriptome data collected at seven time points over a 24-hour light/dark cycle. The nuclear genome of C. tobin is small (59 Mb), compact (∼ 40% of the genome is protein coding) and encodes approximately 16,777 genes. Genes important to fatty acid synthesis, modification, and catabolism show distinct patterns of expression when monitored over the circadian photoperiod. The C. tobin genome harbors the first hybrid polyketide synthase/non-ribosomal peptide synthase gene complex reported for an algal species, and encodes potential anti-microbial peptides and proteins involved in multidrug and toxic compound extrusion. A new haptophyte xanthorhodopsin was also identified, together with two "red" RuBisCO activases that are shared across many algal lineages. The Chrysochromulina tobin genome sequence provides new information on the evolutionary history, ecology and economic importance of haptophytes. PMID:26397803

  4. Genome Sequence and Transcriptome Analyses of Chrysochromulina tobin: Metabolic Tools for Enhanced Algal Fitness in the Prominent Order Prymnesiales (Haptophyceae.

    Directory of Open Access Journals (Sweden)

    Blake T Hovde

    Full Text Available Haptophytes are recognized as seminal players in aquatic ecosystem function. These algae are important in global carbon sequestration, form destructive harmful blooms, and given their rich fatty acid content, serve as a highly nutritive food source to a broad range of eco-cohorts. Haptophyte dominance in both fresh and marine waters is supported by the mixotrophic nature of many taxa. Despite their importance the nuclear genome sequence of only one haptophyte, Emiliania huxleyi (Isochrysidales, is available. Here we report the draft genome sequence of Chrysochromulina tobin (Prymnesiales, and transcriptome data collected at seven time points over a 24-hour light/dark cycle. The nuclear genome of C. tobin is small (59 Mb, compact (∼ 40% of the genome is protein coding and encodes approximately 16,777 genes. Genes important to fatty acid synthesis, modification, and catabolism show distinct patterns of expression when monitored over the circadian photoperiod. The C. tobin genome harbors the first hybrid polyketide synthase/non-ribosomal peptide synthase gene complex reported for an algal species, and encodes potential anti-microbial peptides and proteins involved in multidrug and toxic compound extrusion. A new haptophyte xanthorhodopsin was also identified, together with two "red" RuBisCO activases that are shared across many algal lineages. The Chrysochromulina tobin genome sequence provides new information on the evolutionary history, ecology and economic importance of haptophytes.

  5. A Phylogenomic Study of the Genus Alphavirus Employing Whole Genome Comparison

    Directory of Open Access Journals (Sweden)

    James J. Campanella

    2006-04-01

    Full Text Available The phylogenetics of the genus Alphavirus have historically been characterized using partial gene, single gene or partial proteomic data. We have mined cDNA and amino acid sequences from GenBank for all fully sequenced and some partially sequenced alphaviruses and generated phylogenomic analyses of the genus Alphavirus genus, employing capsid encoding structural regions, non-structural coding regions and complete viral genomes. Our studies support the presence of the previously reported recombination event that produced the Western Equine Encephalitis clade, and confirm many of the patterns of geographic radiation and divergence of the multiple species. Our data suggest that the Salmon Pancreatic Disease Virus and Sleeping Disease Virus are sufficiently divergent to form a separate clade from the other alphaviruses. Also, unlike previously reported studies employing limited sequence data for correlation of phylogeny, our results indicate that the Barmah Forest Virus and Middelburg Virus appear to be members of the Semliki Forest clade. Additionally, our analysis indicates that the Southern Elephant Seal Virus is part of the Semliki Forest clade, although still phylogenetically distant from all known members of the genus Alphavirus. Finally, we demonstrate that the whole Rubella viral genome provides an ideal outgroup for phylogenomic studies of the genus Alphavirus.

  6. Identification of Ohnolog Genes Originating from Whole Genome Duplication in Early Vertebrates, Based on Synteny Comparison across Multiple Genomes

    OpenAIRE

    Priya Singh, Param; Arora, Jatin; Isambert, Hervé

    2015-01-01

    Whole genome duplications (WGD) have now been firmly established in all major eukary-otic kingdoms. In particular, all vertebrates descend from two rounds of WGDs, that occurred in their jawless ancestor some 500 MY ago. Paralogs retained from WGD, also coined 'ohnologs' after Susumu Ohno, have been shown to be typically associated with development, signaling and gene regulation. Ohnologs, which amount to about 20 to 35% of genes in the human genome, have also been shown to be prone to domina...

  7. Comparison between CARIBIC Aerosol Samples Analysed by Accelerator-Based Methods and Optical Particle Counter Measurements

    OpenAIRE

    B. G. Martinsson; J. Friberg; Andersson, S M; Weigelt, A; Hermann, M.; D. Assmann; J. Voigtländer; C. A. M. Brenninkmeijer; Velthoven, P. J. F.; Zahn, A.

    2014-01-01

    Inter-comparison of results from two kinds of aerosol systems in the CARIBIC (Civil Aircraft for the Regular Investigation of the atmosphere Based on a Instrument Container) passenger aircraft based observatory, operating during intercontinental flights at 9–12 km altitude, is presented. Aerosol from the lowermost stratosphere (LMS), the extra-tropical upper troposphere (UT) and the tropical mid troposphere (MT) were investigated. Aerosol particle volume concentration measur...

  8. Genome-Wide Identification and Validation of Reference Genes in Infected Tomato Leaves for Quantitative RT-PCR Analyses.

    Science.gov (United States)

    Müller, Oliver A; Grau, Jan; Thieme, Sabine; Prochaska, Heike; Adlung, Norman; Sorgatz, Anika; Bonas, Ulla

    2015-01-01

    The Gram-negative bacterium Xanthomonas campestris pv. vesicatoria (Xcv) causes bacterial spot disease of pepper and tomato by direct translocation of type III effector proteins into the plant cell cytosol. Once in the plant cell the effectors interfere with host cell processes and manipulate the plant transcriptome. Quantitative RT-PCR (qRT-PCR) is usually the method of choice to analyze transcriptional changes of selected plant genes. Reliable results depend, however, on measuring stably expressed reference genes that serve as internal normalization controls. We identified the most stably expressed tomato genes based on microarray analyses of Xcv-infected tomato leaves and evaluated the reliability of 11 genes for qRT-PCR studies in comparison to four traditionally employed reference genes. Three different statistical algorithms, geNorm, NormFinder and BestKeeper, concordantly determined the superiority of the newly identified reference genes. The most suitable reference genes encode proteins with homology to PHD finger family proteins and the U6 snRNA-associated protein LSm7. In addition, we identified pepper orthologs and validated several genes as reliable normalization controls for qRT-PCR analysis of Xcv-infected pepper plants. The newly identified reference genes will be beneficial for future qRT-PCR studies of the Xcv-tomato and Xcv-pepper pathosystems, as well as for the identification of suitable normalization controls for qRT-PCR studies of other plant-pathogen interactions, especially, if related plant species are used in combination with bacterial pathogens. PMID:26313760

  9. Genome-Wide Identification and Validation of Reference Genes in Infected Tomato Leaves for Quantitative RT-PCR Analyses.

    Directory of Open Access Journals (Sweden)

    Oliver A Müller

    Full Text Available The Gram-negative bacterium Xanthomonas campestris pv. vesicatoria (Xcv causes bacterial spot disease of pepper and tomato by direct translocation of type III effector proteins into the plant cell cytosol. Once in the plant cell the effectors interfere with host cell processes and manipulate the plant transcriptome. Quantitative RT-PCR (qRT-PCR is usually the method of choice to analyze transcriptional changes of selected plant genes. Reliable results depend, however, on measuring stably expressed reference genes that serve as internal normalization controls. We identified the most stably expressed tomato genes based on microarray analyses of Xcv-infected tomato leaves and evaluated the reliability of 11 genes for qRT-PCR studies in comparison to four traditionally employed reference genes. Three different statistical algorithms, geNorm, NormFinder and BestKeeper, concordantly determined the superiority of the newly identified reference genes. The most suitable reference genes encode proteins with homology to PHD finger family proteins and the U6 snRNA-associated protein LSm7. In addition, we identified pepper orthologs and validated several genes as reliable normalization controls for qRT-PCR analysis of Xcv-infected pepper plants. The newly identified reference genes will be beneficial for future qRT-PCR studies of the Xcv-tomato and Xcv-pepper pathosystems, as well as for the identification of suitable normalization controls for qRT-PCR studies of other plant-pathogen interactions, especially, if related plant species are used in combination with bacterial pathogens.

  10. Beyond genomic variation - comparison and functional annotation of three Brassica rapa genomes: a turnip, a rapid cycling and a Chinese cabbage

    OpenAIRE

    Lin, Ke; Zhang, Ningwen; Severing, Edouard I.; Nijveen, Harm; Cheng, Feng; Visser, Richard GF; Wang, Xiaowu; de Ridder, Dick; Bonnema, Guusje

    2014-01-01

    Background Brassica rapa is an economically important crop species. During its long breeding history, a large number of morphotypes have been generated, including leafy vegetables such as Chinese cabbage and pakchoi, turnip tuber crops and oil crops. Results To investigate the genetic variation underlying this morphological variation, we re-sequenced, assembled and annotated the genomes of two B. rapa subspecies, turnip crops (turnip) and a rapid cycling. We then analysed the two resulting ge...

  11. Pion-Induced Fission of 209Bi and 119Sn:. Measurements, Calculations, Analyses and Comparison

    Science.gov (United States)

    Rana, Mukhtar Ahmed; Sher, Gul; Manzoor, Shahid; Shehzad, M. I.

    Cross-sections for the π--induced fission of 209Bi and 119Sn have been measured using the most sensitive CR-39 solid-state nuclear track detector. In experiments, target-detector stacks were exposed to negative pions of energy 500, 672, 1068, and 1665 MeV at the Brookhaven National Laboratory, USA. An important aspect of the present paper is the comparison of pion-induced fission fragment spectra of above mentioned nuclei with the spontaneous fission fragment spectra of 252Cf. This comparison is made in terms of fission fragment track lengths in the CR-39 detectors. Measurement results are compared with calculations of Monte Carlo and statistical weight functions methods using the computer code CEM95. Agreement between measurements and calculations is fairly good for 209Bi target nuclei whereas it is indigent for the case of 119Sn. The possibilities of the trustworthy calculations, using the computer code CEM95, comparable with measurements of pion-induced fission in intermediate and heavy nuclei are explored by employing various systematics available in the code. Energy dependence of pion-induced fission in 119Sn and 209Bi is analyzed employing a newly defined parameter geometric-size-normalized fission cross-section (χfg). It is found that the collective nuclear excitations, which may lead to fission, become more probable for both 209Bi and 119Sn nuclei with increasing energy of negative pions from 500 to 1665 MeV.

  12. A report on the inter comparison of isotopic analyses by mass spectrometry for the laser enrichment of carbon-13

    International Nuclear Information System (INIS)

    A method has been standardized for the mass spectral analysis of (13C/12C) ratio in the isotopically enriched C2F4 photoproduct obtained by the CO2 laser photolysis of natural CF2HCI sample. For improving the quality of the spectra as well as enchancing the detection level of the product at very low concentrations, a pre-concentration technique has been developed by gas chromatography. Inter comparison of the results for analyses carried out with two different mass spectrometers, viz., a commercial instrument available at the Land PT Division and an indigenously built one by MS and ES, BARC showed a very good agreement. (author)

  13. Research on the Comparison Analyses of Three-Phase Discrete and Integrated LC Filters in Three-Phase PV Inverter

    OpenAIRE

    Ying Jiang; Jianguo Li; Sanbo Pan; Xi Zhang; Peng Hu; Haiyan Zhang

    2013-01-01

    In three-phase photovoltaic (PV) system, three-phase filter inductors are important part for the output electrical power quality. The comparison analyses of three-phase discrete filter inductors and two kinds of three-phase integrated filter inductors in three-phase PV inverter are proposed. Firstly, the three-phase PV inverter operation with discrete filter inductors is analyzed, and the design of discrete filter inductors is given; then operation of the three-phase PV inverter with three-ph...

  14. Genome-Wide Analyses Suggest Mechanisms Involving Early B-Cell Development in Canine IgA Deficiency

    OpenAIRE

    Olsson, Mia; Tengvall, Katarina; Frankowiack, Marcel; Kierczak, Marcin; Bergvall, Kerstin; Axelsson, Erik; Tintle, Linda; Marti, Eliane; Roosje, Petra; Leeb, Tosso; Hedhammar, Åke; Hammarström, Lennart; Lindblad-Toh, Kerstin

    2015-01-01

    Immunoglobulin A deficiency (IgAD) is the most common primary immune deficiency disorder in both humans and dogs, characterized by recurrent mucosal tract infections and a predisposition for allergic and other immune mediated diseases. In several dog breeds, low IgA levels have been observed at a high frequency and with a clinical resemblance to human IgAD. In this study, we used genome-wide association studies (GWAS) to identify genomic regions associated with low IgA levels in dogs as a com...

  15. Genome-Wide Analyses Suggest Mechanisms Involving Early B-Cell Development in Canine IgA Deficiency.

    OpenAIRE

    Olsson, Mia; Tengvall, Katarina; Frankowiack, Marcel; Kierczak, Marcin; Bergvall, Kerstin; Axelsson, Erik; Tintle, Linda; Marti, Eliane Isabelle; Roosje, Petra; Leeb, Tosso; Hedhammar, Åke; Hammarström, Lennart; Lindblad-Toh, Kerstin

    2015-01-01

    Immunoglobulin A deficiency (IgAD) is the most common primary immune deficiency disorder in both humans and dogs, characterized by recurrent mucosal tract infections and a predisposition for allergic and other immune mediated diseases. In several dog breeds, low IgA levels have been observed at a high frequency and with a clinical resemblance to human IgAD. In this study, we used genome-wide association studies (GWAS) to identify genomic regions associated with low IgA levels in dogs as a com...

  16. Genomic and proteomic analyses of Mycobacterium bovis BCG Mexico 1931 reveal a diverse immunogenic repertoire against tuberculosis infection

    OpenAIRE

    López-Vidal Yolanda; Mendoza-Hernández Guillermo; Hernández-González Ismael L; Arvizu Adriana; Cevallos Miguel A; de León Samuel; Orduña Patricia

    2011-01-01

    Abstract Background Studies of Mycobacterium bovis BCG strains used in different countries and vaccination programs show clear variations in the genomes and immune protective properties of BCG strains. The aim of this study was to characterise the genomic and immune proteomic profile of the BCG 1931 strain used in Mexico. Results BCG Mexico 1931 has a circular chromosome of 4,350,386 bp with a G+C content and numbers of genes and pseudogenes similar to those of BCG Tokyo and BCG Pasteur. BCG ...

  17. The comparison of classification analyses on the public opinion focus of biofuels based on twitter

    Science.gov (United States)

    Wu, Shianghau; Guo, Jiannjong

    2016-02-01

    The study combined with the text mining and classification methodology to investigate the biofuels related tweets on the social network so as to understand the general public opinion focus. The contribution of the study enclosed the subsequent two points. First, the study utilized the text mining method to explore the content of biofuels connected tweets on the famed social network "twitter" and found the main keywords consistent with their frequencies. Second, the study applied the Back Propagation Neural Network (BPN), random forests model and two sorts of hybrid algorithmclassification analyses to compare the classification results.

  18. Genomic comparison of the endophyte Herbaspirillum seropedicae SmR1 and the phytopathogen Herbaspirillum rubrisubalbicans M1 by suppressive subtractive hybridization and partial genome sequencing.

    Science.gov (United States)

    Monteiro, Rose A; Balsanelli, Eduardo; Tuleski, Thalita; Faoro, Helison; Cruz, Leonardo M; Wassem, Roseli; de Baura, Valter A; Tadra-Sfeir, Michelle Z; Weiss, Vinícius; DaRocha, Wanderson D; Muller-Santos, Marcelo; Chubatsu, Leda S; Huergo, Luciano F; Pedrosa, Fábio O; de Souza, Emanuel M

    2012-05-01

    Herbaspirillum rubrisubalbicans M1 causes the mottled stripe disease in sugarcane cv. B-4362. Inoculation of this cultivar with Herbaspirillum seropedicae SmR1 does not produce disease symptoms. A comparison of the genomic sequences of these closely related species may permit a better understanding of contrasting phenotype such as endophytic association and pathogenic life style. To achieve this goal, we constructed suppressive subtractive hybridization (SSH) libraries to identify DNA fragments present in one species and absent in the other. In a parallel approach, partial genomic sequence from H. rubrisubalbicans M1 was directly compared in silico with the H. seropedicae SmR1 genome. The genomic differences between the two organisms revealed by SSH suggested that lipopolysaccharide and adhesins are potential molecular factors involved in the different phenotypic behavior. The cluster wss probably involved in cellulose biosynthesis was found in H. rubrisubalbicans M1. Expression of this gene cluster was increased in H. rubrisubalbicans M1 cells attached to the surface of maize root, and knockout of wssD gene led to decrease in maize root surface attachment and endophytic colonization. The production of cellulose could be responsible for the maize attachment pattern of H. rubrisubalbicans M1 that is capable of outcompeting H. seropedicae SmR1. PMID:22268687

  19. Whole genome and global gene expression analyses of the model mushroom Flammulina velutipes reveal a high capacity for lignocellulose degradation.

    Directory of Open Access Journals (Sweden)

    Young-Jin Park

    Full Text Available Flammulina velutipes is a fungus with health and medicinal benefits that has been used for consumption and cultivation in East Asia. F. velutipes is also known to degrade lignocellulose and produce ethanol. The overlapping interests of mushroom production and wood bioconversion make F. velutipes an attractive new model for fungal wood related studies. Here, we present the complete sequence of the F. velutipes genome. This is the first sequenced genome for a commercially produced edible mushroom that also degrades wood. The 35.6-Mb genome contained 12,218 predicted protein-encoding genes and 287 tRNA genes assembled into 11 scaffolds corresponding with the 11 chromosomes of strain KACC42780. The 88.4-kb mitochondrial genome contained 35 genes. Well-developed wood degrading machinery with strong potential for lignin degradation (69 auxiliary activities, formerly FOLymes and carbohydrate degradation (392 CAZymes, along with 58 alcohol dehydrogenase genes were highly expressed in the mycelium, demonstrating the potential application of this organism to bioethanol production. Thus, the newly uncovered wood degrading capacity and sequential nature of this process in F. velutipes, offer interesting possibilities for more detailed studies on either lignin or (hemi- cellulose degradation in complex wood substrates. The mutual interest in wood degradation by the mushroom industry and (ligno-cellulose biomass related industries further increase the significance of F. velutipes as a new model.

  20. Eicosapentaenoic acid prevents high fat diet-induced metabolic disorders: Genomic and metabolomic analyses of underlying mechanism

    Science.gov (United States)

    Previously our lab demonstrated eicosapenaenoic acid (EPA)'s ability to prevent high-fat (HF) diet-induced obesity by decreasing insulin resistance, glucose intolerance and inflammation. In the current study, we used genomic and metabolomic approaches to further investigate the molecular basis for t...

  1. Genome-wide linkage analyses of type 2 diabetes in Mexican Americans: the San Antonio Family Diabetes/Gallbladder Study.

    Science.gov (United States)

    Hunt, Kelly J; Lehman, Donna M; Arya, Rector; Fowler, Sharon; Leach, Robin J; Göring, Harald H H; Almasy, Laura; Blangero, John; Dyer, Tom D; Duggirala, Ravindranath; Stern, Michael P

    2005-09-01

    The San Antonio Family Diabetes/Gallbladder Study was initiated to identify susceptibility genes for type 2 diabetes. Evidence was previously reported of linkage to diabetes on 10q with suggestive evidence on 3p and 9p in a genome-wide scan of 440 individuals from 27 pedigrees ascertained through a single diabetic proband. Subsequently, the study was expanded to include 906 individuals from 39 extended Mexican-American pedigrees, two additional examination cycles approximately 5.3 and 7.6 years after baseline, and genotypes for a new set of genome-wide markers. Therefore, we completed a second genome-wide linkage scan. Using information from a participant's most recent exam, the prevalence of diabetes in nonprobands was 21.8%. We performed genome-wide variance components-based genetic analysis on the discrete trait diabetes using a liability model and on the quantitative Martingale residual obtained from modeling age of diabetes diagnosis using Cox proportional hazard models. Controlling for age and age(2), our strongest evidence for linkage to the trait diabetes and the quantitative Martingale residual was on chromosome 3p at marker D3S2406 with multipoint empirical logarithm of odds scores of 1.87 and 3.76, respectively. In summary, we report evidence for linkage to diabetes on chromosome 3p in a region previously identified in at least three independent populations. PMID:16123354

  2. Genome-wide association analyses identify new risk variants and the genetic architecture of amyotrophic lateral sclerosis

    NARCIS (Netherlands)

    van Rheenen, Wouter; Shatunov, Aleksey; Dekker, Annelot M; McLaughlin, Russell L; Diekstra, Frank P; Pulit, Sara L; van der Spek, Rick A A; Võsa, Urmo; de Jong, Simone; Robinson, Matthew R; Yang, Jian; Fogh, Isabella; van Doormaal, Perry Tc; Tazelaar, Gijs H P; Koppers, Max; Blokhuis, Anna M; Sproviero, William; Jones, Ashley R; Kenna, Kevin P; van Eijk, Kristel R; Harschnitz, Oliver; Schellevis, Raymond D; Brands, William J; Medic, Jelena; Menelaou, Androniki; Vajda, Alice; Ticozzi, Nicola; Lin, Kuang; Rogelj, Boris; Vrabec, Katarina; Ravnik-Glavač, Metka; Koritnik, Blaž; Zidar, Janez; Leonardis, Lea; Grošelj, Leja Dolenc; Millecamps, Stéphanie; Salachas, François; Meininger, Vincent; de Carvalho, Mamede; Pinto, Susana; Mora, Jesus S; Rojas-García, Ricardo; Polak, Meraida; Chandran, Siddharthan; Colville, Shuna; Swingler, Robert; Morrison, Karen E; Shaw, Pamela J; Hardy, John; Orrell, Richard W; Pittman, Alan; Sidle, Katie; Fratta, Pietro; Malaspina, Andrea; Topp, Simon; Petri, Susanne; Abdulla, Susanne; Drepper, Carsten; Sendtner, Michael; Meyer, Thomas; Ophoff, Roel A.; Staats, Kim A; Wiedau-Pazos, Martina; Lomen-Hoerth, Catherine; Van Deerlin, Vivianna M; Trojanowski, John Q; Elman, Lauren; McCluskey, Leo; Basak, A Nazli; Tunca, Ceren; Hamzeiy, Hamid; Parman, Yesim; Meitinger, Thomas; Lichtner, Peter; Radivojkov-Blagojevic, Milena; Andres, Christian R; Maurel, Cindy; Bensimon, Gilbert; Landwehrmeyer, Bernhard; Brice, Alexis; Payan, Christine A M; Saker-Delye, Safaa; Dürr, Alexandra; Wood, Nicholas W; Tittmann, Lukas; Lieb, Wolfgang; Franke, Andre; Rietschel, Marcella; Cichon, Sven; Nöthen, Markus M; Amouyel, Philippe; Tzourio, Christophe; Dartigues, Jean-François; Uitterlinden, Andre G; Rivadeneira, Fernando; Estrada, Karol; Hofman, Albert; Curtis, Charles; Blauw, Hylke M; van der Kooi, Anneke J; de Visser, Marianne; Goris, An; Weber, Markus; Shaw, Christopher E; Smith, Bradley N; Pansarasa, Orietta; Cereda, Cristina; Del Bo, Roberto; Comi, Giacomo P; D'Alfonso, Sandra; Bertolin, Cinzia; Sorarù, Gianni; Mazzini, Letizia; Pensato, Viviana; Gellera, Cinzia; Tiloca, Cinzia; Ratti, Antonia; Calvo, Andrea; Moglia, Cristina; Brunetti, Maura; Arcuti, Simona; Capozzo, Rosa; Zecca, Chiara; Lunetta, Christian; Penco, Silvana; Riva, Nilo; Padovani, Alessandro; Filosto, Massimiliano; Muller, Bernard; Stuit, Robbert Jan; Blair, Ian; Zhang, Katharine; McCann, Emily P; Fifita, Jennifer A; Nicholson, Garth A; Rowe, Dominic B; Pamphlett, Roger; Kiernan, Matthew C; Grosskreutz, Julian; Witte, Otto W; Ringer, Thomas; Prell, Tino; Stubendorff, Beatrice; Kurth, Ingo; Hübner, Christian A; Leigh, P Nigel; Casale, Federico; Chio, Adriano; Beghi, Ettore; Pupillo, Elisabetta; Tortelli, Rosanna; Logroscino, Giancarlo; Powell, John; Ludolph, Albert C; Weishaupt, Jochen H; Robberecht, Wim; Van Damme, Philip; Franke, Lude; Pers, Tune H; Brown, Robert H; Glass, Jonathan D; Landers, John E; Hardiman, Orla; Andersen, Peter M; Corcia, Philippe; Vourc'h, Patrick; Silani, Vincenzo; Wray, Naomi R; Visscher, Peter M; de Bakker, Paul I W; van Es, Michael A; Pasterkamp, R Jeroen; Lewis, Cathryn M; Breen, Gerome; Al-Chalabi, Ammar; van den Berg, Leonard H; Veldink, Jan H

    2016-01-01

    To elucidate the genetic architecture of amyotrophic lateral sclerosis (ALS) and find associated loci, we assembled a custom imputation reference panel from whole-genome-sequenced patients with ALS and matched controls (n = 1,861). Through imputation and mixed-model association analysis in 12,577 ca

  3. Genetic variants associated with subjective well-being, depressive symptoms, and neuroticism identified through genome-wide analyses

    DEFF Research Database (Denmark)

    Okbay, Aysu; Baselmans, Bart M L; De Neve, Jan-Emmanuel;

    2016-01-01

    Very few genetic variants have been associated with depression and neuroticism, likely because of limitations on sample size in previous studies. Subjective well-being, a phenotype that is genetically correlated with both of these traits, has not yet been studied with genome-wide data. We conduct...

  4. Comparative genome analyses of novel Mangrovimonas-like strains isolated from estuarine mangrove sediments reveal xylan and arabinan utilization genes.

    Science.gov (United States)

    Dinesh, Balachandra; Lau, Nyok-Sean; Furusawa, Go; Kim, Seok-Won; Taylor, Todd D; Foong, Swee Yeok; Shu-Chien, Alexander Chong

    2016-02-01

    To date, the genus Mangrovimonas consists of only one species, Mangrovimonas yunxiaonensis strain LY01 that is known to have algicidal effects against harmful algal blooms (HABs) of Alexandrium tamarense. In this study, the whole genome sequence of three Mangrovimonas-like strains, TPBH4(T)(=LMG 28913(T),=JCM 30882(T)), ST2L12(T)(=LMG 28914(T),=JCM 30880(T)) and ST2L15(T)(=LMG 28915(T),=JCM 30881(T)) isolated from estuarine mangrove sediments in Perak, Malaysia were described. The sequenced genomes had a range of assembly size ranging from 3.56Mb to 4.15Mb which are significantly larger than that of M. yunxiaonensis LY01 (2.67Mb). Xylan, xylose, L-arabinan and L-arabinose utilization genes were found in the genome sequences of the three Mangrovimonas-like strains described in this study. In contrast, these carbohydrate metabolism genes were not found in the genome sequence of LY01. In addition, TPBH4(T) and ST2L12(T) show capability to degrade xylan using qualitative plate assay method. PMID:26795059

  5. Genomic Analyses of Cladophialophora bantiana, a Major Cause of Cerebral Phaeohyphomycosis Provides Insight into Its Lifestyle, Virulence and Adaption in Host.

    Science.gov (United States)

    Kuan, Chee Sian; Cham, Chun Yoong; Singh, Gurmit; Yew, Su Mei; Tan, Yung-Chie; Chong, Pei-Sin; Toh, Yue Fen; Atiya, Nadia; Na, Shiang Ling; Lee, Kok Wei; Hoh, Chee-Choong; Yee, Wai-Yan; Ng, Kee Peng

    2016-01-01

    Cladophialophora bantiana is a dematiaceous fungus with a predilection for causing central nervous system (CNS) infection manifesting as brain abscess in both immunocompetent and immunocompromised patients. In this paper, we report comprehensive genomic analyses of C. bantiana isolated from the brain abscess of an immunocompetent man, the first reported case in Malaysia and Southeast Asia. The identity of the fungus was determined using combined morphological analysis and multilocus phylogeny. The draft genome sequence of a neurotrophic fungus, C. bantiana UM 956 was generated using Illumina sequencing technology to dissect its genetic fundamental and basic biology. The assembled 37.1 Mb genome encodes 12,155 putative coding genes, of which, 1.01% are predicted transposable elements. Its genomic features support its saprophytic lifestyle, renowned for its versatility in decomposing hemicellulose and pectin components. The C. bantiana UM 956 was also found to carry some important putative genes that engaged in pathogenicity, iron uptake and homeostasis as well as adaptation to various stresses to enable the organism to survive in hostile microenvironment. This wealth of resource will further catalyse more downstream functional studies to provide better understanding on how this fungus can be a successful and persistent pathogen in human. PMID:27570972

  6. Analyses of Methylomes Derived from Meso-American Common Bean (Phaseolus vulgaris L.) Using MeDIP-Seq and Whole Genome Sodium Bisulfite-Sequencing.

    Science.gov (United States)

    Crampton, Mollee; Sripathi, Venkateswara R; Hossain, Khwaja; Kalavacharla, Venu

    2016-01-01

    Common bean (Phaseolus vulgaris L.) is economically important for its high protein, fiber, and micronutrient contents, with a relatively small genome size of ∼587 Mb. Common bean is genetically diverse with two major gene pools, Meso-American and Andean. The phenotypic variability within common bean is partly attributed to the genetic diversity and epigenetic changes that are largely influenced by environmental factors. It is well established that an important epigenetic regulator of gene expression is DNA methylation. Here, we present results generated from two high-throughput sequencing technologies, methylated DNA immunoprecipitation-sequencing (MeDIP-seq) and whole genome bisulfite-sequencing (BS-Seq). Our analyses revealed that this Meso-American common bean displays similar methylation patterns as other previously published plant methylomes, with CG ∼50%, CHG ∼30%, and CHH ∼2.7% methylation, however, these differ from the common bean reference methylome of Andean origin. We identified higher CG methylation levels in both promoter and genic regions than CHG and CHH contexts. Moreover, we found relatively higher CG methylation levels in genes than in promoters. Conversely, the CHG and CHH methylation levels were highest in promoters than in genes. This is the first genome-wide DNA methylation profiling study in a Meso-American common bean cultivar ("Sierra") using NGS approaches. Our long-term goal is to generate genome-wide epigenomic maps in common bean focusing on chromatin accessibility, histone modifications, and DNA methylation. PMID:27199997

  7. Genus-Wide Comparative Genome Analyses of Colletotrichum Species Reveal Specific Gene Family Losses and Gains during Adaptation to Specific Infection Lifestyles.

    Science.gov (United States)

    Gan, Pamela; Narusaka, Mari; Kumakura, Naoyoshi; Tsushima, Ayako; Takano, Yoshitaka; Narusaka, Yoshihiro; Shirasu, Ken

    2016-01-01

    Members from Colletotrichum genus adopt a diverse range of lifestyles during infection of plants and represent a group of agriculturally devastating pathogens. In this study, we present the draft genome of Colletotrichum incanum from the spaethianum clade of Colletotrichum and the comparative analyses with five other Colletotrichum species from distinct lineages. We show that the C. incanum strain, originally isolated from Japanese daikon radish, is able to infect both eudicot plants, such as certain ecotypes of the eudicot Arabidopsis, and monocot plants, such as lily. Being closely related to Colletotrichum species both in the graminicola clade, whose members are restricted strictly to monocot hosts, and to the destructivum clade, whose members are mostly associated with dicot infections, C. incanum provides an interesting model system for comparative genomics to study how fungal pathogens adapt to monocot and dicot hosts. Genus-wide comparative genome analyses reveal that Colletotrichum species have tailored profiles of their carbohydrate-degrading enzymes according to their infection lifestyles. In addition, we show evidence that positive selection acting on secreted and nuclear localized proteins that are highly conserved may be important in adaptation to specific hosts or ecological niches. PMID:27189990

  8. MitoZoa 2.0: a database resource and search tools for comparative and evolutionary analyses of mitochondrial genomes in Metazoa

    Science.gov (United States)

    D'Onorio de Meo, Paolo; D'Antonio, Mattia; Griggio, Francesca; Lupi, Renato; Borsani, Massimiliano; Pavesi, Giulio; Castrignanò, Tiziana; Pesole, Graziano; Gissi, Carmela

    2012-01-01

    The MITOchondrial genome database of metaZOAns (MitoZoa) is a public resource for comparative analyses of metazoan mitochondrial genomes (mtDNA) at both the sequence and genomic organizational levels. The main characteristics of the MitoZoa database are the careful revision of mtDNA entry annotations and the possibility of retrieving gene order and non-coding region (NCR) data in appropriate formats. The MitoZoa retrieval system enables basic and complex queries at various taxonomic levels using different search menus. MitoZoa 2.0 has been enhanced in several aspects, including: a re-annotation pipeline to check the correctness of protein-coding gene predictions; a standardized annotation of introns and of precursor ORFs whose functionality is post-transcriptionally recovered by RNA editing or programmed translational frameshifting; updates of taxon-related fields and a BLAST sequence similarity search tool. Database novelties and the definition of standard mtDNA annotation rules, together with the user-friendly retrieval system and the BLAST service, make MitoZoa a valuable resource for comparative and evolutionary analyses as well as a reference database to assist in the annotation of novel mtDNA sequences. MitoZoa is freely accessible at http://www.caspur.it/mitozoa. PMID:22123747

  9. Laser and electron beams physical analyses applied to the comparison between two silver tetradrachm greek coins

    Science.gov (United States)

    Torrisi, L.; Mondio, G.; Mezzasalma, A. M.; Margarone, D.; Caridi, F.; Serafino, T.; Torrisi, A.

    2009-08-01

    Physical analyses by laser ablation coupled to mass quadrupole spectrometry (LAMQS), energy dispersive X-ray fluorescence (EDX) induced by electron beam, scanning electron microscopy (SEM) and surface profilometry analysis (SPA) are applied to the investigation of two silver tetradrachms from Messana, in order to compare their elemental composition and structure. Quantitative analysis of the elemental composition and of the silver isotopic ratios have been carried out analyzing the surface patina of the two samples. Significant differences in the sulfur, chlorine and copper content, in the isotopic ratios and in the morphological aspects have been measured. The obtained results are presented and discussed from the point of view of the physical techniques useful to establish the differences between apparently true and false coins.

  10. ECR Browser: A Tool For Visualizing And Accessing Data From Comparisons Of Multiple Vertebrate Genomes

    Energy Technology Data Exchange (ETDEWEB)

    Loots, G G; Ovcharenko, I; Stubbs, L; Nobrega, M A

    2004-01-06

    The increasing number of vertebrate genomes being sequenced in draft or finished form provide a unique opportunity to study and decode the language of DNA sequence through comparative genome alignments. However, novel tools and strategies are required to accommodate this increasing volume of genomic information and to facilitate experimental annotation of genome function. Here we present the ECR Browser, a tool that provides an easy and dynamic access to whole genome alignments of human, mouse, rat and fish sequences. This web-based tool (http://ecrbrowser.dcode.org) provides the starting point for discovery of novel genes, identification of distant gene regulatory elements and prediction of transcription factor binding sites. The genome alignment portal of the ECR Browser also permits fast and automated alignment of any user-submitted sequence to the genome of choice. The interconnection of the ECR browser with other DNA sequence analysis tools creates a unique portal for studying and exploring vertebrate genomes.

  11. Comparison of the complete genome sequences of Pseudomonas syringae pv. syringae B728a and pv. tomato DC3000

    Energy Technology Data Exchange (ETDEWEB)

    Feil, Helene [University of California, Berkeley; Feil, William [University of California, Berkeley; Chain, Patrick S. G. [Lawrence Livermore National Laboratory (LLNL); Larimer, Frank W [ORNL; DiBartolo, Genevieve [U.S. Department of Energy, Joint Genome Institute; Copeland, A [U.S. Department of Energy, Joint Genome Institute; Lykidis, A [U.S. Department of Energy, Joint Genome Institute; Trong, Stephen [U.S. Department of Energy, Joint Genome Institute; Nolan, Matt [U.S. Department of Energy, Joint Genome Institute; Goltsman, Eugene [U.S. Department of Energy, Joint Genome Institute; Thiel, James [U.S. Department of Energy, Joint Genome Institute; Malfatti, Stephanie [Lawrence Livermore National Laboratory (LLNL); Loper, Joyce E. [U.S. Department of Agriculture, Agricultural Research Service; Detter, J C [U.S. Department of Energy, Joint Genome Institute; Lapidus, Alla L. [U.S. Department of Energy, Joint Genome Institute; Land, Miriam L [ORNL; Richardson, P M [U.S. Department of Energy, Joint Genome Institute; Kyrpides, Nikos C [U.S. Department of Energy, Joint Genome Institute; Ivanova, N [U.S. Department of Energy, Joint Genome Institute; Lindow, Steven E. [University of California, Berkeley

    2005-01-01

    The complete genomic sequence of Pseudomonas syringae pv. syringae B728a (Pss B728a) has been determined and is compared with that of A syringae pv. tomato DC3000 (Pst DC3000). The two pathovars of this economically important species of plant pathogenic bacteria differ in host range and other interactions with plants, with Pss having a more pronounced epiphytic stage of growth and higher abiotic stress tolerance and Pst DC3000 having a more pronounced apoplastic growth habitat. The Pss B728a genome (6.1 Mb) contains a circular chromosome and no plasmid, whereas the Pst DC3000 genome is 6.5 mbp in size, composed of a circular chromosome and two plasmids. Although a high degree of similarity exists between the two sequenced Pseudomonads, 976 protein-encoding genes are unique to Pss B728a when compared with Pst DC3000, including large genomic islands likely to contribute to virulence and host specificity. Over 375 repetitive extragenic palindromic sequences unique to Pss B728a when compared with Pst DC3000 are widely distributed throughout the chromosome except in 14 genomic islands, which generally had lower GC content than the genome as a whole. Content of the genomic islands varies, with one containing a prophage and another the plasmid pKLC102 of Pseudomonas aeruginosa PAO1. Among the 976 genes of Pss B728a with no counterpart in Pst DC3000 are those encoding for syringopeptin, syringomycin, indole acetic acid biosynthesis, arginine degradation, and production of ice nuclei. The genomic comparison suggests that several unique genes for Pss B728a such as ectoine synthase, DNA repair, and antibiotic production may contribute to the epiphytic fitness and stress tolerance of this organism.

  12. Comparison of the complete genome sequences of Pseudomonas syringae pv. syringae B728a and pv. tomato DC3000

    Energy Technology Data Exchange (ETDEWEB)

    Feil, H; Feil, W S; Chain, P; Larimer, F; DiBartolo, G; Copeland, A; Lykidis, A; Trong, S; Nolan, M; Goltsman, E; Thiel, J; Malfatti, S; Loper, J E; Lapidus, A; Detter, J C; Land, M; Richardson, P M; Kyrpides, N C; Ivanova, N; Lindow, S E

    2005-07-14

    The complete genomic sequence of Pseudomonas syringae pathovar syringae B728a (Pss B728a), has been determined and is compared with that of Pseudomonas syringae pv. tomato DC3000 (Pst DC3000). The two pathovars of this economically important species of plant pathogenic bacteria differ in host range and other interactions with plants, with Pss having a more pronounced epiphytic stage of growth and higher abiotic stress tolerance and Pst DC3000 having a more pronounced apoplastic growth habitat. The Pss B728a genome (6.1 megabases) contains a circular chromosome and no plasmid, whereas the Pst DC3000 genome is 6.5 mbp in size, composed of a circular chromosome and two plasmids. While a high degree of similarity exists between the two sequenced Pseudomonads, 976 protein-encoding genes are unique to Pss B728a when compared to Pst DC3000, including large genomic islands likely to contribute to virulence and host specificity. Over 375 repetitive extragenic palindromic sequences (REPs) unique to Pss B728a when compared to Pst DC3000 are widely distributed throughout the chromosome except in 14 genomic islands, which generally had lower GC content than the genome as a whole. Content of the genomic islands vary, with one containing a prophage and another the plasmid pKLC102 of P. aeruginosa PAO1. Among the 976 genes of Pss B728a with no counterpart in Pst DC3000 are those encoding for syringopeptin (SP), syringomycin (SR), indole acetic acid biosynthesis, arginine degradation, and production of ice nuclei. The genomic comparison suggests that several unique genes for Pss B728a such as ectoine synthase, DNA repair, and antibiotic production may contribute to epiphytic fitness and stress tolerance of this organism.

  13. The smallest known genomes of multicellular and toxic cyanobacteria: comparison, minimal gene sets for linked traits and the evolutionary implications.

    Directory of Open Access Journals (Sweden)

    Karina Stucken

    Full Text Available Cyanobacterial morphology is diverse, ranging from unicellular spheres or rods to multicellular structures such as colonies and filaments. Multicellular species represent an evolutionary strategy to differentiate and compartmentalize certain metabolic functions for reproduction and nitrogen (N(2 fixation into specialized cell types (e.g. akinetes, heterocysts and diazocytes. Only a few filamentous, differentiated cyanobacterial species, with genome sizes over 5 Mb, have been sequenced. We sequenced the genomes of two strains of closely related filamentous cyanobacterial species to yield further insights into the molecular basis of the traits of N(2 fixation, filament formation and cell differentiation. Cylindrospermopsis raciborskii CS-505 is a cylindrospermopsin-producing strain from Australia, whereas Raphidiopsis brookii D9 from Brazil synthesizes neurotoxins associated with paralytic shellfish poisoning (PSP. Despite their different morphology, toxin composition and disjunct geographical distribution, these strains form a monophyletic group. With genome sizes of approximately 3.9 (CS-505 and 3.2 (D9 Mb, these are the smallest genomes described for free-living filamentous cyanobacteria. We observed remarkable gene order conservation (synteny between these genomes despite the difference in repetitive element content, which accounts for most of the genome size difference between them. We show here that the strains share a specific set of 2539 genes with >90% average nucleotide identity. The fact that the CS-505 and D9 genomes are small and streamlined compared to those of other filamentous cyanobacterial species and the lack of the ability for heterocyst formation in strain D9 allowed us to define a core set of genes responsible for each trait in filamentous species. We presume that in strain D9 the ability to form proper heterocysts was secondarily lost together with N(2 fixation capacity. Further comparisons to all available cyanobacterial

  14. The Smallest Known Genomes of Multicellular and Toxic Cyanobacteria: Comparison, Minimal Gene Sets for Linked Traits and the Evolutionary Implications

    Science.gov (United States)

    Stucken, Karina; John, Uwe; Cembella, Allan; Murillo, Alejandro A.; Soto-Liebe, Katia; Fuentes-Valdés, Juan J.; Friedel, Maik; Plominsky, Alvaro M.; Vásquez, Mónica; Glöckner, Gernot

    2010-01-01

    Cyanobacterial morphology is diverse, ranging from unicellular spheres or rods to multicellular structures such as colonies and filaments. Multicellular species represent an evolutionary strategy to differentiate and compartmentalize certain metabolic functions for reproduction and nitrogen (N2) fixation into specialized cell types (e.g. akinetes, heterocysts and diazocytes). Only a few filamentous, differentiated cyanobacterial species, with genome sizes over 5 Mb, have been sequenced. We sequenced the genomes of two strains of closely related filamentous cyanobacterial species to yield further insights into the molecular basis of the traits of N2 fixation, filament formation and cell differentiation. Cylindrospermopsis raciborskii CS-505 is a cylindrospermopsin-producing strain from Australia, whereas Raphidiopsis brookii D9 from Brazil synthesizes neurotoxins associated with paralytic shellfish poisoning (PSP). Despite their different morphology, toxin composition and disjunct geographical distribution, these strains form a monophyletic group. With genome sizes of approximately 3.9 (CS-505) and 3.2 (D9) Mb, these are the smallest genomes described for free-living filamentous cyanobacteria. We observed remarkable gene order conservation (synteny) between these genomes despite the difference in repetitive element content, which accounts for most of the genome size difference between them. We show here that the strains share a specific set of 2539 genes with >90% average nucleotide identity. The fact that the CS-505 and D9 genomes are small and streamlined compared to those of other filamentous cyanobacterial species and the lack of the ability for heterocyst formation in strain D9 allowed us to define a core set of genes responsible for each trait in filamentous species. We presume that in strain D9 the ability to form proper heterocysts was secondarily lost together with N2 fixation capacity. Further comparisons to all available cyanobacterial genomes covering

  15. Whole-genome analyses reveals the animal origin of a rotavirus G4P[6] detected in a child with severe diarrhea.

    Science.gov (United States)

    Martinez, Magaly; Galeano, Maria E; Akopov, Asmik; Palacios, Ruth; Russomando, Graciela; Kirkness, Ewen F; Parra, Gabriel I

    2014-10-01

    Group A rotaviruses are a major cause of severe gastroenteritis in children worldwide. Currently, two rotavirus vaccines are being used in vaccination programs, and one of the factors involved in lower vaccine efficacy is the mismatch among the circulating strains and the vaccine strains. Thus, the emergence of animal strains in the human population could affect the efficacy of vaccination programs. Here we report the presence of a G4P[6] strain in a Paraguayan child presenting acute gastroenteritis in 2009. Genomic analyses revealed that the strain presents a porcine-like genome (G4-P[6]-I1-R1-C1-M1-A8-N1-T7-E1-H1), suggesting a direct animal-to-human transmission. Continuous surveillance of rotaviruses in humans and animals will help us to better understand rotavirus epidemiology and evolution. PMID:25075468

  16. The complete mitochondrial genome of Pseudocellus pearsei (Chelicerata: Ricinulei and a comparison of mitochondrial gene rearrangements in Arachnida

    Directory of Open Access Journals (Sweden)

    Braband Anke

    2007-10-01

    Full Text Available Abstract Background Mitochondrial genomes are widely utilized for phylogenetic and population genetic analyses among animals. In addition to sequence data the mitochondrial gene order and RNA secondary structure data are used in phylogenetic analyses. Arachnid phylogeny is still highly debated and there is a lack of sufficient sequence data for many taxa. Ricinulei (hooded tickspiders are a morphologically distinct clade of arachnids with uncertain phylogenetic affinities. Results The first complete mitochondrial DNA genome of a member of the Ricinulei, Pseudocellus pearsei (Arachnida: Ricinulei was sequenced using a PCR-based approach. The mitochondrial genome is a typical circular duplex DNA molecule with a size of 15,099 bp, showing the complete set of genes usually present in bilaterian mitochondrial genomes. Five tRNA genes (trnW, trnY, trnN, trnL(CUN, trnV show different relative positions compared to other Chelicerata (e.g. Limulus polyphemus, Ixodes spp.. We propose that two events led to this derived gene order: (1 a tandem duplication followed by random deletion and (2 an independent translocation of trnN. Most of the inferred tRNA secondary structures show the common cloverleaf pattern except tRNA-Glu where the TψC-arm is missing. In phylogenetic analyses (maximum likelihood, maximum parsimony, Bayesian inference using concatenated amino acid and nucleotide sequences of protein-coding genes the basal relationships of arachnid orders remain unresolved. Conclusion Phylogenetic analyses (ML, MP, BI of arachnid mitochondrial genomes fail to resolve interordinal relationships of Arachnida and remain in a preliminary stage because there is still a lack of mitogenomic data from important taxa such as Opiliones and Pseudoscorpiones. Gene order varies considerably within Arachnida – only eight out of 23 species have retained the putative arthropod ground pattern. Some gene order changes are valuable characters in phylogenetic analysis of

  17. Genomic analyses and expression evaluation of thaumatin-like gene family in the cacao fungal pathogen Moniliophthora perniciosa.

    Science.gov (United States)

    Franco, Sulamita de Freitas; Baroni, Renata Moro; Carazzolle, Marcelo Falsarella; Teixeira, Paulo José Pereira Lima; Reis, Osvaldo; Pereira, Gonçalo Amarante Guimarães; Mondego, Jorge Maurício Costa

    2015-10-30

    Thaumatin-like proteins (TLPs) are found in diverse eukaryotes. Plant TLPs, known as Pathogenicity Related Protein (PR-5), are considered fungal inhibitors. However, genes encoding TLPs are frequently found in fungal genomes. In this work, we have identified that Moniliophthora perniciosa, a basidiomycete pathogen that causes the Witches' Broom Disease (WBD) of cacao, presents thirteen putative TLPs from which four are expressed during WBD progression. One of them is similar to small TLPs, which are present in phytopathogenic basidiomycete, such as wheat stem rust fungus Puccinia graminis. Fungi genomes annotation and phylogenetic data revealed a larger number of TLPs in basidiomycetes when comparing with ascomycetes, suggesting that these proteins could be involved in specific traits of mushroom-forming species. Based on the present data, we discuss the contribution of TLPs in the combat against fungal competitors and hypothesize a role of these proteins in M. perniciosa pathogenicity. PMID:26367180

  18. Analyses and Comparison of Bulk and Coil Surface Samples from the DWPF Slurry Mix Evaporator

    International Nuclear Information System (INIS)

    Sludge samples from the DWPF Slurry Mix Evaporator (SME) heating coil frame and coil surface were characterized to identify differences that might help identify heat transfer fouling materials. The SME steam coils have seen increased fouling leading to lower boil-up rates. Samples of the sludge were taken from the coil frame somewhat distant from the coil (bulk tank material) and from the coil surface (coil surface sample). The results of the analysis indicate the composition of the two SME samples are very similar with the exception that the coil surface sample shows ∼5-10X higher mercury concentration than the bulk tank sample. Elemental analyses and x-ray diffraction results did not indicate notable differences between the two samples. The ICP-MS and Cs-137 data indicate no significant differences in the radionuclide composition of the two SME samples. Semi-volatile organic analysis revealed numerous organic molecules, these likely result from antifoaming additives. The compositions of the two SME samples also match well with the analyzed composition of the SME batch with the exception of significantly higher silicon, lithium, and boron content in the batch sample indicating the coil samples are deficient in frit relative to the SME batch composition.

  19. Comparison of inoculant and indigenous rhizobial dinitrogen fixation in cowpeas by direct nitrogen-15 analyses

    International Nuclear Information System (INIS)

    Soil that contained 15N enriched organic matter (0.461 % 15N) was used to determine competitiveness of six strains at different logarithmic inoculum densities against indigenous rhizobia and against a previous surviving inoculant (strain P132). Analyses of N content of plant tissues by direct 15N technique showed that cowpeas (Vigna unguiculata L. Walp.) were capable of deriving 60 to 98% of shoot N from N2 fixation. The two fast-growing strains (176A26 and 176A28) were poorer competitors and fixed less N2 compared to the other slow-growing strains. Inoculum density had no effect upon yield response of cowpeas, but inoculation with strains P132, 401, and 22A1 effected greater seed yield, shoot dry matter, total N, and percentage of N derived from fixation (86-98%) than other strains and the uninoculated control (60-73%). By contrast, N2 fixation and yield parameters in inoculated cowpeas were not significantly different from inoculated controls that contained residual P132 from a previous inoculum study. The higher hydrogen uptake (Hup) efficiency of nodules containing residual P132 (98 ± 2%) facilitated presumptive identification of P132 (100% ± 0 Hup efficiency axenically) as the surviving and infecting inoculant strain since nodules infected by indigenous rhizobia had lower Hup efficiencies (88 ± 2%)

  20. ANALYSES AND COMPARISON OF BULK AND COIL SURFACE SAMPLES FROM THE DWPF SLURRY MIX EVAPORATOR

    Energy Technology Data Exchange (ETDEWEB)

    Hay, M.; Nash, C.; Stone, M.

    2012-02-17

    Sludge samples from the DWPF Slurry Mix Evaporator (SME) heating coil frame and coil surface were characterized to identify differences that might help identify heat transfer fouling materials. The SME steam coils have seen increased fouling leading to lower boil-up rates. Samples of the sludge were taken from the coil frame somewhat distant from the coil (bulk tank material) and from the coil surface (coil surface sample). The results of the analysis indicate the composition of the two SME samples are very similar with the exception that the coil surface sample shows {approx}5-10X higher mercury concentration than the bulk tank sample. Elemental analyses and x-ray diffraction results did not indicate notable differences between the two samples. The ICP-MS and Cs-137 data indicate no significant differences in the radionuclide composition of the two SME samples. Semi-volatile organic analysis revealed numerous organic molecules, these likely result from antifoaming additives. The compositions of the two SME samples also match well with the analyzed composition of the SME batch with the exception of significantly higher silicon, lithium, and boron content in the batch sample indicating the coil samples are deficient in frit relative to the SME batch composition.

  1. Genomic and Transcriptomic Analyses of the Facultative Methanotroph Methylocystis sp. Strain SB2 Grown on Methane or Ethanol

    OpenAIRE

    Vorobev, Alexey; Jagadevan, Sheeja; Jain, Sunit; Anantharaman, Karthik; Dick, Gregory J.; Vuilleumier, Stéphane; Semrau, Jeremy D.

    2014-01-01

    A minority of methanotrophs are able to utilize multicarbon compounds as growth substrates in addition to methane. The pathways utilized by these microorganisms for assimilation of multicarbon compounds, however, have not been explicitly examined. Here, we report the draft genome of the facultative methanotroph Methylocystis sp. strain SB2 and perform a detailed transcriptomic analysis of cultures grown with either methane or ethanol. Evidence for use of the canonical methane oxidation pathwa...

  2. Genome wide evolutionary analyses reveal serotype specific patterns of positive selection in selected Salmonella serotypes

    OpenAIRE

    Sun Qi; Rodriguez-Rivera Lorraine D; Orsi Renato H; Soyer Yeşim; Wiedmann Martin

    2009-01-01

    Abstract Background The bacterium Salmonella enterica includes a diversity of serotypes that cause disease in humans and different animal species. Some Salmonella serotypes show a broad host range, some are host restricted and exclusively associated with one particular host, and some are associated with one particular host species, but able to cause disease in other host species and are thus considered "host adapted". Five Salmonella genome sequences, representing a broad host range serotype ...

  3. Cross-Disorder Genome-Wide Analyses Suggest a Complex Genetic Relationship Between Tourette Syndrome and Obsessive-Compulsive Disorder

    OpenAIRE

    Yu, Dongmei; Mathews, Carol A.; Scharf, Jeremiah M.; Neale, Benjamin M; Davis, Lea K; Gamazon, Eric R.; Derks, Eske M; Evans, Patrick; Edlund, Christopher K.; Crane, Jacquelyn; Fagerness, Jesen A.; Osiecki, Lisa; Gallagher, Patience; Gerber, Gloria; Haddad, Stephen

    2014-01-01

    Obsessive-compulsive disorder (OCD) and Tourette Syndrome (TS) are highly heritable neurodevelopmental disorders that are thought to share genetic risk factors. However, the identification of definitive susceptibility genes for these etiologically complex disorders remains elusive. Here, we report a combined genome-wide association study (GWAS) of TS and OCD in 2723 cases (1310 with OCD, 834 with TS, 579 with OCD plus TS/chronic tics (CT)), 5667 ancestry-matched controls, and 290 OCD parent-c...

  4. The complete mitochondrial genome of Flustra foliacea (Ectoprocta, Cheilostomata) - compositional bias affects phylogenetic analyses of lophotrochozoan relationships

    OpenAIRE

    Nesnidal Maximilian P; Helmkampf Martin; Bruchhaus Iris; Hausdorf Bernhard

    2011-01-01

    Abstract Background The phylogenetic relationships of the lophophorate lineages, ectoprocts, brachiopods and phoronids, within Lophotrochozoa are still controversial. We sequenced an additional mitochondrial genome of the most species-rich lophophorate lineage, the ectoprocts. Although it is known that there are large differences in the nucleotide composition of mitochondrial sequences of different lineages as well as in the amino acid composition of the encoded proteins, this bias is often n...

  5. Integrative genomic analyses identify BRF2 as a novel lineage-specific oncogene in lung squamous cell carcinoma.

    OpenAIRE

    Lockwood, William W; Raj Chari; Coe, Bradley P.; Thu, Kelsie L.; Cathie Garnis; Malloff, Chad A.; Jennifer Campbell; Williams, Ariane C.; Dorothy Hwang; Chang-Qi Zhu; Buys, Timon P.H.; John Yee; English, John C.; Calum Macaulay; Ming-Sound Tsao

    2010-01-01

    BACKGROUND: Traditionally, non-small cell lung cancer is treated as a single disease entity in terms of systemic therapy. Emerging evidence suggests the major subtypes--adenocarcinoma (AC) and squamous cell carcinoma (SqCC)--respond differently to therapy. Identification of the molecular differences between these tumor types will have a significant impact in designing novel therapies that can improve the treatment outcome. METHODS AND FINDINGS: We used an integrative genomics approach, combin...

  6. Comparative genomic and functional analyses: unearthing the diversity and specificity of nematicidal factors in Pseudomonas putida strain 1A00316.

    Science.gov (United States)

    Guo, Jing; Jing, Xueping; Peng, Wen-Lei; Nie, Qiyu; Zhai, Yile; Shao, Zongze; Zheng, Longyu; Cai, Minmin; Li, Guangyu; Zuo, Huaiyu; Zhang, Zhitao; Wang, Rui-Ru; Huang, Dian; Cheng, Wanli; Yu, Ziniu; Chen, Ling-Ling; Zhang, Jibin

    2016-01-01

    We isolated Pseudomonas putida (P. putida) strain 1A00316 from Antarctica. This bacterium has a high efficiency against Meloidogyne incognita (M. incognita) in vitro and under greenhouse conditions. The complete genome of P. putida 1A00316 was sequenced using PacBio single molecule real-time (SMRT) technology. A comparative genomic analysis of 16 Pseudomonas strains revealed that although P. putida 1A00316 belonged to P. putida, it was phenotypically more similar to nematicidal Pseudomonas fluorescens (P. fluorescens) strains. We characterized the diversity and specificity of nematicidal factors in P. putida 1A00316 with comparative genomics and functional analysis, and found that P. putida 1A00316 has diverse nematicidal factors including protein alkaline metalloproteinase AprA and two secondary metabolites, hydrogen cyanide and cyclo-(l-isoleucyl-l-proline). We show for the first time that cyclo-(l-isoleucyl-l-proline) exhibit nematicidal activity in P. putida. Interestingly, our study had not detected common nematicidal factors such as 2,4-diacetylphloroglucinol (2,4-DAPG) and pyrrolnitrin in P. putida 1A00316. The results of the present study reveal the diversity and specificity of nematicidal factors in P. putida strain 1A00316. PMID:27384076

  7. Comparative genomic and functional analyses: unearthing the diversity and specificity of nematicidal factors in Pseudomonas putida strain 1A00316

    Science.gov (United States)

    Guo, Jing; Jing, Xueping; Peng, Wen-Lei; Nie, Qiyu; Zhai, Yile; Shao, Zongze; Zheng, Longyu; Cai, Minmin; Li, Guangyu; Zuo, Huaiyu; Zhang, Zhitao; Wang, Rui-Ru; Huang, Dian; Cheng, Wanli; Yu, Ziniu; Chen, Ling-Ling; Zhang, Jibin

    2016-01-01

    We isolated Pseudomonas putida (P. putida) strain 1A00316 from Antarctica. This bacterium has a high efficiency against Meloidogyne incognita (M. incognita) in vitro and under greenhouse conditions. The complete genome of P. putida 1A00316 was sequenced using PacBio single molecule real-time (SMRT) technology. A comparative genomic analysis of 16 Pseudomonas strains revealed that although P. putida 1A00316 belonged to P. putida, it was phenotypically more similar to nematicidal Pseudomonas fluorescens (P. fluorescens) strains. We characterized the diversity and specificity of nematicidal factors in P. putida 1A00316 with comparative genomics and functional analysis, and found that P. putida 1A00316 has diverse nematicidal factors including protein alkaline metalloproteinase AprA and two secondary metabolites, hydrogen cyanide and cyclo-(l-isoleucyl-l-proline). We show for the first time that cyclo-(l-isoleucyl-l-proline) exhibit nematicidal activity in P. putida. Interestingly, our study had not detected common nematicidal factors such as 2,4-diacetylphloroglucinol (2,4-DAPG) and pyrrolnitrin in P. putida 1A00316. The results of the present study reveal the diversity and specificity of nematicidal factors in P. putida strain 1A00316. PMID:27384076

  8. Genome structures and halophyte-specific gene expression of the extremophile thellungiella parvula in comparison with Thellungiella salsuginea (Thellungiella halophila) and arabidopsis

    KAUST Repository

    Oh, Dongha

    2010-09-10

    The genome of Thellungiella parvula, a halophytic relative of Arabidopsis (Arabidopsis thaliana), is being assembled using Roche-454 sequencing. Analyses of a 10-Mb scaffold revealed synteny with Arabidopsis, with recombination and inversion and an uneven distribution of repeat sequences. T. parvula genome structure and DNA sequences were compared with orthologous regions from Arabidopsis and publicly available bacterial artificial chromosome sequences from Thellungiella salsuginea (previously Thellungiella halophila). The three-way comparison of sequences, from one abiotic stress-sensitive species and two tolerant species, revealed extensive sequence conservation and microcolinearity, but grouping Thellungiella species separately from Arabidopsis. However, the T. parvula segments are distinguished from their T. salsuginea counterparts by a pronounced paucity of repeat sequences, resulting in a 30% shorter DNA segment with essentially the same gene content in T. parvula. Among the genes is SALT OVERLY SENSITIVE1 (SOS1), a sodium/proton antiporter, which represents an essential component of plant salinity stress tolerance. Although the SOS1 coding region is highly conserved among all three species, the promoter regions show conservation only between the two Thellungiella species. Comparative transcript analyses revealed higher levels of basal as well as salt-induced SOS1 expression in both Thellungiella species as compared with Arabidopsis. The Thellungiella species and other halophytes share conserved pyrimidine-rich 5\\' untranslated region proximal regions of SOS1 that are missing in Arabidopsis. Completion of the genome structure of T. parvula is expected to highlight distinctive genetic elements underlying the extremophile lifestyle of this species. © American Society of Plant Biologists.

  9. Genome analyses suggest the presence of polyploidy and recent human-driven expansions in eight global populations of the honeybee pathogen Nosema ceranae.

    Science.gov (United States)

    Pelin, Adrian; Selman, Mohammed; Aris-Brosou, Stéphane; Farinelli, Laurent; Corradi, Nicolas

    2015-11-01

    Nosema ceranae is a microsporidian pathogen whose infections have been associated with recent global declines in the populations of western honeybees (Apis mellifera). Despite the outstanding economic and ecological threat that N. ceranae may represent for honeybees worldwide, many aspects of its biology, including its mode of reproduction, propagation and ploidy, are either very unclear or unknown. In the present study, we set to gain knowledge in these biological aspects by re-sequencing the genome of eight isolates (i.e. a population of spores isolated from one single beehive) of this species harvested from eight geographically distant beehives, and by investigating their level of polymorphism. Consistent with previous analyses performed using single gene sequences, our analyses uncovered the presence of very high genetic diversity within each isolate, but also very little hive-specific polymorphism. Surprisingly, the nature, location and distribution of this genetic variation suggest that beehives around the globe are infected by a population of N. ceranae cells that may be polyploid (4n or more), and possibly clonal. Lastly, phylogenetic analyses based on genome-wide single-nucleotide polymorphism data extracted from these parasites and mitochondrial sequences from their hosts all failed to support the current geographical structure of our isolates. PMID:25914091

  10. SNP microarray analyses reveal copy number alterations and progressive genome reorganization during tumor development in SVT/t driven mice breast cancer

    International Nuclear Information System (INIS)

    Tumor development is known to be a stepwise process involving dynamic changes that affect cellular integrity and cellular behavior. This complex interaction between genomic organization and gene, as well as protein expression is not yet fully understood. Tumor characterization by gene expression analyses is not sufficient, since expression levels are only available as a snapshot of the cell status. So far, research has mainly focused on gene expression profiling or alterations in oncogenes, even though DNA microarray platforms would allow for high-throughput analyses of copy number alterations (CNAs). We analyzed DNA from mouse mammary gland epithelial cells using the Affymetrix Mouse Diversity Genotyping array (MOUSEDIVm520650) and calculated the CNAs. Segmental copy number alterations were computed based on the probeset CNAs using the circular binary segmentation algorithm. Motif search was performed in breakpoint regions (inter-segment regions) with the MEME suite to identify common motif sequences. Here we present a four stage mouse model addressing copy number alterations in tumorigenesis. No considerable changes in CNA were identified for non-transgenic mice, but a stepwise increase in CNA was found during tumor development. The segmental copy number alteration revealed informative chromosomal fragmentation patterns. In inter-segment regions (hypothetical breakpoint sides) unique motifs were found. Our analyses suggest genome reorganization as a stepwise process that involves amplifications and deletions of chromosomal regions. We conclude from distinctive fragmentation patterns that conserved as well as individual breakpoints exist which promote tumorigenesis

  11. Characterization of hemizygous deletions in Citrus using array-Comparative Genomic Hybridization and microsynteny comparisons with the poplar genome

    OpenAIRE

    Usach Antonio; Geraud Marion; Ruiz-Rivero Omar; Iglesias Domingo J; Naranjo Miguel A; Ríos Gabino; Talón Manuel

    2008-01-01

    Abstract Background Many fruit-tree species, including relevant Citrus spp varieties exhibit a reproductive biology that impairs breeding and strongly constrains genetic improvements. In citrus, juvenility increases the generation time while sexual sterility, inbreeding depression and self-incompatibility prevent the production of homozygous cultivars. Genomic technology may provide citrus researchers with a new set of tools to address these various restrictions. In this work, we report a val...

  12. Randomized comparison of next-generation sequencing and array comparative genomic hybridization for preimplantation genetic screening: a pilot study

    OpenAIRE

    Yang, Zhihong; Lin, James; Zhang, John; Fong, Wai Ieng; Li, Pei; Zhao, Rong; Liu, Xiaohong; Podevin, William; Kuang, Yanping; Liu, Jiaen

    2015-01-01

    Background Recent advances in next-generation sequencing (NGS) have provided new methods for preimplantation genetic screening (PGS) of human embryos from in vitro fertilization (IVF) cycles. However, there is still limited information about clinical applications of NGS in IVF and PGS (IVF-PGS) treatments. The present study aimed to investigate the effects of NGS screening on clinical pregnancy and implantation outcomes for PGS patients in comparison to array comparative genomic hybridization...

  13. Homo sapiens, Homo neanderthalensis and the Denisova specimen: New insights on their evolutionary histories using whole-genome comparisons

    OpenAIRE

    Vanessa Rodrigues Paixão-Côrtes; Lucas Henrique Viscardi; Francisco Mauro Salzano; Tábita Hünemeier; Maria Cátira Bortolini

    2012-01-01

    After a brief review of the most recent findings in the study of human evolution, an extensive comparison of the complete genomes of our nearest relative, the chimpanzee (Pan troglodytes), of extant Homo sapiens, archaic Homo neanderthalensis and the Denisova specimen were made. The focus was on non-synonymous mutations, which consequently had an impact on protein levels and these changes were classified according to degree of effect. A total of 10,447 non-synonymous substitutions were found ...

  14. Estimating Am-241 activity in the body: comparison of direct measurements and radiochemical analyses

    International Nuclear Information System (INIS)

    The assessment of dose and ultimately the health risk from intakes of radioactive materials begins with estimating the amount actually taken into the body. An accurate estimate provides the basis to best assess the distribution in the body, the resulting dose, and ultimately the health risk. This study continues the time-honored practice of evaluating the accuracy of results obtained using in vivo measurement methods and techniques. Results from the radiochemical analyses of the 241Am activity content of tissues and organs from four donors to the United States Transuranium and Uranium Registries were compared to the results from direct measurements of radioactive material in the body performed in vivo and post mortem. Two were whole body donations and two were partial body donations The skeleton was the organ with the highest deposition of 241Am activity in all four cases. The activities ranged from 30 Bq to 300 Bq. The skeletal estimates obtained from measurements over the forehead were within 20% of the radiochemistry results in three cases and differed by 78% in one case. The 241Am lung activity estimates ranged from 1 Bq to 30 Bq in the four cases. The results from the direct measurements were within 40% of the radiochemistry results in 3 cases and within a factor of 3 for the other case. The direct measurement estimates of liver activity ranged from 2 Bq to 60 Bq and were generally lower than the radiochemistry results. The results from this study suggest that the measurement methods and calibration techniques used at the In Vivo Radiobioassay and Research Facility to quantify the activity in the lungs, skeleton and liver are reasonable under the most challenging conditions where there is 241Am activity in multiple organs. These methods and techniques are comparable to those used at other Department of Energy sites. This suggests that the current in vivo methods and calibration techniques provide reasonable estimates of radioactive material in the body. Not

  15. Genomic comparison between pathogenic Streptococcus agalactiae isolated from Nile tilapia in Thailand and fish-derived ST7 strains.

    Science.gov (United States)

    Kayansamruaj, Pattanapon; Pirarat, Nopadon; Kondo, Hidehiro; Hirono, Ikuo; Rodkhum, Channarong

    2015-12-01

    Streptococcus agalactiae, or Group B streptococcus (GBS), is a highly virulent pathogen in aquatic animals, causing huge mortalities worldwide. In Thailand, the serotype Ia, β-hemolytic GBS, belonging to sequence type (ST) 7 of clonal complex (CC) 7, was found to be the major cause of streptococcosis outbreaks in fish farms. In this study, we performed an in silico genomic comparison, aiming to investigate the phylogenetic relationship between the pathogenic fish strains of Thai ST7 and other ST7 from different hosts and geographical origins. In general, the genomes of Thai ST7 strains are closely related to other fish ST7s, as the core genome is shared by 92-95% of any individual fish ST7 genome. Among the fish ST7 genomes, we observed only small dissimilarities, based on the analysis of clustered regularly interspaced short palindromic repeats (CRISPRs), surface protein markers, insertions sequence (IS) elements and putative virulence genes. The phylogenetic tree based on single nucleotide polymorphisms (SNPs) of the core genome sequences clearly categorized the ST7 strains according to their geographical and host origins, with the human ST7 being genetically distant from other fish ST7 strains. A pan-genome analysis of ST7 strains detected a 48-kb gene island specifically in the Thai ST7 isolates. The orientations and predicted amino acid sequences of the genes in the island closely matched those of Tn5252, a streptococcal conjugative transposon, in GBS 2603V/R serotype V, Streptococcus pneumoniae and Streptococcus suis. Thus, it was presumed that Thai ST7 acquired this Tn5252 homologue from related streptococci. The close phylogenetic relationship between the fish ST7 strains suggests that these strains were derived from a common ancestor and have diverged in different geographical regions and in different hosts. PMID:26455417

  16. Life on arginine for Mycoplasma hominis: clues from its minimal genome and comparison with other human urogenital mycoplasmas.

    Directory of Open Access Journals (Sweden)

    Sabine Pereyre

    2009-10-01

    Full Text Available Mycoplasma hominis is an opportunistic human mycoplasma. Two other pathogenic human species, M. genitalium and Ureaplasma parvum, reside within the same natural niche as M. hominis: the urogenital tract. These three species have overlapping, but distinct, pathogenic roles. They have minimal genomes and, thus, reduced metabolic capabilities characterized by distinct energy-generating pathways. Analysis of the M. hominis PG21 genome sequence revealed that it is the second smallest genome among self-replicating free living organisms (665,445 bp, 537 coding sequences (CDSs. Five clusters of genes were predicted to have undergone horizontal gene transfer (HGT between M. hominis and the phylogenetically distant U. parvum species. We reconstructed M. hominis metabolic pathways from the predicted genes, with particular emphasis on energy-generating pathways. The Embden-Meyerhoff-Parnas pathway was incomplete, with a single enzyme absent. We identified the three proteins constituting the arginine dihydrolase pathway. This pathway was found essential to promote growth in vivo. The predicted presence of dimethylarginine dimethylaminohydrolase suggested that arginine catabolism is more complex than initially described. This enzyme may have been acquired by HGT from non-mollicute bacteria. Comparison of the three minimal mollicute genomes showed that 247 CDSs were common to all three genomes, whereas 220 CDSs were specific to M. hominis, 172 CDSs were specific to M. genitalium, and 280 CDSs were specific to U. parvum. Within these species-specific genes, two major sets of genes could be identified: one including genes involved in various energy-generating pathways, depending on the energy source used (glucose, urea, or arginine and another involved in cytadherence and virulence. Therefore, a minimal mycoplasma cell, not including cytadherence and virulence-related genes, could be envisaged containing a core genome (247 genes, plus a set of genes required for

  17. Genome-wide association analyses identifies a susceptibility locus for tuberculosis on chromosome 18q11.2

    Science.gov (United States)

    Wong, Sunny H.; Owusu-Dabo, Ellis; Osei, Ivy; Gyapong, John; Sirugo, Giorgio; Sisay-Joof, Fatou; Enimil, Anthony; Chinbuah, Margaret A.; Floyd, Sian; Warndorff, David K.; Sichali, Lifted; Malema, Simon; Crampin, Amelia C.; Ngwira, Bagrey; Teo, Yik Y.; Small, Kerrin; Rockett, Kirk; Kwiatkowski, Dominic; Fine, Paul E.; Hill, Philip C.; Newport, Melanie; Lienhardt, Christian; Adegbola, Richard A.; Corrah, Tumani; Ziegler, Andreas; Morris, Andrew P.; Meyer, Christian G.

    2013-01-01

    We combined two tuberculosis (TB) genome-wide association studies (GWAS) from Ghana and The Gambia with subsequent replication totalling 11,425 participants. A significant association with disease was observed at SNP rs4331426 located in a gene-poor region on chromosome 18q11.2 (P=6.8×10−9, OR=1.19, 95%CI=1.13-1.27). Our finding shows that GWAS can identify novel loci for infectious causes of mortality even in Africa where levels of linkage disequilibrium are particularly low. PMID:20694014

  18. Genome-wide analyses of transcription factor GATA3-mediated gene regulation in distinct T cell types

    OpenAIRE

    Wei, Gang; Abraham, Brian J.; Yagi, Ryoji; Jothi, Raja; Cui, Kairong; Sharma, Suveena; Narlikar, Leelavati; Northrup, Daniel L.; Tang, Qingsong; Paul, William E.; Zhu, Jinfang; Zhao, Keji

    2011-01-01

    The transcription factor GATA3 plays an essential role during T cell development and T helper 2 (Th2) cell differentiation. To understand GATA3-mediated gene regulation, we identified genome-wide GATA3 binding sites in ten well-defined developmental and effector T lymphocyte lineages. In the thymus, GATA3 directly regulated many critical factors, including Th-POK, Notch1 and T cell receptor subunits. In the periphery, GATA3 induced a large number of Th2 cell-specific as well as Th2 cell non-s...

  19. Probing the HIV-1 genomic RNA trafficking pathway and dimerization by genetic recombination and single virion analyses.

    Directory of Open Access Journals (Sweden)

    Michael D Moore

    2009-10-01

    Full Text Available Once transcribed, the nascent full-length RNA of HIV-1 must travel to the appropriate host cell sites to be translated or to find a partner RNA for copackaging to form newly generated viruses. In this report, we sought to delineate the location where HIV-1 RNA initiates dimerization and the influence of the RNA transport pathway used by the virus on downstream events essential to viral replication. Using a cell-fusion-dependent recombination assay, we demonstrate that the two RNAs destined for copackaging into the same virion select each other mostly within the cytoplasm. Moreover, by manipulating the RNA export element in the viral genome, we show that the export pathway taken is important for the ability of RNA molecules derived from two viruses to interact and be copackaged. These results further illustrate that at the point of dimerization the two main cellular export pathways are partially distinct. Lastly, by providing Gag in trans, we have demonstrated that Gag is able to package RNA from either export pathway, irrespective of the transport pathway used by the gag mRNA. These findings provide unique insights into the process of RNA export in general, and more specifically, of HIV-1 genomic RNA trafficking.

  20. The spotted gar genome illuminates vertebrate evolution and facilitates human-teleost comparisons

    OpenAIRE

    Braasch, Ingo; Gehrke, Andrew R; Smith, Jeramiah J.; Kawasaki, Kazuhiko; Manousaki, Tereza; Pasquier, Jeremy; Amores, Angel; Desvignes, Thomas; Batzel, Peter; Catchen, Julian; Berlin, Aaron M.; Campbell, Michael S; Barrell, Daniel; Martin, Kyle J.; Mulley, John F

    2016-01-01

    To connect human biology to fish biomedical models, we sequenced the genome of spotted gar (Lepisosteus oculatus), whose lineage diverged from teleosts before teleost genome duplication (TGD). The slowly evolving gar genome has conserved in content and size many entire chromosomes from bony vertebrate ancestors. Gar bridges teleosts to tetrapods by illuminating the evolution of immunity, mineralization and development (mediated, for example, by Hox, ParaHox and microRNA genes). Numerous conse...

  1. Assembly-free genome comparison based on next-generation sequencing reads and variable length patterns

    OpenAIRE

    Comin, Matteo; Schimd, Michele

    2014-01-01

    Background With the advent of Next-Generation Sequencing technologies (NGS), a large amount of short read data has been generated. If a reference genome is not available, the assembly of a template sequence is usually challenging because of repeats and the short length of reads. When NGS reads cannot be mapped onto a reference genome alignment-based methods are not applicable. However it is still possible to study the evolutionary relationship of unassembled genomes based on NGS data. Results...

  2. Comparative genomic and functional analysis of 100 Lactobacillus rhamnosus strains and their comparison with strain GG

    OpenAIRE

    Douillard, F.P.; Ribbera, A.; Kant, R.; Pietilä, T.E.; Järvinen, H.M.; Messing, M.; Randazzo, C.L.; Paulin, L.; Laine, P.K.; Ritari, J.; Caggia, C.; Lähteinen, T.; Brouns, S.J.J.; Satokari, R.M.; Ossowski, von, I.

    2013-01-01

    Lactobacillus rhamnosus is a lactic acid bacterium that is found in a large variety of ecological habitats, including artisanal and industrial dairy products, the oral cavity, intestinal tract or vagina. To gain insights into the genetic complexity and ecological versatility of the species L. rhamnosus, we examined the genomes and phenotypes of 100 L. rhamnosus strains isolated from diverse sources. The genomes of 100 L. rhamnosus strains were mapped onto the L. rhamnosus GG reference genome....

  3. Comparative Genomic and Functional Analysis of 100 Lactobacillus rhamnosus Strains and Their Comparison with Strain GG

    OpenAIRE

    Douillard, François P.; Ribbera, Angela; Kant, Ravi; Pietilä, Taija E.; Järvinen, Hanna M.; Messing, Marcel; Randazzo, Cinzia L.; Paulin, Lars; Laine, Pia; Ritari, Jarmo; Caggia, Cinzia; Lähteinen, Tanja; Brouns, Stan J. J.; Satokari, Reetta; von Ossowski, Ingemar

    2013-01-01

    Lactobacillus rhamnosus is a lactic acid bacterium that is found in a large variety of ecological habitats, including artisanal and industrial dairy products, the oral cavity, intestinal tract or vagina. To gain insights into the genetic complexity and ecological versatility of the species L. rhamnosus, we examined the genomes and phenotypes of 100 L. rhamnosus strains isolated from diverse sources. The genomes of 100 L. rhamnosus strains were mapped onto the L. rhamnosus GG reference genome....

  4. Genomic Comparison of the Closely-Related Salmonella enterica Serovars Enteritidis, Dublin and Gallinarum

    OpenAIRE

    Matthews, T. David; Schmieder, Robert; Silva, Genivaldo G. Z.; Busch, Julia; Cassman, Noriko; Dutilh, Bas E.; Green, Dawn; Matlock, Brian; Heffernan, Brian; Olsen, Gary J.; Farris Hanna, Leigh; Schifferli, Dieter M; Maloy, Stanley; Dinsdale, Elizabeth A.; Edwards, Robert A.

    2015-01-01

    The Salmonella enterica serovars Enteritidis, Dublin, and Gallinarum are closely related but differ in virulence and host range. To identify the genetic elements responsible for these differences and to better understand how these serovars are evolving, we sequenced the genomes of Enteritidis strain LK5 and Dublin strain SARB12 and compared these genomes to the publicly available Enteritidis P125109, Dublin CT 02021853 and Dublin SD3246 genome sequences. We also compared the publicly availabl...

  5. Comparison of complete mitochondrial genome sequence between different ethnic groups from Southern Africa / by Jake Darby

    OpenAIRE

    Darby, Jake

    2004-01-01

    The human mitochondrial genome is a 16,569 base pair double-stranded deoxyribonucleic acid (DNA) molecule located in the mitochondrion. The mitochondria1 genome sequence is conserved, maternally inherited and undergoes no recombination, making its evaluation ideal for evolutionary studies. Certain alterations in the genome are unique to specific human populations and haplogroups. The genetic background, or haplogroup of an individual, may act in concert with disease associated ...

  6. Multi-platform whole-genome microarray analyses refine the epigenetic signature of breast cancer metastasis with gene expression and copy number.

    Directory of Open Access Journals (Sweden)

    Joseph Andrews

    Full Text Available BACKGROUND: We have previously identified genome-wide DNA methylation changes in a cell line model of breast cancer metastasis. These complex epigenetic changes that we observed, along with concurrent karyotype analyses, have led us to hypothesize that complex genomic alterations in cancer cells (deletions, translocations and ploidy are superimposed over promoter-specific methylation events that are responsible for gene-specific expression changes observed in breast cancer metastasis. METHODOLOGY/PRINCIPAL FINDINGS: We undertook simultaneous high-resolution, whole-genome analyses of MDA-MB-468GFP and MDA-MB-468GFP-LN human breast cancer cell lines (an isogenic, paired lymphatic metastasis cell line model using Affymetrix gene expression (U133, promoter (1.0R, and SNP/CNV (SNP 6.0 microarray platforms to correlate data from gene expression, epigenetic (DNA methylation, and combination copy number variant/single nucleotide polymorphism microarrays. Using Partek Software and Ingenuity Pathway Analysis we integrated datasets from these three platforms and detected multiple hypomethylation and hypermethylation events. Many of these epigenetic alterations correlated with gene expression changes. In addition, gene dosage events correlated with the karyotypic differences observed between the cell lines and were reflected in specific promoter methylation patterns. Gene subsets were identified that correlated hyper (and hypo methylation with the loss (or gain of gene expression and in parallel, with gene dosage losses and gains, respectively. Individual gene targets from these subsets were also validated for their methylation, expression and copy number status, and susceptible gene pathways were identified that may indicate how selective advantage drives the processes of tumourigenesis and metastasis. CONCLUSIONS/SIGNIFICANCE: Our approach allows more precisely profiling of functionally relevant epigenetic signatures that are associated with cancer

  7. Analysis of genomic alterations in neuroblastoma by multiplex ligation-dependent probe amplification and array comparative genomic hybridization: a comparison of results.

    Science.gov (United States)

    Combaret, Valérie; Iacono, Isabelle; Bréjon, Stéphanie; Schleiermacher, Gudrun; Pierron, Gäelle; Couturier, Jérôme; Bergeron, Christophe; Blay, Jean-Yves

    2012-12-01

    In cases of neuroblastoma, recurring genetic alterations--losses of the 1p, 3p, 4p, and 11q and/or gains of 1q, 2p, and 17q chromosome arms--are currently used to define the therapeutic strategy in therapeutic protocols for low- and intermediate-risk patients. Different genome-wide analysis techniques, such as array comparative genomic hybridization (aCGH) or multiplex ligation-dependent probe amplification (MLPA), have been suggested for detecting chromosome segmental abnormalities. In this study, we compared the results of the two technologies in the analyses of the DNA of tumor samples from 91 neuroblastoma patients. Similar results were obtained with the two techniques for 75 samples (82%). In five cases (5.5%), the MLPA results were not interpretable. Discrepancies between the aCGH and MLPA results were observed in 11 cases (12%). Among the discrepancies, a 18q21.2-qter gain and 16p11.2 and 11q14.1-q14.3 losses were detected only by aCGH. The MLPA results showed that the 7p, 7q, and 14q chromosome arms were affected in six cases, while in two cases, 2p and 17q gains were observed; these results were confirmed by neither aCGH nor fluorescence in situ hybridization (FISH) analysis. Because of the higher sensitivity and specificity of genome-wide information, reasonable cost, and shorter time of aCGH analysis, we recommend the aCGH procedure for the analysis of genomic alterations in neuroblastoma. PMID:23265803

  8. High-throughput generation of an activation-tagged mutant library for functional genomic analyses in tobacco.

    Science.gov (United States)

    Liu, Feng; Gong, Daping; Zhang, Qian; Wang, Dawei; Cui, Mengmeng; Zhang, Zhiguo; Liu, Guanshan; Wu, Jinxia; Wang, Yuanying

    2015-03-01

    Tobacco (Nicotiana tabacum L.) is an ideal model system for molecular biological and genetic studies. In this study, activation tagging was used to generate approximately 100,000 transgenic tobacco plants. Southern blot analysis indicated that there were 1.6 T-DNA inserts per line on average in our transformed population. The phenotypes observed include abnormalities in leaf and flower morphology, plant height, flowering time, branching, and fertility. Among 6,000 plants in the T0 generation, 57 displayed obvious phenotypes. Among 4,105 lines in the T1 generation, 311 displayed abnormal phenotypes. Fusion primer and nested integrated PCR was used to identify 963 independent genomic loci of T-DNA insertion sites in 1,257 T1 lines. The distribution of T-DNA insertions was non-uniform and correlated well with the predicted gene density along each chromosome. The insertions were biased toward genic regions and noncoding regions within 5 kb of a gene. Fifteen plants that showed the same phenotype as their parent with a dominant pattern in the T2 generation were chosen randomly to detect the expression levels of genes adjacent to the T-DNA integration sites by semi-quantitative RT-PCR. Fifteen candidate genes were identified. Activation was observed in 7 out of the 15 adjacent genes, including one that was located 13.1 kb away from the enhancer sequence. The activation-tagged population described in this paper will be a highly valuable resource for tobacco functional genomics research using both forward and reverse genetic approaches. PMID:25408504

  9. Genomic analyses and transcriptional profiles of the glycoside hydrolase family 18 genes of the entomopathogenic fungus Metarhizium anisopliae.

    Directory of Open Access Journals (Sweden)

    Ângela Junges

    Full Text Available Fungal chitin metabolism involves diverse processes such as metabolically active cell wall maintenance, basic nutrition, and different aspects of virulence. Chitinases are enzymes belonging to the glycoside hydrolase family 18 (GH18 and 19 (GH19 and are responsible for the hydrolysis of β-1,4-linkages in chitin. This linear homopolymer of N-acetyl-β-D-glucosamine is an essential constituent of fungal cell walls and arthropod exoskeletons. Several chitinases have been directly implicated in structural, morphogenetic, autolytic and nutritional activities of fungal cells. In the entomopathogen Metarhizium anisopliae, chitinases are also involved in virulence. Filamentous fungi genomes exhibit a higher number of chitinase-coding genes than bacteria or yeasts. The survey performed in the M. anisopliae genome has successfully identified 24 genes belonging to glycoside hydrolase family 18, including three previously experimentally determined chitinase-coding genes named chit1, chi2 and chi3. These putative chitinases were classified based on domain organization and phylogenetic analysis into the previously described A, B and C chitinase subgroups, and into a new subgroup D. Moreover, three GH18 proteins could be classified as putative endo-N-acetyl-β-D-glucosaminidases, enzymes that are associated with deglycosylation and were therefore assigned to a new subgroup E. The transcriptional profile of the GH18 genes was evaluated by qPCR with RNA extracted from eight culture conditions, representing different stages of development or different nutritional states. The transcripts from the GH18 genes were detected in at least one of the different M. anisopliae developmental stages, thus validating the proposed genes. Moreover, not all members from the same chitinase subgroup presented equal patterns of transcript expression under the eight distinct conditions studied. The determination of M. anisopliae chitinases and ENGases and a more detailed study

  10. Genomic, proteomic, and biochemical analyses of oleaginous Mucor circinelloides: evaluating its capability in utilizing cellulolytic substrates for lipid production.

    Directory of Open Access Journals (Sweden)

    Hui Wei

    Full Text Available Lipid production by oleaginous microorganisms is a promising route to produce raw material for the production of biodiesel. However, most of these organisms must be grown on sugars and agro-industrial wastes because they cannot directly utilize lignocellulosic substrates. We report the first comprehensive investigation of Mucor circinelloides, one of a few oleaginous fungi for which genome sequences are available, for its potential to assimilate cellulose and produce lipids. Our genomic analysis revealed the existence of genes encoding 13 endoglucanases (7 of them secretory, 3 β-D-glucosidases (2 of them secretory and 243 other glycoside hydrolase (GH proteins, but not genes for exoglucanases such as cellobiohydrolases (CBH that are required for breakdown of cellulose to cellobiose. Analysis of the major PAGE gel bands of secretome proteins confirmed expression of two secretory endoglucanases and one β-D-glucosidase, along with a set of accessory cell wall-degrading enzymes and 11 proteins of unknown function. We found that M. circinelloides can grow on CMC (carboxymethyl cellulose and cellobiose, confirming the enzymatic activities of endoglucanases and β-D-glucosidases, respectively. The data suggested that M. circinelloides could be made usable as a consolidated bioprocessing (CBP strain by introducing a CBH (e.g. CBHI into the microorganism. This proposal was validated by our demonstration that M. circinelloides growing on Avicel supplemented with CBHI produced about 33% of the lipid that was generated in glucose medium. Furthermore, fatty acid methyl ester (FAME analysis showed that when growing on pre-saccharified Avicel substrates, it produced a higher proportion of C14 fatty acids, which has an interesting implication in that shorter fatty acid chains have characteristics that are ideal for use in jet fuel. This substrate-specific shift in FAME profile warrants further investigation.

  11. The Complete Mitochondrial Genome Sequence of Bactericera cockerelli and Comparison with Three Other Psylloidea Species.

    Directory of Open Access Journals (Sweden)

    Fengnian Wu

    Full Text Available Potato psyllid (Bactericera cockerelli is an important pest of potato, tomato and pepper. Not only could a toxin secreted by nymphs results in serious phytotoxemia in some host plants, but also over the past few years B. cockerelli was shown to transmit "Candidatus Liberibacter solanacearum", the putative bacterial pathogen of potato zebra chip (ZC disease, to potato and tomato. ZC has caused devastating losses to potato production in the western U.S., Mexico, and elsewhere. New knowledge of the genetic diversity of the B. cockerelli is needed to develop improved strategies to manage pest populations. Mitochondrial genome (mitogenome sequencing provides important knowledge about insect evolution and diversity in and among populations. This report provides the first complete B. cockerelli mitogenome sequence as determined by next generation sequencing technology (Illumina MiSeq. The circular B. cockerelli mitogenome had a size of 15,220 bp with 13 protein-coding gene (PCGs, 2 ribosomal RNA genes (rRNAs, 22 transfer RNA genes (tRNAs, and a non-coding region of 975 bp. The overall gene order of the B. cockerelli mitogenome is identical to three other published Psylloidea mitogenomes: one species from the Triozidae, Paratrioza sinica; and two species from the Psyllidae, Cacopsylla coccinea and Pachypsylla venusta. This suggests all of these species share a common ancestral mitogenome. However, sequence analyses revealed differences between and among the insect families, in particular a unique region that can be folded into three stem-loop secondary structures present only within the B. cockerelli mitogenome. A phylogenetic tree based on the 13 PCGs matched an existing taxonomy scheme that was based on morphological characteristics. The available complete mitogenome sequence makes it accessible to all genes for future population diversity evaluation of B. cockerelli.

  12. The Complete Mitochondrial Genome Sequence of Bactericera cockerelli and Comparison with Three Other Psylloidea Species

    Science.gov (United States)

    Wu, Fengnian; Cen, Yijing; Wallis, Christopher M.; Trumble, John T.; Prager, Sean; Yokomi, Ray; Zheng, Zheng; Deng, Xiaoling; Chen, Jianchi; Liang, Guangwen

    2016-01-01

    Potato psyllid (Bactericera cockerelli) is an important pest of potato, tomato and pepper. Not only could a toxin secreted by nymphs results in serious phytotoxemia in some host plants, but also over the past few years B. cockerelli was shown to transmit “Candidatus Liberibacter solanacearum”, the putative bacterial pathogen of potato zebra chip (ZC) disease, to potato and tomato. ZC has caused devastating losses to potato production in the western U.S., Mexico, and elsewhere. New knowledge of the genetic diversity of the B. cockerelli is needed to develop improved strategies to manage pest populations. Mitochondrial genome (mitogenome) sequencing provides important knowledge about insect evolution and diversity in and among populations. This report provides the first complete B. cockerelli mitogenome sequence as determined by next generation sequencing technology (Illumina MiSeq). The circular B. cockerelli mitogenome had a size of 15,220 bp with 13 protein-coding gene (PCGs), 2 ribosomal RNA genes (rRNAs), 22 transfer RNA genes (tRNAs), and a non-coding region of 975 bp. The overall gene order of the B. cockerelli mitogenome is identical to three other published Psylloidea mitogenomes: one species from the Triozidae, Paratrioza sinica; and two species from the Psyllidae, Cacopsylla coccinea and Pachypsylla venusta. This suggests all of these species share a common ancestral mitogenome. However, sequence analyses revealed differences between and among the insect families, in particular a unique region that can be folded into three stem-loop secondary structures present only within the B. cockerelli mitogenome. A phylogenetic tree based on the 13 PCGs matched an existing taxonomy scheme that was based on morphological characteristics. The available complete mitogenome sequence makes it accessible to all genes for future population diversity evaluation of B. cockerelli. PMID:27227976

  13. The glutathione peroxidase gene family of Lotus japonicus. Characterization of genomic clones, expression analyses and immunolocalization in legumes

    OpenAIRE

    Ramos Escribano, Javier; Matamoros Galindo, Manuel Ángel; Naya, Loreto; James, Euan Kevin; Rouhier, Nicolas; Sato, Shusei; Tabata, Satoshi; Becana Ausejo, Manuel

    2009-01-01

    • Despite the multiple roles played by antioxidants in rhizobia–legume symbioses, little is known about glutathione peroxidases (GPXs) in legumes. Here the characterization of six GPX genes of Lotus japonicus is reported. • Expression of GPX genes was analysed by quantitative reverse transcription–polymerase chain reaction in L. japonicus and Lotus corniculatus plants exposed to various treatments known to generate reactive oxygen and/or nitrogen species. • LjGPX1 and LjGPX3 were the ...

  14. [Kohonen network study of the results of RAPD and ISSR analyses of genomic polymorphism in the genus Capsicum L].

    Science.gov (United States)

    Ruanet, V V; Kochieva, E Z; Ryzhova, N N

    2005-02-01

    The results of studies based on multilocus molecular analyses, including random amplified polymorphic DNA (RAPD), inter-simple sequence repeat (ISSR), and amplified fragment length polymorphism (AFLP) analyses, are usually presented in the form of images (electrophoregrams, photographs, etc.). The interpretation of this information is complicated, labor-consuming, and subjective. Artificial neural networks (ANNs), which are ideal "image processors," may be useful when solving such tasks. The possibility of using ANNs for the treatment of the results of RAPD and ISSR analyses has been studied. The RAPD and ISSR spectra have been studied in fragments of DNA of plants from the genus Capsicum L. (peppers). The results of clustering the accessions studied by means of the unweighted pair-group method with arithmetic averages (UPGMA), which is often used for phylogenetic constructions based on RAPD and ISSR data, serve as expert estimates. Fundamentally new methods of genetic polymorphism estimation using ANN technologies, namely, self-organizing feature maps (SOFMs) have been developed. The results show that the clusters obtained with the use of UPGMA and SOFM coincide by more than 90%; taking into account that ANNs can deal with high noise levels and incomplete or contradictory data, the approach proposed may prove to be efficient. PMID:15810617

  15. Powerful bivariate genome-wide association analyses suggest the SOX6 gene influencing both obesity and osteoporosis phenotypes in males.

    Directory of Open Access Journals (Sweden)

    Yao-Zhong Liu

    Full Text Available BACKGROUND: Current genome-wide association studies (GWAS are normally implemented in a univariate framework and analyze different phenotypes in isolation. This univariate approach ignores the potential genetic correlation between important disease traits. Hence this approach is difficult to detect pleiotropic genes, which may exist for obesity and osteoporosis, two common diseases of major public health importance that are closely correlated genetically. PRINCIPAL FINDINGS: To identify such pleiotropic genes and the key mechanistic links between the two diseases, we here performed the first bivariate GWAS of obesity and osteoporosis. We searched for genes underlying co-variation of the obesity phenotype, body mass index (BMI, with the osteoporosis risk phenotype, hip bone mineral density (BMD, scanning approximately 380,000 SNPs in 1,000 unrelated homogeneous Caucasians, including 499 males and 501 females. We identified in the male subjects two SNPs in intron 1 of the SOX6 (SRY-box 6 gene, rs297325 and rs4756846, which were bivariately associated with both BMI and hip BMD, achieving p values of 6.82x10(-7 and 1.47x10(-6, respectively. The two SNPs ranked at the top in significance for bivariate association with BMI and hip BMD in the male subjects among all the approximately 380,000 SNPs examined genome-wide. The two SNPs were replicated in a Framingham Heart Study (FHS cohort containing 3,355 Caucasians (1,370 males and 1,985 females from 975 families. In the FHS male subjects, the two SNPs achieved p values of 0.03 and 0.02, respectively, for bivariate association with BMI and femoral neck BMD. Interestingly, SOX6 was previously found to be essential to both cartilage formation/chondrogenesis and obesity-related insulin resistance, suggesting the gene's dual role in both bone and fat. CONCLUSIONS: Our findings, together with the prior biological evidence, suggest the SOX6 gene's importance in co-regulation of obesity and osteoporosis.

  16. The unique architecture and function of cellulose-interacting proteins in oomycetes revealed by genomic and structural analyses

    Directory of Open Access Journals (Sweden)

    Larroque Mathieu

    2012-11-01

    Full Text Available Abstract Background Oomycetes are fungal-like microorganisms evolutionary distinct from true fungi, belonging to the Stramenopile lineage and comprising major plant pathogens. Both oomycetes and fungi express proteins able to interact with cellulose, a major component of plant and oomycete cell walls, through the presence of carbohydrate-binding module belonging to the family 1 (CBM1. Fungal CBM1-containing proteins were implicated in cellulose degradation whereas in oomycetes, the Cellulose Binding Elicitor Lectin (CBEL, a well-characterized CBM1-protein from Phytophthora parasitica, was implicated in cell wall integrity, adhesion to cellulosic substrates and induction of plant immunity. Results To extend our knowledge on CBM1-containing proteins in oomycetes, we have conducted a comprehensive analysis on 60 fungi and 7 oomycetes genomes leading to the identification of 518 CBM1-containing proteins. In plant-interacting microorganisms, the larger number of CBM1-protein coding genes is expressed by necrotroph and hemibiotrophic pathogens, whereas a strong reduction of these genes is observed in symbionts and biotrophs. In fungi, more than 70% of CBM1-containing proteins correspond to enzymatic proteins in which CBM1 is associated with a catalytic unit involved in cellulose degradation. In oomycetes more than 90% of proteins are similar to CBEL in which CBM1 is associated with a non-catalytic PAN/Apple domain, known to interact with specific carbohydrates or proteins. Distinct Stramenopile genomes like diatoms and brown algae are devoid of CBM1 coding genes. A CBM1-PAN/Apple association 3D structural modeling was built allowing the identification of amino acid residues interacting with cellulose and suggesting the putative interaction of the PAN/Apple domain with another type of glucan. By Surface Plasmon Resonance experiments, we showed that CBEL binds to glycoproteins through galactose or N-acetyl-galactosamine motifs. Conclusions This study

  17. Integrative genomic analyses of the RNA-binding protein, RNPC1, and its potential role in cancer prediction.

    Science.gov (United States)

    Ding, Zhiming; Yang, Hai-Wei; Xia, Tian-Song; Wang, Bo; Ding, Qiang

    2015-08-01

    The RNA binding motif protein 38 (RBM38, also known as RNPC1) plays a pivotal role in regulating a wide range of biological processes, from cell proliferation and cell cycle arrest to cell myogenic differentiation. It was originally recognized as an oncogene, and was frequently found to be amplified in prostate, ovarian and colorectal cancer, chronic lymphocytic leukemia, colon carcinoma, esophageal cancer, dog lymphomas and breast cancer. In the present study, the complete RNPC1 gene was identified in a number of vertebrate genomes, suggesting that RNPC1 exists in all types of vertebrates, including fish, amphibians, birds and mammals. In the different genomes, the gene had a similar 4 exon/3 intron organization, and all the genetic loci were syntenically conserved. The phylogenetic tree demonstrated that the RNPC1 gene from the mammalian, bird, reptile and teleost lineage formed a species-specific cluster. A total of 34 functionally relevant single nucleotide polymorphisms (SNPs), including 14 SNPs causing missense mutations, 8 exonic splicing enhancer SNPs and 12 SNPs causing nonsense mutations, were identified in the human RNPC1 gene. RNPC1 was found to be expressed in bladder, blood, brain, breast, colorectal, eye, head and neck, lung, ovarian, skin and soft tissue cancer. In 14 of the 94 tests, an association between RNPC1 gene expression and cancer prognosis was observed. We found that the association between the expression of RNPC1 and prognosis varied in different types of cancer, and even in the same type of cancer from the different databases used. This suggests that the function of RNPC1 in these tumors may be multidimensional. The sex determining region Y (SRY)-box 5 (Sox5), runt-related transcription factor 3 (RUNX3), CCAAT displacement protein 1 (CUTL1), v-rel avian reticuloendotheliosis viral oncogene homolog (Rel)A, peroxisome proliferator-activated receptor γ isoform 2 (PPARγ2) and activating transcription factor 6 (ATF6) regulatory

  18. Integrative genomic analyses identify BRF2 as a novel lineage-specific oncogene in lung squamous cell carcinoma.

    Directory of Open Access Journals (Sweden)

    William W Lockwood

    2010-07-01

    Full Text Available BACKGROUND: Traditionally, non-small cell lung cancer is treated as a single disease entity in terms of systemic therapy. Emerging evidence suggests the major subtypes--adenocarcinoma (AC and squamous cell carcinoma (SqCC--respond differently to therapy. Identification of the molecular differences between these tumor types will have a significant impact in designing novel therapies that can improve the treatment outcome. METHODS AND FINDINGS: We used an integrative genomics approach, combing high-resolution comparative genomic hybridization and gene expression microarray profiles, to compare AC and SqCC tumors in order to uncover alterations at the DNA level, with corresponding gene transcription changes, which are selected for during development of lung cancer subtypes. Through the analysis of multiple independent cohorts of clinical tumor samples (>330, normal lung tissues and bronchial epithelial cells obtained by bronchial brushing in smokers without lung cancer, we identified the overexpression of BRF2, a gene on Chromosome 8p12, which is specific for development of SqCC of lung. Genetic activation of BRF2, which encodes a RNA polymerase III (Pol III transcription initiation factor, was found to be associated with increased expression of small nuclear RNAs (snRNAs that are involved in processes essential for cell growth, such as RNA splicing. Ectopic expression of BRF2 in human bronchial epithelial cells induced a transformed phenotype and demonstrates downstream oncogenic effects, whereas RNA interference (RNAi-mediated knockdown suppressed growth and colony formation of SqCC cells overexpressing BRF2, but not AC cells. Frequent activation of BRF2 in >35% preinvasive bronchial carcinoma in situ, as well as in dysplastic lesions, provides evidence that BRF2 expression is an early event in cancer development of this cell lineage. CONCLUSIONS: This is the first study, to our knowledge, to show that the focal amplification of a gene in

  19. Comparative genomic analyses identify common molecular pathways modulated upon exposure to low doses of arsenic and cadmium

    Directory of Open Access Journals (Sweden)

    Fry Rebecca C

    2011-04-01

    Full Text Available Abstract Background Exposure to the toxic metals arsenic and cadmium is associated with detrimental health effects including cancers of various organs. While arsenic and cadmium are well known to cause adverse health effects at high doses, the molecular impact resulting from exposure to environmentally relevant doses of these metals remains largely unexplored. Results In this study, we examined the effects of in vitro exposure to either arsenic or cadmium in human TK6 lymphoblastoid cells using genomics and systems level pathway mapping approaches. A total of 167 genes with differential expression were identified following exposure to either metal with surprisingly no overlap between the two. Real-time PCR was used to confirm target gene expression changes. The gene sets were overlaid onto protein-protein interaction maps to identify metal-induced transcriptional networks. Interestingly, both metal-induced networks were significantly enriched for proteins involved in common biological processes such as tumorigenesis, inflammation, and cell signaling. These findings were further supported by gene set enrichment analysis. Conclusions This study is the first to compare the transcriptional responses induced by low dose exposure to cadmium and arsenic in human lymphoblastoid cells. These results highlight that even at low levels of exposure both metals can dramatically influence the expression of important cellular pathways.

  20. Differential gene expression from genome-wide microarray analyses distinguishes Lohmann Selected Leghorn and Lohmann Brown layers.

    Directory of Open Access Journals (Sweden)

    Christin Habig

    Full Text Available The Lohmann Selected Leghorn (LSL and Lohmann Brown (LB layer lines have been selected for high egg production since more than 50 years and belong to the worldwide leading commercial layer lines. The objectives of the present study were to characterize the molecular processes that are different among these two layer lines using whole genome RNA expression profiles. The hens were kept in the newly developed small group housing system Eurovent German with two different group sizes. Differential expression was observed for 6,276 microarray probes (FDR adjusted P-value <0.05 among the two layer lines LSL and LB. A 2-fold or greater change in gene expression was identified on 151 probe sets. In LSL, 72 of the 151 probe sets were up- and 79 of them were down-regulated. Gene ontology (GO enrichment analysis accounting for biological processes evinced 18 GO-terms for the 72 probe sets with higher expression in LSL, especially those taking part in immune system processes and membrane organization. A total of 32 enriched GO-terms were determined among the 79 down-regulated probe sets of LSL. Particularly, these terms included phosphorus metabolic processes and signaling pathways. In conclusion, the phenotypic differences among the two layer lines LSL and LB are clearly reflected in their gene expression profiles of the cerebrum. These novel findings provide clues for genes involved in economically important line characteristics of commercial laying hens.

  1. Unique features of odorant-binding proteins of the parasitoid wasp Nasonia vitripennis revealed by genome annotation and comparative analyses.

    Directory of Open Access Journals (Sweden)

    Filipe G Vieira

    Full Text Available Insects are the most diverse group of animals on the planet, comprising over 90% of all metazoan life forms, and have adapted to a wide diversity of ecosystems in nearly all environments. They have evolved highly sensitive chemical senses that are central to their interaction with their environment and to communication between individuals. Understanding the molecular bases of insect olfaction is therefore of great importance from both a basic and applied perspective. Odorant binding proteins (OBPs are some of most abundant proteins found in insect olfactory organs, where they are the first component of the olfactory transduction cascade, carrying odorant molecules to the olfactory receptors. We carried out a search for OBPs in the genome of the parasitoid wasp Nasonia vitripennis and identified 90 sequences encoding putative OBPs. This is the largest OBP family so far reported in insects. We report unique features of the N. vitripennis OBPs, including the presence and evolutionary origin of a new subfamily of double-domain OBPs (consisting of two concatenated OBP domains, the loss of conserved cysteine residues and the expression of pseudogenes. This study also demonstrates the extremely dynamic evolution of the insect OBP family: (i the number of different OBPs can vary greatly between species; (ii the sequences are highly diverse, sometimes as a result of positive selection pressure with even the canonical cysteines being lost; (iii new lineage specific domain arrangements can arise, such as the double domain OBP subfamily of wasps and mosquitoes.

  2. Multigenic Control of Pod Shattering Resistance in Chinese Rapeseed Germplasm Revealed by Genome-Wide Association and Linkage Analyses.

    Science.gov (United States)

    Liu, Jia; Wang, Jun; Wang, Hui; Wang, Wenxiang; Zhou, Rijin; Mei, Desheng; Cheng, Hongtao; Yang, Juan; Raman, Harsh; Hu, Qiong

    2016-01-01

    The majority of rapeseed cultivars shatter seeds upon maturity especially under hot-dry and windy conditions, reducing yield and gross margin return to growers. Here, we identified quantitative trait loci (QTL) for resistance to pod shatter in an unstructured diverse panel of 143 rapeseed accessions, and two structured populations derived from bi-parental doubled haploid (DH) and inter-mated (IF2) crosses derived from R1 (resistant to pod shattering) and R2 (prone to pod shattering) accessions. Genome-wide association analysis identified six significant QTL for resistance to pod shatter located on chromosomes A01, A06, A07, A09, C02, and C05. Two of the QTL, qSRI.A09 delimited with the SNP marker Bn-A09-p30171993 (A09) and qSRI.A06 delimited with the SNP marker Bn-A06-p115948 (A06) could be repeatedly detected across environments in a diversity panel, DH and IF2 populations, suggesting that at least two loci on chromosomes A06 and A09 were the main contributors to pod shatter resistance in Chinese germplasm. Significant SNP markers identified in this study especially those that appeared repeatedly across environments provide a cost-effective and an efficient method for introgression and pyramiding of favorable alleles for pod shatter resistance via marker-assisted selection in rapeseed improvement programs. PMID:27493651

  3. Reconstruction of thermotolerant yeast by one-point mutation identified through whole-genome analyses of adaptively-evolved strains.

    Science.gov (United States)

    Satomura, Atsushi; Miura, Natsuko; Kuroda, Kouichi; Ueda, Mitsuyoshi

    2016-01-01

    Saccharomyces cerevisiae is used as a host strain in bioproduction, because of its rapid growth, ease of genetic manipulation, and high reducing capacity. However, the heat produced during the fermentation processes inhibits the biological activities and growth of the yeast cells. We performed whole-genome sequencing of 19 intermediate strains previously obtained during adaptation experiments under heat stress; 49 mutations were found in the adaptation steps. Phylogenetic tree revealed at least five events in which these strains had acquired mutations in the CDC25 gene. Reconstructed CDC25 point mutants based on a parental strain had acquired thermotolerance without any growth defects. These mutations led to the downregulation of the cAMP-dependent protein kinase (PKA) signaling pathway, which controls a variety of processes such as cell-cycle progression and stress tolerance. The one-point mutations in CDC25 were involved in the global transcriptional regulation through the cAMP/PKA pathway. Additionally, the mutations enabled efficient ethanol fermentation at 39 °C, suggesting that the one-point mutations in CDC25 may contribute to bioproduction. PMID:26984760

  4. Complete mitochondrial genome of Military Macaw (Ara militaris): its comparison with mitogenomes of two other Ara species.

    Science.gov (United States)

    Dawid Urantowka, Adam

    2016-09-01

    The Military Macaw is one of the eight species of the genus Ara. The genus is one of six genera, which form morphologically diverse group termed as Macaws. Parrots of this group differ in body size on demand of the genus and species. Six of Ara species are classified as large Macaws. Based on morphological similarities and differences, these species can be segregated into three pairs according to their plumage coloration. Representative mitochondrial genomes were sequenced only for A. glaucogularis (blue and yellow coloration) and A. macao (predominantly red/scarlet). Ara militaris is one of two predominantly green species and full mitochondrial genome of considered species was sequenced in this study. It's comparison with A. glaucogularis and A. macao mitogenomes revealed higher degree of identity between militaris and macao sequences than between militaris and glaucogularis mtDNAs. Ara militaris mitogenome will be indispensable to refine the phylogenetic relationships within Macaw group. PMID:25703844

  5. Cross-Genome Comparisons of Newly Identified Domains in Mycoplasma gallisepticum and Domain Architectures with Other Mycoplasma species

    Directory of Open Access Journals (Sweden)

    Chandra Sekhar Reddy Chilamakuri

    2011-01-01

    Full Text Available Accurate functional annotation of protein sequences is hampered by important factors such as the failure of sequence search methods to identify relationships and the inherent diversity in function of proteins related at low sequence similarities. Earlier, we had employed intermediate sequence search approach to establish new domain relationships in the unassigned regions of gene products at the whole genome level by taking Mycoplasma gallisepticum as a specific example and established new domain relationships. In this paper, we report a detailed comparison of the conservation status of the domain and domain architectures of the gene products that bear our newly predicted domains amongst 14 other Mycoplasma genomes and reported the probable implications for the organisms. Some of the domain associations, observed in Mycoplasma that afflict humans and other non-human primates, are involved in regulation of solute transport and DNA binding suggesting specific modes of host-pathogen interactions.

  6. Genomic analyses of cherry rusty mottle group and cherry twisted leaf-associated viruses reveal a possible new genus within the family betaflexiviridae.

    Science.gov (United States)

    Villamor, D E V; Susaimuthu, J; Eastwell, K C

    2015-03-01

    It is demonstrated that closely related viruses within the family Betaflexiviridae are associated with a number of diseases that affect sweet cherry (Prunus avium) and other Prunus spp. Cherry rusty mottle-associated virus (CRMaV) is correlated with the appearance of cherry rusty mottle disease (CRMD), and Cherry twisted leaf-associated virus (CTLaV) is linked to cherry twisted leaf disease (CTLD) and apricot ringpox disease (ARPD). Comprehensive analysis of previously reported full genomic sequences plus those determined in this study representing isolates of CTLaV, CRMaV, Cherry green ring mottle virus, and Cherry necrotic rusty mottle virus revealed segregation of sequences into four clades corresponding to distinct virus species. High-throughput sequencing of RNA from representative source trees for CRMD, CTLD, and ARPD did not reveal additional unique virus sequences that might be associated with these diseases, thereby further substantiating the association of CRMaV and CTLaV with CRMD and CTLD or ARPD, respectively. Based on comparison of the nucleotide and amino acid sequence identity values, phylogenetic relationships with other triple-gene block-coding viruses within the family Betaflexiviridae, genome organization, and natural host range, a new genus (Robigovirus) is suggested. PMID:25496302

  7. Whole genome transcript profiling from fingerstick blood samples: a comparison and feasibility study

    Directory of Open Access Journals (Sweden)

    Williams Adam R

    2009-12-01

    Full Text Available Abstract Background Whole genome gene expression profiling has revolutionized research in the past decade especially with the advent of microarrays. Recently, there have been significant improvements in whole blood RNA isolation techniques which, through stabilization of RNA at the time of sample collection, avoid bias and artifacts introduced during sample handling. Despite these improvements, current human whole blood RNA stabilization/isolation kits are limited by the requirement of a venous blood sample of at least 2.5 mL. While fingerstick blood collection has been used for many different assays, there has yet to be a kit developed to isolate high quality RNA for use in gene expression studies from such small human samples. The clinical and field testing advantages of obtaining reliable and reproducible gene expression data from a fingerstick are many; it is less invasive, time saving, more mobile, and eliminates the need of a trained phlebotomist. Furthermore, this method could also be employed in small animal studies, i.e. mice, where larger sample collections often require sacrificing the animal. In this study, we offer a rapid and simple method to extract sufficient amounts of high quality total RNA from approximately 70 μl of whole blood collected via a fingerstick using a modified protocol of the commercially available Qiagen PAXgene RNA Blood Kit. Results From two sets of fingerstick collections, about 70 uL whole blood collected via finger lancet and capillary tube, we recovered an average of 252.6 ng total RNA with an average RIN of 9.3. The post-amplification yields for 50 ng of total RNA averaged at 7.0 ug cDNA. The cDNA hybridized to Affymetrix HG-U133 Plus 2.0 GeneChips had an average % Present call of 52.5%. Both fingerstick collections were highly correlated with r2 values ranging from 0.94 to 0.97. Similarly both fingerstick collections were highly correlated to the venous collection with r2 values ranging from 0.88 to 0

  8. Integrated genomic and functional analyses of histone demethylases identify oncogenic KDM2A isoform in breast cancer.

    Science.gov (United States)

    Liu, Hui; Liu, Lanxin; Holowatyj, Andreana; Jiang, Yuanyuan; Yang, Zeng-Quan

    2016-05-01

    Histone lysine demethylases (KDMs) comprise a large class of enzymes that catalyze site-specific demethylation of lysine residues on histones and other proteins. They play critical roles in controlling transcription, chromatin architecture, and cellular differentiation. However, the genomic landscape and clinical significance of KDMs in breast cancer remain poorly characterized. Here, we conducted a meta-analysis of 24 KDMs in breast cancer and identified associations among recurrent copy number alterations, gene expression, breast cancer subtypes, and clinical outcome. Two KDMs, KDM2A and KDM5B, had the highest frequency of genetic amplification and overexpression. Furthermore, among the 24 KDM genes, KDM2A had the highest correlation between copy number and mRNA expression, and high mRNA levels of KDM2A were significantly associated with shorter survival of breast cancer patients. KDM2A has two isoforms: the long isoform is comprised of a JmjC domain, CXXC-zinc finger, PHD zinc finger, F-box, and the AMN1 protein domain; whereas the short isoform of KDM2A lacks the N-terminal JmjC domain but contains all other motifs. Detailed characterization of KDM2A in breast cancer revealed that the short isoform of KDM2A is more abundant than the long isoform at DNA, mRNA, and protein levels in a subset of breast cancers. Furthermore, our data indicate that the short isoform of KDM2A has oncogenic potential and functions as an oncogenic isoform in a subset of breast cancers. Taken together, our findings suggest that amplification and overexpression of the KDM2A short isoform is critical in breast cancer progression. © 2015 Wiley Periodicals, Inc. PMID:26207617

  9. Calcium-dependent protein kinase (CDPK) and CDPK-related kinase (CRK) gene families in tomato: genome-wide identification and functional analyses in disease resistance.

    Science.gov (United States)

    Wang, Ji-Peng; Xu, You-Ping; Munyampundu, Jean-Pierre; Liu, Tian-Yu; Cai, Xin-Zhong

    2016-04-01

    Calcium-dependent protein kinases (CDPKs) and CDPK-related kinases (CRKs) play multiple roles in plant. Nevertheless, genome-wide identification of these two families is limited to several plant species, and role of CRKs in disease resistance remains unclear. In this study, we identified the CDPK and CRK gene families in genome of the economically important crop tomato (Solanum lycopersicum L.) and analyzed their function in resistance to various pathogens. Twenty-nine CDPK and six CRK genes were identified in tomato genome. Both SlCDPK and SlCRK proteins harbored an STKc_CAMK type protein kinase domain, while only SlCDPKs contained EF-hand type Ca(2+) binding domain(s). Phylogenetic analysis revealed that plant CRK family diverged early from CDPKs, and shared a common ancestor gene with subgroup IV CDPKs. Subgroup IV SlCDPK proteins were basic and their genes contained 11 introns, which were distinguished from other subgroups but similar to CRKs. Subgroup I SlCDPKs generally did not carry an N-terminal myristoylation motif while those of the remaining subgroups and SlCRKs universally did. SlCDPK and SlCRK genes were differently responsive to pathogenic stimuli. Furthermore, silencing analyses demonstrated that SlCDPK18 and SlCDPK10 positively regulated nonhost resistance to Xanthomonas oryzae pv. oryzae and host resistance to Pseudomonas syringae pv. tomato (Pst) DC3000, respectively, while SlCRK6 positively regulated resistance to both Pst DC3000 and Sclerotinia sclerotiorum in tomato. In conclusion, CRKs apparently evolved from CDPK lineage, SlCDPK and SlCRK genes regulate a wide range of resistance and SlCRK6 is the first CRK gene proved to function in plant disease resistance. PMID:26520101

  10. Comparison of 26 sphingomonad genomes reveals diverse environmental adaptations and biodegradative capabilities

    DEFF Research Database (Denmark)

    Aylward, Frank O.; McDonald, Bradon R.; Adams, Sandra M.;

    2013-01-01

    Sphingomonads comprise a physiologically versatile group within the Alphaproteobacteria that includes strains of interest for biotechnology, human health, and environmental nutrient cycling. In this study, we compared 26 sphingomonad genome sequences to gain insight into their ecology, metabolic ...

  11. Whole genome sequence comparison of ten diagnostic brucellaphages propagated on two Brucella abortus hosts

    OpenAIRE

    Tevdoradze, Ekaterine; Farlow, Jason; Kotorashvili, Adam; Skhirtladze, Natia; Antadze, Irina; Gunia, Sophio; Balarjishvili, Nana; Kvachadze, Leila; Kutateladze, Mzia

    2015-01-01

    Background Recently the genome sequences of two brucellaphages, isolated in Georgia (Tb) and Mexico (Pr) were reported revealing pronounced sequence homogeneity and the presence of two major indels discriminating the two phages. Subsequent genome sequencing of six diagnostic brucellaphages: Tbilisi (Tb), Firenze (Fz), Weybridge (Wb), S708, Berkeley (Bk) and R/C phages identified three major genetic groups. However, the propensity for fine-scale genetic variability of diverse brucellaphages gr...

  12. Comparison of Genome Wide Variation between Malawians and African Ancestry HapMap Populations

    OpenAIRE

    Joubert, Bonnie R.; North, Kari E.; Wang, Yunfei; Mwapasa, Victor; Franceschini, Nora; Meshnick, Steven R; Lange, Ethan M.

    2010-01-01

    Understanding genetic variation between populations is important because it affects the portability of human genome wide analytical methods. We compared genetic variation and substructure between Malawians and other African and non-African HapMap populations. Allele frequencies and adjacent linkage disequilibrium (LD) were measured for 617,715 single nucleotide polymorphisms (SNPs) across subject genomes. Allele frequencies in the Malawian population (N = 226) were highly correlated with alle...

  13. The auxin response factor transcription factor family in soybean: genome-wide identification and expression analyses during development and water stress.

    Science.gov (United States)

    Ha, Chien Van; Le, Dung Tien; Nishiyama, Rie; Watanabe, Yasuko; Sulieman, Saad; Tran, Uyen Thi; Mochida, Keiichi; Dong, Nguyen Van; Yamaguchi-Shinozaki, Kazuko; Shinozaki, Kazuo; Tran, Lam-Son Phan

    2013-10-01

    In plants, the auxin response factor (ARF) transcription factors play important roles in regulating diverse biological processes, including development, growth, cell division and responses to environmental stimuli. An exhaustive search of soybean genome revealed 51 GmARFs, many of which were formed by genome duplications. The typical GmARFs (43 members) contain a DNA-binding domain, an ARF domain and an auxin/indole acetic acid (AUX/IAA) dimerization domain, whereas the remaining eight members lack the dimerization domain. Phylogenetic analysis of the ARFs from soybean and Arabidopsis revealed both similarity and divergence between the two ARF families, as well as enabled us to predict the functions of the GmARFs. Using quantitative real-time polymerase chain reaction (qRT-PCR) and available soybean Affymetrix array and Illumina transcriptome sequence data, a comprehensive expression atlas of GmARF genes was obtained in various organs and tissues, providing useful information about their involvement in defining the precise nature of individual tissues. Furthermore, expression profiling using qRT-PCR and microarray data revealed many water stress-responsive GmARFs in soybean, albeit with different patterns depending on types of tissues and/or developmental stages. Our systematic analysis has identified excellent tissue-specific and/or stress-responsive candidate GmARF genes for in-depth in planta functional analyses, which would lead to potential applications in the development of genetically modified soybean cultivars with enhanced drought tolerance. PMID:23810914

  14. Integration of Genome-Wide Computation DRE Search, AhR ChIP-chip and Gene Expression Analyses of TCDD-Elicited Responses in the Mouse Liver

    Directory of Open Access Journals (Sweden)

    Matthews Jason

    2011-07-01

    Full Text Available Abstract Background The aryl hydrocarbon receptor (AhR is a ligand-activated transcription factor (TF that mediates responses to 2,3,7,8-tetrachlorodibenzo-p-dioxin (TCDD. Integration of TCDD-induced genome-wide AhR enrichment, differential gene expression and computational dioxin response element (DRE analyses further elucidate the hepatic AhR regulatory network. Results Global ChIP-chip and gene expression analyses were performed on hepatic tissue from immature ovariectomized mice orally gavaged with 30 μg/kg TCDD. ChIP-chip analysis identified 14,446 and 974 AhR enriched regions (1% false discovery rate at 2 and 24 hrs, respectively. Enrichment density was greatest in the proximal promoter, and more specifically, within ± 1.5 kb of a transcriptional start site (TSS. AhR enrichment also occurred distal to a TSS (e.g. intergenic DNA and 3' UTR, extending the potential gene expression regulatory roles of the AhR. Although TF binding site analyses identified over-represented DRE sequences within enriched regions, approximately 50% of all AhR enriched regions lacked a DRE core (5'-GCGTG-3'. Microarray analysis identified 1,896 number of TCDD-responsive genes (|fold change| ≥ 1.5, P1(t > 0.999. Integrating this gene expression data with our ChIP-chip and DRE analyses only identified 625 differentially expressed genes that involved an AhR interaction at a DRE. Functional annotation analysis of differentially regulated genes associated with AhR enrichment identified overrepresented processes related to fatty acid and lipid metabolism and transport, and xenobiotic metabolism, which are consistent with TCDD-elicited steatosis in the mouse liver. Conclusions Details of the AhR regulatory network have been expanded to include AhR-DNA interactions within intragenic and intergenic genomic regions. Moreover, the AhR can interact with DNA independent of a DRE core suggesting there are alternative mechanisms of AhR-mediated gene regulation.

  15. Genome wide signatures of positive selection: The comparison of independent samples and the identification of regions associated to traits

    Directory of Open Access Journals (Sweden)

    Thomas Merle B

    2009-04-01

    Full Text Available Abstract Background The goal of genome wide analyses of polymorphisms is to achieve a better understanding of the link between genotype and phenotype. Part of that goal is to understand the selective forces that have operated on a population. Results In this study we compared the signals of selection, identified through population divergence in the Bovine HapMap project, to those found in an independent sample of cattle from Australia. Evidence for population differentiation across the genome, as measured by FST, was highly correlated in the two data sets. Nevertheless, 40% of the variance in FST between the two studies was attributed to the differences in breed composition. Seventy six percent of the variance in FST was attributed to differences in SNP composition and density when the same breeds were compared. The difference between FST of adjacent loci increased rapidly with the increase in distance between SNP, reaching an asymptote after 20 kb. Using 129 SNP that have highly divergent FST values in both data sets, we identified 12 regions that had additive effects on the traits residual feed intake, beef yield or intramuscular fatness measured in the Australian sample. Four of these regions had effects on more than one trait. One of these regions includes the R3HDM1 gene, which is under selection in European humans. Conclusion Firstly, many different populations will be necessary for a full description of selective signatures across the genome, not just a small set of highly divergent populations. Secondly, it is necessary to use the same SNP when comparing the signatures of selection from one study to another. Thirdly, useful signatures of selection can be obtained where many of the groups have only minor genetic differences and may not be clearly separated in a principal component analysis. Fourthly, combining analyses of genome wide selection signatures and genome wide associations to traits helps to define the trait under selection or

  16. Homo sapiens, Homo neanderthalensis and the Denisova specimen: new insights on their evolutionary histories using whole-genome comparisons

    Directory of Open Access Journals (Sweden)

    Vanessa Rodrigues Paixão-Côrtes

    2012-01-01

    Full Text Available After a brief review of the most recent findings in the study of human evolution, an extensive comparison of the complete genomes of our nearest relative, the chimpanzee (Pan troglodytes, of extant Homo sapiens, archaic Homo neanderthalensis and the Denisova specimen were made. The focus was on non-synonymous mutations, which consequently had an impact on protein levels and these changes were classified according to degree of effect. A total of 10,447 non-synonymous substitutions were found in which the derived allele is fixed or nearly fixed in humans as compared to chimpanzee. Their most frequent location was on chromosome 21. Their presence was then searched in the two archaic genomes. Mutations in 381 genes would imply radical amino acid changes, with a fraction of these related to olfaction and other important physiological processes. Eight new alleles were identified in the Neanderthal and/or Denisova genetic pools. Four others, possibly affecting cognition, occured both in the sapiens and two other archaic genomes. The selective sweep that gave rise to Homo sapiens could, therefore, have initiated before the modern/archaic human divergence.

  17. Homo sapiens, Homo neanderthalensis and the Denisova specimen: New insights on their evolutionary histories using whole-genome comparisons.

    Science.gov (United States)

    Paixão-Côrtes, Vanessa Rodrigues; Viscardi, Lucas Henrique; Salzano, Francisco Mauro; Hünemeier, Tábita; Bortolini, Maria Cátira

    2012-12-01

    After a brief review of the most recent findings in the study of human evolution, an extensive comparison of the complete genomes of our nearest relative, the chimpanzee (Pan troglodytes), of extant Homo sapiens, archaic Homo neanderthalensis and the Denisova specimen were made. The focus was on non-synonymous mutations, which consequently had an impact on protein levels and these changes were classified according to degree of effect. A total of 10,447 non-synonymous substitutions were found in which the derived allele is fixed or nearly fixed in humans as compared to chimpanzee. Their most frequent location was on chromosome 21. Their presence was then searched in the two archaic genomes. Mutations in 381 genes would imply radical amino acid changes, with a fraction of these related to olfaction and other important physiological processes. Eight new alleles were identified in the Neanderthal and/or Denisova genetic pools. Four others, possibly affecting cognition, occured both in the sapiens and two other archaic genomes. The selective sweep that gave rise to Homo sapiens could, therefore, have initiated before the modern/archaic human divergence. PMID:23413113

  18. Genome-wide comparison of paired fresh frozen and formalin-fixed paraffin-embedded gliomas by custom BAC and oligonucleotide array comparative genomic hybridization: facilitating analysis of archival gliomas

    OpenAIRE

    Mohapatra, Gayatry; Engler, David A.; Starbuck, Kristen D.; Kim, James C.; Bernay, Derek C.; Scangas, George A.; Rousseau, Audrey; Batchelor, Tracy T.; Betensky, Rebecca A.; Louis, David N.

    2010-01-01

    Molecular genetic analysis of cancer is rapidly evolving as a result of improvement in genomic technologies and the growing applicability of such analyses to clinical oncology. Array based comparative genomic hybridization (aCGH) is a powerful tool for detecting DNA copy number alterations (CNA), particularly in solid tumors, and has been applied to the study of malignant gliomas. In the clinical setting, however, gliomas are often sampled by small biopsies and thus formalin-fixed paraffin-em...

  19. Genomic characterization and comparison of seven Myoviridae bacteriophage infecting Bacillus thuringiensis.

    Science.gov (United States)

    Sauder, Amber Brooke; Quinn, McKenzie Rea; Brouillette, Alexis; Caruso, Steven; Cresawn, Steven; Erill, Ivan; Lewis, Lynn; Loesser-Casey, Kathryn; Pate, Morgan; Scott, Crystal; Stockwell, Stephanie; Temple, Louise

    2016-02-01

    Bacillus thuringiensis Kurstaki, a bacterium that is a source of biopesticides and a safe simulant for pathogenic Bacillus species, was used to isolate seven unique bacteriophages. The phage genomes were sequenced and ranged in size from 158,100 to 163,019bp encoding 290-299 genes, and the GC content of ~38% was similar to that of the host bacterium. All phages had terminal repeats 2-3kb long. Three of the phages encoded tRNAs and three contained a self-splicing intron in the DNA polymerase gene. They were categorized as a single cluster (>60% nucleotide conservation) containing three subclusters (>80% nucleotide conservation), supported by genomic synteny and phylogenetic analysis. Considering the published genomes of phages that infect the genus Bacillus and noting the ability of many of the Bacillus cereus group phages to infect multiple species, a clustering system based on gene content is proposed. PMID:26773385

  20. Complete genome sequence of Shigella flexneri 5b and comparison with Shigella flexneri 2a

    Directory of Open Access Journals (Sweden)

    Xue Ying

    2006-07-01

    Full Text Available Abstract Background Shigella bacteria cause dysentery, which remains a significant threat to public health. Shigella flexneri is the most common species in both developing and developed countries. Five Shigella genomes have been sequenced, revealing dynamic and diverse features. To investigate the intra-species diversity of S. flexneri genomes further, we have sequenced the complete genome of S. flexneri 5b strain 8401 (abbreviated Sf8401 and compared it with S. flexneri 2a (Sf301. Results The Sf8401 chromosome is 4.5-Mb in size, a little smaller than that of Sf301, mainly because the former lacks the SHI-1 pathogenicity island (PAI. Compared with Sf301, there are 6 inversions and one translocation in Sf8401, which are probably mediated by insertion sequences (IS. There are clear differences in the known PAIs between these two genomes. The bacteriophage SfV segment remaining in SHI-O of Sf8401 is clearly larger than the remnants of bacteriophage SfII in Sf301. SHI-1 is absent from Sf8401 but a specific related protein is found next to the pheV locus. SHI-2 is involved in one intra-replichore inversion near the origin of replication, which may change the expression of iut/iuc genes. Moreover, genes related to the glycine-betaine biosynthesis pathway are present only in Sf8401 among the known Shigella genomes. Conclusion Our data show that the two S. flexneri genomes are very similar, which suggests a high level of structural and functional conservation between the two serotypes. The differences reflect different selection pressures during evolution. The ancestor of S. flexneri probably acquired SHI-1 and SHI-2 before SHI-O was integrated and the serotypes diverged. SHI-1 was subsequently deleted from the S. flexneri 5b genome by recombination, but stabilized in the S. flexneri 2a genome. These events may have contributed to the differences in pathogenicity and epidemicity between the two serotypes of S. flexneri.

  1. Distribution of recombination hotspots in the human genome--a comparison of computer simulations with real data.

    Directory of Open Access Journals (Sweden)

    Dorota Mackiewicz

    Full Text Available Recombination is the main cause of genetic diversity. Thus, errors in this process can lead to chromosomal abnormalities. Recombination events are confined to narrow chromosome regions called hotspots in which characteristic DNA motifs are found. Genomic analyses have shown that both recombination hotspots and DNA motifs are distributed unevenly along human chromosomes and are much more frequent in the subtelomeric regions of chromosomes than in their central parts. Clusters of motifs roughly follow the distribution of recombination hotspots whereas single motifs show a negative correlation with the hotspot distribution. To model the phenomena related to recombination, we carried out computer Monte Carlo simulations of genome evolution. Computer simulations generated uneven distribution of hotspots with their domination in the subtelomeric regions of chromosomes. They also revealed that purifying selection eliminating defective alleles is strong enough to cause such hotspot distribution. After sufficiently long time of simulations, the structure of chromosomes reached a dynamic equilibrium, in which number and global distribution of both hotspots and defective alleles remained statistically unchanged, while their precise positions were shifted. This resembles the dynamic structure of human and chimpanzee genomes, where hotspots change their exact locations but the global distributions of recombination events are very similar.

  2. Bioinformatic genome comparisons for taxonomic and phylogenic assignments using Aeromonas as a test case

    Science.gov (United States)

    Prokaryotic taxonomy is the underpinning of microbiology, providing a framework for the proper identification and naming of organisms. The 'gold standard' of bacterial species delineation is the overall genome similarity as determined by DNA-DNA hybridization (DDH), a technically rigorous yet someti...

  3. Comparison of genome stability in two pig breeds by using the sister chromatid exchange (SCE test

    Directory of Open Access Journals (Sweden)

    V. Barbieri

    2010-01-01

    Full Text Available The sister chromatid exchange (SCE test has been used to detect genome stability in humans (Chaganti, 1974 and the main livestock species (Ciotola et al., 2004; Di Meo et al., 2000; Di Berardino et al., 1979, and to discover DNA damage caused by a variety of natural and artificial chemical compounds (Iannuzzi et al., 1990.

  4. Genomic Comparison of the Closely-Related Salmonella enterica Serovars Enteritidis, Dublin and Gallinarum.

    Science.gov (United States)

    Matthews, T David; Schmieder, Robert; Silva, Genivaldo G Z; Busch, Julia; Cassman, Noriko; Dutilh, Bas E; Green, Dawn; Matlock, Brian; Heffernan, Brian; Olsen, Gary J; Farris Hanna, Leigh; Schifferli, Dieter M; Maloy, Stanley; Dinsdale, Elizabeth A; Edwards, Robert A

    2015-01-01

    The Salmonella enterica serovars Enteritidis, Dublin, and Gallinarum are closely related but differ in virulence and host range. To identify the genetic elements responsible for these differences and to better understand how these serovars are evolving, we sequenced the genomes of Enteritidis strain LK5 and Dublin strain SARB12 and compared these genomes to the publicly available Enteritidis P125109, Dublin CT 02021853 and Dublin SD3246 genome sequences. We also compared the publicly available Gallinarum genome sequences from biotype Gallinarum 287/91 and Pullorum RKS5078. Using bioinformatic approaches, we identified single nucleotide polymorphisms, insertions, deletions, and differences in prophage and pseudogene content between strains belonging to the same serovar. Through our analysis we also identified several prophage cargo genes and pseudogenes that affect virulence and may contribute to a host-specific, systemic lifestyle. These results strongly argue that the Enteritidis, Dublin and Gallinarum serovars of Salmonella enterica evolve by acquiring new genes through horizontal gene transfer, followed by the formation of pseudogenes. The loss of genes necessary for a gastrointestinal lifestyle ultimately leads to a systemic lifestyle and niche exclusion in the host-specific serovars. PMID:26039056

  5. Genomic Comparison of the Closely-Related Salmonella enterica Serovars Enteritidis, Dublin and Gallinarum.

    Directory of Open Access Journals (Sweden)

    T David Matthews

    Full Text Available The Salmonella enterica serovars Enteritidis, Dublin, and Gallinarum are closely related but differ in virulence and host range. To identify the genetic elements responsible for these differences and to better understand how these serovars are evolving, we sequenced the genomes of Enteritidis strain LK5 and Dublin strain SARB12 and compared these genomes to the publicly available Enteritidis P125109, Dublin CT 02021853 and Dublin SD3246 genome sequences. We also compared the publicly available Gallinarum genome sequences from biotype Gallinarum 287/91 and Pullorum RKS5078. Using bioinformatic approaches, we identified single nucleotide polymorphisms, insertions, deletions, and differences in prophage and pseudogene content between strains belonging to the same serovar. Through our analysis we also identified several prophage cargo genes and pseudogenes that affect virulence and may contribute to a host-specific, systemic lifestyle. These results strongly argue that the Enteritidis, Dublin and Gallinarum serovars of Salmonella enterica evolve by acquiring new genes through horizontal gene transfer, followed by the formation of pseudogenes. The loss of genes necessary for a gastrointestinal lifestyle ultimately leads to a systemic lifestyle and niche exclusion in the host-specific serovars.

  6. Multi-Species Genome Comparison Sheds New Light on Evolutionary Processes, Cancer Mutations

    Science.gov (United States)

    Journal of College Science Teaching, 2005

    2005-01-01

    An international team that includes researchers from the National Human Genome Research Institute (NHGRI), part of the National Institutes of Health (NIH), has discovered that mammalian chromosomes have evolved by breaking at specific sites rather than randomly as long thought--and that many of the breakage hot spots are also involved in human…

  7. Comparison of two Next Generation sequencing platforms for full genome sequencing of Classical Swine Fever Virus

    DEFF Research Database (Denmark)

    Fahnøe, Ulrik; Pedersen, Anders Gorm; Höper, Dirk;

    2013-01-01

    Next Generation Sequencing (NGS) is becoming more adopted into viral research and will be the preferred technology in the years to come. We have recently sequenced several strains of Classical Swine Fever Virus (CSFV) by NGS on both Genome Sequencer FLX (GS FLX) and Iontorrent PGM platforms. In...

  8. Complete mitochondrial genome analysis of Clinostomum complanatum and its comparison with selected digeneans.

    Science.gov (United States)

    Chen, Lu; Feng, Yan; Chen, Hong-Mei; Wang, Li-Xia; Feng, Han-Li; Yang, Xin; Mughal, Mudassar-Niaz; Fang, Rui

    2016-08-01

    Clinostomum complanatum is an important trematode in fishes, birds of the family Ardeidae, and humans. Until now, limited knowledge is available regarding its molecular epidemiology, ecology, and phylogenetic study. Knowledge of the full mitochondrial genome of C. complanatum will provide important information for the study of epidemiology, biology, and genetic diversity of this fluke. In the present study, the complete mitochondrial genome of C. complanatum was sequenced and analyzed. The complete C. complanatum mitochondrial genome is 13,796 bp in length and contains 12 protein-coding genes, 22 tRNA genes, two rRNA genes, and one non-coding control region. All the 12 protein-coding genes are transcribed in the same direction and are AT-rich. Phylogenetic analysis based on concatenated amino acid sequences of the 12 protein-coding genes from C. complanatum and other selective digeneans showed that C. complanatum is in a separate branch, indicating C. complanatum has no closer relationship with any of the selected families. The complete mtDNA sequence of C. complanatum will increase our knowledge of mitochondrial genomics data and will also provide an important resource for studies of inter- and intra-species variation of flukes belonging to Clinostomidae. PMID:27146900

  9. Single-molecule approach to bacterial genomic comparisons via optical mapping.

    Energy Technology Data Exchange (ETDEWEB)

    Zhou, Shiguo [Univ. Wisc.-Madison; Kile, A. [Univ. Wisc.-Madison; Bechner, M. [Univ. Wisc.-Madison; Kvikstad, E. [Univ. Wisc.-Madison; Deng, W. [Univ. Wisc.-Madison; Wei, J. [Univ. Wisc.-Madison; Severin, J. [Univ. Wisc.-Madison; Runnheim, R. [Univ. Wisc.-Madison; Churas, C. [Univ. Wisc.-Madison; Forrest, D. [Univ. Wisc.-Madison; Dimalanta, E. [Univ. Wisc.-Madison; Lamers, C. [Univ. Wisc.-Madison; Burland, V. [Univ. Wisc.-Madison; Blattner, F. R. [Univ. Wisc.-Madison; Schwartz, David C. [Univ. Wisc.-Madison

    2004-01-01

    Modern comparative genomics has been established, in part, by the sequencing and annotation of a broad range of microbial species. To gain further insights, new sequencing efforts are now dealing with the variety of strains or isolates that gives a species definition and range; however, this number vastly outstrips our ability to sequence them. Given the availability of a large number of microbial species, new whole genome approaches must be developed to fully leverage this information at the level of strain diversity that maximize discovery. Here, we describe how optical mapping, a single-molecule system, was used to identify and annotate chromosomal alterations between bacterial strains represented by several species. Since whole-genome optical maps are ordered restriction maps, sequenced strains of Shigella flexneri serotype 2a (2457T and 301), Yersinia pestis (CO 92 and KIM), and Escherichia coli were aligned as maps to identify regions of homology and to further characterize them as possible insertions, deletions, inversions, or translocations. Importantly, an unsequenced Shigella flexneri strain (serotype Y strain AMC[328Y]) was optically mapped and aligned with two sequenced ones to reveal one novel locus implicated in serotype conversion and several other loci containing insertion sequence elements or phage-related gene insertions. Our results suggest that genomic rearrangements and chromosomal breakpoints are readily identified and annotated against a prototypic sequenced strain by using the tools of optical mapping.

  10. Gene network analyses of first service conception in Brangus heifers: use of genome and trait associations, hypothalamic-transcriptome information, and transcription factors.

    Science.gov (United States)

    Fortes, M R S; Snelling, W M; Reverter, A; Nagaraj, S H; Lehnert, S A; Hawken, R J; DeAtley, K L; Peters, S O; Silver, G A; Rincon, G; Medrano, J F; Islas-Trejo, A; Thomas, M G

    2012-09-01

    Measures of heifer fertility are economically relevant traits for beef production systems and knowledge of candidate genes could be incorporated into future genomic selection strategies. Ten traits related to growth and fertility were measured in 890 Brangus heifers (3/8 Brahman × 5/8 Angus, from 67 sires). These traits were: BW and hip height adjusted to 205 and 365 d of age, postweaning ADG, yearling assessment of carcass traits (i.e., back fat thickness, intramuscular fat, and LM area), as well as heifer pregnancy and first service conception (FSC). These fertility traits were collected from controlled breeding seasons initiated with estrous synchronization and AI targeting heifers to calve by 24 mo of age. The BovineSNP50 BeadChip was used to ascertain 53,692 SNP genotypes for ∼802 heifers. Associations of genotypes and phenotypes were performed and SNP effects were estimated for each trait. Minimally associated SNP (P Brangus heifer. The remaining hypothalamic-influenced network contained 978 genes connected by 2,560 edges or predicted gene interactions. This hypothalamic gene network was enriched with genes involved in axon guidance, which is a pathway known to influence pulsatile release of LHRH. There were 5 transcription factors with 21 or more connections: ZMAT3, STAT6, RFX4, PLAGL1, and NR6A1 for FSC. The SNP that identified these genes were intragenic and were on chromosomes 1, 5, 9, and 11. Chromosome 5 harbored both STAT6 and RFX4. The large number of interactions and genes observed with network analyses of multiple sources of genomic data (i.e., GWAS and RNA-Seq) support the concept of FSC being a polygenic trait. PMID:22739780

  11. Community Genomic and Proteomic Analyses of Chemoautotrophic Iron-Oxidizing "Leptospirillum rubarum" (Group II) and "Leptospirillum ferrodiazotrophum" (Group III) Bacteria in Acid Mine Drainage Biofilms

    Energy Technology Data Exchange (ETDEWEB)

    Goltsman, Daniela [University of California, Berkeley; Denef, Vincent [University of California, Berkeley; Singer, Steven [Lawrence Livermore National Laboratory (LLNL); Verberkmoes, Nathan C [ORNL; Lefsrud, Mark G [ORNL; Mueller, Ryan [University of California, Berkeley; Dick, Gregory J. [University of California, Berkeley; Sun, Christine [University of California, Berkeley; Wheeler, Korin [Lawrence Livermore National Laboratory (LLNL); Zelma, Adam [Lawrence Livermore National Laboratory (LLNL); Baker, Brett J. [University of California, Berkeley; Hauser, Loren John [ORNL; Land, Miriam L [ORNL; Shah, Manesh B [ORNL; Thelen, Michael P. [University of California, Berkeley; Hettich, Robert {Bob} L [ORNL; Banfield, Jillian F. [University of California, Berkeley

    2009-01-01

    We analyzed near-complete population (composite) genomic sequences for coexisting acidophilic iron-oxidizing Leptospirillum group II and III bacteria (phylum Nitrospirae) and an extrachromosomal plasmid from a Richmond Mine, Iron Mountain, CA, acid mine drainage biofilm. Community proteomic analysis of the genomically characterized sample and two other biofilms identified 64.6% and 44.9% of the predicted proteins of Leptospirillum groups II and III, respectively, and 20% of the predicted plasmid proteins. The bacteria share 92% 16S rRNA gene sequence identity and >60% of their genes, including integrated plasmid-like regions. The extrachromosomal plasmid carries conjugation genes with detectable sequence similarity to genes in the integrated conjugative plasmid, but only those on the extrachromosomal element were identified by proteomics. Both bacterial groups have genes for community-essential functions, including carbon fixation and biosynthesis of vitamins, fatty acids, and biopolymers (including cellulose); proteomic analyses reveal these activities. Both Leptospirillum types have multiple pathways for osmotic protection. Although both are motile, signal transduction and methyl-accepting chemotaxis proteins are more abundant in Leptospirillum group III, consistent with its distribution in gradients within biofilms. Interestingly, Leptospirillum group II uses a methyl-dependent and Leptospirillum group III a methyl-independent response pathway. Although only Leptospirillum group III can fix nitrogen, these proteins were not identified by proteomics. The abundances of core proteins are similar in all communities, but the abundance levels of unique and shared proteins of unknown function vary. Some proteins unique to one organism were highly expressed and may be key to the functional and ecological differentiation of Leptospirillum groups II and III.

  12. Phylogenomic Analyses and Comparative Studies on Genomes of the Bifidobacteriales: Identification of Molecular Signatures Specific for the Order Bifidobacteriales and Its Different Subclades

    Science.gov (United States)

    Zhang, Grace; Gao, Beile; Adeolu, Mobolaji; Khadka, Bijendra; Gupta, Radhey S.

    2016-01-01

    The order Bifidobacteriales comprises a diverse variety of species found in the gastrointestinal tract of humans and other animals, some of which are opportunistic pathogens, whereas a number of others exhibit health-promoting effects. However, currently very few biochemical or molecular characteristics are known which are specific for the order Bifidobacteriales, or specific clades within this order, which distinguish them from other bacteria. This study reports the results of detailed comparative genomic and phylogenetic studies on 62 genome-sequenced species/strains from the order Bifidobacteriales. In a robust phylogenetic tree for the Bifidobacteriales constructed based on 614 core proteins, a number of well-resolved clades were observed including a clade separating the Scarodvia-related genera (Scardovia clade) from the genera Bifidobacterium and Gardnerella, as well as a number of previously reported clusters of Bifidobacterium spp. In parallel, our comparative analyses of protein sequences from the Bifidobacteriales genomes have identified numerous molecular markers that are specific for this group of bacteria. Of these markers, 32 conserved signature indels (CSIs) in widely distributed proteins and 10 signature proteins are distinctive characteristics of all sequenced Bifidobacteriales species and provide novel and highly specific means for distinguishing these bacteria. In addition, multiple other molecular signatures are specific for the following clades of Bifidobacteriales: (i) 5 CSIs specific for a clade comprising of the Scardovia-related genera; (ii) 3 CSIs and 2 CSPs specific for a clade consisting of the Bifidobacterium and Gardnerella spp.; (iii) multiple other signatures demarcating a number of clusters of the B. asteroides-and B. longum- related species. The described molecular markers provide novel and reliable means for distinguishing the Bifidobacteriales and a number of their clades in molecular terms and for the classification of these

  13. Genetic Basis for Spontaneous Hybrid Genome Doubling during Allopolyploid Speciation of Common Wheat Shown by Natural Variation Analyses of the Paternal Species

    OpenAIRE

    Yoshihiro Matsuoka; Shuhei Nasuda; Yasuyo Ashida; Miyuki Nitta; Hisashi Tsujimoto; Shigeo Takumi; Taihachi Kawahara

    2013-01-01

    The complex process of allopolyploid speciation includes various mechanisms ranging from species crosses and hybrid genome doubling to genome alterations and the establishment of new allopolyploids as persisting natural entities. Currently, little is known about the genetic mechanisms that underlie hybrid genome doubling, despite the fact that natural allopolyploid formation is highly dependent on this phenomenon. We examined the genetic basis for the spontaneous genome doubling of triploid F...

  14. Investigation of a genome wide association signal for obesity: synthetic association and haplotype analyses at the melanocortin 4 receptor gene locus.

    Directory of Open Access Journals (Sweden)

    André Scherag

    Full Text Available BACKGROUND: Independent genome-wide association studies (GWAS showed an obesogenic effect of two single nucleotide polymorphisms (SNP; rs12970134 and rs17782313 more than 150 kb downstream of the melanocortin 4 receptor gene (MC4R. It is unclear if the SNPs directly influence MC4R function or expression, or if the SNPs are on a haplotype that predisposes to obesity or includes functionally relevant genetic variation (synthetic association. As both exist, functionally relevant mutations and polymorphisms in the MC4R coding region and a robust association downstream of the gene, MC4R is an ideal model to explore synthetic association. METHODOLOGY/PRINCIPAL FINDINGS: We analyzed a genomic region (364.9 kb encompassing the MC4R in GWAS data of 424 obesity trios (extremely obese child/adolescent and both parents. SNP rs12970134 showed the lowest p-value (p = 0.004; relative risk for the obesity effect allele: 1.37; conditional analyses on this SNP revealed that 7 of 78 analyzed SNPs provided independent signals (p≤0.05. These 8 SNPs were used to derive two-marker haplotypes. The three best (according to p-value haplotype combinations were chosen for confirmation in 363 independent obesity trios. The confirmed obesity effect haplotype includes SNPs 3' and 5' of the MC4R. Including MC4R coding variants in a joint model had almost no impact on the effect size estimators expected under synthetic association. CONCLUSIONS/SIGNIFICANCE: A haplotype reaching from a region 5' of the MC4R to a region at least 150 kb from the 3' end of the gene showed a stronger association to obesity than single SNPs. Synthetic association analyses revealed that MC4R coding variants had almost no impact on the association signal. Carriers of the haplotype should be enriched for relevant mutations outside the MC4R coding region and could thus be used for re-sequencing approaches. Our data also underscore the problems underlying the identification of relevant mutations

  15. Comparison between genomic predictions using daughter yield deviation and conventional estimated breeding value as response variables

    DEFF Research Database (Denmark)

    Guo, Gang; Lund, Mogens Sandø; Zhang, Y;

    2010-01-01

    This study compared genomic predictions using conventional estimated breeding values (EBV) and daughter yield deviations (DYD) as response variables based on simulated data. Eight scenarios were simulated in regard to heritability (0.05 and 0.30), number of daughters per sire (30, 100, and unequal...... scenarios, the EBV approach was as effective as or slightly better than the DYD approach at predicting breeding value, dependent on simulated scenarios and statistical models. Applying a Bayesian common prior model (the same prior distribution of marker effect variance) and a linear mixed model (GBLUP), the...... index combining GEBV and PA in most scenarios. These results indicate that EBV can be used as an alternative response variable for genomic prediction....

  16. [Comparison of specific genomic DNA fragment between Microtus fortis calamorum and Microtus fortis fortis].

    Science.gov (United States)

    Xu, Bing; Hu, Wei-Xin; Yang, Rong; Yu, Yuan-Jing; Wang, Yong; Liu, Xin-Fa; Peng, Xing-Hua

    2003-06-01

    Microtus fortis(Taxonomy ID: 100897), also named as reed vole, is classified as Microtus, Micotinae, Cricetidae, Rodentia, Mammalia on taxonomy. Microtus fortis mainly distributes in China. Some areas of Russia, North Korea and Mongolia close to Northeast borderland of China also have a small number of Microtus fortis in distribution. Microtus fortis in China has principally 4 subspecies, and most of them live is the drainage area of Yangtse River. Schistosoma japonicum (one of commonly parasites in China) can infect about 40 kinds of mammalian animals, including the human being, but could not infect Microtus foris. It is known as the only animal in Dongting Lake region of China which has the ability of natural resistance to Schistosoma japonicum. The Microtus fortis domesticated in laboratory has the same biological characteristics as the wild one and these characteristics could be inherited to its progeny steadily. We got a specific DNA fragment from genomic library of Microtus fortis. This DNA fragment in genomic DNA of human beings, Kunming mice, Balb/c mice and C57BL/6J mice could not be detected by dot blot hybridization and PCR, apart from genomic DNA of Microtus fortis. In this report, the differences of genomic DNA in 34 Microtus fortis were compared between Microtus fortis calamorum(Dongting Lake region of southern China) and Microtus fortis fortis (Ningxia province of northern China). The residing localion of these two subspecies is far away about 1,200 kilometers from each other. The genomic DNA of Microtus fortis calamorum and Microtus fortis fortis were extracted and amplified by PCR according to the specific genomic DNAs sequence of Microtus fortis reported previously (Accession number in GenBank: AF277394). The amplified DNA fragments were inserted into pGEM-T easy vector and sequenced. The DNA fragment sequencing results from the two subspecies were compared to detect whether there was any difference. 19 alleles were found from Microtus fortis (20

  17. Whole Genome Sequencing and Comparisons of Different Chinese Rabies Virus Lineages Including the First Complete Genome of an Arctic-like Strain in China

    Institute of Scientific and Technical Information of China (English)

    LI Hao; GUO ZhenYang; ZHANG Jian; TAO XiaoYan; ZHU WuYang; TANG Qing; LIU HongTu

    2016-01-01

    ObjectiveTo learn the rabies genome molecular characteristics and compare the difference of China rabies lineages. MethodsThe complete genomes of 12 strains from different China rabies lineages were amplified and sequenced, and all the China street strain genomes (total 43), Arctic and Arctic-like genomes were aligned using ClustalX2, the genome homologies were analyzed using MegAlign software, and the phylogenetic trees were constructed by MEGA 5. ResultsFirst Arctic-like rabies genome in China (CQH1202D) was reported, and we supplemented the rabies genome data of China, ensuring at least one genome was available in each China lineage. The genome size of China V (11908nt) is obviously shorter than other lineages’ (11923-11925nt) for the difference of N-P non-coding regions. Among different lineages, the genome homologies are almost under 90%.CQH1202D (China IV lineage) has close relationship with strains from South Korea and they share about 95% genome similarities. ConclusionThe molecular characteristics of 6 different China rabies lineages werecompared and analyzed from genome level, which benefits for continued comprehensive rabies surveillance, rabies prevention and control in China.

  18. A Practical Comparison of De Novo Genome Assembly Software Tools for Next-Generation Sequencing Technologies

    OpenAIRE

    Zhang, Wenyu; Chen, Jiajia; Yang, Yang; Tang, Yifei; Shang, Jing; Shen, Bairong

    2011-01-01

    The advent of next-generation sequencing technologies is accompanied with the development of many whole-genome sequence assembly methods and software, especially for de novo fragment assembly. Due to the poor knowledge about the applicability and performance of these software tools, choosing a befitting assembler becomes a tough task. Here, we provide the information of adaptivity for each program, then above all, compare the performance of eight distinct tools against eight groups of simulat...

  19. Single nucleotide polymorphisms in the bovine Histophilus somni genome; a comparison of new and old isolates

    OpenAIRE

    Madampage, Claudia Avis; Rawlyk, Neil; Crockford, Gordon; Van Donkersgoed, Joyce; Dorin, Craig; Potter, Andrew

    2015-01-01

    Histophilus somni, a causative agent of the bovine respiratory disease complex, can also cause a variety of systemic disorders, including bronchopneumonia, myocarditis, pericarditis, arthritis, pleuritis, and infectious thrombotic meningoencephalitis. The purpose of this study was to determine if currently circulating strains differ from those of the 1980s by identifying genomic changes. Single nucleotide polymorphisms (SNPs) and insertion and deletion (INDEL) sites were examined by whole-gen...

  20. Comparative genomic and functional analysis of 100 Lactobacillus rhamnosus strains and their comparison with strain GG.

    Directory of Open Access Journals (Sweden)

    François P Douillard

    Full Text Available Lactobacillus rhamnosus is a lactic acid bacterium that is found in a large variety of ecological habitats, including artisanal and industrial dairy products, the oral cavity, intestinal tract or vagina. To gain insights into the genetic complexity and ecological versatility of the species L. rhamnosus, we examined the genomes and phenotypes of 100 L. rhamnosus strains isolated from diverse sources. The genomes of 100 L. rhamnosus strains were mapped onto the L. rhamnosus GG reference genome. These strains were phenotypically characterized for a wide range of metabolic, antagonistic, signalling and functional properties. Phylogenomic analysis showed multiple groupings of the species that could partly be associated with their ecological niches. We identified 17 highly variable regions that encode functions related to lifestyle, i.e. carbohydrate transport and metabolism, production of mucus-binding pili, bile salt resistance, prophages and CRISPR adaptive immunity. Integration of the phenotypic and genomic data revealed that some L. rhamnosus strains possibly resided in multiple niches, illustrating the dynamics of bacterial habitats. The present study showed two distinctive geno-phenotypes in the L. rhamnosus species. The geno-phenotype A suggests an adaptation to stable nutrient-rich niches, i.e. milk-derivative products, reflected by the alteration or loss of biological functions associated with antimicrobial activity spectrum, stress resistance, adaptability and fitness to a distinctive range of habitats. In contrast, the geno-phenotype B displays adequate traits to a variable environment, such as the intestinal tract, in terms of nutrient resources, bacterial population density and host effects.

  1. Sodium Ion Cycle in Bacterial Pathogens: Evidence from Cross-Genome Comparisons

    OpenAIRE

    Häse, Claudia C.; Fedorova, Natalie D.; Galperin, Michael Y.; Dibrov, Pavel A.

    2001-01-01

    Analysis of the bacterial genome sequences shows that many human and animal pathogens encode primary membrane Na+ pumps, Na+-transporting dicarboxylate decarboxylases or Na+-translocating NADH:ubiquinone oxidoreductase, and a number of Na+-dependent permeases. This indicates that these bacteria can utilize Na+ as a coupling ion instead of or in addition to the H+ cycle. This capability to use a Na+ cycle might be an important virulence factor for such pathogens as Vibrio cholerae, Neisseria m...

  2. Genomic Comparison of the Closely Related Salmonella enterica Serovars Enteritidis and Dublin

    OpenAIRE

    Betancor, Laura; Yim, Lucía; Martínez, Arací; Fookes, Maria; Sasias, Sebastian; Schelotto, Felipe; Thomson, Nicholas; Maskell, Duncan; Chabalgoity, José A.

    2012-01-01

    The Enteritidis and Dublin serovars of Salmonella enterica are closely related, yet they differ significantly in pathogenicity and epidemiology. S. Enteritidis is a broad host range serovar that commonly causes gastroenteritis and infrequently causes invasive disease in humans. S. Dublin mainly colonizes cattle but upon infecting humans often results in invasive disease.To gain a broader view of the extent of these differences we conducted microarray-based comparative genomics between several...

  3. Genomic prediction of disease occurrence using producer-recorded health data: a comparison of methods

    OpenAIRE

    Parker Gaddis, Kristen L; Tiezzi, Francesco; John B Cole; John S Clay; Maltecca, Christian

    2015-01-01

    International audience AbstractBackgroundGenetic selection has been successful in achieving increased production in dairy cattle; however, corresponding declines in fitness traits have been documented. Selection for fitness traits is more difficult, since they have low heritabilities and are influenced by various non-genetic factors. The objective of this paper was to investigate the predictive ability of two-stage and single-step genomic selection methods applied to health data collected ...

  4. Genome sequencing and comparison of two nonhuman primate animal models, the cynomolgus and Chinese rhesus macaques

    DEFF Research Database (Denmark)

    Yan, Guangmei; Zhang, Guojie; Fang, Xiaodong;

    2011-01-01

    The nonhuman primates most commonly used in medical research are from the genus Macaca. To better understand the genetic differences between these animal models, we present high-quality draft genome sequences from two macaque species, the cynomolgus/crab-eating macaque and the Chinese rhesus...... sequence similarity with human disease gene orthologs and drug targets. However, we identify several putatively dysfunctional genetic differences between the three macaque species, which may explain functional differences between them previously observed in clinical studies....

  5. Whole genome transcript profiling from fingerstick blood samples: a comparison and feasibility study

    OpenAIRE

    Williams Adam R; Mondala Tony S; Robison Elizabeth H; Head Steven R; Salomon Daniel R; Kurian Sunil M

    2009-01-01

    Abstract Background Whole genome gene expression profiling has revolutionized research in the past decade especially with the advent of microarrays. Recently, there have been significant improvements in whole blood RNA isolation techniques which, through stabilization of RNA at the time of sample collection, avoid bias and artifacts introduced during sample handling. Despite these improvements, current human whole blood RNA stabilization/isolation kits are limited by the requirement of a veno...

  6. Comparison of Whole-Genome Sequencing and Molecular-Epidemiological Techniques for Clostridium difficile Strain Typing.

    Science.gov (United States)

    Dominguez, Samuel R; Anderson, Lydia J; Kotter, Cassandra V; Littlehorn, Cynthia A; Arms, Lesley E; Dowell, Elaine; Todd, James K; Frank, Daniel N

    2016-09-01

    We analyzed in parallel 27 pediatric Clostridium difficile isolates by repetitive sequence-based polymerase chain reaction (RepPCR), pulsed-field gel electrophoresis (PFGE), and whole-genome next-generation sequencing. Next-generation sequencing distinguished 3 groups of isolates that were indistinguishable by RepPCR and 1 isolate that clustered in the same PFGE group as other isolates. PMID:26407257

  7. Genomic Diversity in Pig (Sus scrofa) and its Comparison with Human and other Livestock

    OpenAIRE

    Zhang, Chunyan; Plastow, Graham

    2011-01-01

    We have reviewed the current pig (Sus scrofa) genomic diversity within and between sites and compared them with human and other livestock. The current Porcine 60K single nucleotide polymorphism (SNP) panel has an average SNP distance in a range of 30 - 40 kb. Most of genetic variation was distributed within populations, and only a small proportion of them existed between populations. The average heterozygosity was lower in pig than in human and other livestock. Genetic inbreeding coefficient ...

  8. Single nucleotide polymorphisms in the bovine Histophilus somni genome; a comparison of new and old isolates.

    Science.gov (United States)

    Madampage, Claudia Avis; Rawlyk, Neil; Crockford, Gordon; Van Donkersgoed, Joyce; Dorin, Craig; Potter, Andrew

    2015-07-01

    Histophilus somni, a causative agent of the bovine respiratory disease complex, can also cause a variety of systemic disorders, including bronchopneumonia, myocarditis, pericarditis, arthritis, pleuritis, and infectious thrombotic meningoencephalitis. The purpose of this study was to determine if currently circulating strains differ from those of the 1980s by identifying genomic changes. Single nucleotide polymorphisms (SNPs) and insertion and deletion (INDEL) sites were examined by whole-genome sequencing in 12 samples, 6 old and 6 new. The 31 028 SNP/INDELs recorded were compared against the reference genome sequence of the pathogenic H. somni strain 2336. The distribution of about 75% of these SNPs within a specified gene differed between old and new isolates and did not follow any particular pattern. The other 25% clustered into 2 groups containing the same SNPs in various genes: group I included 5 old isolates and 1 new isolate; group II included 5 new isolates and 1 old isolate. For putative virulence genes there were more SNPs in group I compared with strain 2336, itself an older isolate, than in group II. Although only 25% of all the SNPs formed 2 clusters, the results suggest some genetic difference in various genes between old and new strains. PMID:26130851

  9. A comparison of complete mitochondrial genomes of silver carp hypophthalmichthys molitrix and bighead carp hypophthalmichthys nobilis: Implications for their taxonomic relationship and phylogeny

    Science.gov (United States)

    Li, S.-F.; Xu, J.-W.; Yang, Q.-L.; Wang, C.H.; Chen, Q.; Chapman, D.C.; Lu, G.

    2009-01-01

    Based upon morphological characters, Silver carp Hypophthalmichthys molitrix and bighead carp Hypophthalmichthys nobilis (or Aristichthys nobilis) have been classified into either the same genus or two distinct genera. Consequently, the taxonomic relationship of the two species at the generic level remains equivocal. This issue is addressed by sequencing complete mitochondrial genomes of H. molitrix and H. nobilis, comparing their mitogenome organization, structure and sequence similarity, and conducting a comprehensive phylogenetic analysis of cyprinid species. As with other cyprinid fishes, the mitogenomes of the two species were structurally conserved, containing 37 genes including 13 protein-coding genes, two ribosomal RNA genes, 22 transfer RNA (tRNAs) genes and a putative control region (D-loop). Sequence similarity between the two mitogenomes varied in different genes or regions, being highest in the tRNA genes (98??8%), lowest in the control region (89??4%) and intermediate in the protein-coding genes (94??2%). Analyses of the sequence comparison and phylogeny using concatenated protein sequences support the view that the two species belong to the genus Hypophthalmichthys. Further studies using nuclear markers and involving more closely related species, and the systematic combination of traditional biology and molecular biology are needed in order to confirm this conclusion. ?? 2009 The Fisheries Society of the British Isles.

  10. Comparative genomic and transcriptional analyses of the carbohydrate-active enzymes and secretomes of phytopathogenic fungi reveal their significant roles during infection and development.

    Science.gov (United States)

    Lyu, Xueliang; Shen, Cuicui; Fu, Yanping; Xie, Jiatao; Jiang, Daohong; Li, Guoqing; Cheng, Jiasen

    2015-01-01

    Our comparative genomic analysis showed that the numbers of plant cell wall (PCW)- and fungal cell wall (FCW)-degradation-associated carbohydrate-active enzymes (CAZymes) in necrotrophic and hemibiotrophic fungi are significantly larger than that in most biotrophic fungi. However, our transcriptional analyses of CAZyme-encoding genes in Melampsora larici-populina, Puccinia graminis and Sclerotinia sclerotiorum showed that many genes encoding PCW- and FCW-degradation-associated CAZymes were significantly up-regulated during the infection of both necrotrophic fungi and biotrophic fungi, indicating an existence of a universal mechanism underlying PCW degradation and FCW reorganization or modification, which are both intimately involved in necrotrophic and biotrophic fungal infection. Furthermore, our results showed that the FCW reorganization or modification was also related to the fungal development. Additionally, our transcriptional analysis of the secretome of S. sclerotiorum showed that many secreted protein-encoding genes were dramatically induced during infection. Among them, a small, cysteine-rich protein SsCVNH was experimentally confirmed to be essential for the virulence and sclerotial development, indicating that the small secreted proteins might also play crucial roles as potential effectors in host-non-specific necrotrophic fungi. PMID:26531059

  11. Examination of Epigenetic and other Molecular Factors Associated with mda-9/Syntenin Dysregulation in Cancer Through Integrated Analyses of Public Genomic Datasets

    Science.gov (United States)

    Bacolod, Manny D.; Das, Swadesh K.; Sokhi, Upneet K.; Bradley, Steven; Fenstermacher, David A.; Pellecchia, Maurizio; Emdad, Luni; Sarkar, Devanand; Fisher, Paul B.

    2016-01-01

    mda-9/Syntenin (melanoma differentiation-associated gene 9) is a PDZ domain-containing, cancer invasion-related protein. In this study, we employed multiple integrated bioinformatic approaches to identify the probable epigenetic factors, molecular pathways, and functionalities associated with mda-9 dysregulation during cancer progression. Analyses of publicly available genomic data (e.g., expression, copy number, methylation) from TCGA, GEO, ENCODE, and Human Protein Atlas projects led to the following observations: a) mda-9 expression correlates with both copy number and methylation level of an intronic CpG site (cg17197774) located downstream of the CpG island, b) cg17197774 methylation is a likely prognostic marker in glioma, c) Among 22 cancer types, melanoma exhibits the highest mda-9 level, and lowest level of methylation at cg17197774, d) cg17197774 hypomehtylation is also associated with histone modifications (at the mda-9 locus) indicative of more active transcription, e) Using Gene Set Enrichment Analysis (GSEA), and the VIGOR (Virtual Gene Over-expression or Repression ) analytical scheme, we were able to predict mda-9’s association with extracellular matrix organization (e.g., MMPs, collagen, integrins), IGFBP2 and NF-κB signaling pathways, phospholipid metabolism, cytokines (e.g., interleukins), CTLA-4, and components of complement cascade pathways. Indeed, previous publications have shown that many of the aforementioned genes and pathways are associated with mda-9’s functionality. PMID:26093898

  12. Genomic-scale comparison of sequence- and structure-based methods of function prediction: Does structure provide additional insight?

    Science.gov (United States)

    Fetrow, Jacquelyn S.; Siew, Naomi; Di Gennaro, Jeannine A.; Martinez-Yamout, Maria; Dyson, H. Jane; Skolnick, Jeffrey

    2001-01-01

    A function annotation method using the sequence-to-structure-to-function paradigm is applied to the identification of all disulfide oxidoreductases in the Saccharomyces cerevisiae genome. The method identifies 27 sequences as potential disulfide oxidoreductases. All previously known thioredoxins, glutaredoxins, and disulfide isomerases are correctly identified. Three of the 27 predictions are probable false-positives. Three novel predictions, which subsequently have been experimentally validated, are presented. Two additional novel predictions suggest a disulfide oxidoreductase regulatory mechanism for two subunits (OST3 and OST6) of the yeast oligosaccharyltransferase complex. Based on homology, this prediction can be extended to a potential tumor suppressor gene, N33, in humans, whose biochemical function was not previously known. Attempts to obtain a folded, active N33 construct to test the prediction were unsuccessful. The results show that structure prediction coupled with biochemically relevant structural motifs is a powerful method for the function annotation of genome sequences and can provide more detailed, robust predictions than function prediction methods that rely on sequence comparison alone. PMID:11316881

  13. A comparison of tools for the simulation of genomic next-generation sequencing data.

    Science.gov (United States)

    Escalona, Merly; Rocha, Sara; Posada, David

    2016-08-01

    Computer simulation of genomic data has become increasingly popular for assessing and validating biological models or for gaining an understanding of specific data sets. Several computational tools for the simulation of next-generation sequencing (NGS) data have been developed in recent years, which could be used to compare existing and new NGS analytical pipelines. Here we review 23 of these tools, highlighting their distinct functionality, requirements and potential applications. We also provide a decision tree for the informed selection of an appropriate NGS simulation tool for the specific question at hand. PMID:27320129

  14. ProGenViZ: a novel interactive tool for prokaryotic genome visualization and comparison

    OpenAIRE

    Gonçalves, Bruno Filipe Ribeiro

    2014-01-01

    Tese de mestrado, Bioinformática e Biologia Computacional, Universidade de Lisboa, Faculdade de Ciências, 2014 Everyday new sequencing data and draft microbial genomes are obtained by high-throughput sequencing (HTS) and made publicly available at NCBI Sequence Read Archive (www.ncbi.nlm.nih.gov/sra) and EBI European Nucleotide Archive (http://www.ebi.ac.uk/ena). It is now perceived that the limiting factor is not obtaining the sequence data but the current capacity of the existing analysi...

  15. Analyses and interpretation of whole-genome gene expression from formalin-fixed paraffin-embedded tissue: an illustration with breast cancer tissues

    Directory of Open Access Journals (Sweden)

    Argos Maria

    2010-11-01

    Full Text Available Abstract Background We evaluated (a the feasibility of whole genome cDNA-mediated Annealing, Selection, extension and Ligation (DASL assay on formalin-fixed paraffin-embedded (FFPE tissue and (b whether similar conclusions can be drawn by examining FFPE samples as proxies for fresh frozen (FF tissues. We used a whole genome DASL assay (addressing 18,391 genes on a total of 72 samples from paired breast tumor and surrounding healthy tissues from both FF and FFPE samples. Results Gene detection was very good with comparable success between the FFPE and FF samples. Reproducibility was also high (r2 = 0.98; however, concordance between the two types of samples was low. Only one-third of the differentially expressed genes in tumor tissues (compared to corresponding normal from FF samples could be detected in FFPE samples and conversely only one-fourth of the differentially expressed genes from FFPE samples could be detected in FF samples. GO-enrichment analysis, gene set enrichment analysis (GSEA and GO-ANOVA analyses also suggested small overlap between the lead functional groups that were differentially expressed in tumor detectable by examining FFPE and FF samples. In other words, FFPE samples may not be ideal for picking individual target gene(s, but may be used to identify some of the lead functional group(s of genes that are differentially expressed in tumor. The differentially expressed genes in breast cancer found in our study were biologically meaningful. The "cell cycle" & "cell division" related genes were up-regulated and genes related to "regulation of epithelial cell proliferation" were down-regulated. Conclusions Gene expression experiments using the DASL assay can efficiently handle fragmentation issues in the FFPE tissues. However, formalin fixation seems to change RNA and consequently significantly alters gene expression in a number of genes which may not be uniform between tumor and normal tissues. Therefore, considerable caution

  16. Comparison of microbial DNA enrichment tools for metagenomic whole genome sequencing.

    Science.gov (United States)

    Thoendel, Matthew; Jeraldo, Patricio R; Greenwood-Quaintance, Kerryl E; Yao, Janet Z; Chia, Nicholas; Hanssen, Arlen D; Abdel, Matthew P; Patel, Robin

    2016-08-01

    Metagenomic whole genome sequencing for detection of pathogens in clinical samples is an exciting new area for discovery and clinical testing. A major barrier to this approach is the overwhelming ratio of human to pathogen DNA in samples with low pathogen abundance, which is typical of most clinical specimens. Microbial DNA enrichment methods offer the potential to relieve this limitation by improving this ratio. Two commercially available enrichment kits, the NEBNext Microbiome DNA Enrichment Kit and the Molzym MolYsis Basic kit, were tested for their ability to enrich for microbial DNA from resected arthroplasty component sonicate fluids from prosthetic joint infections or uninfected sonicate fluids spiked with Staphylococcus aureus. Using spiked uninfected sonicate fluid there was a 6-fold enrichment of bacterial DNA with the NEBNext kit and 76-fold enrichment with the MolYsis kit. Metagenomic whole genome sequencing of sonicate fluid revealed 13- to 85-fold enrichment of bacterial DNA using the NEBNext enrichment kit. The MolYsis approach achieved 481- to 9580-fold enrichment, resulting in 7 to 59% of sequencing reads being from the pathogens known to be present in the samples. These results demonstrate the usefulness of these tools when testing clinical samples with low microbial burden using next generation sequencing. PMID:27237775

  17. A Comparison of Variant Calling Pipelines Using Genome in a Bottle as a Reference

    Directory of Open Access Journals (Sweden)

    Adam Cornish

    2015-01-01

    Full Text Available High-throughput sequencing, especially of exomes, is a popular diagnostic tool, but it is difficult to determine which tools are the best at analyzing this data. In this study, we use the NIST Genome in a Bottle results as a novel resource for validation of our exome analysis pipeline. We use six different aligners and five different variant callers to determine which pipeline, of the 30 total, performs the best on a human exome that was used to help generate the list of variants detected by the Genome in a Bottle Consortium. Of these 30 pipelines, we found that Novoalign in conjunction with GATK UnifiedGenotyper exhibited the highest sensitivity while maintaining a low number of false positives for SNVs. However, it is apparent that indels are still difficult for any pipeline to handle with none of the tools achieving an average sensitivity higher than 33% or a Positive Predictive Value (PPV higher than 53%. Lastly, as expected, it was found that aligners can play as vital a role in variant detection as variant callers themselves.

  18. Comparison of genome-wide variation between Malawians and African ancestry HapMap populations.

    Science.gov (United States)

    Joubert, Bonnie R; North, Kari E; Wang, Yunfei; Mwapasa, Victor; Franceschini, Nora; Meshnick, Steven R; Lange, Ethan M

    2010-06-01

    Understanding genetic variation between populations is important because it affects the portability of human genome-wide analytical methods. We compared genetic variation and substructure between Malawians and other African and non-African HapMap populations. Allele frequencies and adjacent linkage disequilibrium (LD) were measured for 617 715 single nucleotide polymorphisms (SNPs) across subject genomes. Allele frequencies in the Malawian population (N=226) were highly correlated with allele frequencies in HapMap populations of African ancestry (AFA, N=376), namely Yoruban in Ibadan, Nigeria (Spearman's r(2)=0.97), Luhya in Webuye, Kenya (r(2)=0.97), African Americans in the southwest United States (r(2)=0.94) and Maasai in Kinyawa, Kenya (r(2)=0.91). This correlation was much lower between Malawians and other ancestry populations (r(2)0.82, other ancestries r(2)Maasai in Kenyawa, Kenya (rs3769013, rs730005, rs3769012, rs2304370; P-values <1 x 10(-33)). PMID:20485449

  19. Genome-Wide SNP Calling from Genotyping by Sequencing (GBS) Data: A Comparison of Seven Pipelines and Two Sequencing Technologies

    Science.gov (United States)

    Torkamaneh, Davoud; Laroche, Jérôme; Belzile, François

    2016-01-01

    Next-generation sequencing (NGS) has revolutionized plant and animal research in many ways including new methods of high throughput genotyping. Genotyping-by-sequencing (GBS) has been demonstrated to be a robust and cost-effective genotyping method capable of producing thousands to millions of SNPs across a wide range of species. Undoubtedly, the greatest barrier to its broader use is the challenge of data analysis. Herein we describe a comprehensive comparison of seven GBS bioinformatics pipelines developed to process raw GBS sequence data into SNP genotypes. We compared five pipelines requiring a reference genome (TASSEL-GBS v1& v2, Stacks, IGST, and Fast-GBS) and two de novo pipelines that do not require a reference genome (UNEAK and Stacks). Using Illumina sequence data from a set of 24 re-sequenced soybean lines, we performed SNP calling with these pipelines and compared the GBS SNP calls with the re-sequencing data to assess their accuracy. The number of SNPs called without a reference genome was lower (13k to 24k) than with a reference genome (25k to 54k SNPs) while accuracy was high (92.3 to 98.7%) for all but one pipeline (TASSEL-GBSv1, 76.1%). Among pipelines offering a high accuracy (>95%), Fast-GBS called the greatest number of polymorphisms (close to 35,000 SNPs + Indels) and yielded the highest accuracy (98.7%). Using Ion Torrent sequence data for the same 24 lines, we compared the performance of Fast-GBS with that of TASSEL-GBSv2. It again called more polymorphisms (25.8K vs 22.9K) and these proved more accurate (95.2 vs 91.1%). Typically, SNP catalogues called from the same sequencing data using different pipelines resulted in highly overlapping SNP catalogues (79–92% overlap). In contrast, overlap between SNP catalogues obtained using the same pipeline but different sequencing technologies was less extensive (~50–70%). PMID:27547936

  20. Genome-Wide SNP Calling from Genotyping by Sequencing (GBS) Data: A Comparison of Seven Pipelines and Two Sequencing Technologies.

    Science.gov (United States)

    Torkamaneh, Davoud; Laroche, Jérôme; Belzile, François

    2016-01-01

    Next-generation sequencing (NGS) has revolutionized plant and animal research in many ways including new methods of high throughput genotyping. Genotyping-by-sequencing (GBS) has been demonstrated to be a robust and cost-effective genotyping method capable of producing thousands to millions of SNPs across a wide range of species. Undoubtedly, the greatest barrier to its broader use is the challenge of data analysis. Herein we describe a comprehensive comparison of seven GBS bioinformatics pipelines developed to process raw GBS sequence data into SNP genotypes. We compared five pipelines requiring a reference genome (TASSEL-GBS v1& v2, Stacks, IGST, and Fast-GBS) and two de novo pipelines that do not require a reference genome (UNEAK and Stacks). Using Illumina sequence data from a set of 24 re-sequenced soybean lines, we performed SNP calling with these pipelines and compared the GBS SNP calls with the re-sequencing data to assess their accuracy. The number of SNPs called without a reference genome was lower (13k to 24k) than with a reference genome (25k to 54k SNPs) while accuracy was high (92.3 to 98.7%) for all but one pipeline (TASSEL-GBSv1, 76.1%). Among pipelines offering a high accuracy (>95%), Fast-GBS called the greatest number of polymorphisms (close to 35,000 SNPs + Indels) and yielded the highest accuracy (98.7%). Using Ion Torrent sequence data for the same 24 lines, we compared the performance of Fast-GBS with that of TASSEL-GBSv2. It again called more polymorphisms (25.8K vs 22.9K) and these proved more accurate (95.2 vs 91.1%). Typically, SNP catalogues called from the same sequencing data using different pipelines resulted in highly overlapping SNP catalogues (79-92% overlap). In contrast, overlap between SNP catalogues obtained using the same pipeline but different sequencing technologies was less extensive (~50-70%). PMID:27547936

  1. Comparison of the complete genome sequences of Pseudomonassyringae pv. syringae B728a and pv. tomato DC3000.

    Energy Technology Data Exchange (ETDEWEB)

    Feil, Helene; Feil, William S.; Chain, Patrick; Larimer, Frank; DiBartolo, Genevieve; Copeland, Alex; Lykidis, Athanasios; Trong,Stephen; Nolan, Matt; Goltsman, Eugene; Thiel, James; Malfatti,Stephanie; Loper, Joyce E.; Lapidus, Alla; Detter, John C.; Land, Miriam; Richardson, Paul M.; Kyrpides, Nikos C.; Ivanova, Natalia; Lindow, StevenE.

    2005-04-01

    The complete genomic sequence of Pseudomonas syringaepathovar syringae B728a (Pss B728a), has been determined and is comparedwith that of Pseudomonas syringae pv. tomato DC3000 (Pst DC3000). Thesetwo pathovars of this economically important species of plant pathogenicbacteria differ in host range and apparent patterns of interaction withplants, with Pss having a more pronounced epiphytic stage of growth andhigher abiotic stress tolerance and Pst DC3000 having a more pronouncedapoplastic growth habitat. The Pss B728a genome (6.1 megabases) containsa circular chromosome and no plasmid, whereas the Pst DC3000 genome is6.5 mbp in size, composed of a circular chromosome and two plasmids.While a high degree of similarity exists between the two sequencedPseudomonads, 976 protein-encoding genes are unique to Pss B728a whencompared to Pst DC3000, including large genomic islands likely tocontribute to virulence and host specificity. Over 375 repetitiveextragenic palindromic sequences (REPs) unique to Pss B728a when comparedto Pst DC3000 are widely distributed throughout the chromosome except in14 genomic islands, which generally had lower GC content than the genomeas a whole. Content of the genomic islands vary, with one containing aprophage and another the plasmid pKLC102 of P. aeruginosa PAO1. Among the976 genes of Pss B728a with no counterpart in Pst DC3000 are thoseencoding for syringopeptin (SP), syringomycin (SR), indole acetic acidbiosynthesis, arginine degradation, and production of ice nuclei. Thegenomic comparison suggests that several unique genes for Pss B728a suchas ectoine synthase, DNA repair, and antibiotic production may contributeto epiphytic fitness and stress tolerance of this organism. Pseudomonassyringae, a member of the gamma subgroup of the Proteobacteria, is awidespread bacterial pathogen of many plant species. The species P.syringae is subdivided into approximately 50 pathovars based onpathogenicity and host range. P. syringae is capable of

  2. Genetic, comparative genomic, and expression analyses of the Mc1r locus in the polychromatic Midas cichlid fish (Teleostei, Cichlidae Amphilophus sp.) species group.

    Science.gov (United States)

    Henning, Frederico; Renz, Adina Josepha; Fukamachi, Shoji; Meyer, Axel

    2010-05-01

    Natural populations of the Midas cichlid species in several different crater lakes in Nicaragua exhibit a conspicuous color polymorphism. Most individuals are dark and the remaining have a gold coloration. The color morphs mate assortatively and sympatric population differentiation has been shown based on neutral molecular data. We investigated the color polymorphism using segregation analysis and a candidate gene approach. The segregation patterns observed in a mapping cross between a gold and a dark individual were consistent with a single dominant gene as a cause of the gold phenotype. This suggests that a simple genetic architecture underlies some of the speciation events in the Midas cichlids. We compared the expression levels of several candidate color genes Mc1r, Ednrb1, Slc45a2, and Tfap1a between the color morphs. Mc1r was found to be up regulated in the gold morph. Given its widespread association in color evolution and role on melanin synthesis, the Mc1r locus was further investigated using sequences derived from a genomic library. Comparative analysis revealed conserved synteny in relation to the majority of teleosts and highlighted several previously unidentified conserved non-coding elements (CNEs) in the upstream and downstream regions in the vicinity of Mc1r. The identification of the CNEs regions allowed the comparison of sequences from gold and dark specimens of natural populations. No polymorphisms were found between in the population sample and Mc1r showed no linkage to the gold phenotype in the mapping cross, demonstrating that it is not causally related to the color polymorphism in the Midas cichlid. PMID:20449580

  3. Genome-wide DNA methylation analyses in the brain reveal four differentially methylated regions between humans and non-human primates

    Directory of Open Access Journals (Sweden)

    Wang Jinkai

    2012-08-01

    Full Text Available Abstract Background The highly improved cognitive function is the most significant change in human evolutionary history. Recently, several large-scale studies reported the evolutionary roles of DNA methylation; however, the role of DNA methylation on brain evolution is largely unknown. Results To test if DNA methylation has contributed to the evolution of human brain, with the use of MeDIP-Chip and SEQUENOM MassARRAY, we conducted a genome-wide analysis to identify differentially methylated regions (DMRs in the brain between humans and rhesus macaques. We first identified a total of 150 candidate DMRs by the MeDIP-Chip method, among which 4 DMRs were confirmed by the MassARRAY analysis. All 4 DMRs are within or close to the CpG islands, and a MIR3 repeat element was identified in one DMR, but no repeat sequence was observed in the other 3 DMRs. For the 4 DMR genes, their proteins tend to be conserved and two genes have neural related functions. Bisulfite sequencing and phylogenetic comparison among human, chimpanzee, rhesus macaque and rat suggested several regions of lineage specific DNA methylation, including a human specific hypomethylated region in the promoter of K6IRS2 gene. Conclusions Our study provides a new angle of studying human brain evolution and understanding the evolutionary role of DNA methylation in the central nervous system. The results suggest that the patterns of DNA methylation in the brain are in general similar between humans and non-human primates, and only a few DMRs were identified.

  4. Genome Sequence of Candidatus Riesia pediculischaeffi, Endosymbiont of Chimpanzee Lice, and Genomic Comparison of Recently Acquired Endosymbionts from Human and Chimpanzee Lice

    OpenAIRE

    Boyd, Bret M.; Allen, Julie M; de Crécy-Lagard, Valérie; Reed, David L

    2014-01-01

    The obligate-heritable endosymbionts of insects possess some of the smallest known bacterial genomes. This is likely due to loss of genomic material during symbiosis. The mode and rate of this erosion may change over evolutionary time: faster in newly formed associations and slower in long-established ones. The endosymbionts of human and anthropoid primate lice present a unique opportunity to study genome erosion in newly established (or young) symbionts. This is because we have a detailed ph...

  5. Comparison of Clustering Methods for Time Course Genomic Data: Applications to Aging Effects

    OpenAIRE

    Zhang, Y.; Horvath, S.; Ophoff, R; Telesca, D

    2014-01-01

    Time course microarray data provide insight about dynamic biological processes. While several clustering methods have been proposed for the analysis of these data structures, comparison and selection of appropriate clustering methods are seldom discussed. We compared $3$ probabilistic based clustering methods and $3$ distance based clustering methods for time course microarray data. Among probabilistic methods, we considered: smoothing spline clustering also known as model b...

  6. Comparison of surrogate reporter systems for enrichment of cells with mutations induced by genome editors.

    Science.gov (United States)

    He, Zuyong; Shi, Xuan; Liu, Meirui; Sun, Guangjie; Proudfoot, Chris; Whitelaw, C Bruce A; Lillico, Simon G; Chen, Yaosheng

    2016-03-10

    Genome editors are powerful tools that allow modification of the nuclear DNA in eukaryotic cells both in vitro and in vivo. In vitro modified cells are often phenotypically indistinguishable from unmodified cells, hampering their isolation for analysis. Episomal reporters encoding fluorescent proteins can be used for enrichment of modified cells by flow cytometry. Here we compare two surrogate reporters, RGS and SSA, for the enrichment of porcine embryonic fibroblasts containing mutations induced by ZFNs or CRISPR/Cas9. Both systems were effective for enrichment of edited porcine cells with the RGS reporter proving more effective than the SSA reporter. We noted a higher-fold enrichment when editing events were induced by Cas9 compared to those induced by ZFNs, allowing selection at frequencies as high as 70%. PMID:26778541

  7. Rapid high resolution genotyping of Francisella tularensis by whole genome sequence comparison of annotated genes ("MLST+".

    Directory of Open Access Journals (Sweden)

    Markus H Antwerpen

    Full Text Available The zoonotic disease tularemia is caused by the bacterium Francisella tularensis. This pathogen is considered as a category A select agent with potential to be misused in bioterrorism. Molecular typing based on DNA-sequence like canSNP-typing or MLVA has become the accepted standard for this organism. Due to the organism's highly clonal nature, the current typing methods have reached their limit of discrimination for classifying closely related subpopulations within the subspecies F. tularensis ssp. holarctica. We introduce a new gene-by-gene approach, MLST+, based on whole genome data of 15 sequenced F. tularensis ssp. holarctica strains and apply this approach to investigate an epidemic of lethal tularemia among non-human primates in two animal facilities in Germany. Due to the high resolution of MLST+ we are able to demonstrate that three independent clones of this highly infectious pathogen were responsible for these spatially and temporally restricted outbreaks.

  8. Development of a Rhizoctonia solani AG1-IB Specific Gene Model Enables Comparative Genome Analyses between Phytopathogenic R. solani AG1-IA, AG1-IB, AG3 and AG8 Isolates.

    Science.gov (United States)

    Wibberg, Daniel; Rupp, Oliver; Blom, Jochen; Jelonek, Lukas; Kröber, Magdalena; Verwaaijen, Bart; Goesmann, Alexander; Albaum, Stefan; Grosch, Rita; Pühler, Alfred; Schlüter, Andreas

    2015-01-01

    Rhizoctonia solani, a soil-born plant pathogenic basidiomycetous fungus, affects various economically important agricultural and horticultural crops. The draft genome sequence for the R. solani AG1-IB isolate 7/3/14 as well as a corresponding transcriptome dataset (Expressed Sequence Tags--ESTs) were established previously. Development of a specific R. solani AG1-IB gene model based on GMAP transcript mapping within the eukaryotic gene prediction platform AUGUSTUS allowed detection of new genes and provided insights into the gene structure of this fungus. In total, 12,616 genes were recognized in the genome of the AG1-IB isolate. Analysis of predicted genes by means of different bioinformatics tools revealed new genes whose products potentially are involved in degradation of plant cell wall components, melanin formation and synthesis of secondary metabolites. Comparative genome analyses between members of different R. solani anastomosis groups, namely AG1-IA, AG3 and AG8 and the newly annotated R. solani AG1-IB genome were performed within the comparative genomics platform EDGAR. It appeared that only 21 to 28% of all genes encoded in the draft genomes of the different strains were identified as core genes. Based on Average Nucleotide Identity (ANI) and Average Amino-acid Identity (AAI) analyses, considerable sequence differences between isolates representing different anastomosis groups were identified. However, R. solani isolates form a distinct cluster in relation to other fungi of the phylum Basidiomycota. The isolate representing AG1-IB encodes significant more genes featuring predictable functions in secondary metabolite production compared to other completely sequenced R. solani strains. The newly established R. solani AG1-IB 7/3/14 gene layout now provides a reliable basis for post-genomics studies. PMID:26690577

  9. Development of a Rhizoctonia solani AG1-IB Specific Gene Model Enables Comparative Genome Analyses between Phytopathogenic R. solani AG1-IA, AG1-IB, AG3 and AG8 Isolates.

    Directory of Open Access Journals (Sweden)

    Daniel Wibberg

    Full Text Available Rhizoctonia solani, a soil-born plant pathogenic basidiomycetous fungus, affects various economically important agricultural and horticultural crops. The draft genome sequence for the R. solani AG1-IB isolate 7/3/14 as well as a corresponding transcriptome dataset (Expressed Sequence Tags--ESTs were established previously. Development of a specific R. solani AG1-IB gene model based on GMAP transcript mapping within the eukaryotic gene prediction platform AUGUSTUS allowed detection of new genes and provided insights into the gene structure of this fungus. In total, 12,616 genes were recognized in the genome of the AG1-IB isolate. Analysis of predicted genes by means of different bioinformatics tools revealed new genes whose products potentially are involved in degradation of plant cell wall components, melanin formation and synthesis of secondary metabolites. Comparative genome analyses between members of different R. solani anastomosis groups, namely AG1-IA, AG3 and AG8 and the newly annotated R. solani AG1-IB genome were performed within the comparative genomics platform EDGAR. It appeared that only 21 to 28% of all genes encoded in the draft genomes of the different strains were identified as core genes. Based on Average Nucleotide Identity (ANI and Average Amino-acid Identity (AAI analyses, considerable sequence differences between isolates representing different anastomosis groups were identified. However, R. solani isolates form a distinct cluster in relation to other fungi of the phylum Basidiomycota. The isolate representing AG1-IB encodes significant more genes featuring predictable functions in secondary metabolite production compared to other completely sequenced R. solani strains. The newly established R. solani AG1-IB 7/3/14 gene layout now provides a reliable basis for post-genomics studies.

  10. Genome-wide transcriptomic and proteomic analyses of bollworm-infested developing cotton bolls revealed the genes and pathways involved in the insect pest defence mechanism.

    Science.gov (United States)

    Kumar, Saravanan; Kanakachari, Mogilicherla; Gurusamy, Dhandapani; Kumar, Krishan; Narayanasamy, Prabhakaran; Kethireddy Venkata, Padmalatha; Solanke, Amolkumar; Gamanagatti, Savita; Hiremath, Vamadevaiah; Katageri, Ishwarappa S; Leelavathi, Sadhu; Kumar, Polumetla Ananda; Reddy, Vanga Siva

    2016-06-01

    Cotton bollworm, Helicoverpa armigera, is a major insect pest that feeds on cotton bolls causing extensive damage leading to crop and productivity loss. In spite of such a major impact, cotton plant response to bollworm infection is yet to be witnessed. In this context, we have studied the genome-wide response of cotton bolls infested with bollworm using transcriptomic and proteomic approaches. Further, we have validated this data using semi-quantitative real-time PCR. Comparative analyses have revealed that 39% of the transcriptome and 35% of the proteome were differentially regulated during bollworm infestation. Around 36% of significantly regulated transcripts and 45% of differentially expressed proteins were found to be involved in signalling followed by redox regulation. Further analysis showed that defence-related stress hormones and their lipid precursors, transcription factors, signalling molecules, etc. were stimulated, whereas the growth-related counterparts were suppressed during bollworm infestation. Around 26% of the significantly up-regulated proteins were defence molecules, while >50% of the significantly down-regulated were related to photosynthesis and growth. Interestingly, the biosynthesis genes for synergistically regulated jasmonate, ethylene and suppressors of the antagonistic factor salicylate were found to be up-regulated, suggesting a choice among stress-responsive phytohormone regulation. Manual curation of the enzymes and TFs highlighted the components of retrograde signalling pathways. Our data suggest that a selective regulatory mechanism directs the reallocation of metabolic resources favouring defence over growth under bollworm infestation and these insights could be exploited to develop bollworm-resistant cotton varieties. PMID:26799171

  11. Comparison of BD Max Cdiff and GenomEra C. difficile molecular assays for detection of toxigenic Clostridium difficile from stools in conventional sample containers and in FecalSwabs.

    Science.gov (United States)

    Hirvonen, J J; Kaukoranta, S-S

    2015-05-01

    In this study, the usability and performance of GenomEra™ C. difficile and BD Max™ Cdiff nucleic acid amplification tests (NAATs) for the detection of toxigenic Clostridium difficile were investigated in comparison with toxigenic culture and C. difficile toxin A- and toxin B-detecting immunochromatographic antigen (IA) test, the Tox A/B QuikChek®. In total, 302 faecal specimens were collected, 113 of which were in parallel to conventional sample containers and FecalSwab liquid-based microbiology (LBM) tubes. Seventy-nine specimens were considered true-positives for toxigenic C. difficile. The sensitivity and specificity were 97.5 % and 99.6 % and 93.7 % and 98.7 % for the GenomEra and BD Max assays respectively. Toxigenic culture and Tox A/B QuikChek had sensitivity and specificity of 91.1 % and 100 % and 34.2 % and 100 % respectively. Hands-on time for analysing 1 to 24 specimens using NAATs was 1 to 15 min. The rate of PCR inhibition was 0 % for both NAATs with faeces in LBM tubes, while with faeces in conventional sample containers the respective inhibition rates were 5.3 % and 4.4 % for the GenomEra and the BD Max assays. The NAATs demonstrated an excellent analytical performance, reducing significantly the overall workload of laboratory personnel compared with culture and IA test. PMID:25616552

  12. Software comparison for evaluating genomic copy number variation for Affymetrix 6.0 SNP array platform

    Science.gov (United States)

    2011-01-01

    Background Copy number data are routinely being extracted from genome-wide association study chips using a variety of software. We empirically evaluated and compared four freely-available software packages designed for Affymetrix SNP chips to estimate copy number: Affymetrix Power Tools (APT), Aroma.Affymetrix, PennCNV and CRLMM. Our evaluation used 1,418 GENOA samples that were genotyped on the Affymetrix Genome-Wide Human SNP Array 6.0. We compared bias and variance in the locus-level copy number data, the concordance amongst regions of copy number gains/deletions and the false-positive rate amongst deleted segments. Results APT had median locus-level copy numbers closest to a value of two, whereas PennCNV and Aroma.Affymetrix had the smallest variability associated with the median copy number. Of those evaluated, only PennCNV provides copy number specific quality-control metrics and identified 136 poor CNV samples. Regions of copy number variation (CNV) were detected using the hidden Markov models provided within PennCNV and CRLMM/VanillaIce. PennCNV detected more CNVs than CRLMM/VanillaIce; the median number of CNVs detected per sample was 39 and 30, respectively. PennCNV detected most of the regions that CRLMM/VanillaIce did as well as additional CNV regions. The median concordance between PennCNV and CRLMM/VanillaIce was 47.9% for duplications and 51.5% for deletions. The estimated false-positive rate associated with deletions was similar for PennCNV and CRLMM/VanillaIce. Conclusions If the objective is to perform statistical tests on the locus-level copy number data, our empirical results suggest that PennCNV or Aroma.Affymetrix is optimal. If the objective is to perform statistical tests on the summarized segmented data then PennCNV would be preferred over CRLMM/VanillaIce. Specifically, PennCNV allows the analyst to estimate locus-level copy number, perform segmentation and evaluate CNV-specific quality-control metrics within a single software package

  13. The Complete Mitochondrial Genome of the Pink Stem Borer, Sesamia inferens, in Comparison with Four Other Noctuid Moths

    Science.gov (United States)

    Chai, Huan-Na; Du, Yu-Zhou

    2012-01-01

    The complete 15,413-bp mitochondrial genome (mitogenome) of Sesamia inferens (Walker) (Lepidoptera: Noctuidae) was sequenced and compared with those of four other noctuid moths. All of the mitogenomes analyzed displayed similar characteristics with respect to gene content, genome organization, nucleotide comparison, and codon usages. Twelve-one protein-coding genes (PCGs) utilized the standard ATN, but the cox1 gene used CGA as the initiation codon; cox1, cox2, and nad4 genes had the truncated termination codon T in the S. inferens mitogenome. All of the tRNA genes had typical cloverleaf secondary structures except for trnS1(AGN), in which the dihydrouridine (DHU) arm did not form a stable stem-loop structure. Both the secondary structures of rrnL and rrnS genes inferred from the S. inferens mitogenome closely resembled those of other noctuid moths. In the A+T-rich region, the conserved motif “ATAGA” followed by a long T-stretch was observed in all noctuid moths, but other specific tandem-repeat elements were more variable. Additionally, the S. inferens mitogenome contained a potential stem-loop structure, a duplicated 17-bp repeat element, a decuplicated segment, and a microsatellite “(AT)7”, without a poly-A element upstream of the trnM in the A+T-rich region. Finally, the phylogenetic relationships were reconstructed based on amino acid sequences of mitochondrial 13 PCGs, which support the traditional morphologically based view of relationships within the Noctuidae. PMID:22949858

  14. Comparison of Copy Number of HSF Genes in Two Buffalo Genomes.

    Science.gov (United States)

    Lal, Shardul Vikram; Mukherjee, Ayan; Brahma, Biswajit; Gohain, Moloya; Patra, Mahesh Chandra; Saini, Sushil Kumar; Mishra, Purushottam; Ahlawat, Sonika; Upadhyaya, Ramesh C; Datta, Tirtha K; De, Sachinandan

    2016-07-01

    The copy number variation (CNV) is the number of copies of a particular gene in the genotype of an individual. Recent evidences show that the CNVs can vary in frequency and occurrence between breeds. These variations reportedly allowed different breeds to adapt to different environments. As copy number variations follow Mendelian pattern of inheritance, identification and distribution of these variants between populations can be used to infer the evolutionary history of the species. In this study, we have examined the absolute copy number of four Heat shock factor genes viz. HSF-1, 2, 4, and 5 in two different breeds of buffalo species using real-time PCR. Here, we report that the absolute copy number of HSF2 varies between the two breeds. In contrast no significant difference was observed in the copy number for HSF-1, 4, and 5 between the two breeds. Our results provide evidence for the presence of breed specific differences in HSF2 genomic copy number. This seems to be the first step in delineating the genetic factors underlying environmental adaptation between the two breeds. Nevertheless, a more detailed study is needed to characterize the functional consequence of this variation. PMID:26953680

  15. Cancer Biomarkers from Genome-Scale DNA Methylation: Comparison of Evolutionary and Semantic Analysis Methods

    Directory of Open Access Journals (Sweden)

    Ioannis Valavanis

    2015-11-01

    Full Text Available DNA methylation profiling exploits microarray technologies, thus yielding a wealth of high-volume data. Here, an intelligent framework is applied, encompassing epidemiological genome-scale DNA methylation data produced from the Illumina’s Infinium Human Methylation 450K Bead Chip platform, in an effort to correlate interesting methylation patterns with cancer predisposition and, in particular, breast cancer and B-cell lymphoma. Feature selection and classification are employed in order to select, from an initial set of ~480,000 methylation measurements at CpG sites, predictive cancer epigenetic biomarkers and assess their classification power for discriminating healthy versus cancer related classes. Feature selection exploits evolutionary algorithms or a graph-theoretic methodology which makes use of the semantics information included in the Gene Ontology (GO tree. The selected features, corresponding to methylation of CpG sites, attained moderate-to-high classification accuracies when imported to a series of classifiers evaluated by resampling or blindfold validation. The semantics-driven selection revealed sets of CpG sites performing similarly with evolutionary selection in the classification tasks. However, gene enrichment and pathway analysis showed that it additionally provides more descriptive sets of GO terms and KEGG pathways regarding the cancer phenotypes studied here. Results support the expediency of this methodology regarding its application in epidemiological studies.

  16. Analysing latent constructs via a derived metric paired comparison approach: An application to students’ emotions in mathematics

    Directory of Open Access Journals (Sweden)

    Alexandra Grand

    2014-09-01

    Full Text Available In this article we suggest an approach for the analysis of sets of items using the method of paired comparisons. We applied the proposed approach to a students’ survey of emotions typically experienced while learning mathematics by focusing on the relative dominance of these emotions. The emotions of interest were: enjoyment, pride, anger, anxiety, boredom and shame which were each measured by a set of items and for which we want to obtain an ordering on a continuum. In a first step we evaluated the quality of items by using a method of non-parametric Rasch model tests. The item sets of the emotions enjoyment, anxiety and boredom met the properties of a Rasch model. As a result of fitting Rasch models, we obtained person “emotion” parameter estimates. We then derived for each individual metric paired comparison responses from the obtained person parameter estimates and directly modelled these derived relative responses by fitting a beta regression model. This model is similar to generalized linear models (GLMs. The proposed model accounts for bounded metric paired comparison data in (0,1 where subject covariates and object-specific covariates can also be incorporated. We found that there is a tendency, the higher the positive discrepancy between the self concept of maths ability and the averaged perceived maths ability of students the more enjoyment and the less anxiety is typically experienced while learning mathematics.

  17. On the analysis of the virulence nature of TIGR4 and R6 strains of Streptococcus pneumoniae using genome comparison tools

    Indian Academy of Sciences (India)

    R Jothi; K Manikandakumar; K Ganesan; S Parthasarathy

    2007-09-01

    Comparative genome sequence analysis is a powerful technique for gaining insights into any genome of interest. Streptococcus pneumoniae is a human pathogen, which causes life-threatening diseases, such as pneumoniae, bacteremia, meningitis, etc. After the whole genome of two strains of S. pneumoniae, the virulent TIGR4 and non-pathogenic R6 were sequenced; there is a hope that comparing the genomes will allow an identification of the genes responsible for its virulence and thus the development of treatment and control. Many antimicrobial drugs have diminished the risk from pneumococcal disease because of its multi-drug resistance nature. Several pneumococcal proteins are also being investigated, as virulence factors as potential vaccine or drug targets. Structural and biochemical studies of these pneumococcal virulence factors have facilitated the development of novel antibiotics or protein antigen-based vaccines for the treatment of pneumococcal disease. Here we describe the comparison between the genomes of two strains of S. pneumoniae with few existing genomics databases and tools available in the public domain websites. By comparing nucleotide and protein sequences of the two strains, we investigate the existing differences and similarities. Mainly we focus on the virulence factors and its encoding genes in TIGR4 and how do they differ from R6 strain.

  18. Unleashing the genome of Brassica rapa

    Directory of Open Access Journals (Sweden)

    Haibao eTang

    2012-07-01

    Full Text Available The completion and release of the Brassica rapa genome is of great benefit to researchers of the Brassicas, Arabidopsis, and genome evolution. While its lineage is closely related to the model organism Arabidopsis thaliana, the Brassicas experienced a whole genome triplication subsequent to their divergence. This event contemporaneously created three copies of its ancestral genome, which had diploidized through the process of homeologous gene loss known as fractionation. By the fractionation of homeologous gene content and genetic regulatory binding sites, Brassica’s genome is well placed to use comparative genomic techniques to identify syntenic regions, homeologous gene duplications, and putative regulatory sequences. Here, we use the comparative genomics platform CoGe to perform several different genomic analyses with which to study structural changes of its genome and dynamics of various genetic elements. Starting with whole genome comparisons, the Brassica paleohexaploidy is characterized, syntenic regions with Arabidopsis thaliana are identified, and the TOC1 gene in the circadian rhythm pathway from Arabidopsis thaliana is used to find duplicated orthologs in Brassica rapa. These TOC1 genes are further analyzed to identify conserved noncoding sequences that contain cis-acting regulatory elements and promoter sequences previously implicated in circadian rhythmicity. Each 'cookbook style' analysis includes a step-by-step walkthrough with links to CoGe to quickly reproduce each step of the analytical process.

  19. FALSIRE Phase II. CSNI project for Fracture Analyses of Large-Scale International Reference Experiments (Phase II). Comparison report

    International Nuclear Information System (INIS)

    A summary of Phase II of the Project for Fracture Analysis of Large-Scale International Reference Experiments (FALSIRE) is presented. A FALSIRE II Workshop focused on analyses of reference fracture experiments. More than 30 participants representing 22 organizations from 12 countries took part in the workshop. Final results for 45 analyses of the reference experiments were received from the participating analysts. For each experiment, analysis results provided estimates of variables that include temperature, crack-mouth-opening displacement, stress, strain, and applied K and J values. The data were sent electronically to the Organizing Committee, who assembled the results into a comparative data base using a special-purpose computer program. A comparative assessment and discussion of the analysis results are presented in the report. Generally, structural responses of the test specimens were predicted with tolerable scatter bands. (orig./DG)

  20. Whole genome and transcriptome analyses of environmental antibiotic sensitive and multi-resistant Pseudomonas aeruginosa isolates exposed to waste water and tap water

    OpenAIRE

    Schwartz, Thomas; Armant, Olivier; Bretschneider, Nancy; Hahn, Alexander; Kirchen, Silke; Seifert, Martin; Dötsch, Andreas

    2014-01-01

    The fitness of sensitive and resistant P seudomonas aeruginosa in different aquatic environments depends on genetic capacities and transcriptional regulation. Therefore, an antibiotic-sensitive isolate PA30 and a multi-resistant isolate PA49 originating from waste waters were compared via whole genome and transcriptome Illumina sequencing after exposure to municipal waste water and tap water. A number of different genomic islands (e.g. PAGIs, PAPIs) were identified in the two environmental is...

  1. Blood-based gene expression signatures of medication-free outpatients with major depressive disorder: integrative genome-wide and candidate gene analyses

    OpenAIRE

    Hiroaki Hori; Daimei Sasayama; Toshiya Teraishi; Noriko Yamamoto; Seiji Nakamura; Miho Ota; Kotaro Hattori; Yoshiharu Kim; Teruhiko Higuchi; Hiroshi Kunugi

    2016-01-01

    Several microarray-based studies have investigated gene expression profiles in major depressive disorder (MDD), yet with highly variable findings. We examined blood-based genome-wide expression signatures of MDD, focusing on molecular pathways and networks underlying differentially expressed genes (DEGs) and behaviours of hypothesis-driven, evidence-based candidate genes for depression. Agilent human whole-genome arrays were used to measure gene expression in 14 medication-free outpatients wi...

  2. Genome-Wide Comparative Analyses Reveal the Dynamic Evolution of Nucleotide-Binding Leucine-Rich Repeat Gene Family among Solanaceae Plants

    Science.gov (United States)

    Seo, Eunyoung; Kim, Seungill; Yeom, Seon-In; Choi, Doil

    2016-01-01

    Plants have evolved an elaborate innate immune system against invading pathogens. Within this system, intracellular nucleotide-binding leucine-rich repeat (NLR) immune receptors are known play critical roles in effector-triggered immunity (ETI) plant defense. We performed genome-wide identification and classification of NLR-coding sequences from the genomes of pepper, tomato, and potato using fixed criteria. We then compared genomic duplication and evolution features. We identified intact 267, 443, and 755 NLR-encoding genes in tomato, potato, and pepper genomes, respectively. Phylogenetic analysis and classification of Solanaceae NLRs revealed that the majority of NLR super family members fell into 14 subgroups, including a TIR-NLR (TNL) subgroup and 13 non-TNL subgroups. Specific subgroups have expanded in each genome, with the expansion in pepper showing subgroup-specific physical clusters. Comparative analysis of duplications showed distinct duplication patterns within pepper and among Solanaceae plants suggesting subgroup- or species-specific gene duplication events after speciation, resulting in divergent evolution. Taken together, genome-wide analysis of NLR family members provide insights into their evolutionary history in Solanaceae. These findings also provide important foundational knowledge for understanding NLR evolution and will empower broader characterization of disease resistance genes to be used for crop breeding.

  3. Genomic structure of metabotropic glutamate receptor 7 and comparison of genomic structures of extracellular domains of mGluR family

    Institute of Scientific and Technical Information of China (English)

    2002-01-01

    Metabotropic glutamate receptor 7, coupled with a chemical neurotransmitter L-glutamate, plays an important role in the development of many psychiatric and neurological disorders. To study the biological and genetic mechanism of the mGluR7-related diseases, a physical map covering the full-length mGluR7 genomic sequence has been constructed through seed clone screening and fingerprinting database searching. These BAC clones in the physical map have been sequenced with shotgun strategy and assembled by Phred-Phrap-Consed software; the error rate of the final genomic sequence is less than 0.01%. mGluR7 spans 880 kb genomic region, the GC content and repeat content of mGluR7 genomic sequence are 38% and 37.5% respectively. mGluR7 has a typical "house-keeping" promoter and consists of 11 exons, with introns ranging from 6 kb to 285 kb. mGluR7a and mGluR7b are two known alternatively splicing variants. Comparing the genomic structures of extracellular domains of mGluR family, their genomic structures can be subdivided into three groups, which are consistent with that of proteins. Although the genomic organization of mGluR7's group is conserved, the majority of introns in the extracellular segments vary dramatically. It is an obvious trend of the increasing intron size inverse proportion to phylogenetic time. Variation of genomic structure is higher than that of protein, which is attributed to the species characteristic regulation of gene expression.

  4. Comparison Based on Exergetic Analyses of Two Hot Air Engines: A Gamma Type Stirling Engine and an Open Joule Cycle Ericsson Engine

    OpenAIRE

    Houda Hachem; Marie Creyx; Ramla Gheith; Eric Delacourt; Céline Morin; Fethi Aloui; Sassi Ben Nasrallah

    2015-01-01

    In this paper, a comparison of exergetic models between two hot air engines (a Gamma type Stirling prototype having a maximum output mechanical power of 500 W and an Ericsson hot air engine with a maximum power of 300 W) is made. Referring to previous energetic analyses, exergetic models are set up in order to quantify the exergy destruction and efficiencies in each type of engine. The repartition of the exergy fluxes in each part of the two engines are determined and represented in Sankey di...

  5. Benchmark analyses of sodium convection in the upper plenum of the MONJU reactor vessel - Comparison between plant system analysis code CERES and CFD code -

    International Nuclear Information System (INIS)

    In the CRP of IAEA, the data of the upper plenum geometry of the prototype FBR“MONJU” and the boundary conditions of the plant trip test were provided by JAEA. A plant system analysis code CERES for FBRs was developed by CRIEPI. To verify the CERES code, analyses had been performed for the system test of the MONJU, the results of which showed good agreement with the test. However, the difficulty of accurately reproducing the temperature variation arising from a complex flow in the upper plenum was identified. By using the general-purpose analysis code STAR-CCM+, detailed analysis in the upper plenum was enabled. Based on comparison between analyses of the CERES and STAR-CCM+ codes, parameters that had to be considered to simulate the flow pattern appropriately for plant system analysis codes were discussed. And, the analysis capability of CERES code with appropriate parameter was able to be confirmed. (author)

  6. Development of a chemical structure comparison method for integrated analysis of chemical and genomic information in the metabolic pathways.

    Science.gov (United States)

    Hattori, Masahiro; Okuno, Yasushi; Goto, Susumu; Kanehisa, Minoru

    2003-10-01

    Cellular functions result from intricate networks of molecular interactions, which involve not only proteins and nucleic acids but also small chemical compounds. Here we present an efficient algorithm for comparing two chemical structures of compounds, where the chemical structure is treated as a graph consisting of atoms as nodes and covalent bonds as edges. On the basis of the concept of functional groups, 68 atom types (node types) are defined for carbon, nitrogen, oxygen, and other atomic species with different environments, which has enabled detection of biochemically meaningful features. Maximal common subgraphs of two graphs can be found by searching for maximal cliques in the association graph, and we have introduced heuristics to accelerate the clique finding and to detect optimal local matches (simply connected common subgraphs). Our procedure was applied to the comparison and clustering of 9383 compounds, mostly metabolic compounds, in the KEGG/LIGAND database. The largest clusters of similar compounds were related to carbohydrates, and the clusters corresponded well to the categorization of pathways as represented by the KEGG pathway map numbers. When each pathway map was examined in more detail, finer clusters could be identified corresponding to subpathways or pathway modules containing continuous sets of reaction steps. Furthermore, it was found that the pathway modules identified by similar compound structures sometimes overlap with the pathway modules identified by genomic contexts, namely, by operon structures of enzyme genes. PMID:14505407

  7. Phylogeny and Taxonomy of Archaea: A Comparison of the Whole-Genome-Based CVTree Approach with 16S rRNA Sequence Analysis

    Directory of Open Access Journals (Sweden)

    Guanghong Zuo

    2015-03-01

    Full Text Available A tripartite comparison of Archaea phylogeny and taxonomy at and above the rank order is reported: (1 the whole-genome-based and alignment-free CVTree using 179 genomes; (2 the 16S rRNA analysis exemplified by the All-Species Living Tree with 366 archaeal sequences; and (3 the Second Edition of Bergey’s Manual of Systematic Bacteriology complemented by some current literature. A high degree of agreement is reached at these ranks. From the newly proposed archaeal phyla, Korarchaeota, Thaumarchaeota, Nanoarchaeota and Aigarchaeota, to the recent suggestion to divide the class Halobacteria into three orders, all gain substantial support from CVTree. In addition, the CVTree helped to determine the taxonomic position of some newly sequenced genomes without proper lineage information. A few discrepancies between the CVTree and the 16S rRNA approaches call for further investigation.

  8. Statistical Analyses of Scatterplots to Identify Important Factors in Large-Scale Simulations, 1: Review and Comparison of Techniques

    Energy Technology Data Exchange (ETDEWEB)

    Kleijnen, J.P.C.; Helton, J.C.

    1999-03-24

    The robustness of procedures for identifying patterns in scatterplots generated in Monte Carlo sensitivity analyses is investigated. These procedures are based on attempts to detect increasingly complex patterns in the scatterplots under consideration and involve the identification of (i) linear relationships with correlation coefficients, (ii) monotonic relationships with rank correlation coefficients, (iii) trends in central tendency as defined by means, medians and the Kruskal-Wallis statistic, (iv) trends in variability as defined by variances and interquartile ranges, and (v) deviations from randomness as defined by the chi-square statistic. The following two topics related to the robustness of these procedures are considered for a sequence of example analyses with a large model for two-phase fluid flow: the presence of Type I and Type II errors, and the stability of results obtained with independent Latin hypercube samples. Observations from analysis include: (i) Type I errors are unavoidable, (ii) Type 11errors can occur when inappropriate analysis procedures are used, (iii) physical explanations should always be sought for why statistical procedures identify variables as being important, and (iv) the identification of important variables tends to be stable for independent Latin hypercube samples.

  9. The Complete Mitochondrial Genome of Mindarus keteleerifoliae (Insecta: Hemiptera: Aphididae and Comparison with Other Aphididae Insects

    Directory of Open Access Journals (Sweden)

    Yuan Wang

    2015-12-01

    Full Text Available The mitogenome of Mindarus keteleerifoliae Zhang (Hemiptera: Aphididae is a 15,199 bp circular molecule. The gene order and orientation of M. keteleerifoliae is similarly arranged to that of the ancestral insect of other aphid mitogenomes, and, a tRNA isomerism event maybe identified in the mitogenome of M. keteleerifoliae. The tRNA-Trp gene is coded in the J-strand and the same sequence in the N-strand codes for the tRNA-Ser gene. A similar phenomenon was also found in the mitogenome of Eriosoma lanigerum. However, whether tRNA isomers in aphids exist requires further study. Phylogenetic analyses, using all available protein-coding genes, support Mindarinae as the basal position of Aphididae. Two tribes of Aphidinae were recovered with high statistical significance. Characteristics of the M. keteleerifoliae mitogenome revealed distinct mitogenome structures and provided abundant phylogenetic signals, thus advancing our understanding of insect mitogenomic architecture and evolution. But, because only eight complete aphid mitogenomes, including M. keteleerifoliae, were published, future studies with larger taxon sampling sizes are necessary.

  10. Comparison of Salmonella enterica Serovar Typhimurium LT2 and Non-LT2 Salmonella Genomic Sequences, and Genotyping of Salmonellae by Using PCR†

    OpenAIRE

    Kim, Hyun-Joong; Park, Si-Hong; Kim, Hae-Yeong

    2006-01-01

    Genes of Salmonella enterica serovar Typhimurium LT2 expected to be specifically present in Salmonella were selected using the Basic Local Alignment Search Tool (BLAST) program. The 152 selected genes were compared with 11 genomic sequences of Salmonella serovars, including Salmonella enterica subsp. I and IIIb and Salmonella bongori (V), and were clustered into 17 groups by their comparison patterns. A total of 38 primer pairs were constructed to represent each of the 17 groups, and PCR was ...

  11. Comparison of whole genome sequencing typing results and epidemiological contact information from outbreaks of Salmonella Dublin in Swedish cattle herds

    Science.gov (United States)

    Ågren, Estelle C. C.; Wahlström, Helene; Vesterlund-Carlson, Catrin; Lahti, Elina; Melin, Lennart; Söderlund, Robert

    2016-01-01

    Background Whole genome sequencing (WGS) is becoming a routine tool for infectious disease outbreak investigations. The Swedish situation provides an excellent opportunity to test the usefulness of WGS for investigation of outbreaks with Salmonella Dublin (S. Dublin) as epidemiological investigations are always performed when Salmonella is detected in livestock production, and index isolates from all detected herds are stored and therefore available for analysis. This study was performed to evaluate WGS as a tool in forward and backward tracings from herds infected with S. Dublin. Material and methods In this study, 28 isolates from 26 cattle herds were analysed and the WGS results were compared with results from the epidemiological investigations, for example, information on contacts between herds. The isolates originated from herds in three different outbreaks separated geographically and to some extent also in time, and from the only region in Sweden where S. Dublin is endemic (Öland). Results The WGS results of isolates from the three non-endemic regions were reliably separated from each other and from the endemic isolates. Within the outbreaks, herds with known epidemiological contacts generally showed smaller differences between isolates as compared to when there were no known epidemiological contacts. Conclusion The results indicate that WGS can provide valuable supplemental information in S. Dublin outbreak investigations. The resolution of the WGS was sufficient to distinguish isolates from the different outbreaks and provided additional information to the investigations within an outbreak. PMID:27396609

  12. Pharmacological Analyses of Protein Kinases Regulating Egg Maturation in Marine Nemertean Worms: A Review and Comparison with Mammalian Eggs

    Directory of Open Access Journals (Sweden)

    Alicia Marquardt

    2010-08-01

    Full Text Available For development to proceed normally, animal eggs must undergo a maturation process that ultimately depends on phosphorylations of key regulatory proteins. To analyze the kinases that mediate these phosphorylations, eggs of marine nemertean worms have been treated with pharmacological modulators of intracellular signaling pathways and subsequently probed with immunoblots employing phospho-specific antibodies. This article both reviews such analyses and compares them with those conducted on mammals, while focusing on how egg maturation in nemerteans is affected by signaling pathways involving cAMP, mitogen-activated protein kinases, Src-family kinases, protein kinase C isotypes, AMP-activated kinase, and the Cdc2 kinase of maturation-promoting factor.

  13. A comparison between advanced time–frequency analyses of non-stationary magnetization dynamics in spin-torque oscillators

    Energy Technology Data Exchange (ETDEWEB)

    Siracusano, Giulio, E-mail: giuliosiracusano@gmail.com [Department of Electronic Engineering, Industrial Chemistry and Engineering, University of Messina, C. di Dio, S. Agata, 98166 Messina (Italy); Corte, Aurelio La [Department of Electrical, Electronic and Informatic Engineering, University of Catania, Viale Andrea Doria, 6-95125 Catania (Italy)

    2014-02-15

    We report re'sults of different time–frequency analyses (Wavelet and Hilbert–Huang Transform (HHT)) of voltage measurements related to a spin-torque oscillator working in a regime of non-stationary dynamics. Our results indicate that the Wavelet analysis identifies the non-stationary magnetization dynamics revealing the existence of intermittent and independent excited modes while the HHT is able to accurately extract the time domain traces of each independent mode. Overall performance indicates a route for a complete characterization of time–frequency domain data of a STO, pointing out that the combined Wavelet-HHT methodology developed is general and can be also used for a variety of other different scenarios.

  14. COUPLING EFFECTS FOR CELL-TRUSS SPAR PLATFORM: COMPARISON OF FREQUENCY- AND TIME-DOMAIN ANALYSES WITH MODEL TESTS

    Institute of Scientific and Technical Information of China (English)

    ZHANG Fan; YANG Jian-min; LI Run-pei; CHEN Gang

    2008-01-01

    For the floating structures in deepwater, the coupling effects of the mooring lines and risers on the motion responses of the structures become increasingly significant. Viscous damping, inertial mass, current loading and restoring, etc. from these slender structures should be carefully handled to accurately predict the motion responses and line tensions. For the spar platforms, coupling the mooring system and riser with the vessel motion typically results in a reduction in extreme motion responses. This article presents numerical simulations and model tests on a new cell-truss spar platform in the State Key Laboratory of Ocean Engineering in Shanghai Jiaotong University. Results from three calculation methods, including frequency-domain analysis, time-domain semi-coupled and fully-coupled analyses, were compared with the experimental data to find the applicability of different approaches. Proposals for the improvement of numerical calculations and experimental technique were tabled as well.

  15. Comparison of functional proteomic analyses of human breast cancer cell lines T47D and MCF7.

    Directory of Open Access Journals (Sweden)

    Juliette Adjo Aka

    Full Text Available T47D and MCF7 are two human hormone-dependent breast cancer cell lines which are widely used as experimental models for in vitro and in vivo (tumor xenografts breast cancer studies. Several proteins involved in cancer development were identified in these cell lines by proteomic analyses. Although these studies reported the proteomic profiles of each cell line, until now, their differential protein expression profiles have not been established. Here, we used two-dimensional gel and mass spectrometry analyses to compare the proteomic profiles of the two cell lines, T47D and MCF7. Our data revealed that more than 164 proteins are differentially expressed between them. According to their biological functions, the results showed that proteins involved in cell growth stimulation, anti-apoptosis mechanisms and cancerogenesis are more strongly expressed in T47D than in MCF7. These proteins include G1/S-specific cyclin-D3 and prohibitin. Proteins implicated in transcription repression and apoptosis regulation, including transcriptional repressor NF-X1, nitrilase homolog 2 and interleukin-10, are, on the contrary, more strongly expressed in MCF7 as compared to T47D. Five proteins that were previously described as breast cancer biomarkers, namely cathepsin D, cathepsin B, protein S100-A14, heat shock protein beta-1 (HSP27 and proliferating cell nuclear antigen (PCNA, are found to be differentially expressed in the two cell lines. A list of differentially expressed proteins between T47D and MCF7 was generated, providing useful information for further studies of breast cancer mechanisms with these cell lines as models.

  16. Analyses and interpretation of whole-genome gene expression from formalin-fixed paraffin-embedded tissue: an illustration with breast cancer tissues

    OpenAIRE

    Argos Maria; Paul-Brutus Rachelle M; Roy Shantanu; Jasmine Farzana; Kibriya Muhammad G; Ahsan Habibul

    2010-01-01

    Abstract Background We evaluated (a) the feasibility of whole genome cDNA-mediated Annealing, Selection, extension and Ligation (DASL) assay on formalin-fixed paraffin-embedded (FFPE) tissue and (b) whether similar conclusions can be drawn by examining FFPE samples as proxies for fresh frozen (FF) tissues. We used a whole genome DASL assay (addressing 18,391 genes) on a total of 72 samples from paired breast tumor and surrounding healthy tissues from both FF and FFPE samples. Results Gene det...

  17. Genome and secretome analyses provide insights into keratin decomposition by novel proteases from the non-pathogenic fungus Onygena corvina

    DEFF Research Database (Denmark)

    Huang, Yuhong; Kamp Busk, Peter; Herbst, Florian Alexander;

    2015-01-01

    proteases secreted by O. corvina are interesting in view of their potential relevance for industrial decomposition of keratinaceous wastes. We sequenced and assembled the genome of O. corvina and used a method called peptide pattern recognition to identify 73 different proteases. Comparative genome analysis...... of proteases in keratin-degrading and non-keratin-degrading fungi indicated that 18 putative secreted proteases from four protease families (M36, M35, M43, and S8) may be responsible for keratin decomposition. Twelve of the 18 predicted protease genes could be amplified from O. corvina grown on...

  18. The chromosome damage induced by x-ray radiation doses. Comparison between dicentric chromosomes, micronuclei and Sister Chromatid Exchanges analyses

    International Nuclear Information System (INIS)

    Exposure to ionizing radiations is a well-known source of chromosome damage. Here we present a comparison among three different methodologies employed to recognize cytogenetic damage, after an acute exposure of human lymphocytes to 3 Gy of X-rays (100kVp). Scoring of dicentric chromosomes, present in first mitosis ''in vitro'', was the method of preference as dicentrics increased 937.5 times with respect to background. Micronucleus scoring in binucleated-cytokinesis blocked cells showed an increase of 32.5 times, while it was only of 1.46 times when Sister Chromatid Exchanges (SCEs) were analyzed. The estimated probability of an acentric fragment becoming a micronucleus was around 0.25. Intercellular distribution of dicentrics agree with Poisson, while micronucleus were overdispersed. When analyzed at second cycle after damage induction, the dicentrics yield as well as the level of cells with unstable cromosome aberrations, decreased around a half. Finally, SCEs level was similar in cells with or without unstable structural chromosome aberrations. (Author)

  19. Genomic Comparative Study of Bovine Mastitis Escherichia coli

    OpenAIRE

    Kempf, Florent; Slugocki, Cindy; Blum, Shlomo E.; Leitner, Gabriel; Germon, Pierre

    2016-01-01

    Escherichia coli, one of the main causative agents of bovine mastitis, is responsible for significant losses on dairy farms. In order to better understand the pathogenicity of E. coli mastitis, an accurate characterization of E. coli strains isolated from mastitis cases is required. By using phylogenetic analyses and whole genome comparison of 5 currently available mastitis E. coli genome sequences, we searched for genotypic traits specific for mastitis isolates. Our data confirm that there i...

  20. Comparison of soil amendments to decrease high strength in SE USA Coastal Plain soils using fuzzy decision-making analyses

    Directory of Open Access Journals (Sweden)

    D. Kurtener

    2007-09-01

    Full Text Available Cemented subsurface layers restrict root growth in many southeastern USA Coastal Plain soils. Though cementa- tion is usually reduced by tillage, soil amendments can offer a more permanent solution if they develop aggregation. To increase aggregation, we amended 450 g of a Norfolk soil blend of 90% E horizon (the hard layer and 10% Ap horizon with 0 or 6.44 g kg-1 ground wheat (Triticum aestivum L. residue and 0, 30, or 120 mg kg-1 polyacrylamide (PAM, 12 x 106 Da anionic, linear, and 35% charge density. During a 60-d incubation, parameters measured included water added to maintain 10% soil moisture, soil strength, bulk density, and aggregation. Data were analyzed using a cost-benefit approach with normalized fuzzy logic indicators. Analyses included building normalized decision matrices, calculating weighting vectors, ranking alternatives, and defining the best alternatives. When only physical parameters were analyzed using fuzzy logic indicators, addition of wheat residue with 30 mg kg-1 PAM proved to be the best alternative whereas wheat residue with 120 mg kg-1 PAM had been selected as the best alternative with analysis of variance because it did not simultaneously analyze all variables. When both physical and economic parameters were included, the best alternative was the treatment with wheat residue and 120 mg kg-1 PAM. When using fuzzy logic, judgment of the user was needed to determine which parameters to include and how to weight them.

  1. Comparison of the bending performance of solid and cannulated spinal pedicle screws using finite element analyses and biomechanical tests.

    Science.gov (United States)

    Shih, Kao-Shang; Hsu, Ching-Chi; Hou, Sheng-Mou; Yu, Shan-Chuen; Liaw, Chen-Kun

    2015-09-01

    Spinal pedicle screw fixations have been used extensively to treat fracture, tumor, infection, or degeneration of the spine. Cannulated spinal pedicle screws with bone cement augmentation might be a useful method to ameliorate screw loosening. However, cannulated spinal pedicle screws might also increase the risk of screw breakage. Thus, the purpose of this study was to investigate the bending performance of different spinal pedicle screws with either solid design or cannulated design. Three-dimensional finite element models, which consisted of the spinal pedicle screw and the screw's hosting material, were first constructed. Next, monotonic and cyclic cantilever bending tests were both applied to validate the results of the finite element analyses. Finally, both the numerical and experimental approaches were evaluated and compared. The results indicated that the cylindrical spinal pedicle screws with a cannulated design had significantly poorer bending performance. In addition, conical spinal pedicle screws maintained the original bending performance, whether they were solid or of cannulated design. This study may provide useful recommendations to orthopedic surgeons before surgery, and it may also provide design rationales to biomechanical engineers during the development of spinal pedicle screws. PMID:26208430

  2. Falsire: CSNI project for fracture analyses of large-scale international reference experiments (Phase 1). Comparison report

    International Nuclear Information System (INIS)

    A summary of the recently completed Phase I of the Project for Fracture Analysis of Large-Scale International Reference Experiments (Project FALSIRE) is presented. Project FALSIRE was created by the Fracture Assessment Group (FAG) of Principal Working Group No. 3 (PWG/3) of the OECD/NEA Committee on the Safety of Nuclear Installations (CSNI), formed to evaluate fracture prediction capabilities currently used in safety assessments of nuclear vessel components. The aim of the Project FALSIRE was to assess various fracture methodologies through interpretive analyses of selected large-scale fracture experiments. The six experiments used in Project FALSIRE (performed in the Federal Republic of Germany, Japan, the United Kingdom, and the U.S.A.) were designed to examine various aspects of crack growth in reactor pressure vessel (RPV) steels under pressurized-thermal-shock (PTS) loading conditions. The analysis techniques employed by the participants included engineering and finite-element methods, which were combined with Jr fracture methodology and the French local approach. For each experiment, analysis results provided estimates of variables such as crack growth, crack-mouth-opening displacement, temperature, stress, strain, and applied J and K values. A comparative assessment and discussion of the analysis results are presented; also, the current status of the entire results data base is summarized. Some conclusions concerning predictive capabilities of selected ductile fracture methodologies, as applied to RPVs subjected to PTS loading, are given, and recommendations for future development of fracture methodologies are made

  3. Comparison of liquid-liquid extraction and solid phase extraction for manganese in water analysed by neutron activation analysis

    International Nuclear Information System (INIS)

    Full text: The concentrations of total manganese in most natural water systems are in the range of 0.001 to 1.0 mgl-1. The maximum contaminant level (MCL) of manganese in drinking water as recommended by US Environmental Protection Agency (EPA) is 0.05 mg/l. Analytical methods capable of measuring the low level of manganese are necessary for evaluating the quality of natural water. Neutron activation analysis (NAA) is one of the most sensitive techniques for the determination of trace elements. However, direct application of neutron activation for analysis of trace elements in a complex system such as natural waters is generally difficult because of matrix interference. Preconcentration and/or matrix separation procedures are often required before irradiation to eliminate such interferences. In this study two methods based on solid phase extraction (SPE) and liquid-liquid extraction (LLE) has been developed for the extraction of manganese in water prior to irradiation. Experimental parameters such as effect of pH, type and volume of the chelating agent and flow rate were studied and optimized. Analytical parameters such as linearity, precision, accuracy, detection and quantitation limits, and matrix effects for SPE and LLE methods were evaluated for comparison purposes with the aim of selecting the most appropriate depending on the high recoveries and lower detection capabilities required. Both methods can be applied to real samples and give the same results, but SPE allows the high recovery of 99.8 % of manganese with lower detection limit of 0.001 μgl-1 as compared to LLE (90.5 % of manganese recovery with lower detection limit of 0.73 μgl-1). Furthermore, the SPE is easily used compared with LLE and not time consuming which allows analysis of a large number of samples. (author)

  4. Genomic Analyses Reveal Global Functional Alterations That Promote Tumor Growth and Novel Tumor Suppressor Genes in Natural Killer-Cell Malignancies

    DEFF Research Database (Denmark)

    Kucuk, Can; Iqbal, Javeed; J. deLeeuw, Ronald;

    Background: Natural Killer (NK)-cell lymphomas/leukemias (NKL) account for 1-2 % of all non-Hodgkin lymphomas. Although the incidence of NKL is relatively low, the clinical course of these lymphomas is highly aggressive. To elucidate the recurrent genomic abnormalities and the associated changes ...

  5. Genome and transcriptome analyses reveal that MAPK- and phosphatidylinositol-signaling pathways mediate tolerance to 5-hydroxymethyl-2-furaldehyde for industrial yeast Saccharomyces cerevisiae

    Science.gov (United States)

    The industrial ethanologenic yeast Saccharomyces cerevisiae is a promising biocatalyst for next-generation advanced biofuels applications including lignocellulose-to-ethanol conversion. Here we present the first insight into the genomic background of NRRL Y-12632, a type strain from a worldwide coll...

  6. Inter-laboratory comparison of X-ray fluorescence analyses of eruptive products of El Chichón Volcano, Chiapas, Mexico

    Science.gov (United States)

    Tilling, Robert I.; Bornhorst, Theodore J.; Taggart, Joseph E., Jr.; Rose, William I., Jr.; McGee, James J.

    1987-01-01

    An inter-laboratory comparison has been made of X-ray fluorescence analyses of 10 samples of lava and pumices from El Chichón Volcano, Chiapas, Mexico. Some determinations of major-element constituents agree within analytical uncertainty, whereas others exchibit significant bias. Analyses carried out at the Michigan Technological University (MTU) laboratory are systematically lower in MgO (26–48%), Fetotal(5–18%), CaO (4–15%) and higher in K2O (0–15%) than analyses made at the U.S. Geological Survey (USGS) Denver laboratory. These differences are ascribed in part to a complex combination of calibration assumptionsand mineralogical and particle-size effects inherent in the use of pressed rock-powder pellets in the analytical procedure of the MTU laboratory. Other, but as yet unknown, differences in sample preparation and/or analytical technique may also be important; effects related to natural sample inhomogeneityare believed to be insignificant. The inter-laboratory differences in the analytical data complicated accurate assessment of whether El Chichón magmas have changed composition during the past 300 000 a. Knowledge of such change is needed for understanding petrogenetic history and for such related studies as evaluation of volcanic hazards.

  7. Beyond genomic variation - comparison and functional annotation in three Brassica rapa genotypes: a turnip, a rapid cycling and a Chinese cabbage

    OpenAIRE

    Lin, K.; Zhang, N.; Severing, E.I.; Nijveen, H.; Cheng, F; Visser, R.G.F.; Wang, X.; De, Ridder, G.; Bonnema, A.B.

    2014-01-01

    Background - Brassica rapa is an economically important crop species. During its long breeding history, a large number of morphotypes have been generated, including leafy vegetables such as Chinese cabbage and pakchoi, turnip tuber crops and oil crops. Results - To investigate the genetic variation underlying this morphological variation, we re-sequenced, assembled and annotated the genomes of two B. rapa subspecies, turnip crops (turnip) and a rapid cycling. We then analysed the two resultin...

  8. Characterization of breast lesion using double phase Tc-99m Tetrofosmin scintimammography: Comparison of visual and quantitative analyses

    Energy Technology Data Exchange (ETDEWEB)

    Lee, Jung Sub [Medical Research Institute, Pusan National University, Busan (Korea, Republic of); Kim, Seong-Jang [Medical Research Institute, Pusan National University, Busan (Korea, Republic of) and Department of Nuclear Medicine, College of Medicine, Pusan National University, Busan (Korea, Republic of)]. E-mail: growthkim@daum.net; Kim, In-Ju [Medical Research Institute, Pusan National University, Busan (Korea, Republic of); Department of Nuclear Medicine, College of Medicine, Pusan National University, Busan (Korea, Republic of); Bae, Young-Tae [Medical Research Institute, Pusan National University, Busan (Korea, Republic of); Department of Surgery, College of Medicine, Pusan National University, Busan (Korea, Republic of); Kim, Yong-Ki [Medical Research Institute, Pusan National University, Busan (Korea, Republic of); Department of Nuclear Medicine, College of Medicine, Pusan National University, Busan (Korea, Republic of); Kang, Young-Seok [Medical Research Institute, Pusan National University, Busan (Korea, Republic of); Department of Plastic and Reconstructive Surgery, College of Medicine, Pusan National University, Busan (Korea, Republic of); Kim, Dong-Soo [Department of Nuclear Medicine, College of Medicine, Pusan National University, Busan (Korea, Republic of)

    2006-01-15

    To compare the diagnostic performances of visual and quantitative indices of double phase Tc-99m Tetrofosmin scintimammography (TF-SMM) for the detection of breast cancer. Methods: Double phase TF-SMM (early, 10 min; delayed, 3 h) were performed after injection of 925 MBq of Tc-99m Tetrofosmin in 75 highly suspected breast cancer patients (malignant: 49, benign: 26). For visual analysis, five scoring method was used. For quantitative analysis, early, delayed lesions to non-lesion ratios (L/Ns) and washout rate (%, WR) were calculated. Receiver operating characteristic curve (ROC) analyses were performed to determine the optimal visual grade, to calculate cut-off values of quantitative indices, and to compare visual and quantitative diagnostic performances. Results: When over grade 3 of visual grade was used as cut-off value in the defection of primary breast cancer, the sensitivity and specificity were 75.5 and 80.8%, respectively. The positive and negative predictive values were 88.1 and 63.6%, respectively. The area under curve was 0.824 (95% CI, 0.719-0.902) and standard error (S.E.) was 0.047. The optimal L/N ratios were 3.13 for early and 2.56 for delayed image. When early L/N 3.13 was used as cut-off point, the sensitivity and specificity of TF-SMM were 61.2 and 96.2%, respectively. The positive and negative predictive values were 96.8 and 56.8%, respectively. The AUC was 0.809 (95% CI, 0.702-0.890) and S.E. was 0.049. When delayed L/N 2.56 was used as cut-off value, the sensitivity and specificity were 46.9 and 96.2%, respectively. The positive and negative predictive values were 95.8 and 49%, respectively. The AUC was 0.741 (95% CI, 0.627-0.835) and S.E. was 0.057. No statistical differences between visual assessment and quantitative analysis of early image (difference between area, 0.015; S.E., 0.044; 95% CI, -0.072 to 0.102; p = 0.736) and delayed image (difference between area, 0.083; S.E., 0.054; 95% CI, -0.023 to 0.060; p = 0.189) was noted. However

  9. Understanding the differences between genome sequences of Escherichia coli B strains REL606 and BL21(DE3), and comparison of the closely related E. coli B and K-12 genomes

    Energy Technology Data Exchange (ETDEWEB)

    Studier, F.W.; Daegelen, P.; Lenski, R. E.; Maslov, S.; Kim, J. F.

    2009-12-01

    Each difference between the genome sequences of Escherichia coli B strains REL606 and BL21(DE3) can be interpreted in light of known laboratory manipulations plus a gene conversion between ribosomal RNA operons. Two treatments with 1-methyl-3-nitro-1-nitrosoguanidine in the REL606 lineage produced at least 93 single-base-pair mutations ({approx} 90% GC-to-AT transitions) and 3 single-base-pair GC deletions. Two UV treatments in the BL21(DE3) lineage produced only 4 single-base-pair mutations but 16 large deletions. P1 transductions from K-12 into the two B lineages produced 317 single-base-pair differences and 9 insertions or deletions, reflecting differences between B DNA in BL21(DE3) and integrated restriction fragments of K-12 DNA inherited by REL606. Two sites showed selective enrichment of spontaneous mutations. No unselected spontaneous single-base-pair mutations were evident. The genome sequences revealed that a progenitor of REL606 had been misidentified, explaining initially perplexing differences. Limited sequencing of other B strains defined characteristic properties of B and allowed assembly of the inferred genome of the ancestral B of Delbrueck and Luria. Comparison of the B and K-12 genomes shows that more than half of the 3793 proteins of their basic genomes are predicted to be identical, although {approx} 310 appear to be functional in either B or K-12 but not in both. The ancestral basic genome appears to have had {approx} 4039 coding sequences occupying {approx} 4.0 Mbp. Repeated horizontal transfer from diverged Escherichia coli genomes and homologous recombination may explain the observed variable distribution of single-base-pair differences. Fifteen sites are occupied by phage-related elements, but only six by comparable elements at the same site. More than 50 sites are occupied by IS elements in both B and K, 16 in common, and likely founding IS elements are identified. A signature of widespread cryptic phage P4-type mobile elements was

  10. Multiple Comparison Analysis of Two New Genomic Sequences of ILTV Strains from China with Other Strains from Different Geographic Regions

    OpenAIRE

    Zhao, Yan; Kong, Congcong; Wang, Yunfeng

    2015-01-01

    To date, twenty complete genome sequences of ILTV strains have been published in GenBank, including one strain from China, and nineteen strains from Australian and the United States. To investigate the genomic information on ILTVs from different geographic regions, two additional individual complete genome sequences of WG and K317 strains from China were determined. The genomes of WG and K317 strains were 153,505 and 153,639 bp in length, respectively. Alignments performed on the amino acid s...

  11. Profile of muscle tissue gene expression specific to water buffalo: Comparison with domestic cattle by genome array.

    Science.gov (United States)

    Zhang, Yingying; Wang, Hongbao; Gui, Linsheng; Wang, Hongcheng; Mei, Chugang; Zhang, Yaran; Xu, Huaichao; Jia, Cunlin; Zan, Linsen

    2016-02-10

    In contrast with the past, the water buffalo is now not only a draft animal, but also an important food source of milk and meat. It is increasingly apparent that the water buffalo have huge potential for meat production, but its breeding needs to be investigated. Regarding the molecular mechanisms involved in the meat quality difference between the buffalo (Bubalus bulabis) and yellow cattle (Bos taurus), 12 chemical-physical characteristics related to the meat quality of longissimus thoracis muscles (LTM) have been compared at the age of 36 months. Intramuscular lipid and b* (yellowness) were greater in cattle than the buffalo, whereas a* (redness) was greater in the buffalo. Gene expression profiles were constructed by bovine genome array. A total of 8884 and 10,960 probes were detected in buffalo and cattle, respectively, with 1580 genes being differentially expressed. Over 400 probes were upregulated and nearly 1200 were downregulated in LTM of the buffalo, most being involved in ribosomal RNA (rRNA) processing, cholesterol homeostasis, regulation of transcription, response to hypoxia, and glycolysis. Quantitative real-time PCR was used to validate the microarray data. Enriched GO analyses of highly expressed genes in LTM showed that protein biosynthesis, striated muscle contraction, iron homeostasis, iron transport, glycolysis and glucose metabolism were similar between the buffalo and cattle. High protein content, low fat content and deep meat color of buffalo LTM may be closely associated with the increased expression of genes involved in cholesterol and iron homeostasis, while also reducing the expression of genes involved in ubiquitin-mediated proteolysis and protein oxidative phosphorylation. These results establish the groundwork for further studies on buffalo meat quality and will be beneficial in improving water buffalo breeding by molecular biotechnology. PMID:26598327

  12. OrthoVenn: a web server for genome wide comparison and annotation of orthologous clusters across multiple species

    Science.gov (United States)

    Genome wide analysis of orthologous clusters is an important component of comparative genomics studies. Identifying the overlap among orthologous clusters can enable us to elucidate the function and evolution of proteins across multiple species. Here, we report a web platform named OrthoVenn that i...

  13. Detection of genetic variation affecting milk coagulation properties in Danish Holstein dairy cattle by analyses of pooled whole-genome sequences from phenotypically extreme samples (pool-seq)

    DEFF Research Database (Denmark)

    Bertelsen, H P; Gregersen, V R; Poulsen, Nina Aagaard;

    2016-01-01

    differences between pooled whole-genome sequences of phenotypically extreme samples (pool-seq).. Curd-firming rate and raw milk pH were measured for 415 Danish Holstein cows, and each animal was sequenced at low coverage. Pools were created containing whole genome sequence reads from samples with "extreme...... located on chromosome 6. A total of 9 significant SNP, which were selected based on the possible function of proximal candidate genes, were genotyped in the entire sample set ( = 415) to test for an association. The most significant SNP was located proximal to , explaining 33% of the phenotypic variance....... , coding for κ-casein, is the most studied in relation to milk coagulation due to its position on the surface of the casein micelles and the direct involvement in milk coagulation. Three additional SNP located on chromosome 6 showed significant associations explaining 7, 3.6, and 1.3% of the phenotypic...

  14. Analyses of Twelve New Whole Genome Sequences of Cassava Brown Streak Viruses and Ugandan Cassava Brown Streak Viruses from East Africa: Diversity, Supercomputing and Evidence for Further Speciation.

    Directory of Open Access Journals (Sweden)

    Joseph Ndunguru

    Full Text Available Cassava brown streak disease is caused by two devastating viruses, Cassava brown streak virus (CBSV and Ugandan cassava brown streak virus (UCBSV which are frequently found infecting cassava, one of sub-Saharan Africa's most important staple food crops. Each year these viruses cause losses of up to $100 million USD and can leave entire families without their primary food source, for an entire year. Twelve new whole genomes, including seven of CBSV and five of UCBSV were uncovered in this research, doubling the genomic sequences available in the public domain for these viruses. These new sequences disprove the assumption that the viruses are limited by agro-ecological zones, show that current diagnostic primers are insufficient to provide confident diagnosis of these viruses and give rise to the possibility that there may be as many as four distinct species of virus. Utilizing NGS sequencing technologies and proper phylogenetic practices will rapidly increase the solution to sustainable cassava production.

  15. Analyses of Twelve New Whole Genome Sequences of Cassava Brown Streak Viruses and Ugandan Cassava Brown Streak Viruses from East Africa: Diversity, Supercomputing and Evidence for Further Speciation.

    Science.gov (United States)

    Ndunguru, Joseph; Sseruwagi, Peter; Tairo, Fred; Stomeo, Francesca; Maina, Solomon; Djikeng, Appolinaire; Djinkeng, Appolinaire; Kehoe, Monica; Boykin, Laura M

    2015-01-01

    Cassava brown streak disease is caused by two devastating viruses, Cassava brown streak virus (CBSV) and Ugandan cassava brown streak virus (UCBSV) which are frequently found infecting cassava, one of sub-Saharan Africa's most important staple food crops. Each year these viruses cause losses of up to $100 million USD and can leave entire families without their primary food source, for an entire year. Twelve new whole genomes, including seven of CBSV and five of UCBSV were uncovered in this research, doubling the genomic sequences available in the public domain for these viruses. These new sequences disprove the assumption that the viruses are limited by agro-ecological zones, show that current diagnostic primers are insufficient to provide confident diagnosis of these viruses and give rise to the possibility that there may be as many as four distinct species of virus. Utilizing NGS sequencing technologies and proper phylogenetic practices will rapidly increase the solution to sustainable cassava production. PMID:26439260

  16. Sex-specific population structure, natural selection, and linkage disequilibrium in a wild bird population as revealed by genome-wide microsatellite analyses

    Directory of Open Access Journals (Sweden)

    Merilä Juha

    2010-03-01

    Full Text Available Abstract Background Sexual dimorphism in ecologically important traits is widespread, yet the differences in the genomic architecture between the two sexes are largely unexplored. We employed a genome-wide multilocus approach to examine the sexual differences in population subdivision, natural selection and linkage disequilibrium (LD in a wild Siberian jay (Perisoreus infaustus population, using genotypes at a total of 107 autosomal and Z-chromosomal microsatellites. Results Mean observed heterozygosity was significantly higher in females (HO = 0.567 than in males (HO = 0.532, and autosomal markers (HO = 0.561 were more variable than Z-chromosomal markers (HO = 0.512. Genetic differentiation (FST = 0.002, P P FSTvalue for Z-chromosomal (-0.014, 95% CI: -0.025 - -0.011 than for the autosomal loci (0.003, 95% CI: 0.001 - 0.004. Analysis of syntenic marker pairs revealed high levels of LD in both sexes but significantly (P Conclusion We conclude that there are many clear differences in genomic architecture between the sexes studied here which can be at least partly understood in the light of higher dispersal rate of females as compared to males and the unusual structure of the Z-chromosome of the species.

  17. Genomic comparison of multi-drug resistant invasive and colonizing Acinetobacter baumannii isolated from diverse human body sites reveals genomic plasticity

    OpenAIRE

    Hsiao William W; Phillippy Adam M; Harris Anthony D; Johnson J Kristie; Sahl Jason W; Thom Kerri A; Rasko David A

    2011-01-01

    Abstract Background Acinetobacter baumannii has recently emerged as a significant global pathogen, with a surprisingly rapid acquisition of antibiotic resistance and spread within hospitals and health care institutions. This study examines the genomic content of three A. baumannii strains isolated from distinct body sites. Isolates from blood, peri-anal, and wound sources were examined in an attempt to identify genetic features that could be correlated to each isolation source. Results Pulsed...

  18. Genome-scale comparison of expanded gene families in Plasmodium ovale wallikeri and Plasmodium ovale curtisi with Plasmodium malariae and with other Plasmodium species

    KAUST Repository

    Ansari, Hifzur Rahman

    2016-07-05

    Malaria in humans is caused by six species of Plasmodium parasites, of which the nuclear genome sequences for the two Plasmodium ovale spp., P. ovale curtisi and P. ovale wallikeri, and Plasmodium malariae have not yet been analyzed. Here we present an analysis of the nuclear genome sequences of these three parasites, and describe gene family expansions therein. Plasmodium ovale curtisi and P. ovale wallikeri are genetically distinct but morphologically indistinguishable and have sympatric ranges through the tropics of Africa, Asia and Oceania. Both P. ovale spp. show expansion of the surfin variant gene family, and an amplification of the Plasmodium interspersed repeat (pir) superfamily which results in an approximately 30% increase in genome size. For comparison, we have also analyzed the draft nuclear genome of P. malariae, a malaria parasite causing mild malaria symptoms with a quartan life cycle, long-term chronic infections, and wide geographic distribution. Plasmodium malariae shows only a moderate level of expansion of pir genes, and unique expansions of a highly diverged transmembrane protein family with over 550 members and the gamete P25/27 gene family. The observed diversity in the P. ovale wallikeri and P. ovale curtisi surface antigens, combined with their phylogenetic separation, supports consideration that the two parasites be given species status.

  19. Complete mitochondrial genome of Red-bellied Macaw (Orthopsittaca manilata): its comparison with mitogenome of Blue-throated Macaw (Ara glaucogularis).

    Science.gov (United States)

    Urantowka, Adam Dawid

    2016-05-01

    The Red-bellied Macaw (Orthopsittaca manilata) is a species of the monotypic genus Orthopsittaca. The genus is one of the six genera, which form morphologically diverse group termed as Macaws. Individuals of Orthopsittaca manilata species are found in extremely large Amazonian area of South America. In this study, full mitochondrial genome of considered species was sequenced. It is 16,985 bp long and contains 13 protein-coding genes, 2 rRNAs, 22 tRNAs and a control region. Its comparison with published Blue-throated Macaw (Ara glaucogularis) mitogenome revealed their high degree of identity. Presented Orthopsittaca manilata mitogenome is the first complete genomic sequence of this genus. It will enrich the resource of molecular markers for future examination of evolutionary diversification of Macaws. It will be also indispensable to refine the phylogenetic relationships within the tribe Arini. PMID:25391028

  20. Complete mitochondrial genome of Blue-headed Macaw (Primolius couloni): its comparison with mitogenome of Blue-throated Macaw (Ara glaucogularis).

    Science.gov (United States)

    Urantowka, Adam Dawid

    2016-05-01

    Primolius is a genus of midsized Macaws comprising three species. Blue-headed Macaws (Primolius couloni) are native to eastern Peru, extreme western Brazil and north-western Bolivia. In this study, full mitochondrial genome of considered species was sequenced. It is 16,995 bp long and contains 13 protein-coding genes, 2 rRNAs, 22 tRNAs and a control region. Its comparison with published Blue-throated Macaw (Ara glaucogularis) mitogenome revealed their high degree of identity. Primolius couloni mitogenome is the first complete genomic sequence of this genus. It will be indispensable to refine the phylogenetic relationships within the tribe Arini and will enrich the resource of markers for systematic, phylogenetic and population genetic studies. PMID:25391030

  1. DNA lesions induced by UV A1 and B radiation in human cells: Comparative analyses in the overall genome and in the p53 tumor suppressor gene

    OpenAIRE

    Besaratinia, Ahmad; Synold, Timothy W.; Chen, Hsiu-Hua; Chang, Cheng; Xi, Bixin; Riggs, Arthur D.; Pfeifer, Gerd P.

    2005-01-01

    The UV components of sunlight (UVA and UVB) are implicated in the etiology of human skin cancer. The underlying mechanism of action for UVB carcinogenicity is well defined; however, the mechanistic involvement of UVA in carcinogenesis is not fully delineated. We investigated the genotoxicity of UVA1 versus UVB in the overall genome and in the p53 tumor suppressor gene in normal human skin fibroblasts. Immuno-dot blot analysis identified the cis-syn cyclobutane pyrimidine-dimer (CPD) as a dist...

  2. Analyses of Twelve New Whole Genome Sequences of Cassava Brown Streak Viruses and Ugandan Cassava Brown Streak Viruses from East Africa: Diversity, Supercomputing and Evidence for Further Speciation

    OpenAIRE

    Ndunguru, Joseph; Sseruwagi, Peter; Tairo, Fred; Stomeo, Francesca; Maina, Solomon; Djinkeng, Appolinaire; Kehoe, Monica; Boykin, Laura M

    2015-01-01

    Cassava brown streak disease is caused by two devastating viruses, Cassava brown streak virus (CBSV) and Ugandan cassava brown streak virus (UCBSV) which are frequently found infecting cassava, one of sub-Saharan Africa’s most important staple food crops. Each year these viruses cause losses of up to $100 million USD and can leave entire families without their primary food source, for an entire year. Twelve new whole genomes, including seven of CBSV and five of UCBSV were uncovered in this re...

  3. Integrated analysis of whole genome and transcriptome sequencing reveals diverse transcriptomic aberrations driven by somatic genomic changes in liver cancers.

    Directory of Open Access Journals (Sweden)

    Yuichi Shiraishi

    Full Text Available Recent studies applying high-throughput sequencing technologies have identified several recurrently mutated genes and pathways in multiple cancer genomes. However, transcriptional consequences from these genomic alterations in cancer genome remain unclear. In this study, we performed integrated and comparative analyses of whole genomes and transcriptomes of 22 hepatitis B virus (HBV-related hepatocellular carcinomas (HCCs and their matched controls. Comparison of whole genome sequence (WGS and RNA-Seq revealed much evidence that various types of genomic mutations triggered diverse transcriptional changes. Not only splice-site mutations, but also silent mutations in coding regions, deep intronic mutations and structural changes caused splicing aberrations. HBV integrations generated diverse patterns of virus-human fusion transcripts depending on affected gene, such as TERT, CDK15, FN1 and MLL4. Structural variations could drive over-expression of genes such as WNT ligands, with/without creating gene fusions. Furthermore, by taking account of genomic mutations causing transcriptional aberrations, we could improve the sensitivity of deleterious mutation detection in known cancer driver genes (TP53, AXIN1, ARID2, RPS6KA3, and identified recurrent disruptions in putative cancer driver genes such as HNF4A, CPS1, TSC1 and THRAP3 in HCCs. These findings indicate genomic alterations in cancer genome have diverse transcriptomic effects, and integrated analysis of WGS and RNA-Seq can facilitate the interpretation of a large number of genomic alterations detected in cancer genome.

  4. SNUGB: a versatile genome browser supporting comparative and functional fungal genomics

    Directory of Open Access Journals (Sweden)

    Kim Seungill

    2008-12-01

    Full Text Available Abstract Background Since the full genome sequences of Saccharomyces cerevisiae were released in 1996, genome sequences of over 90 fungal species have become publicly available. The heterogeneous formats of genome sequences archived in different sequencing centers hampered the integration of the data for efficient and comprehensive comparative analyses. The Comparative Fungal Genomics Platform (CFGP was developed to archive these data via a single standardized format that can support multifaceted and integrated analyses of the data. To facilitate efficient data visualization and utilization within and across species based on the architecture of CFGP and associated databases, a new genome browser was needed. Results The Seoul National University Genome Browser (SNUGB integrates various types of genomic information derived from 98 fungal/oomycete (137 datasets and 34 plant and animal (38 datasets species, graphically presents germane features and properties of each genome, and supports comparison between genomes. The SNUGB provides three different forms of the data presentation interface, including diagram, table, and text, and six different display options to support visualization and utilization of the stored information. Information for individual species can be quickly accessed via a new tool named the taxonomy browser. In addition, SNUGB offers four useful data annotation/analysis functions, including 'BLAST annotation.' The modular design of SNUGB makes its adoption to support other comparative genomic platforms easy and facilitates continuous expansion. Conclusion The SNUGB serves as a powerful platform supporting comparative and functional genomics within the fungal kingdom and also across other kingdoms. All data and functions are available at the web site http://genomebrowser.snu.ac.kr/.

  5. BPGA- an ultra-fast pan-genome analysis pipeline.

    Science.gov (United States)

    Chaudhari, Narendrakumar M; Gupta, Vinod Kumar; Dutta, Chitra

    2016-01-01

    Recent advances in ultra-high-throughput sequencing technology and metagenomics have led to a paradigm shift in microbial genomics from few genome comparisons to large-scale pan-genome studies at different scales of phylogenetic resolution. Pan-genome studies provide a framework for estimating the genomic diversity of the dataset, determining core (conserved), accessory (dispensable) and unique (strain-specific) gene pool of a species, tracing horizontal gene-flux across strains and providing insight into species evolution. The existing pan genome software tools suffer from various limitations like limited datasets, difficult installation/requirements, inadequate functional features etc. Here we present an ultra-fast computational pipeline BPGA (Bacterial Pan Genome Analysis tool) with seven functional modules. In addition to the routine pan genome analyses, BPGA introduces a number of novel features for downstream analyses like core/pan/MLST (Multi Locus Sequence Typing) phylogeny, exclusive presence/absence of genes in specific strains, subset analysis, atypical G + C content analysis and KEGG &COG mapping of core, accessory and unique genes. Other notable features include minimum running prerequisites, freedom to select the gene clustering method, ultra-fast execution, user friendly command line interface and high-quality graphics outputs. The performance of BPGA has been evaluated using a dataset of complete genome sequences of 28 Streptococcus pyogenes strains. PMID:27071527

  6. Detection of genetic variation affecting milk coagulation properties in Danish Holstein dairy cattle by analyses of pooled whole-genome sequences from phenotypically extreme samples (pool-seq).

    Science.gov (United States)

    Bertelsen, H P; Gregersen, V R; Poulsen, N; Nielsen, R O; Das, A; Madsen, L B; Buitenhuis, A J; Holm, L-E; Panitz, F; Larsen, L B; Bendixen, C

    2016-04-01

    Rennet-induced milk coagulation is an important trait for cheese production. Recent studies have reported an alarming frequency of cows producing poorly coagulating milk unsuitable for cheese production. Several genetic factors are known to affect milk coagulation, including variation in the major milk proteins; however, recent association studies indicate genetic effects from other genomic regions as well. The aim of this study was to detect genetic variation affecting milk coagulation properties, measured as curd-firming rate (CFR) and milk pH. This was achieved by examining allele frequency differences between pooled whole-genome sequences of phenotypically extreme samples (pool-seq).. Curd-firming rate and raw milk pH were measured for 415 Danish Holstein cows, and each animal was sequenced at low coverage. Pools were created containing whole genome sequence reads from samples with "extreme" values (high or low) for both phenotypic traits. A total of 6,992,186 and 5,295,501 SNP were assessed in relation to CFR and milk pH, respectively. Allele frequency differences were calculated between pools and 32 significantly different SNP were detected, 1 for milk pH and 31 for CFR, of which 19 are located on chromosome 6. A total of 9 significant SNP, which were selected based on the possible function of proximal candidate genes, were genotyped in the entire sample set ( = 415) to test for an association. The most significant SNP was located proximal to , explaining 33% of the phenotypic variance. , coding for κ-casein, is the most studied in relation to milk coagulation due to its position on the surface of the casein micelles and the direct involvement in milk coagulation. Three additional SNP located on chromosome 6 showed significant associations explaining 7, 3.6, and 1.3% of the phenotypic variance of CFR. The significant SNP on chromosome 6 were shown to be in linkage disequilibrium with the SNP peaking proximal to ; however, after accounting for the genotype of

  7. Genome-wide identification of aquaporin encoding genes in Brassica oleracea and their phylogenetic sequence comparison to Brassica crops and Arabidopsis

    Directory of Open Access Journals (Sweden)

    Till Arvid Diehn

    2015-04-01

    Full Text Available Aquaporins (AQPs are essential channel proteins that regulate plant water homeostasis and the uptake and distribution of uncharged solutes such as metalloids, urea, ammonia and carbon dioxide. Despite their importance as crop plants, little is known about AQP gene and protein function in cabbage (Brassica oleracea and other Brassica species. The recent releases of the genome sequences of B. oleracea and B. rapa allow comparative genomic studies in these species to investigate the evolution and features of Brassica genes and proteins.In this study, we identified all AQP genes in B. oleracea by a genome-wide survey. In total, 67 genes of four plant AQP subfamilies were identified. Their full-length gene sequences and locations on chromosomes and scaffolds were manually curated. The identification of six additional full-length AQP sequences in the B. rapa genome added to the recently published AQP protein family of this species. A phylogenetic analysis of AQPs of A. thaliana, B. oleracea, B. rapa allowed us to follow AQP evolution in closely related species and to systematically classify and (re- name these isoforms. Thirty-three groups of AQP-orthologous genes were identified between B. oleracea and Arabidopsis and their expression was analyzed in different organs. The two selectivity filters, gene structure and coding sequences were highly conserved within each AQP subfamily while sequence variations in some introns and untranslated regions were frequent. These data suggest a similar substrate selectivity and function of Brassica AQPs compared to Arabidopsis orthologs. The comparative analyses of all AQP subfamilies in three Brassicaceae species give initial insights into AQP evolution in these taxa. Based on the genome-wide AQP identification in B. oleracea and the sequence analysis and reprocessing of Brassica AQP information, our dataset provides a sequence resource for further investigations of the physiological and molecular functions of

  8. The tRNATyr multigene family of Triticum aestivum: genome organization, sequence analyses and maturation of intron-containing pre-tRNAs in wheat germ extract.

    Science.gov (United States)

    Arends, S; Kraus, J; Beier, H

    1996-04-22

    Southern analysis of Triticum DNA has revealed that nuclear tRNATyr genes are dispersed at a minimum of 16 loci in the genome. We have isolated six independent tRNATyr genes from a Triticum aestivum library in addition to three known members of the Triticum tRNATyr family. Four of the sequenced tRNATyr genes code for Triticum tRNA Tyr and two code for tRNA2Tyr. Three genes encode tRNAsTyr which carry one or two nucleotide substitutions as compared to the conventional genes. The nine Triticum tRNATyr genes possess highly conserved intron sequences ranging in size from 12 to 14 nucleotides. A common secondary intron structure with the 5' and 3' splice site loops separated by five base pairs can be formed by all pre-tRNAs Tyr which are efficiently spliced in the homologous wheat germ extract. PMID:8617358

  9. PSAT: A web tool to compare genomic neighborhoods of multiple prokaryotic genomes

    Directory of Open Access Journals (Sweden)

    Wasnick Michael

    2008-03-01

    Full Text Available Abstract Background The conservation of gene order among prokaryotic genomes can provide valuable insight into gene function, protein interactions, or events by which genomes have evolved. Although some tools are available for visualizing and comparing the order of genes between genomes of study, few support an efficient and organized analysis between large numbers of genomes. The Prokaryotic Sequence homology Analysis Tool (PSAT is a web tool for comparing gene neighborhoods among multiple prokaryotic genomes. Results PSAT utilizes a database that is preloaded with gene annotation, BLAST hit results, and gene-clustering scores designed to help identify regions of conserved gene order. Researchers use the PSAT web interface to find a gene of interest in a reference genome and efficiently retrieve the sequence homologs found in other bacterial genomes. The tool generates a graphic of the genomic neighborhood surrounding the selected gene and the corresponding regions for its homologs in each comparison genome. Homologs in each region are color coded to assist users with analyzing gene order among various genomes. In contrast to common comparative analysis methods that filter sequence homolog data based on alignment score cutoffs, PSAT leverages gene context information for homologs, including those with weak alignment scores, enabling a more sensitive analysis. Features for constraining or ordering results are designed to help researchers browse results from large numbers of comparison genomes in an organized manner. PSAT has been demonstrated to be useful for helping to identify gene orthologs and potential functional gene clusters, and detecting genome modifications that may result in loss of function. Conclusion PSAT allows researchers to investigate the order of genes within local genomic neighborhoods of multiple genomes. A PSAT web server for public use is available for performing analyses on a growing set of reference genomes through any

  10. Electron microscopic comparison of the sequences of single-stranded genomes of mammalian parvoviruses by heteroduplex mapping

    Energy Technology Data Exchange (ETDEWEB)

    Banerjee, P.T.; Olson, W.H.; Allison, D.P.; Bates, R.C.; Snyder, C.E.; Mitra, S.

    1983-01-01

    The sequence homologies among the linear single-stranded genomes of severa