Joaquim Manoel da Silva
Full Text Available High density genotyping panels have been used in a wide range of applications. From population genetics to genome-wide association studies, this technology still offers the lowest cost and the most consistent solution for generating SNP data. However, in spite of the application, part of the generated data is always discarded from final datasets based on quality control criteria used to remove unreliable markers. Some discarded data consists of markers that failed to generate genotypes, labeled as missing genotypes. A subset of missing genotypes that occur in the whole population under study may be caused by technical issues but can also be explained by the presence of genomic variations that are in the vicinity of the assayed SNP and that prevent genotyping probes from annealing. The latter case may contain relevant information because these missing genotypes might be used to identify population-specific genomic variants. In order to assess which case is more prevalent, we used Illumina HD Bovine chip genotypes from 1,709 Nelore (Bos indicus samples. We found 3,200 missing genotypes among the whole population. NGS re-sequencing data from 8 sires were used to verify the presence of genomic variations within their flanking regions in 81.56% of these missing genotypes. Furthermore, we discovered 3,300 novel SNPs/Indels, 31% of which are located in genes that may affect traits of importance for the genetic improvement of cattle production.
Full Text Available Whole-genome sequencing provides a powerful tool to obtain more genetic variability that could produce a range of benefits for cattle breeding industry. Nanyang (Bos indicus and Qinchuan (Bos taurus are two important Chinese indigenous cattle breeds with distinct phenotypes. To identify the genetic characteristics responsible for variation in phenotypes between the two breeds, in the present study, we for the first time sequenced the genomes of four Nanyang and four Qinchuan cattle with 10 to 12 fold on average of 97.86% and 98.98% coverage of genomes, respectively. Comparison with the Bos_taurus_UMD_3.1 reference assembly yielded 9,010,096 SNPs for Nanyang, and 6,965,062 for Qinchuan cattle, 51% and 29% of which were novel SNPs, respectively. A total of 154,934 and 115,032 small indels (1 to 3 bp were found in the Nanyang and Qinchuan genomes, respectively. The SNP and indel distribution revealed that Nanyang showed a genetically high diversity as compared to Qinchuan cattle. Furthermore, a total of 2,907 putative cases of copy number variation (CNV were identified by aligning Nanyang to Qinchuan genome, 783 of which (27% encompassed the coding regions of 495 functional genes. The gene ontology (GO analysis revealed that many CNV genes were enriched in the immune system and environment adaptability. Among several CNV genes related to lipid transport and fat metabolism, Lepin receptor gene (LEPR overlapping with CNV_1815 showed remarkably higher copy number in Qinchuan than Nanyang (log2 (ratio = -2.34988; P value = 1.53E-102. Further qPCR and association analysis investigated that the copy number of the LEPR gene presented positive correlations with transcriptional expression and phenotypic traits, suggesting the LEPR CNV may contribute to the higher fat deposition in muscles of Qinchuan cattle. Our findings provide evidence that the distinct phenotypes of Nanyang and Qinchuan breeds may be due to the different genetic variations including SNPs
Xu, Yao; Jiang, Yu; Shi, Tao; Cai, Hanfang; Lan, Xianyong; Zhao, Xin; Plath, Martin; Chen, Hong
Whole-genome sequencing provides a powerful tool to obtain more genetic variability that could produce a range of benefits for cattle breeding industry. Nanyang (Bos indicus) and Qinchuan (Bos taurus) are two important Chinese indigenous cattle breeds with distinct phenotypes. To identify the genetic characteristics responsible for variation in phenotypes between the two breeds, in the present study, we for the first time sequenced the genomes of four Nanyang and four Qinchuan cattle with 10 ...
Taye, Mengistie; Lee, Wonseok; Caetano-Anolles, Kelsey; Dessie, Tadelle; Hanotte, Olivier; Mwai, Okeyo Ally; Kemp, Stephen; Cho, Seoae; Oh, Sung Jong; Lee, Hak-Kyo; Kim, Heebal
As African indigenous cattle evolved in a hot tropical climate, they have developed an inherent thermotolerance; survival mechanisms include a light-colored and shiny coat, increased sweating, and cellular and molecular mechanisms to cope with high environmental temperature. Here, we report the positive selection signature of genes in African cattle breeds which contribute for their heat tolerance mechanisms. We compared the genomes of five indigenous African cattle breeds with the genomes of four commercial cattle breeds using cross-population composite likelihood ratio (XP-CLR) and cross-population extended haplotype homozygosity (XP-EHH) statistical methods. We identified 296 (XP-EHH) and 327 (XP-CLR) positively selected genes. Gene ontology analysis resulted in 41 biological process terms and six Kyoto Encyclopedia of Genes and Genomes pathways. Several genes and pathways were found to be involved in oxidative stress response, osmotic stress response, heat shock response, hair and skin properties, sweat gland development and sweating, feed intake and metabolism, and reproduction functions. The genes and pathways identified directly or indirectly contribute to the superior heat tolerance mechanisms in African cattle populations. The result will improve our understanding of the biological mechanisms of heat tolerance in African cattle breeds and opens an avenue for further study. © 2017 Japanese Society of Animal Science.
Lasserre, Moira; Fresia, Pablo; Greif, Gonzalo; Iraola, Gregorio; Castro-Ramos, Miguel; Juambeltz, Arturo; Nuñez, Álvaro; Naya, Hugo; Robello, Carlos; Berná, Luisa
Bovine tuberculosis (bTB) poses serious risks to animal welfare and economy, as well as to public health as a zoonosis. Its etiological agent, Mycobacterium bovis, belongs to the Mycobacterium tuberculosis complex (MTBC), a group of genetically monomorphic organisms featured by a remarkably high overall nucleotide identity (99.9%). Indeed, this characteristic is of major concern for correct typing and determination of strain-specific traits based on sequence diversity. Due to its historical economic dependence on cattle production, Uruguay is deeply affected by the prevailing incidence of Mycobacterium bovis. With the world's highest number of cattle per human, and its intensive cattle production, Uruguay represents a particularly suited setting to evaluate genomic variability among isolates, and the diversity traits associated to this pathogen. We compared 186 genomes from MTBC strains isolated worldwide, and found a highly structured population in M. bovis. The analysis of 23 new M. bovis genomes, belonging to strains isolated in Uruguay evidenced three groups present in the country. Despite presenting an expected highly conserved genomic structure and sequence, these strains segregate into a clustered manner within the worldwide phylogeny. Analysis of the non-pe/ppe differential areas against a reference genome defined four main sources of variability, namely: regions of difference (RD), variable genes, duplications and novel genes. RDs and variant analysis segregated the strains into clusters that are concordant with their spoligotype identities. Due to its high homoplasy rate, spoligotyping failed to reflect the true genomic diversity among worldwide representative strains, however, it remains a good indicator for closely related populations. This study introduces a comprehensive population structure analysis of worldwide M. bovis isolates. The incorporation and analysis of 23 novel Uruguayan M. bovis genomes, sheds light onto the genomic diversity of this
Mark, Thomas; Sandøe, Peter
, unfavourable genetic trends for metabolic, reproductive, claw and leg diseases indicate that these attempts have been insufficient. Today, novel genome-wide sequencing techniques are revolutionising dairy cattle breeding; these enable genetic changes to occur at least twice as rapidly as previously. While......, a number of potential risks are discussed, including detrimental genetic trends for non-measured welfare traits, the increased chance of spreading unfavourable mutations, reduced sharing of information arising from concerns over patents, and an increased monopoly within dairy cattle breeding that may make...... negative effects on animal welfare and to invest in breeding for increased animal welfare. Researchers are encouraged to further investigate the long-term effects of various breeding schemes that rely on genomic breeding values....
Kim, Jaemin; Hanotte, Olivier; Mwai, Okeyo Ally; Dessie, Tadelle; Bashir, Salim; Diallo, Boubacar; Agaba, Morris; Kim, Kwondo; Kwak, Woori; Sung, Samsun; Seo, Minseok; Jeong, Hyeonsoo; Kwon, Taehyung; Taye, Mengistie; Song, Ki-Duk; Lim, Dajeong; Cho, Seoae; Lee, Hyun-Jeong; Yoon, Duhak; Oh, Sung Jong; Kemp, Stephen; Lee, Hak-Kyo; Kim, Heebal
The history of African indigenous cattle and their adaptation to environmental and human selection pressure is at the root of their remarkable diversity. Characterization of this diversity is an essential step towards understanding the genomic basis of productivity and adaptation to survival under African farming systems. We analyze patterns of African cattle genetic variation by sequencing 48 genomes from five indigenous populations and comparing them to the genomes of 53 commercial taurine breeds. We find the highest genetic diversity among African zebu and sanga cattle. Our search for genomic regions under selection reveals signatures of selection for environmental adaptive traits. In particular, we identify signatures of selection including genes and/or pathways controlling anemia and feeding behavior in the trypanotolerant N'Dama, coat color and horn development in Ankole, and heat tolerance and tick resistance across African cattle especially in zebu breeds. Our findings unravel at the genome-wide level, the unique adaptive diversity of African cattle while emphasizing the opportunities for sustainable improvement of livestock productivity on the continent.
Md. Rasel Uzzaman
Full Text Available In spite of variation in coat color, size, and production traits among indigenous Bangladeshi cattle populations, genetic differences among most of the populations have not been investigated or exploited. In this study, we used a high-density bovine single nucleotide polymorphism (SNP 80K Bead Chip derived from Bos indicus breeds to assess genetic diversity and population structure of 2 Bangladeshi zebu cattle populations (red Chittagong, n = 28 and non-descript deshi, n = 28 and a semi-domesticated population (gayal, n = 17. Overall, 95% and 58% of the total SNPs (69,804 showed polymorphisms in the zebu and gayal populations, respectively. Similarly, the average minor allele frequency value was as high 0.29 in zebu and as low as 0.09 in gayal. The mean expected heterozygosity varied from 0.42±0.14 in zebu to 0.148±0.14 in gayal with significant heterozygosity deficiency of 0.06 (FIS in the latter. Coancestry estimations revealed that the two zebu populations are weakly differentiated, with over 99% of the total genetic variation retained within populations and less than 1% accounted for between populations. Conversely, strong genetic differentiation (FST = 0.33 was observed between zebu and gayal populations. Results of population structure and principal component analyses suggest that gayal is distinct from Bos indicus and that the two zebu populations were weakly structured. This study provides basic information about the genetic diversity and structure of Bangladeshi cattle and the semi-domesticated gayal population that can be used for future appraisal of breed utilization and management strategies.
Background Genomic evaluations are rapidly replacing traditional evaluation systems used for dairy cattle selection. Economies of scale in genomics promote cooperation across country borders. Genomic information can be transferred across countries using simple conversion equations, by modifying mult...
Seo, S; Larkin, D M; Loor, J J
The recently sequenced cattle (Bos taurus) genome unraveled the unique genomic features of the species and provided the molecular basis for applying a systemic approach to systematically link genomic information to metabolic traits. Comparative analysis has identified a variety of evolutionary adaptive features in the cattle genome, such as an expansion of the gene families related to the rumen function, large number of chromosomal rearrangements affecting regulation of genes for lactation, and chromosomal rearrangements that are associated with segmental duplications and copy number variations. Metabolic reconstruction of the cattle genome has revealed that core metabolic pathways are highly conserved among mammals although five metabolic genes are deleted or highly diverged and seven metabolic genes are present in duplicate in the cattle genome compared to their human counter parts. The evolutionary loss and gain of metabolic genes in the cattle genome may reflect metabolic adaptations of cattle. Metabolic reconstruction also provides a platform for better understanding of metabolic regulation in cattle and ruminants. A substantial body of transcriptomics data from dairy and beef cattle under different nutritional management and across different stages of growth and lactation are already available and will aid in linking the genome with metabolism and nutritional physiology of cattle. Application of cattle genomics has great potential for future development of nutritional strategies to improve efficiency and sustainability of beef and milk production. One of the biggest challenges is to integrate genomic and phenotypic data and interpret them in a biological and practical platform. Systems biology, a holistic and systemic approach, will be very useful in overcoming this challenge.
Great strides in technological advancements have been made in the past decade in cattle genome engineering. First, the success of cloning cattle by somatic cell nuclear transfer (SCNT) or chromatin transfer (CT) is a significant advancement that has made obsolete the need for using embryonic stem (ES) cells to conduct cell-mediated genome engineering, whereby site-specific genetic modifications can be conducted in bovine somatic cells via DNA homologous recombination (HR) and whereby genetically engineered cattle can subsequently be produced by animal cloning from the genetically modified cells. With this approach, a chosen bovine genomic locus can be precisely modified in somatic cells, such as to knock out (KO) or knock in (KI) a gene via HR, a gene-targeting strategy that had almost exclusively been used in mouse ES cells. Furthermore, by the creative application of embryonic cloning to rejuvenate somatic cells, cattle genome can be sequentially modified in the same line of somatic cells and complex genetic modifications have been achieved in cattle. Very recently, the development of designer nucleases-such as zinc finger nucleases (ZFNs) and transcription activator-like effector nuclease (TALENs), and clustered regularly interspaced short palindromic repeats/CRISPR-associated protein 9 (CRISPR/Cas9)-has enabled highly efficient and more facile genome engineering in cattle. Most notably, by employing such designer nucleases, genomes can be engineered at single-nucleotide precision; this process is now often referred to as genome or gene editing. The above achievements are a drastic departure from the traditional methods of creating genetically modified cattle, where foreign DNAs are randomly integrated into the animal genome, most often along with the integrations of bacterial or viral DNAs. Here, I review the most recent technological developments in cattle genome engineering by highlighting some of the major achievements in creating genetically engineered
Lewin Harris A
Full Text Available Abstract Background Metabolic reconstruction of microbial, plant and animal genomes is a necessary step toward understanding the evolutionary origins of metabolism and species-specific adaptive traits. The aims of this study were to reconstruct conserved metabolic pathways in the cattle genome and to identify metabolic pathways with missing genes and proteins. The MetaCyc database and PathwayTools software suite were chosen for this work because they are widely used and easy to implement. Results An amalgamated cattle genome database was created using the NCBI and Ensembl cattle genome databases (based on build 3.1 as data sources. PathwayTools was used to create a cattle-specific pathway genome database, which was followed by comprehensive manual curation for the reconstruction of metabolic pathways. The curated database, CattleCyc 1.0, consists of 217 metabolic pathways. A total of 64 mammalian-specific metabolic pathways were modified from the reference pathways in MetaCyc, and two pathways previously identified but missing from MetaCyc were added. Comparative analysis of metabolic pathways revealed the absence of mammalian genes for 22 metabolic enzymes whose activity was reported in the literature. We also identified six human metabolic protein-coding genes for which the cattle ortholog is missing from the sequence assembly. Conclusion CattleCyc is a powerful tool for understanding the biology of ruminants and other cetartiodactyl species. In addition, the approach used to develop CattleCyc provides a framework for the metabolic reconstruction of other newly sequenced mammalian genomes. It is clear that metabolic pathway analysis strongly reflects the quality of the underlying genome annotations. Thus, having well-annotated genomes from many mammalian species hosted in BioCyc will facilitate the comparative analysis of metabolic pathways among different species and a systems approach to comparative physiology.
DNA base modification systems are common in bacteria and can modulate gene expression as well as act in defense against invading viruses. Recent advances in the direct identification of modified bases in the genome via Single Molecule Real Time (SMRT) sequencing supports an integrated analytical ap...
Killick Kate E
Full Text Available Abstract Background Mycobacterium bovis is the causative agent of bovine tuberculosis (BTB, a pathological infection with significant economic impact. Recent studies have highlighted the role of functional genomics to better understand the molecular mechanisms governing the host immune response to M. bovis infection. Furthermore, these studies may enable the identification of novel transcriptional markers of BTB that can augment current diagnostic tests and surveillance programmes. In the present study, we have analysed the transcriptome of peripheral blood leukocytes (PBL from eight M. bovis-infected and eight control non-infected age-matched and sex-matched Holstein-Friesian cattle using the Affymetrix® GeneChip® Bovine Genome Array with 24,072 gene probe sets representing more than 23,000 gene transcripts. Results Control and infected animals had similar mean white blood cell counts. However, the mean number of lymphocytes was significantly increased in the infected group relative to the control group (P = 0.001, while the mean number of monocytes was significantly decreased in the BTB group (P = 0.002. Hierarchical clustering analysis using gene expression data from all 5,388 detectable mRNA transcripts unambiguously partitioned the animals according to their disease status. In total, 2,960 gene transcripts were differentially expressed (DE between the infected and control animal groups (adjusted P-value threshold ≤ 0.05; with the number of gene transcripts showing decreased relative expression (1,563 exceeding those displaying increased relative expression (1,397. Systems analysis using the Ingenuity® Systems Pathway Analysis (IPA Knowledge Base revealed an over-representation of DE genes involved in the immune response functional category. More specifically, 64.5% of genes in the affects immune response subcategory displayed decreased relative expression levels in the infected animals compared to the control group. Conclusions This
Carroll, Laura M; Wiedmann, Martin; den Bakker, Henk; Siler, Julie; Warchocki, Steven; Kent, David; Lyalina, Svetlana; Davis, Margaret; Sischo, William; Besser, Thomas; Warnick, Lorin D; Pereira, Richard V
Multidrug-resistant (MDR) Salmonella enterica can be spread from cattle to humans through direct contact with animals shedding Salmonella as well as through the food chain, making MDR Salmonella a serious threat to human health. The objective of this study was to use whole-genome sequencing to compare antimicrobial-resistant (AMR) Salmonella enterica serovars Typhimurium, Newport, and Dublin isolated from dairy cattle and humans in Washington State and New York State at the genotypic and phenotypic levels. A total of 90 isolates were selected for the study (37 S Typhimurium, 32 S Newport, and 21 S Dublin isolates). All isolates were tested for phenotypic antibiotic resistance to 12 drugs using Kirby-Bauer disk diffusion. AMR genes were detected in the assembled genome of each isolate using nucleotide BLAST and ARG-ANNOT. Genotypic prediction of phenotypic resistance resulted in a mean sensitivity of 97.2 and specificity of 85.2. Sulfamethoxazole-trimethoprim resistance was observed only in human isolates ( P enterica in humans and farm animals in different regions. IMPORTANCE The use of antibiotics in food-producing animals has been hypothesized to select for AMR Salmonella enterica and associated AMR determinants, which can be transferred to humans through different routes. Previous studies have sought to assess the degree to which AMR livestock- and human-associated Salmonella strains overlap, as well as the spatial distribution of Salmonella 's associated AMR determinants, but have often been limited by the degree of resolution at which isolates can be compared. Here, a comparative genomics study of livestock- and human-associated Salmonella strains from different regions of the United States shows that while many AMR genes and phenotypes were confined to human isolates, overlaps between the resistomes of bovine and human-associated Salmonella isolates were observed on numerous occasions, particularly for S Newport. We have also shown that whole-genome
Shade Larry L
Full Text Available Abstract Background Approximately 11 Mb of finished high quality genomic sequences were sampled from cattle, dog and human to estimate genomic divergences and their regional variation among these lineages. Results Optimal three-way multi-species global sequence alignments for 84 cattle clones or loci (each >50 kb of genomic sequence were constructed using the human and dog genome assemblies as references. Genomic divergences and substitution rates were examined for each clone and for various sequence classes under different functional constraints. Analysis of these alignments revealed that the overall genomic divergences are relatively constant (0.32–0.37 change/site for pairwise comparisons among cattle, dog and human; however substitution rates vary across genomic regions and among different sequence classes. A neutral mutation rate (2.0–2.2 × 10(-9 change/site/year was derived from ancestral repetitive sequences, whereas the substitution rate in coding sequences (1.1 × 10(-9 change/site/year was approximately half of the overall rate (1.9–2.0 × 10(-9 change/site/year. Relative rate tests also indicated that cattle have a significantly faster rate of substitution as compared to dog and that this difference is about 6%. Conclusion This analysis provides a large-scale and unbiased assessment of genomic divergences and regional variation of substitution rates among cattle, dog and human. It is expected that these data will serve as a baseline for future mammalian molecular evolution studies.
Thomasen, Jørn Rind
Genomic selection provides more accurate estimation of genetic merit for breeding candidates without own recordings and is now an integrated part of most dairy breeding schemes. However, the method has turned out to be less efficient in the numerically smaler breeds. This thesis focuses on optimi......Genomic selection provides more accurate estimation of genetic merit for breeding candidates without own recordings and is now an integrated part of most dairy breeding schemes. However, the method has turned out to be less efficient in the numerically smaler breeds. This thesis focuses...... on optimization of genomc selction for a small dairy cattle breed such as Danish Jersey. Implementing genetic superior breeding schemes thus requires more accurate genomc predictions. Besides international collaboration, genotyping of cows is an efficient way to obtain more accurate genomic predictions...
Stature is affected by many polymorphisms of small effect in humans but in contrast variation in dogs, even within breeds is largely due to variants in six genes. Here we use data from cattle to compare genetic architecture of stature to that in humans and dogs. We conducted a meta-analysis for stat...
Full Text Available Abstract Background The Bovine HapMap Consortium has generated assay panels to genotype ~30,000 single nucleotide polymorphisms (SNPs from 501 animals sampled from 19 worldwide taurine and indicine breeds, plus two outgroup species (Anoa and Water Buffalo. Within the larger set of SNPs we targeted 101 high density regions spanning up to 7.6 Mb with an average density of approximately one SNP per 4 kb, and characterized the linkage disequilibrium (LD and haplotype block structure within individual breeds and groups of breeds in relation to their geographic origin and use. Results From the 101 targeted high-density regions on bovine chromosomes 6, 14, and 25, between 57 and 95% of the SNPs were informative in the individual breeds. The regions of high LD extend up to ~100 kb and the size of haplotype blocks ranges between 30 bases and 75 kb (10.3 kb average. On the scale from 1–100 kb the extent of LD and haplotype block structure in cattle has high similarity to humans. The estimation of effective population sizes over the previous 10,000 generations conforms to two main events in cattle history: the initiation of cattle domestication (~12,000 years ago, and the intensification of population isolation and current population bottleneck that breeds have experienced worldwide within the last ~700 years. Haplotype block density correlation, block boundary discordances, and haplotype sharing analyses were consistent in revealing unexpected similarities between some beef and dairy breeds, making them non-differentiable. Clustering techniques permitted grouping of breeds into different clades given their similarities and dissimilarities in genetic structure. Conclusion This work presents the first high-resolution analysis of haplotype block structure in worldwide cattle samples. Several novel results were obtained. First, cattle and human share a high similarity in LD and haplotype block structure on the scale of 1–100 kb. Second, unexpected
Full Text Available In this work, the distribution of different lengths ROH (runs of homozygosity in six cattle breeds was described. A total of 122 animals from six cattle breeds (Holstein, Simmental, Austrian Pinzgau, Ayrshire, MRI-Meuse Rhine Issel and Slovak Pinzgau were analysed. The ROH approach was used to distinguish Slovak Pinzgau population from other investigated breeds as well as to differentiate between ancient and recent inbreeding. The average number of ROH per animal ranged from 17.06 in Holstein to 159.22 in Ayrshire. The highest number of short ROH (ancient inbreeding was found in Simmental, followed by Ayrshire. The Ayrshire and MRI had a higher proportion of longer ROH distributed across the whole genome, revealing recent inbreeding. ROH were identified and used to estimate molecular inbreeding coefficients (FROH. The highest level of inbreeding from the investigated breeds was found out in Ayrshire with the same tendency for all length categories compared to Slovak Pinzgau with higher ancient inbreeding. Ancient inbreeding was only observed in Holstein population. A similar trend is becoming apparent even for Slovak Pinzgau, showing the second smallest recent inbreeding. Therefore, it is necessary to preserve the given population in the original phenotype and prevent further increase of inbreeding especially in endangered breeds.
Santana, M. H. A.; Freua, M. C.; Do, D. N.
Feed intake, feed efficiency, and weight gain are important economic traits of beef cattle in feed lots. In the present study, we investigated the physiological processes underlying such traits from the point of view of systems genetics. Firstly, using data from 1334 Nellore (Bos indicus) cattle......, were annotated and the biological processes underlying the traits were inferred from Database for Annotation, Visualization and Integrated Discovery (DAVID) and Kyoto Encyclopedia of Genes and Genomes (KEGG) databases. Our results indicated several putative genomic regions associated with the target...... genetics approach used in this study revealed novel pathways related to feed efficiency traits in beef cattle....
Transgenic cattle carrying multiple genomic modifications have been produced by serial rounds of somatic cell chromatin transfer (cloning) of sequentially genetically targeted somatic cells. However, cloning efficiency tends to decline with the increase of rounds of cloning. It is possible that mult...
Full Text Available Human driven selection during domestication and subsequent breed formation has likely left detectable signatures within the genome of modern cattle. The elucidation of these signatures of selection is of interest from the perspective of evolutionary biology, and for identifying domestication-related genes that ultimately may help to further genetically improve this economically important animal. To this end, we employed a panel of more than 15 million autosomal SNPs identified from re-sequencing of 43 Fleckvieh animals. We mainly applied two somewhat complementary statistics, the integrated Haplotype Homozygosity Score (iHS reflecting primarily ongoing selection, and the Composite of Likelihood Ratio (CLR having the most power to detect completed selection after fixation of the advantageous allele. We find 106 candidate selection regions, many of which are harboring genes related to phenotypes relevant in domestication, such as coat coloring pattern, neurobehavioral functioning and sensory perception including KIT, MITF, MC1R, NRG4, Erbb4, TMEM132D and TAS2R16, among others. To further investigate the relationship between genes with signatures of selection and genes identified in QTL mapping studies, we use a sample of 3062 animals to perform four genome-wide association analyses using appearance traits, body size and somatic cell count. We show that regions associated with coat coloring significantly (P<0.0001 overlap with the candidate selection regions, suggesting that the selection signals we identify are associated with traits known to be affected by selection during domestication. Results also provide further evidence regarding the complexity of the genetics underlying coat coloring in cattle. This study illustrates the potential of population genetic approaches for identifying genomic regions affecting domestication-related phenotypes and further helps to identify specific regions targeted by selection during speciation, domestication and
Tizioto, P C; Decker, J E; Taylor, J F; Schnabel, R D; Mudadu, M A; Silva, F L; Mourão, G B; Coutinho, L L; Tholon, P; Sonstegard, T S; Rosa, A N; Alencar, M M; Tullio, R R; Medeiros, S R; Nassu, R T; Feijó, G L D; Silva, L O C; Torres, R A; Siqueira, F; Higa, R H; Regitano, L C A
Meat quality traits are economically important because they affect consumers' acceptance, which, in turn, influences the demand for beef. However, selection to improve meat quality is limited by the small numbers of animals on which meat tenderness can be evaluated due to the cost of performing shear force analysis and the resultant damage to the carcass. Genome wide-association studies for Warner-Bratzler shear force measured at different times of meat aging, backfat thickness, ribeye muscle area, scanning parameters [lightness, redness (a*), and yellowness] to ascertain color characteristics of meat and fat, water-holding capacity, cooking loss (CL), and muscle pH were conducted using genotype data from the Illumina BovineHD BeadChip array to identify quantitative trait loci (QTL) in all phenotyped Nelore cattle. Phenotype count for these animals ranged from 430 to 536 across traits. Meat quality traits in Nelore are controlled by numerous QTL of small effect, except for a small number of large-effect QTL identified for a*fat, CL, and pH. Genomic regions harboring these QTL and the pathways in which the genes from these regions act appear to differ from those identified in taurine cattle for meat quality traits. These results will guide future QTL mapping studies and the development of models for the prediction of genetic merit to implement genomic selection for meat quality in Nelore cattle.
Elsik, Christine G.; Tellam, Ross L.; Worley, Kim C.; Gibbs, Richard A.; Abatepaulo, Antonio R. R.; Abbey, Colette A.; Adelson, David L.; Aerts, Jan; Ahola, Virpi; Alexander, Lee; Alioto, Tyler; Almeida, Iassudara G.; Amadio, Ariel F.; Anatriello, Elen; Antonarakis, Stylianos E.
To understand the biology and evolution of ruminants, the cattle genome was sequenced to about sevenfold coverage. The cattle genome contains a minimum of 22,000 genes, with a core set of 14,345 orthologs shared among seven mammalian species of which 1217 are absent or undetected in noneutherian (marsupial or monotreme) genomes. Cattle-specific evolutionary breakpoint regions in chromosomes have a higher density of segmental duplications, enrichment of repetitive elements, and species-specifi...
To understand the biology and evolution of ruminants, the cattle genome was sequenced to about sevenfold coverage. The cattle genome contains a minimum of 22,000 genes, with a core set of 14,345 orthologs shared among seven mammalian species of which 1217 are absent or undetected in noneutherian (ma...
Park, Stephen D E; Magee, David A; McGettigan, Paul A; Teasdale, Matthew D; Edwards, Ceiridwen J; Lohan, Amanda J; Murphy, Alison; Braud, Martin; Donoghue, Mark T; Liu, Yuan; Chamberlain, Andrew T; Rue-Albrecht, Kévin; Schroeder, Steven; Spillane, Charles; Tai, Shuaishuai; Bradley, Daniel G; Sonstegard, Tad S; Loftus, Brendan J; MacHugh, David E
Domestication of the now-extinct wild aurochs, Bos primigenius, gave rise to the two major domestic extant cattle taxa, B. taurus and B. indicus. While previous genetic studies have shed some light on the evolutionary relationships between European aurochs and modern cattle, important questions remain unanswered, including the phylogenetic status of aurochs, whether gene flow from aurochs into early domestic populations occurred, and which genomic regions were subject to selection processes during and after domestication. Here, we address these questions using whole-genome sequencing data generated from an approximately 6,750-year-old British aurochs bone and genome sequence data from 81 additional cattle plus genome-wide single nucleotide polymorphism data from a diverse panel of 1,225 modern animals. Phylogenomic analyses place the aurochs as a distinct outgroup to the domestic B. taurus lineage, supporting the predominant Near Eastern origin of European cattle. Conversely, traditional British and Irish breeds share more genetic variants with this aurochs specimen than other European populations, supporting localized gene flow from aurochs into the ancestors of modern British and Irish cattle, perhaps through purposeful restocking by early herders in Britain. Finally, the functions of genes showing evidence for positive selection in B. taurus are enriched for neurobiology, growth, metabolism and immunobiology, suggesting that these biological processes have been important in the domestication of cattle. This work provides important new information regarding the origins and functional evolution of modern cattle, revealing that the interface between early European domestic populations and wild aurochs was significantly more complex than previously thought.
Machado, Henrique; Gram, Lone
Vibrionaceae is a large marine bacterial family, which can constitute up to 50% of the prokaryotic population in marine waters. Photobacterium is the second largest genus in the family and we used comparative genomics on 35 strains representing 16 of the 28 species described so far, to understand the genomic diversity present in the Photobacterium genus. Such understanding is important for ecophysiology studies of the genus. We used whole genome sequences to evaluate phylogenetic relationships using several analyses (16S rRNA, MLSA, fur , amino-acid usage, ANI), which allowed us to identify two misidentified strains. Genome analyses also revealed occurrence of higher and lower GC content clades, correlating with phylogenetic clusters. Pan- and core-genome analysis revealed the conservation of 25% of the genome throughout the genus, with a large and open pan-genome. The major source of genomic diversity could be traced to the smaller chromosome and plasmids. Several of the physiological traits studied in the genus did not correlate with phylogenetic data. Since horizontal gene transfer (HGT) is often suggested as a source of genetic diversity and a potential driver of genomic evolution in bacterial species, we looked into evidence of such in Photobacterium genomes. Genomic islands were the source of genomic differences between strains of the same species. Also, we found transposase genes and CRISPR arrays that suggest multiple encounters with foreign DNA. Presence of genomic exchange traits was widespread and abundant in the genus, suggesting a role in genomic evolution. The high genetic variability and indications of genetic exchange make it difficult to elucidate genome evolutionary paths and raise the awareness of the roles of foreign DNA in the genomic evolution of environmental organisms.
Nader M. Sobhy
Full Text Available Parainfluenza virus type 3 (PIV-3 can infect a wide variety of mammals including humans, domestic animals, and wild animals. In the present study, bovine parainfluenza virus type 3 (BPIV-3 was isolated from nasal swabs of Egyptian cattle presenting with clinical signs of mild pneumonia. The virus was isolated in Madin-Darby bovine kidney (MDBK cells and confirmed by reverse transcription-polymerase chain reaction (RT-PCR. The complete genome of Egyptian BPIV-3 strain was sequenced by using next generation (Illumina sequencing. The new isolate classified with genotype A of BPIV-3 and was closely related to the Chinese NM09 strain (JQ063064. Subsequently in 2015–16, a molecular surveillance study was undertaken by collecting and testing samples from cattle and buffaloes with respiratory tract infections. The survey revealed a higher rate of BPIV-3 infection in cattle than in buffaloes. The infection was inversely proportional to the age of the animals and to warm weather. This report should form a basis for further molecular studies on animal viruses in Egypt.
Zhang, Guojie; Li, Cai; Li, Qiye
Birds are the most species-rich class of tetrapod vertebrates and have wide relevance across many research fields. We explored bird macroevolution using full genomes from 48 avian species representing all major extant clades. The avian genome is principally characterized by its constrained size...... this pattern of conservation, we detected many non-neutral evolutionary changes in protein-coding genes and noncoding regions. These analyses reveal that pan-avian genomic diversity covaries with adaptations to different lifestyles and convergent evolution of traits....
Full Text Available The increasing prevalence of bovine tuberculosis (bTB in the UK and the limitations of the currently available diagnostic and control methods require the development of complementary approaches to assist in the sustainable control of the disease. One potential approach is the identification of animals that are genetically more resistant to bTB, to enable breeding of animals with enhanced resistance. This paper focuses on prediction of resistance to bTB. We explore estimation of direct genomic estimated breeding values (DGVs for bTB resistance in UK dairy cattle, using dense SNP chip data, and test these genomic predictions for situations when disease phenotypes are not available on selection candidates.We estimated DGVs using genomic best linear unbiased prediction methodology, and assessed their predictive accuracies with a cross validation procedure and receiver operator characteristic (ROC curves. Furthermore, these results were compared with theoretical expectations for prediction accuracy and area-under-the-ROC-curve (AUC. The dataset comprised 1151 Holstein-Friesian cows (bTB cases or controls. All individuals (592 cases and 559 controls were genotyped for 727,252 loci (Illumina Bead Chip. The estimated observed heritability of bTB resistance was 0.23±0.06 (0.34 on the liability scale and five-fold cross validation, replicated six times, provided a prediction accuracy of 0.33 (95% C.I.: 0.26, 0.40. ROC curves, and the resulting AUC, gave a probability of 0.58, averaged across six replicates, of correctly classifying cows as diseased or as healthy based on SNP chip genotype alone using these data.These results provide a first step in the investigation of the potential feasibility of genomic selection for bTB resistance using SNP data. Specifically, they demonstrate that genomic selection is possible, even in populations with no pedigree data and on animals lacking bTB phenotypes. However, a larger training population will be required to
Tsairidou, Smaragda; Woolliams, John A; Allen, Adrian R; Skuce, Robin A; McBride, Stewart H; Wright, David M; Bermingham, Mairead L; Pong-Wong, Ricardo; Matika, Oswald; McDowell, Stanley W J; Glass, Elizabeth J; Bishop, Stephen C
The increasing prevalence of bovine tuberculosis (bTB) in the UK and the limitations of the currently available diagnostic and control methods require the development of complementary approaches to assist in the sustainable control of the disease. One potential approach is the identification of animals that are genetically more resistant to bTB, to enable breeding of animals with enhanced resistance. This paper focuses on prediction of resistance to bTB. We explore estimation of direct genomic estimated breeding values (DGVs) for bTB resistance in UK dairy cattle, using dense SNP chip data, and test these genomic predictions for situations when disease phenotypes are not available on selection candidates. We estimated DGVs using genomic best linear unbiased prediction methodology, and assessed their predictive accuracies with a cross validation procedure and receiver operator characteristic (ROC) curves. Furthermore, these results were compared with theoretical expectations for prediction accuracy and area-under-the-ROC-curve (AUC). The dataset comprised 1151 Holstein-Friesian cows (bTB cases or controls). All individuals (592 cases and 559 controls) were genotyped for 727,252 loci (Illumina Bead Chip). The estimated observed heritability of bTB resistance was 0.23±0.06 (0.34 on the liability scale) and five-fold cross validation, replicated six times, provided a prediction accuracy of 0.33 (95% C.I.: 0.26, 0.40). ROC curves, and the resulting AUC, gave a probability of 0.58, averaged across six replicates, of correctly classifying cows as diseased or as healthy based on SNP chip genotype alone using these data. These results provide a first step in the investigation of the potential feasibility of genomic selection for bTB resistance using SNP data. Specifically, they demonstrate that genomic selection is possible, even in populations with no pedigree data and on animals lacking bTB phenotypes. However, a larger training population will be required to improve
Full Text Available The aim of this study was to identify the evidence of recent selection based on estimation of the integrated Haplotype Score (iHS, population differentiation index (FST and characterize affected regions near QTL associated with traits under strong selection in Pinzgau cattle. In total 21 Austrian and 19 Slovak purebreed bulls genotyped with Illumina bovineHD and bovineSNP50 BeadChip were used to identify genomic regions under selection. Only autosomal loci with call rate higher than 90%, minor allele frequency higher than 0.01 and Hardy-Weinberg equlibrium limit of 0.001 were included in the subsequent analyses of selection sweeps presence. The final dataset was consisted from 30538 SNPs with 81.86 kb average adjacent SNPs spacing. The iHS score were averaged into non-overlapping 500 kb segments across the genome. The FST values were also plotted against genome position based on sliding windows approach and averaged over 8 consecutive SNPs. Based on integrated Haplotype Score evaluation only 7 regions with iHS score higher than 1.7 was found. The average iHS score observed for each adjacent syntenic regions indicated slight effect of recent selection in analysed group of Pinzgau bulls. The level of genetic differentiation between Austrian and Slovak bulls estimated based on FST index was low. Only 24% of FST values calculated for each SNP was greather than 0.01. By using sliding windows approach was found that 5% of analysed windows had higher value than 0.01. Our results indicated use of similar selection scheme in breeding programs of Slovak and Austrian Pinzgau bulls. The evidence for genome-wide association between signatures of selection and regions affecting complex traits such as milk production was insignificant, because the loci in segments identified as affected by selection were very distant from each other. Identification of genomic regions that may be under pressure of selection for phenotypic traits to better understanding of the
Pryce, J E; Wales, W J; de Haas, Y; Veerkamp, R F; Hayes, B J
Feed is a major component of variable costs associated with dairy systems and is therefore an important consideration for breeding objectives. As a result, measures of feed efficiency are becoming popular traits for genetic analyses. Already, several countries account for feed efficiency in their breeding objectives by approximating the amount of energy required for milk production, maintenance, etc. However, variation in actual feed intake is currently not captured in dairy selection objectives, although this could be possible by evaluating traits such as residual feed intake (RFI), defined as the difference between actual and predicted feed (or energy) intake. As feed intake is expensive to accurately measure on large numbers of cows, phenotypes derived from it are obvious candidates for genomic selection provided that: (1) the trait is heritable; (2) the reliability of genomic predictions are acceptable to those using the breeding values; and (3) if breeding values are estimated for heifers, rather than cows then the heifer and cow traits need to be correlated. The accuracy of genomic prediction of dry matter intake (DMI) and RFI has been estimated to be around 0.4 in beef and dairy cattle studies. There are opportunities to increase the accuracy of prediction, for example, pooling data from three research herds (in Australia and Europe) has been shown to increase the accuracy of genomic prediction of DMI from 0.33 within country to 0.35 using a three-country reference population. Before including RFI as a selection objective, genetic correlations with other traits need to be estimated. Weak unfavourable genetic correlations between RFI and fertility have been published. This could be because RFI is mathematically similar to the calculation of energy balance and failure to account for mobilisation of body reserves correctly may result in selection for a trait that is similar to selecting for reduced (or negative) energy balance. So, if RFI is to become a
Genomics analysis of genes expressed reveals differential responses to low chronic nitrogen stress in maize. ... Most induced clones were largely involved in various metabolism processes including physiological process, organelle regulation of biological process, nutrient reservoir activity, transcription regulator activity and ...
Bachmann, Nathan L; Fraser, Tamieka A; Bertelli, Claire; Jelocnik, Martina; Gillett, Amber; Funnell, Oliver; Flanagan, Cheyne; Myers, Garry S A; Timms, Peter; Polkinghorne, Adam
Chlamydia pecorum is an important pathogen of domesticated livestock including sheep, cattle and pigs. This pathogen is also a key factor in the decline of the koala in Australia. We sequenced the genomes of three koala C. pecorum strains, isolated from the urogenital tracts and conjunctiva of diseased koalas. The genome of the C. pecorum VR629 (IPA) strain, isolated from a sheep with polyarthritis, was also sequenced. Comparisons of the draft C. pecorum genomes against the complete genomes of livestock C. pecorum isolates revealed that these strains have a conserved gene content and order, sharing a nucleotide sequence similarity > 98%. Single nucleotide polymorphisms (SNPs) appear to be key factors in understanding the adaptive process. Two regions of the chromosome were found to be accumulating a large number of SNPs within the koala strains. These regions include the Chlamydia plasticity zone, which contains two cytotoxin genes (toxA and toxB), and a 77 kbp region that codes for putative type III effector proteins. In one koala strain (MC/MarsBar), the toxB gene was truncated by a premature stop codon but is full-length in IPTaLE and DBDeUG. Another five pseudogenes were also identified, two unique to the urogenital strains C. pecorum MC/MarsBar and C. pecorum DBDeUG, respectively, while three were unique to the koala C. pecorum conjunctival isolate IPTaLE. An examination of the distribution of these pseudogenes in C. pecorum strains from a variety of koala populations, alongside a number of sheep and cattle C. pecorum positive samples from Australian livestock, confirmed the presence of four predicted pseudogenes in koala C. pecorum clinical samples. Consistent with our genomics analyses, none of these pseudogenes were observed in the livestock C. pecorum samples examined. Interestingly, three SNPs resulting in pseudogenes identified in the IPTaLE isolate were not found in any other C. pecorum strain analysed, raising questions over the origin of these
Daetwyler, Hans D; Capitan, Aurélien; Pausch, Hubert
The 1000 bull genomes project supports the goal of accelerating the rates of genetic gain in domestic cattle while at the same time considering animal health and welfare by providing the annotated sequence variants and genotypes of key ancestor bulls. In the first phase of the 1000 bull genomes p...
Mart?nez-Ocampo, Fernando; Rodr?guez-Camarillo, Sergio D.; Amaro-Estrada, Itzel; Quiroz-Casta?eda, Rosa Estela
We present here the draft genome sequence of the first ?Candidatus Mycoplasma haemobos? strain found in cattle in Mexico. This hemotropic mycoplasma causes acute and chronic disease in animals. This genome is a starting point for studying the role of this mycoplasma in coinfections and synergistic mechanisms associated with the disease.
van den Berg, Irene; Boichard, Didier; Lund, Mogens Sandø
The increasing number of sequenced individuals makes the use of sequences for genomic prediction feasible. With the current SNP chips, the accuracy of genomic selection is limited in populations with low linkage disequilibrium, as is the case in across-breed prediction in dairy cattle. Sequence d...
Zhang, Qianqian; Guldbrandtsen, Bernt; Bosse, Mirte
Runs of homozygosity (ROH) are identified in four dairy cattle breeds using NGS data. Cattle populations have been exposed to strong artificial selection for some generations. Genomic regions under selection will show increased levels of ROH. By investigating the relationship between ROH and dist......Runs of homozygosity (ROH) are identified in four dairy cattle breeds using NGS data. Cattle populations have been exposed to strong artificial selection for some generations. Genomic regions under selection will show increased levels of ROH. By investigating the relationship between ROH...... and distribution of predicted deleterious and tolerated variants, we can gain insight into how selection shapes the distribution of functional variants in inbred regions. We observe that predicted deleterious variants are more enriched in ROHs than predicted tolerated variants. Moreover, increase of enrichment...
Stothard, Paul; Chung, Won-Hyong; Jeon, Heoyn-Jeong; Miller, Stephen P.; Choi, So-Young; Lee, Jeong-Koo; Yang, Bokyoung; Lee, Kyung-Tai; Han, Kwang-Jin; Kim, Hyeong-Cheol; Jeong, Dongkee; Oh, Jae-Don; Kim, Namshin; Kim, Tae-Hun; Lee, Hak-Kyo; Lee, Sung-Jin
A main goal of cattle genomics is to identify DNA differences that account for variations in economically important traits. In this study, we performed whole-genome analyses of three important cattle breeds in Korea—Hanwoo, Jeju Heugu, and Korean Holstein—using the Illumina HiSeq 2000 sequencing platform. We achieved 25.5-, 29.6-, and 29.5-fold coverage of the Hanwoo, Jeju Heugu, and Korean Holstein genomes, respectively, and identified a total of 10.4 million single nucleotide polymorphisms (SNPs), of which 54.12% were found to be novel. We also detected 1,063,267 insertions–deletions (InDels) across the genomes (78.92% novel). Annotations of the datasets identified a total of 31,503 nonsynonymous SNPs and 859 frameshift InDels that could affect phenotypic variations in traits of interest. Furthermore, genome-wide copy number variation regions (CNVRs) were detected by comparing the Hanwoo, Jeju Heugu, and previously published Chikso genomes against that of Korean Holstein. A total of 992, 284, and 1881 CNVRs, respectively, were detected throughout the genome. Moreover, 53, 65, 45, and 82 putative regions of homozygosity (ROH) were identified in Hanwoo, Jeju Heugu, Chikso, and Korean Holstein respectively. The results of this study provide a valuable foundation for further investigations to dissect the molecular mechanisms underlying variation in economically important traits in cattle and to develop genetic markers for use in cattle breeding. PMID:24992012
Smaragdov, M G
Genomic selection is a method based on the use of single nucleotide polymorphisms (SNPs) as markers for detecting animal or plant genotype values. The review describes the genomic selection of milk cattle 5 years after the design of dense SNP chips. References to the application of genomic selection to other animal and plant species are given. The main principles of constructing linear and nonlinear mathematical models that allow one to determine genomic estimates in animals are briefly described. Particular attention is focused on the accuracy and the phenomenon of the additivity ofgenomic estimates, as well as to the prospective use of various genomic selection schemes that consider it over dozens of generations. Information including international organizations that provide the consolidation of genomic information from different countries aimed at designing global reference populations of milk cattle is reported. The results of the practical application of genomic selection to detecting of the breeding value of milk cattle over 5 years are demonstrated in the table, which makes it possible to visually assess the achievements of this highly technological field of cattle breeding.
Raluca G. Mateescu
Full Text Available Improvements in eating satisfaction will benefit consumers and should increase beef demand which is of interest to the beef industry. Tenderness, juiciness, and flavor are major determinants of the palatability of beef and are often used to reflect eating satisfaction. Carcass qualities are used as indicator traits for meat quality, with higher quality grade carcasses expected to relate to more tender and palatable meat. However, meat quality is a complex concept determined by many component traits making interpretation of genome-wide association studies (GWAS on any one component challenging to interpret. Recent approaches combining traditional GWAS with gene network interactions theory could be more efficient in dissecting the genetic architecture of complex traits. Phenotypic measures of 23 traits reflecting carcass characteristics, components of meat quality, along with mineral and peptide concentrations were used along with Illumina 54k bovine SNP genotypes to derive an annotated gene network associated with meat quality in 2,110 Angus beef cattle. The efficient mixed model association (EMMAX approach in combination with a genomic relationship matrix was used to directly estimate the associations between 54k SNP genotypes and each of the 23 component traits. Genomic correlated regions were identified by partial correlations which were further used along with an information theory algorithm to derive gene network clusters. Correlated SNP across 23 component traits were subjected to network scoring and visualization software to identify significant SNP. Significant pathways implicated in the meat quality complex through GO term enrichment analysis included angiogenesis, inflammation, transmembrane transporter activity, and receptor activity. These results suggest that network analysis using partial correlations and annotation of significant SNP can reveal the genetic architecture of complex traits and provide novel information regarding
Yum, Soo-Young; Youn, Ki-Young; Choi, Woo-Jae; Jang, Goo
The production of transgenic farm animals (e.g., cattle) via genome engineering for the gain or loss of gene functions is an important undertaking. In the initial stages of genome engineering, DNA micro-injection into one-cell stage embryos (zygotes) followed by embryo transfer into a recipient was performed because of the ease of the procedure. However, as this approach resulted in severe mosaicism and has a low efficiency, it is not typically employed in the cattle as priority, unlike in mice. To overcome the above issue with micro-injection in cattle, somatic cell nuclear transfer (SCNT) was introduced and successfully used to produce cloned livestock. The application of SCNT for the production of transgenic livestock represents a significant advancement, but its development speed is relatively slow because of abnormal reprogramming and low gene targeting efficiency. Recent genome editing technologies (e.g., ZFN, TALEN, and CRISPR-Cas9) have been rapidly adapted for applications in cattle and great results have been achieved in several fields such as disease models and bioreactors. In the future, genome engineering technologies will accelerate our understanding of genetic traits in bovine and will be readily adapted for bio-medical applications in cattle.
Yum, Soo-Young; Youn, Ki-Young; Choi, Woo-Jae; Jang, Goo
The production of transgenic farm animals (e.g., cattle) via genome engineering for the gain or loss of gene functions is an important undertaking. In the initial stages of genome engineering, DNA micro-injection into one-cell stage embryos (zygotes) followed by embryo transfer into a recipient was performed because of the ease of the procedure. However, as this approach resulted in severe mosaicism and has a low efficiency, it is not typically employed in the cattle as priority, unlike in mi...
Martínez, Rodrigo; Bejarano, Diego; Gómez, Yolanda; Dasoneville, Romain; Jiménez, Ariel; Even, Gael; Sölkner, Johann; Mészáros, Gabor
Genotypic and phenotypic data of 1,562 animals were analyzed to find genomic regions that potentially influence the birth weight (BW), weaning weight at seven months of age (WW) and yearling weight (YW) of Colombian Brahman cattle, with genotyping conducted using Illumina Bead chip array with 74,669 SNPs. A Single Step Genomic BLUP (ssGBLP), approach was used to estimate the proportion of variance explained by each marker. Multiple regions scattered across the genome were found to influence weights at different ages, also dependent on the trait component (direct or maternal). The most interesting regions were connected to previously identified QTLs and genes, such as ADAMTSL3, CAPN2, CAPN2, FABP6, ZEB2 influencing growth and weight traits. The identified regions will contribute to the development and refinement of genomic selection programs for Zebu Brahman cattle in Colombia.
Loor, Juan J; Bionaz, Massimo; Drackley, James K
Microarray development changed the way biologists approach the holistic study of cells and tissues. In dairy cattle biosciences, the application of omics technology, from spotted microarrays to next-generation sequencing and proteomics, has grown steadily during the past 10 years. Omics has found application in fields such as dairy cattle nutritional physiology, reproduction, and immunology. Generating biologically meaningful data from omics studies relies on bioinformatics tools. Both are key components of the systems physiology toolbox, which allows study of the interactions between a condition (e.g., nutrition, physiological state) with tissue gene/protein expression and the associated changes in biological functions. The nature of physiologic and metabolic adaptations in dairy cattle at any stage of the life cycle is multifaceted, involves multiple tissues, and is dynamic, e.g., the transition from late-pregnancy to lactation. Application of integrative systems physiology in periparturient dairy cattle has already advanced knowledge of the simultaneous functional adaptations in liver, adipose, and mammary tissue.
The development of molecular techniques has offered possibilities to identify quantitative trait loci (QTL). Studies in dairy cattle have mainly focused on milk production traits. This thesis first gives an overview of the main identified QTL for milk production traits. Subsequently, a study to
van den Berg, Irene; Guldbrandtsen, Bernt; Hozé, C
Our objective was to investigate the potential benefits of using sequence data to improve across breed genomic prediction, using data from five French and Danish dairy cattle breeds. First, QTL for protein yield were detected using high density genotypes. Part of the QTL detected within breed was...
Sahana, G; Guldbrandtsen, B; Bendixen, C
A genome-wide association study was conducted using a mixed model analysis for QTL for fertility traits in Danish and Swedish Holstein cattle. The analysis incorporated 2,531 progeny tested bulls, and a total of 36 387 SNP markers on 29 bovine autosomes were used. Eleven fertility traits were ana...
Sahana, Goutam; Guldbrandtsen, Bernt; Lund, Mogens Sandø
A total of 22 quantitative trait loci (QTL) were detected on 19 chromosomes for direct and maternal calving traits in cattle using a genome-wide association study. Calving performance is affected by the genotypes of both the calf (direct effect) and dam (maternal effect). To identify the QTL cont...
Genomic evaluations of dairy cattle in the United States have been available for Brown Swiss, Holsteins, and Jerseys since 2009 and for Ayrshires since 2013. As of February 2015, 2,281 Guernsey bulls and cows had genotypes from collaboration between the United States, Canada, England, and the island...
Brøndum, Rasmus Froberg; Rius-Vilarrasa, E; Strandén, I
This study investigated the possibility of increasing the reliability of direct genomic values (DGV) by combining reference opulations. The data were from 3,735 bulls from Danish, Swedish, and Finnish Red dairy cattle populations. Single nucleotide polymorphism markers were fitted as random...
van den Berg, Irene; Guldbrandtsen, Bernt; Hozé, C
Our objective was to investigate the potential benefits of using sequence data to improve across breed genomic prediction, using data from five French and Danish dairy cattle breeds. First, QTL for protein yield were detected using high density genotypes. Part of the QTL detected within breed...
The Korean Hanwoo cattle have been intensively selected for production traits, especially high intramuscular fat content. It is believed that ancient crossings between different breeds contributed to forming the Hanwoo, but little is known about the genomic differences and similarities between other...
Full Text Available Abstract Background Comparative mapping provides new insights into the evolutionary history of genomes. In particular, recent studies in mammals have suggested a role for segmental duplication in genome evolution. In some species such as Drosophila or maize, transposable elements (TEs have been shown to be involved in chromosomal rearrangements. In this work, we have explored the presence of interspersed repeats in regions of chromosomal rearrangements, using an updated high-resolution integrated comparative map among cattle, man and mouse. Results The bovine, human and mouse comparative autosomal map has been constructed using data from bovine genetic and physical maps and from FISH-mapping studies. We confirm most previous results but also reveal some discrepancies. A total of 211 conserved segments have been identified between cattle and man, of which 33 are new segments and 72 correspond to extended, previously known segments. The resulting map covers 91% and 90% of the human and bovine genomes, respectively. Analysis of breakpoint regions revealed a high density of species-specific interspersed repeats in the human and mouse genomes. Conclusion Analysis of the breakpoint regions has revealed specific repeat density patterns, suggesting that TEs may have played a significant role in chromosome evolution and genome plasticity. However, we cannot rule out that repeats and breakpoints accumulate independently in the few same regions where modifications are better tolerated. Likewise, we cannot ascertain whether increased TE density is the cause or the consequence of chromosome rearrangements. Nevertheless, the identification of high density repeat clusters combined with a well-documented repeat phylogeny should highlight probable breakpoints, and permit their precise dating. Combining new statistical models taking the present information into account should help reconstruct ancestral karyotypes.
Full Text Available Puberty is a complex physiological event by which animals mature into an adult capable of sexual reproduction. In order to enhance our understanding of the genes and regulatory pathways and networks involved in puberty, we characterized the transcriptome of five reproductive tissues (i.e. hypothalamus, pituitary gland, ovary, uterus, and endometrium as well as tissues known to be relevant to growth and metabolism needed to achieve puberty (i.e., longissimus dorsi muscle, adipose, and liver. These tissues were collected from pre- and post-pubertal Brangus heifers (3/8 Brahman; Bos indicus x 5/8 Angus; Bos taurus derived from a population of cattle used to identify quantitative trait loci associated with fertility traits (i.e., age of first observed corpus luteum (ACL, first service conception (FSC, and heifer pregnancy (HPG. In order to exploit the power of complementary omics analyses, pre- and post-puberty co-expression gene networks were constructed by combining the results from genome-wide association studies (GWAS, RNA-Seq, and bovine transcription factors. Eight tissues among pre-pubertal and post-pubertal Brangus heifers revealed 1,515 differentially expressed and 943 tissue-specific genes within the 17,832 genes confirmed by RNA-Seq analysis. The hypothalamus experienced the most notable up-regulation of genes via puberty (i.e., 204 out of 275 genes. Combining the results of GWAS and RNA-Seq, we identified 25 loci containing a single nucleotide polymorphism (SNP associated with ACL, FSC, and (or HPG. Seventeen of these SNP were within a gene and 13 of the genes were expressed in uterus or endometrium. Multi-tissue omics analyses revealed 2,450 co-expressed genes relative to puberty. The pre-pubertal network had 372,861 connections whereas the post-pubertal network had 328,357 connections. A sub-network from this process revealed key transcriptional regulators (i.e., PITX2, FOXA1, DACH2, PROP1, SIX6, etc.. Results from these multi
Guosheng, Su; Madsen, Per; Nielsen, Ulrik Sander
This study investigated the accuracy of direct genomic breeding values (DGV) using a genomic BLUP model, genomic enhanced breeding values (GEBV) using a one-step blending approach, and GEBV using a selection index blending approach for 15 traits of Nordic Red Cattle. The data comprised 6,631 bulls...... of which 4,408 bulls were genotyped using Illumina Bovine SNP50 BeadChip (Illumina, San Diego, CA). To validate reliability of genomic predictions, about 20% of the youngest genotyped bulls were taken as test data set. Deregressed proofs (DRP) were used as response variables for genomic predictions....... Reliabilities of genomic predictions in the validation analyses were measured as squared correlations between DRP and genomic predictions corrected for reliability of DRP, based on the bulls in the test data sets. A set of weighting (scaling) factors was used to construct the combined relationship matrix among...
Genomic prediction is successful in single breed genetic evaluation. However, there is no achievement in acoress breed prediction until now. This thesis investigated genomic prediction across populations and breeds using Chinese Holsterin, Nordic Holstein, Norwgian Red, and Nordic Red. Nordic Red...
Sandoval-Azuara, Sarai Estrella; Muñiz-Salazar, Raquel; Perea-Jacobo, Ricardo; Robbe-Austerman, Suelee; Perera-Ortiz, Alejandro; López-Valencia, Gilberto; Bravo, Doris M; Sanchez-Flores, Alejandro; Miranda-Guzmán, Daniela; Flores-López, Carlos Alberto; Zenteno-Cuevas, Roberto; Laniado-Laborín, Rafael; de la Cruz, Fabiola Lafarga; Stuber, Tod P
To determine genetic diversity by comparing the whole genome sequences of cattle and human Mycobacterium bovis isolates from Baja California. A whole genome sequencing strategy was used to obtain the molecular fingerprints of 172 isolates of M. bovis obtained from Baja California, Mexico; 155 isolates were from cattle and 17 isolates were from humans. Spoligotypes were characterized in silico and single nucleotide polymorphism (SNP) differences between the isolates were evaluated. A total of 12 M. bovis spoligotype patterns were identified in cattle and humans. Two predominant spoligotypes patterns were seen in both cattle and humans: SB0145 and SB1040. The SB0145 spoligotype represented 59% of cattle isolates (n=91) and 65% of human isolates (n=11), while the SB1040 spoligotype represented 30% of cattle isolates (n=47) and 30% of human isolates (n=5). When evaluating SNP differences, the human isolates were intimately intertwined with the cattle isolates. All isolates from humans had spoligotype patterns that matched those observed in the cattle isolates, and all human isolates shared common ancestors with cattle in Baja California based on SNP analysis. This suggests that most human tuberculosis caused by M. bovis in Baja California is derived from M. bovis circulating in Baja California cattle. These results reinforce the importance of bovine tuberculosis surveillance and control in this region. Copyright © 2017 The Authors. Published by Elsevier Ltd.. All rights reserved.
Lund, Mogens Sandø; Su, Guosheng; Janss, Luc
In order to obtain accurate genomic breeding values a large number of reference animals with both phenotype and genotype data are needed. This poses a challenge for breeds with small reference populations. One option to overcome this obstacle is to use a multi-breed reference population. However...... that the effect of multi-breed reference populations on the accuracy of genomic prediction is highly affected by the genetic distance between breeds. When combining populations of the same breeds from different countries, large increases in accuracy are seen, whereas for admixed populations with some exchange...... of sires, substantial but smaller gains are found. Little or no benefit is found when combining distantly related breeds such as Holstein and Jersey and using the widely used genomic BLUP model. By using more sophisticated Bayesian variable selection models that put more focus on genomic markers in strong...
Mudadu, Maurício A; Porto-Neto, Laercio R; Mokry, Fabiana B; Tizioto, Polyana C; Oliveira, Priscila S N; Tullio, Rymer R; Nassu, Renata T; Niciura, Simone C M; Tholon, Patrícia; Alencar, Maurício M; Higa, Roberto H; Rosa, Antônio N; Feijó, Gélson L D; Ferraz, André L J; Silva, Luiz O C; Medeiros, Sérgio R; Lanna, Dante P; Nascimento, Michele L; Chaves, Amália S; Souza, Andrea R D L; Packer, Irineu U; Torres, Roberto A A; Siqueira, Fabiane; Mourão, Gerson B; Coutinho, Luiz L; Reverter, Antonio; Regitano, Luciana C A
Nelore is the major beef cattle breed in Brazil with more than 130 million heads. Genome-wide association studies (GWAS) are often used to associate markers and genomic regions to growth and meat quality traits that can be used to assist selection programs. An alternative methodology to traditional GWAS that involves the construction of gene network interactions, derived from results of several GWAS is the AWM (Association Weight Matrices)/PCIT (Partial Correlation and Information Theory). With the aim of evaluating the genetic architecture of Brazilian Nelore cattle, we used high-density SNP genotyping data (~770,000 SNP) from 780 Nelore animals comprising 34 half-sibling families derived from highly disseminated and unrelated sires from across Brazil. The AWM/PCIT methodology was employed to evaluate the genes that participate in a series of eight phenotypes related to growth and meat quality obtained from this Nelore sample. Our results indicate a lack of structuring between the individuals studied since principal component analyses were not able to differentiate families by its sires or by its ancestral lineages. The application of the AWM/PCIT methodology revealed a trio of transcription factors (comprising VDR, LHX9 and ZEB1) which in combination connected 66 genes through 359 edges and whose biological functions were inspected, some revealing to participate in biological growth processes in literature searches. The diversity of the Nelore sample studied is not high enough to differentiate among families neither by sires nor by using the available ancestral lineage information. The gene networks constructed from the AWM/PCIT methodology were a useful alternative in characterizing genes and gene networks that were allegedly influential in growth and meat quality traits in Nelore cattle.
Castro Dias Cuyabano, Beatriz; Lund, Mogens Sandø; Rosa, G J M
SNP markers have been widely explored in genome based prediction. This study explored the use of haplotype blocks (haploblocks) to predict five milk production traits (fertility, mastitis, protein, fat and milk yield), using a mix of Nordic Red cattle as reference population for training. Predict......SNP markers have been widely explored in genome based prediction. This study explored the use of haplotype blocks (haploblocks) to predict five milk production traits (fertility, mastitis, protein, fat and milk yield), using a mix of Nordic Red cattle as reference population for training....... Predictions were performed under a Bayesian approach comparing a GBLUP and a mixture model. In general, predictions were more reliable when using haploblocks instead of individual SNPs as predictors. The Danish Red cattle presented the largest benefit in predictive ability from haploblocks, achieving 5.......1% higher reliability than with the individual SNP approach in mastitis. This work gives evidence that predictions using haploblocks along with a combined training population of dairy cattle, may improve prediction accuracy of important traits in the individual populations....
Utsunomiya, Yuri T; Carmo, Adriana S; Neves, Haroldo H R; Carvalheiro, Roberto; Matos, Márcia C; Zavarez, Ludmilla B; Ito, Pier K R K; Pérez O'Brien, Ana M; Sölkner, Johann; Porto-Neto, Laercio R; Schenkel, Flávio S; McEwan, John; Cole, John B; da Silva, Marcos V G B; Van Tassell, Curtis P; Sonstegard, Tad S; Garcia, José Fernando
The reproductive performance of bulls has a high impact on the beef cattle industry. Scrotal circumference (SC) is the most recorded reproductive trait in beef herds, and is used as a major selection criterion to improve precocity and fertility. The characterization of genomic regions affecting SC can contribute to the identification of diagnostic markers for reproductive performance and uncover molecular mechanisms underlying complex aspects of bovine reproductive biology. In this paper, we report a genome-wide scan for chromosome segments explaining differences in SC, using data of 861 Nellore bulls (Bos indicus) genotyped for over 777,000 single nucleotide polymorphisms. Loci that excel from the genome background were identified on chromosomes 4, 6, 7, 10, 14, 18 and 21. The majority of these regions were previously found to be associated with reproductive and body size traits in cattle. The signal on chromosome 14 replicates the pleiotropic quantitative trait locus encompassing PLAG1 that affects male fertility in cattle and stature in several species. Based on intensive literature mining, SP4, MAGEL2, SH3RF2, PDE5A and SNAI2 are proposed as novel candidate genes for SC, as they affect growth and testicular size in other animal models. These findings contribute to linking reproductive phenotypes to gene functions, and may offer new insights on the molecular biology of male fertility.
Yuri T Utsunomiya
Full Text Available The reproductive performance of bulls has a high impact on the beef cattle industry. Scrotal circumference (SC is the most recorded reproductive trait in beef herds, and is used as a major selection criterion to improve precocity and fertility. The characterization of genomic regions affecting SC can contribute to the identification of diagnostic markers for reproductive performance and uncover molecular mechanisms underlying complex aspects of bovine reproductive biology. In this paper, we report a genome-wide scan for chromosome segments explaining differences in SC, using data of 861 Nellore bulls (Bos indicus genotyped for over 777,000 single nucleotide polymorphisms. Loci that excel from the genome background were identified on chromosomes 4, 6, 7, 10, 14, 18 and 21. The majority of these regions were previously found to be associated with reproductive and body size traits in cattle. The signal on chromosome 14 replicates the pleiotropic quantitative trait locus encompassing PLAG1 that affects male fertility in cattle and stature in several species. Based on intensive literature mining, SP4, MAGEL2, SH3RF2, PDE5A and SNAI2 are proposed as novel candidate genes for SC, as they affect growth and testicular size in other animal models. These findings contribute to linking reproductive phenotypes to gene functions, and may offer new insights on the molecular biology of male fertility.
Sithembile Olga Makina
Full Text Available Information about genetic diversity and population structure among cattle breeds is essential for genetic improvement, understanding of environmental adaptation as well as utilization and conservation of cattle breeds. This study investigated genetic diversity and the population structure among six cattle breeds in South African (SA including Afrikaner (n=44, Nguni (n=54, Drakensberger (n=47, Bonsmara (n=44, Angus (n=31 and Holstein (n=29. Genetic diversity within cattle breeds was analyzed using three measures of genetic diversity namely allelic richness (AR, expected heterozygosity (He and inbreeding coefficient (f. Genetic distances between breed pairs were evaluated using Nei’s genetic distance. Population structure was assessed using model-based clustering (ADMIXTURE. Results of this study revealed that the allelic richness ranged from 1.88 (Afrikaner to 1.73 (Nguni. Afrikaner cattle had the lowest level of genetic diversity (He=0.24 and the Drakensberger cattle (He=0.30 had the highest level of genetic variation among indigenous and locally-developed cattle breeds. The level of inbreeding was lower across the studied cattle breeds. As expected the average genetic distance was the greatest between indigenous cattle breeds and Bos taurus cattle breeds but the lowest among indigenous and locally-developed breeds. Model-based clustering revealed some level of admixture among indigenous and locally-developed breeds and supported the clustering of the breeds according to their history of origin. The results of this study provided useful insight regarding genetic structure of South African cattle breeds.
Machado, Henrique; Gram, Lone
Vibrionaceae is a large marine bacterial family, which can constitute up to 50% of the prokaryotic population in marine waters. Photobacterium is the second largest genus in the family and we used comparative genomics on 35 strains representing 16 of the 28 species described so far, to understand...... the genomic diversity present in the Photobacterium genus. Such understanding is important for ecophysiology studies of the genus. We used whole genome sequences to evaluate phylogenetic relationships using several analyses (16S rRNA, MLSA, fur, amino-acid usage, ANI), which allowed us to identify two...
Full Text Available Yellowstone National Park is home to one of the only plains bison populations that have continuously existed on their present landscape since prehistoric times without evidence of domestic cattle introgression. Previous studies characterized the relatively high levels of nuclear genetic diversity in these bison, but little is known about their mitochondrial haplotype diversity. This study assessed mitochondrial genomes from 25 randomly selected Yellowstone bison and found 10 different mitochondrial haplotypes with a haplotype diversity of 0.78 (± 0.06. Spatial analysis of these mitochondrial DNA (mtDNA haplotypes did not detect geographic population subdivision (FST = -0.06, p = 0.76. However, we identified two independent and historically important lineages in Yellowstone bison by combining data from 65 bison (defined by 120 polymorphic sites from across North America representing a total of 30 different mitochondrial DNA haplotypes. Mitochondrial DNA haplotypes from one of the Yellowstone lineages represent descendants of the 22 indigenous bison remaining in central Yellowstone in 1902. The other mitochondrial DNA lineage represents descendants of the 18 females introduced from northern Montana in 1902 to supplement the indigenous bison population and develop a new breeding herd in the northern region of the park. Comparing modern and historical mitochondrial DNA diversity in Yellowstone bison helps uncover a historical context of park restoration efforts during the early 1900s, provides evidence against a hypothesized mitochondrial disease in bison, and reveals the signature of recent hybridization between American plains bison (Bison bison bison and Canadian wood bison (B. b. athabascae. Our study demonstrates how mitochondrial DNA can be applied to delineate the history of wildlife species and inform future conservation actions.
Full Text Available Development of body hair is an important physiological and cellular process that leads to better adaption in tropical environments for dairy cattle. Various studies suggested a major gene and, more recently, associated genes for hairy locus in dairy cattle. Main aim of this study was to i employ a variant of the discordant sib pair model, in which half sibs from the same sires are randomly sampled using their affection statues, ii use various single marker regression approaches, and iii use whole genome regression approaches to dissect genetic architecture of the hairy gene in the cattle. Whole and single genome regression approaches detected strong genomic signals from Chromosome 23. Although there is a major gene effect on hairy phenotype sourced from chromosome 23: whole genome regression approach also suggested polygenic component related with other parts of the genome. Such a result could not be obtained by any of the single marker approaches.
Full Text Available Abstract Background In future Best Linear Unbiased Prediction (BLUP evaluations of dairy cattle, genomic selection of young sires will cause evaluation biases and loss of accuracy once the selected ones get progeny. Methods To avoid such bias in the estimation of breeding values, we propose to include information on all genotyped bulls, including the culled ones, in BLUP evaluations. Estimated breeding values based on genomic information were converted into genomic pseudo-performances and then analyzed simultaneously with actual performances. Using simulations based on actual data from the French Holstein population, bias and accuracy of BLUP evaluations were computed for young sires undergoing progeny testing or genomic pre-selection. For bulls pre-selected based on their genomic profile, three different types of information can be included in the BLUP evaluations: (1 data from pre-selected genotyped candidate bulls with actual performances on their daughters, (2 data from bulls with both actual and genomic pseudo-performances, or (3 data from all the genotyped candidates with genomic pseudo-performances. The effects of different levels of heritability, genomic pre-selection intensity and accuracy of genomic evaluation were considered. Results Including information from all the genotyped candidates, i.e. genomic pseudo-performances for both selected and culled candidates, removed bias from genetic evaluation and increased accuracy. This approach was effective regardless of the magnitude of the initial bias and as long as the accuracy of the genomic evaluations was sufficiently high. Conclusions The proposed method can be easily and quickly implemented in BLUP evaluations at the national level, although some improvement is necessary to more accurately propagate genomic information from genotyped to non-genotyped animals. In addition, it is a convenient method to combine direct genomic, phenotypic and pedigree-based information in a multiple
Garrick Dorian J
Full Text Available Abstract Artificial selection has proven to be effective at altering the performance of animal production systems. Nevertheless, selection based on assessment of the genetic superiority of candidates is suboptimal as a result of errors in the prediction of genetic merit. Conventional breeding programs may extend phenotypic measurements on selection candidates to include correlated indicator traits, or delay selection decisions well beyond puberty so that phenotypic performance can be observed on progeny or other relatives. Extending the generation interval to increase the accuracy of selection reduces annual rates of gain compared to accurate selection and use of parents of the next generation at the immediate time they reach breeding age. Genomic prediction aims at reducing prediction errors at breeding age by exploiting information on the transmission of chromosome fragments from parents to selection candidates, in conjunction with knowledge on the value of every chromosome fragment. For genomic prediction to influence beef cattle breeding programs and the rate or cost of genetic gains, training analyses must be undertaken, and genomic prediction tools made available for breeders and other industry stakeholders. This paper reviews the nature or kind of studies currently underway, the scope or extent of some of those studies, and comments on the likely predictive value of genomic information for beef cattle improvement.
Bowler, Ch.; Allen, A. E.; Badger, J. H.; Grimwood, J.; Jabbari, K.; Kuo, A.; Maheswari, U.; Martens, C.; Maumus, F.; Otillar, R. P.; Rayko, E.; Salamov, A.; Vandepoele, K.; Beszteri, B.; Gruber, A.; Heijde, M.; Katinka, M.; Mock, T.; Valentin, K.; Verret, F.; Berges, J. A.; Brownlee, C.; Cadoret, J.-P.; Chiovitti, A.; Choi, Ch. J.; Coesel, S.; De Martino, A.; Detter, J. Ch.; Durkin, C.; Falciatore, A.; Fournet, J.; Haruta, M.; Huysman, M. J. J.; Jenkins, B. D.; Jiroutová, Kateřina; Jorgensen, R. E.; Joubert, Y.; Kaplan, A.; Kröger, N.; Kroth, P. G.; La Roche, J.; Lindquist, E.; Lommer, M.; Martin–Jézéquel, V.; Lopez, P. J.; Lucas, S.; Mangogna, M.; McGinnis, K.; Medlin, L. K.; Montsant, A.; Oudot–Le Secq, M.-P.; Napoli, C.; Oborník, Miroslav; Schnitzler Parker, M.; Petit, J.-L.; Porcel, B. M.; Poulsen, N.; Robison, M.; Rychlewski, L.; Rynearson, T. A.; Schmutz, J.; Shapiro, H.; Siaut, M.; Stanley, M.; Sussman, M. R.; Taylor, A. R.; Vardi, A.; von Dassow, P.; Vyverman, W.; Willis, A.; Wyrwicz, L. S.; Rokhsar, D. S.; Weissenbach, J.; Armbrust, E. V.; Green, B. R.; Van de Peer, Y.; Grigoriev, I. V.
Roč. 456, 13-11-2008 (2008), s. 239-244 ISSN 0028-0836 Institutional research plan: CEZ:AV0Z60220518 Keywords : Phaeodactylum * genome * evolution * diatom Subject RIV: EB - Genetics ; Molecular Biology Impact factor: 31.434, year: 2008
Ribeiro, Teresa; Barrela, Ricardo M.; Bergès, Hélène; Marques, Cristina; Loureiro, João; Morais-Cecílio, Leonor; Paiva, Jorge A. P.
The genus Eucalyptus encloses several species with high ecological and economic value, being the subgenus Symphyomyrtus one of the most important. Species such as E. grandis and E. globulus are well characterized at the molecular level but knowledge regarding genome and chromosome organization is very scarce. Here we characterized and compared the karyotypes of three economically important species, E. grandis, E. globulus, and E. calmadulensis, and three with ecological relevance, E. pulverulenta, E. cornuta, and E. occidentalis, through an integrative approach including genome size estimation, fluorochrome banding, rDNA FISH, and BAC landing comprising genes involved in lignin biosynthesis. All karyotypes show a high degree of conservation with pericentromeric 35S and 5S rDNA loci in the first and third pairs, respectively. GC-rich heterochromatin was restricted to the 35S rDNA locus while the AT-rich heterochromatin pattern was species-specific. The slight differences in karyotype formulas and distribution of AT-rich heterochromatin, along with genome sizes estimations, support the idea of Eucalyptus genome evolution by local expansions of heterochromatin clusters. The unusual co-localization of both rDNA with AT-rich heterochromatin was attributed mainly to the presence of silent transposable elements in those loci. The cinnamoyl CoA reductase gene (CCR1) previously assessed to linkage group 10 (LG10) was clearly localized distally at the long arm of chromosome 9 establishing an unexpected correlation between the cytogenetic chromosome 9 and the LG10. Our work is novel and contributes to the understanding of Eucalyptus genome organization which is essential to develop successful advanced breeding strategies for this genus. PMID:27148332
Full Text Available Dairy cattle breeders have exploited technological advances that have emerged in the past in regards to reproduction and genomics. The implementation of such technologies in routine breeding programs has permitted genetic gains in traditional milk production traits as well as, more recently, in low-heritability traits like health and fertility. As demand for dairy products increases, it is important for dairy breeders to optimize the use of available technologies and to consider the many emerging technologies that are currently being investigated in various fields. Here we review a number of technologies that have helped shape dairy breeding programs in the past and present, along with those potentially forthcoming. These tools have materialized in the areas of reproduction, genotyping and sequencing, genetic modification, and epigenetics. Although many of these technologies bring encouraging opportunities for genetic improvement of dairy cattle populations, their applications and benefits need to be weighed with their impacts on economics, genetic diversity, and society.
Höglund, Johanna Karolina; Guldbrandtsen, Bernt; Lund, Mogens Sandø
Background There is often a pronounced disagreement between results obtained from different genome-wide association studies in cattle. There are multiple reasons for this disagreement. Particularly the presence of false positives leads to a need to validate detected QTL before they are optimally...... incorporated or weighted in selection decisions or further studied for causal gene. In dairy cattle progeny testing scheme new data is routinely accumulated which can be used to validate previously discovered associations. However, the data is not an independent sample and the sample size may not be sufficient...... on the aim of the study. If the aim is to discover novel QTL, analyses of the COMB dataset is recommended, while in case of identification of the causal mutation underlying a QTL, confirmation of the discovered SNPs are necessary to avoid following a false positive...
Full Text Available Abstract Background Bovine adenovirus type 3 (BAV-3 belongs to the Mastadenovirus genus of the family Adenoviridae and is involved in respiratory and enteric infections of calves. The isolation of BAV-3 has not been reported prior to this study in China. In 2009, there were many cases in cattle showing similar clinical signs to BAV-3 infection and a virus strain, showing cytopathic effect in Madin-Darby bovine kidney cells, was isolated from a bovine nasal swab collected from feedlot cattle in Heilongjiang Province, China. The isolate was confirmed as a bovine adenovirus type 3 by PCR and immunofluorescence assay, and named as HLJ0955. So far only the complete genome sequence of prototype of BAV-3 WBR-1 strain has been reported. In order to further characterize the Chinese isolate HLJ0955, the complete genome sequence of HLJ0955 was determined. Results The size of the genome of the Chinese isolate HLJ0955 is 34,132 nucleotides in length with a G+C content of 53.6%. The coding sequences for gene regions of HLJ0955 isolate were similar to the prototype of BAV-3 WBR-1 strain, with 80.0-98.6% nucleotide and 87.5-98.8% amino acid identities. The genome of HLJ0955 strain contains 16 regions and four deletions in inverted terminal repeats, E1B region and E4 region, respectively. The complete genome and DNA binding protein gene based phylogenetic analysis with other adenoviruses were performed and the results showed that HLJ0955 isolate belonged to BAV-3 and clustered within the Mastadenovirus genus of the family Adenoviridae. Conclusions This is the first study to report the isolation and molecular characterization of BAV-3 from cattle in China. The phylogenetic analysis performed in this study supported the use of the DNA binding protein gene of adenovirus as an appropriate subgenomic target for the classification of different genuses of the family Adenoviridae on the molecular basis. Meanwhile, a large-scale pathogen and serological epidemiological
Background Bovine adenovirus type 3 (BAV-3) belongs to the Mastadenovirus genus of the family Adenoviridae and is involved in respiratory and enteric infections of calves. The isolation of BAV-3 has not been reported prior to this study in China. In 2009, there were many cases in cattle showing similar clinical signs to BAV-3 infection and a virus strain, showing cytopathic effect in Madin-Darby bovine kidney cells, was isolated from a bovine nasal swab collected from feedlot cattle in Heilongjiang Province, China. The isolate was confirmed as a bovine adenovirus type 3 by PCR and immunofluorescence assay, and named as HLJ0955. So far only the complete genome sequence of prototype of BAV-3 WBR-1 strain has been reported. In order to further characterize the Chinese isolate HLJ0955, the complete genome sequence of HLJ0955 was determined. Results The size of the genome of the Chinese isolate HLJ0955 is 34,132 nucleotides in length with a G+C content of 53.6%. The coding sequences for gene regions of HLJ0955 isolate were similar to the prototype of BAV-3 WBR-1 strain, with 80.0-98.6% nucleotide and 87.5-98.8% amino acid identities. The genome of HLJ0955 strain contains 16 regions and four deletions in inverted terminal repeats, E1B region and E4 region, respectively. The complete genome and DNA binding protein gene based phylogenetic analysis with other adenoviruses were performed and the results showed that HLJ0955 isolate belonged to BAV-3 and clustered within the Mastadenovirus genus of the family Adenoviridae. Conclusions This is the first study to report the isolation and molecular characterization of BAV-3 from cattle in China. The phylogenetic analysis performed in this study supported the use of the DNA binding protein gene of adenovirus as an appropriate subgenomic target for the classification of different genuses of the family Adenoviridae on the molecular basis. Meanwhile, a large-scale pathogen and serological epidemiological investigations for BVA-3
Full Text Available Abstract Background The prediction of the outcomes from multistage breeding schemes is especially important for the introduction of genomic selection in dairy cattle. Decorrelated selection indices can be used for the optimisation of such breeding schemes. However, they decrease the accuracy of estimated breeding values and, therefore, the genetic gain to an unforeseeable extent and have not been applied to breeding schemes with different generation intervals and selection intensities in each selection path. Methods A grid search was applied in order to identify optimum breeding plans to maximise the genetic gain per year in a multistage, multipath dairy cattle breeding program. In this program, different values of the accuracy of estimated genomic breeding values and of their costs per individual were applied, whereby the total breeding costs were restricted. Both decorrelated indices and optimum selection indices were used together with fast multidimensional integration algorithms to produce results. Results In comparison to optimum indices, the genetic gain with decorrelated indices was up to 40% less and the proportion of individuals undergoing genomic selection was different. Additionally, the interaction between selection paths was counter-intuitive and difficult to interpret. Independent of using decorrelated or optimum selection indices, genomic selection replaced traditional progeny testing when maximising the genetic gain per year, as long as the accuracy of estimated genomic breeding values was ≥ 0.45. Overall breeding costs were mainly generated in the path "dam-sire". Selecting males was still the main source of genetic gain per year. Conclusion Decorrelated selection indices should not be used because of misleading results and the availability of accurate and fast algorithms for exact multidimensional integration. Genomic selection is the method of choice when maximising the genetic gain per year but genotyping females may not
Zhang, Qianqian; Guldbrandtsen, Bernt; Thomasen, Jørn Rind; Lund, Mogens Sandø; Sahana, Goutam
Longevity is an important economic trait in dairy production. Improvements in longevity could increase the average number of lactations per cow, thereby affecting the profitability of the dairy cattle industry. Improved longevity for cows reduces the replacement cost of stock and enables animals
Somavilla, A L; Sonstegard, T S; Higa, R H; Rosa, A N; Siqueira, F; Silva, L O C; Torres Júnior, R A A; Coutinho, L L; Mudadu, M A; Alencar, M M; Regitano, L C A
Brazilian Nellore cattle (Bos indicus) have been selected for growth traits for over more than four decades. In recent years, reproductive and meat quality traits have become more important because of increasing consumption, exports and consumer demand. The identification of genome regions altered by artificial selection can potentially permit a better understanding of the biology of specific phenotypes that are useful for the development of tools designed to increase selection efficiency. Therefore, the aims of this study were to detect evidence of recent selection signatures in Nellore cattle using extended haplotype homozygosity methodology and BovineHD marker genotypes (>777,000 single nucleotide polymorphisms) as well as to identify corresponding genes underlying these signals. Thirty-one significant regions (P meat quality, fatty acid profiles and immunity. In addition, 545 genes were identified in regions harboring selection signatures. Within this group, 58 genes were associated with growth, muscle and adipose tissue metabolism, reproductive traits or the immune system. Using relative extended haplotype homozygosity to analyze high-density single nucleotide polymorphism marker data allowed for the identification of regions potentially under artificial selection pressure in the Nellore genome, which might be used to better understand autozygosity and the effects of selection on the Nellore genome. © 2014 Stichting International Foundation for Animal Genetics.
Full Text Available BACKGROUND: The population structure and diversity of Lactococcus lactis subsp. lactis, a major industrial bacterium involved in milk fermentation, was determined at both gene and genome level. Seventy-six lactococcal isolates of various origins were studied by different genotyping methods and thirty-six strains displaying unique macrorestriction fingerprints were analyzed by a new multilocus sequence typing (MLST scheme. This gene-based analysis was compared to genomic characteristics determined by pulsed-field gel electrophoresis (PFGE. METHODOLOGY/PRINCIPAL FINDINGS: The MLST analysis revealed that L. lactis subsp. lactis is essentially clonal with infrequent intra- and intergenic recombination; also, despite its taxonomical classification as a subspecies, it displays a genetic diversity as substantial as that within several other bacterial species. Genome-based analysis revealed a genome size variability of 20%, a value typical of bacteria inhabiting different ecological niches, and that suggests a large pan-genome for this subspecies. However, the genomic characteristics (macrorestriction pattern, genome or chromosome size, plasmid content did not correlate to the MLST-based phylogeny, with strains from the same sequence type (ST differing by up to 230 kb in genome size. CONCLUSION/SIGNIFICANCE: The gene-based phylogeny was not fully consistent with the traditional classification into dairy and non-dairy strains but supported a new classification based on ecological separation between "environmental" strains, the main contributors to the genetic diversity within the subspecies, and "domesticated" strains, subject to recent genetic bottlenecks. Comparison between gene- and genome-based analyses revealed little relationship between core and dispensable genome phylogenies, indicating that clonal diversification and phenotypic variability of the "domesticated" strains essentially arose through substantial genomic flux within the dispensable
Full Text Available The Cervidae family comprises more than fifty species divided into three subfamilies: Capreolinae, Cervinae and Hydropotinae. A characteristic attribute for the species included in this family is the great karyotype diversity, with the chromosomal numbers ranging from 2n = 6 observed in female Muntiacus muntjak vaginalis to 2n = 70 found in Mazama gouazoubira as a result of numerous Robertsonian and tandem fusions. This work reports chromosomal homologies between cattle (Bos taurus, 2n = 60 and nine cervid species using a combination of whole chromosome and region-specific paints and BAC clones derived from cattle. We show that despite the great diversity of karyotypes in the studied species, the number of conserved chromosomal segments detected by 29 cattle whole chromosome painting probes was 35 for all Cervidae samples. The detailed analysis of the X chromosomes revealed two different morphological types within Cervidae. The first one, present in the Capreolinae is a sub/metacentric X with the structure more similar to the bovine X. The second type found in Cervini and Muntiacini is an acrocentric X which shows rearrangements in the proximal part that have not yet been identified within Ruminantia. Moreover, we characterised four repetitive sequences organized in heterochromatic blocks on sex chromosomes of the reindeer (Rangifer tarandus. We show that these repeats gave no hybridization signals to the chromosomes of the closely related moose (Alces alces and are therefore specific to the reindeer.
Frohlich, Jan; Kubickova, Svatava; Musilova, Petra; Cernohorska, Halina; Muskova, Helena; Vodicka, Roman; Rubes, Jiri
The Cervidae family comprises more than fifty species divided into three subfamilies: Capreolinae, Cervinae and Hydropotinae. A characteristic attribute for the species included in this family is the great karyotype diversity, with the chromosomal numbers ranging from 2n = 6 observed in female Muntiacus muntjak vaginalis to 2n = 70 found in Mazama gouazoubira as a result of numerous Robertsonian and tandem fusions. This work reports chromosomal homologies between cattle (Bos taurus, 2n = 60) and nine cervid species using a combination of whole chromosome and region-specific paints and BAC clones derived from cattle. We show that despite the great diversity of karyotypes in the studied species, the number of conserved chromosomal segments detected by 29 cattle whole chromosome painting probes was 35 for all Cervidae samples. The detailed analysis of the X chromosomes revealed two different morphological types within Cervidae. The first one, present in the Capreolinae is a sub/metacentric X with the structure more similar to the bovine X. The second type found in Cervini and Muntiacini is an acrocentric X which shows rearrangements in the proximal part that have not yet been identified within Ruminantia. Moreover, we characterised four repetitive sequences organized in heterochromatic blocks on sex chromosomes of the reindeer (Rangifer tarandus). We show that these repeats gave no hybridization signals to the chromosomes of the closely related moose (Alces alces) and are therefore specific to the reindeer.
Full Text Available Recent advances in high-throughput genotyping technologies have provided the opportunity to map genes using associations between complex traits and markers. Genome-wide association studies (GWAS based on either a single marker or haplotype have identified genetic variants and underlying genetic mechanisms of quantitative traits. Prompted by the achievements of studies examining economic traits in cattle and to verify the consistency of these two methods using real data, the current study was conducted to construct the haplotype structure in the bovine genome and to detect relevant genes genuinely affecting a carcass trait and a meat quality trait. Using the Illumina BovineHD BeadChip, 942 young bulls with genotyping data were introduced as a reference population to identify the genes in the beef cattle genome significantly associated with foreshank weight and triglyceride levels. In total, 92,553 haplotype blocks were detected in the genome. The regions of high linkage disequilibrium extended up to approximately 200 kb, and the size of haplotype blocks ranged from 22 bp to 199,266 bp. Additionally, the individual SNP analysis and the haplotype-based analysis detected similar regions and common SNPs for these two representative traits. A total of 12 and 7 SNPs in the bovine genome were significantly associated with foreshank weight and triglyceride levels, respectively. By comparison, 4 and 5 haplotype blocks containing the majority of significant SNPs were strongly associated with foreshank weight and triglyceride levels, respectively. In addition, 36 SNPs with high linkage disequilibrium were detected in the GNAQ gene, a potential hotspot that may play a crucial role for regulating carcass trait components.
Truchet, Laura; Walland, Julia; Wüthrich, Daniel; Boujon, Céline L; Posthaus, Horst; Bruggmann, Rémy; Schüpbach-Regula, Gertraud; Oevermann, Anna; Seuberlich, Torsten
Neuroinfectious diseases in livestock represent a severe threat to animal health, but their prevalence is not well documented and the etiology of disease often remains unidentified. The aims of this study were to generate baseline data on the prevalence of neuroinfectious diseases in cattle in Switzerland by neuropathological survey, and to identify disease-associated pathogens. The survey was performed over a 1-year period using a representative number of brainstem samples (n=1816) from fallen cattle. In total, 4% (n=73) of the animals had significant lesions, the most frequent types of which were indicative of viral (n=27) and bacterial (n=31) etiologies. Follow-up diagnostics by immunohistochemistry, PCR protocols and next-generation sequencing identified infection with Listeria monocytogenes (n=6), ovine herpesvirus 2 (n=7), bovine astrovirus CH13 (n=2), bovine herpesvirus 6 (n=6), bovine retrovirus CH15 (n=2), posavirus 1 (n=2), and porcine astroviruses (n=2). A retrospective questionnaire-based investigation indicated that animals' owners observed clinical signs of neurological disease in about one-third of cases with lesions, which was estimated to correspond to approximately 85 cases per year in the adult fallen cattle population in Switzerland. This estimate stands in sharp contrast to the number of cases reported to the authorities and reveals a gap in disease surveillance. Systematic neuropathological examination and follow-up molecular testing of neurologically diseased cattle could significantly enhance the efficiency of disease detection for the purposes of estimating the prevalence of endemic diseases, identifying new or re-emerging pathogens, and providing "early warnings" of disease outbreaks. Copyright © 2017 Elsevier B.V. All rights reserved.
Jenko, J; Wiggans, G R; Cooper, T A; Eaglen, S A E; Luff, W G de L; Bichard, M; Pong-Wong, R; Woolliams, J A
This study compares how different cow genotyping strategies increase the accuracy of genomic estimated breeding values (EBV) in dairy cattle breeds with low numbers. In these breeds, few sires have progeny records, and genotyping cows can improve the accuracy of genomic EBV. The Guernsey breed is a small dairy cattle breed with approximately 14,000 recorded individuals worldwide. Predictions of phenotypes of milk yield, fat yield, protein yield, and calving interval were made for Guernsey cows from England and Guernsey Island using genomic EBV, with training sets including 197 de-regressed proofs of genotyped bulls, with cows selected from among 1,440 genotyped cows using different genotyping strategies. Accuracies of predictions were tested using 10-fold cross-validation among the cows. Genomic EBV were predicted using 4 different methods: (1) pedigree BLUP, (2) genomic BLUP using only bulls, (3) univariate genomic BLUP using bulls and cows, and (4) bivariate genomic BLUP. Genotyping cows with phenotypes and using their data for the prediction of single nucleotide polymorphism effects increased the correlation between genomic EBV and phenotypes compared with using only bulls by 0.163±0.022 for milk yield, 0.111±0.021 for fat yield, and 0.113±0.018 for protein yield; a decrease of 0.014±0.010 for calving interval from a low base was the only exception. Genetic correlation between phenotypes from bulls and cows were approximately 0.6 for all yield traits and significantly different from 1. Only a very small change occurred in correlation between genomic EBV and phenotypes when using the bivariate model. It was always better to genotype all the cows, but when only half of the cows were genotyped, a divergent selection strategy was better compared with the random or directional selection approach. Divergent selection of 30% of the cows remained superior for the yield traits in 8 of 10 folds. Copyright © 2017 American Dairy Science Association. Published by
Choi, Taejeong; Lim, Dajeong; Park, Byoungho; Sharma, Aditi; Kim, Jong-Joo; Kim, Sidong; Lee, Seung Hwan
Intramuscular fat is one of the meat quality traits that is considered in the selection strategies for Hanwoo (Korean cattle). Different methods are used to estimate the breeding value of selection candidates. In the present work we focused on accuracy of different genotype relationship matrices as described by forni and pedigree based relationship matrix. The data set included a total of 778 animals that were genotyped for BovineSNP50 BeadChip. Among these 778 animals, 72 animals were sires for 706 reference animals and were used as a validation dataset. Single trait animal model (best linear unbiased prediction and genomic best linear unbiased prediction) was used to estimate the breeding values from genomic and pedigree information. The diagonal elements for the pedigree based coefficients were slightly higher for the genomic relationship matrices (GRM) based coefficients while off diagonal elements were considerably low for GRM based coefficients. The accuracy of breeding value for the pedigree based relationship matrix (A) was 13% while for GRM (GOF, G05, and Yang) it was 0.37, 0.45, and 0.38, respectively. Accuracy of GRM was 1.5 times higher than A in this study. Therefore, genomic information will be more beneficial than pedigree information in the Hanwoo breeding program.
Thomasen, Jørn Rind; Sørensen, Anders Christian
A simulation study was performed for testing the effect of using reproductive technologies in a genomic dairy cattle young bull breeding scheme. The breeding scheme parameters: 1) number of donors, 2) number of progeny per donor, 3) age of the donor, 4) number of sires, and 5) reliability...... of genomic breeding values. The breeding schemes were evaluated according to genetic gain and rate of inbreeding. The relative gain by use of reproductive technologies is 11 to 84 percent points depending on the choice of other breeding scheme parameters. A large donor program with high selection intensity...... of sires provides the highest genetic gain. A relatively higher genetic gain is obtained for higher reliability of GEBV. Extending the donor program and number of selected bulls has a major effect of reducing the rate of inbreeding without compromising genetic gain....
Full Text Available Crossbreeding is a common strategy used in tropical and subtropical regions to enhance beef production, and having accurate knowledge of breed composition is essential for the success of a crossbreeding program. Although pedigree records have been traditionally used to obtain the breed composition of crossbred cattle, the accuracy of pedigree-based breed composition can be reduced by inaccurate and/or incomplete records and Mendelian sampling. Breed composition estimation from genomic data has multiple advantages including higher accuracy without being affected by missing, incomplete, or inaccurate records and the ability to be used as independent authentication of breed in breed-labeled beef products. The present study was conducted with 676 Angus–Brahman crossbred cattle with genotype and pedigree information to evaluate the feasibility and accuracy of using genomic data to determine breed composition. We used genomic data in parametric and non-parametric methods to detect population structure due to differences in breed composition while accounting for the confounding effect of close familial relationships. By applying principal component analysis (PCA and the maximum likelihood method of ADMIXTURE to genomic data, it was possible to successfully characterize population structure resulting from heterogeneous breed ancestry, while accounting for close familial relationships. PCA results offered additional insight into the different hierarchies of genetic variation structuring. The first principal component was strongly correlated with Angus–Brahman proportions, and the second represented variation within animals that have a relatively more extended Brangus lineage—indicating the presence of a distinct pattern of genetic variation in these cattle. Although there was strong agreement between breed proportions estimated from pedigree and genetic information, there were significant discrepancies between these two methods for certain animals
Bouwman, A.C.; Daetwyler, H.D.; Chamberlain, Amanda J.; Ponce, Carla Hurtado; Sargolzaei, Mehdi; Schenkel, Flavio S.; Sahana, Goutam; Govignon-Gion, Armelle; Boitard, Simon; Dolezal, Marlies; Pausch, Hubert; Brøndum, Rasmus F.; Bowman, Phil J.; Thomsen, Bo; Guldbrandtsen, Bernt; Lund, Mogens S.; Servin, Bertrand; Garrick, Dorian J.; Reecy, James M.; Vilkki, Johanna; Bagnato, Alessandro; Wang, Min; Hoff, Jesse L.; Schnabel, Robert D.; Taylor, Jeremy F.; Vinkhuyzen, Anna A.E.; Panitz, Frank; Bendixen, Christian; Holm, Lars-Erik; Gredler, Birgit; Hozé, Chris; Boussaha, Mekki; Sanchez, Marie Pierre; Rocha, Dominique; Capitan, Aurelien; Tribout, Thierry; Barbat, Anne; Croiseau, Pascal; Drögemüller, Cord; Jagannathan, Vidhya; Vander Jagt, Christy; Crowley, John J.; Bieber, Anna; Purfield, Deirdre C.; Berry, Donagh P.; Emmerling, Reiner; Götz, Kay Uwe; Frischknecht, Mirjam; Russ, Ingolf; Sölkner, Johann; Tassell, van Curtis P.; Fries, Ruedi; Stothard, Paul; Veerkamp, R.F.; Boichard, Didier; Goddard, Mike E.; Hayes, Ben J.
Whole genome sequence data (BAM format) of 234 bovine individuals aligned to UMD3.1. The aim of the study was to identify genetic variants (SNPs and indels) for downstream analysis such as imputation, GWAS, and detection of lethal recessives. Additional sequences for later 1000 bull genomes runs can
Zhan, Bujie; Fadista, João; Thomsen, Bo
sequence of a single Holstein Friesian bull with data from single nucleotide polymorphism (SNP) and comparative genomic hybridization (CGH) array technologies to determine a comprehensive spectrum of genomic variation. The performance of resequencing SNP detection was assessed by combining SNPs that were...... of split-read and read-pair approaches proved to be complementary in finding different signatures. CNVs were identified on the basis of the depth of sequenced reads, and by using SNP and CGH arrays. Conclusions Our results provide high resolution mapping of diverse classes of genomic variation...... in an individual bovine genome and demonstrate that structural variation surpasses sequence variation as the main component of genomic variability. Better accuracy of SNP detection was achieved with little loss of sensitivity when algorithms that implemented mapping quality were used. IBD regions were found...
Zhang, Wei; Zhang, Mingyi; Zhu, Xianwen; Cao, Yaping; Sun, Qing; Ma, Guojia; Chao, Shiaoman; Yan, Changhui; Xu, Steven S; Cai, Xiwen
This work pinpointed the goatgrass chromosomal segment in the wheat B genome using modern cytogenetic and genomic technologies, and provided novel insights into the origin of the wheat B genome. Wheat is a typical allopolyploid with three homoeologous subgenomes (A, B, and D). The donors of the subgenomes A and D had been identified, but not for the subgenome B. The goatgrass Aegilops speltoides (genome SS) has been controversially considered a possible candidate for the donor of the wheat B genome. However, the relationship of the Ae. speltoides S genome with the wheat B genome remains largely obscure. The present study assessed the homology of the B and S genomes using an integrative cytogenetic and genomic approach, and revealed the contribution of Ae. speltoides to the origin of the wheat B genome. We discovered noticeable homology between wheat chromosome 1B and Ae. speltoides chromosome 1S, but not between other chromosomes in the B and S genomes. An Ae. speltoides-originated segment spanning a genomic region of approximately 10.46 Mb was detected on the long arm of wheat chromosome 1B (1BL). The Ae. speltoides-originated segment on 1BL was found to co-evolve with the rest of the B genome. Evidently, Ae. speltoides had been involved in the origin of the wheat B genome, but should not be considered an exclusive donor of this genome. The wheat B genome might have a polyphyletic origin with multiple ancestors involved, including Ae. speltoides. These novel findings will facilitate genome studies in wheat and other polyploids.
Full Text Available Abstract Missing genotypes are a common feature of high density SNP datasets obtained using SNP chip technology and this is likely to decrease the accuracy of genomic selection. This problem can be circumvented by imputing the missing genotypes with estimated genotypes. When implementing imputation, the criteria used for SNP data quality control and whether to perform imputation before or after data quality control need to consider. In this paper, we compared six strategies of imputation and quality control using different imputation methods, different quality control criteria and by changing the order of imputation and quality control, against a real dataset of milk production traits in Chinese Holstein cattle. The results demonstrated that, no matter what imputation method and quality control criteria were used, strategies with imputation before quality control performed better than strategies with imputation after quality control in terms of accuracy of genomic selection. The different imputation methods and quality control criteria did not significantly influence the accuracy of genomic selection. We concluded that performing imputation before quality control could increase the accuracy of genomic selection, especially when the rate of missing genotypes is high and the reference population is small.
Shashikanth; Yadav, B.R.
Genomic DNA of 15 randomly selected unrelated animals and from two sire families (11 animals) of the Sahiwal breed of Zebu cattle were investigated. Four oligonucleotide probes - (GTG) 5 , (TCC) 5 , (GT) 8 and (GT) 12 - were used on genomic DNA digested with restriction enzymes AluI, HinfI, MboI, EcoRI and HaeIII in different combinations. All four probes produced multiloci fingerprints with differing levels of polymorphisms. Total bands and shared bands in the fingerprints of each individual were in the range of 2.5 to 23.0 KB. Band number ranged from 9 to 17, with 0.48 average band sharing. Probes (GT) 8 , (GT) 12 and (TCC) 5 produced fingerprinting patterns of medium to low polymorphism, whereas probe (GTG) 5 produced highly polymorphic patterns. Probe (GTG) 5 in combination with the HaeIII enzyme was highly polymorphic with a heterozygosity level of 0.85, followed by (GT) 8 , (TCC) 5 and (GT) 12 with heterozygosity levels of 0.70, 0.65 and 0.30, respectively. Probe GTG 5 or its complementary sequence CAC 5 produced highly polymorphic fingerprints, indicating that the probe can be used for analysing population structure, parentage verification and identifying loci controlling quantitative traits and fertility status. (author)
Full Text Available Korean Hanwoo cattle have been subjected to intensive artificial selection over the past four decades to improve meat production traits. Another three cattle varieties very closely related to Hanwoo reside in Korea (Jeju Black and Brindle and in China (Yanbian. These breeds have not been part of a breeding scheme to improve production traits. Here, we compare the selected Hanwoo against these similar but presumed to be unselected populations to identify genomic regions that have been under recent selection pressure due to the breeding program. Rsb statistics were used to contrast the genomes of Hanwoo versus a pooled sample of the three unselected population (UN. We identified 37 significant SNPs (FDR corrected in the HW/UN comparison and 21 known protein coding genes were within 1 MB to the identified SNPs. These genes were previously reported to affect traits important for meat production (14 genes, reproduction including mammary gland development (3 genes, coat color (2 genes, and genes affecting behavioral traits in a broader sense (2 genes. We subsequently sequenced (Illumina HiSeq 2000 platform 10 individuals of the brown Hanwoo and the Chinese Yanbian to identify SNPs within the candidate genomic regions. Based on allele frequency differences, haplotype structures, and literature research, we singled out one non-synonymous SNP in the APP gene (APP: c.569C>T, Ala199Val and predicted the mutational effect on the protein structure. We found that protein-protein interactions might be impaired due to increased exposed hydrophobic surfaces of the mutated protein. The APP gene has also been reported to affect meat tenderness in pigs and obesity in humans. Meat tenderness has been linked to intramuscular fat content, which is one of the main breeding goals for brown Hanwoo, potentially supporting a causal influence of the herein described nsSNP in the APP gene.
Lim, Dajeong; Strucken, Eva M; Choi, Bong Hwan; Chai, Han Ha; Cho, Yong Min; Jang, Gul Won; Kim, Tae-Hun; Gondro, Cedric; Lee, Seung Hwan
Korean Hanwoo cattle have been subjected to intensive artificial selection over the past four decades to improve meat production traits. Another three cattle varieties very closely related to Hanwoo reside in Korea (Jeju Black and Brindle) and in China (Yanbian). These breeds have not been part of a breeding scheme to improve production traits. Here, we compare the selected Hanwoo against these similar but presumed to be unselected populations to identify genomic regions that have been under recent selection pressure due to the breeding program. Rsb statistics were used to contrast the genomes of Hanwoo versus a pooled sample of the three unselected population (UN). We identified 37 significant SNPs (FDR corrected) in the HW/UN comparison and 21 known protein coding genes were within 1 MB to the identified SNPs. These genes were previously reported to affect traits important for meat production (14 genes), reproduction including mammary gland development (3 genes), coat color (2 genes), and genes affecting behavioral traits in a broader sense (2 genes). We subsequently sequenced (Illumina HiSeq 2000 platform) 10 individuals of the brown Hanwoo and the Chinese Yanbian to identify SNPs within the candidate genomic regions. Based on allele frequency differences, haplotype structures, and literature research, we singled out one non-synonymous SNP in the APP gene (APP: c.569C>T, Ala199Val) and predicted the mutational effect on the protein structure. We found that protein-protein interactions might be impaired due to increased exposed hydrophobic surfaces of the mutated protein. The APP gene has also been reported to affect meat tenderness in pigs and obesity in humans. Meat tenderness has been linked to intramuscular fat content, which is one of the main breeding goals for brown Hanwoo, potentially supporting a causal influence of the herein described nsSNP in the APP gene.
Bárcenas-Reyes, I; Loza-Rubio, E; Cantó-Alarcón, G J; Luna-Cozar, J; Enríquez-Vázquez, A; Barrón-Rodríguez, R J; Milián-Suazo, F
Phylogenetic analysis of the rabies virus in molecular epidemiology has been traditionally performed on partial sequences of the genome, such as the N, G, and P genes; however, that approach raises concerns about the discriminatory power compared to whole genome sequencing. In this study we characterized four strains of the rabies virus isolated from cattle in Querétaro, Mexico by comparing the whole genome sequence to that of strains from the American, European and Asian continents. Four cattle brain samples positive to rabies and characterized as AgV11, genotype 1, were used in the study. A cDNA sequence was generated by reverse transcription PCR (RT-PCR) using oligo dT. cDNA samples were sequenced in an Illumina NextSeq 500 platform. The phylogenetic analysis was performed with MEGA 6.0. Minimum evolution phylogenetic trees were constructed with the Neighbor-Joining method and bootstrapped with 1000 replicates. Three large and seven small clusters were formed with the 26 sequences used. The largest cluster grouped strains from different species in South America: Brazil, and the French Guyana. The second cluster grouped five strains from Mexico. A Mexican strain reported in a different study was highly related to our four strains, suggesting common source of infection. The phylogenetic analysis shows that the type of host is different for the different regions in the American Continent; rabies is more related to bats. It was concluded that the rabies virus in central Mexico is genetically stable and that it is transmitted by the vampire bat Desmodus rotundus. Copyright © 2017 Elsevier Ltd. All rights reserved.
Sahana, G; Guldbrandtsen, B; Thomsen, B; Holm, L-E; Panitz, F; Brøndum, R F; Bendixen, C; Lund, M S
Mastitis is a mammary disease that frequently affects dairy cattle. Despite considerable research on the development of effective prevention and treatment strategies, mastitis continues to be a significant issue in bovine veterinary medicine. To identify major genes that affect mastitis in dairy cattle, 6 chromosomal regions on Bos taurus autosome (BTA) 6, 13, 16, 19, and 20 were selected from a genome scan for 9 mastitis phenotypes using imputed high-density single nucleotide polymorphism arrays. Association analyses using sequence-level variants for the 6 targeted regions were carried out to map causal variants using whole-genome sequence data from 3 breeds. The quantitative trait loci (QTL) discovery population comprised 4,992 progeny-tested Holstein bulls, and QTL were confirmed in 4,442 Nordic Red and 1,126 Jersey cattle. The targeted regions were imputed to the sequence level. The highest association signal for clinical mastitis was observed on BTA 6 at 88.97 Mb in Holstein cattle and was confirmed in Nordic Red cattle. The peak association region on BTA 6 contained 2 genes: vitamin D-binding protein precursor (GC) and neuropeptide FF receptor 2 (NPFFR2), which, based on known biological functions, are good candidates for affecting mastitis. However, strong linkage disequilibrium in this region prevented conclusive determination of the causal gene. A different QTL on BTA 6 located at 88.32 Mb in Holstein cattle affected mastitis. In addition, QTL on BTA 13 and 19 were confirmed to segregate in Nordic Red cattle and QTL on BTA 16 and 20 were confirmed in Jersey cattle. Although several candidate genes were identified in these targeted regions, it was not possible to identify a gene or polymorphism as the causal factor for any of these regions. Copyright © 2014 American Dairy Science Association. Published by Elsevier Inc. All rights reserved.
Sahana, Goutam; Guldbrandtsen, Bernt; Bendixen, Christian
component in a linear mixed model. A total of 90 bulls’ whole genomes were sequenced with a coverage > 10X. Sequence reads were aligned to the cattle reference genome and polymorphisms in candidate regions were identified when one or more samples differed from the reference sequence. The polymorphisms...... Factor Receptor Alpha (LIFR) emerged as a strong candidate gene for mastitis resistance. The LIFR gene is involved in acute phase response and is expressed in saliva and mammary gland....
Speidel, S E; Buckley, B A; Boldt, R J; Enns, R M; Lee, J; Spangler, M L; Thomas, M G
Reproductive performance is the most important component of cattle production from the standpoint of economic sustainability of commercial beef enterprises. Heifer Pregnancy (HPG) and Stayability (STAY) genetic predictions are two selection tools published by the Red Angus Association of America (RAAA) to assist with improvements in reproductive performance. Given the importance of HPG and STAY to the profitability of commercial beef enterprises, the objective of this study was to identify QTL associated with both HPG and STAY in Red Angus cattle. A genome wide association study (GWAS) was performed using deregressed HPG and STAY EBV, calculated using a single trait animal model and a three generation pedigree with data from the Spring 2015 RAAA national cattle evaluation. Each individual animal possessed 74,659 SNP genotypes. Individual animals with a deregressed EBV reliability >0.05 were merged with the genotype file and marker quality control was performed. Criteria for sifting genotypes consisted of removing those markers where any of the following were found: average call rate less than 0.85, minor allele frequency 0.99). These criteria resulted in 2,664 animals with 62,807 SNP available for GWAS. Association studies were performed using a Bayes Cπ model in the BOLT software package. Marker significance was calculated as the posterior probability of inclusion (PPI), or the number of instances a specific marker was sampled divided by the total number of samples retained from the Markov chain Monte Carlo chains. Nine markers, with a PPI ≥ 3% were identified as QTL associated with HPG on BTA 1, 11, 13, 23 and 29. Twelve markers, with a PPI ≥ 75% were identified as QTL associated with STAY on BTA 6, 8, 9, 12, 15, 18, 22 and 23.
Eriksson, S; Jonas, E; Rydhmer, L; Röcklinsberg, H
The hot topic of genetic modification and genome editing is sometimes presented as a rapid solution to various problems in the field of animal breeding and genetics. These technologies hold potential for future use in agriculture but we need to be aware of difficulties in large-scale application and integration in breeding schemes. In this review, we discuss applications of both classical genetic modifications (GM) using vectors and genome editing in dairy cattle breeding. We use an interdisciplinary approach considering both ethical and animal breeding perspectives. Decisions on how to make use of these techniques need to be made based not only on what is possible, but on what is reasonable to do. Principles of animal integrity, naturalness, risk perception, and animal welfare issues are examples of ethically relevant factors to consider. These factors also influence public perception and decisions about regulations by authorities. We need to acknowledge that we lack complete understanding of the genetic background of complex traits. It may be difficult, therefore, to predict the full effect of certain modifications in large-scale breeding programs. We present 2 potential applications: genome editing to dispense with dehorning, and insertion of human genes in bovine genomes to improve udder health as an example of classical GM. Both of these cases could be seen as beneficial for animal welfare but they differ in other aspects. In the former case, a genetic variant already present within the species is introduced, whereas in the latter case, transgenic animals are generated-this difference may influence how society regards the applications. We underline that the use of GM, as well as genome editing, of farm animals such as cattle is not independent of the context, and should be considered as part of an entire process, including, for example, the assisted reproduction technology that needs to be used. We propose that breeding organizations and breeding companies
Full Text Available Abstract Background One of the goals of livestock genomics research is to identify the genetic differences responsible for variation in phenotypic traits, particularly those of economic importance. Characterizing the genetic variation in livestock species is an important step towards linking genes or genomic regions with phenotypes. The completion of the bovine genome sequence and recent advances in DNA sequencing technology allow for in-depth characterization of the genetic variations present in cattle. Here we describe the whole-genome resequencing of two Bos taurus bulls from distinct breeds for the purpose of identifying and annotating novel forms of genetic variation in cattle. Results The genomes of a Black Angus bull and a Holstein bull were sequenced to 22-fold and 19-fold coverage, respectively, using the ABI SOLiD system. Comparisons of the sequences with the Btau4.0 reference assembly yielded 7 million single nucleotide polymorphisms (SNPs, 24% of which were identified in both animals. Of the total SNPs found in Holstein, Black Angus, and in both animals, 81%, 81%, and 75% respectively are novel. In-depth annotations of the data identified more than 16 thousand distinct non-synonymous SNPs (85% novel between the two datasets. Alignments between the SNP-altered proteins and orthologues from numerous species indicate that many of the SNPs alter well-conserved amino acids. Several SNPs predicted to create or remove stop codons were also found. A comparison between the sequencing SNPs and genotyping results from the BovineHD high-density genotyping chip indicates a detection rate of 91% for homozygous SNPs and 81% for heterozygous SNPs. The false positive rate is estimated to be about 2% for both the Black Angus and Holstein SNP sets, based on follow-up genotyping of 422 and 427 SNPs, respectively. Comparisons of read depth between the two bulls along the reference assembly identified 790 putative copy-number variations (CNVs. Ten
de Rezende Neves, Haroldo Henrique; Carvalheiro, Roberto; de Queiroz, Sandra Aidar
Simulation studies allow addressing consequences of selection schemes, helping to identify effective strategies to enable genetic gain and maintain genetic diversity. The aim of this study was to evaluate the long-term impact of genomic selection (GS) in genetic progress and genetic diversity of beef cattle. Forward-in-time simulation generated a population with pattern of linkage disequilibrium close to that previously reported for real beef cattle populations. Different scenarios of GS and traditional pedigree-based BLUP (PBLUP) selection were simulated for 15 generations, mimicking selection for female reproduction and meat quality. For GS scenarios, an alternative selection criterion was simulated (wGBLUP), intended to enhance long-term gains by attributing more weight to favorable alleles with low frequency. GS allowed genetic progress up to 40% greater than PBLUP, for female reproduction and meat quality. The alternative criterion wGBLUP did not increase long-term response, although allowed reducing inbreeding rates and loss of favorable alleles. The results suggest that GS outperforms PBLUP when the selected trait is under less polygenic background and that attributing more weight to low-frequency favorable alleles can reduce inbreeding rates and loss of favorable alleles in GS.
Full Text Available Genomic structural variations represent an important source of genetic variation in mammal genomes, thus, they are commonly related to phenotypic expressions. In this work, ∼ 770,000 single nucleotide polymorphism genotypes from 506 animals from 19 cattle breeds were analyzed. A simple LD-based structural variation was defined, and a genome-wide analysis was performed. After applying some quality control filters, for each breed and each chromosome we calculated the linkage disequilibrium (r2 of short range (≤ 100 Kb. We sorted SNP pairs by distance and obtained a set of LD means (called the expected means using bins of 5 Kb. We identified 15,246 segments of at least 1 Kb, among the 19 breeds, consisting of sets of at least 3 adjacent SNPs so that, for each SNP, r2 within its neighbors in a 100 Kb range, to the right side of that SNP, were all bigger than, or all smaller than, the corresponding expected mean, and their P-value were significant after a Benjamini-Hochberg multiple testing correction. In addition, to account just for homogeneously distributed regions we considered only SNPs having at least 15 SNP neighbors within 100 Kb. We defined such segments as structural variations. By grouping all variations across all animals in the sample we defined 9,146 regions, involving a total of 53,137 SNPs; representing the 6.40% (160.98 Mb from the bovine genome. The identified structural variations covered 3,109 genes. Clustering analysis showed the relatedness of breeds given the geographic region in which they are evolving. In summary, we present an analysis of structural variations based on the deviation of the expected short range LD between SNPs in the bovine genome. With an intuitive and simple definition based only on SNPs data it was possible to discern closeness of breeds due to grouping by geographic region in which they are evolving.
Vallée, A; Daures, J; van Arendonk, J A M; Bovenhuis, H
Behavior, type traits, and muscular development are of interest for beef cattle breeding. Genome-wide association studies (GWAS) enable the identification of candidate genes, which enables gene-based selection and provides insight in the genetic architecture of these traits. The objective of the current study was to perform a GWAS for 3 behavior traits, 12 type traits, and muscular development in Charolais cattle. Behavior traits, including aggressiveness at parturition, aggressiveness during gestation period, and maternal care, were scored by farmers. Type traits, including udder conformation, teat, feet and legs, and locomotion, were scored by trained classifiers. Data used in the GWAS consisted of 3,274 cows with phenotypic records and genotyping information for 44,930 SNP. When SNP had a false discovery rate (FDR) smaller than 0.05, they were referred to as significant. When SNP had a FDR between 0.05 and 0.20, they were referred to as suggestive. Four significant and 12 suggestive regions were detected for aggressiveness during gestation, maternal care, udder balance, teat thinness, teat length, foot angle, foot depth, and locomotion. These 4 significant and 12 suggestive regions were not supported by other significant SNP in close proximity. No SNP with major effects were detected for behavior and type traits, and SNP associations for these traits were spread across the genome, suggesting that behavior and type traits were influenced by many genes, each explaining a small part of genetic variance. The GWAS identified 1 region on chromosome 2 significantly associated with muscular development, which included the myostatin gene (), which is known to affect muscularity. No other regions associated with muscular development were found. Results showed that the myostatin region associated with muscular development had pleiotropic effects on udder volume, teat thinness, rear leg, and leg angle.
Brown, A; Ojango, J; Gibson, J; Coffey, M; Okeyo, M; Mrode, R
Due to the absence of accurate pedigree information, it has not been possible to implement genetic evaluations for crossbred cattle in African small-holder systems. Genomic selection techniques that do not rely on pedigree information could, therefore, be a useful alternative. The objective of this study was to examine the feasibility of using genomic selection techniques in a crossbred cattle population using data from Kenya provided by the Dairy Genetics East Africa Project. Genomic estimated breeding values for milk yield were estimated using 2 prediction methods, GBLUP and BayesC, and accuracies were calculated as the correlation between yield deviations and genomic breeding values included in the estimation process, mimicking the situation for young bulls. The accuracy of evaluation ranged from 0.28 to 0.41, depending on the validation population and prediction method used. No significant differences were found in accuracy between the 2 prediction methods. The results suggest that there is potential for implementing genomic selection for young bulls in crossbred small-holder cattle populations, and targeted genotyping and phenotyping should be pursued to facilitate this. Copyright © 2016 American Dairy Science Association. Published by Elsevier Inc. All rights reserved.
Carl A. Gagnon
Full Text Available Bovine gammaherpesvirus 4 (BoHV-4 is a herpesvirus widespread in cattle populations, and with no clear disease association. Its genome contains a long unique coding region (LUR flanked by polyrepetitive DNA and 79 open reading frames (ORFs, with unique 17 ORFs, named Bo1 to Bo17. In 2009, a BoHV-4 strain was isolated (FMV09-1180503: BoHV-4-FMV from cattle with respiratory disease from Quebec, Canada, and its LUR was sequenced. Despite the overall high similarity, BoHV-4-FMV had the most divergent LUR sequence compared to the two known BoHV-4 reference strain genomes; most of the divergences were in the Bo genes and in the repeat regions. Our phylogenetic analysis based on DNA polymerase and thymidine kinase genes revealed that virus isolate was BoHV-4 gammaherpesvirus and clustered it together with European BoHV-4 strains. Because BoHV-4-FMV was isolated from animals presenting respiratory signs, we have updated the BoHV-4 Canadian cattle seroprevalence data and tried to find out whether there is a link between clinical manifestation and BoHV-4 seropositivity. An indirect immunofluorescence assay (IFA was performed with nearly 200 randomized sera of dairy cattle from two Canadian provinces, Quebec (n = 100 and Ontario (n = 91. An additional set of sera obtained from Quebec, from the healthy (n = 48 cows or from the animals experiencing respiratory or reproductive problems (n = 75, was also analyzed by IFA. BoHV-4 seroprevalence in Canadian dairy cattle was 7.9% (Quebec: 6% and Ontario: 9.9%. Among animals from the Quebec-based farms, diseased animals showed higher BoHV-4 seropositivity than healthy animals (P < 0.05, with a significant 2.494 odds ratio of being seropositive in sick compared to healthy animals. Although there is no established direct link between BoHV-4 and specific diseases, these seroprevalence data suggest the possible involvement of BoHV-4 in dairy cattle diseases.
Ma, Li-Jun; van der Does, H. Charlotte; Borkovich, Katherine A.; Coleman, Jeffrey J.; Daboussi, Marie-Josée; Di Pietro, Antonio; Dufresne, Marie; Freitag, Michael; Grabherr, Manfred; Henrissat, Bernard; Houterman, Petra M.; Kang, Seogchan; Shim, Won-Bo; Woloshuk, Charles; Xie, Xiaohui; Xu, Jin-Rong; Antoniw, John; Baker, Scott E.; Bluhm, Burton H.; Breakspear, Andrew; Brown, Daren W.; Butchko, Robert A. E.; Chapman, Sinead; Coulson, Richard; Coutinho, Pedro M.; Danchin, Etienne G. J.; Diener, Andrew; Gale, Liane R.; Gardiner, Donald M.; Goff, Stephen; Hammond-Kosack, Kim E.; Hilburn, Karen; Hua-Van, Aurélie; Jonkers, Wilfried; Kazan, Kemal; Kodira, Chinnappa D.; Koehrsen, Michael; Kumar, Lokesh; Lee, Yong-Hwan; Li, Liande; Manners, John M.; Miranda-Saavedra, Diego; Mukherjee, Mala; Park, Gyungsoon; Park, Jongsun; Park, Sook-Young; Proctor, Robert H.; Regev, Aviv; Ruiz-Roldan, M. Carmen; Sain, Divya; Sakthikumar, Sharadha; Sykes, Sean; Schwartz, David C.; Turgeon, B. Gillian; Wapinski, Ilan; Yoder, Olen; Young, Sarah; Zeng, Qiandong; Zhou, Shiguo; Galagan, James; Cuomo, Christina A.; Kistler, H. Corby; Rep, Martijn
Fusarium species are among the most important phytopathogenic and toxigenic fungi. To understand the molecular underpinnings of pathogenicity in the genus Fusarium, we compared the genomes of three phenotypically diverse species: Fusarium graminearum, Fusarium verticillioides and Fusarium oxysporum f. sp. lycopersici. Our analysis revealed lineage-specific (LS) genomic regions in F. oxysporum that include four entire chromosomes and account for more than one-quarter of the genome. LS regions are rich in transposons and genes with distinct evolutionary profiles but related to pathogenicity, indicative of horizontal acquisition. Experimentally, we demonstrate the transfer of two LS chromosomes between strains of F. oxysporum, converting a non-pathogenic strain into a pathogen. Transfer of LS chromosomes between otherwise genetically isolated strains explains the polyphyletic origin of host specificity and the emergence of new pathogenic lineages in F. oxysporum. These findings put the evolution of fungal pathogenicity into a new perspective. PMID:20237561
Raghavan, Maanasa; Skoglund, Pontus; Graf, Kelly E.
,000-year-old individual (MA-1), from Mal'ta in south-central Siberia, to an average depth of 1×. To our knowledge this is the oldest anatomically modern human genome reported to date. The MA-1 mitochondrial genome belongs to haplogroup U, which has also been found at high frequency among Upper Palaeolithic......The origins of the First Americans remain contentious. Although Native Americans seem to be genetically most closely related to east Asians, there is no consensus with regard to which specific Old World populations they are closest to. Here we sequence the draft genome of an approximately 24...... that the region was continuously occupied by humans throughout the Last Glacial Maximum. Our findings reveal that western Eurasian genetic signatures in modern-day Native Americans derive not only from post-Columbian admixture, as commonly thought, but also from a mixed ancestry of the First Americans....
Adriana L. Somavilla
Full Text Available Nelore is the most economically important cattle breed in Brazil, and the use of genetically improved animals has contributed to increased beef production efficiency. The Brazilian beef feedlot industry has grown considerably in the last decade, so the selection of animals with higher growth rates on feedlot has become quite important. Genomic selection (GS could be used to reduce generation intervals and improve the rate of genetic gains. The aim of this study was to evaluate the prediction of genomic-estimated breeding values (GEBV for average daily weight gain (ADG in 718 feedlot-finished Nelore steers. Analyses of three Bayesian model specifications [Bayesian GBLUP (BGBLUP, BayesA, and BayesCπ] were performed with four genotype panels [Illumina BovineHD BeadChip, TagSNPs, and GeneSeek High- and Low-density indicus (HDi and LDi, respectively]. Estimates of Pearson correlations, regression coefficients, and mean squared errors were used to assess accuracy and bias of predictions. Overall, the BayesCπ model resulted in less biased predictions. Accuracies ranged from 0.18 to 0.27, which are reasonable values given the heritability estimates (from 0.40 to 0.44 and sample size (568 animals in the training population. Furthermore, results from Bos taurus indicus panels were as informative as those from Illumina BovineHD, indicating that they could be used to implement GS at lower costs.
Howard, Jeremy T.; Kachman, Stephen D.; Snelling, Warren M.; Pollak, E. John; Ciobanu, Daniel C.; Kuehn, Larry A.; Spangler, Matthew L.
Cattle are reared in diverse environments and collecting phenotypic body temperature (BT) measurements to characterize BT variation across diverse environments is difficult and expensive. To better understand the genetic basis of BT regulation, a genome-wide association study was conducted utilizing crossbred steers and heifers totaling 239 animals of unknown pedigree and breed fraction. During predicted extreme heat and cold stress events, hourly tympanic and vaginal BT devices were placed in steers and heifers, respectively. Individuals were genotyped with the BovineSNP50K_v2 assay and data analyzed using Bayesian models for area under the curve (AUC), a measure of BT over time, using hourly BT observations summed across 5-days (AUC summer 5-day (AUCS5D) and AUC winter 5-day (AUCW5D)). Posterior heritability estimates were moderate to high and were estimated to be 0.68 and 0.21 for AUCS5D and AUCW5D, respectively. Moderately positive correlations between direct genomic values for AUCS5D and AUCW5D (0.40) were found, although a small percentage of the top 5 % 1-Mb windows were in common. Different sets of genes were associated with BT during winter and summer, thus simultaneous selection for animals tolerant to both heat and cold appears possible.
Liu, Shengyi; Liu, Yumei; Yang, Xinhua; Tong, Chaobo; Edwards, David; Parkin, Isobel A. P.; Zhao, Meixia; Ma, Jianxin; Yu, Jingyin; Huang, Shunmou; Wang, Xiyin; Wang, Junyi; Lu, Kun; Fang, Zhiyuan; Bancroft, Ian; Yang, Tae-Jin; Hu, Qiong; Wang, Xinfa; Yue, Zhen; Li, Haojie; Yang, Linfeng; Wu, Jian; Zhou, Qing; Wang, Wanxin; King, Graham J; Pires, J. Chris; Lu, Changxin; Wu, Zhangyan; Sampath, Perumal; Wang, Zhuo; Guo, Hui; Pan, Shengkai; Yang, Limei; Min, Jiumeng; Zhang, Dong; Jin, Dianchuan; Li, Wanshun; Belcram, Harry; Tu, Jinxing; Guan, Mei; Qi, Cunkou; Du, Dezhi; Li, Jiana; Jiang, Liangcai; Batley, Jacqueline; Sharpe, Andrew G; Park, Beom-Seok; Ruperao, Pradeep; Cheng, Feng; Waminal, Nomar Espinosa; Huang, Yin; Dong, Caihua; Wang, Li; Li, Jingping; Hu, Zhiyong; Zhuang, Mu; Huang, Yi; Huang, Junyan; Shi, Jiaqin; Mei, Desheng; Liu, Jing; Lee, Tae-Ho; Wang, Jinpeng; Jin, Huizhe; Li, Zaiyun; Li, Xun; Zhang, Jiefu; Xiao, Lu; Zhou, Yongming; Liu, Zhongsong; Liu, Xuequn; Qin, Rui; Tang, Xu; Liu, Wenbin; Wang, Yupeng; Zhang, Yangyong; Lee, Jonghoon; Kim, Hyun Hee; Denoeud, France; Xu, Xun; Liang, Xinming; Hua, Wei; Wang, Xiaowu; Wang, Jun; Chalhoub, Boulos; Paterson, Andrew H
Polyploidization has provided much genetic variation for plant adaptive evolution, but the mechanisms by which the molecular evolution of polyploid genomes establishes genetic architecture underlying species differentiation are unclear. Brassica is an ideal model to increase knowledge of polyploid evolution. Here we describe a draft genome sequence of Brassica oleracea, comparing it with that of its sister species B. rapa to reveal numerous chromosome rearrangements and asymmetrical gene loss in duplicated genomic blocks, asymmetrical amplification of transposable elements, differential gene co-retention for specific pathways and variation in gene expression, including alternative splicing, among a large number of paralogous and orthologous genes. Genes related to the production of anticancer phytochemicals and morphological variations illustrate consequences of genome duplication and gene divergence, imparting biochemical and morphological variation to B. oleracea. This study provides insights into Brassica genome evolution and will underpin research into the many important crops in this genus. PMID:24852848
Full Text Available Background:Relapsing fever borreliae are vector-borne bacteria responsible for febrile infection in humans in North America, Africa, Asia and in the Iberian Peninsula in Europe. Relapsing fever borreliae are phylogenetically closely related, yet they differ in pathogenicity and vectors. Their long-term taxonomy, based on geography and vector grouping, needs a re-appraisal in the genomic area. We therefore embarked into genomic analyses of relapsing fever borreliae, focusing on species found in Africa. Results:Genome-wide phylogenetic analyses group Old World Borrelia crocidurae, Borrelia hispanica, B. duttonii and B. recurrentis in one clade, and New World Borrelia turicatae and Borrelia hermsii in a second clade. Accordingly, average nucleotide identity is 99% among B. duttonii, B. recurrentis and B. crocidurae and 96% between latter borreliae and B. hispanica while the similarity is 86% between Old World and New World borreliae. Comparative genomics indicates that the Old World relapsing fever B. duttonii, B. recurrentis, B. crocidurae and B. hispanica have a 2,514-gene pan-genome and a 933-gene core genome that includes 788 chromosomal and 145 plasmidic genes. Analysing the role that natural selection has played in the evolution of Old World borreliae species revealed that 55 loci were under positive diversifying selection, including loci coding for membrane, flagellar and chemotaxis proteins, three categories associated with adaption to specific niches. Conclusions:Genomic analyses led to a reappraisal of the taxonomy of relapsing fever borreliae in Africa. These analyses suggest that B. crocidurae, B. duttonii and B. recurrentis are ecotypes of a unique genomospecies, while B. hispanica is a distinct species.
Ushiki, Norisuke; Fujitani, Hirotsugu; Shimada, Yu; Morohoshi, Tomohiro; Sekiguchi, Yuji; Tsuneda, Satoshi
The genus Nitrospira represents a dominant group of nitrite-oxidizing bacteria in natural and engineered ecosystems. This genus is phylogenetically divided into six lineages, for which vast phylogenetic and functional diversity has been revealed by recent molecular ecophysiological analyses. However, the genetic basis underlying these phenotypic differences remains largely unknown because of the lack of genome sequences representing their diversity. To gain a more comprehensive understanding of Nitrospira , we performed genomic comparisons between two Nitrospira strains (ND1 and NJ1 belonging to lineages I and II, respectively) previously isolated from activated sludge. In addition, the genomes of these strains were systematically compared with previously reported six Nitrospira genomes to reveal their similarity and presence/absence of several functional genes/operons. Comparisons of Nitrospira genomes indicated that their genomic diversity reflects phenotypic differences and versatile nitrogen metabolisms. Although most genes involved in key metabolic pathways were conserved between strains ND1 and NJ1, assimilatory nitrite reduction pathways of the two Nitrospira strains were different. In addition, the genomes of both strains contain a phylogenetically different urease locus and we confirmed their ureolytic activity. During gene annotation of strain NJ1, we found a gene cluster encoding a quorum-sensing system. From the enriched supernatant of strain NJ1, we successfully identified seven types of acyl-homoserine lactones with a range of C10-C14. In addition, the genome of strain NJ1 lacks genes relevant to flagella and the clustered regularly interspaced short palindromic repeat (CRISPR)-Cas (CRISPR-associated genes) systems, whereas most nitrifying bacteria including strain ND1 possess these genomic elements. These findings enhance our understanding of genomic plasticity and functional diversity among members of the genus Nitrospira .
Bjelland, D W; Weigel, K A; Vukasinovic, N; Nkrumah, J D
The effects of increased pedigree inbreeding in dairy cattle populations have been well documented and result in a negative impact on profitability. Recent advances in genotyping technology have allowed researchers to move beyond pedigree analysis and study inbreeding at a molecular level. In this study, 5,853 animals were genotyped for 54,001 single nucleotide polymorphisms (SNP); 2,913 cows had phenotypic records including a single lactation for milk yield (from either lactation 1, 2, 3, or 4), reproductive performance, and linear type conformation. After removing SNP with poor call rates, low minor allele frequencies, and departure from Hardy-Weinberg equilibrium, 33,025 SNP remained for analyses. Three measures of genomic inbreeding were evaluated: percent homozygosity (FPH), inbreeding calculated from runs of homozygosity (FROH), and inbreeding derived from a genomic relationship matrix (FGRM). Average FPH was 60.5±1.1%, average FROH was 3.8±2.1%, and average FGRM was 20.8±2.3%, where animals with larger values for each of the genomic inbreeding indices were considered more inbred. Decreases in total milk yield to 205d postpartum of 53, 20, and 47kg per 1% increase in FPH, FROH, and FGRM, respectively, were observed. Increases in days open per 1% increase in FPH (1.76 d), FROH (1.72 d), and FGRM (1.06 d) were also noted, as well as increases in maternal calving difficulty (0.09, 0.03, and 0.04 on a 5-point scale for FPH, FROH, and FGRM, respectively). Several linear type traits, such as strength (-0.40, -0.11, and -0.19), rear legs rear view (-0.35, -0.16, and -0.14), front teat placement (0.35, 0.25, 0.18), and teat length (-0.24, -0.14, and -0.13) were also affected by increases in FPH, FROH, and FGRM, respectively. Overall, increases in each measure of genomic inbreeding in this study were associated with negative effects on production and reproductive ability in dairy cows. Copyright © 2013 American Dairy Science Association. Published by Elsevier Inc
Full Text Available A Neolithic domestication of taurine cattle in the Fertile Crescent from local aurochsen (Bos primigenius is generally accepted, but a genetic contribution from European aurochsen has been proposed. Here we performed a survey of a large number of taurine cattle mitochondrial DNA (mtDNA control regions from numerous European breeds confirming the overall clustering within haplogroups (T1, T2 and T3 of Near Eastern ancestry, but also identifying eight mtDNAs (1.3% that did not fit in haplogroup T. Sequencing of the entire mitochondrial genome showed that four mtDNAs formed a novel branch (haplogroup R which, after the deep bifurcation that gave rise to the taurine and zebuine lineages, constitutes the earliest known split in the mtDNA phylogeny of B. primigenius. The remaining four mtDNAs were members of the recently discovered haplogroup Q. Phylogeographic data indicate that R mtDNAs were derived from female European aurochsen, possibly in the Italian Peninsula, and sporadically included in domestic herds. In contrast, the available data suggest that Q mtDNAs and T subclades were involved in the same Neolithic event of domestication in the Near East. Thus, the existence of novel (and rare taurine haplogroups highlights a multifaceted genetic legacy from distinct B. primigenius populations. Taking into account that the maternally transmitted mtDNA tends to underestimate the extent of gene flow from European aurochsen, the detection of the R mtDNAs in autochthonous breeds, some of which are endangered, identifies an unexpected reservoir of genetic variation that should be carefully preserved.
Castanera, Raúl; Pérez, Gúmer; López-Varas, Leticia; Amselem, Joëlle; LaButti, Kurt; Singan, Vasanth; Lipzen, Anna; Haridas, Sajeet; Barry, Kerrie; Grigoriev, Igor V; Pisabarro, Antonio G; Ramírez, Lucía
Coniophora olivacea is a basidiomycete fungus belonging to the order Boletales that produces brown-rot decay on dead wood of conifers. The Boletales order comprises a diverse group of species including saprotrophs and ectomycorrhizal fungi that show important differences in genome size. In this study we report the 39.07-megabase (Mb) draft genome assembly and annotation of C. olivacea. A total of 14,928 genes were annotated, including 470 putatively secreted proteins enriched in functions involved in lignocellulose degradation. Using similarity clustering and protein structure prediction we identified a new family of 10 putative lytic polysaccharide monooxygenase genes. This family is conserved in basidiomycota and lacks of previous functional annotation. Further analyses showed that C. olivacea has a low repetitive genome, with 2.91% of repeats and a restrained content of transposable elements (TEs). The annotation of TEs in four related Boletales yielded important differences in repeat content, ranging from 3.94 to 41.17% of the genome size. The distribution of insertion ages of LTR-retrotransposons showed that differential expansions of these repetitive elements have shaped the genome architecture of Boletales over the last 60 million years. Coniophora olivacea has a small, compact genome that shows macrosynteny with Coniophora puteana. The functional annotation revealed the enzymatic signature of a canonical brown-rot. The annotation and comparative genomics of transposable elements uncovered their particular contraction in the Coniophora genera, highlighting their role in the differential genome expansions found in Boletales species.
On rangelands worldwide, cattle interact with many forms of biodiversity, most obviously with vegetation and other large herbivores. Since 1995, we have been manipulating the presence of cattle, medium-sized herbivores, and mega-herbivores (elephants and giraffes) in a series of eighteen 4ha (10-acr...
Nepal, Chirag; O'Rourke, Colm J; Oliveira, Douglas Vnp
-mutation profiles and enriched pathways. High-throughput drug repositioning in seven patient-matched cell lines, chosen to reflect the genetic alterations specific for each patient group, confirmed in silico predictions of subgroup-specific vulnerabilities linked to enriched pathways. Intriguingly, patients lacking...... genes (IDH, KRAS, TP53) revealed unique oncogenic programs (mutational, structural, epi-mutational) that influence pharmacologic response in drug repositioning protocols. This genome dissection approach highlights the potential of individual mutations to induce extensive molecular heterogeneity...
Friedrich, Juliane; Brand, Bodo; Ponsuksili, Siriluck; Graunke, Katharina L; Langbein, Jan; Knaust, Jacqueline; Kühn, Christa; Schwerin, Manfred
Behaviour traits of cattle have been reported to affect important production traits, such as meat quality and milk performance as well as reproduction and health. Genetic predisposition is, together with environmental stimuli, undoubtedly involved in the development of behaviour phenotypes. Underlying molecular mechanisms affecting behaviour in general and behaviour and productions traits in particular still have to be studied in detail. Therefore, we performed a genome-wide association study in an F2 Charolais × German Holstein cross-breed population to identify genetic variants that affect behaviour-related traits assessed in an open-field and novel-object test and analysed their putative impact on milk performance. Of 37,201 tested single nucleotide polymorphism (SNPs), four showed a genome-wide and 37 a chromosome-wide significant association with behaviour traits assessed in both tests. Nine of the SNPs that were associated with behaviour traits likewise showed a nominal significant association with milk performance traits. On chromosomes 14 and 29, six SNPs were identified to be associated with exploratory behaviour and inactivity during the novel-object test as well as with milk yield traits. Least squares means for behaviour and milk performance traits for these SNPs revealed that genotypes associated with higher inactivity and less exploratory behaviour promote higher milk yields. Whether these results are due to molecular mechanisms simultaneously affecting behaviour and milk performance or due to a behaviour predisposition, which causes indirect effects on milk performance by influencing individual reactivity, needs further investigation. © 2015 Stichting International Foundation for Animal Genetics.
Gay, Laurie M; Kim, Sungeun; Fedorchak, Kyle; Kundranda, Madappa; Odia, Yazmin; Nangia, Chaitali; Battiste, James; Colon-Otero, Gerardo; Powell, Steven; Russell, Jeffery; Elvin, Julia A; Vergilio, Jo-Anne; Suh, James; Ali, Siraj M; Stephens, Philip J; Miller, Vincent A; Ross, Jeffrey S
Esthesioneuroblastoma (ENB), also known as olfactory neuroblastoma, is a rare malignant neoplasm of the olfactory mucosa. Despite surgical resection combined with radiotherapy and adjuvant chemotherapy, ENB often relapses with rapid progression. Current multimodality, nontargeted therapy for relapsed ENB is of limited clinical benefit. We queried whether comprehensive genomic profiling (CGP) of relapsed or refractory ENB can uncover genomic alterations (GA) that could identify potential targeted therapies for these patients. CGP was performed on formalin-fixed, paraffin-embedded sections from 41 consecutive clinical cases of ENBs using a hybrid-capture, adaptor ligation based next-generation sequencing assay to a mean coverage depth of 593X. The results were analyzed for base substitutions, insertions and deletions, select rearrangements, and copy number changes (amplifications and homozygous deletions). Clinically relevant GA (CRGA) were defined as GA linked to drugs on the market or under evaluation in clinical trials. A total of 28 ENBs harbored GA, with a mean of 1.5 GA per sample. Approximately half of the ENBs (21, 51%) featured at least one CRGA, with an average of 1 CRGA per sample. The most commonly altered gene was TP53 (17%), with GA in PIK3CA , NF1 , CDKN2A , and CDKN2C occurring in 7% of samples. We report comprehensive genomic profiles for 41 ENB tumors. CGP revealed potential new therapeutic targets, including targetable GA in the mTOR, CDK and growth factor signaling pathways, highlighting the clinical value of genomic profiling in ENB. Comprehensive genomic profiling of 41 relapsed or refractory ENBs reveals recurrent alterations or classes of mutation, including amplification of tyrosine kinases encoded on chromosome 5q and mutations affecting genes in the mTOR/PI3K pathway. Approximately half of the ENBs (21, 51%) featured at least one clinically relevant genomic alteration (CRGA), with an average of 1 CRGA per sample. The most commonly altered
Full Text Available The human and bovine bacterial pathogen Streptococcus agalactiae (Group B Streptococcus, GBS expresses a thick polysaccharide capsule that constitutes a major virulence factor and vaccine target. GBS can be classified into ten distinct serotypes differing in the chemical composition of their capsular polysaccharide. However, non-typeable strains that do not react with anti-capsular sera are frequently isolated from colonized and infected humans and cattle. To gain a comprehensive insight into the molecular basis for the loss of capsule expression in GBS, a collection of well-characterized non-typeable strains was investigated by genome sequencing. Genome based phylogenetic analysis extended to a wide population of sequenced strains confirmed the recently observed high clonality among GBS lineages mainly containing human strains, and revealed a much higher degree of diversity in the bovine population. Remarkably, non-typeable strains were equally distributed in all lineages. A number of distinct mutations in the cps operon were identified that were apparently responsible for inactivation of capsule synthesis. The most frequent genetic alterations were point mutations leading to stop codons in the cps genes, and the main target was found to be cpsE encoding the portal glycosyl transferase of capsule biosynthesis. Complementation of strains carrying missense mutations in cpsE with a wild-type gene restored capsule expression allowing the identification of amino acid residues essential for enzyme activity.
Fabricio B.M. Arraes
Full Text Available The annotation and comparative analyses of the genomes of Mycoplasma synoviae and Mycoplasma hyopneumonie, as well as of other Mollicutes (a group of bacteria devoid of a rigid cell wall, has set the grounds for a global understanding of their metabolism and infection mechanisms. According to the annotation data, M. synoviae and M. hyopneumoniae are able to perform glycolytic metabolism, but do not possess the enzymatic machinery for citrate and glyoxylate cycles, gluconeogenesis and the pentose phosphate pathway. Both can synthesize ATP by lactic fermentation, but only M. synoviae can convert acetaldehyde to acetate. Also, our genome analysis revealed that M. synoviae and M. hyopneumoniae are not expected to synthesize polysaccharides, but they can take up a variety of carbohydrates via the phosphoenolpyruvate-dependent phosphotransferase system (PEP-PTS. Our data showed that these two organisms are unable to synthesize purine and pyrimidine de novo, since they only possess the sequences which encode salvage pathway enzymes. Comparative analyses of M. synoviae and M. hyopneumoniae with other Mollicutes have revealed differential genes in the former two genomes coding for enzymes that participate in carbohydrate, amino acid and nucleotide metabolism and host-pathogen interaction. The identification of these metabolic pathways will provide a better understanding of the biology and pathogenicity of these organisms.
Stafuzza, Nedenia Bonvino; Zerlotini, Adhemar; Lobo, Francisco Pereira; Yamagishi, Michel Eduardo Beleza; Chud, Tatiane Cristina Seleguim; Caetano, Alexandre Rodrigues; Munari, Danísio Prado; Garrick, Dorian J; Machado, Marco Antonio; Martins, Marta Fonseca; Carvalho, Maria Raquel; Cole, John Bruce; Barbosa da Silva, Marcos Vinicius Gualberto
Whole-genome re-sequencing, alignment and annotation analyses were undertaken for 12 sires representing four important cattle breeds in Brazil: Guzerat (multi-purpose), Gyr, Girolando and Holstein (dairy production). A total of approximately 4.3 billion reads from an Illumina HiSeq 2000 sequencer generated for each animal 10.7 to 16.4-fold genome coverage. A total of 27,441,279 single nucleotide variations (SNVs) and 3,828,041 insertions/deletions (InDels) were detected in the samples, of which 2,557,670 SNVs and 883,219 InDels were novel. The submission of these genetic variants to the dbSNP database significantly increased the number of known variants, particularly for the indicine genome. The concordance rate between genotypes obtained using the Bovine HD BeadChip array and the same variants identified by sequencing was about 99.05%. The annotation of variants identified numerous non-synonymous SNVs and frameshift InDels which could affect phenotypic variation. Functional enrichment analysis was performed and revealed that variants in the olfactory transduction pathway was over represented in all four cattle breeds, while the ECM-receptor interaction pathway was over represented in Girolando and Guzerat breeds, the ABC transporters pathway was over represented only in Holstein breed, and the metabolic pathways was over represented only in Gyr breed. The genetic variants discovered here provide a rich resource to help identify potential genomic markers and their associated molecular mechanisms that impact economically important traits for Gyr, Girolando, Guzerat and Holstein breeding programs.
Nedenia Bonvino Stafuzza
Full Text Available Whole-genome re-sequencing, alignment and annotation analyses were undertaken for 12 sires representing four important cattle breeds in Brazil: Guzerat (multi-purpose, Gyr, Girolando and Holstein (dairy production. A total of approximately 4.3 billion reads from an Illumina HiSeq 2000 sequencer generated for each animal 10.7 to 16.4-fold genome coverage. A total of 27,441,279 single nucleotide variations (SNVs and 3,828,041 insertions/deletions (InDels were detected in the samples, of which 2,557,670 SNVs and 883,219 InDels were novel. The submission of these genetic variants to the dbSNP database significantly increased the number of known variants, particularly for the indicine genome. The concordance rate between genotypes obtained using the Bovine HD BeadChip array and the same variants identified by sequencing was about 99.05%. The annotation of variants identified numerous non-synonymous SNVs and frameshift InDels which could affect phenotypic variation. Functional enrichment analysis was performed and revealed that variants in the olfactory transduction pathway was over represented in all four cattle breeds, while the ECM-receptor interaction pathway was over represented in Girolando and Guzerat breeds, the ABC transporters pathway was over represented only in Holstein breed, and the metabolic pathways was over represented only in Gyr breed. The genetic variants discovered here provide a rich resource to help identify potential genomic markers and their associated molecular mechanisms that impact economically important traits for Gyr, Girolando, Guzerat and Holstein breeding programs.
Akanno, Everestus C; Plastow, Graham; Fitzsimmons, Carolyn; Miller, Stephen P; Baron, Vern; Ominski, Kimberly; Basarab, John A
The aim of this study was to identify SNP markers that associate with variation in beef heifer reproduction and performance of their calves. A genome-wide association study was performed by means of the generalized quasi-likelihood score (GQLS) method using heifer genotypes from the BovineSNP50 BeadChip and estimated breeding values for pre-breeding body weight (PBW), pregnancy rate (PR), calving difficulty (CD), age at first calving (AFC), calf birth weight (BWT), calf weaning weight (WWT), and calf pre-weaning average daily gain (ADG). Data consisted of 785 replacement heifers from three Canadian research herds, namely Brandon Research Centre, Brandon, Manitoba, University of Alberta Roy Berg Kinsella Ranch, Kinsella, Alberta, and Lacombe Research Centre, Lacombe, Alberta. After applying a false discovery rate correction at a 5% significance level, a total of 4, 3, 3, 9, 6, 2, and 1 SNPs were significantly associated with PBW, PR, CD, AFC, BWT, WWT, and ADG, respectively. These SNPs were located on chromosomes 1, 5-7, 9, 13-16, 19-21, 24, 25, and 27-29. Chromosomes 1, 5, and 24 had SNPs with pleiotropic effects. New significant SNPs that impact functional traits were detected, many of which have not been previously reported. The results of this study support quantitative genetic studies related to the inheritance of these traits, and provides new knowledge regarding beef cattle quantitative trait loci effects. The identification of these SNPs provides a starting point to identify genes affecting heifer reproduction traits and performance of their calves (BWT, WWT, and ADG). They also contribute to a better understanding of the biology underlying these traits and will be potentially useful in marker- and genome-assisted selection and management.
He, Hua; Chen, Si; Liang, Wei; Liu, Xiaolin
To gain further insight into the molecular mechanism of bovine muscle development, we combined mass spectrometry characterization of proteins with Illumina deep sequencing of RNAs obtained from bovine longissimus muscle (LD) at prenatal and postnatal stages. For the proteomic study, each group of LD proteins was extracted and labeled using isobaric tags for relative and absolute quantitation (iTRAQ) method. Among the 1321 proteins identified from six samples, 390 proteins were differentially expressed in embryos at day 135 post-fertilization (Emb135d) vs. 30-month-old adult cattle (Emb135d vs. 30M) samples. Gene Ontology, Cluster of Orthologous Groups and Kyoto Encyclopedia of Genes and Genomes analyses were further conducted to better understand the different functions. Furthermore, we analyzed the relationship between transcript and protein regulation between samples by direct comparison of expression levels from transcriptomic and iTRAQ-based proteomics. Association results indicated that 1295 of 1321 proteins could be mapped to transcriptome sequencing data. This study provides the most comprehensive, targeted survey of bovine LD proteins to date and has shown the power of combining transcriptomic and proteomic approaches to provide molecular insights for understanding the developmental characteristics in bovine muscle, and even in other mammals. © 2016 Stichting International Foundation for Animal Genetics.
Abstract Background Bacterial spot of tomato and pepper is caused by four Xanthomonas species and is a major plant disease in warm humid climates. The four species are distinct from each other based on physiological and molecular characteristics. The genome sequence of strain 85-10, a member of one of the species, Xanthomonas euvesicatoria (Xcv) has been previously reported. To determine the relationship of the four species at the genome level and to investigate the molecular basis of their virulence and differing host ranges, draft genomic sequences of members of the other three species were determined and compared to strain 85-10. Results We sequenced the genomes of X. vesicatoria (Xv) strain 1111 (ATCC 35937), X. perforans (Xp) strain 91-118 and X. gardneri (Xg) strain 101 (ATCC 19865). The genomes were compared with each other and with the previously sequenced Xcv strain 85-10. In addition, the molecular features were predicted that may be required for pathogenicity including the type III secretion apparatus, type III effectors, other secretion systems, quorum sensing systems, adhesins, extracellular polysaccharide, and lipopolysaccharide determinants. Several novel type III effectors from Xg strain 101 and Xv strain 1111 genomes were computationally identified and their translocation was validated using a reporter gene assay. A homolog to Ax21, the elicitor of XA21-mediated resistance in rice, and a functional Ax21 sulfation system were identified in Xcv. Genes encoding proteins with functions mediated by type II and type IV secretion systems have also been compared, including enzymes involved in cell wall deconstruction, as contributors to pathogenicity. Conclusions Comparative genomic analyses revealed considerable diversity among bacterial spot pathogens, providing new insights into differences and similarities that may explain the diverse nature of these strains. Genes specific to pepper pathogens, such as the O-antigen of the lipopolysaccharide cluster
Full Text Available Abstract Background Marbling (intramuscular fat is a valuable trait that impacts on meat quality and an important factor determining price of beef in the Korean beef market. Animals that are destined for this high marbling market are fed a high concentrate ration for approximately 30 months in the Korean finishing farms. However, this feeding strategy leads to inefficiencies and excessive fat production. This study aimed to identify candidate genes and pathways associated with intramuscular fat deposition on highly divergent marbling phenotypes in adult Hanwoo cattle. Results Bovine genome array analysis was conducted to detect differentially expressed genes (DEGs in m. longissimus with divergent marbling phenotype (marbling score 2 to 7. Three data-processing methods (MAS5.0, GCRMA and RMA were used to test for differential expression (DE. Statistical analysis identified 21 significant transcripts from at least two data-processing methods (P . All 21 differentially expressed genes were validated by real-time PCR. Results showed a high concordance in the gene expression fold change between the microarrays and the real time PCR data. Gene Ontology (GO and pathway analysis demonstrated that some genes (ADAMTS4, CYP51A and SQLE over expressed in high marbled animals are involved in a protein catabolic process and a cholesterol biosynthesis process. In addition, pathway analysis also revealed that ADAMTS4 is activated by three regulators (IL-17A, TNFα and TGFβ1. QRT-PCR was used to investigate gene expression of these regulators in muscle with divergent intramuscular fat contents. The results demonstrate that ADAMTS4 and TGFβ1 are associated with increasing marbling fat. An ADAMTS4/TGFβ1 pathway seems to be associated with the phenotypic differences between high and low marbled groups. Conclusions Marbling differences are possibly a function of complex signaling pathway interactions between muscle and fat. These results suggest that ADAMTS4
Piccoli, Mario L; Brito, Luiz F; Braccini, José; Cardoso, Fernando F; Sargolzaei, Mehdi; Schenkel, Flávio S
Genomic selection (GS) has played an important role in cattle breeding programs. However, genotyping prices are still a challenge for implementation of GS in beef cattle and there is still a lack of information about the use of low-density Single Nucleotide Polymorphisms (SNP) chip panels for genomic predictions in breeds such as Brazilian Braford and Hereford. Therefore, this study investigated the effect of using imputed genotypes in the accuracy of genomic predictions for twenty economically important traits in Brazilian Braford and Hereford beef cattle. Various scenarios composed by different percentages of animals with imputed genotypes and different sizes of the training population were compared. De-regressed EBVs (estimated breeding values) were used as pseudo-phenotypes in a Genomic Best Linear Unbiased Prediction (GBLUP) model using two different mimicked panels derived from the 50 K (8 K and 15 K SNP panels), which were subsequently imputed to the 50 K panel. In addition, genomic prediction accuracies generated from a 777 K SNP (imputed from the 50 K SNP) were presented as another alternate scenario. The accuracy of genomic breeding values averaged over the twenty traits ranged from 0.38 to 0.40 across the different scenarios. The average losses in expected genomic estimated breeding values (GEBV) accuracy (accuracy obtained from the inverse of the mixed model equations) relative to the true 50 K genotypes ranged from -0.0007 to -0.0012 and from -0.0002 to -0.0005 when using the 50 K imputed from the 8 K or 15 K, respectively. When using the imputed 777 K panel the average losses in expected GEBV accuracy was -0.0021. The average gain in expected EBVs accuracy by including genomic information when compared to simple BLUP was between 0.02 and 0.03 across scenarios and traits. The percentage of animals with imputed genotypes in the training population did not significantly influence the validation accuracy. However, the size of the training
Full Text Available To re-evaluate virulence of Mycobacterium tuberculosis (M. tb in cattle, we experimentally infected calves with M. tb andMycobacterium bovisvia intratracheal injection at a dose of 2.0×10(7 CFU and observed the animals for 33 weeks. The intradermal tuberculin test and IFN-γin vitro release assay showed that both M. tb and M. bovis induced similar responses. Immunohistochemical staining of pulmonary lymph nodes indicated that the antigen MPB83 of both M. tb and M. bovis were similarly distributed in the tissue samples. Histological examinations showed all of the infected groups exhibited neutrophil infiltration to similar extents. Although the infected cattle did not develop granulomatous inflammation, the metabolic profiles changed significantly, which were characterized by a change in energy production pathways and increased concentrations of N-acetyl glycoproteins. Glycolysis was induced in the infected cattle by decreased glucose and increased lactate content, and enhanced fatty acid β-oxidation was induced by decreased TG content, and decreased gluconeogenesis indicated by the decreased concentration of glucogenic and ketogenic amino acids promoted utilization of substances other than glucose as energy sources. In addition, an increase in acute phase reactive serum glycoproteins, together with neutrophil infiltration and increased of IL-1β production indicated an early inflammatory response before granuloma formation. In conclusion, this study indicated that both M. tb and M.bovis were virulent to cattle. Therefore, it is likely that cattle with M. tb infections would be critical to tuberculosis transmission from cattle to humans. Nuclear magnetic resonance was demonstrated to be an efficient method to systematically evaluate M. tb and M. bovi sinfection in cattle.
Cao, Peili; Guo, Dongchun; Liu, Jiasen; Jiang, Qian; Xu, Zhuofei; Qu, Liandong
Pasteurella multocida, a Gram-negative opportunistic pathogen, has led to a broad range of diseases in mammals and birds, including fowl cholera in poultry, pneumonia and atrophic rhinitis in swine and rabbit, hemorrhagic septicemia in cattle, and bite infections in humans. In order to better interpret the genetic diversity and adaptation evolution of this pathogen, seven genomes of P. multocida strains isolated from fowls, rabbit and pigs were determined by using high-throughput sequencing approach. Together with publicly available P. multocida genomes, evolutionary features were systematically analyzed in this study. Clustering of 70,565 protein-coding genes showed that the pangenome of 33 P. multocida strains was composed of 1,602 core genes, 1,364 dispensable genes, and 1,070 strain-specific genes. Of these, we identified a full spectrum of genes related to virulence factors and revealed genetic diversity of these potential virulence markers across P. multocida strains, e.g., bcbAB, fcbC, lipA, bexDCA, ctrCD, lgtA, lgtC, lic2A involved in biogenesis of surface polysaccharides, hsf encoding autotransporter adhesin, and fhaB encoding filamentous haemagglutinin. Furthermore, based on genome-wide positive selection scanning, a total of 35 genes were subject to strong selection pressure. Extensive analyses of protein subcellular location indicated that membrane-associated genes were highly abundant among all positively selected genes. The detected amino acid sites undergoing adaptive selection were preferably located in extracellular space, perhaps associated with bacterial evasion of host immune responses. Our findings shed more light on conservation and distribution of virulence-associated genes across P. multocida strains. Meanwhile, this study provides a genetic context for future researches on the mechanism of adaptive evolution in P. multocida. PMID:28611758
Natural selection has molded the evolution across all taxa. At an arguable date of around 330,000 years ago there were already at least two different types of cattle that became ancestors of nearly all modern cattle, the Bos primigenius taurus more adapted to temperate climates and the tropically ad...
At an arguable date of around 330,000 years ago there were already at least two different types of cattle that became ancestors of nearly all modern cattle, the Bos primigenius taurus more adapted to temperate climates and the tropically adapted Bos primigenius indicus. Human selection exponentially...
Investigators for The Cancer Genome Atlas (TCGA) Research Network have detailed and broadly classified the genomic alterations that frequently underlie the development of acute myeloid leukemia (AML).
Mai, M D; Sahana, G; Christiansen, F B
QTL for milk production traits in Danish Jersey cattle were mapped by a genome-wide association analysis using a mixed model. The analysis incorporated 1,039 bulls and 33,090 SNP, and resulted in 98 detected combinations of QTL and traits on 27 BTA. These QTL comprised 30 for milk index, 50 for fat...... index and 18 for protein index. The evidence presents 33 genome-wide QTL on 14 BTA. Of these, seven had effects on milk index, 21 on fat index and five on protein index. Among the genome-wide QTL, 26 have been previously reported, two on BTA4 and -5 were new for milk index, and five on BTA4, -5, -13....... By a chromosome wide threshold 65 additional QTL were detected. Many of them are likely to represent QTL. The results are interesting from a breeding perspective and contribute to the search for the genes causing the polymorphisms important for milk production traits....
Lopes, F B; Wu, X-L; Li, H; Xu, J; Perkins, T; Genho, J; Ferretti, R; Tait, R G; Bauck, S; Rosa, G J M
Reliable genomic prediction of breeding values for quantitative traits requires the availability of sufficient number of animals with genotypes and phenotypes in the training set. As of 31 October 2016, there were 3,797 Brangus animals with genotypes and phenotypes. These Brangus animals were genotyped using different commercial SNP chips. Of them, the largest group consisted of 1,535 animals genotyped by the GGP-LDV4 SNP chip. The remaining 2,262 genotypes were imputed to the SNP content of the GGP-LDV4 chip, so that the number of animals available for training the genomic prediction models was more than doubled. The present study showed that the pooling of animals with both original or imputed 40K SNP genotypes substantially increased genomic prediction accuracies on the ten traits. By supplementing imputed genotypes, the relative gains in genomic prediction accuracies on estimated breeding values (EBV) were from 12.60% to 31.27%, and the relative gain in genomic prediction accuracies on de-regressed EBV was slightly small (i.e. 0.87%-18.75%). The present study also compared the performance of five genomic prediction models and two cross-validation methods. The five genomic models predicted EBV and de-regressed EBV of the ten traits similarly well. Of the two cross-validation methods, leave-one-out cross-validation maximized the number of animals at the stage of training for genomic prediction. Genomic prediction accuracy (GPA) on the ten quantitative traits was validated in 1,106 newly genotyped Brangus animals based on the SNP effects estimated in the previous set of 3,797 Brangus animals, and they were slightly lower than GPA in the original data. The present study was the first to leverage currently available genotype and phenotype resources in order to harness genomic prediction in Brangus beef cattle. © 2018 Blackwell Verlag GmbH.
Beverly E. Flood
Full Text Available The genus Thiomargarita includes the world’s largest bacteria. But as uncultured organisms, their physiology, metabolism, and basis for their gigantism are not well understood. Thus a genomics approach, applied to a single Candidatus Thiomargarita nelsonii cell was employed to explore the genetic potential of one of these enigmatic giant bacteria. The Thiomargarita cell was obtained from an assemblage of budding Ca. T. nelsonii attached to a provannid gastropod shell from Hydrate Ridge, a methane seep offshore of Oregon, USA. Here we present a manually curated genome of Bud S10 resulting from a hybrid assembly of long Pacific Biosciences and short Illumina sequencing reads. With respect to inorganic carbon fixation and sulfur oxidation pathways, the Ca. T. nelsonii Hydrate Ridge Bud S10 genome was similar to marine sister taxa within the family Beggiatoaceae. However, the Bud S10 genome contains genes suggestive of the genetic potential for lithotrophic growth on arsenite and perhaps hydrogen. The genome also revealed that Bud S10 likely respires nitrate via two pathways: a complete denitrification pathway and a dissimilatory nitrate reduction to ammonia pathway. Both pathways have been predicted, but not previously fully elucidated, in the genomes of other large, vacuolated, sulfur-oxidizing bacteria.Surprisingly, the genome also had a high number of unusual features for a bacterium to include the largest number of metacaspases and introns ever reported in a bacterium. Also present, are a large number of other mobile genetic elements, such as insertion sequence transposable elements and miniature inverted-repeat transposable elements (MITEs. In some cases, mobile genetic elements disrupted key genes in metabolic pathways. For example, a MITE interrupts hupL, which encodes the large subunit of the hydrogenase in hydrogen oxidation. Moreover, we detected a group I intron in one of the most critical genes in the sulfur oxidation pathway, dsr
Full Text Available Abstract Background In national evaluations, direct genomic breeding values can be considered as correlated traits to those for which phenotypes are available for traditional estimation of breeding values. For this purpose, estimates of the accuracy of direct genomic breeding values expressed as genetic correlations between traits and their respective direct genomic breeding values are required. Methods We derived direct genomic breeding values for 2239 registered Limousin and 2703 registered Simmental beef cattle genotyped with either the Illumina BovineSNP50 BeadChip or the Illumina BovineHD BeadChip. For the 264 Simmental animals that were genotyped with the BovineHD BeadChip, genotypes for markers present on the BovineSNP50 BeadChip were extracted. Deregressed estimated breeding values were used as observations in weighted analyses that estimated marker effects to derive direct genomic breeding values for each breed. For each breed, genotyped individuals were clustered into five groups using K-means clustering, with the aim of increasing within-group and decreasing between-group pedigree relationships. Cross-validation was performed five times for each breed, using four groups for training and the fifth group for validation. For each trait, we then applied a weighted bivariate analysis of the direct genomic breeding values of genotyped animals from all five validation sets and their corresponding deregressed estimated breeding values to estimate variance and covariance components. Results After minimizing relationships between training and validation groups, estimated genetic correlations between each trait and its direct genomic breeding values ranged from 0.39 to 0.76 in Limousin and from 0.29 to 0.65 in Simmental. The efficiency of selection based on direct genomic breeding values relative to selection based on parent average information ranged from 0.68 to 1.28 in genotyped Limousin and from 0.51 to 1.44 in genotyped Simmental animals
Saatchi, Mahdi; McClure, Mathew C; McKay, Stephanie D; Rolf, Megan M; Kim, JaeWoo; Decker, Jared E; Taxis, Tasia M; Chapple, Richard H; Ramey, Holly R; Northcutt, Sally L; Bauck, Stewart; Woodward, Brent; Dekkers, Jack C M; Fernando, Rohan L; Schnabel, Robert D; Garrick, Dorian J; Taylor, Jeremy F
Genomic selection is a recently developed technology that is beginning to revolutionize animal breeding. The objective of this study was to estimate marker effects to derive prediction equations for direct genomic values for 16 routinely recorded traits of American Angus beef cattle and quantify corresponding accuracies of prediction. Deregressed estimated breeding values were used as observations in a weighted analysis to derive direct genomic values for 3570 sires genotyped using the Illumina BovineSNP50 BeadChip. These bulls were clustered into five groups using K-means clustering on pedigree estimates of additive genetic relationships between animals, with the aim of increasing within-group and decreasing between-group relationships. All five combinations of four groups were used for model training, with cross-validation performed in the group not used in training. Bivariate animal models were used for each trait to estimate the genetic correlation between deregressed estimated breeding values and direct genomic values. Accuracies of direct genomic values ranged from 0.22 to 0.69 for the studied traits, with an average of 0.44. Predictions were more accurate when animals within the validation group were more closely related to animals in the training set. When training and validation sets were formed by random allocation, the accuracies of direct genomic values ranged from 0.38 to 0.85, with an average of 0.65, reflecting the greater relationship between animals in training and validation. The accuracies of direct genomic values obtained from training on older animals and validating in younger animals were intermediate to the accuracies obtained from K-means clustering and random clustering for most traits. The genetic correlation between deregressed estimated breeding values and direct genomic values ranged from 0.15 to 0.80 for the traits studied. These results suggest that genomic estimates of genetic merit can be produced in beef cattle at a young age but
Full Text Available Abstract Background Genomic selection is a recently developed technology that is beginning to revolutionize animal breeding. The objective of this study was to estimate marker effects to derive prediction equations for direct genomic values for 16 routinely recorded traits of American Angus beef cattle and quantify corresponding accuracies of prediction. Methods Deregressed estimated breeding values were used as observations in a weighted analysis to derive direct genomic values for 3570 sires genotyped using the Illumina BovineSNP50 BeadChip. These bulls were clustered into five groups using K-means clustering on pedigree estimates of additive genetic relationships between animals, with the aim of increasing within-group and decreasing between-group relationships. All five combinations of four groups were used for model training, with cross-validation performed in the group not used in training. Bivariate animal models were used for each trait to estimate the genetic correlation between deregressed estimated breeding values and direct genomic values. Results Accuracies of direct genomic values ranged from 0.22 to 0.69 for the studied traits, with an average of 0.44. Predictions were more accurate when animals within the validation group were more closely related to animals in the training set. When training and validation sets were formed by random allocation, the accuracies of direct genomic values ranged from 0.38 to 0.85, with an average of 0.65, reflecting the greater relationship between animals in training and validation. The accuracies of direct genomic values obtained from training on older animals and validating in younger animals were intermediate to the accuracies obtained from K-means clustering and random clustering for most traits. The genetic correlation between deregressed estimated breeding values and direct genomic values ranged from 0.15 to 0.80 for the traits studied. Conclusions These results suggest that genomic estimates
Huang, Yong-Zhen; Zhan, Zhao-Yang; Li, Xin-Yi; Wu, Sheng-Ru; Sun, Yu-Jia; Xue, Jing; Lan, Xian-Yong; Lei, Chu-Zhao; Zhang, Chun-Lei; Jia, Yu-Tang; Chen, Hong
Insulin-like growth factor 2 (IGF2) is a potent cell growth and differentiation factor and is implicated in mammals' growth and development. The objective of this study was to evaluate the effects of the mutations in the bovine IGF2 with growth traits in Chinese Qinchuan cattle. Four single nucleotide polymorphisms (SNPs) were detected of the bovine IGF2 by DNA pool sequencing and forced polymerase chain reaction-restriction fragment length polymorphism (forced PCR-RFLP) methods. We also investigated haplotype structure and linkage disequilibrium (LD) coefficients for four SNPs in 817 individuals representing two main cattle breeds from China. The result of haplotype analysis showed eight different haplotypes and 27 combined genotypes within the study population. The statistical analyses indicated that the four SNPs, combined genotypes and haplotypes are associated with the withers height, body length, chest breadth, chest depth and body weight in Qinchuan cattle population (P growth traits; the heterozygote diplotype was associated with higher growth traits compared to wild-type homozygote. Our results provide evidence that polymorphisms in the IGF2 gene are associated with growth traits, and may be used for marker-assisted selection in beef cattle breeding program.
Curtis, Bruce A.; Tanifuji, Goro; Burki, Fabien; Gruber, Ansgar; Irimia, Manuuel; Maruyama, Shinichiro; Arias, Maria C.; Ball, Steven G.; Gile, Gillian H.; Hirakawa, Yoshihisa; Hopkins, Julia F.; Kuo, Alan; Rensing, Stefan A.; Schmutz, Jeremy; Symeonidi, Aikaterini; Elias, Marek; Eveleigh, Robert J. M.; Herman, Emily K.; Klute, Mary J.; Nakayama, Takuro; Obornik, Miroslav; Reyes-Prieto, Adrian; Armbrust, E. Virginia; Aves, Stephen J.; Beiko, Robert G.; Coutinho, Pedro; Dacks, Joel B.; Durnford, Dion G.; Fast, Naomi M.; Green, Beverley R.; Grisdale, Cameron J.; Hempel, Franziska; Henrissat, Bernard; Hoppner, Marc P.; Ishida, Ken-Ichiro; Kim, Eunsoo; Koreny, Ludek; Kroth, Peter G.; Liu, Yuan; Malik, Shehre-Banoo; Maier, Uwe G.; McRose, Darcy; Mock, Thomas; Neilson, Jonathan A. D.; Onodera, Naoko T.; Poole, Anthony M.; Pritham, Ellen J.; Richards, Thomas A.; Rocap, Gabrielle; Roy, Scott W.; Sarai, Chihiro; Schaack, Sarah; Shirato, Shu; Slamovits, Claudio H.; Spencer, Davie F.; Suzuki, Shigekatsu; Worden, Alexandra Z.; Zauner, Stefan; Barry, Kerrie; Bell, Callum; Bharti, Arvind K.; Crow, John A.; Grimwood, Jane; Kramer, Robin; Lindquist, Erika; Lucas, Susan; Salamov, Asaf; McFadden, Geoffrey I.; Lane, Christopher E.; Keeling, Patrick J.; Gray, Michael W.; Grigoriev, Igor V.; Archibald, John M.
Cryptophyte and chlorarachniophyte algae are transitional forms in the widespread secondary endosymbiotic acquisition of photosynthesis by engulfment of eukaryotic algae. Unlike most secondary plastid-bearing algae, miniaturized versions of the endosymbiont nuclei (nucleomorphs) persist in cryptophytes and chlorarachniophytes. To determine why, and to address other fundamental questions about eukaryote eukaryote endosymbiosis, we sequenced the nuclear genomes of the cryptophyte Guillardia theta and the chlorarachniophyte Bigelowiella natans. Both genomes have 21,000 protein genes and are intron rich, and B. natans exhibits unprecedented alternative splicing for a single-celled organism. Phylogenomic analyses and subcellular targeting predictions reveal extensive genetic and biochemical mosaicism, with both host- and endosymbiont-derived genes servicing the mitochondrion, the host cell cytosol, the plastid and the remnant endosymbiont cytosol of both algae. Mitochondrion-to-nucleus gene transfer still occurs in both organisms but plastid-to-nucleus and nucleomorph-to-nucleus transfers do not, which explains why a small residue of essential genes remains locked in each nucleomorph.
Evans, J.A.; Tatham, S.; Chenery, S.R.; Chenery, C.A.
The Sr concentration and isotope composition of tooth enamel from domesticated animals from two neighbouring Anglo-Saxon settlements, at Empingham (6-7th century) and Ketton (10-12th century) in Rutland, central England, are compared both with each other, and with associated human populations. Data from the Empingham II site form discrete fields in Sr concentration and isotope composition space for cattle, pig and sheep with a partial overlap of the human and pig fields. By contrast there is significant overlap in all the animal and human data fields from the Ketton site. The differences in data distribution between the two sites are attributed to animal husbandry techniques, as the surface geology of the two areas is very similar, implying geological factors are an unlikely cause of the difference. It is suggested that the grazing and feeding patterns of animals at the Empingham II site were controlled and restricted, whereas at the Ketton site the animals grazed and foraged freely over a common area. Strontium isotope variation within cattle molars from the two settlements show marked differences that reflect the nature of their feeding and rearing. The enamel from a cattle molar from the Empingham II site has a well-defined, systematic variation of Sr isotope composition with Sr concentration, whereas no such patterns exist in a comparable cattle molar from Ketton. Chemical and O isotope variations in the cattle tooth from Empingham II show sympathetic variation of Sr and Ba concentrations with Sr isotope composition from cusp to cervix. The cusp has higher Sr and Ba concentrations log (Sr/Ca) ratio of -3.1 and 87 Sr/ 86 Sr ratio of 0.71151 where as the cervical region of the enamel has log (Sr/Ca) = -3.3 and 87 Sr/ 86 Sr = 0.71061
Schwager, Evelyn E; Sharma, Prashant P; Clarke, Thomas; Leite, Daniel J; Wierschin, Torsten; Pechmann, Matthias; Akiyama-Oda, Yasuko; Esposito, Lauren; Bechsgaard, Jesper; Bilde, Trine; Buffry, Alexandra D; Chao, Hsu; Dinh, Huyen; Doddapaneni, HarshaVardhan; Dugan, Shannon; Eibner, Cornelius; Extavour, Cassandra G; Funch, Peter; Garb, Jessica; Gonzalez, Luis B; Gonzalez, Vanessa L; Griffiths-Jones, Sam; Han, Yi; Hayashi, Cheryl; Hilbrant, Maarten; Hughes, Daniel S T; Janssen, Ralf; Lee, Sandra L; Maeso, Ignacio; Murali, Shwetha C; Muzny, Donna M; Nunes da Fonseca, Rodrigo; Paese, Christian L B; Qu, Jiaxin; Ronshaugen, Matthew; Schomburg, Christoph; Schönauer, Anna; Stollewerk, Angelika; Torres-Oliva, Montserrat; Turetzek, Natascha; Vanthournout, Bram; Werren, John H; Wolff, Carsten; Worley, Kim C; Bucher, Gregor; Gibbs, Richard A; Coddington, Jonathan; Oda, Hiroki; Stanke, Mario; Ayoub, Nadia A; Prpic, Nikola-Michael; Flot, Jean-François; Posnien, Nico; Richards, Stephen; McGregor, Alistair P
The duplication of genes can occur through various mechanisms and is thought to make a major contribution to the evolutionary diversification of organisms. There is increasing evidence for a large-scale duplication of genes in some chelicerate lineages including two rounds of whole genome duplication (WGD) in horseshoe crabs. To investigate this further, we sequenced and analyzed the genome of the common house spider Parasteatoda tepidariorum. We found pervasive duplication of both coding and non-coding genes in this spider, including two clusters of Hox genes. Analysis of synteny conservation across the P. tepidariorum genome suggests that there has been an ancient WGD in spiders. Comparison with the genomes of other chelicerates, including that of the newly sequenced bark scorpion Centruroides sculpturatus, suggests that this event occurred in the common ancestor of spiders and scorpions, and is probably independent of the WGDs in horseshoe crabs. Furthermore, characterization of the sequence and expression of the Hox paralogs in P. tepidariorum suggests that many have been subject to neo-functionalization and/or sub-functionalization since their duplication. Our results reveal that spiders and scorpions are likely the descendants of a polyploid ancestor that lived more than 450 MYA. Given the extensive morphological diversity and ecological adaptations found among these animals, rivaling those of vertebrates, our study of the ancient WGD event in Arachnopulmonata provides a new comparative platform to explore common and divergent evolutionary outcomes of polyploidization events across eukaryotes.
Kyrpides, Nikos; Anderson, Iain; Rodriguez, Jason; Susanti, Dwi; Porat, Iris; Reich, Claudia; Ulrich, Luke E.; Elkins, James G.; Mavromatis, Kostas; Lykidis, Athanasios; Kim, Edwin; Thompson, Linda S.; Nolan, Matt; Land, Miriam; Copeland, Alex; Lapidus, Alla; Lucas, Susan; Detter, Chris; Zhulin, Igor B.; Olsen, Gary J.; Whitman, William; Mukhopadhyay, Biswarup; Bristow, James; Kyrpides, Nikos
We report the complete genome of Thermofilum pendens, a deep-branching, hyperthermophilic member of the order Thermoproteales within the archaeal kingdom Crenarchaeota. T. pendens is a sulfur-dependent, anaerobic heterotroph isolated from a solfatara in Iceland. It is an extracellular commensal, requiring an extract of Thermoproteus tenax for growth, and the genome sequence reveals that biosynthetic pathways for purines, most amino acids, and most cofactors are absent. In fact T. pendens has fewer biosynthetic enzymes than obligate intracellular parasites, although it does not display other features common among obligate parasites and thus does not appear to be in the process of becoming a parasite. It appears that T. pendens has adapted to life in an environment rich in nutrients. T. pendens was known to utilize peptides as an energy source, but the genome reveals substantial ability to grow on carbohydrates. T. pendens is the first crenarchaeote and only the second archaeon found to have a transporter of the phosphotransferase system. In addition to fermentation, T. pendens may gain energy from sulfur reduction with hydrogen and formate as electron donors. It may also be capable of sulfur-independent growth on formate with formate hydrogenlyase. Additional novel features are the presence of a monomethylamine:corrinoid methyltransferase, the first time this enzyme has been found outside of Methanosarcinales, and a presenilin-related protein. Predicted highly expressed proteins do not include housekeeping genes, and instead include ABC transporters for carbohydrates and peptides, and CRISPR-associated proteins.
Barth, Stefanie A; Menge, Christian; Eichhorn, Inga; Semmler, Torsten; Wieler, Lothar H; Pickard, Derek; Belka, Ariane; Berens, Christian; Geue, Lutz
Shiga toxin-producing Escherichia coli (STEC) strains can colonize cattle for several months and may, thus, serve as gene reservoirs for the genesis of highly virulent zoonotic enterohemorrhagic E. coli (EHEC). Attempts to reduce the human risk for acquiring EHEC infections should include strategies to control such STEC strains persisting in cattle. We therefore aimed to identify genetic patterns associated with the STEC colonization type in the bovine host. We included 88 persistent colonizing STEC (STEC(per)) (shedding for ≥4 months) and 74 sporadically colonizing STEC (STEC(spo)) (shedding for ≤2 months) isolates from cattle and 16 bovine STEC isolates with unknown colonization types. Genoserotypes and multilocus sequence types (MLSTs) were determined, and the isolates were probed with a DNA microarray for virulence-associated genes (VAGs). All STEC(per) isolates belonged to only four genoserotypes (O26:H11, O156:H25, O165:H25, O182:H25), which formed three genetic clusters (ST21/396/1705, ST300/688, ST119). In contrast, STEC(spo) isolates were scattered among 28 genoserotypes and 30 MLSTs, with O157:H7 (ST11) and O6:H49 (ST1079) being the most prevalent. The microarray analysis identified 139 unique gene patterns that clustered with the genoserotypes and MLSTs of the strains. While the STEC(per) isolates possessed heterogeneous phylogenetic backgrounds, the accessory genome clustered these isolates together, separating them from the STEC(spo) isolates. Given the vast genetic heterogeneity of bovine STEC strains, defining the genetic patterns distinguishing STEC(per) from STEC(spo) isolates will facilitate the targeted design of new intervention strategies to counteract these zoonotic pathogens at the farm level. Ruminants, especially cattle, are sources of food-borne infections by Shiga toxin-producing Escherichia coli (STEC) in humans. Some STEC strains persist in cattle for longer periods of time, while others are detected only sporadically. Persisting
Ma, Li-Jun; Ibrahim, Ashraf S.; Skory, Christopher; Grabherr, Manfred G.; Burger, Gertraud; Butler, Margi; Elias, Marek; Idnurm, Alexander; Lang, B. Franz; Sone, Teruo; Abe, Ayumi; Calvo, Sarah E.; Corrochano, Luis M.; Engels, Reinhard; Fu, Jianmin; Hansberg, Wilhelm; Kim, Jung-Mi; Kodira, Chinnappa D.; Koehrsen, Michael J.; Liu, Bo; Miranda-Saavedra, Diego; O'Leary, Sinead; Ortiz-Castellanos, Lucila; Poulter, Russell; Rodriguez-Romero, Julio; Ruiz-Herrera, José; Shen, Yao-Qing; Zeng, Qiandong; Galagan, James; Birren, Bruce W.
Rhizopus oryzae is the primary cause of mucormycosis, an emerging, life-threatening infection characterized by rapid angioinvasive growth with an overall mortality rate that exceeds 50%. As a representative of the paraphyletic basal group of the fungal kingdom called “zygomycetes,” R. oryzae is also used as a model to study fungal evolution. Here we report the genome sequence of R. oryzae strain 99–880, isolated from a fatal case of mucormycosis. The highly repetitive 45.3 Mb genome assembly contains abundant transposable elements (TEs), comprising approximately 20% of the genome. We predicted 13,895 protein-coding genes not overlapping TEs, many of which are paralogous gene pairs. The order and genomic arrangement of the duplicated gene pairs and their common phylogenetic origin provide evidence for an ancestral whole-genome duplication (WGD) event. The WGD resulted in the duplication of nearly all subunits of the protein complexes associated with respiratory electron transport chains, the V-ATPase, and the ubiquitin–proteasome systems. The WGD, together with recent gene duplications, resulted in the expansion of multiple gene families related to cell growth and signal transduction, as well as secreted aspartic protease and subtilase protein families, which are known fungal virulence factors. The duplication of the ergosterol biosynthetic pathway, especially the major azole target, lanosterol 14α-demethylase (ERG11), could contribute to the variable responses of R. oryzae to different azole drugs, including voriconazole and posaconazole. Expanded families of cell-wall synthesis enzymes, essential for fungal cell integrity but absent in mammalian hosts, reveal potential targets for novel and R. oryzae-specific diagnostic and therapeutic treatments. PMID:19578406
Haitjema, Charles H.; Gilmore, Sean P.; Henske, John K.; Solomon, Kevin V.; de Groot, Randall; Kuo, Alan; Mondo, Stephen J.; Salamov, Asaf A.; LaButti, Kurt; Zhao, Zhiying; Chiniquy, Jennifer; Barry, Kerrie; Brewer, Heather M.; Purvine, Samuel O.; Wright, Aaron T.; Hainaut, Matthieu; Boxma, Brigitte; van Alen, Theo; Hackstein, Johannes H. P.; Henrissat, Bernard; Baker, Scott E.; Grigoriev, Igor V.; O' Malley, Michelle A.
Cellulosomes are large, multi-protein complexes that tether plant biomass degrading enzymes together for improved hydrolysis1. These complexes were first described in anaerobic bacteria where species specific dockerin domains mediate assembly of enzymes onto complementary cohesin motifs interspersed within non-catalytic protein scaffolds1. The versatile protein assembly mechanism conferred by the bacterial cohesin-dockerin interaction is now a standard design principle for synthetic protein-scale pathways2,3. For decades, analogous structures have been reported in the early branching anaerobic fungi, which are known to assemble by sequence divergent non-catalytic dockerin domains (NCDD)4. However, the enzyme components, modular assembly mechanism, and functional role of fungal cellulosomes remain unknown5,6. Here, we describe the comprehensive set of proteins critical to fungal cellulosome assembly, including novel, conserved scaffolding proteins unique to the Neocallimastigomycota. High quality genomes of the anaerobic fungi Anaeromyces robustus, Neocallimastix californiae and Piromyces finnis were assembled with long-read, single molecule technology to overcome their repeat-richness and extremely low GC content. Genomic analysis coupled with proteomic validation revealed an average 320 NCDD-containing proteins per fungal strain that were overwhelmingly carbohydrate active enzymes (CAZymes), with 95 large fungal scaffoldins identified across 4 genera that contain a conserved amino acid sequence repeat that binds to NCDDs. Fungal dockerin and scaffoldin domains have no similarity to their bacterial counterparts, yet several catalytic domains originated via horizontal gene transfer with gut bacteria. Though many catalytic domains are shared with bacteria, the biocatalytic activity of anaerobic fungi is expanded by the inclusion of GH3, GH6, and GH45 enzymes in the enzyme complexes. Collectively, these findings suggest that the fungal cellulosome is an evolutionarily
Full Text Available Abstract Background Actinobacteria represent a consistent component of most marine bacterial communities yet little is known about the mechanisms by which these Gram-positive bacteria adapt to life in the marine environment. Here we employed a phylogenomic approach to identify marine adaptation genes in marine Actinobacteria. The focus was on the obligate marine actinomycete genus Salinispora and the identification of marine adaptation genes that have been acquired from other marine bacteria. Results Functional annotation, comparative genomics, and evidence of a shared evolutionary history with bacteria from hyperosmotic environments were used to identify a pool of more than 50 marine adaptation genes. An Actinobacterial species tree was used to infer the likelihood of gene gain or loss in accounting for the distribution of each gene. Acquired marine adaptation genes were associated with electron transport, sodium and ABC transporters, and channels and pores. In addition, the loss of a mechanosensitive channel gene appears to have played a major role in the inability of Salinispora strains to grow following transfer to low osmotic strength media. Conclusions The marine Actinobacteria for which genome sequences are available are broadly distributed throughout the Actinobacterial phylogenetic tree and closely related to non-marine forms suggesting they have been independently introduced relatively recently into the marine environment. It appears that the acquisition of transporters in Salinispora spp. represents a major marine adaptation while gene loss is proposed to play a role in the inability of this genus to survive outside of the marine environment. This study reveals fundamental differences between marine adaptations in Gram-positive and Gram-negative bacteria and no common genetic basis for marine adaptation among the Actinobacteria analyzed.
Mota, R R; Guimarães, S E F; Fortes, M R S; Hayes, B; Silva, F F; Verardo, L L; Kelly, M J; de Campos, C F; Guimarães, J D; Wenceslau, R R; Penitente-Filho, J M; Garcia, J F; Moore, S
We performed a genome-wide mapping for the age at first calving (AFC) with the goal of annotating candidate genes that regulate fertility in Nellore cattle. Phenotypic data from 762 cows and 777k SNP genotypes from 2,992 bulls and cows were used. Single nucleotide polymorphism (SNP) effects based on the single-step GBLUP methodology were blocked into adjacent windows of 1 Megabase (Mb) to explain the genetic variance. SNP windows explaining more than 0.40% of the AFC genetic variance were identified on chromosomes 2, 8, 9, 14, 16 and 17. From these windows, we identified 123 coding protein genes that were used to build gene networks. From the association study and derived gene networks, putative candidate genes (e.g., PAPPA, PREP, FER1L6, TPR, NMNAT1, ACAD10, PCMTD1, CRH, OPKR1, NPBWR1 and NCOA2) and transcription factors (TF) (STAT1, STAT3, RELA, E2F1 and EGR1) were strongly associated with female fertility (e.g., negative regulation of luteinizing hormone secretion, folliculogenesis and establishment of uterine receptivity). Evidence suggests that AFC inheritance is complex and controlled by multiple loci across the genome. As several windows explaining higher proportion of the genetic variance were identified on chromosome 14, further studies investigating the interaction across haplotypes to better understand the molecular architecture behind AFC in Nellore cattle should be undertaken. © 2017 Blackwell Verlag GmbH.
Georgi, Benjamin; Craig, David; Kember, Rachel L.; Liu, Wencheng; Lindquist, Ingrid; Nasser, Sara; Brown, Christopher; Egeland, Janice A.; Paul, Steven M.; Bućan, Maja
Bipolar disorder is a common, heritable mental illness characterized by recurrent episodes of mania and depression. Despite considerable effort to elucidate the genetic underpinnings of bipolar disorder, causative genetic risk factors remain elusive. We conducted a comprehensive genomic analysis of bipolar disorder in a large Old Order Amish pedigree. Microsatellite genotypes and high-density SNP-array genotypes of 388 family members were combined with whole genome sequence data for 50 of these subjects, comprising 18 parent-child trios. This study design permitted evaluation of candidate variants within the context of haplotype structure by resolving the phase in sequenced parent-child trios and by imputation of variants into multiple unsequenced siblings. Non-parametric and parametric linkage analysis of the entire pedigree as well as on smaller clusters of families identified several nominally significant linkage peaks, each of which included dozens of predicted deleterious variants. Close inspection of exonic and regulatory variants in genes under the linkage peaks using family-based association tests revealed additional credible candidate genes for functional studies and further replication in population-based cohorts. However, despite the in-depth genomic characterization of this unique, large and multigenerational pedigree from a genetic isolate, there was no convergence of evidence implicating a particular set of risk loci or common pathways. The striking haplotype and locus heterogeneity we observed has profound implications for the design of studies of bipolar and other related disorders. PMID:24625924
Full Text Available Bipolar disorder is a common, heritable mental illness characterized by recurrent episodes of mania and depression. Despite considerable effort to elucidate the genetic underpinnings of bipolar disorder, causative genetic risk factors remain elusive. We conducted a comprehensive genomic analysis of bipolar disorder in a large Old Order Amish pedigree. Microsatellite genotypes and high-density SNP-array genotypes of 388 family members were combined with whole genome sequence data for 50 of these subjects, comprising 18 parent-child trios. This study design permitted evaluation of candidate variants within the context of haplotype structure by resolving the phase in sequenced parent-child trios and by imputation of variants into multiple unsequenced siblings. Non-parametric and parametric linkage analysis of the entire pedigree as well as on smaller clusters of families identified several nominally significant linkage peaks, each of which included dozens of predicted deleterious variants. Close inspection of exonic and regulatory variants in genes under the linkage peaks using family-based association tests revealed additional credible candidate genes for functional studies and further replication in population-based cohorts. However, despite the in-depth genomic characterization of this unique, large and multigenerational pedigree from a genetic isolate, there was no convergence of evidence implicating a particular set of risk loci or common pathways. The striking haplotype and locus heterogeneity we observed has profound implications for the design of studies of bipolar and other related disorders.
Höglund, Johanna; Buitenhuis, Albert Johannes; Guldbrandtsen, Bernt
they produce less milk, fat and protein. Female fertility is an important trait for the dairy cattle farmer. Selection decisions in female fertilty in NRC are based on the female fertility index (FTI). FTI is a composite index including a number of sub-indices describing aspects of female fertility in dairy......Background The Nordic Red Cattle (NRC) consists of animls belonging to the Danish Red, Finnish Ayrshire, and Swedish Red breeds. Compared to the Holstein breed, NRC animals are smaller, have a shorter calving interval, lower mastitis incidence and lower rates of stillborn calves, however...... cattle. The sub-traits of FTI are: number of inseminations per conception (AIS) in cows (C) and heifers (H), the length in days of the interval from calving to first insemination (ICF) in cows, days from first to last insemination (IFL) in cows and heifers, and 56-day non-return rate (NRR) in cows...
Symbiosis between dinoflagellates of the genus Symbiodinium and reef-building corals forms the trophic foundation of the world\\'s coral reef ecosystems. Here we present the first draft genome of Symbiodinium goreaui (Clade C, type C1: 1.03 Gbp), one of the most ubiquitous endosymbionts associated with corals, and an improved draft genome of Symbiodinium kawagutii (Clade F, strain CS-156: 1.05 Gbp), previously sequenced as strain CCMP2468, to further elucidate genomic signatures of this symbiosis. Comparative analysis of four available Symbiodinium genomes against other dinoflagellate genomes led to the identification of 2460 nuclear gene families that show evidence of positive selection, including genes involved in photosynthesis, transmembrane ion transport, synthesis and modification of amino acids and glycoproteins, and stress response. Further, we identified extensive sets of genes for meiosis and response to light stress. These draft genomes provide a foundational resource for advancing our understanding Symbiodinium biology and the coral-algal symbiosis.
Fang, Lingzhao; Sahana, Goutam; Ma, Peipei
sequence variants in Holstein (HOL) and Jersey (JER) cattle were analysed. We first carried out a post-GWAS analysis in a HOL training population to assess the degree of enrichment of the association signals in the gene regions defined by each GO term. We then extended the genomic best linear unbiased......BACKGROUND: A better understanding of the genetic architecture underlying complex traits (e.g., the distribution of causal variants and their effects) may aid in the genomic prediction. Here, we hypothesized that the genomic variants of complex traits might be enriched in a subset of genomic...
Mesbah-Uddin, Md; Guldbrandtsen, Bernt; Iso-Touru, Terhi
-regions, such as ABCA12, TTC1, VWA3B, TSHR, DST/BPAG1, and CD1D. The genes overlapping deletions are on average evolutionarily less conserved compared with known mouse lethal genes (P-value = 2.3 × 10-6). We report 167 natural gene knockouts in cattle that are apparently nonessential as live homozygote individuals...
Schrooten, C.; Bovenhuis, H.; Coppieters, W.; Arendonk, van J.A.M.
A granddaughter design was used to locate quantitative trait loci determining conformation and functional traits in dairy cattle. In this granddaughter design, consisting of 20 Holstein Friesian grandsires and 833 sons, genotypes were determined for 277 microsatellite markers covering the whole
The introduction of humped zebu to the African continent has led to genetic introgression with the local African humpless taurine. A mosaic of zebu x taurine cattle populations adapted to the local environments (e.g., semi-dry desert, humid and sub-humid forested areas) has arisen as a result of thi...
Grbić, M.; Van Leeuwen, T.; Clark, R.M.; Rombauts, S.; Grbić, V.; Osborne, E.J.; Dermauw, W.; Phuong, C.T.N.; Ortego, F.; Hernández-Crespo, P.; Diaz, I.; Martinez, M.; Navajas, M.; Sucena, E.; Magalhães, S.; Nagy, L.; Pace, R.M.; Djuranović, S.; Smagghe, G.; Iga, M.; Christiaens, O.; Veenstra, J.A.; Ewer, J.; Villalobos, R.M.; Hutter, J.L.; Hudson, S.D.; Velez, M.; Yi, S.V.; Zeng, J.; Pires-dasilva, A.; Roch, F.; Cazaux, M.; Navarro, M.; Zhurov, V.; Acevedo, G.; Bjelica, A.; Fawcett, J.A.; Bonnet, E.; Martens, C.; Baele, G.; Wissler, L.; Sanchez-Rodriguez, A.; Tirry, L.; Blais, C.; Demeestere, K.; Henz, S.R.; Gregory, T.R.; Mathieu, J.; Verdon, L.; Farinelli, L.; Schmutz, J.; Lindquist, E.; Feyereisen, R.; Van de Peer, Y.
The spider mite Tetranychus urticae is a cosmopolitan agricultural pest with an extensive host plant range and an extreme record of pesticide resistance. Here we present the completely sequenced and annotated spider mite genome, representing the first complete chelicerate genome. At 90 megabases T.
Suga, Hiroshi; Chen, Zehua; de Mendoza, Alex; Sebé-Pedrós, Arnau; Brown, Matthew W; Kramer, Eric; Carr, Martin; Kerner, Pierre; Vervoort, Michel; Sánchez-Pons, Núria; Torruella, Guifré; Derelle, Romain; Manning, Gerard; Lang, B Franz; Russ, Carsten; Haas, Brian J; Roger, Andrew J; Nusbaum, Chad; Ruiz-Trillo, Iñaki
To reconstruct the evolutionary origin of multicellular animals from their unicellular ancestors, the genome sequences of diverse unicellular relatives are essential. However, only the genome of the choanoflagellate Monosiga brevicollis has been reported to date. Here we completely sequence the genome of the filasterean Capsaspora owczarzaki, the closest known unicellular relative of metazoans besides choanoflagellates. Analyses of this genome alter our understanding of the molecular complexity of metazoans' unicellular ancestors showing that they had a richer repertoire of proteins involved in cell adhesion and transcriptional regulation than previously inferred only with the choanoflagellate genome. Some of these proteins were secondarily lost in choanoflagellates. In contrast, most intercellular signalling systems controlling development evolved later concomitant with the emergence of the first metazoans. We propose that the acquisition of these metazoan-specific developmental systems and the co-option of pre-existing genes drove the evolutionary transition from unicellular protists to metazoans.
Pedersen, Louise Dybdahl; Kargo, Morten; Berg, Peer
The aim of this study was to test whether the use of X-semen in a dairy cattle population using genomic selection (GS) and multiple ovulation and embryo transfer (MOET) increases the selection intensity on cow dams and thereby the genetic gain in the entire population. Also, the dynamics of using...
The availability of whole genome sequence (WGS) data has made it possible to discover protein variants in silico. However, existing bovine WGS databases do not show data in a form conducive to protein variant analysis, and tend to under represent the breadth of genetic diversity in U.S. beef cattle...
The availability of whole genome sequence (WGS) data has made it possible to discover protein variants in silico. However, bovine WGS databases comprised of related influential sires from relatively few breeds tend to under represent the breadth of genetic diversity in U.S. beef cattle. Thus, our ...
Amigo, Natalia; Puebla, Andrea Fabiana; Farber, Marisa Diana
Escherichia coli O157:H7 is a major etiologic agent of diseases in humans that cause diarrhea, hemorrhagic colitis, and hemolytic-uremic syndrome. Here, we report the draft genome sequences of two strains isolated from cattle that had high levels of Shiga toxin 2 and high lethality in mice. PMID:26067964
Hu, Jianqiang; Han, Danxiang; Wang, Hui; Zeng, Xiaowei; Jing, Xiaoyan; Zhou, Qian; Su, Xiaoquan; Chang, Xingzhi; Wang, Anhui; Wang, Wei; Jia, Jing; Wei, Li; Xin, Yi; Qiao, Yinghe; Huang, Ranran; Chen, Jie; Han, Bo; Yoon, Kangsup; Hill, Russell T.; Zohar, Yonathan; Chen, Feng; Hu, Qiang; Xu, Jian
Oleaginous microalgae are promising feedstock for biofuels, yet the genetic diversity, origin and evolution of oleaginous traits remain largely unknown. Here we present a detailed phylogenomic analysis of five oleaginous Nannochloropsis species (a total of six strains) and one time-series transcriptome dataset for triacylglycerol (TAG) synthesis on one representative strain. Despite small genome sizes, high coding potential and relative paucity of mobile elements, the genomes feature small cores of ca. 2,700 protein-coding genes and a large pan-genome of >38,000 genes. The six genomes share key oleaginous traits, such as the enrichment of selected lipid biosynthesis genes and certain glycoside hydrolase genes that potentially shift carbon flux from chrysolaminaran to TAG synthesis. The eleven type II diacylglycerol acyltransferase genes (DGAT-2) in every strain, each expressed during TAG synthesis, likely originated from three ancient genomes, including the secondary endosymbiosis host and the engulfed green and red algae. Horizontal gene transfers were inferred in most lipid synthesis nodes with expanded gene doses and many glycoside hydrolase genes. Thus multiple genome pooling and horizontal genetic exchange, together with selective inheritance of lipid synthesis genes and species-specific gene loss, have led to the enormous genetic apparatus for oleaginousness and the wide genomic divergence among present-day Nannochloropsis. These findings have important implications in the screening and genetic engineering of microalgae for biofuels. PMID:24415958
Full Text Available Oleaginous microalgae are promising feedstock for biofuels, yet the genetic diversity, origin and evolution of oleaginous traits remain largely unknown. Here we present a detailed phylogenomic analysis of five oleaginous Nannochloropsis species (a total of six strains and one time-series transcriptome dataset for triacylglycerol (TAG synthesis on one representative strain. Despite small genome sizes, high coding potential and relative paucity of mobile elements, the genomes feature small cores of ca. 2,700 protein-coding genes and a large pan-genome of >38,000 genes. The six genomes share key oleaginous traits, such as the enrichment of selected lipid biosynthesis genes and certain glycoside hydrolase genes that potentially shift carbon flux from chrysolaminaran to TAG synthesis. The eleven type II diacylglycerol acyltransferase genes (DGAT-2 in every strain, each expressed during TAG synthesis, likely originated from three ancient genomes, including the secondary endosymbiosis host and the engulfed green and red algae. Horizontal gene transfers were inferred in most lipid synthesis nodes with expanded gene doses and many glycoside hydrolase genes. Thus multiple genome pooling and horizontal genetic exchange, together with selective inheritance of lipid synthesis genes and species-specific gene loss, have led to the enormous genetic apparatus for oleaginousness and the wide genomic divergence among present-day Nannochloropsis. These findings have important implications in the screening and genetic engineering of microalgae for biofuels.
Full Text Available Recent studies applying high-throughput sequencing technologies have identified several recurrently mutated genes and pathways in multiple cancer genomes. However, transcriptional consequences from these genomic alterations in cancer genome remain unclear. In this study, we performed integrated and comparative analyses of whole genomes and transcriptomes of 22 hepatitis B virus (HBV-related hepatocellular carcinomas (HCCs and their matched controls. Comparison of whole genome sequence (WGS and RNA-Seq revealed much evidence that various types of genomic mutations triggered diverse transcriptional changes. Not only splice-site mutations, but also silent mutations in coding regions, deep intronic mutations and structural changes caused splicing aberrations. HBV integrations generated diverse patterns of virus-human fusion transcripts depending on affected gene, such as TERT, CDK15, FN1 and MLL4. Structural variations could drive over-expression of genes such as WNT ligands, with/without creating gene fusions. Furthermore, by taking account of genomic mutations causing transcriptional aberrations, we could improve the sensitivity of deleterious mutation detection in known cancer driver genes (TP53, AXIN1, ARID2, RPS6KA3, and identified recurrent disruptions in putative cancer driver genes such as HNF4A, CPS1, TSC1 and THRAP3 in HCCs. These findings indicate genomic alterations in cancer genome have diverse transcriptomic effects, and integrated analysis of WGS and RNA-Seq can facilitate the interpretation of a large number of genomic alterations detected in cancer genome.
Full Text Available Many loci in the human genome harbor complex genomic structures that can result in susceptibility to genomic rearrangements leading to various genomic disorders. Nephronophthisis 1 (NPHP1, MIM# 256100 is an autosomal recessive disorder that can be caused by defects of NPHP1; the gene maps within the human 2q13 region where low copy repeats (LCRs are abundant. Loss of function of NPHP1 is responsible for approximately 85% of the NPHP1 cases-about 80% of such individuals carry a large recurrent homozygous NPHP1 deletion that occurs via nonallelic homologous recombination (NAHR between two flanking directly oriented ~45 kb LCRs. Published data revealed a non-pathogenic inversion polymorphism involving the NPHP1 gene flanked by two inverted ~358 kb LCRs. Using optical mapping and array-comparative genomic hybridization, we identified three potential novel structural variant (SV haplotypes at the NPHP1 locus that may protect a haploid genome from the NPHP1 deletion. Inter-species comparative genomic analyses among primate genomes revealed massive genomic changes during evolution. The aggregated data suggest that dynamic genomic rearrangements occurred historically within the NPHP1 locus and generated SV haplotypes observed in the human population today, which may confer differential susceptibility to genomic instability and the NPHP1 deletion within a personal genome. Our study documents diverse SV haplotypes at a complex LCR-laden human genomic region. Comparative analyses provide a model for how this complex region arose during primate evolution, and studies among humans suggest that intra-species polymorphism may potentially modulate an individual's susceptibility to acquiring disease-associated alleles.
Dasmahapatra, Kanchon K; Walters, James R.; Briscoe, Adriana D.; Davey, John W.; Whibley, Annabel; Nadeau, Nicola J.; Zimin, Aleksey V.; Hughes, Daniel S. T.; Ferguson, Laura C.; Martin, Simon H.; Salazar, Camilo; Lewis, James J.; Adler, Sebastian; Ahn, Seung-Joon; Baker, Dean A.; Baxter, Simon W.; Chamberlain, Nicola L.; Chauhan, Ritika; Counterman, Brian A.; Dalmay, Tamas; Gilbert, Lawrence E.; Gordon, Karl; Heckel, David G.; Hines, Heather M.; Hoff, Katharina J.; Holland, Peter W.H.; Jacquin-Joly, Emmanuelle; Jiggins, Francis M.; Jones, Robert T.; Kapan, Durrell D.; Kersey, Paul; Lamas, Gerardo; Lawson, Daniel; Mapleson, Daniel; Maroja, Luana S.; Martin, Arnaud; Moxon, Simon; Palmer, William J.; Papa, Riccardo; Papanicolaou, Alexie; Pauchet, Yannick; Ray, David A.; Rosser, Neil; Salzberg, Steven L.; Supple, Megan A.; Surridge, Alison; Tenger-Trolander, Ayse; Vogel, Heiko; Wilkinson, Paul A.; Wilson, Derek; Yorke, James A.; Yuan, Furong; Balmuth, Alexi L.; Eland, Cathlene; Gharbi, Karim; Thomson, Marian; Gibbs, Richard A.; Han, Yi; Jayaseelan, Joy C.; Kovar, Christie; Mathew, Tittu; Muzny, Donna M.; Ongeri, Fiona; Pu, Ling-Ling; Qu, Jiaxin; Thornton, Rebecca L.; Worley, Kim C.; Wu, Yuan-Qing; Linares, Mauricio; Blaxter, Mark L.; Constant, Richard H. ffrench; Joron, Mathieu; Kronforst, Marcus R.; Mullen, Sean P.; Reed, Robert D.; Scherer, Steven E.; Richards, Stephen; Mallet, James; McMillan, W. Owen; Jiggins, Chris D.
The evolutionary importance of hybridization and introgression has long been debated1. We used genomic tools to investigate introgression in Heliconius, a rapidly radiating genus of neotropical butterflies widely used in studies of ecology, behaviour, mimicry and speciation2-5 . We sequenced the genome of Heliconius melpomene and compared it with other taxa to investigate chromosomal evolution in Lepidoptera and gene flow among multiple Heliconius species and races. Among 12,657 predicted genes for Heliconius, biologically important expansions of families of chemosensory and Hox genes are particularly noteworthy. Chromosomal organisation has remained broadly conserved since the Cretaceous, when butterflies split from the silkmoth lineage. Using genomic resequencing, we show hybrid exchange of genes between three co-mimics, H. melpomene, H. timareta, and H. elevatus, especially at two genomic regions that control mimicry pattern. Closely related Heliconius species clearly exchange protective colour pattern genes promiscuously, implying a major role for hybridization in adaptive radiation. PMID:22722851
Rudman, Seth M; Barbour, Matthew A; Csilléry, Katalin; Gienapp, Phillip; Guillaume, Frederic; Hairston, Nelson G; Hendry, Andrew P; Lasky, Jesse R; Rafajlović, Marina; Räsänen, Katja; Schmidt, Paul S; Seehausen, Ole; Therkildsen, Nina O; Turcotte, Martin M; Levine, Jonathan M
Recognition that evolution operates on the same timescale as ecological processes has motivated growing interest in eco-evolutionary dynamics. Nonetheless, generating sufficient data to test predictions about eco-evolutionary dynamics has proved challenging, particularly in natural contexts. Here we argue that genomic data can be integrated into the study of eco-evolutionary dynamics in ways that deepen our understanding of the interplay between ecology and evolution. Specifically, we outline five major questions in the study of eco-evolutionary dynamics for which genomic data may provide answers. Although genomic data alone will not be sufficient to resolve these challenges, integrating genomic data can provide a more mechanistic understanding of the causes of phenotypic change, help elucidate the mechanisms driving eco-evolutionary dynamics, and lead to more accurate evolutionary predictions of eco-evolutionary dynamics in nature.
Full Text Available Bovine tuberculosis (bTB was discovered in a Minnesota cow through routine slaughter surveillance in 2005 and the resulting epidemiological investigation led to the discovery of infection in both cattle and white-tailed deer in the state. From 2005 through 2009, a total of 12 beef cattle herds and 27 free-ranging white-tailed deer (Odocoileus virginianus were found infected in a small geographic region of northwestern Minnesota. Genotyping of isolates determined both cattle and deer shared the same strain of bTB, and it was similar to types found in cattle in the southwestern United States and Mexico. Whole genomic sequencing confirmed the introduction of this infection into Minnesota was recent, with little genetic divergence. Aggressive surveillance and management efforts in both cattle and deer continued from 2010-2012; no additional infections were discovered. Over 10,000 deer were tested and 705 whole herd cattle tests performed in the investigation of this outbreak.
Full Text Available Methanopyrus spp. are usually isolated from harsh niches, such as high osmotic pressure and extreme temperature. However, the molecular mechanisms for their environmental adaption are poorly understood. Archaeal species is commonly considered as primitive organism. The evolutional placement of archaea is a fundamental and intriguing scientific question. We sequenced the genomes of Methanopyrus strains SNP6 and KOL6 isolated from the Atlantic and Iceland, respectively. Comparative genomic analysis revealed genetic diversity and instability implicated in niche adaption, including a number of transporter- and integrase/transposase-related genes. Pan-genome analysis also defined the gene pool of Methanopyrus spp., in addition of ~120-Kb genomic region of plasticity impacting cognate genomic architecture. We believe that Methanopyrus genomics could facilitate efficient investigation/recognition of archaeal phylogenetic diverse patterns, as well as improve understanding of biological roles and significance of these versatile microbes.
Doran, Anthony G; Berry, Donagh P; Creevey, Christopher J
Four traits related to carcass performance have been identified as economically important in beef production: carcass weight, carcass fat, carcass conformation of progeny and cull cow carcass weight. Although Holstein-Friesian cattle are primarily utilized for milk production, they are also an important source of meat for beef production and export. Because of this, there is great interest in understanding the underlying genomic structure influencing these traits. Several genome-wide association studies have identified regions of the bovine genome associated with growth or carcass traits, however, little is known about the mechanisms or underlying biological pathways involved. This study aims to detect regions of the bovine genome associated with carcass performance traits (employing a panel of 54,001 SNPs) using measures of genetic merit (as predicted transmitting abilities) for 5,705 Irish Holstein-Friesian animals. Candidate genes and biological pathways were then identified for each trait under investigation. Following adjustment for false discovery (q-value 0.5) with at least one of the four traits. In total, 557 unique bovine genes, which mapped to 426 human orthologs, were within 500kbs of QTL found associated with a trait using the Bayesian approach. Using this information, 24 significantly over-represented pathways were identified across all traits. The most significantly over-represented biological pathway was the peroxisome proliferator-activated receptor (PPAR) signaling pathway. A large number of genomic regions putatively associated with bovine carcass traits were detected using two different statistical approaches. Notably, several significant associations were detected in close proximity to genes with a known role in animal growth such as glucagon and leptin. Several biological pathways, including PPAR signaling, were shown to be involved in various aspects of bovine carcass performance. These core genes and biological processes may form the
Green, Richard E; Braun, Edward L; Armstrong, Joel; Earl, Dent; Nguyen, Ngan; Hickey, Glenn; Vandewege, Michael W; St John, John A; Capella-Gutiérrez, Salvador; Castoe, Todd A; Kern, Colin; Fujita, Matthew K; Opazo, Juan C; Jurka, Jerzy; Kojima, Kenji K; Caballero, Juan; Hubley, Robert M; Smit, Arian F; Platt, Roy N; Lavoie, Christine A; Ramakodi, Meganathan P; Finger, John W; Suh, Alexander; Isberg, Sally R; Miles, Lee; Chong, Amanda Y; Jaratlerdsiri, Weerachai; Gongora, Jaime; Moran, Christopher; Iriarte, Andrés; McCormack, John; Burgess, Shane C; Edwards, Scott V; Lyons, Eric; Williams, Christina; Breen, Matthew; Howard, Jason T; Gresham, Cathy R; Peterson, Daniel G; Schmitz, Jürgen; Pollock, David D; Haussler, David; Triplett, Eric W; Zhang, Guojie; Irie, Naoki; Jarvis, Erich D; Brochu, Christopher A; Schmidt, Carl J; McCarthy, Fiona M; Faircloth, Brant C; Hoffmann, Federico G; Glenn, Travis C; Gabaldón, Toni; Paten, Benedict; Ray, David A
To provide context for the diversification of archosaurs--the group that includes crocodilians, dinosaurs, and birds--we generated draft genomes of three crocodilians: Alligator mississippiensis (the American alligator), Crocodylus porosus (the saltwater crocodile), and Gavialis gangeticus (the Indian gharial). We observed an exceptionally slow rate of genome evolution within crocodilians at all levels, including nucleotide substitutions, indels, transposable element content and movement, gene family evolution, and chromosomal synteny. When placed within the context of related taxa including birds and turtles, this suggests that the common ancestor of all of these taxa also exhibited slow genome evolution and that the comparatively rapid evolution is derived in birds. The data also provided the opportunity to analyze heterozygosity in crocodilians, which indicates a likely reduction in population size for all three taxa through the Pleistocene. Finally, these data combined with newly published bird genomes allowed us to reconstruct the partial genome of the common ancestor of archosaurs, thereby providing a tool to investigate the genetic starting material of crocodilians, birds, and dinosaurs. Copyright © 2014, American Association for the Advancement of Science.
Xia, Jun Hong; Bai, Zhiyi; Meng, Zining; Zhang, Yong; Wang, Le; Liu, Feng; Jing, Wu; Wan, Zi Yi; Li, Jiale; Lin, Haoran; Yue, Gen Hua
Natural selection and selective breeding for genetic improvement have left detectable signatures within the genome of a species. Identification of selection signatures is important in evolutionary biology and for detecting genes that facilitate to accelerate genetic improvement. However, selection signatures, including artificial selection and natural selection, have only been identified at the whole genome level in several genetically improved fish species. Tilapia is one of the most important genetically improved fish species in the world. Using next-generation sequencing, we sequenced the genomes of 47 tilapia individuals. We identified a total of 1.43 million high-quality SNPs and found that the LD block sizes ranged from 10-100 kb in tilapia. We detected over a hundred putative selective sweep regions in each line of tilapia. Most selection signatures were located in non-coding regions of the tilapia genome. The Wnt signaling, gonadotropin-releasing hormone receptor and integrin signaling pathways were under positive selection in all improved tilapia lines. Our study provides a genome-wide map of genetic variation and selection footprints in tilapia, which could be important for genetic studies and accelerating genetic improvement of tilapia.
Full Text Available Abstract Background The size and repetitive nature of the Rhipicephalus microplus genome makes obtaining a full genome sequence fiscally and technically problematic. To selectively obtain gene-enriched regions of this tick's genome, Cot filtration was performed, and Cot-filtered DNA was sequenced via 454 FLX pyrosequencing. Results The sequenced Cot-filtered genomic DNA was assembled with an EST-based gene index of 14,586 unique entries where each EST served as a potential "seed" for scaffold formation. The new sequence assembly extended the lengths of 3,913 of the 14,586 gene index entries. Over half of the extensions corresponded to extensions of over 30 amino acids. To survey the repetitive elements in the tick genome, the complete sequences of five BAC clones were determined. Both Class I and II transposable elements were found. Comparison of the BAC and Cot filtration data indicates that Cot filtration was highly successful in filtering repetitive DNA out of the genomic DNA used in 454 sequencing. Conclusion Cot filtration is a very useful strategy to incorporate into genome sequencing projects on organisms with large genome sizes and which contain high percentages of repetitive, difficult to assemble, genomic DNA. Combining the Cot selection approach with 454 sequencing and assembly with a pre-existing EST database as seeds resulted in extensions of 27% of the members of the EST database.
Warren, Wesley C; Hillier, LaDeana W; Marshall Graves, Jennifer A; Birney, Ewan; Ponting, Chris P; Grützner, Frank; Belov, Katherine; Miller, Webb; Clarke, Laura; Chinwalla, Asif T; Yang, Shiaw-Pyng; Heger, Andreas; Locke, Devin P; Miethke, Pat; Waters, Paul D; Veyrunes, Frédéric; Fulton, Lucinda; Fulton, Bob; Graves, Tina; Wallis, John; Puente, Xose S; López-Otín, Carlos; Ordóñez, Gonzalo R; Eichler, Evan E; Chen, Lin; Cheng, Ze; Deakin, Janine E; Alsop, Amber; Thompson, Katherine; Kirby, Patrick; Papenfuss, Anthony T; Wakefield, Matthew J; Olender, Tsviya; Lancet, Doron; Huttley, Gavin A; Smit, Arian F A; Pask, Andrew; Temple-Smith, Peter; Batzer, Mark A; Walker, Jerilyn A; Konkel, Miriam K; Harris, Robert S; Whittington, Camilla M; Wong, Emily S W; Gemmell, Neil J; Buschiazzo, Emmanuel; Vargas Jentzsch, Iris M; Merkel, Angelika; Schmitz, Juergen; Zemann, Anja; Churakov, Gennady; Kriegs, Jan Ole; Brosius, Juergen; Murchison, Elizabeth P; Sachidanandam, Ravi; Smith, Carly; Hannon, Gregory J; Tsend-Ayush, Enkhjargal; McMillan, Daniel; Attenborough, Rosalind; Rens, Willem; Ferguson-Smith, Malcolm; Lefèvre, Christophe M; Sharp, Julie A; Nicholas, Kevin R; Ray, David A; Kube, Michael; Reinhardt, Richard; Pringle, Thomas H; Taylor, James; Jones, Russell C; Nixon, Brett; Dacheux, Jean-Louis; Niwa, Hitoshi; Sekita, Yoko; Huang, Xiaoqiu; Stark, Alexander; Kheradpour, Pouya; Kellis, Manolis; Flicek, Paul; Chen, Yuan; Webber, Caleb; Hardison, Ross; Nelson, Joanne; Hallsworth-Pepin, Kym; Delehaunty, Kim; Markovic, Chris; Minx, Pat; Feng, Yucheng; Kremitzki, Colin; Mitreva, Makedonka; Glasscock, Jarret; Wylie, Todd; Wohldmann, Patricia; Thiru, Prathapan; Nhan, Michael N; Pohl, Craig S; Smith, Scott M; Hou, Shunfeng; Nefedov, Mikhail; de Jong, Pieter J; Renfree, Marilyn B; Mardis, Elaine R; Wilson, Richard K
We present a draft genome sequence of the platypus, Ornithorhynchus anatinus. This monotreme exhibits a fascinating combination of reptilian and mammalian characters. For example, platypuses have a coat of fur adapted to an aquatic lifestyle; platypus females lactate, yet lay eggs; and males are equipped with venom similar to that of reptiles. Analysis of the first monotreme genome aligned these features with genetic innovations. We find that reptile and platypus venom proteins have been co-opted independently from the same gene families; milk protein genes are conserved despite platypuses laying eggs; and immune gene family expansions are directly related to platypus biology. Expansions of protein, non-protein-coding RNA and microRNA families, as well as repeat elements, are identified. Sequencing of this genome now provides a valuable resource for deep mammalian comparative analyses, as well as for monotreme biology and conservation.
Hori, Koichi; Maruyama, Fumito; Fujisawa, Takatomo; Togashi, Tomoaki; Yamamoto, Nozomi; Seo, Mitsunori; Sato, Syusei; Yamada, Takuji; Mori, Hiroshi; Tajima, Naoyuki; Moriyama, Takashi; Ikeuchi, Masahiko; Watanabe, Mai; Wada, Hajime; Kobayashi, Koichi; Saito, Masakazu; Masuda, Tatsuru; Sasaki-Sekimoto, Yuko; Mashiguchi, Kiyoshi; Awai, Koichiro; Shimojima, Mie; Masuda, Shinji; Iwai, Masako; Nobusawa, Takashi; Narise, Takafumi; Kondo, Satoshi; Saito, Hikaru; Sato, Ryoichi; Murakawa, Masato; Ihara, Yuta; Oshima-Yamada, Yui; Ohtaka, Kinuka; Satoh, Masanori; Sonobe, Kohei; Ishii, Midori; Ohtani, Ryosuke; Kanamori-Sato, Miyu; Honoki, Rina; Miyazaki, Daichi; Mochizuki, Hitoshi; Umetsu, Jumpei; Higashi, Kouichi; Shibata, Daisuke; Kamiya, Yuji; Sato, Naoki; Nakamura, Yasukazu; Tabata, Satoshi; Ida, Shigeru; Kurokawa, Ken; Ohta, Hiroyuki
The colonization of land by plants was a key event in the evolution of life. Here we report the draft genome sequence of the filamentous terrestrial alga Klebsormidium flaccidum (Division Charophyta, Order Klebsormidiales) to elucidate the early transition step from aquatic algae to land plants. Comparison of the genome sequence with that of other algae and land plants demonstrate that K. flaccidum acquired many genes specific to land plants. We demonstrate that K. flaccidum indeed produces several plant hormones and homologues of some of the signalling intermediates required for hormone actions in higher plants. The K. flaccidum genome also encodes a primitive system to protect against the harmful effects of high-intensity light. The presence of these plant-related systems in K. flaccidum suggests that, during evolution, this alga acquired the fundamental machinery required for adaptation to terrestrial environments.
Warren, Wesley C.; Hillier, LaDeana W.; Marshall Graves, Jennifer A.; Birney, Ewan; Ponting, Chris P.; Grützner, Frank; Belov, Katherine; Miller, Webb; Clarke, Laura; Chinwalla, Asif T.; Yang, Shiaw-Pyng; Heger, Andreas; Locke, Devin P.; Miethke, Pat; Waters, Paul D.; Veyrunes, Frédéric; Fulton, Lucinda; Fulton, Bob; Graves, Tina; Wallis, John; Puente, Xose S.; López-Otín, Carlos; Ordóñez, Gonzalo R.; Eichler, Evan E.; Chen, Lin; Cheng, Ze; Deakin, Janine E.; Alsop, Amber; Thompson, Katherine; Kirby, Patrick; Papenfuss, Anthony T.; Wakefield, Matthew J.; Olender, Tsviya; Lancet, Doron; Huttley, Gavin A.; Smit, Arian F. A.; Pask, Andrew; Temple-Smith, Peter; Batzer, Mark A.; Walker, Jerilyn A.; Konkel, Miriam K.; Harris, Robert S.; Whittington, Camilla M.; Wong, Emily S. W.; Gemmell, Neil J.; Buschiazzo, Emmanuel; Vargas Jentzsch, Iris M.; Merkel, Angelika; Schmitz, Juergen; Zemann, Anja; Churakov, Gennady; Kriegs, Jan Ole; Brosius, Juergen; Murchison, Elizabeth P.; Sachidanandam, Ravi; Smith, Carly; Hannon, Gregory J.; Tsend-Ayush, Enkhjargal; McMillan, Daniel; Attenborough, Rosalind; Rens, Willem; Ferguson-Smith, Malcolm; Lefèvre, Christophe M.; Sharp, Julie A.; Nicholas, Kevin R.; Ray, David A.; Kube, Michael; Reinhardt, Richard; Pringle, Thomas H.; Taylor, James; Jones, Russell C.; Nixon, Brett; Dacheux, Jean-Louis; Niwa, Hitoshi; Sekita, Yoko; Huang, Xiaoqiu; Stark, Alexander; Kheradpour, Pouya; Kellis, Manolis; Flicek, Paul; Chen, Yuan; Webber, Caleb; Hardison, Ross; Nelson, Joanne; Hallsworth-Pepin, Kym; Delehaunty, Kim; Markovic, Chris; Minx, Pat; Feng, Yucheng; Kremitzki, Colin; Mitreva, Makedonka; Glasscock, Jarret; Wylie, Todd; Wohldmann, Patricia; Thiru, Prathapan; Nhan, Michael N.; Pohl, Craig S.; Smith, Scott M.; Hou, Shunfeng; Renfree, Marilyn B.; Mardis, Elaine R.; Wilson, Richard K.
We present a draft genome sequence of the platypus, Ornithorhynchus anatinus. This monotreme exhibits a fascinating combination of reptilian and mammalian characters. For example, platypuses have a coat of fur adapted to an aquatic lifestyle; platypus females lactate, yet lay eggs; and males are equipped with venom similar to that of reptiles. Analysis of the first monotreme genome aligned these features with genetic innovations. We find that reptile and platypus venom proteins have been co-opted independently from the same gene families; milk protein genes are conserved despite platypuses laying eggs; and immune gene family expansions are directly related to platypus biology. Expansions of protein, non-protein-coding RNA and microRNA families, as well as repeat elements, are identified. Sequencing of this genome now provides a valuable resource for deep mammalian comparative analyses, as well as for monotreme biology and conservation. PMID:18464734
Hori, Koichi; Maruyama, Fumito; Fujisawa, Takatomo; Togashi, Tomoaki; Yamamoto, Nozomi; Seo, Mitsunori; Sato, Syusei; Yamada, Takuji; Mori, Hiroshi; Tajima, Naoyuki; Moriyama, Takashi; Ikeuchi, Masahiko; Watanabe, Mai; Wada, Hajime; Kobayashi, Koichi; Saito, Masakazu; Masuda, Tatsuru; Sasaki-Sekimoto, Yuko; Mashiguchi, Kiyoshi; Awai, Koichiro; Shimojima, Mie; Masuda, Shinji; Iwai, Masako; Nobusawa, Takashi; Narise, Takafumi; Kondo, Satoshi; Saito, Hikaru; Sato, Ryoichi; Murakawa, Masato; Ihara, Yuta; Oshima-Yamada, Yui; Ohtaka, Kinuka; Satoh, Masanori; Sonobe, Kohei; Ishii, Midori; Ohtani, Ryosuke; Kanamori-Sato, Miyu; Honoki, Rina; Miyazaki, Daichi; Mochizuki, Hitoshi; Umetsu, Jumpei; Higashi, Kouichi; Shibata, Daisuke; Kamiya, Yuji; Sato, Naoki; Nakamura, Yasukazu; Tabata, Satoshi; Ida, Shigeru; Kurokawa, Ken; Ohta, Hiroyuki
The colonization of land by plants was a key event in the evolution of life. Here we report the draft genome sequence of the filamentous terrestrial alga Klebsormidium flaccidum (Division Charophyta, Order Klebsormidiales) to elucidate the early transition step from aquatic algae to land plants. Comparison of the genome sequence with that of other algae and land plants demonstrate that K. flaccidum acquired many genes specific to land plants. We demonstrate that K. flaccidum indeed produces several plant hormones and homologues of some of the signalling intermediates required for hormone actions in higher plants. The K. flaccidum genome also encodes a primitive system to protect against the harmful effects of high-intensity light. The presence of these plant-related systems in K. flaccidum suggests that, during evolution, this alga acquired the fundamental machinery required for adaptation to terrestrial environments. PMID:24865297
Giang T. H. Vu
Full Text Available The C-value paradox remains incompletely resolved after >40 yr and is exemplified by 2,350-fold variation in genome sizes of flowering plants. The carnivorous Lentibulariaceae genus , displaying a 25-fold range of genome sizes, is a promising subject to study mechanisms and consequences of evolutionary genome size variation. Applying genomic, phylogenetic, and cytogenetic approaches, we uncovered bidirectional genome size evolution within the genus . The Steyerm. genome (86 Mbp has probably shrunk by retroelement silencing and deletion-biased double-strand break (DSB repair, from an ancestral size of 400 to 800 Mbp to become one of the smallest among flowering plants. The Stapf genome has expanded by whole-genome duplication (WGD and retrotransposition to 1550 Mbp. became allotetraploid after the split from the clade ∼29 Ma. A. St.-Hil. (179 Mbp, a close relative of , proved to be a recent (autotetraploid. Our analyses suggest a common ancestor of the genus a with an intermediate 1C value (400–800 Mbp and subsequent rapid genome size evolution in opposite directions. Many abundant repeats of the larger genome are absent in the smaller, casting doubt on their functionality for the organism, while recurrent WGD seems to safeguard against the loss of essential elements in the face of genome shrinkage. We cannot identify any consistent differences in habitat or life strategy that correlate with genome size changes, raising the possibility that these changes may be selectively neutral.
Jordan T Bird
Full Text Available The recently proposed candidatus order Altiarchaeales remains an uncultured archaeal lineage composed of genetically diverse, globally widespread organisms frequently observed in anoxic subsurface environments. In spite of 15 years of studies on the psychrophilic biofilm-producing Candidatus (Ca. Altiarchaeum hamiconexum and its close relatives, very little is known about the phylogenetic and functional diversity of the widespread free-living marine members of this taxon. From methanogenic sediments in the White Oak River Estuary, NC, we sequenced a single cell amplified genome (SAG, WOR_SCG_SM1, and used it to identify and refine two high-quality genomes from metagenomes, WOR_79 and WOR_86-2, from the same site in a different year. These three genomic reconstructions form a monophyletic group which also includes three previously published genomes from metagenomes from terrestrial springs and a SAG from Sakinaw Lake in a group previously designated as pMC2A384. A synapomorphic mutation in the Altiarchaeales tRNA synthetase β subunit, pheT, causes the protein to be encoded as two subunits at distant loci. Consistent with the terrestrial spring clades, our estuarine genomes contain a near-complete autotrophic metabolism, H2 or CO as potential electron donors, a reductive acetyl-CoA pathway for carbon fixation, and methylotroph-like NADP(H-dependent dehydrogenase. Phylogenies based on 16S rRNA genes and concatenated conserved proteins identify two distinct sub-clades of Altiarchaeales, Alti-1 populated by organisms from actively flowing springs, and Alti-2 which is more widespread, diverse, and not associated with visible mats. The core Alti-1 genome supports Alti-1 as adapted for the stream environment, with lipopolysaccharide production capacity, extracellular hami structures. The core Alti-2 genome members of this clade are free-living, with distinct mechanisms for energy maintenance, motility, osmoregulation, and sulfur redox reactions. These
de Manuel, Marc; Kuhlwilm, Martin; Frandsen, Peter
Our closest living relatives, chimpanzees and bonobos, have a complex demographic history. We analyzed the high-coverage whole genomes of 75 wild-born chimpanzees and bonobos from 10 countries in Africa. We found that chimpanzee population substructure makes genetic information a good predictor o...
Westers, Helga; Dorenbos, Ronald; Dijl, Jan Maarten van; Kabel, Jorrit; Flanagan, Tony; Devine, Kevin M.; Jude, Florence; Séror, Simone J.; Beekman, Aäron C.; Darmon, Elise; Eschevins, Caroline; Jong, Anne de; Bron, Sierd; Kuipers, Oscar P.; Albertini, Alessandra M.; Antelmann, Haike; Hecker, Michael; Zamboni, Nicola; Sauer, Uwe; Bruand, Claude; Ehrlich, Dusko S.; Alonso, Juan C.; Salas, Margarita; Quax, Wim J.
Bacterial genomes contain 250 to 500 essential genes, as suggested by single gene disruptions and theoretical considerations. If this view is correct, the remaining nonessential genes of an organism, such as Bacillus subtilis, have been acquired during evolution in its perpetually changing
Rasmussen, Morten; Guo, Xiaosen; Wang, Yong
We present an Aboriginal Australian genomic sequence obtained from a 100-year-old lock of hair donated by an Aboriginal man from southern Western Australia in the early 20th century. We detect no evidence of European admixture and estimate contamination levels to be below 0.5%. We show that Abori...
Domain organization of all the predicted RTKs was verified using SMART and found to be similar to the general RTK domain organization, which includes extracellular domain (ECD)/TM domain ... magnipapillata genome, showed that HTK30 lies in a ..... Gu J and Gu X 2003 Natural history and functional divergence of.
Höglund, Johanna Karolina; Guldbrandtsen, B; Su, G
Data from the joint Nordic breeding value prediction for Danish and Swedish Holstein grandsire families were used to locate quantitative trait loci (QTL) for female fertility traits in Danish and Swedish Holstein cattle. Up to 36 Holstein grandsires with over 2,000 sons were genotyped for 416 mic...... for QTL segregating on Bos taurus chromosome (BTA)1, BTA7, BTA10, and BTA26. On each of these chromosomes, several QTL were detected affecting more than one of the fertility traits investigated in this study. Evidence for segregation of additional QTL on BTA2, BTA9, and BTA24 was found...
Full Text Available Tenderness is one of the most important properties of meat quality, which is influenced by genetic and environmental factors. As an intensively studied epigenetic marker, histone methylation, occurring on arginine and lysine residues, has pivotal regulatory functions on gene expression. To examine whether histone methylation involves in beef tenderness variation, we analyzed the transcriptome and H3K4me3 enrichment profiles of muscle strips obtained from the longissimus dorsi (LD of Angus steers previously classify to the tender or tough group. We first plotted a global bovine H3K4me3 map on chromosomes and called peak-enriched regions and genes. We found that majorities of H3K4me3 on genes were occupying the first intron and intergenic regions and its maps displayed similar patterns in tender and tough groups, with high H3K4me3 enrichment surrounding the transcription start site (TSS. We also explored the relationship of H3K4me3 and gene expression. The results showed that H3K4me3 enrichment is highly positively correlated with gene expression across the whole genome. Cluster analysis results confirmed the relationship of H3K4me3 enrichment and gene expression. By using a pathway-based approach in genes with H3K4me3 enrichment in promoter regions from the tender cluster, we revealed that those genes involved in the development of different tissues-connective tissue, skeletal and muscular system and functional tissues-; while in tough group those genes engaged in cell death, lipid metabolism and small molecule biochemistry. The results from this study provide a deep insight into understanding of the mechanisms of epigenetic regulations in meat quality and beef tenderness.
Full Text Available Abstract Background The genome-wide association study (GWAS is a useful approach to identify genes affecting economically important traits in dairy cattle. Here, we report the results from a GWAS based on high-density SNP genotype data and estimated breeding values for nine production, fertility, body conformation, udder health and workability traits in the Brown Swiss cattle population that is part of the international genomic evaluation program. Result GWASs were performed using 50 k SNP chip data and deregressed estimated breeding values (DEBVs for nine traits from between 2061 and 5043 bulls that were part of the international genomic evaluation program coordinated by Interbull Center. The nine traits were milk yield (MY, fat yield (FY, protein yield (PY, lactating cow’s ability to recycle after calving (CRC, angularity (ANG, body depth (BDE, stature (STA, milk somatic cell score (SCS and milk speed (MSP. Analyses were performed using a linear mixed model correcting for population confounding. A total of 74 SNPs were detected to be genome-wide significantly associated with one or several of the nine analyzed traits. The strongest signal was identified on chromosome 25 for milk production traits, stature and body depth. Other signals were on chromosome 11 for angularity, chromosome 24 for somatic cell score, and chromosome 6 for milking speed. Some signals overlapped with earlier reported QTL for similar traits in other cattle populations and were located close to interesting candidate genes worthy of further investigations. Conclusions Our study shows that international genetic evaluation data is a useful resource for identifying genetic factors influencing complex traits in livestock. Several genome wide significant association signals could be identified in the Brown Swiss population, including a major signal on BTA25. Our findings report several associations and plausible candidate genes that deserve further exploration in other
Gallego Llorente, M.
Characterizing genetic diversity in Africa is a crucial step for most analyses reconstructing the evolutionary history of anatomically modern humans. However, historic migrations from Eurasia into Africa have affected many contemporary populations, confounding inferences. Here, we present a 12.5×coverage ancient genome of an Ethiopian male ("Mota") who lived approximately 4500 years ago. We use this genome to demonstrate that the Eurasian backflow into Africa came from a population closely related to Early Neolithic farmers, who had colonized Europe 4000 years earlier. The extent of this backflow was much greater than previously reported, reaching all the way to Central, West, and Southern Africa, affecting even populations such as Yoruba and Mbuti, previously thought to be relatively unadmixed, who harbor 6 to 7% Eurasian ancestry.
Kamath, Pauline L.; Foster, Jeffrey T.; Drees, Kevin P.; Luikart, Gordon; Quance, Christine; Anderson, Neil J.; Clarke, P. Ryan; Cole, Eric K.; Drew, Mark L.; Edwards, William H.; Rhyan, Jack C.; Treanor, John J.; Wallen, Rick L.; White, Patrick J.; Robbe-Austerman, Suelee; Cross, Paul C.
Whole-genome sequencing has provided fundamental insights into infectious disease epidemiology, but has rarely been used for examining transmission dynamics of a bacterial pathogen in wildlife. In the Greater Yellowstone Ecosystem (GYE), outbreaks of brucellosis have increased in cattle along with rising seroprevalence in elk. Here we use a genomic approach to examine Brucella abortus evolution, cross-species transmission and spatial spread in the GYE. We find that brucellosis was introduced into wildlife in this region at least five times. The diffusion rate varies among Brucella lineages (B3 to 8 km per year) and over time. We also estimate 12 host transitions from bison to elk, and 5 from elk to bison. Our results support the notion that free-ranging elk are currently a self-sustaining brucellosis reservoir and the source of livestock infections, and that control measures in bison are unlikely to affect the dynamics of unrelated strains circulating in nearby elk populations.
Full Text Available Wolbachia are common endosymbionts of terrestrial arthropods, and are also found in nematodes: the animal-parasitic filaria, and the plant-parasite Radopholus similis. Lateral transfer of Wolbachia DNA to the host genome is common. We generated a draft genome sequence for the strongyloidean nematode parasite Dictyocaulus viviparus, the cattle lungworm. In the assembly, we identified nearly 1 Mb of sequence with similarity to Wolbachia. The fragments were unlikely to derive from a live Wolbachia infection: most were short, and the genes were disabled through inactivating mutations. Many fragments were co-assembled with definitively nematode-derived sequence. We found limited evidence of expression of the Wolbachia-derived genes. The D. viviparus Wolbachia genes were most similar to filarial strains and strains from the host-promiscuous clade F. We conclude that D. viviparus was infected by Wolbachia in the past, and that clade F-like symbionts may have been the source of filarial Wolbachia infections.
Oborník, Miroslav; Kořený, Luděk
Roč. 492, č. 7427 (2012), s. 59-65 ISSN 0028-0836 Institutional support: RVO:60077344 Keywords : GENE-TRANSFER * BIGELOWIELLA-NATANS * EUKARYOTIC GENOMES * GUILLARDIA-THETA * NUCLEUS * CHLORARACHNIOPHYTE * PROTEINS * SEQUENCE * ORIGIN * CRYPTOPHYTES Subject RIV: EB - Genetics ; Molecular Biology Impact factor: 38.597, year: 2012 http://www.nature.com/nature/journal/v492/n7427/full/nature11681.html
Authors: Denise Chroscinski, Darryl Sampey, Alex Hewitt, The Reproducibility Project: Cancer Biology† ### Abstract The [Reproducibility Project: Cancer Biology](https://osf.io/e81xl/wiki/home/) seeks to address growing concerns about reproducibility in scientific research by conducting replications of 50 papers in the field of cancer biology published between 2010 and 2012. This Registered Report describes the proposed replication plan of key experiments from “Melanoma genome sequenci...
Jones, Eppie R.
We extend the scope of European palaeogenomics by sequencing the genomes of Late Upper Palaeolithic (13,300 years old, 1.4-fold coverage) and Mesolithic (9,700 years old, 15.4-fold) males from western Georgia in the Caucasus and a Late Upper Palaeolithic (13,700 years old, 9.5-fold) male from Switzerland. While we detect Late Palaeolithic–Mesolithic genomic continuity in both regions, we find that Caucasus hunter-gatherers (CHG) belong to a distinct ancient clade that split from western hunter-gatherers ~45 kya, shortly after the expansion of anatomically modern humans into Europe and from the ancestors of Neolithic farmers ~25 kya, around the Last Glacial Maximum. CHG genomes significantly contributed to the Yamnaya steppe herders who migrated into Europe ~3,000 BC, supporting a formative Caucasus influence on this important Early Bronze age culture. CHG left their imprint on modern populations from the Caucasus and also central and south Asia possibly marking the arrival of Indo-Aryan languages.
Jones, Eppie R; Gonzalez-Fortes, Gloria; Connell, Sarah; Siska, Veronika; Eriksson, Anders; Martiniano, Rui; McLaughlin, Russell L; Gallego Llorente, Marcos; Cassidy, Lara M; Gamba, Cristina; Meshveliani, Tengiz; Bar-Yosef, Ofer; Müller, Werner; Belfer-Cohen, Anna; Matskevich, Zinovi; Jakeli, Nino; Higham, Thomas F G; Currat, Mathias; Lordkipanidze, David; Hofreiter, Michael; Manica, Andrea; Pinhasi, Ron; Bradley, Daniel G
We extend the scope of European palaeogenomics by sequencing the genomes of Late Upper Palaeolithic (13,300 years old, 1.4-fold coverage) and Mesolithic (9,700 years old, 15.4-fold) males from western Georgia in the Caucasus and a Late Upper Palaeolithic (13,700 years old, 9.5-fold) male from Switzerland. While we detect Late Palaeolithic-Mesolithic genomic continuity in both regions, we find that Caucasus hunter-gatherers (CHG) belong to a distinct ancient clade that split from western hunter-gatherers ∼45 kya, shortly after the expansion of anatomically modern humans into Europe and from the ancestors of Neolithic farmers ∼25 kya, around the Last Glacial Maximum. CHG genomes significantly contributed to the Yamnaya steppe herders who migrated into Europe ∼3,000 BC, supporting a formative Caucasus influence on this important Early Bronze age culture. CHG left their imprint on modern populations from the Caucasus and also central and south Asia possibly marking the arrival of Indo-Aryan languages.
Yoshida, Masa-aki; Ishikura, Yukiko; Moritaki, Takeya; Shoguchi, Eiichi; Shimizu, Kentaro K; Sese, Jun; Ogura, Atsushi
Comparative genome structure analysis allows us to identify novel genes, repetitive sequences and gene duplications. To explore lineage-specific genomic changes of the molluscs that is good model for development of nervous system in invertebrate, we conducted comparative genome structure analyses of three molluscs, pygmy squid, nautilus and scallops using partial genome shotgun sequencing. Most effective elements on the genome structural changes are repetitive elements (REs) causing expansion of genome size and whole genome duplication producing large amount of novel functional genes. Therefore, we investigated variation and proportion of REs and whole genome duplication. We, first, identified variations of REs in the three molluscan genomes by homology-based and de novo RE detection. Proportion of REs were 9.2%, 4.0%, and 3.8% in the pygmy squid, nautilus and scallop, respectively. We, then, estimated genome size of the species as 2.1, 4.2 and 1.8 Gb, respectively, with 2× coverage frequency and DNA sequencing theory. We also performed a gene duplication assay based on coding genes, and found that large-scale duplication events occurred after divergence from the limpet Lottia, an out-group of the three molluscan species. Comparison of all the results suggested that RE expansion did not relate to the increase in genome size of nautilus. Despite close relationships to nautilus, the squid has the largest portion of REs and smaller genome size than nautilus. We also identified lineage-specific RE and gene-family expansions, possibly relate to acquisition of the most complicated eye and brain systems in the three species. Copyright © 2011 Elsevier B.V. All rights reserved.
Kaas, Christian Schrøder; Kristensen, Claus; Betenbaugh, Michael J.
in eight additional analyzed CHO genomes (15-20% haploidy) but not in the genome of the Chinese hamster. The dhfr gene is confirmed to be haploid in CHO DXB11; transcriptionally active and the remaining allele contains a G410C point mutation causing a Thr137Arg missense mutation. We find similar to 2.......5 million single nucleotide polymorphisms (SNP's), 44 gene deletions in the CHO DXB11 genome and 9357 SNP's, which interfere with the coding regions of 3458 genes. Copy number variations for nine CHO genomes were mapped to the chromosomes of the Chinese hamster showing unique signatures for each chromosome...
Derzelle, Sylviane; Girault, Guillaume; Kokotovic, Branko
of such lineage in Europe is demonstrated for the first time, filling an historical gap within the phylogeography of the lineage. Comparative genome analyses of these strains with 41 isolates from other parts of the world revealed that the two Danish A.Br.008/011 strains were related to the heroin...
Full Text Available Analysis of genomic data is increasingly becoming part of the livestock industry. Therefore the routine collection of genomic information would be an invaluable resource for management of breeding programs in small, endangered populations. The objectives of this project were to analyse 1. linkage disequlibrium decay and the effective population size; 2. Inbreeding level and effective population size (NeROH based on runs of homozygosity (ROH; 3. Prediction of genomic breeding values (GEBV within and across breeds. In addition, the use of genomic information for breed management is discussed. The study was based on all available genotypes of Tyrol Grey AI bulls. ROHs were derived based on regions covering at least 4 Mb, 8 Mb and 16 Mb regions, with the corresponding mean inbreeding coefficients 4.0%, 2.9% and 1.6%, respectively. The NeROH was 125 (NeROH>16Mb, 186 (NeROH>8Mb and 370 (NeROH>4Mb, indicating strict avoidance of close inbreeding in the population.The genomic selection was developed for and is working well in large breeds. Contrary to the expectations, the accuracy of GEBVs with very small within breed reference populations were very high, between 0.13-0.91 and 0.12-0.63, when EBVs and dEBVs were used as pseudo-phenotypes, respectively. Subsequent analyses confirmed the high accuracies being heavily influenced by parent averages. Multi-breed and across breed reference sets gave inconsistent and lower accuracies. Genomic information may have a crucial role in management of small breeds. It allows to assess relatedness between individuals, trends in inbreeding and to take decisions accordingly. These decisions would be based on the real genome architecture, rather than conventional pedigree information, which can be missing or incomplete. We strongly suggest the routine genotyping of all individuals that belong to a small breed in order to facilitate the effective management of endangered livestock populations.
Xu, Shuqing; Brockmöller, Thomas; Navarro-Quezada, Aura; Kuhl, Heiner; Gase, Klaus; Ling, Zhihao; Zhou, Wenwu; Kreitzer, Christoph; Stanke, Mario; Tang, Haibao; Lyons, Eric; Pandey, Priyanka; Pandey, Shree P; Timmermann, Bernd; Gaquerel, Emmanuel; Baldwin, Ian T
Nicotine, the signature alkaloid of Nicotiana species responsible for the addictive properties of human tobacco smoking, functions as a defensive neurotoxin against attacking herbivores. However, the evolution of the genetic features that contributed to the assembly of the nicotine biosynthetic pathway remains unknown. We sequenced and assembled genomes of two wild tobaccos, Nicotiana attenuata (2.5 Gb) and Nicotiana obtusifolia (1.5 Gb), two ecological models for investigating adaptive traits in nature. We show that after the Solanaceae whole-genome triplication event, a repertoire of rapidly expanding transposable elements (TEs) bloated these Nicotiana genomes, promoted expression divergences among duplicated genes, and contributed to the evolution of herbivory-induced signaling and defenses, including nicotine biosynthesis. The biosynthetic machinery that allows for nicotine synthesis in the roots evolved from the stepwise duplications of two ancient primary metabolic pathways: the polyamine and nicotinamide adenine dinucleotide (NAD) pathways. In contrast to the duplication of the polyamine pathway that is shared among several solanaceous genera producing polyamine-derived tropane alkaloids, we found that lineage-specific duplications within the NAD pathway and the evolution of root-specific expression of the duplicated Solanaceae-specific ethylene response factor that activates the expression of all nicotine biosynthetic genes resulted in the innovative and efficient production of nicotine in the genus Nicotiana Transcription factor binding motifs derived from TEs may have contributed to the coexpression of nicotine biosynthetic pathway genes and coordinated the metabolic flux. Together, these results provide evidence that TEs and gene duplications facilitated the emergence of a key metabolic innovation relevant to plant fitness.
Keeling Patrick J
Full Text Available Abstract Background Dinoflagellates comprise an ecologically significant and diverse eukaryotic phylum that is sister to the phylum containing apicomplexan endoparasites. The mitochondrial genome of apicomplexans is uniquely reduced in gene content and size, encoding only three proteins and two ribosomal RNAs (rRNAs within a highly compacted 6 kb DNA. Dinoflagellate mitochondrial genomes have been comparatively poorly studied: limited available data suggest some similarities with apicomplexan mitochondrial genomes but an even more radical type of genomic organization. Here, we investigate structure, content and expression of dinoflagellate mitochondrial genomes. Results From two dinoflagellates, Crypthecodinium cohnii and Karlodinium micrum, we generated over 42 kb of mitochondrial genomic data that indicate a reduced gene content paralleling that of mitochondrial genomes in apicomplexans, i.e., only three protein-encoding genes and at least eight conserved components of the highly fragmented large and small subunit rRNAs. Unlike in apicomplexans, dinoflagellate mitochondrial genes occur in multiple copies, often as gene fragments, and in numerous genomic contexts. Analysis of cDNAs suggests several novel aspects of dinoflagellate mitochondrial gene expression. Polycistronic transcripts were found, standard start codons are absent, and oligoadenylation occurs upstream of stop codons, resulting in the absence of termination codons. Transcripts of at least one gene, cox3, are apparently trans-spliced to generate full-length mRNAs. RNA substitutional editing, a process previously identified for mRNAs in dinoflagellate mitochondria, is also implicated in rRNA expression. Conclusion The dinoflagellate mitochondrial genome shares the same gene complement and fragmentation of rRNA genes with its apicomplexan counterpart. However, it also exhibits several unique characteristics. Most notable are the expansion of gene copy numbers and their arrangements
Full Text Available Dominance may be an important source of non-additive genetic variance for many traits of dairy cattle. However, nearly all prediction models for dairy cattle have included only additive effects because of the limited number of cows with both genotypes and phenotypes. The role of dominance in the Holstein and Jersey breeds was investigated for eight traits: milk, fat, and protein yields; productive life; daughter pregnancy rate; somatic cell score; fat percent and protein percent. Additive and dominance variance components were estimated and then used to estimate additive and dominance effects of single nucleotide polymorphisms (SNPs. The predictive abilities of three models with both additive and dominance effects and a model with additive effects only were assessed using ten-fold cross-validation. One procedure estimated dominance values, and another estimated dominance deviations; calculation of the dominance relationship matrix was different for the two methods. The third approach enlarged the dataset by including cows with genotype probabilities derived using genotyped ancestors. For yield traits, dominance variance accounted for 5 and 7% of total variance for Holsteins and Jerseys, respectively; using dominance deviations resulted in smaller dominance and larger additive variance estimates. For non-yield traits, dominance variances were very small for both breeds. For yield traits, including additive and dominance effects fit the data better than including only additive effects; average correlations between estimated genetic effects and phenotypes showed that prediction accuracy increased when both effects rather than just additive effects were included. No corresponding gains in prediction ability were found for non-yield traits. Including cows with derived genotype probabilities from genotyped ancestors did not improve prediction accuracy. The largest additive effects were located on chromosome 14 near DGAT1 for yield traits for both
The largest and most comprehensive genomic analysis of individuals with Ewing sarcoma performed to date reveals that some patients are genetically predisposed to developing the cancer. Learn more...
Anderson, Iain; Ulrich, Luke E.; Lupa, Boguslaw; Susanti, Dwi; Porat, Iris; Hooper, Sean D.; Lykidis, Athanasios; Sieprawska-Lupa, Magdalena; Dharmarajan, Lakshmi; Goltsman, Eugene; Lapidus, Alla; Saunders, Elizabeth; Han, Cliff; Land, Miriam; Lucas, Susan; Mukhopadhyay, Biswarup; Whitman, William B.; Woese, Carl; Bristow, James; Kyrpides, Nikos
Methanomicrobiales is the least studied order of methanogens. While these organisms appear to be more closely related to the Methanosarcinales in ribosomal-based phylogenetic analyses, they are metabolically more similar to Class I methanogens. In order to improve our understanding of this lineage, we have completely sequenced the genomes of two members of this order, Methanocorpusculum labreanum Z and Methanoculleus marisnigri JR1, and compared them with the genome of a third, Methanospirillum hungatei JF-1. Similar to Class I methanogens, Methanomicrobiales use a partial reductive citric acid cycle for 2-oxoglutarate biosynthesis, and they have the Eha energy-converting hydrogenase. In common with Methanosarcinales, Methanomicrobiales possess the Ech hydrogenase and at least some of them may couple formylmethanofuran formation and heterodisulfide reduction to transmembrane ion gradients. Uniquely, M. labreanum and M. hungatei contain hydrogenases similar to the Pyrococcus furiosus Mbh hydrogenase, and all three Methanomicrobiales have anti-sigma factor and anti-anti-sigma factor regulatory proteins not found in other methanogens. Phylogenetic analysis based on seven core proteins of methanogenesis and cofactor biosynthesis places the Methanomicrobiales equidistant from Class I methanogens and Methanosarcinales. Our results indicate that Methanomicrobiales, rather than being similar to Class I methanogens or Methanomicrobiales, share some features of both and have some unique properties. We find that there are three distinct classes of methanogens: the Class I methanogens, the Methanomicrobiales (Class II), and the Methanosarcinales (Class III).
Using reduced representation bisulphite sequencing (RRBS), we obtained the first single-base-resolution maps of bovine DNA methylation in ten somatic tissues. In total, we observed 1,868,049 cytosines in the CG-enriched regions. Similar to the methylation patterns in other species, the CG context wa...
Thomasen, Jørn Rind; Sørensen, Anders Christian; Su, Guosheng
The main purpose of this study is to evaluate whether the population structure in Danish Jersey known from the history of the breed also is reflected in the markers. This is done by comparing the linkage disequilibrium and persistence of phase for subgroups of Jersey animals with high proportions...... of Danish or US origin. Furthermore, it is investigated whether a model explicitly incorporating breed origin of animals, inferred either through the known pedigree or from SNP marker data, leads to improved genomic predictions compared to a model ignoring breed origin. The study of the population structure...... origin were analyzed and compared to a basic genomic model that assumes a homogeneous breed structure. The main finding in this study is that the importation of germ plasma from the US Jersey population is readily reflected in the genomes of modern Danish Jersey animals. Firstly, linkage disequilibrium...
Tian, Xinpeng; Zhang, Zhewen; Yang, Tingting; Chen, Meili; Li, Jie; Chen, Fei; Yang, Jin; Li, Wenjie; Zhang, Bing; Zhang, Zhang; Wu, Jiayan; Zhang, Changsheng; Long, Lijuan; Xiao, Jingfa
Over 200 genomes of streptomycete strains that were isolated from various environments are available from the NCBI. However, little is known about the characteristics that are linked to marine adaptation in marine-derived streptomycetes. The particularity and complexity of the marine environment suggest that marine streptomycetes are genetically diverse. Here, we sequenced nine strains from the Streptomyces genus that were isolated from different longitudes, latitudes, and depths of the South China Sea. Then we compared these strains to 22 NCBI downloaded streptomycete strains. Thirty-one streptomycete strains are clearly grouped into a marine-derived subgroup and multiple source subgroup-based phylogenetic tree. The phylogenetic analyses have revealed the dynamic process underlying streptomycete genome evolution, and lateral gene transfer is an important driving force during the process. Pan-genomics analyses have revealed that streptomycetes have an open pan-genome, which reflects the diversity of these streptomycetes and guarantees the species a quick and economical response to diverse environments. Functional and comparative genomics analyses indicate that the marine-derived streptomycetes subgroup possesses some common characteristics of marine adaptation. Our findings have expanded our knowledge of how ocean isolates of streptomycete strains adapt to marine environments. The availability of streptomycete genomes from the South China Sea will be beneficial for further analysis on marine streptomycetes and will enrich the South China Sea’s genetic data sources. PMID:27446038
Kijas, James W.; Lenstra, Johannes A.; Hayes, Ben; Boitard, Simon; Porto Neto, Laercio R.; San Cristobal, Magali; Servin, Bertrand; McCulloch, Russell; Whan, Vicki; Gietzen, Kimberly; Paiva, Samuel; Barendse, William; Ciani, Elena; Raadsma, Herman; McEwan, John; Dalrymple, Brian
Through their domestication and subsequent selection, sheep have been adapted to thrive in a diverse range of environments. To characterise the genetic consequence of both domestication and selection, we genotyped 49,034 SNP in 2,819 animals from a diverse collection of 74 sheep breeds. We find the majority of sheep populations contain high SNP diversity and have retained an effective population size much higher than most cattle or dog breeds, suggesting domestication occurred from a broad genetic base. Extensive haplotype sharing and generally low divergence time between breeds reveal frequent genetic exchange has occurred during the development of modern breeds. A scan of the genome for selection signals revealed 31 regions containing genes for coat pigmentation, skeletal morphology, body size, growth, and reproduction. We demonstrate the strongest selection signal has occurred in response to breeding for the absence of horns. The high density map of genetic variability provides an in-depth view of the genetic history for this important livestock species. PMID:22346734
James W Kijas
Full Text Available Through their domestication and subsequent selection, sheep have been adapted to thrive in a diverse range of environments. To characterise the genetic consequence of both domestication and selection, we genotyped 49,034 SNP in 2,819 animals from a diverse collection of 74 sheep breeds. We find the majority of sheep populations contain high SNP diversity and have retained an effective population size much higher than most cattle or dog breeds, suggesting domestication occurred from a broad genetic base. Extensive haplotype sharing and generally low divergence time between breeds reveal frequent genetic exchange has occurred during the development of modern breeds. A scan of the genome for selection signals revealed 31 regions containing genes for coat pigmentation, skeletal morphology, body size, growth, and reproduction. We demonstrate the strongest selection signal has occurred in response to breeding for the absence of horns. The high density map of genetic variability provides an in-depth view of the genetic history for this important livestock species.
Botigué, Laura R.; Song, Shiya; Scheu, Amelie; Gopalan, Shyamalika; Pendleton, Amanda L.; Oetjens, Matthew; Taravella, Angela M.; Seregély, Timo; Zeeb-Lanz, Andrea; Arbogast, Rose-Marie; Bobo, Dean; Daly, Kevin; Unterländer, Martina; Burger, Joachim; Kidd, Jeffrey M.; Veeramah, Krishna R.
Europe has played a major role in dog evolution, harbouring the oldest uncontested Palaeolithic remains and having been the centre of modern dog breed creation. Here we sequence the genomes of an Early and End Neolithic dog from Germany, including a sample associated with an early European farming community. Both dogs demonstrate continuity with each other and predominantly share ancestry with modern European dogs, contradicting a previously suggested Late Neolithic population replacement. We find no genetic evidence to support the recent hypothesis proposing dual origins of dog domestication. By calibrating the mutation rate using our oldest dog, we narrow the timing of dog domestication to 20,000–40,000 years ago. Interestingly, we do not observe the extreme copy number expansion of the AMY2B gene characteristic of modern dogs that has previously been proposed as an adaptation to a starch-rich diet driven by the widespread adoption of agriculture in the Neolithic. PMID:28719574
Stafuzza, Nedenia Bonvino; Zerlotini, Adhemar; Lobo, Francisco Pereira; Yamagishi, Michel Eduardo Beleza; Chud, Tatiane Cristina Seleguim; Caetano, Alexandre Rodrigues; Munari, Dan?sio Prado; Garrick, Dorian J.; Machado, Marco Antonio; Martins, Marta Fonseca; Carvalho, Maria Raquel; Cole, John Bruce; Barbosa da Silva, Marcos Vinicius Gualberto
Whole-genome re-sequencing, alignment and annotation analyses were undertaken for 12 sires representing four important cattle breeds in Brazil: Guzerat (multi-purpose), Gyr, Girolando and Holstein (dairy production). A total of approximately 4.3 billion reads from an Illumina HiSeq 2000 sequencer generated for each animal 10.7 to 16.4-fold genome coverage. A total of 27,441,279 single nucleotide variations (SNVs) and 3,828,041 insertions/deletions (InDels) were detected in the samples, of whi...
Spek, van der D.; Arendonk, van J.A.M.; Bovenhuis, H.
Performing a genome-wide association study (GWAS) might add to a better understanding of the development of claw disorders and the need for trimming. Therefore, the aim of the current study was to perform a GWAS on claw disorders and trimming status and to validate the results for claw disorders
Salmans, Michael Lee; Chaw, Shu-Miaw; Lin, Ching-Ping; Shih, Arthur Chun-Chieh; Wu, Yu-Wei; Mulligan, R. Michael
Sequence analysis of organelle genomes and comprehensive analysis of C-to-U editing sites from flowering and non-flowering plants have provided extensive sequence information from diverse taxa. This study includes the first comprehensive analysis of RNA editing sites from a gymnosperm mitochondrial genome, and utilizes informatics analyses to determine conserved features in the RNA sequence context around editing sites. We have identified 565 editing sites in 21 full-length and 4 partial cDNA...
Jun, Se-Ran; Wassenaar, Trudy M; Nookaew, Intawat; Hauser, Loren; Wanchai, Visanu; Land, Miriam; Timm, Collin M; Lu, Tse-Yuan S; Schadt, Christopher W; Doktycz, Mitchel J; Pelletier, Dale A; Ussery, David W
The Pseudomonas genus contains a metabolically versatile group of organisms that are known to occupy numerous ecological niches, including the rhizosphere and endosphere of many plants. Their diversity influences the phylogenetic diversity and heterogeneity of these communities. On the basis of average amino acid identity, comparative genome analysis of >1,000 Pseudomonas genomes, including 21 Pseudomonas strains isolated from the roots of native Populus deltoides (eastern cottonwood) trees resulted in consistent and robust genomic clusters with phylogenetic homogeneity. All Pseudomonas aeruginosa genomes clustered together, and these were clearly distinct from other Pseudomonas species groups on the basis of pangenome and core genome analyses. In contrast, the genomes of Pseudomonas fluorescens were organized into 20 distinct genomic clusters, representing enormous diversity and heterogeneity. Most of our 21 Populus-associated isolates formed three distinct subgroups within the major P. fluorescens group, supported by pathway profile analysis, while two isolates were more closely related to Pseudomonas chlororaphis and Pseudomonas putida. Genes specific to Populus-associated subgroups were identified. Genes specific to subgroup 1 include several sensory systems that act in two-component signal transduction, a TonB-dependent receptor, and a phosphorelay sensor. Genes specific to subgroup 2 contain hypothetical genes, and genes specific to subgroup 3 were annotated with hydrolase activity. This study justifies the need to sequence multiple isolates, especially from P. fluorescens, which displays the most genetic variation, in order to study functional capabilities from a pangenomic perspective. This information will prove useful when choosing Pseudomonas strains for use to promote growth and increase disease resistance in plants. Copyright © 2015 Jun et al.
C. G. Dang
Full Text Available Significant SNPs associated with Warner-Bratzler (WB shear force and sensory traits were confirmed for Hanwoo beef (Korean cattle. A Bonferroni-corrected genome-wide significant association (p<1.3×10−6 was detected with only one single nucleotide polymorphism (SNP on chromosome 5 for WB shear force. A slightly higher number of SNPs was significantly (p<0.001 associated with WB shear force than with other sensory traits. Further, 50, 25, 29, and 34 SNPs were significantly associated with WB shear force, tenderness, juiciness, and flavor likeness, respectively. The SNPs between p = 0.001 and p = 0.0001 thresholds explained 3% to 9% of the phenotypic variance, while the most significant SNPs accounted for 7% to 12% of the phenotypic variance. In conclusion, because WB shear force and sensory evaluation were moderately affected by a few loci and minimally affected by other loci, further studies are required by using a large sample size and high marker density.
Athanasiadis, Georgios; Cheng, Jade Y; Vilhjálmsson, Bjarni J; Jørgensen, Frank G; Als, Thomas D; Le Hellard, Stephanie; Espeseth, Thomas; Sullivan, Patrick F; Hultman, Christina M; Kjærgaard, Peter C; Schierup, Mikkel H; Mailund, Thomas
Denmark has played a substantial role in the history of Northern Europe. Through a nationwide scientific outreach initiative, we collected genetic and anthropometrical data from ∼800 high school students and used them to elucidate the genetic makeup of the Danish population, as well as to assess polygenic predictions of phenotypic traits in adolescents. We observed remarkable homogeneity across different geographic regions, although we could still detect weak signals of genetic structure reflecting the history of the country. Denmark presented genomic affinity with primarily neighboring countries with overall resemblance of decreasing weight from Britain, Sweden, Norway, Germany, and France. A Polish admixture signal was detected in Zealand and Funen, and our date estimates coincided with historical evidence of Wend settlements in the south of Denmark. We also observed considerably diverse demographic histories among Scandinavian countries, with Denmark having the smallest current effective population size compared to Norway and Sweden. Finally, we found that polygenic prediction of self-reported adolescent height in the population was remarkably accurate (R 2 = 0.639 ± 0.015). The high homogeneity of the Danish population could render population structure a lesser concern for the upcoming large-scale gene-mapping studies in the country. Copyright © 2016 by the Genetics Society of America.
Biankin, Andrew V; Waddell, Nicola; Kassahn, Karin S; Gingras, Marie-Claude; Muthuswamy, Lakshmi B; Johns, Amber L; Miller, David K; Wilson, Peter J; Patch, Ann-Marie; Wu, Jianmin; Chang, David K; Cowley, Mark J; Gardiner, Brooke B; Song, Sarah; Harliwong, Ivon; Idrisoglu, Senel; Nourse, Craig; Nourbakhsh, Ehsan; Manning, Suzanne; Wani, Shivangi; Gongora, Milena; Pajic, Marina; Scarlett, Christopher J; Gill, Anthony J; Pinho, Andreia V; Rooman, Ilse; Anderson, Matthew; Holmes, Oliver; Leonard, Conrad; Taylor, Darrin; Wood, Scott; Xu, Qinying; Nones, Katia; Fink, J Lynn; Christ, Angelika; Bruxner, Tim; Cloonan, Nicole; Kolle, Gabriel; Newell, Felicity; Pinese, Mark; Mead, R Scott; Humphris, Jeremy L; Kaplan, Warren; Jones, Marc D; Colvin, Emily K; Nagrial, Adnan M; Humphrey, Emily S; Chou, Angela; Chin, Venessa T; Chantrill, Lorraine A; Mawson, Amanda; Samra, Jaswinder S; Kench, James G; Lovell, Jessica A; Daly, Roger J; Merrett, Neil D; Toon, Christopher; Epari, Krishna; Nguyen, Nam Q; Barbour, Andrew; Zeps, Nikolajs; Kakkar, Nipun; Zhao, Fengmei; Wu, Yuan Qing; Wang, Min; Muzny, Donna M; Fisher, William E; Brunicardi, F Charles; Hodges, Sally E; Reid, Jeffrey G; Drummond, Jennifer; Chang, Kyle; Han, Yi; Lewis, Lora R; Dinh, Huyen; Buhay, Christian J; Beck, Timothy; Timms, Lee; Sam, Michelle; Begley, Kimberly; Brown, Andrew; Pai, Deepa; Panchal, Ami; Buchner, Nicholas; De Borja, Richard; Denroche, Robert E; Yung, Christina K; Serra, Stefano; Onetto, Nicole; Mukhopadhyay, Debabrata; Tsao, Ming-Sound; Shaw, Patricia A; Petersen, Gloria M; Gallinger, Steven; Hruban, Ralph H; Maitra, Anirban; Iacobuzio-Donahue, Christine A; Schulick, Richard D; Wolfgang, Christopher L; Morgan, Richard A; Lawlor, Rita T; Capelli, Paola; Corbo, Vincenzo; Scardoni, Maria; Tortora, Giampaolo; Tempero, Margaret A; Mann, Karen M; Jenkins, Nancy A; Perez-Mancera, Pedro A; Adams, David J; Largaespada, David A; Wessels, Lodewyk F A; Rust, Alistair G; Stein, Lincoln D; Tuveson, David A; Copeland, Neal G; Musgrove, Elizabeth A; Scarpa, Aldo; Eshleman, James R; Hudson, Thomas J; Sutherland, Robert L; Wheeler, David A; Pearson, John V; McPherson, John D; Gibbs, Richard A; Grimmond, Sean M
Pancreatic cancer is a highly lethal malignancy with few effective therapies. We performed exome sequencing and copy number analysis to define genomic aberrations in a prospectively accrued clinical cohort (n = 142) of early (stage I and II) sporadic pancreatic ductal adenocarcinoma. Detailed analysis of 99 informative tumours identified substantial heterogeneity with 2,016 non-silent mutations and 1,628 copy-number variations. We define 16 significantly mutated genes, reaffirming known mutations (KRAS, TP53, CDKN2A, SMAD4, MLL3, TGFBR2, ARID1A and SF3B1), and uncover novel mutated genes including additional genes involved in chromatin modification (EPC1 and ARID2), DNA damage repair (ATM) and other mechanisms (ZIM2, MAP2K4, NALCN, SLC16A4 and MAGEA6). Integrative analysis with in vitro functional data and animal models provided supportive evidence for potential roles for these genetic aberrations in carcinogenesis. Pathway-based analysis of recurrently mutated genes recapitulated clustering in core signalling pathways in pancreatic ductal adenocarcinoma, and identified new mutated genes in each pathway. We also identified frequent and diverse somatic aberrations in genes described traditionally as embryonic regulators of axon guidance, particularly SLIT/ROBO signalling, which was also evident in murine Sleeping Beauty transposon-mediated somatic mutagenesis models of pancreatic cancer, providing further supportive evidence for the potential involvement of axon guidance genes in pancreatic carcinogenesis.
Tuskan, Gerald A [ORNL; Yin, Tongming [ORNL; Gunter, Lee E [ORNL; Blaudez, D [UMR, France
We constructed a comprehensive genetic map for Populus and ordered 332 Mb of sequence scaffolds along the 19 haploid chromosomes in order to compare chromosomal regions among diverse members of the genus. These efforts lead us to conclude that chromosome XIX in Populus is evolving into a sex chromosome. Consistent segregation distortion in favor of the sub-genera Tacamahaca alleles provided evidence of divergent selection among species, particularly at the proximal end of chromosome XIX. A large microsatellite marker (SSR) cluster was detected in the distorted region even though the genome-wide distribute SSR sites was uniform across the physical map. The differences between the genetic map and physical sequence data suggested recombination suppression was occurring in the distorted region. A gender-determination locus and an overabundance of NBS-LRR genes were also co-located to the distorted region and were put forth as the cause for divergent selection and recombination suppression. This hypothesis was verified by using fine-scale mapping of an integrated scaffold in the vicinity of the gender-determination locus. As such it appears that chromosome XIX in Populus is in the process of evolving from an autosome into a sex chromosome and that NBS-LRR genes may play important role in the chromosomal diversification process in Populus.
Weigel, K A; Pralle, R S; Adams, H; Cho, K; Do, C; White, H M
Hyperketonemia (HYK), a common early postpartum health disorder characterized by elevated blood concentrations of β-hydroxybutyrate (BHB), affects millions of dairy cows worldwide and leads to significant economic losses and animal welfare concerns. In this study, blood concentrations of BHB were assessed for 1,453 Holstein cows using electronic handheld meters at four time points between 5 and 18 days postpartum. Incidence rates of subclinical (1.2 ≤ maximum BHB ≤ 2.9 mmol/L) and clinical ketosis (maximum BHB ≥ 3.0 mmol/L) were 24.0 and 2.4%, respectively. Variance components, estimated breeding values, and predicted HYK phenotypes were computed on the original, square-root, and binary scales. Heritability estimates for HYK ranged from 0.058 to 0.072 in pedigree-based analyses, as compared to estimates that ranged from 0.071 to 0.093 when pedigrees were augmented with 60,671 single nucleotide polymorphism genotypes of 959 cows and 801 male ancestors. On average, predicted HYK phenotypes from the genome-enhanced analysis ranged from 0.55 mmol/L for first-parity cows in the best contemporary group to 1.40 mmol/L for fourth-parity cows in the worst contemporary group. Genome-enhanced predictions of HYK phenotypes were more closely associated with actual phenotypes than pedigree-based predictions in five-fold cross-validation, and transforming phenotypes to reduce skewness and kurtosis also improved predictive ability. This study demonstrates the feasibility of using repeated cowside measurement of blood BHB concentration in early lactation to construct a reference population that can be used to estimate HYK breeding values for genomic selection programmes and predict HYK phenotypes for genome-guided management decisions. © 2017 Blackwell Verlag GmbH.
Abstract Background Whole genome sequencing has revolutionised the interrogation of mycobacterial genomes. Recent studies have reported conflicting findings on the genomic stability of Mycobacterium tuberculosis during the evolution of drug resistance. In an age where whole genome sequencing is increasingly relied upon for defining the structure of bacterial genomes, it is important to investigate the reliability of next generation sequencing to identify clonal variants present in a minor percentage of the population. This study aimed to define a reliable cut-off for identification of low frequency sequence variants and to subsequently investigate genetic heterogeneity and the evolution of drug resistance in M. tuberculosis. Methods Genomic DNA was isolated from single colonies from 14 rifampicin mono-resistant M. tuberculosis isolates, as well as the primary cultures and follow up MDR cultures from two of these patients. The whole genomes of the M. tuberculosis isolates were sequenced using either the Illumina MiSeq or Illumina HiSeq platforms. Sequences were analysed with an in-house pipeline. Results Using next-generation sequencing in combination with Sanger sequencing and statistical analysis we defined a read frequency cut-off of 30Â % to identify low frequency M. tuberculosis variants with high confidence. Using this cut-off we demonstrated a high rate of genetic diversity between single colonies isolated from one population, showing that by using the current sequencing technology, single colonies are not a true reflection of the genetic diversity within a whole population and vice versa. We further showed that numerous heterogeneous variants emerge and then disappear during the evolution of isoniazid resistance within individual patients. Our findings allowed us to formulate a model for the selective bottleneck which occurs during the course of infection, acting as a genomic purification event. Conclusions Our study demonstrated true levels of genetic
Background Whole genome sequencing has revolutionised the interrogation of mycobacterial genomes. Recent studies have reported conflicting findings on the genomic stability of Mycobacterium tuberculosis during the evolution of drug resistance. In an age where whole genome sequencing is increasingly relied upon for defining the structure of bacterial genomes, it is important to investigate the reliability of next generation sequencing to identify clonal variants present in a minor percentage of the population. This study aimed to define a reliable cut-off for identification of low frequency sequence variants and to subsequently investigate genetic heterogeneity and the evolution of drug resistance in M. tuberculosis. Methods Genomic DNA was isolated from single colonies from 14 rifampicin mono-resistant M. tuberculosis isolates, as well as the primary cultures and follow up MDR cultures from two of these patients. The whole genomes of the M. tuberculosis isolates were sequenced using either the Illumina MiSeq or Illumina HiSeq platforms. Sequences were analysed with an in-house pipeline. Results Using next-generation sequencing in combination with Sanger sequencing and statistical analysis we defined a read frequency cut-off of 30 % to identify low frequency M. tuberculosis variants with high confidence. Using this cut-off we demonstrated a high rate of genetic diversity between single colonies isolated from one population, showing that by using the current sequencing technology, single colonies are not a true reflection of the genetic diversity within a whole population and vice versa. We further showed that numerous heterogeneous variants emerge and then disappear during the evolution of isoniazid resistance within individual patients. Our findings allowed us to formulate a model for the selective bottleneck which occurs during the course of infection, acting as a genomic purification event. Conclusions Our study demonstrated true levels of genetic diversity
Sarai Estrella Sandoval-Azuara
Conclusions: All isolates from humans had spoligotype patterns that matched those observed in the cattle isolates, and all human isolates shared common ancestors with cattle in Baja California based on SNP analysis. This suggests that most human tuberculosis caused by M. bovis in Baja California is derived from M. bovis circulating in Baja California cattle. These results reinforce the importance of bovine tuberculosis surveillance and control in this region.
Lewis, Nathan E; Liu, Xin; Li, Yuxiang
Chinese hamster ovary (CHO) cells, first isolated in 1957, are the preferred production host for many therapeutic proteins. Although genetic heterogeneity among CHO cell lines has been well documented, a systematic, nucleotide-resolution characterization of their genotypic differences has been...... stymied by the lack of a unifying genomic resource for CHO cells. Here we report a 2.4-Gb draft genome sequence of a female Chinese hamster, Cricetulus griseus, harboring 24,044 genes. We also resequenced and analyzed the genomes of six CHO cell lines from the CHO-K1, DG44 and CHO-S lineages....... This analysis identified hamster genes missing in different CHO cell lines, and detected >3.7 million single-nucleotide polymorphisms (SNPs), 551,240 indels and 7,063 copy number variations. Many mutations are located in genes with functions relevant to bioprocessing, such as apoptosis. The details...
Hahn, Christoph; Fromm, Bastian; Bachmann, Lutz
The ectoparasitic Monogenea comprise a major part of the obligate parasitic flatworm diversity. Although genomic adaptations to parasitism have been studied in the endoparasitic tapeworms (Cestoda) and flukes (Trematoda), no representative of the Monogenea has been investigated yet. We present the high-quality draft genome of Gyrodactylus salaris, an economically important monogenean ectoparasite of wild Atlantic salmon (Salmo salar). A total of 15,488 gene models were identified, of which 7,102 were functionally annotated. The controversial phylogenetic relationships within the obligate parasitic Neodermata were resolved in a phylogenomic analysis using 1,719 gene models (alignment length of >500,000 amino acids) for a set of 16 metazoan taxa. The Monogenea were found basal to the Cestoda and Trematoda, which implies ectoparasitism being plesiomorphic within the Neodermata and strongly supports a common origin of complex life cycles. Comparative analysis of seven parasitic flatworm genomes identified shared genomic features for the ecto- and endoparasitic lineages, such as a substantial reduction of the core bilaterian gene complement, including the homeodomain-containing genes, and a loss of the piwi and vasa genes, which are considered essential for animal development. Furthermore, the shared loss of functional fatty acid biosynthesis pathways and the absence of peroxisomes, the latter organelles presumed ubiquitous in eukaryotes except for parasitic protozoans, were inferred. The draft genome of G. salaris opens for future in-depth analyses of pathogenicity and host specificity of poorly characterized G. salaris strains, and will enhance studies addressing the genomics of host–parasite interactions and speciation in the highly diverse monogenean flatworms. PMID:24732282
Fulton, R W; Neill, J D; Saliki, J T; Landis, C; Burge, L J; Payton, M E
This study investigated the genetic and antigenic characterization of parainfluenza-3 virus (PI3V) of cattle. Using molecular tests including real time PCR and viral genome sequencing, PI3V strains could be separated into PI3V types, including PI3V A, PI3V B, and PI3V C. Isolates from cattle with bovine respiratory disease clinical signs and commercial vaccines in the U.S. with MLV PI3V were typed using these molecular tests. All the MLV vaccine strains tested were PI3V A. In most cases PI3V field strains from calves receiving MLV vaccines were types heterologous to the vaccine type A. Also antigenic differences were noted as PI3V C strains had lower antibody levels than PI3V A in serums from cattle receiving MLV PI3V A vaccines. This study further demonstrates there is genetic variability of U.S. PI3V strains and also antigenic variability. In addition, isolates from cattle with BRD signs and receiving MLV vaccines may have heterologous types to the vaccines, and molecular tests should be performed to differentiate field from vaccine strains. Potentially the efficacy of current PI3V A vaccines should be evaluated with other types such a PI3V B and PI3V C. Copyright © 2017 Elsevier B.V. All rights reserved.
Full Text Available Bovine tuberculosis (bTB is a chronic disease of cattle caused by Mycobacterium bovis, a member of the Mycobacterium tuberculosis complex group of bacteria. Vaccination of cattle might offer a long-term solution for controlling the disease and priority has been given to the development of a cattle vaccine against bTB. Identification of biomarkers in tuberculosis research remains elusive and the goal is to identify host correlates of protection. We hypothesized that by studying global gene expression we could identify in vitro predictors of protection that could help to facilitate vaccine development. Calves were vaccinated with BCG or with a heterologous BCG prime adenovirally vectored subunit boosting protocol. Protective efficacy was determined after M. bovis challenge. RNA was prepared from PPD-stimulated PBMC prepared from vaccinated-protected, vaccinated-unprotected and unvaccinated control cattle prior to M. bovis challenge and global gene expression determined by RNA-seq. 668 genes were differentially expressed in vaccinated-protected cattle compared with vaccinated-unprotected and unvaccinated control cattle. Cytokine-cytokine receptor interaction was the most significant pathway related to this dataset with IL-22 expression identified as the dominant surrogate of protection besides INF-γ. Finally, the expression of these candidate genes identified by RNA-seq was evaluated by RT-qPCR in an independent set of PBMC samples from BCG vaccinated and unvaccinated calves. This experiment confirmed the importance of IL-22 as predictor of vaccine efficacy.
Ma, Peipei; Lund, Mogens Sandø; Nielsen, Ulrik Sander
population. In simulated data, there was no bias when the test animals were unselected cows. When the G matrix was derived from genotypes of causal genes, the bias was reduced. The results suggest that the main reasons for causing the bias of the prediction trends are the selection of bulls and bull dams......This study investigated the phenomenon of bias in the trend of genomic predictions and attempted to find the reason and solution for this bias. The data used in this study include Danish Jersey data and simulation data. In Jersey data, the bias was reduced when cows were included in the reference...
Ma, Peipei; Lund, Mogens Sandø; Nielsen, Ulrik Sander
This study investigated the phenomenon of bias in the trend of genomic predictions and attempted to find the reason and solution for this bias. The data used in this study include Danish Jersey data and simulation data. In Jersey data, the bias was reduced when cows were included in the reference...... population. In simulated data, there was no bias when the test animals were unselected cows. When the G matrix was derived from genotypes of causal genes, the bias was reduced. The results suggest that the main reasons for causing the bias of the prediction trends are the selection of bulls and bull dams...
Dutilh, Bas E; Thompson, Cristiane C; Vicente, Ana C P; Marin, Michel A; Lee, Clarence; Silva, Genivaldo G Z; Schmieder, Robert; Andrade, Bruno G N; Chimetto, Luciane; Cuevas, Daniel; Garza, Daniel R; Okeke, Iruka N; Aboderin, Aaron Oladipo; Spangler, Jessica; Ross, Tristen; Dinsdale, Elizabeth A; Thompson, Fabiano L; Harkins, Timothy T; Edwards, Robert A
BACKGROUND: Vibrio cholerae is a globally dispersed pathogen that has evolved with humans for centuries, but also includes non-pathogenic environmental strains. Here, we identify the genomic variability underlying this remarkable persistence across the three major niche dimensions space, time, and
Vu, G.T.H.; Schmutzer, T.; Bull, F.; Cao, H.X.; Fuchs, J.; Tran, T.D.; Jovtchev, G.; Pistrick, K.; Stein, N.; Pečinka, A.; Neumann, Pavel; Novák, Petr; Macas, Jiří; Dear, P.H.; Blattner, F.R.; Scholz, U.; Schubert, I.
Roč. 8, č. 3 (2015) ISSN 1940-3372 R&D Projects: GA ČR GBP501/12/G090 Institutional support: RVO:60077344 Keywords : Genlisea * genome * repetitive sequences Subject RIV: EB - Genetics ; Molecular Biology Impact factor: 3.509, year: 2015
Full Text Available Neuroglobin (Ngb is a hexacoordinated globin expressed mainly in the central and peripheral nervous system of vertebrates. Although several hypotheses have been put forward regarding the role of neuroglobin, its definite function remains uncertain. Ngb appears to have a neuro-protective role enhancing cell viability under hypoxia and other types of oxidative stress. Ngb is phylogenetically ancient and has a substitution rate nearly four times lower than that of other vertebrate globins, e.g. hemoglobin. Despite its high sequence conservation among vertebrates Ngb seems to be elusive in invertebrates.We determined candidate orthologs in invertebrates and identified a globin of the placozoan Trichoplax adhaerens that is most likely orthologous to vertebrate Ngb and confirmed the orthologous relationship of the polymeric globin of the sea urchin Strongylocentrotus purpuratus to Ngb. The putative orthologous globin genes are located next to genes orthologous to vertebrate POMT2 similarly to localization of vertebrate Ngb. The shared syntenic position of the globins from Trichoplax, the sea urchin and of vertebrate Ngb strongly suggests that they are orthologous. A search for conserved transcription factor binding sites (TFBSs in the promoter regions of the Ngb genes of different vertebrates via phylogenetic footprinting revealed several TFBSs, which may contribute to the specific expression of Ngb, whereas a comparative analysis with myoglobin revealed several common TFBSs, suggestive of regulatory mechanisms common to globin genes.Identification of the placozoan and echinoderm genes orthologous to vertebrate neuroglobin strongly supports the hypothesis of the early evolutionary origin of this globin, as it shows that neuroglobin was already present in the placozoan-bilaterian last common ancestor. Computational determination of the transcription factor binding sites repertoire provides on the one hand a set of transcriptional factors that are
Havt, Alexandre; Nayak, Uma; Pinkerton, Relana; Farber, Emily; Concannon, Patrick; Lima, Aldo A.; Guerrant, Richard L.
Despite its population, geographic size, and emerging economic importance, disproportionately little genome-scale research exists into genetic factors that predispose Brazilians to disease, or the population genetics of risk. After identification of suitable proxy populations and careful analysis of tri-continental admixture in 1,538 North-Eastern Brazilians to estimate individual ancestry and ancestral allele frequencies, we computed 400,000 genome-wide locus-specific branch length (LSBL) Fst statistics of Brazilian Amerindian ancestry compared to European and African; and a similar set of differentiation statistics for their Amerindian component compared with the closest Asian 1000 Genomes population (surprisingly, Bengalis in Bangladesh). After ranking SNPs by these statistics, we identified the top 10 highly differentiated SNPs in five genome regions in the LSBL tests of Brazilian Amerindian ancestry compared to European and African; and the top 10 SNPs in eight regions comparing their Amerindian component to the closest Asian 1000 Genomes population. We found SNPs within or proximal to the genes CIITA (rs6498115), SMC6 (rs1834619), and KLHL29 (rs2288697) were most differentiated in the Amerindian-specific branch, while SNPs in the genes ADAMTS9 (rs7631391), DOCK2 (rs77594147), SLC28A1 (rs28649017), ARHGAP5 (rs7151991), and CIITA (rs45601437) were most highly differentiated in the Asian comparison. These genes are known to influence immune function, metabolic and anthropometry traits, and embryonic development. These analyses have identified candidate genes for selection within Amerindian ancestry, and by comparison of the two analyses, those for which the differentiation may have arisen during the migration from Asia to the Americas. PMID:28100790
Noll, Lance W; Worley, Jay N; Yang, Xun; Shridhar, Pragathi B; Ludwig, Justin B; Shi, Xiaorong; Bai, Jianfa; Caragea, Doina; Meng, Jianghong; Nagaraja, T G
Escherichia coli O103, harbored in the hindgut and shed in the feces of cattle, can be enterohemorrhagic (EHEC), enteropathogenic (EPEC), or putative non-pathotype. The genetic diversity particularly that of virulence gene profiles within O103 serogroup is likely to be broad, considering the wide range in severity of illness. However, virulence descriptions of the E. coli O103 strains isolated from cattle feces have been primarily limited to major genes, such as Shiga toxin and intimin genes. Less is known about the frequency at which other virulence genes exist or about genes associated with the mobile genetic elements of E. coli O103 pathotypes. Our objective was to utilize whole genome sequencing (WGS) to identify and compare major and putative virulence genes of EHEC O103 (positive for Shiga toxin gene, stx1, and intimin gene, eae; n = 43), EPEC O103 (negative for stx1 and positive for eae; n = 13) and putative non-pathotype O103 strains (negative for stx and eae; n = 13) isolated from cattle feces. Six strains of EHEC O103 from human clinical cases were also included. All bovine EHEC strains (43/43) and a majority of EPEC (12/13) and putative non-pathotype strains (12/13) were O103:H2 serotype. Both bovine and human EHEC strains had significantly larger average genome sizes (P coli O103 pathotypes shed in cattle feces is likely reflective of the acquisition or loss of virulence genes carried on mobile genetic elements.
Chao Xu; Huan Chen; Mark L. Gleason; Jin-Rong Xu; Huiquan Liu; Rong Zhang; Guangyu Sun
Sooty blotch and flyspeck (SBFS) fungi are unconventional plant pathogens that cause economic losses by blemishing the surface appearance of infected fruit. Here, we introduce the 18.14-Mb genome of Peltaster fructicola, one of the most prevalent SBFS species on apple. This undersized assembly contains only 8,334 predicted protein-coding genes and a very small repertoire of repetitive elements. Phylogenomics and comparative genomics revealed that P. fructicola had undergone a reductive evolut...
Lim, Shu Yong; Yap, Kien-Pong; Thong, Kwai Lin
Listeria monocytogenes is an important foodborne pathogen that causes considerable morbidity in humans with high mortality rates. In this study, we have sequenced the genomes and performed comparative genomics analyses on two strains, LM115 and LM41, isolated from ready-to-eat food in Malaysia. The genome size of LM115 and LM41 was 2,959,041 and 2,963,111 bp, respectively. These two strains shared approximately 90% homologous genes. Comparative genomics and phylogenomic analyses revealed that LM115 and LM41 were more closely related to the reference strains F2365 and EGD-e, respectively. Our virulence profiling indicated a total of 31 virulence genes shared by both analysed strains. These shared genes included those that encode for internalins and L. monocytogenes pathogenicity island 1 (LIPI-1). Both the Malaysian L. monocytogenes strains also harboured several genes associated with stress tolerance to counter the adverse conditions. Seven antibiotic and efflux pump related genes which may confer resistance against lincomycin, erythromycin, fosfomycin, quinolone, tetracycline, and penicillin, and macrolides were identified in the genomes of both strains. Whole genome sequencing and comparative genomics analyses revealed two virulent L. monocytogenes strains isolated from ready-to-eat foods in Malaysia. The identification of strains with pathogenic, persistent, and antibiotic resistant potentials from minimally processed food warrant close attention from both healthcare and food industry.
van der Spek, D; van Arendonk, J A M; Bovenhuis, H
Performing a genome-wide association study (GWAS) might add to a better understanding of the development of claw disorders and the need for trimming. Therefore, the aim of the current study was to perform a GWAS on claw disorders and trimming status and to validate the results for claw disorders based on an independent data set. Data consisted of 20,474 cows with phenotypes for claw disorders and 50,238 cows with phenotypes for trimming status. Recorded claw disorders used in the current study were double sole (DS), interdigital hyperplasia (IH), sole hemorrhage (SH), sole ulcer (SU), white line separation (WLS), a combination of infectious claw disorders consisting of (inter-)digital dermatitis and heel erosion, and a combination of laminitis-related claw disorders (DS, SH, SU, and WLS). Of the cows with phenotypes for claw disorders, 1,771 cows were genotyped and these cow data were used for the GWAS on claw disorders. A SNP was considered significant when the false discovery rate≤0.05 and suggestive when the false discovery rate≤0.20. An independent data set of 185 genotyped bulls having at least 5 daughters with phenotypes (6,824 daughters in total) for claw disorders was used to validate significant and suggestive SNP detected based on the cow data. To analyze the trait "trimming status" (i.e., the need for claw trimming), a data set with 327 genotyped bulls having at least 5 daughters with phenotypes (18,525 daughters in total) was used. Based on the cow data, in total 10 significant and 45 suggestive SNP were detected for claw disorders. The 10 significant SNP were associated with SU, and mainly located on BTA8. The suggestive SNP were associated with DS, IH, SU, and laminitis-related claw disorders. Three of the suggestive SNP were validated in the data set of 185 bulls, and were located on BTA13, BTA14, and BTA17. For infectious claw disorders, SH, and WLS, no significant or suggestive SNP associations were detected. For trimming status, 1 significant
The slick hair coat (SLICK) is a dominantly inherited trait typically associated with tropically adapted cattle that are from Criollo descent through Spanish colonization of cattle into the New World. The trait is of interest relative to climate change, due to its association with improved thermo-t...
Serra, Fabienne; Müller, Joachim; Gray, John; Lüthi, Ramona; Dudas, Sandor; Czub, Stefanie; Seuberlich, Torsten
Three different types of bovine spongiform encephalopathy (BSE) are known and supposedly caused by distinct prion strains: the classical (C-) BSE type that was typically found during the BSE epidemic, and two relatively rare atypical BSE types, termed H-BSE and L-BSE. The three BSE types differ in the molecular phenotype of the disease associated prion protein, namely the N-terminally truncated proteinase K (PK) resistant prion protein fragment (PrP res ). In this study, we report and analyze yet another PrP res type (PrP res-2011 ), which was found in severely autolytic brain samples of two cows in the framework of disease surveillance in Switzerland in 2011. Analysis of brain tissues from these animals by PK titration and PK inhibitor assays ruled out the process of autolysis as the cause for the aberrant PrP res profile. Immunochemical characterization of the PrP fragments present in the 2011 cases by epitope mapping indicated that PrP res-2011 corresponds in its primary sequence to the physiologically occurring PrP-C1 fragment. However, high speed centrifugation, sucrose gradient assay and NaPTA precipitation revealed biochemical similarities between PrP res-2011 and the disease-associated prion protein found in BSE affected cattle in terms of detergent insolubility, PK resistance and PrP aggregation. Although it remains to be established whether PrP res-2011 is associated with a transmissible disease, our results point out the need of further research on the role the PrP-C1 aggregation and misfolding in health and disease. Copyright Â© 2017. Published by Elsevier B.V.
T-lineage acute lymphoblastic leukemia (T-ALL) comprises 15-20% of childhood ALL and has historically been associated with inferior outcome to B-cell ALL (B-ALL). Recent studies have used genome-wide sequencing approaches to identify new subtypes and targets of mutation in B-ALL, but comprehensive sequencing studies of large cohorts of T-ALL have not been performed.
Full Text Available Pigmentation patterns allow for the differentiation of cattle breeds. A dominantly inherited white head is characteristic for animals of the Fleckvieh (FV breed. However, a minority of the FV animals exhibits peculiar pigmentation surrounding the eyes (ambilateral circumocular pigmentation, ACOP. In areas where animals are exposed to increased solar ultraviolet radiation, ACOP is associated with a reduced susceptibility to bovine ocular squamous cell carcinoma (BOSCC, eye cancer. Eye cancer is the most prevalent malignant tumour affecting cattle. Selection for animals with ACOP rapidly reduces the incidence of BOSCC. To identify quantitative trait loci (QTL underlying ACOP, we performed a genome-wide association study using 658,385 single nucleotide polymorphisms (SNPs. The study population consisted of 3579 bulls of the FV breed with a total of 320,186 progeny with phenotypes for ACOP. The proportion of progeny with ACOP was used as a quantitative trait with high heritability (h(2 = 0.79. A variance component based approach to account for population stratification uncovered twelve QTL regions on seven chromosomes. The identified QTL point to MCM6, PAX3, ERBB3, KITLG, LEF1, DKK2, KIT, CRIM1, ATRN, GSDMC, MITF and NBEAL2 as underlying genes for eye area pigmentation in cattle. The twelve QTL regions explain 44.96% of the phenotypic variance of the proportion of daughters with ACOP. The chromosomes harbouring significantly associated SNPs account for 54.13% of the phenotypic variance, while another 19.51% of the phenotypic variance is attributable to chromosomes without identified QTL. Thus, the missing heritability amounts to 7% only. Our results support a polygenic inheritance pattern of ACOP in cattle and provide the basis for efficient genomic selection of animals that are less susceptible to serious eye diseases.
Full Text Available Because it is suspected that gene content may partly explain host adaptation and ecology of pathogenic bacteria, it is important to study factors affecting genome composition and its evolution. While recent genomic advances have revealed extremely large pan-genomes for some bacterial species, it remains difficult to predict to what extent gene pool is accessible within or transferable between populations. As genomes bear imprints of the history of the organisms, gene distribution pattern analyses should provide insights into the forces and factors at play in the shaping and maintaining of bacterial genomes. In this study, we revisited the data obtained from a previous CGH microarrays analysis in order to assess the genomic plasticity of the R. solanacearum species complex. Gene distribution analyses demonstrated the remarkably dispersed genome of R. solanacearum with more than half of the genes being accessory. From the reconstruction of the ancestral genomes compositions, we were able to infer the number of gene gain and loss events along the phylogeny. Analyses of gene movement patterns reveal that factors associated with gene function, genomic localization and ecology delineate gene flow patterns. While the chromosome displayed lower rates of movement, the megaplasmid was clearly associated with hot-spots of gene gain and loss. Gene function was also confirmed to be an essential factor in gene gain and loss dynamics with significant differences in movement patterns between different COG categories. Finally, analyses of gene distribution highlighted possible highways of horizontal gene transfer. Due to sampling and design bias, we can only speculate on factors at play in this gene movement dynamic. Further studies examining precise conditions that favor gene transfer would provide invaluable insights in the fate of bacteria, species delineation and the emergence of successful pathogens.
Loren H. Rieseberg
Full Text Available Few studies have investigated the causes of evolutionary rate variation among plant nuclear genes, especially in recently diverged species still capable of hybridizing in the wild. The recent advent of Next Generation Sequencing (NGS permits investigation of genome wide rates of protein evolution and the role of selection in generating and maintaining divergence. Here, we use individual whole-transcriptome sequencing (RNAseq to refine our understanding of the population genomics of wild species of sunflowers (Helianthus spp. and the factors that affect rates of protein evolution. We aligned 35 GB of transcriptome sequencing data and identified 433,257 polymorphic sites (SNPs in a reference transcriptome comprising 16,312 genes. Using SNP markers, we identified strong population clustering largely corresponding to the three species analyzed here (Helianthus annuus, H. petiolaris, H. debilis, with one distinct early generation hybrid. Then, we calculated the proportions of adaptive substitution fixed by selection (alpha and identified gene ontology categories with elevated values of alpha. The “response to biotic stimulus” category had the highest mean alpha across the three interspecific comparisons, implying that natural selection imposed by other organisms plays an important role in driving protein evolution in wild sunflowers. Finally, we examined the relationship between protein evolution (dN/dS ratio and several genomic factors predicted to co-vary with protein evolution (gene expression level, divergence and specificity, genetic divergence [FST], and nucleotide diversity pi. We find that variation in rates of protein divergence was correlated with gene expression level and specificity, consistent with results from a broad range of taxa and timescales. This would in turn imply that these factors govern protein evolution both at a microevolutionary and macroevolutionary timescale. Our results contribute to a general understanding of the
Man, Tsz-Kwong; Rao, Pulivarthi H; Lu, Xin-Yan; Jaeweon, Kim; Perlaky, Laszlo; Harris, Charles P; Shah, Shishir; Ladanyi, Marc; Gorlick, Richard; Lau, Ching C
Osteosarcoma is a highly malignant bone neoplasm of children and young adults. It is characterized by extremely complex karyotypes and high frequency of chromosomal amplifications. Currently, only the histological response (degree of necrosis) to therapy represent gold standard for predicting the outcome in a patient with non-metastatic osteosarcoma at the time of definitive surgery. Patients with lower degree of necrosis have a higher risk of relapse and poor outcome even after chemotherapy and complete resection of the primary tumor. Therefore, a better understanding of the underlying molecular genetic events leading to tumor initiation and progression could result in the identification of potential diagnostic and therapeutic targets. We used a genome-wide screening method – array based comparative genomic hybridization (array-CGH) to identify DNA copy number changes in 48 patients with osteosarcoma. We applied fluorescence in situ hybridization (FISH) to validate some of amplified clones in this study. Clones showing gains (79%) were more frequent than losses (66%). High-level amplifications and homozygous deletions constitute 28.6% and 3.8% of tumor genome respectively. High-level amplifications were present in 238 clones, of which about 37% of them showed recurrent amplification. Most frequently amplified clones were mapped to 1p36.32 (PRDM16), 6p21.1 (CDC5L, HSPCB, NFKBIE), 8q24, 12q14.3 (IFNG), 16p13 (MGRN1), and 17p11.2 (PMP22 MYCD, SOX1,ELAC27). We validated some of the amplified clones by FISH from 6p12-p21, 8q23-q24, and 17p11.2 amplicons. Homozygous deletions were noted for 32 clones and only 7 clones showed in more than one case. These 7 clones were mapped to 1q25.1 (4 cases), 3p14.1 (4 cases), 13q12.2 (2 cases), 4p15.1 (2 cases), 6q12 (2 cases), 6q12 (2 cases) and 6q16.3 (2 cases). This study clearly demonstrates the utility of array CGH in defining high-resolution DNA copy number changes and refining amplifications. The resolution of array CGH
Full Text Available Subterranean mammals spend their lives in dark, unventilated environments that are rich in carbon dioxide and ammonia and low in oxygen. Many of these animals are also long-lived and exhibit reduced aging-associated diseases, such as neurodegenerative disorders and cancer. We sequenced the genome of the Damaraland mole rat (DMR, Fukomys damarensis and improved the genome assembly of the naked mole rat (NMR, Heterocephalus glaber. Comparative genome analyses, along with the transcriptomes of related subterranean rodents, revealed candidate molecular adaptations for subterranean life and longevity, including a divergent insulin peptide, expression of oxygen-carrying globins in the brain, prevention of high CO2-induced pain perception, and enhanced ammonia detoxification. Juxtaposition of the genomes of DMR and other more conventional animals with the genome of NMR revealed several truly exceptional NMR features: unusual thermogenesis, an aberrant melatonin system, pain insensitivity, and unique processing of 28S rRNA. Together, these genomes and transcriptomes extend our understanding of subterranean adaptations, stress resistance, and longevity.
Pope, Welkin H; Bowman, Charles A; Russell, Daniel A; Jacobs-Sera, Deborah; Asai, David J; Cresawn, Steven G; Jacobs, William R; Hendrix, Roger W; Lawrence, Jeffrey G; Hatfull, Graham F
The bacteriophage population is large, dynamic, ancient, and genetically diverse. Limited genomic information shows that phage genomes are mosaic, and the genetic architecture of phage populations remains ill-defined. To understand the population structure of phages infecting a single host strain, we isolated, sequenced, and compared 627 phages of Mycobacterium smegmatis. Their genetic diversity is considerable, and there are 28 distinct genomic types (clusters) with related nucleotide sequences. However, amino acid sequence comparisons show pervasive genomic mosaicism, and quantification of inter-cluster and intra-cluster relatedness reveals a continuum of genetic diversity, albeit with uneven representation of different phages. Furthermore, rarefaction analysis shows that the mycobacteriophage population is not closed, and there is a constant influx of genes from other sources. Phage isolation and analysis was performed by a large consortium of academic institutions, illustrating the substantial benefits of a disseminated, structured program involving large numbers of freshman undergraduates in scientific discovery.
Lance W Noll
Full Text Available Escherichia coli O103, harbored in the hindgut and shed in the feces of cattle, can be enterohemorrhagic (EHEC, enteropathogenic (EPEC, or putative non-pathotype. The genetic diversity particularly that of virulence gene profiles within O103 serogroup is likely to be broad, considering the wide range in severity of illness. However, virulence descriptions of the E. coli O103 strains isolated from cattle feces have been primarily limited to major genes, such as Shiga toxin and intimin genes. Less is known about the frequency at which other virulence genes exist or about genes associated with the mobile genetic elements of E. coli O103 pathotypes. Our objective was to utilize whole genome sequencing (WGS to identify and compare major and putative virulence genes of EHEC O103 (positive for Shiga toxin gene, stx1, and intimin gene, eae; n = 43, EPEC O103 (negative for stx1 and positive for eae; n = 13 and putative non-pathotype O103 strains (negative for stx and eae; n = 13 isolated from cattle feces. Six strains of EHEC O103 from human clinical cases were also included. All bovine EHEC strains (43/43 and a majority of EPEC (12/13 and putative non-pathotype strains (12/13 were O103:H2 serotype. Both bovine and human EHEC strains had significantly larger average genome sizes (P < 0.0001 and were positive for a higher number of adherence and toxin-based virulence genes and genes on mobile elements (prophages, transposable elements, and plasmids than EPEC or putative non-pathotype strains. The genome size of the three pathotypes positively correlated (R2 = 0.7 with the number of genes carried on mobile genetic elements. Bovine strains clustered phylogenetically by pathotypes, which differed in several key virulence genes. The diversity of E. coli O103 pathotypes shed in cattle feces is likely reflective of the acquisition or loss of virulence genes carried on mobile genetic elements.
Full Text Available In 1996 a common-source outbreak of severe soft tissue and bloodstream infections erupted among Israeli fish farmers and fish consumers due to changes in fish marketing policies. The causative pathogen was a new strain of Vibrio vulnificus, named biotype 3, which displayed a unique biochemical and genotypic profile. Initial observations suggested that the pathogen erupted as a result of genetic recombination between two distinct populations. We applied a whole genome shotgun sequencing approach using several V. vulnificus strains from Israel in order to study the pan genome of V. vulnificus and determine the phylogenetic relationship of biotype 3 with existing populations. The core genome of V. vulnificus based on 16 draft and complete genomes consisted of 3068 genes, representing between 59% and 78% of the whole genome of 16 strains. The accessory genome varied in size from 781 kbp to 2044 kbp. Phylogenetic analysis based on whole, core, and accessory genomes displayed similar clustering patterns with two main clusters, clinical (C and environmental (E, all biotype 3 strains formed a distinct group within the E cluster. Annotation of accessory genomic regions found in biotype 3 strains and absent from the core genome yielded 1732 genes, of which the vast majority encoded hypothetical proteins, phage-related proteins, and mobile element proteins. A total of 1916 proteins (including 713 hypothetical proteins were present in all human pathogenic strains (both biotype 3 and non-biotype 3 and absent from the environmental strains. Clustering analysis of the non-hypothetical proteins revealed 148 protein clusters shared by all human pathogenic strains; these included transcriptional regulators, arylsulfatases, methyl-accepting chemotaxis proteins, acetyltransferases, GGDEF family proteins, transposases, type IV secretory system (T4SS proteins, and integrases. Our study showed that V. vulnificus biotype 3 evolved from environmental populations and
Olsen, Jeanine; Rouzé, Pierre; Verhelst, Bram; Lin, Yao-Cheng; Bayer, Till; Collen, Jonas; Dattolo, Emanuela; De Paoli, Emanuele; Dittami, Simon; Maumus, Florian; Michel, Gurvan; Kersting, Anna; Lauritano, Chiara; Lohaus, Rolf; Töpel, Mats; Tonon, Thierry; Vanneste, Kevin; Amirebrahimi, Mojgan; Brakel, Janina; Boström, Christoffer; Chovatia, Mansi; Grimwood, Jane; Jenkins, Jerry W; Jueterbock, Alexander; Mraz, Amy; Stam, Wytze T; Tice, Hope; Bornberg-Bauer, Erich; Green, Pamela J; Pearson, Gareth A; Procaccini, Gabriele; Duarte, Carlos M; Schmutz, Jeremy; Reusch, Thorsten B H; Van de Peer, Yves
Seagrasses colonized the sea on at least three independent occasions to form the basis of one of the most productive and widespread coastal ecosystems on the planet. Here we report the genome of Zostera marina (L.), the first, to our knowledge, marine angiosperm to be fully sequenced. This reveals
SNP effects estimated in genomic selection programs allow for the prediction of direct genomic values (DGV) both at genome-wide and chromosomal level. As a consequence, genome-wide (G_GW) or chromosomal (G_CHR) correlation matrices between genomic predictions for different traits can be calculated. ...
Kang, Yu; Gu, Chaohao; Yuan, Lina; Wang, Yue; Zhu, Yanmin; Li, Xinna; Luo, Qibin; Xiao, Jingfa; Jiang, Daquan; Qian, Minping; Ahmed Khan, Aftab; Chen, Fei; Zhang, Zhang; Yu, Jun
The prokaryotic pangenome partitions genes into core and dispensable genes. The order of core genes, albeit assumed to be stable under selection in general, is frequently interrupted by horizontal gene transfer and rearrangement, but how a core-gene-defined genome maintains its stability or flexibility remains to be investigated. Based on data from 30 species, including 425 genomes from six phyla, we grouped core genes into syntenic blocks in the context of a pangenome according to their stability across multiple isolates. A subset of the core genes, often species specific and lineage associated, formed a core-gene-defined genome organizational framework (cGOF). Such cGOFs are either single segmental (one-third of the species analyzed) or multisegmental (the rest). Multisegment cGOFs were further classified into symmetric or asymmetric according to segment orientations toward the origin-terminus axis. The cGOFs in Gram-positive species are exclusively symmetric and often reversible in orientation, as opposed to those of the Gram-negative bacteria, which are all asymmetric and irreversible. Meanwhile, all species showing strong strand-biased gene distribution contain symmetric cGOFs and often specific DnaE (α subunit of DNA polymerase III) isoforms. Furthermore, functional evaluations revealed that cGOF genes are hub associated with regard to cellular activities, and the stability of cGOF provides efficient indexes for scaffold orientation as demonstrated by assembling virtual and empirical genome drafts. cGOFs show species specificity, and the symmetry of multisegmental cGOFs is conserved among taxa and constrained by DNA polymerase-centric strand-biased gene distribution. The definition of species-specific cGOFs provides powerful guidance for genome assembly and other structure-based analysis. Prokaryotic genomes are frequently interrupted by horizontal gene transfer (HGT) and rearrangement. To know whether there is a set of genes not only conserved in position
The cattle fever tick, Rhipicephalus (Boophilus) microplus is one of the most significant medical veterinary pests in the world, vectoring several serious livestock diseases negatively impacting agricultural economies of tropical and subtropical countries around the world. We assembled the complete ...
Maeder, Dennis L.; Anderson, Iain; Brettin, Thomas S.; Bruce,David C.; Gilna, Paul; Han, Cliff S.; Lapidus, Alla; Metcalf, William W.; Saunders, Elizabeth; Tapia, Roxanne; Sowers, Kevin R.
We report here a comparative analysis of the genome sequence of Methanosarcina barkeri with those of Methanosarcina acetivorans and Methanosarcina mazei. All three genomes share a conserved double origin of replication and many gene clusters. M. barkeri is distinguished by having an organization that is well conserved with respect to the other Methanosarcinae in the region proximal to the origin of replication with interspecies gene similarities as high as 95%. However it is disordered and marked by increased transposase frequency and decreased gene synteny and gene density in the proximal semi-genome. Of the 3680 open reading frames in M. barkeri, 678 had paralogs with better than 80% similarity to both M. acetivorans and M. mazei while 128 nonhypothetical orfs were unique (non-paralogous) amongst these species including a complete formate dehydrogenase operon, two genes required for N-acetylmuramic acid synthesis, a 14 gene gas vesicle cluster and a bacterial P450-specific ferredoxin reductase cluster not previously observed or characterized in this genus. A cryptic 36 kbp plasmid sequence was detected in M. barkeri that contains an orc1 gene flanked by a presumptive origin of replication consisting of 38 tandem repeats of a 143 nt motif. Three-way comparison of these genomes reveals differing mechanisms for the accrual of changes. Elongation of the large M. acetivorans is the result of multiple gene-scale insertions and duplications uniformly distributed in that genome, while M. barkeri is characterized by localized inversions associated with the loss of gene content. In contrast, the relatively short M. mazei most closely approximates the ancestral organizational state.
Hirose, Yusuke; Onuki, Mamiko; Tenjimbayashi, Yuri; Mori, Seiichiro; Ishii, Yoshiyuki; Takeuchi, Takamasa; Tasaka, Nobutaka; Satoh, Toyomi; Morisada, Tohru; Iwata, Takashi; Miyamoto, Shingo; Matsumoto, Koji; Sekizawa, Akihiko; Kukimoto, Iwao
Persistent infection with oncogenic human papillomaviruses (HPVs) causes cervical cancer, accompanied with the accumulation of somatic mutations into the host genome. There are concomitant genetic changes in the HPV genome during viral infection; however, their relevance to cervical carcinogenesis is poorly understood. Here we explored within-host genetic diversity of HPV by performing deep sequencing analyses of viral whole-genome sequences in clinical specimens. The whole genomes of HPV types 16, 52 and 58 were amplified by type-specific PCR from total cellular DNA of cervical exfoliated cells collected from patients with cervical intraepithelial neoplasia (CIN) and invasive cervical cancer (ICC), and were deep-sequenced. After constructing a reference vial genome sequence for each specimen, nucleotide positions showing changes with > 0.5% frequencies compared to the reference sequence were determined for individual samples. In total, 1,052 positions of nucleotide variations were detected in HPV genomes from 151 samples (CIN1, n = 56; CIN2/3, n = 68; ICC, n = 27), with varying numbers per sample. Overall, C-to-T and C-to-A substitutions were the dominant changes observed across all histological grades. While C-to-T transitions were predominantly detected in CIN1, their prevalence was decreased in CIN2/3 and fell below that of C-to-A transversions in ICC. Analysis of the tri-nucleotides context encompassing substituted bases revealed that Tp C pN, a preferred target sequence for cellular APOBEC cytosine deaminases, was a primary site for C-to-T substitutions in the HPV genome. These results strongly imply that the APOBEC proteins are drivers of HPV genome mutation, particularly in CIN1 lesions. IMPORTANCE HPVs exhibit surprisingly high levels of genetic diversity, including a large repertoire of minor genomic variants in each viral genotype. Here, by conducting deep sequencing analyses, we show for the first time a comprehensive snapshot of the "within
Xu, Yinghua; Zhu, Yongzhang; Wang, Yuezhu; Chang, Yung-Fu; Zhang, Ying; Jiang, Xiugao; Zhuang, Xuran; Zhu, Yongqiang; Zhang, Jinlong; Zeng, Lingbing; Yang, Minjun; Li, Shijun; Wang, Shengyue; Ye, Qiang; Xin, Xiaofang; Zhao, Guoping; Zheng, Huajun; Guo, Xiaokui; Wang, Junzhi
Leptospirosis, caused by pathogenic Leptospira spp., has recently been recognized as an emerging infectious disease worldwide. Despite its severity and global importance, knowledge about the molecular pathogenesis and virulence evolution of Leptospira spp. remains limited. Here we sequenced and analyzed 102 isolates representing global sources. A high genomic variability were observed among different Leptospira species, which was attributed to massive gene gain and loss events allowing for adaptation to specific niche conditions and changing host environments. Horizontal gene transfer and gene duplication allowed the stepwise acquisition of virulence factors in pathogenic Leptospira evolved from a recent common ancestor. More importantly, the abundant expansion of specific virulence-related protein families, such as metalloproteases-associated paralogs, were exclusively identified in pathogenic species, reflecting the importance of these protein families in the pathogenesis of leptospirosis. Our observations also indicated that positive selection played a crucial role on this bacteria adaptation to hosts. These novel findings may lead to greater understanding of the global diversity and virulence evolution of Leptospira spp. PMID:26833181
Anna De Grassi
Full Text Available Early detection of cancer-associated genomic instability is crucial, particularly in tumour types in which this instability represents the essential underlying mechanism of tumourigenesis. Currently used methods require the presence of already established neoplastic cells because they only detect clonal mutations. In principle, parallel sequencing of single DNA filaments could reveal the early phases of tumour initiation by detecting low-frequency mutations, provided an adequate depth of coverage and an effective control of the experimental error. We applied ultradeep sequencing to estimate the genomic instability of individuals with hereditary non-polyposis colorectal cancer (HNPCC. To overcome the experimental error, we used an ultraconserved region (UCR of the human genome as an internal control. By comparing the mutability outside and inside the UCR, we observed a tendency of the ultraconserved element to accumulate significantly fewer mutations than the flanking segments in both neoplastic and nonneoplastic HNPCC samples. No difference between the two regions was detectable in cells from healthy donors, indicating that all three HNPCC samples have mutation rates higher than the healthy genome. This is the first, to our knowledge, direct evidence of an intrinsic genomic instability of individuals with heterozygous mutations in mismatch repair genes, and constitutes the proof of principle for the development of a more sensitive molecular assay of genomic instability.
Li, Juan; Chen, Fen; Sugiyama, Hiromu; Blair, David; Lin, Rui-Qing; Zhu, Xing-Quan
In the present study, near-complete mitochondrial (mt) genome sequences for Schistosoma japonicum from different regions in the Philippines and Japan were amplified and sequenced. Comparisons among S. japonicum from the Philippines, Japan, and China revealed a geographically based length difference in mt genomes, but the mt genomic organization and gene arrangement were the same. Sequence differences among samples from the Philippines and all samples from the three endemic areas were 0.57-2.12 and 0.76-3.85 %, respectively. The most variable part of the mt genome was the non-coding region. In the coding portion of the genome, protein-coding genes varied more than rRNA genes and tRNAs. The near-complete mt genome sequences for Philippine specimens were identical in length (14,091 bp) which was 4 bp longer than those of S. japonicum samples from Japan and China. This indel provides a unique genetic marker for S. japonicum samples from the Philippines. Phylogenetic analyses based on the concatenated amino acids of 12 protein-coding genes showed that samples of S. japonicum clustered according to their geographical origins. The identified mitochondrial indel marker will be useful for tracing the source of S. japonicum infection in humans and animals in Southeast Asia.
Full Text Available Predicting phenotype from genotype represents the epitome of biological questions. Comparative genomics of appropriate model organisms holds the promise of making it possible. However, the high heterozygosity of many Eukaryotes currently prohibits assembling their genomes. Here, we report the 376 Mb genome sequence of Papilio glaucus (Pgl, the first sequenced genome from the Papilionidae family. We obtained the genome from a wild-caught specimen using a cost-effective strategy that overcomes the high (2% heterozygosity problem. Comparative analyses suggest the molecular bases of various phenotypic traits, including terpene production in the Papilionidae-specific organ, osmeterium. Comparison of Pgl and Papilio canadensis transcriptomes reveals mutation hotspots (4% genes associated with their divergence: four key circadian clock proteins are enriched in inter-species mutations and likely responsible for the difference in pupal diapause. Finally, the Pgl genome confirms Papilio appalachiensis as a hybrid of Pgl and Pca, but suggests it inherited 3/4 of its genes from Pca.
Cong, Qian; Borek, Dominika; Otwinowski, Zbyszek; Grishin, Nick V
Predicting phenotype from genotype represents the epitome of biological questions. Comparative genomics of appropriate model organisms holds the promise of making it possible. However, the high heterozygosity of many Eukaryotes currently prohibits assembling their genomes. Here, we report the 376 Mb genome sequence of Papilio glaucus (Pgl), the first sequenced genome from the Papilionidae family. We obtained the genome from a wild-caught specimen using a cost-effective strategy that overcomes the high (2%) heterozygosity problem. Comparative analyses suggest the molecular bases of various phenotypic traits, including terpene production in the Papilionidae-specific organ, osmeterium. Comparison of Pgl and Papilio canadensis transcriptomes reveals mutation hotspots (4% genes) associated with their divergence: four key circadian clock proteins are enriched in inter-species mutations and likely responsible for the difference in pupal diapause. Finally, the Pgl genome confirms Papilio appalachiensis as a hybrid of Pgl and Pca, but suggests it inherited 3/4 of its genes from Pca. Copyright © 2015 The Authors. Published by Elsevier Inc. All rights reserved.
Cohrs, Randall J; Lee, Katherine S; Beach, Addilynn; Sanford, Bridget; Baird, Nicholas L; Como, Christina; Graybill, Chiharu; Jones, Dallas; Tekeste, Eden; Ballard, Mitchell; Chen, Xiaomi; Yalacki, David; Frietze, Seth; Jones, Kenneth; Lenac Rovis, Tihana; Jonjić, Stipan; Haas, Jürgen; Gilden, Don
The neurotropic herpesvirus varicella-zoster virus (VZV) establishes a lifelong latent infection in humans following primary infection. The low abundance of VZV nucleic acids in human neurons has hindered an understanding of the mechanisms that regulate viral gene transcription during latency. To overcome this critical barrier, we optimized a targeted capture protocol to enrich VZV DNA and cDNA prior to whole-genome/transcriptome sequence analysis. Since the VZV genome is remarkably stable, it was surprising to detect that VZV32, a VZV laboratory strain with no discernible growth defect in tissue culture, contained a 2,158-bp deletion in open reading frame (ORF) 12. Consequently, ORF 12 and 13 protein expression was abolished and Akt phosphorylation was inhibited. The discovery of the ORF 12 deletion, revealed through targeted genome sequencing analysis, points to the need to authenticate the VZV genome when the virus is propagated in tissue culture. IMPORTANCE Viruses isolated from clinical samples often undergo genetic modifications when cultured in the laboratory. Historically, VZV is among the most genetically stable herpesviruses, a notion supported by more than 60 complete genome sequences from multiple isolates and following multiple in vitro passages. However, application of enrichment protocols to targeted genome sequencing revealed the unexpected deletion of a significant portion of VZV ORF 12 following propagation in cultured human fibroblast cells. While the enrichment protocol did not introduce bias in either the virus genome or transcriptome, the findings indicate the need for authentication of VZV by sequencing when the virus is propagated in tissue culture. Copyright © 2017 American Society for Microbiology.
Adam O Whelan
Full Text Available Gaining a better understanding of the T cell mechanisms underlying natural immunity to bovine tuberculosis would help to identify immune correlates of disease progression and facilitate the rational design of improved vaccine and diagnostic strategies. CD4 T cells play an established central role in immunity to TB, and recent interest has focussed on the potential role of multifunctional CD4 T cells expressing IFN-γ, IL-2 and TNF-α. Until now, it has not been possible to assess the contribution of these multifunctional CD4 T cells in cattle due to the lack of reagents to detect bovine IL-2 (bIL-2. Using recombinant phage display technology, we have identified an antibody that recognises biologically active bIL-2. Using this antibody, we have developed a polychromatic flow cytometric staining panel that has allowed the investigation of multifunctional CD4 T-cells responses in cattle naturally infected with M. bovis. Assessment of the frequency of antigen specific CD4 T cell subsets reveals a dominant IFN-γ(+IL-2(+TNF-α(+ and IFN-γ(+ TNF-α(+ response in naturally infected cattle. These multifunctional CD4 T cells express a CD44(hiCD45RO(+CD62L(lo T-effector memory (T(EM phenotype and display higher cytokine median fluorescence intensities than single cytokine producers, consistent with an enhanced 'quality of response' as reported for multifunctional cells in human and murine systems. Through our development of these novel immunological bovine tools, we provide the first description of multifunctional T(EM cells in cattle. Application of these tools will improve our understanding of protective immunity in bovine TB and allow more direct comparisons of the complex T cell mediated immune responses between murine models, human clinical studies and bovine TB models in the future.
Driscoll, Timothy; Gillespie, Joseph J; Nordberg, Eric K; Azad, Abdu F; Sobral, Bruno W
Eukaryotic genome sequencing projects often yield bacterial DNA sequences, data typically considered as microbial contamination. However, these sequences may also indicate either symbiont genes or lateral gene transfer (LGT) to host genomes. These bacterial sequences can provide clues about eukaryote-microbe interactions. Here, we used the genome of the primitive animal Trichoplax adhaerens (Metazoa: Placozoa), which is known to harbor an uncharacterized Gram-negative endosymbiont, to search for the presence of bacterial DNA sequences. Bioinformatic and phylogenomic analyses of extracted data from the genome assembly (181 bacterial coding sequences [CDS]) and trace read archive (16S rDNA) revealed a dominant proteobacterial profile strongly skewed to Rickettsiales (Alphaproteobacteria) genomes. By way of phylogenetic analysis of 16S rDNA and 113 proteins conserved across proteobacterial genomes, as well as identification of 27 rickettsial signature genes, we propose a Rickettsiales endosymbiont of T. adhaerens (RETA). The majority (93%) of the identified bacterial CDS belongs to small scaffolds containing prokaryotic-like genes; however, 12 CDS were identified on large scaffolds comprised of eukaryotic-like genes, suggesting that T. adhaerens might have recently acquired bacterial genes. These putative LGTs may coincide with the placozoan's aquatic niche and symbiosis with RETA. This work underscores the rich, and relatively untapped, resource of eukaryotic genome projects for harboring data pertinent to host-microbial interactions. The nature of unknown (or poorly characterized) bacterial species may only emerge via analysis of host genome sequencing projects, particularly if these species are resistant to cell culturing, as are many obligate intracellular microbes. Our work provides methodological insight for such an approach.
David H. Lunt
Full Text Available Root knot nematodes (RKN can infect most of the world’s agricultural crop species and are among the most important of all plant pathogens. As yet however we have little understanding of their origins or the genomic basis of their extreme polyphagy. The most damaging pathogens reproduce by obligatory mitotic parthenogenesis and it has been suggested that these species originated from interspecific hybridizations between unknown parental taxa. We have sequenced the genome of the diploid meiotic parthenogen Meloidogyne floridensis, and use a comparative genomic approach to test the hypothesis that this species was involved in the hybrid origin of the tropical mitotic parthenogen Meloidogyne incognita. Phylogenomic analysis of gene families from M. floridensis, M. incognita and an outgroup species Meloidogyne hapla was carried out to trace the evolutionary history of these species’ genomes, and we demonstrate that M. floridensis was one of the parental species in the hybrid origins of M. incognita. Analysis of the M. floridensis genome itself revealed many gene loci present in divergent copies, as they are in M. incognita, indicating that it too had a hybrid origin. The triploid M. incognita is shown to be a complex double-hybrid between M. floridensis and a third, unidentified, parent. The agriculturally important RKN have very complex origins involving the mixing of several parental genomes by hybridization and their extreme polyphagy and success in agricultural environments may be related to this hybridization, producing transgressive variation on which natural selection can act. It is now clear that studying RKN variation via individual marker loci may fail due to the species’ convoluted origins, and multi-species population genomics is essential to understand the hybrid diversity and adaptive variation of this important species complex. This comparative genomic analysis provides a compelling example of the importance and complexity of
Rensing, Stefan A.; Lang, Daniel; Zimmer, Andreas D.; Terry, Astrid; Salamov, Asaf; Shapiro, Harris; Nishiyama, Tomaoki; Perroud, Pierre-Francois; Lindquist, Erika A.; Kamisugi, Yasuko; Tanahashi, Takako; Sakakibara, Keiko; Fujita, Tomomichi; Oishi, Kazuko; Shin, Tadasu; Kuroki, Yoko; Toyoda, Atsushi; Suzuki, Yutaka; Hashimoto, Shin-ichi; Yamaguchi, Kazuo; Sugano, Sumio; Kohara, Yuji; Fujiyama, Asao; Anterola, Aldwin; Aoki, Setsuyuki; Ashton, Neil; Barbazuk, W. Brad; Barker, Elizabeth; Bennetzen, Jeffrey L.; Blankenship, Robert; Cho, Sung Hyun; Dutcher, Susan K.; Estelle, Mark; Fawcett, Jeffrey A.; Gundlach, Heidrum; Hanada, Kousuke; Melkozernov, Alexander; Murata, Takashi; Nelson, David R.; Pils, Birgit; Prigge, Michael; Reiss, Bernd; Renner, Tanya; Rombauts, Stephane; Rushton, Paul J.; Sanderfoot, Anton; Schween, Gabriele; Shiu, Shin-Han; Stueber, Kurt; Theodoulou, Frederica L.; Tu, Hank; Van de Peer, Yves; Verrier, Paul J.; Waters, Elizabeth; Wood, Andrew; Yang, Lixing; Cove, David; Cuming, Andrew C.; Hasebe, Mitsayasu; Lucas, Susan; Mishler, Brent D.; Reski, Ralf; Grigoriev, Igor V.; Quatrano, Rakph S.; Boore, Jeffrey L.
We report the draft genome sequence of the model moss Physcomitrella patens and compare its features with those of flowering plants, from which it is separated by more than 400 million years, and unicellular aquatic algae. This comparison reveals genomic changes concomitant with the evolutionary movement to land, including a general increase in gene family complexity; loss of genes associated with aquatic environments (e.g., flagellar arms); acquisition of genes for tolerating terrestrial stresses (e.g., variation in temperature and water availability); and the development of the auxin and abscisic acid signaling pathways for coordinating multicellular growth and dehydration response. The Physcomitrella genome provides a resource for phylogenetic inferences about gene function and for experimental analysis of plant processes through this plant's unique facility for reverse genetics.
Wen, Yong-Jun; Shi, Xin-Chuan; Wang, Feng-Xue; Wang, Wei; Zhang, Shu-Qin; Li, Guo; Song, Ni; Chen, Li-Zhi; Cheng, Shi-Peng; Wu, Hua
In 2009, a bovine parainfluenza virus (BPIV3), named as NM09, was isolated using MDBK cell culture from the nasal swabs of normal cattle in China. The NM09 isolate was characterized by RT-PCR and nucleotide sequence analysis. Its complete genome was 15,456 nucleotides in length. Similar to other sequenced PIV strains, the NM09 virus consisted of six non-overlapping genes, which were predicted to encode nine proteins with conserved and complementary 3' leader and 5' trailer regions, conserved gene starts, gene stops, and trinucleotide intergenic sequences. Nucleotide phylogenetic analysis of matrix and hemagglutinin-neuraminidase gene demonstrated that this NM09 isolate belonged to BPIV3 genotype A instead of the previously reported BPIV3 genotype C in China. It is implicated that the different genotypes A and C might coexist infection for a long time in China.
Deptula, Paulina; Laine, Pia K; Roberts, Richard J; Smolander, Olli-Pekka; Vihinen, Helena; Piironen, Vieno; Paulin, Lars; Jokitalo, Eija; Savijoki, Kirsi; Auvinen, Petri; Varmanen, Pekka
Propionibacterium freudenreichii is an industrially important bacterium granted the Generally Recognized as Safe (the GRAS) status, due to its long safe use in food bioprocesses. Despite the recognized role in the food industry and in the production of vitamin B12, as well as its documented health-promoting potential, P. freudenreichii remained poorly characterised at the genomic level. At present, only three complete genome sequences are available for the species. We used the PacBio RS II sequencing platform to generate complete genomes of 20 P. freudenreichii strains and compared them in detail. Comparative analyses revealed both sequence conservation and genome organisational diversity among the strains. Assembly from long reads resulted in the discovery of additional circular elements: two putative conjugative plasmids and three active, lysogenic bacteriophages. It also permitted characterisation of the CRISPR-Cas systems. The use of the PacBio sequencing platform allowed identification of DNA modifications, which in turn allowed characterisation of the restriction-modification systems together with their recognition motifs. The observed genomic differences suggested strain variation in surface piliation and specific mucus binding, which were validated by experimental studies. The phenotypic characterisation displayed large diversity between the strains in ability to utilise a range of carbohydrates, to grow at unfavourable conditions and to form a biofilm. The complete genome sequencing allowed detailed characterisation of the industrially important species, P. freudenreichii by facilitating the discovery of previously unknown features. The results presented here lay a solid foundation for future genetic and functional genomic investigations of this actinobacterial species.
Weng, Qijie; Li, Mei; Yu, Xiaoli; Guo, Yong; Wang, Yu; Zhang, Xiaohong; Gan, Siming
Dense genetic maps, along with quantitative trait loci (QTLs) detected on such maps, are powerful tools for genomics and molecular breeding studies. In the important woody genus Eucalyptus, the recent release of E. grandis genome sequence allows for sequence-based genomic comparison and searching for positional candidate genes within QTL regions. Here, dense genetic maps were constructed for E. urophylla and E. tereticornis using genomic simple sequence repeats (SSR), expressed sequence tag (EST) derived SSR, EST-derived cleaved amplified polymorphic sequence (EST-CAPS), and diversity arrays technology (DArT) markers. The E. urophylla and E. tereticornis maps comprised 700 and 585 markers across 11 linkage groups, totaling at 1,208.2 and 1,241.4 cM in length, respectively. Extensive synteny and colinearity were observed as compared to three earlier DArT-based eucalypt maps (two maps with E. grandis × E. urophylla and one map of E. globulus) and with the E. grandis genome sequence. Fifty-three QTLs for growth (10–56 months of age) and wood density (56 months) were identified in 22 discrete regions on both maps, in which only one colocalizaiton was found between growth and wood density. Novel QTLs were revealed as compared with those previously detected on DArT-based maps for similar ages in Eucalyptus. Eleven to 585 positional candidate genes were obained for a 56-month-old QTL through aligning QTL confidence interval with the E. grandis genome. These results will assist in comparative genomics studies, targeted gene characterization, and marker-assisted selection in Eucalyptus and the related taxa. PMID:26695430
Full Text Available Dense genetic maps, along with quantitative trait loci (QTLs detected on such maps, are powerful tools for genomics and molecular breeding studies. In the important woody genus Eucalyptus, the recent release of E. grandis genome sequence allows for sequence-based genomic comparison and searching for positional candidate genes within QTL regions. Here, dense genetic maps were constructed for E. urophylla and E. tereticornis using genomic simple sequence repeats (SSR, expressed sequence tag (EST derived SSR, EST-derived cleaved amplified polymorphic sequence (EST-CAPS, and diversity arrays technology (DArT markers. The E. urophylla and E. tereticornis maps comprised 700 and 585 markers across 11 linkage groups, totaling at 1,208.2 and 1,241.4 cM in length, respectively. Extensive synteny and colinearity were observed as compared to three earlier DArT-based eucalypt maps (two maps with E. grandis × E. urophylla and one map of E. globulus and with the E. grandis genome sequence. Fifty-three QTLs for growth (10-56 months of age and wood density (56 months were identified in 22 discrete regions on both maps, in which only one colocalizaiton was found between growth and wood density. Novel QTLs were revealed as compared with those previously detected on DArT-based maps for similar ages in Eucalyptus. Eleven to 585 positional candidate genes were obained for a 56-month-old QTL through aligning QTL confidence interval with the E. grandis genome. These results will assist in comparative genomics studies, targeted gene characterization, and marker-assisted selection in Eucalyptus and the related taxa.
Li, Fagen; Zhou, Changpin; Weng, Qijie; Li, Mei; Yu, Xiaoli; Guo, Yong; Wang, Yu; Zhang, Xiaohong; Gan, Siming
Dense genetic maps, along with quantitative trait loci (QTLs) detected on such maps, are powerful tools for genomics and molecular breeding studies. In the important woody genus Eucalyptus, the recent release of E. grandis genome sequence allows for sequence-based genomic comparison and searching for positional candidate genes within QTL regions. Here, dense genetic maps were constructed for E. urophylla and E. tereticornis using genomic simple sequence repeats (SSR), expressed sequence tag (EST) derived SSR, EST-derived cleaved amplified polymorphic sequence (EST-CAPS), and diversity arrays technology (DArT) markers. The E. urophylla and E. tereticornis maps comprised 700 and 585 markers across 11 linkage groups, totaling at 1,208.2 and 1,241.4 cM in length, respectively. Extensive synteny and colinearity were observed as compared to three earlier DArT-based eucalypt maps (two maps with E. grandis × E. urophylla and one map of E. globulus) and with the E. grandis genome sequence. Fifty-three QTLs for growth (10-56 months of age) and wood density (56 months) were identified in 22 discrete regions on both maps, in which only one colocalizaiton was found between growth and wood density. Novel QTLs were revealed as compared with those previously detected on DArT-based maps for similar ages in Eucalyptus. Eleven to 585 positional candidate genes were obained for a 56-month-old QTL through aligning QTL confidence interval with the E. grandis genome. These results will assist in comparative genomics studies, targeted gene characterization, and marker-assisted selection in Eucalyptus and the related taxa.
Wang, Jiao; Chu, Shanshan; Zhang, Huairen; Zhu, Ying; Cheng, Hao; Yu, Deyue
Domestication of soybeans occurred under the intense human-directed selections aimed at developing high-yielding lines. Tracing the domestication history and identifying the genes underlying soybean domestication require further exploration. Here, we developed a high-throughput NJAU 355 K SoySNP array and used this array to study the genetic variation patterns in 367 soybean accessions, including 105 wild soybeans and 262 cultivated soybeans. The population genetic analysis suggests that cultivated soybeans have tended to originate from northern and central China, from where they spread to other regions, accompanied with a gradual increase in seed weight. Genome-wide scanning for evidence of artificial selection revealed signs of selective sweeps involving genes controlling domestication-related agronomic traits including seed weight. To further identify genomic regions related to seed weight, a genome-wide association study (GWAS) was conducted across multiple environments in wild and cultivated soybeans. As a result, a strong linkage disequilibrium region on chromosome 20 was found to be significantly correlated with seed weight in cultivated soybeans. Collectively, these findings should provide an important basis for genomic-enabled breeding and advance the study of functional genomics in soybean.
Wang, Jiao; Chu, Shanshan; Zhang, Huairen; Zhu, Ying; Cheng, Hao; Yu, Deyue
Domestication of soybeans occurred under the intense human-directed selections aimed at developing high-yielding lines. Tracing the domestication history and identifying the genes underlying soybean domestication require further exploration. Here, we developed a high-throughput NJAU 355 K SoySNP array and used this array to study the genetic variation patterns in 367 soybean accessions, including 105 wild soybeans and 262 cultivated soybeans. The population genetic analysis suggests that cultivated soybeans have tended to originate from northern and central China, from where they spread to other regions, accompanied with a gradual increase in seed weight. Genome-wide scanning for evidence of artificial selection revealed signs of selective sweeps involving genes controlling domestication-related agronomic traits including seed weight. To further identify genomic regions related to seed weight, a genome-wide association study (GWAS) was conducted across multiple environments in wild and cultivated soybeans. As a result, a strong linkage disequilibrium region on chromosome 20 was found to be significantly correlated with seed weight in cultivated soybeans. Collectively, these findings should provide an important basis for genomic-enabled breeding and advance the study of functional genomics in soybean. PMID:26856884
Valenzuela Jesus G
Full Text Available Abstract Background The completion of the Plasmodium falciparum genome represents a milestone in malaria research. The genome sequence allows for the development of genome-wide approaches such as microarray and proteomics that will greatly facilitate our understanding of the parasite biology and accelerate new drug and vaccine development. Designing and application of these genome-wide assays, however, requires accurate information on gene prediction and genome annotation. Unfortunately, the genes in the parasite genome databases were mostly identified using computer software that could make some erroneous predictions. Results We aimed to obtain cDNA sequences to examine the accuracy of gene prediction in silico. We constructed cDNA libraries from mixed blood stages of P. falciparum parasite using the SMART cDNA library construction technique and generated 17332 high-quality expressed sequence tags (EST, including 2198 from primer-walking experiments. Assembly of our sequence tags produced 2548 contigs and 2671 singletons versus 5220 contigs and 5910 singletons when our EST were assembled with EST in public databases. Comparison of all the assembled EST/contigs with predicted CDS and genomic sequences in the PlasmoDB database identified 356 genes with predicted coding sequences fully covered by EST, including 85 genes (23.6% with introns incorrectly predicted. Careful automatic software and manual alignments found an additional 308 genes that have introns different from those predicted, with 152 new introns discovered and 182 introns with sizes or locations different from those predicted. Alternative spliced and antisense transcripts were also detected. Matching cDNA to predicted genes also revealed silent chromosomal regions, mostly at subtelomere regions. Conclusion Our data indicated that approximately 24% of the genes in the current databases were predicted incorrectly, although some of these inaccuracies could represent alternatively
Lai, Yiling; Liu, Keke; Zhang, Xinyu; Zhang, Xiaoling; Li, Kuan; Wang, Niuniu; Shu, Chi; Wu, Yunpeng; Wang, Chengshu; Bushley, Kathryn E.; Xiang, Meichun; Liu, Xingzhong
Hirsutella minnesotensis [Ophiocordycipitaceae (Hypocreales, Ascomycota)] is a dominant endoparasitic fungus by using conidia that adhere to and penetrate the secondary stage juveniles of soybean cyst nematode. Its genome was de novo sequenced and compared with five entomopathogenic fungi in the Hypocreales and three nematode-trapping fungi in the Orbiliales (Ascomycota). The genome of H. minnesotensis is 51.4 Mb and encodes 12,702 genes enriched with transposable elements up to 32%. Phylogenomic analysis revealed that H. minnesotensis was diverged from entomopathogenic fungi in Hypocreales. Genome of H. minnesotensis is similar to those of entomopathogenic fungi to have fewer genes encoding lectins for adhesion and glycoside hydrolases for cellulose degradation, but is different from those of nematode-trapping fungi to possess more genes for protein degradation, signal transduction, and secondary metabolism. Those results indicate that H. minnesotensis has evolved different mechanism for nematode endoparasitism compared with nematode-trapping fungi. Transcriptomics analyses for the time-scale parasitism revealed the upregulations of lectins, secreted proteases and the genes for biosynthesis of secondary metabolites that could be putatively involved in host surface adhesion, cuticle degradation, and host manipulation. Genome and transcriptome analyses provided comprehensive understanding of the evolution and lifestyle of nematode endoparasitism. PMID:25359922
Multiple introductions of Zika virus into the United States revealed through genomic epidemiology Nathan D Grubaugh1*, Jason T Ladner2,*, Moritz UG...042 DISTRIBUTION STATEMENT A: Approved for public release; distribution is unlimited. UNCLASSIFIED Zika virus (ZIKV) is currently causing an... virus case data Weekly reports of international travel-associated Zika fever cases in Florida and ZIKV infected cases acquired in Florida were obtained
Campylobacter sputorum is a non-thermotolerant campylobacter that is primarily isolated from food animals such as cattle and sheep. C. sputorum is also infrequently associated with human illness. Based on catalase and urease activity, three biovars are currently recognized within C. sputorum: bv. sp...
Full Text Available Bacterial surveys of the vaginal and bladder human microbiota have revealed an abundance of many similar bacterial taxa. As the bladder was once thought to be sterile, the complex interactions between microbes within the bladder have yet to be characterized. To initiate this process, we have begun sequencing isolates, including the clinically relevant genus Gardnerella. Herein, we present the genomic sequences of four Gardnerella strains isolated from the bladders of women with symptoms of urgency urinary incontinence; these are the first Gardnerella genomes produced from this niche. Congruent to genomic characterization of Gardnerella isolates from the reproductive tract, isolates from the bladder reveal a large pangenome, as well as evidence of high frequency horizontal gene transfer. Prophage gene sequences were found to be abundant amongst the strains isolated from the bladder, as well as amongst publicly available Gardnerella genomes from the vagina and endometrium, motivating an in depth examination of these sequences. Amongst the 39 Gardnerella strains examined here, there were more than 400 annotated prophage gene sequences that we could cluster into 95 homologous groups; 49 of these groups were unique to a single strain. While many of these prophages exhibited no sequence similarity to any lytic phage genome, estimation of the rate of phage acquisition suggests both vertical and horizontal acquisition. Furthermore, bioinformatic evidence indicates that prophage acquisition is ongoing within both vaginal and bladder Gardnerella populations. The abundance of prophage sequences within the strains examined here suggests that phages could play an important role in the species' evolutionary history and in its interactions within the complex communities found in the female urinary and reproductive tracts.
Full Text Available The high risk of metabolic disease traits in Polynesians may be partly explained by elevated prevalence of genetic variants involved in energy metabolism. The genetics of Polynesian populations has been shaped by island hoping migration events which have possibly favoured thrifty genes. The aim of this study was to sequence the mitochondrial genome in a group of Maoris in an effort to characterise genome variation in this Polynesian population for use in future disease association studies. We sequenced the complete mitochondrial genomes of 20 non-admixed Maori subjects using Affymetrix technology. DNA diversity analyses showed the Maori group exhibited reduced mitochondrial genome diversity compared to other worldwide populations, which is consistent with historical bottleneck and founder effects. Global phylogenetic analysis positioned these Maori subjects specifically within mitochondrial haplogroup--B4a1a1. Interestingly, we identified several novel variants that collectively form new and unique Maori motifs--B4a1a1c, B4a1a1a3 and B4a1a1a5. Compared to ancestral populations we observed an increased frequency of non-synonymous coding variants of several mitochondrial genes in the Maori group, which may be a result of positive selection and/or genetic drift effects. In conclusion, this study reports the first complete mitochondrial genome sequence data for a Maori population. Overall, these new data reveal novel mitochondrial genome signatures in this Polynesian population and enhance the phylogenetic picture of maternal ancestry in Oceania. The increased frequency of several mitochondrial coding variants makes them good candidates for future studies aimed at assessment of metabolic disease risk in Polynesian populations.
Yang, Tianquan; Dong, Xue; Li, De-Zhu
Investigations of genomic DNA methylation in seeds have been restricted to a few model plants. The endosperm genomic DNA hypomethylation has been identified in angiosperm, but it is difficult to dissect the mechanism of how this hypomethylation is established and maintained because endosperm is ephemeral and disappears with seed development in most dicots. Castor bean (Ricinus communis), unlike Arabidopsis (Arabidopsis thaliana), endosperm is persistent throughout seed development, providing an excellent model in which to dissect the mechanism of endosperm genomic hypomethylation in dicots. We characterized the DNA methylation-related genes encoding DNA methyltransferases and demethylases and analyzed their expression profiles in different tissues. We examined genomic methylation including CG, CHG, and CHH contexts in endosperm and embryo tissues using bisulfite sequencing and revealed that the CHH methylation extent in endosperm and embryo was, unexpectedly, substantially higher than in previously studied plants, irrespective of the CHH percentage in their genomes. In particular, we found that the endosperm exhibited a global reduction in CG and CHG methylation extents relative to the embryo, markedly switching global gene expression. However, CHH methylation occurring in endosperm did not exhibit a significant reduction. Combining with the expression of 24-nucleotide small interfering RNAs (siRNAs) mapped within transposable element (TE) regions and genes involved in the RNA-directed DNA methylation pathway, we demonstrate that the 24-nucleotide siRNAs played a critical role in maintaining CHH methylation and repressing the activation of TEs in persistent endosperm development. This study discovered a novel genomic DNA methylation pattern and proposes the potential mechanism occurring in dicot seeds with persistent endosperm. PMID:27208275
Schäfers, Christian; Blank, Saskia; Wiebusch, Sigrid; Elleuche, Skander; Antranikian, Garabed
Thermus brockianus strain GE-1 is a thermophilic, Gram-negative, rod-shaped and non-motile bacterium that was isolated from the Geysir geothermal area, Iceland. Like other thermophiles, Thermus species are often used as model organisms to understand the mechanism of action of extremozymes, especially focusing on their heat-activity and thermostability. Genome-specific features of T. brockianus GE-1 and their properties further help to explain processes of the adaption of extremophiles at elevated temperatures. Here we analyze the first whole genome sequence of T. brockianus strain GE-1. Insights of the genome sequence and the methodologies that were applied during de novo assembly and annotation are given in detail. The finished genome shows a phred quality value of QV50. The complete genome size is 2.38 Mb, comprising the chromosome (2,035,182 bp), the megaplasmid pTB1 (342,792 bp) and the smaller plasmid pTB2 (10,299 bp). Gene prediction revealed 2,511 genes in total, including 2,458 protein-encoding genes, 53 RNA and 66 pseudo genes. A unique genomic region on megaplasmid pTB1 was identified encoding key enzymes for xylan depolymerization and xylose metabolism. This is in agreement with the growth experiments in which xylan is utilized as sole source of carbon. Accordingly, we identified sequences encoding the xylanase Xyn10, an endoglucanase, the membrane ABC sugar transporter XylH, the xylose-binding protein XylF, the xylose isomerase XylA catalyzing the first step of xylose metabolism and the xylulokinase XylB, responsible for the second step of xylose metabolism. Our data indicate that an ancestor of T. brockianus obtained the ability to use xylose as alternative carbon source by horizontal gene transfer.
Aerophobetes (or CD12) is a recently defined bacterial phylum, of which the metabolic processes and ecological importance remain unclear. In the present study, we obtained the draft genome of an Aerophobetes bacterium TCS1 from saline sediment near the Thuwal cold seep in the Red Sea using a genome binning method. Analysis of 16S rRNA genes of TCS1 and close relatives revealed wide distribution of Aerophobetes in deep-sea sediments. Phylogenetic relationships showed affinity between Aerophobetes TCS1 and some thermophilic bacterial phyla. The genome of TCS1 (at least 1.27 Mbp) contains a full set of genes encoding core metabolic pathways, including glycolysis and pyruvate fermentation to produce acetyl-CoA and acetate. The identification of cross-membrane sugar transporter genes further indicates its potential ability to consume carbohydrates preserved in the sediment under the microbial mat. Aerophobetes bacterium TCS1 therefore probably carried out saccharolytic and fermentative metabolism. The genes responsible for autotrophic synthesis of acetyl-CoA via the Wood–Ljungdahl pathway were also found in the genome. Phylogenetic study of the essential genes for the Wood–Ljungdahl pathway implied relative independence of Aerophobetes bacterium from the known acetogens and methanogens. Compared with genomes of acetogenic bacteria, Aerophobetes bacterium TCS1 genome lacks the genes involved in nitrogen metabolism, sulfur metabolism, signal transduction and cell motility. The metabolic activities of TCS1 might depend on geochemical conditions such as supplies of CO2, hydrogen and sugars, and therefore the TCS1 might be a facultative bacterium in anaerobic saline sediments near cold seeps. © 2016, Science China Press and Springer-Verlag Berlin Heidelberg.
Gopala Krishnan S
Full Text Available BACKGROUND: Rice is a major source of human food with a predominantly Asian production base. Domestication involved selection of traits that are desirable for agriculture and to human consumers. Wild relatives of crop plants are a source of useful variation which is of immense value for crop improvement. Australian wild rices have been isolated from the impacts of domestication in Asia and represents a source of novel diversity for global rice improvement. Oryza rufipogon is a perennial wild progenitor of cultivated rice. Oryza meridionalis is a related annual species in Australia. RESULTS: We have examined the sequence of the genomes of AA genome wild rices from Australia that are close relatives of cultivated rice through whole genome re-sequencing. Assembly of the resequencing data to the O. sativa ssp. japonica cv. Nipponbare shows that Australian wild rices possess 2.5 times more single nucleotide polymorphisms than in the Asian wild rice and cultivated O. sativa ssp. indica. Analysis of the genome of domesticated rice reveals regions of low diversity that show very little variation (polymorphism deserts. Both the perennial and annual wild rice from Australia show a high degree of conservation of sequence with that found in cultivated rice in the same 4.58 Mbp region on chromosome 5, which suggests that some of the 'polymorphism deserts' in this and other parts of the rice genome may have originated prior to domestication due to natural selection. CONCLUSIONS: Analysis of genes in the 'polymorphism deserts' indicates that this selection may have been due to biotic or abiotic stress in the environment of early rice relatives. Despite having closely related sequences in these genome regions, the Australian wild populations represent an invaluable source of diversity supporting rice food security.
Riley, Robert; Salamov, Asaf; Otillar, Robert; Fagnan, Kirsten; Boussau, Bastien; Brown, Daren; Henrissat, Bernard; Levasseur, Anthony; Held, Benjamin; Nagy, Laszlo; Floudas, Dimitris; Morin, Emmanuelle; Manning, Gerard; Baker, Scott; Martin, Francis; Blanchette, Robert; Hibbett, David; Grigoriev, Igor V.
Fungi of the phylum Basidiomycota (basidiomycetes), make up some 37percent of the described fungi, and are important in forestry, agriculture, medicine, and bioenergy. This diverse phylum includes symbionts, pathogens, and saprobes including wood decaying fungi. To better understand the diversity of this phylum we compared the genomes of 35 basidiomycete fungi including 6 newly sequenced genomes. The genomes of basidiomycetes span extremes of genome size, gene number, and repeat content. A phylogenetic tree of Basidiomycota was generated using the Phyldog software, which uses all available protein sequence data to simultaneously infer gene and species trees. Analysis of core genes reveals that some 48percent of basidiomycete proteins are unique to the phylum with nearly half of those (22percent) comprising proteins found in only one organism. Phylogenetic patterns of plant biomass-degrading genes suggest a continuum rather than a sharp dichotomy between the white rot and brown rot modes of wood decay among the members of Agaricomycotina subphylum. There is a correlation of the profile of certain gene families to nutritional mode in Agaricomycotina. Based on phylogenetically-informed PCA analysis of such profiles, we predict that that Botryobasidium botryosum and Jaapia argillacea have properties similar to white rot species, although neither has liginolytic class II fungal peroxidases. Furthermore, we find that both fungi exhibit wood decay with white rot-like characteristics in growth assays. Analysis of the rate of discovery of proteins with no or few homologs suggests the high value of continued sequencing of basidiomycete fungi.
Full Text Available Abstract Background Several mutations were present in the genome of Streptococcus pneumoniae linezolid-resistant strains but the role of several of these mutations had not been experimentally tested. To analyze the role of these mutations, we reconstituted resistance by serial whole genome transformation of a novel resistant isolate into two strains with sensitive background. We sequenced the parent mutant and two independent transformants exhibiting similar minimum inhibitory concentration to linezolid. Results Comparative genomic analyses revealed that transformants acquired G2576T transversions in every gene copy of 23S rRNA and that the number of altered copies correlated with the level of linezolid resistance and cross-resistance to florfenicol and chloramphenicol. One of the transformants also acquired a mutation present in the parent mutant leading to the overexpression of an ABC transporter (spr1021. The acquisition of these mutations conferred a fitness cost however, which was further enhanced by the acquisition of a mutation in a RNA methyltransferase implicated in resistance. Interestingly, the fitness of the transformants could be restored in part by the acquisition of altered copies of the L3 and L16 ribosomal proteins and by mutations leading to the overexpression of the spr1887 ABC transporter that were present in the original linezolid-resistant mutant. Conclusions Our results demonstrate the usefulness of whole genome approaches at detecting major determinants of resistance as well as compensatory mutations that alleviate the fitness cost associated with resistance.
Tollis, Marc; Hutchins, Elizabeth D; Stapley, Jessica; Rupp, Shawn M; Eckalbar, Walter L; Maayan, Inbar; Lasku, Eris; Infante, Carlos R; Dennis, Stuart R; Robertson, Joel A; May, Catherine M; Bermingham, Eldredge; DeNardo, Dale F; Hsieh, Shi-Tong Tonia; Kulathinal, Rob J; McMillan, William Owen; Menke, Douglas B; Pratt, Stephen C; Rawls, Jeffery Alan; Sanjur, Oris; Wilson-Rawls, Jeanne; Wilson Sayres, Melissa A; Fisher, Rebecca E
Abstract Squamates include all lizards and snakes, and display some of the most diverse and extreme morphological adaptations among vertebrates. However, compared with birds and mammals, relatively few resources exist for comparative genomic analyses of squamates, hampering efforts to understand the molecular bases of phenotypic diversification in such a speciose clade. In particular, the ∼400 species of anole lizard represent an extensive squamate radiation. Here, we sequence and assemble the draft genomes of three anole species—Anolis frenatus, Anolis auratus, and Anolis apletophallus—for comparison with the available reference genome of Anolis carolinensis. Comparative analyses reveal a rapid background rate of molecular evolution consistent with a model of punctuated equilibrium, and strong purifying selection on functional genomic elements in anoles. We find evidence for accelerated evolution in genes involved in behavior, sensory perception, and reproduction, as well as in genes regulating limb bud development and hindlimb specification. Morphometric analyses of anole fore and hindlimbs corroborated these findings. We detect signatures of positive selection across several genes related to the development and regulation of the forebrain, hormones, and the iguanian lizard dewlap, suggesting molecular changes underlying behavioral adaptations known to reinforce species boundaries were a key component in the diversification of anole lizards. PMID:29360978
Anderson, Iain J; Elkins, James G.; Podar, Mircea; Graham, David E.; Makarova, Kira S.; Wolf, Yuri; Randau, Lennart; Hedlund, Brian P.; Brochier-Armanet, Celine; Kunin, Victor; Anderson, Iain; Lapidus, Alla; Goltsman, Eugene; Barry, Kerrie; Koonin, Eugene V.; Hugenholtz, Phil; Kyrpides, Nikos; Wanner, Gerhard; Richardson, Paul; Keller, Martin; Stetter, Karl O.
The candidate division Korarchaeota comprises a group of uncultivated microorganisms that, by their small subunit rRNA phylogeny, may have diverged early from the major archaeal phyla Crenarchaeota and Euryarchaeota. Here, we report the initial characterization of a member of the Korarchaeota with the proposed name,"Candidatus Korarchaeum cryptofilum," which exhibits an ultrathin filamentous morphology. To investigate possible ancestral relationships between deep-branching Korarchaeota and other phyla, we used whole-genome shotgun sequencing to construct a complete composite korarchaeal genome from enriched cells. The genome was assembled into a single contig 1.59 Mb in length with a G + C content of 49percent. Of the 1,617 predicted protein-coding genes, 1,382 (85percent) could be assigned to a revised set of archaeal Clusters of Orthologous Groups (COGs). The predicted gene functions suggest that the organism relies on a simple mode of peptide fermentation for carbon and energy and lacks the ability to synthesize de novo purines, CoA, and several other cofactors. Phylogenetic analyses based on conserved single genes and concatenated protein sequences positioned the korarchaeote as a deep archaeal lineage with an apparent affinity to the Crenarchaeota. However, the predicted gene content revealed that several conserved cellular systems, such as cell division, DNA replication, and tRNA maturation, resemble the counterparts in the Euryarchaeota. In light of the known composition of archaeal genomes, the Korarchaeota might have retained a set of cellular features that represents the ancestral archaeal form.
Lapidus, Alla; Elkins, James G.; Podar, Mircea; Graham, David E.; Makarova, Kira S.; Wolf, Yuri; Randau, Lennart; Hedlund, Brian P.; Brochier-Armanet, Celine; Kunin, Victor; Anderson, Iain; Lapidus, Alla; Goltsman, Eugene; Barry, Kerrie; Koonin, Eugene V.; Hugenholtz, Phil; Kyrpides, Nikos; Wanner, Gerhard; Richardson, Paul; Keller, Martin; Stetter, Karl O.
The candidate division Korarchaeota comprises a group of uncultivated microorganisms that, by their small subunit rRNA phylogeny, may have diverged early from the major archaeal phyla Crenarchaeota and Euryarchaeota. Here, we report the initial characterization of a member of the Korarchaeota with the proposed name, ?Candidatus Korarchaeum cryptofilum,? which exhibits an ultrathin filamentous morphology. To investigate possible ancestral relationships between deep-branching Korarchaeota and other phyla, we used whole-genome shotgun sequencing to construct a complete composite korarchaeal genome from enriched cells. The genome was assembled into a single contig 1.59 Mb in length with a G + C content of 49percent. Of the 1,617 predicted protein-coding genes, 1,382 (85percent) could be assigned to a revised set of archaeal Clusters of Orthologous Groups (COGs). The predicted gene functions suggest that the organism relies on a simple mode of peptide fermentation for carbon and energy and lacks the ability to synthesize de novo purines, CoA, and several other cofactors. Phylogenetic analyses based on conserved single genes and concatenated protein sequences positioned the korarchaeote as a deep archaeal lineage with an apparent affinity to the Crenarchaeota. However, the predicted gene content revealed that several conserved cellular systems, such as cell division, DNA replication, and tRNA maturation, resemble the counterparts in the Euryarchaeota. In light of the known composition of archaeal genomes, the Korarchaeota might have retained a set of cellular features that represents the ancestral archaeal form.
Bhardwaj, Tulika; Somvanshi, Pallavi
Clostridium botulinum, a formidable pathogen is responsible for the emerging cause of food poisoning cases on the global canvas. The endemicity of bacterium Clostridium botulinum is reflected by the sudden hospital outbreaks and increased resistance towards multiple drugs. Therefore, a combined approach of in-silico comparative genomic analysis with statistical analysis was applied to overcome the limitation of bench-top technologies. Owing to the paucity of genomic data available by the advent of third generation sequencing technologies, several 'omics' technologies were applied to understand the underlying evolutionary pattern and lifestyle of the bacterial pathogen using phylogenomics. The calculation of pan-genome, core genome and singletons provides view of genetic repertoire of the bacterial pathogen lineage at the successive level, orthology shared and specific gene subsets. In addition, assessment of pathogenomic potential, resistome, toxin/antitoxin family in successive pathogenic strains of Clostridium botulinum aids in revealing more specific targets for drug design and development. Copyright © 2017 Elsevier B.V. All rights reserved.
Zheng, Jinshui; Zhao, Xin; Lin, Xiaoxi B; Gänzle, Michael
Lactobacillus reuteri is a dominant member of intestinal microbiota of vertebrates, and occurs in food fermentations. The stable presence of L. reuteri in sourdough provides the opportunity to study the adaptation of vertebrate symbionts to an extra-intestinal habitat. This study evaluated this adaptation by comparative genomics of 16 strains of L. reuteri. A core genome phylogenetic tree grouped L. reuteri into 5 clusters corresponding to the host-adapted lineages. The topology of a gene content tree, which includes accessory genes, differed from the core genome phylogenetic tree, suggesting that the differentiation of L. reuteri is shaped by gene loss or acquisition. About 10% of the core genome (124 core genes) were under positive selection. In lineage III sourdough isolates, 177 genes were under positive selection, mainly related to energy conversion and carbohydrate metabolism. The analysis of the competitiveness of L. reuteri in sourdough revealed that the competitivess of sourdough isolates was equal or higher when compared to rodent isolates. This study provides new insights into the adaptation of L. reuteri to food and intestinal habitats, suggesting that these two habitats exert different selective pressure related to growth rate and energy (carbohydrate) metabolism.
Matsumoto, Akira; Ukai-Tadenuma, Maki; Yamada, Rikuhiro G; Houl, Jerry; Uno, Kenichiro D; Kasukawa, Takeya; Dauwalder, Brigitte; Itoh, Taichi Q; Takahashi, Kuniaki; Ueda, Ryu; Hardin, Paul E; Tanimura, Teiichi; Ueda, Hiroki R
The Drosophila circadian clock consists of integrated autoregulatory feedback loops, making the clock difficult to elucidate without comprehensively identifying the network components in vivo. Previous studies have adopted genome-wide screening for clock-controlled genes using high-density oligonucleotide arrays that identified hundreds of clock-controlled genes. In an attempt to identify the core clock genes among these candidates, we applied genome-wide functional screening using an RNA interference (RNAi) system in vivo. Here we report the identification of novel clock gene candidates including clockwork orange (cwo), a transcriptional repressor belonging to the basic helix-loop-helix ORANGE family. cwo is rhythmically expressed and directly regulated by CLK-CYC through canonical E-box sequences. A genome-wide search for its target genes using the Drosophila genome tiling array revealed that cwo forms its own negative feedback loop and directly suppresses the expression of other clock genes through the E-box sequence. Furthermore, this negative transcriptional feedback loop contributes to sustaining a high-amplitude circadian oscillation in vivo. Based on these results, we propose that the competition between cyclic CLK-CYC activity and the adjustable threshold imposed by CWO keeps E-box-mediated transcription within the controllable range of its activity, thereby rendering a Drosophila circadian clock capable of generating high-amplitude oscillation.
Wragg, David; Mason, Andrew S; Yu, Le; Kuo, Richard; Lawal, Raman A; Desta, Takele Taye; Mwacharo, Joram M; Cho, Chang-Yeon; Kemp, Steve; Burt, David W; Hanotte, Olivier
EAV-HP is an ancient retrovirus pre-dating Gallus speciation, which continues to circulate in modern chicken populations, and led to the emergence of avian leukosis virus subgroup J causing significant economic losses to the poultry industry. We mapped EAV-HP integration sites in Ethiopian village chickens, a Silkie, Taiwan Country chicken, red junglefowl Gallus gallus and several inbred experimental lines using whole-genome sequence data. An average of 75.22 ± 9.52 integration sites per bird were identified, which collectively group into 279 intervals of which 5 % are common to 90 % of the genomes analysed and are suggestive of pre-domestication integration events. More than a third of intervals are specific to individual genomes, supporting active circulation of EAV-HP in modern chickens. Interval density is correlated with chromosome length (P < 2.31(-6)), and 27 % of intervals are located within 5 kb of a transcript. Functional annotation clustering of genes reveals enrichment for immune-related functions (P < 0.05). Our results illustrate a non-random distribution of EAV-HP in the genome, emphasising the importance it may have played in the adaptation of the species, and provide a platform from which to extend investigations on the co-evolutionary significance of endogenous retroviral genera with their hosts.
Full Text Available Nuclei of arbuscular endomycorrhizal fungi have been described as highly diverse due to their asexual nature and absence of a single cell stage with only one nucleus. This has raised fundamental questions concerning speciation, selection and transmission of the genetic make-up to next generations. Although this concept has become textbook knowledge, it is only based on studying a few loci, including 45S rDNA. To provide a more comprehensive insight into the genetic makeup of arbuscular endomycorrhizal fungi, we applied de novo genome sequencing of individual nuclei of Rhizophagus irregularis. This revealed a surprisingly low level of polymorphism between nuclei. In contrast, within a nucleus, the 45S rDNA repeat unit turned out to be highly diverged. This finding demystifies a long-lasting hypothesis on the complex genetic makeup of arbuscular endomycorrhizal fungi. Subsequent genome assembly resulted in the first draft reference genome sequence of an arbuscular endomycorrhizal fungus. Its length is 141 Mbps, representing over 27,000 protein-coding gene models. We used the genomic sequence to reinvestigate the phylogenetic relationships of Rhizophagus irregularis with other fungal phyla. This unambiguously demonstrated that Glomeromycota are more closely related to Mucoromycotina than to its postulated sister Dikarya.
Mtambo, Madundo M. A.; Muhairwa, Amandus P.; Lupindu, Athumani M.; Olsen, John E.
While pathogenic types of Escherichia coli are well characterized, relatively little is known about the commensal E. coli flora. In the current study, antimicrobial resistance in commensal E. coli and distribution of ERIC-PCR genotypes among isolates of such bacteria from cattle and cattle attendants on cattle farms in Tanzania were investigated. Seventeen E. coli genomes representing different ERIC-PCR types of commensal E. coli were sequenced in order to determine their possible importance as a reservoir for both antimicrobial resistance genes and virulence factors. Both human and cattle isolates were highly resistant to tetracycline (40.8% and 33.1%), sulphamethazole-trimethoprim (49.0% and 8.8%) and ampicillin (44.9% and 21.3%). However, higher proportion of resistant E. coli and higher frequency of resistance to more than two antimicrobials was found in isolates from cattle attendants than isolates from cattle. Sixteen out of 66 ERIC-PCR genotypes were shared between the two hosts, and among these ones, seven types contained isolates from cattle and cattle attendants from the same farm, suggesting transfer of strains between hosts. Genome-wide analysis showed that the majority of the sequenced cattle isolates were assigned to phylogroups B1, while human isolates represented phylogroups A, C, D and E. In general, in silico resistome and virulence factor identification did not reveal differences between hosts or phylogroups, except for lpfA and iss found to be cattle and B1 phylogroup specific. The most frequent plasmids replicon genes found in strains from both hosts were of IncF type, which are commonly associated with carriage of antimicrobial and virulence genes. Commensal E. coli from cattle and attendants were found to share same genotypes and to carry antimicrobial resistance and virulence genes associated with both intra and extraintestinal E. coli pathotypes. PMID:27977751
Baroukh, Nadine; Ahituv, Nadav; Chang, Jessie; Shoukry, Malak; Afzal, Veena; Rubin, Edward M.; Pennacchio, Len A.
COUP-TFII is a central nuclear hormone receptor that tightly regulates the expression of numerous target lipid metabolism genes in vertebrates. However, it remains unclear how COUP-TFII itself is transcriptionally controlled since studies with its promoter and upstream region fail to recapitulate the genes liver expression. In an attempt to identify liver enhancers in the vicinity of COUP-TFII, we employed a comparative genomic approach. Initial comparisons between humans and mice of the 3,470kb gene poor region surrounding COUP-TFII revealed 2,023 conserved non-coding elements. To prioritize a subset of these elements for functional studies, we performed further genomic comparisons with the orthologous pufferfish (Fugu rubripes) locus and uncovered two anciently conserved non-coding sequences (CNS) upstream of COUP-TFII (CNS-62kb and CNS-66kb). Testing these two elements using reporter constructs in liver (HepG2) cells revealed that CNS-66kb, but not CNS-62kb, yielded robust in vitro enhancer activity. In addition, an in vivo reporter assay using naked DNA transfer with CNS-66kb linked to luciferase displayed strong reproducible liver expression in adult mice, further supporting its role as a liver enhancer. Together, these studies further support the utility of comparative genomics to uncover gene regulatory sequences based on evolutionary conservation and provide the substrates to better understand the regulation and expression of COUP-TFII.
Teng, Huajing; Zhang, Yaohua; Shi, Chengmin; Mao, Fengbiao; Cai, Wanshi; Lu, Liang; Zhao, Fangqing; Sun, Zhongsheng; Zhang, Jianxu
Murine rodents are excellent models for study of adaptive radiations and speciation. Brown Norway rats (Rattus norvegicus) are successful global colonizers and the contributions of their domesticated laboratory strains to biomedical research are well established. To identify nucleotide-based speciation timing of the rat and genomic information contributing to its colonization capabilities, we analyzed 51 whole-genome sequences of wild-derived Brown Norway rats and their sibling species, R. nitidus, and identified over 20 million genetic variants in the wild Brown Norway rats that were absent in the laboratory strains, which substantially expand the reservoir of rat genetic diversity. We showed that divergence of the rat and its siblings coincided with drastic climatic changes that occurred during the Middle Pleistocene. Further, we revealed that there was a geographically widespread influx of genes between Brown Norway rats and the sibling species following the divergence, resulting in numerous introgressed regions in the genomes of admixed Brown Norway rats. Intriguing, genes related to chemical communications among these introgressed regions appeared to contribute to the population-specific adaptations of the admixed Brown Norway rats. Our data reveals evolutionary history of the Brown Norway rat, and offers new insights into the role of climatic changes in speciation of animals and the effect of interspecies introgression on animal adaptation. © The Author 2017. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.
Abstract The version of this article published in BMC Genomics 2013, 14: 274, contains 9 unpublished genomes (Botryobasidium botryosum, Gymnopus luxurians, Hypholoma sublateritium, Jaapia argillacea, Hebeloma cylindrosporum, Conidiobolus coronatus, Laccaria amethystina, Paxillus involutus, and P. rubicundulus) downloaded from JGI website. In this correction, we removed these genomes after discussion with editors and data producers whom we should have contacted before downloading these genomes. Removing these data did not alter the principle results and conclusions of our original work. The relevant Figures 1, 2, 3, 4 and 6; and Table 1 have been revised. Additional files 1, 3, 4, and 5 were also revised. We would like to apologize for any confusion or inconvenience this may have caused. Background Fungi produce a variety of carbohydrate activity enzymes (CAZymes) for the degradation of plant polysaccharide materials to facilitate infection and/or gain nutrition. Identifying and comparing CAZymes from fungi with different nutritional modes or infection mechanisms may provide information for better understanding of their life styles and infection models. To date, over hundreds of fungal genomes are publicly available. However, a systematic comparative analysis of fungal CAZymes across the entire fungal kingdom has not been reported. Results In this study, we systemically identified glycoside hydrolases (GHs), polysaccharide lyases (PLs), carbohydrate esterases (CEs), and glycosyltransferases (GTs) as well as carbohydrate-binding modules (CBMs) in the predicted proteomes of 94 representative fungi from Ascomycota, Basidiomycota, Chytridiomycota, and Zygomycota. Comparative analysis of these CAZymes that play major roles in plant polysaccharide degradation revealed that fungi exhibit tremendous diversity in the number and variety of CAZymes. Among them, some families of GHs and CEs are the most prevalent CAZymes that are distributed in all of the fungi analyzed
Zhao, Zhongtao; Liu, Huiquan; Wang, Chenfang; Xu, Jin-Rong
The version of this article published in BMC Genomics 2013, 14: 274, contains 9 unpublished genomes (Botryobasidium botryosum, Gymnopus luxurians, Hypholoma sublateritium, Jaapia argillacea, Hebeloma cylindrosporum, Conidiobolus coronatus, Laccaria amethystina, Paxillus involutus, and P. rubicundulus) downloaded from JGI website. In this correction, we removed these genomes after discussion with editors and data producers whom we should have contacted before downloading these genomes. Removing these data did not alter the principle results and conclusions of our original work. The relevant Figures 1, 2, 3, 4 and 6; and Table 1 have been revised. Additional files 1, 3, 4, and 5 were also revised. We would like to apologize for any confusion or inconvenience this may have caused. Fungi produce a variety of carbohydrate activity enzymes (CAZymes) for the degradation of plant polysaccharide materials to facilitate infection and/or gain nutrition. Identifying and comparing CAZymes from fungi with different nutritional modes or infection mechanisms may provide information for better understanding of their life styles and infection models. To date, over hundreds of fungal genomes are publicly available. However, a systematic comparative analysis of fungal CAZymes across the entire fungal kingdom has not been reported. In this study, we systemically identified glycoside hydrolases (GHs), polysaccharide lyases (PLs), carbohydrate esterases (CEs), and glycosyltransferases (GTs) as well as carbohydrate-binding modules (CBMs) in the predicted proteomes of 94 representative fungi from Ascomycota, Basidiomycota, Chytridiomycota, and Zygomycota. Comparative analysis of these CAZymes that play major roles in plant polysaccharide degradation revealed that fungi exhibit tremendous diversity in the number and variety of CAZymes. Among them, some families of GHs and CEs are the most prevalent CAZymes that are distributed in all of the fungi analyzed. Importantly, cellulases of some GH
Mai, Duy Minh; Sahana, Goutam; Christiansen, Freddy
Quantitative trait loci for milk production traits in Danish Jersey cattle were mapped by a genome-wide association analysis using a mixed model. The analysis incorporated 1,039 bulls and 33,090 SNP and resulted in 98 detected combinations of QTL and traits on 27 BTA. These QTL comprised 30...... for milk index, 50 for fat index, and 18 for protein index. The evidence presents 33 genome-wide QTL on 14 BTA. Of these, 7 had effects on milk index, 21 on fat index, and 5 on protein index. Among the genome-wide QTL, 26 have been previously reported, 2 on BTA4 and BTA5 were new for milk index, and 5......-like kinase 4. By a chromosome-wide threshold, 65 additional QTL were detected. Many of them are likely to represent QTL. The results are interesting from a breeding perspective and contribute to the search for the genes causing the polymorphisms important for milk production traits....
Full Text Available In the present study, the near complete mitochondrial genome (mitogenome of Junonia iphita (Lepidoptera: Nymphalidae: Nymphalinae was determined to be 14,892 bp. The gene order and orientation are identical to those in other butterfly species. The phylogenetic tree constructed from the whole mitogenomes using the 13 protein coding genes (PCGs defines the genetic relatedness of the two J. iphita species collected from two different regions. All the Junonia species clustered together, and were further subdivided into clade one consisting of J. almana and J. orithya and clade two comprising of the two J. iphita which were collected from Indo and Indochinese subregions separated by river barrier. Comparison between the two J. iphita sequences revealed minor variations and Single Nucleotide Polymorphisms were identified at 51 sites amounting to 0.4% of the entire mitochondrial genome.
Merchant, Sabeeha S
Chlamydomonas reinhardtii is a unicellular green alga whose lineage diverged from land plants over 1 billion years ago. It is a model system for studying chloroplast-based photosynthesis, as well as the structure, assembly, and function of eukaryotic flagella (cilia), which were inherited from the common ancestor of plants and animals, but lost in land plants. We sequenced the 120-megabase nuclear genome of Chlamydomonas and performed comparative phylogenomic analyses, identifying genes encoding uncharacterized proteins that are likely associated with the function and biogenesis of chloroplasts or eukaryotic flagella. Analyses of the Chlamydomonas genome advance our understanding of the ancestral eukaryotic cell, reveal previously unknown genes associated with photosynthetic and flagellar functions, and establish links between ciliopathy and the composition and function of flagella.
Bottacini, Francesca; Milani, Christian; Turroni, Francesca; Sánchez, Borja; Foroni, Elena; Duranti, Sabrina; Serafini, Fausta; Viappiani, Alice; Strati, Francesco; Ferrarini, Alberto; Delledonne, Massimo; Henrissat, Bernard; Coutinho, Pedro; Fitzgerald, Gerald F; Margolles, Abelardo; van Sinderen, Douwe; Ventura, Marco
Bifidobacteria are known as anaerobic/microaerophilic and fermentative microorganisms, which commonly inhabit the gastrointestinal tract of various animals and insects. Analysis of the 2,167,301 bp genome of Bifidobacterium asteroides PRL2011, a strain isolated from the hindgut of Apis mellifera var. ligustica, commonly known as the honey bee, revealed its predicted capability for respiratory metabolism. Conservation of the latter gene clusters in various B. asteroides strains enforces the notion that respiration is a common metabolic feature of this ancient bifidobacterial species, which has been lost in currently known mammal-derived Bifidobacterium species. In fact, phylogenomic based analyses suggested an ancient origin of B. asteroides and indicates it as an ancestor of the genus Bifidobacterium. Furthermore, the B. asteroides PRL2011 genome encodes various enzymes for coping with toxic products that arise as a result of oxygen-mediated respiration.
Sun, Jiajie; Zhang, Bowen; Lan, Xianyong; Zhang, Chunlei; Lei, Chuzhao; Chen, Hong
The posttranscriptional gene regulation mediated by microRNAs (miRNAs) plays an important role in various species. However, to date limited miRNAs have been reported between fat and muscle tissues in beef cattle. In this paper, 412 known and 22 novel miRNAs in backfat as well as 334 known and 10 novel miRNAs in longissimus thoracis were identified in the Chinese Qinchuan beef cattle. Bta-miR-199a-3p, -154c, -320a and -432 were expressed at higher levels in backfat tissue, while bta-miR-1, -133a, -206, and -378 were also significantly enriched in muscle tissue. Functional analysis revealed that fat-enriched miRNAs targeted PRKAA1/2, PPARA and PPARG genes to modulate lipid and fatty acid metabolism, and muscle-enriched miRNAs targeted CSRP3 gene to present function involved in skeletal and muscular system development. The results obtained may help in the design of new selection strategies to improve beef quality.
David A Baltrus
Full Text Available Closely related pathogens may differ dramatically in host range, but the molecular, genetic, and evolutionary basis for these differences remains unclear. In many Gram- negative bacteria, including the phytopathogen Pseudomonas syringae, type III effectors (TTEs are essential for pathogenicity, instrumental in structuring host range, and exhibit wide diversity between strains. To capture the dynamic nature of virulence gene repertoires across P. syringae, we screened 11 diverse strains for novel TTE families and coupled this nearly saturating screen with the sequencing and assembly of 14 phylogenetically diverse isolates from a broad collection of diseased host plants. TTE repertoires vary dramatically in size and content across all P. syringae clades; surprisingly few TTEs are conserved and present in all strains. Those that are likely provide basal requirements for pathogenicity. We demonstrate that functional divergence within one conserved locus, hopM1, leads to dramatic differences in pathogenicity, and we demonstrate that phylogenetics-informed mutagenesis can be used to identify functionally critical residues of TTEs. The dynamism of the TTE repertoire is mirrored by diversity in pathways affecting the synthesis of secreted phytotoxins, highlighting the likely role of both types of virulence factors in determination of host range. We used these 14 draft genome sequences, plus five additional genome sequences previously reported, to identify the core genome for P. syringae and we compared this core to that of two closely related non-pathogenic pseudomonad species. These data revealed the recent acquisition of a 1 Mb megaplasmid by a sub-clade of cucumber pathogens. This megaplasmid encodes a type IV secretion system and a diverse set of unknown proteins, which dramatically increases both the genomic content of these strains and the pan-genome of the species.
Baltrus, David A; Nishimura, Marc T; Romanchuk, Artur; Chang, Jeff H; Mukhtar, M Shahid; Cherkis, Karen; Roach, Jeff; Grant, Sarah R; Jones, Corbin D; Dangl, Jeffery L
Closely related pathogens may differ dramatically in host range, but the molecular, genetic, and evolutionary basis for these differences remains unclear. In many Gram- negative bacteria, including the phytopathogen Pseudomonas syringae, type III effectors (TTEs) are essential for pathogenicity, instrumental in structuring host range, and exhibit wide diversity between strains. To capture the dynamic nature of virulence gene repertoires across P. syringae, we screened 11 diverse strains for novel TTE families and coupled this nearly saturating screen with the sequencing and assembly of 14 phylogenetically diverse isolates from a broad collection of diseased host plants. TTE repertoires vary dramatically in size and content across all P. syringae clades; surprisingly few TTEs are conserved and present in all strains. Those that are likely provide basal requirements for pathogenicity. We demonstrate that functional divergence within one conserved locus, hopM1, leads to dramatic differences in pathogenicity, and we demonstrate that phylogenetics-informed mutagenesis can be used to identify functionally critical residues of TTEs. The dynamism of the TTE repertoire is mirrored by diversity in pathways affecting the synthesis of secreted phytotoxins, highlighting the likely role of both types of virulence factors in determination of host range. We used these 14 draft genome sequences, plus five additional genome sequences previously reported, to identify the core genome for P. syringae and we compared this core to that of two closely related non-pathogenic pseudomonad species. These data revealed the recent acquisition of a 1 Mb megaplasmid by a sub-clade of cucumber pathogens. This megaplasmid encodes a type IV secretion system and a diverse set of unknown proteins, which dramatically increases both the genomic content of these strains and the pan-genome of the species. © 2011 Baltrus et al.
Full Text Available The genus Sulfobacillus is a cohort of mildly thermophilic or thermotolerant acidophiles within the phylum Firmicutes and requires extremely acidic environments and hypersalinity for optimal growth. However, our understanding of them is still preliminary partly because few genome sequences are available. Here, the draft genome of Sulfobacillus thermosulfidooxidans strain ST was deciphered to obtain a comprehensive insight into the genetic content and to understand the cellular mechanisms necessary for its survival. Furthermore, the expressions of key genes related with iron and sulfur oxidation were verified by semi-quantitative RT-PCR analysis. The draft genome sequence of Sulfobacillus thermosulfidooxidans strain ST, which encodes 3225 predicted coding genes on a total length of 3,333,554 bp and a 48.35% G+C, revealed the high degree of heterogeneity with other Sulfobacillus species. The presence of numerous transposases, genomic islands and complete CRISPR/Cas defence systems testifies to its dynamic evolution consistent with the genome heterogeneity. As expected, S. thermosulfidooxidans encodes a suit of conserved enzymes required for the oxidation of inorganic sulfur compounds (ISCs. The model of sulfur oxidation in S. thermosulfidooxidans was proposed, which showed some different characteristics from the sulfur oxidation of Gram-negative A. ferrooxidans. Sulfur oxygenase reductase and heterodisulfide reductase were suggested to play important roles in the sulfur oxidation. Although the iron oxidation ability was observed, some key proteins cannot be identified in S. thermosulfidooxidans. Unexpectedly, a predicted sulfocyanin is proposed to transfer electrons in the iron oxidation. Furthermore, its carbon metabolism is rather flexible, can perform the transformation of pentose through the oxidative and non-oxidative pentose phosphate pathways and has the ability to take up small organic compounds. It encodes a multitude of heavy metal
van Baren, Marijke J; Bachy, Charles; Reistetter, Emily Nahas; Purvine, Samuel O; Grimwood, Jane; Sudek, Sebastian; Yu, Hang; Poirier, Camille; Deerinck, Thomas J; Kuo, Alan; Grigoriev, Igor V; Wong, Chee-Hong; Smith, Richard D; Callister, Stephen J; Wei, Chia-Lin; Schmutz, Jeremy; Worden, Alexandra Z
Prasinophytes are widespread marine green algae that are related to plants. Cellular abundance of the prasinophyte Micromonas has reportedly increased in the Arctic due to climate-induced changes. Thus, studies of these unicellular eukaryotes are important for marine ecology and for understanding Viridiplantae evolution and diversification. We generated evidence-based Micromonas gene models using proteomics and RNA-Seq to improve prasinophyte genomic resources. First, sequences of four chromosomes in the 22 Mb Micromonas pusilla (CCMP1545) genome were finished. Comparison with the finished 21 Mb genome of Micromonas commoda (RCC299; named herein) shows they share ≤8,141 of ~10,000 protein-encoding genes, depending on the analysis method. Unlike RCC299 and other sequenced eukaryotes, CCMP1545 has two abundant repetitive intron types and a high percent (26 %) GC splice donors. Micromonas has more genus-specific protein families (19 %) than other genome sequenced prasinophytes (11 %). Comparative analyses using predicted proteomes from other prasinophytes reveal proteins likely related to scale formation and ancestral photosynthesis. Our studies also indicate that peptidoglycan (PG) biosynthesis enzymes have been lost in multiple independent events in select prasinophytes and plants. However, CCMP1545, polar Micromonas CCMP2099 and prasinophytes from other classes retain the entire PG pathway, like moss and glaucophyte algae. Surprisingly, multiple vascular plants also have the PG pathway, except the Penicillin-Binding Protein, and share a unique bi-domain protein potentially associated with the pathway. Alongside Micromonas experiments using antibiotics that halt bacterial PG biosynthesis, the findings highlight unrecognized phylogenetic complexity in PG-pathway retention and implicate a role in chloroplast structure or division in several extant Viridiplantae lineages. Extensive differences in gene loss and architecture between related prasinophytes underscore
Palkopoulou, Eleftheria; Mallick, Swapan; Skoglund, Pontus; Enk, Jacob; Rohland, Nadin; Li, Heng; Omrak, Ayça; Vartanyan, Sergey; Poinar, Hendrik; Götherström, Anders; Reich, David; Dalén, Love
Summary The processes leading up to species extinctions are typically characterized by prolonged declines in population size and geographic distribution, followed by a phase in which populations are very small and may be subject to intrinsic threats, including loss of genetic diversity and inbreeding . However, whether such genetic factors have had an impact on species prior to their extinction is unclear [2, 3]; examining this would require a detailed reconstruction of a species’ demographic history as well as changes in genome-wide diversity leading up to its extinction. Here, we present high-quality complete genome sequences from two woolly mammoths (Mammuthus primigenius). The first mammoth was sequenced at 17.1-fold coverage, and dates to ~4,300 years before present, constituting one of the last surviving individuals on Wrangel Island. The second mammoth, sequenced at 11.2-fold coverage, was obtained from a ~44,800 year old specimen from the Late Pleistocene population in northeastern Siberia. The demographic trajectories inferred from the two genomes are qualitatively similar and reveal a population bottleneck during the Middle or Early Pleistocene, and a more recent severe decline in the ancestors of the Wrangel mammoth at the end of the last glaciation. A comparison of the two genomes shows that the Wrangel mammoth has a 20% reduction in heterozygosity as well as a 28-fold increase in the fraction of the genome that is comprised of runs of homozygosity. We conclude that the population on Wrangel Island, which was the last surviving woolly mammoth population, was subject to reduced genetic diversity shortly before it became extinct. PMID:25913407
Full Text Available The differential accumulation and elimination of repetitive DNA are key drivers of genome size variation in flowering plants, yet there have been few studies which have analysed how different types of repeats in related species contribute to genome size evolution within a phylogenetic context. This question is addressed here by conducting large-scale comparative analysis of repeats in 23 species from four genera of the monophyletic legume tribe Fabeae, representing a 7.6-fold variation in genome size. Phylogenetic analysis and genome size reconstruction revealed that this diversity arose from genome size expansions and contractions in different lineages during the evolution of Fabeae. Employing a combination of low-pass genome sequencing with novel bioinformatic approaches resulted in identification and quantification of repeats making up 55-83% of the investigated genomes. In turn, this enabled an analysis of how each major repeat type contributed to the genome size variation encountered. Differential accumulation of repetitive DNA was found to account for 85% of the genome size differences between the species, and most (57% of this variation was found to be driven by a single lineage of Ty3/gypsy LTR-retrotransposons, the Ogre elements. Although the amounts of several other lineages of LTR-retrotransposons and the total amount of satellite DNA were also positively correlated with genome size, their contributions to genome size variation were much smaller (up to 6%. Repeat analysis within a phylogenetic framework also revealed profound differences in the extent of sequence conservation between different repeat types across Fabeae. In addition to these findings, the study has provided a proof of concept for the approach combining recent developments in sequencing and bioinformatics to perform comparative analyses of repetitive DNAs in a large number of non-model species without the need to assemble their genomes.
Full Text Available Petroleum pollution is a severe environmental issue. Comprehensively revealing the genetic backgrounds of hydrocarbon-degrading microorganisms contributes to developing effective methods for bioremediation of crude oil-polluted environments. Marine bacterium Achromobacter sp. HZ01 is capable of degrading hydrocarbons and producing biosurfactants. In this study, the draft genome (5.5 Mbp of strain HZ01 has been obtained by Illumina sequencing, containing 5,162 predicted genes. Genome annotation shows that “amino acid metabolism” is the most abundant metabolic pathway. Strain HZ01 is not capable of using some common carbohydrates as the sole carbon sources, which is due to that it contains few genes associated with carbohydrate transport and lacks some important enzymes related to glycometabolism. It contains abundant proteins directly related to petroleum hydrocarbon degradation. AlkB hydroxylase and its homologs were not identified. It harbors a complete enzyme system of terminal oxidation pathway for n-alkane degradation, which may be initiated by cytochrome P450. The enzymes involved in the catechol pathway are relatively complete for the degradation of aromatic compounds. This bacterium lacks several essential enzymes for methane oxidation, and Baeyer-Villiger monooxygenase involved in the subterminal oxidation pathway and cycloalkane degradation was not identified. These results suggest that strain HZ01 degrades n-alkanes via the terminal oxidation pathway, degrades aromatic compounds primarily via the catechol pathway and cannot perform methane oxidation or cycloalkane degradation. Additionally, strain HZ01 possesses abundant genes related to the metabolism of secondary metabolites, including some genes involved in biosurfactant (such as glycolipids and lipopeptides synthesis. The genome analysis also reveals its genetic basis for nitrogen metabolism, antibiotic resistance, regulatory responses to environmental changes, cell motility
Full Text Available Fibrobacter succinogenes is an important member of the rumen microbial community that converts plant biomass into nutrients usable by its host. This bacterium, which is also one of only two cultivated species in its phylum, is an efficient and prolific degrader of cellulose. Specifically, it has a particularly high activity against crystalline cellulose that requires close physical contact with this substrate. However, unlike other known cellulolytic microbes, it does not degrade cellulose using a cellulosome or by producing high extracellular titers of cellulase enzymes. To better understand the biology of F. succinogenes, we sequenced the genome of the type strain S85 to completion. A total of 3,085 open reading frames were predicted from its 3.84 Mbp genome. Analysis of sequences predicted to encode for carbohydrate-degrading enzymes revealed an unusually high number of genes that were classified into 49 different families of glycoside hydrolases, carbohydrate binding modules (CBMs, carbohydrate esterases, and polysaccharide lyases. Of the 31 identified cellulases, none contain CBMs in families 1, 2, and 3, typically associated with crystalline cellulose degradation. Polysaccharide hydrolysis and utilization assays showed that F. succinogenes was able to hydrolyze a number of polysaccharides, but could only utilize the hydrolytic products of cellulose. This suggests that F. succinogenes uses its array of hemicellulose-degrading enzymes to remove hemicelluloses to gain access to cellulose. This is reflected in its genome, as F. succinogenes lacks many of the genes necessary to transport and metabolize the hydrolytic products of non-cellulose polysaccharides. The F. succinogenes genome reveals a bacterium that specializes in cellulose as its sole energy source, and provides insight into a novel strategy for cellulose degradation.
Ma, Peipei; Brøndum, Rasmus Froberg; Qin, Zahng
This study investigated the imputation accuracy of different methods, considering both the minor allele frequency and relatedness between individuals in the reference and test data sets. Two data sets from the combined population of Swedish and Finnish Red Cattle were used to test the influence...... coefficient was lower when the minor allele frequency was lower. The results indicate that Beagle and IMPUTE2 provide the most robust and accurate imputation accuracies, but considering computing time and memory usage, FImpute is another alternative method....
Background: Biotechnological uses of bacteriophage gene products as alternatives to conventional antibiotics will require a thorough understanding of their genomic context. We sequenced and analyzed the genomes of four closely related phages isolated from Clostridium perfringens, an important agricu...
Full Text Available Genomics will improve the efficiency of beef cattle genetic improvement programs through the incorporation of genomic predictions into traditional genetic evaluations. The global dairy cattle breeding industry has been changed considerably in the last year through the implementation of genomic selection. Now proven to work in dairy cattle breeding, the challenge remains for the beef industry to successfully implement this technology. The primary challenge in beef cattle is the required resource population that relates genomic profile to phenotypic performance, which is quite large and its establishment will require collaboration or a significant investment by any one enterprise. Another challenge in beef cattle is the requirement for genomic predictions to function across breeds, which will require denser marker panels. Opportunities to increase genetic progress include increased accuracy of selection, reduced generation interval, increased selection intensity and better utilization of limited recording capacity, such as individual feed intake, along with opportunities to genetically change novel traits. Implementation of a low density panel at the commercial level will allow informative decisions based on genetic potential at all levels of the production chain. This reduced panel will include predictive SNP based on fine QTL mapping efforts, combined with additional SNP to enable imputation of genotypes from a high density SNP panel, when combined with high density genotypes of key ancestors, such as sires. With electronic recording in cattle, a single genotyping event on each animal would provide information throughout the beef production chain, which will create the incentive for genetic change. Genomics will create new opportunities for reproductive technologies such as embryo transfer as elite females will be identified with increased accuracy. Potential changes to the structure of the breeding industry are discussed including changes to
Chapman, Carol; Henry, Matthew; Bishop-Lilly, Kimberly A; Awosika, Joy; Briska, Adam; Ptashkin, Ryan N; Wagner, Trevor; Rajanna, Chythanya; Tsang, Hsinyi; Johnson, Shannon L; Mokashi, Vishwesh P; Chain, Patrick S G; Sozhamannan, Shanmuga
Historically, cholera outbreaks have been linked to V. cholerae O1 serogroup strains or its derivatives of the O37 and O139 serogroups. A genomic study on the 2010 Haiti cholera outbreak strains highlighted the putative role of non O1/non-O139 V. cholerae in causing cholera and the lack of genomic sequences of such strains from around the world. Here we address these gaps by scanning a global collection of V. cholerae strains as a first step towards understanding the population genetic diversity and epidemic potential of non O1/non-O139 strains. Whole Genome Mapping (Optical Mapping) based bar coding produces a high resolution, ordered restriction map, depicting a complete view of the unique chromosomal architecture of an organism. To assess the genomic diversity of non-O1/non-O139 V. cholerae, we applied a Whole Genome Mapping strategy on a well-defined and geographically and temporally diverse strain collection, the Sakazaki serogroup type strains. Whole Genome Map data on 91 of the 206 serogroup type strains support the hypothesis that V. cholerae has an unprecedented genetic and genomic structural diversity. Interestingly, we discovered chromosomal fusions in two unusual strains that possess a single chromosome instead of the two chromosomes usually found in V. cholerae. We also found pervasive chromosomal rearrangements such as duplications and indels in many strains. The majority of Vibrio genome sequences currently in public databases are unfinished draft sequences. The Whole Genome Mapping approach presented here enables rapid screening of large strain collections to capture genomic complexities that would not have been otherwise revealed by unfinished draft genome sequencing and thus aids in assembling and finishing draft sequences of complex genomes. Furthermore, Whole Genome Mapping allows for prediction of novel V. cholerae non-O1/non-O139 strains that may have the potential to cause future cholera outbreaks.
Selection Signatures (SS) assessed through analysis of genomic data are being widely studied to discover population specific regions selected via artificial or natural selection. Different methodologies have been proposed for these analyses, each having specific limitations as to the age of the sele...
Starkenburg, Shawn R; Kwon, Kyungyoon J; Jha, Ramesh K; McKay, Cedar; Jacobs, Michael; Chertkov, Olga; Twary, Scott; Rocap, Gabrielle; Cattolico, Rose Ann
Microalgae in the genus Nannochloropsis are photosynthetic marine Eustigmatophytes of significant interest to the bioenergy and aquaculture sectors due to their ability to efficiently accumulate biomass and lipids for utilization in renewable transportation fuels, aquaculture feed, and other useful bioproducts. To better understand the genetic complement that drives the metabolic processes of these organisms, we present the assembly and comparative pangenomic analysis of the chloroplast and mitochondrial genomes from Nannochloropsis salina CCMP1776. The chloroplast and mitochondrial genomes of N. salina are 98.4% and 97% identical to their counterparts in Nannochloropsis gaditana. Comparison of the Nannochloropsis pangenome to other algae within and outside of the same phyla revealed regions of significant genetic divergence in key genes that encode proteins needed for regulation of branched chain amino synthesis (acetohydroxyacid synthase), carbon fixation (RuBisCO activase), energy conservation (ATP synthase), protein synthesis and homeostasis (Clp protease, ribosome). Many organellar gene modifications in Nannochloropsis are unique and deviate from conserved orthologs found across the tree of life. Implementation of secondary and tertiary structure prediction was crucial to functionally characterize many proteins and therefore should be implemented in automated annotation pipelines. The exceptional similarity of the N. salina and N. gaditana organellar genomes suggests that N. gaditana be reclassified as a strain of N. salina.
Liu, Mingshan; Liu, Yang; Di, Jiabo; Su, Zhe; Yang, Hong; Jiang, Beihai; Wang, Zaozao; Zhuang, Meng; Bai, Fan; Su, Xiangqian
Colorectal cancer is a heterogeneous group of malignancies with complex molecular subtypes. While colon cancer has been widely investigated, studies on rectal cancer are very limited. Here, we performed multi-region whole-exome sequencing and single-cell whole-genome sequencing to examine the genomic intratumor heterogeneity (ITH) of rectal tumors. We sequenced nine tumor regions and 88 single cells from two rectal cancer patients with tumors of the same molecular classification and characterized their mutation profiles and somatic copy number alterations (SCNAs) at the multi-region and the single-cell levels. A variable extent of genomic heterogeneity was observed between the two patients, and the degree of ITH increased when analyzed on the single-cell level. We found that major SCNAs were early events in cancer development and inherited steadily. Single-cell sequencing revealed mutations and SCNAs which were hidden in bulk sequencing. In summary, we studied the ITH of rectal cancer at regional and single-cell resolution and demonstrated that variable heterogeneity existed in two patients. The mutational scenarios and SCNA profiles of two patients with treatment naïve from the same molecular subtype are quite different. Our results suggest each tumor possesses its own architecture, which may result in different diagnosis, prognosis, and drug responses. Remarkable ITH exists in the two patients we have studied, providing a preliminary impression of ITH in rectal cancer.
Zeng, Xingquan; Long, Hai; Wang, Zhuo; Zhao, Shancen; Tang, Yawei; Huang, Zhiyong; Wang, Yulin; Xu, Qijun; Mao, Likai; Deng, Guangbing; Yao, Xiaoming; Li, Xiangfeng; Bai, Lijun; Yuan, Hongjun; Pan, Zhifen; Liu, Renjian; Chen, Xin; WangMu, QiMei; Chen, Ming; Yu, Lili; Liang, Junjun; DunZhu, DaWa; Zheng, Yuan; Yu, Shuiyang; LuoBu, ZhaXi; Guang, Xuanmin; Li, Jiang; Deng, Cao; Hu, Wushu; Chen, Chunhai; TaBa, XiongNu; Gao, Liyun; Lv, Xiaodan; Abu, Yuval Ben; Fang, Xiaodong; Nevo, Eviatar; Yu, Maoqun; Wang, Jun; Tashi, Nyima
The Tibetan hulless barley (Hordeum vulgare L. var. nudum), also called "Qingke" in Chinese and "Ne" in Tibetan, is the staple food for Tibetans and an important livestock feed in the Tibetan Plateau. The diploid nature and adaptation to diverse environments of the highland give it unique resources for genetic research and crop improvement. Here we produced a 3.89-Gb draft assembly of Tibetan hulless barley with 36,151 predicted protein-coding genes. Comparative analyses revealed the divergence times and synteny between barley and other representative Poaceae genomes. The expansion of the gene family related to stress responses was found in Tibetan hulless barley. Resequencing of 10 barley accessions uncovered high levels of genetic variation in Tibetan wild barley and genetic divergence between Tibetan and non-Tibetan barley genomes. Selective sweep analyses demonstrate adaptive correlations of genes under selection with extensive environmental variables. Our results not only construct a genomic framework for crop improvement but also provide evolutionary insights of highland adaptation of Tibetan hulless barley.
Xu, Chao; Chen, Huan; Gleason, Mark L; Xu, Jin-Rong; Liu, Huiquan; Zhang, Rong; Sun, Guangyu
Sooty blotch and flyspeck (SBFS) fungi are unconventional plant pathogens that cause economic losses by blemishing the surface appearance of infected fruit. Here, we introduce the 18.14-Mb genome of Peltaster fructicola, one of the most prevalent SBFS species on apple. This undersized assembly contains only 8,334 predicted protein-coding genes and a very small repertoire of repetitive elements. Phylogenomics and comparative genomics revealed that P. fructicola had undergone a reductive evolution, during which the numbers of orphan genes and genes involved in plant cell wall degradation, secondary metabolism, and secreted peptidases and effectors were drastically reduced. In contrast, the genes controlling 1,8-dihydroxynaphthalene (DHN)-melanin biosynthesis and appressorium-mediated penetration were retained substantially. Additionally, microscopic examination of the surfaces of infected apple indicated for the first time that P. fructicola can not only dissolve epicuticular waxes but also partially penetrate the cuticle proper. Our findings indicate that genome contraction, characterized mainly by the massive loss of pathogenicity-related genes, has played an important role in the evolution of P. fructicola (and by implication other SBFS species) from a plant-penetrating ancestor to a non-invasive ectophyte, displaying a novel form of trophic interaction between plants and fungi.
Vonk, Freek J.; Casewell, Nicholas R.; Henkel, Christiaan V.; Heimberg, Alysha M.; Jansen, Hans J.; McCleary, Ryan J. R.; Kerkkamp, Harald M. E.; Vos, Rutger A.; Guerreiro, Isabel; Calvete, Juan J.; Wüster, Wolfgang; Woods, Anthony E.; Logan, Jessica M.; Harrison, Robert A.; Castoe, Todd A.; de Koning, A. P. Jason; Pollock, David D.; Yandell, Mark; Calderon, Diego; Renjifo, Camila; Currier, Rachel B.; Salgado, David; Pla, Davinia; Sanz, Libia; Hyder, Asad S.; Ribeiro, José M. C.; Arntzen, Jan W.; van den Thillart, Guido E. E. J. M.; Boetzer, Marten; Pirovano, Walter; Dirks, Ron P.; Spaink, Herman P.; Duboule, Denis; McGlinn, Edwina; Kini, R. Manjunatha; Richardson, Michael K.
Snakes are limbless predators, and many species use venom to help overpower relatively large, agile prey. Snake venoms are complex protein mixtures encoded by several multilocus gene families that function synergistically to cause incapacitation. To examine venom evolution, we sequenced and interrogated the genome of a venomous snake, the king cobra (Ophiophagus hannah), and compared it, together with our unique transcriptome, microRNA, and proteome datasets from this species, with data from other vertebrates. In contrast to the platypus, the only other venomous vertebrate with a sequenced genome, we find that snake toxin genes evolve through several distinct co-option mechanisms and exhibit surprisingly variable levels of gene duplication and directional selection that correlate with their functional importance in prey capture. The enigmatic accessory venom gland shows a very different pattern of toxin gene expression from the main venom gland and seems to have recruited toxin-like lectin genes repeatedly for new nontoxic functions. In addition, tissue-specific microRNA analyses suggested the co-option of core genetic regulatory components of the venom secretory system from a pancreatic origin. Although the king cobra is limbless, we recovered coding sequences for all Hox genes involved in amniote limb development, with the exception of Hoxd12. Our results provide a unique view of the origin and evolution of snake venom and reveal multiple genome-level adaptive responses to natural selection in this complex biological weapon system. More generally, they provide insight into mechanisms of protein evolution under strong selection. PMID:24297900
Olsen, Jeanine L.
Seagrasses colonized the sea1 on at least three independent occasions to form the basis of one of the most productive and widespread coastal ecosystems on the planet2. Here we report the genome of Zostera marina (L.), the first, to our knowledge, marine angiosperm to be fully sequenced. This reveals unique insights into the genomic losses and gains involved in achieving the structural and physiological adaptations required for its marine lifestyle, arguably the most severe habitat shift ever accomplished by flowering plants. Key angiosperm innovations that were lost include the entire repertoire of stomatal genes3, genes involved in the synthesis of terpenoids and ethylene signalling, and genes for ultraviolet protection and phytochromes for far-red sensing. Seagrasses have also regained functions enabling them to adjust to full salinity. Their cell walls contain all of the polysaccharides typical of land plants, but also contain polyanionic, low-methylated pectins and sulfated galactans, a feature shared with the cell walls of all macroalgae4 and that is important for ion homoeostasis, nutrient uptake and O2/CO2 exchange through leaf epidermal cells. The Z. marina genome resource will markedly advance a wide range of functional ecological studies from adaptation of marine ecosystems under climate warming5, 6, to unravelling the mechanisms of osmoregulation under high salinities that may further inform our understanding of the evolution of salt tolerance in crop plants7.
Guo, Ximing; Ford, Susan E
More and more infectious diseases affect marine molluscs. Some diseases have impacted commercial species including MSX and Dermo of the eastern oyster, QPX of hard clams, withering syndrome of abalone and ostreid herpesvirus 1 (OsHV-1) infections of many molluscs. Although the exact transmission mechanisms are not well understood, human activities and associated environmental changes often correlate with increased disease prevalence. For instance, hatcheries and large-scale aquaculture create high host densities, which, along with increasing ocean temperature, might have contributed to OsHV-1 epizootics in scallops and oysters. A key to understanding linkages between the environment and disease is to understand how the environment affects the host immune system. Although we might be tempted to downplay the role of immunity in invertebrates, recent advances in genomics have provided insights into host and parasite genomes and revealed surprisingly sophisticated innate immune systems in molluscs. All major innate immune pathways are found in molluscs with many immune receptors, regulators and effectors expanded. The expanded gene families provide great diversity and complexity in innate immune response, which may be key to mollusc's defence against diverse pathogens in the absence of adaptive immunity. Further advances in host and parasite genomics should improve our understanding of genetic variation in parasite virulence and host disease resistance. © 2016 The Author(s).
Pryszcz, Leszek P; Németh, Tibor; Gácser, Attila; Gabaldón, Toni
The Candida parapsilosis species complex comprises a group of emerging human pathogens of varying virulence. This complex was recently subdivided into three different species: C. parapsilosis sensu stricto, C. metapsilosis, and C. orthopsilosis. Within the latter, at least two clearly distinct subspecies seem to be present among clinical isolates (Type 1 and Type 2). To gain insight into the genomic differences between these subspecies, we undertook the sequencing of a clinical isolate classified as Type 1 and compared it with the available sequence of a Type 2 clinical strain. Unexpectedly, the analysis of the newly sequenced strain revealed a highly heterozygous genome, which we show to be the consequence of a hybridization event between both identified subspecies. This implicitly suggests that C. orthopsilosis is able to mate, a so-far unanswered question. The resulting hybrid shows a chimeric genome that maintains a similar gene dosage from both parental lineages and displays ongoing loss of heterozygosity. Several of the differences found between the gene content in both strains relate to virulent-related families, with the hybrid strain presenting a higher copy number of genes coding for efflux pumps or secreted lipases. Remarkably, two clinical strains isolated from distant geographical locations (Texas and Singapore) are descendants of the same hybrid line, raising the intriguing possibility of a relationship between the hybridization event and the global spread of a virulent clone.
Bertonati, Claudia; Punta, Marco; Fischer, Markus; Yachdav, Guy; Forouhar, Farhad; Zhou, Weihong; Kuzin, Alexander P.; Seetharaman, Jayaraman; Abashidze, Mariam; Ramelot, Theresa A.; Kennedy, Michael A.; Cort, John R.; Belachew, Adam; Hunt, John F.; Tong, Liang; Montelione, Gaetano T.; Rost, Burkhard
Summary We report on several proteins recently solved by structural genomics consortia, in particular by the Northeast Structural Genomics consortium (NESG). The proteins considered in this study differ substantially in their sequences but they share a similar structural core, characterized by a pseudobarrel five-stranded beta sheet. This core corresponds to the PUA domain-like architecture in the SCOP database. By connecting sequence information with structural knowledge, we characterize a new subgroup of these proteins that we propose to be distinctly different from previously described PUA domain-like domains such as PUA proper or ASCH. We refer to these newly defined domains as EVE. Although EVE may have retained the ability of PUA domains to bind RNA, the available experimental and computational data suggests that both the details of its molecular function and its cellular function differ from those of other PUA domain-like domains. This study of EVE and its relatives illustrates how the combination of structure and genomics creates new insights by connecting a cornucopia of structures that map to the same evolutionary potential. Primary sequence information alone would have not been sufficient to reveal these evolutionary links. PMID:19191354
Siragusa Gregory R
Full Text Available Abstract Background Because biotechnological uses of bacteriophage gene products as alternatives to conventional antibiotics will require a thorough understanding of their genomic context, we sequenced and analyzed the genomes of four closely related phages isolated from Clostridium perfringens, an important agricultural and human pathogen. Results Phage whole-genome tetra-nucleotide signatures and proteomic tree topologies correlated closely with host phylogeny. Comparisons of our phage genomes to 26 others revealed three shared COGs; of particular interest within this core genome was an endolysin (PF01520, an N-acetylmuramoyl-L-alanine amidase and a holin (PF04531. Comparative analyses of the evolutionary history and genomic context of these common phage proteins revealed two important results: 1 strongly significant host-specific sequence variation within the endolysin, and 2 a protein domain architecture apparently unique to our phage genomes in which the endolysin is located upstream of its associated holin. Endolysin sequences from our phages were one of two very distinct genotypes distinguished by variability within the putative enzymatically-active domain. The shared or core genome was comprised of genes with multiple sequence types belonging to five pfam families, and genes belonging to 12 pfam families, including the holin genes, which were nearly identical. Conclusions Significant genomic diversity exists even among closely-related bacteriophages. Holins and endolysins represent conserved functions across divergent phage genomes and, as we demonstrate here, endolysins can have significant variability and host-specificity even among closely-related genomes. Endolysins in our phage genomes may be subject to different selective pressures than the rest of the genome. These findings may have important implications for potential biotechnological applications of phage gene products.
Full Text Available Determining mitochondrial genomes is important for elucidating vital activities of seed plants. Mitochondrial genomes are specific to each plant species because of their variable size, complex structures and patterns of gene losses and gains during evolution. This complexity has made research on the soybean mitochondrial genome difficult compared with its nuclear and chloroplast genomes. The present study helps to solve a 30-year mystery regarding the most complex mitochondrial genome structure, showing that pairwise rearrangements among the many large repeats may produce an enriched molecular pool of 760 circles in seed plants. The soybean mitochondrial genome harbors 58 genes of known function in addition to 52 predicted open reading frames of unknown function. The genome contains sequences of multiple identifiable origins, including 6.8 kb and 7.1 kb DNA fragments that have been transferred from the nuclear and chloroplast genomes, respectively, and some horizontal DNA transfers. The soybean mitochondrial genome has lost 16 genes, including nine protein-coding genes and seven tRNA genes; however, it has acquired five chloroplast-derived genes during evolution. Four tRNA genes, common among the three genomes, are derived from the chloroplast. Sizeable DNA transfers to the nucleus, with pericentromeric regions as hotspots, are observed, including DNA transfers of 125.0 kb and 151.6 kb identified unambiguously from the soybean mitochondrial and chloroplast genomes, respectively. The soybean nuclear genome has acquired five genes from its mitochondrial genome. These results provide biological insights into the mitochondrial genome of seed plants, and are especially helpful for deciphering vital activities in soybean.
Kumar, Roshan; Verma, Helianthous; Haider, Shazia; Bajaj, Abhay; Sood, Utkarsh; Ponnusamy, Kalaiarasan; Nagar, Shekhar; Shakarad, Mallikarjun N; Negi, Ram Krishan; Singh, Yogendra; Khurana, J P; Gilbert, Jack A; Lal, Rup
Species belonging to the genus Novosphingobium are found in many different habitats and have been identified as metabolically versatile. Through comparative genomic analysis, we identified habitat-specific genes and regulatory hubs that could determine habitat selection for Novosphingobium spp. Genomes from 27 Novosphingobium strains isolated from diverse habitats such as rhizosphere soil, plant surfaces, heavily contaminated soils, and marine and freshwater environments were analyzed. Genome size and coding potential were widely variable, differing significantly between habitats. Phylogenetic relationships between strains were less likely to describe functional genotype similarity than the habitat from which they were isolated. In this study, strains (19 out of 27) with a recorded habitat of isolation, and at least 3 representative strains per habitat, comprised four ecological groups-rhizosphere, contaminated soil, marine, and freshwater. Sulfur acquisition and metabolism were the only core genomic traits to differ significantly in proportion between these ecological groups; for example, alkane sulfonate ( ssuABCD ) assimilation was found exclusively in all of the rhizospheric isolates. When we examined osmolytic regulation in Novosphingobium spp. through ectoine biosynthesis, which was assumed to be marine habitat specific, we found that it was also present in isolates from contaminated soil, suggesting its relevance beyond the marine system. Novosphingobium strains were also found to harbor a wide variety of mono- and dioxygenases, responsible for the metabolism of several aromatic compounds, suggesting their potential to act as degraders of a variety of xenobiotic compounds. Protein-protein interaction analysis revealed β-barrel outer membrane proteins as habitat-specific hubs in each of the four habitats-freshwater (Saro_1868), marine water (PP1Y_AT17644), rhizosphere (PMI02_00367), and soil (V474_17210). These outer membrane proteins could play a key role
the archaea, the extension of comparative genomic pathway profiling to broader metabolic and homeostasis networks should be useful in revealing characteristics from metagenomic datasets related to adaptations to diverse environments.
Iurlaro, Mario; McInroy, Gordon R; Burgess, Heather E; Dean, Wendy; Raiber, Eun-Ang; Bachman, Martin; Beraldi, Dario; Balasubramanian, Shankar; Reik, Wolf
Genome-wide methylation of cytosine can be modulated in the presence of TET and thymine DNA glycosylase (TDG) enzymes. TET is able to oxidise 5-methylcytosine (5mC) to 5-hydroxymethylcytosine (5hmC), 5-formylcytosine (5fC) and 5-carboxylcytosine (5caC). TDG can excise the oxidative products 5fC and 5caC, initiating base excision repair. These modified bases are stable and detectable in the genome, suggesting that they could have epigenetic functions in their own right. However, functional investigation of the genome-wide distribution of 5fC has been restricted to cell culture-based systems, while its in vivo profile remains unknown. Here, we describe the first analysis of the in vivo genome-wide profile of 5fC across a range of tissues from both wild-type and Tdg-deficient E11.5 mouse embryos. Changes in the formylation profile of cytosine upon depletion of TDG suggest TET/TDG-mediated active demethylation occurs preferentially at intron-exon boundaries and reveals a major role for TDG in shaping 5fC distribution at CpG islands. Moreover, we find that active enhancer regions specifically exhibit high levels of 5fC, resulting in characteristic tissue-diagnostic patterns, which suggest a role in embryonic development. The tissue-specific distribution of 5fC can be regulated by the collective contribution of TET-mediated oxidation and excision by TDG. The in vivo profile of 5fC during embryonic development resembles that of embryonic stem cells, sharing key features including enrichment of 5fC in enhancer and intragenic regions. Additionally, by investigating mouse embryo 5fC profiles in a tissue-specific manner, we identify targeted enrichment at active enhancers involved in tissue development.
Full Text Available Copy number variation (CNV, which is characterized by large-scale losses or gains of DNA fragments, contributes significantly to genetic and phenotypic variation. Assessing CNV across different European cattle populations might reveal genetic changes responsible for phenotypic differences, which have accumulated throughout the domestication history of cattle as consequences of evolutionary forces that act upon them. To explore pattern of CNVs across European cattle, we genotyped 149 individuals, that represent different European regions, using the Illumina Bovine HD Genotyping array. A total of 9,944 autosomal CNVs were identified in 149 samples using a Hidden Markov Model (HMM as employed in PennCNV. Animals originating from several breeds of British Isles, and Balkan and Italian regions, on average, displayed higher abundance of CNV counts than Dutch or Alpine animals. A total of 923 CNV regions (CNVRs were identified by aggregating CNVs overlapping in at least two animals. The hierarchical clustering of CNVRs indicated low differentiation and sharing of high-frequency CNVRs between European cattle populations. Various CNVRs identified in the present study overlapped with olfactory receptor genes and genes related to immune system. In addition, we also detected a CNV overlapping the Kit gene in English longhorn cattle which has previously been associated with color-sidedness. To conclude, we provide a comprehensive overview of CNV distribution in genome of European cattle. Our results indicate an important role of purifying selection and genomic drift in shaping CNV diversity that exists between different European cattle populations.
Rappoport, Nadav; Linial, Michal
Insects belong to a class that accounts for the majority of animals on earth. With over one million identified species, insects display a huge diversity and occupy extreme environments. At present, there are dozens of fully sequenced insect genomes that cover a range of habitats, social behavior and morphologies. In view of such diverse collection of genomes, revealing evolutionary trends and charting functional relationships of proteins remain challenging. We analyzed the relatedness of 17 complete proteomes representative of proteomes from insects including louse, bee, beetle, ants, flies and mosquitoes, as well as an out-group from the crustaceans. The analyzed proteomes mostly represented the orders of Hymenoptera and Diptera. The 287,405 protein sequences from the 18 proteomes were automatically clustered into 20,933 families, including 799 singletons. A comprehensive analysis based on statistical considerations identified the families that were significantly expanded or reduced in any of the studied organisms. Among all the tested species, ants are characterized by an exceptionally high rate of family gain and loss. By assigning annotations to hundreds of species-specific families, the functional diversity among species and between the major clades (Diptera and Hymenoptera) is revealed. We found that many species-specific families are associated with receptor signaling, stress-related functions and proteases. The highest variability among insects associates with the function of transposition and nucleic acids processes (collectively coined TNAP). Specifically, the wasp and ants have an order of magnitude more TNAP families and proteins relative to species that belong to Diptera (mosquitoes and flies). An unsupervised clustering methodology combined with a comparative functional analysis unveiled proteomic signatures in the major clades of winged insects. We propose that the expansion of TNAP families in Hymenoptera potentially contributes to the accelerated
James J Collins
Full Text Available Bioactive peptides (i.e., neuropeptides or peptide hormones represent the largest class of cell-cell signaling molecules in metazoans and are potent regulators of neural and physiological function. In vertebrates, peptide hormones play an integral role in endocrine signaling between the brain and the gonads that controls reproductive development, yet few of these molecules have been shown to influence reproductive development in invertebrates. Here, we define a role for peptide hormones in controlling reproductive physiology of the model flatworm, the planarian Schmidtea mediterranea. Based on our observation that defective neuropeptide processing results in defects in reproductive system development, we employed peptidomic and functional genomic approaches to characterize the planarian peptide hormone complement, identifying 51 prohormone genes and validating 142 peptides biochemically. Comprehensive in situ hybridization analyses of prohormone gene expression revealed the unanticipated complexity of the flatworm nervous system and identified a prohormone specifically expressed in the nervous system of sexually reproducing planarians. We show that this member of the neuropeptide Y superfamily is required for the maintenance of mature reproductive organs and differentiated germ cells in the testes. Additionally, comparative analyses of our biochemically validated prohormones with the genomes of the parasitic flatworms Schistosoma mansoni and Schistosoma japonicum identified new schistosome prohormones and validated half of all predicted peptide-encoding genes in these parasites. These studies describe the peptide hormone complement of a flatworm on a genome-wide scale and reveal a previously uncharacterized role for peptide hormones in flatworm reproduction. Furthermore, they suggest new opportunities for using planarians as free-living models for understanding the reproductive biology of flatworm parasites.
Radhika, G; Aravindakshan, T V; Jinty, S; Ramya, K
The legendary Vechur cattle of Kerala, described as a very short breed, and the crossbred (CB) Sunandini cattle population exhibited great phenotypic variation; hence, the present study attempted to analyze the genetic diversity existing between them. A set of 14 polymorphic microsatellites were chosen from FAO-ISAG panel and amplified from genomic DNA isolated from blood samples of 30 Vechur and 64 unrelated crossbred cattle, using fluorescent labeled primers. Both populations revealed high genetic diversity as evidenced from high observed number of alleles, Polymorphic Information Content and expected heterozygosity. Observed heterozygosity was lesser (0.699) than expected (0.752) in Vechur population which was further supported by positive F IS value of 0.1149, indicating slight level of inbreeding in Vechur population. Overall, F ST value was 0.065, which means genetic differentiation between crossbred and Vechur population was 6.5%, indicating that the crossbred cattle must have differentiated into a definite population that is different from the indigenous Vechur cows. Structure analysis indicated that the two populations showed distinct differences, with two underlying clusters. The present study supports the separation between Taurine and Zebu cattle and throws light onto the genetic diversity and relationship between native Vechur and crossbred cattle populations in Kerala state.
Researchers from British Columbia Cancer Agency used whole genome sequencing to analyze 40 DLBCL cases and 13 cell lines in order to fill in the gaps of the complex landscape of DLBCL genomes. Their analysis, “Mutational and structural analysis of diffuse large B-cell lymphoma using whole genome sequencing,” was published online in Blood on May 22. The authors are Ryan Morin, Marco Marra, and colleagues.
Ehret, Anita; Hochstuhl, David; Gianola, Daniel; Thaller, Georg
Recently, artificial neural networks (ANN) have been proposed as promising machines for marker-based genomic predictions of complex traits in animal and plant breeding. ANN are universal approximators of complex functions, that can capture cryptic relationships between SNPs (single nucleotide polymorphisms) and phenotypic values without the need of explicitly defining a genetic model. This concept is attractive for high-dimensional and noisy data, especially when the genetic architecture of the trait is unknown. However, the properties of ANN for the prediction of future outcomes of genomic selection using real data are not well characterized and, due to high computational costs, using whole-genome marker sets is difficult. We examined different non-linear network architectures, as well as several genomic covariate structures as network inputs in order to assess their ability to predict milk traits in three dairy cattle data sets using large-scale SNP data. For training, a regularized back propagation algorithm was used. The average correlation between the observed and predicted phenotypes in a 20 times 5-fold cross-validation was used to assess predictive ability. A linear network model served as benchmark. Predictive abilities of different ANN models varied markedly, whereas differences between data sets were small. Dimension reduction methods enhanced prediction performance in all data sets, while at the same time computational cost decreased. For the Holstein-Friesian bull data set, an ANN with 10 neurons in the hidden layer achieved a predictive correlation of r=0.47 for milk yield when the entire marker matrix was used. Predictive ability increased when the genomic relationship matrix (r=0.64) was used as input and was best (r=0.67) when principal component scores of the marker genotypes were used. Similar results were found for the other traits in all data sets. Artificial neural networks are powerful machines for non-linear genome-enabled predictions in
Meganathan, P R; Pagan, Heidi J T; McCulloch, Eve S; Stevens, Richard D; Ray, David A
Order Chiroptera is a unique group of mammals whose members have attained self-powered flight as their main mode of locomotion. Much speculation persists regarding bat evolution; however, lack of sufficient molecular data hampers evolutionary and conservation studies. Of ~1200 species, complete mitochondrial genome sequences are available for only eleven. Additional sequences should be generated if we are to resolve many questions concerning these fascinating mammals. Herein, we describe the complete mitochondrial genomes of three bats: Corynorhinus rafinesquii, Lasiurus borealis and Artibeus lituratus. We also compare the currently available mitochondrial genomes and analyze codon usage in Chiroptera. C. rafinesquii, L. borealis and A. lituratus mitochondrial genomes are 16438 bp, 17048 bp and 16709 bp, respectively. Genome organization and gene arrangements are similar to other bats. Phylogenetic analyses using complete mitochondrial genome sequences support previously established phylogenetic relationships and suggest utility in future studies focusing on the evolutionary aspects of these species. Comprehensive analyses of available bat mitochondrial genomes reveal distinct nucleotide patterns and synonymous codon preferences corresponding to different chiropteran families. These patterns suggest that mutational and selection forces are acting to different extents within Chiroptera and shape their mitochondrial genomes. Copyright © 2011 Elsevier B.V. All rights reserved.
Metzger, Julia; Gast, Alana Christina; Schrimpf, Rahel; Rau, Janina; Eikelberg, Deborah; Beineke, Andreas; Hellige, Maren; Distl, Ottmar
The Miniature Shetland pony represents a horse breed with an extremely small body size. Clinical examination of a dwarf Miniature Shetland pony revealed a lowered size at the withers, malformed skull and brachygnathia superior. Computed tomography (CT) showed a shortened maxilla and a cleft of the hard and soft palate which protruded into the nasal passage leading to breathing difficulties. Pathological examination confirmed these findings but did not reveal histopathological signs of premature ossification in limbs or cranial sutures. Whole-genome sequencing of this dwarf Miniature Shetland pony and comparative sequence analysis using 26 reference equids from NCBI Sequence Read Archive revealed three probably damaging missense variants which could be exclusively found in the affected foal. Validation of these three missense mutations in 159 control horses from different horse breeds and five donkeys revealed only the aggrecan (ACAN)-associated g.94370258G>C variant as homozygous wild-type in all control samples. The dwarf Miniature Shetland pony had the homozygous mutant genotype C/C of the ACAN:g.94370258G>C variant and the normal parents were heterozygous G/C. An unaffected full sib and 3/5 unaffected half-sibs were heterozygous G/C for the ACAN:g.94370258G>C variant. In summary, we could demonstrate a dwarf phenotype in a miniature pony breed perfectly associated with a missense mutation within the ACAN gene.
Full Text Available Two major transitions in animal evolution--the origins of multicellularity and bilaterality--correlate with major changes in mitochondrial DNA (mtDNA organization. Demosponges, the largest class in the phylum Porifera, underwent only the first of these transitions and their mitochondrial genomes display a peculiar combination of ancestral and animal-specific features. To get an insight into the evolution of mitochondrial genomes within the Demospongiae, we determined 17 new mtDNA sequences from this group and analyzing them with five previously published sequences. Our analysis revealed that all demosponge mtDNAs are 16- to 25-kbp circular molecules, containing 13-15 protein genes, 2 rRNA genes, and 2-27 tRNA genes. All but four pairs of sampled genomes had unique gene orders, with the number of shared gene boundaries ranging from 1 to 41. Although most demosponge species displayed low rates of mitochondrial sequence evolution, a significant acceleration in evolutionary rates occurred in the G1 group (orders Dendroceratida, Dictyoceratida, and Verticillitida. Large variation in mtDNA organization was also observed within the G0 group (order Homosclerophorida including gene rearrangements, loss of tRNA genes, and the presence of two introns in Plakortis angulospiculatus. While introns are rare in modern-day demosponge mtDNA, we inferred that at least one intron was present in cox1 of the common ancestor of all demosponges. Our study uncovered an extensive mitochondrial genomic diversity within the Demospongiae. Although all sampled mitochondrial genomes retained some ancestral features, including a minimally modified genetic code, conserved structures of tRNA genes, and presence of multiple non-coding regions, they vary considerably in their size, gene content, gene order, and the rates of sequence evolution. Some of the changes in demosponge mtDNA, such as the loss of tRNA genes and the appearance of hairpin-containing repetitive elements
Pausch, Hubert; Emmerling, Reiner; Gredler-Grandl, Birgit; Fries, Ruedi; Daetwyler, Hans D; Goddard, Michael E
Genotyping and whole-genome sequencing data have been generated for hundreds of thousands of cattle. International consortia used these data to compile imputation reference panels that facilitate the imputation of sequence variant genotypes for animals that have been genotyped using dense microarrays. Association studies with imputed sequence variant genotypes allow for the characterization of quantitative trait loci (QTL) at nucleotide resolution particularly when individuals from several breeds are included in the mapping populations. We imputed genotypes for 28 million sequence variants in 17,229 cattle of the Braunvieh, Fleckvieh and Holstein breeds in order to compile large mapping populations that provide high power to identify QTL for milk production traits. Association tests between imputed sequence variant genotypes and fat and protein percentages in milk uncovered between six and thirteen QTL (P < 1e-8) per breed. Eight of the detected QTL were significant in more than one breed. We combined the results across breeds using meta-analysis and identified a total of 25 QTL including six that were not significant in the within-breed association studies. Two missense mutations in the ABCG2 (p.Y581S, rs43702337, P = 4.3e-34) and GHR (p.F279Y, rs385640152, P = 1.6e-74) genes were the top variants at QTL on chromosomes 6 and 20. Another known causal missense mutation in the DGAT1 gene (p.A232K, rs109326954, P = 8.4e-1436) was the second top variant at a QTL on chromosome 14 but its allelic substitution effects were inconsistent across breeds. It turned out that the conflicting allelic substitution effects resulted from flaws in the imputed genotypes due to the use of a multi-breed reference population for genotype imputation. Many QTL for milk production traits segregate across breeds and across-breed meta-analysis has greater power to detect such QTL than within-breed association testing. Association testing between imputed sequence variant genotypes and
Veeramah, Krishna R; Rott, Andreas; Groß, Melanie; van Dorp, Lucy; López, Saioa; Kirsanow, Karola; Sell, Christian; Blöcher, Jens; Wegmann, Daniel; Link, Vivian; Hofmanová, Zuzana; Peters, Joris; Trautmann, Bernd; Gairhos, Anja; Haberstroh, Jochen; Päffgen, Bernd; Hellenthal, Garrett; Haas-Gebhard, Brigitte; Harbeck, Michaela; Burger, Joachim
Modern European genetic structure demonstrates strong correlations with geography, while genetic analysis of prehistoric humans has indicated at least two major waves of immigration from outside the continent during periods of cultural change. However, population-level genome data that could shed light on the demographic processes occurring during the intervening periods have been absent. Therefore, we generated genomic data from 41 individuals dating mostly to the late 5th/early 6th century AD from present-day Bavaria in southern Germany, including 11 whole genomes (mean depth 5.56×). In addition we developed a capture array to sequence neutral regions spanning a total of 5 Mb and 486 functional polymorphic sites to high depth (mean 72×) in all individuals. Our data indicate that while men generally had ancestry that closely resembles modern northern and central Europeans, women exhibit a very high genetic heterogeneity; this includes signals of genetic ancestry ranging from western Europe to East Asia. Particularly striking are women with artificial skull deformations; the analysis of their collective genetic ancestry suggests an origin in southeastern Europe. In addition, functional variants indicate that they also differed in visible characteristics. This example of female-biased migration indicates that complex demographic processes during the Early Medieval period may have contributed in an unexpected way to shape the modern European genetic landscape. Examination of the panel of functional loci also revealed that many alleles associated with recent positive selection were already at modern-like frequencies in European populations ∼1,500 years ago. Copyright © 2018 the Author(s). Published by PNAS.
Sibley L David
Full Text Available Abstract Background The phylum Apicomplexa is an early-branching eukaryotic lineage that contains a number of important human and animal pathogens. Their complex life cycles and unique cytoskeletal features distinguish them from other model eukaryotes. Apicomplexans rely on actin-based motility for cell invasion, yet the regulation of this system remains largely unknown. Consequently, we focused our efforts on identifying actin-related proteins in the recently completed genomes of Toxoplasma gondii, Plasmodium spp., Cryptosporidium spp., and Theileria spp. Results Comparative genomic and phylogenetic studies of apicomplexan genomes reveals that most contain only a single conventional actin and yet they each have 8–10 additional actin-related proteins. Among these are a highly conserved Arp1 protein (likely part of a conserved dynactin complex, and Arp4 and Arp6 homologues (subunits of the chromatin-remodeling machinery. In contrast, apicomplexans lack canonical Arp2 or Arp3 proteins, suggesting they lost the Arp2/3 actin polymerization complex on their evolutionary path towards intracellular parasitism. Seven of these actin-like proteins (ALPs are novel to apicomplexans. They show no phylogenetic associations to the known Arp groups and likely serve functions specific to this important group of intracellular parasites. Conclusion The large diversity of actin-like proteins in apicomplexans suggests that the actin protein family has diverged to fulfill various roles in the unique biology of intracellular parasites. Conserved Arps likely participate in vesicular transport and gene expression, while apicomplexan-specific ALPs may control unique biological traits such as actin-based gliding motility.
Tedeschi, Luis O.
Cattle body composition is difficult to model because several factors affect the composition of the average daily gain (ADG) of growing animals. The objective of this study was to identify commercial single nucleotide polymorphism (SNP) panels that could improve the predictability of days on feed (DOF) to reach a target United States Department of Agriculture (USDA) grade given animal, diet, and environmental information under feedyard conditions. The data for this study was comprised of crossbred heifers (n = 681) and steers (n = 836) from commercial feedyards. Eleven molecular breeding value (MBV) scores derived from SNP panels of candidate gene polymorphisms and two-leptin gene SNP (UASMS2 and E2FB) were evaluated. The empty body fat (EBF) and the shrunk body weight (SBW) at 28% EBF (AFSBW) were computed by the Cattle Value Discovery System (CVDS) model using hip height (EBFHH and AFSBWHH) or carcass traits (EBFCT and AFSBWCT) of the animals. The DOFHH was calculated when AFSBWHH and ADGHH were used and DOFCT was calculated when AFSBWCT and ADGCT were used. The CVDS estimates dry matter required (DMR) by individuals fed in groups when observed ADG and AFSBW are provided. The AFSBWCT was assumed more accurate than the AFSBWHH because it was computed using carcass traits. The difference between AFSBWCT and AFSBWHH, DOFCT and DOFHH, and DMR and dry matter intake (DMI) were regressed on the MBV scores and leptin gene SNP to explain the variation. Our results indicate quite a large range of correlations among MBV scores and model input and output variables, but MBV ribeye area was the most strongly correlated with the differences in DOF, AFSBW, and DMI by explaining 8, 13.2 and 6.5%, respectively, of the variation. This suggests that specific MBV scores might explain additional variation of input and output variables used by nutritional models in predicting individual animal performance. PMID:26599759
Luis O Tedeschi
Full Text Available Cattle body composition is difficult to model because several factors affect the composition of the average daily gain (ADG of growing animals. The objective of this study was to identify commercial single nucleotide polymorphism (SNP panels that could improve the predictability of days on feed (DOF to reach a target United States Department of Agriculture (USDA grade given animal, diet, and environmental information under feedyard conditions. The data for this study was comprised of crossbred heifers (n = 681 and steers (n = 836 from commercial feedyards. Eleven molecular breeding value (MBV scores derived from SNP panels of candidate gene polymorphisms and two-leptin gene SNP (UASMS2 and E2FB were evaluated. The empty body fat (EBF and the shrunk body weight (SBW at 28% EBF (AFSBW were computed by the Cattle Value Discovery System (CVDS model using hip height (EBFHH and AFSBWHH or carcass traits (EBFCT and AFSBWCT of the animals. The DOFHH was calculated when AFSBWHH and ADGHH were used and DOFCT was calculated when AFSBWCT and ADGCT were used. The CVDS estimates dry matter required (DMR by individuals fed in groups when observed ADG and AFSBW are provided. The AFSBWCT was assumed more accurate than the AFSBWHH because it was computed using carcass traits. The difference between AFSBWCT and AFSBWHH, DOFCT and DOFHH, and DMR and dry matter intake (DMI were regressed on the MBV scores and leptin gene SNP to explain the variation. Our results indicate quite a large range of correlations among MBV scores and model input and output variables, but MBV ribeye area was the most strongly correlated with the differences in DOF, AFSBW, and DMI by explaining 8, 13.2 and 6.5%, respectively, of the variation. This suggests that specific MBV scores might explain additional variation of input and output variables used by nutritional models in predicting individual animal performance.
Bos, Kirsten I.; Harkins, Kelly M.; Herbig, Alexander; Coscolla, Mireia; Weber, Nico; Comas, Iñaki; Forrest, Stephen A.; Bryant, Josephine M.; Harris, Simon R.; Schuenemann, Verena J.; Campbell, Tessa J.; Majander, Kerrtu; Wilbur, Alicia K.; Guichon, Ricardo A.; Wolfe Steadman, Dawnie L.; Cook, Della Collins; Niemann, Stefan; Behr, Marcel A.; Zumarraga, Martin; Bastida, Ricardo; Huson, Daniel; Nieselt, Kay; Young, Douglas; Parkhill, Julian; Buikstra, Jane E.; Gagneux, Sebastien; Stone, Anne C.; Krause, Johannes
Modern strains of Mycobacterium tuberculosis from the Americas are closely related to those from Europe, supporting the assumption that human tuberculosis was introduced post-contact1. This notion, however, is incompatible with archaeological evidence of pre-contact tuberculosis in the New World2. Comparative genomics of modern isolates suggests that M. tuberculosis attained its worldwide distribution following human dispersals out of Africa during the Pleistocene epoch3, although this has yet to be confirmed with ancient calibration points. Here we present three 1,000-year-old mycobacterial genomes from Peruvian human skeletons, revealing that a member of the M. tuberculosis complex caused human disease before contact. The ancient strains are distinct from known human-adapted forms and are most closely related to those adapted to seals and sea lions. Two independent dating approaches suggest a most recent common ancestor for the M. tuberculosis complex less than 6,000 years ago, which supports a Holocene dispersal of the disease. Our results implicate sea mammals as having played a role in transmitting the disease to humans across the ocean. PMID:25141181
Denise E. Costich
Full Text Available Switchgrass ( L., a native perennial dominant of the prairies of North America, has been targeted as a model herbaceous species for biofeedstock development. A flow-cytometric survey of a core set of 11 primarily upland polyploid switchgrass accessions indicated that there was considerable variation in genome size within each accession, particularly at the octoploid (2 = 8 = 72 chromosome ploidy level. Highly variable chromosome counts in mitotic cell preparations indicated that aneuploidy was more common in octoploids (86.3% than tetraploids (23.2%. Furthermore, the incidence of hyper- versus hypoaneuploidy is equivalent in tetraploids. This is clearly not the case in octoploids, where close to 90% of the aneuploid counts are lower than the euploid number. Cytogenetic investigation using fluorescent in situ hybridization (FISH revealed an unexpected degree of variation in chromosome structure underlying the apparent genomic instability of this species. These results indicate that rapid advances in the breeding of polyploid biofuel feedstocks, based on the molecular-genetic dissection of biomass characteristics and yield, will be predicated on the continual improvement of our understanding of the cytogenetics of these species.
Full Text Available Sequencing and annotation was performed for two giant double stranded DNA bacteriophages, φGrn1 and φSt2 of the Myoviridae family, considered to be of great interest for phage therapy against Vibrios in aquaculture live feeds. In addition, phage-host metabolic interactions and exploitation was studied by transcript profiling of selected viral and host genes. Comparative genomic analysis with other giant Vibrio phages was also performed to establish the presence and location of homing endonucleases highlighting distinct features for both phages. Phylogenetic analysis revealed that they belong to the schizoT4like clade. Although many reports of newly sequenced viruses have provided a large set of information, basic research related to the shift of the bacterial metabolism during infection remains stagnant. The function of many viral protein products in the process of infection is still unknown. Genome annotation identified the presence of several viral ORFs participating in metabolism, including a Sir2/cobB (sirtuin protein and a number of genes involved in auxiliary NAD+ and nucleotide biosynthesis, necessary for phage DNA replication. Key genes were subsequently selected for detail study of their expression levels during infection. This work suggests a complex metabolic interaction and exploitation of the host metabolic pathways and biochemical processes, including a possible post-translational protein modification, by the virus during infection.
Full Text Available Genetic analyses have recently been carried out on present-day Tuscans (Central Italy in order to investigate their presumable recent Near East ancestry in connection with the long-standing debate on the origins of the Etruscan civilization. We retrieved mitogenomes and genome-wide SNP data from 110 Tuscans analyzed within the context of The 1000 Genome Project. For phylogeographic and evolutionary analysis we made use of a large worldwide database of entire mitogenomes (>26,000 and partial control region sequences (>180,000.Different analyses reveal the presence of typical Near East haplotypes in Tuscans representing isolated members of various mtDNA phylogenetic branches. As a whole, the Near East component in Tuscan mitogenomes can be estimated at about 8%; a proportion that is comparable to previous estimates but significantly lower than admixture estimates obtained from autosomal SNP data (21%. Phylogeographic and evolutionary inter-population comparisons indicate that the main signal of Near Eastern Tuscan mitogenomes comes from Iran.Mitogenomes of recent Near East origin in present-day Tuscans do not show local or regional variation. This points to a demographic scenario that is compatible with a recent arrival of Near Easterners to this region in Italy with no founder events or bottlenecks.
Full Text Available Lactococcus garvieae is a major pathogen for fish. Two complete (ATCC 49156 and Lg2 and three draft (UNIUD074, 8831, and 21881 genome sequences of L. garvieae have recently been released. We here present the results of a comparative genomic analysis of these fish and human isolates of L. garvieae. The pangenome comprised 1,542 core and 1,378 dispensable genes. The sequenced L. garvieae strains shared most of the possible virulence genes, but the capsule gene cluster was found only in fish-pathogenic strain Lg2. The absence of the capsule gene cluster in other nonpathogenic strains isolated from mastitis and vegetable was also confirmed by PCR. The fish and human isolates of L. garvieae contained the specific two and four adhesin genes, respectively, indicating that these adhesion proteins may be involved in the host specificity differences of L. garvieae. The discoveries revealed by the pangenomic analysis may provide significant insights into the biology of L. garvieae.
Wilkins, Michael J.; Kennedy, David W.; Castelle, Cindy; Field, Erin; Stepanauskas, Ramunas; Fredrickson, Jim K.; Konopka, Allan
Bacteria from the genus Pedobacter are a major component of microbial assemblages at Hanford Site and have been shown to significantly change in abundance in response to the subsurface intrusion of Columbia River water. Here we employed single cell genomics techniques to shed light on the physiological niche of these microorganisms. Analysis of four Pedobacter single amplified genomes (SAGs) from Hanford Site sediments revealed a chemoheterotrophic lifestyle, with the potential to exist under both aerobic and microaerophilic conditions via expression of both aa3-type and cbb3-type cytochrome c oxidases. These SAGs encoded a wide-range of both intra-and extra-cellular carbohydrate-active enzymes, potentially enabling the degradation of recalcitrant substrates such as xylan and chitin, and the utilization of more labile sugars such as mannose and fucose. Coupled to these enzymes, a diversity of transporters and sugar-binding molecules were involved in the uptake of carbon from the extracellular local environment. The SAGs were enriched in TonB-dependent receptors (TBDRs), which play a key role in uptake of substrates resulting from degradation of recalcitrant carbon. CRISPR-Cas mechanisms for resisting viral infections were identified in all SAGs. These data demonstrate the potential mechanisms utilized for persistence by heterotrophic microorganisms in a carbon-limited aquifer, and hint at potential linkages between observed Pedobacter abundance shifts within the 300 Area subsurface and biogeochemical shifts associated with Columbia River water intrusion.
Fogarty, Norah M E; McCarthy, Afshan; Snijders, Kirsten E; Powell, Benjamin E; Kubikova, Nada; Blakeley, Paul; Lea, Rebecca; Elder, Kay; Wamaitha, Sissy E; Kim, Daesik; Maciulyte, Valdone; Kleinjung, Jens; Kim, Jin-Soo; Wells, Dagan; Vallier, Ludovic; Bertero, Alessandro; Turner, James M A; Niakan, Kathy K
Despite their fundamental biological and clinical importance, the molecular mechanisms that regulate the first cell fate decisions in the human embryo are not well understood. Here we use CRISPR-Cas9-mediated genome editing to investigate the function of the pluripotency transcription factor OCT4 during human embryogenesis. We identified an efficient OCT4-targeting guide RNA using an inducible human embryonic stem cell-based system and microinjection of mouse zygotes. Using these refined methods, we efficiently and specifically targeted the gene encoding OCT4 (POU5F1) in diploid human zygotes and found that blastocyst development was compromised. Transcriptomics analysis revealed that, in POU5F1-null cells, gene expression was downregulated not only for extra-embryonic trophectoderm genes, such as CDX2, but also for regulators of the pluripotent epiblast, including NANOG. By contrast, Pou5f1-null mouse embryos maintained the expression of orthologous genes, and blastocyst development was established, but maintenance was compromised. We conclude that CRISPR-Cas9-mediated genome editing is a powerful method for investigating gene function in the context of human development.
Yao, Chen; Zhu, Xiaojin; Weigel, Kent A
Genomic prediction for novel traits, which can be costly and labor-intensive to measure, is often hampered by low accuracy due to the limited size of the reference population. As an option to improve prediction accuracy, we introduced a semi-supervised learning strategy known as the self-training model, and applied this method to genomic prediction of residual feed intake (RFI) in dairy cattle. We describe a self-training model that is wrapped around a support vector machine (SVM) algorithm, which enables it to use data from animals with and without measured phenotypes. Initially, a SVM model was trained using data from 792 animals with measured RFI phenotypes. Then, the resulting SVM was used to generate self-trained phenotypes for 3000 animals for which RFI measurements were not available. Finally, the SVM model was re-trained using data from up to 3792 animals, including those with measured and self-trained RFI phenotypes. Incorporation of additional animals with self-trained phenotypes enhanced the accuracy of genomic predictions compared to that of predictions that were derived from the subset of animals with measured phenotypes. The optimal ratio of animals with self-trained phenotypes to animals with measured phenotypes (2.5, 2.0, and 1.8) and the maximum increase achieved in prediction accuracy measured as the correlation between predicted and actual RFI phenotypes (5.9, 4.1, and 2.4%) decreased as the size of the initial training set (300, 400, and 500 animals with measured phenotypes) increased. The optimal number of animals with self-trained phenotypes may be smaller when prediction accuracy is measured as the mean squared error rather than the correlation between predicted and actual RFI phenotypes. Our results demonstrate that semi-supervised learning models that incorporate self-trained phenotypes can achieve genomic prediction accuracies that are comparable to those obtained with models using larger training sets that include only animals with
In this study, we identified and compared nucleotide-binding site (NBS) domain-containing genes from three Citrus genomes (C. clementina, C. sinensis from USA and C. sinensis from China). Phylogenetic analysis of all Citrus NBS genes across these three genomes revealed that there are three approxima...
Dudas, Gytis; Carvalho, Luiz Max; Bedford, Trevor; Tatem, Andrew J.; Baele, Guy; Faria, Nuno R.; Park, Daniel J.; Ladner, Jason T.; Arias, Armando; Asogun, Danny; Bielejec, Filip; Caddy, Sarah L.; Cotten, Matthew; D’Ambrozio, Jonathan; Dellicour, Simon; Di Caro, Antonino; Diclaro, JosephW.; Duraffour, Sophie; Elmore, Michael J.; Fakoli, Lawrence S.; Faye, Ousmane; Gilbert, Merle L.; Gevao, Sahr M.; Gire, Stephen; Gladden-Young, Adrianne; Gnirke, Andreas; Goba, Augustine; Grant, Donald S.; Haagmans, Bart L.; Hiscox, Julian A.; Jah, Umaru; Kargbo, Brima; Kugelman, Jeffrey R.; Liu, Di; Lu, Jia; Malboeuf, Christine M.; Mate, Suzanne; Matthews, David A.; Matranga, Christian B.; Meredith, Luke W.; Qu, James; Quick, Joshua; Pas, Suzan D.; Phan, My VT; Pollakis, Georgios; Reusken, Chantal B.; Sanchez-Lockhart, Mariano; Schaffner, Stephen F.; Schieffelin, John S.; Sealfon, Rachel S.; Simon-Loriere, Etienne; Smits, Saskia L.; Stoecker, Kilian; Thorne, Lucy; Tobin, Ekaete Alice; Vandi, Mohamed A.; Watson, Simon J.; West, Kendra; Whitmer, Shannon; Wiley, Michael R.; Winnicki, Sarah M.; Wohl, Shirlee; Wölfel, Roman; Yozwiak, Nathan L.; Andersen, Kristian G.; Blyden, Sylvia O.; Bolay, Fatorma; Carroll, MilesW.; Dahn, Bernice; Diallo, Boubacar; Formenty, Pierre; Fraser, Christophe; Gao, George F.; Garry, Robert F.; Goodfellow, Ian; Günther, Stephan; Happi, Christian T.; Holmes, Edward C.; Kargbo, Brima; Keïta, Sakoba; Kellam, Paul; Koopmans, Marion P. G.; Kuhn, Jens H.; Loman, Nicholas J.; Magassouba, N’Faly; Naidoo, Dhamari; Nichol, Stuart T.; Nyenswah, Tolbert; Palacios, Gustavo; Pybus, Oliver G.; Sabeti, Pardis C.; Sall, Amadou; Ströher, Ute; Wurie, Isatta; Suchard, Marc A.; Lemey, Philippe; Rambaut, Andrew
The 2013–2016 epidemic of Ebola virus disease was of unprecedented magnitude, duration and impact. Analysing 1610 Ebola virus genomes, representing over 5% of known cases, we reconstruct the dispersal, proliferation and decline of Ebola virus throughout the region. We test the association of geography, climate and demography with viral movement among administrative regions, inferring a classic ‘gravity’ model, with intense dispersal between larger and closer populations. Despite attenuation of international dispersal after border closures, cross-border transmission had already set the seeds for an international epidemic, rendering these measures ineffective in curbing the epidemic. We address why the epidemic did not spread into neighbouring countries, showing they were susceptible to significant outbreaks but at lower risk of introductions. Finally, we reveal this large epidemic to be a heterogeneous and spatially dissociated collection of transmission clusters of varying size, duration and connectivity. These insights will help inform interventions in future epidemics. PMID:28405027
Full Text Available Chronic kidney disease (CKD is an important public health problem with a genetic component. We performed genome-wide association studies in up to 130,600 European ancestry participants overall, and stratified for key CKD risk factors. We uncovered 6 new loci in association with estimated glomerular filtration rate (eGFR, the primary clinical measure of CKD, in or near MPPED2, DDX1, SLC47A1, CDK12, CASP9, and INO80. Morpholino knockdown of mpped2 and casp9 in zebrafish embryos revealed podocyte and tubular abnormalities with altered dextran clearance, suggesting a role for these genes in renal function. By providing new insights into genes that regulate renal function, these results could further our understanding of the pathogenesis of CKD.
Wang, Shuai; Wang, Sen; Luo, Yingfeng; Xiao, Lihua; Luo, Xuenong; Gao, Shenghan; Dou, Yongxi; Zhang, Huangkai; Guo, Aijiang; Meng, Qingshu; Hou, Junling; Zhang, Bing; Zhang, Shaohua; Yang, Meng; Meng, Xuelian; Mei, Hailiang; Li, Hui; He, Zilong; Zhu, Xueliang; Tan, Xinyu; Zhu, Xing-quan; Yu, Jun; Cai, Jianping; Zhu, Guan; Hu, Songnian; Cai, Xuepeng
Taenia saginata, Taenia solium and Taenia asiatica (beef, pork and Asian tapeworms, respectively) are parasitic flatworms of major public health and food safety importance. Among them, T. asiatica is a newly recognized species that split from T. saginata via an intermediate host switch ∼1.14 Myr ago. Here we report the 169- and 168-Mb draft genomes of T. saginata and T. asiatica. Comparative analysis reveals that high rates of gene duplications and functional diversifications might have partially driven the divergence between T. asiatica and T. saginata. We observe accelerated evolutionary rates, adaptive evolutions in homeostasis regulation, tegument maintenance and lipid uptakes, and differential/specialized gene family expansions in T. asiatica that may favour its hepatotropism in the new intermediate host. We also identify potential targets for developing diagnostic or intervention tools against human tapeworms. These data provide new insights into the evolution of Taenia parasites, particularly the recent speciation of T. asiatica. PMID:27653464
Mason, Olivia U.; Hazen, Terry C.; Borglin, Sharon; Chain, Patrick S. G.; Dubinsky, Eric A.; Fortney, Julian L.; Han, James; Holman, Hoi-Ying N.; Hultman, Jenni; Lamendella, Regina; Mackelprang, Rachel; Malfatti, Stephanie; Tom, Lauren M.; Tringe, Susannah G.; Woyke, Tanja; Zhou, Jizhong; Rubin, Edward M.; Jansson, Janet K.
The Deepwater Horizon oil spill in the Gulf of Mexico resulted in a deep-sea hydrocarbon plume that caused a shift in the indigenous microbial community composition with unknown ecological consequences. Early in the spill history, a bloom of uncultured, thus uncharacterized, members of the Oceanospirillales was previously detected, but their role in oil disposition was unknown. Here our aim was to determine the functional role of the Oceanospirillales and other active members of the indigenous microbial community using deep sequencing of community DNA and RNA, as well as single-cell genomics. Shotgun metagenomic and metatranscriptomic sequencing revealed that genes for motility, chemotaxis and aliphatic hydrocarbon degradation were significantly enriched and expressed in the hydrocarbon plume samples compared with uncontaminated seawater collected from plume depth. In contrast, although genes coding for degradation of more recalcitrant compounds, such as benzene, toluene, ethylbenzene, total xylenes and polycyclic aromatic hydrocarbons, were identified in the metagenomes, they were expressed at low levels, or not at all based on analysis of the metatranscriptomes. Isolation and sequencing of two Oceanospirillales single cells revealed that both cells possessed genes coding for n-alkane and cycloalkane degradation. Specifically, the near-complete pathway for cyclohexane oxidation in the Oceanospirillales single cells was elucidated and supported by both metagenome and metatranscriptome data. The draft genome also included genes for chemotaxis, motility and nutrient acquisition strategies that were also identified in the metagenomes and metatranscriptomes. These data point towards a rapid response of members of the Oceanospirillales to aliphatic hydrocarbons in the deep sea.
Hoeppner Marc P
Full Text Available Abstract Background Small nucleolar (snoRNAs are required for posttranscriptional processing and modification of ribosomal, spliceosomal and messenger RNAs. Their presence in both eukaryotes and archaea indicates that snoRNAs are evolutionarily ancient. The location of some snoRNAs within the introns of ribosomal protein genes has been suggested to belie an RNA world origin, with the exons of the earliest protein-coding genes having evolved around snoRNAs after the advent of templated protein synthesis. Alternatively, this intronic location may reflect more recent selection for coexpression of snoRNAs and ribosomal components, ensuring rRNA modification by snoRNAs during ribosome synthesis. To gain insight into the evolutionary origins of this genetic organization, we examined the antiquity of snoRNA families and the stability of their genomic location across 44 eukaryote genomes. Results We report that dozens of snoRNA families are traceable to the Last Eukaryotic Common Ancestor (LECA, but find only weak similarities between the oldest eukaryotic snoRNAs and archaeal snoRNA-like genes. Moreover, many of these LECA snoRNAs are located within the introns of host genes independently traceable to the LECA. Comparative genomic analyses reveal the intronic location of LECA snoRNAs is not ancestral however, suggesting the pattern we observe is the result of ongoing intragenomic mobility. Analysis of human transcriptome data indicates that the primary requirement for hosting intronic snoRNAs is a broad expression profile. Consistent with ongoing mobility across broadly-expressed genes, we report a case of recent migration of a non-LECA snoRNA from the intron of a ubiquitously expressed non-LECA host gene into the introns of two LECA genes during the evolution of primates. Conclusions Our analyses show that snoRNAs were a well-established family of RNAs at the time when eukaryotes began to diversify. While many are intronic, this association is not
Zheng, Jinshui; Gao, Qiuling; Liu, Linlin; Liu, Hualin; Wang, Yueying; Peng, Donghai; Ruan, Lifang; Raymond, Ben; Sun, Ming
Understanding the genetic basis of host shifts is a key genomic question for pathogen and parasite biology. The Bacillus cereus group, which encompasses Bacillus thuringiensis and Bacillus anthracis , contains pathogens that can infect insects, nematodes, and vertebrates. Since the target range of the essential virulence factors (Cry toxins) and many isolates is well known, this group presents a powerful system for investigating how pathogens can diversify and adapt to phylogenetically distant hosts. Specialization to exploit insects occurs at the level of the major clade and is associated with substantial changes in the core genome, and host switching between insect orders has occurred repeatedly within subclades. The transfer of plasmids with linked cry genes may account for much of the adaptation to particular insect orders, and network analysis implies that host specialization has produced strong associations between key toxin genes with similar targets. Analysis of the distribution of plasmid minireplicons shows that plasmids with orf156 and orf157 , which carry genes encoding toxins against Lepidoptera or Diptera, were contained only by B. thuringiensis in the specialized insect clade (clade 2), indicating that tight genome/plasmid associations have been important in adaptation to invertebrate hosts. Moreover, the accumulation of multiple virulence factors on transposable elements suggests that cotransfer of diverse virulence factors is advantageous in terms of expanding the insecticidal spectrum, overcoming insect resistance, or through gains in pathogenicity via synergistic interactions between toxins. IMPORTANCE Population genomics have provided many new insights into the formation, evolution, and dynamics of bacterial pathogens of humans and other higher animals, but these pathogens usually have very narrow host ranges. As a pathogen of insects and nematodes, Bacillus thuringiensis , which produces toxins showing toxicity to many orders of insects and
Buitenhuis, Bart; Janss, Luc L G; Poulsen, Nina Aagaard
The milk fat profile of the Danish Holstein (DH) and Danish Jersey (DJ) show clear differences. Identification of the genomic regions, genes and biological pathways underlying the milk fat biosynthesis will improve the understanding of the biology underlying bovine milk fat production and may...... provide new possibilities to change the milk fat composition by selective breeding. In this study a genome wide association scan (GWAS) in the DH and DJ was performed for a detailed milk fatty acid (FA) profile using the HD bovine SNP array and subsequently a biological pathway analysis based on the SNP...
Zhang, D; Cupp, M S; Cupp, E W
Thrombostasin (TS) is a newly described thrombin-inhibiting protein isolated from the saliva of the horn fly (Haematobia irritans), a blood-sucking ectoparasite of cattle. This report provides a detailed characterization of the TS gene and the first analysis of the allelic complexity of a gene for an anti-hemostatic protein from a blood-feeding insect. Multiple point mutations at fixed positions in the TS gene were identified in a cDNA library prepared from mRNA isolated from horn fly salivary glands. When translated, the variant mRNAs would specify five biochemically active peptides that differ in molecular weight, isoelectric point and predicted secondary structure. Allelic variation with the same mutation pattern was revealed in the genomes of individual flies collected in the field and sampled from a long-standing laboratory colony. Approximately 60% of flies examined carried heterozygous alleles, including five additional alleles not found in the cDNA library. Comparative analysis of the allelic mutations and the predicted effects on secondary structures of the active proteins produced suggest that the TS gene may be undergoing evolutionary selection.
Kateryna V Zhalnina
Full Text Available The activity of ammonia-oxidizing archaea (AOA leads to the loss of nitrogen from soil, pollution of water sources and elevated emissions of greenhouse gas. To date, eight AOA genomes are available in the public databases, seven are from the group I.1a of the Thaumarchaeota and only one is from the group I.1b, isolated from hot springs. Many soils are dominated by AOA from the group I.1b, but the genomes of soil representatives of this group have not been sequenced and functionally characterized. The lack of knowledge of metabolic pathways of soil AOA presents a critical gap in understanding their role in biogeochemical cycles. Here, we describe the first complete genome of soil archaeon Candidatus Nitrososphaera evergladensis, which has been reconstructed from metagenomic sequencing of a highly enriched culture obtained from an agricultural soil. The AOA enrichment was sequenced with the high throughput next generation sequencing platforms from Pacific Biosciences and Ion Torrent. The de novo assembly of sequences resulted in one 2.95 Mb contig. Annotation of the reconstructed genome revealed many similarities of the basic metabolism with the rest of sequenced AOA. Ca. N. evergladensis belongs to the group I.1b and shares only 40% of whole-genome homology with the closest sequenced relative Ca. N. gargensis. Detailed analysis of the genome revealed coding sequences that were completely absent from the group I.1a. These unique sequences code for proteins involved in control of DNA integrity, transporters, two-component systems and versatile CRISPR defense system. Notably, genomes from the group I.1b have more gene duplications compared to the genomes from the group I.1a. We suggest that the presence of these unique genes and gene duplications may be associated with the environmental versatility of this group.
Vital, Marius; Howe, Adina Chuang; Tiedje, James M
Butyrate-producing bacteria have recently gained attention, since they are important for a healthy colon and when altered contribute to emerging diseases, such as ulcerative colitis and type II diabetes. This guild is polyphyletic and cannot be accurately detected by 16S rRNA gene sequencing. Consequently, approaches targeting the terminal genes of the main butyrate-producing pathway have been developed. However, since additional pathways exist and alternative, newly recognized enzymes catalyzing the terminal reaction have been described, previous investigations are often incomplete. We undertook a broad analysis of butyrate-producing pathways and individual genes by screening 3,184 sequenced bacterial genomes from the Integrated Microbial Genome database. Genomes of 225 bacteria with a potential to produce butyrate were identified, including many previously unknown candidates. The majority of candidates belong to distinct families within the Firmicutes, but members of nine other phyla, especially from Actinobacteria, Bacteroidetes, Fusobacteria, Proteobacteria, Spirochaetes, and Thermotogae, were also identified as potential butyrate producers. The established gene catalogue (3,055 entries) was used to screen for butyrate synthesis pathways in 15 metagenomes derived from stool samples of healthy individuals provided by the HMP (Human Microbiome Project) consortium. A high percentage of total genomes exhibited a butyrate-producing pathway (mean, 19.1%; range, 3.2% to 39.4%), where the acetyl-coenzyme A (CoA) pathway was the most prevalent (mean, 79.7% of all pathways), followed by the lysine pathway (mean, 11.2%). Diversity analysis for the acetyl-CoA pathway showed that the same few firmicute groups associated with several Lachnospiraceae and Ruminococcaceae were dominating in most individuals, whereas the other pathways were associated primarily with Bacteroidetes. IMPORTANCE Microbiome research has revealed new, important roles of our gut microbiota for
Full Text Available Aim: To analyze the promoter sequence of toll-like receptor (TLR genes in Vechur cattle, an indigenous breed of Kerala with the sequence of Bos taurus and access the differences that could be attributed to innate immune responses against bovine mastitis. Materials and Methods: Blood samples were collected from Jugular vein of Vechur cattle, maintained at Vechur cattle conservation center of Kerala Veterinary and Animal Sciences University, using an acid-citrate-dextrose anticoagulant. The genomic DNA was extracted, and polymerase chain reaction was carried out to amplify the promoter region of TLRs. The amplified product of TLR2, 4, and 9 promoter regions was sequenced by Sanger enzymatic DNA sequencing technique. Results: The sequence of promoter region of TLR2 of Vechur cattle with the B. taurus sequence present in GenBank showed 98% similarity and revealed variants for four sequence motifs. The sequence of the promoter region of TLR4 of Vechur cattle revealed 99% similarity with that of B. taurus sequence but not reveals significant variant in motifregions. However, two heterozygous loci were observed from the chromatogram. Promoter sequence of TLR9 gene also showed 99% similarity to B. taurus sequence and revealed variants for four sequence motifs. Conclusion: The results of this study indicate that significant variation in the promoter of TLR2 and 9 genes in Vechur cattle breed and may potentially link the influence the innate immunity response against mastitis diseases.
Saunders, Neil F.W.; Thomas, Torsten; Curmi, Paul M.G.; Mattick, John S.; Kuczek, Elizabeth; Slade, Rob; Davis, John; Franzmann, Peter D.; Boone, David; Rusterholtz, Karl; Feldman, Robert; Gates, Chris; Bench, Shellie; Sowers, Kevin; Kadner, Kristen
We generated draft genome sequences for two cold-adapted Archaea, Methanogenium frigidum and Methanococcoides burtonii, to identify genotypic characteristics that distinguish them from Archaea with a higher optimal growth temperature (OGT). Comparative genomics revealed trends in amino acid and tRNA composition, and structural features of proteins. Proteins from the cold-adapted Archaea are characterized by a higher content of noncharged polar amino acids, particularly Gl...
Saunders, Neil F.W.; Thomas, Torsten; Curmi, Paul M.G.; Mattick, John S.; Kuczek, Elizabeth; Slade, Rob; Davis, John; Franzmann, Peter; Boone, David; Rusterholtz, Karl; Feldman, Robert; Gates, Chris; Bench, Shellie; Sowers, Kevin; Kadner, Kristen
We generated draft genome sequences for two cold-adapted Archaea, Methanogenium frigidum and Methanococcoides burtonii, to identify genotypic characteristics that distinguish them from Archaea with a higher optimal growth temperature (OGT). Comparative genomics revealed trends in amino acid and tRNA composition, and structural features of proteins. Proteins from the cold-adapted Archaea are characterized by a higher content of non-charged polar amino acids, particularly Gln and Thr and a lowe...
Full Text Available Abstract Background Bird genomes have very different compositional structure compared with other warm-blooded animals. The variation in the base skew rules in the vertebrate genomes remains puzzling, but it must relate somehow to large-scale genome evolution. Current research is inclined to relate base skew with mutations and their fixation. Here we wish to explore base skew correlations in bird genomes, to develop methods for displaying and quantifying such correlations at different scales, and to discuss possible explanations for the peculiarities of the bird genomes in skew correlation. Results We have developed a method called Base Skew Double Triangle (BSDT for exhibiting the genome-scale change of AT/CG skew as a two-dimensional square picture, showing base skews at many scales simultaneously in a single image. By this method we found that most chicken chromosomes have high AT/CG skew correlation (symmetry in 2D picture, except for some microchromosomes. No other organisms studied (18 species show such high skew correlations. This visualized high correlation was validated by three kinds of quantitative calculations with overlapping and non-overlapping windows, all indicating that chicken and birds in general have a special genome structure. Similar features were also found in some of the mammal genomes, but clearly much weaker than in chickens. We presume that the skew correlation feature evolved near the time that birds separated from other vertebrate lineages. When we eliminated the repeat sequences from the genomes, the AT and CG skews correlation increased for some mammal genomes, but were still clearly lower than in chickens. Conclusion Our results suggest that BSDT is an expressive visualization method for AT and CG skew and enabled the discovery of the very high skew correlation in bird genomes; this peculiarity is worth further study. Computational analysis indicated that this correlation might be a compositional characteristic
Complete Genome Sequences of Five Bluetongue Virus (BTV) Vaccine Strains from a Commercial Live Attenuated Vaccine, a BTV-4 Field Strain from South Africa, and a Reassortant Strain Isolated from Experimentally Vaccinated Cattle
Coetzee, Peter; le Grange, Misha; Venter, Estelle H.
This is a report of the complete genome sequences of plaque-selected isolates of each of the five virus strains included in a South African commercial trivalent bluetongue virus (BTV) attenuated live virus vaccine, a BTV-4 field strain isolated from Rustenburg, South Africa, in 2011, and a bluetongue reassortant (bluetongue virus 4 strain 4/O. aries-tc/ZAF/11/OBP-115) isolated from experimentally vaccinated cattle. Full-genome sequencing and phylogenetic analyses show that the bluetongue virus 9 strain 9/B. taurus-tc/ZAF/15/Onderstepoort_B02b is a reassortant virus containing segments from both BTV-9 and BTV-8. PMID:27340051
Antunes, S.; Galindo, R. C.; Almazán, C.; Rudenko, Natalia; Golovchenko, Maryna; Grubhoffer, Libor; Shkap, V.; do Rosário, A.; de la Fuente, J.; Domingos, A.
Roč. 42, č. 2 (2012), s. 187-195 ISSN 0020-7519 Institutional research plan: CEZ:AV0Z60220518 Keywords : Tick * Genomics * Babesia * Rhipicephalus * Boophilus * RNA interference * Vaccine Subject RIV: EC - Immunology Impact factor: 3.637, year: 2012 http://www.sciencedirect.com/science/article/pii/S0020751912000033
Pszczola, M.J.; Mulder, H.A.; Calus, M.P.L.
Genomic selection (GS) permits accurate breeding values to be obtained for young animals, shortening the generation interval and accelerating the genetic gain, thereby leading to reduced costs for proven bulls. Genotyping a large number of animals using high-density single nucleotide polymorphism
As the methodologies available for the detection of positive selection from genomic data vary in terms of assumptions and execution, weak correlations are expected among them. However, if there is any given signal that is consistently supported across different tests, it might be a strong evidence o...
Zhao, Zhongtao; Liu, Huiquan; Wang, Chenfang; Xu, Jin-Rong
Fungi produce a variety of carbohydrate activity enzymes (CAZymes) for the degradation of plant polysaccharide materials to facilitate infection and/or gain nutrition. Identifying and comparing CAZymes from fungi with different nutritional modes or infection mechanisms may provide information for better understanding of their life styles and infection models. To date, over hundreds of fungal genomes are publicly available. However, a systematic comparative analysis of fungal CAZymes across the entire fungal kingdom has not been reported. In this study, we systemically identified glycoside hydrolases (GHs), polysaccharide lyases (PLs), carbohydrate esterases (CEs), and glycosyltransferases (GTs) as well as carbohydrate-binding modules (CBMs) in the predicted proteomes of 103 representative fungi from Ascomycota, Basidiomycota, Chytridiomycota, and Zygomycota. Comparative analysis of these CAZymes that play major roles in plant polysaccharide degradation revealed that fungi exhibit tremendous diversity in the number and variety of CAZymes. Among them, some families of GHs and CEs are the most prevalent CAZymes that are distributed in all of the fungi analyzed. Importantly, cellulases of some GH families are present in fungi that are not known to have cellulose-degrading ability. In addition, our results also showed that in general, plant pathogenic fungi have the highest number of CAZymes. Biotrophic fungi tend to have fewer CAZymes than necrotrophic and hemibiotrophic fungi. Pathogens of dicots often contain more pectinases than fungi infecting monocots. Interestingly, besides yeasts, many saprophytic fungi that are highly active in degrading plant biomass contain fewer CAZymes than plant pathogenic fungi. Furthermore, analysis of the gene expression profile of the wheat scab fungus Fusarium graminearum revealed that most of the CAZyme genes related to cell wall degradation were up-regulated during plant infection. Phylogenetic analysis also revealed a complex
Background Fungi produce a variety of carbohydrate activity enzymes (CAZymes) for the degradation of plant polysaccharide materials to facilitate infection and/or gain nutrition. Identifying and comparing CAZymes from fungi with different nutritional modes or infection mechanisms may provide information for better understanding of their life styles and infection models. To date, over hundreds of fungal genomes are publicly available. However, a systematic comparative analysis of fungal CAZymes across the entire fungal kingdom has not been reported. Results In this study, we systemically identified glycoside hydrolases (GHs), polysaccharide lyases (PLs), carbohydrate esterases (CEs), and glycosyltransferases (GTs) as well as carbohydrate-binding modules (CBMs) in the predicted proteomes of 103 representative fungi from Ascomycota, Basidiomycota, Chytridiomycota, and Zygomycota. Comparative analysis of these CAZymes that play major roles in plant polysaccharide degradation revealed that fungi exhibit tremendous diversity in the number and variety of CAZymes. Among them, some families of GHs and CEs are the most prevalent CAZymes that are distributed in all of the fungi analyzed. Importantly, cellulases of some GH families are present in fungi that are not known to have cellulose-degrading ability. In addition, our results also showed that in general, plant pathogenic fungi have the highest number of CAZymes. Biotrophic fungi tend to have fewer CAZymes than necrotrophic and hemibiotrophic fungi. Pathogens of dicots often contain more pectinases than fungi infecting monocots. Interestingly, besides yeasts, many saprophytic fungi that are highly active in degrading plant biomass contain fewer CAZymes than plant pathogenic fungi. Furthermore, analysis of the gene expression profile of the wheat scab fungus Fusarium graminearum revealed that most of the CAZyme genes related to cell wall degradation were up-regulated during plant infection. Phylogenetic analysis also
Mohammed, Manal; Le Hello, Simon; Leekitcharoenphon, Pimlapas
Salmonella enterica serovar Dublin is a zoonotic infection that can be transmitted from cattle to humans through consumption of contaminated milk and milk products. Outbreaks of human infections by S. Dublin have been reported in several countries including high-income countries. A high proportio...
Full Text Available Adipocytes mainly function as energy storage and endocrine cells. Adipose tissues showed the biological and genetic difference based on their depots. The difference of adipocytes between depots might be influenced by the inherent genetic programing for adipogenesis. We used RNA-seq technique to investigate the transcriptomes in 3 adipose tissues of omental (O, subcutaneous (S and intramuscular (I fats in cattle. Sequence reads were obtained from Illumina HiSeq2000 and mapped to the bovine genome using Tophat2. Differentially expressed genes (DEG between adipose tissues were detected by EdgeR. We identified 5797, 2156, and 5455 DEGs in the comparison between OI, OS, and IS respectively and also found 5657 DEGs in the comparison between the intramuscular and the combined omental and subcutaneous fats (C (FDR<0.01. Depot specifically up- and down- regulated DEGs were 853 in S, 48 in I, and 979 in O. The numbers of DEGs and functional annotation studies suggested that I had the different genetic profile compared to other two adipose tissues. In I, DEGs involved in the developmental process (eg. EGR2, FAS, and KLF7 were up-regulated and those in the immune system process were down-regulated. Many DEGs from the adipose tissues were enriched in the various GO terms of developmental process and KEGG pathway analysis showed that the ECM-receptor interaction was one of commonly enriched pathways in all of the 3 adipose tissues and also functioned as a sub-pathway of other enriched pathways. However, genes involved in the ECM-receptor interaction were differentially regulated depending on the depots. Collagens, main ECM constituents, were significantly up-regulated in S and integrins, transmembrane receptors, were up-regulated in I. Different laminins were up-regulated in the different depots. This comparative transcriptome analysis of three adipose tissues suggested that the interactions between ECM components and transmembrane receptors of fat cells
González-Rodríguez, A; Munilla, S; Mouresan, E F; Cañas-Álvarez, J J; Baro, J A; Molina, A; Díaz, C; Altarriba, J; Piedrafita, J; Varona, L
The Spanish local beef cattle breeds have most likely common origin followed by a process of differentiation. This particular historical evolution has most probably left detectable signatures in the genome. The objective of this study was to identify genomic regions associated with differentiation processes in seven Spanish autochthonous populations (Asturiana de los Valles (AV), Avileña-Negra Ibérica (ANI), Bruna dels Pirineus (BP), Morucha (Mo), Pirenaica (Pi), Retinta (Re) and Rubia Gallega (RG)). The BovineHD 777K BeadChip was used on 342 individuals (AV, n=50; ANI, n=48; BP, n=50; Mo, n=50; Pi, n=48; Re, n=48; RG, n=48) chosen to be as unrelated as possible. We calculated the fixation index (F ST ) and performed a Bayesian analysis named SelEstim. The output of both procedures was very similar, although the Bayesian analysis provided a richer inference and allowed us to calculate significance thresholds by generating a pseudo-observed data set from the estimated posterior distributions. We identified a very large number of genomic regions, but when a very restrictive significance threshold was applied these regions were reduced to only 10. Among them, four regions can be highlighted because they comprised a large number of single nucleotide polymorphisms and showed extremely high signals (Kullback-Leiber divergence (KLD)>6). They are located in BTA 2 (5 575 950 to 10 152 228 base pairs (bp)), BTA 5 (17 596 734 to 18 850 702 bp), BTA 6 (37 853 912 to 39 441 548 bp) and BTA 18 (13 345 515 to 15 243 838 bp) and harbor, among others, the MSTN (Myostatin), KIT-LG (KIT Ligand), LAP3 (leucine aminopeptidase 3), NAPCG (non-SMC condensing I complex, subunit G), LCORL (ligand dependent nuclear receptor corepressor-like) and MC1R (Melanocortin 1 receptor) genes. Knowledge on these genomic regions allows to identify potential targets of recent selection and helps to define potential candidate genes associated with traits of interest, such as coat color, muscle
Susanna K. P. Lau
Full Text Available Talaromyces marneffei is a thermally dimorphic fungus causing systemic infections in patients positive for HIV or other immunocompromised statuses. Analysis of its ~28.9 Mb draft genome and additional transcriptomic, proteomic and metabolomic studies revealed mechanisms for environmental adaptations and virulence. Meiotic genes and genes for pheromone receptors, enzymes which process pheromones, and proteins involved in pheromone response pathway are present, indicating its possibility as a heterothallic fungus. Among the 14 Mp1p homologs, only Mp1p is a virulence factor binding a variety of host proteins, fatty acids and lipids. There are 23 polyketide synthase genes, one for melanin and two for mitorubrinic acid/mitorubrinol biosynthesis, which are virulence factors. Another polyketide synthase is for biogenesis of the diffusible red pigment, which consists of amino acid conjugates of monascorubin and rubropunctatin. Novel microRNA-like RNAs (milRNAs and processing proteins are present. The dicer protein, dcl-2, is required for biogenesis of two milRNAs, PM-milR-M1 and PM-milR-M2, which are more highly expressed in hyphal cells. Comparative transcriptomics showed that tandem repeat-containing genes were overexpressed in yeast phase, generating protein polymorphism among cells, evading host’s immunity. Comparative proteomics between yeast and hyphal cells revealed that glyceraldehyde-3-phosphate dehydrogenase, up-regulated in hyphal cells, is an adhesion factor for conidial attachment.
Full Text Available A genome-wide association study was performed for milk yield per lactation (MY, initial yield (IY, peak yield (PY, persistency (PS and age at first calving (AFC in a Thai multibreed dairy cattle population. The dataset contained 1305 first-lactation cows raised on 188 farms located in Central, Northeastern and Southern Thailand. Cows were genotyped with GeneSeek Genomic Profiler low-density bead chips (8810 single nucleotide polymorphism [SNP]; n = 1255 and with high-density bead chips (76,883 SNP; n = 50. The single SNP association analyses utilized 8096 SNPs in common between the low and high density GeneSeek chips. The mixed model contained the fixed effects of contemporary group, fraction of non-Holstein breeds, age at first calving and gene content, and the random effects of animal and residual. Computations were done with the QXPAK.5 software. The number of SNPs associated with MY, IY, PY, PS and AFC at the significant threshold level of p < 0.00001 were 75, 102, 145, 74 and 24, respectively. Of the 366 SNP markers significantly associated with the studied traits, 54 (14.75% were associated with two traits and 312 (85.25% with only one trait, and all but one of the 54 SNPs associated with two traits affected MY and lactation characteristics. Genetic improvement of Thai dairy cows for lactation characteristics, milk yield and age at first calving could be aided by selecting animals with the SNP markers found to be highly associated with genes influencing these traits.
David B. Neale; Patrick E. McGuire; Nicholas C. Wheeler; Kristian A. Stevens; Marc W. Crepeau; Charis Cardeno; Aleksey V. Zimin; Daniela Puiu; Geo M. Pertea; U. Uzay Sezen; Claudio Casola; Tomasz E. Koralewski; Robin Paul; Daniel Gonzalez-Ibeas; Sumaira Zaman; Richard Cronn; Mark Yandell; Carson Holt; Charles H. Langley; James A. Yorke; Steven L. Salzberg; Jill L. Wegrzyn
A reference genome sequence for Pseudotsuga menziesii var. menziesii (Mirb.) Franco (Coastal Douglas-fir) is reported, thus providing a reference sequence for a third genus of the family Pinaceae. The contiguity and quality of the genome assembly far exceeds that of other conifer reference genome sequences (contig N50 = 44,136 bp and scaffold N50...
Xue, Yali; Prado-Martinez, Javier; Sudmant, Peter H
Mountain gorillas are an endangered great ape subspecies and a prominent focus for conservation, yet we know little about their genomic diversity and evolutionary past. We sequenced whole genomes from multiple wild individuals and compared the genomes of all four Gorilla subspecies. We found that...
The small East African Shorthorn Zebu is the main indigenous cattle across East Africa. A recent genome wide SNPs analysis has revealed their ancient stable African taurine x Asian zebu admixture. Here, we assess the presence of candidate signature of positive selection in their genome, with the aim...
Full Text Available Mycobacterium ulcerans is the causative agent of Buruli ulcer, the third most common mycobacterial disease after tuberculosis and leprosy. It is an emerging infectious disease that afflicts mainly children and youths in West Africa. Little is known about the evolution and transmission mode of M. ulcerans, partially due to the lack of known genetic polymorphisms among isolates, limiting the application of genetic epidemiology. To systematically profile single nucleotide polymorphisms (SNPs, we sequenced the genomes of three M. ulcerans strains using 454 and Solexa technologies. Comparison with the reference genome of the Ghanaian classical lineage isolate Agy99 revealed 26,564 SNPs in a Japanese strain representing the ancestral lineage. Only 173 SNPs were found when comparing Agy99 with two other Ghanaian isolates, which belong to the two other types previously distinguished in Ghana by variable number tandem repeat typing. We further analyzed a collection of Ghanaian strains using the SNPs discovered. With 68 SNP loci, we were able to differentiate 54 strains into 13 distinct SNP haplotypes. The average SNP nucleotide diversity was low (average 0.06-0.09 across 68 SNP loci, and 96% of the SNP locus pairs were in complete linkage disequilibrium. We estimated that the divergence of the M. ulcerans Ghanaian clade from the Japanese strain occurred 394 to 529 thousand years ago. The Ghanaian subtypes diverged about 1000 to 3000 years ago, or even much more recently, because we found evidence that they evolved significantly faster than average. Our results offer significant insight into the evolution of M. ulcerans and provide a comprehensive report on genetic diversity within a highly clonal M. ulcerans population from a Buruli ulcer endemic region, which can facilitate further epidemiological studies of this pathogen through the development of high-resolution tools.
García-Huertas, Paola; Mejía-Jaramillo, Ana María; González, Laura; Triana-Chávez, Omar
Currently, the only available treatments for Trypanosoma cruzi are benznidazole (Bz) and nifurtimox (Nfx). The mechanisms of action and resistance to these drugs in this parasite are not complete known. In order to identify differentially expressed transcripts between sensitive and resistant parasites, a massive pyrosequencing of the T. cruzi transcriptome was carried out. Additionally, the 2D gel electrophoresis profile of sensitive and resistant parasites was analyzed and the data were supported with functional genomics. The results showed 133 differentially expressed genes in resistant parasites. The transcriptome analysis revealed the regulation of different genes with several functions and metabolic pathways, which could suggest that resistance in T. cruzi is a multigenic process. Additionally, using transcriptomics, one gene, adenine phosphoribosyltransferase (APRT), was found to be down-regulated in the resistant parasites and its expression profile was confirmed by 2D electrophoresis analysis. The role of this gene in the resistance to Bz was confirmed overexpressing it in sensitive and resistant parasites. Interestingly, both parasites became more sensitive to Bz and H 2 O 2 . This is the first RNA-seq study to identify regulated genes in T. cruzi associated with Bz resistance and to show the role of APRT in T. cruzi resistance. Although T. cruzi regulation is mainly post-transcriptional, the transcriptome analysis, supported by 2D gel analysis and functional genomic, provides an overall idea of the expression profiles of genes under resistance conditions. These results contribute essential information to further the understanding of the mechanisms of action and resistance to Bz in T. cruzi. J. Cell. Biochem. 118: 1936-1945, 2017. © 2017 Wiley Periodicals, Inc. © 2017 Wiley Periodicals, Inc.
Full Text Available Abstract Background Members of the Anopheles punctulatus group (AP group are the primary vectors of human malaria in Papua New Guinea. The AP group includes 13 sibling species, most of them morphologically indistinguishable. Understanding why only certain species are able to transmit malaria requires a better comprehension of their evolutionary history. In particular, understanding relationships and divergence times among Anopheles species may enable assessing how malaria-related traits (e.g. blood feeding behaviours, vector competence have evolved. Methods DNA sequences of 14 mitochondrial (mt genomes from five AP sibling species and two species of the Anopheles dirus complex of Southeast Asia were sequenced. DNA sequences from all concatenated protein coding genes (10,770 bp were then analysed using a Bayesian approach to reconstruct phylogenetic relationships and date the divergence of the AP sibling species. Results Phylogenetic reconstruction using the concatenated DNA sequence of all mitochondrial protein coding genes indicates that the ancestors of the AP group arrived in Papua New Guinea 25 to 54 million years ago and rapidly diverged to form the current sibling species. Conclusion Through evaluation of newly described mt genome sequences, this study has revealed a divergence among members of the AP group in Papua New Guinea that would significantly predate the arrival of humans in this region, 50 thousand years ago. The divergence observed among the mtDNA sequences studied here may have resulted from reproductive isolation during historical changes in sea-level through glacial minima and maxima. This leads to a hypothesis that the AP sibling species have evolved independently for potentially thousands of generations. This suggests that the evolution of many phenotypes, such as insecticide resistance will arise independently in each of the AP sibling species studied here.
Pinchuk, Grigoriy E.; Rodionov, Dmitry A.; Yang, Chen; Li, Xiaoqing; Osterman, Andrei L.; Dervyn, Etienne; Geydebrekht, Oleg V.; Reed, Samantha B.; Romine, Margaret F.; Collart, Frank R.; Scott, J.; Fredrickson, Jim K.; Beliaev, Alex S.
The ability to utilize lactate as a sole source of carbon and energy is one of the key metabolic signatures of Shewanellae, a diverse group of dissimilatory metal reducing bacteria commonly found in aquatic and sedimentary environments. Nonetheless, homology searches failed to recognize orthologs of previously described bacterial D- or L-lactate oxidizing enzymes (Escherichia coli genes dld and lldD) in any of the 13 analyzed genomes of Shewanella spp. Using comparative genomic techniques, we identified a conserved chromosomal gene cluster in Shewanella oneidensis MR-1 (locus tag: SO1522-SO1518) containing lactate permease and candidate genes for both D- and L-lactate dehydrogenase enzymes. The predicted D-LDH gene (dldD, SO1521) is a distant homolog of FAD-dependent lactate dehydrogenase from yeast, whereas the predicted L-LDH is encoded by three genes with previously unknown functions (lldEGF, SO1520-19-18). Through a combination of genetic and biochemical techniques, we experimentally confirmed the predicted physiological role of these novel genes in S. oneidensis MR-1 and carried out successful functional validation studies in Escherichia coli and Bacillus subtilis. We conclusively showed that dldD and lldEFG encode fully functional D-and L-LDH enzymes, which catalyze the oxidation of the respective lactate stereoisomers to pyruvate. Notably, the S. oneidensis MR-1 LldEFG enzyme is the first described example of a multi-subunit lactate oxidase. Comparative analysis of >400 bacterial species revealed the presence of LldEFG and Dld in a broad range of diverse species accentuating the potential importance of these previously unknown proteins in microbial metabolism.
Full Text Available Investigations on the influence of nature vs. nurture on Alcoholism (Alcohol Use Disorder in human have yet to provide a clear view on potential genomic etiologies. To address this issue, we sequenced a replicated animal model system bidirectionally-selected for alcohol preference (AP. This model is uniquely suited to map genetic effects with high reproducibility, and resolution. The origin of the rat lines (an 8-way cross resulted in small haplotype blocks (HB with a corresponding high level of resolution. We sequenced DNAs from 40 samples (10 per line of each replicate to determine allele frequencies and HB. We achieved ~46X coverage per line and replicate. Excessive differentiation in the genomic architecture between lines, across replicates, termed signatures of selection (SS, were classified according to gene and region. We identified SS in 930 genes associated with AP. The majority (50% of the SS were confined to single gene regions, the greatest numbers of which were in promoters (284 and intronic regions (169 with the least in exon's (4, suggesting that differences in AP were primarily due to alterations in regulatory regions. We confirmed previously identified genes and found many new genes associated with AP. Of those newly identified genes, several demonstrated neuronal function involved in synaptic memory and reward behavior, e.g. ion channels (Kcnf1, Kcnn3, Scn5a, excitatory receptors (Grin2a, Gria3, Grip1, neurotransmitters (Pomc, and synapses (Snap29. This study not only reveals the polygenic architecture of AP, but also emphasizes the importance of regulatory elements, consistent with other complex traits.
Lo, Chiao-Ling; Lossie, Amy C; Liang, Tiebing; Liu, Yunlong; Xuei, Xiaoling; Lumeng, Lawrence; Zhou, Feng C; Muir, William M
Investigations on the influence of nature vs. nurture on Alcoholism (Alcohol Use Disorder) in human have yet to provide a clear view on potential genomic etiologies. To address this issue, we sequenced a replicated animal model system bidirectionally-selected for alcohol preference (AP). This model is uniquely suited to map genetic effects with high reproducibility, and resolution. The origin of the rat lines (an 8-way cross) resulted in small haplotype blocks (HB) with a corresponding high level of resolution. We sequenced DNAs from 40 samples (10 per line of each replicate) to determine allele frequencies and HB. We achieved ~46X coverage per line and replicate. Excessive differentiation in the genomic architecture between lines, across replicates, termed signatures of selection (SS), were classified according to gene and region. We identified SS in 930 genes associated with AP. The majority (50%) of the SS were confined to single gene regions, the greatest numbers of which were in promoters (284) and intronic regions (169) with the least in exon's (4), suggesting that differences in AP were primarily due to alterations in regulatory regions. We confirmed previously identified genes and found many new genes associated with AP. Of those newly identified genes, several demonstrated neuronal function involved in synaptic memory and reward behavior, e.g. ion channels (Kcnf1, Kcnn3, Scn5a), excitatory receptors (Grin2a, Gria3, Grip1), neurotransmitters (Pomc), and synapses (Snap29). This study not only reveals the polygenic architecture of AP, but also emphasizes the importance of regulatory elements, consistent with other complex traits.
Vecsey, Christopher G; Peixoto, Lucia; Choi, Jennifer H K; Wimmer, Mathieu; Jaganath, Devan; Hernandez, Pepe J; Blackwell, Jennifer; Meda, Karuna; Park, Alan J; Hannenhalli, Sridhar; Abel, Ted
Sleep deprivation is a common problem of considerable health and economic impact in today's society. Sleep loss is associated with deleterious effects on cognitive functions such as memory and has a high comorbidity with many neurodegenerative and neuropsychiatric disorders. Therefore, it is crucial to understand the molecular basis of the effect of sleep deprivation in the brain. In this study, we combined genome-wide and traditional molecular biological approaches to determine the cellular and molecular impacts of sleep deprivation in the mouse hippocampus, a brain area crucial for many forms of memory. Microarray analysis examining the effects of 5 h of sleep deprivation on gene expression in the mouse hippocampus found 533 genes with altered expression. Bioinformatic analysis revealed that a prominent effect of sleep deprivation was to downregulate translation, potentially mediated through components of the insulin signaling pathway such as the mammalian target of rapamycin (mTOR), a key regulator of protein synthesis. Consistent with this analysis, sleep deprivation reduced levels of total and phosphorylated mTOR, and levels returned to baseline after 2.5 h of recovery sleep. Our findings represent the first genome-wide analysis of the effects of sleep deprivation on the mouse hippocampus, and they suggest that the detrimental effects of sleep deprivation may be mediated by reductions in protein synthesis via downregulation of mTOR. Because protein synthesis and mTOR activation are required for long-term memory formation, our study improves our understanding of the molecular mechanisms underlying the memory impairments induced by sleep deprivation.
Yu, Chengjie; Sizhu, Suolang; Luo, Qingping; Xu, Xuewen; Fu, Lei; Zhang, Anding
Pasteurella multocida (P. multocida) was first shown to be the causative agent of fowl cholera by Louis Pasteur in 1881. First genomic study was performed on an avirulent avian strain Pm70, and until 2013, two genomes of virulent avian strains X73 and P1059 were sequenced. Comparative genome study supplied important information for further study on the pathogenesis of fowl cholera. In the previous study, a capsular serotype A strain GX-Pm was isolated from the liver of a chicken, which died during an outbreak of fowl cholera in 2011. The strain showed multiple drug resistance and was highly virulent to chickens. Therefore, the present study performed the genome sequencing and a comparative genomic analysis to reveal the candidate genes involved in virulence of P. multocida. Sequenced draft genome sequence of GX-Pm was 2,292,886 bp, contained 2941 protein-coding genes, 5 genomic islands, 4 IS elements and 2 prophage regions. Notability, all the predicted drug-resistance genes were included in predicted genomic islands. A comparative genome study on virulent avian strains P1059, X73 and GX-Pm with the avirulent avian strain Pm 70 indicated that 475 unique genes were only identified in either of virulent strains but absent in the avirulent strain. Among these genes, 20 genes were contained within genomes of all three virulent strains, including a few of putative virulence genes. Further characterization of the pathogenic functions of these genes would benefit the understanding of pathogenesis of fowl cholera. Copyright © 2016 Elsevier Ltd. All rights reserved.
Do, Duy Ngoc; Strathe, Anders Bjerring; Ostersen, Tage
This study was aimed at identifying genomic regions controlling feeding behaviors inDanish Duroc boars and its potential implications for eating behaviors in humans.Individual daily feed intake (DFI), total daily time spent in feeder (TPD), number of dailyvisits to feeder (NVD), time spent to eat...... for geneticimprovement of pig feed efficiency. The results of pig-human comparative genemapping revealed some important genomic regions and/or genes on the humangenome that may influence eating behavior in human and consequently affect thedevelopment of obesity and metabolic syndromes. This is the first...... such translationalgenomics study to report potential candidate genes for eating behavior in humans...
Belbahri, Lassaad; Chenari Bouket, Ali; Rekik, Imen; Alenezi, Faizah N.; Vallat, Armelle; Luptakova, Lenka; Petrovova, Eva; Oszako, Tomasz; Cherrad, Semcheddine; Vacher, Sébastien; Rateb, Mostafa E.
The Gram positive, non-pathogenic endospore-forming soil inhabiting prokaryote Bacillus amyloliquefaciens is a plant growth-promoting rhizobacterium. Bacillus amyloliquefaciens processes wide biocontrol abilities and numerous strains have been reported to suppress diverse bacterial, fungal and fungal-like pathogens. Knowledge about strain level biocontrol abilities is warranted to translate this knowledge into developing more efficient biocontrol agents and bio-fertilizers. Ever-expanding genome studies of B. amyloliquefaciens are showing tremendous increase in strain-specific new secondary metabolite clusters which play key roles in the suppression of pathogens and plant growth promotion. In this report, we have used genome mining of all sequenced B. amyloliquefaciens genomes to highlight species boundaries, the diverse strategies used by different strains to promote plant growth and the diversity of their secondary metabolites. Genome composition of the targeted strains suggest regions of genomic plasticity that shape the structure and function of these genomes and govern strain adaptation to different niches. Our results indicated that B. amyloliquefaciens: (i) suffer taxonomic imprecision that blurs the debate over inter-strain genome diversity and dynamics, (ii) have diverse strategies to promote plant growth and development, (iii) have an unlocked, yet to be delimited impressive arsenal of secondary metabolites and products, (iv) have large number of so-called orphan gene clusters, i.e., biosynthetic clusters for which the corresponding metabolites are yet unknown, and (v) have a dynamic pan genome with a secondary metabolite rich accessory genome. PMID:28824571
Belbahri, Lassaad; Chenari Bouket, Ali; Rekik, Imen; Alenezi, Faizah N; Vallat, Armelle; Luptakova, Lenka; Petrovova, Eva; Oszako, Tomasz; Cherrad, Semcheddine; Vacher, Sébastien; Rateb, Mostafa E
The Gram positive, non-pathogenic endospore-forming soil inhabiting prokaryote Bacillus amyloliquefaciens is a plant growth-promoting rhizobacterium. Bacillus amyloliquefaciens processes wide biocontrol abilities and numerous strains have been reported to suppress diverse bacterial, fungal and fungal-like pathogens. Knowledge about strain level biocontrol abilities is warranted to translate this knowledge into developing more efficient biocontrol agents and bio-fertilizers. Ever-expanding genome studies of B. amyloliquefaciens are showing tremendous increase in strain-specific new secondary metabolite clusters which play key roles in the suppression of pathogens and plant growth promotion. In this report, we have used genome mining of all sequenced B. amyloliquefaciens genomes to highlight species boundaries, the diverse strategies used by different strains to promote plant growth and the diversity of their secondary metabolites. Genome composition of the targeted strains suggest regions of genomic plasticity that shape the structure and function of these genomes and govern strain adaptation to different niches. Our results indicated that B. amyloliquefaciens : (i) suffer taxonomic imprecision that blurs the debate over inter-strain genome diversity and dynamics, (ii) have diverse strategies to promote plant growth and development, (iii) have an unlocked, yet to be delimited impressive arsenal of secondary metabolites and products, (iv) have large number of so-called orphan gene clusters, i.e., biosynthetic clusters for which the corresponding metabolites are yet unknown, and (v) have a dynamic pan genome with a secondary metabolite rich accessory genome.
Jaillon, Olivier; Aury, Jean-Marc; Brunet, Frédéric; Petit, Jean-Louis; Stange-Thomann, Nicole; Mauceli, Evan; Bouneau, Laurence; Fischer, Cécile; Ozouf-Costaz, Catherine; Bernot, Alain; Nicaud, Sophie; Jaffe, David; Fisher, Sheila; Lutfalla, Georges; Dossat, Carole; Segurens, Béatrice; Dasilva, Corinne; Salanoubat, Marcel; Levy, Michael; Boudet, Nathalie; Castellano, Sergi; Anthouard, Véronique; Jubin, Claire; Castelli, Vanina; Katinka, Michael; Vacherie, Benoît; Biémont, Christian; Skalli, Zineb; Cattolico, Laurence; Poulain, Julie; De Berardinis, Véronique; Cruaud, Corinne; Duprat, Simone; Brottier, Philippe; Coutanceau, Jean-Pierre; Gouzy, Jérôme; Parra, Genis; Lardier, Guillaume; Chapple, Charles; McKernan, Kevin J; McEwan, Paul; Bosak, Stephanie; Kellis, Manolis; Volff, Jean-Nicolas; Guigó, Roderic; Zody, Michael C; Mesirov, Jill; Lindblad-Toh, Kerstin; Birren, Bruce; Nusbaum, Chad; Kahn, Daniel; Robinson-Rechavi, Marc; Laudet, Vincent; Schachter, Vincent; Quétier, Francis; Saurin, William; Scarpelli, Claude; Wincker, Patrick; Lander, Eric S; Weissenbach, Jean; Roest Crollius, Hugues
Tetraodon nigroviridis is a freshwater puffer fish with the smallest known vertebrate genome. Here, we report a draft genome sequence with long-range linkage and substantial anchoring to the 21 Tetraodon chromosomes. Genome analysis provides a greatly improved fish gene catalogue, including identifying key genes previously thought to be absent in fish. Comparison with other vertebrates and a urochordate indicates that fish proteins have diverged markedly faster than their mammalian homologues. Comparison with the human genome suggests approximately 900 previously unannotated human genes. Analysis of the Tetraodon and human genomes shows that whole-genome duplication occurred in the teleost fish lineage, subsequent to its divergence from mammals. The analysis also makes it possible to infer the basic structure of the ancestral bony vertebrate genome, which was composed of 12 chromosomes, and to reconstruct much of the evolutionary history of ancient and recent chromosome rearrangements leading to the modern human karyotype.
Sharma, Rekha; Kishore, Amit; Mukesh, Manishi; Ahlawat, Sonika; Maitra, Avishek; Pandey, Ashwni Kumar; Tantia, Madhu Sudan
Indian agriculture is an economic symbiosis of crop and livestock production with cattle as the foundation. Sadly, the population of indigenous cattle (Bos indicus) is declining (8.94% in last decade) and needs immediate scientific management. Genetic characterization is the first step in the development of proper management strategies for preserving genetic diversity and preventing undesirable loss of alleles. Thus, in this study we investigated genetic diversity and relationship among eleven Indian cattle breeds using 21 microsatellite markers and mitochondrial D loop sequence. The analysis of autosomal DNA was performed on 508 cattle which exhibited sufficient genetic diversity across all the breeds. Estimates of mean allele number and observed heterozygosity across all loci and population were 8.784 ± 0.25 and 0.653 ± 0.014, respectively. Differences among breeds accounted for 13.3% of total genetic variability. Despite high genetic diversity, significant inbreeding was also observed within eight populations. Genetic distances and cluster analysis showed a close relationship between breeds according to proximity in geographic distribution. The genetic distance, STRUCTURE and Principal Coordinate Analysis concluded that the Southern Indian Ongole cattle are the most distinct among the investigated cattle populations. Sequencing of hypervariable mitochondrial DNA region on a subset of 170 cattle revealed sixty haplotypes with haplotypic diversity of 0.90240, nucleotide diversity of 0.02688 and average number of nucleotide differences as 6.07407. Two major star clusters for haplotypes indicated population expansion for Indian cattle. Nuclear and mitochondrial genomes show a similar pattern of genetic variability and genetic differentiation. Various analyses concluded that the Southern breed 'Ongole' was distinct from breeds of Northern/ Central India. Overall these results provide basic information about genetic diversity and structure of Indian cattle which
Pseudomonas aeruginosa ATCC 27853 was isolated from a hospital blood specimen in 1971 and has been widely used as a model strain to survey antibiotics susceptibilities, biofilm development, and metabolic activities of Pseudomonas spp.. Although four draft genomes of P. aeruginosa ATCC 27853 have been sequenced, the complete genome of this strain is still lacking, hindering a comprehensive understanding of its physiology and functional genome.Here we sequenced and assembled the complete genome of P. aeruginosa ATCC 27853 using the Pacific Biosciences SMRT (PacBio) technology and Illumina sequencing platform. We found that accessory genes of ATCC 27853 including prophages and genomic islands (GIs) mainly contribute to the difference between P. aeruginosa ATCC 27853 and other P. aeruginosa strains. Seven prophages were identified within the genome of P. aeruginosa ATCC 27853. Of the predicted 25 GIs, three contain genes that encode monoxoygenases, dioxygenases and hydrolases that could be involved in the metabolism of aromatic compounds. Surveying virulence-related genes revealed that a series of genes that encode the B-band O-antigen of LPS are lacking in ATCC 27853. Distinctive SNPs in genes of cellular adhesion proteins such as type IV pili and flagella biosynthesis were also observed in this strain. Colony morphology analysis confirmed an enhanced biofilm formation capability of ATCC 27853 on solid agar surface compared to Pseudomonas aeruginosa PAO1. We then performed transcriptome analysis of ATCC 27853 and PAO1 using RNA-seq and compared the expression of orthologous genes to understand the functional genome and the genomic details underlying the distinctive colony morphogenesis. These analyses revealed an increased expression of genes involved in cellular adhesion and biofilm maturation such as type IV pili, exopolysaccharide and electron transport chain components in ATCC 27853 compared with PAO1. In addition, distinctive expression profiles of the
Full Text Available The domestication of Asian rice (Oryza sativa was a complex process punctuated by episodes of introgressive hybridization among and between subpopulations. Deep genetic divergence between the two main varietal groups (Indica and Japonica suggests domestication from at least two distinct wild populations. However, genetic uniformity surrounding key domestication genes across divergent subpopulations suggests cultural exchange of genetic material among ancient farmers.In this study, we utilize a novel 1,536 SNP panel genotyped across 395 diverse accessions of O. sativa to study genome-wide patterns of polymorphism, to characterize population structure, and to infer the introgression history of domesticated Asian rice. Our population structure analyses support the existence of five major subpopulations (indica, aus, tropical japonica, temperate japonica and GroupV consistent with previous analyses. Our introgression analysis shows that most accessions exhibit some degree of admixture, with many individuals within a population sharing the same introgressed segment due to artificial selection. Admixture mapping and association analysis of amylose content and grain length illustrate the potential for dissecting the genetic basis of complex traits in domesticated plant populations.Genes in these regions control a myriad of traits including plant stature, blast resistance, and amylose content. These analyses highlight the power of population genomics in agricultural systems to identify functionally important regions of the genome and to decipher the role of human-directed breeding in refashioning the genomes of a domesticated species.
Chipman, Ariel D; Ferrier, David E K; Brena, Carlo; Qu, Jiaxin; Hughes, Daniel S T; Schröder, Reinhard; Torres-Oliva, Montserrat; Znassi, Nadia; Jiang, Huaiyang; Almeida, Francisca C; Alonso, Claudio R; Apostolou, Zivkos; Aqrawi, Peshtewani; Arthur, Wallace; Barna, Jennifer C J; Blankenburg, Kerstin P; Brites, Daniela; Capella-Gutiérrez, Salvador; Coyle, Marcus; Dearden, Peter K; Du Pasquier, Louis; Duncan, Elizabeth J; Ebert, Dieter; Eibner, Cornelius; Erikson, Galina; Evans, Peter D; Extavour, Cassandra G; Francisco, Liezl; Gabaldón, Toni; Gillis, William J; Goodwin-Horn, Elizabeth A; Green, Jack E; Griffiths-Jones, Sam; Grimmelikhuijzen, Cornelis J P; Gubbala, Sai; Guigó, Roderic; Han, Yi; Hauser, Frank; Havlak, Paul; Hayden, Luke; Helbing, Sophie; Holder, Michael; Hui, Jerome H L; Hunn, Julia P; Hunnekuhl, Vera S; Jackson, LaRonda; Javaid, Mehwish; Jhangiani, Shalini N; Jiggins, Francis M; Jones, Tamsin E; Kaiser, Tobias S; Kalra, Divya; Kenny, Nathan J; Korchina, Viktoriya; Kovar, Christie L; Kraus, F Bernhard; Lapraz, François; Lee, Sandra L; Lv, Jie; Mandapat, Christigale; Manning, Gerard; Mariotti, Marco; Mata, Robert; Mathew, Tittu; Neumann, Tobias; Newsham, Irene; Ngo, Dinh N; Ninova, Maria; Okwuonu, Geoffrey; Ongeri, Fiona; Palmer, William J; Patil, Shobha; Patraquim, Pedro; Pham, Christopher; Pu, Ling-Ling; Putman, Nicholas H; Rabouille, Catherine; Ramos, Olivia Mendivil; Rhodes, Adelaide C; Robertson, Helen E; Robertson, Hugh M; Ronshaugen, Matthew; Rozas, Julio; Saada, Nehad; Sánchez-Gracia, Alejandro; Scherer, Steven E; Schurko, Andrew M; Siggens, Kenneth W; Simmons, DeNard; Stief, Anna; Stolle, Eckart; Telford, Maximilian J; Tessmar-Raible, Kristin; Thornton, Rebecca; van der Zee, Maurijn; von Haeseler, Arndt; Williams, James M; Willis, Judith H; Wu, Yuanqing; Zou, Xiaoyan; Lawson, Daniel; Muzny, Donna M; Worley, Kim C; Gibbs, Richard A; Akam, Michael; Richards, Stephen
Myriapods (e.g., centipedes and millipedes) display a simple homonomous body plan relative to other arthropods. All members of the class are terrestrial, but they attained terrestriality independently of insects. Myriapoda is the only arthropod class not represented by a sequenced genome. We present
Aslam, M.L.; Bastiaansen, J.W.M.; Crooijmans, R.P.M.A.; Vereijken, A.; Groenen, M.A.M.; Megens, H.J.W.C.
Background The turkey (Meleagris gallopavo) is an important agricultural species that is the second largest contributor to the world's poultry meat production. The genomic resources of turkey provide turkey breeders with tools needed for the genetic improvement of commercial breeds of turkey for
Sandesh K Shrestha
Full Text Available Corynespora cassiicola attackes diverse agriculturally important plants, including soybean and cotton, in the US. It is a reemerge pathogen on cotton in southeastern US. Whole genome sequences of four cotton and one soybean isolate from Tennessee were used to develop single nucleotide polymorphism markers for cotton isolates. Cotton isolates had little diversity at the genome level and very little differentiation from the soybean isolate. Analysis of 75 isolates from cotton and soybean, using targeted-sequencing of 22 polymorphic SNP sites, revealed eight multi-locus genotypes and it appears a single clonal lineage predominates across the southeastern region. The cotton and soybean genome sequences were significantly different from the public reference genome derived from a rubber isolate and the utility of these novel resources will be discussed.
Ariel D Chipman
Full Text Available Myriapods (e.g., centipedes and millipedes display a simple homonomous body plan relative to other arthropods. All members of the class are terrestrial, but they attained terrestriality independently of insects. Myriapoda is the only arthropod class not represented by a sequenced genome. We present an analysis of the genome of the centipede Strigamia maritima. It retains a compact genome that has undergone less gene loss and shuffling than previously sequenced arthropods, and many orthologues of genes conserved from the bilaterian ancestor that have been lost in insects. Our analysis locates many genes in conserved macro-synteny contexts, and many small-scale examples of gene clustering. We describe several examples where S. maritima shows different solutions from insects to similar problems. The insect olfactory receptor gene family is absent from S. maritima, and olfaction in air is likely effected by expansion of other receptor gene families. For some genes S. maritima has evolved paralogues to generate coding sequence diversity, where insects use alternate splicing. This is most striking for the Dscam gene, which in Drosophila generates more than 100,000 alternate splice forms, but in S. maritima is encoded by over 100 paralogues. We see an intriguing linkage between the absence of any known photosensory proteins in a blind organism and the additional absence of canonical circadian clock genes. The phylogenetic position of myriapods allows us to identify where in arthropod phylogeny several particular molecular mechanisms and traits emerged. For example, we conclude that juvenile hormone signalling evolved with the emergence of the exoskeleton in the arthropods and that RR-1 containing cuticle proteins evolved in the lineage leading to Mandibulata. We also identify when various gene expansions and losses occurred. The genome of S. maritima offers us a unique glimpse into the ancestral arthropod genome, while also displaying many adaptations
Chipman, Ariel D.; Ferrier, David E. K.; Brena, Carlo; Qu, Jiaxin; Hughes, Daniel S. T.; Schröder, Reinhard; Torres-Oliva, Montserrat; Znassi, Nadia; Jiang, Huaiyang; Almeida, Francisca C.; Alonso, Claudio R.; Apostolou, Zivkos; Aqrawi, Peshtewani; Arthur, Wallace; Barna, Jennifer C. J.; Blankenburg, Kerstin P.; Brites, Daniela; Capella-Gutiérrez, Salvador; Coyle, Marcus; Dearden, Peter K.; Du Pasquier, Louis; Duncan, Elizabeth J.; Ebert, Dieter; Eibner, Cornelius; Erikson, Galina; Evans, Peter D.; Extavour, Cassandra G.; Francisco, Liezl; Gabaldón, Toni; Gillis, William J.; Goodwin-Horn, Elizabeth A.; Green, Jack E.; Griffiths-Jones, Sam; Grimmelikhuijzen, Cornelis J. P.; Gubbala, Sai; Guigó, Roderic; Han, Yi; Hauser, Frank; Havlak, Paul; Hayden, Luke; Helbing, Sophie; Holder, Michael; Hui, Jerome H. L.; Hunn, Julia P.; Hunnekuhl, Vera S.; Jackson, LaRonda; Javaid, Mehwish; Jhangiani, Shalini N.; Jiggins, Francis M.; Jones, Tamsin E.; Kaiser, Tobias S.; Kalra, Divya; Kenny, Nathan J.; Korchina, Viktoriya; Kovar, Christie L.; Kraus, F. Bernhard; Lapraz, François; Lee, Sandra L.; Lv, Jie; Mandapat, Christigale; Manning, Gerard; Mariotti, Marco; Mata, Robert; Mathew, Tittu; Neumann, Tobias; Newsham, Irene; Ngo, Dinh N.; Ninova, Maria; Okwuonu, Geoffrey; Ongeri, Fiona; Palmer, William J.; Patil, Shobha; Patraquim, Pedro; Pham, Christopher; Pu, Ling-Ling; Putman, Nicholas H.; Rabouille, Catherine; Ramos, Olivia Mendivil; Rhodes, Adelaide C.; Robertson, Helen E.; Robertson, Hugh M.; Ronshaugen, Matthew; Rozas, Julio; Saada, Nehad; Sánchez-Gracia, Alejandro; Scherer, Steven E.; Schurko, Andrew M.; Siggens, Kenneth W.; Simmons, DeNard; Stief, Anna; Stolle, Eckart; Telford, Maximilian J.; Tessmar-Raible, Kristin; Thornton, Rebecca; van der Zee, Maurijn; von Haeseler, Arndt; Williams, James M.; Willis, Judith H.; Wu, Yuanqing; Zou, Xiaoyan; Lawson, Daniel; Muzny, Donna M.; Worley, Kim C.; Gibbs, Richard A.; Akam, Michael; Richards, Stephen
Myriapods (e.g., centipedes and millipedes) display a simple homonomous body plan relative to other arthropods. All members of the class are terrestrial, but they attained terrestriality independently of insects. Myriapoda is the only arthropod class not represented by a sequenced genome. We present an analysis of the genome of the centipede Strigamia maritima. It retains a compact genome that has undergone less gene loss and shuffling than previously sequenced arthropods, and many orthologues of genes conserved from the bilaterian ancestor that have been lost in insects. Our analysis locates many genes in conserved macro-synteny contexts, and many small-scale examples of gene clustering. We describe several examples where S. maritima shows different solutions from insects to similar problems. The insect olfactory receptor gene family is absent from S. maritima, and olfaction in air is likely effected by expansion of other receptor gene families. For some genes S. maritima has evolved paralogues to generate coding sequence diversity, where insects use alternate splicing. This is most striking for the Dscam gene, which in Drosophila generates more than 100,000 alternate splice forms, but in S. maritima is encoded by over 100 paralogues. We see an intriguing linkage between the absence of any known photosensory proteins in a blind organism and the additional absence of canonical circadian clock genes. The phylogenetic position of myriapods allows us to identify where in arthropod phylogeny several particular molecular mechanisms and traits emerged. For example, we conclude that juvenile hormone signalling evolved with the emergence of the exoskeleton in the arthropods and that RR-1 containing cuticle proteins evolved in the lineage leading to Mandibulata. We also identify when various gene expansions and losses occurred. The genome of S. maritima offers us a unique glimpse into the ancestral arthropod genome, while also displaying many adaptations to its specific
Soares, Siomar C; Silva, Artur; Trost, Eva
, and singleton analyses revealed close relationships among pathogenic corynebacteria, the clonal-like behavior of C. pseudotuberculosis and slow increases in the sizes of pan-genomes. According to extrapolations based on the pan-genomes, core genomes and singletons, the C. pseudotuberculosis biovar ovis shows...... a more clonal-like behavior than the C. pseudotuberculosis biovar equi. Most of the variable genes of the biovar ovis strains were acquired in a block through horizontal gene transfer and are highly conserved, whereas the biovar equi strains contain great variability, both intra- and inter...... are expected to have less contact with other organisms than the biovar equi strains, thereby explaining the significant clonal-like behavior of the biovar ovis strains....
Decker, Jared E.; McKay, Stephanie D.; Rolf, Megan M.; Kim, JaeWoo; Molina Alcalá, Antonio; Sonstegard, Tad S.; Hanotte, Olivier; Götherström, Anders; Seabury, Christopher M.; Praharani, Lisa; Babar, Masroor Ellahi; Correia de Almeida Regitano, Luciana; Yildiz, Mehmet Ali; Heaton, Michael P.; Liu, Wan-Sheng; Lei, Chu-Zhao; Reecy, James M.; Saif-Ur-Rehman, Muhammad; Schnabel, Robert D.; Taylor, Jeremy F.
The domestication and development of cattle has considerably impacted human societies, but the histories of cattle breeds and populations have been poorly understood especially for African, Asian, and American breeds. Using genotypes from 43,043 autosomal single nucleotide polymorphism markers scored in 1,543 animals, we evaluate the population structure of 134 domesticated bovid breeds. Regardless of the analytical method or sample subset, the three major groups of Asian indicine, Eurasian taurine, and African taurine were consistently observed. Patterns of geographic dispersal resulting from co-migration with humans and exportation are recognizable in phylogenetic networks. All analytical methods reveal patterns of hybridization which occurred after divergence. Using 19 breeds, we map the cline of indicine introgression into Africa. We infer that African taurine possess a large portion of wild African auroch ancestry, causing their divergence from Eurasian taurine. We detect exportation patterns in Asia and identify a cline of Eurasian taurine/indicine hybridization in Asia. We also identify the influence of species other than Bos taurus taurus and B. t. indicus in the formation of Asian breeds. We detect the pronounced influence of Shorthorn cattle in the formation of European breeds. Iberian and Italian cattle possess introgression from African taurine. American Criollo cattle originate from Iberia, and not directly from Africa with African ancestry inherited via Iberian ancestors. Indicine introgression into American cattle occurred in the Americas, and not Europe. We argue that cattle migration, movement and trading followed by admixture have been important forces in shaping modern bovine genomic variation. PMID:24675901
Yvette A Halley
Full Text Available Herein, we evaluated the concordance of population inferences and conclusions resulting from the analysis of short mitochondrial fragments (i.e., partial or complete D-Loop nucleotide sequences versus complete mitogenome sequences for 53 bobwhites representing six ecoregions across TX and OK (USA. Median joining (MJ haplotype networks demonstrated that analyses performed using small mitochondrial fragments were insufficient for estimating the true (i.e., complete mitogenome haplotype structure, corresponding levels of divergence, and maternal population history of our samples. Notably, discordant demographic inferences were observed when mismatch distributions of partial (i.e., partial D-Loop versus complete mitogenome sequences were compared, with the reduction in mitochondrial genomic information content observed to encourage spurious inferences in our samples. A probabilistic approach to variant prediction for the complete bobwhite mitogenomes revealed 344 segregating sites corresponding to 347 total mutations, including 49 putative nonsynonymous single nucleotide variants (SNVs distributed across 12 protein coding genes. Evidence of gross heteroplasmy was observed for 13 bobwhites, with 10 of the 13 heteroplasmies involving one moderate to high frequency SNV. Haplotype network and phylogenetic analyses for the complete bobwhite mitogenome sequences revealed two divergent maternal lineages (dXY = 0.00731; FST = 0.849; P < 0.05, thereby supporting the potential for two putative subspecies. However, the diverged lineage (n = 103 variants almost exclusively involved bobwhites geographically classified as Colinus virginianus texanus, which is discordant with the expectations of previous geographic subspecies designations. Tests of adaptive evolution for functional divergence (MKT, frequency distribution tests (D, FS and phylogenetic analyses (RAxML provide no evidence for positive selection or hybridization with the sympatric scaled quail
Full Text Available Efforts to unravel the mechanisms underlying taste sensation (gustation have largely focused on rodents. Here we present the first comprehensive characterization of gene expression in primate taste buds. Our findings reveal unique new insights into the biology of taste buds. We generated a taste bud gene expression database using laser capture microdissection (LCM procured fungiform (FG and circumvallate (CV taste buds from primates. We also used LCM to collect the top and bottom portions of CV taste buds. Affymetrix genome wide arrays were used to analyze gene expression in all samples. Known taste receptors are preferentially expressed in the top portion of taste buds. Genes associated with the cell cycle and stem cells are preferentially expressed in the bottom portion of taste buds, suggesting that precursor cells are located there. Several chemokines including CXCL14 and CXCL8 are among the highest expressed genes in taste buds, indicating that immune system related processes are active in taste buds. Several genes expressed specifically in endocrine glands including growth hormone releasing hormone and its receptor are also strongly expressed in taste buds, suggesting a link between metabolism and taste. Cell type-specific expression of transcription factors and signaling molecules involved in cell fate, including KIT, reveals the taste bud as an active site of cell regeneration, differentiation, and development. IKBKAP, a gene mutated in familial dysautonomia, a disease that results in loss of taste buds, is expressed in taste cells that communicate with afferent nerve fibers via synaptic transmission. This database highlights the power of LCM coupled with transcriptional profiling to dissect the molecular composition of normal tissues, represents the most comprehensive molecular analysis of primate taste buds to date, and provides a foundation for further studies in diverse aspects of taste biology.
Rovelet-Lecrux, Anne; Legallic, Solenn; Wallon, David; Flaman, Jean-Michel; Martinaud, Olivier; Bombois, Stéphanie; Rollin-Sillaire, Adeline; Michon, Agnès; Le Ber, Isabelle; Pariente, Jérémie; Puel, Michèle; Paquet, Claire; Croisile, Bernard; Thomas-Antérion, Catherine; Vercelletto, Martine; Lévy, Richard; Frébourg, Thierry; Hannequin, Didier; Campion, Dominique
Studying rare extreme forms of Alzheimer disease (AD) may prove to be a useful strategy in identifying new genes involved in monogenic determinism of AD. Amyloid precursor protein (APP), PSEN1, and PSEN2 mutations account for only 85% of autosomal dominant early-onset AD (ADEOAD) families. We hypothesised that rare copy number variants (CNVs) could be involved in ADEOAD families without mutations in known genes, as well as in rare sporadic young-onset AD cases. Using high-resolution array comparative genomic hybridisation, we assessed the presence of rare CNVs in 21 unrelated ADEOAD cases, having no alteration on known genes, and 12 sporadic AD cases, with an age of onset younger than 55 years. The analysis revealed the presence of 7 singleton CNVs (4 in ADEOAD and 3 in sporadic cases) absent in 1078 controls and 912 late-onset AD cases. Strikingly, 4 out of 7 rearrangements target genes (KLK6, SLC30A3, MEOX2, and FPR2) encoding proteins that are tightly related to amyloid-β peptide metabolism or signalling. Although these variants are individually rare and restricted to particular subgroups of patients, these findings support the causal role, in human pathology, of a set of genes coding for molecules suspected for a long time to modify Aβ metabolism or signalling, and for which animal or cellular models have already been developed.
Der Sarkissian, Clio; Vilstrup, Julia T.; Schubert, Mikkel; Seguin-Orlando, Andaine; Eme, David; Weinstock, Jacobo; Alberdi, Maria Teresa; Martin, Fabiana; Lopez, Patricio M.; Prado, Jose L.; Prieto, Alfredo; Douady, Christophe J.; Stafford, Tom W.; Willerslev, Eske; Orlando, Ludovic
Hippidions were equids with very distinctive anatomical features. They lived in South America 2.5 million years ago (Ma) until their extinction approximately 10 000 years ago. The evolutionary origin of the three known Hippidion morphospecies is still disputed. Based on palaeontological data, Hippidion could have diverged from the lineage leading to modern equids before 10 Ma. In contrast, a much later divergence date, with Hippidion nesting within modern equids, was indicated by partial ancient mitochondrial DNA sequences. Here, we characterized eight Hippidion complete mitochondrial genomes at 3.4–386.3-fold coverage using target-enrichment capture and next-generation sequencing. Our dataset reveals that the two morphospecies sequenced (H. saldiasi and H. principale) formed a monophyletic clade, basal to extant and extinct Equus lineages. This contrasts with previous genetic analyses and supports Hippidion as a distinct genus, in agreement with palaeontological models. We date the Hippidion split from Equus at 5.6–6.5 Ma, suggesting an early divergence in North America prior to the colonization of South America, after the formation of the Panamanian Isthmus 3.5 Ma and the Great American Biotic Interchange. PMID:25762573
Porath, Hagit T.; Carmi, Shai; Levanon, Erez Y.
Adenosine-to-inosine editing is one of the most frequent post-transcriptional modifications, manifested as A-to-G mismatches when comparing RNA sequences with their source DNA. Recently, a number of RNA-seq data sets have been screened for the presence of A-to-G editing, and hundreds of thousands of editing sites identified. Here we show that existing screens missed the majority of sites by ignoring reads with excessive (‘hyper’) editing that do not easily align to the genome. We show that careful alignment and examination of the unmapped reads in RNA-seq studies reveal numerous new sites, usually many more than originally discovered, and in precisely those regions that are most heavily edited. Specifically, we discover 327,096 new editing sites in the heavily studied Illumina Human BodyMap data and more than double the number of detected sites in several published screens. We also identify thousands of new sites in mouse, rat, opossum and fly. Our results establish that hyper-editing events account for the majority of editing sites. PMID:25158696
Krebs, Arnaud R; Imanci, Dilek; Hoerner, Leslie; Gaidatzis, Dimos; Burger, Lukas; Schübeler, Dirk
Transcription initiation entails chromatin opening followed by pre-initiation complex formation and RNA polymerase II recruitment. Subsequent polymerase elongation requires additional signals, resulting in increased residence time downstream of the start site, a phenomenon referred to as pausing. Here, we harnessed single-molecule footprinting to quantify distinct steps of initiation in vivo throughout the Drosophila genome. This identifies the impact of promoter structure on initiation dynamics in relation to nucleosomal occupancy. Additionally, perturbation of transcriptional initiation reveals an unexpectedly high turnover of polymerases at paused promoters-an observation confirmed at the level of nascent RNAs. These observations argue that absence of elongation is largely caused by premature termination rather than by stable polymerase stalling. In support of this non-processive model, we observe that induction of the paused heat shock promoter depends on continuous initiation. Our study provides a framework to quantify protein binding at single-molecule resolution and refines concepts of transcriptional pausing. Copyright © 2017 The Author(s). Published by Elsevier Inc. All rights reserved.
Dudas, Gytis; Carvalho, Luiz Max; Bedford, Trevor; Tatem, Andrew J; Baele, Guy; Faria, Nuno R; Park, Daniel J; Ladner, Jason T; Arias, Armando; Asogun, Danny; Bielejec, Filip; Caddy, Sarah L; Cotten, Matthew; D'Ambrozio, Jonathan; Dellicour, Simon; Di Caro, Antonino; Diclaro, Joseph W; Duraffour, Sophie; Elmore, Michael J; Fakoli, Lawrence S; Faye, Ousmane; Gilbert, Merle L; Gevao, Sahr M; Gire, Stephen; Gladden-Young, Adrianne; Gnirke, Andreas; Goba, Augustine; Grant, Donald S; Haagmans, Bart L; Hiscox, Julian A; Jah, Umaru; Kugelman, Jeffrey R; Liu, Di; Lu, Jia; Malboeuf, Christine M; Mate, Suzanne; Matthews, David A; Matranga, Christian B; Meredith, Luke W; Qu, James; Quick, Joshua; Pas, Suzan D; Phan, My V T; Pollakis, Georgios; Reusken, Chantal B; Sanchez-Lockhart, Mariano; Schaffner, Stephen F; Schieffelin, John S; Sealfon, Rachel S; Simon-Loriere, Etienne; Smits, Saskia L; Stoecker, Kilian; Thorne, Lucy; Tobin, Ekaete Alice; Vandi, Mohamed A; Watson, Simon J; West, Kendra; Whitmer, Shannon; Wiley, Michael R; Winnicki, Sarah M; Wohl, Shirlee; Wölfel, Roman; Yozwiak, Nathan L; Andersen, Kristian G; Blyden, Sylvia O; Bolay, Fatorma; Carroll, Miles W; Dahn, Bernice; Diallo, Boubacar; Formenty, Pierre; Fraser, Christophe; Gao, George F; Garry, Robert F; Goodfellow, Ian; Günther, Stephan; Happi, Christian T; Holmes, Edward C; Kargbo, Brima; Keïta, Sakoba; Kellam, Paul; Koopmans, Marion P G; Kuhn, Jens H; Loman, Nicholas J; Magassouba, N'Faly; Naidoo, Dhamari; Nichol, Stuart T; Nyenswah, Tolbert; Palacios, Gustavo; Pybus, Oliver G; Sabeti, Pardis C; Sall, Amadou; Ströher, Ute; Wurie, Isatta; Suchard, Marc A; Lemey, Philippe; Rambaut, Andrew
The 2013-2016 West African epidemic caused by the Ebola virus was of unprecedented magnitude, duration and impact. Here we reconstruct the dispersal, proliferation and decline of Ebola virus throughout the region by analysing 1,610 Ebola virus genomes, which represent over 5% of the known cases. We test the association of geography, climate and demography with viral movement among administrative regions, inferring a classic 'gravity' model, with intense dispersal between larger and closer populations. Despite attenuation of international dispersal after border closures, cross-border transmission had already sown the seeds for an international epidemic, rendering these measures ineffective at curbing the epidemic. We address why the epidemic did not spread into neighbouring countries, showing that these countries were susceptible to substantial outbreaks but at lower risk of introductions. Finally, we reveal that this large epidemic was a heterogeneous and spatially dissociated collection of transmission clusters of varying size, duration and connectivity. These insights will help to inform interventions in future epidemics.
Nordlund Henri R
Full Text Available Abstract Background A chicken egg contains several biotin-binding proteins (BBPs, whose complete DNA and amino acid sequences are not known. In order to identify and characterise these genes and proteins we studied chicken cDNAs and genes available in the NCBI database and chicken genome database using the reported N-terminal amino acid sequences of chicken egg-yolk BBPs as search strings. Results Two separate hits showing significant homology for these N-terminal sequences were discovered. For one of these hits, the chromosomal location in the immediate proximity of the avidin gene family was found. Both of these hits encode proteins having high sequence similarity with avidin suggesting that chicken BBPs are paralogous to avidin family. In particular, almost all residues corresponding to biotin binding in avidin are conserved in these putative BBP proteins. One of the found DNA sequences, however, seems to encode a carboxy-terminal extension not present in avidin. Conclusion We describe here the predicted properties of the putative BBP genes and proteins. Our present observations link BBP genes together with avidin gene family and shed more light on the genetic arrangement and variability of this family. In addition, comparative modelling revealed the potential structural elements important for the functional and structural properties of the putative BBP proteins.
Full Text Available Mitochondrial disorders have the highest incidence among congenital metabolic disorders characterized by biochemical respiratory chain complex deficiencies. It occurs at a rate of 1 in 5,000 births, and has phenotypic and genetic heterogeneity. Mutations in about 1,500 nuclear encoded mitochondrial proteins may cause mitochondrial dysfunction of energy production and mitochondrial disorders. More than 250 genes that cause mitochondrial disorders have been reported to date. However exact genetic diagnosis for patients still remained largely unknown. To reveal this heterogeneity, we performed comprehensive genomic analyses for 142 patients with childhood-onset mitochondrial respiratory chain complex deficiencies. The approach includes whole mtDNA and exome analyses using high-throughput sequencing, and chromosomal aberration analyses using high-density oligonucleotide arrays. We identified 37 novel mutations in known mitochondrial disease genes and 3 mitochondria-related genes (MRPS23, QRSL1, and PNPLA4 as novel causative genes. We also identified 2 genes known to cause monogenic diseases (MECP2 and TNNI3 and 3 chromosomal aberrations (6q24.3-q25.1, 17p12, and 22q11.21 as causes in this cohort. Our approaches enhance the ability to identify pathogenic gene mutations in patients with biochemically defined mitochondrial respiratory chain complex deficiencies in clinical settings. They also underscore clinical and genetic heterogeneity and will improve patient care of this complex disorder.
Eckerle, Lance D.; Becker, Michelle M.; Halpin, Rebecca A.; Li, Kelvin; Venter, Eli; Lu, Xiaotao; Scherbakova, Sana; Graham, Rachel L.; Baric, Ralph S.; Stockwell, Timothy B.; Spiro, David J.; Denison, Mark R.
Most RNA viruses lack the mechanisms to recognize and correct mutations that arise during genome replication, resulting in quasispecies diversity that is required for pathogenesis and adaptation. However, it is not known how viruses encoding large viral RNA genomes such as the Coronaviridae (26 to 32 kb) balance the requirements for genome stability and quasispecies diversity. Further, the limits of replication infidelity during replication of large RNA genomes and how decreased fidelity impacts virus fitness over time are not known. Our previous work demonstrated that genetic inactivation of the coronavirus exoribonuclease (ExoN) in nonstructural protein 14 (nsp14) of murine hepatitis virus results in a 15-fold decrease in replication fidelity. However, it is not known whether nsp14-ExoN is required for replication fidelity of all coronaviruses, nor the impact of decreased fidelity on genome diversity and fitness during replication and passage. We report here the engineering and recovery of nsp14-ExoN mutant viruses of severe acute respiratory syndrome coronavirus (SARS-CoV) that have stable growth defects and demonstrate a 21-fold increase in mutation frequency during replication in culture. Analysis of complete genome sequences from SARS-ExoN mutant viral clones revealed unique mutation sets in every genome examined from the same round of replication and a total of 100 unique mutations across the genome. Using novel bioinformatic tools and deep sequencing across the full-length genome following 10 population passages in vitro, we demonstrate retention of ExoN mutations and continued increased diversity and mutational load compared to wild-type SARS-CoV. The results define a novel genetic and bioinformatics model for introduction and identification of multi-allelic mutations in replication competent viruses that will be powerful tools for testing the effects of decreased fidelity and increased quasispecies diversity on viral replication, pathogenesis, and
Kevin A Wilkinson
Full Text Available Replication and pathogenesis of the human immunodeficiency virus (HIV is tightly linked to the structure of its RNA genome, but genome structure in infectious virions is poorly understood. We invent high-throughput SHAPE (selective 2'-hydroxyl acylation analyzed by primer extension technology, which uses many of the same tools as DNA sequencing, to quantify RNA backbone flexibility at single-nucleotide resolution and from which robust structural information can be immediately derived. We analyze the structure of HIV-1 genomic RNA in four biologically instructive states, including the authentic viral genome inside native particles. Remarkably, given the large number of plausible local structures, the first 10% of the HIV-1 genome exists in a single, predominant conformation in all four states. We also discover that noncoding regions functioning in a regulatory role have significantly lower (p-value < 0.0001 SHAPE reactivities, and hence more structure, than do viral coding regions that function as the template for protein synthesis. By directly monitoring protein binding inside virions, we identify the RNA recognition motif for the viral nucleocapsid protein. Seven structurally homologous binding sites occur in a well-defined domain in the genome, consistent with a role in directing specific packaging of genomic RNA into nascent virions. In addition, we identify two distinct motifs that are targets for the duplex destabilizing activity of this same protein. The nucleocapsid protein destabilizes local HIV-1 RNA structure in ways likely to facilitate initial movement both of the retroviral reverse transcriptase from its tRNA primer and of the ribosome in coding regions. Each of the three nucleocapsid interaction motifs falls in a specific genome domain, indicating that local protein interactions can be organized by the long-range architecture of an RNA. High-throughput SHAPE reveals a comprehensive view of HIV-1 RNA genome structure, and further
Zhang, Guofan; Fang, Xiaodong; Guo, Ximing
The Pacific oyster Crassostrea gigas belongs to one of the most species-rich but genomically poorly explored phyla, the Mollusca. Here we report the sequencing and assembly of the oyster genome using short reads and a fosmid-pooling strategy, along with transcriptomes of development and stress re...... and their exosomes. The oyster genome sequence fills a void in our understanding of the Lophotrochozoa....
Thieme, Frank; Koebnik, Ralf; Bekel, Thomas; Berger, Carolin; Boch, Jens; Büttner, Daniela; Caldana, Camila; Gaigalat, Lars; Goesmann, Alexander; Kay, Sabine; Kirchner, Oliver; Lanz, Christa; Linke, Burkhard; McHardy, Alice C.; Meyer, Folker; Mittenhuber, Gerhard; Nies, Dietrich H.; Niesbach-Klösgen, Ulla; Patschkowski, Thomas; Rückert, Christian; Rupp, Oliver; Schneiker, Susanne; Schuster, Stephan C.; Vorhölter, Frank-Jörg; Weber, Ernst; Pühler, Alfred; Bonas, Ulla; Bartels, Daniela; Kaiser, Olaf
The gram-negative plant-pathogenic bacterium Xanthomonas campestris pv. vesicatoria is the causative agent of bacterial spot disease in pepper and tomato plants, which leads to economically important yield losses. This pathosystem has become a well-established model for studying bacterial infection strategies. Here, we present the whole-genome sequence of the pepper-pathogenic Xanthomonas campestris pv. vesicatoria strain 85-10, which comprises a 5.17-Mb circular chromosome and four plasmids. The genome has a high G+C content (64.75%) and signatures of extensive genome plasticity. Whole-genome comparisons revealed a gene order similar to both Xanthomonas axonopodis pv. citri and Xanthomonas campestris pv. campestris and a structure completely different from Xanthomonas oryzae pv. oryzae. A total of 548 coding sequences (12.2%) are unique to X. campestris pv. vesicatoria. In addition to a type III secretion system, which is essential for pathogenicity, the genome of strain 85-10 encodes all other types of protein secretion systems described so far in gram-negative bacteria. Remarkably, one of the putative type IV secretion systems encoded on the largest plasmid is similar to the Icm/Dot systems of the human pathogens Legionella pneumophila and Coxiella burnetii. Comparisons with other completely sequenced plant pathogens predicted six novel type III effector proteins and several other virulence factors, including adhesins, cell wall-degrading enzymes, and extracellular polysaccharides. PMID:16237009
Verwaaijen, Bart; Wibberg, Daniel; Nelkner, Johanna; Gordin, Miriam; Rupp, Oliver; Winkler, Anika; Bremges, Andreas; Blom, Jochen; Grosch, Rita; Pühler, Alfred; Schlüter, Andreas
Lettuce (Lactuca sativa, L.) is an important annual plant of the family Asteraceae (Compositae). The commercial lettuce cultivar Tizian has been used in various scientific studies investigating the interaction of the plant with phytopathogens or biological control agents. Here, we present the de novo draft genome sequencing and gene prediction for this specific cultivar derived from transcriptome sequence data. The assembled scaffolds amount to a size of 2.22 Gb. Based on RNAseq data, 31,112 transcript isoforms were identified. Functional predictions for these transcripts were determined within the GenDBE annotation platform. Comparison with the cv. Salinas reference genome revealed a high degree of sequence similarity on genome and transcriptome levels, with an average amino acid identity of 99%. Furthermore, it was observed that two large regions are either missing or are highly divergent within the cv. Tizian genome compared to cv. Salinas. One of these regions covers the major resistance complex 1 region of cv. Salinas. The cv. Tizian draft genome sequence provides a valuable resource for future functional and transcriptome analyses focused on this lettuce cultivar. Copyright © 2017 Elsevier B.V. All rights reserved.
Joachim W Bargsten
Full Text Available As part of large protein complexes, Snf2 family ATPases are responsible for energy supply during chromatin remodeling, but the precise mechanism of action of many of these proteins is largely unknown. They influence many processes in plants, such as the response to environmental stress. This analysis is the first comprehensive study of Snf2 family ATPases in plants. We here present a comparative analysis of 1159 candidate plant Snf2 genes in 33 complete and annotated plant genomes, including two green algae. The number of Snf2 ATPases shows considerable variation across plant genomes (17-63 genes. The DRD1, Rad5/16 and Snf2 subfamily members occur most often. Detailed analysis of the plant-specific DRD1 subfamily in related plant genomes shows the occurrence of a complex series of evolutionary events. Notably tomato carries unexpected gene expansions of DRD1 gene members. Most of these genes are expressed in tomato, although at low levels and with distinct tissue or organ specificity. In contrast, the Snf2 subfamily genes tend to be expressed constitutively in tomato. The results underpin and extend the Snf2 subfamily classification, which could help to determine the various functional roles of Snf2 ATPases and to target environmental stress tolerance and yield in future breeding.
Bargsten, Joachim W; Folta, Adam; Mlynárová, Ludmila; Nap, Jan-Peter
As part of large protein complexes, Snf2 family ATPases are responsible for energy supply during chromatin remodeling, but the precise mechanism of action of many of these proteins is largely unknown. They influence many processes in plants, such as the response to environmental stress. This analysis is the first comprehensive study of Snf2 family ATPases in plants. We here present a comparative analysis of 1159 candidate plant Snf2 genes in 33 complete and annotated plant genomes, including two green algae. The number of Snf2 ATPases shows considerable variation across plant genomes (17-63 genes). The DRD1, Rad5/16 and Snf2 subfamily members occur most often. Detailed analysis of the plant-specific DRD1 subfamily in related plant genomes shows the occurrence of a complex series of evolutionary events. Notably tomato carries unexpected gene expansions of DRD1 gene members. Most of these genes are expressed in tomato, although at low levels and with distinct tissue or organ specificity. In contrast, the Snf2 subfamily genes tend to be expressed constitutively in tomato. The results underpin and extend the Snf2 subfamily classification, which could help to determine the various functional roles of Snf2 ATPases and to target environmental stress tolerance and yield in future breeding.
Full Text Available Triatomine insects are vectors of Trypanosoma cruzi, a protozoan parasite that is the causative agent of Chagas' disease. This is a neglected disease affecting approximately 8 million people in Latin America. The existence of diverse pyrethroid resistant populations of at least two species demonstrates the potential of triatomines to develop high levels of insecticide resistance. Therefore, the incorporation of strategies for resistance management is a main concern for vector control programs. Three enzymatic superfamilies are thought to mediate xenobiotic detoxification and resistance: Glutathione Transferases (GSTs, Cytochromes P450 (CYPs and Carboxyl/Cholinesterases (CCEs. Improving our knowledge of key triatomine detoxification enzymes will strengthen our understanding of insecticide resistance processes in vectors of Chagas' disease.The discovery and description of detoxification gene superfamilies in normalized transcriptomes of three triatomine species: Triatoma dimidiata, Triatoma infestans and Triatoma pallidipennis is presented. Furthermore, a comparative analysis of these superfamilies among the triatomine transcriptomes and the genome of Rhodnius prolixus, also a triatomine vector of Chagas' disease, and other well-studied insect genomes was performed. The expression pattern of detoxification genes in R. prolixus transcriptomes from key organs was analyzed. The comparisons reveal gene expansions in Sigma class GSTs, CYP3 in CYP superfamily and clade E in CCE superfamily. Moreover, several CYP families identified in these triatomines have not yet been described in other insects. Conversely, several groups of insecticide resistance related enzymes within each enzyme superfamily are reduced or lacking in triatomines. Furthermore, our qRT-PCR results showed an increase in the expression of a CYP4 gene in a T. infestans population resistant to pyrethroids. These results could point to an involvement of metabolic detoxification mechanisms
Weingarten, Rebecca A; Johnson, Ryan C; Conlan, Sean; Ramsburg, Amanda M; Dekker, John P; Lau, Anna F; Khil, Pavel; Odom, Robin T; Deming, Clay; Park, Morgan; Thomas, Pamela J; Henderson, David K; Palmore, Tara N; Segre, Julia A; Frank, Karen M
mortality associated with these resistant Gram-negative bacteria. Horizontal plasmid transfer spreads the resistance mechanism to new bacteria, and understanding the plasmid ecology of the hospital environment can assist in the design of control strategies to prevent nosocomial infections. A 5-year genomic and epidemiological survey was undertaken to study the CPOs in the patient-accessible environment, as well as in the plumbing system removed from the patient. This comprehensive survey revealed a vast, unappreciated reservoir of CPOs in wastewater, which was in contrast to the low positivity rate in both the patient population and the patient-accessible environment. While there were few patient-environmental isolate associations, there were plasmid backbones common to both populations. These results are relevant to all hospitals for which CPO colonization may not yet be defined through extensive surveillance.
Wei, Caihong; Wang, Huihua; Liu, Gang; Wu, Mingming; Cao, Jiaxve; Liu, Zhen; Liu, Ruizao; Zhao, Fuping; Zhang, Li; Lu, Jian; Liu, Chousheng; Du, Lixin
Traditionally, Chinese indigenous sheep were classified geographically and morphologically into three groups: Mongolian, Kazakh and Tibetan. Herein, we aimed to evaluate the population structure and genome selection among 140 individuals from ten representative Chinese indigenous sheep breeds: Ujimqin, Hu, Tong, Large-Tailed Han and Lop breed (Mongolian group); Duolang and Kazakh (Kazakh group); and Diqing, Plateau-type Tibetan, and Valley-type Tibetan breed (Tibetan group). We analyzed the population using principal component analysis (PCA), STRUCTURE and a Neighbor-Joining (NJ)-tree. In PCA plot, the Tibetan and Mongolian groups were clustered as expected; however, Duolang and Kazakh (Kazakh group) were segregated. STRUCTURE analyses suggested two subpopulations: one from North China (Kazakh and Mongolian groups) and the other from the Southwest (Tibetan group). In the NJ-tree, the Tibetan group formed an independent branch and the Kazakh and Mongolian groups were mixed. We then used the d i statistic approach to reveal selection in Chinese indigenous sheep breeds. Among the 599 genome sequence windows analyzed, sixteen (2.7%) exhibited signatures of selection in four or more breeds. We detected three strong selection windows involving three functional genes: RXFP2, PPP1CC and PDGFD. PDGFD, one of the four subfamilies of PDGF, which promotes proliferation and inhibits differentiation of preadipocytes, was significantly selected in fat type breeds by the Rsb (across pairs of populations) approach. Two consecutive selection regions in Duolang sheep were obviously different to other breeds. One region was in OAR2 including three genes (NPR2, SPAG8 and HINT2) the influence growth traits. The other region was in OAR 6 including four genes (PKD2, SPP1, MEPE, and IBSP) associated with a milk production quantitative trait locus. We also identified known candidate genes such as BMPR1B, MSRB3, and three genes (KIT, MC1R, and FRY) that influence lambing percentage, ear size
Annotated Draft Genome Assemblies for the Northern Bobwhite (Colinus virginianus) and the Scaled Quail (Callipepla squamata) Reveal Disparate Estimates of Modern Genome Diversity and Historic Effective Population Size.
Oldeschulte, David L; Halley, Yvette A; Wilson, Miranda L; Bhattarai, Eric K; Brashear, Wesley; Hill, Joshua; Metz, Richard P; Johnson, Charles D; Rollins, Dale; Peterson, Markus J; Bickhart, Derek M; Decker, Jared E; Sewell, John F; Seabury, Christopher M
Northern bobwhite ( Colinus virginianus ; hereafter bobwhite) and scaled quail ( Callipepla squamata ) populations have suffered precipitous declines across most of their US ranges. Illumina-based first- (v1.0) and second- (v2.0) generation draft genome assemblies for the scaled quail and the bobwhite produced N50 scaffold sizes of 1.035 and 2.042 Mb, thereby producing a 45-fold improvement in contiguity over the existing bobwhite assembly, and ≥90% of the assembled genomes were captured within 1313 and 8990 scaffolds, respectively. The scaled quail assembly (v1.0 = 1.045 Gb) was ∼20% smaller than the bobwhite (v2.0 = 1.254 Gb), which was supported by kmer-based estimates of genome size. Nevertheless, estimates of GC content (41.72%; 42.66%), genome-wide repetitive content (10.40%; 10.43%), and MAKER-predicted protein coding genes (17,131; 17,165) were similar for the scaled quail (v1.0) and bobwhite (v2.0) assemblies, respectively. BUSCO analyses utilizing 3023 single-copy orthologs revealed a high level of assembly completeness for the scaled quail (v1.0; 84.8%) and the bobwhite (v2.0; 82.5%), as verified by comparison with well-established avian genomes. We also detected 273 putative segmental duplications in the scaled quail genome (v1.0), and 711 in the bobwhite genome (v2.0), including some that were shared among both species. Autosomal variant prediction revealed ∼2.48 and 4.17 heterozygous variants per kilobase within the scaled quail (v1.0) and bobwhite (v2.0) genomes, respectively, and estimates of historic effective population size were uniformly higher for the bobwhite across all time points in a coalescent model. However, large-scale declines were predicted for both species beginning ∼15-20 KYA. Copyright © 2017 Oldeschulte et al.
Van den Bogert, Bartholomeus; Boekhorst, Jos; Herrmann, Ruth; Smid, Eddy J; Zoetendal, Erwin G; Kleerebezem, Michiel
The human small-intestinal microbiota is characterised by relatively large and dynamic Streptococcus populations. In this study, genome sequences of small-intestinal streptococci from S. mitis, S. bovis, and S. salivarius species-groups were determined and compared with those from 58 Streptococcus strains in public databases. The Streptococcus pangenome consists of 12,403 orthologous groups of which 574 are shared among all sequenced streptococci and are defined as the Streptococcus core genome. Genome mining of the small-intestinal streptococci focused on functions playing an important role in the interaction of these streptococci in the small-intestinal ecosystem, including natural competence and nutrient-transport and metabolism. Analysis of the small-intestinal Streptococcus genomes predicts a high capacity to synthesize amino acids and various vitamins as well as substantial divergence in their carbohydrate transport and metabolic capacities, which is in agreement with observed physiological differences between these Streptococcus strains. Gene-specific PCR-strategies enabled evaluation of conservation of Streptococcus populations in intestinal samples from different human individuals, revealing that the S. salivarius strains were frequently detected in the small-intestine microbiota, supporting the representative value of the genomes provided in this study. Finally, the Streptococcus genomes allow prediction of the effect of dietary substances on Streptococcus population dynamics in the human small-intestine.
Bartholomeus Van den Bogert
Full Text Available The human small-intestinal microbiota is characterised by relatively large and dynamic Streptococcus populations. In this study, genome sequences of small-intestinal streptococci from S. mitis, S. bovis, and S. salivarius species-groups were determined and compared with those from 58 Streptococcus strains in public databases. The Streptococcus pangenome consists of 12,403 orthologous groups of which 574 are shared among all sequenced streptococci and are defined as the Streptococcus core genome. Genome mining of the small-intestinal streptococci focused on functions playing an important role in the interaction of these streptococci in the small-intestinal ecosystem, including natural competence and nutrient-transport and metabolism. Analysis of the small-intestinal Streptococcus genomes predicts a high capacity to synthesize amino acids and various vitamins as well as substantial divergence in their carbohydrate transport and metabolic capacities, which is in agreement with observed physiological differences between these Streptococcus strains. Gene-specific PCR-strategies enabled evaluation of conservation of Streptococcus populations in intestinal samples from different human individuals, revealing that the S. salivarius strains were frequently detected in the small-intestine microbiota, supporting the representative value of the genomes provided in this study. Finally, the Streptococcus genomes allow prediction of the effect of dietary substances on Streptococcus population dynamics in the human small-intestine.
Van den Bogert, Bartholomeus; Boekhorst, Jos; Herrmann, Ruth; Smid, Eddy J.; Zoetendal, Erwin G.; Kleerebezem, Michiel
The human small-intestinal microbiota is characterised by relatively large and dynamic Streptococcus populations. In this study, genome sequences of small-intestinal streptococci from S. mitis, S. bovis, and S. salivarius species-groups were determined and compared with those from 58 Streptococcus strains in public databases. The Streptococcus pangenome consists of 12,403 orthologous groups of which 574 are shared among all sequenced streptococci and are defined as the Streptococcus core genome. Genome mining of the small-intestinal streptococci focused on functions playing an important role in the interaction of these streptococci in the small-intestinal ecosystem, including natural competence and nutrient-transport and metabolism. Analysis of the small-intestinal Streptococcus genomes predicts a high capacity to synthesize amino acids and various vitamins as well as substantial divergence in their carbohydrate transport and metabolic capacities, which is in agreement with observed physiological differences between these Streptococcus strains. Gene-specific PCR-strategies enabled evaluation of conservation of Streptococcus populations in intestinal samples from different human individuals, revealing that the S. salivarius strains were frequently detected in the small-intestine microbiota, supporting the representative value of the genomes provided in this study. Finally, the Streptococcus genomes allow prediction of the effect of dietary substances on Streptococcus population dynamics in the human small-intestine. PMID:24386196
Malaspinas, Anna-Sapfo; Lao, Oscar; Schroeder, Hannes
find that their genomic ancestry is Polynesian, with no detectable Native American component. Radiocarbon analysis of the skulls shows that the individuals had died prior to the beginning of the 19th century. Our findings could either represent genomic evidence of Polynesians reaching South America...
Neave, Matthew J.
Endozoicomonas bacteria are globally distributed and often abundantly associated with diverse marine hosts including reef-building corals, yet their function remains unknown. In this study we generated novel Endozoicomonas genomes from single cells and metagenomes obtained directly from the corals Stylophora pistillata, Pocillopora verrucosa, and Acropora humilis. We then compared these culture-independent genomes to existing genomes of bacterial isolates acquired from a sponge, sea slug, and coral to examine the functional landscape of this enigmatic genus. Sequencing and analysis of single cells and metagenomes resulted in four novel genomes with 60–76% and 81–90% genome completeness, respectively. These data also confirmed that Endozoicomonas genomes are large and are not streamlined for an obligate endosymbiotic lifestyle, implying that they have free-living stages. All genomes show an enrichment of genes associated with carbon sugar transport and utilization and protein secretion, potentially indicating that Endozoicomonas contribute to the cycling of carbohydrates and the provision of proteins to their respective hosts. Importantly, besides these commonalities, the genomes showed evidence for differential functional specificity and diversification, including genes for the production of amino acids. Given this metabolic diversity of Endozoicomonas we propose that different genotypes play disparate roles and have diversified in concert with their hosts.
In this study we sequenced the genomes of 60 Fusarium graminearum, the major fungal pathogen responsible for Fusarium head blight (FHB) in cereal crops world-wide. To investigate adaptive evolution of FHB pathogens, we performed population-level analyses to characterize genomic structure, signatures...
Sükösd, Zsuzsanna; Andersen, Ebbe Sloth; Seemann, Ernst Stefan
of the HIV-1 genome is highly variable in most regions, with a limited number of stable and conserved RNA secondary structures. Most interesting, a set of long distance interactions form a core organizing structure (COS) that organize the genome into three major structural domains. Despite overlapping...
The American cranberry (Vaccinium macrocarpon Ait.) mitochondrial genome was assembled and reconstructed from whole genome 454 Roche GS-FLX and Illumina shotgun sequences. Compared with other Asterids, the reconstruction of the genome revealed an average size mitochondrion (459,678 nt) with comparat...
Salomé, Patrice A; Weigel, Detlef
The circadian oscillator is astonishingly robust to changes in the environment but also to genomic changes that alter the copy number of its components through genome duplication, gene duplication, and homeologous gene loss. While studying the potential effect of aneuploidy on the Arabidopsis thaliana circadian clock, we discovered that a line thought to be trisomic for chromosome 3 also bears the gi-1 mutation, resulting in a short period and late flowering. With the help of whole-genome sequencing, we uncovered the unexpected complexity of this trisomic stock's history, as its genome shows evidence of past outcrossing with another A. thaliana accession. Our study indicates that although historical aneuploidy lines exist and are available, it might be safer to generate new individuals and confirm their genomes and karyotypes by sequencing. Copyright © 2015 Salomé and Weigel.
Pathak, Ashish; Chauhan, Ashvini; Blom, Jochen; Indest, Karl J; Jung, Carina M; Stothard, Paul; Bera, Gopal; Green, Stefan J; Ogram, Andrew
The genome of Rhodococcus opacus strain M213, isolated from a fuel-oil contaminated soil, was sequenced and annotated which revealed a genome size of 9,194,165 bp encoding 8680 putative genes and a G+C content of 66.72%. Among the protein coding genes, 71.77% were annotated as clusters of orthologous groups of proteins (COGs); 55% of the COGs were present as paralog clusters. Pulsed field gel electrophoresis (PFGE) analysis of M213 revealed the presence of three different sized replicons- a circular chromosome and two megaplasmids (pNUO1 and pNUO2) estimated to be of 750Kb 350Kb in size, respectively. Conversely, using an alternative approach of optical mapping, the plasmid replicons appeared as a circular ~1.2 Mb megaplasmid and a linear, ~0.7 Mb megaplasmid. Genome-wide comparative analysis of M213 with a cohort of sequenced Rhodococcus species revealed low syntenic affiliation with other R. opacus species including strains B4 and PD630. Conversely, a closer affiliation of M213, at the functional (COG) level, was observed with the catabolically versatile R. jostii strain RHA1 and other Rhodococcii such as R. wratislaviensis strain IFP 2016, R. imtechensis strain RKJ300, Rhodococcus sp. strain JVH1, and Rhodococcus sp. strain DK17, respectively. An in-depth, genome-wide comparison between these functional relatives revealed 971 unique genes in M213 representing 11% of its total genome; many associating with catabolic functions. Of major interest was the identification of as many as 154 genomic islands (GEIs), many with duplicated catabolic genes, in particular for PAHs; a trait that was confirmed by PCR-based identification of naphthalene dioxygenase (NDO) as a representative gene, across PFGE-resolved replicons of strain M213. Interestingly, several plasmid/GEI-encoded genes, that likely participate in degrading naphthalene (NAP) via a peculiar pathway, were also identified in strain M213 using a combination of bioinformatics, metabolic analysis and gene
Wu Harry X
Full Text Available Abstract Background Wood is a valuable natural resource and a major carbon sink. Wood formation is an important developmental process in vascular plants which played a crucial role in plant evolution. Although genes involved in xylem formation have been investigated, the molecular mechanisms of xylem evolution are not well understood. We use comparative genomics to examine evolution of the xylem transcriptome to gain insights into xylem evolution. Results The xylem transcriptome is highly conserved in conifers, but considerably divergent in angiosperms. The functional domains of genes in the xylem transcriptome are moderately to highly conserved in vascular plants, suggesting the existence of a common ancestral xylem transcriptome. Compared to the total transcriptome derived from a range of tissues, the xylem transcriptome is relatively conserved in vascular plants. Of the xylem transcriptome, cell wall genes, ancestral xylem genes, known proteins and transcription factors are relatively more conserved in vascular plants. A total of 527 putative xylem orthologs were identified, which are unevenly distributed across the Arabidopsis chromosomes with eight hot spots observed. Phylogenetic analysis revealed that evolution of the xylem transcriptome has paralleled plant evolution. We also identified 274 conifer-specific xylem unigenes, all of which are of unknown function. These xylem orthologs and conifer-specific unigenes are likely to have played a crucial role in xylem evolution. Conclusions Conifers have highly conserved xylem transcriptomes, while angiosperm xylem transcriptomes are relatively diversified. Vascular plants share a common ancestral xylem transcriptome. The xylem transcriptomes of vascular plants are more conserved than the total transcriptomes. Evolution of the xylem transcriptome has largely followed the trend of plant evolution.
Allison L Creason
Full Text Available Members of Gram-positive Actinobacteria cause economically important diseases to plants. Within the Rhodococcus genus, some members can cause growth deformities and persist as pathogens on a wide range of host plants. The current model predicts that phytopathogenic isolates require a cluster of three loci present on a linear plasmid, with the fas operon central to virulence. The Fas proteins synthesize, modify, and activate a mixture of growth regulating cytokinins, which cause a hormonal imbalance in plants, resulting in abnormal growth. We sequenced and compared the genomes of 20 isolates of Rhodococcus to gain insights into the mechanisms and evolution of virulence in these bacteria. Horizontal gene transfer was identified as critical but limited in the scale of virulence evolution, as few loci are conserved and exclusive to phytopathogenic isolates. Although the fas operon is present in most phytopathogenic isolates, it is absent from phytopathogenic isolate A21d2. Instead, this isolate has a horizontally acquired gene chimera that encodes a novel fusion protein with isopentyltransferase and phosphoribohydrolase domains, predicted to be capable of catalyzing and activating cytokinins, respectively. Cytokinin profiling of the archetypal D188 isolate revealed only one activate cytokinin type that was specifically synthesized in a fas-dependent manner. These results suggest that only the isopentenyladenine cytokinin type is synthesized and necessary for Rhodococcus phytopathogenicity, which is not consistent with the extant model stating that a mixture of cytokinins is necessary for Rhodococcus to cause leafy gall symptoms. In all, data indicate that only four horizontally acquired functions are sufficient to confer the trait of phytopathogenicity to members of the genetically diverse clade of Rhodococcus.
Wu, Chung-Shien; Huang, Ya-Yi; Chaw, Shu-Miaw
We determined the complete chloroplast genome (cpDNA) of Ginkgo biloba (common name: ginkgo), the only relict of ginkgophytes from the Triassic Period. The cpDNA molecule of ginkgo is quadripartite and circular, with a length of 156,945 bp, which is 6,458 bp shorter than that of Cycas taitungensis. In ginkgo cpDNA, rpl23 becomes pseudo, only one copy of ycf2 is retained, and there are at least five editing sites. We propose that the retained ycf2 is a duplicate of the ancestral ycf2, and the ancestral one has been lost from the inverted repeat A (IRA). This loss event should have occurred and led to the contraction of IRs after ginkgos diverged from other gymnosperms. A novel cluster of three transfer RNA (tRNA) genes, trnY-AUA, trnC-ACA, and trnSeC-UCA, was predicted to be located between trnC-GCA and rpoB of the large single-copy region. Our phylogenetic analysis strongly suggests that the three predicted tRNA genes are duplicates of trnC-GCA. Interestingly, in ginkgo cpDNA, the loss of one ycf2 copy does not significantly elevate the synonymous rate (Ks) of the retained copy, which disagrees with the view of Perry and Wolfe (2002) that one of the two-copy genes is subjected to elevated Ks when its counterpart has been lost. We hypothesize that the loss of one ycf2 is likely recent, and therefore, the acquired Ks of the retained copy is low. Our data reveal that ginkgo possesses several unique features that contribute to our understanding of the cpDNA evolution in seed plants. PMID:22403032
Kristoffer T Bæk
Full Text Available Staphylococcus aureus strains of the 8325 lineage, especially 8325-4 and derivatives lacking prophage, have been used extensively for decades of research. We report herein the results of our deep sequence analysis of strain 8325-4. Assignment of sequence variants compared with the reference strain 8325 (NRS77/PS47 required correction of errors in the 8325 reference genome, and reassessment of variation previously attributed to chemical mutagenesis of the restriction-defective RN4220. Using an extensive strain pedigree analysis, we discovered that 8325-4 contains 16 single nucleotide polymorphisms (SNP arising prior to the construction of RN4220. We identified 5 indels in 8325-4 compared with 8325. Three indels correspond to expected Φ11, 12, 13 excisions, one indel is explained by a sequence assembly artifact, and the final indel (Δ63bp in the spa-sarS intergenic region is common to only a sub-lineage of 8325-4 strains including SH1000. This deletion was found to significantly decrease (75% steady state sarS but not spa transcript levels in post-exponential phase. The sub-lineage 8325-4 was also found to harbor 4 additional SNPs. We also found large sequence variation between 8325, 8325-4 and RN4220 in a cluster of repetitive hypothetical proteins (SA0282 homologs near the Ess secretion cluster. The overall 8325-4 SNP set results in 17 alterations within coding sequences. Remarkably, we discovered that all tested strains of the 8325-4 lineage lack phenol soluble modulin α3 (PSMα3, a virulence determinant implicated in neutrophil chemotaxis, biofilm architecture and surface spreading. Collectively, our results clarify and define the 8325-4 pedigree and reveal clear evidence that mutations existing throughout all branches of this lineage, including the widely used RN6390 and SH1000 strains, could conceivably impact virulence regulation.
Creason, Allison L; Vandeputte, Olivier M; Savory, Elizabeth A; Davis, Edward W; Putnam, Melodie L; Hu, Erdong; Swader-Hines, David; Mol, Adeline; Baucher, Marie; Prinsen, Els; Zdanowska, Magdalena; Givan, Scott A; El Jaziri, Mondher; Loper, Joyce E; Mahmud, Taifo; Chang, Jeff H
Members of Gram-positive Actinobacteria cause economically important diseases to plants. Within the Rhodococcus genus, some members can cause growth deformities and persist as pathogens on a wide range of host plants. The current model predicts that phytopathogenic isolates require a cluster of three loci present on a linear plasmid, with the fas operon central to virulence. The Fas proteins synthesize, modify, and activate a mixture of growth regulating cytokinins, which cause a hormonal imbalance in plants, resulting in abnormal growth. We sequenced and compared the genomes of 20 isolates of Rhodococcus to gain insights into the mechanisms and evolution of virulence in these bacteria. Horizontal gene transfer was identified as critical but limited in the scale of virulence evolution, as few loci are conserved and exclusive to phytopathogenic isolates. Although the fas operon is present in most phytopathogenic isolates, it is absent from phytopathogenic isolate A21d2. Instead, this isolate has a horizontally acquired gene chimera that encodes a novel fusion protein with isopentyltransferase and phosphoribohydrolase domains, predicted to be capable of catalyzing and activating cytokinins, respectively. Cytokinin profiling of the archetypal D188 isolate revealed only one activate cytokinin type that was specifically synthesized in a fas-dependent manner. These results suggest that only the isopentenyladenine cytokinin type is synthesized and necessary for Rhodococcus phytopathogenicity, which is not consistent with the extant model stating that a mixture of cytokinins is necessary for Rhodococcus to cause leafy gall symptoms. In all, data indicate that only four horizontally acquired functions are sufficient to confer the trait of phytopathogenicity to members of the genetically diverse clade of Rhodococcus.
Full Text Available Heme is a cofactor in proteins that function in almost all sub-cellular compartments and in many diverse biological processes. Heme is produced by a conserved biosynthetic pathway that is highly regulated to prevent the accumulation of heme--a cytotoxic, hydrophobic tetrapyrrole. Caenorhabditis elegans and related parasitic nematodes do not synthesize heme, but instead require environmental heme to grow and develop. Heme homeostasis in these auxotrophs is, therefore, regulated in accordance with available dietary heme. We have capitalized on this auxotrophy in C. elegans to study gene expression changes associated with precisely controlled dietary heme concentrations. RNA was isolated from cultures containing 4, 20, or 500 microM heme; derived cDNA probes were hybridized to Affymetrix C. elegans expression arrays. We identified 288 heme-responsive genes (hrgs that were differentially expressed under these conditions. Of these genes, 42% had putative homologs in humans, while genomes of medically relevant heme auxotrophs revealed homologs for 12% in both Trypanosoma and Leishmania and 24% in parasitic nematodes. Depletion of each of the 288 hrgs by RNA-mediated interference (RNAi in a transgenic heme-sensor worm strain identified six genes that regulated heme homeostasis. In addition, seven membrane-spanning transporters involved in heme uptake were identified by RNAi knockdown studies using a toxic heme analog. Comparison of genes that were positive in both of the RNAi screens resulted in the identification of three genes in common that were vital for organismal heme homeostasis in C. elegans. Collectively, our results provide a catalog of genes that are essential for metazoan heme homeostasis and demonstrate the power of C. elegans as a genetic animal model to dissect the regulatory circuits which mediate heme trafficking in both vertebrate hosts and their parasites, which depend on environmental heme for survival.
Grossmann, Patrick; Gutman, David A.; Dunn, William D. Jr; Holder, Chad A.; Aerts, Hugo J. W. L.
Glioblastoma (GBM) tumors exhibit strong phenotypic differences that can be quantified using magnetic resonance imaging (MRI), but the underlying biological drivers of these imaging phenotypes remain largely unknown. An Imaging-Genomics analysis was performed to reveal the mechanistic associations between MRI derived quantitative volumetric tumor phenotype features and molecular pathways. One hundred fourty one patients with presurgery MRI and survival data were included in our analysis. Volumetric features were defined, including the necrotic core (NE), contrast-enhancement (CE), abnormal tumor volume assessed by post-contrast T1w (tumor bulk or TB), tumor-associated edema based on T2-FLAIR (ED), and total tumor volume (TV), as well as ratios of these tumor components. Based on gene expression where available (n = 91), pathway associations were assessed using a preranked gene set enrichment analysis. These results were put into context of molecular subtypes in GBM and prognostication. Volumetric features were significantly associated with diverse sets of biological processes (FDR < 0.05). While NE and TB were enriched for immune response pathways and apoptosis, CE was associated with signal transduction and protein folding processes. ED was mainly enriched for homeostasis and cell cycling pathways. ED was also the strongest predictor of molecular GBM subtypes (AUC = 0.61). CE was the strongest predictor of overall survival (C-index = 0.6; Noether test, p = 4x10 −4 ). GBM volumetric features extracted from MRI are significantly enriched for information about the biological state of a tumor that impacts patient outcomes. Clinical decision-support systems could exploit this information to develop personalized treatment strategies on the basis of noninvasive imaging. The online version of this article (doi:10.1186/s12885-016-2659-5) contains supplementary material, which is available to authorized users
Full Text Available Vibrio cholerae is commonly found in estuarine water systems. Toxigenic O1 and O139 V. cholerae strains have caused cholera epidemics and pandemics, whereas the nontoxigenic strains within these serogroups only occasionally lead to disease. To understand the differences in the genome and clonality between the toxigenic and nontoxigenic strains of V. cholerae serogroups O1 and O139, we employed a whole genome PCR scanning (WGPScanning method, an rrn operon-mediated fragment rearrangement analysis and comparative genomic hybridization (CGH to analyze the genome structure of different strains. WGPScanning in conjunction with CGH revealed that the genomic contents of the toxigenic strains were conservative, except for a few indels located mainly in mobile elements. Minor nucleotide variation in orthologous genes appeared to be the major difference between the toxigenic strains. rrn operon-mediated rearrangements were infrequent in El Tor toxigenic strains tested using I-CeuI digested pulsed-field gel electrophoresis (PFGE analysis and PCR analysis based on flanking sequence of rrn operons. Using these methods, we found that the genomic structures of toxigenic El Tor and O139 strains were syntenic. The nontoxigenic strains exhibited more extensive sequence variations, but toxin coregulated pilus positive (TCP+ strains had a similar structure. TCP+ nontoxigenic strains could be subdivided into multiple lineages according to the TCP type, suggesting the existence of complex intermediates in the evolution of toxigenic strains. The data indicate that toxigenic O1 El Tor and O139 strains were derived from a single lineage of intermediates from complex clones in the environment. The nontoxigenic strains with non-El Tor type TCP may yet evolve into new epidemic clones after attaining toxigenic attributes.