WorldWideScience

Sample records for genome size variation

  1. Genome size variation in Begonia.

    Science.gov (United States)

    Dewitte, Angelo; Leus, Leen; Eeckhaut, Tom; Vanstechelman, Ives; Van Huylenbroeck, Johan; Van Bockstaele, Erik

    2009-10-01

    The genome sizes of a Begonia collection comprising 37 species and 23 hybrids of African, Asiatic, Middle American, and South American origin were screened using flow cytometry. Within the collection, 1C values varied between 0.23 and 1.46 pg DNA. Genome sizes were, in most cases, not positively correlated with chromosome number, but with pollen size. A 12-fold difference in mean chromosome size was found between the genotypes with the largest and smallest chromosomes. In general, chromosomes from South American genotypes were smaller than chromosomes of African, Asian, or Middle American genotypes, except for B. boliviensis and B. pearcei. Cytological chromosome studies in different genotypes showed variable chromosome numbers, length, width, and total chromosome volume, which confirmed the diversity in genome size. Large secondary constrictions were present in several investigated genotypes. These data show that chromosome number and structure exhibit a great deal of variation within the genus Begonia, and likely help to explain the large number of taxa found within the genus.

  2. Genome size variation in the genus Avena.

    Science.gov (United States)

    Yan, Honghai; Martin, Sara L; Bekele, Wubishet A; Latta, Robert G; Diederichsen, Axel; Peng, Yuanying; Tinker, Nicholas A

    2016-03-01

    Genome size is an indicator of evolutionary distance and a metric for genome characterization. Here, we report accurate estimates of genome size in 99 accessions from 26 species of Avena. We demonstrate that the average genome size of C genome diploid species (2C = 10.26 pg) is 15% larger than that of A genome species (2C = 8.95 pg), and that this difference likely accounts for a progression of size among tetraploid species, where AB genome configuration had similar genome sizes (average 2C = 25.74 pg). Genome size was mostly consistent within species and in general agreement with current information about evolutionary distance among species. Results also suggest that most of the polyploid species in Avena have experienced genome downsizing in relation to their diploid progenitors. Genome size measurements could provide additional quality control for species identification in germplasm collections, especially in cases where diploid and polyploid species have similar morphology.

  3. Patterns of genome size variation in snapping shrimp.

    Science.gov (United States)

    Jeffery, Nicholas W; Hultgren, Kristin; Chak, Solomon Tin Chi; Gregory, T Ryan; Rubenstein, Dustin R

    2016-06-01

    Although crustaceans vary extensively in genome size, little is known about how genome size may affect the ecology and evolution of species in this diverse group, in part due to the lack of large genome size datasets. Here we investigate interspecific, intraspecific, and intracolony variation in genome size in 39 species of Synalpheus shrimps, representing one of the largest genome size datasets for a single genus within crustaceans. We find that genome size ranges approximately 4-fold across Synalpheus with little phylogenetic signal, and is not related to body size. In a subset of these species, genome size is related to chromosome size, but not to chromosome number, suggesting that despite large genomes, these species are not polyploid. Interestingly, there appears to be 35% intraspecific genome size variation in Synalpheus idios among geographic regions, and up to 30% variation in Synalpheus duffyi genome size within the same colony.

  4. Intrapopulation Genome Size Variation in D. melanogaster Reflects Life History Variation and Plasticity

    Science.gov (United States)

    Ellis, Lisa L.; Huang, Wen; Quinn, Andrew M.; Ahuja, Astha; Alfrejd, Ben; Gomez, Francisco E.; Hjelmen, Carl E.; Moore, Kristi L.; Mackay, Trudy F. C.; Johnston, J. Spencer; Tarone, Aaron M.

    2014-01-01

    We determined female genome sizes using flow cytometry for 211 Drosophila melanogaster sequenced inbred strains from the Drosophila Genetic Reference Panel, and found significant conspecific and intrapopulation variation in genome size. We also compared several life history traits for 25 lines with large and 25 lines with small genomes in three thermal environments, and found that genome size as well as genome size by temperature interactions significantly correlated with survival to pupation and adulthood, time to pupation, female pupal mass, and female eclosion rates. Genome size accounted for up to 23% of the variation in developmental phenotypes, but the contribution of genome size to variation in life history traits was plastic and varied according to the thermal environment. Expression data implicate differences in metabolism that correspond to genome size variation. These results indicate that significant genome size variation exists within D. melanogaster and this variation may impact the evolutionary ecology of the species. Genome size variation accounts for a significant portion of life history variation in an environmentally dependent manner, suggesting that potential fitness effects associated with genome size variation also depend on environmental conditions. PMID:25057905

  5. Intrapopulation genome size variation in D. melanogaster reflects life history variation and plasticity.

    Directory of Open Access Journals (Sweden)

    Lisa L Ellis

    2014-07-01

    Full Text Available We determined female genome sizes using flow cytometry for 211 Drosophila melanogaster sequenced inbred strains from the Drosophila Genetic Reference Panel, and found significant conspecific and intrapopulation variation in genome size. We also compared several life history traits for 25 lines with large and 25 lines with small genomes in three thermal environments, and found that genome size as well as genome size by temperature interactions significantly correlated with survival to pupation and adulthood, time to pupation, female pupal mass, and female eclosion rates. Genome size accounted for up to 23% of the variation in developmental phenotypes, but the contribution of genome size to variation in life history traits was plastic and varied according to the thermal environment. Expression data implicate differences in metabolism that correspond to genome size variation. These results indicate that significant genome size variation exists within D. melanogaster and this variation may impact the evolutionary ecology of the species. Genome size variation accounts for a significant portion of life history variation in an environmentally dependent manner, suggesting that potential fitness effects associated with genome size variation also depend on environmental conditions.

  6. Nuclear DNA content in Sinningia (Gesneriaceae); intraspecific genome size variation and genome characterization in S. speciosa.

    Science.gov (United States)

    Zaitlin, David; Pierce, Andrew J

    2010-12-01

    The Gesneriaceae (Lamiales) is a family of flowering plants comprising >3000 species of mainly tropical origin, the most familiar of which is the cultivated African violet (Saintpaulia spp.). Species of Gesneriaceae are poorly represented in the lists of taxa sampled for genome size estimation; measurements are available for three species of Ramonda and one each of Haberlea, Saintpaulia, and Streptocarpus, all species of Old World origin. We report here nuclear genome size estimates for 10 species of Sinningia, a neotropical genus largely restricted to Brazil. Flow cytometry of leaf cell nuclei showed that holoploid genome size in Sinningia is very small (approximately two times the size of the Arabidopsis genome), and is small compared to the other six species of Gesneriaceae with genome size estimates. We also documented intraspecific genome size variation of 21%-26% within a group of wild Sinningia speciosa (Lodd.) Hiern collections. In addition, we analyzed 1210 genome survey sequences from S. speciosa to characterize basic features of the nuclear genome such as guanine-cytosine content, types of repetitive elements, numbers of protein-coding sequences, and sequences unique to S. speciosa. We included several other angiosperm species as genome size standards, one of which was the snapdragon (Antirrhinum majus L.; Veronicaceae, Lamiales). Multiple measurements on three accessions indicated that the genome size of A. majus is ~633 × 10⁶ base pairs, which is approximately 40% of the previously published estimate.

  7. Hawaiian Drosophila genomes: size variation and evolutionary expansions.

    Science.gov (United States)

    Craddock, Elysse M; Gall, Joseph G; Jonas, Mark

    2016-02-01

    This paper reports genome sizes of one Hawaiian Scaptomyza and 16 endemic Hawaiian Drosophila species that include five members of the antopocerus species group, one member of the modified mouthpart group, and ten members of the picture wing clade. Genome size expansions have occurred independently multiple times among Hawaiian Drosophila lineages, and have resulted in an over 2.3-fold range of genome sizes among species, with the largest observed in Drosophila cyrtoloma (1C = 0.41 pg). We find evidence that these repeated genome size expansions were likely driven by the addition of significant amounts of heterochromatin and satellite DNA. For example, our data reveal that the addition of seven heterochromatic chromosome arms to the ancestral haploid karyotype, and a remarkable proportion of ~70 % satellite DNA, account for the greatly expanded size of the D. cyrtoloma genome. Moreover, the genomes of 13/17 Hawaiian picture wing species are composed of substantial proportions (22-70 %) of detectable satellites (all but one of which are AT-rich). Our results suggest that in this tightly knit group of recently evolved species, genomes have expanded, in large part, via evolutionary amplifications of satellite DNA sequences in centric and pericentric domains (especially of the X and dot chromosomes), which have resulted in longer acrocentric chromosomes or metacentrics with an added heterochromatic chromosome arm. We discuss possible evolutionary mechanisms that may have shaped these patterns, including rapid fixation of novel expanded genomes during founder-effect speciation.

  8. Genome Size in North American Fireflies: Substantial Variation Likely Driven by Neutral Processes

    Science.gov (United States)

    Johnston, J. Spencer; Stanger-Hall, Kathrin F.; Hjelmen, Carl E.; Hanrahan, Shawn J.; Korunes, Katharine; Hall, David

    2017-01-01

    Abstract Eukaryotic genomes show tremendous size variation across taxa. Proximate explanations for genome size variation include differences in ploidy and amounts of noncoding DNA, especially repetitive DNA. Ultimate explanations include selection on physiological correlates of genome size such as cell size, which in turn influence body size, resulting in the often-observed correlation between body size and genome size. In this study, we examined body size and repetitive DNA elements in relationship to the evolution of genome size in North American representatives of a single beetle family, the Lampyridae (fireflies). The 23 species considered represent an excellent study system because of the greater than 5-fold range of genome sizes, documented here using flow cytometry, and the 3-fold range in body size, measured using pronotum width. We also identified common genomic repetitive elements using low-coverage sequencing. We found a positive relationship between genome size and repetitive DNA, particularly retrotransposons. Both genome size and these elements were evolving as expected given phylogenetic relatedness. We also tested whether genome size varied with body size and found no relationship. Together, our results suggest that genome size is evolving neutrally in fireflies. PMID:28541478

  9. Genome downsizing and karyotype constancy in diploid and polyploid congeners: a model of genome size variation.

    Science.gov (United States)

    Poggio, Lidia; Realini, María Florencia; Fourastié, María Florencia; García, Ana María; González, Graciela Esther

    2014-06-26

    Evolutionary chromosome change involves significant variation in DNA amount in diploids and genome downsizing in polyploids. Genome size and karyotype parameters of Hippeastrum species with different ploidy level were analysed. In Hippeastrum, polyploid species show less DNA content per basic genome than diploid species. The rate of variation is lower at higher ploidy levels. All the species have a basic number x = 11 and bimodal karyotypes. The basic karyotypes consist of four short metacentric chromosomes and seven large chromosomes (submetacentric and subtelocentric). The bimodal karyotype is preserved maintaining the relative proportions of members of the haploid chromosome set, even in the presence of genome downsizing. The constancy of the karyotype is maintained because changes in DNA amount are proportional to the length of the whole-chromosome complement and vary independently in the long and short sets of chromosomes. This karyotype constancy in taxa of Hippeastrum with different genome size and ploidy level indicates that the distribution of extra DNA within the complement is not at random and suggests the presence of mechanisms selecting for constancy, or against changes, in karyotype morphology.

  10. Phylogeny, rate variation, and genome size evolution of Pelargonium (Geraniaceae).

    Science.gov (United States)

    Weng, Mao-Lun; Ruhlman, Tracey A; Gibby, Mary; Jansen, Robert K

    2012-09-01

    The phylogeny of 58 Pelargonium species was estimated using five plastid markers (rbcL, matK, ndhF, rpoC1, trnL-F) and one mitochondrial gene (nad5). The results confirmed the monophyly of three major clades and four subclades within Pelargonium but also indicate the need to revise some sectional classifications. This phylogeny was used to examine karyotype evolution in the genus: plotting chromosome sizes, numbers and 2C-values indicates that genome size is significantly correlated with chromosome size but not number. Accelerated rates of nucleotide substitution have been previously detected in both plastid and mitochondrial genes in Pelargonium, but sparse taxon sampling did not enable identification of the phylogenetic distribution of these elevated rates. Using the multigene phylogeny as a constraint, we investigated lineage- and locus-specific heterogeneity of substitution rates in Pelargonium for an expanded number of taxa and demonstrated that both plastid and mitochondrial genes have had accelerated substitution rates but with markedly disparate patterns. In the plastid, the exons of rpoC1 have significantly accelerated substitution rates compared to its intron and the acceleration was mainly due to nonsynonymous substitutions. In contrast, the mitochondrial gene, nad5, experienced substantial acceleration of synonymous substitution rates in three internal branches of Pelargonium, but this acceleration ceased in all terminal branches. Several lineages also have dN/dS ratios significantly greater than one for rpoC1, indicating that positive selection is acting on this gene, whereas the accelerated synonymous substitutions in the mitochondrial gene are the result of elevated mutation rates.

  11. Transposable element distribution, abundance and role in genome size variation in the genus Oryza.

    Science.gov (United States)

    Zuccolo, Andrea; Sebastian, Aswathy; Talag, Jayson; Yu, Yeisoo; Kim, HyeRan; Collura, Kristi; Kudrna, Dave; Wing, Rod A

    2007-08-29

    The genus Oryza is composed of 10 distinct genome types, 6 diploid and 4 polyploid, and includes the world's most important food crop - rice (Oryza sativa [AA]). Genome size variation in the Oryza is more than 3-fold and ranges from 357 Mbp in Oryza glaberrima [AA] to 1283 Mbp in the polyploid Oryza ridleyi [HHJJ]. Because repetitive elements are known to play a significant role in genome size variation, we constructed random sheared small insert genomic libraries from 12 representative Oryza species and conducted a comprehensive study of the repetitive element composition, distribution and phylogeny in this genus. Particular attention was paid to the role played by the most important classes of transposable elements (Long Terminal Repeats Retrotransposons, Long interspersed Nuclear Elements, helitrons, DNA transposable elements) in shaping these genomes and in their contributing to genome size variation. We identified the elements primarily responsible for the most strikingly genome size variation in Oryza. We demonstrated how Long Terminal Repeat retrotransposons belonging to the same families have proliferated to very different extents in various species. We also showed that the pool of Long Terminal Repeat Retrotransposons is substantially conserved and ubiquitous throughout the Oryza and so its origin is ancient and its existence predates the speciation events that originated the genus. Finally we described the peculiar behavior of repeats in the species Oryza coarctata [HHKK] whose placement in the Oryza genus is controversial. Long Terminal Repeat retrotransposons are the major component of the Oryza genomes analyzed and, along with polyploidization, are the most important contributors to the genome size variation across the Oryza genus. Two families of Ty3-gypsy elements (RIRE2 and Atlantys) account for a significant portion of the genome size variations present in the Oryza genus.

  12. Transposable element distribution, abundance and role in genome size variation in the genus Oryza

    Directory of Open Access Journals (Sweden)

    Collura Kristi

    2007-08-01

    Full Text Available Abstract Background The genus Oryza is composed of 10 distinct genome types, 6 diploid and 4 polyploid, and includes the world's most important food crop – rice (Oryza sativa [AA]. Genome size variation in the Oryza is more than 3-fold and ranges from 357 Mbp in Oryza glaberrima [AA] to 1283 Mbp in the polyploid Oryza ridleyi [HHJJ]. Because repetitive elements are known to play a significant role in genome size variation, we constructed random sheared small insert genomic libraries from 12 representative Oryza species and conducted a comprehensive study of the repetitive element composition, distribution and phylogeny in this genus. Particular attention was paid to the role played by the most important classes of transposable elements (Long Terminal Repeats Retrotransposons, Long interspersed Nuclear Elements, helitrons, DNA transposable elements in shaping these genomes and in their contributing to genome size variation. Results We identified the elements primarily responsible for the most strikingly genome size variation in Oryza. We demonstrated how Long Terminal Repeat retrotransposons belonging to the same families have proliferated to very different extents in various species. We also showed that the pool of Long Terminal Repeat Retrotransposons is substantially conserved and ubiquitous throughout the Oryza and so its origin is ancient and its existence predates the speciation events that originated the genus. Finally we described the peculiar behavior of repeats in the species Oryza coarctata [HHKK] whose placement in the Oryza genus is controversial. Conclusion Long Terminal Repeat retrotransposons are the major component of the Oryza genomes analyzed and, along with polyploidization, are the most important contributors to the genome size variation across the Oryza genus. Two families of Ty3-gypsy elements (RIRE2 and Atlantys account for a significant portion of the genome size variations present in the Oryza genus.

  13. Intra-specific variation in genome size in maize: cytological and phenotypic correlates

    Science.gov (United States)

    Realini, María Florencia; Poggio, Lidia; Cámara-Hernández, Julián; González, Graciela Esther

    2016-01-01

    Genome size variation accompanies the diversification and evolution of many plant species. Relationships between DNA amount and phenotypic and cytological characteristics form the basis of most hypotheses that ascribe a biological role to genome size. The goal of the present research was to investigate the intra-specific variation in the DNA content in maize populations from Northeastern Argentina and further explore the relationship between genome size and the phenotypic traits seed weight and length of the vegetative cycle. Moreover, cytological parameters such as the percentage of heterochromatin as well as the number, position and sequence composition of knobs were analysed and their relationships with 2C DNA values were explored. The populations analysed presented significant differences in 2C DNA amount, from 4.62 to 6.29 pg, representing 36.15 % of the inter-populational variation. Moreover, intra-populational genome size variation was found, varying from 1.08 to 1.63-fold. The variation in the percentage of knob heterochromatin as well as in the number, chromosome position and sequence composition of the knobs was detected among and within the populations. Although a positive relationship between genome size and the percentage of heterochromatin was observed, a significant correlation was not found. This confirms that other non-coding repetitive DNA sequences are contributing to the genome size variation. A positive relationship between DNA amount and the seed weight has been reported in a large number of species, this relationship was not found in the populations studied here. The length of the vegetative cycle showed a positive correlation with the percentage of heterochromatin. This result allowed attributing an adaptive effect to heterochromatin since the length of this cycle would be optimized via selection for an appropriate percentage of heterochromatin. PMID:26644343

  14. Chromosome Numbers and Genome Size Variation in Indian Species of Curcuma (Zingiberaceae)

    Science.gov (United States)

    Leong-Škorničková, Jana; Šída, Otakar; Jarolímová, Vlasta; Sabu, Mamyil; Fér, Tomáš; Trávníček, Pavel; Suda, Jan

    2007-01-01

    Background and Aims Genome size and chromosome numbers are important cytological characters that significantly influence various organismal traits. However, geographical representation of these data is seriously unbalanced, with tropical and subtropical regions being largely neglected. In the present study, an investigation was made of chromosomal and genome size variation in the majority of Curcuma species from the Indian subcontinent, and an assessment was made of the value of these data for taxonomic purposes. Methods Genome size of 161 homogeneously cultivated plant samples classified into 51 taxonomic entities was determined by propidium iodide flow cytometry. Chromosome numbers were counted in actively growing root tips using conventional rapid squash techniques. Key Results Six different chromosome counts (2n = 22, 42, 63, >70, 77 and 105) were found, the last two representing new generic records. The 2C-values varied from 1·66 pg in C. vamana to 4·76 pg in C. oligantha, representing a 2·87-fold range. Three groups of taxa with significantly different homoploid genome sizes (Cx-values) and distinct geographical distribution were identified. Five species exhibited intraspecific variation in nuclear DNA content, reaching up to 15·1 % in cultivated C. longa. Chromosome counts and genome sizes of three Curcuma-like species (Hitchenia caulina, Kaempferia scaposa and Paracautleya bhatii) corresponded well with typical hexaploid (2n = 6x = 42) Curcuma spp. Conclusions The basic chromosome number in the majority of Indian taxa (belonging to subgenus Curcuma) is x = 7; published counts correspond to 6x, 9x, 11x, 12x and 15x ploidy levels. Only a few species-specific C-values were found, but karyological and/or flow cytometric data may support taxonomic decisions in some species alliances with morphological similarities. Close evolutionary relationships among some cytotypes are suggested based on the similarity in homoploid genome sizes and geographical grouping

  15. Genome size and phenotypic variation of Nymphaea (Nymphaeaceae species from Eastern Europe and temperate Asia

    Directory of Open Access Journals (Sweden)

    Magdalena Anna Dąbrowska

    2015-07-01

    Full Text Available Despite long-term research, the aquatic genus Nymphaea still possesses major taxonomic challenges. High phenotypic plasticity and possible interspecific hybridization often make it impossible to identify individual specimens. The main aim of this study was to assess phenotypic variation in Nymphaea taxa sampled over a wide area of Eastern Europe and temperate Asia. Samples were identified based on species-specific genome sizes and diagnostic morphological characters for each taxon were then selected. A total of 353 specimens from 32 populations in Poland, Russia and Ukraine were studied, with nine biometric traits being examined. Although some specimens morphologically matched N. ×borealis (a hybrid between N. alba and N. candida according to published determination keys, only one hybrid individual was revealed based on genome size data. Other specimens with intermediate morphology possessed genome size corresponding to N. alba, N. candida or N. tetragona. This indicates that natural hybridization between N. alba and N. candida is not as frequent as previously suggested. Our results also revealed a considerably higher variation in the studied morphological traits (especially the quantitative ones in N. alba and N. candida than reported in the literature. A determination key for the investigated Nymphaea species is provided, based on taxonomically-informative morphological characters identified in our study.

  16. Genome-size Variation in Switchgrass (Panicum virgatum: Flow Cytometry and Cytology Reveal Rampant Aneuploidy

    Directory of Open Access Journals (Sweden)

    Denise E. Costich

    2010-11-01

    Full Text Available Switchgrass ( L., a native perennial dominant of the prairies of North America, has been targeted as a model herbaceous species for biofeedstock development. A flow-cytometric survey of a core set of 11 primarily upland polyploid switchgrass accessions indicated that there was considerable variation in genome size within each accession, particularly at the octoploid (2 = 8 = 72 chromosome ploidy level. Highly variable chromosome counts in mitotic cell preparations indicated that aneuploidy was more common in octoploids (86.3% than tetraploids (23.2%. Furthermore, the incidence of hyper- versus hypoaneuploidy is equivalent in tetraploids. This is clearly not the case in octoploids, where close to 90% of the aneuploid counts are lower than the euploid number. Cytogenetic investigation using fluorescent in situ hybridization (FISH revealed an unexpected degree of variation in chromosome structure underlying the apparent genomic instability of this species. These results indicate that rapid advances in the breeding of polyploid biofuel feedstocks, based on the molecular-genetic dissection of biomass characteristics and yield, will be predicated on the continual improvement of our understanding of the cytogenetics of these species.

  17. Evolutionary and Taxonomic Implications of Variation in Nuclear Genome Size: Lesson from the Grass Genus Anthoxanthum (Poaceae).

    Science.gov (United States)

    Chumová, Zuzana; Krejčíková, Jana; Mandáková, Terezie; Suda, Jan; Trávníček, Pavel

    2015-01-01

    The genus Anthoxanthum (sweet vernal grass, Poaceae) represents a taxonomically intricate polyploid complex with large phenotypic variation and its evolutionary relationships still poorly resolved. In order to get insight into the geographic distribution of ploidy levels and assess the taxonomic value of genome size data, we determined C- and Cx-values in 628 plants representing all currently recognized European species collected from 197 populations in 29 European countries. The flow cytometric estimates were supplemented by conventional chromosome counts. In addition to diploids, we found two low (rare 3x and common 4x) and one high (~16x-18x) polyploid levels. Mean holoploid genome sizes ranged from 5.52 pg in diploid A. alpinum to 44.75 pg in highly polyploid A. amarum, while the size of monoploid genomes ranged from 2.75 pg in tetraploid A. alpinum to 9.19 pg in diploid A. gracile. In contrast to Central and Northern Europe, which harboured only limited cytological variation, a much more complex pattern of genome sizes was revealed in the Mediterranean, particularly in Corsica. Eight taxonomic groups that partly corresponded to traditionally recognized species were delimited based on genome size values and phenotypic variation. Whereas our data supported the merger of A. aristatum and A. ovatum, eastern Mediterranean populations traditionally referred to as diploid A. odoratum were shown to be cytologically distinct, and may represent a new taxon. Autopolyploid origin was suggested for 4x A. alpinum. In contrast, 4x A. odoratum seems to be an allopolyploid, based on the amounts of nuclear DNA. Intraspecific variation in genome size was observed in all recognized species, the most striking example being the A. aristatum/ovatum complex. Altogether, our study showed that genome size can be a useful taxonomic marker in Anthoxathum to not only guide taxonomic decisions but also help resolve evolutionary relationships in this challenging grass genus.

  18. Genome size variation in Corchorus olitorius (Malvaceae s.l.) and its correlation with elevation and phenotypic traits.

    Science.gov (United States)

    Benor, Solomon; Fuchs, Jörg; Blattner, Frank R

    2011-07-01

    In this study, we report genome size variations in Corchorus olitorius L. (Malvaceae s.l.), a crop species known for its morphological plasticity and broad geographical distribution, and Corchorus capsularis L., the second widely cultivated species in the genus. Flow cytometric analyses were conducted with several tissues and nuclei isolation buffers using 69 accessions of C. olitorius and 4 accessions of C. capsularis, representing different habitats and geographical origins. The mean 2C nuclear DNA content (± SD) of C. olitorius was estimated to be 0.918 ± 0.011 pg, with a minimum of 0.882 ± 0.004 pg, and a maximum of 0.942 ± 0.004 pg. All studied plant materials were found to be diploid with 2n = 14. The genome size is negatively correlated with days to flowering (r = -0.29, p genome size and growing elevation (r = 0.59, p genome sizes of C. olitorius and C. capsularis are much smaller, and therewith closer to that of rice. The relatively small genome sizes will be of general advantage for any efforts into genomics or sequencing approaches of these species.

  19. Variation, evolution, and correlation analysis of C+G content and genome or chromosome size in different kingdoms and phyla.

    Science.gov (United States)

    Li, Xiu-Qing; Du, Donglei

    2014-01-01

    C+G content (GC content or G+C content) is known to be correlated with genome/chromosome size in bacteria but the relationship for other kingdoms remains unclear. This study analyzed genome size, chromosome size, and base composition in most of the available sequenced genomes in various kingdoms. Genome size tends to increase during evolution in plants and animals, and the same is likely true for bacteria. The genomic C+G contents were found to vary greatly in microorganisms but were quite similar within each animal or plant subkingdom. In animals and plants, the C+G contents are ranked as follows: monocot plants>mammals>non-mammalian animals>dicot plants. The variation in C+G content between chromosomes within species is greater in animals than in plants. The correlation between average chromosome C+G content and chromosome length was found to be positive in Proteobacteria, Actinobacteria (but not in other analyzed bacterial phyla), Ascomycota fungi, and likely also in some plants; negative in some animals, insignificant in two protist phyla, and likely very weak in Archaea. Clearly, correlations between C+G content and chromosome size can be positive, negative, or not significant depending on the kingdoms/groups or species. Different phyla or species exhibit different patterns of correlation between chromosome-size and C+G content. Most chromosomes within a species have a similar pattern of variation in C+G content but outliers are common. The data presented in this study suggest that the C+G content is under genetic control by both trans- and cis- factors and that the correlation between C+G content and chromosome length can be positive, negative, or not significant in different phyla.

  20. Morphological and genome size variations within populations of Edraianthus graminifolius “Jugoslavicus” (Campanulaceae from the central Balkan peninsula

    Directory of Open Access Journals (Sweden)

    Rakić Tamara

    2014-01-01

    Full Text Available The E. graminifolius complex is widely distributed in the continental part of the central and western Balkan Peninsula and is characterized by pronounced morphological variability. Plants grow on different geological substrates, span a wide altitudinal range and inhabit heterogeneous microclimatic conditions. The aim of this study was to compare morpho-anatomical and genome size variations among 31 populations of E. graminifolius, and to correlate morphoanatomical characteristics of plants with the geomorphologic and bioclimatic characteristics of their habitats. For these purposes, multivariate statistical analyses were performed. Results showed that most of morphological variability could be explained as the adaptive responses of plants to diverse environmental conditions that accompany life at different altitudes. Populations from SE Serbia had larger genome size in respect to other investigated populations. Genome size was bigger in sympatric populations of Edraianthus then in allopatric ones. Apart from the general morphological variability, plants from the Ovčar-Kablar Gorge are particularly morphologically specific. [Projekat Ministarstva nauke Republike Srbije, br. 173030

  1. Geographical distribution of cytotypes in the Chrysanthemum indicum complex as evidenced by ploidy level and genome-size variation

    Institute of Scientific and Technical Information of China (English)

    Jing LI; Qian WAN; Richard J.ABBOTT; Guang-Yuan RAO

    2013-01-01

    A detailed knowledge of the geographical distribution ofcytotypes within and between species comprising a polyploid complex is critical to our understanding of the history and evolution of such complexes.In the present study we examined the geographical distributions ofcytotypes within six tentatively delimited species comprising the Chrysanthemum indicum complex in China.We determined the ploidy of 188 individuals sampled from 47 populations,based on DNA content using flow cytometry.In addition,chromosome counts were made on samples of each taxon.We confirmed that all samples of C.rhombifolium and C.lavandulifolium were diploid (2n =18),those of C.hypargyrum and C.potentilloides were tetraploid (2n--36),and those of C.vestitum were hexaploid (2n =54).In contrast,we confirmed that C.indicum contained both diploid and tetraploid cytotypes.We found that in addition to marked differences in genome size between ploidy levels,there was a variation in genome size between species of the same ploidy level.Although the diploid,tetraploid,and hexaploid taxa of the complex,as well as the diploid form of C.indicum,occurred only in central and northem China,the tetraploid form of C.indicum was widespread both north and south of the Yangtze River.We suggest that the tetraploid form of C.indicum may have expanded its range southward during recent Quatemary glacial periods when forests retreated in south China as conditions became drier.

  2. Genome Size Dynamics and Evolution in Monocots

    Directory of Open Access Journals (Sweden)

    Ilia J. Leitch

    2010-01-01

    Full Text Available Monocot genomic diversity includes striking variation at many levels. This paper compares various genomic characters (e.g., range of chromosome numbers and ploidy levels, occurrence of endopolyploidy, GC content, chromosome packaging and organization, genome size between monocots and the remaining angiosperms to discern just how distinctive monocot genomes are. One of the most notable features of monocots is their wide range and diversity of genome sizes, including the species with the largest genome so far reported in plants. This genomic character is analysed in greater detail, within a phylogenetic context. By surveying available genome size and chromosome data it is apparent that different monocot orders follow distinctive modes of genome size and chromosome evolution. Further insights into genome size-evolution and dynamics were obtained using statistical modelling approaches to reconstruct the ancestral genome size at key nodes across the monocot phylogenetic tree. Such approaches reveal that while the ancestral genome size of all monocots was small (1C=1.9 pg, there have been several major increases and decreases during monocot evolution. In addition, notable increases in the rates of genome size-evolution were found in Asparagales and Poales compared with other monocot lineages.

  3. Sauropod dinosaurs evolved moderately sized genomes unrelated to body size.

    Science.gov (United States)

    Organ, Chris L; Brusatte, Stephen L; Stein, Koen

    2009-12-22

    Sauropodomorph dinosaurs include the largest land animals to have ever lived, some reaching up to 10 times the mass of an African elephant. Despite their status defining the upper range for body size in land animals, it remains unknown whether sauropodomorphs evolved larger-sized genomes than non-avian theropods, their sister taxon, or whether a relationship exists between genome size and body size in dinosaurs, two questions critical for understanding broad patterns of genome evolution in dinosaurs. Here we report inferences of genome size for 10 sauropodomorph taxa. The estimates are derived from a Bayesian phylogenetic generalized least squares approach that generates posterior distributions of regression models relating genome size to osteocyte lacunae volume in extant tetrapods. We estimate that the average genome size of sauropodomorphs was 2.02 pg (range of species means: 1.77-2.21 pg), a value in the upper range of extant birds (mean = 1.42 pg, range: 0.97-2.16 pg) and near the average for extant non-avian reptiles (mean = 2.24 pg, range: 1.05-5.44 pg). The results suggest that the variation in size and architecture of genomes in extinct dinosaurs was lower than the variation found in mammals. A substantial difference in genome size separates the two major clades within dinosaurs, Ornithischia (large genomes) and Saurischia (moderate to small genomes). We find no relationship between body size and estimated genome size in extinct dinosaurs, which suggests that neutral forces did not dominate the evolution of genome size in this group.

  4. Identification of PLCL1 gene for hip bone size variation in females in a genome-wide association study.

    Directory of Open Access Journals (Sweden)

    Yao-Zhong Liu

    Full Text Available Osteoporosis, the most prevalent metabolic bone disease among older people, increases risk for low trauma hip fractures (HF that are associated with high morbidity and mortality. Hip bone size (BS has been identified as one of the key measurable risk factors for HF. Although hip BS is highly genetically determined, genetic factors underlying the trait are still poorly defined. Here, we performed the first genome-wide association study (GWAS of hip BS interrogating approximately 380,000 SNPs on the Affymetrix platform in 1,000 homogeneous unrelated Caucasian subjects, including 501 females and 499 males. We identified a gene, PLCL1 (phospholipase c-like 1, that had four SNPs associated with hip BS at, or approaching, a genome-wide significance level in our female subjects; the most significant SNP, rs7595412, achieved a p value of 3.72x10(-7. The gene's importance to hip BS was replicated using the Illumina genotyping platform in an independent UK cohort containing 1,216 Caucasian females. Two SNPs of the PLCL1 gene, rs892515 and rs9789480, surrounded by the four SNPs identified in our GWAS, achieved p values of 8.62x10(-3 and 2.44x10(-3, respectively, for association with hip BS. Imputation analyses on our GWAS and the UK samples further confirmed the replication signals; eight SNPs of the gene achieved combined imputed p values<10(-5 in the two samples. The PLCL1 gene's relevance to HF was also observed in a Chinese sample containing 403 females, including 266 with HF and 177 control subjects. A SNP of the PLCL1 gene, rs3771362 that is only approximately 0.6 kb apart from the most significant SNP detected in our GWAS (rs7595412, achieved a p value of 7.66x10(-3 (odds ratio = 0.26 for association with HF. Additional biological support for the role of PLCL1 in BS comes from previous demonstrations that the PLCL1 protein inhibits IP3 (inositol 1,4,5-trisphosphate-mediated calcium signaling, an important pathway regulating mechanical sensing of

  5. A multivariate analysis of variation in genome size and endoreduplication in angiosperms reveals strong phylogenetic signal and association with phenotypic traits.

    Science.gov (United States)

    Bainard, Jillian D; Bainard, Luke D; Henry, Thomas A; Fazekas, Aron J; Newmaster, Steven G

    2012-12-01

    Genome size (C-value) and endopolyploidy (endoreduplication index, EI) are known to correlate with various morphological and ecological traits, in addition to phylogenetic placement. A phylogenetically controlled multivariate analysis was used to explore the relationships between DNA content and phenotype in angiosperms. Seeds from 41 angiosperm species (17 families) were grown in a common glasshouse experiment. Genome size (2C-value and 1Cx-value) and EI (in four tissues: leaf, stem, root, petal) were determined using flow cytometry. The phylogenetic signal was calculated for each measure of DNA content, and phylogenetic canonical correlation analysis (PCCA) explored how the variation in genome size and EI was correlated with 18 morphological and ecological traits. Phylogenetic signal (λ) was strongest for EI in all tissues, and λ was stronger for the 2C-value than the 1Cx-value. PCCA revealed that EI was correlated with pollen length, stem height, seed mass, dispersal mechanism, arbuscular mycorrhizal association, life history and flowering time, and EI and genome size were both correlated with stem height and life history. PCCA provided an effective way to explore multiple factors of DNA content variation and phenotypic traits in a phylogenetic context. Traits that were correlated significantly with DNA content were linked to plant competitive ability. © 2012 The Authors. New Phytologist © 2012 New Phytologist Trust.

  6. Structural variations in pig genomes

    NARCIS (Netherlands)

    Paudel, Y.

    2015-01-01

    Abstract Paudel, Y. (2015). Structural variations in pig genomes. PhD thesis, Wageningen University, the Netherlands Structural variations are chromosomal rearrangements such as insertions-deletions (INDELs), duplications, inversions, translocations, and copy number variations (CNVs

  7. Genome size and genome evolution in diploid Triticeae species.

    Science.gov (United States)

    Eilam, T; Anikster, Y; Millet, E; Manisterski, J; Sagi-Assif, O; Feldman, M

    2007-11-01

    One of the intriguing issues concerning the dynamics of plant genomes is the occurrence of intraspecific variation in nuclear DNA amount. The aim of this work was to assess the ranges of intraspecific, interspecific, and intergeneric variation in nuclear DNA content of diploid species of the tribe Triticeae (Poaceae) and to examine the relation between life form or habitat and genome size. Altogether, 438 plants representing 272 lines that belong to 22 species were analyzed. Nuclear DNA content was estimated by flow cytometry. Very small intraspecific variation in DNA amount was found between lines of Triticeae diploid species collected from different habitats or between different morphs. In contrast to the constancy in nuclear DNA amount at the intraspecific level, there are significant differences in genome size between the various diploid species. Within the genus Aegilops, the 1C DNA amount ranged from 4.84 pg in A. caudata to 7.52 pg in A. sharonensis; among genera, the 1C DNA amount ranged from 4.18 pg in Heteranthelium piliferum to 9.45 pg in Secale montanum. No evidence was found for a smaller genome size in annual, self-pollinating species relative to perennial, cross-pollinating ones. Diploids that grow in the southern part of the group's distribution have larger genomes than those growing in other parts of the distribution. The contrast between the low variation at the intraspecific level and the high variation at the interspecific one suggests that changes in genome size originated in close temporal proximity to the speciation event, i.e., before, during, or immediately after it. The possible effects of sudden changes in genome size on speciation processes are discussed.

  8. Copy number variation in the bovine genome

    DEFF Research Database (Denmark)

    Fadista, João; Thomsen, Bo; Holm, Lars-Erik;

    2010-01-01

    to genetic variation in cattle. Results We designed and used a set of NimbleGen CGH arrays that tile across the assayable portion of the cattle genome with approximately 6.3 million probes, at a median probe spacing of 301 bp. This study reports the highest resolution map of copy number variation...... in the cattle genome, with 304 CNV regions (CNVRs) being identified among the genomes of 20 bovine samples from 4 dairy and beef breeds. The CNVRs identified covered 0.68% (22 Mb) of the genome, and ranged in size from 1.7 to 2,031 kb (median size 16.7 kb). About 20% of the CNVs co-localized with segmental...

  9. Evaluating the role of genome downsizing and size thresholds from genome size distributions in angiosperms.

    Science.gov (United States)

    Zenil-Ferguson, Rosana; Ponciano, José M; Burleigh, J Gordon

    2016-07-01

    Whole-genome duplications (WGDs) can rapidly increase genome size in angiosperms. Yet their mean genome size is not correlated with ploidy. We compared three hypotheses to explain the constancy of genome size means across ploidies. The genome downsizing hypothesis suggests that genome size will decrease by a given percentage after a WGD. The genome size threshold hypothesis assumes that taxa with large genomes or large monoploid numbers will fail to undergo or survive WGDs. Finally, the genome downsizing and threshold hypothesis suggests that both genome downsizing and thresholds affect the relationship between genome size means and ploidy. We performed nonparametric bootstrap simulations to compare observed angiosperm genome size means among species or genera against simulated genome sizes under the three different hypotheses. We evaluated the hypotheses using a decision theory approach and estimated the expected percentage of genome downsizing. The threshold hypothesis improves the approximations between mean genome size and simulated genome size. At the species level, the genome downsizing with thresholds hypothesis best explains the genome size means with a 15% genome downsizing percentage. In the genus level simulations, the monoploid number threshold hypothesis best explains the data. Thresholds of genome size and monoploid number added to genome downsizing at species level simulations explain the observed means of angiosperm genome sizes, and monoploid number is important for determining the genome size mean at the genus level. © 2016 Botanical Society of America.

  10. FROG - Fingerprinting Genomic Variation Ontology.

    Directory of Open Access Journals (Sweden)

    E Abinaya

    Full Text Available Genetic variations play a crucial role in differential phenotypic outcomes. Given the complexity in establishing this correlation and the enormous data available today, it is imperative to design machine-readable, efficient methods to store, label, search and analyze this data. A semantic approach, FROG: "FingeRprinting Ontology of Genomic variations" is implemented to label variation data, based on its location, function and interactions. FROG has six levels to describe the variation annotation, namely, chromosome, DNA, RNA, protein, variations and interactions. Each level is a conceptual aggregation of logically connected attributes each of which comprises of various properties for the variant. For example, in chromosome level, one of the attributes is location of variation and which has two properties, allosomes or autosomes. Another attribute is variation kind which has four properties, namely, indel, deletion, insertion, substitution. Likewise, there are 48 attributes and 278 properties to capture the variation annotation across six levels. Each property is then assigned a bit score which in turn leads to generation of a binary fingerprint based on the combination of these properties (mostly taken from existing variation ontologies. FROG is a novel and unique method designed for the purpose of labeling the entire variation data generated till date for efficient storage, search and analysis. A web-based platform is designed as a test case for users to navigate sample datasets and generate fingerprints. The platform is available at http://ab-openlab.csir.res.in/frog.

  11. Clinical Interpretation of Genomic Variations.

    Science.gov (United States)

    Sayitoğlu, Müge

    2016-09-05

    Novel high-throughput sequencing technologies generate large-scale genomic data and are used extensively for disease mapping of monogenic and/or complex disorders, personalized treatment, and pharmacogenomics. Next-generation sequencing is rapidly becoming routine tool for diagnosis and molecular monitoring of patients to evaluate therapeutic efficiency. The next-generation sequencing platforms generate huge amounts of genetic variation data and it remains a challenge to interpret the variations that are identified. Such data interpretation needs close collaboration among bioinformaticians, clinicians, and geneticists. There are several problems that must be addressed, such as the generation of new algorithms for mapping and annotation, harmonization of the terminology, correct use of nomenclature, reference genomes for different populations, rare disease variant databases, and clinical reports.

  12. Child Development and Structural Variation in the Human Genome

    Science.gov (United States)

    Zhang, Ying; Haraksingh, Rajini; Grubert, Fabian; Abyzov, Alexej; Gerstein, Mark; Weissman, Sherman; Urban, Alexander E.

    2013-01-01

    Structural variation of the human genome sequence is the insertion, deletion, or rearrangement of stretches of DNA sequence sized from around 1,000 to millions of base pairs. Over the past few years, structural variation has been shown to be far more common in human genomes than previously thought. Very little is currently known about the effects…

  13. Child Development and Structural Variation in the Human Genome

    Science.gov (United States)

    Zhang, Ying; Haraksingh, Rajini; Grubert, Fabian; Abyzov, Alexej; Gerstein, Mark; Weissman, Sherman; Urban, Alexander E.

    2013-01-01

    Structural variation of the human genome sequence is the insertion, deletion, or rearrangement of stretches of DNA sequence sized from around 1,000 to millions of base pairs. Over the past few years, structural variation has been shown to be far more common in human genomes than previously thought. Very little is currently known about the effects…

  14. Genome size estimation: a new methodology

    Science.gov (United States)

    Álvarez-Borrego, Josué; Gallardo-Escárate, Crisitian; Kober, Vitaly; López-Bonilla, Oscar

    2007-03-01

    Recently, within the cytogenetic analysis, the evolutionary relations implied in the content of nuclear DNA in plants and animals have received a great attention. The first detailed measurements of the nuclear DNA content were made in the early 40's, several years before Watson and Crick proposed the molecular structure of the DNA. In the following years Hewson Swift developed the concept of "C-value" in reference to the haploid phase of DNA in plants. Later Mirsky and Ris carried out the first systematic study of genomic size in animals, including representatives of the five super classes of vertebrates as well as of some invertebrates. From these preliminary results it became evident that the DNA content varies enormously between the species and that this variation does not bear relation to the intuitive notion from the complexity of the organism. Later, this observation was reaffirmed in the following years as the studies increased on genomic size, thus denominating to this characteristic of the organisms like the "Paradox of the C-value". Few years later along with the no-codification discovery of DNA the paradox was solved, nevertheless, numerous questions remain until nowadays unfinished, taking to denominate this type of studies like the "C-value enigma". In this study, we reported a new method for genome size estimation by quantification of fluorescence fading. We measured the fluorescence intensity each 1600 milliseconds in DAPI-stained nuclei. The estimation of the area under the graph (integral fading) during fading period was related with the genome size.

  15. Dynamics of genome size evolution in birds and mammals

    Science.gov (United States)

    Feschotte, Cédric

    2017-01-01

    Genome size in mammals and birds shows remarkably little interspecific variation compared with other taxa. However, genome sequencing has revealed that many mammal and bird lineages have experienced differential rates of transposable element (TE) accumulation, which would be predicted to cause substantial variation in genome size between species. Thus, we hypothesize that there has been covariation between the amount of DNA gained by transposition and lost by deletion during mammal and avian evolution, resulting in genome size equilibrium. To test this model, we develop computational methods to quantify the amount of DNA gained by TE expansion and lost by deletion over the last 100 My in the lineages of 10 species of eutherian mammals and 24 species of birds. The results reveal extensive variation in the amount of DNA gained via lineage-specific transposition, but that DNA loss counteracted this expansion to various extents across lineages. Our analysis of the rate and size spectrum of deletion events implies that DNA removal in both mammals and birds has proceeded mostly through large segmental deletions (>10 kb). These findings support a unified “accordion” model of genome size evolution in eukaryotes whereby DNA loss counteracting TE expansion is a major determinant of genome size. Furthermore, we propose that extensive DNA loss, and not necessarily a dearth of TE activity, has been the primary force maintaining the greater genomic compaction of flying birds and bats relative to their flightless relatives. PMID:28179571

  16. Genome size increases in recently diverged hornwort clades.

    Science.gov (United States)

    Bainard, Jillian D; Villarreal, Juan Carlos

    2013-08-01

    As our knowledge of plant genome size estimates continues to grow, one group has continually been neglected: the hornworts. Hornworts (Anthocerotophyta) have been traditionally grouped with liverworts and mosses because they share a haploid dominant life cycle; however, recent molecular studies place hornworts as the sister lineage to extant tracheophytes. Given the scarcity of information regarding the DNA content of hornworts, our objective was to estimate the 1C-value for a range of hornwort species within a phylogenetic context. Using flow cytometry, we estimated genome size for 36 samples representing 24 species. This accounts for roughly 10% of known hornwort species. Haploid genome sizes (1C-value) ranged from 160 Mbp or 0.16 pg (Leiosporoceros dussii) to 719 Mbp or 0.73 pg (Nothoceros endiviifolius). The average 1C-value was 261 ± 104 Mbp (0.27 ± 0.11 pg). Ancestral reconstruction of genome size on a hornwort phylogeny suggests a small ancestral genome size and revealed increases in genome size in the most recently divergent clades. Much more work is needed to understand DNA content variation in this phylogenetically important group, but this work has significantly increased our knowledge of genome size variation in hornworts.

  17. Genome size evolution in pufferfish: an insight from BAC clone-based Diodon holocanthus genome sequencing

    Directory of Open Access Journals (Sweden)

    Gan Xiaoni

    2010-06-01

    Full Text Available Abstract Background Variations in genome size within and between species have been observed since the 1950 s in diverse taxonomic groups. Serving as model organisms, smooth pufferfish possess the smallest vertebrate genomes. Interestingly, spiny pufferfish from its sister family have genome twice as large as smooth pufferfish. Therefore, comparative genomic analysis between smooth pufferfish and spiny pufferfish is useful for our understanding of genome size evolution in pufferfish. Results Ten BAC clones of a spiny pufferfish Diodon holocanthus were randomly selected and shotgun sequenced. In total, 776 kb of non-redundant sequences without gap representing 0.1% of the D. holocanthus genome were identified, and 77 distinct genes were predicted. In the sequenced D. holocanthus genome, 364 kb is homologous with 265 kb of the Takifugu rubripes genome, and 223 kb is homologous with 148 kb of the Tetraodon nigroviridis genome. The repetitive DNA accounts for 8% of the sequenced D. holocanthus genome, which is higher than that in the T. rubripes genome (6.89% and that in the Te. nigroviridis genome (4.66%. In the repetitive DNA, 76% is retroelements which account for 6% of the sequenced D. holocanthus genome and belong to known families of transposable elements. More than half of retroelements were distributed within genes. In the non-homologous regions, repeat element proportion in D. holocanthus genome increased to 10.6% compared with T. rubripes and increased to 9.19% compared with Te. nigroviridis. A comparison of 10 well-defined orthologous genes showed that the average intron size (566 bp in D. holocanthus genome is significantly longer than that in the smooth pufferfish genome (435 bp. Conclusion Compared with the smooth pufferfish, D. holocanthus has a low gene density and repeat elements rich genome. Genome size variation between D. holocanthus and the smooth pufferfish exhibits as length variation between homologous region and different

  18. Genome size and longevity in fish.

    Science.gov (United States)

    Griffith, O L; Moodie, G E E; Civetta, A

    2003-03-01

    The wide variety of genome sizes (measured as C-value) observed across taxa is not related to organismal complexity or number of coding genes. Partial answers to this C-value enigma have been found by establishing associations between C-value and particular phenotypic characteristics. One such controversial association has been recently suggested between genome size and longevity in birds. In order to determine whether genome size is a general predictor of longevity, we have extended the analysis to the Actinoptergyian fish, a widely divergent group in terms of both longevity and genome size. We collected data on genome size, longevity and body mass for species covering fourteen orders of bony fish. Analysis of covariance using order as a cofactor shows a significant effect of genome size on longevity (corrected for body mass), with lifespan increasing as a function of genome size. Analysis of phylogenetically independent contrasts for orders with a large number of species with a well resolved phylogenetic relationship (Acipenseriformes, Cypriniformes, and Salmoniformes) found the same trend of longer lifespan with increases in genome size but the relationship was not significant. Our results consistently show an increase in lifespan for fish with larger genomes.

  19. From genomic variation to personalized medicine

    DEFF Research Database (Denmark)

    Wesolowska, Agata; Schmiegelow, Kjeld

    Genomic variation is the basis of interindividual differences in observable traits and disease susceptibility. Genetic studies are the driving force of personalized medicine, as many of the differences in treatment efficacy can be attributed to our genomic background. The rapid development of nex...... alternative to data-driven genome-wide association studies. Finally, the findings of the presented studies set new directions for future pharmacognenetic investigations and provide a framework for future implementation of personalized medicine.......Genomic variation is the basis of interindividual differences in observable traits and disease susceptibility. Genetic studies are the driving force of personalized medicine, as many of the differences in treatment efficacy can be attributed to our genomic background. The rapid development...... the thesis and includes some final remarks on the perspectives of genomic variation research and personalized medicine. In summary, this thesis demonstrates the feasibility of integrative analyses of genomic variations and introduces large-scale hypothesis-driven SNP exploration studies as an emerging...

  20. Genome size analyses of Pucciniales reveal the largest fungal genomes

    Directory of Open Access Journals (Sweden)

    Silvia eTavares

    2014-08-01

    Full Text Available Rust fungi (Basidiomycota, Pucciniales are biotrophic plant pathogens which exhibit diverse complexities in their life cycles and host ranges. The completion of genome sequencing of a few rust fungi has revealed the occurrence of large genomes. Sequencing efforts for other rust fungi have been hampered by uncertainty concerning their genome sizes. Flow cytometry was recently applied to estimate the genome size of a few rust fungi, and confirmed the occurrence of large genomes in this order (averaging 151.5 Mbp, while the average for Basidiomycota was 49.9 Mbp and was 37.7 Mbp for all fungi. In this work, we have used an innovative and simple approach to simultaneously isolate nuclei from the rust and its host plant in order to estimate the genome size of 30 rust species by flow cytometry. Genome sizes varied over 10-fold, from 70 to 893 Mbp, with an average genome size value of 380.2 Mbp. Compared to the genome sizes of over 1,800 fungi, Gymnosporangium confusum possesses the largest fungal genome ever reported (893.2 Mbp. Moreover, even the smallest rust genome determined in this study is larger than the vast majority of fungal genomes (94 %. The average genome size of the Pucciniales is now of 305.5 Mbp, while the average Basidiomycota genome size has shifted to 70.4 Mbp and the average for all fungi reached 44.2 Mbp. Despite the fact that no correlation could be drawn between the genome sizes, the phylogenomics or the life cycle of rust fungi, it is interesting to note that rusts with Fabaceae hosts present genomes clearly larger than those with Poaceae hosts. Although this study comprises only a small fraction of the more than 7,000 rust species described, it seems already evident that the Pucciniales represent a group where genome size expansion could be a common characteristic. This is in sharp contrast to sister taxa, placing this order in a relevant position in fungal genomics research.

  1. Evolution of genome size and complexity in Pinus.

    Directory of Open Access Journals (Sweden)

    Alison M Morse

    Full Text Available BACKGROUND: Genome evolution in the gymnosperm lineage of seed plants has given rise to many of the most complex and largest plant genomes, however the elements involved are poorly understood. METHODOLOGY/PRINCIPAL FINDINGS: Gymny is a previously undescribed retrotransposon family in Pinus that is related to Athila elements in Arabidopsis. Gymny elements are dispersed throughout the modern Pinus genome and occupy a physical space at least the size of the Arabidopsis thaliana genome. In contrast to previously described retroelements in Pinus, the Gymny family was amplified or introduced after the divergence of pine and spruce (Picea. If retrotransposon expansions are responsible for genome size differences within the Pinaceae, as they are in angiosperms, then they have yet to be identified. In contrast, molecular divergence of Gymny retrotransposons together with other families of retrotransposons can account for the large genome complexity of pines along with protein-coding genic DNA, as revealed by massively parallel DNA sequence analysis of Cot fractionated genomic DNA. CONCLUSIONS/SIGNIFICANCE: Most of the enormous genome complexity of pines can be explained by divergence of retrotransposons, however the elements responsible for genome size variation are yet to be identified. Genomic resources for Pinus including those reported here should assist in further defining whether and how the roles of retrotransposons differ in the evolution of angiosperm and gymnosperm genomes.

  2. Metabolic 'engines' of flight drive genome size reduction in birds.

    Science.gov (United States)

    Wright, Natalie A; Gregory, T Ryan; Witt, Christopher C

    2014-03-22

    The tendency for flying organisms to possess small genomes has been interpreted as evidence of natural selection acting on the physical size of the genome. Nonetheless, the flight-genome link and its mechanistic basis have yet to be well established by comparative studies within a volant clade. Is there a particular functional aspect of flight such as brisk metabolism, lift production or maneuverability that impinges on the physical genome? We measured genome sizes, wing dimensions and heart, flight muscle and body masses from a phylogenetically diverse set of bird species. In phylogenetically controlled analyses, we found that genome size was negatively correlated with relative flight muscle size and heart index (i.e. ratio of heart to body mass), but positively correlated with body mass and wing loading. The proportional masses of the flight muscles and heart were the most important parameters explaining variation in genome size in multivariate models. Hence, the metabolic intensity of powered flight appears to have driven genome size reduction in birds.

  3. On the Relationship between Pollen Size and Genome Size

    Directory of Open Access Journals (Sweden)

    Charles A. Knight

    2010-01-01

    Full Text Available Here we test whether genome size is a predictor of pollen size. If it were, inferences of ancient genome size would be possible using the abundant paleo-palynolgical record. We performed regression analyses across 464 species of pollen width and genome size. We found a significant positive trend. However, regression analysis using phylogentically independent contrasts did not support the correlated evolution of these traits. Instead, a large split between angiosperms and gymnosperms for both pollen width and genome size was revealed. Sister taxa were not more likely to show a positive contrast when compared to deeper nodes. However, significantly more congeneric species had a positive trend than expected by chance. These results may reflect the strong selection pressure for pollen to be small. Also, because pollen grains are not metabolically active when measured, their biology is different than other cells which have been shown to be strongly related to genome size, such as guard cells. Our findings contrast with previously published research. It was our hope that pollen size could be used as a proxy for inferring the genome size of ancient species. However, our results suggest pollen is not a good candidate for such endeavors.

  4. Genome evolution of ferns: evidence for relative stasis of genome size across the fern phylogeny.

    Science.gov (United States)

    Clark, James; Hidalgo, Oriane; Pellicer, Jaume; Liu, Hongmei; Marquardt, Jeannine; Robert, Yannis; Christenhusz, Maarten; Zhang, Shouzhou; Gibby, Mary; Leitch, Ilia J; Schneider, Harald

    2016-05-01

    The genome evolution of ferns has been considered to be relatively static compared with angiosperms. In this study, we analyse genome size data and chromosome numbers in a phylogenetic framework to explore three hypotheses: the correlation of genome size and chromosome number, the origin of modern ferns from ancestors with high chromosome numbers, and the occurrence of several whole-genome duplications during the evolution of ferns. To achieve this, we generated new genome size data, increasing the percentage of fern species with genome sizes estimated to 2.8% of extant diversity, and ensuring a comprehensive phylogenetic coverage including at least three species from each fern order. Genome size was correlated with chromosome number across all ferns despite some substantial variation in both traits. We observed a trend towards conservation of the amount of DNA per chromosome, although Osmundaceae and Psilotaceae have substantially larger chromosomes. Reconstruction of the ancestral genome traits suggested that the earliest ferns were already characterized by possessing high chromosome numbers and that the earliest divergences in ferns were correlated with substantial karyological changes. Evidence for repeated whole-genome duplications was found across the phylogeny. Fern genomes tend to evolve slowly, albeit genome rearrangements occur in some clades. © 2016 The Authors. New Phytologist © 2016 New Phytologist Trust.

  5. Genome size evolution in Ontario ferns (Polypodiidae): evolutionary correlations with cell size, spore size, and habitat type and an absence of genome downsizing.

    Science.gov (United States)

    Henry, Thomas A; Bainard, Jillian D; Newmaster, Steven G

    2014-10-01

    Genome size is known to correlate with a number of traits in angiosperms, but less is known about the phenotypic correlates of genome size in ferns. We explored genome size variation in relation to a suite of morphological and ecological traits in ferns. Thirty-six fern taxa were collected from wild populations in Ontario, Canada. 2C DNA content was measured using flow cytometry. We tested for genome downsizing following polyploidy using a phylogenetic comparative analysis to explore the correlation between 1Cx DNA content and ploidy. There was no compelling evidence for the occurrence of widespread genome downsizing during the evolution of Ontario ferns. The relationship between genome size and 11 morphological and ecological traits was explored using a phylogenetic principal component regression analysis. Genome size was found to be significantly associated with cell size, spore size, spore type, and habitat type. These results are timely as past and recent studies have found conflicting support for the association between ploidy/genome size and spore size in fern polyploid complexes; this study represents the first comparative analysis of the trend across a broad taxonomic group of ferns.

  6. Adaptive and nonadaptive genome size evolution in Karst endemic flora of China.

    Science.gov (United States)

    Kang, Ming; Tao, Junjie; Wang, Jing; Ren, Chen; Qi, Qingwen; Xiang, Qiu-Yun; Huang, Hongwen

    2014-06-01

    Genome size variation is of fundamental biological importance and has been a longstanding puzzle in evolutionary biology. Several hypotheses for genome size evolution including neutral, maladaptive, and adaptive models have been proposed, but the relative importance of these models remains controversial. Primulina is a genus that is highly diversified in the Karst region of southern China, where genome size variation and the underlying evolutionary mechanisms are poorly understood. We reconstructed the phylogeny of Primulina using DNA sequences for 104 species and determined the genome sizes of 101 species. We examined the phylogenetic signal in genome size variation, and tested the fit to different evolutionary models and for correlations with variation in latitude and specific leaf area (SLA). The results showed that genome size, SLA and latitudinal variation all displayed strong phylogenetic signals, but were best explained by different evolutionary models. Furthermore, significant positive relationships were detected between genome size and SLA and between genome size and latitude. Our study is the first to investigate genome size evolution on such a comprehensive scale and in the Karst region flora. We conclude that genome size in Primulina is phylogenetically conserved but its variation among species is a combined outcome of both neutral and adaptive evolution.

  7. The evolution of genome size in ants

    Directory of Open Access Journals (Sweden)

    Spagna Joseph C

    2008-02-01

    Full Text Available Abstract Background Despite the economic and ecological importance of ants, genomic tools for this family (Formicidae remain woefully scarce. Knowledge of genome size, for example, is a useful and necessary prerequisite for the development of many genomic resources, yet it has been reported for only one ant species (Solenopsis invicta, and the two published estimates for this species differ by 146.7 Mb (0.15 pg. Results Here, we report the genome size for 40 species of ants distributed across 10 of the 20 currently recognized subfamilies, thus making Formicidae the 4th most surveyed insect family and elevating the Hymenoptera to the 5th most surveyed insect order. Our analysis spans much of the ant phylogeny, from the less derived Amblyoponinae and Ponerinae to the more derived Myrmicinae, Formicinae and Dolichoderinae. We include a number of interesting and important taxa, including the invasive Argentine ant (Linepithema humile, Neotropical army ants (genera Eciton and Labidus, trapjaw ants (Odontomachus, fungus-growing ants (Apterostigma, Atta and Sericomyrmex, harvester ants (Messor, Pheidole and Pogonomyrmex, carpenter ants (Camponotus, a fire ant (Solenopsis, and a bulldog ant (Myrmecia. Our results show that ants possess small genomes relative to most other insects, yet genome size varies three-fold across this insect family. Moreover, our data suggest that two whole-genome duplications may have occurred in the ancestors of the modern Ectatomma and Apterostigma. Although some previous studies of other taxa have revealed a relationship between genome size and body size, our phylogenetically-controlled analysis of this correlation did not reveal a significant relationship. Conclusion This is the first analysis of genome size in ants (Formicidae and the first across multiple species of social insects. We show that genome size is a variable trait that can evolve gradually over long time spans, as well as rapidly, through processes that may

  8. Cell size, genome size and the dominance of Angiosperms

    Science.gov (United States)

    Simonin, K. A.; Roddy, A. B.

    2016-12-01

    Angiosperms are capable of maintaining the highest rates of photosynthetic gas exchange of all land plants. High rates of photosynthesis depends mechanistically both on efficiently transporting water to the sites of evaporation in the leaf and on regulating the loss of that water to the atmosphere as CO2 diffuses into the leaf. Angiosperm leaves are unique in their ability to sustain high fluxes of liquid and vapor phase water transport due to high vein densities and numerous, small stomata. Despite the ubiquity of studies characterizing the anatomical and physiological adaptations that enable angiosperms to maintain high rates of photosynthesis, the underlying mechanism explaining why they have been able to develop such high leaf vein densities, and such small and abundant stomata, is still incomplete. Here we ask whether the scaling of genome size and cell size places a fundamental constraint on the photosynthetic metabolism of land plants, and whether genome downsizing among the angiosperms directly contributed to their greater potential and realized primary productivity relative to the other major groups of terrestrial plants. Using previously published data we show that a single relationship can predict guard cell size from genome size across the major groups of terrestrial land plants (e.g. angiosperms, conifers, cycads and ferns). Similarly, a strong positive correlation exists between genome size and both stomatal density and vein density that together ultimately constrains maximum potential (gs, max) and operational stomatal conductance (gs, op). Further the difference in the slopes describing the covariation between genome size and both gs, max and gs, op suggests that genome downsizing brings gs, op closer to gs, max. Taken together the data presented here suggests that the smaller genomes of angiosperms allow their final cell sizes to vary more widely and respond more directly to environmental conditions and in doing so bring operational photosynthetic

  9. Genome sizes for all genera of Cycadales.

    Science.gov (United States)

    Zonneveld, B J M

    2012-01-01

    Nuclear DNA content (2C) is reported for all genera of the Cycadales, using flow cytometry with propidium iodide. Nuclear DNA content ranges from 24 to 64 pg in cycads. This implies that the largest genome contains roughly 40 × 10(9) more base pairs than the smallest genome. The narrow range in nuclear DNA content within a genus is remarkable for such an old group. Furthermore, 42 of the 58 plants measured, covering five genera, have 18 chromosomes. They vary from 36.1 to 64.7 pg, covering the whole range of genome sizes (excluding the genome of Cycas). Hence, their does not seem to be a correlation between genome size and the number of chromosomes.

  10. Genomics technologies to study structural variations in the grapevine genome

    Directory of Open Access Journals (Sweden)

    Cardone Maria Francesca

    2016-01-01

    Full Text Available Grapevine is one of the most important crop plants in the world. Recently there was great expansion of genomics resources about grapevine genome, thus providing increasing efforts for molecular breeding. Current cultivars display a great level of inter-specific differentiation that needs to be investigated to reach a comprehensive understanding of the genetic basis of phenotypic differences, and to find responsible genes selected by cross breeding programs. While there have been significant advances in resolving the pattern and nature of single nucleotide polymorphisms (SNPs on plant genomes, few data are available on copy number variation (CNV. Furthermore association between structural variations and phenotypes has been described in only a few cases. We combined high throughput biotechnologies and bioinformatics tools, to reveal the first inter-varietal atlas of structural variation (SV for the grapevine genome. We sequenced and compared four table grape cultivars with the Pinot noir inbred line PN40024 genome as the reference. We detected roughly 8% of the grapevine genome affected by genomic variations. Taken into account phenotypic differences existing among the studied varieties we performed comparison of SVs among them and the reference and next we performed an in-depth analysis of gene content of polymorphic regions. This allowed us to identify genes showing differences in copy number as putative functional candidates for important traits in grapevine cultivation.

  11. Size Matters: Individual Variation in Ectotherm Growth and Asymptotic Size

    Science.gov (United States)

    King, Richard B.

    2016-01-01

    Body size, and, by extension, growth has impacts on physiology, survival, attainment of sexual maturity, fecundity, generation time, and population dynamics, especially in ectotherm animals that often exhibit extensive growth following attainment of sexual maturity. Frequently, growth is analyzed at the population level, providing useful population mean growth parameters but ignoring individual variation that is also of ecological and evolutionary significance. Our long-term study of Lake Erie Watersnakes, Nerodia sipedon insularum, provides data sufficient for a detailed analysis of population and individual growth. We describe population mean growth separately for males and females based on size of known age individuals (847 captures of 769 males, 748 captures of 684 females) and annual growth increments of individuals of unknown age (1,152 males, 730 females). We characterize individual variation in asymptotic size based on repeated measurements of 69 males and 71 females that were each captured in five to nine different years. The most striking result of our analyses is that asymptotic size varies dramatically among individuals, ranging from 631–820 mm snout-vent length in males and from 835–1125 mm in females. Because female fecundity increases with increasing body size, we explore the impact of individual variation in asymptotic size on lifetime reproductive success using a range of realistic estimates of annual survival. When all females commence reproduction at the same age, lifetime reproductive success is greatest for females with greater asymptotic size regardless of annual survival. But when reproduction is delayed in females with greater asymptotic size, lifetime reproductive success is greatest for females with lower asymptotic size when annual survival is low. Possible causes of individual variation in asymptotic size, including individual- and cohort-specific variation in size at birth and early growth, warrant further investigation. PMID

  12. Nitrogen limitation as a driver of genome size evolution in a group of karst plants

    OpenAIRE

    Kang, Ming; Wang, Jing; Huang, Hongwen

    2015-01-01

    Genome size is of fundamental biological importance with significance in predicting structural and functional attributes of organisms. Although abundant evidence has shown that the genome size can be largely explained by differential proliferation and removal of non-coding DNA of the genome, the evolutionary and ecological basis of genome size variation remains poorly understood. Nitrogen (N) and phosphorus (P) are essential elements of DNA and protein building blocks, yet often subject to en...

  13. Size variation in Middle Pleistocene humans.

    Science.gov (United States)

    Arsuaga, J L; Carretero, J M; Lorenzo, C; Gracia, A; Martínez, I; Bermúdez de Castro, J M; Carbonell, E

    1997-08-22

    It has been suggested that European Middle Pleistocene humans, Neandertals, and prehistoric modern humans had a greater sexual dimorphism than modern humans. Analysis of body size variation and cranial capacity variation in the large sample from the Sima de los Huesos site in Spain showed instead that the sexual dimorphism is comparable in Middle Pleistocene and modern populations.

  14. Genome Size and Variation Analysis of Mango (Mangifera indica L.) Germplasms in Yunnan by Flow Cytometry%云南芒果种质基因组大小测定与变异分析

    Institute of Scientific and Technical Information of China (English)

    柳觐; 李开雄; 孔广红; 倪书邦

    2015-01-01

    为了解云南芒果(Mangifera indica L.)种质资源的基因组的变异情况,采用流式细胞术对35份云南芒果种质资源的基因组大小进行了测定和变异分析。结果表明,云南芒果种质资源的基因组大小存在一定差异,基因组的平均C值是0.445110 pg,0.4353177×109 bp,最小的是采自景洪的半栽培种YSM-44(0.434567 pg,0.4250060×109 bp),最大的是采自红河的野生种YSM-25(0.458679 pg,0.4485881×109 bp)。基因组C值变异程度最大的是野生种(CV=1.65%),其次为半野生种(CV=1.26%)、半栽培种(CV=1.21%)和栽培种(CV=0.11%)。与芒果具有相近基因组大小的多为苔藓植物,与“C值悖论”观点相一致。因此,应用流式细胞术能准确、快捷地测定芒果基因组大小,而且云南野生、半野生及半栽培芒果种质资源遗传变异类型丰富,有较大的挖掘利用潜力。%In order to understand the variation of mango (Mangifera indica L.) germplasms in Yunnan, the genome size of 35 germplasms was determined by lfow cytometry and their variation was analyzed. The results showed that the mean genome size among the 35 germplasms was 0.445110 pg and 0.4353177×109 bp, which the minimum one (0.434567 pg, 0.4250060×109 bp) was YSM-44 from Jinghong, and the maximum one (0.458679 pg, 0.44485881×109 bp) was YSM-25 from Honghe. The genome size variation of wild germplasms was the largest (CV=1.65%), followed by semi-wild germplasms (CV=1.26%), semi-cultivated germplasms (CV=1.21%) and cultivated germplasms (CV=0.11%). The bryophytes had similar genome size to mango, which is consistent with the“C-value paradox”theory. Therefore, lfow cytometry method could accurately and fastly measure genome size of mango, and the genetic variation in wild, semi-wild and semi-cultivated germplasms was rich, these could be used for mango breeding.

  15. Insights into structural variations and genome rearrangements in prokaryotic genomes.

    Science.gov (United States)

    Periwal, Vinita; Scaria, Vinod

    2015-01-01

    Structural variations (SVs) are genomic rearrangements that affect fairly large fragments of DNA. Most of the SVs such as inversions, deletions and translocations have been largely studied in context of genetic diseases in eukaryotes. However, recent studies demonstrate that genome rearrangements can also have profound impact on prokaryotic genomes, leading to altered cell phenotype. In contrast to single-nucleotide variations, SVs provide a much deeper insight into organization of bacterial genomes at a much better resolution. SVs can confer change in gene copy number, creation of new genes, altered gene expression and many other functional consequences. High-throughput technologies have now made it possible to explore SVs at a much refined resolution in bacterial genomes. Through this review, we aim to highlight the importance of the less explored field of SVs in prokaryotic genomes and their impact. We also discuss its potential applicability in the emerging fields of synthetic biology and genome engineering where targeted SVs could serve to create sophisticated and accurate genome editing.

  16. Genomic Sequence Variation Markup Language (GSVML).

    Science.gov (United States)

    Nakaya, Jun; Kimura, Michio; Hiroi, Kaei; Ido, Keisuke; Yang, Woosung; Tanaka, Hiroshi

    2010-02-01

    With the aim of making good use of internationally accumulated genomic sequence variation data, which is increasing rapidly due to the explosive amount of genomic research at present, the development of an interoperable data exchange format and its international standardization are necessary. Genomic Sequence Variation Markup Language (GSVML) will focus on genomic sequence variation data and human health applications, such as gene based medicine or pharmacogenomics. We developed GSVML through eight steps, based on case analysis and domain investigations. By focusing on the design scope to human health applications and genomic sequence variation, we attempted to eliminate ambiguity and to ensure practicability. We intended to satisfy the requirements derived from the use case analysis of human-based clinical genomic applications. Based on database investigations, we attempted to minimize the redundancy of the data format, while maximizing the data covering range. We also attempted to ensure communication and interface ability with other Markup Languages, for exchange of omics data among various omics researchers or facilities. The interface ability with developing clinical standards, such as the Health Level Seven Genotype Information model, was analyzed. We developed the human health-oriented GSVML comprising variation data, direct annotation, and indirect annotation categories; the variation data category is required, while the direct and indirect annotation categories are optional. The annotation categories contain omics and clinical information, and have internal relationships. For designing, we examined 6 cases for three criteria as human health application and 15 data elements for three criteria as data formats for genomic sequence variation data exchange. The data format of five international SNP databases and six Markup Languages and the interface ability to the Health Level Seven Genotype Model in terms of 317 items were investigated. GSVML was developed as

  17. Big Data Analysis of Human Genome Variations

    KAUST Repository

    Gojobori, Takashi

    2016-01-25

    Since the human genome draft sequence was in public for the first time in 2000, genomic analyses have been intensively extended to the population level. The following three international projects are good examples for large-scale studies of human genome variations: 1) HapMap Data (1,417 individuals) (http://hapmap.ncbi.nlm.nih.gov/downloads/genotypes/2010-08_phaseII+III/forward/), 2) HGDP (Human Genome Diversity Project) Data (940 individuals) (http://www.hagsc.org/hgdp/files.html), 3) 1000 genomes Data (2,504 individuals) http://ftp.1000genomes.ebi.ac.uk/vol1/ftp/release/20130502/ If we can integrate all three data into a single volume of data, we should be able to conduct a more detailed analysis of human genome variations for a total number of 4,861 individuals (= 1,417+940+2,504 individuals). In fact, we successfully integrated these three data sets by use of information on the reference human genome sequence, and we conducted the big data analysis. In particular, we constructed a phylogenetic tree of about 5,000 human individuals at the genome level. As a result, we were able to identify clusters of ethnic groups, with detectable admixture, that were not possible by an analysis of each of the three data sets. Here, we report the outcome of this kind of big data analyses and discuss evolutionary significance of human genomic variations. Note that the present study was conducted in collaboration with Katsuhiko Mineta and Kosuke Goto at KAUST.

  18. Structural variation in two human genomes mapped at single-nucleotide resolution by whole genome de novo assembly

    DEFF Research Database (Denmark)

    Li, Yingrui; Zheng, Hancheng; Luo, Ruibang

    2011-01-01

    Here we use whole-genome de novo assembly of second-generation sequencing reads to map structural variation (SV) in an Asian genome and an African genome. Our approach identifies small- and intermediate-size homozygous variants (1-50 kb) including insertions, deletions, inversions and their precise...

  19. Comparative genomics of brain size evolution

    OpenAIRE

    Enard, Wolfgang

    2014-01-01

    Which genetic changes took place during mammalian, primate and human evolution to build a larger brain? To answer this question, one has to correlate genetic changes with brain size changes across a phylogeny. Such a comparative genomics approach provides unique information to better understand brain evolution and brain development. However, its statistical power is limited for example due to the limited number of species, the presumably complex genetics of brain size evolution and the large ...

  20. Comparative genomics of brain size evolution

    OpenAIRE

    2014-01-01

    Which genetic changes took place during mammalian, primate and human evolution to build a larger brain? To answer this question, one has to correlate genetic changes with brain size changes across a phylogeny. Such a comparative genomics approach provides unique information to better understand brain evolution and brain development. However, its statistical power is limited for example due to the limited number of species, the presumably complex genetics of brain size evolution and the large ...

  1. Four loci explain 83% of size variation in the horse.

    Directory of Open Access Journals (Sweden)

    Shokouh Makvandi-Nejad

    Full Text Available Horse body size varies greatly due to intense selection within each breed. American Miniatures are less than one meter tall at the withers while Shires and Percherons can exceed two meters. The genetic basis for this variation is not known. We hypothesize that the breed population structure of the horse should simplify efforts to identify genes controlling size. In support of this, here we show with genome-wide association scans (GWAS that genetic variation at just four loci can explain the great majority of horse size variation. Unlike humans, which are naturally reproducing and possess many genetic variants with weak effects on size, we show that horses, like other domestic mammals, carry just a small number of size loci with alleles of large effect. Furthermore, three of our horse size loci contain the LCORL, HMGA2 and ZFAT genes that have previously been found to control human height. The LCORL/NCAPG locus is also implicated in cattle growth and HMGA2 is associated with dog size. Extreme size diversification is a hallmark of domestication. Our results in the horse, complemented by the prior work in cattle and dog, serve to pinpoint those very few genes that have played major roles in the rapid evolution of size during domestication.

  2. Genome size, karyotype polymorphism and chromosomal evolution in Trypanosoma cruzi.

    Directory of Open Access Journals (Sweden)

    Renata T Souza

    Full Text Available BACKGROUND: The Trypanosoma cruzi genome was sequenced from a hybrid strain (CL Brener. However, high allelic variation and the repetitive nature of the genome have prevented the complete linear sequence of chromosomes being determined. Determining the full complement of chromosomes and establishing syntenic groups will be important in defining the structure of T. cruzi chromosomes. A large amount of information is now available for T. cruzi and Trypanosoma brucei, providing the opportunity to compare and describe the overall patterns of chromosomal evolution in these parasites. METHODOLOGY/PRINCIPAL FINDINGS: The genome sizes, repetitive DNA contents, and the numbers and sizes of chromosomes of nine strains of T. cruzi from four lineages (TcI, TcII, TcV and TcVI were determined. The genome of the TcI group was statistically smaller than other lineages, with the exception of the TcI isolate Tc1161 (José-IMT. Satellite DNA content was correlated with genome size for all isolates, but this was not accompanied by simultaneous amplification of retrotransposons. Regardless of chromosomal polymorphism, large syntenic groups are conserved among T. cruzi lineages. Duplicated chromosome-sized regions were identified and could be retained as paralogous loci, increasing the dosage of several genes. By comparing T. cruzi and T. brucei chromosomes, homologous chromosomal regions in T. brucei were identified. Chromosomes Tb9 and Tb11 of T. brucei share regions of syntenic homology with three and six T. cruzi chromosomal bands, respectively. CONCLUSIONS: Despite genome size variation and karyotype polymorphism, T. cruzi lineages exhibit conservation of chromosome structure. Several syntenic groups are conserved among all isolates analyzed in this study. The syntenic regions are larger than expected if rearrangements occur randomly, suggesting that they are conserved owing to positive selection. Mapping of the syntenic regions on T. cruzi chromosomal bands

  3. Genome Size in Diploids, Allopolyploids, and Autopolyploids of Mediterranean Triticeae

    Directory of Open Access Journals (Sweden)

    T. Eilam

    2010-01-01

    Full Text Available Nuclear DNA amount, determined by the flow cytometry method, in diploids, natural and synthetic allopolyploids, and natural and synthetic autopolyploids of the tribe Triticeae (Poaceae is reviewed here and discussed. In contrast to the very small and nonsignificant variation in nuclear DNA amount that was found at the intraspecific level, the variation at the interspecific level is very large. Evidently changes in genome size are either the cause or the result of speciation. Typical autopolyploids had the expected additive DNA amount of their diploid parents, whereas natural and synthetic cytologically diploidized autopolyploids and natural and synthetic allopolyploids had significantly less DNA than the sum of their parents. Thus, genome downsizing, occurring during or immediately after the formation of these polyploids, provides the physical basis for their cytological diploidization, that is, diploid-like meiotic behavior. Possible mechanisms that are involved in genome downsizing and the biological significance of this phenomenon are discussed.

  4. GFVO: the Genomic Feature and Variation Ontology.

    Science.gov (United States)

    Baran, Joachim; Durgahee, Bibi Sehnaaz Begum; Eilbeck, Karen; Antezana, Erick; Hoehndorf, Robert; Dumontier, Michel

    2015-01-01

    Falling costs in genomic laboratory experiments have led to a steady increase of genomic feature and variation data. Multiple genomic data formats exist for sharing these data, and whilst they are similar, they are addressing slightly different data viewpoints and are consequently not fully compatible with each other. The fragmentation of data format specifications makes it hard to integrate and interpret data for further analysis with information from multiple data providers. As a solution, a new ontology is presented here for annotating and representing genomic feature and variation dataset contents. The Genomic Feature and Variation Ontology (GFVO) specifically addresses genomic data as it is regularly shared using the GFF3 (incl. FASTA), GTF, GVF and VCF file formats. GFVO simplifies data integration and enables linking of genomic annotations across datasets through common semantics of genomic types and relations. Availability and implementation. The latest stable release of the ontology is available via its base URI; previous and development versions are available at the ontology's GitHub repository: https://github.com/BioInterchange/Ontologies; versions of the ontology are indexed through BioPortal (without external class-/property-equivalences due to BioPortal release 4.10 limitations); examples and reference documentation is provided on a separate web-page: http://www.biointerchange.org/ontologies.html. GFVO version 1.0.2 is licensed under the CC0 1.0 Universal license (https://creativecommons.org/publicdomain/zero/1.0) and therefore de facto within the public domain; the ontology can be appropriated without attribution for commercial and non-commercial use.

  5. GFVO: the Genomic Feature and Variation Ontology

    KAUST Repository

    Baran, Joachim

    2015-05-05

    Falling costs in genomic laboratory experiments have led to a steady increase of genomic feature and variation data. Multiple genomic data formats exist for sharing these data, and whilst they are similar, they are addressing slightly different data viewpoints and are consequently not fully compatible with each other. The fragmentation of data format specifications makes it hard to integrate and interpret data for further analysis with information from multiple data providers. As a solution, a new ontology is presented here for annotating and representing genomic feature and variation dataset contents. The Genomic Feature and Variation Ontology (GFVO) specifically addresses genomic data as it is regularly shared using the GFF3 (incl. FASTA), GTF, GVF and VCF file formats. GFVO simplifies data integration and enables linking of genomic annotations across datasets through common semantics of genomic types and relations. Availability and implementation. The latest stable release of the ontology is available via its base URI; previous and development versions are available at the ontology’s GitHub repository: https://github.com/BioInterchange/Ontologies; versions of the ontology are indexed through BioPortal (without external class-/property-equivalences due to BioPortal release 4.10 limitations); examples and reference documentation is provided on a separate web-page: http://www.biointerchange.org/ontologies.html. GFVO version 1.0.2 is licensed under the CC0 1.0 Universal license (https://creativecommons.org/publicdomain/zero/1.0) and therefore de facto within the public domain; the ontology can be appropriated without attribution for commercial and non-commercial use.

  6. GFVO: the Genomic Feature and Variation Ontology

    Directory of Open Access Journals (Sweden)

    Joachim Baran

    2015-05-01

    Full Text Available Falling costs in genomic laboratory experiments have led to a steady increase of genomic feature and variation data. Multiple genomic data formats exist for sharing these data, and whilst they are similar, they are addressing slightly different data viewpoints and are consequently not fully compatible with each other. The fragmentation of data format specifications makes it hard to integrate and interpret data for further analysis with information from multiple data providers. As a solution, a new ontology is presented here for annotating and representing genomic feature and variation dataset contents. The Genomic Feature and Variation Ontology (GFVO specifically addresses genomic data as it is regularly shared using the GFF3 (incl. FASTA, GTF, GVF and VCF file formats. GFVO simplifies data integration and enables linking of genomic annotations across datasets through common semantics of genomic types and relations.Availability and implementation. The latest stable release of the ontology is available via its base URI; previous and development versions are available at the ontology’s GitHub repository: https://github.com/BioInterchange/Ontologies; versions of the ontology are indexed through BioPortal (without external class-/property-equivalences due to BioPortal release 4.10 limitations; examples and reference documentation is provided on a separate web-page: http://www.biointerchange.org/ontologies.html. GFVO version 1.0.2 is licensed under the CC0 1.0 Universal license (https://creativecommons.org/publicdomain/zero/1.0 and therefore de facto within the public domain; the ontology can be appropriated without attribution for commercial and non-commercial use.

  7. The dynamic evolutionary history of genome size in North American woodland salamanders.

    Science.gov (United States)

    Newman, Catherine E; Gregory, T Ryan; Austin, Christopher C

    2017-04-01

    The genus Plethodon is the most species-rich salamander genus in North America, and nearly half of its species face an uncertain future. It is also one of the most diverse families in terms of genome sizes, which range from 1C = 18.2 to 69.3 pg, or 5-20 times larger than the human genome. Large genome size in salamanders results in part from accumulation of transposable elements and is associated with various developmental and physiological traits. However, genome sizes have been reported for only 25% of the species of Plethodon (14 of 55). We collected genome size data for Plethodon serratus to supplement an ongoing phylogeographic study, reconstructed the evolutionary history of genome size in Plethodontidae, and inferred probable genome sizes for the 41 species missing empirical data. Results revealed multiple genome size changes in Plethodon: genomes of western Plethodon increased, whereas genomes of eastern Plethodon decreased, followed by additional decreases or subsequent increases. The estimated genome size of P. serratus was 21 pg. New understanding of variation in genome size evolution, along with genome size inferences for previously unstudied taxa, provide a foundation for future studies on the biology of plethodontid salamanders.

  8. Comparative genomics of brain size evolution

    Directory of Open Access Journals (Sweden)

    Wolfgang eEnard

    2014-05-01

    Full Text Available Which genetic changes took place during mammalian, primate and human evolution to build a larger brain? To answer this question, one has to correlate genetic changes with brain size changes across a phylogeny. Such a comparative genomics approach provides unique information to better understand brain evolution and brain development. However, its statistical power is limited for example due to the limited number of species, the presumably complex genetics of brain size evolution and the large search space of mammalian genomes. Hence, it is crucial to add functional information, for example by limiting the search space to genes and regulatory elements known to play a role in the relevant cell types during brain development. Similarly, it is crucial to experimentally follow up on hypotheses generated by such a comparative approach. Recent progress in understanding the molecular and cellular mechanisms of mammalian brain development, in genome sequencing and in genome editing, promises to make a close integration of evolutionary and experimental methods a fruitful approach to better understand the genetics of mammalian brain size evolution.

  9. Reductive genome evolution at both ends of the bacterial population size spectrum.

    Science.gov (United States)

    Batut, Bérénice; Knibbe, Carole; Marais, Gabriel; Daubin, Vincent

    2014-12-01

    Bacterial genomes show substantial variations in size. The smallest bacterial genomes are those of endocellular symbionts of eukaryotic hosts, which have undergone massive genome reduction and show patterns that are consistent with the degenerative processes that are predicted to occur in species with small effective population sizes. However, similar genome reduction is found in some free-living marine cyanobacteria that are characterized by extremely large populations. In this Opinion article, we discuss the different hypotheses that have been proposed to account for this reductive genome evolution at both ends of the bacterial population size spectrum.

  10. Size did not matter: An evolutionary account of the variation in penis size and size anxiety

    OpenAIRE

    Menelaos Apostolou

    2016-01-01

    The human penis exhibits considerable variation in size, while a substantial proportion of the adult male population experiences size anxiety. This paper employs an evolutionary framework in order to understand this variation, as well as the concern men exhibit about the adequacy of the size of their penis. It is argued that female choice has been one important sexual selection force, responsible for shaping the size of the penis. However, this force has been relatively weak, because women do...

  11. Genome size determination in peronosporales (Oomycota) by Feulgen image analysis.

    Science.gov (United States)

    Voglmayr, H; Greilhuber, J

    1998-12-01

    Genome size was determined, by nuclear Feulgen staining and image analysis, in 46 accessions of 31 species of Peronosporales (Oomycota), including important plant pathogens such as Bremia lactucae, Plasmopara viticola, Pseudoperonospora cubensis, and Pseudoperonospora humuli. The 1C DNA contents ranged from 0.046 (45. 6 Mb) to 0.163 pg (159.9 Mb). This is 0.041- to 0.144-fold that of Glycine max (soybean, 1C = 1.134 pg), which was used as an internal standard for genome size determination. The linearity of Feulgen absorbance photometry method over this range was demonstrated by calibration of Aspergillus species (1C = 31-38 Mb) against Glycine, which revealed differences of less than 6% compared to the published CHEF data. The low coefficients of variation (usually between 5 and 10%), repeatability of the results, and compatibility with CHEF data prove the resolution power of Feulgen image analysis. The applicability and limitations of Feulgen photometry are discussed in relation to other methods of genome size determination (CHEF gel electrophoresis, reassociation kinetics, genomic reconstruction) that have been previously applied to Oomycota. Copyright 1998 Academic Press.

  12. Online resources for genomic structural variation.

    Science.gov (United States)

    Sneddon, Tam P; Church, Deanna M

    2012-01-01

    Genomic structural variation (SV) can be thought of on a continuum from a single base pair insertion/deletion (INDEL) to large megabase-scale rearrangements involving insertions, deletions, duplications, inversions, or translocations of whole chromosomes or chromosome arms. These variants can occur in coding or noncoding DNA, they can be inherited or arise sporadically in the germline or somatic cells. Many of these events are segregating in the population and can be considered common alleles while others are new alleles and thus rare events. All species studied to date harbor structural variants and these may be benign, contributing to phenotypes such as sensory perception and immunity, or pathogenic resulting in genomic disorders including DiGeorge/velocardiofacial, Smith-Margenis, Williams-Beuren, and Prader-Willi syndromes. As structural variants are identified, validated, and their significance, origin, and prevalence are elucidated, it is of critical importance that these data be collected and collated in a way that can be easily accessed and analyzed. This chapter describes current structural variation online resources (see Fig. 1 and Table 1), highlights the challenges in capturing, storing, and displaying SV data, and discusses how dbVar and DGVa, the genomic structural variation databases developed at NCBI and EBI, respectively, were designed to address these issues.

  13. Burkholderia pseudomallei genome plasticity associated with genomic island variation

    Directory of Open Access Journals (Sweden)

    Currie Bart J

    2008-04-01

    Full Text Available Abstract Background Burkholderia pseudomallei is a soil-dwelling saprophyte and the cause of melioidosis. Horizontal gene transfer contributes to the genetic diversity of this pathogen and may be an important determinant of virulence potential. The genome contains genomic island (GI regions that encode a broad array of functions. Although there is some evidence for the variable distribution of genomic islands in B. pseudomallei isolates, little is known about the extent of variation between related strains or their association with disease or environmental survival. Results Five islands from B. pseudomallei strain K96243 were chosen as representatives of different types of genomic islands present in this strain, and their presence investigated in other B. pseudomallei. In silico analysis of 10 B. pseudomallei genome sequences provided evidence for the variable presence of these regions, together with micro-evolutionary changes that generate GI diversity. The diversity of GIs in 186 isolates from NE Thailand (83 environmental and 103 clinical isolates was investigated using multiplex PCR screening. The proportion of all isolates positive by PCR ranged from 12% for a prophage-like island (GI 9, to 76% for a metabolic island (GI 16. The presence of each of the five GIs did not differ between environmental and disease-associated isolates (p > 0.05 for all five islands. The cumulative number of GIs per isolate for the 186 isolates ranged from 0 to 5 (median 2, IQR 1 to 3. The distribution of cumulative GI number did not differ between environmental and disease-associated isolates (p = 0.27. The presence of GIs was defined for the three largest clones in this collection (each defined as a single sequence type, ST, by multilocus sequence typing; these were ST 70 (n = 15 isolates, ST 54 (n = 11, and ST 167 (n = 9. The rapid loss and/or acquisition of gene islands was observed within individual clones. Comparisons were drawn between isolates obtained

  14. The mode and tempo of genome size evolution in the subgenus Sophophora

    Science.gov (United States)

    Johnston, J. Spencer

    2017-01-01

    Genome size varies widely across organisms, with no apparent tie to organismal complexity. While genome size is inherited, there is no established evolutionary model for this trait. Hypotheses have been postulated for the observed variation in genome sizes across species, most notably the effective population size hypothesis, the mutational equilibrium hypothesis, and the adaptive hypothesis. While much data has been collected on genome size, the above hypotheses have largely ignored impacts from phylogenetic relationships. In order to test these competing hypotheses, genome sizes of 87 Sophophora species were analyzed in a comparative phylogenetic approach using Pagel’s parameters of evolution, Blomberg’s K, Abouheif’s Cmean and Moran’s I. In addition to testing the mode and rate of genome size evolution in Sophophora species, the effect of number of taxa on detection of phylogenetic signal was analyzed for each of these comparative phylogenetic methods. Sophophora genome size was found to be dependent on the phylogeny, indicating that evolutionary time was important for predicting the variation among species. Genome size was found to evolve gradually on branches of the tree, with a rapid burst of change early in the phylogeny. These results suggest that Sophophora genome size has experienced gradual changes, which support the largely theoretical mutational equilibrium hypothesis. While some methods (Abouheif’s Cmean and Moran’s I) were found to be affected by increasing taxa numbers, more commonly used methods (λ and Blomberg’s K) were found to have increasing reliability with increasing taxa number, with significantly more support with fifteen or more taxa. Our results suggest that these comparative phylogenetic methods, with adequate taxon sampling, can be a powerful way to uncover the enigma that is genome size variation through incorporation of phylogenetic relationships. PMID:28267812

  15. Personal and population genomics of human regulatory variation.

    Science.gov (United States)

    Vernot, Benjamin; Stergachis, Andrew B; Maurano, Matthew T; Vierstra, Jeff; Neph, Shane; Thurman, Robert E; Stamatoyannopoulos, John A; Akey, Joshua M

    2012-09-01

    The characteristics and evolutionary forces acting on regulatory variation in humans remains elusive because of the difficulty in defining functionally important noncoding DNA. Here, we combine genome-scale maps of regulatory DNA marked by DNase I hypersensitive sites (DHSs) from 138 cell and tissue types with whole-genome sequences of 53 geographically diverse individuals in order to better delimit the patterns of regulatory variation in humans. We estimate that individuals likely harbor many more functionally important variants in regulatory DNA compared with protein-coding regions, although they are likely to have, on average, smaller effect sizes. Moreover, we demonstrate that there is significant heterogeneity in the level of functional constraint in regulatory DNA among different cell types. We also find marked variability in functional constraint among transcription factor motifs in regulatory DNA, with sequence motifs for major developmental regulators, such as HOX proteins, exhibiting levels of constraint comparable to protein-coding regions. Finally, we perform a genome-wide scan of recent positive selection and identify hundreds of novel substrates of adaptive regulatory evolution that are enriched for biologically interesting pathways such as melanogenesis and adipocytokine signaling. These data and results provide new insights into patterns of regulatory variation in individuals and populations and demonstrate that a large proportion of functionally important variation lies beyond the exome.

  16. Coevolution between simple sequence repeats (SSRs and virus genome size

    Directory of Open Access Journals (Sweden)

    Zhao Xiangyan

    2012-08-01

    Full Text Available Abstract Background Relationship between the level of repetitiveness in genomic sequence and genome size has been investigated by making use of complete prokaryotic and eukaryotic genomes, but relevant studies have been rarely made in virus genomes. Results In this study, a total of 257 viruses were examined, which cover 90% of genera. The results showed that simple sequence repeats (SSRs is strongly, positively and significantly correlated with genome size. Certain repeat class is distributed in a certain range of genome sequence length. Mono-, di- and tri- repeats are widely distributed in all virus genomes, tetra- SSRs as a common component consist in genomes which more than 100 kb in size; in the range of genome  Conclusions We conducted this research standing on the height of the whole virus. We concluded that genome size is an important factor in affecting the occurrence of SSRs; hosts are also responsible for the variances of SSRs content to a certain degree.

  17. Paleogenomic data suggest mammal-like genome size in the ancestral amniote and derived large genome size in amphibians.

    Science.gov (United States)

    Organ, C L; Canoville, A; Reisz, R R; Laurin, M

    2011-02-01

    An unsolved question in evolutionary genomics is whether amniote genomes have been expanding or contracting since the common ancestor of this diverse group. Here, we report on the polarity of amniote genome size evolution using genome size estimates for 14 extinct tetrapod genera from the Paleozoic and early Mesozoic Eras using osteocyte lacunae size as a correlate. We find substantial support for a phylogenetically controlled regression model relating genome size to osteocyte lacunae size (P of slopes amphibians, contractions along the diapsid lineage, and no directional change within the synapsid lineage leading to mammals.

  18. Nitrogen limitation as a driver of genome size evolution in a group of karst plants

    Science.gov (United States)

    Kang, Ming; Wang, Jing; Huang, Hongwen

    2015-06-01

    Genome size is of fundamental biological importance with significance in predicting structural and functional attributes of organisms. Although abundant evidence has shown that the genome size can be largely explained by differential proliferation and removal of non-coding DNA of the genome, the evolutionary and ecological basis of genome size variation remains poorly understood. Nitrogen (N) and phosphorus (P) are essential elements of DNA and protein building blocks, yet often subject to environmental limitation in natural ecosystems. Using phylogenetic comparative methods, we test this hypothesis by determining whether leaf N and P availability affects genome sizes in 99 species of Primulina (Gesneriaceae), a group of soil specialists adapted to limestone karst environment in south China. We find that genome sizes in Primulina are strongly positively correlated with plant N content, but the correlation with plant P content is not significant when phylogeny history was taken into account. This study shows for the first time that N limitation might have been a plausible driver of genome size variation in a group of plants. We propose that competition for nitrogen nutrient between DNA synthesis and cellular functions is a possible mechanism for genome size evolution in Primulina under N-limitation.

  19. The Small Nuclear Genomes of Selaginella Are Associated with a Low Rate of Genome Size Evolution.

    Science.gov (United States)

    Baniaga, Anthony E; Arrigo, Nils; Barker, Michael S

    2016-06-03

    The haploid nuclear genome size (1C DNA) of vascular land plants varies over several orders of magnitude. Much of this observed diversity in genome size is due to the proliferation and deletion of transposable elements. To date, all vascular land plant lineages with extremely small nuclear genomes represent recently derived states, having ancestors with much larger genome sizes. The Selaginellaceae represent an ancient lineage with extremely small genomes. It is unclear how small nuclear genomes evolved in Selaginella We compared the rates of nuclear genome size evolution in Selaginella and major vascular plant clades in a comparative phylogenetic framework. For the analyses, we collected 29 new flow cytometry estimates of haploid genome size in Selaginella to augment publicly available data. Selaginella possess some of the smallest known haploid nuclear genome sizes, as well as the lowest rate of genome size evolution observed across all vascular land plants included in our analyses. Additionally, our analyses provide strong support for a history of haploid nuclear genome size stasis in Selaginella Our results indicate that Selaginella, similar to other early diverging lineages of vascular land plants, has relatively low rates of genome size evolution. Further, our analyses highlight that a rapid transition to a small genome size is only one route to an extremely small genome.

  20. A first exploration of genome size diversity in sponges.

    Science.gov (United States)

    Jeffery, Nicholas W; Jardine, Catherine B; Gregory, T Ryan

    2013-08-01

    The phyla known as early-branching lineages of animals have become the subject of increasing interest from the perspectives of genomics and evolutionary biology. Unfortunately, data on even the most fundamental properties of their genomes, such as genome size, remain very scarce. In this study, genome size estimates are reported for 75 species of sponges (phylum Porifera) representing 33 families and 12 orders, marking the first large survey of genome size diversity for an early-branching phylum. Sponge genome sizes averaged around 0.2 pg but exhibited a 17-fold range overall (0.04-0.63 pg). In addition, the results of comparisons of two methods of genome size quantification (flow cytometry and Feulgen image analysis densitometry) are presented, thereby facilitating future work on these animals. Some particularly promising avenues for future investigation are highlighted.

  1. Size did not matter: An evolutionary account of the variation in penis size and size anxiety

    Directory of Open Access Journals (Sweden)

    Menelaos Apostolou

    2016-12-01

    Full Text Available The human penis exhibits considerable variation in size, while a substantial proportion of the adult male population experiences size anxiety. This paper employs an evolutionary framework in order to understand this variation, as well as the concern men exhibit about the adequacy of the size of their penis. It is argued that female choice has been one important sexual selection force, responsible for shaping the size of the penis. However, this force has been relatively weak, because women do not consider the size of their partners’ penis to be the most important determinant of their sexual satisfaction. Also, in ancestral human societies, sexual satisfaction was a secondary concern, while women had limited space to exercise mate choice. The mismatch between ancestral and modern conditions, with female choice being stronger in the present than in the past, causes anxiety in men about their ability to satisfy their partners, which is also manifested in their concerns about size.

  2. Genomic variation in Salmonella enterica core genes for epidemiological typing

    DEFF Research Database (Denmark)

    Leekitcharoenphon, Pimlapas; Lukjancenko, Oksana; Rundsten, Carsten Friis

    2012-01-01

    Background: Technological advances in high throughput genome sequencing are making whole genome sequencing (WGS) available as a routine tool for bacterial typing. Standardized procedures for identification of relevant genes and of variation are needed to enable comparison between studies and over...... genomes and evaluate their value as typing targets, comparing whole genome typing and traditional methods such as 16S and MLST. A consensus tree based on variation of core genes gives much better resolution than 16S and MLST; the pan-genome family tree is similar to the consensus tree, but with higher...... that there is a positive selection towards mutations leading to amino acid changes. Conclusions: Genomic variation within the core genome is useful for investigating molecular evolution and providing candidate genes for bacterial genome typing. Identification of genes with different degrees of variation is important...

  3. 49 CFR 231.20 - Variation in size permitted.

    Science.gov (United States)

    2010-10-01

    ... 49 Transportation 4 2010-10-01 2010-10-01 false Variation in size permitted. 231.20 Section 231.20..., DEPARTMENT OF TRANSPORTATION RAILROAD SAFETY APPLIANCE STANDARDS § 231.20 Variation in size permitted. To... total variation of 5 percent below size given is permitted....

  4. Genetic variation and the de novo assembly of human genomes.

    Science.gov (United States)

    Chaisson, Mark J P; Wilson, Richard K; Eichler, Evan E

    2015-11-01

    The discovery of genetic variation and the assembly of genome sequences are both inextricably linked to advances in DNA-sequencing technology. Short-read massively parallel sequencing has revolutionized our ability to discover genetic variation but is insufficient to generate high-quality genome assemblies or resolve most structural variation. Full resolution of variation is only guaranteed by complete de novo assembly of a genome. Here, we review approaches to genome assembly, the nature of gaps or missing sequences, and biases in the assembly process. We describe the challenges of generating a complete de novo genome assembly using current technologies and the impact that being able to perfectly sequence the genome would have on understanding human disease and evolution. Finally, we summarize recent technological advances that improve both contiguity and accuracy and emphasize the importance of complete de novo assembly as opposed to read mapping as the primary means to understanding the full range of human genetic variation.

  5. Reptiles: a group of transition in the evolution of genome size and of the nucleotypic effect.

    Science.gov (United States)

    Olmo, E

    2003-01-01

    A comparison between genome size and some phenotypic parameters, such as developmental length and metabolic rate, showed in reptiles a nucleotypic correlation similar to the one observed in birds and mammals. Indeed, like homeotherms, reptiles exhibit a highly significant, inverse correlation of genome size with metabolic rate but unlike amphibians, no relationship with developmental length. Several lines of evidence suggest that these nucleotypic correlations are influenced by body temperature, which also affects the guanine + cytosine nuclear percentage, and that they play an important role in the adaptation of these amniotes. However, the reptilian suborders exhibit differences in the quantitative and compositional characters of the genome that do not completely correspond to differences in the phenotypic parameters commonly involved in the nucleotypic effect. Thus, additional factors could have influenced genome size in this class. These data could be explained with the model of Hartl and Petrov, who observed an inverse correlation between genome size, non-coding portion of the genome and rate of DNA loss and hypothesized a strong role for different spectra of spontaneous insertions and deletions (indels) in the variations of genome size. It is thus reasonable to surmise that variations in the reptilian genome were initially influenced by different indels spectra typical of the diverse lineages, possibly related to different chromosome compartmentalizations. The consequent size increases or decreases would have influenced various morphological and functional cell parameters, and through these some phenotypic characteristics of the whole organism, especially the metabolic rate, very important for environmental adaptation and thus subject to natural selection. Through this "nucleotypic" bond, natural selection would also have controlled genome size variations.

  6. Copy number variation in the horse genome.

    Directory of Open Access Journals (Sweden)

    Sharmila Ghosh

    2014-10-01

    Full Text Available We constructed a 400K WG tiling oligoarray for the horse and applied it for the discovery of copy number variations (CNVs in 38 normal horses of 16 diverse breeds, and the Przewalski horse. Probes on the array represented 18,763 autosomal and X-linked genes, and intergenic, sub-telomeric and chrY sequences. We identified 258 CNV regions (CNVRs across all autosomes, chrX and chrUn, but not in chrY. CNVs comprised 1.3% of the horse genome with chr12 being most enriched. American Miniature horses had the highest and American Quarter Horses the lowest number of CNVs in relation to Thoroughbred reference. The Przewalski horse was similar to native ponies and draft breeds. The majority of CNVRs involved genes, while 20% were located in intergenic regions. Similar to previous studies in horses and other mammals, molecular functions of CNV-associated genes were predominantly in sensory perception, immunity and reproduction. The findings were integrated with previous studies to generate a composite genome-wide dataset of 1476 CNVRs. Of these, 301 CNVRs were shared between studies, while 1174 were novel and require further validation. Integrated data revealed that to date, 41 out of over 400 breeds of the domestic horse have been analyzed for CNVs, of which 11 new breeds were added in this study. Finally, the composite CNV dataset was applied in a pilot study for the discovery of CNVs in 6 horses with XY disorders of sexual development. A homozygous deletion involving AKR1C gene cluster in chr29 in two affected horses was considered possibly causative because of the known role of AKR1C genes in testicular androgen synthesis and sexual development. While the findings improve and integrate the knowledge of CNVs in horses, they also show that for effective discovery of variants of biomedical importance, more breeds and individuals need to be analyzed using comparable methodological approaches.

  7. Copy number variation in the horse genome.

    Science.gov (United States)

    Ghosh, Sharmila; Qu, Zhipeng; Das, Pranab J; Fang, Erica; Juras, Rytis; Cothran, E Gus; McDonell, Sue; Kenney, Daniel G; Lear, Teri L; Adelson, David L; Chowdhary, Bhanu P; Raudsepp, Terje

    2014-10-01

    We constructed a 400K WG tiling oligoarray for the horse and applied it for the discovery of copy number variations (CNVs) in 38 normal horses of 16 diverse breeds, and the Przewalski horse. Probes on the array represented 18,763 autosomal and X-linked genes, and intergenic, sub-telomeric and chrY sequences. We identified 258 CNV regions (CNVRs) across all autosomes, chrX and chrUn, but not in chrY. CNVs comprised 1.3% of the horse genome with chr12 being most enriched. American Miniature horses had the highest and American Quarter Horses the lowest number of CNVs in relation to Thoroughbred reference. The Przewalski horse was similar to native ponies and draft breeds. The majority of CNVRs involved genes, while 20% were located in intergenic regions. Similar to previous studies in horses and other mammals, molecular functions of CNV-associated genes were predominantly in sensory perception, immunity and reproduction. The findings were integrated with previous studies to generate a composite genome-wide dataset of 1476 CNVRs. Of these, 301 CNVRs were shared between studies, while 1174 were novel and require further validation. Integrated data revealed that to date, 41 out of over 400 breeds of the domestic horse have been analyzed for CNVs, of which 11 new breeds were added in this study. Finally, the composite CNV dataset was applied in a pilot study for the discovery of CNVs in 6 horses with XY disorders of sexual development. A homozygous deletion involving AKR1C gene cluster in chr29 in two affected horses was considered possibly causative because of the known role of AKR1C genes in testicular androgen synthesis and sexual development. While the findings improve and integrate the knowledge of CNVs in horses, they also show that for effective discovery of variants of biomedical importance, more breeds and individuals need to be analyzed using comparable methodological approaches.

  8. Comparative genomic analysis of sixty mycobacteriophage genomes: Genome clustering, gene acquisition and gene size

    Science.gov (United States)

    Hatfull, Graham F.; Jacobs-Sera, Deborah; Lawrence, Jeffrey G.; Pope, Welkin H.; Russell, Daniel A.; Ko, Ching-Chung; Weber, Rebecca J.; Patel, Manisha C.; Germane, Katherine L.; Edgar, Robert H.; Hoyte, Natasha N.; Bowman, Charles A.; Tantoco, Anthony T.; Paladin, Elizabeth C.; Myers, Marlana S.; Smith, Alexis L.; Grace, Molly S.; Pham, Thuy T.; O'Brien, Matthew B.; Vogelsberger, Amy M.; Hryckowian, Andrew J.; Wynalek, Jessica L.; Donis-Keller, Helen; Bogel, Matt W.; Peebles, Craig L.; Cresawn, Steve G.; Hendrix, Roger W.

    2010-01-01

    Mycobacteriophages are viruses that infect mycobacterial hosts. Expansion of a collection of sequenced phage genomes to a total of sixty – all infecting a common bacterial host – provides further insight into their diversity and evolution. Of the sixty phage genomes, 55 can be grouped into nine clusters according to their nucleotide sequence similarities, five of which can be further divided into subclusters; five genomes do not cluster with other phages. The sequence diversity between genomes within a cluster varies greatly; for example, the six genomes in cluster D share more than 97.5% average nucleotide similarity with each other. In contrast, similarity between the two genomes in Cluster I is barely detectable by diagonal plot analysis. The total of 6,858 predicted ORFs have been grouped into 1523 phamilies (phams) of related sequences, 46% of which possess only a single member. Only 18.8% of the phams have sequence similarity to non-mycobacteriophage database entries and fewer than 10% of all phams can be assigned functions based on database searching or synteny. Genome clustering facilitates the identification of genes that are in greatest genetic flux and are more likely to have been exchanged horizontally in relatively recent evolutionary time. Although mycobacteriophage genes exhibit smaller average size than genes of their host (205 residues compared to 315), phage genes in higher flux average only ∼100 amino acids, suggesting that the primary units of genetic exchange correspond to single protein domains. PMID:20064525

  9. Size-selected genomic libraries: the distribution and size-fractionation of restricted genomic DNA fragments by gel electrophoresis.

    Science.gov (United States)

    Gondo, Y

    1995-02-01

    By using one-dimensional genome scanning, it is possible to directly identify the restricted genomic DNA fragment that reflects the site of genetic change. The subsequent strategies to obtain the molecular clones of the corresponding restriction fragment are usually as follows: (i) the restriction of a mass quantity of an appropriate genomic DNA, (ii) the size-fractionation of the restricted DNA on a preparative electrophoresis gel in order to enrich the corresponding restriction fragment, (iii) the construction of the size-selected libraries from the fractionated genomic DNA, and (iv) the screening of the library to obtain an objective clone which is identified on the analytical genome scanning gel. A knowledge of the size distribution pattern of restriction fragments of the genomic DNA makes it possible to calculate the heterogeneity or complexity of the restriction fragment in each size-fraction. This manuscript first describes the distribution of the restriction fragments with respect to their length. Some examples of the practical application of this theory to genome scanning is then discussed using presumptive genome scanning gels. The way to calculate such DNA complexities in the prepared size-fractionated samples is also demonstrated. Such information should greatly facilitate the design of experimental strategies for the cloning of a certain size of genomic DNA after digestion with restriction enzyme(s) as is the case with genome scanning.

  10. Survey of genome size in 28 hydrothermal vent species covering 10 families.

    Science.gov (United States)

    Bonnivard, Eric; Catrice, Olivier; Ravaux, Juliette; Brown, Spencer C; Higuet, Dominique

    2009-06-01

    Knowledge of genome size is a useful and necessary prerequisite for the development of many genomic resources. To better understand the origins and effects of DNA gains and losses among species, it is important to collect data from a broad taxonomic base, but also from particular ecosystems. Oceanic thermal vents are an interesting model to investigate genome size in very unstable environments. Here we provide data estimated by flow cytometry for 28 vent-living species among the most representative from different hydrothermal vents. We also report the genome size of closely related coastal decapods. Haploid C-values were compared with those previously reported for species from corresponding orders or infraorders. This is the first broad survey of 2C values in vent organisms. Contrary to expectations, it shows that certain hydrothermal vent species have particularly large genomes. The vent squat lobster Munidopsis recta has the largest genome yet reported for any anomuran: 2C=31.1 pg=30.4x10(9) bp. In several groups, such as Brachyura, Phyllodocida, and Veneroida, vent species have genomes that clearly rank at the high end of published values for each group. We also describe the highest DNA content yet recorded for the Brachyura (coastal crabs Xantho pilipes and Necora puber). Finally, analysis of genome size variation across populations revealed unexpected intraspecific variation in the vent shrimp Mirocaris fortunata that could not be attributed simply to ploidy changes.

  11. Exploring functional elements and genomic variation in the noncoding genome

    NARCIS (Netherlands)

    van Heesch, S.A.A.C.

    2014-01-01

    Gene expression regulation is a delicate process that depends on multiple aspects including genome structure and transcription factor binding to DNA elements. The majority of our genome consists of noncoding DNA, which was shown to be crucial in providing the correct context for genome function. Alt

  12. Exploring functional elements and genomic variation in the noncoding genome

    NARCIS (Netherlands)

    van Heesch, S.A.A.C.|info:eu-repo/dai/nl/336463286

    2014-01-01

    Gene expression regulation is a delicate process that depends on multiple aspects including genome structure and transcription factor binding to DNA elements. The majority of our genome consists of noncoding DNA, which was shown to be crucial in providing the correct context for genome function. Alt

  13. Characterization of copy number variation in genomic regions containing STR loci using array comparative genomic hybridization.

    Science.gov (United States)

    Repnikova, Elena A; Rosenfeld, Jill A; Bailes, Andrea; Weber, Cecilia; Erdman, Linda; McKinney, Aimee; Ramsey, Sarah; Hashimoto, Sayaka; Lamb Thrush, Devon; Astbury, Caroline; Reshmi, Shalini C; Shaffer, Lisa G; Gastier-Foster, Julie M; Pyatt, Robert E

    2013-09-01

    Short tandem repeat (STR) loci are commonly used in forensic casework, familial analysis for human identification, and for monitoring hematopoietic cell engraftment after bone marrow transplant. Unexpected genetic variation leading to sequence and length differences in STR loci can complicate STR typing, and presents challenges in casework interpretation. Copy number variation (CNV) is a relatively recently identified form of genetic variation consisting of genomic regions present at variable copy numbers within an individual compared to a reference genome. Large scale population studies have demonstrated that likely all individuals carry multiple regions with CNV of 1kb in size or greater in their genome. To date, no study correlating genomic regions containing STR loci with CNV has been conducted. In this study, we analyzed results from 32,850 samples sent for clinical array comparative genomic hybridization (CGH) analysis for the presence of CNV at regions containing the 13 CODIS (Combined DNA Index System) STR, and the Amelogenin X (AMELX) and Amelogenin Y (AMELY) loci. Thirty-two individuals with CNV involving STR loci on chromosomes 2, 4, 7, 11, 12, 13, 16, and 21, and twelve with CNV involving the AMELX/AMELY loci were identified. These results were correlated with data from publicly available databases housing information on CNV identified in normal populations and additional clinical cases. These collective results demonstrate the presence of CNV in regions containing 9 of the 13 CODIS STR and AMELX/Y loci. Further characterization of STR profiles within regions of CNV, additional cataloging of these variants in multiple populations, and contributing such examples to the public domain will provide valuable information for reliable use of these loci.

  14. Size variation of fossil rodent populations

    NARCIS (Netherlands)

    Freudenthal, M.; Cuenca Bescos, G.

    1984-01-01

    Pearson's coefficient of variation is in general not applicable in palaeontology, due to the heterogeneity of samples. The heterogeneity may be due to the mixing of two species, mixture of material from various biotopes, or from a relatively large time span. A new coefficient of variation is propose

  15. Size variation of fossil rodent populations

    NARCIS (Netherlands)

    Freudenthal, M.; Cuenca Bescos, G.

    1984-01-01

    Pearson's coefficient of variation is in general not applicable in palaeontology, due to the heterogeneity of samples. The heterogeneity may be due to the mixing of two species, mixture of material from various biotopes, or from a relatively large time span. A new coefficient of variation is

  16. Microeconomic principles explain an optimal genome size in bacteria.

    Science.gov (United States)

    Ranea, Juan A G; Grant, Alastair; Thornton, Janet M; Orengo, Christine A

    2005-01-01

    Bacteria can clearly enhance their survival by expanding their genetic repertoire. However, the tight packing of the bacterial genome and the fact that the most evolved species do not necessarily have the biggest genomes suggest there are other evolutionary factors limiting their genome expansion. To clarify these restrictions on size, we studied those protein families contributing most significantly to bacterial-genome complexity. We found that all bacteria apply the same basic and ancestral 'molecular technology' to optimize their reproductive efficiency. The same microeconomics principles that define the optimum size in a factory can also explain the existence of a statistical optimum in bacterial genome size. This optimum is reached when the bacterial genome obtains the maximum metabolic complexity (revenue) for minimal regulatory genes (logistic cost).

  17. Size and complexity of the nuclear genome of Colletotrichum graminicola.

    Science.gov (United States)

    Randhir, R J; Hanau, R M

    1997-10-01

    DNA reassociation was used to estimate GC content, size, and complexity of the nuclear genomes of Colletotrichum from maize and sorghum. Melting-temperature analysis indicated that the GC content of the maize pathotype DNA was 51% and that the GC content of the sorghum pathotype was 52%. DNA reassociation kinetics employing S1 nuclease digestion and an appropriately modified second-order equation indicated that the genome sizes of the maize and sorghum pathotypes were 4.8 x 10(7) bp, and 5.0 x 10(7) bp, respectively. Genomic reconstruction experiments based on Southern blot hybridization between a cloned single-copy gene, PYR1 (orotate phosphoribosyl transferase), and maize-pathotype DNA confirmed the size of the nuclear genome. The single-copy component of the genomes of both pathotypes was estimated at about 90%. For both pathotypes, ca. 7% of the genome represented repetitive DNA, and 2 to 3% was foldback DNA.

  18. Genome size correlates with reproductive fitness in seed beetles.

    Science.gov (United States)

    Arnqvist, Göran; Sayadi, Ahmed; Immonen, Elina; Hotzy, Cosima; Rankin, Daniel; Tuda, Midori; Hjelmen, Carl E; Johnston, J Spencer

    2015-09-22

    The ultimate cause of genome size (GS) evolution in eukaryotes remains a major and unresolved puzzle in evolutionary biology. Large-scale comparative studies have failed to find consistent correlations between GS and organismal properties, resulting in the 'C-value paradox'. Current hypotheses for the evolution of GS are based either on the balance between mutational events and drift or on natural selection acting upon standing genetic variation in GS. It is, however, currently very difficult to evaluate the role of selection because within-species studies that relate variation in life-history traits to variation in GS are very rare. Here, we report phylogenetic comparative analyses of GS evolution in seed beetles at two distinct taxonomic scales, which combines replicated estimation of GS with experimental assays of life-history traits and reproductive fitness. GS showed rapid and bidirectional evolution across species, but did not show correlated evolution with any of several indices of the relative importance of genetic drift. Within a single species, GS varied by 4-5% across populations and showed positive correlated evolution with independent estimates of male and female reproductive fitness. Collectively, the phylogenetic pattern of GS diversification across and within species in conjunction with the pattern of correlated evolution between GS and fitness provide novel support for the tenet that natural selection plays a key role in shaping GS evolution.

  19. Genome size correlates with reproductive fitness in seed beetles

    Science.gov (United States)

    Arnqvist, Göran; Sayadi, Ahmed; Immonen, Elina; Hotzy, Cosima; Rankin, Daniel; Tuda, Midori; Hjelmen, Carl E.; Johnston, J. Spencer

    2015-01-01

    The ultimate cause of genome size (GS) evolution in eukaryotes remains a major and unresolved puzzle in evolutionary biology. Large-scale comparative studies have failed to find consistent correlations between GS and organismal properties, resulting in the ‘C-value paradox’. Current hypotheses for the evolution of GS are based either on the balance between mutational events and drift or on natural selection acting upon standing genetic variation in GS. It is, however, currently very difficult to evaluate the role of selection because within-species studies that relate variation in life-history traits to variation in GS are very rare. Here, we report phylogenetic comparative analyses of GS evolution in seed beetles at two distinct taxonomic scales, which combines replicated estimation of GS with experimental assays of life-history traits and reproductive fitness. GS showed rapid and bidirectional evolution across species, but did not show correlated evolution with any of several indices of the relative importance of genetic drift. Within a single species, GS varied by 4–5% across populations and showed positive correlated evolution with independent estimates of male and female reproductive fitness. Collectively, the phylogenetic pattern of GS diversification across and within species in conjunction with the pattern of correlated evolution between GS and fitness provide novel support for the tenet that natural selection plays a key role in shaping GS evolution. PMID:26354938

  20. Recent updates and developments to plant genome size databases

    Science.gov (United States)

    Garcia, Sònia; Leitch, Ilia J.; Anadon-Rosell, Alba; Canela, Miguel Á.; Gálvez, Francisco; Garnatje, Teresa; Gras, Airy; Hidalgo, Oriane; Johnston, Emmeline; Mas de Xaxars, Gemma; Pellicer, Jaume; Siljak-Yakovlev, Sonja; Vallès, Joan; Vitales, Daniel; Bennett, Michael D.

    2014-01-01

    Two plant genome size databases have been recently updated and/or extended: the Plant DNA C-values database (http://data.kew.org/cvalues), and GSAD, the Genome Size in Asteraceae database (http://www.asteraceaegenomesize.com). While the first provides information on nuclear DNA contents across land plants and some algal groups, the second is focused on one of the largest and most economically important angiosperm families, Asteraceae. Genome size data have numerous applications: they can be used in comparative studies on genome evolution, or as a tool to appraise the cost of whole-genome sequencing programs. The growing interest in genome size and increasing rate of data accumulation has necessitated the continued update of these databases. Currently, the Plant DNA C-values database (Release 6.0, Dec. 2012) contains data for 8510 species, while GSAD has 1219 species (Release 2.0, June 2013), representing increases of 17 and 51%, respectively, in the number of species with genome size data, compared with previous releases. Here we provide overviews of the most recent releases of each database, and outline new features of GSAD. The latter include (i) a tool to visually compare genome size data between species, (ii) the option to export data and (iii) a webpage containing information about flow cytometry protocols. PMID:24288377

  1. Identification of Sesame Genomic Variations from Genome Comparison of Landrace and Variety.

    Science.gov (United States)

    Wei, Xin; Zhu, Xiaodong; Yu, Jingyin; Wang, Linhai; Zhang, Yanxin; Li, Donghua; Zhou, Rong; Zhang, Xiurong

    2016-01-01

    Sesame (Sesamum indicum L.) is one of the main oilseed crops, providing vegetable oil and protein to human. Landrace is the gene source of variety, carrying many desire alleles for genetic improvement. Despite the importance of sesame landrace, genome of sesame landrace remains unexplored and genomic variations between landrace and variety still is not clear. To identify the genomic variations between sesame landrace and variety, two representative sesame landrace accessions, "Baizhima" and "Mishuozhima," were selected and re-sequenced. The genome sequencing and de novo assembling of the two sesame landraces resulted in draft genomes of 267 Mb and 254 Mb, respectively, with the contig N50 more than 47 kb. Totally, 1,332,025 SNPs and 506,245 InDels were identified from the genome of "Baizhima" and "Mishuozhima" by comparison of the genome of a variety "Zhongzhi13." Among the genomic variations, 70,018 SNPs and 8311 InDels were located in the coding regions of genes. Genomic variations may contribute to variation of sesame agronomic traits such as flowering time, plant height, and oil content. The identified genomic variations were successfully used in the QTL mapping and the black pigment synthesis gene, PPO, was found to be the candidate gene of sesame seed coat color. The comprehensively compared genomes of sesame landrace and modern variety produced massive useful genomic information, constituting a powerful tool to support genetic research, and molecular breeding of sesame.

  2. Identification of Sesame Genomic Variations from Genome Comparison of Landrace and Variety

    Science.gov (United States)

    Wei, Xin; Zhu, Xiaodong; Yu, Jingyin; Wang, Linhai; Zhang, Yanxin; Li, Donghua; Zhou, Rong; Zhang, Xiurong

    2016-01-01

    Sesame (Sesamum indicum L.) is one of the main oilseed crops, providing vegetable oil and protein to human. Landrace is the gene source of variety, carrying many desire alleles for genetic improvement. Despite the importance of sesame landrace, genome of sesame landrace remains unexplored and genomic variations between landrace and variety still is not clear. To identify the genomic variations between sesame landrace and variety, two representative sesame landrace accessions, “Baizhima” and “Mishuozhima,” were selected and re-sequenced. The genome sequencing and de novo assembling of the two sesame landraces resulted in draft genomes of 267 Mb and 254 Mb, respectively, with the contig N50 more than 47 kb. Totally, 1,332,025 SNPs and 506,245 InDels were identified from the genome of “Baizhima” and “Mishuozhima” by comparison of the genome of a variety “Zhongzhi13.” Among the genomic variations, 70,018 SNPs and 8311 InDels were located in the coding regions of genes. Genomic variations may contribute to variation of sesame agronomic traits such as flowering time, plant height, and oil content. The identified genomic variations were successfully used in the QTL mapping and the black pigment synthesis gene, PPO, was found to be the candidate gene of sesame seed coat color. The comprehensively compared genomes of sesame landrace and modern variety produced massive useful genomic information, constituting a powerful tool to support genetic research, and molecular breeding of sesame. PMID:27536315

  3. Genomic variation in Salmonella enterica core genes for epidemiological typing

    Directory of Open Access Journals (Sweden)

    Leekitcharoenphon Pimlapas

    2012-03-01

    Full Text Available Abstract Background Technological advances in high throughput genome sequencing are making whole genome sequencing (WGS available as a routine tool for bacterial typing. Standardized procedures for identification of relevant genes and of variation are needed to enable comparison between studies and over time. The core genes--the genes that are conserved in all (or most members of a genus or species--are potentially good candidates for investigating genomic variation in phylogeny and epidemiology. Results We identify a set of 2,882 core genes clusters based on 73 publicly available Salmonella enterica genomes and evaluate their value as typing targets, comparing whole genome typing and traditional methods such as 16S and MLST. A consensus tree based on variation of core genes gives much better resolution than 16S and MLST; the pan-genome family tree is similar to the consensus tree, but with higher confidence. The core genes can be divided into two categories: a few highly variable genes and a larger set of conserved core genes, with low variance. For the most variable core genes, the variance in amino acid sequences is higher than for the corresponding nucleotide sequences, suggesting that there is a positive selection towards mutations leading to amino acid changes. Conclusions Genomic variation within the core genome is useful for investigating molecular evolution and providing candidate genes for bacterial genome typing. Identification of genes with different degrees of variation is important especially in trend analysis.

  4. Evolution of genome size and genomic GC content in carnivorous holokinetics (Droseraceae).

    Science.gov (United States)

    Veleba, Adam; Šmarda, Petr; Zedek, František; Horová, Lucie; Šmerda, Jakub; Bureš, Petr

    2017-02-01

    Studies in the carnivorous family Lentibulariaceae in the last years resulted in the discovery of the smallest plant genomes and an unusual pattern of genomic GC content evolution. However, scarcity of genomic data in other carnivorous clades still prevents a generalization of the observed patterns. Here the aim was to fill this gap by mapping genome evolution in the second largest carnivorous family, Droseraceae, where this evolution may be affected by chromosomal holokinetism in Drosera METHODS: The genome size and genomic GC content of 71 Droseraceae species were measured by flow cytometry. A dated phylogeny was constructed, and the evolution of both genomic parameters and their relationship to species climatic niches were tested using phylogeny-based statistics. The 2C genome size of Droseraceae varied between 488 and 10 927 Mbp, and the GC content ranged between 37·1 and 44·7 %. The genome sizes and genomic GC content of carnivorous and holocentric species did not differ from those of their non-carnivorous and monocentric relatives. The genomic GC content positively correlated with genome size and annual temperature fluctuations. The genome size and chromosome numbers were inversely correlated in the Australian clade of Drosera CONCLUSIONS: Our results indicate that neither carnivory (nutrient scarcity) nor the holokinetism have a prominent effect on size and DNA base composition of Droseraceae genomes. However, the holokinetic drive seems to affect karyotype evolution in one of the major clades of Drosera Our survey confirmed that the evolution of GC content is tightly connected with the evolution of genome size and also with environmental conditions. © The Author 2016. Published by Oxford University Press on behalf of the Annals of Botany Company. All rights reserved. For Permissions, please email: journals.permissions@oup.com.

  5. Genome-wide patterns of large-size presence/absence variants in sorghum

    Institute of Scientific and Technical Information of China (English)

    LiMin Zhang; Hong Luo; ZhiQuan Liu; Yi Zhao; JingChu Luo; DongYun Hao; HaiChun Jing

    2014-01-01

    The presence/absence variants (PAVs) are a major source of genome structural variation and have profound effects on phenotypic and genomic variation in animals and humans. However, little is understood about PAVs in plant genomes. Our previous resequencing effort on three sorghum (Sorghum bicolour L.) genomes, each 12? coverage, uncovered 5 364 PAVs. Here, we report a detailed characterization of 51 large-size (>30 kb) PAVs. These PAVs spanned a total size of 2.92 Mb of the sorghum genome containing 202 known and predicted genes, including 38 genes annotated to encode celldeath and stress response genes. The PAVs varied considerably for repeat sequences and mobile elements with DNA trans-posons as the major components. The frequency and distribution of these PAVs differed substantial y across 96 sorghum inbred lines, and the low-and high frequency PAVs differed in their gene categories. This report shed new light on the occurrence and diversity of PAVs in sorghum genomes. Our research exemplifies a new perspective to explore genome structural variation for genetic improvement in plant breeding.

  6. Genome Size Is a Strong Predictor of Root Meristem Growth Rate

    Directory of Open Access Journals (Sweden)

    Adam Gruner

    2010-01-01

    Full Text Available Variation in genome size (GS has been linked to several facets of the plant phenotype. Recently it was shown that GS is significantly correlated with cell size and the duration of the cell cycle. Here we test the hypothesis that GS might also be a predictor of apical root meristem growth rate (RMGR. We studied eight species of eudicots with varying GS using time-lapse microscopic image analysis. A significant negative exponential relationship was observed between GS and RMGR. Our results show significantly decreased RMGR for large genome species. This relationship represents a significant consequence of GS expansion in plants and may partly explain why genome sizes tend to be small in eudicots. Interestingly, parasitic plants, which do not rely on root growth as much, often have large genomes.

  7. Genomic variation at the tips of the adaptive radiation of Darwin's finches.

    Science.gov (United States)

    Chaves, Jaime A; Cooper, Elizabeth A; Hendry, Andrew P; Podos, Jeffrey; De León, Luis F; Raeymaekers, Joost A M; MacMillan, W Owen; Uy, J Albert C

    2016-11-01

    Adaptive radiation unfolds as selection acts on the genetic variation underlying functional traits. The nature of this variation can be revealed by studying the tips of an ongoing adaptive radiation. We studied genomic variation at the tips of the Darwin's finch radiation; specifically focusing on polymorphism within, and variation among, three sympatric species of the genus Geospiza. Using restriction site-associated DNA (RAD-seq), we characterized 32 569 single-nucleotide polymorphisms (SNPs), from which 11 outlier SNPs for beak and body size were uncovered by a genomewide association study (GWAS). Principal component analysis revealed that these 11 SNPs formed four statistically linked groups. Stepwise regression then revealed that the first PC score, which included 6 of the 11 top SNPs, explained over 80% of the variation in beak size, suggesting that selection on these traits influences multiple correlated loci. The two SNPs most strongly associated with beak size were near genes associated with beak morphology across deeper branches of the radiation: delta-like 1 homologue (DLK1) and high-mobility group AT-hook 2 (HMGA2). Our results suggest that (i) key adaptive traits are associated with a small fraction of the genome (11 of 32 569 SNPs), (ii) SNPs linked to the candidate genes are dispersed throughout the genome (on several chromosomes), and (iii) micro- and macro-evolutionary variation (roots and tips of the radiation) involve some shared and some unique genomic regions. © 2016 John Wiley & Sons Ltd.

  8. Genomic and karyotypic variation in Drosophila parasitoids (Hymenoptera, Cynipoidea, Figitidae

    Directory of Open Access Journals (Sweden)

    Vladimir Gokhman

    2011-08-01

    Full Text Available Drosophila melanogaster Meigen, 1830 has served as a model insect for over a century. Sequencing of the 11 additional Drosophila Fallen, 1823 species marks substantial progress in comparative genomics of this genus. By comparison, practically nothing is known about the genome size or genome sequences of parasitic wasps of Drosophila. Here, we present the first comparative analysis of genome size and karyotype structures of Drosophila parasitoids of the Leptopilina Förster, 1869 and Ganaspis Förster, 1869 species. The gametic genome size of Ganaspis xanthopoda (Ashmead, 1896 is larger than those of the three Leptopilina species studied. The genome sizes of all parasitic wasps studied here are also larger than those known for all Drosophila species. Surprisingly, genome sizes of these Drosophila parasitoids exceed the average value known for all previously studied Hymenoptera. The haploid chromosome number of both Leptopilina heterotoma (Thomson, 1862 and L. victoriae Nordlander, 1980 is ten. A chromosomal fusion appears to have produced a distinct karyotype for L. boulardi (Barbotin, Carton et Keiner-Pillault, 1979 (n = 9, whose genome size is smaller than that of wasps of the L. heterotoma clade. Like L. boulardi, the haploid chromosome number for G. xanthopoda is also nine. Our studies reveal a positive, but non linear, correlation between the genome size and total chromosome length in Drosophila parasitoids. These Drosophila parasitoids differ widely in their host range, and utilize different infection strategies to overcome host defense. Their comparative genomics, in relation to their exceptionally well-characterized hosts, will prove to be valuable for understanding the molecular basis of the host-parasite arms race and how such mechanisms shape the genetic structures of insect communities.

  9. Genome size is inversely correlated with relative brain size in parrots and cockatoos.

    Science.gov (United States)

    Andrews, Chandler B; Gregory, T Ryan

    2009-03-01

    Genome size (haploid nuclear DNA content) has been found to correlate positively with cell size and negatively with cell division rate in a variety of taxa. These cytological relationships manifest in various ways at the organism level, for example, in terms of body size, metabolic rate, or developmental rate, depending on the biology of the organisms. In birds, it has been suggested that high metabolic rate and strong flight ability are linked to small genome size. However, it was also hypothesized that the exceptional cognitive abilities of birds may impose additional constraints on genome size through effects on neuron size and differentiation, as has been observed in amphibians. To test this hypothesis, a comparative analysis was made between genome size, cell (erythrocyte) size, and brain size in 54 species of parrots and cockatoos (order Psittaciformes, family Psittacidae). Relative brain volume, which is taken as an indicator of investment in brain tissue and is widely correlated with behavioural and ecological traits, was found to correlate inversely with genome size. Several possible and mutually compatible explanations for this relationship are described.

  10. Methodology significantly affects genome size estimates: quantitative evidence using bryophytes.

    Science.gov (United States)

    Bainard, Jillian D; Fazekas, Aron J; Newmaster, Steven G

    2010-08-01

    Flow cytometry (FCM) is commonly used to determine plant genome size estimates. Methodology has improved and changed during the past three decades, and researchers are encouraged to optimize protocols for their specific application. However, this step is typically omitted or undescribed in the current plant genome size literature, and this omission could have serious consequences for the genome size estimates obtained. Using four bryophyte species (Brachythecium velutinum, Fissidens taxifolius, Hedwigia ciliata, and Thuidium minutulum), three methodological approaches to the use of FCM in plant genome size estimation were tested. These included nine different buffers (Baranyi's, de Laat's, Galbraith's, General Purpose, LB01, MgSO(4), Otto's, Tris.MgCl(2), and Woody Plant), seven propidium iodide (PI) staining periods (5, 10, 15, 20, 45, 60, and 120 min), and six PI concentrations (10, 25, 50, 100, 150, and 200 microg ml(-1)). Buffer, staining period and staining concentration all had a statistically significant effect (P = 0.05) on the genome size estimates obtained for all four species. Buffer choice and PI concentration had the greatest effect, altering the 1C-values by as much as 8% and 14%, respectively. As well, the quality of the data varied with the different methodology used. Using the methodology determined to be the most accurate in this study (LB01 buffer and PI staining for 20 min at 150 microg ml(-1)), three new genome size estimates were obtained: B. velutinum: 0.46 pg, H. ciliata: 0.30 pg, and T. minutulum: 0.46 pg. While the peak quality of flow cytometry histograms is important, researchers must consider that changes in methodology can also affect the relative peak positions and therefore the genome size estimates obtained for plants using FCM.

  11. Genome Size Is a Strong Predictor of Root Meristem Growth Rate

    OpenAIRE

    Adam Gruner; Nathan Hoverter; Tylia Smith; Charles A. Knight

    2010-01-01

    Variation in genome size (GS) has been linked to several facets of the plant phenotype. Recently it was shown that GS is significantly correlated with cell size and the duration of the cell cycle. Here we test the hypothesis that GS might also be a predictor of apical root meristem growth rate (RMGR). We studied eight species of eudicots with varying GS using time-lapse microscopic image analysis. A significant negative exponential relationship was observed between GS and RMGR. Our results sh...

  12. Progress in the detection of human genome structural variations

    Institute of Scientific and Technical Information of China (English)

    WU XueMei; XIAO HuaSheng

    2009-01-01

    The emerging of high.throughput and high-resolution genomic technologies led to the detection of submicroscopic variants ranging from 1 kb to 3 Mb in the human genome. These variants include copy number variations (CNVs), inversions, insertions, deletions and other complex rearrangements of DNA sequences. This paper briefly reviews the commonly used technologies to discover both genomic structural variants and their potential influences. Particularly, we highlight the array-based, PCR-based and sequencing-based assays, including array-based comparative genomic hybridization (aCGH),representational oligonucleotide microarray analysis (ROMA), multiplex amplifiable probe hybridization (MAPH), multiplex ligation-dependent probe amplification (MLPA), paired-end mapping (PEM), and next-generation DNA sequencing technologies. Furthermore, we discuss the limitations and challenges of current assays and give advices on how to make the database of genomic variations more reliable.

  13. Progress in the detection of human genome structural variations

    Institute of Scientific and Technical Information of China (English)

    2009-01-01

    The emerging of high-throughput and high-resolution genomic technologies led to the detection of submicroscopic variants ranging from 1 kb to 3 Mb in the human genome.These variants include copy number variations(CNVs),inversions,insertions,deletions and other complex rearrangements of DNA sequences.This paper briefly reviews the commonly used technologies to discover both genomic structural variants and their potential influences.Particularly,we highlight the array-based,PCR-based and sequencing-based assays,including array-based comparative genomic hybridization(aCGH),representational oligonucleotide microarray analysis(ROMA),multiplex amplifiable probe hybridization(MAPH),multiplex ligation-dependent probe amplification(MLPA),paired-end mapping(PEM),and next-generation DNA sequencing technologies.Furthermore,we discuss the limitations and challenges of current assays and give advices on how to make the database of genomic variations more reliable.

  14. Cyanobacteria Maintain Constant Protein Concentration despite Genome Copy-Number Variation.

    Science.gov (United States)

    Zheng, Xiao-Yu; O'Shea, Erin K

    2017-04-18

    The cyanobacterium Synechococcus elongatus PCC 7942 has multiple copies of its single chromosome, and the copy number varies in individual cells, providing an ideal system to study the effect of genome copy-number variation on cell size and gene expression. Using single-cell fluorescence imaging, we found that protein concentration remained constant across individual cells regardless of genome copy number. Cell volume and the total protein amount from a single gene were both positively, linearly correlated with genome copy number, suggesting that changes in cell volume play an important role in buffering genome copy-number variance. This study provides a quantitative examination of gene expression regulation in cells with variable genome copies and sheds light on the compensation mechanisms for variance in genome copy number. Copyright © 2017 The Author(s). Published by Elsevier Inc. All rights reserved.

  15. Total centromere size and genome size are strongly correlated in ten grass species.

    Science.gov (United States)

    Zhang, Han; Dawe, R Kelly

    2012-05-01

    It has been known for decades that centromere size varies across species, but the factors involved in setting centromere boundaries are unknown. As a means to address this question, we estimated centromere sizes in ten species of the grass family including rice, maize, and wheat, which diverged 60~80 million years ago and vary by 40-fold in genome size. Measurements were made using a broadly reactive antibody to rice centromeric histone H3 (CENH3). In species-wide comparisons, we found a clear linear relationship between total centromere size and genome size. Species with large genomes and few chromosomes tend to have the largest centromeres (e.g., rye) while species with small genomes and many chromosomes have the smallest centromeres (e.g., rice). However, within a species, centromere size is surprisingly uniform. We present evidence from three oat-maize addition lines that support this claim, indicating that each of three maize centromeres propagated in oat are not measurably different from each other. In the context of previously published data, our results suggest that the apparent correlation between chromosome and centromere size is incidental to a larger trend that reflects genome size. Centromere size may be determined by a limiting component mechanism similar to that described for Caenorhabditis elegans centrosomes.

  16. Variation in salamanders: an essay on genomes, development, and evolution.

    Science.gov (United States)

    Brockes, Jeremy P

    2015-01-01

    Regeneration is studied in a few model species of salamanders, but the ten families of salamanders show considerable variation, and this has implications for our understanding of salamander biology. The most recent classification of the families identifies the cryptobranchoidea as the basal group which diverged in the early Jurassic. Variation in the sizes of genomes is particularly obvious, and reflects a major contribution from transposable elements which is already present in the basal group.Limb development has been a focus for evodevo studies, in part because of the variable property of pre-axial dominance which distinguishes salamanders from other tetrapods. This is thought to reflect the selective pressures that operate on a free-living aquatic larva, and might also be relevant for the evolution of limb regeneration. Recent fossil evidence suggests that both pre-axial dominance and limb regeneration were present 300 million years ago in larval temnospondyl amphibians that lived in mountain lakes. A satisfying account of regeneration in salamanders may need to address all these different aspects in the future.

  17. Correlated evolution of LTR retrotransposons and genome size in the genus Eleocharis.

    Science.gov (United States)

    Zedek, František; Smerda, Jakub; Smarda, Petr; Bureš, Petr

    2010-11-30

    Transposable elements (TEs) are considered to be an important source of genome size variation and genetic and phenotypic plasticity in eukaryotes. Most of our knowledge about TEs comes from large genomic projects and studies focused on model organisms. However, TE dynamics among related taxa from natural populations and the role of TEs at the species or supra-species level, where genome size and karyotype evolution are modulated in concert with polyploidy and chromosomal rearrangements, remain poorly understood. We focused on the holokinetic genus Eleocharis (Cyperaceae), which displays large variation in genome size and the occurrence of polyploidy and agmatoploidy/symploidy. We analyzed and quantified the long terminal repeat (LTR) retrotransposons Ty1-copia and Ty3-gypsy in relation to changes in both genome size and karyotype in Eleocharis. We also examined how this relationship is reflected in the phylogeny of Eleocharis. Using flow cytometry, we measured the genome sizes of members of the genus Eleocharis (Cyperaceae). We found positive correlation between the independent phylogenetic contrasts of genome size and chromosome number in Eleocharis. We analyzed PCR-amplified sequences of various reverse transcriptases of the LTR retrotransposons Ty1-copia and Ty3-gypsy (762 sequences in total). Using real-time PCR and dot blot approaches, we quantified the densities of Ty1-copia and Ty3-gypsy within the genomes of the analyzed species. We detected an increasing density of Ty1-copia elements in evolutionarily younger Eleocharis species and found a positive correlation between Ty1-copia densities and C/n-values (an alternative measure of monoploid genome size) in the genus phylogeny. In addition, our analysis of Ty1-copia sequences identified a novel retrotransposon family named Helos1, which is responsible for the increasing density of Ty1-copia. The transition:transversion ratio of Helos1 sequences suggests that Helos1 recently transposed in later

  18. Phenotypic impact of genomic structural variation

    DEFF Research Database (Denmark)

    Weischenfeldt, Joachim; Symmons, Orsolya; Spitz, François;

    2013-01-01

    Genomic structural variants have long been implicated in phenotypic diversity and human disease, but dissecting the mechanisms by which they exert their functional impact has proven elusive. Recently however, developments in high-throughput DNA sequencing and chromosomal engineering technology have...... facilitated the analysis of structural variants in human populations and model systems in unprecedented detail. In this Review, we describe how structural variants can affect molecular and cellular processes, leading to complex organismal phenotypes, including human disease. We further present advances...

  19. Regulatory change at Physalis Organ Size 1 correlates to natural variation in tomatillo reproductive organ size.

    Science.gov (United States)

    Wang, Li; He, Lingli; Li, Jing; Zhao, Jing; Li, Zhichao; He, Chaoying

    2014-07-01

    The genetic basis of size variation in the reproductive organs of tomatillo (Physalis philadelphica) is unknown. Here we report that the expression levels of the gene Physalis Organ Size 1 (POS1) are positively associated with size variation in P. philadelphica reproductive organs such flowers, berries and seeds. POS1 knockdown results in smaller flowers and berries with smaller cells as compared with their wild-type counterparts. Conversely, POS1 overexpression promotes organ size without increasing the cell number. The first introns of the POS1 alleles from the large, intermediate and small tomatillo groups contain one, two and three 37-bp repeats, respectively. Furthermore, our results show that copy variation of repeats in the first intron of POS1 alleles results in differential expression of this gene. Thus, co-variation in tomatillo reproductive organ sizes can be attributed to the novel regulatory variation in POS1.

  20. ENGINES: exploring single nucleotide variation in entire human genomes

    Directory of Open Access Journals (Sweden)

    Salas Antonio

    2011-04-01

    Full Text Available Abstract Background Next generation ultra-sequencing technologies are starting to produce extensive quantities of data from entire human genome or exome sequences, and therefore new software is needed to present and analyse this vast amount of information. The 1000 Genomes project has recently released raw data for 629 complete genomes representing several human populations through their Phase I interim analysis and, although there are certain public tools available that allow exploration of these genomes, to date there is no tool that permits comprehensive population analysis of the variation catalogued by such data. Description We have developed a genetic variant site explorer able to retrieve data for Single Nucleotide Variation (SNVs, population by population, from entire genomes without compromising future scalability and agility. ENGINES (ENtire Genome INterface for Exploring SNVs uses data from the 1000 Genomes Phase I to demonstrate its capacity to handle large amounts of genetic variation (>7.3 billion genotypes and 28 million SNVs, as well as deriving summary statistics of interest for medical and population genetics applications. The whole dataset is pre-processed and summarized into a data mart accessible through a web interface. The query system allows the combination and comparison of each available population sample, while searching by rs-number list, chromosome region, or genes of interest. Frequency and FST filters are available to further refine queries, while results can be visually compared with other large-scale Single Nucleotide Polymorphism (SNP repositories such as HapMap or Perlegen. Conclusions ENGINES is capable of accessing large-scale variation data repositories in a fast and comprehensive manner. It allows quick browsing of whole genome variation, while providing statistical information for each variant site such as allele frequency, heterozygosity or FST values for genetic differentiation. Access to the data mart

  1. Genome Architecture and Its Roles in Human Copy Number Variation

    Directory of Open Access Journals (Sweden)

    Lu Chen

    2014-12-01

    Full Text Available Besides single-nucleotide variants in the human genome, large-scale genomic variants, such as copy number variations (CNVs, are being increasingly discovered as a genetic source of human diversity and the pathogenic factors of diseases. Recent experimental findings have shed light on the links between different genome architectures and CNV mutagenesis. In this review, we summarize various genomic features and discuss their contributions to CNV formation. Genomic repeats, including both low-copy and high-copy repeats, play important roles in CNV instability, which was initially known as DNA recombination events. Furthermore, it has been found that human genomic repeats can also induce DNA replication errors and consequently result in CNV mutations. Some recent studies showed that DNA replication timing, which reflects the high-order information of genomic organization, is involved in human CNV mutations. Our review highlights that genome architecture, from DNA sequence to high-order genomic organization, is an important molecular factor in CNV mutagenesis and human genomic instability.

  2. Intron Derived Size Polymorphism in the Mitochondrial Genomes of Closely Related Chrysoporthe Species.

    Science.gov (United States)

    Kanzi, Aquillah Mumo; Wingfield, Brenda Diana; Steenkamp, Emma Theodora; Naidoo, Sanushka; van der Merwe, Nicolaas Albertus

    2016-01-01

    In this study, the complete mitochondrial (mt) genomes of Chrysoporthe austroafricana (190,834 bp), C. cubensis (89,084 bp) and C. deuterocubensis (124,412 bp) were determined. Additionally, the mitochondrial genome of another member of the Cryphonectriaceae, namely Cryphonectria parasitica (158,902 bp), was retrieved and annotated for comparative purposes. These genomes showed high levels of synteny, especially in regions including genes involved in oxidative phosphorylation and electron transfer, unique open reading frames (uORFs), ribosomal RNAs (rRNAs) and transfer RNAs (tRNAs), as well as intron positions. Comparative analyses revealed signatures of duplication events, intron number and length variation, and varying intronic ORFs which highlighted the genetic diversity of mt genomes among the Cryphonectriaceae. These mt genomes showed remarkable size polymorphism. The size polymorphism in the mt genomes of these closely related Chrysoporthe species was attributed to the varying number and length of introns, coding sequences and to a lesser extent, intergenic sequences. Compared to publicly available fungal mt genomes, the C. austroafricana mt genome is the second largest in the Ascomycetes thus far.

  3. Determination of sample size in genome-scale RNAi screens.

    Science.gov (United States)

    Zhang, Xiaohua Douglas; Heyse, Joseph F

    2009-04-01

    For genome-scale RNAi research, it is critical to investigate sample size required for the achievement of reasonably low false negative rate (FNR) and false positive rate. The analysis in this article reveals that current design of sample size contributes to the occurrence of low signal-to-noise ratio in genome-scale RNAi projects. The analysis suggests that (i) an arrangement of 16 wells per plate is acceptable and an arrangement of 20-24 wells per plate is preferable for a negative control to be used for hit selection in a primary screen without replicates; (ii) in a confirmatory screen or a primary screen with replicates, a sample size of 3 is not large enough, and there is a large reduction in FNRs when sample size increases from 3 to 4. To search a tradeoff between benefit and cost, any sample size between 4 and 11 is a reasonable choice. If the main focus is the selection of siRNAs with strong effects, a sample size of 4 or 5 is a good choice. If we want to have enough power to detect siRNAs with moderate effects, sample size needs to be 8, 9, 10 or 11. These discoveries about sample size bring insight to the design of a genome-scale RNAi screen experiment.

  4. Exploring variation in active network size : Constraints and ego characteristics

    NARCIS (Netherlands)

    Roberts, Sam G. B.; Dunbar, Robin I. M.; Pollet, Thomas V.; Kuppens, Toon

    2009-01-01

    Studies of active personal networks have primarily focused on providing reliable estimates of the size of the network. In this study, we examine how compositional properties of the network and ego characteristics are related to Variation in network size. There was a negative relationship between mea

  5. The Arabidopsis lyrata genome sequence and the basis of rapid genome size change

    Energy Technology Data Exchange (ETDEWEB)

    Hu, Tina T.; Pattyn, Pedro; Bakker, Erica G.; Cao, Jun; Cheng, Jan-Fang; Clark, Richard M.; Fahlgren, Noah; Fawcett, Jeffrey A.; Grimwood, Jane; Gundlach, Heidrun; Haberer, Georg; Hollister, Jesse D.; Ossowski, Stephan; Ottilar, Robert P.; Salamov, Asaf A.; Schneeberger, Korbinian; Spannagl, Manuel; Wang, Xi; Yang, Liang; Nasrallah, Mikhail E.; Bergelson, Joy; Carrington, James C.; Gaut, Brandon S.; Schmutz, Jeremy; Mayer, Klaus F. X.; Van de Peer, Yves; Grigoriev, Igor V.; Nordborg, Magnus; Weigel, Detlef; Guo, Ya-Long

    2011-04-29

    In our manuscript, we present a high-quality genome sequence of the Arabidopsis thaliana relative, Arabidopsis lyrata, produced by dideoxy sequencing. We have performed the usual types of genome analysis (gene annotation, dN/dS studies etc. etc.), but this is relegated to the Supporting Information. Instead, we focus on what was a major motivation for sequencing this genome, namely to understand how A. thaliana lost half its genome in a few million years and lived to tell the tale. The rather surprising conclusion is that there is not a single genomic feature that accounts for the reduced genome, but that every aspect centromeres, intergenic regions, transposable elements, gene family number is affected through hundreds of thousands of cuts. This strongly suggests that overall genome size in itself is what has been under selection, a suggestion that is strongly supported by our demonstration (using population genetics data from A. thaliana) that new deletions seem to be driven to fixation.

  6. Geographic variation in body size and sexual size dimorphism of a seed-feeding beetle.

    Science.gov (United States)

    Stillwell, R Craig; Morse, Geoffrey E; Fox, Charles W

    2007-09-01

    Body size of many animals varies with latitude: body size is either larger at higher latitudes (Bergmann's rule) or smaller at higher latitudes (converse Bergmann's rule). However, the causes underlying these patterns are poorly understood. Also, studies rarely explore how sexual size dimorphism varies with latitude. Here we investigate geographic variation in body size and sexual size dimorphism of the seed-feeding beetle Stator limbatus, collected from 95 locations along a 38 degrees range in latitude. We examine 14 variables to test whether clines in environmental factors are adequate to explain geographic patterns of body size. We found that body size and sexual size dimorphism of S. limbatus varied considerably with latitude; beetles were smaller but more dimorphic at lower latitudes. Body size was not correlated with a gradient in mean temperature, contrary to the commonly accepted hypothesis that clines are produced by latitudinal gradients in temperature. Instead, we found that three factors were adequate to explain the cline in body size: clinal variation in host plant seed size, moisture (humidity), and seasonality (variance in humidity, precipitation, and temperature). We also found that the cline in sexual size dimorphism was partially explainable by a gradient in moisture, though moisture alone was not sufficient to explain the cline. Other ecological or environmental variables must necessarily contribute to differences in selection on male versus female body size. The main implications of our study are that the sexes differ in the magnitude of clinal variation in body size, creating latitudinal variation in sexual size dimorphism, and that clines in body size of seed beetles are likely influenced by variation in host seed size, water availability, and seasonality.

  7. Stomatal vs. genome size in angiosperms: the somatic tail wagging the genomic dog?

    Science.gov (United States)

    Hodgson, J G; Sharafi, M; Jalili, A; Díaz, S; Montserrat-Martí, G; Palmer, C; Cerabolini, B; Pierce, S; Hamzehee, B; Asri, Y; Jamzad, Z; Wilson, P; Raven, J A; Band, S R; Basconcelo, S; Bogard, A; Carter, G; Charles, M; Castro-Díez, P; Cornelissen, J H C; Funes, G; Jones, G; Khoshnevis, M; Pérez-Harguindeguy, N; Pérez-Rontomé, M C; Shirvany, F A; Vendramini, F; Yazdani, S; Abbas-Azimi, R; Boustani, S; Dehghan, M; Guerrero-Campo, J; Hynd, A; Kowsary, E; Kazemi-Saeed, F; Siavash, B; Villar-Salvador, P; Craigie, R; Naqinezhad, A; Romo-Díez, A; de Torres Espuny, L; Simmons, E

    2010-04-01

    Genome size is a function, and the product, of cell volume. As such it is contingent on ecological circumstance. The nature of 'this ecological circumstance' is, however, hotly debated. Here, we investigate for angiosperms whether stomatal size may be this 'missing link': the primary determinant of genome size. Stomata are crucial for photosynthesis and their size affects functional efficiency. Stomatal and leaf characteristics were measured for 1442 species from Argentina, Iran, Spain and the UK and, using PCA, some emergent ecological and taxonomic patterns identified. Subsequently, an assessment of the relationship between genome-size values obtained from the Plant DNA C-values database and measurements of stomatal size was carried out. Stomatal size is an ecologically important attribute. It varies with life-history (woody species angiosperms. Correlation is not, however, proof of causality and here our interpretation is hampered by unexpected deficiencies in the scientific literature. Firstly, there are discrepancies between our own observations and established ideas about the ecological significance of stomatal size; very large stomata, theoretically facilitating photosynthesis in deep shade, were, in this study (and in other studies), primarily associated with vernal geophytes of unshaded habitats. Secondly, the lower size limit at which stomata can function efficiently, and the ecological circumstances under which these minute stomata might occur, have not been satisfactorally resolved. Thus, our hypothesis, that the optimization of stomatal size for functional efficiency is a major ecological determinant of genome size, remains unproven.

  8. Insights into the evolution of mitochondrial genome size from complete sequences of Citrullus lanatus and Cucurbita pepo (Cucurbitaceae).

    Science.gov (United States)

    Alverson, Andrew J; Wei, XiaoXin; Rice, Danny W; Stern, David B; Barry, Kerrie; Palmer, Jeffrey D

    2010-06-01

    The mitochondrial genomes of seed plants are unusually large and vary in size by at least an order of magnitude. Much of this variation occurs within a single family, the Cucurbitaceae, whose genomes range from an estimated 390 to 2,900 kb in size. We sequenced the mitochondrial genomes of Citrullus lanatus (watermelon: 379,236 nt) and Cucurbita pepo (zucchini: 982,833 nt)--the two smallest characterized cucurbit mitochondrial genomes--and determined their RNA editing content. The relatively compact Citrullus mitochondrial genome actually contains more and longer genes and introns, longer segmental duplications, and more discernibly nuclear-derived DNA. The large size of the Cucurbita mitochondrial genome reflects the accumulation of unprecedented amounts of both chloroplast sequences (>113 kb) and short repeated sequences (>370 kb). A low mutation rate has been hypothesized to underlie increases in both genome size and RNA editing frequency in plant mitochondria. However, despite its much larger genome, Cucurbita has a significantly higher synonymous substitution rate (and presumably mutation rate) than Citrullus but comparable levels of RNA editing. The evolution of mutation rate, genome size, and RNA editing are apparently decoupled in Cucurbitaceae, reflecting either simple stochastic variation or governance by different factors.

  9. Salmon and steelhead genetics and genomics - Epigenetic and genomic variation in salmon and steelhead

    Data.gov (United States)

    National Oceanic and Atmospheric Administration, Department of Commerce — Conduct analyses of epigenetic and genomic variation in Chinook salmon and steelhead to determine influence on phenotypic expression of life history traits. Genetic,...

  10. Patterns of genome size diversity in bats (order Chiroptera).

    Science.gov (United States)

    Smith, Jillian D L; Bickham, John W; Gregory, T Ryan

    2013-08-01

    Despite being a group of particular interest in considering relationships between genome size and metabolic parameters, bats have not been well studied from this perspective. This study presents new estimates for 121 "microbat" species from 12 families and complements a previous study on members of the family Pteropodidae ("megabats"). The results confirm that diversity in genome size in bats is very limited even compared with other mammals, varying approximately 2-fold from 1.63 pg in Lophostoma carrikeri to 3.17 pg in Rhinopoma hardwickii and averaging only 2.35 pg ± 0.02 SE (versus 3.5 pg overall for mammals). However, contrary to some other vertebrate groups, and perhaps owing to the narrow range observed, genome size correlations were not apparent with any chromosomal, physiological, flight-related, developmental, or ecological characteristics within the order Chiroptera. Genome size is positively correlated with measures of body size in bats, though the strength of the relationships differs between pteropodids ("megabats") and nonpteropodids ("microbats").

  11. Genome-wide profiling of genetic variation in Agrobacterium-transformed rice plants*#

    Science.gov (United States)

    Li, Wen-xu; Wu, San-ling; Liu, Yan-hua; Jin, Gu-lei; Zhao, Hai-jun; Fan, Long-jiang; Shu, Qing-yao

    2016-01-01

    Agrobacterium-mediated transformation has been widely used in producing transgenic plants, and was recently used to generate “transgene-clean” targeted genomic modifications coupled with the clustered regularly interspaced short palindromic repeats (CRISPR)/CRISPR-associated (Cas9) system. Although tremendous variation in morphological and agronomic traits, such as plant height, seed fertility, and grain size, was observed in transgenic plants, the underlying mechanisms are not yet well understood, and the types and frequency of genetic variation in transformed plants have not been fully disclosed. To reveal the genome-wide variation in transformed plants, we sequenced the genomes of five independent T0 rice plants using next-generation sequencing (NGS) techniques. Bioinformatics analyses followed by experimental validation revealed the following: (1) in addition to transfer-DNA (T-DNA) insertions, three transformed plants carried heritable plasmid backbone DNA of variable sizes (855–5216 bp) and in different configurations with the T-DNA insertions (linked or apart); (2) each transgenic plant contained an estimated 338–1774 independent genetic variations (single nucleotide variations (SNVs) or small insertion/deletions); and (3) 2–6 new Tos17 insertions were detected in each transformed plant, but no other transposable elements or bacterial genomic DNA. PMID:27921404

  12. Sizing ocean giants: patterns of intraspecific size variation in marine megafauna

    Directory of Open Access Journals (Sweden)

    Craig R. McClain

    2015-01-01

    Full Text Available What are the greatest sizes that the largest marine megafauna obtain? This is a simple question with a difficult and complex answer. Many of the largest-sized species occur in the world’s oceans. For many of these, rarity, remoteness, and quite simply the logistics of measuring these giants has made obtaining accurate size measurements difficult. Inaccurate reports of maximum sizes run rampant through the scientific literature and popular media. Moreover, how intraspecific variation in the body sizes of these animals relates to sex, population structure, the environment, and interactions with humans remains underappreciated. Here, we review and analyze body size for 25 ocean giants ranging across the animal kingdom. For each taxon we document body size for the largest known marine species of several clades. We also analyze intraspecific variation and identify the largest known individuals for each species. Where data allows, we analyze spatial and temporal intraspecific size variation. We also provide allometric scaling equations between different size measurements as resources to other researchers. In some cases, the lack of data prevents us from fully examining these topics and instead we specifically highlight these deficiencies and the barriers that exist for data collection. Overall, we found considerable variability in intraspecific size distributions from strongly left- to strongly right-skewed. We provide several allometric equations that allow for estimation of total lengths and weights from more easily obtained measurements. In several cases, we also quantify considerable geographic variation and decreases in size likely attributed to humans.

  13. Global assessment of genomic variation in cattle by genome resequencing and high-throughput genotyping

    DEFF Research Database (Denmark)

    Zhan, Bujie; Fadista, João; Thomsen, Bo

    2011-01-01

    sequence of a single Holstein Friesian bull with data from single nucleotide polymorphism (SNP) and comparative genomic hybridization (CGH) array technologies to determine a comprehensive spectrum of genomic variation. The performance of resequencing SNP detection was assessed by combining SNPs that were...... of split-read and read-pair approaches proved to be complementary in finding different signatures. CNVs were identified on the basis of the depth of sequenced reads, and by using SNP and CGH arrays. Conclusions Our results provide high resolution mapping of diverse classes of genomic variation...

  14. The bat genome: GC-biased small chromosomes associated with reduction in genome size.

    Science.gov (United States)

    Kasai, Fumio; O'Brien, Patricia C M; Ferguson-Smith, Malcolm A

    2013-12-01

    Bats are distinct from other mammals in their small genome size as well as their high metabolic rate, possibly related to flight ability. Although the genome sequence has been published in two species, the data lack cytogenetic information. In this study, the size and GC content of each chromosome are measured from the flow karyotype of the mouse-eared bat, Myotis myotis (MMY). The smaller chromosomes are GC-rich compared to the larger chromosomes, and the relative proportions of homologous segments between MMY and human differ among the MMY chromosomes. The MMY genome size calculated from the sum of the chromosome sizes is 2.25 Gb, and the total GC content is 42.3%, compared to human and dog with 41.0 and 41.2%, respectively. The GC-rich small MMY genome is characterised by GC-biased smaller chromosomes resulting from preferential loss of AT-rich sequences. Although the association between GC-rich small chromosomes and small genome size has been reported only in birds so far, we show in this paper, for the first time, that the same phenomenon is observed in at least one group of mammals, implying that this may be a mechanism common to genome evolution in general.

  15. Population genetic inference from personal genome data: impact of ancestry and admixture on human genomic variation.

    Science.gov (United States)

    Kidd, Jeffrey M; Gravel, Simon; Byrnes, Jake; Moreno-Estrada, Andres; Musharoff, Shaila; Bryc, Katarzyna; Degenhardt, Jeremiah D; Brisbin, Abra; Sheth, Vrunda; Chen, Rong; McLaughlin, Stephen F; Peckham, Heather E; Omberg, Larsson; Bormann Chung, Christina A; Stanley, Sarah; Pearlstein, Kevin; Levandowsky, Elizabeth; Acevedo-Acevedo, Suehelay; Auton, Adam; Keinan, Alon; Acuña-Alonzo, Victor; Barquera-Lozano, Rodrigo; Canizales-Quinteros, Samuel; Eng, Celeste; Burchard, Esteban G; Russell, Archie; Reynolds, Andy; Clark, Andrew G; Reese, Martin G; Lincoln, Stephen E; Butte, Atul J; De La Vega, Francisco M; Bustamante, Carlos D

    2012-10-05

    Full sequencing of individual human genomes has greatly expanded our understanding of human genetic variation and population history. Here, we present a systematic analysis of 50 human genomes from 11 diverse global populations sequenced at high coverage. Our sample includes 12 individuals who have admixed ancestry and who have varying degrees of recent (within the last 500 years) African, Native American, and European ancestry. We found over 21 million single-nucleotide variants that contribute to a 1.75-fold range in nucleotide heterozygosity across diverse human genomes. This heterozygosity ranged from a high of one heterozygous site per kilobase in west African genomes to a low of 0.57 heterozygous sites per kilobase in segments inferred to have diploid Native American ancestry from the genomes of Mexican and Puerto Rican individuals. We show evidence of all three continental ancestries in the genomes of Mexican, Puerto Rican, and African American populations, and the genome-wide statistics are highly consistent across individuals from a population once ancestry proportions have been accounted for. Using a generalized linear model, we identified subtle variations across populations in the proportion of neutral versus deleterious variation and found that genome-wide statistics vary in admixed populations even once ancestry proportions have been factored in. We further infer that multiple periods of gene flow shaped the diversity of admixed populations in the Americas-70% of the European ancestry in today's African Americans dates back to European gene flow happening only 7-8 generations ago.

  16. Genome-wide sequence variations among Mycobacterium avium subspecies paratuberculosis.

    Directory of Open Access Journals (Sweden)

    Chung-Yi eHsu

    2011-12-01

    Full Text Available Mycobacterium avium subspecies paratuberculosis (M. ap, the causative agent of Johne’s disease (JD, infects many farmed ruminants, wildlife animals and humans. To better understand the molecular pathogenesis of these infections, we analyzed the whole genome sequences of several M. ap and M. avium subspecies avium (M. avium strains isolated from various hosts and environments. Using Next-generation sequencing technology, all 6 M. ap isolates showed a high percentage of homology (98% to the reference genome sequence of M. ap K-10 isolated from cattle. However, 2 M. avium isolates (DT 78 and Env 77 showed significant sequence diversity from the reference strain M. avium 104. The genomes of M. avium isolates DT 78 and Env 77 exhibited only 87% and 40% homology, respectively, to the M. avium 104 reference genome. Within the M. ap isolates, genomic rearrangements (insertions/deletions, Indels were not detected, and only unique single nucleotide polymorphisms (SNPs were observed among the 6 M. ap strains. While most of the SNPs (~100 in M. ap genomes were non-synonymous, a total of ~ 6000 SNPs were detected among M. avium genomes, most of them were synonymous suggesting a differential selective pressure between M. ap and M. avium isolates. In addition, SNPs-based phylo-genomic analysis showed that isolates from goat and Oryx are closely related to the cattle (K-10 strain while the human isolate (M. ap 4B is closely related to the environmental strains, indicating environmental source to human infections. Overall, SNPs were the most common variations among M. ap isolates while SNPs in addition to Indels were prevalent among M. avium isolates. Genomic variations will be useful in designing host-specific markers for the analysis of mycobacterial evolution and for developing novel diagnostics directed against Johne’s disease in animals.

  17. The African Genome Variation Project shapes medical genetics in Africa.

    Science.gov (United States)

    Gurdasani, Deepti; Carstensen, Tommy; Tekola-Ayele, Fasil; Pagani, Luca; Tachmazidou, Ioanna; Hatzikotoulas, Konstantinos; Karthikeyan, Savita; Iles, Louise; Pollard, Martin O; Choudhury, Ananyo; Ritchie, Graham R S; Xue, Yali; Asimit, Jennifer; Nsubuga, Rebecca N; Young, Elizabeth H; Pomilla, Cristina; Kivinen, Katja; Rockett, Kirk; Kamali, Anatoli; Doumatey, Ayo P; Asiki, Gershim; Seeley, Janet; Sisay-Joof, Fatoumatta; Jallow, Muminatou; Tollman, Stephen; Mekonnen, Ephrem; Ekong, Rosemary; Oljira, Tamiru; Bradman, Neil; Bojang, Kalifa; Ramsay, Michele; Adeyemo, Adebowale; Bekele, Endashaw; Motala, Ayesha; Norris, Shane A; Pirie, Fraser; Kaleebu, Pontiano; Kwiatkowski, Dominic; Tyler-Smith, Chris; Rotimi, Charles; Zeggini, Eleftheria; Sandhu, Manjinder S

    2015-01-15

    Given the importance of Africa to studies of human origins and disease susceptibility, detailed characterization of African genetic diversity is needed. The African Genome Variation Project provides a resource with which to design, implement and interpret genomic studies in sub-Saharan Africa and worldwide. The African Genome Variation Project represents dense genotypes from 1,481 individuals and whole-genome sequences from 320 individuals across sub-Saharan Africa. Using this resource, we find novel evidence of complex, regionally distinct hunter-gatherer and Eurasian admixture across sub-Saharan Africa. We identify new loci under selection, including loci related to malaria susceptibility and hypertension. We show that modern imputation panels (sets of reference genotypes from which unobserved or missing genotypes in study sets can be inferred) can identify association signals at highly differentiated loci across populations in sub-Saharan Africa. Using whole-genome sequencing, we demonstrate further improvements in imputation accuracy, strengthening the case for large-scale sequencing efforts of diverse African haplotypes. Finally, we present an efficient genotype array design capturing common genetic variation in Africa.

  18. Massive genomic variation and strong selection in Arabidopsis thaliana lines from Sweden

    Science.gov (United States)

    Platzer, Alexander; Zhang, Qingrun; Vilhjálmsson, Bjarni J; Korte, Arthur; Nizhynska, Viktoria; Voronin, Viktor; Korte, Pamela; Sedman, Laura; Mandáková, Terezie; Lysak, Martin A; Seren, Ümit; Hellmann, Ines; Nordborg, Magnus

    2013-01-01

    Despite advances in sequencing, the goal of obtaining a comprehensive view of genetic variation in populations is still far from reached. We sequenced 180 lines of A. thaliana from Sweden to obtain as complete a picture as possible of variation in a single region. Whereas simple polymorphisms in the unique portion of the genome are readily identified, other polymorphisms are not. The massive variation in genome size identified by flow cytometry seems largely to be due to 45S rDNA copy number variation, with lines from northern Sweden having particularly large numbers of copies. Strong selection is evident in the form of long-range linkage disequilibrium (LD), as well as in LD between nearby compensatory mutations. Many footprints of selective sweeps were found in lines from northern Sweden, and a massive global sweep was shown to have involved a 700-kb transposition. PMID:23793030

  19. Genomic Copy Number Variation in Disorders of Cognitive Development

    Science.gov (United States)

    Morrow, Eric M.

    2010-01-01

    Objective: To highlight recent discoveries in the area of genomic copy number variation in neuropsychiatric disorders including intellectual disability, autism, and schizophrenia. To emphasize new principles emerging from this area, involving the genetic architecture of disease, pathophysiology, and diagnosis. Method: Review of studies published…

  20. Mapping copy number variation by population-scale genome sequencing

    DEFF Research Database (Denmark)

    Mills, Ryan E.; Walter, Klaudia; Stewart, Chip;

    2011-01-01

    Genomic structural variants (SVs) are abundant in humans, differing from other forms of variation in extent, origin and functional impact. Despite progress in SV characterization, the nucleotide resolution architecture of most SVs remains unknown. We constructed a map of unbalanced SVs (that is, ...

  1. Repetitive elements, architects of genomic variation in Verticillium

    Science.gov (United States)

    Vascular wilt pathogens in the genus Verticillium show considerable variation with respect to their host ranges, genomic organization, and the variety and number of transposable elements (TEs) that they carry. These families of TE sequences were first documented in the wide host range, plant pathog...

  2. Global DNA cytosine methylation as an evolving trait: phylogenetic signal and correlated evolution with genome size in Angiosperms

    Directory of Open Access Journals (Sweden)

    Conchita eAlonso

    2015-01-01

    Full Text Available DNA cytosine methylation is a widespread epigenetic mechanism in eukaryotes, and plant genomes commonly are densely methylated. Genomic methylation can be associated with functional consequences such as mutational events, genomic instability or altered gene expression, but little is known on interspecific variation in global cytosine methylation in plants. In this paper, we compare global cytosine methylation estimates obtained by HPLC and use a phylogenetically-informed analytical approach to test for significance of evolutionary signatures of this trait across 54 angiosperm species in 25 families. We evaluate whether interspecific variation in global cytosine methylation is statistically related to phylogenetic distance and also whether it is evolutionarily correlated with genome size (C-value. Global cytosine methylation varied widely between species, ranging between 5.3% (Arabidopsis and 39.2% (Narcissus. Differences between species were related to their evolutionary trajectories, as denoted by the strong phylogenetic signal underlying interspecific variation. Global cytosine methylation and genome size were evolutionarily correlated, as revealed by the significant relationship between the corresponding phylogenetically independent contrasts. On average, a ten-fold increase in genome size entailed an increase of about 10% in global cytosine methylation. Results show that global cytosine methylation is an evolving trait in angiosperms whose evolutionary trajectory is significantly linked to changes in genome size, and suggest that the evolutionary implications of epigenetic mechanisms are likely to vary between plant lineages.

  3. Genome-wide detection of copy number variations among diverse horse breeds by array CGH.

    Science.gov (United States)

    Wang, Wei; Wang, Shenyuan; Hou, Chenglin; Xing, Yanping; Cao, Junwei; Wu, Kaifeng; Liu, Chunxia; Zhang, Dong; Zhang, Li; Zhang, Yanru; Zhou, Huanmin

    2014-01-01

    Recent studies have found that copy number variations (CNVs) are widespread in human and animal genomes. CNVs are a significant source of genetic variation, and have been shown to be associated with phenotypic diversity. However, the effect of CNVs on genetic variation in horses is not well understood. In the present study, CNVs in 6 different breeds of mare horses, Mongolia horse, Abaga horse, Hequ horse and Kazakh horse (all plateau breeds) and Debao pony and Thoroughbred, were determined using aCGH. In total, seven hundred CNVs were identified ranging in size from 6.1 Kb to 0.57 Mb across all autosomes, with an average size of 43.08 Kb and a median size of 15.11 Kb. By merging overlapping CNVs, we found a total of three hundred and fifty-three CNV regions (CNVRs). The length of the CNVRs ranged from 6.1 Kb to 1.45 Mb with average and median sizes of 38.49 Kb and 13.1 Kb. Collectively, 13.59 Mb of copy number variation was identified among the horses investigated and accounted for approximately 0.61% of the horse genome sequence. Five hundred and eighteen annotated genes were affected by CNVs, which corresponded to about 2.26% of all horse genes. Through the gene ontology (GO), genetic pathway analysis and comparison of CNV genes among different breeds, we found evidence that CNVs involving 7 genes may be related to the adaptation to severe environment of these plateau horses. This study is the first report of copy number variations in Chinese horses, which indicates that CNVs are ubiquitous in the horse genome and influence many biological processes of the horse. These results will be helpful not only in mapping the horse whole-genome CNVs, but also to further research for the adaption to the high altitude severe environment for plateau horses.

  4. Quantitative metagenomic analyses based on average genome size normalization

    DEFF Research Database (Denmark)

    Frank, Jeremy Alexander; Sørensen, Søren Johannes

    2011-01-01

    Over the past quarter-century, microbiologists have used DNA sequence information to aid in the characterization of microbial communities. During the last decade, this has expanded from single genes to microbial community genomics, or metagenomics, in which the gene content of an environment can...... by estimating average genome sizes. This normalization can relieve comparative biases introduced by differences in community structure, number of sequencing reads, and sequencing read lengths between different metagenomes. We demonstrate the utility of this approach by comparing metagenomes from two different...... marine sources using both conventional small-subunit (SSU) rRNA gene analyses and our quantitative method to calculate the proportion of genomes in each sample that are capable of a particular metabolic trait. With both environments, to determine what proportion of each community they make up and how...

  5. Genomic variation landscape of the human gut microbiome

    DEFF Research Database (Denmark)

    Schloissnig, Siegfried; Arumugam, Manimozhiyan; Sunagawa, Shinichi

    2013-01-01

    Whereas large-scale efforts have rapidly advanced the understanding and practical impact of human genomic variation, the practical impact of variation is largely unexplored in the human microbiome. We therefore developed a framework for metagenomic variation analysis and applied it to 252 faecal...... metagenomes of 207 individuals from Europe and North America. Using 7.4 billion reads aligned to 101 reference species, we detected 10.3 million single nucleotide polymorphisms (SNPs), 107,991 short insertions/deletions, and 1,051 structural variants. The average ratio of non-synonymous to synonymous...... polymorphism rates of 0.11 was more variable between gut microbial species than across human hosts. Subjects sampled at varying time intervals exhibited individuality and temporal stability of SNP variation patterns, despite considerable composition changes of their gut microbiota. This indicates...

  6. Genome-Wide Association Study on Male Genital Shape and Size in Drosophila melanogaster.

    Directory of Open Access Journals (Sweden)

    Baku Takahara

    Full Text Available Male genital morphology of animals with internal fertilization and promiscuous mating systems have been one of the most diverse and rapidly evolving morphological traits. The male genital morphology in general is known to have low phenotypic and genetic variations, but the genetic basis of the male genital variation remains unclear. Drosophila melanogaster and its closely related species are morphologically very similar, but the shapes of the posterior lobe, a cuticular projection on the male genital arch are distinct from each other, representing a model system for studying the genetic basis of male genital morphology. In this study, we used highly inbred whole genome sequenced strains of D. melanogaster to perform genome wide association analysis on posterior lobe morphology. We quantified the outline shape of posterior lobes with Fourier coefficients obtained from elliptic Fourier analysis and performed principal component analysis, and posterior lobe size. The first and second principal components (PC1 and PC2 explained approximately 88% of the total variation of the posterior lobe shape. We then examined the association between the principal component scores and posterior lobe size and 1902142 single nucleotide polymorphisms (SNPs. As a result, we obtained 15, 14 and 15 SNPs for PC1, PC2 and posterior lobe size with P-values smaller than 10(-5. Based on the location of the SNPs, 13, 13 and six protein coding genes were identified as potential candidates for PC1, PC2 and posterior lobe size, respectively. In addition to the previous findings showing that the intraspecific posterior shape variation are regulated by multiple QTL with strong effects, the present study suggests that the intraspecific variation may be under polygenic regulation with a number of loci with small effects. Further studies are required for investigating whether these candidate genes are responsible for the intraspecific posterior lobe shape variation.

  7. Detecting microsatellites within genomes: significant variation among algorithms

    Directory of Open Access Journals (Sweden)

    Rivals Eric

    2007-04-01

    Full Text Available Abstract Background Microsatellites are short, tandemly-repeated DNA sequences which are widely distributed among genomes. Their structure, role and evolution can be analyzed based on exhaustive extraction from sequenced genomes. Several dedicated algorithms have been developed for this purpose. Here, we compared the detection efficiency of five of them (TRF, Mreps, Sputnik, STAR, and RepeatMasker. Results Our analysis was first conducted on the human X chromosome, and microsatellite distributions were characterized by microsatellite number, length, and divergence from a pure motif. The algorithms work with user-defined parameters, and we demonstrate that the parameter values chosen can strongly influence microsatellite distributions. The five algorithms were then compared by fixing parameters settings, and the analysis was extended to three other genomes (Saccharomyces cerevisiae, Neurospora crassa and Drosophila melanogaster spanning a wide range of size and structure. Significant differences for all characteristics of microsatellites were observed among algorithms, but not among genomes, for both perfect and imperfect microsatellites. Striking differences were detected for short microsatellites (below 20 bp, regardless of motif. Conclusion Since the algorithm used strongly influences empirical distributions, studies analyzing microsatellite evolution based on a comparison between empirical and theoretical size distributions should therefore be considered with caution. We also discuss why a typological definition of microsatellites limits our capacity to capture their genomic distributions.

  8. Genome-Wide Associations of Gene Expression Variation in Humans.

    Directory of Open Access Journals (Sweden)

    2005-12-01

    Full Text Available The exploration of quantitative variation in human populations has become one of the major priorities for medical genetics. The successful identification of variants that contribute to complex traits is highly dependent on reliable assays and genetic maps. We have performed a genome-wide quantitative trait analysis of 630 genes in 60 unrelated Utah residents with ancestry from Northern and Western Europe using the publicly available phase I data of the International HapMap project. The genes are located in regions of the human genome with elevated functional annotation and disease interest including the ENCODE regions spanning 1% of the genome, Chromosome 21 and Chromosome 20q12-13.2. We apply three different methods of multiple test correction, including Bonferroni, false discovery rate, and permutations. For the 374 expressed genes, we find many regions with statistically significant association of single nucleotide polymorphisms (SNPs with expression variation in lymphoblastoid cell lines after correcting for multiple tests. Based on our analyses, the signal proximal (cis- to the genes of interest is more abundant and more stable than distal and trans across statistical methodologies. Our results suggest that regulatory polymorphism is widespread in the human genome and show that the 5-kb (phase I HapMap has sufficient density to enable linkage disequilibrium mapping in humans. Such studies will significantly enhance our ability to annotate the non-coding part of the genome and interpret functional variation. In addition, we demonstrate that the HapMap cell lines themselves may serve as a useful resource for quantitative measurements at the cellular level.

  9. Genome-wide associations of gene expression variation in humans.

    Directory of Open Access Journals (Sweden)

    Barbara E Stranger

    2005-12-01

    Full Text Available The exploration of quantitative variation in human populations has become one of the major priorities for medical genetics. The successful identification of variants that contribute to complex traits is highly dependent on reliable assays and genetic maps. We have performed a genome-wide quantitative trait analysis of 630 genes in 60 unrelated Utah residents with ancestry from Northern and Western Europe using the publicly available phase I data of the International HapMap project. The genes are located in regions of the human genome with elevated functional annotation and disease interest including the ENCODE regions spanning 1% of the genome, Chromosome 21 and Chromosome 20q12-13.2. We apply three different methods of multiple test correction, including Bonferroni, false discovery rate, and permutations. For the 374 expressed genes, we find many regions with statistically significant association of single nucleotide polymorphisms (SNPs with expression variation in lymphoblastoid cell lines after correcting for multiple tests. Based on our analyses, the signal proximal (cis- to the genes of interest is more abundant and more stable than distal and trans across statistical methodologies. Our results suggest that regulatory polymorphism is widespread in the human genome and show that the 5-kb (phase I HapMap has sufficient density to enable linkage disequilibrium mapping in humans. Such studies will significantly enhance our ability to annotate the non-coding part of the genome and interpret functional variation. In addition, we demonstrate that the HapMap cell lines themselves may serve as a useful resource for quantitative measurements at the cellular level.

  10. Ultra Deep Sequencing of a Baculovirus Population Reveals Widespread Genomic Variations

    Directory of Open Access Journals (Sweden)

    Aurélien Chateigner

    2015-07-01

    Full Text Available Viruses rely on widespread genetic variation and large population size for adaptation. Large DNA virus populations are thought to harbor little variation though natural populations may be polymorphic. To measure the genetic variation present in a dsDNA virus population, we deep sequenced a natural strain of the baculovirus Autographa californica multiple nucleopolyhedrovirus. With 124,221X average genome coverage of our 133,926 bp long consensus, we could detect low frequency mutations (0.025%. K-means clustering was used to classify the mutations in four categories according to their frequency in the population. We found 60 high frequency non-synonymous mutations under balancing selection distributed in all functional classes. These mutants could alter viral adaptation dynamics, either through competitive or synergistic processes. Lastly, we developed a technique for the delimitation of large deletions in next generation sequencing data. We found that large deletions occur along the entire viral genome, with hotspots located in homologous repeat regions (hrs. Present in 25.4% of the genomes, these deletion mutants presumably require functional complementation to complete their infection cycle. They might thus have a large impact on the fitness of the baculovirus population. Altogether, we found a wide breadth of genomic variation in the baculovirus population, suggesting it has high adaptive potential.

  11. Variations in serving sizes of Australian snack foods and confectionery.

    Science.gov (United States)

    Watson, Wendy L; Kury, Alexandra; Wellard, Lyndal; Hughes, Clare; Dunford, Elizabeth; Chapman, Kathy

    2016-01-01

    This study examined the serving size and energy content per serving of Australian packaged snack foods and confectionery products. Nutrition Information Panel data for 23 sub-categories of packaged snack foods (n = 3481) were extracted from The George Institute for Global Health's 2013 branded food composition database. Variations in serving size and energy content per serving were examined. Energy contents per serving were compared to recommendations in the Australian Dietary Guidelines. Serving sizes varied within and between snack food categories. Mean energy content per serving varied from 320 kJ to 899 kJ. More energy per serving than the recommended 600 kJ was displayed by 22% (n = 539) of snack foods classified in the Australian Dietary Guidelines as discretionary foods. The recommendation for energy content per serving was exceeded in 60% (n = 635) of snack foods from the Five Food Groups. Only 37% (n = 377) of confectionery products displayed the industry-agreed serving size of 25 g. Energy content per serving of many packaged snack foods do not align with the Australian Dietary Guidelines and the industry agreed serving size has not been taken up widely within the confectionery category. Given the inconsistencies in serving sizes, featuring serving size in front-of-pack information may hinder the objective of a clear and simple nutrition message. Messaging to help consumers make healthier choices should consider the variation in serving sizes on packaged snack foods. Copyright © 2015 Elsevier Ltd. All rights reserved.

  12. Genome Size Diversity in Lilium (Liliaceae Is Correlated with Karyotype and Environmental Traits

    Directory of Open Access Journals (Sweden)

    Yun-peng Du

    2017-07-01

    Full Text Available Genome size (GS diversity is of fundamental biological importance. The occurrence of giant genomes in angiosperms is restricted to just a few lineages in the analyzed genome size of plant species so far. It is still an open question whether GS diversity is shaped by neutral or natural selection. The genus Lilium, with giant genomes, is phylogenetically and horticulturally important and is distributed throughout the northern hemisphere. GS diversity in Lilium and the underlying evolutionary mechanisms are poorly understood. We performed a comprehensive study involving phylogenetically independent analysis on 71 species to explore the diversity and evolution of GS and its correlation with karyological and environmental traits within Lilium (including Nomocharis. The strong phylogenetic signal detected for GS in the genus provides evidence consistent with that the repetitive DNA may be the primary contributors to the GS diversity, while the significant positive relationships detected between GS and the haploid chromosome length (HCL provide insights into patterns of genome evolution. The relationships between GS and karyotypes indicate that ancestral karyotypes of Lilium are likely to have exhibited small genomes, low diversity in centromeric index (CVCI values and relatively high relative variation in chromosome length (CVCL values. Significant relationships identified between GS and annual temperature and between GS and annual precipitation suggest that adaptation to habitat strongly influences GS diversity. We conclude that GS in Lilium is shaped by both neutral (genetic drift and adaptive evolution. These findings will have important consequences for understanding the evolution of giant plant genomes, and exploring the role of repetitive DNA fraction and chromosome changes in a plant group with large genomes and conservation of chromosome number.

  13. Flow cytometric analysis using SYBR Green I for genome size estimation in coffee.

    Science.gov (United States)

    Ronildo Clarindo, Wellington; Roberto Carvalho, Carlos

    2011-02-01

    Plant genome size has been measured by flow cytometry using propidium iodide as a dye for nuclear DNA staining. However, some authors have reported the occurrence of genome size estimation errors, especially in plants rich in secondary metabolites, such as the coffee tree. In this context, we tested an alternative cytometric protocol using the SYBR Green I as a fluorochrome for stoichiometrically staining nuclear double-stranded DNA in Coffea canephora (2x) and Coffea arabica (4x). The results showed that the respective mean genome size measured from nuclei stained with SYBR Green I and propidium iodide was statistically identical. However, the G(0)/G(1) peaks of nuclei stained with SYBR Green I exhibited lower coefficient variations (1.57-2.85%) compared to those stained with propidium iodide (2.75-4.80%). Coefficient variation statistical data suggest that SYBR Green I is adequate for stoichiometric nuclei staining using this methodology. Our results provide evidence that SYBR Green I can be used in flow cytometry measurements of plants, with the advantages of minimizing errors in nuclear DNA content quantification, staining relatively quicker, with high affinity, and being less mutagenic than propidium iodide.

  14. Transcriptome and genome size analysis of the Venus flytrap.

    Science.gov (United States)

    Jensen, Michael Krogh; Vogt, Josef Korbinian; Bressendorff, Simon; Seguin-Orlando, Andaine; Petersen, Morten; Sicheritz-Pontén, Thomas; Mundy, John

    2015-01-01

    The insectivorous Venus flytrap (Dionaea muscipula) is renowned from Darwin's studies of plant carnivory and the origins of species. To provide tools to analyze the evolution and functional genomics of D. muscipula, we sequenced a normalized cDNA library synthesized from mRNA isolated from D. muscipula flowers and traps. Using the Oases transcriptome assembler 79,165,657 quality trimmed reads were assembled into 80,806 cDNA contigs, with an average length of 679 bp and an N50 length of 1,051 bp. A total of 17,047 unique proteins were identified, and assigned to Gene Ontology (GO) and classified into functional categories. A total of 15,547 full-length cDNA sequences were identified, from which open reading frames were detected in 10,941. Comparative GO analyses revealed that D. muscipula is highly represented in molecular functions related to catalytic, antioxidant, and electron carrier activities. Also, using a single copy sequence PCR-based method, we estimated that the genome size of D. muscipula is approx. 3 Gb. Our genome size estimate and transcriptome analyses will contribute to future research on this fascinating, monotypic species and its heterotrophic adaptations.

  15. Transcriptome and genome size analysis of the Venus flytrap.

    Directory of Open Access Journals (Sweden)

    Michael Krogh Jensen

    Full Text Available The insectivorous Venus flytrap (Dionaea muscipula is renowned from Darwin's studies of plant carnivory and the origins of species. To provide tools to analyze the evolution and functional genomics of D. muscipula, we sequenced a normalized cDNA library synthesized from mRNA isolated from D. muscipula flowers and traps. Using the Oases transcriptome assembler 79,165,657 quality trimmed reads were assembled into 80,806 cDNA contigs, with an average length of 679 bp and an N50 length of 1,051 bp. A total of 17,047 unique proteins were identified, and assigned to Gene Ontology (GO and classified into functional categories. A total of 15,547 full-length cDNA sequences were identified, from which open reading frames were detected in 10,941. Comparative GO analyses revealed that D. muscipula is highly represented in molecular functions related to catalytic, antioxidant, and electron carrier activities. Also, using a single copy sequence PCR-based method, we estimated that the genome size of D. muscipula is approx. 3 Gb. Our genome size estimate and transcriptome analyses will contribute to future research on this fascinating, monotypic species and its heterotrophic adaptations.

  16. Genetic integration of molar cusp size variation in baboons

    Science.gov (United States)

    Koh, Christina; Bates, Elizabeth; Broughton, Elizabeth; Do, Nicholas T.; Fletcher, Zachary; Mahaney, Michael C.; Hlusko, Leslea J.

    2010-01-01

    Many studies of primate diversity and evolution rely on dental morphology for insight into diet, behavior, and phylogenetic relationships. Consequently, variation in molar cusp size has increasingly become a phenotype of interest. In 2007 we published a quantitative genetic analysis of mandibular molar cusp size variation in baboons. Those results provided more questions than answers, as the pattern of genetic integration did not fit predictions from odontogenesis. To follow up, we expanded our study to include data from the maxillary molar cusps. Here we report on these later analyses, as well as inter-arch comparisons with the mandibular data. We analyzed variation in two-dimensional maxillary molar cusp size using data collected from a captive pedigreed breeding colony of baboons, Papio hamadryas, housed at the Southwest National Primate Research Center. These analyses show that variation in maxillary molar cusp size is heritable and sexually dimorphic. We also estimated additive genetic correlations between cusps on the same crown, homologous cusps along the tooth row, and maxillary and mandibular cusps. The pattern for maxillary molars yields genetic correlations of one between the paracone-metacone and protocone-hypocone. Bivariate analyses of cuspal homologues on adjacent teeth yield correlations that are high or not significantly different from one. Between dental arcades, the non-occluding cusps consistently yield high genetic correlations, especially the metaconid-paracone and metaconid-metacone. This pattern of genetic correlation does not immediately accord with the pattern of development and/or calcification, however these results do follow predictions that can be made from the evolutionary history of the tribosphenic molar. PMID:20034010

  17. Size is not everything: rates of genome size evolution, not C-value, correlate with speciation in angiosperms.

    Science.gov (United States)

    Puttick, Mark N; Clark, James; Donoghue, Philip C J

    2015-12-07

    Angiosperms represent one of the key examples of evolutionary success, and their diversity dwarfs other land plants; this success has been linked, in part, to genome size and phenomena such as whole genome duplication events. However, while angiosperms exhibit a remarkable breadth of genome size, evidence linking overall genome size to diversity is equivocal, at best. Here, we show that the rates of speciation and genome size evolution are tightly correlated across land plants, and angiosperms show the highest rates for both, whereas very slow rates are seen in their comparatively species-poor sister group, the gymnosperms. No evidence is found linking overall genome size and rates of speciation. Within angiosperms, both the monocots and eudicots show the highest rates of speciation and genome size evolution, and these data suggest a potential explanation for the megadiversity of angiosperms. It is difficult to associate high rates of diversification with different types of polyploidy, but it is likely that high rates of evolution correlate with a smaller genome size after genome duplications. The diversity of angiosperms may, in part, be due to an ability to increase evolvability by benefiting from whole genome duplications, transposable elements and general genome plasticity. © 2015 The Authors.

  18. Natural variation in genome architecture among 205 Drosophila melanogaster Genetic Reference Panel lines.

    Science.gov (United States)

    Huang, Wen; Massouras, Andreas; Inoue, Yutaka; Peiffer, Jason; Ràmia, Miquel; Tarone, Aaron M; Turlapati, Lavanya; Zichner, Thomas; Zhu, Dianhui; Lyman, Richard F; Magwire, Michael M; Blankenburg, Kerstin; Carbone, Mary Anna; Chang, Kyle; Ellis, Lisa L; Fernandez, Sonia; Han, Yi; Highnam, Gareth; Hjelmen, Carl E; Jack, John R; Javaid, Mehwish; Jayaseelan, Joy; Kalra, Divya; Lee, Sandy; Lewis, Lora; Munidasa, Mala; Ongeri, Fiona; Patel, Shohba; Perales, Lora; Perez, Agapito; Pu, LingLing; Rollmann, Stephanie M; Ruth, Robert; Saada, Nehad; Warner, Crystal; Williams, Aneisa; Wu, Yuan-Qing; Yamamoto, Akihiko; Zhang, Yiqing; Zhu, Yiming; Anholt, Robert R H; Korbel, Jan O; Mittelman, David; Muzny, Donna M; Gibbs, Richard A; Barbadilla, Antonio; Johnston, J Spencer; Stone, Eric A; Richards, Stephen; Deplancke, Bart; Mackay, Trudy F C

    2014-07-01

    The Drosophila melanogaster Genetic Reference Panel (DGRP) is a community resource of 205 sequenced inbred lines, derived to improve our understanding of the effects of naturally occurring genetic variation on molecular and organismal phenotypes. We used an integrated genotyping strategy to identify 4,853,802 single nucleotide polymorphisms (SNPs) and 1,296,080 non-SNP variants. Our molecular population genomic analyses show higher deletion than insertion mutation rates and stronger purifying selection on deletions. Weaker selection on insertions than deletions is consistent with our observed distribution of genome size determined by flow cytometry, which is skewed toward larger genomes. Insertion/deletion and single nucleotide polymorphisms are positively correlated with each other and with local recombination, suggesting that their nonrandom distributions are due to hitchhiking and background selection. Our cytogenetic analysis identified 16 polymorphic inversions in the DGRP. Common inverted and standard karyotypes are genetically divergent and account for most of the variation in relatedness among the DGRP lines. Intriguingly, variation in genome size and many quantitative traits are significantly associated with inversions. Approximately 50% of the DGRP lines are infected with Wolbachia, and four lines have germline insertions of Wolbachia sequences, but effects of Wolbachia infection on quantitative traits are rarely significant. The DGRP complements ongoing efforts to functionally annotate the Drosophila genome. Indeed, 15% of all D. melanogaster genes segregate for potentially damaged proteins in the DGRP, and genome-wide analyses of quantitative traits identify novel candidate genes. The DGRP lines, sequence data, genotypes, quality scores, phenotypes, and analysis and visualization tools are publicly available.

  19. Coconut genome size determined by flow cytometry: Tall versus Dwarf types.

    Science.gov (United States)

    Freitas Neto, M; Pereira, T N S; Geronimo, I G C; Azevedo, A O N; Ramos, S R R; Pereira, M G

    2016-02-11

    Coconuts (Cocos nucifera L.) are tropical palm trees that are classified into Tall and Dwarf types based on height, and both types are diploid (2n = 2x = 32 chromosomes). The reproduction mode is autogamous for Dwarf types and allogamous for Tall types. One hypothesis for the origin of the Dwarf coconut suggests that it is a Tall variant that resulted from either mutation or inbreeding, and differences in genome size between the two types would support this hypothesis. In this study, we estimated the genome sizes of 14 coconut accessions (eight Tall and six Dwarf types) using flow cytometry. Nuclei were extracted from leaf discs and stained with propidium iodide, and Pisum sativum (2C = 9.07 pg DNA) was used as an internal standard. Histograms with good resolution and low coefficients of variation (2.5 to 3.2%) were obtained. The 2C DNA content ranged from 5.72 to 5.48 pg for Tall accessions and from 5.58 to 5.52 pg for Dwarf accessions. The mean genome sizes for Tall and Dwarf specimens were 5.59 and 5.55 pg, respectively. Among all accessions, Rennel Island Tall had the highest mean DNA content (5.72 pg), whereas West African Tall had the lowest (5.48 pg). The mean coconut genome size (2C = 5.57 pg, corresponding to 2723.73 Mbp/haploid set) was classified as small. Only small differences in genome size existed among the coconut accessions, suggesting that the Dwarf type did not evolve from the Tall type.

  20. Discrepancy variation of dinucleotide microsatellite repeats in eukaryotic genomes.

    Science.gov (United States)

    Gao, Huan; Cai, Shengli; Yan, Binlun; Chen, Baiyao; Yu, Fei

    2009-01-01

    To address whether there are differences of variation among repeat motif types and among taxonomic groups, we present here an analysis of variation and correlation of dinucleotide microsatellite repeats in eukaryotic genomes. Ten taxonomic groups were compared, those being primates, mammalia (excluding primates and rodentia), rodentia, birds, fish, amphibians and reptiles, insects, molluscs, plants and fungi, respectively. The data used in the analysis is from the literature published in the Journal of Molecular Ecology Notes. Analysis of variation reveals that there are no significant differences between AC and AG repeat motif types. Moreover, the number of alleles correlates positively with the copy number in both AG and AC repeats. Similar conclusions can be obtained from each taxonomic group. These results strongly suggest that the increase of SSR variation is almost linear with the increase of the copy number of each repeat motif. As well, the results suggest that the variability of SSR in the genomes of low-ranking species seem to be more than that of high-ranking species, excluding primates and fungi.

  1. Genomic Variation in Natural Populations of Drosophila melanogaster

    Science.gov (United States)

    Langley, Charles H.; Stevens, Kristian; Cardeno, Charis; Lee, Yuh Chwen G.; Schrider, Daniel R.; Pool, John E.; Langley, Sasha A.; Suarez, Charlyn; Corbett-Detig, Russell B.; Kolaczkowski, Bryan; Fang, Shu; Nista, Phillip M.; Holloway, Alisha K.; Kern, Andrew D.; Dewey, Colin N.; Song, Yun S.; Hahn, Matthew W.; Begun, David J.

    2012-01-01

    This report of independent genome sequences of two natural populations of Drosophila melanogaster (37 from North America and 6 from Africa) provides unique insight into forces shaping genomic polymorphism and divergence. Evidence of interactions between natural selection and genetic linkage is abundant not only in centromere- and telomere-proximal regions, but also throughout the euchromatic arms. Linkage disequilibrium, which decays within 1 kbp, exhibits a strong bias toward coupling of the more frequent alleles and provides a high-resolution map of recombination rate. The juxtaposition of population genetics statistics in small genomic windows with gene structures and chromatin states yields a rich, high-resolution annotation, including the following: (1) 5′- and 3′-UTRs are enriched for regions of reduced polymorphism relative to lineage-specific divergence; (2) exons overlap with windows of excess relative polymorphism; (3) epigenetic marks associated with active transcription initiation sites overlap with regions of reduced relative polymorphism and relatively reduced estimates of the rate of recombination; (4) the rate of adaptive nonsynonymous fixation increases with the rate of crossing over per base pair; and (5) both duplications and deletions are enriched near origins of replication and their density correlates negatively with the rate of crossing over. Available demographic models of X and autosome descent cannot account for the increased divergence on the X and loss of diversity associated with the out-of-Africa migration. Comparison of the variation among these genomes to variation among genomes from D. simulans suggests that many targets of directional selection are shared between these species. PMID:22673804

  2. Potential Value of Genomic Copy Number Variations in Schizophrenia

    Directory of Open Access Journals (Sweden)

    Chuanjun Zhuo

    2017-06-01

    Full Text Available Schizophrenia is a devastating neuropsychiatric disorder affecting approximately 1% of the global population, and the disease has imposed a considerable burden on families and society. Although, the exact cause of schizophrenia remains unknown, several lines of scientific evidence have revealed that genetic variants are strongly correlated with the development and early onset of the disease. In fact, the heritability among patients suffering from schizophrenia is as high as 80%. Genomic copy number variations (CNVs are one of the main forms of genomic variations, ubiquitously occurring in the human genome. An increasing number of studies have shown that CNVs account for population diversity and genetically related diseases, including schizophrenia. The last decade has witnessed rapid advances in the development of novel genomic technologies, which have led to the identification of schizophrenia-associated CNVs, insight into the roles of the affected genes in their intervals in schizophrenia, and successful manipulation of the target CNVs. In this review, we focus on the recent discoveries of important CNVs that are associated with schizophrenia and outline the potential values that the study of CNVs will bring to the areas of schizophrenia research, diagnosis, and therapy. Furthermore, with the help of the novel genetic tool known as the Clustered Regularly Interspaced Short Palindromic Repeats-associated nuclease 9 (CRISPR/Cas9 system, the pathogenic CNVs as genomic defects could be corrected. In conclusion, the recent novel findings of schizophrenia-associated CNVs offer an exciting opportunity for schizophrenia research to decipher the pathological mechanisms underlying the onset and development of schizophrenia as well as to provide potential clinical applications in genetic counseling, diagnosis, and therapy for this complex mental disease.

  3. Intra-genomic variation in the ribosomal repeats of nematodes.

    Directory of Open Access Journals (Sweden)

    Holly M Bik

    Full Text Available Ribosomal loci represent a major tool for investigating environmental diversity and community structure via high-throughput marker gene studies of eukaryotes (e.g. 18S rRNA. Since the estimation of species' abundance is a major goal of environmental studies (by counting numbers of sequences, understanding the patterns of rRNA copy number across species will be critical for informing such high-throughput approaches. Such knowledge is critical, given that ribosomal RNA genes exist within multi-copy repeated arrays in a genome. Here we measured the repeat copy number for six nematode species by mapping the sequences from whole genome shotgun libraries against reference sequences for their rRNA repeat. This revealed a 6-fold variation in repeat copy number amongst taxa investigated, with levels of intragenomic variation ranging from 56 to 323 copies of the rRNA array. By applying the same approach to four C. elegans mutation accumulation lines propagated by repeated bottlenecking for an average of ~400 generations, we find on average a 2-fold increase in repeat copy number (rate of increase in rRNA estimated at 0.0285-0.3414 copies per generation, suggesting that rRNA repeat copy number is subject to selection. Within each Caenorhabditis species, the majority of intragenomic variation found across the rRNA repeat was observed within gene regions (18S, 28S, 5.8S, suggesting that such intragenomic variation is not a product of selection for rRNA coding function. We find that the dramatic variation in repeat copy number among these six nematode genomes would limit the use of rRNA in estimates of organismal abundance. In addition, the unique pattern of variation within a single genome was uncorrelated with patterns of divergence between species, reflecting a strong signature of natural selection for rRNA function. A better understanding of the factors that control or affect copy number in these arrays, as well as their rates and patterns of evolution

  4. Genome size as a key to evolutionary complex aquatic plants: polyploidy and hybridization in Callitriche (Plantaginaceae.

    Directory of Open Access Journals (Sweden)

    Jan Prančl

    Full Text Available Despite their complex evolutionary histories, aquatic plants are highly underrepresented in contemporary biosystematic studies. Of them, the genus Callitriche is particularly interesting because of such evolutionary features as wide variation in chromosome numbers and pollination systems. However, taxonomic difficulties have prevented broader investigation of this genus. In this study we applied flow cytometry to Callitriche for the first time in order to gain an insight into evolutionary processes and genome size differentiation in the genus. Flow cytometry complemented by confirmation of chromosome counts was applied to an extensive dataset of 1077 Callitriche individuals from 495 localities in 11 European countries and the USA. Genome size was determined for 12 taxa. The results suggest that many important processes have interacted in the evolution of the genus, including polyploidization and hybridization. Incongruence between genome size and ploidy level, intraspecific variation in genome size, formation of autotriploid and hybridization between species with different pollination systems were also detected. Hybridization takes place particularly in the diploid-tetraploid complex C. cophocarpa-C. platycarpa, for which the triploid hybrids were frequently recorded in the area of co-occurrence of its parents. A hitherto unknown hybrid (probably C. hamulata × C. cophocarpa with a unique chromosome number was discovered in the Czech Republic. However, hybridization occurs very rarely among most of the studied species. The main ecological preferences were also compared among the taxa collected. Although Callitriche taxa often grow in mixed populations, the ecological preferences of individual species are distinctly different in some cases. Anyway, flow cytometry is a very efficient method for taxonomic delimitation, determination and investigation of Callitriche species, and is even able to distinguish homoploid taxa and identify introduced

  5. Reassessment of the Genome Size in Elaeis guineensis and Elaeis oleifera, and Its Interspecific Hybrid.

    Science.gov (United States)

    Camillo, Julceia; Leão, André P; Alves, Alexandre A; Formighieri, Eduardo F; Azevedo, Ana Ls; Nunes, Juliana D; de Capdeville, Guy; de A Mattos, Jean K; Souza, Manoel T

    2014-01-01

    Aiming at generating a comprehensive genomic database on Elaeis spp., our group is leading several R&D initiatives with Elaeis guineensis (African oil palm) and Elaeis oleifera (American oil palm), including the whole-genome sequencing of the last. Genome size estimates currently available for this genus are controversial, as they indicate that American oil palm genome is about half the size of the African oil palm genome and that the genome of the interspecific hybrid is bigger than both the parental species genomes. We estimated the genome size of three E. guineensis genotypes, five E. oleifera genotypes, and two interspecific hybrids genotypes. On average, the genome size of E. guineensis is 4.32 ± 0.173 pg, while that of E. oleifera is 4.43 ± 0.018 pg. This indicates that both genomes are similar in size, even though E. oleifera is in fact bigger. As expected, the hybrid genome size is around the average of the two genomes, 4.40 ± 0.016 pg. Additionally, we demonstrate that both species present around 38% of GC content. As our results contradict the currently available data on Elaeis spp. genome sizes, we propose that the actual genome size of the Elaeis species is around 4 pg and that American oil palm possesses a larger genome than African oil palm.

  6. Estimating variable effective population sizes from multiple genomes: a sequentially markov conditional sampling distribution approach.

    Science.gov (United States)

    Sheehan, Sara; Harris, Kelley; Song, Yun S

    2013-07-01

    Throughout history, the population size of modern humans has varied considerably due to changes in environment, culture, and technology. More accurate estimates of population size changes, and when they occurred, should provide a clearer picture of human colonization history and help remove confounding effects from natural selection inference. Demography influences the pattern of genetic variation in a population, and thus genomic data of multiple individuals sampled from one or more present-day populations contain valuable information about the past demographic history. Recently, Li and Durbin developed a coalescent-based hidden Markov model, called the pairwise sequentially Markovian coalescent (PSMC), for a pair of chromosomes (or one diploid individual) to estimate past population sizes. This is an efficient, useful approach, but its accuracy in the very recent past is hampered by the fact that, because of the small sample size, only few coalescence events occur in that period. Multiple genomes from the same population contain more information about the recent past, but are also more computationally challenging to study jointly in a coalescent framework. Here, we present a new coalescent-based method that can efficiently infer population size changes from multiple genomes, providing access to a new store of information about the recent past. Our work generalizes the recently developed sequentially Markov conditional sampling distribution framework, which provides an accurate approximation of the probability of observing a newly sampled haplotype given a set of previously sampled haplotypes. Simulation results demonstrate that we can accurately reconstruct the true population histories, with a significant improvement over the PSMC in the recent past. We apply our method, called diCal, to the genomes of multiple human individuals of European and African ancestry to obtain a detailed population size change history during recent times.

  7. Genomic variability in Mexican chicken population using Copy Number Variation

    Directory of Open Access Journals (Sweden)

    Erica Gorla

    2017-05-01

    Full Text Available Copy number variants (CNVs are polymorphisms which influence phenotypic variation and are an important source of genetic variability [1]. In Mexico the backyard poultry population is a unique widespread Creole chicken (Gallus gallus domesticus population, an undefined cross among different breeds brought to Mexico from Europe and under natural selection for almost 500 years [2-3]. The aim of this study was to investigate genomic variation in the Mexican chicken population using CNVs. A total of 256 DNA samples genotyped with Axiom® Genome-Wide Chicken Genotyping Array were used in the analyses. The individual CNV calling, based on log-R ratio and B-allele frequency values, was performed using the Hidden Markov Model (HMM of PennCNV software on the autosomes [4-5]. CNVs were summarized to CNV regions (CNVRs at a population level (i.e. overlapping CNVs, using BEDTools. The HMM detected a total of 1924 CNVs in the genome of 256 samples resulting, at population level, in 1216 CNV regions, of which 959 gains, 226 losses and 31 complex CNVRs (i.e. containing both losses and gains, covering a total of 47 Mb of sequence length corresponding to 5,12 % of the chicken galGal4 assembly autosome. A comparison among this study and 7 previous reports about CNVs in chicken was performed, finding that the 1,216 CNVRs detected in this study overlap with 617 regions (51% mapped by others studies.   This study allowed a deep insight into the structural variation in the genome of unselected Mexican chicken population, which up to now has not been never genetically characterized with SNP markers. Based on a cluster analysis (pvclust – R package on CNV markers the population, even if presenting extreme morphological variation, does not resulted divided in differentiated genetic subpopulations. Finally this study provides a CNV map based on the 600K SNP chip array jointly with a genome-wide gene copy number estimates in Mexican chicken population.

  8. Meta-basic estimates the size of druggable human genome.

    Science.gov (United States)

    Plewczynski, Dariusz; Rychlewski, Leszek

    2009-06-01

    We present here the estimation of the upper limit of the number of molecular targets in the human genome that represent an opportunity for further therapeutic treatment. We select around approximately 6300 human proteins that are similar to sequences of known protein targets collected from DrugBank database. Our bioinformatics study estimates the size of 'druggable' human genome to be around 20% of human proteome, i.e. the number of the possible protein targets for small-molecule drug design in medicinal chemistry. We do not take into account any toxicity prediction, the three-dimensional characteristics of the active site in the predicted 'druggable' protein families, or detailed chemical analysis of known inhibitors/drugs. Instead we rely on remote homology detection method Meta-BASIC, which is based on sequence and structural similarity. The prepared dataset of all predicted protein targets from human genome presents the unique opportunity for developing and benchmarking various in silico chemo/bio-informatics methods in the context of the virtual high throughput screening.

  9. Variation in genome organization of the plant pathogenic fungus Colletotrichum lindemuthianum.

    Science.gov (United States)

    O'Sullivan, D; Tosi, P; Creusot, F; Cooke, B M; Phan, T H; Dron, M; Langin, T

    1998-04-01

    The genome structure of Colletotrichum lindemuthianum in a set of diverse isolates was investigated using a combination of physical and molecular approaches. Flow cytometric measurement of genome size revealed significant variation between strains, with the smallest genome representing 59% of the largest. Southern-blot profiles of a cloned fungal telomere revealed a total chromosome number varying from 9 to 12. Chromosome separations using pulsed-field gel electrophoresis (PFGE) showed that these chromosomes belong to two distinct size classes: a variable number of small (< 2.5 Mb) polymorphic chromosomes and a set of unresolved chromosomes larger than 7 Mb. Two dispersed repeat elements were shown to cluster on distinct polymorphic minichromosomes. Single-copy flanking sequences from these repeat-containing clones specifically marked distinct small chromosomes. These markers were absent in some strains, indicating that part of the observed variability in genome organization may be explained by the presence or absence, in a given strain, of dispensable genomic regions and/or chromosomes.

  10. Regulatory hotspots in the malaria parasite genome dictate transcriptional variation.

    Directory of Open Access Journals (Sweden)

    Joseph M Gonzales

    2008-09-01

    Full Text Available The determinants of transcriptional regulation in malaria parasites remain elusive. The presence of a well-characterized gene expression cascade shared by different Plasmodium falciparum strains could imply that transcriptional regulation and its natural variation do not contribute significantly to the evolution of parasite drug resistance. To clarify the role of transcriptional variation as a source of stain-specific diversity in the most deadly malaria species and to find genetic loci that dictate variations in gene expression, we examined genome-wide expression level polymorphisms (ELPs in a genetic cross between phenotypically distinct parasite clones. Significant variation in gene expression is observed through direct co-hybridizations of RNA from different P. falciparum clones. Nearly 18% of genes were regulated by a significant expression quantitative trait locus. The genetic determinants of most of these ELPs resided in hotspots that are physically distant from their targets. The most prominent regulatory locus, influencing 269 transcripts, coincided with a Chromosome 5 amplification event carrying the drug resistance gene, pfmdr1, and 13 other genes. Drug selection pressure in the Dd2 parental clone lineage led not only to a copy number change in the pfmdr1 gene but also to an increased copy number of putative neighboring regulatory factors that, in turn, broadly influence the transcriptional network. Previously unrecognized transcriptional variation, controlled by polymorphic regulatory genes and possibly master regulators within large copy number variants, contributes to sweeping phenotypic evolution in drug-resistant malaria parasites.

  11. Genome-wide profiling of structural genomic variations in Korean HapMap individuals.

    Directory of Open Access Journals (Sweden)

    Joon Seol Bae

    Full Text Available BACKGROUND: Structural genomic variation study, along with microarray technology development has provided many genomic resources related with architecture of human genome, and led to the fact that human genome structure is a lot more complicated than previously thought. METHODOLOGY/PRINCIPAL FINDINGS: In the case of International HapMap Project, Epstein-Barr various immortalized cell lines were preferably used over blood in order to get a larger number of genomic DNA. However, genomic aberration stemming from immortalization process, biased representation of the donor tissue, and culture process may influence the accuracy of SNP genotypes. In order to identify chromosome aberrations including loss of heterozygosity (LOH, large-scale and small-scale copy number variations, we used Illumina HumanHap500 BeadChip (555,352 markers on Korean HapMap individuals (n = 90 to obtain Log R ratio and B allele frequency information, and then utilized the data with various programs including Illumina ChromoZone, cnvParition and PennCNV. As a result, we identified 28 LOHs (>3 mb and 35 large-scale CNVs (>1 mb, with 4 samples having completely duplicated chromosome. In addition, after checking the sample quality (standard deviation of log R ratio <0.30, we selected 79 samples and used both signal intensity and B allele frequency simultaneously for identification of small-scale CNVs (<1 mb to discover 4,989 small-scale CNVs. Identified CNVs in this study were successfully validated using visual examination of the genoplot images, overlapping analysis with previously reported CNVs in DGV, and quantitative PCR. CONCLUSION/SIGNIFICANCE: In this study, we describe the result of the identified chromosome aberrations in Korean HapMap individuals, and expect that these findings will provide more meaningful information on the human genome.

  12. Rare and common regulatory variation in population-scale sequenced human genomes.

    Directory of Open Access Journals (Sweden)

    Stephen B Montgomery

    2011-07-01

    Full Text Available Population-scale genome sequencing allows the characterization of functional effects of a broad spectrum of genetic variants underlying human phenotypic variation. Here, we investigate the influence of rare and common genetic variants on gene expression patterns, using variants identified from sequencing data from the 1000 genomes project in an African and European population sample and gene expression data from lymphoblastoid cell lines. We detect comparable numbers of expression quantitative trait loci (eQTLs when compared to genotypes obtained from HapMap 3, but as many as 80% of the top expression quantitative trait variants (eQTVs discovered from 1000 genomes data are novel. The properties of the newly discovered variants suggest that mapping common causal regulatory variants is challenging even with full resequencing data; however, we observe significant enrichment of regulatory effects in splice-site and nonsense variants. Using RNA sequencing data, we show that 46.2% of nonsynonymous variants are differentially expressed in at least one individual in our sample, creating widespread potential for interactions between functional protein-coding and regulatory variants. We also use allele-specific expression to identify putative rare causal regulatory variants. Furthermore, we demonstrate that outlier expression values can be due to rare variant effects, and we approximate the number of such effects harboured in an individual by effect size. Our results demonstrate that integration of genomic and RNA sequencing analyses allows for the joint assessment of genome sequence and genome function.

  13. Genomic Sequencing of Orientia tsutsugamushi Strain Karp, an Assembly Comparable to the Genome Size of the Strain Ikeda.

    Science.gov (United States)

    Liao, Hsiao-Mei; Chao, Chien-Chung; Lei, Haiyan; Li, Bingjie; Tsai, Shien; Hung, Guo-Chiuan; Ching, Wei-Mei; Lo, Shyh-Ching

    2016-08-18

    Orientia tsutsugamushi, an intracellular bacterium, belongs to the family Rickettsiaceae This study presents the draft genome sequence of strain Karp, with 2.0 Mb as the size of the completed genome. This nearly finished draft genome sequence was annotated with the RAST server and the contents compared to those of the other strains.

  14. Genomic Sequencing of Orientia tsutsugamushi Strain Karp, an Assembly Comparable to the Genome Size of the Strain Ikeda

    Science.gov (United States)

    Liao, Hsiao-Mei; Chao, Chien-Chung; Lei, Haiyan; Li, Bingjie; Tsai, Shien; Hung, Guo-Chiuan

    2016-01-01

    Orientia tsutsugamushi, an intracellular bacterium, belongs to the family Rickettsiaceae. This study presents the draft genome sequence of strain Karp, with 2.0 Mb as the size of the completed genome. This nearly finished draft genome sequence was annotated with the RAST server and the contents compared to those of the other strains. PMID:27540052

  15. Intrinsic Pixel Size Variation in an LSST Prototype Sensor

    CERN Document Server

    Baumer, Michael

    2015-01-01

    The ambitious science goals of the Large Synoptic Survey Telescope (LSST) have motivated a search for new and unexpected sources of systematic error in the LSST camera. Flat-field images are a rich source of data on sensor anomalies, although such effects are typically dwarfed by shot noise in a single flat field. After combining many ($\\sim 500$) such images into `ultraflats' to reduce the impact of shot noise, we perform photon transfer analysis on a pixel-by-pixel basis and observe no spatial structure in pixel linearity or gain at light levels of 100 ke$^-$ and below. At 125 ke$^-$, a columnar structure is observed in the gain map--we attribute this to a flux-dependent charge transfer inefficiency. We also probe small-scale variations in effective pixel size by analyzing pixel-neighbor correlations in ultraflat images, where we observe clear evidence of intrinsic variation in effective pixel size in an LSST prototype sensor near the $\\sim .3\\%$ level.

  16. Population-based resequencing of experimentally evolved populations reveals the genetic basis of body size variation in Drosophila melanogaster.

    Directory of Open Access Journals (Sweden)

    Thomas L Turner

    2011-03-01

    Full Text Available Body size is a classic quantitative trait with evolutionarily significant variation within many species. Locating the alleles responsible for this variation would help understand the maintenance of variation in body size in particular, as well as quantitative traits in general. However, successful genome-wide association of genotype and phenotype may require very large sample sizes if alleles have low population frequencies or modest effects. As a complementary approach, we propose that population-based resequencing of experimentally evolved populations allows for considerable power to map functional variation. Here, we use this technique to investigate the genetic basis of natural variation in body size in Drosophila melanogaster. Significant differentiation of hundreds of loci in replicate selection populations supports the hypothesis that the genetic basis of body size variation is very polygenic in D. melanogaster. Significantly differentiated variants are limited to single genes at some loci, allowing precise hypotheses to be formed regarding causal polymorphisms, while other significant regions are large and contain many genes. By using significantly associated polymorphisms as a priori candidates in follow-up studies, these data are expected to provide considerable power to determine the genetic basis of natural variation in body size.

  17. Rapid detection of structural variation in a human genome using nanochannel-based genome mapping technology

    DEFF Research Database (Denmark)

    Cao, Hongzhi; Hastie, Alex R.; Cao, Dandan

    2014-01-01

    mutations; however, none of the current detection methods are comprehensive, and currently available methodologies are incapable of providing sufficient resolution and unambiguous information across complex regions in the human genome. To address these challenges, we applied a high-throughput, cost......BACKGROUND: Structural variants (SVs) are less common than single nucleotide polymorphisms and indels in the population, but collectively account for a significant fraction of genetic polymorphism and diseases. Base pair differences arising from SVs are on a much higher order (>100 fold) than point...... mapping technology as a comprehensive and cost-effective method for detecting structural variation and studying complex regions in the human genome, as well as deciphering viral integration into the host genome....

  18. First genome size estimations for some eudicot families and genera

    Directory of Open Access Journals (Sweden)

    Garcia, S.

    2010-12-01

    Full Text Available Genome size diversity in angiosperms varies roughly 2400-fold, although approximately 45% of angiosperm families lack a single genome size estimation, and therefore, this range could be enlarged. To contribute completing family and genera representation, DNA C-Values are here provided for 19 species from 16 eudicot families, including first values for 6 families, 14 genera and 17 species. The sample of species studied is very diverse, including herbs, weeds, vines, shrubs and trees. Data are discussed regarding previous genome size estimates of closely related species or genera, if any, their chromosome number, growth form or invasive behaviour. The present research contributes approximately 1.5% new values for previously unreported angiosperm families, being the current coverage around 55% of angiosperm families, according to the Plant DNA C-Values Database.

    La diversidad del tamaño del genoma en angiospermas es muy amplia, siendo el valor más elevado aproximadamente unas 2400 veces superior al más pequeño. Sin embargo, cerca del 45% de las familias no presentan ni una sola estimación, por lo que el rango real podría ser ampliado. Para contribuir a completar la representación de familias y géneros de angiospermas, este estudio contribuye con valores C para 19 especies de 16 familias de eudicoticotiledóneas, incluyendo los primeros valores para 6 familias, 14 géneros y 17 especies. La muestra estudiada es muy diversa, e incluye hierbas, malezas, enredaderas, arbustos y árboles. Se discuten los resultados en función de estimaciones previas del tamaño del genoma de especies o géneros estrechamente relacionados, del número de cromosomas, la forma de crecimiento o el comportamiento invasor de las especies analizadas. El presente estudio contribuye aproximadamente en un 1,5% de nuevos valores para familias de angiospermas no estudiadas previamente, de las que actualmente existe información para el 55%, según la base de datos

  19. Genome size diversity in angiosperms and its influence on gene space.

    Science.gov (United States)

    Dodsworth, Steven; Leitch, Andrew R; Leitch, Ilia J

    2015-12-01

    Genome size varies c. 2400-fold in angiosperms (flowering plants), although the range of genome size is skewed towards small genomes, with a mean genome size of 1C=5.7Gb. One of the most crucial factors governing genome size in angiosperms is the relative amount and activity of repetitive elements. Recently, there have been new insights into how these repeats, previously discarded as 'junk' DNA, can have a significant impact on gene space (i.e. the part of the genome comprising all the genes and gene-related DNA). Here we review these new findings and explore in what ways genome size itself plays a role in influencing how repeats impact genome dynamics and gene space, including gene expression. Copyright © 2015 The Authors. Published by Elsevier Ltd.. All rights reserved.

  20. Increased brain size in mammals is associated with size variations in gene families with cell signalling, chemotaxis and immune-related functions.

    Science.gov (United States)

    Castillo-Morales, Atahualpa; Monzón-Sandoval, Jimena; Urrutia, Araxi O; Gutiérrez, Humberto

    2014-01-22

    Genomic determinants underlying increased encephalization across mammalian lineages are unknown. Whole genome comparisons have revealed large and frequent changes in the size of gene families, and it has been proposed that these variations could play a major role in shaping morphological and physiological differences among species. Using a genome-wide comparative approach, we examined changes in gene family size (GFS) and degree of encephalization in 39 fully sequenced mammalian species and found a significant over-representation of GFS variations in line with increased encephalization in mammals. We found that this relationship is not accounted for by known correlates of brain size such as maximum lifespan or body size and is not explained by phylogenetic relatedness. Genes involved in chemotaxis, immune regulation and cell signalling-related functions are significantly over-represented among those gene families most highly correlated with encephalization. Genes within these families are prominently expressed in the human brain, particularly the cortex, and organized in co-expression modules that display distinct temporal patterns of expression in the developing cortex. Our results suggest that changes in GFS associated with encephalization represent an evolutionary response to the specific functional requirements underlying increased brain size in mammals.

  1. PolyTB: A genomic variation map for Mycobacterium tuberculosis

    KAUST Repository

    Coll, Francesc

    2014-02-15

    Tuberculosis (TB) caused by Mycobacterium tuberculosis (Mtb) is the second major cause of death from an infectious disease worldwide. Recent advances in DNA sequencing are leading to the ability to generate whole genome information in clinical isolates of M. tuberculosis complex (MTBC). The identification of informative genetic variants such as phylogenetic markers and those associated with drug resistance or virulence will help barcode Mtb in the context of epidemiological, diagnostic and clinical studies. Mtb genomic datasets are increasingly available as raw sequences, which are potentially difficult and computer intensive to process, and compare across studies. Here we have processed the raw sequence data (>1500 isolates, eight studies) to compile a catalogue of SNPs (n = 74,039, 63% non-synonymous, 51.1% in more than one isolate, i.e. non-private), small indels (n = 4810) and larger structural variants (n = 800). We have developed the PolyTB web-based tool (http://pathogenseq.lshtm.ac.uk/polytb) to visualise the resulting variation and important meta-data (e.g. in silico inferred strain-types, location) within geographical map and phylogenetic views. This resource will allow researchers to identify polymorphisms within candidate genes of interest, as well as examine the genomic diversity and distribution of strains. PolyTB source code is freely available to researchers wishing to develop similar tools for their pathogen of interest. 2014 Elsevier Ltd. All rights reserved.

  2. Structural variation in the chicken genome identified by paired-end next-generation DNA sequencing of reduced representation libraries

    NARCIS (Netherlands)

    Kerstens, H.H.D.; Crooijmans, R.P.M.A.; Dibbits, B.W.; Vereijken, A.; Okimoto, R.; Groenen, M.A.M.

    2011-01-01

    Background Variation within individual genomes ranges from single nucleotide polymorphisms (SNPs) to kilobase, and even megabase, sized structural variants (SVs), such as deletions, insertions, inversions, and more complex rearrangements. Although much is known about the extent of SVs in humans and

  3. Integration of genomic approaches to uncover sources of variation in age at puberty and reproductive longevity in sows

    Science.gov (United States)

    Genetic variants associated with traits such as age at puberty and litter size could provide insight into the underlying genetic sources of variation impacting sow reproductive longevity and productivity. Genome wide characterization and gene expression profiling were employed using gilts from the U...

  4. Genome Sizes of Nine Insect Species Determined by Flow Cytometry and k-mer Analysis

    Science.gov (United States)

    He, Kang; Lin, Kejian; Wang, Guirong; Li, Fei

    2016-01-01

    The flow cytometry method was used to estimate the genome sizes of nine agriculturally important insects, including two coleopterans, five Hemipterans, and two hymenopterans. Among which, the coleopteran Lissorhoptrus oryzophilus (Kuschel) had the largest genome of 981 Mb. The average genome size was 504 Mb, suggesting that insects have a moderate-size genome. Compared with the insects in other orders, hymenopterans had small genomes, which were averagely about ~200 Mb. We found that the genome sizes of four insect species were different between male and female, showing the organismal complexity of insects. The largest difference occurred in the coconut leaf beetle Brontispa longissima (Gestro). The male coconut leaf beetle had a 111 Mb larger genome than females, which might be due to the chromosome number difference between the sexes. The results indicated that insect invasiveness was not related to genome size. We also determined the genome sizes of the small brown planthopper Laodelphax striatellus (Fallén) and the parasitic wasp Macrocentrus cingulum (Brischke) using k-mer analysis with Illunima Solexa sequencing data. There were slight differences in the results from the two methods. k-mer analysis indicated that the genome size of L. striatellus was 500–700 Mb and that of M. cingulum was ~150 Mb. In all, the genome sizes information presented here should be helpful for designing the genome sequencing strategy when necessary. PMID:27932995

  5. Transcriptome, methylome and genomic variations analysis of ectopic thyroid glands.

    Directory of Open Access Journals (Sweden)

    Rasha Abu-Khudir

    Full Text Available BACKGROUND: Congenital hypothyroidism from thyroid dysgenesis (CHTD is predominantly a sporadic disease characterized by defects in the differentiation, migration or growth of thyroid tissue. Of these defects, incomplete migration resulting in ectopic thyroid tissue is the most common (up to 80%. Germinal mutations in the thyroid-related transcription factors NKX2.1, FOXE1, PAX-8, and NKX2.5 have been identified in only 3% of patients with sporadic CHTD. Moreover, a survey of monozygotic twins yielded a discordance rate of 92%, suggesting that somatic events, genetic or epigenetic, probably play an important role in the etiology of CHTD. METHODOLOGY/PRINCIPAL FINDINGS: To assess the role of somatic genetic or epigenetic processes in CHTD, we analyzed gene expression, genome-wide methylation, and structural genome variations in normal versus ectopic thyroid tissue. In total, 1011 genes were more than two-fold induced or repressed. Expression array was validated by quantitative real-time RT-PCR for 100 genes. After correction for differences in thyroid activation state, 19 genes were exclusively associated with thyroid ectopy, among which genes involved in embryonic development (e.g. TXNIP and in the Wnt pathway (e.g. SFRP2 and FRZB were observed. None of the thyroid related transcription factors (FOXE1, HHEX, NKX2.1, NKX2.5 showed decreased expression, whereas PAX8 expression was associated with thyroid activation state. Finally, the expression profile was independent of promoter and CpG island methylation and of structural genome variations. CONCLUSIONS/SIGNIFICANCE: This is the first integrative molecular analysis of ectopic thyroid tissue. Ectopic thyroids show a differential gene expression compared to that of normal thyroids, although molecular basis could not be defined. Replication of this pilot study on a larger cohort could lead to unraveling the elusive cause of defective thyroid migration during embryogenesis.

  6. Litter size variation in Polish selected small dog breeds

    Directory of Open Access Journals (Sweden)

    Małgorzata Goleman

    2015-08-01

    Full Text Available In breeders’ general opinion small breed females produce less numerous litters. The aim of the study was to analyse the litter size and the frequency of the gender ratio in selected small dog breeds in view of their popularity in Poland. The data set comprised information on 639 litters (in total 2578 puppies of eight breeds, which were born between January 2003 and end December 2014. The results were statistically analysed using statistical program SPSS 20.0. Medium-size litters were observed in the analysed small dog breeds (4.034±0.1. Comparison of the selected breeds of the Fédération Cynologique Internationale (FCI Groups showed that the mean litter size in Group IX was higher (4.36±0.08 than that in Group III (3.87±0.14 and the differences were statistically significant. The study has confirmed the hypothesis that larger females produce more numerous litters, but there are large intra-individual variations in the number of pups born in individual breeds. Additionally, the gender ratio in the puppies born in the analysed breeds was equal, despite the fluctuations in the individual breeds.

  7. Karyotype and genome size in Euterpe Mart. (Arecaceae) species

    Science.gov (United States)

    Oliveira, Ludmila Cristina; de Oliveira, Maria do Socorro Padilha; Davide, Lisete Chamma; Torres, Giovana Augusta

    2016-01-01

    Abstract Euterpe (Martius, 1823), a genus from Central and South America, has species with high economic importance in Brazil, because of their palm heart and fruits, known as açaí berries. Breeding programs have been conducted to increase yield and establish cultivation systems to replace the extraction of wild material. These programs need basic information about the genome of these species to better explore the available genetic variability. The aim of this study was to compare Euterpe edulis (Martius, 1824), Euterpe oleracea (Martius, 1824) and Euterpe precatoria (Martius, 1842), with regard to karyotype, type of interphase nucleus and nuclear DNA amount. Metaphase chromosomes and interphase nuclei from root tip meristematic cells were obtained by the squashing technique and solid stained for microscope analysis. The DNA amount was estimated by flow cytometry. There were previous reports on the chromosome number of Euterpe edulis and Euterpe oleracea, but chromosome morphology of these two species and the whole karyotype of Euterpe precatoria are reported for the first time. The species have 2n=36, a number considered as a pleisomorphic feature in Arecoideae since the modern species, according to floral morphology, have the lowest chromosome number (2n=28 and 2n=30). The three Euterpe species also have the same type of interphase nuclei, classified as semi-reticulate. The species differed on karyotypic formulas, on localization of secondary constriction and genome size. The data suggest that the main forces driving Euterpe karyotype evolution were structural rearrangements, such as inversions and translocations that alter chromosome morphology, and either deletion or amplification that led to changes in chromosome size. PMID:27186334

  8. Effective Normalization for Copy Number Variation Detection from Whole Genome Sequencing

    NARCIS (Netherlands)

    Janevski, A.; Varadan, V.; Kamalakaran, S.; Banerjee, N.; Dimitrova, D.

    2012-01-01

    Background Whole genome sequencing enables a high resolution view ofthe human genome and provides unique insights into genome structureat an unprecedented scale. There have been a number of tools to infer copy number variation in the genome. These tools while validatedalso include a number of parame

  9. Queen Size Variation in the Ponerine Ant Ponera coarctata (Hymenoptera: Formicidae

    Directory of Open Access Journals (Sweden)

    J. Liebig

    1995-01-01

    Full Text Available Queens of Ponera coarctata show a pronounced variation in size as measured by ommatidia number and Weber's alitrunk length. Isometric size variation and the normal distribution of size categories indicate that, despite these differences, only one queen morph exists. Queen size varies less within colonies than between colonies, and thus appears to be colony specific. Ovary length apparently varies with queen size. Similar size variations as in queens also occured in males, but not in workers.

  10. The Genome of the Trinidadian Guppy, Poecilia reticulata, and Variation in the Guanapo Population

    Science.gov (United States)

    Künstner, Axel; Hoffmann, Margarete; Fraser, Bonnie A.; Kottler, Verena A.; Sharma, Eshita; Weigel, Detlef; Dreyer, Christine

    2016-01-01

    For over a century, the live bearing guppy, Poecilia reticulata, has been used to study sexual selection as well as local adaptation. Natural guppy populations differ in many traits that are of intuitively adaptive significance such as ornamentation, age at maturity, brood size and body shape. Water depth, light supply, food resources and predation regime shape these traits, and barrier waterfalls often separate contrasting environments in the same river. We have assembled and annotated the genome of an inbred single female from a high-predation site in the Guanapo drainage. The final assembly comprises 731.6 Mb with a scaffold N50 of 5.3 MB. Scaffolds were mapped to linkage groups, placing 95% of the genome assembly on the 22 autosomes and the X-chromosome. To investigate genetic variation in the population used for the genome assembly, we sequenced 10 wild caught male individuals. The identified 5 million SNPs correspond to an average nucleotide diversity (π) of 0.0025. The genome assembly and SNP map provide a rich resource for investigating adaptation to different predation regimes. In addition, comparisons with the genomes of other Poeciliid species, which differ greatly in mechanisms of sex determination and maternal resource allocation, as well as comparisons to other teleost genera can begin to reveal how live bearing evolved in teleost fish. PMID:28033408

  11. Identification of genomic regions associated with phenotypic variation between dog breeds using selection mapping.

    Directory of Open Access Journals (Sweden)

    Amaury Vaysse

    2011-10-01

    Full Text Available The extraordinary phenotypic diversity of dog breeds has been sculpted by a unique population history accompanied by selection for novel and desirable traits. Here we perform a comprehensive analysis using multiple test statistics to identify regions under selection in 509 dogs from 46 diverse breeds using a newly developed high-density genotyping array consisting of >170,000 evenly spaced SNPs. We first identify 44 genomic regions exhibiting extreme differentiation across multiple breeds. Genetic variation in these regions correlates with variation in several phenotypic traits that vary between breeds, and we identify novel associations with both morphological and behavioral traits. We next scan the genome for signatures of selective sweeps in single breeds, characterized by long regions of reduced heterozygosity and fixation of extended haplotypes. These scans identify hundreds of regions, including 22 blocks of homozygosity longer than one megabase in certain breeds. Candidate selection loci are strongly enriched for developmental genes. We chose one highly differentiated region, associated with body size and ear morphology, and characterized it using high-throughput sequencing to provide a list of variants that may directly affect these traits. This study provides a catalogue of genomic regions showing extreme reduction in genetic variation or population differentiation in dogs, including many linked to phenotypic variation. The many blocks of reduced haplotype diversity observed across the genome in dog breeds are the result of both selection and genetic drift, but extended blocks of homozygosity on a megabase scale appear to be best explained by selection. Further elucidation of the variants under selection will help to uncover the genetic basis of complex traits and disease.

  12. Evolution of genome size and chromosome number in the carnivorous plant genus Genlisea (Lentibulariaceae), with a new estimate of the minimum genome size in angiosperms.

    Science.gov (United States)

    Fleischmann, Andreas; Michael, Todd P; Rivadavia, Fernando; Sousa, Aretuza; Wang, Wenqin; Temsch, Eva M; Greilhuber, Johann; Müller, Kai F; Heubl, Günther

    2014-12-01

    Some species of Genlisea possess ultrasmall nuclear genomes, the smallest known among angiosperms, and some have been found to have chromosomes of diminutive size, which may explain why chromosome numbers and karyotypes are not known for the majority of species of the genus. However, other members of the genus do not possess ultrasmall genomes, nor do most taxa studied in related genera of the family or order. This study therefore examined the evolution of genome sizes and chromosome numbers in Genlisea in a phylogenetic context. The correlations of genome size with chromosome number and size, with the phylogeny of the group and with growth forms and habitats were also examined. Nuclear genome sizes were measured from cultivated plant material for a comprehensive sampling of taxa, including nearly half of all species of Genlisea and representing all major lineages. Flow cytometric measurements were conducted in parallel in two laboratories in order to compare the consistency of different methods and controls. Chromosome counts were performed for the majority of taxa, comparing different staining techniques for the ultrasmall chromosomes. Genome sizes of 15 taxa of Genlisea are presented and interpreted in a phylogenetic context. A high degree of congruence was found between genome size distribution and the major phylogenetic lineages. Ultrasmall genomes with 1C values of <100 Mbp were almost exclusively found in a derived lineage of South American species. The ancestral haploid chromosome number was inferred to be n = 8. Chromosome numbers in Genlisea ranged from 2n = 2x = 16 to 2n = 4x = 32. Ascendant dysploid series (2n = 36, 38) are documented for three derived taxa. The different ploidy levels corresponded to the two subgenera, but were not directly correlated to differences in genome size; the three different karyotype ranges mirrored the different sections of the genus. The smallest known plant genomes were not found in G. margaretae, as previously reported

  13. Phenotypic consequences of polyploidy and genome size at the microevolutionary scale: a multivariate morphological approach.

    Science.gov (United States)

    Balao, Francisco; Herrera, Javier; Talavera, Salvador

    2011-10-01

    • Chromosomal duplications and increases in DNA amount have the potential to alter quantitative plant traits like flower number, plant stature or stomata size. This has been documented often across species, but information on whether such effects also occur within species (i.e. at the microevolutionary or population scale) is scarce. • We studied trait covariation associated with polyploidy and genome size (both monoploid and total) in 22 populations of Dianthus broteri s.l., a perennial herb with several cytotypes (2x, 4x, 6x and 12x) that do not coexist spatially. Principal component scores of organ size/number variations were assessed as correlates of polyploidy, and phylogenetic relatedness among populations was controlled using phylogenetic generalized least squares. • Polyploidy covaried with organ dimensions, causing multivariate characters to increase, remain unchanged, or decrease with DNA amount. Variations in monoploid DNA amount had detectable consequences on some phenotypic traits. According to the analyses, some traits would experience phenotypic selection, while others would not. • We show that polyploidy contributes to decouple variation among traits in D. broteri, and hypothesize that polyploids may experience an evolutionary advantage in this plant lineage, for example, if it helps to overcome the constraints imposed by trait integration.

  14. Genomic and gene variation in Mycoplasma hominis strains

    DEFF Research Database (Denmark)

    Christiansen, Gunna; Andersen, H; Birkelund, Svend

    1987-01-01

    DNAs from 14 strains of Mycoplasma hominis isolated from various habitats, including strain PG21, were analyzed for genomic heterogeneity. DNA-DNA filter hybridization values were from 51 to 91%. Restriction endonuclease digestion patterns, analyzed by agarose gel electrophoresis, revealed...... no identity or cluster formation between strains. Variation within M. hominis rRNA genes was analyzed by Southern hybridization of EcoRI-cleaved DNA hybridized with a cloned fragment of the rRNA gene from the mycoplasma strain PG50. Five of the M. hominis strains showed identical hybridization patterns....... These hybridization patterns were compared with those of 12 other mycoplasma species, which showed a much more complex band pattern. Cloned nonribosomal RNA gene fragments of M. hominis PG21 DNA were analyzed, and the fragments were used to demonstrate heterogeneity among the strains. A monoclonal antibody against...

  15. Identification of genomic indels and structural variations using split reads

    Directory of Open Access Journals (Sweden)

    Urban Alexander E

    2011-07-01

    Full Text Available Abstract Background Recent studies have demonstrated the genetic significance of insertions, deletions, and other more complex structural variants (SVs in the human population. With the development of the next-generation sequencing technologies, high-throughput surveys of SVs on the whole-genome level have become possible. Here we present split-read identification, calibrated (SRiC, a sequence-based method for SV detection. Results We start by mapping each read to the reference genome in standard fashion using gapped alignment. Then to identify SVs, we score each of the many initial mappings with an assessment strategy designed to take into account both sequencing and alignment errors (e.g. scoring more highly events gapped in the center of a read. All current SV calling methods have multilevel biases in their identifications due to both experimental and computational limitations (e.g. calling more deletions than insertions. A key aspect of our approach is that we calibrate all our calls against synthetic data sets generated from simulations of high-throughput sequencing (with realistic error models. This allows us to calculate sensitivity and the positive predictive value under different parameter-value scenarios and for different classes of events (e.g. long deletions vs. short insertions. We run our calculations on representative data from the 1000 Genomes Project. Coupling the observed numbers of events on chromosome 1 with the calibrations gleaned from the simulations (for different length events allows us to construct a relatively unbiased estimate for the total number of SVs in the human genome across a wide range of length scales. We estimate in particular that an individual genome contains ~670,000 indels/SVs. Conclusions Compared with the existing read-depth and read-pair approaches for SV identification, our method can pinpoint the exact breakpoints of SV events, reveal the actual sequence content of insertions, and cover the whole

  16. Variation in the fitness effects of mutations with population density and size in Escherichia coli.

    Directory of Open Access Journals (Sweden)

    Huansheng Cao

    Full Text Available The fitness effects of mutations are context specific and depend on both external (e.g., environment and internal (e.g., cellular stress, genetic background factors. The influence of population size and density on fitness effects are unknown, despite the central role population size plays in the supply and fixation of mutations. We addressed this issue by comparing the fitness of 92 Keio strains (Escherichia coli K12 single gene knockouts at comparatively high (1.2×10(7 CFUs/mL and low (2.5×10(2 CFUs/mL densities, which also differed in population size (high: 1.2×10(8; low: 1.25×10(3. Twenty-eight gene deletions (30% exhibited a fitness difference, ranging from 5 to 174% (median: 35%, between the high and low densities. Our analyses suggest this variation among gene deletions in fitness responses reflected in part both gene orientation and function, of the gene properties we examined (genomic position, length, orientation, and function. Although we could not determine the relative effects of population density and size, our results suggest fitness effects of mutations vary with these two factors, and this variation is gene-specific. Besides being a mechanism for density-dependent selection (r-K selection, the dependence of fitness effects on population density and size has implications for any population that varies in size over time, including populations undergoing evolutionary rescue, species invasions into novel habitats, and cancer progression and metastasis. Further, combined with recent advances in understanding the roles of other context-specific factors in the fitness effects of mutations, our results will help address theoretical and applied biological questions more realistically.

  17. Structural variation in the chicken genome identified by paired-end next-generation DNA sequencing of reduced representation libraries

    Directory of Open Access Journals (Sweden)

    Okimoto Ron

    2011-02-01

    Full Text Available Abstract Background Variation within individual genomes ranges from single nucleotide polymorphisms (SNPs to kilobase, and even megabase, sized structural variants (SVs, such as deletions, insertions, inversions, and more complex rearrangements. Although much is known about the extent of SVs in humans and mice, species in which they exert significant effects on phenotypes, very little is known about the extent of SVs in the 2.5-times smaller and less repetitive genome of the chicken. Results We identified hundreds of shared and divergent SVs in four commercial chicken lines relative to the reference chicken genome. The majority of SVs were found in intronic and intergenic regions, and we also found SVs in the coding regions. To identify the SVs, we combined high-throughput short read paired-end sequencing of genomic reduced representation libraries (RRLs of pooled samples from 25 individuals and computational mapping of DNA sequences from a reference genome. Conclusion We provide a first glimpse of the high abundance of small structural genomic variations in the chicken. Extrapolating our results, we estimate that there are thousands of rearrangements in the chicken genome, the majority of which are located in non-coding regions. We observed that structural variation contributes to genetic differentiation among current domesticated chicken breeds and the Red Jungle Fowl. We expect that, because of their high abundance, SVs might explain phenotypic differences and play a role in the evolution of the chicken genome. Finally, our study exemplifies an efficient and cost-effective approach for identifying structural variation in sequenced genomes.

  18. PLCL1 rs7595412 variation is not associated with hip bone size variation in postmenopausal Danish women

    Directory of Open Access Journals (Sweden)

    Karsdal Morten A

    2009-12-01

    Full Text Available Abstract Background Bone size (BS variation is under strong genetic control and plays an important role in determining bone strength and fracture risk. Recently, a genome-wide association study identified polymorphisms associated with hip BS variation in the PLCL1 (phospholipase c-like 1 locus. Carriers of the major A allele of the most significant polymorphism, rs7595412, have around 17% larger hip BS than non-carriers. We therefore hypothesized that this polymorphism may also influence postmenopausal complications. Methods The effects of rs7595412 on hip BS, bone mineral density (BMD, vertebral fractures, serum Crosslaps and osteocalcin levels were analyzed in 1,191 postmenopausal Danish women. Results This polymorphism had no influence on hip and spine BS as well as on femur and spine BMD. Women carrying at least one copy of the A allele had lower levels of serum osteocalcin as compared with those homozygous for the G allele (p = 0.03 whereas no effect on serum Crosslaps was detected. Furthermore, women homozygous for the A allele were more affected by vertebral fractures than those carrying at least one copy of the G allele (p = 0.04. Conclusions In postmenopausal women, our results suggest that the PLCL1 rs7595412 polymorphism has no obvious effect on hip BS or BMD but may be nominally associated with increased proportion of vertebral fracture and increased levels of osteocalcin.

  19. Sequencing of mitochondrial genomes of nine Aspergillus and Penicillium species identifies mobile introns and accessory genes as main sources of genome size variability

    Directory of Open Access Journals (Sweden)

    Joardar Vinita

    2012-12-01

    Full Text Available Abstract Background The genera Aspergillus and Penicillium include some of the most beneficial as well as the most harmful fungal species such as the penicillin-producer Penicillium chrysogenum and the human pathogen Aspergillus fumigatus, respectively. Their mitochondrial genomic sequences may hold vital clues into the mechanisms of their evolution, population genetics, and biology, yet only a handful of these genomes have been fully sequenced and annotated. Results Here we report the complete sequence and annotation of the mitochondrial genomes of six Aspergillus and three Penicillium species: A. fumigatus, A. clavatus, A. oryzae, A. flavus, Neosartorya fischeri (A. fischerianus, A. terreus, P. chrysogenum, P. marneffei, and Talaromyces stipitatus (P. stipitatum. The accompanying comparative analysis of these and related publicly available mitochondrial genomes reveals wide variation in size (25–36 Kb among these closely related fungi. The sources of genome expansion include group I introns and accessory genes encoding putative homing endonucleases, DNA and RNA polymerases (presumed to be of plasmid origin and hypothetical proteins. The two smallest sequenced genomes (A. terreus and P. chrysogenum do not contain introns in protein-coding genes, whereas the largest genome (T. stipitatus, contains a total of eleven introns. All of the sequenced genomes have a group I intron in the large ribosomal subunit RNA gene, suggesting that this intron is fixed in these species. Subsequent analysis of several A. fumigatus strains showed low intraspecies variation. This study also includes a phylogenetic analysis based on 14 concatenated core mitochondrial proteins. The phylogenetic tree has a different topology from published multilocus trees, highlighting the challenges still facing the Aspergillus systematics. Conclusions The study expands the genomic resources available to fungal biologists by providing mitochondrial genomes with consistent

  20. Antigen-presenting genes and genomic copy number variations in the Tasmanian devil MHC

    Directory of Open Access Journals (Sweden)

    Cheng Yuanyuan

    2012-03-01

    Full Text Available Abstract Background The Tasmanian devil (Sarcophilus harrisii is currently under threat of extinction due to an unusual fatal contagious cancer called Devil Facial Tumour Disease (DFTD. DFTD is caused by a clonal tumour cell line that is transmitted between unrelated individuals as an allograft without triggering immune rejection due to low levels of Major Histocompatibility Complex (MHC diversity in Tasmanian devils. Results Here we report the characterization of the genomic regions encompassing MHC Class I and Class II genes in the Tasmanian devil. Four genomic regions approximately 960 kb in length were assembled and annotated using BAC contigs and physically mapped to devil Chromosome 4q. 34 genes and pseudogenes were identified, including five Class I and four Class II loci. Interestingly, when two haplotypes from two individuals were compared, three genomic copy number variants with sizes ranging from 1.6 to 17 kb were observed within the classical Class I gene region. One deletion is particularly important as it turns a Class Ia gene into a pseudogene in one of the haplotypes. This deletion explains the previously observed variation in the Class I allelic number between individuals. The frequency of this deletion is highest in the northwestern devil population and lowest in southeastern areas. Conclusions The third sequenced marsupial MHC provides insights into the evolution of this dynamic genomic region among the diverse marsupial species. The two sequenced devil MHC haplotypes revealed three copy number variations that are likely to significantly affect immune response and suggest that future work should focus on the role of copy number variations in disease susceptibility in this species.

  1. Sample size for cluster randomized trials: effect of coefficient of variation of cluster size and analysis method.

    Science.gov (United States)

    Eldridge, Sandra M; Ashby, Deborah; Kerry, Sally

    2006-10-01

    Cluster randomized trials are increasingly popular. In many of these trials, cluster sizes are unequal. This can affect trial power, but standard sample size formulae for these trials ignore this. Previous studies addressing this issue have mostly focused on continuous outcomes or methods that are sometimes difficult to use in practice. We show how a simple formula can be used to judge the possible effect of unequal cluster sizes for various types of analyses and both continuous and binary outcomes. We explore the practical estimation of the coefficient of variation of cluster size required in this formula and demonstrate the formula's performance for a hypothetical but typical trial randomizing UK general practices. The simple formula provides a good estimate of sample size requirements for trials analysed using cluster-level analyses weighting by cluster size and a conservative estimate for other types of analyses. For trials randomizing UK general practices the coefficient of variation of cluster size depends on variation in practice list size, variation in incidence or prevalence of the medical condition under examination, and practice and patient recruitment strategies, and for many trials is expected to be approximately 0.65. Individual-level analyses can be noticeably more efficient than some cluster-level analyses in this context. When the coefficient of variation is <0.23, the effect of adjustment for variable cluster size on sample size is negligible. Most trials randomizing UK general practices and many other cluster randomized trials should account for variable cluster size in their sample size calculations.

  2. DESCARTES' RULE OF SIGNS AND THE IDENTIFIABILITY OF POPULATION DEMOGRAPHIC MODELS FROM GENOMIC VARIATION DATA.

    Science.gov (United States)

    Bhaskar, Anand; Song, Yun S

    2014-01-01

    The sample frequency spectrum (SFS) is a widely-used summary statistic of genomic variation in a sample of homologous DNA sequences. It provides a highly efficient dimensional reduction of large-scale population genomic data and its mathematical dependence on the underlying population demography is well understood, thus enabling the development of efficient inference algorithms. However, it has been recently shown that very different population demographies can actually generate the same SFS for arbitrarily large sample sizes. Although in principle this nonidentifiability issue poses a thorny challenge to statistical inference, the population size functions involved in the counterexamples are arguably not so biologically realistic. Here, we revisit this problem and examine the identifiability of demographic models under the restriction that the population sizes are piecewise-defined where each piece belongs to some family of biologically-motivated functions. Under this assumption, we prove that the expected SFS of a sample uniquely determines the underlying demographic model, provided that the sample is sufficiently large. We obtain a general bound on the sample size sufficient for identifiability; the bound depends on the number of pieces in the demographic model and also on the type of population size function in each piece. In the cases of piecewise-constant, piecewise-exponential and piecewise-generalized-exponential models, which are often assumed in population genomic inferences, we provide explicit formulas for the bounds as simple functions of the number of pieces. Lastly, we obtain analogous results for the "folded" SFS, which is often used when there is ambiguity as to which allelic type is ancestral. Our results are proved using a generalization of Descartes' rule of signs for polynomials to the Laplace transform of piecewise continuous functions.

  3. Testing the link between genome size and growth rate in maize

    Directory of Open Access Journals (Sweden)

    Maud I. Tenaillon

    2016-09-01

    Full Text Available Little is known about the factors driving within species Genome Size (GS variation. GS may be shaped indirectly by natural selection on development and adaptative traits. Because GS variation is particularly pronounced in maize, we have sampled 83 maize inbred lines from three well described genetic groups adapted to contrasted climate conditions: inbreds of tropical origin, Flint inbreds grown in temperate climates, and Dent inbreds distributed in the Corn Belt. As a proxy for growth rate, we measured the Leaf Elongation Rate maximum during nighttime (LERmax as well as GS in all inbred lines. In addition we combined available and new nucleotide polymorphism data at 29,090 sites to characterize the genetic structure of our panel. We found significant variation for both LERmax and GS among groups defined by our genetic structuring. Tropicals displayed larger GS than Flints while Dents exhibited intermediate values. LERmax followed the opposite trend with greater growth rate in Flints than in Tropicals. In other words, LERmax and GS exhibited a significantly negative correlation (r = − 0.27. However, this correlation was driven by among-group variation rather than within-group variation—it was no longer significant after controlling for structure and kinship among inbreds. Our results indicate that selection on GS may have accompanied ancient maize diffusion from its center of origin, with large DNA content excluded from temperate areas. Whether GS has been targeted by more intense selection during modern breeding within groups remains an open question.

  4. One size fits all? Direct evidence for the heterogeneity of genetic drift throughout the genome.

    Science.gov (United States)

    Jiménez-Mena, Belén; Tataru, Paula; Brøndum, Rasmus F; Sahana, Goutam; Guldbrandtsen, Bernt; Bataillon, Thomas

    2016-07-01

    Effective population size (Ne) is a central parameter in population and conservation genetics. It measures the magnitude of genetic drift, rates of accumulation of inbreeding in a population, and it conditions the efficacy of selection. It is often assumed that a single Ne can account for the evolution of genomes. However, recent work provides indirect evidence for heterogeneity in Ne throughout the genome. We study this by examining genome-wide diversity in the Danish Holstein cattle breed. Using the differences in allele frequencies over a single generation, we directly estimated Ne among autosomes and smaller windows within autosomes. We found statistically significant variation in Ne at both scales. However, no correlation was found between the detected regional variability in Ne, and proxies for the intensity of linked selection (local recombination rate, gene density), or the presence of either past strong selection or current artificial selection on traits of economic value. Our findings call for further caution regarding the wide applicability of the Ne concept for understanding quantitatively processes such as genetic drift and accumulation of consanguinity in both natural and managed populations.

  5. Whole-genome sequence variation, population structure and demographic history of the Dutch population

    NARCIS (Netherlands)

    Francioli, Laurent C.; Menelaou, Andronild; Pulit, Sara L.; Van Dijk, Freerk; Palamara, Pier Francesco; Elbers, Clara C.; Neerincx, Pieter B. T.; Ye, Kai; Guryev, Victor; Kloosterman, Wigard P.; Deelen, Patrick; Abdellaoui, Abdel; Van Leeuwen, Elisabeth M.; Van Oven, Mannis; Vermaat, Martijn; Li, Mingkun; Laros, Jeroen F. J.; Karssen, Lennart C.; Kanterakis, Alexandros; Amin, Najaf; Hottenga, Jouke Jan; Lameijer, Eric-Wubbo; Kattenberg, Mathijs; Dijkstra, Martijn; Byelas, Heorhiy; Van Settenl, Jessica; Van Schaik, Barbera D. C.; Bot, Jan; Nijman, Isaac J.; Renkens, Ivo; Marscha, Tobias; Schonhuth, Alexander; Hehir-Kwa, Jayne Y.; Handsaker, Robert E.; Polak, Paz; Sohail, Mashaal; Vuzman, Dana; Hormozdiari, Fereydoun; Van Enckevort, David; Mei, Hailiang; Koval, Vyacheslav; Moed, Ma-Tthijs H.; Van der Velde, K. Joeri; Rivadeneira, Fernando; Estrada, Karol; Medina-Gomez, Carolina; Isaacs, Aaron; McCarroll, Steven A.; Beekrnan, Marian; De Craen, Anton J. M.; Suchiman, H. Eka D.; Hofman, Albert; Oostra, Ben; Uitterlinden, Andre G.; Willemsen, Gonneke; Platteel, Mathieu; Veldink, Jan H.; Van den Berg, Leonard H.; Pitts, Steven J.; Potluri, Shobha; Sundar, Purnima; Cox, David R.; Sunyaev, Shamil R.; Den Dunnen, Johan T.; Stoneking, Mark; De Knijff, Peter; Kayser, Manfred; Li, Qibin; Li, Yingrui; Du, Yuanping; Chen, Ruoyan; Cao, Hongzhi; Li, Ning; Cao, Sujie; Wang, Jun; Bovenberg, Jasper A.; Peer, Itsik; Slagboom, P. Eline; Van Duijn, Cornelia M.; Boomsma, Dorret I.; Van Ommen, Gert-Jan B.; De Bakker, Paul I. W.; Swertz, Morris A.; Wijmenga, Cisca

    2014-01-01

    Whole-genome sequencing enables complete characterization of genetic variation, but geographic clustering of rare alleles demands many diverse populations be studied. Here we describe the Genome of the Netherlands (GoNL) Project, in which we sequenced the whole genomes of 250 Dutch parent-offspring

  6. Natural selection affects multiple aspects of genetic variation at putatively peutral sites across the human genome

    DEFF Research Database (Denmark)

    Lohmueller, Kirk E; Albrechtsen, Anders; Li, Yingrui

    2011-01-01

    A major question in evolutionary biology is how natural selection has shaped patterns of genetic variation across the human genome. Previous work has documented a reduction in genetic diversity in regions of the genome with low recombination rates. However, it is unclear whether other summaries...... affected multiple aspects of linked neutral variation throughout the human genome and that positive selection is not required to explain these observations....... these questions by analyzing three different genome-wide resequencing datasets from European individuals. We document several significant correlations between different genomic features. In particular, we find that average minor allele frequency and diversity are reduced in regions of low recombination...

  7. Copy Number Variation Analysis by Array Analysis of Single Cells Following Whole Genome Amplification.

    Science.gov (United States)

    Dimitriadou, Eftychia; Zamani Esteki, Masoud; Vermeesch, Joris Robert

    2015-01-01

    Whole genome amplification is required to ensure the availability of sufficient material for copy number variation analysis of a genome deriving from an individual cell. Here, we describe the protocols we use for copy number variation analysis of non-fixed single cells by array-based approaches following single-cell isolation and whole genome amplification. We are focusing on two alternative protocols, an isothermal and a PCR-based whole genome amplification method, followed by either comparative genome hybridization (aCGH) or SNP array analysis, respectively.

  8. Genome size in arthropods; different roles of phylogeny, habitat and life history in insects and crustaceans.

    Science.gov (United States)

    Alfsnes, Kristian; Leinaas, Hans Petter; Hessen, Dag Olav

    2017-08-01

    Despite the major role of genome size for physiology, ecology, and evolution, there is still mixed evidence with regard to proximate and ultimate drivers. The main causes of large genome size are proliferation of noncoding elements and/or duplication events. The relative role and interplay between these proximate causes and the evolutionary patterns shaped by phylogeny, life history traits or environment are largely unknown for the arthropods. Genome size shows a tremendous variability in this group, and it has a major impact on a range of fitness-related parameters such as growth, metabolism, life history traits, and for many species also body size. In this study, we compared genome size in two major arthropod groups, insects and crustaceans, and related this to phylogenetic patterns and parameters affecting ambient temperature (latitude, depth, or altitude), insect developmental mode, as well as crustacean body size and habitat, for species where data were available. For the insects, the genome size is clearly phylogeny-dependent, reflecting primarily their life history and mode of development, while for crustaceans there was a weaker association between genome size and phylogeny, suggesting life cycle strategies and habitat as more important determinants. Maximum observed latitude and depth, and their combined effect, showed positive, and possibly phylogenetic independent, correlations with genome size for crustaceans. This study illustrate the striking difference in genome sizes both between and within these two major groups of arthropods, and that while living in the cold with low developmental rates may promote large genomes in marine crustaceans, there is a multitude of proximate and ultimate drivers of genome size.

  9. Comprehensive long-span paired-end-tag mapping reveals characteristic patterns of structural variations in epithelial cancer genomes.

    Science.gov (United States)

    Hillmer, Axel M; Yao, Fei; Inaki, Koichiro; Lee, Wah Heng; Ariyaratne, Pramila N; Teo, Audrey S M; Woo, Xing Yi; Zhang, Zhenshui; Zhao, Hao; Ukil, Leena; Chen, Jieqi P; Zhu, Feng; So, Jimmy B Y; Salto-Tellez, Manuel; Poh, Wan Ting; Zawack, Kelson F B; Nagarajan, Niranjan; Gao, Song; Li, Guoliang; Kumar, Vikrant; Lim, Hui Ping J; Sia, Yee Yen; Chan, Chee Seng; Leong, See Ting; Neo, Say Chuan; Choi, Poh Sum D; Thoreau, Hervé; Tan, Patrick B O; Shahab, Atif; Ruan, Xiaoan; Bergh, Jonas; Hall, Per; Cacheux-Rataboul, Valère; Wei, Chia-Lin; Yeoh, Khay Guan; Sung, Wing-Kin; Bourque, Guillaume; Liu, Edison T; Ruan, Yijun

    2011-05-01

    Somatic genome rearrangements are thought to play important roles in cancer development. We optimized a long-span paired-end-tag (PET) sequencing approach using 10-Kb genomic DNA inserts to study human genome structural variations (SVs). The use of a 10-Kb insert size allows the identification of breakpoints within repetitive or homology-containing regions of a few kilobases in size and results in a higher physical coverage compared with small insert libraries with the same sequencing effort. We have applied this approach to comprehensively characterize the SVs of 15 cancer and two noncancer genomes and used a filtering approach to strongly enrich for somatic SVs in the cancer genomes. Our analyses revealed that most inversions, deletions, and insertions are germ-line SVs, whereas tandem duplications, unpaired inversions, interchromosomal translocations, and complex rearrangements are over-represented among somatic rearrangements in cancer genomes. We demonstrate that the quantitative and connective nature of DNA-PET data is precise in delineating the genealogy of complex rearrangement events, we observe signatures that are compatible with breakage-fusion-bridge cycles, and we discover that large duplications are among the initial rearrangements that trigger genome instability for extensive amplification in epithelial cancers.

  10. Temporal variation in genetic diversity and effective population size of Mediterranean and subalpine Arabidopsis thaliana populations.

    Science.gov (United States)

    Gomaa, Nasr H; Montesinos-Navarro, Alicia; Alonso-Blanco, Carlos; Picó, F Xavier

    2011-09-01

    Currently, there exists a limited knowledge on the extent of temporal variation in population genetic parameters of natural populations. Here, we study the extent of temporal variation in population genetics by genotyping 151 genome-wide SNP markers polymorphic in 466 individuals collected from nine populations of the annual plant Arabidopsis thaliana during 4 years. Populations are located along an altitudinal climatic gradient from Mediterranean to subalpine environments in NE Spain, which has been shown to influence key demographic attributes and life cycle adaptations. Genetically, A. thaliana populations were more variable across space than over time. Common multilocus genotypes were detected several years in the same population, whereas low-frequency multilocus genotypes appeared only 1 year. High-elevation populations were genetically poorer and more variable over time than low-elevation populations, which might be caused by a higher overall demographic instability at higher altitudes. Estimated effective population sizes were low but also showed a significant decreasing trend with increasing altitude, suggesting a deeper impact of genetic drift at high-elevation populations. In comparison with single-year samplings, repeated genotyping over time captured substantially higher amount of genetic variation contained in A. thaliana populations. Furthermore, repeated genotyping of populations provided novel information on the genetic properties of A. thaliana populations and allowed hypothesizing on their underlying mechanisms. Therefore, including temporal genotyping programmes into traditional population genetic studies can significantly increase our understanding of the dynamics of natural populations.

  11. Variations in CYP78A13 coding region influence grain size and yield in rice.

    Science.gov (United States)

    Xu, Fan; Fang, Jun; Ou, Shujun; Gao, Shaopei; Zhang, Fengxia; Du, Lin; Xiao, Yunhua; Wang, Hongru; Sun, Xiaohong; Chu, Jinfang; Wang, Guodong; Chu, Chengcai

    2015-04-01

    Grain size is one of the most important determinants of crop yield in cereals. Here, we identified a dominant mutant, big grain2 (bg2-D) from our enhancer-trapping population. Genetic analysis and SiteFinding PCR (polymerase chain reaction) revealed that BG2 encodes a cytochrome P450, OsCYP78A13. Sequence search revealed that CYP78A13 has a paralogue Grain Length 3.2 (GL3.2, LOC_Os03g30420) in rice with distinct expression patterns, analysis of transgenic plants harbouring either CYP78A13 or GL3.2 showed that both can promote grain growth. Sequence polymorphism analysis with 1529 rice varieties showed that the nucleotide diversity at CYP78A13 gene body and the 20 kb flanking region in the indica varieties were markedly higher than those in japonica varieties. Further, comparison of the genomic sequence of CYP78A13 in the japonica cultivar Nipponbare and the indica cultivar 9311 showed that there were three InDels in the promoter region and eight SNPs (single nucleotide polymorphism) in its coding sequence. Detailed examination of the transgenic plants with chimaeric constructs suggested that variation in CYP78A13 coding region is responsible for the variation of grain yield. Taken together, our results suggest that the variations in CYP78A13 in the indica varieties hold potential in rice breeding for application of grain yield improvement.

  12. Somatic genomic variations in extra-embryonic tissues

    Energy Technology Data Exchange (ETDEWEB)

    Weier, Jingly F.; Ferlatte, Christy; Weier, Heinz-Ulli G.

    2010-05-21

    In the mature chorion, one of the membranes that exist during pregnancy between the developing fetus and mother, human placental cells form highly specialized tissues composed of mesenchyme and floating or anchoring villi. Using fluorescence in situ hybridization, we found that human invasive cytotrophoblasts isolated from anchoring villi or the uterine wall had gained individual chromosomes; however, chromosome losses were detected infrequently. With chromosomes gained in what appeared to be a chromosome-specific manner, more than half of the invasive cytotrophoblasts in normal pregnancies were found to be hyperdiploid. Interestingly, the rates of hyperdiploid cells depended not only on gestational age, but were strongly associated with the extraembryonic compartment at the fetal-maternal interface from which they were isolated. Since hyperdiploid cells showed drastically reduced DNA replication as measured by bromodeoxyuridine incorporation, we conclude that aneuploidy is a part of the normal process of placentation potentially limiting the proliferative capabilities of invasive cytotrophoblasts. Thus, under the special circumstances of human reproduction, somatic genomic variations may exert a beneficial, anti-neoplastic effect on the organism.

  13. From genomes to pangenomes: understanding variation among individuals and species

    OpenAIRE

    Contreras-Moreira, Bruno; Vinuesa, Pablo

    2017-01-01

    This tutorial illustrates how to analyze pan-genomes using GET_HOMOLOGUES and GET_HOMOLOGUES-EST. After a short introduction, where the main concepts are illustrated, the remaining sections cover the installation and typical operations required to analyze and annotate genomes and transcriptomes from a pan-genome perspective, in which individuals or species contribute genetic material to a pool.

  14. Identification of genome-wide copy number variations among diverse pig breeds by array CGH

    Directory of Open Access Journals (Sweden)

    Li Yan

    2012-12-01

    Full Text Available Abstract Background Recent studies have shown that copy number variation (CNV in mammalian genomes contributes to phenotypic diversity, including health and disease status. In domestic pigs, CNV has been catalogued by several reports, but the extent of CNV and the phenotypic effects are far from clear. The goal of this study was to identify CNV regions (CNVRs in pigs based on array comparative genome hybridization (aCGH. Results Here a custom-made tiling oligo-nucleotide array was used with a median probe spacing of 2506 bp for screening 12 pigs including 3 Chinese native pigs (one Chinese Erhualian, one Tongcheng and one Yangxin pig, 5 European pigs (one Large White, one Pietrain, one White Duroc and two Landrace pigs, 2 synthetic pigs (Chinese new line DIV pigs and 2 crossbred pigs (Landrace × DIV pigs with a Duroc pig as the reference. Two hundred and fifty-nine CNVRs across chromosomes 1–18 and X were identified, with an average size of 65.07 kb and a median size of 98.74 kb, covering 16.85 Mb or 0.74% of the whole genome. Concerning copy number status, 93 (35.91% CNVRs were called as gains, 140 (54.05% were called as losses and the remaining 26 (10.04% were called as both gains and losses. Of all detected CNVRs, 171 (66.02% and 34 (13.13% CNVRs directly overlapped with Sus scrofa duplicated sequences and pig QTLs, respectively. The CNVRs encompassed 372 full length Ensembl transcripts. Two CNVRs identified by aCGH were validated using real-time quantitative PCR (qPCR. Conclusions Using 720 K array CGH (aCGH we described a map of porcine CNVs which facilitated the identification of structural variations for important phenotypes and the assessment of the genetic diversity of pigs.

  15. A genome-to-genome analysis of associations between human genetic variation, HIV-1 sequence diversity, and viral control.

    Science.gov (United States)

    Bartha, István; Carlson, Jonathan M; Brumme, Chanson J; McLaren, Paul J; Brumme, Zabrina L; John, Mina; Haas, David W; Martinez-Picado, Javier; Dalmau, Judith; López-Galíndez, Cecilio; Casado, Concepción; Rauch, Andri; Günthard, Huldrych F; Bernasconi, Enos; Vernazza, Pietro; Klimkait, Thomas; Yerly, Sabine; O'Brien, Stephen J; Listgarten, Jennifer; Pfeifer, Nico; Lippert, Christoph; Fusi, Nicolo; Kutalik, Zoltán; Allen, Todd M; Müller, Viktor; Harrigan, P Richard; Heckerman, David; Telenti, Amalio; Fellay, Jacques

    2013-10-29

    HIV-1 sequence diversity is affected by selection pressures arising from host genomic factors. Using paired human and viral data from 1071 individuals, we ran >3000 genome-wide scans, testing for associations between host DNA polymorphisms, HIV-1 sequence variation and plasma viral load (VL), while considering human and viral population structure. We observed significant human SNP associations to a total of 48 HIV-1 amino acid variants (pgenome-to-genome approach highlights sites of genomic conflict and is a strategy generally applicable to studies of host-pathogen interaction. DOI:http://dx.doi.org/10.7554/eLife.01123.001.

  16. Genome size and metabolic intensity in tetrapods: a tale of two lines.

    Science.gov (United States)

    Vinogradov, Alexander E; Anatskaya, Olga V

    2006-01-07

    We show the negative link between genome size and metabolic intensity in tetrapods, using the heart index (relative heart mass) as a unified indicator of metabolic intensity in poikilothermal and homeothermal animals. We found two separate regression lines of heart index on genome size for reptiles-birds and amphibians-mammals (the slope of regression is steeper in reptiles-birds). We also show a negative correlation between GC content and nucleosome formation potential in vertebrate DNA, and, consistent with this relationship, a positive correlation between genome GC content and nuclear size (independent of genome size). It is known that there are two separate regression lines of genome GC content on genome size for reptiles-birds and amphibians-mammals: reptiles-birds have the relatively higher GC content (for their genome sizes) compared to amphibians-mammals. Our results suggest uniting all these data into one concept. The slope of negative regression between GC content and nucleosome formation potential is steeper in exons than in non-coding DNA (where nucleosome formation potential is generally higher), which indicates a special role of non-coding DNA for orderly chromatin organization. The chromatin condensation and nuclear size are supposed to be key parameters that accommodate the effects of both genome size and GC content and connect them with metabolic intensity. Our data suggest that the reptilian-birds clade evolved special relationships among these parameters, whereas mammals preserved the amphibian-like relationships. Surprisingly, mammals, although acquiring a more complex general organization, seem to retain certain genome-related properties that are similar to amphibians. At the same time, the slope of regression between nucleosome formation potential and GC content is steeper in poikilothermal than in homeothermal genomes, which suggests that mammals and birds acquired certain common features of genomic organization.

  17. Genome-wide mapping of copy number variation in humans: comparative analysis of high resolution array platforms.

    Directory of Open Access Journals (Sweden)

    Rajini R Haraksingh

    Full Text Available Accurate and efficient genome-wide detection of copy number variants (CNVs is essential for understanding human genomic variation, genome-wide CNV association type studies, cytogenetics research and diagnostics, and independent validation of CNVs identified from sequencing based technologies. Numerous, array-based platforms for CNV detection exist utilizing array Comparative Genome Hybridization (aCGH, Single Nucleotide Polymorphism (SNP genotyping or both. We have quantitatively assessed the abilities of twelve leading genome-wide CNV detection platforms to accurately detect Gold Standard sets of CNVs in the genome of HapMap CEU sample NA12878, and found significant differences in performance. The technologies analyzed were the NimbleGen 4.2 M, 2.1 M and 3×720 K Whole Genome and CNV focused arrays, the Agilent 1×1 M CGH and High Resolution and 2×400 K CNV and SNP+CGH arrays, the Illumina Human Omni1Quad array and the Affymetrix SNP 6.0 array. The Gold Standards used were a 1000 Genomes Project sequencing-based set of 3997 validated CNVs and an ultra high-resolution aCGH-based set of 756 validated CNVs. We found that sensitivity, total number, size range and breakpoint resolution of CNV calls were highest for CNV focused arrays. Our results are important for cost effective CNV detection and validation for both basic and clinical applications.

  18. Variations in the composition of house dust by particle size.

    Science.gov (United States)

    Lanzerstorfer, Christof

    2017-07-03

    In this study, the distribution of heavy metals and other components in the various size fractions of house dust is investigated. A house dust sample collected from a vacuum cleaner was separated into size fractions by sieving and air classification. The analysis of the size fractions showed that the heavy metals and other components are not uniformly distributed in the various size fractions. The highest total carbon concentrations were found in the size fractions with a mass median diameter of 18-95 µm, while in the coarser size fractions and in the finest size fraction, the total carbon concentration was lower. In contrast, for many heavy metals and other metals (Al, Fe, Ca, S, Mn, Ti, Ba, Sr, As, Co, and V), the maximum concentrations were found in the finest size fraction. With increasing size of the dust fractions, the concentrations decreased. For several of these components, the dependence of the concentration on the particle size can be approximately assessed well using a power function. The distribution of Zn, Cu, Mg and Na was different. While the concentration of Na and Mg was higher in the coarser size fractions, no distinct trend was found for the concentrations of Cu and Zn.

  19. Small genomes and large seeds: chromosome numbers, genome size and seed mass in diploid Aesculus species (Sapindaceae).

    Science.gov (United States)

    Krahulcová, Anna; Trávnícek, Pavel; Krahulec, František; Rejmánek, Marcel

    2017-04-01

    Aesculus L. (horse chestnut, buckeye) is a genus of 12-19 extant woody species native to the temperate Northern Hemisphere. This genus is known for unusually large seeds among angiosperms. While chromosome counts are available for many Aesculus species, only one has had its genome size measured. The aim of this study is to provide more genome size data and analyse the relationship between genome size and seed mass in this genus. Chromosome numbers in root tip cuttings were confirmed for four species and reported for the first time for three additional species. Flow cytometric measurements of 2C nuclear DNA values were conducted on eight species, and mean seed mass values were estimated for the same taxa. The same chromosome number, 2 n = 40, was determined in all investigated taxa. Original measurements of 2C values for seven Aesculus species (eight taxa), added to just one reliable datum for A. hippocastanum , confirmed the notion that the genome size in this genus with relatively large seeds is surprisingly low, ranging from 0·955 pg 2C -1 in A. parviflora to 1·275 pg 2C -1 in A. glabra var. glabra. The chromosome number of 2 n = 40 seems to be conclusively the universal 2 n number for non-hybrid species in this genus. Aesculus genome sizes are relatively small, not only within its own family, Sapindaceae, but also within woody angiosperms. The genome sizes seem to be distinct and non-overlapping among the four major Aesculus clades. These results provide an extra support for the most recent reconstruction of Aesculus phylogeny. The correlation between the 2C values and seed masses in examined Aesculus species is slightly negative and not significant. However, when the four major clades are treated separately, there is consistent positive association between larger genome size and larger seed mass within individual lineages.

  20. Life course variations in the heritability of body size

    DEFF Research Database (Denmark)

    Zhao, J.; Luan, J.A.; Sharp, S.J.

    Background: It has been shown recently that whole genome data can facilitate estimation of genetic contributions to a variety of traits via a mixed model framework as implemented in GCTA and R/SAS (Yang et al. Nat Genet 2010, 42:565-9; Zhao & Luan. J Prob Stat 2012, doi 10.1155/ 2012.485174). Our...

  1. Genomic instability is associated with natural life span variation in Saccharomyces cerevisiae.

    Directory of Open Access Journals (Sweden)

    Hong Qin

    Full Text Available Increasing genomic instability is associated with aging in eukaryotes, but the connection between genomic instability and natural variation in life span is unknown. We have quantified chronological life span and loss-of-heterozygosity (LOH in 11 natural isolates of Saccharomyces cerevisiae. We show that genomic instability increases and mitotic asymmetry breaks down during chronological aging. The age-dependent increase of genomic instability generally lags behind the drop of viability and this delay accounts for approximately 50% of the observed natural variation of replicative life span in these yeast isolates. We conclude that the abilities of yeast strains to tolerate genomic instability co-vary with their replicative life spans. To the best of our knowledge, this is the first quantitative evidence that demonstrates a link between genomic instability and natural variation in life span.

  2. Size variation in small-bodied humans from palau, micronesia.

    Directory of Open Access Journals (Sweden)

    Andrew Gallagher

    Full Text Available BACKGROUND: Recent discoveries on Palau are claimed to represent the remains of small-bodied humans that may display evidence insular size reduction. This claim has yet to be statistically validated METHODOLOGY/PRINCIPAL FINDINGS: Published postcranial specimens (n = 16 from Palau were assessed relative to recent small-bodied comparative samples. Resampling statistical approaches were employed to test specific hypotheses relating to body size in the Palau sample. Results confirm that the Palau postcranial sample is indisputably small-bodied. CONCLUSIONS/SIGNIFICANCE: A single, homogenous body size morph is represented in early prehistoric postcrania from Palau. Small body size in early Palauans is an ancestral characteristic and was likely not a consequence of in-situ size reduction. Specimens from Palau have little bearing upon hypothesised insular size reduction in the ancestral lineage of Homo floresiensis.

  3. Genomic variation across the Yellow-rumped Warbler species complex

    OpenAIRE

    Toews, David P.L.; Brelsford, Alan; Grossen, Christine; Milá, Borja; Irwin, Darren E.

    2016-01-01

    Populations that have experienced long periods of geographic isolation will diverge over time. The application of highthroughput sequencing technologies to study the genomes of related taxa now allows us to quantify, at a fine scale, the consequences of this divergence across the genome. Throughout a number of studies, a notable pattern has emerged. In many cases, estimates of differentiation across the genome are strongly heterogeneous; however, the evolutionary processes driving this striki...

  4. Genome size and transposable element content as determined by high-throughput sequencing in maize and Zea luxurians.

    Science.gov (United States)

    Tenaillon, Maud I; Hufford, Matthew B; Gaut, Brandon S; Ross-Ibarra, Jeffrey

    2011-01-01

    The genome of maize (Zea mays ssp. mays) consists mostly of transposable elements (TEs) and varies in size among lines. This variation extends to other species in the genus Zea: although maize and Zea luxurians diverged only ∼140,000 years ago, their genomes differ in size by ∼50%. We used paired-end Illumina sequencing to evaluate the potential contribution of TEs to the genome size difference between these two species. We aligned the reads both to a filtered gene set and to an exemplar database of unique repeats representing 1,514 TE families; ∼85% of reads mapped against TE repeats in both species. The relative contribution of TE families to the B73 genome was highly correlated with previous estimates, suggesting that reliable estimates of TE content can be obtained from short high-throughput sequencing reads, even at low coverage. Because we used paired-end reads, we could assess whether a TE was near a gene by determining if one paired read mapped to a TE and the second read mapped to a gene. Using this method, Class 2 DNA elements were found significantly more often in genic regions than Class 1 RNA elements, but Class 1 elements were found more often near other TEs. Overall, we found that both Class 1 and 2 TE families account for ∼70% of the genome size difference between B73 and luxurians. Interestingly, the relative abundance of TE families was conserved between species (r = 0.97), suggesting genome-wide control of TE content rather than family-specific effects.

  5. A novel statistical method to estimate the effective SNP size in vertebrate genomes and categorized genomic regions

    Directory of Open Access Journals (Sweden)

    Zhao Zhongming

    2006-12-01

    Full Text Available Abstract Background The local environment of single nucleotide polymorphisms (SNPs contains abundant genetic information for the study of mechanisms of mutation, genome evolution, and causes of diseases. Recent studies revealed that neighboring-nucleotide biases on SNPs were strong and the genome-wide bias patterns could be represented by a small subset of the total SNPs. It remains unsolved for the estimation of the effective SNP size, the number of SNPs that are sufficient to represent the bias patterns observed from the whole SNP data. Results To estimate the effective SNP size, we developed a novel statistical method, SNPKS, which considers both the statistical and biological significances. SNPKS consists of two major steps: to obtain an initial effective size by the Kolmogorov-Smirnov test (KS test and to find an intermediate effective size by interval evaluation. The SNPKS algorithm was implemented in computer programs and applied to the real SNP data. The effective SNP size was estimated to be 38,200, 39,300, 38,000, and 38,700 in the human, chimpanzee, dog, and mouse genomes, respectively, and 39,100, 39,600, 39,200, and 42,200 in human intergenic, genic, intronic, and CpG island regions, respectively. Conclusion SNPKS is the first statistical method to estimate the effective SNP size. It runs efficiently and greatly outperforms the algorithm implemented in SNPNB. The application of SNPKS to the real SNP data revealed the similar small effective SNP size (38,000 – 42,200 in the human, chimpanzee, dog, and mouse genomes as well as in human genomic regions. The findings suggest strong influence of genetic factors across vertebrate genomes.

  6. Variation in body size and metamorphic traits of Iberian spadefoot toads over a short geographic distance

    OpenAIRE

    2008-01-01

    Determinants of geographic variation in body size are often poorly understood, especially in organisms with complex life cycles. We examined patterns of adult body size and metamorphic traits variation in Iberian spadefoot toad (Pelobates cultripes) populations, which exhibit an extreme reduction in adult body size, 71.6% reduction in body mass, within just about 30 km at south-western Spain. We hypothesized that size at and time to metamorphosis would be predictive of the spatial pattern obs...

  7. Quantitative testing of the methodology for genome size estimation in plants using flow cytometry: a case study of the Primulina genus

    Directory of Open Access Journals (Sweden)

    Jing eWang

    2015-05-01

    Full Text Available Flow cytometry (FCM is a commonly used method for estimating genome size in many organisms. The use of flow cytometry in plants is influenced by endogenous fluorescence inhibitors and may cause an inaccurate estimation of genome size; thus, falsifying the relationship between genome size and phenotypic traits/ecological performance. Quantitative optimization of FCM methodology minimizes such errors, yet there are few studies detailing this methodology. We selected the genus Primulina, one of the most representative and diverse genera of the Old World Gesneriaceae, to evaluate the methodology effect on determining genome size. Our results showed that buffer choice significantly affected genome size estimation in six out of the eight species examined and altered the 2C-value (DNA content by as much as 21.4%. The staining duration and propidium iodide (PI concentration slightly affected the 2C-value. Our experiments showed better histogram quality when the samples were stained for 40 minutes at a PI concentration of 100 µg ml-1. The quality of the estimates was not improved by one-day incubation in the dark at 4 °C or by centrifugation. Thus, our study determined an optimum protocol for genome size measurement in Primulina: LB01 buffer supplemented with 100 µg ml-1 PI and stained for 40 minutes. This protocol also demonstrated a high universality in other Gesneriaceae genera. We report the genome size of nine Gesneriaceae species for the first time. The results showed substantial genome size variation both within and among the species, with the 2C-value ranging between 1.62 and 2.71 pg. Our study highlights the necessity of optimizing the FCM methodology prior to obtaining reliable genome size estimates in a given taxon.

  8. Quantitative testing of the methodology for genome size estimation in plants using flow cytometry: a case study of the Primulina genus.

    Science.gov (United States)

    Wang, Jing; Liu, Juan; Kang, Ming

    2015-01-01

    Flow cytometry (FCM) is a commonly used method for estimating genome size in many organisms. The use of FCM in plants is influenced by endogenous fluorescence inhibitors and may cause an inaccurate estimation of genome size; thus, falsifying the relationship between genome size and phenotypic traits/ecological performance. Quantitative optimization of FCM methodology minimizes such errors, yet there are few studies detailing this methodology. We selected the genus Primulina, one of the most representative and diverse genera of the Old World Gesneriaceae, to evaluate the methodology effect on determining genome size. Our results showed that buffer choice significantly affected genome size estimation in six out of the eight species examined and altered the 2C-value (DNA content) by as much as 21.4%. The staining duration and propidium iodide (PI) concentration slightly affected the 2C-value. Our experiments showed better histogram quality when the samples were stained for 40 min at a PI concentration of 100 μg ml(-1). The quality of the estimates was not improved by 1-day incubation in the dark at 4°C or by centrifugation. Thus, our study determined an optimum protocol for genome size measurement in Primulina: LB01 buffer supplemented with 100 μg ml(-1) PI and stained for 40 min. This protocol also demonstrated a high universality in other Gesneriaceae genera. We report the genome size of nine Gesneriaceae species for the first time. The results showed substantial genome size variation both within and among the species, with the 2C-value ranging between 1.62 and 2.71 pg. Our study highlights the necessity of optimizing the FCM methodology prior to obtaining reliable genome size estimates in a given taxon.

  9. Distinct gene number-genome size relationships for eukaryotes and non-eukaryotes: gene content estimation for dinoflagellate genomes.

    Directory of Open Access Journals (Sweden)

    Yubo Hou

    Full Text Available The ability to predict gene content is highly desirable for characterization of not-yet sequenced genomes like those of dinoflagellates. Using data from completely sequenced and annotated genomes from phylogenetically diverse lineages, we investigated the relationship between gene content and genome size using regression analyses. Distinct relationships between log(10-transformed protein-coding gene number (Y' versus log(10-transformed genome size (X', genome size in kbp were found for eukaryotes and non-eukaryotes. Eukaryotes best fit a logarithmic model, Y' = ln(-46.200+22.678X', whereas non-eukaryotes a linear model, Y' = 0.045+0.977X', both with high significance (p0.91. Total gene number shows similar trends in both groups to their respective protein coding regressions. The distinct correlations reflect lower and decreasing gene-coding percentages as genome size increases in eukaryotes (82%-1% compared to higher and relatively stable percentages in prokaryotes and viruses (97%-47%. The eukaryotic regression models project that the smallest dinoflagellate genome (3x10(6 kbp contains 38,188 protein-coding (40,086 total genes and the largest (245x10(6 kbp 87,688 protein-coding (92,013 total genes, corresponding to 1.8% and 0.05% gene-coding percentages. These estimates do not likely represent extraordinarily high functional diversity of the encoded proteome but rather highly redundant genomes as evidenced by high gene copy numbers documented for various dinoflagellate species.

  10. Copy number variation in CNP267 region may be associated with hip bone size.

    Directory of Open Access Journals (Sweden)

    Shan-Lin Liu

    Full Text Available Osteoporotic hip fracture (HF is a serious global public health problem associated with high morbidity and mortality. Hip bone size (BS has been identified as one of key measurable risk factors for HF, independent of bone mineral density (BMD. Hip BS is highly genetically determined, but genetic factors underlying BS variation are still poorly defined. Here, we performed an initial genome-wide copy number variation (CNV association analysis for hip BS in 1,627 Chinese Han subjects using Affymetrix GeneChip Human Mapping SNP 6.0 Array and a follow-up replicate study in 2,286 unrelated US Caucasians sample. We found that a copy number polymorphism (CNP267 located at chromosome 2q12.2 was significantly associated with hip BS in both initial Chinese and replicate Caucasian samples with p values of 4.73E-03 and 5.66E-03, respectively. An important candidate gene, four and a half LIM domains 2 (FHL2, was detected at the downstream of CNP267, which plays important roles in bone metabolism by binding to several bone formation regulator, such as insulin-like growth factor-binding protein 5 (IGFBP-5 and androgen receptor (AR. Our findings suggest that CNP267 region may be associated with hip BS which might influence the FHL2 gene downstream.

  11. Transcriptome and genome size analysis of the venus flytrap

    DEFF Research Database (Denmark)

    Jensen, Michael Krogh; Vogt, Josef Korbinian; Bressendorff, Simon

    2015-01-01

    The insectivorous Venus flytrap (Dionaea muscipula) is renowned from Darwin's studies of plant carnivory and the origins of species. To provide tools to analyze the evolution and functional genomics of D. muscipula, we sequenced a normalized cDNA library synthesized from mRNA isolated from D...

  12. Inexpensive multiplexed library preparation for megabase-sized genomes.

    Directory of Open Access Journals (Sweden)

    Michael Baym

    Full Text Available Whole-genome sequencing has become an indispensible tool of modern biology. However, the cost of sample preparation relative to the cost of sequencing remains high, especially for small genomes where the former is dominant. Here we present a protocol for rapid and inexpensive preparation of hundreds of multiplexed genomic libraries for Illumina sequencing. By carrying out the Nextera tagmentation reaction in small volumes, replacing costly reagents with cheaper equivalents, and omitting unnecessary steps, we achieve a cost of library preparation of $8 per sample, approximately 6 times cheaper than the standard Nextera XT protocol. Furthermore, our procedure takes less than 5 hours for 96 samples. Several hundred samples can then be pooled on the same HiSeq lane via custom barcodes. Our method will be useful for re-sequencing of microbial or viral genomes, including those from evolution experiments, genetic screens, and environmental samples, as well as for other sequencing applications including large amplicon, open chromosome, artificial chromosomes, and RNA sequencing.

  13. Overview of the creative genome: effects of genome structure and sequence on the generation of variation and evolution.

    Science.gov (United States)

    Caporale, Lynn Helena

    2012-09-01

    This overview of a special issue of Annals of the New York Academy of Sciences discusses uneven distribution of distinct types of variation across the genome, the dependence of specific types of variation upon distinct classes of DNA sequences and/or the induction of specific proteins, the circumstances in which distinct variation-generating systems are activated, and the implications of this work for our understanding of evolution and of cancer. Also discussed is the value of non text-based computational methods for analyzing information carried by DNA, early insights into organizational frameworks that affect genome behavior, and implications of this work for comparative genomics. © 2012 New York Academy of Sciences.

  14. Genomic analysis reveals major determinants of cis-regulatory variation in Capsella grandiflora.

    Science.gov (United States)

    Steige, Kim A; Laenen, Benjamin; Reimegård, Johan; Scofield, Douglas G; Slotte, Tanja

    2017-01-31

    Understanding the causes of cis-regulatory variation is a long-standing aim in evolutionary biology. Although cis-regulatory variation has long been considered important for adaptation, we still have a limited understanding of the selective importance and genomic determinants of standing cis-regulatory variation. To address these questions, we studied the prevalence, genomic determinants, and selective forces shaping cis-regulatory variation in the outcrossing plant Capsella grandiflora We first identified a set of 1,010 genes with common cis-regulatory variation using analyses of allele-specific expression (ASE). Population genomic analyses of whole-genome sequences from 32 individuals showed that genes with common cis-regulatory variation (i) are under weaker purifying selection and (ii) undergo less frequent positive selection than other genes. We further identified genomic determinants of cis-regulatory variation. Gene body methylation (gbM) was a major factor constraining cis-regulatory variation, whereas presence of nearby transposable elements (TEs) and tissue specificity of expression increased the odds of ASE. Our results suggest that most common cis-regulatory variation in C. grandiflora is under weak purifying selection, and that gene-specific functional constraints are more important for the maintenance of cis-regulatory variation than genome-scale variation in the intensity of selection. Our results agree with previous findings that suggest TE silencing affects nearby gene expression, and provide evidence for a link between gbM and cis-regulatory constraint, possibly reflecting greater dosage sensitivity of body-methylated genes. Given the extensive conservation of gbM in flowering plants, this suggests that gbM could be an important predictor of cis-regulatory variation in a wide range of plant species.

  15. VIGoR: Variational Bayesian Inference for Genome-Wide Regression

    Directory of Open Access Journals (Sweden)

    Akio Onogi

    2016-04-01

    Full Text Available Genome-wide regression using a number of genome-wide markers as predictors is now widely used for genome-wide association mapping and genomic prediction. We developed novel software for genome-wide regression which we named VIGoR (variational Bayesian inference for genome-wide regression. Variational Bayesian inference is computationally much faster than widely used Markov chain Monte Carlo algorithms. VIGoR implements seven regression methods, and is provided as a command line program package for Linux/Mac, and as a cross-platform R package. In addition to model fitting, cross-validation and hyperparameter tuning using cross-validation can be automatically performed by modifying a single argument. VIGoR is available at https://github.com/Onogi/VIGoR. The R package is also available at https://cran.r-project.org/web/packages/VIGoR/index.html.

  16. Are we Genomic Mosaics? Variations of the Genome of Somatic Cells can Contribute to Diversify our Phenotypes.

    Science.gov (United States)

    Astolfi, P A; Salamini, F; Sgaramella, V

    2010-09-01

    Theoretical and experimental evidences support the hypothesis that the genomes and the epigenomes may be different in the somatic cells of complex organisms. In the genome, the differences range from single base substitutions to chromosome number; in the epigenome, they entail multiple postsynthetic modifications of the chromatin. Somatic genome variations (SGV) may accumulate during development in response both to genetic programs, which may differ from tissue to tissue, and to environmental stimuli, which are often undetected and generally irreproducible. SGV may jeopardize physiological cellular functions, but also create novel coding and regulatory sequences, to be exposed to intraorganismal Darwinian selection. Genomes acknowledged as comparatively poor in genes, such as humans', could thus increase their pristine informational endowment. A better understanding of SGV will contribute to basic issues such as the "nature vs nurture" dualism and the inheritance of acquired characters. On the applied side, they may explain the low yield of cloning via somatic cell nuclear transfer, provide clues to some of the problems associated with transdifferentiation, and interfere with individual DNA analysis. SGV may be unique in the different cells types and in the different developmental stages, and thus explain the several hundred gaps persisting in the human genomes "completed" so far. They may compound the variations associated to our epigenomes and make of each of us an "(epi)genomic" mosaic. An ensuing paradigm is the possibility that a single genome (the ephemeral one assembled at fertilization) has the capacity to generate several different brains in response to different environments.

  17. G-protein genomic association with normal variation in gray matter density

    NARCIS (Netherlands)

    Chen, J.; Calhoun, V.D.; Arias-Vasquez, A.; Zwiers, M.P.; Hulzen, K. van; Fernandez, G.S.E.; Fisher, S.E.; Franke, B.; Turner, J.A.; Liu, J.

    2015-01-01

    While detecting genetic variations underlying brain structures helps reveal mechanisms of neural disorders, high data dimensionality poses a major challenge for imaging genomic association studies. In this work, we present the application of a recently proposed approach, parallel independent

  18. Body weight in relation to variation in body size of Oystercatchers Haematopus ostralegus

    NARCIS (Netherlands)

    Zwarts, L; Hulscher, JB; Koopman, K; Zegers, PM

    1996-01-01

    This paper analyses the relationships between body weight in the Oystercatcher and two measures of its body size, bill length and wing length. The weight variation between individuals due to differences in body size is nearly as large as the seasonal variation in body weight within individuals. Wing

  19. Genome size and sequence composition of moso bamboo: A comparative study

    Institute of Scientific and Technical Information of China (English)

    2007-01-01

    Moso bamboo (Phyllostachys pubescens) is one of the world's most important bamboo species. It has the largest area of all planted bamboo―over two-thirds of the total bamboo forest area―and the highest economic value in China. Moso bamboo is a tetraploid (4x=48) and a special member of the grasses family. Although several genomes have been sequenced or are being sequenced in the grasses family, we know little about the genome of the bambusoids (bamboos). In this study, the moso bamboo genome size was estimated to be about 2034 Mb by flow cytometry (FCM), using maize (cv. B73) and rice (cv. Nipponbare) as internal references. The rice genome has been sequenced and the maize genome is being sequenced. We found that the size of the moso bamboo genome was similar to that of maize but significantly larger than that of rice. To determine whether the bamboo genome had a high proportion of repeat elements, similar to that of the maize genome, approximately 1000 genome survey sequences (GSS) were generated. Sequence analysis showed that the proportion of repeat elements was 23.3% for the bamboo genome, which is significantly lower than that of the maize genome (65.7%). The bamboo repeat elements were mainly Gypsy/DIRS1 and Ty1/Copia LTR retrotransposons (14.7%), with a few DNA transposons. However, more genomic sequences are needed to confirm the above results due to several factors, such as the limitation of our GSS data. This study is the first to investigate sequence composition of the bamboo genome. Our results are valuable for future genome research of moso and other bamboos.

  20. A high-definition view of functional genetic variation from natural yeast genomes.

    Science.gov (United States)

    Bergström, Anders; Simpson, Jared T; Salinas, Francisco; Barré, Benjamin; Parts, Leopold; Zia, Amin; Nguyen Ba, Alex N; Moses, Alan M; Louis, Edward J; Mustonen, Ville; Warringer, Jonas; Durbin, Richard; Liti, Gianni

    2014-04-01

    The question of how genetic variation in a population influences phenotypic variation and evolution is of major importance in modern biology. Yet much is still unknown about the relative functional importance of different forms of genome variation and how they are shaped by evolutionary processes. Here we address these questions by population level sequencing of 42 strains from the budding yeast Saccharomyces cerevisiae and its closest relative S. paradoxus. We find that genome content variation, in the form of presence or absence as well as copy number of genetic material, is higher within S. cerevisiae than within S. paradoxus, despite genetic distances as measured in single-nucleotide polymorphisms being vastly smaller within the former species. This genome content variation, as well as loss-of-function variation in the form of premature stop codons and frameshifting indels, is heavily enriched in the subtelomeres, strongly reinforcing the relevance of these regions to functional evolution. Genes affected by these likely functional forms of variation are enriched for functions mediating interaction with the external environment (sugar transport and metabolism, flocculation, metal transport, and metabolism). Our results and analyses provide a comprehensive view of genomic diversity in budding yeast and expose surprising and pronounced differences between the variation within S. cerevisiae and that within S. paradoxus. We also believe that the sequence data and de novo assemblies will constitute a useful resource for further evolutionary and population genomics studies.

  1. The GAAS metagenomic tool and its estimations of viral and microbial average genome size in four major biomes.

    Science.gov (United States)

    Angly, Florent E; Willner, Dana; Prieto-Davó, Alejandra; Edwards, Robert A; Schmieder, Robert; Vega-Thurber, Rebecca; Antonopoulos, Dionysios A; Barott, Katie; Cottrell, Matthew T; Desnues, Christelle; Dinsdale, Elizabeth A; Furlan, Mike; Haynes, Matthew; Henn, Matthew R; Hu, Yongfei; Kirchman, David L; McDole, Tracey; McPherson, John D; Meyer, Folker; Miller, R Michael; Mundt, Egbert; Naviaux, Robert K; Rodriguez-Mueller, Beltran; Stevens, Rick; Wegley, Linda; Zhang, Lixin; Zhu, Baoli; Rohwer, Forest

    2009-12-01

    Metagenomic studies characterize both the composition and diversity of uncultured viral and microbial communities. BLAST-based comparisons have typically been used for such analyses; however, sampling biases, high percentages of unknown sequences, and the use of arbitrary thresholds to find significant similarities can decrease the accuracy and validity of estimates. Here, we present Genome relative Abundance and Average Size (GAAS), a complete software package that provides improved estimates of community composition and average genome length for metagenomes in both textual and graphical formats. GAAS implements a novel methodology to control for sampling bias via length normalization, to adjust for multiple BLAST similarities by similarity weighting, and to select significant similarities using relative alignment lengths. In benchmark tests, the GAAS method was robust to both high percentages of unknown sequences and to variations in metagenomic sequence read lengths. Re-analysis of the Sargasso Sea virome using GAAS indicated that standard methodologies for metagenomic analysis may dramatically underestimate the abundance and importance of organisms with small genomes in environmental systems. Using GAAS, we conducted a meta-analysis of microbial and viral average genome lengths in over 150 metagenomes from four biomes to determine whether genome lengths vary consistently between and within biomes, and between microbial and viral communities from the same environment. Significant differences between biomes and within aquatic sub-biomes (oceans, hypersaline systems, freshwater, and microbialites) suggested that average genome length is a fundamental property of environments driven by factors at the sub-biome level. The behavior of paired viral and microbial metagenomes from the same environment indicated that microbial and viral average genome sizes are independent of each other, but indicative of community responses to stressors and environmental conditions.

  2. The GAAS metagenomic tool and its estimations of viral and microbial average genome size in four major biomes.

    Directory of Open Access Journals (Sweden)

    Florent E Angly

    2009-12-01

    Full Text Available Metagenomic studies characterize both the composition and diversity of uncultured viral and microbial communities. BLAST-based comparisons have typically been used for such analyses; however, sampling biases, high percentages of unknown sequences, and the use of arbitrary thresholds to find significant similarities can decrease the accuracy and validity of estimates. Here, we present Genome relative Abundance and Average Size (GAAS, a complete software package that provides improved estimates of community composition and average genome length for metagenomes in both textual and graphical formats. GAAS implements a novel methodology to control for sampling bias via length normalization, to adjust for multiple BLAST similarities by similarity weighting, and to select significant similarities using relative alignment lengths. In benchmark tests, the GAAS method was robust to both high percentages of unknown sequences and to variations in metagenomic sequence read lengths. Re-analysis of the Sargasso Sea virome using GAAS indicated that standard methodologies for metagenomic analysis may dramatically underestimate the abundance and importance of organisms with small genomes in environmental systems. Using GAAS, we conducted a meta-analysis of microbial and viral average genome lengths in over 150 metagenomes from four biomes to determine whether genome lengths vary consistently between and within biomes, and between microbial and viral communities from the same environment. Significant differences between biomes and within aquatic sub-biomes (oceans, hypersaline systems, freshwater, and microbialites suggested that average genome length is a fundamental property of environments driven by factors at the sub-biome level. The behavior of paired viral and microbial metagenomes from the same environment indicated that microbial and viral average genome sizes are independent of each other, but indicative of community responses to stressors and

  3. A comparison of cataloged variation between International HapMap Consortium and 1000 Genomes Project data

    OpenAIRE

    2012-01-01

    Background Since publication of the human genome in 2003, geneticists have been interested in risk variant associations to resolve the etiology of traits and complex diseases. The International HapMap Consortium undertook an effort to catalog all common variation across the genome (variants with a minor allele frequency (MAF) of at least 5% in one or more ethnic groups). HapMap along with advances in genotyping technology led to genome-wide association studies which have identified common var...

  4. Economy, Speed and Size Matter: Evolutionary Forces Driving Nuclear Genome Miniaturization and Expansion

    OpenAIRE

    CAVALIER-SMITH, THOMAS

    2005-01-01

    • Background Nuclear genome size varies 300 000-fold, whereas transcriptome size varies merely 17-fold. In the largest genomes nearly all DNA is non-genic secondary DNA, mostly intergenic but also within introns. There is now compelling evidence that secondary DNA is functional, i.e. positively selected by organismal selection, not the purely neutral or ‘selfish’ outcome of mutation pressure. The skeletal DNA theory argued that nuclear volumes are genetically determined primarily by nuclear D...

  5. CREST maps somatic structural variation in cancer genomes with base-pair resolution.

    Science.gov (United States)

    Wang, Jianmin; Mullighan, Charles G; Easton, John; Roberts, Stefan; Heatley, Sue L; Ma, Jing; Rusch, Michael C; Chen, Ken; Harris, Christopher C; Ding, Li; Holmfeldt, Linda; Payne-Turner, Debbie; Fan, Xian; Wei, Lei; Zhao, David; Obenauer, John C; Naeve, Clayton; Mardis, Elaine R; Wilson, Richard K; Downing, James R; Zhang, Jinghui

    2011-06-12

    We developed 'clipping reveals structure' (CREST), an algorithm that uses next-generation sequencing reads with partial alignments to a reference genome to directly map structural variations at the nucleotide level of resolution. Application of CREST to whole-genome sequencing data from five pediatric T-lineage acute lymphoblastic leukemias (T-ALLs) and a human melanoma cell line, COLO-829, identified 160 somatic structural variations. Experimental validation exceeded 80%, demonstrating that CREST had a high predictive accuracy.

  6. CREST maps somatic structural variation in cancer genomes with base-pair resolution

    OpenAIRE

    2011-01-01

    We developed CREST (Clipping REveals STructure), an algorithm that uses next-generation sequencing reads with partial alignments to a reference genome to directly map structural variations at the nucleotide level of resolution. Application of CREST to whole-genome sequencing data from five pediatric T-lineage acute lymphoblastic leukemias (T-ALLs) and a human melanoma cell line, COLO-829, identified 160 somatic structural variations. Experimental validation exceeded 80% demonstrating that CRE...

  7. FECUNDITY AND EGG SIZE VARIATION IN TILAPIA ZILLII ...

    African Journals Online (AJOL)

    Relative Fecundity (RF), were made by standardizing fish to common weight. ANOVA result ... advanced as a possible reason for its higher population than T. mariae in the lagoon. A maximum ... these characters and implication for the species ... Sampling and Laboratory Procedure: ... Egg size measurement was carried.

  8. Internal size variations in Tn1546-like elements due to the presence of IS1216V

    DEFF Research Database (Denmark)

    Jensen, Lars Bogø

    1998-01-01

    In this study, internal size variations in the VanA gene cluster Tn1546, encoding resistance to glycopeptides, is described. Studies of previously uncharacterized size variations of an internal region, encoding the vanX and vanY genes of Tn1546, revealed that these variations were due to the pres......-essential for vancomycin resistance. (C) 1998 Federation of European Microbiological Societies....

  9. Larger Daphnia at lower temperature: a role for cell size and genome configuration?

    Science.gov (United States)

    Jalal, Marwa; Wojewodzic, Marcin W; Laane, Carl Morten M; Hessen, Dag O

    2013-09-01

    Experiments with Daphnia magna and Daphnia pulex raised at 10 and 20 °C yielded larger adult size at the lower temperature. This must reflect increased cell size, increased cell numbers, or a combination of both. As it is difficult to achieve good estimates on cell size in crustaceans, we, therefore, measured nucleus and genome size using flow cytometry at 10 and 20 °C. DNA was stained with propidium iodide, ethidium bromide, and DAPI. Both nucleus and genome size estimates were elevated at 10 °C compared with 20 °C, suggesting that larger body size at low temperature could partly be accredited to an enlarged nucleus and thus cell size. Confocal microscopy observations confirmed the staining properties of fluorochromes. As differences in nucleotide numbers in response of growth temperature within a life span is unlikely, these results seem accredited to changed DNA-fluorochrome binding properties, presumably reflecting increased DNA condensation at low temperature. This implies that genome size comparisons may be impacted by ambient temperature in ectotherms. It also suggests that temperature-induced structural changes in the genome could affect cell size and for some species even body size.

  10. Characterizing genomic variation of Arabidopsis thaliana: the roles of geography and climate.

    Science.gov (United States)

    Lasky, Jesse R; Des Marais, David L; McKay, John K; Richards, James H; Juenger, Thomas E; Keitt, Timothy H

    2012-11-01

    Arabidopsis thaliana inhabits diverse climates and exhibits varied phenology across its range. Although A. thaliana is an extremely well-studied model species, the relationship between geography, growing season climate and its genetic variation is poorly characterized. We used redundancy analysis (RDA) to quantify the association of genomic variation [214 051 single nucleotide polymorphisms (SNPs)] with geography and climate among 1003 accessions collected from 447 locations in Eurasia. We identified climate variables most correlated with genomic variation, which may be important selective gradients related to local adaptation across the species range. Climate variation among sites of origin explained slightly more genomic variation than geographical distance. Large-scale spatial gradients and early spring temperatures explained the most genomic variation, while growing season and summer conditions explained the most after controlling for spatial structure. SNP variation in Scandinavia showed the greatest climate structure among regions, possibly because of relatively consistent phenology and life history of populations in this region. Climate variation explained more variation among nonsynonymous SNPs than expected by chance, suggesting that much of the climatic structure of SNP correlations is due to changes in coding sequence that may underlie local adaptation.

  11. Variation in clutch size in relation to nest size in birds

    Science.gov (United States)

    Møller, Anders P; Adriaensen, Frank; Artemyev, Alexandr; Bańbura, Jerzy; Barba, Emilio; Biard, Clotilde; Blondel, Jacques; Bouslama, Zihad; Bouvier, Jean-Charles; Camprodon, Jordi; Cecere, Francesco; Charmantier, Anne; Charter, Motti; Cichoń, Mariusz; Cusimano, Camillo; Czeszczewik, Dorota; Demeyrier, Virginie; Doligez, Blandine; Doutrelant, Claire; Dubiec, Anna; Eens, Marcel; Eeva, Tapio; Faivre, Bruno; Ferns, Peter N; Forsman, Jukka T; García-Del-Rey, Eduardo; Goldshtein, Aya; Goodenough, Anne E; Gosler, Andrew G; Góźdź, Iga; Grégoire, Arnaud; Gustafsson, Lars; Hartley, Ian R; Heeb, Philipp; Hinsley, Shelley A; Isenmann, Paul; Jacob, Staffan; Järvinen, Antero; Juškaitis, Rimvydas; Korpimäki, Erkki; Krams, Indrikis; Laaksonen, Toni; Leclercq, Bernard; Lehikoinen, Esa; Loukola, Olli; Lundberg, Arne; Mainwaring, Mark C; Mänd, Raivo; Massa, Bruno; Mazgajski, Tomasz D; Merino, Santiago; Mitrus, Cezary; Mönkkönen, Mikko; Morales-Fernaz, Judith; Morin, Xavier; Nager, Ruedi G; Nilsson, Jan-Åke; Nilsson, Sven G; Norte, Ana C; Orell, Markku; Perret, Philippe; Pimentel, Carla S; Pinxten, Rianne; Priedniece, Ilze; Quidoz, Marie-Claude; Remeš, Vladimir; Richner, Heinz; Robles, Hugo; Rytkönen, Seppo; Senar, Juan Carlos; Seppänen, Janne T; da Silva, Luís P; Slagsvold, Tore; Solonen, Tapio; Sorace, Alberto; Stenning, Martyn J; Török, János; Tryjanowski, Piotr; van Noordwijk, Arie J; von Numers, Mikael; Walankiewicz, Wiesław; Lambrechts, Marcel M

    2014-01-01

    Nests are structures built to support and protect eggs and/or offspring from predators, parasites, and adverse weather conditions. Nests are mainly constructed prior to egg laying, meaning that parent birds must make decisions about nest site choice and nest building behavior before the start of egg-laying. Parent birds should be selected to choose nest sites and to build optimally sized nests, yet our current understanding of clutch size-nest size relationships is limited to small-scale studies performed over short time periods. Here, we quantified the relationship between clutch size and nest size, using an exhaustive database of 116 slope estimates based on 17,472 nests of 21 species of hole and non-hole-nesting birds. There was a significant, positive relationship between clutch size and the base area of the nest box or the nest, and this relationship did not differ significantly between open nesting and hole-nesting species. The slope of the relationship showed significant intraspecific and interspecific heterogeneity among four species of secondary hole-nesting species, but also among all 116 slope estimates. The estimated relationship between clutch size and nest box base area in study sites with more than a single size of nest box was not significantly different from the relationship using studies with only a single size of nest box. The slope of the relationship between clutch size and nest base area in different species of birds was significantly negatively related to minimum base area, and less so to maximum base area in a given study. These findings are consistent with the hypothesis that bird species have a general reaction norm reflecting the relationship between nest size and clutch size. Further, they suggest that scientists may influence the clutch size decisions of hole-nesting birds through the provisioning of nest boxes of varying sizes. PMID:25478150

  12. Intramale variation in sperm size: functional significance in a polygynous mammal

    OpenAIRE

    2015-01-01

    Studies concerning the relationships between sperm size and velocity at the intraspecific level are quite limited and often yielded contradictory results across the animal kingdom. Intramale variation in sperm size may represent a meaningful factor to predict sperm velocity, due to its relationship with the level of sperm competition among related taxa. Because sperm phenotype is under post-copulatory sexual selection, we hypothesized that a reduced intramale variation in sperm size is associ...

  13. Interactions of photosynthesis with genome size and function.

    Science.gov (United States)

    Raven, John A; Beardall, John; Larkum, Anthony W D; Sánchez-Baracaldo, Patricia

    2013-07-19

    Photolithotrophs are divided between those that use water as their electron donor (Cyanobacteria and the photosynthetic eukaryotes) and those that use a different electron donor (the anoxygenic photolithotrophs, all of them Bacteria). Photolithotrophs with the most reduced genomes have more genes than do the corresponding chemoorganotrophs, and the fastest-growing photolithotrophs have significantly lower specific growth rates than the fastest-growing chemoorganotrophs. Slower growth results from diversion of resources into the photosynthetic apparatus, which accounts for about half of the cell protein. There are inherent dangers in (especially oxygenic) photosynthesis, including the formation of reactive oxygen species (ROS) and blue light sensitivity of the water spitting apparatus. The extent to which photolithotrophs incur greater DNA damage and repair, and faster protein turnover with increased rRNA requirement, needs further investigation. A related source of environmental damage is ultraviolet B (UVB) radiation (280-320 nm), whose flux at the Earth's surface decreased as oxygen (and ozone) increased in the atmosphere. This oxygenation led to the requirements of defence against ROS, and decreasing availability to organisms of combined (non-dinitrogen) nitrogen and ferrous iron, and (indirectly) phosphorus, in the oxygenated biosphere. Differential codon usage in the genome and, especially, the proteome can lead to economies in the use of potentially growth-limiting elements.

  14. Evolution and Variation of the SARS-CoV Genome

    Institute of Scientific and Technical Information of China (English)

    Jianfei Hu; Zizhang Zhang; Wei Wei; Songgang Li; Jun Wang; Jian Wang; Jun Yu; Huanming Yang; Jing Wang; Jing Xu; Wei Li; Yujun Han; Yan Li; Jia Ji; Jia Ye; Zhao Xu

    2003-01-01

    Knowledge of the evolution of pathogens is of great medical and biological significance to the prevention, diagnosis, and therapy of infectious diseases. In order to understand the origin and evolution of the SARS-CoV (severe acute respiratory syndrome-associated coronavirus), we collected complete genome sequences of all viruses available in GenBank, and made comparative analyses with the SARSCoV. Genomic signature analysis demonstrates that the coronaviruses all take the TGTT as their richest tetranucleotide except the SARS-CoV. A detailed analysis of the forty-two complete SARS-CoV genome sequences revealed the existence of two distinct genotypes, and showed that these isolates could be classified into four groups. Our manual analysis of the BLASTN results demonstrates that the HE (hemagglutinin-esterase) gene exists in the SARS-CoV, and many mutations made it unfamiliar to us.

  15. Structural genomic variation as risk factor for idiopathic recurrent miscarriage

    DEFF Research Database (Denmark)

    Nagirnaja, Liina; Palta, Priit; Kasak, Laura;

    2014-01-01

    within RM study group revealed significant enrichment of loci related to innate immunity and immunoregulatory pathways essential for immune tolerance at fetomaternal interface. As a major finding, we report a multicopy duplication (61.6 kb) at 5p13.3 conferring increased maternal risk to RM in Estonia...... and identify common rearrangements modulating risk to RM. Genome-wide screening of Estonian RM patients and fertile controls identified excessive cumulative burden of CNVs (5.4 and 6.1 Mb per genome) in two RM cases possibly increasing their individual disease risk. Functional profiling of all rearranged genes...... and Denmark (meta-analysis, n = 309/205, odds ratio = 4.82, P = 0.012). Comparison to Estonian population-based cohort (total, n = 1000) confirmed the risk for Estonian female cases (P = 7.9 × 10(-4) ). Datasets of four cohorts from the Database of Genomic Variants (total, n = 5,846 subjects) exhibited...

  16. The Cambrian explosion triggered by critical turning point in genome size evolution.

    Science.gov (United States)

    Li, Dirson Jian; Zhang, Shengli

    2010-02-05

    The Cambrian explosion is a grand challenge to science today and involves multidisciplinary study. This event is generally believed as a result of genetic innovations, environmental factors and ecological interactions, even though there are many conflicts on nature and timing of metazoan origins. The crux of the matter is that an entire roadmap of the evolution is missing to discern the biological complexity transition and to evaluate the critical role of the Cambrian explosion in the overall evolutionary context. Here, we calculate the time of the Cambrian explosion by a "C-value clock"; our result quite fits the fossil records. We clarify that the intrinsic reason of genome evolution determined the Cambrian explosion. A general formula for evaluating genome size of different species has been found, by which the genome size evolution can be illustrated. The Cambrian explosion, as a major transition of biological complexity, essentially corresponds to a critical turning point in genome size evolution.

  17. Comparative Whole-Genome Mapping To Determine Staphylococcus aureus Genome Size, Virulence Motifs, and Clonality

    Science.gov (United States)

    Pantrang, Madhulatha; Stahl, Buffy; Briska, Adam M.; Stemper, Mary E.; Wagner, Trevor K.; Zentz, Emily B.; Callister, Steven M.; Lovrich, Steven D.; Henkhaus, John K.; Dykes, Colin W.

    2012-01-01

    Despite being a clonal pathogen, Staphylococcus aureus continues to acquire virulence and antibiotic-resistant genes located on mobile genetic elements such as genomic islands, prophages, pathogenicity islands, and the staphylococcal chromosomal cassette mec (SCCmec) by horizontal gene transfer from other staphylococci. The potential virulence of a S. aureus strain is often determined by comparing its pulsed-field gel electrophoresis (PFGE) or multilocus sequence typing profiles to that of known epidemic or virulent clones and by PCR of the toxin genes. Whole-genome mapping (formerly optical mapping), which is a high-resolution ordered restriction mapping of a bacterial genome, is a relatively new genomic tool that allows comparative analysis across entire bacterial genomes to identify regions of genomic similarities and dissimilarities, including small and large insertions and deletions. We explored whether whole-genome maps (WGMs) of methicillin-resistant S. aureus (MRSA) could be used to predict the presence of methicillin resistance, SCCmec type, and Panton-Valentine leukocidin (PVL)-producing genes on an S. aureus genome. We determined the WGMs of 47 diverse clinical isolates of S. aureus, including well-characterized reference MRSA strains, and annotated the signature restriction pattern in SCCmec types, arginine catabolic mobile element (ACME), and PVL-carrying prophage, PhiSa2 or PhiSa2-like regions on the genome. WGMs of these isolates accurately characterized them as MRSA or methicillin-sensitive S. aureus based on the presence or absence of the SCCmec motif, ACME and the unique signature pattern for the prophage insertion that harbored the PVL genes. Susceptibility to methicillin resistance and the presence of mecA, SCCmec types, and PVL genes were confirmed by PCR. A WGM clustering approach was further able to discriminate isolates within the same PFGE clonal group. These results showed that WGMs could be used not only to genotype S. aureus but also to

  18. Novel nuclei isolation buffer for flow cytometric genome size estimation of Zingiberaceae: a comparison with common isolation buffers.

    Science.gov (United States)

    Sadhu, Abhishek; Bhadra, Sreetama; Bandyopadhyay, Maumita

    2016-11-01

    Cytological parameters such as chromosome numbers and genome sizes of plants are used routinely for studying evolutionary aspects of polyploid plants. Members of Zingiberaceae show a wide range of inter- and intrageneric variation in their reproductive habits and ploidy levels. Conventional cytological study in this group of plants is severely hampered by the presence of diverse secondary metabolites, which also affect their genome size estimation using flow cytometry. None of the several nuclei isolation buffers used in flow cytometry could be used very successfully for members of Zingiberaceae to isolate good quality nuclei from both shoot and root tissues. The competency of eight nuclei isolation buffers was compared with a newly formulated buffer, MB01, in six different genera of Zingiberaceae based on the fluorescence intensity of propidium iodide-stained nuclei using flow cytometric parameters, namely coefficient of variation of the G0/G1 peak, debris factor and nuclei yield factor. Isolated nuclei were studied using fluorescence microscopy and bio-scanning electron microscopy to analyse stain-nuclei interaction and nuclei topology, respectively. Genome contents of 21 species belonging to these six genera were determined using MB01. Flow cytometric parameters showed significant differences among the analysed buffers. MB01 exhibited the best combination of analysed parameters; photomicrographs obtained from fluorescence and electron microscopy supported the superiority of MB01 buffer over other buffers. Among the 21 species studied, nuclear DNA contents of 14 species are reported for the first time. Results of the present study substantiate the enhanced efficacy of MB01, compared to other buffers tested, in the generation of acceptable cytograms from all species of Zingiberaceae studied. Our study facilitates new ways of sample preparation for further flow cytometric analysis of genome size of other members belonging to this highly complex polyploid family.

  19. Random distribution pattern and non-adaptivity of genome size in a highly variable population of Festuca pallens.

    Science.gov (United States)

    Smarda, Petr; Bures, Petr; Horová, Lucie

    2007-07-01

    The spatial and statistical distribution of genome sizes and the adaptivity of genome size to some types of habitat, vegetation or microclimatic conditions were investigated in a tetraploid population of Festuca pallens. The population was previously documented to vary highly in genome size and is assumed as a model for the study of the initial stages of genome size differentiation. Using DAPI flow cytometry, samples were measured repeatedly with diploid Festuca pallens as the internal standard. Altogether 172 plants from 57 plots (2.25 m(2)), distributed in contrasting habitats over the whole locality in South Moravia, Czech Republic, were sampled. The differences in DNA content were confirmed by the double peaks of simultaneously measured samples. At maximum, a 1.115-fold difference in genome size was observed. The statistical distribution of genome sizes was found to be continuous and best fits the extreme (Gumbel) distribution with rare occurrences of extremely large genomes (positive-skewed), as it is similar for the log-normal distribution of the whole Angiosperms. Even plants from the same plot frequently varied considerably in genome size and the spatial distribution of genome sizes was generally random and unautocorrelated (P > 0.05). The observed spatial pattern and the overall lack of correlations of genome size with recognized vegetation types or microclimatic conditions indicate the absence of ecological adaptivity of genome size in the studied population. These experimental data on intraspecific genome size variability in Festuca pallens argue for the absence of natural selection and the selective non-significance of genome size in the initial stages of genome size differentiation, and corroborate the current hypothetical model of genome size evolution in Angiosperms (Bennetzen et al., 2005, Annals of Botany 95: 127-132).

  20. Size distribution and seasonal variation of atmospheric cellulose

    Science.gov (United States)

    Puxbaum, Hans; Tenze-Kunit, Monika

    Atmospheric cellulose is a main constituent of the insoluble organic aerosol and a "macrotracer" for plant debris. A time series of the cellulose concentration at a downtown site in Vienna showed a maximum concentration during fall and a secondary maximum during spring. The fall maximum appears to be associated with leaf litter production, the spring maximum with increased biological activity involving repulsion of cellulose-containing particles, e.g. seed production. The grand average of the time series over 9 months was 0.374 μg m -3 cellulose, respectively, 0.75 μg m -3 plant debris. Compared to an annual average of 5.7 μg m -3 organic carbon as observed at a Vienna downtown site it becomes clear that plant debris is a major contributor to the organic aerosol and has to be considered in source attribution studies. Simultaneous measurements at the downtown and a suburban site indicated that particulate cellulose is obviously not produced within the city in notable amounts, at least during the campaign in December. Size distribution measurements with impactors showed the unexpected result that "fine aerosol" size particles (0.1- 1.6 μm aerodynamic diameter) contained 0.7% "free cellulose" on a mass basis, forming a wettable, but insoluble part of the accumulation mode aerosol.

  1. Vertical Variation of Ice Particle Size in Convective Cloud Tops

    Science.gov (United States)

    Van Diedenhoven, Bastiaan; Fridlind, Ann M.; Cairns, Brian; Ackerman, Andrew S.; Yorks, John E.

    2016-01-01

    A novel technique is used to estimate derivatives of ice effective radius with respect to height near convective cloud tops (dr(sub e)/dz) from airborne shortwave reflectance measurements and lidar. Values of dr(sub e)/dz are about -6 micrometer/km for cloud tops below the homogeneous freezing level, increasing to near 0 micrometer/km above the estimated level of neutral buoyancy. Retrieved dr(sub e)/dz compares well with previously documented remote sensing and in situ estimates. Effective radii decrease with increasing cloud top height, while cloud top extinction increases. This is consistent with weaker size sorting in high, dense cloud tops above the level of neutral buoyancy where fewer large particles are present and with stronger size sorting in lower cloud tops that are less dense. The results also confirm that cloud top trends of effective radius can generally be used as surrogates for trends with height within convective cloud tops. These results provide valuable observational targets for model evaluation.

  2. Global spectrum of copy number variations reveals genome organizational plasticity and proposes new migration routes.

    Science.gov (United States)

    Veerappa, Avinash M; Vishweswaraiah, Sangeetha; Lingaiah, Kusuma; Murthy, Megha; Suresh, Raviraj V; Manjegowda, Dinesh S; Ramachandra, Nallur B

    2015-01-01

    Global spectrum of CNVs is required to catalog variations to provide a high-resolution on the dynamics of genome-organization and human migration. In this study, we performed genome-wide genotyping using high-resolution arrays and identified 44,109 CNVs from 1,715 genomes across 12 populations. The study unraveled the force of independent evolutionary dynamics on genome-organizational plasticity across populations. We demonstrated the use of CNV tool to study human migration and identified a second major settlement establishing new migration routes in addition to existing ones.

  3. Genome Variation Within Triticale in Comparison to its Wheat and Rye Progenitors

    Science.gov (United States)

    Genome variation in the intergeneric wheat-rye hybrid triticale (X Triticosecale Wittmack) has been a puzzle to scientists and plant breeders since the first triticale was synthesized. The existence of unexplained genetic variation in triticale as compared to the parents has been a hindrance to bre...

  4. Copy number variation in Fayoumi and Leghorn chickens analyzed using array comparative genomic hybridization

    NARCIS (Netherlands)

    Abernathy, J.; Li, X.; Jia, X.; Chou, W.; Lamont, S.J.; Crooijmans, R.P.M.A.; Zhou, H.

    2014-01-01

    Copy number variation refers to regions along chromosomes that harbor a type of structural variation, such as duplications or deletions. Copy number variants (CNVs) play a role in many important traits as well as in genetic diversity. Previous analyses of chickens using array comparative genomic hyb

  5. ChickVD: a sequence variation database for the chicken genome

    DEFF Research Database (Denmark)

    Wang, Jing; He, Ximiao; Ruan, Jue

    2005-01-01

    Working in parallel with the efforts to sequence the chicken (Gallus gallus) genome, the Beijing Genomics Institute led an international team of scientists from China, USA, UK, Sweden, The Netherlands and Germany to map extensive DNA sequence variation throughout the chicken genome by sampling DNA...... from domestic breeds. Using the Red Jungle Fowl genome sequence as a reference, we identified 3.1 million non-redundant DNA sequence variants. To facilitate the application of our data to avian genetics and to provide a foundation for functional and evolutionary studies, we created the 'Chicken...... Variation Database' (ChickVD). A graphical MapView shows variants mapped onto the chicken genome in the context of gene annotations and other features, including genetic markers, trait loci, cDNAs, chicken orthologs of human disease genes and raw sequence traces. ChickVD also stores information...

  6. Using multilocus sequence typing to study bacterial variation: prospects in the genomic era.

    Science.gov (United States)

    Jolley, Keith A; Maiden, Martin C J

    2014-01-01

    Multilocus sequence typing (MLST) indexes the sequence variation present in a small number (usually seven) of housekeeping gene fragments located around the bacterial genome. Unique alleles at these loci are assigned arbitrary integer identifiers, which effectively summarizes the variation present in several thousand base pairs of genome sequence information as a series of numbers. Comparing bacterial isolates using allele-based methods efficiently corrects for the effects of lateral gene transfer present in many bacterial populations and is computationally efficient. This 'gene-by-gene' approach can be applied to larger collections of loci, such as the ribosomal protein genes used in ribosomal MLST (rMLST), up to and including the complete set of coding sequences present in a genome, whole-genome MLST (wgMLST), providing scalable, efficient and readily interpreted genome analysis.

  7. Theories of Population Variation in Genes and Genomes

    DEFF Research Database (Denmark)

    Christiansen, Freddy

    as biologists, molecular biologists, breeders, biomathematicians, and biostatisticians. •    Up-to-date treatment of key areas in classical and modern theoretical population genetics •    In-depth coverage of coalescent theory •    Timely discussion of genomic effects of selection •    Inspired by...

  8. Genomic and gene variation in Mycoplasma hominis strains

    DEFF Research Database (Denmark)

    Christiansen, Gunna; Andersen, H; Birkelund, Svend;

    1987-01-01

    DNAs from 14 strains of Mycoplasma hominis isolated from various habitats, including strain PG21, were analyzed for genomic heterogeneity. DNA-DNA filter hybridization values were from 51 to 91%. Restriction endonuclease digestion patterns, analyzed by agarose gel electrophoresis, revealed no ide...

  9. Structural genomic variation as risk factor for idiopathic recurrent miscarriage

    DEFF Research Database (Denmark)

    Nagirnaja, Liina; Palta, Priit; Kasak, Laura

    2014-01-01

    Recurrent miscarriage (RM) is a multifactorial disorder with acknowledged genetic heritability that affects ∼3% of couples aiming at childbirth. As copy number variants (CNVs) have been shown to contribute to reproductive disease susceptibility, we aimed to describe genome-wide profile of CNVs...

  10. Size variation of polyaniline nanoparticles dispersed in polyvinyl alcohol matrix

    Indian Academy of Sciences (India)

    J Bhadra; D Sarkar

    2010-10-01

    We report the preparation of polyaniline (PANI) nanoparticles dispersed in polyvinyl alcohol (PVA) matrix. From SEM picture it is seen that the particle sizes vary from 100–20 nm. Also with increase in PVA content the stability of dispersion is found to increase. Apart from SEM, spin cast films of PANI in PVA are also characterized through XRD and FTIR. XRD shows increase in crystallinity with PVA content and FTIR gives evidence of crosslinking between PANI and PVA molecules. In plane electrical conductivity (in the range of 102 Scm-1) and the exponent of nonlinear – are found to decrease with increase of PVA content. There is a good correlation between SEM, XRD, FTIR and electrical properties.

  11. The Influence of Genome and Cell Size on Brain Morphology in Amphibians.

    Science.gov (United States)

    Roth, Gerhard; Walkowiak, Wolfgang

    2015-08-10

    In amphibians, nerve cell size is highly correlated with genome size, and increases in genome and cell size cause a retardation of the rate of development of nervous (as well as nonnervous) tissue leading to secondary simplification. This yields an inverse relationship between genome and cell size on the one hand and morphological complexity of the tectum mesencephali as the main visual center, the size of the torus semicircularis as the main auditory center, the size of the amphibian papilla as an important peripheral auditory structure, and the size of the cerebellum as a major sensorimotor center. Nervous structures developing later (e.g., torus and cerebellum) are more affected by secondary simplification than those that develop earlier (e.g., the tectum). This effect is more prominent in salamanders and caecilians than in frogs owing to larger genome and cells sizes in the former two taxa. We hypothesize that because of intragenomic evolutionary processes, important differences in brain morphology can arise independently of specific environmental selection.

  12. Spatial variation in egg size of a top predator: Interplay of body size and environmental factors?

    Science.gov (United States)

    Louzao, Maite; Igual, José M.; Genovart, Meritxell; Forero, Manuela G.; Hobson, Keith A.; Oro, Daniel

    2008-09-01

    It is expected that nearby populations are constrained by the same ecological features shaping in turn similarity in their ecological traits. Here, we studied the spatio-temporal variability in egg size among local populations of the critically endangered Balearic shearwater Puffinus mauretanicus, a top marine predator endemic to the western Mediterranean region. Specifically we assessed whether this trait was influenced by maternal body size, as an indicator of a genetic component, and feeding ecology (through stable-carbon and nitrogen-isotope measurements), as an indicator of environmental factors. We found that egg size varied among local populations, an unexpected result at such a small spatial scale. Body size differences at the local population level only partially explained such differences. Blood isotope measurements also differed among local populations. Values of δ 15N suggested inter-population differences in trophic level, showing a similar general pattern with egg size, and suggesting a nutritional link between them whereby egg size was affected by differences in feeding resources and/or behaviour. Values of δ 13C suggested that local populations did not differ in foraging habits with respect to benthic- vs. pelagic-based food-webs. Egg size did not vary among years as did breeding performance, suggesting that a differential temporal window could affect both breeding parameters in relation to food availability. The absence of a relationship between breeding performance and egg size suggested that larger eggs might only confer an advantage during harsh conditions. Alternatively parental quality could greatly affect breeding performance. We showed that inter-population differences in egg size could be influenced by both body size and environmental factors.

  13. Mandibular size and shape variation in the hominins at Dmanisi, Republic of Georgia.

    Science.gov (United States)

    Skinner, Matthew M; Gordon, Adam D; Collard, Nicole J

    2006-07-01

    The hominin fossils of Dmanisi, Republic of Georgia, present an ideal means of assessing levels of skeletal size and shape variation in a fossil hypodigm belonging to the genus Homo because they have been recovered from a spatially and temporally restricted context. We compare variation in mandible size and shape at Dmanisi to that of extant hominoids and extinct hominins. We use height and breadth measurements of the mandibular corpus at the first molar and the symphysis to assess size, and analyze shape based on size-adjusted (using a geometric mean) versions of these four variables. We compare size and shape variation at Dmanisi relative to all possible pairs of individuals within each comparative taxon using an exact resampling procedure of the ratio of D2600 to D211 and the average Euclidean distance (AED) between D2600 and D211, respectively. Comparisons to extant hominoids were conducted at both the specific and subspecific taxonomic levels and to extinct hominins by adopting both a more, and less speciose, hominin taxonomy. Results indicate that the pattern of variation for the Dmanisi hominins does not resemble that of any living species: they exhibit significantly more size variation when compared to modern humans, and they have significantly more corpus shape variation and size variation in corpus heights and overall mandible size than any extant ape species. When compared to fossil hominins they are also more dimorphic in size (although this result is influenced by the taxonomic hypothesis applied to the hominin fossil record). These results highlight the need to re-examine expectations of levels of sexual dimorphism in members of the genus Homo and to account for marked size and shape variation between D2600 and D211 under the prevailing view of a single hominin species at Dmanisi.

  14. Theories of Population Variation in Genes and Genomes

    DEFF Research Database (Denmark)

    Christiansen, Freddy

    genetics, while emphasizing the close interplay between theory and empiricism. Traditional topics such as genetic and phenotypic variation, mutation, migration, and linkage are covered and advanced by contemporary coalescent theory, which describes the genealogy of genes in a population, ultimately...

  15. Identification of genomic regions associated with phenotypic variation between dog breeds using selection mapping

    DEFF Research Database (Denmark)

    Vaysse, Amaury; Ratnakumar, Abhirami; Derrien, Thomas;

    2011-01-01

    across the genome in dog breeds are the result of both selection and genetic drift, but extended blocks of homozygosity on a megabase scale appear to be best explained by selection. Further elucidation of the variants under selection will help to uncover the genetic basis of complex traits and disease....... breeds using a newly developed high-density genotyping array consisting of >170,000 evenly spaced SNPs. We first identify 44 genomic regions exhibiting extreme differentiation across multiple breeds. Genetic variation in these regions correlates with variation in several phenotypic traits that vary...... to provide a list of variants that may directly affect these traits. This study provides a catalogue of genomic regions showing extreme reduction in genetic variation or population differentiation in dogs, including many linked to phenotypic variation. The many blocks of reduced haplotype diversity observed...

  16. Maintenance of phenotypic variation: repeatibility, heritability, and size-dependent processes in a wild brook trout population

    Science.gov (United States)

    Benjamin H. Letcher; Jason A Coombs; Keith H. Nislow

    2011-01-01

    Phenotypic variation in body size can result from within-cohort variation in birth dates, among-individual growth variation and size-selective processes. We explore the relative effects of these processes on the maintenance of wide observed body size variation in stream-dwelling brook trout (Salvelinus fontinalis). Based on the analyses of multiple...

  17. Genome size of Alexandrium catenella and Gracilariopsis lemaneiformis estimated by flow cytometry

    Science.gov (United States)

    Du, Qingwei; Sui, Zhenghong; Chang, Lianpeng; Wei, Huihui; Liu, Yuan; Mi, Ping; Shang, Erlei; Zeeshan, Niaz; Que, Zhou

    2016-08-01

    Flow cytometry (FCM) technique has been widely applied to estimating the genome size of various higher plants. However, there is few report about its application in algae. In this study, an optimized procedure of FCM was exploited to estimate the genome size of two eukaryotic algae. For analyzing Alexandrium catenella, an important red tide species, the whole cell instead of isolated nucleus was studied, and chicken erythrocytes were used as an internal reference. The genome size of A. catenella was estimated to be 56.48 ± 4.14 Gb (1C), approximately nineteen times larger than that of human genome. For analyzing Gracilariopsis lemaneiformis, an important economical red alga, the purified nucleus was employed, and Arabidopsis thaliana and Chondrus crispus were used as internal references, respectively. The genome size of Gp. lemaneiformis was 97.35 ± 2.58 Mb (1C) and 112.73 ± 14.00 Mb (1C), respectively, depending on the different internal references. The results of this research will promote the related studies on the genomics and evolution of these two species.

  18. Viral genome size distribution does not correlate with the antiquity of the host lineages

    Directory of Open Access Journals (Sweden)

    José Alberto Campillo-Balderas

    2015-12-01

    Full Text Available It has been suggested that RNA viruses and other subcellular entities endowed with RNA genomes are relicts from an ancient RNA/protein World which is believed to have preceded extant DNA/RNA/protein-based cells. According to their proponents, this possibility is supported by the small-genome sizes of RNA viruses and their manifold replication strategies, which have been interpreted as the result of an evolutionary exploration of different alternative genome organizations and replication strategies during early evolutionary stages. At the other extreme are the giant DNA viruses, whose genome sizes can be as large as those of some prokaryotes, and which have been grouped by some authors into a fourth domain of life. As argued here, the comparative analysis of the chemical nature and sizes of the viral genomes reported in GenBank does not reveal any obvious correlation with the phylogenetic history of their hosts. Accordingly, it is somewhat difficult to reconcile the proposal of the putative pre-DNA antiquity of RNA viruses, with their extraordinary diversity in plant hosts and their apparent absence among the Archaea. Other issues related to the genome size of all known viruses and subviral agents and the relationship with their hosts are discussed.

  19. Using large-scale genome variation cohorts to decipher the molecular mechanism of cancer.

    Science.gov (United States)

    Habermann, Nina; Mardin, Balca R; Yakneen, Sergei; Korbel, Jan O

    2016-01-01

    Characterizing genomic structural variations (SVs) in the human genome remains challenging, and there is a growing interest to understand somatic SVs occurring in cancer, a disease of the genome. A havoc-causing SV process known as chromothripsis scars the genome when localized chromosome shattering and repair occur in a one-off catastrophe. Recent efforts led to the development of a set of conceptual criteria for the inference of chromothripsis events in cancer genomes and to the development of experimental model systems for studying this striking DNA alteration process in vitro. We discuss these approaches, and additionally touch upon current "Big Data" efforts that employ hybrid cloud computing to enable studies of numerous cancer genomes in an effort to search for commonalities and differences in molecular DNA alteration processes in cancer.

  20. Genomic variation in the porcine immunoglobulin lambda variable region.

    Science.gov (United States)

    Guo, Xi; Schwartz, John C; Murtaugh, Michael P

    2016-04-01

    Production of a vast antibody repertoire is essential for the protection against pathogens. Variable region germline complexity contributes to repertoire diversity and is a standard feature of mammalian immunoglobulin loci, but functional V region genes are limited in swine. For example, the porcine lambda light chain locus is composed of 23 variable (V) genes and 4 joining (J) genes, but only 10 or 11 V and 2 J genes are functional. Allelic variation in V and J may increase overall diversity within a population, yet lead to repertoire holes in individuals lacking key alleles. Previous studies focused on heavy chain genetic variation, thus light chain allelic diversity is not known. We characterized allelic variation of the porcine immunoglobulin lambda variable (IGLV) region genes. All intact IGLV genes in 81 pigs were amplified, sequenced, and analyzed to determine their allelic variation and functionality. We observed mutational variation across the entire length of the IGLV genes, in both framework and complementarity determining regions (CDRs). Three recombination hotspot motifs were also identified suggesting that non-allelic homologous recombination is an evolutionarily alternative mechanism for generating germline antibody diversity. Functional alleles were greatest in the most highly expressed families, IGLV3 and IGLV8. At the population level, allelic variation appears to help maintain the potential for broad antibody repertoire diversity in spite of reduced gene segment choices and limited germline sequence modification. The trade-off may be a reduction in repertoire diversity within individuals that could result in an increased variation in immunity to infectious disease and response to vaccination.

  1. Variations in the size of focal nodular hyperplasia on magnetic resonance imaging.

    Science.gov (United States)

    Ramírez-Fuentes, C; Martí-Bonmatí, L; Torregrosa, A; Del Val, A; Martínez, C

    2013-01-01

    To evaluate the changes in the size of focal nodular hyperplasia (FNH) during long-term magnetic resonance imaging (MRI) follow-up. We reviewed 44 FNHs in 30 patients studied with MRI with at least two MRI studies at least 12 months apart. We measured the largest diameter of the lesion (inmm) in contrast-enhanced axial images and calculated the percentage of variation as the difference between the maximum diameter in the follow-up and the maximum diameter in the initial study. We defined significant variation in size as variation greater than 20%. We also analyzed predisposing hormonal factors. The mean interval between the two imaging studies was 35±2 months (range: 12-94). Most lesions (80%) remained stable during follow-up. Only 9 of the 44 lesions (20%) showed a significant variation in diameter: 7 (16%) decreased in size and 2 (4%) increased, with variations that reached the double of the initial size. The change in size was not related to pregnancy, menopause, or the use of birth control pills or corticoids. Changes in the size of FNHs during follow-up are relatively common and should not lead to a change in the diagnosis. These variations in size seem to be independent of hormonal factors that are considered to predispose. Copyright © 2011 SERAM. Published by Elsevier Espana. All rights reserved.

  2. Geographical variation in body size and sexual size dimorphism in an Australian lizard, Boulenger's Skink (Morethia boulengeri).

    Science.gov (United States)

    Michael, Damian R; Banks, Sam C; Piggott, Maxine P; Cunningham, Ross B; Crane, Mason; MacGregor, Christopher; McBurney, Lachlan; Lindenmayer, David B

    2014-01-01

    Ecogeographical rules help explain spatial and temporal patterns in intraspecific body size. However, many of these rules, when applied to ectothermic organisms such as reptiles, are controversial and require further investigation. To explore factors that influence body size in reptiles, we performed a heuristic study to examine body size variation in an Australian lizard, Boulenger's Skink Morethia boulengeri from agricultural landscapes in southern New South Wales, south-eastern Australia. We collected tissue and morphological data on 337 adult lizards across a broad elevation and climate gradient. We used a model-selection procedure to determine if environmental or ecological variables best explained body size variation. We explored the relationship between morphology and phylogenetic structure before modeling candidate variables from four broad domains: (1) geography (latitude, longitude and elevation), (2) climate (temperature and rainfall), (3) habitat (vegetation type, number of logs and ground cover attributes), and (4) management (land use and grazing history). Broad phylogenetic structure was evident, but on a scale larger than our study area. Lizards were sexually dimorphic, whereby females had longer snout-vent length than males, providing support for the fecundity selection hypothesis. Body size variation in M. boulengeri was correlated with temperature and rainfall, a pattern consistent with larger individuals occupying cooler and more productive parts of the landscape. Climate change forecasts, which predict warmer temperature and increased aridity, may result in reduced lizard biomass and decoupling of trophic interactions with potential implications for community organization and ecosystem function.

  3. Changes in pollinator fauna affect altitudinal variation of floral size in a bumblebee-pollinated herb.

    Science.gov (United States)

    Nagano, Yusuke; Abe, Kota; Kitazawa, Tomoaki; Hattori, Mitsuru; Hirao, Akira S; Itino, Takao

    2014-09-01

    Geographic trait variations are often caused by locally different selection regimes. As a steep environmental cline along altitude strongly influences adaptive traits, mountain ecosystems are ideal for exploring adaptive differentiation over short distances. We investigated altitudinal floral size variation of Campanula punctata var. hondoensis in 12 populations in three mountain regions of central Japan to test whether the altitudinal floral size variation was correlated with the size of the local bumblebee pollinator and to assess whether floral size was selected for by pollinator size. We found apparent geographic variations in pollinator assemblages along altitude, which consequently produced a geographic change in pollinator size. Similarly, we found altitudinal changes in floral size, which proved to be correlated with the local pollinator size, but not with altitude itself. Furthermore, pollen removal from flower styles onto bees (plant's male fitness) was strongly influenced by the size match between flower style length and pollinator mouthpart length. These results strongly suggest that C. punctata floral size is under pollinator-mediated selection and that a geographic mosaic of locally adapted C. punctata exists at fine spatial scale.

  4. [Phylogenetic relationships and intraspecific variation of D-genome Aegilops L. as revealed by RAPD analysis].

    Science.gov (United States)

    Goriunova, S V; Kochieva, E Z; Chikida, N N; Pukhal'skiĭ, V A

    2004-05-01

    RAPD analysis was carried out to study the genetic variation and phylogenetic relationships of polyploid Aegilops species, which contain the D genome as a component of the alloploid genome, and diploid Aegilops tauschii, which is a putative donor of the D genome for common wheat. In total, 74 accessions of six D-genome Aegilops species were examined. The highest intraspecific variation (0.03-0.21) was observed for Ae. tauschii. Intraspecific distances between accessions ranged 0.007-0.067 in Ae. cylindrica, 0.017-0.047 in Ae. vavilovii, and 0.00-0.053 in Ae. juvenalis. Likewise, Ae. ventricosa and Ae. crassa showed low intraspecific polymorphism. The among-accession difference in alloploid Ae. ventricosa (genome DvNv) was similar to that of one parental species, Ae. uniaristata (N), and substantially lower than in the other parent, Ae. tauschii (D). The among-accession difference in Ae. cylindrica (CcDc) was considerably lower than in either parent, Ae. tauschii (D) or Ae. caudata (C). With the exception of Ae. cylindrica, all D-genome species--Ae. tauschii (D), Ae. ventricosa (DvNv), Ae. crassa (XcrDcrl and XcrDcrlDcr2), Ae. juvenalis (XjDjUj), and Ae. vavilovii (XvaDvaSva)--formed a single polymorphic cluster, which was distinct from clusters of other species. The only exception, Ae. cylindrica, did not group with the other D-genome species, but clustered with Ae. caudata (C), a donor of the C genome. The cluster of these two species was clearly distinct from the cluster of the other D-genome species and close to a cluster of Ae. umbellulata (genome U) and Ae. ovata (genome UgMg). Thus, RAPD analysis for the first time was used to estimate and to compare the interpopulation polymorphism and to establish the phylogenetic relationships of all diploid and alloploid D-genome Aegilops species.

  5. Genome Size and GC Content Evolution of Festuca: Ancestral Expansion and Subsequent Reduction

    Science.gov (United States)

    Šmarda, Petr; Bureš, Petr; Horová, Lucie; Foggi, Bruno; Rossi, Graziano

    2008-01-01

    Background and Aims Plant evolution is well known to be frequently associated with remarkable changes in genome size and composition; however, the knowledge of long-term evolutionary dynamics of these processes still remains very limited. Here a study is made of the fine dynamics of quantitative genome evolution in Festuca (fescue), the largest genus in Poaceae (grasses). Methods Using flow cytometry (PI, DAPI), measurements were made of DNA content (2C-value), monoploid genome size (Cx-value), average chromosome size (C/n-value) and cytosine + guanine (GC) content of 101 Festuca taxa and 14 of their close relatives. The results were compared with the existing phylogeny based on ITS and trnL-F sequences. Key Results The divergence of the fescue lineage from related Poeae was predated by about a 2-fold monoploid genome and chromosome size enlargement, and apparent GC content enrichment. The backward reduction of these parameters, running parallel in both main evolutionary lineages of fine-leaved and broad-leaved fescues, appears to diverge among the existing species groups. The most dramatic reductions are associated with the most recently and rapidly evolving groups which, in combination with recent intraspecific genome size variability, indicate that the reduction process is probably ongoing and evolutionarily young. This dynamics may be a consequence of GC-rich retrotransposon proliferation and removal. Polyploids derived from parents with a large genome size and high GC content (mostly allopolyploids) had smaller Cx- and C/n-values and only slightly deviated from parental GC content, whereas polyploids derived from parents with small genome and low GC content (mostly autopolyploids) generally had a markedly increased GC content and slightly higher Cx- and C/n-values. Conclusions The present study indicates the high potential of general quantitative characters of the genome for understanding the long-term processes of genome evolution, testing evolutionary

  6. A Variational Bayes Genomic-Enabled Prediction Model with Genotype × Environment Interaction

    Directory of Open Access Journals (Sweden)

    Osval A. Montesinos-López

    2017-06-01

    Full Text Available There are Bayesian and non-Bayesian genomic models that take into account G×E interactions. However, the computational cost of implementing Bayesian models is high, and becomes almost impossible when the number of genotypes, environments, and traits is very large, while, in non-Bayesian models, there are often important and unsolved convergence problems. The variational Bayes method is popular in machine learning, and, by approximating the probability distributions through optimization, it tends to be faster than Markov Chain Monte Carlo methods. For this reason, in this paper, we propose a new genomic variational Bayes version of the Bayesian genomic model with G×E using half-t priors on each standard deviation (SD term to guarantee highly noninformative and posterior inferences that are not sensitive to the choice of hyper-parameters. We show the complete theoretical derivation of the full conditional and the variational posterior distributions, and their implementations. We used eight experimental genomic maize and wheat data sets to illustrate the new proposed variational Bayes approximation, and compared its predictions and implementation time with a standard Bayesian genomic model with G×E. Results indicated that prediction accuracies are slightly higher in the standard Bayesian model with G×E than in its variational counterpart, but, in terms of computation time, the variational Bayes genomic model with G×E is, in general, 10 times faster than the conventional Bayesian genomic model with G×E. For this reason, the proposed model may be a useful tool for researchers who need to predict and select genotypes in several environments.

  7. Efficiency of genomic DNA extraction dependent on the size of magnetic nanoclusters

    Science.gov (United States)

    Cho, Hyun Ah; Hyun Min, Ji; Hua Wu, Jun; Woo Jang, Jin; Lim, Chae-Seung; Keun Kim, Young

    2014-05-01

    We report the efficiency of genomic DNA extraction as a function of particle size and quantity. For DNA extraction, we synthesized magnetic nanoclusters of various sizes and coated the surface of these magnetic nanoclusters with meso-2,3-dimercaptosuccinic acid. We showed that the nanoclusters had a tight particle size distribution and high crystallinity. Furthermore, we observed that the three types of magnetic nanoclusters studied exhibited ferrimagnetic behavior and that larger nanoclusters showed larger saturation magnetization values. The resultant efficiency of DNA extraction is inversely proportional to particle size in the range of nanoclusters tested, due to the fact that the surface-to-volume ratio decreases as particle size increases.

  8. Variation in size, morphology and chemical composition of polymetallic nodules from the Central Indian Ocean Basin

    Digital Repository Service at National Institute of Oceanography (India)

    Valsangkar, A.B.; Karisiddaiah, S.M.; Parthiban, G.

    Chemical composition of 613 polymetallic nodules from 150 stations in the Central Indian Ocean Basin (CIOB) are determined and variations in Mn, Fe, Cu, Ni, Co, Zn and moisture content are studied with respect to their size and surface texture...

  9. GENOME SIZE DETERMINATION AND RAPD ANALYSIS OF FOUR EDIBLE AROIDS OF NORTH EAST INDIA

    Directory of Open Access Journals (Sweden)

    Jyoti P. Saikia1*, Bolin K. Konwar 2 and Susmita Singh3

    2010-10-01

    Full Text Available Four edible aroid species were selected for the study. The genomic DNA of the plants was isolated and estimated. A part of the genomic DNA was used for analysis using six different primers from Operon Technologies, USA. The genome size determined for the aroids is in the order of Colocasia esculenta> Xanthosoma caracu> Xanthosoma sagittifolium > Amorphophallus paeonifolius. Amorphophallus species was found to be 50% similar to both Xanthosoma caracu and Colocasia esculenta. The analysis will provide a ground for exploring the vast diversified aroid population of the region.

  10. Consequences of intraspecific seed-size variation in Sparganium emersum for dispersal by fish

    NARCIS (Netherlands)

    Pollux, B.J.A.; Ouborg, J.; Van Groenendael, J.M.; Klaassen, M.R.J.

    2007-01-01

    The potential for seed dispersal by fish (ichthyochory) is likely to vary within aquatic plant species, depending on intraspecific variation in phenotypic seed traits. 2. We studied the effect of seed size variation within the unbranched burreed (Sparganium emersum) on the potential for internal dis

  11. Consequences of intraspecific seed-size variation in Sparganium emersum for dispersal by fish

    NARCIS (Netherlands)

    Pollux, B.J.A.; Ouborg, J.; Van Groenendael, J.M.; Klaassen, M.R.J.

    2007-01-01

    The potential for seed dispersal by fish (ichthyochory) is likely to vary within aquatic plant species, depending on intraspecific variation in phenotypic seed traits. 2. We studied the effect of seed size variation within the unbranched burreed (Sparganium emersum) on the potential for internal

  12. Does litter size variation affect models of terrestrial carnivore extinction risk and management?

    Directory of Open Access Journals (Sweden)

    Eleanor S Devenish-Nelson

    Full Text Available BACKGROUND: Individual variation in both survival and reproduction has the potential to influence extinction risk. Especially for rare or threatened species, reliable population models should adequately incorporate demographic uncertainty. Here, we focus on an important form of demographic stochasticity: variation in litter sizes. We use terrestrial carnivores as an example taxon, as they are frequently threatened or of economic importance. Since data on intraspecific litter size variation are often sparse, it is unclear what probability distribution should be used to describe the pattern of litter size variation for multiparous carnivores. METHODOLOGY/PRINCIPAL FINDINGS: We used litter size data on 32 terrestrial carnivore species to test the fit of 12 probability distributions. The influence of these distributions on quasi-extinction probabilities and the probability of successful disease control was then examined for three canid species - the island fox Urocyon littoralis, the red fox Vulpes vulpes, and the African wild dog Lycaon pictus. Best fitting probability distributions differed among the carnivores examined. However, the discretised normal distribution provided the best fit for the majority of species, because variation among litter-sizes was often small. Importantly, however, the outcomes of demographic models were generally robust to the distribution used. CONCLUSION/SIGNIFICANCE: These results provide reassurance for those using demographic modelling for the management of less studied carnivores in which litter size variation is estimated using data from species with similar reproductive attributes.

  13. Length of activity season drives geographic variation in body size of a widely distributed lizard

    OpenAIRE

    Horváthová, Terézia; Cooney, Christopher R.; Fitze, Patrick S; Oksanen, Tuula; Jelic, Dusan; Ghira, Ioan; Uller, Tobias; Jandzik, David

    2013-01-01

    Understanding the factors that drive geographic variation in life history is an important challenge in evolutionary ecology. Here, we analyze what predicts geographic variation in life-history traits of the common lizard, Zootoca vivipara, which has the globally largest distribution range of all terrestrial reptile species. Variation in body size was predicted by differences in the length of activity season, while we found no effects of environmental temperature per se. Females experiencing r...

  14. Rapid detection of structural variation in a human genome using nanochannel-based genome mapping technology

    DEFF Research Database (Denmark)

    Cao, Hongzhi; Hastie, Alex R.; Cao, Dandan;

    2014-01-01

    than 1 kb. Excluding the 59 SVs (54 insertions/deletions, 5 inversions) that overlap with N-base gaps in the reference assembly hg19, 666 non-gap SVs remained, and 396 of them (60%) were verified by paired-end data from whole-genome sequencing-based re-sequencing or de novo assembly sequence from...... fosmid data. Of the remaining 270 SVs, 260 are insertions and 213 overlap known SVs in the Database of Genomic Variants. Overall, 609 out of 666 (90%) variants were supported by experimental orthogonal methods or historical evidence in public databases. At the same time, genome mapping also provides...

  15. Worldwide patterns of genomic variation and admixture in gray wolves.

    Science.gov (United States)

    Fan, Zhenxin; Silva, Pedro; Gronau, Ilan; Wang, Shuoguo; Armero, Aitor Serres; Schweizer, Rena M; Ramirez, Oscar; Pollinger, John; Galaverni, Marco; Ortega Del-Vecchyo, Diego; Du, Lianming; Zhang, Wenping; Zhang, Zhihe; Xing, Jinchuan; Vilà, Carles; Marques-Bonet, Tomas; Godinho, Raquel; Yue, Bisong; Wayne, Robert K

    2016-02-01

    The gray wolf (Canis lupus) is a widely distributed top predator and ancestor of the domestic dog. To address questions about wolf relationships to each other and dogs, we assembled and analyzed a data set of 34 canine genomes. The divergence between New and Old World wolves is the earliest branching event and is followed by the divergence of Old World wolves and dogs, confirming that the dog was domesticated in the Old World. However, no single wolf population is more closely related to dogs, supporting the hypothesis that dogs were derived from an extinct wolf population. All extant wolves have a surprisingly recent common ancestry and experienced a dramatic population decline beginning at least ∼30 thousand years ago (kya). We suggest this crisis was related to the colonization of Eurasia by modern human hunter-gatherers, who competed with wolves for limited prey but also domesticated them, leading to a compensatory population expansion of dogs. We found extensive admixture between dogs and wolves, with up to 25% of Eurasian wolf genomes showing signs of dog ancestry. Dogs have influenced the recent history of wolves through admixture and vice versa, potentially enhancing adaptation. Simple scenarios of dog domestication are confounded by admixture, and studies that do not take admixture into account with specific demographic models are problematic. © 2016 Fan et al.; Published by Cold Spring Harbor Laboratory Press.

  16. Genomic variation in recently collected maize landraces from Mexico

    Science.gov (United States)

    Arteaga, María Clara; Moreno-Letelier, Alejandra; Mastretta-Yanes, Alicia; Vázquez-Lobo, Alejandra; Breña-Ochoa, Alejandra; Moreno-Estrada, Andrés; Eguiarte, Luis E.; Piñero, Daniel

    2015-01-01

    The present dataset comprises 36,931 SNPs genotyped in 46 maize landraces native to Mexico as well as the teosinte subspecies Zea maiz ssp. parviglumis and ssp. mexicana. These landraces were collected directly from farmers mostly between 2006 and 2010. We accompany these data with a short description of the variation within each landrace, as well as maps, principal component analyses and neighbor joining trees showing the distribution of the genetic diversity relative to landrace, geographical features and maize biogeography. High levels of genetic variation were detected for the maize landraces (HE = 0.234 to 0.318 (mean 0.311), while slightly lower levels were detected in Zea m. mexicana and Zea m. parviglumis (HE = 0.262 and 0.234, respectively). The distribution of genetic variation was better explained by environmental variables given by the interaction of altitude and latitude than by landrace identity. This dataset is a follow up product of the Global Native Maize Project, an initiative to update the data on Mexican maize landraces and their wild relatives, and to generate information that is necessary for implementing the Mexican Biosafety Law. PMID:26981357

  17. Genomic variation in recently collected maize landraces from Mexico

    Directory of Open Access Journals (Sweden)

    María Clara Arteaga

    2016-03-01

    Full Text Available The present dataset comprises 36,931 SNPs genotyped in 46 maize landraces native to Mexico as well as the teosinte subspecies Zea maiz ssp. parviglumis and ssp. mexicana. These landraces were collected directly from farmers mostly between 2006 and 2010. We accompany these data with a short description of the variation within each landrace, as well as maps, principal component analyses and neighbor joining trees showing the distribution of the genetic diversity relative to landrace, geographical features and maize biogeography. High levels of genetic variation were detected for the maize landraces (HE = 0.234 to 0.318 (mean 0.311, while slightly lower levels were detected in Zea m. mexicana and Zea m. parviglumis (HE = 0.262 and 0.234, respectively. The distribution of genetic variation was better explained by environmental variables given by the interaction of altitude and latitude than by landrace identity. This dataset is a follow up product of the Global Native Maize Project, an initiative to update the data on Mexican maize landraces and their wild relatives, and to generate information that is necessary for implementing the Mexican Biosafety Law.

  18. Genomic variation in recently collected maize landraces from Mexico.

    Science.gov (United States)

    Arteaga, María Clara; Moreno-Letelier, Alejandra; Mastretta-Yanes, Alicia; Vázquez-Lobo, Alejandra; Breña-Ochoa, Alejandra; Moreno-Estrada, Andrés; Eguiarte, Luis E; Piñero, Daniel

    2016-03-01

    The present dataset comprises 36,931 SNPs genotyped in 46 maize landraces native to Mexico as well as the teosinte subspecies Zea maiz ssp. parviglumis and ssp. mexicana. These landraces were collected directly from farmers mostly between 2006 and 2010. We accompany these data with a short description of the variation within each landrace, as well as maps, principal component analyses and neighbor joining trees showing the distribution of the genetic diversity relative to landrace, geographical features and maize biogeography. High levels of genetic variation were detected for the maize landraces (H E = 0.234 to 0.318 (mean 0.311), while slightly lower levels were detected in Zea m. mexicana and Zea m. parviglumis (H E = 0.262 and 0.234, respectively). The distribution of genetic variation was better explained by environmental variables given by the interaction of altitude and latitude than by landrace identity. This dataset is a follow up product of the Global Native Maize Project, an initiative to update the data on Mexican maize landraces and their wild relatives, and to generate information that is necessary for implementing the Mexican Biosafety Law.

  19. Habitat area and climate stability determine geographical variation in plant species range sizes

    DEFF Research Database (Denmark)

    Morueta-Holme, Naia; Enquist, Brian J.; McGill, Brian J.

    2013-01-01

    Despite being a fundamental aspect of biodiversity, little is known about what controls species range sizes. This is especially the case for hyperdiverse organisms such as plants. We use the largest botanical data set assembled to date to quantify geographical variation in range size for ~85...

  20. Copy number variation detection in whole-genome sequencing data using the Bayesian information criterion.

    Science.gov (United States)

    Xi, Ruibin; Hadjipanayis, Angela G; Luquette, Lovelace J; Kim, Tae-Min; Lee, Eunjung; Zhang, Jianhua; Johnson, Mark D; Muzny, Donna M; Wheeler, David A; Gibbs, Richard A; Kucherlapati, Raju; Park, Peter J

    2011-11-15

    DNA copy number variations (CNVs) play an important role in the pathogenesis and progression of cancer and confer susceptibility to a variety of human disorders. Array comparative genomic hybridization has been used widely to identify CNVs genome wide, but the next-generation sequencing technology provides an opportunity to characterize CNVs genome wide with unprecedented resolution. In this study, we developed an algorithm to detect CNVs from whole-genome sequencing data and applied it to a newly sequenced glioblastoma genome with a matched control. This read-depth algorithm, called BIC-seq, can accurately and efficiently identify CNVs via minimizing the Bayesian information criterion. Using BIC-seq, we identified hundreds of CNVs as small as 40 bp in the cancer genome sequenced at 10× coverage, whereas we could only detect large CNVs (> 15 kb) in the array comparative genomic hybridization profiles for the same genome. Eighty percent (14/16) of the small variants tested (110 bp to 14 kb) were experimentally validated by quantitative PCR, demonstrating high sensitivity and true positive rate of the algorithm. We also extended the algorithm to detect recurrent CNVs in multiple samples as well as deriving error bars for breakpoints using a Gibbs sampling approach. We propose this statistical approach as a principled yet practical and efficient method to estimate CNVs in whole-genome sequencing data.

  1. Genome resequencing in Populus: Revealing large-scale genome variation and implications on specialized-trait genomics

    Energy Technology Data Exchange (ETDEWEB)

    Muchero, Wellington [ORNL; Labbe, Jessy L [ORNL; Priya, Ranjan [University of Tennessee, Knoxville (UTK); DiFazio, Steven P [West Virginia University, Morgantown; Tuskan, Gerald A [ORNL

    2014-01-01

    To date, Populus ranks among a few plant species with a complete genome sequence and other highly developed genomic resources. With the first genome sequence among all tree species, Populus has been adopted as a suitable model organism for genomic studies in trees. However, far from being just a model species, Populus is a key renewable economic resource that plays a significant role in providing raw materials for the biofuel and pulp and paper industries. Therefore, aside from leading frontiers of basic tree molecular biology and ecological research, Populus leads frontiers in addressing global economic challenges related to fuel and fiber production. The latter fact suggests that research aimed at improving quality and quantity of Populus as a raw material will likely drive the pursuit of more targeted and deeper research in order to unlock the economic potential tied in molecular biology processes that drive this tree species. Advances in genome sequence-driven technologies, such as resequencing individual genotypes, which in turn facilitates large scale SNP discovery and identification of large scale polymorphisms are key determinants of future success in these initiatives. In this treatise we discuss implications of genome sequence-enable technologies on Populus genomic and genetic studies of complex and specialized-traits.

  2. Intramale variation in sperm size: functional significance in a polygynous mammal

    Directory of Open Access Journals (Sweden)

    José Luis Ros-Santaella

    2015-12-01

    Full Text Available Studies concerning the relationships between sperm size and velocity at the intraspecific level are quite limited and often yielded contradictory results across the animal kingdom. Intramale variation in sperm size may represent a meaningful factor to predict sperm velocity, due to its relationship with the level of sperm competition among related taxa. Because sperm phenotype is under post-copulatory sexual selection, we hypothesized that a reduced intramale variation in sperm size is associated with sperm competitiveness in red deer. Our results show that low variation in sperm size is strongly related to high sperm velocity and normal sperm morphology, which in turn are good predictors of male fertility in this species. Furthermore, it is well known that the red deer show high variability in testicular mass but there is limited knowledge concerning the significance of this phenomenon at intraspecific level, even though it may reveal interesting processes of sexual selection. Thereby, as a preliminary result, we found that absolute testes mass is negatively associated with intramale variation in sperm size. Our findings suggest that sperm size variation in red deer is under a strong selective force leading to increase sperm function efficiency, and reveal new insights into sexual selection mechanisms.

  3. Grading Standards of the Coefficient of Variation in the Electronic Raw Silk Size Testing

    Institute of Scientific and Technical Information of China (English)

    2007-01-01

    In order to shorten the difference between the raw silk size grading standards of the world and that of China,to quicken the step of the electronic raw silk testing process,the distribution of the coefficient of variation (CV50m%) of the raw silk size in the electronic testing and the development of the new standards are studied according to the sampling and grading theory. By the theoretical deduction and the simulating experiments, the distribution of the coefficient of variation of the raw silk size is given,and the grading scheme whose quality index is the coefficient of variation(CV50m%)of the raw silk size and the grading precisions of all grades are proposed. Moreover,the rightness and the feasibility of the grading scheme are testified by the sampling and grading simulation.

  4. A Model of Genome Size Evolution for Prokaryotes in Stable and Fluctuating Environments.

    Science.gov (United States)

    Bentkowski, Piotr; Van Oosterhout, Cock; Mock, Thomas

    2015-08-04

    Temporal variability in ecosystems significantly impacts species diversity and ecosystem productivity and therefore the evolution of organisms. Different levels of environmental perturbations such as seasonal fluctuations, natural disasters, and global change have different impacts on organisms and therefore their ability to acclimatize and adapt. Thus, to understand how organisms evolve under different perturbations is a key for predicting how environmental change will impact species diversity and ecosystem productivity. Here, we developed a computer simulation utilizing the individual-based model approach to investigate genome size evolution of a haploid, clonal and free-living prokaryotic population across different levels of environmental perturbations. Our results show that a greater variability of the environment resulted in genomes with a larger number of genes. Environmental perturbations were more effectively buffered by populations of individuals with relatively large genomes. Unpredictable changes of the environment led to a series of population bottlenecks followed by adaptive radiations. Our model shows that the evolution of genome size is indirectly driven by the temporal variability of the environment. This complements the effects of natural selection directly acting on genome optimization. Furthermore, species that have evolved in relatively stable environments may face the greatest risk of extinction under global change as genome streamlining genetically constrains their ability to acclimatize to the new environmental conditions, unless mechanisms of genetic diversification such as horizontal gene transfer will enrich their gene pool and therefore their potential to adapt.

  5. High variation in manufacturer-declared serving size of packaged discretionary foods in Australia.

    Science.gov (United States)

    Haskelberg, Hila; Neal, Bruce; Dunford, Elizabeth; Flood, Victoria; Rangan, Anna; Thomas, Beth; Cleanthous, Xenia; Trevena, Helen; Zheng, Jazzmin Miaobing; Louie, Jimmy Chun Yu; Gill, Timothy; Wu, Jason H Y

    2016-05-28

    Despite the potential of declared serving size to encourage appropriate portion size consumption, most countries including Australia have not developed clear reference guidelines for serving size. The present study evaluated variability in manufacturer-declared serving size of discretionary food and beverage products in Australia, and how declared serving size compared with the 2013 Australian Dietary Guideline (ADG) standard serve (600 kJ). Serving sizes were obtained from the Nutrition Information Panel for 4466 packaged, discretionary products in 2013 at four large supermarkets in Sydney, Australia, and categorised into fifteen categories in line with the 2013 ADG. For unique products that were sold in multiple package sizes, the percentage difference between the minimum and the maximum serving size across different package sizes was calculated. A high variation in serving size was found within the majority of food and beverage categories - for example, among 347 non-alcoholic beverages (e.g. soft drinks), the median for serving size was 250 (interquartile range (IQR) 250, 355) ml (range 100-750 ml). Declared serving size for unique products that are available in multiple package sizes also showed high variation, particularly for chocolate-based confectionery, with median percentage difference between minimum and maximum serving size of 183 (IQR 150) %. Categories with a high proportion of products that exceeded the 600 kJ ADG standard serve included cakes and muffins, pastries and desserts (≥74 % for each). High variability in declared serving size may confound interpretation and understanding of consumers interested in standardising and controlling their portion selection. Future research is needed to assess if and how standardising declared serving size might affect consumer behaviour.

  6. Natural variation in rosette size under salt stress conditions corresponds to developmental differences between Arabidopsis accessions and allelic variation in the LRR-KISS gene

    KAUST Repository

    Julkowska, Magdalena M.

    2016-02-11

    Natural variation among Arabidopsis accessions is an important genetic resource to identify mechanisms underlying plant development and stress tolerance. To evaluate the natural variation in salinity stress tolerance, two large-scale experiments were performed on two populations consisting of 160 Arabidopsis accessions each. Multiple traits, including projected rosette area, and fresh and dry weight were collected as an estimate for salinity tolerance. Our results reveal a correlation between rosette size under salt stress conditions and developmental differences between the accessions grown in control conditions, suggesting that in general larger plants were more salt tolerant. This correlation was less pronounced when plants were grown under severe salt stress conditions. Subsequent genome wide association study (GWAS) revealed associations with novel candidate genes for salinity tolerance such as LRR-KISS (At4g08850), flowering locus KH-domain containing protein and a DUF1639-containing protein. Accessions with high LRR-KISS expression developed larger rosettes under salt stress conditions. Further characterization of allelic variation in candidate genes identified in this study will provide more insight into mechanisms of salt stress tolerance due to enhanced shoot growth.

  7. Exploring Diversification and Genome Size Evolution in Extant Gymnosperms through Phylogenetic Synthesis

    Directory of Open Access Journals (Sweden)

    J. Gordon Burleigh

    2012-01-01

    Full Text Available Gymnosperms, comprising cycads, Ginkgo, Gnetales, and conifers, represent one of the major groups of extant seed plants. Yet compared to angiosperms, little is known about the patterns of diversification and genome evolution in gymnosperms. We assembled a phylogenetic supermatrix containing over 4.5 million nucleotides from 739 gymnosperm taxa. Although 93.6% of the cells in the supermatrix are empty, the data reveal many strongly supported nodes that are generally consistent with previous phylogenetic analyses, including weak support for Gnetales sister to Pinaceae. A lineage through time plot suggests elevated rates of diversification within the last 100 million years, and there is evidence of shifts in diversification rates in several clades within cycads and conifers. A likelihood-based analysis of the evolution of genome size in 165 gymnosperms finds evidence for heterogeneous rates of genome size evolution due to an elevated rate in Pinus.

  8. Structural genomic variation in childhood epilepsies with complex phenotypes

    DEFF Research Database (Denmark)

    Helbig, Ingo; Swinkels, Marielle E M; Aten, Emmelien

    2014-01-01

    A genetic contribution to a broad range of epilepsies has been postulated, and particularly copy number variations (CNVs) have emerged as significant genetic risk factors. However, the role of CNVs in patients with epilepsies with complex phenotypes is not known. Therefore, we investigated the role...... of CNVs in patients with unclassified epilepsies and complex phenotypes. A total of 222 patients from three European countries, including patients with structural lesions on magnetic resonance imaging (MRI), dysmorphic features, and multiple congenital anomalies, were clinically evaluated and screened...... for CNVs. MRI findings including acquired or developmental lesions and patient characteristics were subdivided and analyzed in subgroups. MRI data were available for 88.3% of patients, of whom 41.6% had abnormal MRI findings. Eighty-eight rare CNVs were discovered in 71 out of 222 patients (31...

  9. Genome-wide estimates of coancestry, inbreeding and effective population size in the Spanish Holstein population.

    Directory of Open Access Journals (Sweden)

    Silvia Teresa Rodríguez-Ramilo

    Full Text Available Estimates of effective population size in the Holstein cattle breed have usually been low despite the large number of animals that constitute this breed. Effective population size is inversely related to the rates at which coancestry and inbreeding increase and these rates have been high as a consequence of intense and accurate selection. Traditionally, coancestry and inbreeding coefficients have been calculated from pedigree data. However, the development of genome-wide single nucleotide polymorphisms has increased the interest of calculating these coefficients from molecular data in order to improve their accuracy. In this study, genomic estimates of coancestry, inbreeding and effective population size were obtained in the Spanish Holstein population and then compared with pedigree-based estimates. A total of 11,135 animals genotyped with the Illumina BovineSNP50 BeadChip were available for the study. After applying filtering criteria, the final genomic dataset included 36,693 autosomal SNPs and 10,569 animals. Pedigree data from those genotyped animals included 31,203 animals. These individuals represented only the last five generations in order to homogenise the amount of pedigree information across animals. Genomic estimates of coancestry and inbreeding were obtained from identity by descent segments (coancestry or runs of homozygosity (inbreeding. The results indicate that the percentage of variance of pedigree-based coancestry estimates explained by genomic coancestry estimates was higher than that for inbreeding. Estimates of effective population size obtained from genome-wide and pedigree information were consistent and ranged from about 66 to 79. These low values emphasize the need of controlling the rate of increase of coancestry and inbreeding in Holstein selection programmes.

  10. Genome-wide estimates of coancestry, inbreeding and effective population size in the Spanish Holstein population.

    Science.gov (United States)

    Rodríguez-Ramilo, Silvia Teresa; Fernández, Jesús; Toro, Miguel Angel; Hernández, Delfino; Villanueva, Beatriz

    2015-01-01

    Estimates of effective population size in the Holstein cattle breed have usually been low despite the large number of animals that constitute this breed. Effective population size is inversely related to the rates at which coancestry and inbreeding increase and these rates have been high as a consequence of intense and accurate selection. Traditionally, coancestry and inbreeding coefficients have been calculated from pedigree data. However, the development of genome-wide single nucleotide polymorphisms has increased the interest of calculating these coefficients from molecular data in order to improve their accuracy. In this study, genomic estimates of coancestry, inbreeding and effective population size were obtained in the Spanish Holstein population and then compared with pedigree-based estimates. A total of 11,135 animals genotyped with the Illumina BovineSNP50 BeadChip were available for the study. After applying filtering criteria, the final genomic dataset included 36,693 autosomal SNPs and 10,569 animals. Pedigree data from those genotyped animals included 31,203 animals. These individuals represented only the last five generations in order to homogenise the amount of pedigree information across animals. Genomic estimates of coancestry and inbreeding were obtained from identity by descent segments (coancestry) or runs of homozygosity (inbreeding). The results indicate that the percentage of variance of pedigree-based coancestry estimates explained by genomic coancestry estimates was higher than that for inbreeding. Estimates of effective population size obtained from genome-wide and pedigree information were consistent and ranged from about 66 to 79. These low values emphasize the need of controlling the rate of increase of coancestry and inbreeding in Holstein selection programmes.

  11. Estimating variation within the genes and inferring the phylogeny of 186 sequenced diverse Escherichia coli genomes

    Directory of Open Access Journals (Sweden)

    Kaas Rolf S

    2012-10-01

    Full Text Available Abstract Background Escherichia coli exists in commensal and pathogenic forms. By measuring the variation of individual genes across more than a hundred sequenced genomes, gene variation can be studied in detail, including the number of mutations found for any given gene. This knowledge will be useful for creating better phylogenies, for determination of molecular clocks and for improved typing techniques. Results We find 3,051 gene clusters/families present in at least 95% of the genomes and 1,702 gene clusters present in 100% of the genomes. The former 'soft core' of about 3,000 gene families is perhaps more biologically relevant, especially considering that many of these genome sequences are draft quality. The E. coli pan-genome for this set of isolates contains 16,373 gene clusters. A core-gene tree, based on alignment and a pan-genome tree based on gene presence/absence, maps the relatedness of the 186 sequenced E. coli genomes. The core-gene tree displays high confidence and divides the E. coli strains into the observed MLST type clades and also separates defined phylotypes. Conclusion The results of comparing a large and diverse E. coli dataset support the theory that reliable and good resolution phylogenies can be inferred from the core-genome. The results further suggest that the resolution at the isolate level may, subsequently be improved by targeting more variable genes. The use of whole genome sequencing will make it possible to eliminate, or at least reduce, the need for several typing steps used in traditional epidemiology.

  12. Sequencing the CHO DXB11 genome reveals regional variations in genomic stability and haploidy

    DEFF Research Database (Denmark)

    Kaas, Christian Schrøder; Kristensen, Claus; Betenbaugh, Michael J.

    2015-01-01

    Background: The DHFR negative CHO DXB11 cell line (also known as DUX-B11 and DUKX) was historically the first CHO cell line to be used for large scale production of heterologous proteins and is still used for production of a number of complex proteins.  Results: Here we present the genomic sequen...

  13. European sea bass genome and its variation provide insights into adaptation to euryhalinity and speciation

    Science.gov (United States)

    Tine, Mbaye; Kuhl, Heiner; Gagnaire, Pierre-Alexandre; Louro, Bruno; Desmarais, Erick; Martins, Rute S.T.; Hecht, Jochen; Knaust, Florian; Belkhir, Khalid; Klages, Sven; Dieterich, Roland; Stueber, Kurt; Piferrer, Francesc; Guinand, Bruno; Bierne, Nicolas; Volckaert, Filip A. M.; Bargelloni, Luca; Power, Deborah M.; Bonhomme, François; Canario, Adelino V. M.; Reinhardt, Richard

    2014-01-01

    The European sea bass (Dicentrarchus labrax) is a temperate-zone euryhaline teleost of prime importance for aquaculture and fisheries. This species is subdivided into two naturally hybridizing lineages, one inhabiting the north-eastern Atlantic Ocean and the other the Mediterranean and Black seas. Here we provide a high-quality chromosome-scale assembly of its genome that shows a high degree of synteny with the more highly derived teleosts. We find expansions of gene families specifically associated with ion and water regulation, highlighting adaptation to variation in salinity. We further generate a genome-wide variation map through RAD-sequencing of Atlantic and Mediterranean populations. We show that variation in local recombination rates strongly influences the genomic landscape of diversity within and differentiation between lineages. Comparing predictions of alternative demographic models to the joint allele-frequency spectrum indicates that genomic islands of differentiation between sea bass lineages were generated by varying rates of introgression across the genome following a period of geographical isolation. PMID:25534655

  14. Singapore Genome Variation Project: a haplotype map of three Southeast Asian populations.

    Science.gov (United States)

    Teo, Yik-Ying; Sim, Xueling; Ong, Rick T H; Tan, Adrian K S; Chen, Jieming; Tantoso, Erwin; Small, Kerrin S; Ku, Chee-Seng; Lee, Edmund J D; Seielstad, Mark; Chia, Kee-Seng

    2009-11-01

    The Singapore Genome Variation Project (SGVP) provides a publicly available resource of 1.6 million single nucleotide polymorphisms (SNPs) genotyped in 268 individuals from the Chinese, Malay, and Indian population groups in Southeast Asia. This online database catalogs information and summaries on genotype and phased haplotype data, including allele frequencies, assessment of linkage disequilibrium (LD), and recombination rates in a format similar to the International HapMap Project. Here, we introduce this resource and describe the analysis of human genomic variation upon agglomerating data from the HapMap and the Human Genome Diversity Project, providing useful insights into the population structure of the three major population groups in Asia. In addition, this resource also surveyed across the genome for variation in regional patterns of LD between the HapMap and SGVP populations, and for signatures of positive natural selection using two well-established metrics: iHS and XP-EHH. The raw and processed genetic data, together with all population genetic summaries, are publicly available for download and browsing through a web browser modeled with the Generic Genome Browser.

  15. Effect of population size on genetic variation levels in Capparis spinosa (Capparaceae detected by RAPDs

    Directory of Open Access Journals (Sweden)

    Houshang Nosrati

    2012-07-01

    Full Text Available Background: The population size of plants affects on population genetic variation. Materials and Methods: We studied the impact of population size on genetic variation in populations of Capparis spinosa (caper, Capparaceae using RAPDs in East Azerbaijan (Iran. Within-population genetic diversity was estimated based on Nei`s and Shanonn`s diversity using Popgen, and genetic similarity among the populations was studied from a UPGMA dendrogram based the matrix of Nei’s distances obtained through SHAN. Difference in the level genetic variation between small-sized and large-sized populations was tested using Mann-Whitney U test, and correlation between geographical and genetic distances among populations was examined by Pearson test (SPSS, 11.3. Total genetic variation was partitioned into within and among populations based on AMOVA using Arlequin. Results: The polymorphism levels of RAPDs bands among the populations ranged from 48.8% to 81.4%, and within-population Nei’s diversity varied from 0.1667 to 0.2630. Genetic variation in small-sized populations (0.1667 to 0.1809 was significantly lower than the variations in large-sized populations (0.2158 -0.2630 (N= 7, P0.674, Pearson correlation test. Conclusions: Population size has a dramatic impact on its genetic diversity. The results revealed that fragmentation of caper population in the study region has most likely occurred recently. The low genetic diversity revealed within caper populations indicates high risk of extinction and suggests that urgent conservation action is needed to recover diversity in these populations.

  16. Body Size Adaptations to Altitudinal Climatic Variation in Neotropical Grasshoppers of the Genus Sphenarium (Orthoptera: Pyrgomorphidae)

    Science.gov (United States)

    2015-01-01

    Altitudinal clines in body size can result from the effects of natural and sexual selection on growth rates and developing times in seasonal environments. Short growing and reproductive seasons constrain the body size that adults can attain and their reproductive success. Little is known about the effects of altitudinal climatic variation on the diversification of Neotropical insects. In central Mexico, in addition to altitude, highly heterogeneous topography generates diverse climates that can occur even at the same latitude. Altitudinal variation and heterogeneous topography open an opportunity to test the relative impact of climatic variation on body size adaptations. In this study, we investigated the relationship between altitudinal climatic variation and body size, and the divergence rates of sexual size dimorphism (SSD) in Neotropical grasshoppers of the genus Sphenarium using a phylogenetic comparative approach. In order to distinguish the relative impact of natural and sexual selection on the diversification of the group, we also tracked the altitudinal distribution of the species and trends of both body size and SSD on the phylogeny of Sphenarium. The correlative evidence suggests no relationship between altitude and body size. However, larger species were associated with places having a warmer winter season in which the temporal window for development and reproduction can be longer. Nonetheless, the largest species were also associated with highly seasonal environments. Moreover, large body size and high levels of SSD have evolved independently several times throughout the history of the group and male body size has experienced a greater evolutionary divergence than females. These lines of evidence suggest that natural selection, associated with seasonality and sexual selection, on maturation time and body size could have enhanced the diversification of this insect group. PMID:26684616

  17. Body Size Adaptations to Altitudinal Climatic Variation in Neotropical Grasshoppers of the Genus Sphenarium (Orthoptera: Pyrgomorphidae.

    Directory of Open Access Journals (Sweden)

    Salomón Sanabria-Urbán

    Full Text Available Altitudinal clines in body size can result from the effects of natural and sexual selection on growth rates and developing times in seasonal environments. Short growing and reproductive seasons constrain the body size that adults can attain and their reproductive success. Little is known about the effects of altitudinal climatic variation on the diversification of Neotropical insects. In central Mexico, in addition to altitude, highly heterogeneous topography generates diverse climates that can occur even at the same latitude. Altitudinal variation and heterogeneous topography open an opportunity to test the relative impact of climatic variation on body size adaptations. In this study, we investigated the relationship between altitudinal climatic variation and body size, and the divergence rates of sexual size dimorphism (SSD in Neotropical grasshoppers of the genus Sphenarium using a phylogenetic comparative approach. In order to distinguish the relative impact of natural and sexual selection on the diversification of the group, we also tracked the altitudinal distribution of the species and trends of both body size and SSD on the phylogeny of Sphenarium. The correlative evidence suggests no relationship between altitude and body size. However, larger species were associated with places having a warmer winter season in which the temporal window for development and reproduction can be longer. Nonetheless, the largest species were also associated with highly seasonal environments. Moreover, large body size and high levels of SSD have evolved independently several times throughout the history of the group and male body size has experienced a greater evolutionary divergence than females. These lines of evidence suggest that natural selection, associated with seasonality and sexual selection, on maturation time and body size could have enhanced the diversification of this insect group.

  18. Genomic variation in rice: genesis of highly polymorphic linkage blocks during domestication.

    Directory of Open Access Journals (Sweden)

    Tian Tang

    2006-11-01

    Full Text Available Genomic regions that are unusually divergent between closely related species or racial groups can be particularly informative about the process of speciation or the operation of natural selection. The two sequenced genomes of cultivated Asian rice, Oryza sativa, reveal that at least 6% of the genomes are unusually divergent. Sequencing of ten unlinked loci from the highly divergent regions consistently identified two highly divergent haplotypes with each locus in nearly complete linkage disequilibrium among 25 O. sativa cultivars and 35 lines from six wild species. The existence of two highly divergent haplotypes in high divergence regions in species from all geographical areas (Africa, Asia, and Oceania was in contrast to the low polymorphism and low linkage disequilibrium that were observed in other parts of the genome, represented by ten reference loci. While several natural processes are likely to contribute to this pattern of genomic variation, domestication may have greatly exaggerated the trend. In this hypothesis, divergent haplotypes that were adapted to different geographical and ecological environments migrated along with humans during the development of domesticated varieties. If true, these high divergence regions of the genome would be enriched for loci that contribute to the enormous range of phenotypic variation observed among domesticated breeds.

  19. Experimental evidence for ecological selection on genome variation in the wild.

    Science.gov (United States)

    Gompert, Zachariah; Comeault, Aaron A; Farkas, Timothy E; Feder, Jeffrey L; Parchman, Thomas L; Buerkle, C Alex; Nosil, Patrik

    2014-03-01

    Understanding natural selection's effect on genetic variation is a major goal in biology, but the genome-scale consequences of contemporary selection are not well known. In a release and recapture field experiment we transplanted stick insects to native and novel host plants and directly measured allele frequency changes within a generation at 186,576 genetic loci. We observed substantial, genome-wide allele frequency changes during the experiment, most of which could be attributed to random mortality (genetic drift). However, we also documented that selection affected multiple genetic loci distributed across the genome, particularly in transplants to the novel host. Host-associated selection affecting the genome acted on both a known colour-pattern trait as well as other (unmeasured) phenotypes. We also found evidence that selection associated with elevation affected genome variation, although our experiment was not designed to test this. Our results illustrate how genomic data can identify previously underappreciated ecological sources and phenotypic targets of selection. © 2013 The Authors. Ecology Letters published by John Wiley & Sons Ltd and CNRS.

  20. Whole-genome sequencing reveals the diversity of cattle copy number variations and multicopy genes

    Science.gov (United States)

    Structural and functional impacts of copy number variations (CNVs) on livestock genomes are not yet well understood. We identified 1853 CNV regions using population-scale sequencing data generated from 75 cattle representing 8 breeds (Angus, Brahman, Gir, Holstein, Jersey, Limousin, Nelore, Romagnol...

  1. Comparison of variations detection between whole-genome amplification methods used in single-cell resequencing

    DEFF Research Database (Denmark)

    Hou, Yong; Wu, Kui; Shi, Xulian;

    2015-01-01

    BACKGROUND: Single-cell resequencing (SCRS) provides many biomedical advances in variations detection at the single-cell level, but it currently relies on whole genome amplification (WGA). Three methods are commonly used for WGA: multiple displacement amplification (MDA), degenerate-oligonucleoti...

  2. Identification of Nucleotide Variation in Genomes Using Next-Generation Sequencing

    NARCIS (Netherlands)

    Megens, H.J.W.C.; Groenen, M.A.M.

    2012-01-01

    Discovery of genome-wide variation has taken a huge leap forward with the introduction of next-generation sequencing (NGS) technology. Variant discovery requires sampling of a number of haplotypes. This can be either the two haplotypes of a diploid organism or multiple haplotypes in a population. Va

  3. Variation and Sexual Dimorphism of Body Size in the Plateau Brown Frog along an Altitudinal Gradient

    Institute of Scientific and Technical Information of China (English)

    Xueyun FENG; Wei CHEN; Junhua HU; Jianping JIANG

    2015-01-01

    Variation in body size and sexual size dimorphism (SSD) can have important consequences for animal ecology, behavior, population dynamics and the evolution of life-history traits. Organisms are expected to be larger in colder climate (i.e., Bergmann’s rule) and SSD varies with body size (i.e., Rensch’s rule). However, the underlying mechanisms are still elusive. The plateau brown frog (Rana kukunoris), a medium-sized anuran species with female-biased SSD, is endemic to the Qinghai-Tibetan Plateau (QTP). From 1797 m (Maoxiang’ping) to 3453 m (Heihe’qiao) in the eastern margin of the QTP, we surveyed 10 populations of R. kukunoris and collected phalanges and snout vent length (SVL) data for 258 adult individuals (199 males versus 59 females). Based on these data, we explored how body size and SSD varying along the altitudinal gradient and examined the corresponding effects of temperature. We found body size to be larger at higher altitude for males but not for females, with likely effects from the temperature on the variation in male body size. Sex differences in growth rates may be the main cause of the variation in SSD. Our results suggested that only males follow the Bergmann’s rule and variation in SSD of R. kukunoris do not support the Rensch’s rule and its inverse. Therefore, the variations of body size can be different between sexes and the applicability of both Bergmann’s rule and Rensch’s rule should depend on species and environment where they live.

  4. Genomic analysis of local variation and recent evolution in Plasmodium vivax.

    Science.gov (United States)

    Pearson, Richard D; Amato, Roberto; Auburn, Sarah; Miotto, Olivo; Almagro-Garcia, Jacob; Amaratunga, Chanaki; Suon, Seila; Mao, Sivanna; Noviyanti, Rintis; Trimarsanto, Hidayat; Marfurt, Jutta; Anstey, Nicholas M; William, Timothy; Boni, Maciej F; Dolecek, Christiane; Tran, Hien Tinh; White, Nicholas J; Michon, Pascal; Siba, Peter; Tavul, Livingstone; Harrison, Gabrielle; Barry, Alyssa; Mueller, Ivo; Ferreira, Marcelo U; Karunaweera, Nadira; Randrianarivelojosia, Milijaona; Gao, Qi; Hubbart, Christina; Hart, Lee; Jeffery, Ben; Drury, Eleanor; Mead, Daniel; Kekre, Mihir; Campino, Susana; Manske, Magnus; Cornelius, Victoria J; MacInnis, Bronwyn; Rockett, Kirk A; Miles, Alistair; Rayner, Julian C; Fairhurst, Rick M; Nosten, Francois; Price, Ric N; Kwiatkowski, Dominic P

    2016-08-01

    The widespread distribution and relapsing nature of Plasmodium vivax infection present major challenges for the elimination of malaria. To characterize the genetic diversity of this parasite in individual infections and across the population, we performed deep genome sequencing of >200 clinical samples collected across the Asia-Pacific region and analyzed data on >300,000 SNPs and nine regions of the genome with large copy number variations. Individual infections showed complex patterns of genetic structure, with variation not only in the number of dominant clones but also in their level of relatedness and inbreeding. At the population level, we observed strong signals of recent evolutionary selection both in known drug resistance genes and at new loci, and these varied markedly between geographical locations. These findings demonstrate a dynamic landscape of local evolutionary adaptation in the parasite population and provide a foundation for genomic surveillance to guide effective strategies for control and elimination of P. vivax.

  5. Presentation of the intrasubject coefficient of variation for sample size planning in bioequivalence studies.

    Science.gov (United States)

    Hauschke, D; Steinijans, W V; Diletti, E; Schall, R; Luus, H G; Elze, M; Blume, H

    1994-07-01

    Bioequivalence studies are generally performed as crossover studies and, therefore, information on the intrasubject coefficient of variation is needed for sample size planning. Unfortunately, this information is usually not presented in publications on bioequivalence studies, and only the pooled inter- and intrasubject coefficient of variation for either test or reference formulation is reported. Thus, the essential information for sample size planning of future studies is not made available to other researchers. In order to overcome such shortcomings, the presentation of results from bioequivalence studies should routinely include the intrasubject coefficient of variation. For the relevant coefficients of variation, theoretical background together with modes of calculation and presentation are given in this communication with particular emphasis on the multiplicative model.

  6. Genomic Variation of Inbreeding and Ancestry in the Remaining Two Isle Royale Wolves.

    Science.gov (United States)

    Hedrick, Philip W; Kardos, Marty; Peterson, Rolf O; Vucetich, John A

    2017-03-01

    Inbreeding, relatedness, and ancestry have traditionally been estimated with pedigree information, however, molecular genomic data can provide more detailed examination of these properties. For example, pedigree information provides estimation of the expected value of these measures but molecular genomic data can estimate the realized values of these measures in individuals. Here, we generate the theoretical distribution of inbreeding, relatedness, and ancestry for the individuals in the pedigree of the Isle Royale wolves, the first examination of such variation in a wild population with a known pedigree. We use the 38 autosomes of the dog genome and their estimated map lengths in our genomic analysis. Although it is known that the remaining wolves are highly inbred, closely related, and descend from only 3 ancestors, our analyses suggest that there is significant variation in the realized inbreeding and relatedness around pedigree expectations. For example, the expected inbreeding in a hypothetical offspring from the 2 remaining wolves is 0.438 but the realized 95% genomic confidence interval is from 0.311 to 0.565. For individual chromosomes, a substantial proportion of the whole chromosomes are completely identical by descent. This examination provides a background to use when analyzing molecular genomic data for individual levels of inbreeding, relatedness, and ancestry. The level of variation in these measures is a function of the time to the common ancestor(s), the number of chromosomes, and the rate of recombination. In the Isle Royale wolf population, the few generations to a common ancestor results in the high variance in genomic inbreeding. © The American Genetic Association 2016. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.

  7. AluScan: a method for genome-wide scanning of sequence and structure variations in the human genome

    Directory of Open Access Journals (Sweden)

    Mei Lingling

    2011-11-01

    Full Text Available Abstract Background To complement next-generation sequencing technologies, there is a pressing need for efficient pre-sequencing capture methods with reduced costs and DNA requirement. The Alu family of short interspersed nucleotide elements is the most abundant type of transposable elements in the human genome and a recognized source of genome instability. With over one million Alu elements distributed throughout the genome, they are well positioned to facilitate genome-wide sequence amplification and capture of regions likely to harbor genetic variation hotspots of biological relevance. Results Here we report on the use of inter-Alu PCR with an enhanced range of amplicons in conjunction with next-generation sequencing to generate an Alu-anchored scan, or 'AluScan', of DNA sequences between Alu transposons, where Alu consensus sequence-based 'H-type' PCR primers that elongate outward from the head of an Alu element are combined with 'T-type' primers elongating from the poly-A containing tail to achieve huge amplicon range. To illustrate the method, glioma DNA was compared with white blood cell control DNA of the same patient by means of AluScan. The over 10 Mb sequences obtained, derived from more than 8,000 genes spread over all the chromosomes, revealed a highly reproducible capture of genomic sequences enriched in genic sequences and cancer candidate gene regions. Requiring only sub-micrograms of sample DNA, the power of AluScan as a discovery tool for genetic variations was demonstrated by the identification of 357 instances of loss of heterozygosity, 341 somatic indels, 274 somatic SNVs, and seven potential somatic SNV hotspots between control and glioma DNA. Conclusions AluScan, implemented with just a small number of H-type and T-type inter-Alu PCR primers, provides an effective capture of a diversity of genome-wide sequences for analysis. The method, by enabling an examination of gene-enriched regions containing exons, introns, and

  8. An Empirical Bayes Mixture Model for Effect Size Distributions in Genome-Wide Association Studies

    DEFF Research Database (Denmark)

    Thompson, Wesley K.; Wang, Yunpeng; Schork, Andrew J.

    2015-01-01

    Characterizing the distribution of effects from genome-wide genotyping data is crucial for understanding important aspects of the genetic architecture of complex traits, such as number or proportion of non-null loci, average proportion of phenotypic variance explained per non-null effect, power...... for discovery, and polygenic risk prediction. To this end, previous work has used effect-size models based on various distributions, including the normal and normal mixture distributions, among others. In this paper we propose a scale mixture of two normals model for effect size distributions of genome...... of variance explained by genotyped SNPs, CD and SZ have a broadly dissimilar genetic architecture, due to differing mean effect size and proportion of non-null loci....

  9. Genome size, GC percentage and 5mC level in the Indonesian coelacanth Latimeria menadoensis.

    Science.gov (United States)

    Makapedua, Daisy Monica; Barucca, Marco; Forconi, Mariko; Antonucci, Niki; Bizzaro, Davide; Amici, Adolfo; Carradori, Maria Rita; Olmo, Ettore; Canapa, Adriana

    2011-09-01

    The living fossil Latimeria menadoensis is important to understand sarcopterygian evolution. To gain further insights into this fish species we studied its genome size, GC% and 5mC level. The genome size and the GC% of the Indonesian coelacanth seem to be very similar to those of the African coelacanth. Moreover the GC%, the CpG frequency and the 5mC level of L. menadoensis are more similar to those of fish and amphibians than to those of mammals, birds and reptiles and this is in line with the hypothesis that two different DNA methylation and CpG shortage equilibria arose during vertebrate evolution. Our results suggest that the genome of L. menadoensis has remained unchanged for several million years, maybe since the origin of the lineage which from lobe-finned fish led to tetrapods. These data fit a conservative evolutionary landscape and suggest that the genome of the extant crossopterygians may be a sort of evolutionarily frozen genome.

  10. Copy number variation in the genomes of twelve natural isolates of Caenorhabditis elegans

    Directory of Open Access Journals (Sweden)

    Flibotte Stephane

    2010-01-01

    Full Text Available Abstract Background Copy number variation is an important component of genetic variation in higher eukaryotes. The extent of natural copy number variation in C. elegans is unknown outside of 2 highly divergent wild isolates and the canonical N2 Bristol strain. Results We have used array comparative genomic hybridization (aCGH to detect copy number variation in the genomes of 12 natural isolates of Caenorhabditis elegans. Deletions relative to the canonical N2 strain are more common in these isolates than duplications, and indels are enriched in multigene families on the autosome arms. Among the strains in our study, the Hawaiian and Madeiran strains (CB4856 and JU258 carry the largest number of deletions, followed by the Vancouver strain (KR314. Overall we detected 510 different deletions affecting 1136 genes, or over 5% of the genes in the canonical N2 genome. The indels we identified had a median length of 2.7 kb. Since many deletions are found in multiple isolates, deletion loci were used as markers to derive an unrooted tree to estimate genetic relatedness among the strains. Conclusion Copy number variation is extensive in C. elegans, affecting over 5% of the genes in the genome. The deletions we have detected in natural isolates of C. elegans contribute significantly to the number of deletion alleles available to researchers. The relationships between strains are complex and different regions of the genome possess different genealogies due to recombination throughout the natural history of the species, which may not be apparent in studies utilizing smaller numbers of genetic markers.

  11. Genomic analysis of QTLs and genes altering natural variation in stochastic noise.

    Science.gov (United States)

    Jimenez-Gomez, Jose M; Corwin, Jason A; Joseph, Bindu; Maloof, Julin N; Kliebenstein, Daniel J

    2011-09-01

    Quantitative genetic analysis has long been used to study how natural variation of genotype can influence an organism's phenotype. While most studies have focused on genetic determinants of phenotypic average, it is rapidly becoming understood that stochastic noise is genetically determined. However, it is not known how many traits display genetic control of stochastic noise nor how broadly these stochastic loci are distributed within the genome. Understanding these questions is critical to our understanding of quantitative traits and how they relate to the underlying causal loci, especially since stochastic noise may be directly influenced by underlying changes in the wiring of regulatory networks. We identified QTLs controlling natural variation in stochastic noise of glucosinolates, plant defense metabolites, as well as QTLs for stochastic noise of related transcripts. These loci included stochastic noise QTLs unique for either transcript or metabolite variation. Validation of these loci showed that genetic polymorphism within the regulatory network alters stochastic noise independent of effects on corresponding average levels. We examined this phenomenon more globally, using transcriptomic datasets, and found that the Arabidopsis transcriptome exhibits significant, heritable differences in stochastic noise. Further analysis allowed us to identify QTLs that control genomic stochastic noise. Some genomic QTL were in common with those altering average transcript abundance, while others were unique to stochastic noise. Using a single isogenic population, we confirmed that natural variation at ELF3 alters stochastic noise in the circadian clock and metabolism. Since polymorphisms controlling stochastic noise in genomic phenotypes exist within wild germplasm for naturally selected phenotypes, this suggests that analysis of Arabidopsis evolution should account for genetic control of stochastic variance and average phenotypes. It remains to be determined if natural

  12. The next evolutionary synthesis: from Lamarck and Darwin to genomic variation and systems biology

    Directory of Open Access Journals (Sweden)

    Bard Jonathan BL

    2011-11-01

    Full Text Available Abstract The evolutionary synthesis, the standard 20th century view of how evolutionary change occurs, is based on selection, heritable phenotypic variation and a very simple view of genes. It is therefore unable to incorporate two key aspects of modern molecular knowledge: first is the richness of genomic variation, so much more complicated than simple mutation, and second is the opaque relationship between the genotype and its resulting phenotype. Two new and important books shed some light on how we should view evolutionary change now. Evolution: a view from the 21st century by J.A. Shapiro (2011, FT Press Science, New Jersey, USA. pp. 246. $34.99. examines the richness of genomic variation and its implications. Transformations of Lamarckism: from Subtle Fluids to Molecular Biology edited by S.B. Gissis & E. Jablonka (2011, MIT Press, Cambridge, USA. pp. 457 includes some 40 papers that anyone with an interest in the history of evolutionary thought and the relationship between the environment and the genome will want to read. This review discusses both books within the context of contemporary evolutionary thinking and points out that neither really comes to terms with today's key systems-biology question: how does mutation-induced variation in a molecular network generate variation in the resulting phenotype?

  13. Genic intolerance to functional variation and the interpretation of personal genomes.

    Directory of Open Access Journals (Sweden)

    Slavé Petrovski

    Full Text Available A central challenge in interpreting personal genomes is determining which mutations most likely influence disease. Although progress has been made in scoring the functional impact of individual mutations, the characteristics of the genes in which those mutations are found remain largely unexplored. For example, genes known to carry few common functional variants in healthy individuals may be judged more likely to cause certain kinds of disease than genes known to carry many such variants. Until now, however, it has not been possible to develop a quantitative assessment of how well genes tolerate functional genetic variation on a genome-wide scale. Here we describe an effort that uses sequence data from 6503 whole exome sequences made available by the NHLBI Exome Sequencing Project (ESP. Specifically, we develop an intolerance scoring system that assesses whether genes have relatively more or less functional genetic variation than expected based on the apparently neutral variation found in the gene. To illustrate the utility of this intolerance score, we show that genes responsible for Mendelian diseases are significantly more intolerant to functional genetic variation than genes that do not cause any known disease, but with striking variation in intolerance among genes causing different classes of genetic disease. We conclude by showing that use of an intolerance ranking system can aid in interpreting personal genomes and identifying pathogenic mutations.

  14. Adaptive potential of genomic structural variation in human and mammalian evolution.

    Science.gov (United States)

    Radke, David W; Lee, Charles

    2015-09-01

    Because phenotypic innovations must be genetically heritable for biological evolution to proceed, it is natural to consider new mutation events as well as standing genetic variation as sources for their birth. Previous research has identified a number of single-nucleotide polymorphisms that underlie a subset of adaptive traits in organisms. However, another well-known class of variation, genomic structural variation, could have even greater potential to produce adaptive phenotypes, due to the variety of possible types of alterations (deletions, insertions, duplications, among others) at different genomic positions and with variable lengths. It is from these dramatic genomic alterations, and selection on their phenotypic consequences, that adaptations leading to biological diversification could be derived. In this review, using studies in humans and other mammals, we highlight examples of how phenotypic variation from structural variants might become adaptive in populations and potentially enable biological diversification. Phenotypic change arising from structural variants will be described according to their immediate effect on organismal metabolic processes, immunological response and physical features. Study of population dynamics of segregating structural variation can therefore provide a window into understanding current and historical biological diversification.

  15. Localising loci underlying complex trait variation using Regional Genomic Relationship Mapping.

    Directory of Open Access Journals (Sweden)

    Yoshitaka Nagamine

    Full Text Available The limited proportion of complex trait variance identified in genome-wide association studies may reflect the limited power of single SNP analyses to detect either rare causative alleles or those of small effect. Motivated by studies that demonstrate that loci contributing to trait variation may contain a number of different alleles, we have developed an analytical approach termed Regional Genomic Relationship Mapping that, like linkage-based family methods, integrates variance contributed by founder gametes within a pedigree. This approach takes advantage of very distant (and unrecorded relationships, and this greatly increases the power of the method, compared with traditional pedigree-based linkage analyses. By integrating variance contributed by founder gametes in the population, our approach provides an estimate of the Regional Heritability attributable to a small genomic region (e.g. 100 SNP window covering ca. 1 Mb of DNA in a 300000 SNP GWAS and has the power to detect regions containing multiple alleles that individually contribute too little variance to be detectable by GWAS as well as regions with single common GWAS-detectable SNPs. We use genome-wide SNP array data to obtain both a genome-wide relationship matrix and regional relationship ("identity by state" or IBS matrices for sequential regions across the genome. We then estimate a heritability for each region sequentially in our genome-wide scan. We demonstrate by simulation and with real data that, when compared to traditional ("individual SNP" GWAS, our method uncovers new loci that explain additional trait variation. We analysed data from three Southern European populations and from Orkney for exemplar traits - serum uric acid concentration and height. We show that regional heritability estimates are correlated with results from genome-wide association analysis but can capture more of the genetic variance segregating in the population and identify additional trait loci.

  16. Extreme recombination frequencies shape genome variation and evolution in the honeybee, Apis mellifera.

    Directory of Open Access Journals (Sweden)

    Andreas Wallberg

    2015-04-01

    Full Text Available Meiotic recombination is a fundamental cellular process, with important consequences for evolution and genome integrity. However, we know little about how recombination rates vary across the genomes of most species and the molecular and evolutionary determinants of this variation. The honeybee, Apis mellifera, has extremely high rates of meiotic recombination, although the evolutionary causes and consequences of this are unclear. Here we use patterns of linkage disequilibrium in whole genome resequencing data from 30 diploid honeybees to construct a fine-scale map of rates of crossing over in the genome. We find that, in contrast to vertebrate genomes, the recombination landscape is not strongly punctate. Crossover rates strongly correlate with levels of genetic variation, but not divergence, which indicates a pervasive impact of selection on the genome. Germ-line methylated genes have reduced crossover rate, which could indicate a role of methylation in suppressing recombination. Controlling for the effects of methylation, we do not infer a strong association between gene expression patterns and recombination. The site frequency spectrum is strongly skewed from neutral expectations in honeybees: rare variants are dominated by AT-biased mutations, whereas GC-biased mutations are found at higher frequencies, indicative of a major influence of GC-biased gene conversion (gBGC, which we infer to generate an allele fixation bias 5 - 50 times the genomic average estimated in humans. We uncover further evidence that this repair bias specifically affects transitions and favours fixation of CpG sites. Recombination, via gBGC, therefore appears to have profound consequences on genome evolution in honeybees and interferes with the process of natural selection. These findings have important implications for our understanding of the forces driving molecular evolution.

  17. Viral small RNAs reveal the genomic variations of three grapevine vein clearing virus quasispecies populations.

    Science.gov (United States)

    Howard, Susanne; Qiu, Wenping

    2017-02-02

    Viral small RNAs (vsRNAs) include viral small interfering RNAs (vsiRNAs) that are initiators and products of RNA silencing, and small RNAs that are derived from viral RNAs with function still unknown. Sequencing of vsRNAs allows assembling of viral genomes and revelation of viral population variations at genomic levels. Grapevine vein clearing virus (GVCV) is a new member of the family Caulimoviridae whose DNA genome is replicated by reverse transcription of pre-genomic RNA molecules. In this short report, three genomic sequences of GVCV were assembled from vsRNAs that were isolated and sequenced from three individual grapevines in commercial vineyards and compared to the GVCV-CHA reference genome. Profiles of single nucleotide polymorphism among three viral populations indicated a closer relatedness between two populations in different grape cultivars at the same location than those in the same grape cultivar at different locations, suggesting the spread of GVCV populations among vineyards of close proximity. Classic types of vsiRNAs (21-nt, 22-nt, and 24-nt) were found in the three GVCV vsiRNA populations, but these did not produce alignment hotspots on the GVCV-CHA reference genome. The number of 36-nt reads is the highest among vsRNAs, the role of these vsRNAs remains unclear. The analysis of vsRNAs provides a first holistic picture of genomic variations among GVCV viral quasispecies populations that help monitor epidemics and evolution of GVCV populations, an emerging virus that is becoming a threat to grape production in the Midwest region of the USA.

  18. Icelandic Birch Polyploids—The Case of a Perfect Fit in Genome Size

    Directory of Open Access Journals (Sweden)

    K. Anamthawat-Jónsson

    2010-01-01

    Full Text Available Two birch species coexist in Iceland, dwarf birch Betula nana and tree birch B. pubescens. Both species are variable morphologically, which has been shown to be due to introgressive hybridization via interspecific hybrids. The aim of this study was to examine if the introgression could be related to genome size. We characterized 42 plants from Bifröst woodland morphologically and cytogenetically. The population consisted of diploid B. nana (38%, tetraploid B. pubescens (55%, and triploid hybrids (7%. Genome size was measured from 12 plants, using Feulgen DNA image densitometry (FDM on spring leaf buds and flow cytometry (FCM with dormant winter twigs. The use of winter twigs for FCM is novel. The average 1C-values for diploid, triploid, and tetraploid plants were 448, 666, and 882 Mbp, respectively. Monoploid genome sizes were found to be statistically constant among ploidy levels. This stability is in contrast to the different taxonomic positions of the di- and tetraploids and also contrasts with the frequent occurrence of genome downsizing in polyploids.

  19. Mitochondrial genome evolution in Alismatales: Size reduction and extensive loss of ribosomal protein genes

    DEFF Research Database (Denmark)

    Petersen, Gitte; Cuenca, Argelia; Zervas, Athanasios

    2017-01-01

    The order Alismatales is a hotspot for evolution of plant mitochondrial genomes characterized by remarkable differences in genome size, substitution rates, RNA editing, retrotranscription, gene loss and intron loss. Here we have sequenced the complete mitogenomes of Zostera marina and Stratiotes ...... mitogenome from a non-parasitic plant. Using a broad sample of the Alismatales, the evolutionary history of ribosomal protein gene loss is analyzed. In Zostera almost all ribosomal protein genes are lost from the mitogenome, but only some can be found in the nucleus....

  20. Intraspecific Variation in Maximum Ingested Food Size and Body Mass in Varecia rubra and Propithecus coquereli

    Directory of Open Access Journals (Sweden)

    Adam Hartstone-Rose

    2011-01-01

    Full Text Available In a recent study, we quantified the scaling of ingested food size (Vb—the maximum size at which an animal consistently ingests food whole—and found that Vb scaled isometrically between species of captive strepsirrhines. The current study examines the relationship between Vb and body size within species with a focus on the frugivorous Varecia rubra and the folivorous Propithecus coquereli. We found no overlap in Vb between the species (all V. rubra ingested larger pieces of food relative to those eaten by P. coquereli, and least-squares regression of Vb and three different measures of body mass showed no scaling relationship within each species. We believe that this lack of relationship results from the relatively narrow intraspecific body size variation and seemingly patternless individual variation in Vb within species and take this study as further evidence that general scaling questions are best examined interspecifically rather than intraspecifically.

  1. Genomic profiling of plastid DNA variation in the Mediterranean olive tree

    Directory of Open Access Journals (Sweden)

    Dorado Gabriel

    2011-05-01

    Full Text Available Abstract Background Characterisation of plastid genome (or cpDNA polymorphisms is commonly used for phylogeographic, population genetic and forensic analyses in plants, but detecting cpDNA variation is sometimes challenging, limiting the applications of such an approach. In the present study, we screened cpDNA polymorphism in the olive tree (Olea europaea L. by sequencing the complete plastid genome of trees with a distinct cpDNA lineage. Our objective was to develop new markers for a rapid genomic profiling (by Multiplex PCRs of cpDNA haplotypes in the Mediterranean olive tree. Results Eight complete cpDNA genomes of Olea were sequenced de novo. The nucleotide divergence between olive cpDNA lineages was low and not exceeding 0.07%. Based on these sequences, markers were developed for studying two single nucleotide substitutions and length polymorphism of 62 regions (with variable microsatellite motifs or other indels. They were then used to genotype the cpDNA variation in cultivated and wild Mediterranean olive trees (315 individuals. Forty polymorphic loci were detected on this sample, allowing the distinction of 22 haplotypes belonging to the three Mediterranean cpDNA lineages known as E1, E2 and E3. The discriminating power of cpDNA variation was particularly low for the cultivated olive tree with one predominating haplotype, but more diversity was detected in wild populations. Conclusions We propose a method for a rapid characterisation of the Mediterranean olive germplasm. The low variation in the cultivated olive tree indicated that the utility of cpDNA variation for forensic analyses is limited to rare haplotypes. In contrast, the high cpDNA variation in wild populations demonstrated that our markers may be useful for phylogeographic and populations genetic studies in O. europaea.

  2. Molecular subdivision of the marine diatom Thalassiosira rotula in relation to geographic distribution, genome size, and physiology

    Directory of Open Access Journals (Sweden)

    Whittaker Kerry A

    2012-10-01

    Full Text Available Abstract Background Marine phytoplankton drift passively with currents, have high dispersal potentials and can be comprised of morphologically cryptic species. To examine molecular subdivision in the marine diatom Thalassiosira rotula, variations in rDNA sequence, genome size, and growth rate were examined among isolates collected from the Atlantic and Pacific Ocean basins. Analyses of rDNA included T. gravida because morphological studies have argued that T. rotula and T. gravida are conspecific. Results Culture collection isolates of T. gravida and T. rotula diverged by 7.0 ± 0.3% at the ITS1 and by 0.8 ± 0.03% at the 28S. Within T. rotula, field and culture collection isolates were subdivided into three lineages that diverged by 0.6 ± 0.3% at the ITS1 and 0% at the 28S. The predicted ITS1 secondary structure revealed no compensatory base pair changes among lineages. Differences in genome size were observed among isolates, but were not correlated with ITS1 lineages. Maximum acclimated growth rates of isolates revealed genotype by environment effects, but these were also not correlated with ITS1 lineages. In contrast, intra-individual variation in the multi-copy ITS1 revealed no evidence of recombination amongst lineages, and molecular clock estimates indicated that lineages diverged 0.68 Mya. The three lineages exhibited different geographic distributions and, with one exception, each field sample was dominated by a single lineage. Conclusions The degree of inter- and intra-specific divergence between T. gravida and T. rotula suggests they should continue to be treated as separate species. The phylogenetic distinction of the three closely-related T. rotula lineages was unclear. On the one hand, the lineages showed no physiological differences, no consistent genome size differences and no significant changes in the ITS1 secondary structure, suggesting there are no barriers to interbreeding among lineages. In contrast, analysis of intra

  3. Evaluating variations of genotype calling: a potential source of spurious associations in genome-wide association studies

    Indian Academy of Sciences (India)

    Xuixiao Hong; Zhenqiang Su; Weigong Ge; Leming Shi; Roger Perkins; Hong Fang; Donna Mendrick; Weida Tong

    2010-04-01

    Genome-wide association studies (GWAS) examine the entire human genome with the goal of identifying genetic variants (usually single nucleotide polymorphisms (SNPs)) that are associated with phenotypic traits such as disease status and drug response. The discordance of significantly associated SNPs for the same disease identified from different GWAS indicates that false associations exist in such results. In addition to the possible sources of spurious associations that have been investigated and discussed intensively, such as sample size and population stratification, an accurate and reproducible genotype calling algorithm is required for concordant GWAS results from different studies. However, variations of genotype calling of an algorithm and their effects on significantly associated SNPs identified in downstream association analyses have not been systematically investigated. In this paper, the variations of genotype calling using the Bayesian Robust Linear Model with Mahalanobis distance classifier (BRLMM) algorithm and the resulting influence on the lists of significantly associated SNPs were evaluated using the raw data of 270 HapMap samples analysed with the Affymetrix Human Mapping 500K Array Set (Affy500K) by changing algorithmic parameters. Modified were the Dynamic Model (DM) call confidence threshold (threshold) and the number of randomly selected SNPs (size). Comparative analysis of the calling results and the corresponding lists of significantly associated SNPs identified through association analysis revealed that algorithmic parameters used in BRLMM affected the genotype calls and the significantly associated SNPs. Both the threshold and the size affected the called genotypes and the lists of significantly associated SNPs in association analysis. The effect of the threshold was much larger than the effect of the size. Moreover, the heterozygous calls had lower consistency compared to the homozygous calls.

  4. Identification of genome-wide copy number variations among diverse pig breeds using SNP genotyping arrays.

    Directory of Open Access Journals (Sweden)

    Jiying Wang

    Full Text Available Copy number variations (CNVs are important forms of genetic variation complementary to SNPs, and can be considered as promising markers for some phenotypic and economically important traits or diseases susceptibility in domestic animals. In the present study, we performed a genome-wide CNV identification in 14 individuals selected from diverse populations, including six types of Chinese indigenous breeds, one Asian wild boar population, as well as three modern commercial foreign breeds. We identified 63 CNVRs in total, which covered 9.98 Mb of polymorphic sequence and corresponded to 0.36% of the genome sequence. The length of these CNVRs ranged from 3.20 to 827.21 kb, with an average of 158.37 kb and a median of 97.85 kb. Functional annotation revealed these identified CNVR have important molecular function, and may play an important role in exploring the genetic basis of phenotypic variability and disease susceptibility among pigs. Additionally, to confirm these potential CNVRs, we performed qPCR for 12 randomly selected CNVRs and 8 of them (66.67% were confirmed successfully. CNVs detected in diverse populations herein are essential complementary to the CNV map in the pig genome, which provide an important resource for studies of genomic variation and the association between various economically important traits and CNVs.

  5. Variation in genomic methylation in natural populations of chinese white poplar.

    Science.gov (United States)

    Ma, Kaifeng; Song, Yuepeng; Yang, Xiaohui; Zhang, Zhiyi; Zhang, Deqiang

    2013-01-01

    It is thought that methylcytosine can be inherited through meiosis and mitosis, and that epigenetic variation may be under genetic control or correlation may be caused by neutral drift. However, DNA methylation also varies with tissue, developmental stage, and environmental factors. Eliminating these factors, we analyzed the levels and patterns, diversity and structure of genomic methylcytosine in the xylem of nine natural populations of Chinese white poplar. On average, the relative total methylation and non-methylation levels were approximately 26.567% and 42.708% (Pdifferentiation (GST  = 0.159) were assessed by Shannon's diversity index. Co-inertia analysis indicated that methylation-sensitive polymorphism (MSP) and genomic methylation pattern (CG-CNG) profiles gave similar distributions. Using a between-group eigen analysis, we found that the Hebei and Shanxi populations were independent of each other, but the Henan population intersected with the other populations, to some degree. Genome methylation in Populus tomentosa presented tissue-specific characteristics and the relative 5'-CCGG methylation level was higher in xylem than in leaves. Meanwhile, the genome methylation in the xylem shows great epigenetic variation and could be fixed and inherited though mitosis. Compared to genetic structure, data suggest that epigenetic and genetic variation do not completely match.

  6. Genetic variation architecture of mitochondrial genome reveals the differentiation in Korean landrace and weedy rice.

    Science.gov (United States)

    Tong, Wei; He, Qiang; Park, Yong-Jin

    2017-03-03

    Mitochondrial genome variations have been detected despite the overall conservation of this gene content, which has been valuable for plant population genetics and evolutionary studies. Here, we describe mitochondrial variation architecture and our performance of a phylogenetic dissection of Korean landrace and weedy rice. A total of 4,717 variations across the mitochondrial genome were identified adjunct with 10 wild rice. Genetic diversity assessment revealed that wild rice has higher nucleotide diversity than landrace and/or weedy, and landrace rice has higher diversity than weedy rice. Genetic distance was suggestive of a high level of breeding between landrace and weedy rice, and the landrace showing a closer association with wild rice than weedy rice. Population structure and principal component analyses showed no obvious difference in the genetic backgrounds of landrace and weedy rice in mitochondrial genome level. Phylogenetic, population split, and haplotype network evaluations were suggestive of independent origins of the indica and japonica varieties. The origin of weedy rice is supposed to be more likely from cultivated rice rather than from wild rice in mitochondrial genome level.

  7. A genome-wide, fine-scale map of natural pigmentation variation in Drosophila melanogaster.

    Directory of Open Access Journals (Sweden)

    Héloïse Bastide

    2013-06-01

    Full Text Available Various approaches can be applied to uncover the genetic basis of natural phenotypic variation, each with their specific strengths and limitations. Here, we use a replicated genome-wide association approach (Pool-GWAS to fine-scale map genomic regions contributing to natural variation in female abdominal pigmentation in Drosophila melanogaster, a trait that is highly variable in natural populations and highly heritable in the laboratory. We examined abdominal pigmentation phenotypes in approximately 8000 female European D. melanogaster, isolating 1000 individuals with extreme phenotypes. We then used whole-genome Illumina sequencing to identify single nucleotide polymorphisms (SNPs segregating in our sample, and tested these for associations with pigmentation by contrasting allele frequencies between replicate pools of light and dark individuals. We identify two small regions near the pigmentation genes tan and bric-à-brac 1, both corresponding to known cis-regulatory regions, which contain SNPs showing significant associations with pigmentation variation. While the Pool-GWAS approach suffers some limitations, its cost advantage facilitates replication and it can be applied to any non-model system with an available reference genome.

  8. Genetic variation architecture of mitochondrial genome reveals the differentiation in Korean landrace and weedy rice

    Science.gov (United States)

    Tong, Wei; He, Qiang; Park, Yong-Jin

    2017-01-01

    Mitochondrial genome variations have been detected despite the overall conservation of this gene content, which has been valuable for plant population genetics and evolutionary studies. Here, we describe mitochondrial variation architecture and our performance of a phylogenetic dissection of Korean landrace and weedy rice. A total of 4,717 variations across the mitochondrial genome were identified adjunct with 10 wild rice. Genetic diversity assessment revealed that wild rice has higher nucleotide diversity than landrace and/or weedy, and landrace rice has higher diversity than weedy rice. Genetic distance was suggestive of a high level of breeding between landrace and weedy rice, and the landrace showing a closer association with wild rice than weedy rice. Population structure and principal component analyses showed no obvious difference in the genetic backgrounds of landrace and weedy rice in mitochondrial genome level. Phylogenetic, population split, and haplotype network evaluations were suggestive of independent origins of the indica and japonica varieties. The origin of weedy rice is supposed to be more likely from cultivated rice rather than from wild rice in mitochondrial genome level. PMID:28256554

  9. Read clouds uncover variation in complex regions of the human genome.

    Science.gov (United States)

    Bishara, Alex; Liu, Yuling; Weng, Ziming; Kashef-Haghighi, Dorna; Newburger, Daniel E; West, Robert; Sidow, Arend; Batzoglou, Serafim

    2015-10-01

    Although an increasing amount of human genetic variation is being identified and recorded, determining variants within repeated sequences of the human genome remains a challenge. Most population and genome-wide association studies have therefore been unable to consider variation in these regions. Core to the problem is the lack of a sequencing technology that produces reads with sufficient length and accuracy to enable unique mapping. Here, we present a novel methodology of using read clouds, obtained by accurate short-read sequencing of DNA derived from long fragment libraries, to confidently align short reads within repeat regions and enable accurate variant discovery. Our novel algorithm, Random Field Aligner (RFA), captures the relationships among the short reads governed by the long read process via a Markov Random Field. We utilized a modified version of the Illumina TruSeq synthetic long-read protocol, which yielded shallow-sequenced read clouds. We test RFA through extensive simulations and apply it to discover variants on the NA12878 human sample, for which shallow TruSeq read cloud sequencing data are available, and on an invasive breast carcinoma genome that we sequenced using the same method. We demonstrate that RFA facilitates accurate recovery of variation in 155 Mb of the human genome, including 94% of 67 Mb of segmental duplication sequence and 96% of 11 Mb of transcribed sequence, that are currently hidden from short-read technologies.

  10. Detection of breed specific copy number variations in domestic chicken genome.

    Science.gov (United States)

    Sohrabi, Saeed S; Mohammadabadi, Mohammadreza; Wu, Dong-Dong; Esmailizadeh, Ali

    2017-09-29

    Copy number variations (CNVs) are important large scale variants that are widespread in the genome and may contribute to phenotypic variation. Detection and characterization of CNVs can provide new insights into the genetic basis of important traits. Here, we performed whole genome short read sequence analysis to identify CNVs in two indigenous and commercial chicken breeds and evaluate the impact of the identified CNVs on breed specific traits. After filtration, a total of 12955 CNVs spanning (on average) about 9.42% of the chicken genome were found that made up 5467 CNV regions (CNVRs). Chicken quantitative trait loci (QTL) datasets and Ensembl gene annotations were used as resources for the estimation of potential phenotypic effects of our CNVRs on breed specific traits. In total, 34% of our detected CNVRs were also detected in earlier CNV studies. These CNVRs partly overlap with several previously reported QTL and gene ontology terms associated with some important traits, including shank length QTL in Creeper specific CNVRs and body weight and egg production characteristics as well as growth of muscles and body organs gene terms in the Arian commercial breed. Our findings provide new insights into the genomic structure of the chicken genome for an improved understanding of the potential roles of CNVRs in differentiating between breeds or lines.

  11. Marked variation in predicted and observed variability of tandem repeat loci across the human genome

    Directory of Open Access Journals (Sweden)

    Shields Denis C

    2008-04-01

    Full Text Available Abstract Background Tandem repeat (TR variants in the human genome play key roles in a number of diseases. However, current models predicting variability are based on limited training sets. We conducted a systematic analysis of TRs of unit lengths 2–12 nucleotides in Whole Genome Shotgun (WGS sequences to define the extent of variation of 209,214 unique repeat loci throughout the genome. Results We applied a multivariate statistical model to predict TR variability. Predicted heterozygosity correlated with heterozygosity in the CEPH polymorphism database (correlation ρ = 0.29, p Conclusion Variability among 2–12-mer TRs in the genome can be modeled by a few parameters, which do not markedly differ according to unit length, consistent with a common mechanism for the generation of variability among such TRs. Analysis of the distributions of observed and predicted variants across the genome showed a general concordance, indicating that the repeat variation dataset does not exhibit strong regional ascertainment biases. This revealed a deficit of variant repeats in chromosomes 19 and Y – likely to reflect a reduction in 2-mer repeats in the former and a reduced level of recombination in the latter – and excesses in chromosomes 6, 13, 20 and 21.

  12. A genome-wide study of recombination rate variation in Bartonella henselae

    Directory of Open Access Journals (Sweden)

    Guy Lionel

    2012-05-01

    Full Text Available Abstract Background Rates of recombination vary by three orders of magnitude in bacteria but the reasons for this variation is unclear. We performed a genome-wide study of recombination rate variation among genes in the intracellular bacterium Bartonella henselae, which has among the lowest estimated ratio of recombination relative to mutation in prokaryotes. Results The 1.9 Mb genomes of B. henselae strains IC11, UGA10 and Houston-1 genomes showed only minor gene content variation. Nucleotide sequence divergence levels were less than 1% and the relative rate of recombination to mutation was estimated to 1.1 for the genome overall. Four to eight segments per genome presented significantly enhanced divergences, the most pronounced of which were the virB and trw gene clusters for type IV secretion systems that play essential roles in the infection process. Consistently, multiple recombination events were identified inside these gene clusters. High recombination frequencies were also observed for a gene putatively involved in iron metabolism. A phylogenetic study of this gene in 80 strains of Bartonella quintana, B. henselae and B. grahamii indicated different population structures for each species and revealed horizontal gene transfers across Bartonella species with different host preferences. Conclusions Our analysis has shown little novel gene acquisition in B. henselae, indicative of a closed pan-genome, but higher recombination frequencies within the population than previously estimated. We propose that the dramatically increased fixation rate for recombination events at gene clusters for type IV secretion systems is driven by selection for sequence variability.

  13. VarB Plus: An Integrated Tool for Visualization of Genome Variation Datasets

    KAUST Repository

    Hidayah, Lailatul

    2012-07-01

    Research on genomic sequences has been improving significantly as more advanced technology for sequencing has been developed. This opens enormous opportunities for sequence analysis. Various analytical tools have been built for purposes such as sequence assembly, read alignments, genome browsing, comparative genomics, and visualization. From the visualization perspective, there is an increasing trend towards use of large-scale computation. However, more than power is required to produce an informative image. This is a challenge that we address by providing several ways of representing biological data in order to advance the inference endeavors of biologists. This thesis focuses on visualization of variations found in genomic sequences. We develop several visualization functions and embed them in an existing variation visualization tool as extensions. The tool we improved is named VarB, hence the nomenclature for our enhancement is VarB Plus. To the best of our knowledge, besides VarB, there is no tool that provides the capability of dynamic visualization of genome variation datasets as well as statistical analysis. Dynamic visualization allows users to toggle different parameters on and off and see the results on the fly. The statistical analysis includes Fixation Index, Relative Variant Density, and Tajima’s D. Hence we focused our efforts on this tool. The scope of our work includes plots of per-base genome coverage, Principal Coordinate Analysis (PCoA), integration with a read alignment viewer named LookSeq, and visualization of geo-biological data. In addition to description of embedded functionalities, significance, and limitations, future improvements are discussed. The result is four extensions embedded successfully in the original tool, which is built on the Qt framework in C++. Hence it is portable to numerous platforms. Our extensions have shown acceptable execution time in a beta testing with various high-volume published datasets, as well as positive

  14. Background selection as baseline for nucleotide variation across the Drosophila genome.

    Science.gov (United States)

    Comeron, Josep M

    2014-06-01

    The constant removal of deleterious mutations by natural selection causes a reduction in neutral diversity and efficacy of selection at genetically linked sites (a process called Background Selection, BGS). Population genetic studies, however, often ignore BGS effects when investigating demographic events or the presence of other types of selection. To obtain a more realistic evolutionary expectation that incorporates the unavoidable consequences of deleterious mutations, we generated high-resolution landscapes of variation across the Drosophila melanogaster genome under a BGS scenario independent of polymorphism data. We find that BGS plays a significant role in shaping levels of variation across the entire genome, including long introns and intergenic regions distant from annotated genes. We also find that a very large percentage of the observed variation in diversity across autosomes can be explained by BGS alone, up to 70% across individual chromosome arms at 100-kb scale, thus indicating that BGS predictions can be used as baseline to infer additional types of selection and demographic events. This approach allows detecting several outlier regions with signal of recent adaptive events and selective sweeps. The use of a BGS baseline, however, is particularly appropriate to investigate the presence of balancing selection and our study exposes numerous genomic regions with the predicted signature of higher polymorphism than expected when a BGS context is taken into account. Importantly, we show that these conclusions are robust to the mutation and selection parameters of the BGS model. Finally, analyses of protein evolution together with previous comparisons of genetic maps between Drosophila species, suggest temporally variable recombination landscapes and, thus, local BGS effects that may differ between extant and past phases. Because genome-wide BGS and temporal changes in linkage effects can skew approaches to estimate demographic and selective events, future

  15. Background selection as baseline for nucleotide variation across the Drosophila genome.

    Directory of Open Access Journals (Sweden)

    Josep M Comeron

    2014-06-01

    Full Text Available The constant removal of deleterious mutations by natural selection causes a reduction in neutral diversity and efficacy of selection at genetically linked sites (a process called Background Selection, BGS. Population genetic studies, however, often ignore BGS effects when investigating demographic events or the presence of other types of selection. To obtain a more realistic evolutionary expectation that incorporates the unavoidable consequences of deleterious mutations, we generated high-resolution landscapes of variation across the Drosophila melanogaster genome under a BGS scenario independent of polymorphism data. We find that BGS plays a significant role in shaping levels of variation across the entire genome, including long introns and intergenic regions distant from annotated genes. We also find that a very large percentage of the observed variation in diversity across autosomes can be explained by BGS alone, up to 70% across individual chromosome arms at 100-kb scale, thus indicating that BGS predictions can be used as baseline to infer additional types of selection and demographic events. This approach allows detecting several outlier regions with signal of recent adaptive events and selective sweeps. The use of a BGS baseline, however, is particularly appropriate to investigate the presence of balancing selection and our study exposes numerous genomic regions with the predicted signature of higher polymorphism than expected when a BGS context is taken into account. Importantly, we show that these conclusions are robust to the mutation and selection parameters of the BGS model. Finally, analyses of protein evolution together with previous comparisons of genetic maps between Drosophila species, suggest temporally variable recombination landscapes and, thus, local BGS effects that may differ between extant and past phases. Because genome-wide BGS and temporal changes in linkage effects can skew approaches to estimate demographic and

  16. Genomic copy number variation associated with clinical outcome in canine cutaneous mast cell tumors

    DEFF Research Database (Denmark)

    Jark, Paulo C; Mundin, Deborah B P; de Carvalho, Marcio

    2017-01-01

    from Group ST>12 and six from Group STGenomic DNA was extracted, and aCGH was performed using Agilent Canine Genome CGH Microarray 4×180 (ID-252 552 - Agilent, USA). Data analysis was carried out using Nexus program version 5.0 (Biodiscovery, USA). The group ST>12 presented 11±3.3 CNVs, while...... in DNA isolated from tumor cells by array comparative genomic hybridization (aCGH). The aim of this study was to compare copy number variations (CNVs) in cutaneous mast cell tumors of dogs that survived less than six (ST12months (ST>12) from the date of diagnosis. Ten animals were used: four...

  17. Variations in Desired Family Size and Excess Fertility in East Africa

    NARCIS (Netherlands)

    Muhoza, D.N.; Broekhuis, A.; Hooimeijer, P.

    2014-01-01

    This contribution studies the variation in desired family size and excess fertility in four East African countries by analyzing the combined impact of wealth, education, religious affiliation, and place of residence. The findings show an enormous heterogeneity in Kenya. Wealthy and higher educated

  18. Exploring patterns of variation in clutch size-density reaction norms in a wild passerine bird

    NARCIS (Netherlands)

    Nicolaus, M.; Brommer, J. E.; Ubels, R.; Tinbergen, J. M.; Dingemanse, N. J.

    2013-01-01

    Negative density dependence of clutch size is a ubiquitous characteristic of avian populations and is partly due to within-individual phenotypic plasticity. Yet, very little is known about the extent to which individuals differ in their degree of phenotypic plasticity, whether such variation has a g

  19. Variation in male body size and reproductive allocation in the leafcutter ant Atta colombica

    DEFF Research Database (Denmark)

    Stürup, M.; den Boer, S. P. A.; Nash, David Richard

    2011-01-01

    species. In 2008 and 2009, we revisited a Panamanian population of Atta colombica leafcutter ants to partially repeat and complement a study of more than 15 years ago. We compared within- and between-colony variation in male body size (mass and width of head, mesosoma and gaster) and sperm characteristics...

  20. Variations in Desired Family Size and Excess Fertility in East Africa

    NARCIS (Netherlands)

    Muhoza, D.N.; Broekhuis, A.; Hooimeijer, P.

    2014-01-01

    This contribution studies the variation in desired family size and excess fertility in four East African countries by analyzing the combined impact of wealth, education, religious affiliation, and place of residence. The findings show an enormous heterogeneity in Kenya. Wealthy and higher educated p

  1. Intraspecific egg size variation and sperm limitation in the broadcast spawning bivalve

    NARCIS (Netherlands)

    Luttikhuizen, P.C.; Honkoop, P.J.C.; Drent, J.

    2011-01-01

    Broadcast spawners are exceptionally suited and simple models for studying parental investment in offspring, because direct post-spawning investment is nonexistent. However, a comprehensive understanding of the large variation that exists in their egg sizes is still lacking. One of the main hypothes

  2. Improving the Process-Variation Tolerance of Digital Circuits Using Gate Sizing and Statistical Techniques

    CERN Document Server

    Neiroukh, Osama

    2011-01-01

    A new approach for enhancing the process-variation tolerance of digital circuits is described. We extend recent advances in statistical timing analysis into an optimization framework. Our objective is to reduce the performance variance of a technology-mapped circuit where delays across elements are represented by random variables which capture the manufacturing variations. We introduce the notion of statistical critical paths, which account for both means and variances of performance variation. An optimization engine is used to size gates with a goal of reducing the timing variance along the statistical critical paths. We apply a pair of nested statistical analysis methods deploying a slower more accurate approach for tracking statistical critical paths and a fast engine for evaluation of gate size assignments. We derive a new approximation for the max operation on random variables which is deployed for the faster inner engine. Circuit optimization is carried out using a gain-based algorithm that terminates w...

  3. Genomic analysis of natural selection and phenotypic variation in high-altitude mongolians.

    Directory of Open Access Journals (Sweden)

    Jinchuan Xing

    Full Text Available Deedu (DU Mongolians, who migrated from the Mongolian steppes to the Qinghai-Tibetan Plateau approximately 500 years ago, are challenged by environmental conditions similar to native Tibetan highlanders. Identification of adaptive genetic factors in this population could provide insight into coordinated physiological responses to this environment. Here we examine genomic and phenotypic variation in this unique population and present the first complete analysis of a Mongolian whole-genome sequence. High-density SNP array data demonstrate that DU Mongolians share genetic ancestry with other Mongolian as well as Tibetan populations, specifically in genomic regions related with adaptation to high altitude. Several selection candidate genes identified in DU Mongolians are shared with other Asian groups (e.g., EDAR, neighboring Tibetan populations (including high-altitude candidates EPAS1, PKLR, and CYP2E1, as well as genes previously hypothesized to be associated with metabolic adaptation (e.g., PPARG. Hemoglobin concentration, a trait associated with high-altitude adaptation in Tibetans, is at an intermediate level in DU Mongolians compared to Tibetans and Han Chinese at comparable altitude. Whole-genome sequence from a DU Mongolian (Tianjiao1 shows that about 2% of the genomic variants, including more than 300 protein-coding changes, are specific to this individual. Our analyses of DU Mongolians and the first Mongolian genome provide valuable insight into genetic adaptation to extreme environments.

  4. Whole-Genome Sequencing Reveals Genetic Variation in the Asian House Rat

    Directory of Open Access Journals (Sweden)

    Huajing Teng

    2016-07-01

    Full Text Available Whole-genome sequencing of wild-derived rat species can provide novel genomic resources, which may help decipher the genetics underlying complex phenotypes. As a notorious pest, reservoir of human pathogens, and colonizer, the Asian house rat, Rattus tanezumi, is successfully adapted to its habitat. However, little is known regarding genetic variation in this species. In this study, we identified over 41,000,000 single-nucleotide polymorphisms, plus insertions and deletions, through whole-genome sequencing and bioinformatics analyses. Moreover, we identified over 12,000 structural variants, including 143 chromosomal inversions. Further functional analyses revealed several fixed nonsense mutations associated with infection and immunity-related adaptations, and a number of fixed missense mutations that may be related to anticoagulant resistance. A genome-wide scan for loci under selection identified various genes related to neural activity. Our whole-genome sequencing data provide a genomic resource for future genetic studies of the Asian house rat species and have the potential to facilitate understanding of the molecular adaptations of rats to their ecological niches.

  5. Whole-Genome Sequencing Reveals Genetic Variation in the Asian House Rat.

    Science.gov (United States)

    Teng, Huajing; Zhang, Yaohua; Shi, Chengmin; Mao, Fengbiao; Hou, Lingling; Guo, Hongling; Sun, Zhongsheng; Zhang, Jianxu

    2016-07-07

    Whole-genome sequencing of wild-derived rat species can provide novel genomic resources, which may help decipher the genetics underlying complex phenotypes. As a notorious pest, reservoir of human pathogens, and colonizer, the Asian house rat, Rattus tanezumi, is successfully adapted to its habitat. However, little is known regarding genetic variation in this species. In this study, we identified over 41,000,000 single-nucleotide polymorphisms, plus insertions and deletions, through whole-genome sequencing and bioinformatics analyses. Moreover, we identified over 12,000 structural variants, including 143 chromosomal inversions. Further functional analyses revealed several fixed nonsense mutations associated with infection and immunity-related adaptations, and a number of fixed missense mutations that may be related to anticoagulant resistance. A genome-wide scan for loci under selection identified various genes related to neural activity. Our whole-genome sequencing data provide a genomic resource for future genetic studies of the Asian house rat species and have the potential to facilitate understanding of the molecular adaptations of rats to their ecological niches.

  6. [Egg size variation in egrets and herons (Aves: Ardeidae) nesting in Birama's swamp, Cuba].

    Science.gov (United States)

    Denis Avila, Dennis

    2015-03-01

    Intraclutch egg size variation in birds depends on many ecological factors and on the evolutive history of each species. In wading birds, a trend to smaller eggs with laying order has been described, but comparative reports are scarce. In this study, egg size variation patterns were described for nine Egrets and Heron species nesting in Birama' Swamp, Cuba. The patterns were described using external dimensions of 3142 eggs from 1875 nests of Butorides virescens, Bubulcus ibis, Ardea alba, Nycticorax nycticorax, Nyctanassa violacea and four Egretta species, taken in the field between 1998 and 2006. Results showed that eggs were 4.9-10% of adult weight and had volume variation coefficients between 6-9%. There were no general and consistent interspecies relationship between clutch size and egg sizes. Average volumes tend to get smaller with laying order, but it is not statistically detectable in Butorides and Bubulcus. Last egg was between 0.2% and 15% smaller than the first, showing an inverse relationship with it. Intraclutch asymmetry is light in E. thula and fluctuating around null in Bubulcus. Size only predicted laying or hatching order for the last egg, in nests with more than two eggs, with 72.4% of confidence.

  7. Nucleotide diversity maps reveal variation in diversity among wheat genomes and chromosomes

    Directory of Open Access Journals (Sweden)

    McGuire Patrick E

    2010-12-01

    chromosomal regions. The net effect of these factors in T. aestivum is large variation in diversity among genomes and chromosomes, which impacts the development of SNP markers and their practical utility. Accumulation of new mutations in older polyploid species, such as wild emmer, results in increased diversity and its more uniform distribution across the genome.

  8. Estimation of the Whitefly Bemisia tabaci Genome Size Based on k-mer and Flow Cytometric Analyses

    Directory of Open Access Journals (Sweden)

    Wenbo Chen

    2015-07-01

    Full Text Available Whiteflies of the Bemisia tabaci (Hemiptera: Aleyrodidae cryptic species complex are among the most important agricultural insect pests in the world. These phloem-feeding insects can colonize over 1000 species of plants worldwide and inflict severe economic losses to crops, mainly through the transmission of pathogenic viruses. Surprisingly, there is very little genomic information about whiteflies. As a starting point to genome sequencing, we report a new estimation of the genome size of the B. tabaci B biotype or Middle East-Asia Minor 1 (MEAM1 population. Using an isogenic whitefly colony with over 6500 haploid male individuals for genomic DNA, three paired-end genomic libraries with insert sizes of ~300 bp, 500 bp and 1 Kb were constructed and sequenced on an Illumina HiSeq 2500 system. A total of ~50 billion base pairs of sequences were obtained from each library. K-mer analysis using these sequences revealed that the genome size of the whitefly was ~682.3 Mb. In addition, the flow cytometric analysis estimated the haploid genome size of the whitefly to be ~690 Mb. Considering the congruency between both estimation methods, we predict the haploid genome size of B. tabaci MEAM1 to be ~680–690 Mb. Our data provide a baseline for ongoing efforts to assemble and annotate the B. tabaci genome.

  9. Variations and classification of toxic epitopes related to celiac disease among α-gliadin genes from four Aegilops genomes.

    Science.gov (United States)

    Li, Jie; Wang, Shunli; Li, Shanshan; Ge, Pei; Li, Xiaohui; Ma, Wujun; Zeller, F J; Hsam, Sai L K; Yan, Yueming

    2012-07-01

    The α-gliadins are associated with human celiac disease. A total of 23 noninterrupted full open reading frame α-gliadin genes and 19 pseudogenes were cloned and sequenced from C, M, N, and U genomes of four diploid Aegilops species. Sequence comparison of α-gliadin genes from Aegilops and Triticum species demonstrated an existence of extensive allelic variations in Gli-2 loci of the four Aegilops genomes. Specific structural features were found including the compositions and variations of two polyglutamine domains (QI and QII) and four T cell stimulatory toxic epitopes. The mean numbers of glutamine residues in the QI domain in C and N genomes and the QII domain in C, N, and U genomes were much higher than those in Triticum genomes, and the QI domain in C and N genomes and the QII domain in C, M, N, and U genomes displayed greater length variations. Interestingly, the types and numbers of four T cell stimulatory toxic epitopes in α-gliadins from the four Aegilops genomes were significantly less than those from Triticum A, B, D, and their progenitor genomes. Relationships between the structural variations of the two polyglutamine domains and the distributions of four T cell stimulatory toxic epitopes were found, resulting in the α-gliadin genes from the Aegilops and Triticum genomes to be classified into three groups.

  10. Natural selection affects multiple aspects of genetic variation at putatively peutral sites across the human genome

    DEFF Research Database (Denmark)

    Lohmueller, Kirk E; Albrechtsen, Anders; Li, Yingrui;

    2011-01-01

    throughout the genome. Further, we show that the widespread presence of weakly deleterious alleles, rather than a small number of strongly positively selected mutations, is responsible for the correlation between neutral genetic diversity and recombination rate. This work suggests that natural selection has......A major question in evolutionary biology is how natural selection has shaped patterns of genetic variation across the human genome. Previous work has documented a reduction in genetic diversity in regions of the genome with low recombination rates. However, it is unclear whether other summaries...... and that human diversity, human-chimp divergence, and average minor allele frequency are reduced near genes. Population genetic simulations show that either positive natural selection acting on favorable mutations or negative natural selection acting against deleterious mutations can explain these correlations...

  11. Natural variation in SAR11 marine bacterioplankton genomes inferred from metagenomic data

    Directory of Open Access Journals (Sweden)

    Wilhelm Larry J

    2007-11-01

    Full Text Available Abstract Background One objective of metagenomics is to reconstruct information about specific uncultured organisms from fragmentary environmental DNA sequences. We used the genome of an isolate of the marine alphaproteobacterium SAR11 ('Candidatus Pelagibacter ubique'; strain HTCC1062, obtained from the cold, productive Oregon coast, as a query sequence to study variation in SAR11 metagenome sequence data from the Sargasso Sea, a warm, oligotrophic ocean gyre. Results The average amino acid identity of SAR11 genes encoded by the metagenomic data to the query genome was only 71%, indicating significant evolutionary divergence between the coastal isolates and Sargasso Sea populations. However, an analysis of gene neighbors indicated that SAR11 genes in the Sargasso Sea metagenomic data match the gene order of the HTCC1062 genome in 96% of cases (> 85,000 observations, and that rearrangements are most frequent at predicted operon boundaries. There were no conserved examples of genes with known functions being found in the coastal isolates, but not the Sargasso Sea metagenomic data, or vice versa, suggesting that core regions of these diverse SAR11 genomes are relatively conserved in gene content. However, four hypervariable regions were observed, which may encode properties associated with variation in SAR11 ecotypes. The largest of these, HVR2, is a 48 kb region flanked by the sole 5S and 23S genes in the HTCC1062 genome, and mainly encodes genes that determine cell surface properties. A comparison of two closely related 'Candidatus Pelagibacter' genomes (HTCC1062 and HTCC1002 revealed a number of "gene indels" in core regions. Most of these were found to be polymorphic in the metagenomic data and showed evidence of purifying selection, suggesting that the same "polymorphic gene indels" are maintained in physically isolated SAR11 populations. Conclusion These findings suggest that natural selection has conserved many core features of SAR11

  12. A biometrical genome search in rats reveals the multigenic basis of blood pressure variation.

    Science.gov (United States)

    Schork, N J; Krieger, J E; Trolliet, M R; Franchini, K G; Koike, G; Krieger, E M; Lander, E S; Dzau, V J; Jacob, H J

    1995-09-01

    A genome-wide search for multiple loci influencing salt-loaded systolic blood pressure (NaSBP) variation among 188 F2 progeny from a cross between the Brown-Norway and spontaneously hypertensive rat strains was pursued in an effort to gain insight into the polygenic basis of blood pressure regulation. The results suggest that loci within five to six genomic regions collectively explain approximately 43% of the total NaSBP variation exhibited among the 188 F2 progeny. Many of these loci are in regions that previous studies have not implicated in blood pressure regulation. Ultimately, however, this study not only sheds light on the multigenic basis of blood pressure but provides further evidence that the identification of the genetic determinants of polygenic traits in mammals is possible with modern biometrical and molecular genetic tools in controlled settings (i.e., breeding paradigm and model organism).

  13. Genome-wide fine-scale recombination rate variation in Drosophila melanogaster.

    Directory of Open Access Journals (Sweden)

    Andrew H Chan

    Full Text Available Estimating fine-scale recombination maps of Drosophila from population genomic data is a challenging problem, in particular because of the high background recombination rate. In this paper, a new computational method is developed to address this challenge. Through an extensive simulation study, it is demonstrated that the method allows more accurate inference, and exhibits greater robustness to the effects of natural selection and noise, compared to a well-used previous method developed for studying fine-scale recombination rate variation in the human genome. As an application, a genome-wide analysis of genetic variation data is performed for two Drosophila melanogaster populations, one from North America (Raleigh, USA and the other from Africa (Gikongoro, Rwanda. It is shown that fine-scale recombination rate variation is widespread throughout the D. melanogaster genome, across all chromosomes and in both populations. At the fine-scale, a conservative, systematic search for evidence of recombination hotspots suggests the existence of a handful of putative hotspots each with at least a tenfold increase in intensity over the background rate. A wavelet analysis is carried out to compare the estimated recombination maps in the two populations and to quantify the extent to which recombination rates are conserved. In general, similarity is observed at very broad scales, but substantial differences are seen at fine scales. The average recombination rate of the X chromosome appears to be higher than that of the autosomes in both populations, and this pattern is much more pronounced in the African population than the North American population. The correlation between various genomic features-including recombination rates, diversity, divergence, GC content, gene content, and sequence quality-is examined using the wavelet analysis, and it is shown that the most notable difference between D. melanogaster and humans is in the correlation between

  14. Comparative population genomics of latitudinal variation in Drosophila simulans and Drosophila melanogaster.

    Science.gov (United States)

    Machado, Heather E; Bergland, Alan O; O'Brien, Katherine R; Behrman, Emily L; Schmidt, Paul S; Petrov, Dmitri A

    2016-02-01

    Examples of clinal variation in phenotypes and genotypes across latitudinal transects have served as important models for understanding how spatially varying selection and demographic forces shape variation within species. Here, we examine the selective and demographic contributions to latitudinal variation through the largest comparative genomic study to date of Drosophila simulans and Drosophila melanogaster, with genomic sequence data from 382 individual fruit flies, collected across a spatial transect of 19 degrees latitude and at multiple time points over 2 years. Consistent with phenotypic studies, we find less clinal variation in D. simulans than D. melanogaster, particularly for the autosomes. Moreover, we find that clinally varying loci in D. simulans are less stable over multiple years than comparable clines in D. melanogaster. D. simulans shows a significantly weaker pattern of isolation by distance than D. melanogaster and we find evidence for a stronger contribution of migration to D. simulans population genetic structure. While population bottlenecks and migration can plausibly explain the differences in stability of clinal variation between the two species, we also observe a significant enrichment of shared clinal genes, suggesting that the selective forces associated with climate are acting on the same genes and phenotypes in D. simulans and D. melanogaster. © 2015 John Wiley & Sons Ltd.

  15. Impacts of both reference population size and inclusion of a residual polygenic effect on the accuracy of genomic prediction

    Directory of Open Access Journals (Sweden)

    Rensing Stephan

    2011-05-01

    Full Text Available Abstract Background The purpose of this work was to study the impact of both the size of genomic reference populations and the inclusion of a residual polygenic effect on dairy cattle genetic evaluations enhanced with genomic information. Methods Direct genomic values were estimated for German Holstein cattle with a genomic BLUP model including a residual polygenic effect. A total of 17,429 genotyped Holstein bulls were evaluated using the phenotypes of 44 traits. The Interbull genomic validation test was implemented to investigate how the inclusion of a residual polygenic effect impacted genomic estimated breeding values. Results As the number of reference bulls increased, both the variance of the estimates of single nucleotide polymorphism effects and the reliability of the direct genomic values of selection candidates increased. Fitting a residual polygenic effect in the model resulted in less biased genome-enhanced breeding values and decreased the correlation between direct genomic values and estimated breeding values of sires in the reference population. Conclusions Genetic evaluation of dairy cattle enhanced with genomic information is highly effective in increasing reliability, as well as using large genomic reference populations. We found that fitting a residual polygenic effect reduced the bias in genome-enhanced breeding values, decreased the correlation between direct genomic values and sire's estimated breeding values and made genome-enhanced breeding values more consistent in mean and variance as is the case for pedigree-based estimated breeding values.

  16. Patterns of Genome-Wide Variation in Glossina fuscipes fuscipes Tsetse Flies from Uganda.

    Science.gov (United States)

    Gloria-Soria, Andrea; Dunn, W Augustine; Telleria, Erich L; Evans, Benjamin R; Okedi, Loyce; Echodu, Richard; Warren, Wesley C; Montague, Michael J; Aksoy, Serap; Caccone, Adalgisa

    2016-06-01

    The tsetse fly Glossina fuscipes fuscipes (Gff) is the insect vector of the two forms of Human African Trypanosomiasis (HAT) that exist in Uganda. Understanding Gff population dynamics, and the underlying genetics of epidemiologically relevant phenotypes is key to reducing disease transmission. Using ddRAD sequence technology, complemented with whole-genome sequencing, we developed a panel of ∼73,000 single-nucleotide polymorphisms (SNPs) distributed across the Gff genome that can be used for population genomics and to perform genome-wide-association studies. We used these markers to estimate genomic patterns of linkage disequilibrium (LD) in Gff, and used the information, in combination with outlier-locus detection tests, to identify candidate regions of the genome under selection. LD in individual populations decays to half of its maximum value (r(2) max/2) between 1359 and 2429 bp. The overall LD estimated for the species reaches r(2) max/2 at 708 bp, an order of magnitude slower than in Drosophila Using 53 infected (Trypanosoma spp.) and uninfected flies from four genetically distinct Ugandan populations adapted to different environmental conditions, we were able to identify SNPs associated with the infection status of the fly and local environmental adaptation. The extent of LD in Gff likely facilitated the detection of loci under selection, despite the small sample size. Furthermore, it is probable that LD in the regions identified is much higher than the average genomic LD due to strong selection. Our results show that even modest sample sizes can reveal significant genetic associations in this species, which has implications for future studies given the difficulties of collecting field specimens with contrasting phenotypes for association analysis.

  17. Patterns of Genome-Wide Variation in Glossina fuscipes fuscipes Tsetse Flies from Uganda

    Directory of Open Access Journals (Sweden)

    Andrea Gloria-Soria

    2016-06-01

    Full Text Available The tsetse fly Glossina fuscipes fuscipes (Gff is the insect vector of the two forms of Human African Trypanosomiasis (HAT that exist in Uganda. Understanding Gff population dynamics, and the underlying genetics of epidemiologically relevant phenotypes is key to reducing disease transmission. Using ddRAD sequence technology, complemented with whole-genome sequencing, we developed a panel of ∼73,000 single-nucleotide polymorphisms (SNPs distributed across the Gff genome that can be used for population genomics and to perform genome-wide-association studies. We used these markers to estimate genomic patterns of linkage disequilibrium (LD in Gff, and used the information, in combination with outlier-locus detection tests, to identify candidate regions of the genome under selection. LD in individual populations decays to half of its maximum value (r2max/2 between 1359 and 2429 bp. The overall LD estimated for the species reaches r2max/2 at 708 bp, an order of magnitude slower than in Drosophila. Using 53 infected (Trypanosoma spp. and uninfected flies from four genetically distinct Ugandan populations adapted to different environmental conditions, we were able to identify SNPs associated with the infection status of the fly and local environmental adaptation. The extent of LD in Gff likely facilitated the detection of loci under selection, despite the small sample size. Furthermore, it is probable that LD in the regions identified is much higher than the average genomic LD due to strong selection. Our results show that even modest sample sizes can reveal significant genetic associations in this species, which has implications for future studies given the difficulties of collecting field specimens with contrasting phenotypes for association analysis.

  18. Insights into the dynamics of genome size and chromosome evolution in the early diverging angiosperm lineage Nymphaeales (water lilies).

    Science.gov (United States)

    Pellicer, J; Kelly, L J; Magdalena, C; Leitch, I J

    2013-08-01

    Nymphaeales are the most species-rich lineage of the earliest diverging angiosperms known as the ANA grade (Amborellales, Nymphaeales, Austrobaileyales), and they have received considerable attention from morphological, physiological, and ecological perspectives. Although phylogenetic relationships between these three lineages of angiosperms are mainly well resolved, insights at the whole genome level are still limited because of a dearth of information. To address this, genome sizes and chromosome numbers in 34 taxa, comprising 28 species were estimated and analysed together with previously published data to provide an overview of genome size and chromosome diversity in Nymphaeales. Overall, genome sizes were shown to vary 10-fold and chromosome numbers and ploidy levels ranged from 2n = 2x = 18 to 2n = 16x = ∼224. Distinct patterns of genome diversity were apparent, reflecting the differential incidence of polyploidy, changes in repetitive DNA content, and chromosome rearrangements within and between genera. Using model-based approaches, ancestral genome size and basic chromosome numbers were reconstructed to provide insights into the dynamics of genome size and chromosome number evolution. Finally, by combining additional data from Amborellales and Austrobaileyales, a comprehensive overview of genome sizes and chromosome numbers in these early diverging angiosperms is presented.

  19. Modeling grain size variations of aeolian gypsum deposits at White Sands, New Mexico, using AVIRIS imagery

    Science.gov (United States)

    Ghrefat, H.A.; Goodell, P.C.; Hubbard, B.E.; Langford, R.P.; Aldouri, R.E.

    2007-01-01

    Visible and Near-Infrared (VNIR) through Short Wavelength Infrared (SWIR) (0.4-2.5????m) AVIRIS data, along with laboratory spectral measurements and analyses of field samples, were used to characterize grain size variations in aeolian gypsum deposits across barchan-transverse, parabolic, and barchan dunes at White Sands, New Mexico, USA. All field samples contained a mineralogy of ?????100% gypsum. In order to document grain size variations at White Sands, surficial gypsum samples were collected along three Transects parallel to the prevailing downwind direction. Grain size analyses were carried out on the samples by sieving them into seven size fractions ranging from 45 to 621????m, which were subjected to spectral measurements. Absorption band depths of the size fractions were determined after applying an automated continuum-removal procedure to each spectrum. Then, the relationship between absorption band depth and gypsum size fraction was established using a linear regression. Three software processing steps were carried out to measure the grain size variations of gypsum in the Dune Area using AVIRIS data. AVIRIS mapping results, field work and laboratory analysis all show that the interdune areas have lower absorption band depth values and consist of finer grained gypsum deposits. In contrast, the dune crest areas have higher absorption band depth values and consist of coarser grained gypsum deposits. Based on laboratory estimates, a representative barchan-transverse dune (Transect 1) has a mean grain size of 1.16 ??{symbol} (449????m). The error bar results show that the error ranges from - 50 to + 50????m. Mean grain size for a representative parabolic dune (Transect 2) is 1.51 ??{symbol} (352????m), and 1.52 ??{symbol} (347????m) for a representative barchan dune (Transect 3). T-test results confirm that there are differences in the grain size distributions between barchan and parabolic dunes and between interdune and dune crest areas. The t-test results

  20. Modelling human regulatory variation in mouse: finding the function in genome-wide association studies and whole-genome sequencing.

    Directory of Open Access Journals (Sweden)

    Jean-François Schmouth

    Full Text Available An increasing body of literature from genome-wide association studies and human whole-genome sequencing highlights the identification of large numbers of candidate regulatory variants of potential therapeutic interest in numerous diseases. Our relatively poor understanding of the functions of non-coding genomic sequence, and the slow and laborious process of experimental validation of the functional significance of human regulatory variants, limits our ability to fully benefit from this information in our efforts to comprehend human disease. Humanized mouse models (HuMMs, in which human genes are introduced into the mouse, suggest an approach to this problem. In the past, HuMMs have been used successfully to study human disease variants; e.g., the complex genetic condition arising from Down syndrome, common monogenic disorders such as Huntington disease and β-thalassemia, and cancer susceptibility genes such as BRCA1. In this commentary, we highlight a novel method for high-throughput single-copy site-specific generation of HuMMs entitled High-throughput Human Genes on the X Chromosome (HuGX. This method can be applied to most human genes for which a bacterial artificial chromosome (BAC construct can be derived and a mouse-null allele exists. This strategy comprises (1 the use of recombineering technology to create a human variant-harbouring BAC, (2 knock-in of this BAC into the mouse genome using Hprt docking technology, and (3 allele comparison by interspecies complementation. We demonstrate the throughput of the HuGX method by generating a series of seven different alleles for the human NR2E1 gene at Hprt. In future challenges, we consider the current limitations of experimental approaches and call for a concerted effort by the genetics community, for both human and mouse, to solve the challenge of the functional analysis of human regulatory variation.

  1. A novel technique for measuring variations in DNA copy-number: competitive genomic polymerase chain reaction

    Directory of Open Access Journals (Sweden)

    Nakagawara Akira

    2007-07-01

    Full Text Available Background Changes in genomic copy number occur in many human diseases including cancer. Characterization of these changes is important for both basic understanding and diagnosis of these diseases. Microarrays have recently become the standard technique and are commercially available. However, it is useful to have an affordable technique to complement them. Results We describe a novel polymerase chain reaction (PCR-based technique, termed competitive genomic PCR (CGP. The main characteristic of CGP is that different adaptors are added to the sample and control genomic DNAs after appropriate restriction enzyme digestion. These adaptor-supplemented DNAs are subjected to competitive PCR using an adaptor-primer and a locus-specific primer. The amplified products are then separated according to size differences between the adaptors. CGP eliminates the tedious steps inherent in quantitative PCR and achieves moderate throughput. Assays with different X chromosome numbers showed that it can provide accurate quantification. High-resolution analysis of neuroblastoma cell lines around the MYCN locus revealed novel junctions for amplification, which were not detected by a commercial array. Conclusion CGP is a moderate throughput technique for analyzing changes in genomic copy numbers. Because CGP can measure any genomic locus using PCR primers, it is especially useful for detailed analysis of a genomic region of interest.

  2. Intraspecific variation in erythrocyte sizes among populations of Hypsiboas cordobas (Anura: Hylidae

    Directory of Open Access Journals (Sweden)

    Mariana Baraquet

    2013-12-01

    Full Text Available We studied the morphology and size of erythrocytes of H. cordobae, and analysed the geographic variation of this character along the distribution of the species, in relation to the latitudinal and altitudinal distances. Erythrocyte shape of the H. cordobae is ellipsoidal and the nuclei are also ellipsoidal and centrally oriented. Erythrocyte and nuclear size showed significant differences among populations, with the highest mean size corresponding to the population of Achiras (low altitude site and the lowest mean size to Los Linderos (high altitude site. There was no significant relationship between the latitude of each population and the both erythrocyte and nuclear size. The altitudinal variation in erythrocyte cell size may be attributable to the surface available for gas exchange; a small erythrocyte offers a possibility of greater rate of exchange than a larger one. Our results are consistent with studies of other amphibians, where intraspecific comparisons of populations at different altitudes show that individuals at higher altitudes are characterized by smaller erythrocytes.

  3. Variations in otolith patterns, sizes and body morphometrics of jack mackerel Trachurus japonicus juveniles.

    Science.gov (United States)

    Kanaji, Y; Kishida, M; Watanabe, Y; Kawamura, T; Xie, S; Yamashita, Y; Sassa, C; Tsukamoto, Y

    2010-10-01

    Variations in otolith patterns, sizes and body morphometrics of jack mackerel Trachurus japonicus juveniles were investigated. Under transmitted light, translucent (W(t)) and opaque otoliths (W(o)) were detected in juveniles collected from Wakasa Bay between July 2005 and April 2006, whereas only opaque otoliths (G(o)) were detected in Goto-nada Sea individuals between May and June 2006. Three groups of juveniles were distinguished based on differences in hatch season, otolith size and growth history, and body morphometrics. As T. japonicus has different spawning seasons according to spawning grounds, each group was estimated to hatch in different waters. Juveniles with W(t) otoliths were considered to have stayed in coastal habitat longer, as the hatch area was estimated to be near Wakasa Bay. Juveniles with W(o) and G(o) otoliths appear to recruit to coastal waters at larger size, since their hatch areas were estimated to be far from each collection area. Larger otoliths of W(t) were attributed to otolith accretion after the second growth flexion, which was observed only for W(t) . Standard length of W(t) fish at the second otolith growth flexion was estimated to correspond to recruitment size to coastal rocky reefs in Wakasa Bay. Body morphometrics were correlated with otolith size after removing body size effect, suggesting that morphological variations of T. japonicus juveniles were also associated with the timing of recruitment to coastal habitat.

  4. Population size, habitat fragmentation, and the nature of adaptive variation in a stream fish.

    Science.gov (United States)

    Fraser, Dylan J; Debes, Paul V; Bernatchez, Louis; Hutchings, Jeffrey A

    2014-09-07

    Whether and how habitat fragmentation and population size jointly affect adaptive genetic variation and adaptive population differentiation are largely unexplored. Owing to pronounced genetic drift, small, fragmented populations are thought to exhibit reduced adaptive genetic variation relative to large populations. Yet fragmentation is known to increase variability within and among habitats as population size decreases. Such variability might instead favour the maintenance of adaptive polymorphisms and/or generate more variability in adaptive differentiation at smaller population size. We investigated these alternative hypotheses by analysing coding-gene, single-nucleotide polymorphisms associated with different biological functions in fragmented brook trout populations of variable sizes. Putative adaptive differentiation was greater between small and large populations or among small populations than among large populations. These trends were stronger for genetic population size measures than demographic ones and were present despite pronounced drift in small populations. Our results suggest that fragmentation affects natural selection and that the changes elicited in the adaptive genetic composition and differentiation of fragmented populations vary with population size. By generating more variable evolutionary responses, the alteration of selective pressures during habitat fragmentation may affect future population persistence independently of, and perhaps long before, the effects of demographic and genetic stochasticity are manifest. © 2014 The Author(s) Published by the Royal Society. All rights reserved.

  5. [RAPD analysis of the intraspecific and interspecific variation and phylogenetic relationships of Aegilops L. species with the U genome].

    Science.gov (United States)

    Goriunova, S V; Chikida, N N; Kochieva, E Z

    2010-07-01

    RAPD analysis was used to study the genetic variation and phylogenetic relationships of polyploid Aegilops species with the U genome. In total, 115 DNA samples of eight polyploid species containing the U genome and the diploid species Ae. umbellulata (U) were examined. Substantial interspecific polymorphism was observed for the majority of the polyploid species with the U genome (interspecific differences, 0.01-0,2; proportion of polymorphic loci, 56.6-88.2%). Aegilops triuncialis was identified as the only alloploid species with low interspecific polymorphism (interspecific differences, 0-0.01, P = 50%) in the U-genome group. The U-genome Aegilops species proved to be separated from other species of the genus. The phylogenetic relationships were established for the U-genome species. The greatest separation within the U-genome group was observed for the US-genome species Ae. kotschyi and Ae. variabilis. The tetraploid species Ae. triaristata and Ae. columnaris, which had the UX genome, and the hexaploid species Ae. recta (UXN) were found to be related to each other and separate from the UM-genome species. A similarity was observed between the U M-genome species Ae. ovata and Ae. biuncialis, which had the UM genome, and the ancestral diploid U-genome species Ae. umbellulata. The UC-genome species Ae. triuncialis was rather separate and slightly similar to the UX-genome species.

  6. Ecological correlates of group-size variation in a resource-defense ungulate, the sedentary guanaco.

    Directory of Open Access Journals (Sweden)

    Andrea Marino

    Full Text Available For large herbivores, predation-risk, habitat structure and population density are often reported as major determinants of group size variation within and between species. However, whether the underlying causes of these relationships imply an ecological adaptation or are the result of a purely mechanistic process in which fusion and fragmentation events only depend on the rate of group meeting, is still under debate. The aim of this study was to model guanaco family and bachelor group sizes in contrasting ecological settings in order to test hypotheses regarding the adaptive significance of group-size variation. We surveyed guanaco group sizes within three wildlife reserves located in eastern Patagonia where guanacos occupy a mosaic of grasslands and shrublands. Two of these reserves have been free from predators for decades while in the third, pumas often prey on guanacos. All locations have experienced important changes in guanaco abundance throughout the study offering the opportunity to test for density effects. We found that bachelor group size increased with increasing density, as expected by the mechanistic approach, but was independent of habitat structure or predation risk. In contrast, the smaller and territorial family groups were larger in the predator-exposed than in the predator-free locations, and were larger in open grasslands than in shrublands. However, the influence of population density on these social units was very weak. Therefore, family group data supported the adaptive significance of group-size variation but did not support the mechanistic idea. Yet, the magnitude of the effects was small and between-population variation in family group size after controlling for habitat and predation was negligible, suggesting that plasticity of these social units is considerably low. Our results showed that different social units might respond differentially to local ecological conditions, supporting two contrasting hypotheses in a

  7. Variation in Genomic Methylation in Natural Populations of Chinese White Poplar

    OpenAIRE

    Kaifeng Ma; Yuepeng Song; Xiaohui Yang; Zhiyi Zhang; Deqiang Zhang

    2013-01-01

    BACKGROUND: It is thought that methylcytosine can be inherited through meiosis and mitosis, and that epigenetic variation may be under genetic control or correlation may be caused by neutral drift. However, DNA methylation also varies with tissue, developmental stage, and environmental factors. Eliminating these factors, we analyzed the levels and patterns, diversity and structure of genomic methylcytosine in the xylem of nine natural populations of Chinese white poplar. PRINCIPAL FINDINGS: O...

  8. Genome-wide association study identified CNP12587 region underlying height variation in Chinese females.

    Directory of Open Access Journals (Sweden)

    Yin-Ping Zhang

    Full Text Available INTRODUCTION: Human height is a highly heritable trait considered as an important factor for health. There has been limited success in identifying the genetic factors underlying height variation. We aim to identify sequence variants associated with adult height by a genome-wide association study of copy number variants (CNVs in Chinese. METHODS: Genome-wide CNV association analyses were conducted in 1,625 unrelated Chinese adults and sex specific subgroup for height variation, respectively. Height was measured with a stadiometer. Affymetrix SNP6.0 genotyping platform was used to identify copy number polymorphisms (CNPs. We constructed a genomic map containing 1,009 CNPs in Chinese individuals and performed a genome-wide association study of CNPs with height. RESULTS: We detected 10 significant association signals for height (p<0.05 in the whole population, 9 and 11 association signals for Chinese female and male population, respectively. A copy number polymorphism (CNP12587, chr18:54081842-54086942, p = 2.41 × 10(-4 was found to be significantly associated with height variation in Chinese females even after strict Bonferroni correction (p = 0.048. Confirmatory real time PCR experiments lent further support for CNV validation. Compared to female subjects with two copies of the CNP, carriers of three copies had an average of 8.1% decrease in height. An important candidate gene, ubiquitin-protein ligase NEDD4-like (NEDD4L, was detected at this region, which plays important roles in bone metabolism by binding to bone formation regulators. CONCLUSIONS: Our findings suggest the important genetic variants underlying height variation in Chinese.

  9. Whole Genome Analysis of 132 Clinical Saccharomyces cerevisiae Strains Reveals Extensive Ploidy Variation

    Science.gov (United States)

    Zhu, Yuan O.; Sherlock, Gavin; Petrov, Dmitri A.

    2016-01-01

    Budding yeast has undergone several independent transitions from commercial to clinical lifestyles. The frequency of such transitions suggests that clinical yeast strains are derived from environmentally available yeast populations, including commercial sources. However, despite their important role in adaptive evolution, the prevalence of polyploidy and aneuploidy has not been extensively analyzed in clinical strains. In this study, we have looked for patterns governing the transition to clinical invasion in the largest screen of clinical yeast isolates to date. In particular, we have focused on the hypothesis that ploidy changes have influenced adaptive processes. We sequenced 144 yeast strains, 132 of which are clinical isolates. We found pervasive large-scale genomic variation in both overall ploidy (34% of strains identified as 3n/4n) and individual chromosomal copy numbers (36% of strains identified as aneuploid). We also found evidence for the highly dynamic nature of yeast genomes, with 35 strains showing partial chromosomal copy number changes and eight strains showing multiple independent chromosomal events. Intriguingly, a lineage identified to be baker’s/commercial derived with a unique damaging mutation in NDC80 was particularly prone to polyploidy, with 83% of its members being triploid or tetraploid. Polyploidy was in turn associated with a >2× increase in aneuploidy rates as compared to other lineages. This dataset provides a rich source of information on the genomics of clinical yeast strains and highlights the potential importance of large-scale genomic copy variation in yeast adaptation. PMID:27317778

  10. Exploration of presence/absence variation and corresponding polymorphic markers in soybean genome

    Institute of Scientific and Technical Information of China (English)

    Yufeng Wang; Tuanjie Zhao; Junyi Gai; Jiangjie Lu; Shouyi Chen; Liping Shu; Reid GPalmer; Guangnan Xing; Yan Li; Shouping Yang; Deyue Yu

    2014-01-01

    This study was designed to reveal the genome-wide distribution of presence/absence variation (PAV) and to establish a database of polymorphic PAV markers in soybean. The 33 soybean whole-genome sequences were compared to each other with that of Wil iams 82 as a reference genome. A total of 33,127 PAVs were detected and 28,912 PAV markers with their primer sequences were designed as the database NJAUSoyPAV_1.0. The PAVs scattered on whole genome while only 518 (1.8%) over-lapped with simple sequence repeats (SSRs) in BARCSOYSSR_1.0 database. In a random sample of 800 PAVs, 713 (89.13%) showed polymorphism among the 12 differential genotypes. Using 126 PAVs and 108 SSRs to test a Chinese soybean germplasm col ection composed of 828 Glycine soja Sieb. et Zucc. and Glycine max (L.) Merr. accessions, the per locus al ele number and its variation appeared less in PAVs than in SSRs. The distinctness among al eles/bands of PCR (polymerase chain reaction) products showed better in PAVs than in SSRs, potential in accurate marker-assisted al ele selection. The association mapping results showed SSR þ PAV was more powerful than any single marker systems. The NJAUSoyPAV_1.0 database has enriched the source of PCR markers, and may fit the materials with a range of per locus al ele numbers, if jointly used with SSR markers.

  11. Genomic and functional characteristics of copy number variations in Angus cattle selected for resistance or susceptibility to gastrointestinal nematodes

    Science.gov (United States)

    Genomic structural variation is an important and abundant source of genetic and phenotypic variation. We previously reported an initial analysis of copy number variations (CNVs) in Angus cattle selected for resistance or susceptibility to intestinal nematodes. In this study, we performed a large sca...

  12. A genomic overview of short genetic variations in a basal chordate, Ciona intestinalis

    Directory of Open Access Journals (Sweden)

    Satou Yutaka

    2012-05-01

    Full Text Available Abstract Background Although the Ciona intestinalis genome contains many allelic polymorphisms, there is only limited data analyzed systematically. Establishing a dense map of genetic variations in C. intestinalis is necessary not only for linkage analysis, but also for other experimental biology including molecular developmental and evolutionary studies, because animals from natural populations are typically used for experiments. Results Here, we identified over three million candidate short genomic variations within a 110 Mb euchromatin region among five C. intestinalis individuals. The average nucleotide diversity was approximately 1.1%. Genetic variations were found at a similar density in intergenic and gene regions. Non-synonymous and nonsense nucleotide substitutions were found in 12,493 and 1,214 genes accounting for 81.9% and 8.0% of the entire gene set, respectively, and over 60% of genes in the single animal encode non-identical proteins between maternal and paternal alleles. Conclusions Our results provide a framework for studying evolution of the animal genome, as well as a useful resource for a wide range of C. intestinalis researchers.

  13. Sample size planning for the coefficient of variation from the accuracy in parameter estimation approach.

    Science.gov (United States)

    Kelley, Ken

    2007-11-01

    The accuracy in parameter estimation approach to sample size planning is developed for the coefficient of variation, where the goal of the method is to obtain an accurate parameter estimate by achieving a sufficiently narrow confidence interval. The first method allows researchers to plan sample size so that the expected width of the confidence interval for the population coefficient of variation is sufficiently narrow. A modification allows a desired degree of assurance to be incorporated into the method, so that the obtained confidence interval will be sufficiently narrow with some specified probability (e.g., 85% assurance that the 95 confidence interval width will be no wider than to units). Tables of necessary sample size are provided for a variety of scenarios that may help researchers planning a study where the coefficient of variation is of interest plan an appropriate sample size in order to have a sufficiently narrow confidence interval, optionally with somespecified assurance of the confidence interval being sufficiently narrow. Freely available computer routines have been developed that allow researchers to easily implement all of the methods discussed in the article.

  14. Estimation of hominoid ancestral population sizes under bayesian coalescent models incorporating mutation rate variation and sequencing errors.

    Science.gov (United States)

    Burgess, Ralph; Yang, Ziheng

    2008-09-01

    Estimation of population parameters for the common ancestors of humans and the great apes is important in understanding our evolutionary history. In particular, inference of population size for the human-chimpanzee common ancestor may shed light on the process by which the 2 species separated and on whether the human population experienced a severe size reduction in its early evolutionary history. In this study, the Bayesian method of ancestral inference of Rannala and Yang (2003. Bayes estimation of species divergence times and ancestral population sizes using DNA sequences from multiple loci. Genetics. 164:1645-1656) was extended to accommodate variable mutation rates among loci and random species-specific sequencing errors. The model was applied to analyze a genome-wide data set of approximately 15,000 neutral loci (7.4 Mb) aligned for human, chimpanzee, gorilla, orangutan, and macaque. We obtained robust and precise estimates for effective population sizes along the hominoid lineage extending back approximately 30 Myr to the cercopithecoid divergence. The results showed that ancestral populations were 5-10 times larger than modern humans along the entire hominoid lineage. The estimates were robust to the priors used and to model assumptions about recombination. The unusually low X chromosome divergence between human and chimpanzee could not be explained by variation in the male mutation bias or by current models of hybridization and introgression. Instead, our parameter estimates were consistent with a simple instantaneous process for human-chimpanzee speciation but showed a major reduction in X chromosome effective population size peculiar to the human-chimpanzee common ancestor, possibly due to selective sweeps on the X prior to separation of the 2 species.

  15. Analysis of genetic variation and potential applications in genome-scale metabolic modeling

    Directory of Open Access Journals (Sweden)

    João Gonçalo Rocha Cardoso

    2015-02-01

    Full Text Available Genetic variation is the motor of evolution and allows organisms to overcome the environmental challenges they encounter. It can be both beneficial and harmful in the process of engineering cell factories for the production of proteins and chemicals. Throughout the history of biotechnology, there have been efforts to exploit genetic variation in our favor to create strains with favorable phenotypes. Genetic variation can either be present in natural populations or it can be artificially created by mutagenesis and selection or adaptive laboratory evolution. On the other hand, unintended genetic variation during a long term production process may lead to significant economic losses and it is important to understand how to control this type of variation. With the emergence of next-generation sequencing technologies, genetic variation in microbial strains can now be determined on an unprecedented scale and resolution by re-sequencing thousands of strains systematically. In this article, we review challenges in the integration and analysis of large-scale re-sequencing data, present an extensive overview of bioinformatics methods for predicting the effects of genetic variants on protein function, and discuss approaches for interfacing existing bioinformatics approaches with genome-scale models of cellular processes in order to predict effects of sequence variation on cellular phenotypes.

  16. Expression, tandem repeat copy number variation and stability of four macrosatellite arrays in the human genome

    Directory of Open Access Journals (Sweden)

    Chadwick Brian P

    2010-11-01

    Full Text Available Abstract Background Macrosatellites are some of the largest variable number tandem repeats in the human genome, but what role these unusual sequences perform is unknown. Their importance to human health is clearly demonstrated by the 4q35 macrosatellite D4Z4 that is associated with the onset of the muscle degenerative disease facioscapulohumeral muscular dystrophy. Nevertheless, many other macrosatellite arrays in the human genome remain poorly characterized. Results Here we describe the organization, tandem repeat copy number variation, transmission stability and expression of four macrosatellite arrays in the human genome: the TAF11-Like array located on chromosomes 5p15.1, the SST1 arrays on 4q28.3 and 19q13.12, the PRR20 array located on chromosome 13q21.1, and the ZAV array at 9q32. All are polymorphic macrosatellite arrays that at least for TAF11-Like and SST1 show evidence of meiotic instability. With the exception of the SST1 array that is ubiquitously expressed, all are expressed at high levels in the testis and to a lesser extent in the brain. Conclusions Our results extend the number of characterized macrosatellite arrays in the human genome and provide the foundation for formulation of hypotheses to begin assessing their functional role in the human genome.

  17. Habitat area and climate stability determine geographical variation in plant species range sizes.

    Science.gov (United States)

    Morueta-Holme, Naia; Enquist, Brian J; McGill, Brian J; Boyle, Brad; Jørgensen, Peter M; Ott, Jeffrey E; Peet, Robert K; Símová, Irena; Sloat, Lindsey L; Thiers, Barbara; Violle, Cyrille; Wiser, Susan K; Dolins, Steven; Donoghue, John C; Kraft, Nathan J B; Regetz, Jim; Schildhauer, Mark; Spencer, Nick; Svenning, Jens-Christian

    2013-12-01

    Despite being a fundamental aspect of biodiversity, little is known about what controls species range sizes. This is especially the case for hyperdiverse organisms such as plants. We use the largest botanical data set assembled to date to quantify geographical variation in range size for ~ 85 000 plant species across the New World. We assess prominent hypothesised range-size controls, finding that plant range sizes are codetermined by habitat area and long- and short-term climate stability. Strong short- and long-term climate instability in large parts of North America, including past glaciations, are associated with broad-ranged species. In contrast, small habitat areas and a stable climate characterise areas with high concentrations of small-ranged species in the Andes, Central America and the Brazilian Atlantic Rainforest region. The joint roles of area and climate stability strengthen concerns over the potential effects of future climate change and habitat loss on biodiversity.

  18. Habitat area and climate stability determine geographical variation in plant species range sizes

    DEFF Research Database (Denmark)

    Morueta-Holme, Naia; Enquist, Brian J.; McGill, Brian J.

    2013-01-01

    Despite being a fundamental aspect of biodiversity, little is known about what controls species range sizes. This is especially the case for hyperdiverse organisms such as plants. We use the largest botanical data set assembled to date to quantify geographical variation in range size for ~85,000 ...... concerns over the potential effects of future climate change and habitat loss on biodiversity.......,000 plant species across the New World. We assess prominent hypothesised range-size controls, finding that plant range sizes are codetermined by habitat area and long- and short-term climate stability. Strong short- and long-term climate instability in large parts of North America, including past...... glaciations, are associated with broad-ranged species. In contrast, small habitat areas and a stable climate characterise areas with high concentrations of small-ranged species in the Andes, Central America and the Brazilian Atlantic Rainforest region. The joint roles of area and climate stability strengthen...

  19. Connecting Anxiety and Genomic Copy Number Variation: A Genome-Wide Analysis in CD-1 Mice.

    Directory of Open Access Journals (Sweden)

    Julia Brenndörfer

    Full Text Available Genomic copy number variants (CNVs have been implicated in multiple psychiatric disorders, but not much is known about their influence on anxiety disorders specifically. Using next-generation sequencing (NGS and two additional array-based genotyping approaches, we detected CNVs in a mouse model consisting of two inbred mouse lines showing high (HAB and low (LAB anxiety-related behavior, respectively. An influence of CNVs on gene expression in the central (CeA and basolateral (BLA amygdala, paraventricular nucleus (PVN, and cingulate cortex (Cg was shown by a two-proportion Z-test (p = 1.6 x 10-31, with a positive correlation in the CeA (p = 0.0062, PVN (p = 0.0046 and Cg (p = 0.0114, indicating a contribution of CNVs to the genetic predisposition to trait anxiety in the specific context of HAB/LAB mice. In order to confirm anxiety-relevant CNVs and corresponding genes in a second mouse model, we further examined CD-1 outbred mice. We revealed the distribution of CNVs by genotyping 64 CD 1 individuals using a high-density genotyping array (Jackson Laboratory. 78 genes within those CNVs were identified to show nominally significant association (48 genes, or a statistical trend in their association (30 genes with the time animals spent on the open arms of the elevated plus-maze (EPM. Fifteen of them were considered promising candidate genes of anxiety-related behavior as we could show a significant overlap (permutation test, p = 0.0051 with genes within HAB/LAB CNVs. Thus, here we provide what is to our knowledge the first extensive catalogue of CNVs in CD-1 mice and potential corresponding candidate genes linked to anxiety-related behavior in mice.

  20. Genome variations associated with viral susceptibility and calcification in Emiliania huxleyi.

    Directory of Open Access Journals (Sweden)

    Jessica U Kegel

    Full Text Available Emiliania huxleyi, a key player in the global carbon cycle is one of the best studied coccolithophores with respect to biogeochemical cycles, climatology, and host-virus interactions. Strains of E. huxleyi show phenotypic plasticity regarding growth behaviour, light-response, calcification, acidification, and virus susceptibility. This phenomenon is likely a consequence of genomic differences, or transcriptomic responses, to environmental conditions or threats such as viral infections. We used an E. huxleyi genome microarray based on the sequenced strain CCMP1516 (reference strain to perform comparative genomic hybridizations (CGH of 16 E. huxleyi strains of different geographic origin. We investigated the genomic diversity and plasticity and focused on the identification of genes related to virus susceptibility and coccolith production (calcification. Among the tested 31940 gene models a core genome of 14628 genes was identified by hybridization among 16 E. huxleyi strains. 224 probes were characterized as specific for the reference strain CCMP1516. Compared to the sequenced E. huxleyi strain CCMP1516 variation in gene content of up to 30 percent among strains was observed. Comparison of core and non-core transcripts sets in terms of annotated functions reveals a broad, almost equal functional coverage over all KOG-categories of both transcript sets within the whole annotated genome. Within the variable (non-core genome we identified genes associated with virus susceptibility and calcification. Genes associated with virus susceptibility include a Bax inhibitor-1 protein, three LRR receptor-like protein kinases, and mitogen-activated protein kinase. Our list of transcripts associated with coccolith production will stimulate further research, e.g. by genetic manipulation. In particular, the V-type proton ATPase 16 kDa proteolipid subunit is proposed to be a plausible target gene for further calcification studies.

  1. Effects of sample size and intraspecific variation in phylogenetic comparative studies: a meta-analytic review.

    Science.gov (United States)

    Garamszegi, László Z; Møller, Anders P

    2010-11-01

    Comparative analyses aim to explain interspecific variation in phenotype among taxa. In this context, phylogenetic approaches are generally applied to control for similarity due to common descent, because such phylogenetic relationships can produce spurious similarity in phenotypes (known as phylogenetic inertia or bias). On the other hand, these analyses largely ignore potential biases due to within-species variation. Phylogenetic comparative studies inherently assume that species-specific means from intraspecific samples of modest sample size are biologically meaningful. However, within-species variation is often significant, because measurement errors, within- and between-individual variation, seasonal fluctuations, and differences among populations can all reduce the repeatability of a trait. Although simulations revealed that low repeatability can increase the type I error in a phylogenetic study, researchers only exercise great care in accounting for similarity in phenotype due to common phylogenetic descent, while problems posed by intraspecific variation are usually neglected. A meta-analysis of 194 comparative analyses all adjusting for similarity due to common phylogenetic descent revealed that only a few studies reported intraspecific repeatabilities, and hardly any considered or partially dealt with errors arising from intraspecific variation. This is intriguing, because the meta-analytic data suggest that the effect of heterogeneous sampling can be as important as phylogenetic bias, and thus they should be equally controlled in comparative studies. We provide recommendations about how to handle such effects of heterogeneous sampling.

  2. The Genetic Basis of Baculum Size and Shape Variation in Mice

    Science.gov (United States)

    Schultz, Nicholas G.; Ingels, Jesse; Hillhouse, Andrew; Wardwell, Keegan; Chang, Peter L.; Cheverud, James M.; Lutz, Cathleen; Lu, Lu; Williams, Robert W.; Dean, Matthew D.

    2016-01-01

    The rapid divergence of male genitalia is a preeminent evolutionary pattern. This rapid divergence is especially striking in the baculum, a bone that occurs in the penis of many mammalian species. Closely related species often display diverse baculum morphology where no other morphological differences can be discerned. While this fundamental pattern of evolution has been appreciated at the level of gross morphology, nearly nothing is known about the genetic basis of size and shape divergence. Quantifying the genetic basis of baculum size and shape variation has been difficult because these structures generally lack obvious landmarks, so comparing them in three dimensions is not straightforward. Here, we develop a novel morphometric approach to quantify size and shape variation from three-dimensional micro-CT scans taken from 369 bacula, representing 75 distinct strains of the BXD family of mice. We identify two quantitative trait loci (QTL) that explain ∼50% of the variance in baculum size, and a third QTL that explains more than 20% of the variance in shape. Together, our study demonstrates that baculum morphology may diverge relatively easily, with mutations at a few loci of large effect that independently modulate size and shape. Based on a combination of bioinformatic investigations and new data on RNA expression, we prioritized these QTL to 16 candidate genes, which have hypothesized roles in bone morphogenesis and may enable future genetic manipulation of baculum morphology. PMID:26935419

  3. The Genetic Basis of Baculum Size and Shape Variation in Mice

    Directory of Open Access Journals (Sweden)

    Nicholas G. Schultz

    2016-05-01

    Full Text Available The rapid divergence of male genitalia is a preeminent evolutionary pattern. This rapid divergence is especially striking in the baculum, a bone that occurs in the penis of many mammalian species. Closely related species often display diverse baculum morphology where no other morphological differences can be discerned. While this fundamental pattern of evolution has been appreciated at the level of gross morphology, nearly nothing is known about the genetic basis of size and shape divergence. Quantifying the genetic basis of baculum size and shape variation has been difficult because these structures generally lack obvious landmarks, so comparing them in three dimensions is not straightforward. Here, we develop a novel morphometric approach to quantify size and shape variation from three-dimensional micro-CT scans taken from 369 bacula, representing 75 distinct strains of the BXD family of mice. We identify two quantitative trait loci (QTL that explain ∼50% of the variance in baculum size, and a third QTL that explains more than 20% of the variance in shape. Together, our study demonstrates that baculum morphology may diverge relatively easily, with mutations at a few loci of large effect that independently modulate size and shape. Based on a combination of bioinformatic investigations and new data on RNA expression, we prioritized these QTL to 16 candidate genes, which have hypothesized roles in bone morphogenesis and may enable future genetic manipulation of baculum morphology.

  4. The Genetic Basis of Baculum Size and Shape Variation in Mice.

    Science.gov (United States)

    Schultz, Nicholas G; Ingels, Jesse; Hillhouse, Andrew; Wardwell, Keegan; Chang, Peter L; Cheverud, James M; Lutz, Cathleen; Lu, Lu; Williams, Robert W; Dean, Matthew D

    2016-05-03

    The rapid divergence of male genitalia is a preeminent evolutionary pattern. This rapid divergence is especially striking in the baculum, a bone that occurs in the penis of many mammalian species. Closely related species often display diverse baculum morphology where no other morphological differences can be discerned. While this fundamental pattern of evolution has been appreciated at the level of gross morphology, nearly nothing is known about the genetic basis of size and shape divergence. Quantifying the genetic basis of baculum size and shape variation has been difficult because these structures generally lack obvious landmarks, so comparing them in three dimensions is not straightforward. Here, we develop a novel morphometric approach to quantify size and shape variation from three-dimensional micro-CT scans taken from 369 bacula, representing 75 distinct strains of the BXD family of mice. We identify two quantitative trait loci (QTL) that explain ∼50% of the variance in baculum size, and a third QTL that explains more than 20% of the variance in shape. Together, our study demonstrates that baculum morphology may diverge relatively easily, with mutations at a few loci of large effect that independently modulate size and shape. Based on a combination of bioinformatic investigations and new data on RNA expression, we prioritized these QTL to 16 candidate genes, which have hypothesized roles in bone morphogenesis and may enable future genetic manipulation of baculum morphology.

  5. Variations in basic demographics consequential to population size of governorate in Saudi Arabia.

    Science.gov (United States)

    Khraif, Rshood; Salam, Asharaf Abdul; Potty, Rajaram Subramanian; Aldosari, Ali; Elsegaey, Ibrahim; AlMutairi, Abdullah

    2016-01-01

    Saudi Arabia, divided into 5 planning regions, 13 administrative regions and further to 118 governorates (administrative units), has diverse demographic characteristics from one region to another and from one governorate to another. Rural to urban migration and an exodus of immigrants characterize the Kingdom, where development planning depend largely upon local level requirements based on economic activities. An attempt was made to analyze the population characteristics, such as population size, sex ratio, native to foreigner ratio, and households and persons per households by keeping governorate as unit of analysis. Data of two census period (2004 and 2010) was used in order to explore the situation and track the intercensal changes. Large variations in population were observed between governorates and it varied from 3686 to 5,007,886 in 2010. Governorates are divided according to the number of native population demarcating urbanization, modernization and infrastructure. During the intercensal period, the number of small governorates reduced and medium and large sized governorates increased mainly due to population growth. The average population in governorates was increased in total and in the larger governorates during the period. However, we noticed a reduction in the average population size in the small and medium sized governorates. The size of native population in a governorate influences the sex ratio, the native-foreigner ratio and the persons per household as well as the variations within the group of governorates. Analyses of lower level data shall aid not only to understand the situation but also to support local development policies.

  6. Epigenetic Variation in Monozygotic Twins: A Genome-Wide Analysis of DNA Methylation in Buccal Cells

    Directory of Open Access Journals (Sweden)

    Jenny van Dongen

    2014-05-01

    Full Text Available DNA methylation is one of the most extensively studied epigenetic marks in humans. Yet, it is largely unknown what causes variation in DNA methylation between individuals. The comparison of DNA methylation profiles of monozygotic (MZ twins offers a unique experimental design to examine the extent to which such variation is related to individual-specific environmental influences and stochastic events or to familial factors (DNA sequence and shared environment. We measured genome-wide DNA methylation in buccal samples from ten MZ pairs (age 8–19 using the Illumina 450k array and examined twin correlations for methylation level at 420,921 CpGs after QC. After selecting CpGs showing the most variation in the methylation level between subjects, the mean genome-wide correlation (rho was 0.54. The correlation was higher, on average, for CpGs within CpG islands (CGIs, compared to CGI shores, shelves and non-CGI regions, particularly at hypomethylated CpGs. This finding suggests that individual-specific environmental and stochastic influences account for more variation in DNA methylation in CpG-poor regions. Our findings also indicate that it is worthwhile to examine heritable and shared environmental influences on buccal DNA methylation in larger studies that also include dizygotic twins.

  7. Variation in genomic methylation in natural populations of chinese white poplar.

    Directory of Open Access Journals (Sweden)

    Kaifeng Ma

    Full Text Available BACKGROUND: It is thought that methylcytosine can be inherited through meiosis and mitosis, and that epigenetic variation may be under genetic control or correlation may be caused by neutral drift. However, DNA methylation also varies with tissue, developmental stage, and environmental factors. Eliminating these factors, we analyzed the levels and patterns, diversity and structure of genomic methylcytosine in the xylem of nine natural populations of Chinese white poplar. PRINCIPAL FINDINGS: On average, the relative total methylation and non-methylation levels were approximately 26.567% and 42.708% (P<0.001, respectively. Also, the relative CNG methylation level was higher than the relative CG methylation level. The relative methylation/non-methylation levels were significantly different among the nine natural populations. Epigenetic diversity ranged from 0.811 (Gansu to 1.211 (Shaanxi, and the coefficients of epigenetic differentiation (GST  = 0.159 were assessed by Shannon's diversity index. Co-inertia analysis indicated that methylation-sensitive polymorphism (MSP and genomic methylation pattern (CG-CNG profiles gave similar distributions. Using a between-group eigen analysis, we found that the Hebei and Shanxi populations were independent of each other, but the Henan population intersected with the other populations, to some degree. CONCLUSIONS: Genome methylation in Populus tomentosa presented tissue-specific characteristics and the relative 5'-CCGG methylation level was higher in xylem than in leaves. Meanwhile, the genome methylation in the xylem shows great epigenetic variation and could be fixed and inherited though mitosis. Compared to genetic structure, data suggest that epigenetic and genetic variation do not completely match.

  8. Genomic landscape of copy number variation and copy neutral loss of heterozygosity events in equine sarcoids reveals increased instability of the sarcoid genome.

    Science.gov (United States)

    Pawlina-Tyszko, Klaudia; Gurgul, Artur; Szmatoła, Tomasz; Ropka-Molik, Katarzyna; Semik-Gurgul, Ewelina; Klukowska-Rötzler, Jolanta; Koch, Christoph; Mählmann, Kathrin; Bugno-Poniewierska, Monika

    2017-09-01

    Although they are the most common neoplasms in equids, sarcoids are not fully characterized at the molecular level. Therefore, the objective of this study was to characterize the landscape of structural rearrangements, such as copy number variation (CNV) and copy neutral loss of heterozygosity (cnLOH), in the genomes of sarcoid tumor cells. This information will not only broaden our understanding of the characteristics of this genome but will also improve the general knowledge of this tumor and the mechanisms involved in its generation. To this end, Equine SNP64K Illumina microarrays were applied along with bioinformatics tools dedicated for signal intensity analysis. The analysis revealed increased instability of the genome of sarcoid cells compared with unaltered skin tissue samples, which was manifested by the prevalence of CNV and cnLOH events. Many of the identified CNVs overlapped with the other research results, but the simultaneously observed variability in the number and sizes of detected aberrations indicated a need for further studies and the development of more reliable bioinformatics algorithms. The functional analysis of genes co-localized with the identified aberrations revealed that these genes are engaged in vital cellular processes. In addition, a number of these genes directly contribute to neoplastic transformation. Furthermore, large numbers of cnLOH events identified in the sarcoids suggested that they may play no less significant roles than CNVs in the carcinogenesis of this tumor. Thus, our results indicate the importance of cnLOH and CNV in equine sarcoid oncogenesis and present a direction of future research. Copyright © 2017 Elsevier B.V. and Société Française de Biochimie et Biologie Moléculaire (SFBBM). All rights reserved.

  9. MicroRNAs and genomic variations: from Proteus tricks to Prometheus gift.

    Science.gov (United States)

    Fabbri, Muller; Valeri, Nicola; Calin, George A

    2009-06-01

    MicroRNAs (miRNAs) are small non-coding RNAs with regulatory functions. MiRNAs are aberrantly expressed in almost all human cancers, leading to abnormal levels of target genes. Recently, an increasing number of studies have addressed whether genomic variations including germ line or somatic mutations and single-nucleotide polymorphisms can count for miRNA abnormal expression by altering their biogenesis and/or affect the ability of miRNAs to bind to target messenger RNAs. Here, we provide an extensive review of the studies that have investigated variations occurring both in miRNA genes and in target genes and we discuss the possible clinical implications of these findings. Furthermore, we propose that sequence variations in miRNAs or interactor sites located in mRNAs can be involved in cancer predisposition.

  10. Egg size variation among tropical and temperate songbirds: An embryonic temperature hypothesis

    Science.gov (United States)

    Martin, T.E.

    2008-01-01

    Species with 'slow' life history strategies (long life, low fecundity) are thought to produce high-quality offspring by investing in larger, but fewer, young. Larger eggs are indeed associated with fewer eggs across taxa and can yield higher-quality offspring. Tropical passerines appear to follow theory because they commonly exhibit slow life history strategies and produce larger, but fewer, eggs compared with northern species. Yet, I show here that relative egg mass (corrected for adult mass) varies extensively in the tropics and subtropics for the same clutch size, and this variation is unexplained. I propose a hypothesis to explain egg size variation both within the tropics and between latitudes: Relative egg mass increases in species with cooler egg temperatures and longer embryonic periods to offset associated increases in energetic requirements of embryos. Egg temperatures of birds are determined by parental incubation behavior and are often cooler among tropical passerines because of reduced parental attentiveness of eggs. Here, I show that cooler egg temperatures and longer embryonic periods explained the enigmatic variation in egg mass within and among regions, based on field studies in tropical Venezuela (36 species), subtropical Argentina (16 species), and north temperate Arizona (20 species). Alternative explanations are not supported. Thus, large egg sizes may reflect compensation for increased energetic requirements of cool egg temperatures and long embryonic periods that result from reduced parental attentiveness in tropical birds. ?? 2008 by The National Academy of Sciences of the USA.

  11. Co-variation of metabolic rates and cell-size in coccolithophores

    Directory of Open Access Journals (Sweden)

    G. Aloisi

    2015-04-01

    , the model is able to reproduce the co-variation of growth rate and cell size observed in the laboratory when these nutrients become limiting. These results support ongoing efforts to interpret coccosphere and coccolith size measurements in the context of climate change.

  12. Multi-decadal variation in size of juvenile Summer Flounder (Paralichthys dentatus) in Chesapeake Bay

    Science.gov (United States)

    Nys, Lauren N.; Fabrizio, Mary C.; Tuckey, Troy D.

    2016-01-01

    During the last quarter-century, management of Summer Flounder Paralichthys dentatus along the Atlantic coast resulted in significant increases in abundance such that rebuilding targets were recently achieved. Although spawning stock biomass is high, recruitment of young-of-the-year (YOY) Summer Flounder remains variable. Chesapeake Bay is one of the principal nursery areas for this species, but processes such as growth and survival that affect production of YOY Summer Flounder in this estuary have not been explored. Here, we investigated the relationship between abundance and size of Summer Flounder recruits from the 1988 to 2012 year classes in Chesapeake Bay. We also considered the effects of environmental factors on fish size because conditions in the bay vary spatially during the time that fish occupy nursery areas. To describe variations in Summer Flounder size, we used monthly length observations from 13,018 YOY fish captured by bottom trawl from the lower Chesapeake Bay and the James, York, and Rappahannock river subestuaries where Summer Flounder are commonly observed. We applied a generalized additive model to describe spatial, temporal, and environmental effects on observed fish size; we also considered the density of Summer Flounder and an index of productivity as factors in the model. Summer Flounder in Chesapeake Bay exhibited density-dependent and spatially related variations in mean length: larger fish were found mostly in the Bay and smaller fish in the subestuaries. Additionally, low ( 26 °C) temperatures and low salinities (fish size, indicating that individuals found in these environments were typically smaller than conspecifics inhabiting areas of moderate temperatures and higher salinities. Variable nursery habitat conditions in temperate estuaries affect fish size and, subsequently, may influence production of Summer Flounder year classes through effects on maturation and survival. As water temperatures in the mid-Atlantic region continue to

  13. Insights into the Dekkera bruxellensis genomic landscape: comparative genomics reveals variations in ploidy and nutrient utilisation potential amongst wine isolates.

    Directory of Open Access Journals (Sweden)

    Anthony R Borneman

    2014-02-01

    Full Text Available The yeast Dekkera bruxellensis is a major contaminant of industrial fermentations, such as those used for the production of biofuel and wine, where it outlasts and, under some conditions, outcompetes the major industrial yeast Saccharomyces cerevisiae. In order to investigate the level of inter-strain variation that is present within this economically important species, the genomes of four diverse D. bruxellensis isolates were compared. While each of the four strains was shown to contain a core diploid genome, which is clearly sufficient for survival, two of the four isolates have a third haploid complement of chromosomes. The sequences of these additional haploid genomes were both highly divergent from those comprising the diploid core and divergent between the two triploid strains. Similar to examples in the Saccharomyces spp. clade, where some allotriploids have arisen on the basis of enhanced ability to survive a range of environmental conditions, it is likely these strains are products of two independent hybridisation events that may have involved multiple species or distinct sub-species of Dekkera. Interestingly these triploid strains represent the vast majority (92% of isolates from across the Australian wine industry, suggesting that the additional set of chromosomes may confer a selective advantage in winery environments that has resulted in these hybrid strains all-but replacing their diploid counterparts in Australian winery settings. In addition to the apparent inter-specific hybridisation events, chromosomal aberrations such as strain-specific insertions and deletions and loss-of-heterozygosity by gene conversion were also commonplace. While these events are likely to have affected many phenotypes across these strains, we have been able to link a specific deletion to the inability to utilise nitrate by some strains of D. bruxellensis, a phenotype that may have direct impacts in the ability for these strains to compete with S

  14. Insights into the Dekkera bruxellensis genomic landscape: comparative genomics reveals variations in ploidy and nutrient utilisation potential amongst wine isolates.

    Science.gov (United States)

    Borneman, Anthony R; Zeppel, Ryan; Chambers, Paul J; Curtin, Chris D

    2014-02-01

    The yeast Dekkera bruxellensis is a major contaminant of industrial fermentations, such as those used for the production of biofuel and wine, where it outlasts and, under some conditions, outcompetes the major industrial yeast Saccharomyces cerevisiae. In order to investigate the level of inter-strain variation that is present within this economically important species, the genomes of four diverse D. bruxellensis isolates were compared. While each of the four strains was shown to contain a core diploid genome, which is clearly sufficient for survival, two of the four isolates have a third haploid complement of chromosomes. The sequences of these additional haploid genomes were both highly divergent from those comprising the diploid core and divergent between the two triploid strains. Similar to examples in the Saccharomyces spp. clade, where some allotriploids have arisen on the basis of enhanced ability to survive a range of environmental conditions, it is likely these strains are products of two independent hybridisation events that may have involved multiple species or distinct sub-species of Dekkera. Interestingly these triploid strains represent the vast majority (92%) of isolates from across the Australian wine industry, suggesting that the additional set of chromosomes may confer a selective advantage in winery environments that has resulted in these hybrid strains all-but replacing their diploid counterparts in Australian winery settings. In addition to the apparent inter-specific hybridisation events, chromosomal aberrations such as strain-specific insertions and deletions and loss-of-heterozygosity by gene conversion were also commonplace. While these events are likely to have affected many phenotypes across these strains, we have been able to link a specific deletion to the inability to utilise nitrate by some strains of D. bruxellensis, a phenotype that may have direct impacts in the ability for these strains to compete with S. cerevisiae.

  15. Poly(T) variation in heteroderid nematode mitochondrial genomes is predominantly an artefact of amplification.

    Science.gov (United States)

    Riepsamen, Angelique H; Gibson, Tracey; Rowe, Janet; Chitwood, David J; Subbotin, Sergei A; Dowton, Mark

    2011-02-01

    We assessed the rate of in vitro polymerase errors at polythymidine [poly(T)] tracts in the mitochondrial DNA (mtDNA) of a heteroderid nematode (Heterodera cajani). The mtDNA of these nematodes contain unusually high numbers of poly(T) tracts, and have previously been suggested to contain biological poly(T) length variation. However, using a cloned molecule, we observed that poly(T) variation was generated in vitro at regions containing more than six consecutive Ts. This artefactual error rate was estimated at 7.3 × 10(-5) indels/poly(T) tract >6 Ts/cycle. This rate was then compared to the rate of poly(T) variation detected after the amplification of a biological sample, in order to estimate the 'biological + artefactual' rate of poly(T) variation. There was no significant difference between the artefactual and the artefactual + biological rates, suggesting that the majority of poly(T) variation in the biological sample was artefactual. We then examined the generation of poly(T) variation in a range of templates with tracts up to 16 Ts long, utilizing a range of Heteroderidae species. We observed that T deletions occurred five times more frequently than insertions, and a trend towards increasing error rates with increasing poly(T) tract length. These findings have significant implications for studies involving genomes with many homopolymer tracts.

  16. Cytogenetics of Aspidogaster limacoides (Trematoda, Aspidogastrea): karyotype, spermatocyte division, and genome size.

    Science.gov (United States)

    Bombarová, Marta; Špakulová, Marta; Kello, Martin; Nguyen, Petr; Bazsalovicsová, Eva; Králová-Hromadová, Ivica

    2015-04-01

    A detailed cytogenetic analysis of the aspidogastrean fluke Aspidogaster limacoides revealed a karyotype consisting of six medium-sized chromosome pairs. The first and the last pairs were two-armed while four remaining were one-armed; 2n = 12, n = 1 m + 1 m - sm + 4a. Fluorescence in situ hybridization with 18S ribosomal DNA (rDNA) probe detected a single cluster of ribosomal genes (NOR) located in pericentromeric regions of the long arms of the third chromosome pair in a site of secondary constriction apparent in meiotic prophase, especially in diplotene. The silver nitrate staining showed only a single active NOR site on one of homologous chromosomes in the majority of spermatogonia and spermatocyte divisions. A course of meiosis corresponded to standard schemes. The nucleolus was apparent in early meiotic spermatocytes and disintegrated by the end of pachytene. For the first time in Aspidogastrea, the genome size was determined. The flow cytometry showed 1.21 pg DNA per haploid nucleus in A. limacoides which is in accordance with relatively low genome sizes of other flukes and tapeworms (Neodermata). A comparison of cytogenetic data available to date in the fluke sister groups Aspidogastrea and Digenea suggests that the lower chromosome number of Aspidogastrea might represent an ancestral condition and their split might have been accompanied by an increase in chromosome number via either chromosome fissions or paleopolyploidy.

  17. Population genomics of eusocial insects: the costs of a vertebrate-like effective population size.

    Science.gov (United States)

    Romiguier, J; Lourenco, J; Gayral, P; Faivre, N; Weinert, L A; Ravel, S; Ballenghien, M; Cahais, V; Bernard, A; Loire, E; Keller, L; Galtier, N

    2014-03-01

    The evolution of reproductive division of labour and social life in social insects has lead to the emergence of several life-history traits and adaptations typical of larger organisms: social insect colonies can reach masses of several kilograms, they start reproducing only when they are several years old, and can live for decades. These features and the monopolization of reproduction by only one or few individuals in a colony should affect molecular evolution by reducing the effective population size. We tested this prediction by analysing genome-wide patterns of coding sequence polymorphism and divergence in eusocial vs. noneusocial insects based on newly generated RNA-seq data. We report very low amounts of genetic polymorphism and an elevated ratio of nonsynonymous to synonymous changes – a marker of the effective population size – in four distinct species of eusocial insects, which were more similar to vertebrates than to solitary insects regarding molecular evolutionary processes. Moreover, the ratio of nonsynonymous to synonymous substitutions was positively correlated with the level of social complexity across ant species. These results are fully consistent with the hypothesis of a reduced effective population size and an increased genetic load in eusocial insects, indicating that the evolution of social life has important consequences at both the genomic and population levels. © 2014 The Authors. Journal of Evolutionary Biology © 2014 European Society For Evolutionary Biology.

  18. An initial comparative map of copy number variations in the goat (Capra hircus genome

    Directory of Open Access Journals (Sweden)

    Casadio Rita

    2010-11-01

    Full Text Available Abstract Background The goat (Capra hircus represents one of the most important farm animal species. It is reared in all continents with an estimated world population of about 800 million of animals. Despite its importance, studies on the goat genome are still in their infancy compared to those in other farm animal species. Comparative mapping between cattle and goat showed only a few rearrangements in agreement with the similarity of chromosome banding. We carried out a cross species cattle-goat array comparative genome hybridization (aCGH experiment in order to identify copy number variations (CNVs in the goat genome analysing animals of different breeds (Saanen, Camosciata delle Alpi, Girgentana, and Murciano-Granadina using a tiling oligonucleotide array with ~385,000 probes designed on the bovine genome. Results We identified a total of 161 CNVs (an average of 17.9 CNVs per goat, with the largest number in the Saanen breed and the lowest in the Camosciata delle Alpi goat. By aggregating overlapping CNVs identified in different animals we determined CNV regions (CNVRs: on the whole, we identified 127 CNVRs covering about 11.47 Mb of the virtual goat genome referred to the bovine genome (0.435% of the latter genome. These 127 CNVRs included 86 loss and 41 gain and ranged from about 24 kb to about 1.07 Mb with a mean and median equal to 90,292 bp and 49,530 bp, respectively. To evaluate whether the identified goat CNVRs overlap with those reported in the cattle genome, we compared our results with those obtained in four independent cattle experiments. Overlapping between goat and cattle CNVRs was highly significant (P Conclusions We describe a first map of goat CNVRs. This provides information on a comparative basis with the cattle genome by identifying putative recurrent interspecies CNVs between these two ruminant species. Several goat CNVs affect genes with important biological functions. Further studies are needed to evaluate the

  19. Temperature and developmental responses of body and cell size in Drosophila; effects of polyploidy and genome configuration.

    Science.gov (United States)

    Jalal, Marwa; Andersen, Tom; Hessen, Dag O

    2015-07-01

    Increased adult body size in Drosophila raised at lower temperatures could be attributed both to an increase in the cell volume and cell number. It is not clear, however, whether increased cell size is related to (or even caused by) increased nuclear volume and genome size (or configuration). Experiments with Drosophila melanogaster stocks (Oregon-R and w1118) raised at 16, 22, 24, and 28°C resulted in larger adult body and wing size with lower temperature, while eye size was less affected. The increase in wing size reflected an increase in cell size in both males and females of both stocks. The nucleus size, genome size, and DNA condensation of adult flies, embryos, and Schneider 2 cells (S2 cells, of larval origin) were estimated by flow cytometry. In both adult flies and S2 cells, both nucleus size and DNA condensation varied with temperature, while DNA content appears to be constant. From 12% to 18% of the somatic cells were tetraploid (4C) and 2-5% were octoploid (8C), and for the Oregon strain we observed an increase in the fraction of polyploid cells with decreasing temperature. The observed increase in body size (and wing size) at low temperatures could partly be linked with the cell size and DNA condensation, while corresponding changes in the haploid genome size were not observed. Copyright © 2015 Elsevier Ltd. All rights reserved.

  20. Genome-wide analysis of copy number variation in type 1 diabetes.

    Directory of Open Access Journals (Sweden)

    Britney L Grayson

    Full Text Available Type 1 diabetes (T1D tends to cluster in families, suggesting there may be a genetic component predisposing to disease. However, a recent large-scale genome-wide association study concluded that identified genetic factors, single nucleotide polymorphisms, do not account for overall familiality. Another class of genetic variation is the amplification or deletion of >1 kilobase segments of the genome, also termed copy number variations (CNVs. We performed genome-wide CNV analysis on a cohort of 20 unrelated adults with T1D and a control (Ctrl cohort of 20 subjects using the Affymetrix SNP Array 6.0 in combination with the Birdsuite copy number calling software. We identified 39 CNVs as enriched or depleted in T1D versus Ctrl. Additionally, we performed CNV analysis in a group of 10 monozygotic twin pairs discordant for T1D. Eleven of these 39 CNVs were also respectively enriched or depleted in the Twin cohort, suggesting that these variants may be involved in the development of islet autoimmunity, as the presently unaffected twin is at high risk for developing islet autoimmunity and T1D in his or her lifetime. These CNVs include a deletion on chromosome 6p21, near an HLA-DQ allele. CNVs were found that were both enriched or depleted in patients with or at high risk for developing T1D. These regions may represent genetic variants contributing to development of islet autoimmunity in T1D.

  1. HGD-Chn: The Database of Genome Diversity and Variation for Chinese Populations.

    Science.gov (United States)

    Hong-Sheng, Gui; Peng, Zhou; Cheng-Bo, Yang; Sheng-Bin, Li

    2009-04-01

    The Database of Genome Diversity and Variation for Chinese Populations is toward a more efficient utilization and sharing of the valuable yet diminishing genetic resources in China (including sample information of healthy populations, healthy pedigrees, disease population and disease pedigrees; genomic diversity data; disease-related allelic and haplotype data). Organization of the database can be divided into two parts: (1) Genetic resources of healthy people--Organizing genetic resources of healthy people. A variety of genetic markers (VNTR, STR, SNP, HLA, and enzyme markers, etc.) are chosen for their diversity among populations, with their distribution among different ethnic groups in China stored in the form of allelic frequency. A further analysis as well as an overall description of the Chinese population genetic structure is also being made possible. (2) Disease genetic resources--Four categories are mainly concerned: chromosomal diseases, monogenic diseases, polygenic diseases, and birth defects. For each kind of disease, the basic introduction and description, sample information, and allelic data of related gene are involved. Aside from research-oriented information, introductory courses oriented at general public covering fields of genomic diversity and variation, the related experimental techniques, standards and specifications could also be accessed in our website. Further more, flexible query and submit system with user-friendly interfaces are also integrated in our website to simplify the process of user-query and administrators' database maintenance work. Online data analyzing and managing tools are developed using bioinformatics algorithm and programming language for a better interpretation of the biological data.

  2. Effect of litter size on the variation in birth and weaning weights of Landrace piglets

    Directory of Open Access Journals (Sweden)

    Camila Duarte Prazeres

    2016-03-01

    Full Text Available The objective of this study was to evaluate the effect of the size class of the litter at birth on the variation in birth and weaning weights and on the survival rate of piglets from birth to weaning. For this purpose, records of individual weight at birth and weaning of piglets obtained from a database of 295 Landrace litters born between 2000 and 2010 on a pig farm in the western region of the State of Paraná were used. The litters were classified as small (up to 7 piglets, medium (8 to 13 piglets, and large (> 14 piglets according to the total number of piglets born. The data were analyzed considering the effects of the year of sow mating and size class of the litter at birth. The correlations between mean weight and variance in litter weight and size were higher for medium and large litters. The size class of the litter significantly influenced the mean weight of piglets at birth and weaning and the variance in birth weight. Piglets born in medium and large litters weighed less and exhibited greater birth weight variation and a lower survival rate until weaning than piglets born in small litters.

  3. Variation in avian brain shape: relationship with size and orbital shape.

    Science.gov (United States)

    Kawabe, Soichiro; Shimokawa, Tetsuya; Miki, Hitoshi; Matsuda, Seiji; Endo, Hideki

    2013-11-01

    There is wide variation in brain shape among birds. Differences in brain dimensions reflect species-specific sensory capacities and behavioral repertoires that are shaped by environmental and biological factors during evolution. Most previous studies aimed at defining factors impacting brain shape have used volumetric or linear measurements. However, few have explored the quantitative indices of three-dimensional (3D) brain geometry that are absolutely imperative to understanding avian evolutionary history. This study aimed: (i) to explore the relationship between brain shape and overall brain size; and (ii) to assess the relationship between brain shape and orbital shape. Avian brain endocasts were reconstructed from computed tomography images and analyzed using 3D geometric morphometrics. Principal component analysis revealed dominant regional variations in avian brain shape and shape correlations between the telencephalon and cerebellum, between the cerebellum and myelencephalon, and between the diencephalon and optic tectum. Brain shape changes relative to total brain size were determined by multivariate regression analysis. Larger brain size was associated with a relatively slender telencephalon and differences in brain orientation. The correlation between brain shape and orbital shape was assessed by two-block partial least-squares analysis. Relatively round brains with a ventrally flexed brain base were associated with rounder orbits, while narrower brains with a flat brain base were associated with more elongated orbits. The shapes of functionally associated avian brain regions are correlated, and orbital size and shape are dominant factors influencing the overall shape of the avian brain.

  4. PGen: large-scale genomic variations analysis workflow and browser in SoyKB.

    Science.gov (United States)

    Liu, Yang; Khan, Saad M; Wang, Juexin; Rynge, Mats; Zhang, Yuanxun; Zeng, Shuai; Chen, Shiyuan; Maldonado Dos Santos, Joao V; Valliyodan, Babu; Calyam, Prasad P; Merchant, Nirav; Nguyen, Henry T; Xu, Dong; Joshi, Trupti

    2016-10-06

    With the advances in next-generation sequencing (NGS) technology and significant reductions in sequencing costs, it is now possible to sequence large collections of germplasm in crops for detecting genome-scale genetic variations and to apply the knowledge towards improvements in traits. To efficiently facilitate large-scale NGS resequencing data analysis of genomic variations, we have developed "PGen", an integrated and optimized workflow using the Extreme Science and Engineering Discovery Environment (XSEDE) high-performance computing (HPC) virtual system, iPlant cloud data storage resources and Pegasus workflow management system (Pegasus-WMS). The workflow allows users to identify single nucleotide polymorphisms (SNPs) and insertion-deletions (indels), perform SNP annotations and conduct copy number variation analyses on multiple resequencing datasets in a user-friendly and seamless way. We have developed both a Linux version in GitHub ( https://github.com/pegasus-isi/PGen-GenomicVariations-Workflow ) and a web-based implementation of the PGen workflow integrated within the Soybean Knowledge Base (SoyKB), ( http://soykb.org/Pegasus/index.php ). Using PGen, we identified 10,218,140 single-nucleotide polymorphisms (SNPs) and 1,398,982 indels from analysis of 106 soybean lines sequenced at 15X coverage. 297,245 non-synonymous SNPs and 3330 copy number variation (CNV) regions were identified from this analysis. SNPs identified using PGen from additional soybean resequencing projects adding to 500+ soybean germplasm lines in total have been integrated. These SNPs are being utilized for trait improvement using genotype to phenotype prediction approaches developed in-house. In order to browse and access NGS data easily, we have also developed an NGS resequencing data browser ( http://soykb.org/NGS_Resequence/NGS_index.php ) within SoyKB to provide easy access to SNP and downstream analysis results for soybean researchers. PGen workflow has been optimized for the most

  5. Genome Sizes in Hepatica Mill: (Ranunculaceae Show a Loss of DNA, Not a Gain, in Polyploids

    Directory of Open Access Journals (Sweden)

    B. J. M. Zonneveld

    2010-01-01

    , and a possible pentaploid. The somatic nuclear DNA contents (2C-value, as measured by flow cytometry with propidium iodide, were shown to range from 33 to 80 pg. The Asiatic and American species, often considered subspecies of H. nobilis, could be clearly distinguished from European H. nobilis. DNA content confirmed the close relationships in the Asiatic species, and these are here considered as subspecies of H. asiatica. Parents for the allotetraploid species could be suggested based on their nuclear DNA content. Contrary to the increase in genome size suggested earlier for Hepatica, a significant (6%–14% loss of nuclear DNA in the natural allopolyploids was found.

  6. Transposon Insertions, Structural Variations, and SNPs Contribute to the Evolution of the Melon Genome.

    Science.gov (United States)

    Sanseverino, Walter; Hénaff, Elizabeth; Vives, Cristina; Pinosio, Sara; Burgos-Paz, William; Morgante, Michele; Ramos-Onsins, Sebastián E; Garcia-Mas, Jordi; Casacuberta, Josep Maria

    2015-10-01

    The availability of extensive databases of crop genome sequences should allow analysis of crop variability at an unprecedented scale, which should have an important impact in plant breeding. However, up to now the analysis of genetic variability at the whole-genome scale has been mainly restricted to single nucleotide polymorphisms (SNPs). This is a strong limitation as structural variation (SV) and transposon insertion polymorphisms are frequent in plant species and have had an important mutational role in crop domestication and breeding. Here, we present the first comprehensive analysis of melon genetic diversity, which includes a detailed analysis of SNPs, SV, and transposon insertion polymorphisms. The variability found among seven melon varieties representing the species diversity and including wild accessions and highly breed lines, is relatively high due in part to the marked divergence of some lineages. The diversity is distributed nonuniformly across the genome, being lower at the extremes of the chromosomes and higher in the pericentromeric regions, which is compatible with the effect of purifying selection and recombination forces over functional regions. Additionally, this variability is greatly reduced among elite varieties, probably due to selection during breeding. We have found some chromosomal regions showing a high differentiation of the elite varieties versus the rest, which could be considered as strongly selected candidate regions. Our data also suggest that transposons and SV may be at the origin of an important fraction of the variability in melon, which highlights the importance of analyzing all types of genetic variability to understand crop genome evolution.

  7. Achilles' heel of pluripotent stem cells: genetic, genomic and epigenetic variations during prolonged culture.

    Science.gov (United States)

    Rebuzzini, Paola; Zuccotti, Maurizio; Redi, Carlo Alberto; Garagna, Silvia

    2016-07-01

    Pluripotent stem cells differentiate into almost any specialized adult cell type of an organism. PSCs can be derived either from the inner cell mass of a blastocyst-giving rise to embryonic stem cells-or after reprogramming of somatic terminally differentiated cells to obtain ES-like cells, named induced pluripotent stem cells. The potential use of these cells in the clinic, for investigating in vitro early embryonic development or for screening the effects of new drugs or xenobiotics, depends on capability to maintain their genome integrity during prolonged culture and differentiation. Both human and mouse PSCs are prone to genomic and (epi)genetic instability during in vitro culture, a feature that seriously limits their real potential use. Culture-induced variations of specific chromosomes or genes, are almost all unpredictable and, as a whole, differ among independent cell lines. They may arise at different culture passages, suggesting the absence of a safe passage number maintaining genome integrity and rendering the control of genomic stability mandatory since the very early culture passages. The present review highlights the urgency for further studies on the mechanisms involved in determining (epi)genetic and chromosome instability, exploiting the knowledge acquired earlier on other cell types.

  8. Dental size variation in the Atapuerca-SH Middle Pleistocene hominids.

    Science.gov (United States)

    Bermúdez de Castro, J M; Sarmiento, S; Cunha, E; Rosas, A; Bastir, M

    2001-09-01

    The Middle Pleistocene Atapuerca-Sima de los Huesos (SH) site in Spain has yielded the largest sample of fossil hominids so far found from a single site and belonging to the same biological population. The SH dental sample includes a total of 452 permanent and deciduous teeth, representing a minimum of 27 individuals. We present a study of the dental size variation in these hominids, based on the analysis of the mandibular permanent dentition: lateral incisors, n=29; canines, n=27; third premolars, n=30; fourth premolars, n=34; first molars, n=38; second molars, n=38. We have obtained the buccolingual diameter and the crown area (measured on occlusal photographs) of these teeth, and used the bootstrap method to assess the amount of variation in the SH sample compared with the variation of a modern human sample from the Museu Antropologico of the Universidade of Coimbra (Portugal). The SH hominids have, in general terms, a dental size variation higher than that of the modern human sample. The analysis is especially conclusive for the canines. Furthermore, we have estimated the degree of sexual dimorphism of the SH sample by obtaining male and female dental subsamples by means of sexing the large sample of SH mandibular specimens. We obtained the index of sexual dimorphism (ISD=male mean/female mean) and the values were compared with those obtained from the sexed modern human sample from Coimbra, and with data found in the literature concerning several recent human populations. In all tooth classes the ISD of the SH hominids was higher than that of modern humans, but the differences were generally modest, except for the canines, thus suggesting that canine size sexual dimorphism in Homo heidelbergensis was probably greater than that of modern humans. Since the approach of sexing fossil specimens has some obvious limitations, these results should be assessed with caution. Additional data from SH and other European Middle Pleistocene sites would be necessary to test

  9. A genome-wide survey of genetic variation in gorillas using reduced representation sequencing.

    Directory of Open Access Journals (Sweden)

    Aylwyn Scally

    Full Text Available All non-human great apes are endangered in the wild, and it is therefore important to gain an understanding of their demography and genetic diversity. Whole genome assembly projects have provided an invaluable foundation for understanding genetics in all four genera, but to date genetic studies of multiple individuals within great ape species have largely been confined to mitochondrial DNA and a small number of other loci. Here, we present a genome-wide survey of genetic variation in gorillas using a reduced representation sequencing approach, focusing on the two lowland subspecies. We identify 3,006,670 polymorphic sites in 14 individuals: 12 western lowland gorillas (Gorilla gorilla gorilla and 2 eastern lowland gorillas (Gorilla beringei graueri. We find that the two species are genetically distinct, based on levels of heterozygosity and patterns of allele sharing. Focusing on the western lowland population, we observe evidence for population substructure, and a deficit of rare genetic variants suggesting a recent episode of population contraction. In western lowland gorillas, there is an elevation of variation towards telomeres and centromeres on the chromosomal scale. On a finer scale, we find substantial variation in genetic diversity, including a marked reduction close to the major histocompatibility locus, perhaps indicative of recent strong selection there. These findings suggest that despite their maintaining an overall level of genetic diversity equal to or greater than that of humans, population decline, perhaps associated with disease, has been a significant factor in recent and long-term pressures on wild gorilla populations.

  10. Population genomics of Pacific lamprey: adaptive variation in a highly dispersive species.

    Science.gov (United States)

    Hess, Jon E; Campbell, Nathan R; Close, David A; Docker, Margaret F; Narum, Shawn R

    2013-06-01

    Unlike most anadromous fishes that have evolved strict homing behaviour, Pacific lamprey (Entosphenus tridentatus) seem to lack philopatry as evidenced by minimal population structure across the species range. Yet unexplained findings of within-region population genetic heterogeneity coupled with the morphological and behavioural diversity described for the species suggest that adaptive genetic variation underlying fitness traits may be responsible. We employed restriction site-associated DNA sequencing to genotype 4439 quality filtered single nucleotide polymorphism (SNP) loci for 518 individuals collected across a broad geographical area including British Columbia, Washington, Oregon and California. A subset of putatively neutral markers (N = 4068) identified a significant amount of variation among three broad populations: northern British Columbia, Columbia River/southern coast and 'dwarf' adults (F(CT) = 0.02, P ≪ 0.001). Additionally, 162 SNPs were identified as adaptive through outlier tests, and inclusion of these markers revealed a signal of adaptive variation related to geography and life history. The majority of the 162 adaptive SNPs were not independent and formed four groups of linked loci. Analyses with matsam software found that 42 of these outlier SNPs were significantly associated with geography, run timing and dwarf life history, and 27 of these 42 SNPs aligned with known genes or highly conserved genomic regions using the genome browser available for sea lamprey. This study provides both neutral and adaptive context for observed genetic divergence among collections and thus reconciles previous findings of population genetic heterogeneity within a species that displays extensive gene flow.

  11. Copy number variation is a fundamental aspect of the placental genome.

    Science.gov (United States)

    Hannibal, Roberta L; Chuong, Edward B; Rivera-Mulia, Juan Carlos; Gilbert, David M; Valouev, Anton; Baker, Julie C

    2014-05-01

    Discovery of lineage-specific somatic copy number variation (CNV) in mammals has led to debate over whether CNVs are mutations that propagate disease or whether they are a normal, and even essential, aspect of cell biology. We show that 1,000 N polyploid trophoblast giant cells (TGCs) of the mouse placenta contain 47 regions, totaling 138 Megabases, where genomic copies are underrepresented (UR). UR domains originate from a subset of late-replicating heterochromatic regions containing gene deserts and genes involved in cell adhesion and neurogenesis. While lineage-specific CNVs have been identified in mammalian cells, classically in the immune system where V(D)J recombination occurs, we demonstrate that CNVs form during gestation in the placenta by an underreplication mechanism, not by recombination nor deletion. Our results reveal that large scale CNVs are a normal feature of the mammalian placental genome, which are regulated systematically during embryogenesis and are propagated by a mechanism of underreplication.

  12. Copy number variation is a fundamental aspect of the placental genome.

    Directory of Open Access Journals (Sweden)

    Roberta L Hannibal

    2014-05-01

    Full Text Available Discovery of lineage-specific somatic copy number variation (CNV in mammals has led to debate over whether CNVs are mutations that propagate disease or whether they are a normal, and even essential, aspect of cell biology. We show that 1,000 N polyploid trophoblast giant cells (TGCs of the mouse placenta contain 47 regions, totaling 138 Megabases, where genomic copies are underrepresented (UR. UR domains originate from a subset of late-replicating heterochromatic regions containing gene deserts and genes involved in cell adhesion and neurogenesis. While lineage-specific CNVs have been identified in mammalian cells, classically in the immune system where V(DJ recombination occurs, we demonstrate that CNVs form during gestation in the placenta by an underreplication mechanism, not by recombination nor deletion. Our results reveal that large scale CNVs are a normal feature of the mammalian placental genome, which are regulated systematically during embryogenesis and are propagated by a mechanism of underreplication.

  13. Illumina based whole mitochondrial genome of Junonia iphita reveals minor intraspecific variation

    Directory of Open Access Journals (Sweden)

    Catherine Vanlalruati

    2015-12-01

    Full Text Available In the present study, the near complete mitochondrial genome (mitogenome of Junonia iphita (Lepidoptera: Nymphalidae: Nymphalinae was determined to be 14,892 bp. The gene order and orientation are identical to those in other butterfly species. The phylogenetic tree constructed from the whole mitogenomes using the 13 protein coding genes (PCGs defines the genetic relatedness of the two J. iphita species collected from two different regions. All the Junonia species clustered together, and were further subdivided into clade one consisting of J. almana and J. orithya and clade two comprising of the two J. iphita which were collected from Indo and Indochinese subregions separated by river barrier. Comparison between the two J. iphita sequences revealed minor variations and Single Nucleotide Polymorphisms were identified at 51 sites amounting to 0.4% of the entire mitochondrial genome.

  14. An integrated map of genetic variation from 1.092 human genomes

    DEFF Research Database (Denmark)

    Abecasis, Goncalo R.; Auton, Adam; Brooks, Lisa D.

    2012-01-01

    By characterizing the geographic and functional spectrum of human genetic variation, the 1000 Genomes Project aims to build a resource to help to understand the genetic contribution to disease. Here we describe the genomes of 1,092 individuals from 14 populations, constructed using a combination...... deletions. We show that individuals from different populations carry different profiles of rare and common variants, and that low-frequency variants show substantial geographic differentiation, which is further increased by the action of purifying selection. We show that evolutionary conservation and coding...... consequence are key determinants of the strength of purifying selection, that rare-variant load varies substantially across biological pathways, and that each individual contains hundreds of rare non-coding variants at conserved sites, such as motif-disrupting changes in transcription-factor-binding sites...

  15. Whole genome re-sequencing reveals genome-wide variations among parental lines of 16 mapping populations in chickpea (Cicer arietinum L.).

    Science.gov (United States)

    Thudi, Mahendar; Khan, Aamir W; Kumar, Vinay; Gaur, Pooran M; Katta, Krishnamohan; Garg, Vanika; Roorkiwal, Manish; Samineni, Srinivasan; Varshney, Rajeev K

    2016-01-27

    Chickpea (Cicer arietinum L.) is the second most important grain legume cultivated by resource poor farmers in South Asia and Sub-Saharan Africa. In order to harness the untapped genetic potential available for chickpea improvement, we re-sequenced 35 chickpea genotypes representing parental lines of 16 mapping populations segregating for abiotic (drought, heat, salinity), biotic stresses (Fusarium wilt, Ascochyta blight, Botrytis grey mould, Helicoverpa armigera) and nutritionally important (protein content) traits using whole genome re-sequencing approach. A total of 192.19 Gb data, generated on 35 genotypes of chickpea, comprising 973.13 million reads, with an average sequencing depth of ~10 X for each line. On an average 92.18 % reads from each genotype were aligned to the chickpea reference genome with 82.17 % coverage. A total of 2,058,566 unique single nucleotide polymorphisms (SNPs) and 292,588 Indels were detected while comparing with the reference chickpea genome. Highest number of SNPs were identified on the Ca4 pseudomolecule. In addition, copy number variations (CNVs) such as gene deletions and duplications were identified across the chickpea parental genotypes, which were minimum in PI 489777 (1 gene deletion) and maximum in JG 74 (1,497). A total of 164,856 line specific variations (144,888 SNPs and 19,968 Indels) with the highest percentage were identified in coding regions in ICC 1496 (21 %) followed by ICCV 97105 (12 %). Of 539 miscellaneous variations, 339, 138 and 62 were inter-chromosomal variations (CTX), intra-chromosomal variations (ITX) and inversions (INV) respectively. Genome-wide SNPs, Indels, CNVs, PAVs, and miscellaneous variations identified in different mapping populations are a valuable resource in genetic research and helpful in locating genes/genomic segments responsible for economically important traits. Further, the genome-wide variations identified in the present study can be used for developing high density SNP arrays for

  16. Optimizing k-mer size using a variant grid search to enhance de novo genome assembly

    Science.gov (United States)

    Cha, Soyeon; Bird, David McK

    2016-01-01

    Largely driven by huge reductions in per-base costs, sequencing nucleic acids has become a near-ubiquitous technique in laboratories performing biological and biomedical research. Most of the effort goes to re-sequencing, but assembly of de novogenerated, raw sequence reads into contigs that span as much of the genome as possible is central to many projects. Although truly complete coverage is not realistically attainable, maximizing the amount of sequence that can be correctly assembled into contigs contributes to coverage. Here we compare three commonly used assembly algorithms (ABySS, Velvet and SOAPdenovo2), and show that empirical optimization of k-mer values has a disproportionate influence on de novo assembly of a eukaryotic genome, the nematode parasite Meloidogynechitwoodi. Each assembler was challenged with about 40 million Iluumina II paired-end reads, and assemblies performed under a range of k-mer sizes. In each instance, the optimal k-mer was 127, although based on N50 values,ABySS was more efficient than the others. That the assembly was not spurious was established using the “Core Eukaryotic Gene Mapping Approach”, which indicated that 98.79% of the M. chitwoodi genome was accounted for by the assembly. Subsequent gene finding and annotation are consistent with this and suggest that k-mer optimization contributes to the robustness of assembly. PMID:28104957

  17. Predictive Models of Recombination Rate Variation across the Drosophila melanogaster Genome

    Science.gov (United States)

    Adrian, Andrew B.; Corchado, Johnny Cruz; Comeron, Josep M.

    2016-01-01

    In all eukaryotic species examined, meiotic recombination, and crossovers in particular, occur non‐randomly along chromosomes. The cause for this non-random distribution remains poorly understood but some specific DNA sequence motifs have been shown to be enriched near crossover hotspots in a number of species. We present analyses using machine learning algorithms to investigate whether DNA motif distribution across the genome can be used to predict crossover variation in Drosophila melanogaster, a species without hotspots. Our study exposes a combinatorial non-linear influence of motif presence able to account for a significant fraction of the genome-wide variation in crossover rates at all genomic scales investigated, from 20% at 5-kb to almost 70% at 2,500-kb scale. The models are particularly predictive for regions with the highest and lowest crossover rates and remain highly informative after removing sub-telomeric and -centromeric regions known to have strongly reduced crossover rates. Transcriptional activity during early meiosis and differences in motif use between autosomes and the X chromosome add to the predictive power of the models. Moreover, we show that population-specific differences in crossover rates can be partly explained by differences in motif presence. Our results suggest that crossover distribution in Drosophila is influenced by both meiosis-specific chromatin dynamics and very local constitutive open chromatin associated with DNA motifs that prevent nucleosome stabilization. These findings provide new information on the genetic factors influencing variation in recombination rates and a baseline to study epigenetic mechanisms responsible for plastic recombination as response to different biotic and abiotic conditions and stresses. PMID:27492232

  18. Molecular phylogeny and genome size evolution of the genus Betula (Betulaceae).

    Science.gov (United States)

    Wang, Nian; McAllister, Hugh A; Bartlett, Paul R; Buggs, Richard J A

    2016-05-01

    Betula L. (birch) is a genus of approx. 60 species, subspecies or varieties with a wide distribution in the northern hemisphere, of ecological and economic importance. A new classification of Betula has recently been proposed based on morphological characters. This classification differs somewhat from previously published molecular phylogenies, which may be due to factors such as convergent evolution, hybridization, incomplete taxon sampling or misidentification of samples. While chromosome counts have been made for many species, few have had their genome size measured. The aim of this study is to produce a new phylogenetic and genome size analysis of the genus. Internal transcribed spacer (ITS) regions of nuclear ribosomal DNA were sequenced for 76 Betula samples verified by taxonomic experts, representing approx. 60 taxa, of which approx. 24 taxa have not been included in previous phylogenetic analyses. A further 49 samples from other collections were also sequenced, and 108 ITS sequences were downloaded from GenBank. Phylogenetic trees were built for these sequences. The genome sizes of 103 accessions representing nearly all described species were estimated using flow cytometry. As expected for a gene tree of a genus where hybridization and allopolyploidy occur, the ITS tree shows clustering, but not resolved monophyly, for the morphological subgenera recently proposed. Most sections show some clustering, but species of the dwarf section Apterocaryon are unusually scattered. Betula corylifolia (subgenus Nipponobetula) unexpectedly clusters with species of subgenus Aspera Unexpected placements are also found for B. maximowicziana, B. bomiensis, B. nigra and B. grossa Biogeographical disjunctions were found within Betula between Europe and North America, and also disjunctions between North-east and South-west Asia. The 2C-values for Betula ranged from 0·88 to 5·33 pg, and polyploids are scattered widely throughout the ITS phylogeny. Species with large genomes

  19. Size variation of the end Permian conodont Neogondolella at Meishan Section, Changxing, Zhejiang and its significance

    Institute of Scientific and Technical Information of China (English)

    2006-01-01

    This study is based on both a generic and species level investigation of the individual size of the latest Permian conodont Neogondolella Pa elements collected from Meishan Section A, Changxing, Zhejiang Province. In this study, an obvious size reduction of Neogondolella Pa elements within bed 24e of the upper Changxing Limestone is recognized. The size variation of the Neogondolella occurs simultaneously with some important events including the negative shift of δ13C, change in the ratio of the abundance of cyanobacterial biomarkers versus that of other general bacterial biomarkers and the shallowing of the sea water. Through the investigation of the paleoenvironmental changes and the analysis of the paleoecology of the conodont genus Neogondolella, the authors propose that the major factors for the size reduction of species of the conodont genus Neogondolella are food shortages caused by the mass extinction, the shallowing of the sea water as well as the increase in opacity of the sea water during the end Permian. The same phenomenon of Neogondolella size reduction is also observed in preliminary research from the same horizon at Shangsi Section, Sichuan Province. All the evidence suggests that there was a mass extinction that occurred at the horizon of bed 24e, and the evidence supports the viewpoint of a multi-phase mass extinction during the Permian and Triassic transition in South China.

  20. Home range size variation in female arctic grizzly bears relative to reproductive status and resource availability.

    Directory of Open Access Journals (Sweden)

    Mark A Edwards

    Full Text Available The area traversed in pursuit of resources defines the size of an animal's home range. For females, the home range is presumed to be a function of forage availability. However, the presence of offspring may also influence home range size due to reduced mobility, increased nutritional need, and behavioral adaptations of mothers to increase offspring survival. Here, we examine the relationship between resource use and variation in home range size for female barren-ground grizzly bears (Ursus arctos of the Mackenzie Delta region in Arctic Canada. We develop methods to test hypotheses of home range size that address selection of cover where cover heterogeneity is low, using generalized linear mixed-effects models and an information-theoretic approach. We found that the reproductive status of female grizzlies affected home range size but individually-based spatial availability of highly selected cover in spring and early summer was a stronger correlate. If these preferred covers in spring and early summer, a period of low resource availability for grizzly bears following den-emergence, were patchy and highly dispersed, females travelled farther regardless of the presence or absence of offspring. Increased movement to preferred covers, however, may result in greater risk to the individual or family.

  1. Home range size variation in female arctic grizzly bears relative to reproductive status and resource availability.

    Science.gov (United States)

    Edwards, Mark A; Derocher, Andrew E; Nagy, John A

    2013-01-01

    The area traversed in pursuit of resources defines the size of an animal's home range. For females, the home range is presumed to be a function of forage availability. However, the presence of offspring may also influence home range size due to reduced mobility, increased nutritional need, and behavioral adaptations of mothers to increase offspring survival. Here, we examine the relationship between resource use and variation in home range size for female barren-ground grizzly bears (Ursus arctos) of the Mackenzie Delta region in Arctic Canada. We develop methods to test hypotheses of home range size that address selection of cover where cover heterogeneity is low, using generalized linear mixed-effects models and an information-theoretic approach. We found that the reproductive status of female grizzlies affected home range size but individually-based spatial availability of highly selected cover in spring and early summer was a stronger correlate. If these preferred covers in spring and early summer, a period of low resource availability for grizzly bears following den-emergence, were patchy and highly dispersed, females travelled farther regardless of the presence or absence of offspring. Increased movement to preferred covers, however, may result in greater risk to the individual or family.

  2. Integrated analysis of copy number variation and genome-wide expression profiling in colorectal cancer tissues.

    Science.gov (United States)

    Ali Hassan, Nur Zarina; Mokhtar, Norfilza Mohd; Kok Sin, Teow; Mohamed Rose, Isa; Sagap, Ismail; Harun, Roslan; Jamal, Rahman

    2014-01-01

    Integrative analyses of multiple genomic datasets for selected samples can provide better insight into the overall data and can enhance our knowledge of cancer. The objective of this study was to elucidate the association between copy number variation (CNV) and gene expression in colorectal cancer (CRC) samples and their corresponding non-cancerous tissues. Sixty-four paired CRC samples from the same patients were subjected to CNV profiling using the Illumina HumanOmni1-Quad assay, and validation was performed using multiplex ligation probe amplification method. Genome-wide expression profiling was performed on 15 paired samples from the same group of patients using the Affymetrix Human Gene 1.0 ST array. Significant genes obtained from both array results were then overlapped. To identify molecular pathways, the data were mapped to the KEGG database. Whole genome CNV analysis that compared primary tumor and non-cancerous epithelium revealed gains in 1638 genes and losses in 36 genes. Significant gains were mostly found in chromosome 20 at position 20q12 with a frequency of 45.31% in tumor samples. Examples of genes that were associated at this cytoband were PTPRT, EMILIN3 and CHD6. The highest number of losses was detected at chromosome 8, position 8p23.2 with 17.19% occurrence in all tumor samples. Among the genes found at this cytoband were CSMD1 and DLC1. Genome-wide expression profiling showed 709 genes to be up-regulated and 699 genes to be down-regulated in CRC compared to non-cancerous samples. Integration of these two datasets identified 56 overlapping genes, which were located in chromosomes 8, 20 and 22. MLPA confirmed that the CRC samples had the highest gains in chromosome 20 compared to the reference samples. Interpretation of the CNV data in the context of the transcriptome via integrative analyses may provide more in-depth knowledge of the genomic landscape of CRC.

  3. Integrated analysis of copy number variation and genome-wide expression profiling in colorectal cancer tissues.

    Directory of Open Access Journals (Sweden)

    Nur Zarina Ali Hassan

    Full Text Available Integrative analyses of multiple genomic datasets for selected samples can provide better insight into the overall data and can enhance our knowledge of cancer. The objective of this study was to elucidate the association between copy number variation (CNV and gene expression in colorectal cancer (CRC samples and their corresponding non-cancerous tissues. Sixty-four paired CRC samples from the same patients were subjected to CNV profiling using the Illumina HumanOmni1-Quad assay, and validation was performed using multiplex ligation probe amplification method. Genome-wide expression profiling was performed on 15 paired samples from the same group of patients using the Affymetrix Human Gene 1.0 ST array. Significant genes obtained from both array results were then overlapped. To identify molecular pathways, the data were mapped to the KEGG database. Whole genome CNV analysis that compared primary tumor and non-cancerous epithelium revealed gains in 1638 genes and losses in 36 genes. Significant gains were mostly found in chromosome 20 at position 20q12 with a frequency of 45.31% in tumor samples. Examples of genes that were associated at this cytoband were PTPRT, EMILIN3 and CHD6. The highest number of losses was detected at chromosome 8, position 8p23.2 with 17.19% occurrence in all tumor samples. Among the genes found at this cytoband were CSMD1 and DLC1. Genome-wide expression profiling showed 709 genes to be up-regulated and 699 genes to be down-regulated in CRC compared to non-cancerous samples. Integration of these two datasets identified 56 overlapping genes, which were located in chromosomes 8, 20 and 22. MLPA confirmed that the CRC samples had the highest gains in chromosome 20 compared to the reference samples. Interpretation of the CNV data in the context of the transcriptome via integrative analyses may provide more in-depth knowledge of the genomic landscape of CRC.

  4. Exploring genetic variation in the tomato (Solanum section Lycopersicon) clade by whole-genome sequencing.

    Science.gov (United States)

    Aflitos, Saulo; Schijlen, Elio; de Jong, Hans; de Ridder, Dick; Smit, Sandra; Finkers, Richard; Wang, Jun; Zhang, Gengyun; Li, Ning; Mao, Likai; Bakker, Freek; Dirks, Rob; Breit, Timo; Gravendeel, Barbara; Huits, Henk; Struss, Darush; Swanson-Wagner, Ruth; van Leeuwen, Hans; van Ham, Roeland C H J; Fito, Laia; Guignier, Laëtitia; Sevilla, Myrna; Ellul, Philippe; Ganko, Eric; Kapur, Arvind; Reclus, Emannuel; de Geus, Bernard; van de Geest, Henri; Te Lintel Hekkert, Bas; van Haarst, Jan; Smits, Lars; Koops, Andries; Sanchez-Perez, Gabino; van Heusden, Adriaan W; Visser, Richard; Quan, Zhiwu; Min, Jiumeng; Liao, Li; Wang, Xiaoli; Wang, Guangbiao; Yue, Zhen; Yang, Xinhua; Xu, Na; Schranz, Eric; Smets, Erik; Vos, Rutger; Rauwerda, Johan; Ursem, Remco; Schuit, Cees; Kerns, Mike; van den Berg, Jan; Vriezen, Wim; Janssen, Antoine; Datema, Erwin; Jahrman, Torben; Moquet, Frederic; Bonnet, Julien; Peters, Sander

    2014-10-01

    We explored genetic variation by sequencing a selection of 84 tomato accessions and related wild species representative of the Lycopersicon, Arcanum, Eriopersicon and Neolycopersicon groups, which has yielded a huge amount of precious data on sequence diversity in the tomato clade. Three new reference genomes were reconstructed to support our comparative genome analyses. Comparative sequence alignment revealed group-, species- and accession-specific polymorphisms, explaining characteristic fruit traits and growth habits in the various cultivars. Using gene models from the annotated Heinz 1706 reference genome, we observed differences in the ratio between non-synonymous and synonymous SNPs (dN/dS) in fruit diversification and plant growth genes compared to a random set of genes, indicating positive selection and differences in selection pressure between crop accessions and wild species. In wild species, the number of single-nucleotide polymorphisms (SNPs) exceeds 10 million, i.e. 20-fold higher than found in most of the crop accessions, indicating dramatic genetic erosion of crop and heirloom tomatoes. In addition, the highest levels of heterozygosity were found for allogamous self-incompatible wild species, while facultative and autogamous self-compatible species display a lower heterozygosity level. Using whole-genome SNP information for maximum-likelihood analysis, we achieved complete tree resolution, whereas maximum-likelihood trees based on SNPs from ten fruit and growth genes show incomplete resolution for the crop accessions, partly due to the effect of heterozygous SNPs. Finally, results suggest that phylogenetic relationships are correlated with habitat, indicating the occurrence of geographical races within these groups, which is of practical importance for Solanum genome evolution studies.

  5. Advances in biotechnology and informatics to link variation in the genome to phenotypes in plants and animals.

    Science.gov (United States)

    Appels, R; Barrero, R; Bellgard, M

    2013-03-01

    Advances in our understanding of genome structure provide consistent evidence for the existence of a core genome representing species classically defined by phenotype, as well as conditionally dispensable components of the genome that shows extensive variation between individuals of a given species. Generally, conservation of phenotypic features between species reflects conserved features of the genome; however, this is evidently not necessarily always the case as demonstrated by the analysis of the tunicate chordate Oikopleura dioica. In both plants and animals, the methylation activity of DNA and histones continues to present new variables for modifying (eventually) the phenotype of an organism and provides for structural variation that builds on the point mutations, rearrangements, indels, and amplification of retrotransposable elements traditionally considered. The translation of the advances in the structure/function analysis of the genome to industry is facilitated through the capture of research outputs in "toolboxes" that remain accessible in the public domain.

  6. Complete chloroplast genomes from apomictic Taraxacum (Asteraceae): Identity and variation between three microspecies

    Science.gov (United States)

    Majeský, Ľuboš; Schwarzacher, Trude; Gornall, Richard; Heslop-Harrison, Pat

    2017-01-01

    Chloroplast DNA sequences show substantial variation between higher plant species, and less variation within species, so are typically excellent markers to investigate evolutionary, population and genetic relationships and phylogenies. We sequenced the plastomes of Taraxacum obtusifrons Markl. (O978); T. stridulum Trávniček ined. (S3); and T. amplum Markl. (A978), three apomictic triploid (2n = 3x = 24) dandelions from the T. officinale agg. We aimed to characterize the variation in plastomes, define relationships and correlations with the apomictic microspecies status, and refine placement of the microspecies in the evolutionary or phylogenetic context of the Asteraceae. The chloroplast genomes of accessions O978 and S3 were identical and 151,322 bp long (where the nuclear genes are known to show variation), while A978 was 151,349 bp long. All three genomes contained 135 unique genes, with an additional copy of the trnF-GGA gene in the LSC region and 20 duplicated genes in the IR region, along with short repeats, the typical major Inverted Repeats (IR1 and IR2, 24,431bp long), and Large and Small Single Copy regions (LSC 83,889bp and SSC 18,571bp in O978). Between the two Taraxacum plastomes types, we identified 28 SNPs. The distribution of polymorphisms suggests some parts of the Taraxacum plastome are evolving at a slower rate. There was a hemi-nested inversion in the LSC region that is common to Asteraceae, and an SSC inversion from ndhF to rps15 found only in some Asteraceae lineages. A comparative repeat analysis showed variation between Taraxacum and the phylogenetically close genus Lactuca, with many more direct repeats of 40bp or more in Lactuca (1% larger plastome than Taraxacum). When individual genes and non-coding regions were for Asteraceae phylogeny reconstruction, not all showed the same evolutionary scenario suggesting care is needed for interpretation of relationships if a limited number of markers are used. Studying genotypic diversity in

  7. Complete chloroplast genomes from apomictic Taraxacum (Asteraceae): Identity and variation between three microspecies.

    Science.gov (United States)

    M Salih, Rubar Hussein; Majeský, Ľuboš; Schwarzacher, Trude; Gornall, Richard; Heslop-Harrison, Pat

    2017-01-01

    Chloroplast DNA sequences show substantial variation between higher plant species, and less variation within species, so are typically excellent markers to investigate evolutionary, population and genetic relationships and phylogenies. We sequenced the plastomes of Taraxacum obtusifrons Markl. (O978); T. stridulum Trávniček ined. (S3); and T. amplum Markl. (A978), three apomictic triploid (2n = 3x = 24) dandelions from the T. officinale agg. We aimed to characterize the variation in plastomes, define relationships and correlations with the apomictic microspecies status, and refine placement of the microspecies in the evolutionary or phylogenetic context of the Asteraceae. The chloroplast genomes of accessions O978 and S3 were identical and 151,322 bp long (where the nuclear genes are known to show variation), while A978 was 151,349 bp long. All three genomes contained 135 unique genes, with an additional copy of the trnF-GGA gene in the LSC region and 20 duplicated genes in the IR region, along with short repeats, the typical major Inverted Repeats (IR1 and IR2, 24,431bp long), and Large and Small Single Copy regions (LSC 83,889bp and SSC 18,571bp in O978). Between the two Taraxacum plastomes types, we identified 28 SNPs. The distribution of polymorphisms suggests some parts of the Taraxacum plastome are evolving at a slower rate. There was a hemi-nested inversion in the LSC region that is common to Asteraceae, and an SSC inversion from ndhF to rps15 found only in some Asteraceae lineages. A comparative repeat analysis showed variation between Taraxacum and the phylogenetically close genus Lactuca, with many more direct repeats of 40bp or more in Lactuca (1% larger plastome than Taraxacum). When individual genes and non-coding regions were for Asteraceae phylogeny recon