WorldWideScience

Sample records for chloroplast genomics analyses

  1. Comparative chloroplast genomics: analyses including new sequences from the angiosperms Nuphar advena and Ranunculus macranthus

    Directory of Open Access Journals (Sweden)

    Boore Jeffrey L

    2007-06-01

    Full Text Available Abstract Background The number of completely sequenced plastid genomes available is growing rapidly. This array of sequences presents new opportunities to perform comparative analyses. In comparative studies, it is often useful to compare across wide phylogenetic spans and, within angiosperms, to include representatives from basally diverging lineages such as the genomes reported here: Nuphar advena (from a basal-most lineage and Ranunculus macranthus (a basal eudicot. We report these two new plastid genome sequences and make comparisons (within angiosperms, seed plants, or all photosynthetic lineages to evaluate features such as the status of ycf15 and ycf68 as protein coding genes, the distribution of simple sequence repeats (SSRs and longer dispersed repeats (SDR, and patterns of nucleotide composition. Results The Nuphar [GenBank:NC_008788] and Ranunculus [GenBank:NC_008796] plastid genomes share characteristics of gene content and organization with many other chloroplast genomes. Like other plastid genomes, these genomes are A+T-rich, except for rRNA and tRNA genes. Detailed comparisons of Nuphar with Nymphaea, another Nymphaeaceae, show that more than two-thirds of these genomes exhibit at least 95% sequence identity and that most SSRs are shared. In broader comparisons, SSRs vary among genomes in terms of abundance and length and most contain repeat motifs based on A and T nucleotides. Conclusion SSR and SDR abundance varies by genome and, for SSRs, is proportional to genome size. Long SDRs are rare in the genomes assessed. SSRs occur less frequently than predicted and, although the majority of the repeat motifs do include A and T nucleotides, the A+T bias in SSRs is less than that predicted from the underlying genomic nucleotide composition. In codon usage third positions show an A+T bias, however variation in codon usage does not correlate with differences in A+T-richness. Thus, although plastome nucleotide composition shows "A

  2. Analyses of the complete genome and gene expression of chloroplast of sweet potato [Ipomoea batata].

    Science.gov (United States)

    Yan, Lang; Lai, Xianjun; Li, Xuedan; Wei, Changhe; Tan, Xuemei; Zhang, Yizheng

    2015-01-01

    Sweet potato [Ipomoea batatas (L.) Lam] ranks among the top seven most important food crops cultivated worldwide and is hexaploid plant (2n=6x=90) in the Convolvulaceae family with a genome size between 2,200 to 3,000 Mb. The genomic resources for this crop are deficient due to its complicated genetic structure. Here, we report the complete nucleotide sequence of the chloroplast (cp) genome of sweet potato, which is a circular molecule of 161,303 bp in the typical quadripartite structure with large (LSC) and small (SSC) single-copy regions separated by a pair of inverted repeats (IRs). The chloroplast DNA contains a total of 145 genes, including 94 protein-encoding genes of which there are 72 single-copy and 11 double-copy genes. The organization and structure of the chloroplast genome (gene content and order, IR expansion/contraction, random repeating sequences, structural rearrangement) of sweet potato were compared with those of Ipomoea (L.) species and some basal important angiosperms, respectively. Some boundary gene-flow and gene gain-and-loss events were identified at intra- and inter-species levels. In addition, by comparing with the transcriptome sequences of sweet potato, the RNA editing events and differential expressions of the chloroplast functional-genes were detected. Moreover, phylogenetic analysis was conducted based on 77 protein-coding genes from 33 taxa and the result may contribute to a better understanding of the evolution progress of the genus Ipomoea (L.), including phylogenetic relationships, intraspecific differentiation and interspecific introgression.

  3. Comparative transcriptome and chloroplast genome analyses of two related Dipteronia species

    Directory of Open Access Journals (Sweden)

    Tao Zhou

    2016-10-01

    Full Text Available Dipteronia (order Sapindales is an endangered genus endemic to China and has two living species, D. sinensis and D. dyeriana. The plants are closely related to the genus Acer, which is also classified in the order Sapindales. Evolutionary studies on Dipteronia have been hindered by the paucity of information on their genomes and plastids. Here, we used next generation sequencing to characterize the transcriptomes and complete chloroplast genomes of both Dipteronia species. A comparison of the transcriptomes of both species identified a total of 7,814 orthologs. Estimation of selection pressures using Ka/Ks ratios showed that only 30 of 5,435 orthologous pairs had a ratio significantly greater than 1, i.e., showing positive selection. However, 4,041 orthologs had a Ka/Ks < 0.5 (p < 0.05, suggesting that most genes had likely undergone purifying selection. Based on orthologous unigenes, 314 single copy nuclear genes were identified. Through a combination of de novo and reference guided assembly, plastid genomes were obtained; that of D. sinensis was 157,080 bp and that of D. dyeriana was 157,071 bp. Both plastid genomes encoded 87 protein coding genes, 40 tRNAs, and 8 rRNAs; no significant differences were detected in the size, gene content, and organization of the two plastomes. We used the whole chloroplast genomes to determine the phylogeny of D. sinensis and D. dyeriana and confirmed that the two species were highly divergent. Overall, our study provides comprehensive transcriptomic and chloroplast genomic resources, which will be valuable for future evolutionary studies of Dipteronia.

  4. The complete chloroplast genome sequences of five Epimedium species: lights into phylogenetic and taxonomic analyses

    Directory of Open Access Journals (Sweden)

    Yanjun eZhang

    2016-03-01

    Full Text Available Epimedium L. is a phylogenetically and economically important genus in the family Berberidaceae. We here sequenced the complete chloroplast (cp genomes of four Epimedium species using Illumina sequencing technology via a combination of de novo and reference-guided assembly, which was also the first comprehensive cp genome analysis on Epimedium combining the cp genome sequence of E. koreanum previously reported. The five Epimedium cp genomes exhibited typical quadripartite and circular structure that was rather conserved in genomic structure and the synteny of gene order. However, these cp genomes presented obvious variations at the boundaries of the four regions because of the expansion and contraction of the inverted repeat (IR region and the single-copy (SC boundary regions. The trnQ-UUG duplication occurred in the five Epimedium cp genomes, which was not found in the other basal eudicotyledons. The rapidly evolving cp genome regions were detected among the five cp genomes, as well as the difference of simple sequence repeats (SSR and repeat sequence were identified. Phylogenetic relationships among the five Epimedium species based on their cp genomes showed accordance with the updated system of the genus on the whole, but reminded that the evolutionary relationships and the divisions of the genus need further investigation applying more evidences. The availability of these cp genomes provided valuable genetic information for accurately identifying species, taxonomy and phylogenetic resolution and evolution of Epimedium, and assist in exploration and utilization of Epimedium plants.

  5. Comparative Chloroplast Genome Analyses of Streptophyte Green Algae Uncover Major Structural Alterations in the Klebsormidiophyceae, Coleochaetophyceae and Zygnematophyceae.

    Science.gov (United States)

    Lemieux, Claude; Otis, Christian; Turmel, Monique

    2016-01-01

    The Streptophyta comprises all land plants and six main lineages of freshwater green algae: Mesostigmatophyceae, Chlorokybophyceae, Klebsormidiophyceae, Charophyceae, Coleochaetophyceae and Zygnematophyceae. Previous comparisons of the chloroplast genome from nine streptophyte algae (including four zygnematophyceans) revealed that, although land plant chloroplast DNAs (cpDNAs) inherited most of their highly conserved structural features from green algal ancestors, considerable cpDNA changes took place during the evolution of the Zygnematophyceae, the sister group of land plants. To gain deeper insights into the evolutionary dynamics of the chloroplast genome in streptophyte algae, we sequenced the cpDNAs of nine additional taxa: two klebsormidiophyceans (Entransia fimbriata and Klebsormidium sp. SAG 51.86), one coleocheatophycean (Coleochaete scutata) and six zygnematophyceans (Cylindrocystis brebissonii, Netrium digitus, Roya obtusa, Spirogyra maxima, Cosmarium botrytis and Closterium baillyanum). Our comparative analyses of these genomes with their streptophyte algal counterparts indicate that the large inverted repeat (IR) encoding the rDNA operon experienced loss or expansion/contraction in all three sampled classes and that genes were extensively shuffled in both the Klebsormidiophyceae and Zygnematophyceae. The klebsormidiophycean genomes boast greatly expanded IRs, with the Entransia 60,590-bp IR being the largest known among green algae. The 206,025-bp Entransia cpDNA, which is one of the largest genome among streptophytes, encodes 118 standard genes, i.e., four additional genes compared to its Klebsormidium flaccidum homolog. We inferred that seven of the 21 group II introns usually found in land plants were already present in the common ancestor of the Klebsormidiophyceae and its sister lineages. At 107,236 bp and with 117 standard genes, the Coleochaete IR-less genome is both the smallest and most compact among the streptophyte algal cpDNAs analyzed thus

  6. Complete chloroplast genome of Sedum sarmentosum and chloroplast genome evolution in Saxifragales.

    Directory of Open Access Journals (Sweden)

    Wenpan Dong

    Full Text Available Comparative chloroplast genome analyses are mostly carried out at lower taxonomic levels, such as the family and genus levels. At higher taxonomic levels, chloroplast genomes are generally used to reconstruct phylogenies. However, little attention has been paid to chloroplast genome evolution within orders. Here, we present the chloroplast genome of Sedum sarmentosum and take advantage of several available (or elucidated chloroplast genomes to examine the evolution of chloroplast genomes in Saxifragales. The chloroplast genome of S. sarmentosum is 150,448 bp long and includes 82,212 bp of a large single-copy (LSC region, 16.670 bp of a small single-copy (SSC region, and a pair of 25,783 bp sequences of inverted repeats (IRs.The genome contains 131 unique genes, 18 of which are duplicated within the IRs. Based on a comparative analysis of chloroplast genomes from four representative Saxifragales families, we observed two gene losses and two pseudogenes in Paeonia obovata, and the loss of an intron was detected in the rps16 gene of Penthorum chinense. Comparisons among the 72 common protein-coding genes confirmed that the chloroplast genomes of S. sarmentosum and Paeonia obovata exhibit accelerated sequence evolution. Furthermore, a strong correlation was observed between the rates of genome evolution and genome size. The detected genome size variations are predominantly caused by the length of intergenic spacers, rather than losses of genes and introns, gene pseudogenization or IR expansion or contraction. The genome sizes of these species are negatively correlated with nucleotide substitution rates. Species with shorter duration of the life cycle tend to exhibit shorter chloroplast genomes than those with longer life cycles.

  7. Comparative chloroplast genomics: Analyses including new sequencesfrom the angiosperms Nuphar advena and Ranunculus macranthus

    Energy Technology Data Exchange (ETDEWEB)

    Raubeso, Linda A.; Peery, Rhiannon; Chumley, Timothy W.; Dziubek,Chris; Fourcade, H. Matthew; Boore, Jeffrey L.; Jansen, Robert K.

    2007-03-01

    The number of completely sequenced plastid genomes available is growing rapidly. This new array of sequences presents new opportunities to perform comparative analyses. In comparative studies, it is most useful to compare across wide phylogenetic spans and, within angiosperms, to include representatives from basally diverging lineages such as the new genomes reported here: Nuphar advena (from a basal-most lineage) and Ranunculus macranthus (from the basal group of eudicots). We report these two new plastid genome sequences and make comparisons (within angiosperms, seed plants, or all photosynthetic lineages) to evaluate features such as the status of ycf15 and ycf68 as protein coding genes, the distribution of simple sequence repeats (SSRs) and longer dispersed repeats (SDR), and patterns of nucleotide composition.

  8. A comparison of rice chloroplast genomes

    DEFF Research Database (Denmark)

    Tang, Jiabin; Xia, Hong'ai; Cao, Mengliang

    2004-01-01

    ), which are both parental varieties of the super-hybrid rice, LYP9. Based on the patterns of high sequence coverage, we partitioned chloroplast sequence variations into two classes, intravarietal and intersubspecific polymorphisms. Intravarietal polymorphisms refer to variations within 93-11 or PA64S...... to intersubspecific polymorphisms. In our study, we found that the intersubspecific variations of 93-11 (indica) and PA64S (japonica) chloroplast genomes consisted of 72 single nucleotide polymorphisms and 27 insertions or deletions. The intersubspecific polymorphism rates between 93-11 and PA64S were 0.......05% for single nucleotide polymorphisms and 0.02% for insertions or deletions, nearly 8 and 10 times lower than their respective nuclear genomes. Based on the total number of nucleotide substitutions between the two chloroplast genomes, we dated the divergence of indica and japonica chloroplast genomes...

  9. Utilization of complete chloroplast genomes for phylogenetic studies

    NARCIS (Netherlands)

    Ramlee, Shairul Izan Binti

    2016-01-01

    Chloroplast DNA sequence polymorphisms are a primary source of data in many plant phylogenetic studies. The chloroplast genome is relatively conserved in its evolution making it an ideal molecule to retain phylogenetic signals. The chloroplast genome is also largely, but not completely, free from ot

  10. Complete sequencing of five araliaceae chloroplast genomes and the phylogenetic implications.

    Directory of Open Access Journals (Sweden)

    Rong Li

    Full Text Available BACKGROUND: The ginseng family (Araliaceae includes a number of economically important plant species. Previously phylogenetic studies circumscribed three major clades within the core ginseng plant family, yet the internal relationships of each major group have been poorly resolved perhaps due to rapid radiation of these lineages. Recent studies have shown that phyogenomics based on chloroplast genomes provides a viable way to resolve complex relationships. METHODOLOGY/PRINCIPAL FINDINGS: We report the complete nucleotide sequences of five Araliaceae chloroplast genomes using next-generation sequencing technology. The five chloroplast genomes are 156,333-156,459 bp in length including a pair of inverted repeats (25,551-26,108 bp separated by the large single-copy (86,028-86,566 bp and small single-copy (18,021-19,117 bp regions. Each chloroplast genome contains the same 114 unique genes consisting of 30 transfer RNA genes, four ribosomal RNA genes, and 80 protein coding genes. Gene size, content, and order, AT content, and IR/SC boundary structure are similar among all Araliaceae chloroplast genomes. A total of 140 repeats were identified in the five chloroplast genomes with palindromic repeat as the most common type. Phylogenomic analyses using parsimony, likelihood, and Bayesian inference based on the complete chloroplast genomes strongly supported the monophyly of the Asian Palmate group and the Aralia-Panax group. Furthermore, the relationships among the sampled taxa within the Asian Palmate group were well resolved. Twenty-six DNA markers with the percentage of variable sites higher than 5% were identified, which may be useful for phylogenetic studies of Araliaceae. CONCLUSION: The chloroplast genomes of Araliaceae are highly conserved in all aspects of genome features. The large-scale phylogenomic data based on the complete chloroplast DNA sequences is shown to be effective for the phylogenetic reconstruction of Araliaceae.

  11. Chloroplast genome analysis of Australian eucalypts--Eucalyptus, Corymbia, Angophora, Allosyncarpia and Stockwellia (Myrtaceae).

    Science.gov (United States)

    Bayly, Michael J; Rigault, Philippe; Spokevicius, Antanas; Ladiges, Pauline Y; Ades, Peter K; Anderson, Charlotte; Bossinger, Gerd; Merchant, Andrew; Udovicic, Frank; Woodrow, Ian E; Tibbits, Josquin

    2013-12-01

    We present a phylogenetic analysis and comparison of structural features of chloroplast genomes for 39 species of the eucalypt group (genera Eucalyptus, Corymbia, Angophora, and outgroups Allosyncarpia and Stockwellia). We use 41 complete chloroplast genome sequences, adding 39 finished-quality chloroplast genomes to two previously published genomes. Maximum parsimony and Bayesian analyses, based on >7000 variable nucleotide positions, produced one fully resolved phylogenetic tree (35 supported nodes, 27 with 100% bootstrap support). Eucalyptus and its sister lineage Angophora+Corymbia show a deep divergence. Within Eucalyptus, three lineages are resolved: the 'eudesmid', 'symphyomyrt' and 'monocalypt' groups. Corymbia is paraphyletic with respect to Angophora. Gene content and order do not vary among eucalypt chloroplasts; length mutations, especially frame shifts, are uncommon in protein-coding genes. Some non-synonymous mutations are highly incongruent with the overall phylogenetic signal, notably in rbcL, and may be adaptive. Application of custom informatics pipelines (GYDLE Inc.) enabled direct chloroplast genome assembly, resolving each genome to finished-quality with no need for PCR gap-filling or contig order resolution. Analysis of whole chloroplast genomes resolved major eucalypt clades and revealed variable regions of the genome that will be useful in lower-level genetic studies (including phylogeography and geneflow).

  12. Complete Chloroplast Genome of Tanaecium tetragonolobum: The First Bignoniaceae Plastome.

    Directory of Open Access Journals (Sweden)

    Alison Gonçalves Nazareno

    Full Text Available Bignoniaceae is a Pantropical plant family that is especially abundant in the Neotropics. Members of the Bignoniaceae are diverse in many ecosystems and represent key components of the Tropical flora. Despite the ecological importance of the Bignoniaceae and all the efforts to reconstruct the phylogeny of this group, whole chloroplast genome information has not yet been reported for any members of the family. Here, we report the complete chloroplast genome sequence of Tanaecium tetragonolobum (Jacq. L.G. Lohmann, which was reconstructed using de novo and referenced-based assembly of single-end reads generated by shotgun sequencing of total genomic DNA in an Illumina platform. The gene order and organization of the chloroplast genome of T. tetragonolobum exhibits the general structure of flowering plants, and is similar to other Lamiales chloroplast genomes. The chloroplast genome of T. tetragonolobum is a circular molecule of 153,776 base pairs (bp with a quadripartite structure containing two single copy regions, a large single copy region (LSC, 84,612 bp and a small single copy region (SSC, 17,586 bp separated by inverted repeat regions (IRs, 25,789 bp. In addition, the chloroplast genome of T. tetragonolobum has 38.3% GC content and includes 121 genes, of which 86 are protein-coding, 31 are transfer RNA, and four are ribosomal RNA. The chloroplast genome of T. tetragonolobum presents a total of 47 tandem repeats and 347 simple sequence repeats (SSRs with mononucleotides being the most common and di-, tri-, tetra-, and hexanucleotides occurring with less frequency. The results obtained here were compared to other chloroplast genomes of Lamiales available to date, providing new insight into the evolution of chloroplast genomes within Lamiales. Overall, the evolutionary rates of genes in Lamiales are lineage-, locus-, and region-specific, indicating that the evolutionary pattern of nucleotide substitution in chloroplast genomes of flowering

  13. Comparative analysis of microsatellites in chloroplast genomes of lower and higher plants.

    Science.gov (United States)

    George, Biju; Bhatt, Bhavin S; Awasthi, Mayur; George, Binu; Singh, Achuit K

    2015-11-01

    Microsatellites, or simple sequence repeats (SSRs), contain repetitive DNA sequence where tandem repeats of one to six base pairs are present number of times. Chloroplast genome sequences have been  shown to possess extensive variations in the length, number and distribution of SSRs. However, a comparative analysis of chloroplast microsatellites is not available. Considering their potential importance in generating genomic diversity, we have systematically analysed the abundance and distribution of simple and compound microsatellites in 164 sequenced chloroplast genomes from wide range of plants. The key findings of these studies are (1) a large number of mononucleotide repeats as compared to SSR(2-6)(di-, tri-, tetra-, penta-, hexanucleotide repeats) are present in all chloroplast genomes investigated, (2) lower plants such as algae show wide variation in relative abundance, density and distribution of microsatellite repeats as compared to flowering plants, (3) longer SSRs are excluded from coding regions of most chloroplast genomes, (4) GC content has a weak influence on number, relative abundance and relative density of mononucleotide as well as SSR(2-6). However, GC content strongly showed negative correlation with relative density (R (2) = 0.5, P plants possesses relatively more genomic diversity compared to higher plants.

  14. Complete Chloroplast Genome Sequence of Dendrobium nobile from Northeastern India

    Science.gov (United States)

    Parameswaran, Sriram; Sundar, Durai

    2016-01-01

    The orchid species Dendrobium nobile belonging to the family Orchidaceae and genus Dendrobium (a vast genus that encompasses nearly 1,200 species) has an herbal medicinal history of about 2000 years in east and south Asian countries. Here, we report the complete chloroplast genome sequence of D. nobile from northeastern India for the first time.

  15. The complete chloroplast genome sequence of Brachypodium distachyon: sequence comparison and phylogenetic analysis of eight grass plastomes

    Directory of Open Access Journals (Sweden)

    Anderson Olin D

    2008-07-01

    Full Text Available Abstract Background Wheat, barley, and rye, of tribe Triticeae in the Poaceae, are among the most important crops worldwide but they present many challenges to genomics-aided crop improvement. Brachypodium distachyon, a close relative of those cereals has recently emerged as a model for grass functional genomics. Sequencing of the nuclear and organelle genomes of Brachypodium is one of the first steps towards making this species available as a tool for researchers interested in cereals biology. Findings The chloroplast genome of Brachypodium distachyon was sequenced by a combinational approach using BAC end and shotgun sequences derived from a selected BAC containing the entire chloroplast genome. Comparative analysis indicated that the chloroplast genome is conserved in gene number and organization with respect to those of other cereals. However, several Brachypodium genes evolve at a faster rate than those in other grasses. Sequence analysis reveals that rice and wheat have a ~2.1 kb deletion in their plastid genomes and this deletion must have occurred independently in both species. Conclusion We demonstrate that BAC libraries can be used to sequence plastid, and likely other organellar, genomes. As expected, the Brachypodium chloroplast genome is very similar to those of other sequenced grasses. The phylogenetic analyses and the pattern of insertions and deletions in the chloroplast genome confirmed that Brachypodium is a close relative of the tribe Triticeae. Nevertheless, we show that some large indels can arise multiple times and may confound phylogenetic reconstruction.

  16. The complete chloroplast genome sequence of Helwingia himalaica (Helwingiaceae, Aquifoliales and a chloroplast phylogenomic analysis of the Campanulidae

    Directory of Open Access Journals (Sweden)

    Xin Yao

    2016-11-01

    Full Text Available Complete chloroplast genome sequences have been very useful for understanding phylogenetic relationships in angiosperms at the family level and above, but there are currently large gaps in coverage. We report the chloroplast genome for Helwingia himalaica, the first in the distinctive family Helwingiaceae and only the second genus to be sequenced in the order Aquifoliales. We then combine this with 36 published sequences in the large (c. 35,000 species subclass Campanulidae in order to investigate relationships at the order and family levels. The Helwingia genome consists of 158,362 bp containing a pair of inverted repeat (IR regions of 25,996 bp separated by a large single-copy (LSC region and a small single-copy (SSC region which are 87,810 and 18,560 bp, respectively. There are 142 known genes, including 94 protein-coding genes, eight ribosomal RNA genes, and 40 tRNA genes. The topology of the phylogenetic relationships between Apiales, Asterales, and Dipsacales differed between analyses based on complete genome sequences and on 36 shared protein-coding genes, showing that further studies of campanulid phylogeny are needed.

  17. The first complete chloroplast genome sequence of a lycophyte,Huperzia lucidula (Lycopodiaceae)

    Energy Technology Data Exchange (ETDEWEB)

    Wolf, Paul G.; Karol, Kenneth G.; Mandoli, Dina F.; Kuehl,Jennifer V.; Arumuganathan, K.; Ellis, Mark W.; Mishler, Brent D.; Kelch,Dean G.; Olmstead, Richard G.; Boore, Jeffrey L.

    2005-02-01

    We used a unique combination of techniques to sequence the first complete chloroplast genome of a lycophyte, Huperzia lucidula. This plant belongs to a significant clade hypothesized to represent the sister group to all other vascular plants. We used fluorescence-activated cell sorting (FACS) to isolate the organelles, rolling circle amplification (RCA) to amplify the genome, and shotgun sequencing to 8x depth coverage to obtain the complete chloroplast genome sequence. The genome is 154,373bp, containing inverted repeats of 15,314 bp each, a large single-copy region of 104,088 bp, and a small single-copy region of 19,671 bp. Gene order is more similar to those of mosses, liverworts, and hornworts than to gene order for other vascular plants. For example, the Huperziachloroplast genome possesses the bryophyte gene order for a previously characterized 30 kb inversion, thus supporting the hypothesis that lycophytes are sister to all other extant vascular plants. The lycophytechloroplast genome data also enable a better reconstruction of the basaltracheophyte genome, which is useful for inferring relationships among bryophyte lineages. Several unique characters are observed in Huperzia, such as movement of the gene ndhF from the small single copy region into the inverted repeat. We present several analyses of evolutionary relationships among land plants by using nucleotide data, amino acid sequences, and by comparing gene arrangements from chloroplast genomes. The results, while still tentative pending the large number of chloroplast genomes from other key lineages that are soon to be sequenced, are intriguing in themselves, and contribute to a growing comparative database of genomic and morphological data across the green plants.

  18. A rare case of plastid protein-coding gene duplication in the chloroplast genome of Euglena archaeoplastidiata (Euglenophyta).

    Science.gov (United States)

    Bennett, Matthew S; Shiu, Shin-Han; Triemer, Richard E

    2017-03-12

    Gene duplication is an important evolutionary process that allows duplicate functions to diverge, or, in some cases, allows for new functional gains. However, in contrast to the nuclear genome, gene duplications within the chloroplast are extremely rare. Here, we present the chloroplast genome of the photosynthetic protist Euglena archaeoplastidiata. Upon annotation, it was found that the chloroplast genome contained a novel tandem direct duplication that encoded a portion of RuBisCO large subunit (rbcL) followed by a complete copy of ribosomal protein L32 (rpl32), as well as the associated intergenic sequences. Analyses of the duplicated rpl32 were inconclusive regarding selective pressures, although it was found that substitutions in the duplicated region, all non-synonymous, likely had a neutral functional effect. The duplicated region did not exhibit patterns consistent with previously described mechanisms for tandem direct duplications, and demonstrated an unknown mechanism of duplication. In addition, a comparison of this chloroplast genome to other previously characterized chloroplast genomes from the same family revealed characteristics that indicated E. archaeoplastidiata was probably more closely related to taxa in the genera Monomorphina, Cryptoglena, and Euglenaria than it was to other Euglena taxa. Taken together, the chloroplast genome of E. archaeoplastidiata demonstrated multiple characteristics unique to the euglenoid world, and has justified the longstanding curiosity regarding this enigmatic taxon.

  19. Characterization of mango (Mangifera indica L.) transcriptome and chloroplast genome.

    Science.gov (United States)

    Azim, M Kamran; Khan, Ishtaiq A; Zhang, Yong

    2014-05-01

    We characterized mango leaf transcriptome and chloroplast genome using next generation DNA sequencing. The RNA-seq output of mango transcriptome generated >12 million reads (total nucleotides sequenced >1 Gb). De novo transcriptome assembly generated 30,509 unigenes with lengths in the range of 300 to ≥3,000 nt and 67× depth of coverage. Blast searching against nonredundant nucleotide databases and several Viridiplantae genomic datasets annotated 24,593 mango unigenes (80% of total) and identified Citrus sinensis as closest neighbor of mango with 9,141 (37%) matched sequences. The annotation with gene ontology and Clusters of Orthologous Group terms categorized unigene sequences into 57 and 25 classes, respectively. More than 13,500 unigenes were assigned to 293 KEGG pathways. Besides major plant biology related pathways, KEGG based gene annotation pointed out active presence of an array of biochemical pathways involved in (a) biosynthesis of bioactive flavonoids, flavones and flavonols, (b) biosynthesis of terpenoids and lignins and (c) plant hormone signal transduction. The mango transcriptome sequences revealed 235 proteases belonging to five catalytic classes of proteolytic enzymes. The draft genome of mango chloroplast (cp) was obtained by a combination of Sanger and next generation sequencing. The draft mango cp genome size is 151,173 bp with a pair of inverted repeats of 27,093 bp separated by small and large single copy regions, respectively. Out of 139 genes in mango cp genome, 91 found to be protein coding. Sequence analysis revealed cp genome of C. sinensis as closest neighbor of mango. We found 51 short repeats in mango cp genome supposed to be associated with extensive rearrangements. This is the first report of transcriptome and chloroplast genome analysis of any Anacardiaceae family member.

  20. Chloroplast genome sequence of the moss Tortula ruralis: gene content, polymorphism, and structural arrangement relative to other green plant chloroplast genomes

    OpenAIRE

    Wolf Paul G; Everett Karin DE; Mandoli Dina F; Boore Jeffrey L; Kuehl Jennifer V; Mishler Brent D; Murdock Andrew G; Oliver Melvin J; Duffy Aaron M; Karol Kenneth G

    2010-01-01

    Abstract Background Tortula ruralis, a widely distributed species in the moss family Pottiaceae, is increasingly used as a model organism for the study of desiccation tolerance and mechanisms of cellular repair. In this paper, we present the chloroplast genome sequence of T. ruralis, only the second published chloroplast genome for a moss, and the first for a vegetatively desiccation-t...

  1. The first complete chloroplast genome sequences of Ulmus species by de novo sequencing: Genome comparative and taxonomic position analysis

    Science.gov (United States)

    Zhang, Shuang; Yu, Xiao-Yue; Ren, Ya-Chao; Yang, Min-Sheng; Wang, Jin-Mao

    2017-01-01

    Elm (Ulmus) has a long history of use as a high-quality heavy hardwood famous for its resistance to drought, cold, and salt. It grows in temperate, warm temperate, and subtropical regions. This is the first report of Ulmaceae chloroplast genomes by de novo sequencing. The Ulmus chloroplast genomes exhibited a typical quadripartite structure with two single-copy regions (long single copy [LSC] and short single copy [SSC] sections) separated by a pair of inverted repeats (IRs). The lengths of the chloroplast genomes from five Ulmus ranged from 158,953 to 159,453 bp, with the largest observed in Ulmus davidiana and the smallest in Ulmus laciniata. The genomes contained 137–145 protein-coding genes, of which Ulmus davidiana var. japonica and U. davidiana had the most and U. pumila had the fewest. The five Ulmus species exhibited different evolutionary routes, as some genes had been lost. In total, 18 genes contained introns, 13 of which (trnL-TAA+, trnL-TAA−, rpoC1-, rpl2-, ndhA-, ycf1, rps12-, rps12+, trnA-TGC+, trnA-TGC-, trnV-TAC-, trnI-GAT+, and trnI-GAT) were shared among all five species. The intron of ycf1 was the longest (5,675bp) while that of trnF-AAA was the smallest (53bp). All Ulmus species except U. davidiana exhibited the same degree of amplification in the IR region. To determine the phylogenetic positions of the Ulmus species, we performed phylogenetic analyses using common protein-coding genes in chloroplast sequences of 42 other species published in NCBI. The cluster results showed the closest plants to Ulmaceae were Moraceae and Cannabaceae, followed by Rosaceae. Ulmaceae and Moraceae both belonged to Urticales, and the chloroplast genome clustering results were consistent with their traditional taxonomy. The results strongly supported the position of Ulmaceae as a member of the order Urticales. In addition, we found a potential error in the traditional taxonomies of U. davidiana and U. davidiana var. japonica, which should be confirmed with a

  2. The complete chloroplast genomes of Cannabis sativa and Humulus lupulus.

    Science.gov (United States)

    Vergara, Daniela; White, Kristin H; Keepers, Kyle G; Kane, Nolan C

    2016-09-01

    Cannabis and Humulus are sister genera comprising the entirety of the Cannabaceae sensu stricto, including C. sativa L. (marijuana, hemp), and H. lupulus L. (hops) as two economically important crops. These two plants have been used by humans for many purposes including as a fiber, food, medicine, or inebriant in the case of C. sativa, and as a flavoring component in beer brewing in the case of H. lupulus. In this study, we report the complete chloroplast genomes for two distinct hemp varieties of C. sativa, Italian "Carmagnola" and Russian "Dagestani", and one Czech variety of H. lupulus "Saazer". Both C. sativa genomes are 153 871 bp in length, while the H. lupulus genome is 153 751 bp. The genomes from the two C. sativa varieties differ in 16 single nucleotide polymorphisms (SNPs), while the H. lupulus genome differs in 1722 SNPs from both C. sativa cultivars.

  3. Cloning and molecular genetics analyses of Deschampsia antarctica Desv. chloroplast and mitochondrial DNA sequence

    Directory of Open Access Journals (Sweden)

    O.P. Savchuk

    2012-03-01

    Full Text Available Chloroplast and mitochondrial DNA sequences of Deschampsia antarctica were studied. We had made comparison analysis with completely sequenced genomes of other temperateness plants to find homology.

  4. Chloroplast genome sequence of the moss Tortula ruralis: gene content, polymorphism, and structural arrangement relative to other green plant chloroplast genomes

    Directory of Open Access Journals (Sweden)

    Wolf Paul G

    2010-02-01

    Full Text Available Abstract Background Tortula ruralis, a widely distributed species in the moss family Pottiaceae, is increasingly used as a model organism for the study of desiccation tolerance and mechanisms of cellular repair. In this paper, we present the chloroplast genome sequence of T. ruralis, only the second published chloroplast genome for a moss, and the first for a vegetatively desiccation-tolerant plant. Results The Tortula chloroplast genome is ~123,500 bp, and differs in a number of ways from that of Physcomitrella patens, the first published moss chloroplast genome. For example, Tortula lacks the ~71 kb inversion found in the large single copy region of the Physcomitrella genome and other members of the Funariales. Also, the Tortula chloroplast genome lacks petN, a gene found in all known land plant plastid genomes. In addition, an unusual case of nucleotide polymorphism was discovered. Conclusions Although the chloroplast genome of Tortula ruralis differs from that of the only other sequenced moss, Physcomitrella patens, we have yet to determine the biological significance of the differences. The polymorphisms we have uncovered in the sequencing of the genome offer a rare possibility (for mosses of the generation of DNA markers for fine-level phylogenetic studies, or to investigate individual variation within populations.

  5. Complete chloroplast genome of Trachelium caeruleum: extensiverearrangements are associated with repeats and tRNAs

    Energy Technology Data Exchange (ETDEWEB)

    Haberle, Rosemarie C.; Fourcade, Matthew L.; Boore, Jeffrey L.; Jansen, Robert K.

    2006-01-09

    features previously identifiedthrough mapping, and discovered many additional structural changes,including several partial to entire gene duplications, deterioration ofat least four normally conserved chloroplast genes into gene fragments,and the nature and position of numerous repeat elements at or nearinversion endpoints. The focus of this paper is on analyses of sequencesat or near these rearrangements in Trachelium caeruleum. Inversions arebelieved to occur due to the presence of repeat elements subject tohomologous recombination (Palmer, 1991; Knox et al., 1993). Repeats mayfacilitate inversions or other genome rearrangements (Achaz et al.,2003), and higher incidences of repeats have been correlated with greaternumbers of rearrangements (Rocha, 2003). Alternatively, repeats mayproliferate within a genome asa result of DNA strand repair mechanismsfollowing a rearrangement event such as an inversion. Gene

  6. The complete chloroplast genome of Origanum vulgare L. (Lamiaceae).

    Science.gov (United States)

    Lukas, Brigitte; Novak, Johannes

    2013-10-10

    Oregano (Origanum vulgare L., Lamiaceae) is a medicinal and aromatic plant maybe best known for flavouring pizza. New applications e.g. as natural antioxidants for food are emerging due to the plants' high antibacterial and antioxidant activity. The complete chloroplast (cp) genome of Origanum vulgare (GenBank/EBML/DDBJ accession number: JX880022) consists of 151,935 bp and includes a pair of inverted repeats (IR) of 25,527 bp separated by one small and one large single copy region (SSC and LSC) of 17,745 and 83,136 bp, respectively. The genome with an overall GC content of 38% hosts 114 genes that covering 63% of the genome of which 8% were introns. The comparison of the Origanum cp genome with the cp genomes of two other core lamiales (Salvia miltiorrhiza Bunge and Sesamum indicum L.) revealed completely conserved protein-coding regions in the IR region but also in the LSC and SSC regions. Phylogenetic analysis of the lamiids based on 56 protein-coding genes give a hint at the basic structure of the Lamiales. However, further genomes will be necessary to clarify this taxonomically complicated order. The variability of the cp within the genus Origanum, studied exemplarily on 16 different chloroplast DNA regions, demonstrated that in 14 regions analyzed, the variability was extremely low (max. 0.7%), while only two regions showed a moderate variability of up to 2.3%. The cp genome of Origanum vulgare contains 27 perfect mononucleotide repeats (number of repeats>9) consisting exclusively of the nucleotides A or T. 34 perfect repeats (repeat lengths>1 and number of repeats>3) were found, of which 32 were di-, and 2 were trinucleotide repeats.

  7. The complete chloroplast genome sequence of Abies nephrolepis (Pinaceae: Abietoideae

    Directory of Open Access Journals (Sweden)

    Dong-Keun Yi

    2016-06-01

    Full Text Available The plant chloroplast (cp genome has maintained a relatively conserved structure and gene content throughout evolution. Cp genome sequences have been used widely for resolving evolutionary and phylogenetic issues at various taxonomic levels of plants. Here, we report the complete cp genome of Abies nephrolepis. The A. nephrolepis cp genome is 121,336 base pairs (bp in length including a pair of short inverted repeat regions (IRa and IRb of 139 bp each separated by a small single copy (SSC region of 54,323 bp (SSC and a large single copy region of 66,735 bp (LSC. It contains 114 genes, 68 of which are protein coding genes, 35 tRNA and four rRNA genes, six open reading frames, and one pseudogene. Seventeen repeat units and 64 simple sequence repeats (SSR have been detected in A. nephrolepis cp genome. Large IR sequences locate in 42-kb inversion points (1186 bp. The A. nephrolepis cp genome is identical to Abies koreana’s which is closely related to taxa. Pairwise comparison between two cp genomes revealed 140 polymorphic sites in each. Complete cp genome sequence of A. nephrolepis has a significant potential to provide information on the evolutionary pattern of Abietoideae and valuable data for development of DNA markers for easy identification and classification.

  8. Analysis of Acorus calamus chloroplast genome and its phylogenetic implications.

    Science.gov (United States)

    Goremykin, Vadim V; Holland, Barbara; Hirsch-Ernst, Karen I; Hellwig, Frank H

    2005-09-01

    Determining the phylogenetic relationships among the major lines of angiosperms is a long-standing problem, yet the uncertainty as to the phylogenetic affinity of these lines persists. While a number of studies have suggested that the ANITA (Amborella-Nymphaeales-Illiciales-Trimeniales-Aristolochiales) grade is basal within angiosperms, studies of complete chloroplast genome sequences also suggested an alternative tree, wherein the line leading to the grasses branches first among the angiosperms. To improve taxon sampling in the existing chloroplast genome data, we sequenced the chloroplast genome of the monocot Acorus calamus. We generated a concatenated alignment (89,436 positions for 15 taxa), encompassing almost all sequences usable for phylogeny reconstruction within spermatophytes. The data still contain support for both the ANITA-basal and grasses-basal hypotheses. Using simulations we can show that were the ANITA-basal hypothesis true, parsimony (and distance-based methods with many models) would be expected to fail to recover it. The self-evident explanation for this failure appears to be a long-branch attraction (LBA) between the clade of grasses and the out-group. However, this LBA cannot explain the discrepancies observed between tree topology recovered using the maximum likelihood (ML) method and the topologies recovered using the parsimony and distance-based methods when grasses are deleted. Furthermore, the fact that neither maximum parsimony nor distance methods consistently recover the ML tree, when according to the simulations they would be expected to, when the out-group (Pinus) is deleted, suggests that either the generating tree is not correct or the best symmetric model is misspecified (or both). We demonstrate that the tree recovered under ML is extremely sensitive to model specification and that the best symmetric model is misspecified. Hence, we remain agnostic regarding phylogenetic relationships among basal angiosperm lineages.

  9. The whole chloroplast genome of wild rice (Oryza australiensis).

    Science.gov (United States)

    Wu, Zhiqiang; Ge, Song

    2016-01-01

    The whole chloroplast genome of wild rice (Oryza australiensis) is characterized in this study. The genome size is 135,224  bp, exhibiting a typical circular structure including a pair of 25,776  bp inverted repeats (IRa,b) separated by a large single-copy region (LSC) of 82,212  bp and a small single-copy region (SSC) of 12,470  bp. The overall GC content of the genome is 38.95%. 110 unique genes were annotated, including 76 protein-coding genes, 4 ribosomal RNA genes, and 30t RNA genes. Among these, 18 are duplicated in the inverted repeat regions, 13 genes contain one intron, and 2 genes (rps12 and ycf3) have two introns.

  10. The complete chloroplast genome sequence of Dendropanax morbifera (Léveillé).

    Science.gov (United States)

    Kim, Kyunghee; Lee, Sang-Choon; Yang, Tae-Jin

    2016-07-01

    The complete chloroplast genome sequence of Dendropanax morbifera, an economically and medicinally important endemic tree species in Korea, was obtained by de novo assembly with whole-genome sequence data and manual correction. A circular 156 366-bp chloroplast genome showed typical chloroplast genome structure comprising a large single copy region of 86 475 bp, a small single copy region of 18 125 bp, and a pair of inverted repeats of 25 883 bp. The chloroplast genome harbored 87 protein-coding genes. Phylogenetic analysis with the chloroplast genome revealed that D. morbifera is most closely related to Dendropanax dentiger, an evergreen tree species in China and Southeastern Asia.

  11. The complete chloroplast genome sequence of an endemic monotypic genus Hagenia (Rosaceae: structural comparative analysis, gene content and microsatellite detection

    Directory of Open Access Journals (Sweden)

    Andrew W. Gichira

    2017-01-01

    Full Text Available Hagenia is an endangered monotypic genus endemic to the topical mountains of Africa. The only species, Hagenia abyssinica (Bruce J.F. Gmel, is an important medicinal plant producing bioactive compounds that have been traditionally used by African communities as a remedy for gastrointestinal ailments in both humans and animals. Complete chloroplast genomes have been applied in resolving phylogenetic relationships within plant families. We employed high-throughput sequencing technologies to determine the complete chloroplast genome sequence of H. abyssinica. The genome is a circular molecule of 154,961 base pairs (bp, with a pair of Inverted Repeats (IR 25,971 bp each, separated by two single copies; a large (LSC, 84,320 bp and a small single copy (SSC, 18,696. H. abyssinica’s chloroplast genome has a 37.1% GC content and encodes 112 unique genes, 78 of which code for proteins, 30 are tRNA genes and four are rRNA genes. A comparative analysis with twenty other species, sequenced to-date from the family Rosaceae, revealed similarities in structural organization, gene content and arrangement. The observed size differences are attributed to the contraction/expansion of the inverted repeats. The translational initiation factor gene (infA which had been previously reported in other chloroplast genomes was conspicuously missing in H. abyssinica. A total of 172 microsatellites and 49 large repeat sequences were detected in the chloroplast genome. A Maximum Likelihood analyses of 71 protein-coding genes placed Hagenia in Rosoideae. The availability of a complete chloroplast genome, the first in the Sanguisorbeae tribe, is beneficial for further molecular studies on taxonomic and phylogenomic resolution within the Rosaceae family.

  12. The complete chloroplast genome sequence of an endemic monotypic genus Hagenia (Rosaceae): structural comparative analysis, gene content and microsatellite detection.

    Science.gov (United States)

    Gichira, Andrew W; Li, Zhizhong; Saina, Josphat K; Long, Zhicheng; Hu, Guangwan; Gituru, Robert W; Wang, Qingfeng; Chen, Jinming

    2017-01-01

    Hagenia is an endangered monotypic genus endemic to the topical mountains of Africa. The only species, Hagenia abyssinica (Bruce) J.F. Gmel, is an important medicinal plant producing bioactive compounds that have been traditionally used by African communities as a remedy for gastrointestinal ailments in both humans and animals. Complete chloroplast genomes have been applied in resolving phylogenetic relationships within plant families. We employed high-throughput sequencing technologies to determine the complete chloroplast genome sequence of H. abyssinica. The genome is a circular molecule of 154,961 base pairs (bp), with a pair of Inverted Repeats (IR) 25,971 bp each, separated by two single copies; a large (LSC, 84,320 bp) and a small single copy (SSC, 18,696). H. abyssinica's chloroplast genome has a 37.1% GC content and encodes 112 unique genes, 78 of which code for proteins, 30 are tRNA genes and four are rRNA genes. A comparative analysis with twenty other species, sequenced to-date from the family Rosaceae, revealed similarities in structural organization, gene content and arrangement. The observed size differences are attributed to the contraction/expansion of the inverted repeats. The translational initiation factor gene (infA) which had been previously reported in other chloroplast genomes was conspicuously missing in H. abyssinica. A total of 172 microsatellites and 49 large repeat sequences were detected in the chloroplast genome. A Maximum Likelihood analyses of 71 protein-coding genes placed Hagenia in Rosoideae. The availability of a complete chloroplast genome, the first in the Sanguisorbeae tribe, is beneficial for further molecular studies on taxonomic and phylogenomic resolution within the Rosaceae family.

  13. The complete chloroplast genome sequence of an endemic monotypic genus Hagenia (Rosaceae): structural comparative analysis, gene content and microsatellite detection

    Science.gov (United States)

    Saina, Josphat K.; Long, Zhicheng; Hu, Guangwan; Gituru, Robert W.

    2017-01-01

    Hagenia is an endangered monotypic genus endemic to the topical mountains of Africa. The only species, Hagenia abyssinica (Bruce) J.F. Gmel, is an important medicinal plant producing bioactive compounds that have been traditionally used by African communities as a remedy for gastrointestinal ailments in both humans and animals. Complete chloroplast genomes have been applied in resolving phylogenetic relationships within plant families. We employed high-throughput sequencing technologies to determine the complete chloroplast genome sequence of H. abyssinica. The genome is a circular molecule of 154,961 base pairs (bp), with a pair of Inverted Repeats (IR) 25,971 bp each, separated by two single copies; a large (LSC, 84,320 bp) and a small single copy (SSC, 18,696). H. abyssinica’s chloroplast genome has a 37.1% GC content and encodes 112 unique genes, 78 of which code for proteins, 30 are tRNA genes and four are rRNA genes. A comparative analysis with twenty other species, sequenced to-date from the family Rosaceae, revealed similarities in structural organization, gene content and arrangement. The observed size differences are attributed to the contraction/expansion of the inverted repeats. The translational initiation factor gene (infA) which had been previously reported in other chloroplast genomes was conspicuously missing in H. abyssinica. A total of 172 microsatellites and 49 large repeat sequences were detected in the chloroplast genome. A Maximum Likelihood analyses of 71 protein-coding genes placed Hagenia in Rosoideae. The availability of a complete chloroplast genome, the first in the Sanguisorbeae tribe, is beneficial for further molecular studies on taxonomic and phylogenomic resolution within the Rosaceae family.

  14. Phylogenetic placement of Cynomorium in Rosales inferred from sequences of the inverted repeat region of the chloroplast genome

    Institute of Scientific and Technical Information of China (English)

    Zhi-Hong ZHANG; Chun-Qi LI; Jian-hua LI

    2009-01-01

    Cynomorium is a herbaceous holoparasite that has been placed in Santalales, Saxifragales, Myrtales, or Sapindales. The inverted repeat (IR) region of the chloroplast genome region is slow evolving and, unlike mitochondrial genes, the chloroplast genome experiences few horizontal gene transfers between the host and parasite. Thus, in the present study, we used sequences of the IR region to test the phylogenetic placements of Cynomorium. Phylogenetic analyses of the chloroplast IR sequences generated largely congruent ordinal relationships with those from previous studies of angiosperm phylogeny based on single or multiple genes. Santalales was closely related to Caryophyllales and asterids. Saxifragales formed a clade where Peridiscus was sister to the remainder of the order, whereas Paeonia was sister to the woody clade of Saxifragales. Cynomorium is not closely related to Santalales, Saxifragales, Myrtales, or Sapindales; instead, it is included in Rosales and sister to Rosaceae. The various placements of the holoparasite on the basis of different regions of the mitochondrial genome may indicate the heterogeneous nature of the genome in the parasite. However, it is unlikely that the placement of Cynomorium in Rosales is the result of chloroplast gene transfer because Cynomorium does not parasitize on rosaceous plants and there is no chloroplast gene transfer between Cynomorium and Nitraria, a confirmed host of Cynomorium and a member of Sapindales.

  15. The complete chloroplast genome provides insight into the evolution and polymorphism of Panax ginseng

    Directory of Open Access Journals (Sweden)

    Yongbing eZhao

    2015-01-01

    Full Text Available Panax ginseng C.A. Meyer (P. ginseng is an important medicinal plant and is often used in traditional Chinese medicine. With next generation sequencing (NGS technology, we determined the complete chloroplast genome sequences for four Chinese P. ginseng strains, which are Damaya (DMY, Ermaya (EMY, Gaolishen (GLS and Yeshanshen (YSS. The total chloroplast genome sequence length for DMY, EMY and GLS was 156,354 bp, while that for YSS was 156,355 bp. Comparative genomic analysis of the chloroplast genome sequences indicate that gene content, GC content, and gene order in DMY are quite similar to its relative species, and nucleotide sequence diversity of inverted repeat region (IR is lower than that of its counterparts, large single copy region (LSC and small single copy region (SSC. A comparison among these four P. ginseng strains revealed that the chloroplast genome sequences of DMY, EMY, and GLS were identical and YSS had a 1-bp insertion at base 5472. To further study the heterogeneity in chloroplast genome during domestication, high-resolution reads were mapped to the genome sequences to investigate the differences at the minor allele level; 208 minor allele sites with minor allele frequencies (MAF of ≥ 0.05 were identified. The polymorphism site numbers per kb of chloroplast genome sequence for DMY, EMY, GLS, and YSS were 0.74, 0.59, 0.97, and 1.23, respectively. All the minor allele sites located in LSC and IR regions, and the four strains showed the same variation types (substitution base or indel at all identified polymorphism sites. Comparison results of heterogeneity in the chloroplast genome sequences showed that the minor allele sites on the chloroplast genome were undergoing purifying selection to adapt to changing environment during domestication process. A study of P. ginseng chloroplast genome with particular focus on minor allele sites would aid in investigating the dynamics on the chloroplast genomes and different P. ginseng

  16. Chloroplast genome evolution in early diverged leptosporangiate ferns.

    Science.gov (United States)

    Kim, Hyoung Tae; Chung, Myong Gi; Kim, Ki-Joong

    2014-05-01

    In this study, the chloroplast (cp) genome sequences from three early diverged leptosporangiate ferns were completed and analyzed in order to understand the evolution of the genome of the fern lineages. The complete cp genome sequence of Osmunda cinnamomea (Osmundales) was 142,812 base pairs (bp). The cp genome structure was similar to that of eusporangiate ferns. The gene/intron losses that frequently occurred in the cp genome of leptosporangiate ferns were not found in the cp genome of O. cinnamomea. In addition, putative RNA editing sites in the cp genome were rare in O. cinnamomea, even though the sites were frequently predicted to be present in leptosporangiate ferns. The complete cp genome sequence of Diplopterygium glaucum (Gleicheniales) was 151,007 bp and has a 9.7 kb inversion between the trnL-CAA and trnVGCA genes when compared to O. cinnamomea. Several repeated sequences were detected around the inversion break points. The complete cp genome sequence of Lygodium japonicum (Schizaeales) was 157,142 bp and a deletion of the rpoC1 intron was detected. This intron loss was shared by all of the studied species of the genus Lygodium. The GC contents and the effective numbers of codons (ENCs) in ferns varied significantly when compared to seed plants. The ENC values of the early diverged leptosporangiate ferns showed intermediate levels between eusporangiate and core leptosporangiate ferns. However, our phylogenetic tree based on all of the cp gene sequences clearly indicated that the cp genome similarity between O. cinnamomea (Osmundales) and eusporangiate ferns are symplesiomorphies, rather than synapomorphies. Therefore, our data is in agreement with the view that Osmundales is a distinct early diverged lineage in the leptosporangiate ferns.

  17. The chloroplast genome of a symbiodinium sp. clade C3 isolate

    KAUST Repository

    Barbrook, Adrian C.

    2014-01-01

    Dinoflagellate algae of the genus Symbiodinium form important symbioses within corals and other benthic marine animals. Dinoflagellates possess an extremely reduced plastid genome relative to those examined in plants and other algae. In dinoflagellates the plastid genes are located on small plasmids, commonly referred to as \\'minicircles\\'. However, the chloroplast genomes of dinoflagellates have only been extensively characterised from a handful of species. There is also evidence of considerable variation in the chloroplast genome organisation across those species that have been examined. We therefore characterised the chloroplast genome from an environmental coral isolate, in this case containing a symbiont belonging to the Symbiodinium sp. clade C3. The gene content of the genome is well conserved with respect to previously characterised genomes. However, unlike previously characterised dinoflagellate chloroplast genomes we did not identify any \\'empty\\' minicircles. The sequences of this chloroplast genome show a high rate of evolution relative to other algal species. Particularly notable was a surprisingly high level of sequence divergence within the core polypeptides of photosystem I, the reasons for which are currently unknown. This chloroplast genome also possesses distinctive codon usage and GC content. These features suggest that chloroplast genomes in Symbiodinium are highly plastic. © 2013 Adrian C. Barbrook.

  18. Sonication-based isolation and enrichment of Chlorella protothecoides chloroplasts for illumina genome sequencing

    Energy Technology Data Exchange (ETDEWEB)

    Angelova, Angelina [University of Arizona; Park, Sang-Hycuk [University of Arizona; Kyndt, John [Bellevue University; Fitzsimmons, Kevin [University of Arizona; Brown, Judith K [University of Arizona

    2013-09-01

    With the increasing world demand for biofuel, a number of oleaginous algal species are being considered as renewable sources of oil. Chlorella protothecoides Krüger synthesizes triacylglycerols (TAGs) as storage compounds that can be converted into renewable fuel utilizing an anabolic pathway that is poorly understood. The paucity of algal chloroplast genome sequences has been an important constraint to chloroplast transformation and for studying gene expression in TAGs pathways. In this study, the intact chloroplasts were released from algal cells using sonication followed by sucrose gradient centrifugation, resulting in a 2.36-fold enrichment of chloroplasts from C. protothecoides, based on qPCR analysis. The C. protothecoides chloroplast genome (cpDNA) was determined using the Illumina HiSeq 2000 sequencing platform and found to be 84,576 Kb in size (8.57 Kb) in size, with a GC content of 30.8 %. This is the first report of an optimized protocol that uses a sonication step, followed by sucrose gradient centrifugation, to release and enrich intact chloroplasts from a microalga (C. prototheocoides) of sufficient quality to permit chloroplast genome sequencing with high coverage, while minimizing nuclear genome contamination. The approach is expected to guide chloroplast isolation from other oleaginous algal species for a variety of uses that benefit from enrichment of chloroplasts, ranging from biochemical analysis to genomics studies.

  19. A novel class of heat-responsive small RNAs derived from the chloroplast genome of Chinese cabbage (Brassica rapa

    Directory of Open Access Journals (Sweden)

    de Ruiter Marjo

    2011-06-01

    Full Text Available Abstract Background Non-coding small RNAs play critical roles in various cellular processes in a wide spectrum of eukaryotic organisms. Their responses to abiotic stress have become a popular topic of economic and scientific importance in biological research. Several studies in recent years have reported a small number of non-coding small RNAs that map to chloroplast genomes. However, it remains uncertain whether small RNAs are generated from chloroplast genome and how they respond to environmental stress, such as high temperature. Chinese cabbage is an important vegetable crop, and heat stress usually causes great losses in yields and quality. Under heat stress, the leaves become etiolated due to the disruption and disassembly of chloroplasts. In an attempt to determine the heat-responsive small RNAs in chloroplast genome of Chinese cabbage, we carried out deep sequencing, using heat-treated samples, and analysed the proportion of small RNAs that were matched to chloroplast genome. Results Deep sequencing provided evidence that a novel subset of small RNAs were derived from the chloroplast genome of Chinese cabbage. The chloroplast small RNAs (csRNAs include those derived from mRNA, rRNA, tRNA and intergenic RNA. The rRNA-derived csRNAs were preferentially located at the 3'-ends of the rRNAs, while the tRNA-derived csRNAs were mainly located at 5'-termini of the tRNAs. After heat treatment, the abundance of csRNAs decreased in seedlings, except those of 24 nt in length. The novel heat-responsive csRNAs and their locations in the chloroplast were verified by Northern blotting. The regulation of some csRNAs to the putative target genes were identified by real-time PCR. Our results reveal that high temperature suppresses the production of some csRNAs, which have potential roles in transcriptional or post-transcriptional regulation. Conclusions In addition to nucleus, the chloroplast is another important organelle that generates a number of small

  20. The evolution of chloroplast genes and genomes in ferns.

    Science.gov (United States)

    Wolf, Paul G; Der, Joshua P; Duffy, Aaron M; Davidson, Jacob B; Grusz, Amanda L; Pryer, Kathleen M

    2011-07-01

    Most of the publicly available data on chloroplast (plastid) genes and genomes come from seed plants, with relatively little information from their sister group, the ferns. Here we describe several broad evolutionary patterns and processes in fern plastid genomes (plastomes), and we include some new plastome sequence data. We review what we know about the evolutionary history of plastome structure across the fern phylogeny and we compare plastome organization and patterns of evolution in ferns to those in seed plants. A large clade of ferns is characterized by a plastome that has been reorganized with respect to the ancestral gene order (a similar order that is ancestral in seed plants). We review the sequence of inversions that gave rise to this organization. We also explore global nucleotide substitution patterns in ferns versus those found in seed plants across plastid genes, and we review the high levels of RNA editing observed in fern plastomes.

  1. The evolutionary processes of mitochondrial and chloroplast genomes differ from those of nuclear genomes

    Science.gov (United States)

    Korpelainen, Helena

    2004-11-01

    This paper first introduces our present knowledge of the origin of mitochondria and chloroplasts, and the organization and inheritance patterns of their genomes, and then carries on to review the evolutionary processes influencing mitochondrial and chloroplast genomes. The differences in evolutionary phenomena between the nuclear and cytoplasmic genomes are highlighted. It is emphasized that varying inheritance patterns and copy numbers among different types of genomes, and the potential advantage achieved through the transfer of many cytoplasmic genes to the nucleus, have important implications for the evolution of nuclear, mitochondrial and chloroplast genomes. Cytoplasmic genes transferred to the nucleus have joined the more strictly controlled genetic system of the nuclear genome, including also sexual recombination, while genes retained within the cytoplasmic organelles can be involved in selection and drift processes both within and among individuals. Within-individual processes can be either intra- or intercellular. In the case of heteroplasmy, which is attributed to mutations or biparental inheritance, within-individual selection on cytoplasmic DNA may provide a mechanism by which the organism can adapt rapidly. The inheritance of cytoplasmic genomes is not universally maternal. The presence of a range of inheritance patterns indicates that different strategies have been adopted by different organisms. On the other hand, the variability occasionally observed in the inheritance mechanisms of cytoplasmic genomes reduces heritability and increases environmental components in phenotypic features and, consequently, decreases the potential for adaptive evolution.

  2. DEEP DIVISION IN THE CHLOROPHYCEAE (CHLOROPHYTA) REVEALED BY CHLOROPLAST PHYLOGENOMIC ANALYSES(1).

    Science.gov (United States)

    Turmel, Monique; Brouard, Jean-Simon; Gagnon, Cédric; Otis, Christian; Lemieux, Claude

    2008-06-01

    The Chlorophyceae (sensu Mattox and Stewart) is a morphologically diverse class of the Chlorophyta displaying biflagellate and quadriflagellate motile cells with varying configurations of the flagellar apparatus. Phylogenetic analyses of 18S rDNA data and combined 18S and 26S rDNA data from a broad range of chlorophycean taxa uncovered five major monophyletic groups (Chlamydomonadales, Sphaeropleales, Oedogoniales, Chaetophorales, and Chaetopeltidales) but could not resolve their branching order. To gain insight into the interrelationships of these groups, we analyzed multiple genes encoded by the chloroplast genomes of Chlamydomonas reinhardtii P. A. Dang. and Chlamydomonas moewusii Gerloff (Chlamydomonadales), Scenedesmus obliquus (Turpin) Kütz. (Sphaeropleales), Oedogonium cardiacum Wittr. (Oedogoniales), Stigeoclonium helveticum Vischer (Chaetophorales), and Floydiella terrestris (Groover et Hofstetter) Friedl et O'Kelly (Chaetopeltidales). The C. moewusii, Oedogonium, and Floydiella chloroplast DNAs were partly sequenced using a random strategy. Trees were reconstructed from nucleotide and amino acid data sets derived from 44 protein-coding genes of 11 chlorophytes and nine streptophytes as well as from 57 protein-coding genes of the six chlorophycean taxa. All best trees identified two robustly supported major lineages within the Chlorophyceae: a clade uniting the Chlamydomonadales and Sphaeropleales, and a clade uniting the Oedogoniales, Chaetophorales, and Chaetopeltidales (OCC clade). This dichotomy is independently supported by molecular signatures in chloroplast genes, such as insertions/deletions and the distribution of trans-spliced group II introns. Within the OCC clade, the sister relationship observed for the Chaetophorales and Chaetopeltidales is also strengthened by independent data. Character state reconstruction of basal body orientation allowed us to refine hypotheses regarding the evolution of the flagellar apparatus.

  3. Chloroplast Genome Evolution in Actinidiaceae: clpP Loss, Heterogenous Divergence and Phylogenomic Practice.

    Science.gov (United States)

    Wang, Wen-Cai; Chen, Si-Yun; Zhang, Xian-Zhi

    2016-01-01

    Actinidiaceae is a well-known economically important plant family in asterids. To elucidate the chloroplast (cp) genome evolution within this family, here we present complete genomes of three species from two sister genera (Clematoclethra and Actinidia) in the Actinidiaceae via genome skimming technique. Comparative analyses revealed that the genome structure and content were rather conservative in three cp genomes in spite of different inheritance pattern, i.e.paternal in Actinidia and maternal in Clematoclethra. The clpP gene was lacked in all the three sequenced cp genomes examined here indicating that the clpP gene loss is likely a conspicuous synapomorphic characteristic during the cp genome evolution of Actinidiaceae. Comprehensive sequence comparisons in Actinidiaceae cp genomes uncovered that there were apparently heterogenous divergence patterns among the cpDNA regions, suggesting a preferred data-partitioned analysis for cp phylogenomics. Twenty non-coding cpDNA loci with fast evolutionary rates are further identified as potential molecular markers for systematics studies of Actinidiaceae. Moreover, the cp phylogenomic analyses including 31 angiosperm plastomes strongly supported the monophyly of Actinidia, being sister to Clematoclethra in Actinidiaceae which locates in the basal asterids, Ericales.

  4. Structure and organization of Marchantia polymorpha chloroplast genome. I. Cloning and gene identification.

    Science.gov (United States)

    Ohyama, K; Fukuzawa, H; Kohchi, T; Sano, T; Sano, S; Shirai, H; Umesono, K; Shiki, Y; Takeuchi, M; Chang, Z

    1988-09-20

    We have determined the complete nucleotide sequence of chloroplast DNA from a liverwort, Marchantia polymorpha, using a clone bank of chloroplast DNA fragments. The circular genome consists of 121,024 base-pairs and includes two large inverted repeats (IRA and IRB, each 10,058 base-pairs), a large single-copy region (LSC, 81,095 base-pairs), and a small single-copy region (SSC, 19,813 base-pairs). The nucleotide sequence was analysed with a computer to deduce the entire gene organization, assuming the universal genetic code and the presence of introns in the coding sequences. We detected 136 possible genes. 103 gene products of which are related to known stable RNA or protein molecules. Stable RNA genes for four species of ribosomal RNA and 32 species of tRNA were located, although one of the tRNA genes may be defective. Twenty genes encoding polypeptides involved in photosynthesis and electron transport were identified by comparison with known chloroplast genes. Twenty-five open reading frames (ORFs) show structural similarities to Escherichia coli RNA polymerase subunits, 19 ribosomal proteins and two related proteins. Seven ORFs are comparable with human mitochondrial NADH dehydrogenase genes. A computer-aided homology search predicted possible chloroplast homologues of bacterial proteins; two ORFs for bacterial 4Fe-4S-type ferredoxin, two for distinct subunits of a protein-dependent transport system, one ORF for a component of nitrogenase, and one for an antenna protein of a light-harvesting complex. The other 33 ORFs, consisting of 29 to 2136 codons, remain to be identified, but some of them seem to be conserved in evolution. Detailed information on gene identification is presented in the accompanying papers. We postulated that there were 22 introns in 20 genes (8 tRNA genes and 12 ORFs), which may be classified into the groups I and II found in fungal mitochondrial genes. The structural gene for ribosomal protein S12 is trans-split on the opposite DNA strand

  5. The complete chloroplast genome sequence of Dianthus superbus var. longicalycinus.

    Science.gov (United States)

    Gurusamy, Raman; Lee, Do-Hyung; Park, SeonJoo

    2016-05-01

    The complete chloroplast genome (cpDNA) sequence of Dianthus superbus var. longicalycinus is an economically important traditional Chinese medicine was reported and characterized. The cpDNA of Dianthus superbus var. longicalycinus is 149,539 bp, with 36.3% GC content. A pair of inverted repeats (IRs) of 24,803 bp is separated by a large single-copy region (LSC, 82,805 bp) and a small single-copy region (SSC, 17,128 bp). It encodes 85 protein-coding genes, 36 tRNA genes and 8 rRNA genes. Of 129 individual genes, 13 genes encoded one intron and three genes have two introns.

  6. The complete chloroplast genome sequence of Ampelopsis: gene organization, comparative analysis and phylogenetic relationships to other angiosperms

    Directory of Open Access Journals (Sweden)

    Gurusamy eRaman

    2016-03-01

    Full Text Available Ampelopsis brevipedunculata is an economically important plant that belongs to the Vitaceae family of angiosperms. The phylogenetic placement of Vitaceae is still unresolved. Recent phylogenetic studies suggested that it should be placed in various alternative families including Caryophyllaceae, asteraceae, Saxifragaceae, Dilleniaceae, or with the rest of the rosid families. However, these analyses provided weak supportive results because they were based on only one of several genes. Accordingly, complete chloroplast genome sequences are required to resolve the phylogenetic relationships among angiosperms. Recent phylogenetic analyses based on the complete chloroplast genome sequence suggested strong support for the position of Vitaceae as the earliest diverging lineage of rosids and placed it as a sister to the remaining rosids. These studies also revealed relationships among several major lineages of angiosperms; however, they highlighted the significance of taxon sampling for obtaining accurate phylogenies. In the present study, we sequenced the complete chloroplast genome of A. brevipedunculata and used these data to assess the relationships among 32 angiosperms, including 18 taxa of rosids. The Ampelopsis chloroplast genome is 161,090 bp in length, and includes a pair of inverted repeats of 26,394 bp that are separated by small and large single copy regions of 19,036 bp and 89,266 bp, respectively. The gene content and order of Ampelopsis is identical to many other unrearranged angiosperm chloroplast genomes, including Vitis and tobacco. A phylogenetic tree constructed based on 70 protein-coding genes of 33 angiosperms showed that both Saxifragales and Vitaceae diverged from the rosid clade and formed two clades with 100% bootstrap value. The position of the Vitaceae is sister to Saxifragales, and both are the basal and earliest diverging lineages. Moreover, Saxifragales forms a sister clade to Vitaceae of rosids. Overall, the results of

  7. A Phylogenetic Analysis of Chloroplast Genomes Elucidates the Relationships of the Six Economically Important Brassica Species Comprising the Triangle of U

    Science.gov (United States)

    Li, Peirong; Zhang, Shujiang; Li, Fei; Zhang, Shifan; Zhang, Hui; Wang, Xiaowu; Sun, Rifei; Bonnema, Guusje; Borm, Theo J. A.

    2017-01-01

    The Brassica genus comprises many economically important worldwide cultivated crops. The well-established model of the Brassica genus, U’s triangle, consists of three basic diploid plant species (Brassica rapa, Brassica oleracea, and Brassica nigra) and three amphidiploid species (Brassica napus, Brassica juncea, and Brassica carinata) that arose through interspecific hybridizations. Despite being extensively studied because of its commercial relevance, several aspects of the origin of the Brassica species and the relationships within and among these six species still remain open questions. Here, we successfully de novo assembled 60 complete chloroplast genomes of Brassica genotypes of all six species. A complete map of the single nucleotide variants and insertions and deletions in the chloroplast genomes of different Brassica species was produced. The chloroplast genome consists of a Large and a Small Single Copy (LSC and SSC) region between two inverted repeats, and while these regions of chloroplast genomes have very different molecular evolutionary rates, phylogenetic analyses of different regions yielded no contradicting topologies and separated the Brassica genus into four clades. B. carinata and B. juncea share their chloroplast genome with one of their hybridization donors B. nigra and B. rapa, respectively, which fits the U model. B. rapa, surprisingly, shows evidence of two types of chloroplast genomes, with one type specific to some Italian broccoletto accessions. B. napus clearly has evidence for two independent hybridization events, as it contains either B. rapa chloroplast genomes. The divergence estimation suggests that B. nigra and B. carinata diverged from the main Brassica clade 13.7 million years ago (Mya), while B. rapa and B. oleracea diverged at 2.18 Mya. The use of the complete chloroplast DNA sequence not only provides insights into comparative genome analysis but also paves the way for a better understanding of the phylogenetic

  8. Insights from the complete chloroplast genome into the evolution of Sesamum indicum L.

    Directory of Open Access Journals (Sweden)

    Haiyang Zhang

    Full Text Available Sesame (Sesamum indicum L. is one of the oldest oilseed crops. In order to investigate the evolutionary characters according to the Sesame Genome Project, apart from sequencing its nuclear genome, we sequenced the complete chloroplast genome of S. indicum cv. Yuzhi 11 (white seeded using Illumina and 454 sequencing. Comparisons of chloroplast genomes between S. indicum and the 18 other higher plants were then analyzed. The chloroplast genome of cv. Yuzhi 11 contains 153,338 bp and a total of 114 unique genes (KC569603. The number of chloroplast genes in sesame is the same as that in Nicotiana tabacum, Vitis vinifera and Platanus occidentalis. The variation in the length of the large single-copy (LSC regions and inverted repeats (IR in sesame compared to 18 other higher plant species was the main contributor to size variation in the cp genome in these species. The 77 functional chloroplast genes, except for ycf1 and ycf2, were highly conserved. The deletion of the cp ycf1 gene sequence in cp genomes may be due either to its transfer to the nuclear genome, as has occurred in sesame, or direct deletion, as has occurred in Panax ginseng and Cucumis sativus. The sesame ycf2 gene is only 5,721 bp in length and has lost about 1,179 bp. Nucleotides 1-585 of ycf2 when queried in BLAST had hits in the sesame draft genome. Five repeats (R10, R12, R13, R14 and R17 were unique to the sesame chloroplast genome. We also found that IR contraction/expansion in the cp genome alters its rate of evolution. Chloroplast genes and repeats display the signature of convergent evolution in sesame and other species. These findings provide a foundation for further investigation of cp genome evolution in Sesamum and other higher plants.

  9. The complete chloroplast DNA sequence of the green alga Oltmannsiellopsis viridis reveals a distinctive quadripartite architecture in the chloroplast genome of early diverging ulvophytes

    Directory of Open Access Journals (Sweden)

    Lemieux Claude

    2006-02-01

    Oltmannsiellopsis cpDNA more closely resembles that of Chlorella (Trebouxiophyceae cpDNA. Conclusion The chloroplast genome of the last common ancestor of Oltmannsiellopsis and Pseudendoclonium contained a minimum of 108 genes, carried only a few group I introns, and featured a distinctive quadripartite architecture. Numerous changes were experienced by the chloroplast genome in the lineages leading to Oltmannsiellopsis and Pseudendoclonium. Our comparative analyses of chlorophyte cpDNAs support the notion that the Ulvophyceae is sister to the Chlorophyceae.

  10. The complete chloroplast genome sequence of Cephalotaxus oliveri (Cephalotaxaceae): evolutionary comparison of cephalotaxus chloroplast DNAs and insights into the loss of inverted repeat copies in gymnosperms.

    Science.gov (United States)

    Yi, Xuan; Gao, Lei; Wang, Bo; Su, Ying-Juan; Wang, Ting

    2013-01-01

    We have determined the complete chloroplast (cp) genome sequence of Cephalotaxus oliveri. The genome is 134,337 bp in length, encodes 113 genes, and lacks inverted repeat (IR) regions. Genome-wide mutational dynamics have been investigated through comparative analysis of the cp genomes of C. oliveri and C. wilsoniana. Gene order transformation analyses indicate that when distinct isomers are considered as alternative structures for the ancestral cp genome of cupressophyte and Pinaceae lineages, it is not possible to distinguish between hypotheses favoring retention of the same IR region in cupressophyte and Pinaceae cp genomes from a hypothesis proposing independent loss of IRA and IRB. Furthermore, in cupressophyte cp genomes, the highly reduced IRs are replaced by short repeats that have the potential to mediate homologous recombination, analogous to the situation in Pinaceae. The importance of repeats in the mutational dynamics of cupressophyte cp genomes is also illustrated by the accD reading frame, which has undergone extreme length expansion in cupressophytes. This has been caused by a large insertion comprising multiple repeat sequences. Overall, we find that the distribution of repeats, indels, and substitutions is significantly correlated in Cephalotaxus cp genomes, consistent with a hypothesis that repeats play a role in inducing substitutions and indels in conifer cp genomes.

  11. The complete chloroplast genome of two Brassica species, Brassica nigra and B. Oleracea.

    Science.gov (United States)

    Seol, Young-Joo; Kim, Kyunghee; Kang, Sang-Ho; Perumal, Sampath; Lee, Jonghoon; Kim, Chang-Kug

    2017-03-01

    The two Brassica species, Brassica nigra and Brassica oleracea, are important agronomic crops. The chloroplast genome sequences were generated by de novo assembly using whole genome next-generation sequences. The chloroplast genomes of B. nigra and B. oleracea were 153 633 bp and 153 366 bp in size, respectively, and showed conserved typical chloroplast structure. The both chloroplast genomes contained a total of 114 genes including 80 protein-coding genes, 30 tRNA genes, and 4 rRNA genes. Phylogenetic analysis revealed that B. oleracea is closely related to B. rapa and B. napus but B. nigra is more diverse than the neighbor species Raphanus sativus.

  12. Diversification and genetic differentiation of cultivated melon inferred from sequence polymorphism in the chloroplast genome

    OpenAIRE

    Tanaka, Katsunori; Akashi, Yukari; FUKUNAGA, Kenji; Yamamoto, Tatsuya; Aierken, Yasheng; Nishida, Hidetaka; Long, Chun Lin; Yoshino, Hiromichi; Sato, Yo-Ichiro; KATO, Kenji

    2013-01-01

    Molecular analysis encouraged discovery of genetic diversity and relationships of cultivated melon (Cucumis melo L.). We sequenced nine inter- and intra-genic regions of the chloroplast genome, about 5500 bp, using 60 melon accessions and six reference accessions of wild species of Cucumis to show intra-specific variation of the chloroplast genome. Sequence polymorphisms were detected among melon accessions and other Cucumis species, indicating intra-specific diversification of the chloroplas...

  13. Complete Chloroplast Genome Sequence of Omani Lime (Citrus aurantiifolia) and Comparative Analysis within the Rosids

    OpenAIRE

    Huei-Jiun Su; Hogenhout, Saskia A.; Al-Sadi, Abdullah M.; Chih-Horng Kuo

    2014-01-01

    The genus Citrus contains many economically important fruits that are grown worldwide for their high nutritional and medicinal value. Due to frequent hybridizations among species and cultivars, the exact number of natural species and the taxonomic relationships within this genus are unclear. To compare the differences between the Citrus chloroplast genomes and to develop useful genetic markers, we used a reference-assisted approach to assemble the complete chloroplast genome of Omani lime (C....

  14. Patterns of synonymous codon usage bias in chloroplast genomes of seed plants

    Institute of Scientific and Technical Information of China (English)

    2008-01-01

    Codon usage in chloroplast genome of six seed plants (Arabidopsis thaliana, Populus alba, Zea mays, Triticum aestivum,Pinus koraiensis and Cycas taitungensis) was analyzed to find general patterns of codon usage in chloroplast genomes of seed plants.The results show that chloroplast genomes of the six seed plants had similar codon usage patterns, with a strong bias towards a high representation of NNA and NNT codons. In chloroplast genomes of the six seed plants, the effective number of codons (ENC) for most genes was similar to that of the expected ENC based on the GC content at the third codon position, but several genes with low ENC values were laying below the expected curve. All of these data indicate that codon usage was dominated by a mutational bias in chloroplast genomes of seed plants and that selection appeared to be limited to a subset of genes and to only subtly affect codon us-age. Meantime, four, six, eight, nine, ten and 12 codons were defined as the optimal codons in chloroplast genomes of the six seed plants.

  15. The Complete Chloroplast Genome Sequences of the Medicinal Plant Pogostemon cablin

    Directory of Open Access Journals (Sweden)

    Yang He

    2016-06-01

    Full Text Available Pogostemon cablin, the natural source of patchouli alcohol, is an important herb in the Lamiaceae family. Here, we present the entire chloroplast genome of P. cablin. This genome, with 38.24% GC content, is 152,460 bp in length. The genome presents a typical quadripartite structure with two inverted repeats (each 25,417 bp in length, separated by one small and one large single-copy region (17,652 and 83,974 bp in length, respectively. The chloroplast genome encodes 127 genes, of which 107 genes are single-copy, including 79 protein-coding genes, four rRNA genes, and 24 tRNA genes. The genome structure, GC content, and codon usage of this chloroplast genome are similar to those of other species in the family, except that it encodes less protein-coding genes and tRNA genes. Phylogenetic analysis reveals that P. cablin diverged from the Scutellarioideae clade about 29.45 million years ago (Mya. Furthermore, most of the simple sequence repeats (SSRs are short polyadenine or polythymine repeats that contribute to high AT content in the chloroplast genome. Complete sequences and annotation of P. cablin chloroplast genome will facilitate phylogenic, population and genetic engineering research investigations involving this particular species.

  16. The complete chloroplast genome sequences for four Amaranthus species (Amaranthaceae)1

    Science.gov (United States)

    Chaney, Lindsay; Mangelson, Ryan; Ramaraj, Thiruvarangan; Jellen, Eric N.; Maughan, Peter J.

    2016-01-01

    Premise of the study: The amaranth genus contains many important grain and weedy species. We further our understanding of the genus through the development of a complete reference chloroplast genome. Methods and Results: A high-quality Amaranthus hypochondriacus (Amaranthaceae) chloroplast genome assembly was developed using long-read technology. This reference genome was used to reconstruct the chloroplast genomes for two closely related grain species (A. cruentus and A. caudatus) and their putative progenitor (A. hybridus). The reference genome was 150,518 bp and possesses a circular structure of two inverted repeats (24,352 bp) separated by small (17,941 bp) and large (83,873 bp) single-copy regions; it encodes 111 genes, 72 for proteins. Relative to the reference chloroplast genome, an average of 210 single-nucleotide polymorphisms (SNPs) and 122 insertion/deletion polymorphisms (indels) were identified across the analyzed genomes. Conclusions: This reference chloroplast genome, along with the reported simple sequence repeats, SNPs, and indels, is an invaluable genetic resource for studying the phylogeny and genetic diversity within the amaranth genus. PMID:27672525

  17. CpGAVAS, an integrated web server for the annotation, visualization, analysis, and GenBank submission of completely sequenced chloroplast genome sequences

    Directory of Open Access Journals (Sweden)

    Liu Chang

    2012-12-01

    Full Text Available Abstract Background The complete sequences of chloroplast genomes provide wealthy information regarding the evolutionary history of species. With the advance of next-generation sequencing technology, the number of completely sequenced chloroplast genomes is expected to increase exponentially, powerful computational tools annotating the genome sequences are in urgent need. Results We have developed a web server CPGAVAS. The server accepts a complete chloroplast genome sequence as input. First, it predicts protein-coding and rRNA genes based on the identification and mapping of the most similar, full-length protein, cDNA and rRNA sequences by integrating results from Blastx, Blastn, protein2genome and est2genome programs. Second, tRNA genes and inverted repeats (IR are identified using tRNAscan, ARAGORN and vmatch respectively. Third, it calculates the summary statistics for the annotated genome. Fourth, it generates a circular map ready for publication. Fifth, it can create a Sequin file for GenBank submission. Last, it allows the extractions of protein and mRNA sequences for given list of genes and species. The annotation results in GFF3 format can be edited using any compatible annotation editing tools. The edited annotations can then be uploaded to CPGAVAS for update and re-analyses repeatedly. Using known chloroplast genome sequences as test set, we show that CPGAVAS performs comparably to another application DOGMA, while having several superior functionalities. Conclusions CPGAVAS allows the semi-automatic and complete annotation of a chloroplast genome sequence, and the visualization, editing and analysis of the annotation results. It will become an indispensible tool for researchers studying chloroplast genomes. The software is freely accessible from http://www.herbalgenomics.org/cpgavas.

  18. Chloroplast genome sequence confirms distinctness of Australian and Asian wild rice.

    Science.gov (United States)

    Waters, Daniel L E; Nock, Catherine J; Ishikawa, Ryuji; Rice, Nicole; Henry, Robert J

    2012-01-01

    Cultivated rice (Oryza sativa) is an AA genome Oryza species that was most likely domesticated from wild populations of O. rufipogon in Asia. O. rufipogon and O. meridionalis are the only AA genome species found within Australia and occur as widespread populations across northern Australia. The chloroplast genome sequence of O. rufipogon from Asia and Australia and O. meridionalis and O. australiensis (an Australian member of the genus very distant from O. sativa) was obtained by massively parallel sequencing and compared with the chloroplast genome sequence of domesticated O. sativa. Oryza australiensis differed in more than 850 sites single nucleotide polymorphism or indel from each of the other samples. The other wild rice species had only around 100 differences relative to cultivated rice. The chloroplast genomes of Australian O. rufipogon and O. meridionalis were closely related with only 32 differences. The Asian O. rufipogon chloroplast genome (with only 68 differences) was closer to O. sativa than the Australian taxa (both with more than 100 differences). The chloroplast sequences emphasize the genetic distinctness of the Australian populations and their potential as a source of novel rice germplasm. The Australian O. rufipogon may be a perennial form of O. meridionalis.

  19. The Complete Chloroplast Genome of the Hare’s Ear Root, Bupleurum falcatum: Its Molecular Features

    Science.gov (United States)

    Shin, Dong-Ho; Lee, Jeong-Hoon; Kang, Sang-Ho; Ahn, Byung-Ohg; Kim, Chang-Kug

    2016-01-01

    Bupleurum falcatum, which belongs to the family Apiaceae, has long been applied for curative treatments, especially as a liver tonic, in herbal medicine. The chloroplast (cp) genome has been an ideal model to perform the evolutionary and comparative studies because of its highly conserved features and simple structure. The Apiaceae family is taxonomically close to the Araliaceae family and there have been numerous complete chloroplast genome sequences reported in the Araliaceae family, while little is known about the Apiaceae family. In this study, the complete sequence of the B. falcatum chloroplast genome was obtained. The full-length of the cp genome is 155,989 nucleotides with a 37.66% overall guanine-cytosine (GC) content and shows a quadripartite structure composed of three nomenclatural regions: a large single-copy (LSC) region, a small single-copy (SSC) region, and a pair of inverted repeat (IR) regions. The genome occupancy is 85,912-bp, 17,517-bp, and 26,280-bp for LSC, SSC, and IR, respectively. B. falcatum was shown to contain 111 unique genes (78 for protein-coding, 29 for tRNAs, and four for rRNAs, respectively) on its chloroplast genome. Genic comparison found that B. falcatum has no pseudogenes and has two gene losses, accD in the LSC and ycf15 in the IRs. A total of 55 unique tandem repeat sequences were detected in the B. falcatum cp genome. This report is the first to describe the complete chloroplast genome sequence in B. falcatum and will open up further avenues of research to understand the evolutionary panorama and the chloroplast genome conformation in related plant species. PMID:27187480

  20. The Complete Chloroplast Genome of the Hare’s Ear Root, Bupleurum falcatum: Its Molecular Features

    Directory of Open Access Journals (Sweden)

    Dong-Ho Shin

    2016-05-01

    Full Text Available Bupleurum falcatum, which belongs to the family Apiaceae, has long been applied for curative treatments, especially as a liver tonic, in herbal medicine. The chloroplast (cp genome has been an ideal model to perform the evolutionary and comparative studies because of its highly conserved features and simple structure. The Apiaceae family is taxonomically close to the Araliaceae family and there have been numerous complete chloroplast genome sequences reported in the Araliaceae family, while little is known about the Apiaceae family. In this study, the complete sequence of the B. falcatum chloroplast genome was obtained. The full-length of the cp genome is 155,989 nucleotides with a 37.66% overall guanine-cytosine (GC content and shows a quadripartite structure composed of three nomenclatural regions: a large single-copy (LSC region, a small single-copy (SSC region, and a pair of inverted repeat (IR regions. The genome occupancy is 85,912-bp, 17,517-bp, and 26,280-bp for LSC, SSC, and IR, respectively. B. falcatum was shown to contain 111 unique genes (78 for protein-coding, 29 for tRNAs, and four for rRNAs, respectively on its chloroplast genome. Genic comparison found that B. falcatum has no pseudogenes and has two gene losses, accD in the LSC and ycf15 in the IRs. A total of 55 unique tandem repeat sequences were detected in the B. falcatum cp genome. This report is the first to describe the complete chloroplast genome sequence in B. falcatum and will open up further avenues of research to understand the evolutionary panorama and the chloroplast genome conformation in related plant species.

  1. Genome Sequences of Populus tremula Chloroplast and Mitochondrion: Implications for Holistic Poplar Breeding.

    Directory of Open Access Journals (Sweden)

    Birgit Kersten

    Full Text Available Complete Populus genome sequences are available for the nucleus (P. trichocarpa; section Tacamahaca and for chloroplasts (seven species, but not for mitochondria. Here, we provide the complete genome sequences of the chloroplast and the mitochondrion for the clones P. tremula W52 and P. tremula x P. alba 717-1B4 (section Populus. The organization of the chloroplast genomes of both Populus clones is described. A phylogenetic tree constructed from all available complete chloroplast DNA sequences of Populus was not congruent with the assignment of the related species to different Populus sections. In total, 3,024 variable nucleotide positions were identified among all compared Populus chloroplast DNA sequences. The 5-prime part of the LSC from trnH to atpA showed the highest frequency of variations. The variable positions included 163 positions with SNPs allowing for differentiating the two clones with P. tremula chloroplast genomes (W52, 717-1B4 from the other seven Populus individuals. These potential P. tremula-specific SNPs were displayed as a whole-plastome barcode on the P. tremula W52 chloroplast DNA sequence. Three of these SNPs and one InDel in the trnH-psbA linker were successfully validated by Sanger sequencing in an extended set of Populus individuals. The complete mitochondrial genome sequence of P. tremula is the first in the family of Salicaceae. The mitochondrial genomes of the two clones are 783,442 bp (W52 and 783,513 bp (717-1B4 in size, structurally very similar and organized as single circles. DNA sequence regions with high similarity to the W52 chloroplast sequence account for about 2% of the W52 mitochondrial genome. The mean SNP frequency was found to be nearly six fold higher in the chloroplast than in the mitochondrial genome when comparing 717-1B4 with W52. The availability of the genomic information of all three DNA-containing cell organelles will allow a holistic approach in poplar molecular breeding in the future.

  2. Complete chloroplast genome sequence of Omani lime (Citrus aurantiifolia and comparative analysis within the rosids.

    Directory of Open Access Journals (Sweden)

    Huei-Jiun Su

    Full Text Available The genus Citrus contains many economically important fruits that are grown worldwide for their high nutritional and medicinal value. Due to frequent hybridizations among species and cultivars, the exact number of natural species and the taxonomic relationships within this genus are unclear. To compare the differences between the Citrus chloroplast genomes and to develop useful genetic markers, we used a reference-assisted approach to assemble the complete chloroplast genome of Omani lime (C. aurantiifolia. The complete C. aurantiifolia chloroplast genome is 159,893 bp in length; the organization and gene content are similar to most of the rosids lineages characterized to date. Through comparison with the sweet orange (C. sinensis chloroplast genome, we identified three intergenic regions and 94 simple sequence repeats (SSRs that are potentially informative markers with resolution for interspecific relationships. These markers can be utilized to better understand the origin of cultivated Citrus. A comparison among 72 species belonging to 10 families of representative rosids lineages also provides new insights into their chloroplast genome evolution.

  3. Complete chloroplast genome sequence of Omani lime (Citrus aurantiifolia) and comparative analysis within the rosids.

    Science.gov (United States)

    Su, Huei-Jiun; Hogenhout, Saskia A; Al-Sadi, Abdullah M; Kuo, Chih-Horng

    2014-01-01

    The genus Citrus contains many economically important fruits that are grown worldwide for their high nutritional and medicinal value. Due to frequent hybridizations among species and cultivars, the exact number of natural species and the taxonomic relationships within this genus are unclear. To compare the differences between the Citrus chloroplast genomes and to develop useful genetic markers, we used a reference-assisted approach to assemble the complete chloroplast genome of Omani lime (C. aurantiifolia). The complete C. aurantiifolia chloroplast genome is 159,893 bp in length; the organization and gene content are similar to most of the rosids lineages characterized to date. Through comparison with the sweet orange (C. sinensis) chloroplast genome, we identified three intergenic regions and 94 simple sequence repeats (SSRs) that are potentially informative markers with resolution for interspecific relationships. These markers can be utilized to better understand the origin of cultivated Citrus. A comparison among 72 species belonging to 10 families of representative rosids lineages also provides new insights into their chloroplast genome evolution.

  4. Phylogenetic Relationships of the Fern Cyrtomium falcatum (Dryopteridaceae) from Dokdo Island Based on Chloroplast Genome Sequencing.

    Science.gov (United States)

    Raman, Gurusamy; Choi, Kyoung Su; Park, SeonJoo

    2016-12-02

    Cyrtomium falcatum is a popular ornamental fern cultivated worldwide. Native to the Korean Peninsula, Japan, and Dokdo Island in the Sea of Japan, it is the only fern present on Dokdo Island. We isolated and characterized the chloroplast (cp) genome of C. falcatum, and compared it with those of closely related species. The genes trnV-GAC and trnV-GAU were found to be present within the cp genome of C. falcatum, whereas trnP-GGG and rpl21 were lacking. Moreover, cp genomes of Cyrtomium devexiscapulae and Adiantum capillus-veneris lack trnP-GGG and rpl21, suggesting these are not conserved among angiosperm cp genomes. The deletion of trnR-UCG, trnR-CCG, and trnSeC in the cp genomes of C. falcatum and other eupolypod ferns indicates these genes are restricted to tree ferns, non-core leptosporangiates, and basal ferns. The C. falcatum cp genome also encoded ndhF and rps7, with GUG start codons that were only conserved in polypod ferns, and it shares two significant inversions with other ferns, including a minor inversion of the trnD-GUC region and an approximate 3 kb inversion of the trnG-trnT region. Phylogenetic analyses showed that Equisetum was found to be a sister clade to Psilotales-Ophioglossales with a 100% bootstrap (BS) value. The sister relationship between Pteridaceae and eupolypods was also strongly supported by a 100% BS, but Bayesian molecular clock analyses suggested that C. falcatum diversified in the mid-Paleogene period (45.15 ± 4.93 million years ago) and might have moved from Eurasia to Dokdo Island.

  5. The complete chloroplast genome sequence of Citrus sinensis (L. Osbeck var 'Ridge Pineapple': organization and phylogenetic relationships to other angiosperms

    Directory of Open Access Journals (Sweden)

    Jansen Robert K

    2006-09-01

    Full Text Available Abstract Background The production of Citrus, the largest fruit crop of international economic value, has recently been imperiled due to the introduction of the bacterial disease Citrus canker. No significant improvements have been made to combat this disease by plant breeding and nuclear transgenic approaches. Chloroplast genetic engineering has a number of advantages over nuclear transformation; it not only increases transgene expression but also facilitates transgene containment, which is one of the major impediments for development of transgenic trees. We have sequenced the Citrus chloroplast genome to facilitate genetic improvement of this crop and to assess phylogenetic relationships among major lineages of angiosperms. Results The complete chloroplast genome sequence of Citrus sinensis is 160,129 bp in length, and contains 133 genes (89 protein-coding, 4 rRNAs and 30 distinct tRNAs. Genome organization is very similar to the inferred ancestral angiosperm chloroplast genome. However, in Citrus the infA gene is absent. The inverted repeat region has expanded to duplicate rps19 and the first 84 amino acids of rpl22. The rpl22 gene in the IRb region has a nonsense mutation resulting in 9 stop codons. This was confirmed by PCR amplification and sequencing using primers that flank the IR/LSC boundaries. Repeat analysis identified 29 direct and inverted repeats 30 bp or longer with a sequence identity ≥ 90%. Comparison of protein-coding sequences with expressed sequence tags revealed six putative RNA edits, five of which resulted in non-synonymous modifications in petL, psbH, ycf2 and ndhA. Phylogenetic analyses using maximum parsimony (MP and maximum likelihood (ML methods of a dataset composed of 61 protein-coding genes for 30 taxa provide strong support for the monophyly of several major clades of angiosperms, including monocots, eudicots, rosids and asterids. The MP and ML trees are incongruent in three areas: the position of Amborella and

  6. Chloroplast genome sequence confirms distinctness of Australian and Asian wild rice

    OpenAIRE

    Waters, Daniel L. E.; Nock, Catherine J; Ishikawa, Ryuji; Rice, Nicole; Henry, Robert J.

    2012-01-01

    Cultivated rice (Oryza sativa) is an AA genome Oryza species that was most likely domesticated from wild populations of O. rufipogon in Asia. O. rufipogon and O. meridionalis are the only AA genome species found within Australia and occur as widespread populations across northern Australia. The chloroplast genome sequence of O. rufipogon from Asia and Australia and O. meridionalis and O. australiensis (an Australian member of the genus very distant from O. sativa) was obtained by massively pa...

  7. Analysis of complete nucleotide sequences of 12 Gossypium chloroplast genomes: origin and evolution of allotetraploids.

    Directory of Open Access Journals (Sweden)

    Qin Xu

    Full Text Available BACKGROUND: Cotton (Gossypium spp. is a model system for the analysis of polyploidization. Although ascertaining the donor species of allotetraploid cotton has been intensively studied, sequence comparison of Gossypium chloroplast genomes is still of interest to understand the mechanisms underlining the evolution of Gossypium allotetraploids, while it is generally accepted that the parents were A- and D-genome containing species. Here we performed a comparative analysis of 13 Gossypium chloroplast genomes, twelve of which are presented here for the first time. METHODOLOGY/PRINCIPAL FINDINGS: The size of 12 chloroplast genomes under study varied from 159,959 bp to 160,433 bp. The chromosomes were highly similar having >98% sequence identity. They encoded the same set of 112 unique genes which occurred in a uniform order with only slightly different boundary junctions. Divergence due to indels as well as substitutions was examined separately for genome, coding and noncoding sequences. The genome divergence was estimated as 0.374% to 0.583% between allotetraploid species and A-genome, and 0.159% to 0.454% within allotetraploids. Forty protein-coding genes were completely identical at the protein level, and 20 intergenic sequences were completely conserved. The 9 allotetraploids shared 5 insertions and 9 deletions in whole genome, and 7-bp substitutions in protein-coding genes. The phylogenetic tree confirmed a close relationship between allotetraploids and the ancestor of A-genome, and the allotetraploids were divided into four separate groups. Progenitor allotetraploid cotton originated 0.43-0.68 million years ago (MYA. CONCLUSION: Despite high degree of conservation between the Gossypium chloroplast genomes, sequence variations among species could still be detected. Gossypium chloroplast genomes preferred for 5-bp indels and 1-3-bp indels are mainly attributed to the SSR polymorphisms. This study supports that the common ancestor of diploid A-genome

  8. The complete chloroplast and mitochondrial genomes of the green macroalga Ulva sp. UNA00071828 (Ulvophyceae, Chlorophyta.

    Directory of Open Access Journals (Sweden)

    James T Melton

    Full Text Available Sequencing mitochondrial and chloroplast genomes has become an integral part in understanding the genomic machinery and the phylogenetic histories of green algae. Previously, only three chloroplast genomes (Oltmannsiellopsis viridis, Pseudendoclonium akinetum, and Bryopsis hypnoides and two mitochondrial genomes (O. viridis and P. akinetum from the class Ulvophyceae have been published. Here, we present the first chloroplast and mitochondrial genomes from the ecologically and economically important marine, green algal genus Ulva. The chloroplast genome of Ulva sp. was 99,983 bp in a circular-mapping molecule that lacked inverted repeats, and thus far, was the smallest ulvophycean plastid genome. This cpDNA was a highly compact, AT-rich genome that contained a total of 102 identified genes (71 protein-coding genes, 28 tRNA genes, and three ribosomal RNA genes. Additionally, five introns were annotated in four genes: atpA (1, petB (1, psbB (2, and rrl (1. The circular-mapping mitochondrial genome of Ulva sp. was 73,493 bp and follows the expanded pattern also seen in other ulvophyceans and trebouxiophyceans. The Ulva sp. mtDNA contained 29 protein-coding genes, 25 tRNA genes, and two rRNA genes for a total of 56 identifiable genes. Ten introns were annotated in this mtDNA: cox1 (4, atp1 (1, nad3 (1, nad5 (1, and rrs (3. Double-cut-and-join (DCJ values showed that organellar genomes across Chlorophyta are highly rearranged, in contrast to the highly conserved organellar genomes of the red algae (Rhodophyta. A phylogenomic investigation of 51 plastid protein-coding genes showed that Ulvophyceae is not monophyletic, and also placed Oltmannsiellopsis (Oltmannsiellopsidales and Tetraselmis (Chlorodendrophyceae closely to Ulva (Ulvales and Pseudendoclonium (Ulothrichales.

  9. Maternal inheritance of chloroplast genome and paternal inheritance of mitochondrial genome in bananas (Musa acuminata).

    Science.gov (United States)

    Fauré, S; Noyer, J L; Carreel, F; Horry, J P; Bakry, F; Lanaud, C

    1994-03-01

    Restriction fragment length polymorphisms (RFLPs) were used as markers to determine the transmission of cytoplasmic DNA in diploid banana crosses. Progenies from two controlled crosses were studied with heterologous cytoplasmic probes. This analysis provided evidence for a strong bias towards maternal transmission of chloroplast DNA and paternal transmission of mitochondrial DNA in Musa acuminata. These results suggest the existence of two separate mechanisms of organelle transmission and selection, but no model to explain this can be proposed at the present time. Knowledge of the organelle mode of inheritance constitutes an important point for phylogeny analyses in bananas and may offer a powerful tool to confirm hybrid origins.

  10. An improved chloroplast DNA extraction procedure for whole plastid genome sequencing.

    Directory of Open Access Journals (Sweden)

    Chao Shi

    Full Text Available BACKGROUND: Chloroplast genomes supply valuable genetic information for evolutionary and functional studies in plants. The past five years have witnessed a dramatic increase in the number of completely sequenced chloroplast genomes with the application of second-generation sequencing technology in plastid genome sequencing projects. However, cost-effective high-throughput chloroplast DNA (cpDNA extraction becomes a major bottleneck restricting the application, as conventional methods are difficult to make a balance between the quality and yield of cpDNAs. METHODOLOGY/PRINCIPAL FINDINGS: We first tested two traditional methods to isolate cpDNA from the three species, Oryza brachyantha, Leersia japonica and Prinsepia utihis. Both of them failed to obtain properly defined cpDNA bands. However, we developed a simple but efficient method based on sucrose gradients and found that the modified protocol worked efficiently to isolate the cpDNA from the same three plant species. We sequenced the isolated DNA samples with Illumina (Solexa sequencing technology to test cpDNA purity according to aligning sequence reads to the reference chloroplast genomes, showing that the reference genome was properly covered. We show that 40-50% cpDNA purity is achieved with our method. CONCLUSION: Here we provide an improved method used to isolate cpDNA from angiosperms. The Illumina sequencing results suggest that the isolated cpDNA has reached enough yield and sufficient purity to perform subsequent genome assembly. The cpDNA isolation protocol thus will be widely applicable to the plant chloroplast genome sequencing projects.

  11. A genomic approach for isolating chloroplast microsatellite markers for Pachyptera kerere (Bignoniaceae)1

    Science.gov (United States)

    Francisco, Jessica N. C.; Nazareno, Alison G.; Lohmann, Lúcia G.

    2016-01-01

    Premise of the study: In this study, we developed chloroplast microsatellite markers (cpSSRs) for Pachyptera kerere (Bignoniaceae) to investigate the population structure and genetic diversity of this species. Methods and Results: We used Illumina HiSeq data to reconstruct the chloroplast genome of P. kerere by a combination of de novo and reference-guided assembly. We then used the chloroplast genome to develop a set of cpSSRs from intergenic regions. Overall, 24 primer pairs were designed, 21 of which amplified successfully and were polymorphic, presenting three to nine alleles per locus. The unbiased haploid diversity per locus varied from 0.207 (Pac28) to 0.817 (Pac04). All but one locus amplified for all other taxa of Pachyptera. Conclusions: The markers reported here will serve as a basis for studies to assess the genetic structure and phylogeographic history of Pachyptera. PMID:27672522

  12. The First Chloroplast Genome Sequence of Boswellia sacra, a Resin-Producing Plant in Oman

    Science.gov (United States)

    Khan, Abdul Latif; Al-Harrasi, Ahmed; Asaf, Sajjad; Park, Chang Eon; Park, Gun-Seok; Khan, Abdur Rahim; Lee, In-Jung; Al-Rawahi, Ahmed; Shin, Jae-Ho

    2017-01-01

    Boswellia sacra (Burseraceae), a keystone endemic species, is famous for the production of fragrant oleo-gum resin. However, the genetic make-up especially the genomic information about chloroplast is still unknown. Here, we described for the first time the chloroplast (cp) genome of B. sacra. The complete cp sequence revealed a circular genome of 160,543 bp size with 37.61% GC content. The cp genome is a typical quadripartite chloroplast structure with inverted repeats (IRs 26,763 bp) separated by small single copy (SSC; 18,962 bp) and large single copy (LSC; 88,055 bp) regions. De novo assembly and annotation showed the presence of 114 unique genes with 83 protein-coding regions. The phylogenetic analysis revealed that the B. sacra cp genome is closely related to the cp genome of Azadirachta indica and Citrus sinensis, while most of the syntenic differences were found in the non-coding regions. The pairwise distance among 76 shared genes of B. sacra and A. indica was highest for atpA, rpl2, rps12 and ycf1. The cp genome of B. sacra reveals a novel genome, which could be used for further studied to understand its diversity, taxonomy and phylogeny. PMID:28085925

  13. Comparative analysis of single nucleotide polymorphisms in the nuclear, chloroplast, and mitochondrial genomes in identification of phylogenetic association among seven melon (Cucumis melo L.) cultivars.

    Science.gov (United States)

    Zhu, Qianglong; Gao, Peng; Liu, Shi; Amanullah, Sikandar; Luan, Feishi

    2016-12-01

    A variety of melons are cultivated worldwide, and their specific biological properties make them an attractive model for molecular studies. This study aimed to investigate the single nucleotide polymorphisms (SNPs) from the mitochondrial, chloroplast, and nuclear genomes of seven melon accessions (Cucumis melo L.) to identify the phylogenetic relationships among melon cultivars with the Illumina HiSeq 2000 platform and bioinformatical analyses. The data showed that there were a total of 658 mitochondrial SNPs (207-295 in each), while there were 0-60 chloroplast SNPs among these seven melon cultivars, compared to the reference genome. Bioinformatical analysis showed that the mitochondrial tree topology was unable to separate the melon features, whereas the maximum parsimony/neighbor joining (MP/NJ) tree of the chloroplast SNPs could define melon features such as seed length, width, thickness, 100-seed weight, and type. SNPs of the nuclear genome were better than the mitochondrial and chloroplast SNPs in the identification of melon features. The data demonstrated the usefulness of mitochondrial, chloroplast, and nuclear SNPs in identification of phylogenetic associations among these seven melon cultivars.

  14. An optimized chloroplast DNA extraction protocol for grasses (Poaceae proves suitable for whole plastid genome sequencing and SNP detection.

    Directory of Open Access Journals (Sweden)

    Kerstin Diekmann

    Full Text Available BACKGROUND: Obtaining chloroplast genome sequences is important to increase the knowledge about the fundamental biology of plastids, to understand evolutionary and ecological processes in the evolution of plants, to develop biotechnological applications (e.g. plastid engineering and to improve the efficiency of breeding schemes. Extraction of pure chloroplast DNA is required for efficient sequencing of chloroplast genomes. Unfortunately, most protocols for extracting chloroplast DNA were developed for eudicots and do not produce sufficiently pure yields for a shotgun sequencing approach of whole plastid genomes from the monocot grasses. METHODOLOGY/PRINCIPAL FINDINGS: We have developed a simple and inexpensive method to obtain chloroplast DNA from grass species by modifying and extending protocols optimized for the use in eudicots. Many protocols for extracting chloroplast DNA require an ultracentrifugation step to efficiently separate chloroplast DNA from nuclear DNA. The developed method uses two more centrifugation steps than previously reported protocols and does not require an ultracentrifuge. CONCLUSIONS/SIGNIFICANCE: The described method delivered chloroplast DNA of very high quality from two grass species belonging to highly different taxonomic subfamilies within the grass family (Lolium perenne, Pooideae; Miscanthus x giganteus, Panicoideae. The DNA from Lolium perenne was used for whole chloroplast genome sequencing and detection of SNPs. The sequence is publicly available on EMBL/GenBank.

  15. Comparative analysis of the complete chloroplast genome sequences in psammophytic Haloxylon species (Amaranthaceae

    Directory of Open Access Journals (Sweden)

    Wenpan Dong

    2016-11-01

    Full Text Available The Haloxylon genus belongs to the Amaranthaceae (formerly Chenopodiaceae family. The small trees or shrubs in this genus are referred to as the King of psammophytic plants, and perform important functions in environmental protection, including wind control and sand fixation in deserts. To better understand these beneficial plants, we sequenced the chloroplast (cp genomes of Haloxylon ammodendron (HA and Haloxylon persicum (HP and conducted comparative genomic analyses on these and two other representative Amaranthaceae species. Similar to other higher plants, we found that the Haloxylon cp genome is a quadripartite, double-stranded, circular DNA molecule of 151,570 bp in HA and 151,586 bp in HP. It contains a pair of inverted repeats (24,171 bp in HA and 24,177 bp in HP that separate the genome into a large single copy region of 84,214 bp in HA and 84,217 bp in HP, and a small single copy region of 19,014 bp in HA and 19,015 bp in HP. Each Haloxylon cp genome contains 112 genes, including 78 coding, 30 tRNA, and four ribosomal RNA genes. We detected 59 different simple sequence repeat loci, including 44 mono-nucleotide, three di-nucleotide, one tri-nucleotide, and 11 tetra-nucleotide repeats. Comparative analysis revealed only 67 mutations between the two species, including 44 substitutions, 23 insertions/deletions, and two micro-inversions. The two inversions, with lengths of 14 and 3 bp, occur in the petA-psbJ intergenic region and rpl16 intron, respectively, and are predicted to form hairpin structures with repeat sequences of 27 and 19 bp, respectively, at the two ends. The ratio of transitions to transversions was 0.76. These results are valuable for future studies on Haloxylon genetic diversity and will enhance our understanding of the phylogenetic evolution of Amaranthaceae.

  16. Diversity of chloroplast genome among local clones of cocoa (Theobroma cacao, L.) from Central Sulawesi

    Science.gov (United States)

    Suwastika, I. Nengah; Pakawaru, Nurul Aisyah; Rifka, Rahmansyah, Muslimin, Ishizaki, Yoko; Cruz, André Freire; Basri, Zainuddin; Shiina, Takashi

    2017-02-01

    Chloroplast genomes typically range in size from 120 to 170 kilo base pairs (kb), which relatively conserved among plant species. Recent evaluation on several species, certain unique regions showed high variability which can be utilized in the phylogenetic analysis. Many fragments of coding regions, introns, and intergenic spacers, such as atpB-rbcL, ndhF, rbcL, rpl16, trnH-psbA, trnL-F, trnS-G, etc., have been used for phylogenetic reconstructions at various taxonomic levels. Based on that status, we would like to analysis the diversity of chloroplast genome within species of local cacao (Theobroma cacao L.) from Central Sulawesi. Our recent data showed, there were more than 20 clones from local farming in Central Sulawesi, and it can be detected based on phenotypic and nuclear-genome-based characterization (RAPD- Random Amplified Polymorphic DNA and SSR- Simple Sequences Repeat) markers. In developing DNA marker for this local cacao, here we also included analysis based on the variation of chloroplast genome. At least several regions such as rpl32-TurnL, it can be considered as chloroplast markers on our local clone of cocoa. Furthermore, we could develop phylogenetic analysis in between clones of cocoa.

  17. The complete chloroplast and mitochondrial genome sequences of Boea hygrometrica: insights into the evolution of plant organellar genomes.

    Directory of Open Access Journals (Sweden)

    Tongwu Zhang

    Full Text Available The complete nucleotide sequences of the chloroplast (cp and mitochondrial (mt genomes of resurrection plant Boea hygrometrica (Bh, Gesneriaceae have been determined with the lengths of 153,493 bp and 510,519 bp, respectively. The smaller chloroplast genome contains more genes (147 with a 72% coding sequence, and the larger mitochondrial genome have less genes (65 with a coding faction of 12%. Similar to other seed plants, the Bh cp genome has a typical quadripartite organization with a conserved gene in each region. The Bh mt genome has three recombinant sequence repeats of 222 bp, 843 bp, and 1474 bp in length, which divide the genome into a single master circle (MC and four isomeric molecules. Compared to other angiosperms, one remarkable feature of the Bh mt genome is the frequent transfer of genetic material from the cp genome during recent Bh evolution. We also analyzed organellar genome evolution in general regarding genome features as well as compositional dynamics of sequence and gene structure/organization, providing clues for the understanding of the evolution of organellar genomes in plants. The cp-derived sequences including tRNAs found in angiosperm mt genomes support the conclusion that frequent gene transfer events may have begun early in the land plant lineage.

  18. The complete chloroplast and mitochondrial genome sequences of Boea hygrometrica: insights into the evolution of plant organellar genomes.

    Science.gov (United States)

    Zhang, Tongwu; Fang, Yongjun; Wang, Xumin; Deng, Xin; Zhang, Xiaowei; Hu, Songnian; Yu, Jun

    2012-01-01

    The complete nucleotide sequences of the chloroplast (cp) and mitochondrial (mt) genomes of resurrection plant Boea hygrometrica (Bh, Gesneriaceae) have been determined with the lengths of 153,493 bp and 510,519 bp, respectively. The smaller chloroplast genome contains more genes (147) with a 72% coding sequence, and the larger mitochondrial genome have less genes (65) with a coding faction of 12%. Similar to other seed plants, the Bh cp genome has a typical quadripartite organization with a conserved gene in each region. The Bh mt genome has three recombinant sequence repeats of 222 bp, 843 bp, and 1474 bp in length, which divide the genome into a single master circle (MC) and four isomeric molecules. Compared to other angiosperms, one remarkable feature of the Bh mt genome is the frequent transfer of genetic material from the cp genome during recent Bh evolution. We also analyzed organellar genome evolution in general regarding genome features as well as compositional dynamics of sequence and gene structure/organization, providing clues for the understanding of the evolution of organellar genomes in plants. The cp-derived sequences including tRNAs found in angiosperm mt genomes support the conclusion that frequent gene transfer events may have begun early in the land plant lineage.

  19. Increasing phylogenetic resolution at low taxonomic levels using massively parallel sequencing of chloroplast genomes

    Directory of Open Access Journals (Sweden)

    Cronn Richard

    2009-12-01

    Full Text Available Abstract Background Molecular evolutionary studies share the common goal of elucidating historical relationships, and the common challenge of adequately sampling taxa and characters. Particularly at low taxonomic levels, recent divergence, rapid radiations, and conservative genome evolution yield limited sequence variation, and dense taxon sampling is often desirable. Recent advances in massively parallel sequencing make it possible to rapidly obtain large amounts of sequence data, and multiplexing makes extensive sampling of megabase sequences feasible. Is it possible to efficiently apply massively parallel sequencing to increase phylogenetic resolution at low taxonomic levels? Results We reconstruct the infrageneric phylogeny of Pinus from 37 nearly-complete chloroplast genomes (average 109 kilobases each of an approximately 120 kilobase genome generated using multiplexed massively parallel sequencing. 30/33 ingroup nodes resolved with ≥ 95% bootstrap support; this is a substantial improvement relative to prior studies, and shows massively parallel sequencing-based strategies can produce sufficient high quality sequence to reach support levels originally proposed for the phylogenetic bootstrap. Resampling simulations show that at least the entire plastome is necessary to fully resolve Pinus, particularly in rapidly radiating clades. Meta-analysis of 99 published infrageneric phylogenies shows that whole plastome analysis should provide similar gains across a range of plant genera. A disproportionate amount of phylogenetic information resides in two loci (ycf1, ycf2, highlighting their unusual evolutionary properties. Conclusion Plastome sequencing is now an efficient option for increasing phylogenetic resolution at lower taxonomic levels in plant phylogenetic and population genetic analyses. With continuing improvements in sequencing capacity, the strategies herein should revolutionize efforts requiring dense taxon and character sampling

  20. Complex interplay among DNA modification, noncoding RNA expression and protein-coding RNA expression in Salvia miltiorrhiza chloroplast genome.

    Directory of Open Access Journals (Sweden)

    Haimei Chen

    Full Text Available Salvia miltiorrhiza is one of the most widely used medicinal plants. As a first step to develop a chloroplast-based genetic engineering method for the over-production of active components from S. miltiorrhiza, we have analyzed the genome, transcriptome, and base modifications of the S. miltiorrhiza chloroplast. Total genomic DNA and RNA were extracted from fresh leaves and then subjected to strand-specific RNA-Seq and Single-Molecule Real-Time (SMRT sequencing analyses. Mapping the RNA-Seq reads to the genome assembly allowed us to determine the relative expression levels of 80 protein-coding genes. In addition, we identified 19 polycistronic transcription units and 136 putative antisense and intergenic noncoding RNA (ncRNA genes. Comparison of the abundance of protein-coding transcripts (cRNA with and without overlapping antisense ncRNAs (asRNA suggest that the presence of asRNA is associated with increased cRNA abundance (p<0.05. Using the SMRT Portal software (v1.3.2, 2687 potential DNA modification sites and two potential DNA modification motifs were predicted. The two motifs include a TATA box-like motif (CPGDMM1, "TATANNNATNA", and an unknown motif (CPGDMM2 "WNYANTGAW". Specifically, 35 of the 97 CPGDMM1 motifs (36.1% and 91 of the 369 CPGDMM2 motifs (24.7% were found to be significantly modified (p<0.01. Analysis of genes downstream of the CPGDMM1 motif revealed the significantly increased abundance of ncRNA genes that are less than 400 bp away from the significantly modified CPGDMM1motif (p<0.01. Taking together, the present study revealed a complex interplay among DNA modifications, ncRNA and cRNA expression in chloroplast genome.

  1. A Comparison of the First Two Sequenced Chloroplast Genomes in Asteraceae: Lettuce and Sunflower

    Energy Technology Data Exchange (ETDEWEB)

    Timme, Ruth E.; Kuehl, Jennifer V.; Boore, Jeffrey L.; Jansen, Robert K.

    2006-01-20

    Asteraceae is the second largest family of plants, with over 20,000 species. For the past few decades, numerous phylogenetic studies have contributed to our understanding of the evolutionary relationships within this family, including comparisons of the fast evolving chloroplast gene, ndhF, rbcL, as well as non-coding DNA from the trnL intron plus the trnLtrnF intergenic spacer, matK, and, with lesser resolution, psbA-trnH. This culminated in a study by Panero and Funk in 2002 that used over 13,000 bp per taxon for the largest taxonomic revision of Asteraceae in over a hundred years. Still, some uncertainties remain, and it would be very useful to have more information on the relative rates of sequence evolution among various genes and on genome structure as a potential set of phylogenetic characters to help guide future phylogenetic structures. By way of contributing to this, we report the first two complete chloroplast genome sequences from members of the Asteraceae, those of Helianthus annuus and Lactuca sativa. These plants belong to two distantly related subfamilies, Asteroideae and Cichorioideae, respectively. In addition to these, there is only one other published chloroplast genome sequence for any plant within the larger group called Eusterids II, that of Panax ginseng (Araliaceae, 156,318 bps, AY582139). Early chloroplast genome mapping studies demonstrated that H. annuus and L. sativa share a 22 kb inversion relative to members of the subfamily Barnadesioideae. By comparison to outgroups, this inversion was shown to be derived, indicating that the Asteroideae and Cichorioideae are more closely related than either is to the Barnadesioideae. Later sequencing study found that taxa that share this 22 kb inversion also contain within this region a second, smaller, 3.3 kb inversion. These sequences also enable an analysis of patterns of shared repeats in the genomes at fine level and of RNA editing by comparison to available EST sequences. In addition, since

  2. Relative rates of synonymous substitutions in the mitochondrial, chloroplast and nuclear genomes of seed plants.

    Science.gov (United States)

    Drouin, Guy; Daoud, Hanane; Xia, Junnan

    2008-12-01

    Previous studies have estimated that, in angiosperms, the synonymous substitution rate of chloroplast genes is three times higher than that of mitochondrial genes and that of nuclear genes is twelve times higher than that of mitochondrial genes. Here we used 12 genes in 27 seed plant species to investigate whether these relative rates of substitutions are common to diverse seed plant groups. We find that the overall relative rate of synonymous substitutions of mitochondrial, chloroplast and nuclear genes of all seed plants is 1:3:10, that these ratios are 1:2:4 in gymnosperms but 1:3:16 in angiosperms and that they go up to 1:3:20 in basal angiosperms. Our results show that the mitochondrial, chloroplast and nuclear genomes of seed plant groups have different synonymous substitutions rates, that these rates are different in different seed plant groups and that gymnosperms have smaller ratios than angiosperms.

  3. The complete chloroplast genome sequence of American bird pepper (Capsicum annuum var. glabriusculum).

    Science.gov (United States)

    Zeng, Fan-chun; Gao, Cheng-wen; Gao, Li-zhi

    2016-01-01

    The complete chloroplast genome sequence of American bird pepper (Capsicum annuum var. glabriusculum) is reported and characterized in this study. The genome size is 156,612 bp, containing a pair of inverted repeats (IRs) of 25,776 bp separated by a large single-copy region of 87,213 bp and a small single-copy region of 17,851 bp. The chloroplast genome harbors 130 known genes, including 89 protein-coding genes, 8 ribosomal RNA genes, and 37 tRNA genes. A total of 18 of these genes are duplicated in the inverted repeat regions, 16 genes contain 1 intron, and 2 genes and one ycf have 2 introns.

  4. Characterization of the chloroplast genome sequence of oil palm (Elaeis guineensis Jacq.).

    Science.gov (United States)

    Uthaipaisanwong, P; Chanprasert, J; Shearman, J R; Sangsrakru, D; Yoocha, T; Jomchai, N; Jantasuriyarat, C; Tragoonrung, S; Tangphatsornruang, S

    2012-06-01

    Oil palm (Elaeis guineensis Jacq.) is an economically important crop, which is grown for oil production. To better understand the molecular basis of oil palm chloroplasts, we characterized the complete chloroplast (cp) genome sequence obtained from 454 pyrosequencing. The oil palm cp genome is 156,973 bp in length consisting of a large single-copy region of 85,192 bp flanked on each side by inverted repeats of 27,071 bp with a small single-copy region of 17,639 bp joining the repeats. The genome contains 112 unique genes: 79 protein-coding genes, 4 ribosomal RNA genes and 29 tRNA genes. By aligning the cp genome sequence with oil palm cDNA sequences, we observed 18 non-silent and 10 silent RNA editing events among 19 cp protein-coding genes. Creation of an initiation codon by RNA editing in rpl2 has been reported in several monocots and was also found in the oil palm cp genome. Fifty common chloroplast protein-coding genes from 33 plant taxa were used to construct ML and MP phylogenetic trees. Their topologies are similar and strongly support for the position of E. guineensis as the sister of closely related species Phoenix dactylifera in Arecaceae (palm families) of monocot subtrees.

  5. The complete chloroplast genome sequence of Pelargonium x hortorum: organization and evolution of the largest and most highly rearranged chloroplast genome of land plants.

    Science.gov (United States)

    Chumley, Timothy W; Palmer, Jeffrey D; Mower, Jeffrey P; Fourcade, H Matthew; Calie, Patrick J; Boore, Jeffrey L; Jansen, Robert K

    2006-11-01

    The chloroplast genome of Pelargonium x hortorum has been completely sequenced. It maps as a circular molecule of 217,942 bp and is both the largest and most rearranged land plant chloroplast genome yet sequenced. It features 2 copies of a greatly expanded inverted repeat (IR) of 75,741 bp each and, consequently, diminished single-copy regions of 59,710 and 6,750 bp. Despite the increase in size and complexity of the genome, the gene content is similar to that of other angiosperms, with the exceptions of a large number of pseudogenes, the recognition of 2 open reading frames (ORF56 and ORF42) in the trnA intron with similarities to previously identified mitochondrial products (ACRS and pvs-trnA), the losses of accD and trnT-ggu and, in particular, the presence of a highly divergent set of rpoA-like ORFs rather than a single, easily recognized gene for rpoA. The 3-fold expansion of the IR (relative to most angiosperms) accounts for most of the size increase of the genome, but an additional 10% of the size increase is related to the large number of repeats found. The Pelargonium genome contains 35 times as many 31 bp or larger repeats than the unrearranged genome of Spinacia. Most of these repeats occur near the rearrangement hotspots, and 2 different associations of repeats are localized in these regions. These associations are characterized by full or partial duplications of several genes, most of which appear to be nonfunctional copies or pseudogenes. These duplications may also be linked to the disruption of at least 1 but possibly 2 or 3 operons. We propose simple models that account for the major rearrangements with a minimum of 8 IR boundary changes and 12 inversions in addition to several insertions of duplicated sequence.

  6. High-throughput sequencing of six bamboo chloroplast genomes: phylogenetic implications for temperate woody bamboos (Poaceae: Bambusoideae.

    Directory of Open Access Journals (Sweden)

    Yun-Jie Zhang

    Full Text Available BACKGROUND: Bambusoideae is the only subfamily that contains woody members in the grass family, Poaceae. In phylogenetic analyses, Bambusoideae, Pooideae and Ehrhartoideae formed the BEP clade, yet the internal relationships of this clade are controversial. The distinctive life history (infrequent flowering and predominance of asexual reproduction of woody bamboos makes them an interesting but taxonomically difficult group. Phylogenetic analyses based on large DNA fragments could only provide a moderate resolution of woody bamboo relationships, although a robust phylogenetic tree is needed to elucidate their evolutionary history. Phylogenomics is an alternative choice for resolving difficult phylogenies. METHODOLOGY/PRINCIPAL FINDINGS: Here we present the complete nucleotide sequences of six woody bamboo chloroplast (cp genomes using Illumina sequencing. These genomes are similar to those of other grasses and rather conservative in evolution. We constructed a phylogeny of Poaceae from 24 complete cp genomes including 21 grass species. Within the BEP clade, we found strong support for a sister relationship between Bambusoideae and Pooideae. In a substantial improvement over prior studies, all six nodes within Bambusoideae were supported with ≥0.95 posterior probability from Bayesian inference and 5/6 nodes resolved with 100% bootstrap support in maximum parsimony and maximum likelihood analyses. We found that repeats in the cp genome could provide phylogenetic information, while caution is needed when using indels in phylogenetic analyses based on few selected genes. We also identified relatively rapidly evolving cp genome regions that have the potential to be used for further phylogenetic study in Bambusoideae. CONCLUSIONS/SIGNIFICANCE: The cp genome of Bambusoideae evolved slowly, and phylogenomics based on whole cp genome could be used to resolve major relationships within the subfamily. The difficulty in resolving the diversification among

  7. Complete sequence and comparative analysis of the chloroplast genome of coconut palm (Cocos nucifera.

    Directory of Open Access Journals (Sweden)

    Ya-Yi Huang

    Full Text Available Coconut, a member of the palm family (Arecaceae, is one of the most economically important trees used by mankind. Despite its diverse morphology, coconut is recognized taxonomically as only a single species (Cocos nucifera L.. There are two major coconut varieties, tall and dwarf, the latter of which displays traits resulting from selection by humans. We report here the complete chloroplast (cp genome of a dwarf coconut plant, and describe the gene content and organization, inverted repeat fluctuations, repeated sequence structure, and occurrence of RNA editing. Phylogenetic relationships of monocots were inferred based on 47 chloroplast protein-coding genes. Potential nodes for events of gene duplication and pseudogenization related to inverted repeat fluctuation were mapped onto the tree using parsimony criteria. We compare our findings with those from other palm species for which complete cp genome sequences are available.

  8. Complete sequence and comparative analysis of the chloroplast genome of coconut palm (Cocos nucifera).

    Science.gov (United States)

    Huang, Ya-Yi; Matzke, Antonius J M; Matzke, Marjori

    2013-01-01

    Coconut, a member of the palm family (Arecaceae), is one of the most economically important trees used by mankind. Despite its diverse morphology, coconut is recognized taxonomically as only a single species (Cocos nucifera L.). There are two major coconut varieties, tall and dwarf, the latter of which displays traits resulting from selection by humans. We report here the complete chloroplast (cp) genome of a dwarf coconut plant, and describe the gene content and organization, inverted repeat fluctuations, repeated sequence structure, and occurrence of RNA editing. Phylogenetic relationships of monocots were inferred based on 47 chloroplast protein-coding genes. Potential nodes for events of gene duplication and pseudogenization related to inverted repeat fluctuation were mapped onto the tree using parsimony criteria. We compare our findings with those from other palm species for which complete cp genome sequences are available.

  9. Complete chloroplast genome sequence of a major invasive species, crofton weed (Ageratina adenophora.

    Directory of Open Access Journals (Sweden)

    Xiaojun Nie

    Full Text Available BACKGROUND: Crofton weed (Ageratina adenophora is one of the most hazardous invasive plant species, which causes serious economic losses and environmental damages worldwide. However, the sequence resource and genome information of A. adenophora are rather limited, making phylogenetic identification and evolutionary studies very difficult. Here, we report the complete sequence of the A. adenophora chloroplast (cp genome based on Illumina sequencing. METHODOLOGY/PRINCIPAL FINDINGS: The A. adenophora cp genome is 150, 689 bp in length including a small single-copy (SSC region of 18, 358 bp and a large single-copy (LSC region of 84, 815 bp separated by a pair of inverted repeats (IRs of 23, 755 bp. The genome contains 130 unique genes and 18 duplicated in the IR regions, with the gene content and organization similar to other Asteraceae cp genomes. Comparative analysis identified five DNA regions (ndhD-ccsA, psbI-trnS, ndhF-ycf1, ndhI-ndhG and atpA-trnR containing parsimony-informative characters higher than 2%, which may be potential informative markers for barcoding and phylogenetic analysis. Repeat structure, codon usage and contraction of the IR were also investigated to reveal the pattern of evolution. Phylogenetic analysis demonstrated a sister relationship between A. adenophora and Guizotia abyssinica and supported a monophyly of the Asterales. CONCLUSION: We have assembled and analyzed the chloroplast genome of A. adenophora in this study, which was the first sequenced plastome in the Eupatorieae tribe. The complete chloroplast genome information is useful for plant phylogenetic and evolutionary studies within this invasive species and also within the Asteraceae family.

  10. Analysis of synonymous codon usage in chloroplast genome of Populus alba

    Institute of Scientific and Technical Information of China (English)

    ZHOU Meng; LONG Wei; LI Xia

    2008-01-01

    The pattern of codon usage in the chloroplast genome of Populus alba was investigated.Correspondence analysis (a commonly used multivariate statistical approach) and method of effective number of codons (ENc)-plot were conducted to analyze synonymous codon usage.The results of correspondence analysis showed that the distribution of genes on the major axis was significantly correlated with the frequency of use of G+C in synonymously variable third position of sense codon (GC3S),(r=0.349),and the positions of genes on the axis 2 and axis 3 were significantly correlated with CAI (r=-0.348,p<0.01 and r=0.602,p<0.01).The ENc for most genes was similar to that for the expected ENc based on the GC3S,but several genes with low ENC values were lying below the expected curve.All of these data indicated that codon usage was dominated by a mutational bias in chloroplast genome of P.alba.The selection in nature for translational efficiency only played a minor role in shaping codon usage in the chloroplast genome of P.alba.

  11. The complete chloroplast genome sequence of Pelargonium xhortorum: Or ganization and evolution of the largest and most highlyrearranged chloroplast genome of land plants

    Energy Technology Data Exchange (ETDEWEB)

    Chumley, Timothy W.; Palmer, Jeffrey D.; Mower, Jeffrey P.; Fourcade, H. Matthew; Calie, Patrick J.; Boore, Jeffrey L.; Jansen,Robert K.

    2006-01-20

    The chloroplast genome of Pelargonium e hortorum has beencompletely sequenced. It maps as a circular molecule of 217,942 bp, andis both the largest and most rearranged land plant chloroplast genome yetsequenced. It features two copies of a greatly expanded inverted repeat(IR) of 75,741 bp each, and consequently diminished single copy regionsof 59,710 bp and 6,750 bp. It also contains two different associations ofrepeated elements that contribute about 10 percent to the overall sizeand account for the majority of repeats found in the genome. Theyrepresent hotspots for rearrangements and gene duplications and include alarge number of pseudogenes. We propose simple models that account forthe major rearrangements with a minimum of eight IR boundary changes and12 inversions in addition to a several insertions of duplicated sequence.The major processes at work (duplication, IR expansion, and inversion)have disrupted at least one and possibly two or three transcriptionaloperons, and the genes involved in these disruptions form the core of thetwo major repeat associations. Despite the vast increase in size andcomplexity of the genome, the gene content is similar to that of otherangiosperms, with the exceptions of a large number of pseudogenes as partof the repeat associations, the recognition of two open reading frames(ORF56 and ORF42) in the trnA intron with similarities to previouslyidentified mitochondrial products (ACRS and pvs-trnA), the loss of accDand trnT-GGU, and in particular, the lack of a recognizably functionalrpoA. One or all of three similar open reading frames may possibly encodethe latter, however.

  12. Phylogenomic relationship of feijoa (Acca sellowiana (O.Berg) Burret) with other Myrtaceae based on complete chloroplast genome sequences.

    Science.gov (United States)

    Machado, Lilian de Oliveira; Vieira, Leila do Nascimento; Stefenon, Valdir Marcos; Oliveira Pedrosa, Fábio de; Souza, Emanuel Maltempi de; Guerra, Miguel Pedro; Nodari, Rubens Onofre

    2017-04-01

    Given their distribution, importance, and richness, Myrtaceae species comprise a model system for studying the evolution of tropical plant diversity. In addition, chloroplast (cp) genome sequencing is an efficient tool for phylogenetic relationship studies. Feijoa [Acca sellowiana (O. Berg) Burret; CN: pineapple-guava] is a Myrtaceae species that occurs naturally in southern Brazil and northern Uruguay. Feijoa is known for its exquisite perfume and flavorful fruits, pharmacological properties, ornamental value and increasing economic relevance. In the present work, we reported the complete cp genome of feijoa. The feijoa cp genome is a circular molecule of 159,370 bp with a quadripartite structure containing two single copy regions, a Large Single Copy region (LSC 88,028 bp) and a Small Single Copy region (SSC 18,598 bp) separated by Inverted Repeat regions (IRs 26,372 bp). The genome structure, gene order, GC content and codon usage are similar to those of typical angiosperm cp genomes. When compared to other cp genome sequences of Myrtaceae, feijoa showed closest relationship with pitanga (Eugenia uniflora L.). Furthermore, a comparison of pitanga synonymous (Ks) and nonsynonymous (Ka) substitution rates revealed extremely low values. Maximum Likelihood and Bayesian Inference analyses produced phylogenomic trees identical in topology. These trees supported monophyly of three Myrtoideae clades.

  13. Complete genome sequence of chloroplast DNA (cpDNA) of Chlorella sorokiniana.

    Science.gov (United States)

    Orsini, Massimiliano; Cusano, Roberto; Costelli, Cristina; Malavasi, Veronica; Concas, Alessandro; Angius, Andrea; Cao, Giacomo

    2016-01-01

    The complete chloroplast genome sequence of Chlorella sorokiniana strain (SAG 111-8 k) is presented in this study. The genome consists of circular chromosomes of 109,811 bp, which encode a total of 109 genes, including 74 proteins, 3 rRNAs and 31 tRNAs. Moreover, introns are not detected and all genes are present in single copy. The overall AT contents of the C. sorokiniana cpDNA is 65.9%, the coding sequence is 59.1% and a large inverted repeat (IR) is not observed.

  14. Evolutionary analysis of Arabidopsis, cyanobacterial, and chloroplast genomes reveals plastid phylogeny and thousands of cyanobacterial genes in the nucleus.

    Science.gov (United States)

    Martin, William; Rujan, Tamas; Richly, Erik; Hansen, Andrea; Cornelsen, Sabine; Lins, Thomas; Leister, Dario; Stoebe, Bettina; Hasegawa, Masami; Penny, David

    2002-09-17

    Chloroplasts were once free-living cyanobacteria that became endosymbionts, but the genomes of contemporary plastids encode only approximately 5-10% as many genes as those of their free-living cousins, indicating that many genes were either lost from plastids or transferred to the nucleus during the course of plant evolution. Previous estimates have suggested that between 800 and perhaps as many as 2,000 genes in the Arabidopsis genome might come from cyanobacteria, but genome-wide phylogenetic surveys that could provide direct estimates of this number are lacking. We compared 24,990 proteins encoded in the Arabidopsis genome to the proteins from three cyanobacterial genomes, 16 other prokaryotic reference genomes, and yeast. Of 9,368 Arabidopsis proteins sufficiently conserved for primary sequence comparison, 866 detected homologues only among cyanobacteria and 834 other branched with cyanobacterial homologues in phylogenetic trees. Extrapolating from these conserved proteins to the whole genome, the data suggest that approximately 4,500 of Arabidopsis protein-coding genes ( approximately 18% of the total) were acquired from the cyanobacterial ancestor of plastids. These proteins encompass all functional classes, and the majority of them are targeted to cell compartments other than the chloroplast. Analysis of 15 sequenced chloroplast genomes revealed 117 nuclear-encoded proteins that are also still present in at least one chloroplast genome. A phylogeny of chloroplast genomes inferred from 41 proteins and 8,303 amino acids sites indicates that at least two independent secondary endosymbiotic events have occurred involving red algae and that amino acid composition bias in chloroplast proteins strongly affects plastid genome phylogeny.

  15. Comparative genomics of four Liliales families inferred from the complete chloroplast genome sequence of Veratrum patulum O. Loes. (Melanthiaceae).

    Science.gov (United States)

    Do, Hoang Dang Khoa; Kim, Jung Sung; Kim, Joo-Hwan

    2013-11-10

    The sequence of the chloroplast genome, which is inherited maternally, contains useful information for many scientific fields such as plant systematics, biogeography and biotechnology because its characteristics are highly conserved among species. There is an increase in chloroplast genomes of angiosperms that have been sequenced in recent years. In this study, the nucleotide sequence of the chloroplast genome (cpDNA) of Veratrum patulum Loes. (Melanthiaceae, Liliales) was analyzed completely. The circular double-stranded DNA of 153,699 bp consists of two inverted repeat (IR) regions of 26,360 bp each, a large single copy of 83,372 bp, and a small single copy of 17,607 bp. This plastome contains 81 protein-coding genes, 30 distinct tRNA and four genes of rRNA. In addition, there are six hypothetical coding regions (ycf1, ycf2, ycf3, ycf4, ycf15 and ycf68) and two open reading frames (ORF42 and ORF56), which are also found in the chloroplast genomes of the other species. The gene orders and gene contents of the V. patulum plastid genome are similar to that of Smilax china, Lilium longiflorum and Alstroemeria aurea, members of the Smilacaceae, Liliaceae and Alstroemeriaceae (Liliales), respectively. However, the loss rps16 exon 2 in V. patulum results in the difference in the large single copy regions in comparison with other species. The base substitution rate is quite similar among genes of these species. Additionally, the base substitution rate of inverted repeat region was smaller than that of single copy regions in all observed species of Liliales. The IR regions were expanded to trnH_GUG in V. patulum, a part of rps19 in L. longiflorum and A. aurea, and whole sequence of rps19 in S. china. Furthermore, the IGS lengths of rbcL-accD-psaI region were variable among Liliales species, suggesting that this region might be a hotspot of indel events and the informative site for phylogenetic studies in Liliales. In general, the whole chloroplast genome of V. patulum, a

  16. The complete chloroplast genome of Cupressus gigantea, an endemic conifer species to Qinghai-Tibetan Plateau.

    Science.gov (United States)

    Li, Huie; Guo, Qiqiang; Zheng, Weilie

    2016-09-01

    The complete chloroplast genome of the wild Cupressus gigantea (Cupressaceae) is determined in this study. The circular genome is 128 244 bp in length with 115 single copy genes and two duplicated genes (trnI-CAU and trnQ-UUG). This genome contains 82 protein-coding genes, four ribosomal RNA genes and 31 transfer RNA genes. In these genes, eight genes (atpF, rpoC1, ndhA, ndhB, petB, petD, rpl16 and rpl2) harbor a single intron and two genes (rps12 and ycf3) harbor two introns. This genome does not contain canonical IRs, and the overall GC content is 34.7%. A maximum parsimony phylogenetic analysis revealed that C. gigantea and C. sempervirens are more closely related.

  17. Assembly of the Complete Sitka Spruce Chloroplast Genome Using 10X Genomics’ GemCode Sequencing Data

    Science.gov (United States)

    Coombe, Lauren; Jackman, Shaun D.; Yang, Chen; Vandervalk, Benjamin P.; Moore, Richard A.; Pleasance, Stephen; Coope, Robin J.; Bohlmann, Joerg; Holt, Robert A.; Jones, Steven J. M.; Birol, Inanc

    2016-01-01

    The linked read sequencing library preparation platform by 10X Genomics produces barcoded sequencing libraries, which are subsequently sequenced using the Illumina short read sequencing technology. In this new approach, long fragments of DNA are partitioned into separate micro-reactions, where the same index sequence is incorporated into each of the sequencing fragment inserts derived from a given long fragment. In this study, we exploited this property by using reads from index sequences associated with a large number of reads, to assemble the chloroplast genome of the Sitka spruce tree (Picea sitchensis). Here we report on the first Sitka spruce chloroplast genome assembled exclusively from P. sitchensis genomic libraries prepared using the 10X Genomics protocol. We show that the resulting 124,049 base pair long genome shares high sequence similarity with the related white spruce and Norway spruce chloroplast genomes, but diverges substantially from a previously published P. sitchensis- P. thunbergii chimeric genome. The use of reads from high-frequency indices enabled separation of the nuclear genome reads from that of the chloroplast, which resulted in the simplification of the de Bruijn graphs used at the various stages of assembly. PMID:27632164

  18. Identifying the Basal Angiosperm Node in Chloroplast GenomePhylogenies: Sampling One's Way Out of the Felsenstein Zone

    Energy Technology Data Exchange (ETDEWEB)

    Leebens-Mack, Jim; Raubeson, Linda A.; Cui, Liying; Kuehl,Jennifer V.; Fourcade, Matthew H.; Chumley, Timothy W.; Boore, JeffreyL.; Jansen, Robert K.; dePamphilis, Claude W.

    2005-05-27

    While there has been strong support for Amborella and Nymphaeales (water lilies) as branching from basal-most nodes in the angiosperm phylogeny, this hypothesis has recently been challenged by phylogenetic analyses of 61 protein-coding genes extracted from the chloroplast genome sequences of Amborella, Nymphaea and 12 other available land plant chloroplast genomes. These character-rich analyses placed the monocots, represented by three grasses (Poaceae), as sister to all other extant angiosperm lineages. We have extracted protein-coding regions from draft sequences for six additional chloroplast genomes to test whether this surprising result could be an artifact of long-branch attraction due to limited taxon sampling. The added taxa include three monocots (Acorus, Yucca and Typha), a water lily (Nuphar), a ranunculid(Ranunculus), and a gymnosperm (Ginkgo). Phylogenetic analyses of the expanded DNA and protein datasets together with microstructural characters (indels) provided unambiguous support for Amborella and the Nymphaeales as branching from the basal-most nodes in the angiospermphylogeny. However, their relative positions proved to be dependent on method of analysis, with parsimony favoring Amborella as sister to all other angiosperms, and maximum likelihood and neighbor-joining methods favoring an Amborella + Nympheales clade as sister. The maximum likelihood phylogeny supported the later hypothesis, but the likelihood for the former hypothesis was not significantly different. Parametric bootstrap analysis, single gene phylogenies, estimated divergence dates and conflicting in del characters all help to illuminate the nature of the conflict in resolution of the most basal nodes in the angiospermphylogeny. Molecular dating analyses provided median age estimates of 161 mya for the most recent common ancestor of all extant angiosperms and 145 mya for the most recent common ancestor of monocots, magnoliids andeudicots. Whereas long sequences reduce variance in

  19. Chloroplast Genome Sequence of pigeonpea (Cajanus cajan (L. Millspaugh and Cajanus scarabaeoides: Genome organization and Comparison with other legumes

    Directory of Open Access Journals (Sweden)

    Tanvi Kaila

    2016-12-01

    Full Text Available Pigeonpea (Cajanus cajan (L. Millspaugh, a diploid (2n = 22 legume crop with a genome size of 852 Mbp, serves as an important source of human dietary protein especially in South East Asian and African regions. In this study, the draft chloroplast genomes of Cajanus cajan and Cajanus scarabaeoides were sequenced. Cajanus scarabaeoides is an important species of the Cajanus gene pool and has also been used for developing promising CMS system by different groups. A male sterile genotype harbouring the Cajanus scarabaeoides cytoplasm was used for sequencing the plastid genome. The cp genome of Cajanus cajan is 152,242bp long, having a quadripartite structure with LSC of 83,455 bp and SSC of 17,871 bp separated by IRs of 25,398 bp. Similarly, the cp genome of Cajanus scarabaeoides is 152,201bp long, having a quadripartite structure in which IRs of 25,402 bp length separates 83,423 bp of LSC and 17,854 bp of SSC. The pigeonpea cp genome contains 116 unique genes, including 30 tRNA, 4 rRNA, 78 predicted protein coding genes and 5 pseudogenes. A 50kb inversion was observed in the LSC region of pigeonpea cp genome, consistent with other legumes. Comparison of cp genome with other legumes revealed the contraction of IR boundaries due to the absence of rps19 gene in the IR region. Chloroplast SSRs were mined and a total of 280 and 292 cpSSRs were identified in Cajanus scarabaeoides and Cajanus cajan respectively. RNA editing was observed at 37 sites in both Cajanus scarabaeoides and Cajanus cajan, with maximum occurrence in the ndh genes. The pigeonpea cp genome sequence would be beneficial in providing informative molecular markers which can be utilized for genetic diversity analysis and aid in understanding the plant systematics studies among major grain legumes.

  20. The evolution of chloroplast genome structure in ferns.

    Science.gov (United States)

    Wolf, Paul G; Roper, Jessie M; Duffy, Aaron M

    2010-09-01

    The plastid genome (plastome) is a rich source of phylogenetic and other comparative data in plants. Most land plants possess a plastome of similar structure. However, in a major group of plants, the ferns, a unique plastome structure has evolved. The gene order in ferns has been explained by a series of genomic inversions relative to the plastome organization of seed plants. Here, we examine for the first time the structure of the plastome across fern phylogeny. We used a PCR-based strategy to map and partially sequence plastomes. We found that a pair of partially overlapping inversions in the region of the inverted repeat occurred in the common ancestor of most ferns. However, the ancestral (seed plant) structure is still found in early diverging branches leading to the osmundoid and filmy fern lineages. We found that a second pair of overlapping inversions occurred on a branch leading to the core leptosporangiates. We also found that the unique placement of the gene matK in ferns (lacking a flanking intron) is not a result of a large-scale inversion, as previously thought. This is because the intron loss maps to an earlier point on the phylogeny than the nearby inversion. We speculate on why inversions may occur in pairs and what this may mean for the dynamics of plastome evolution.

  1. A fragment of chloroplast DNA was transferred horizontally, probably from non-eudicots, to mitochondrial genome of Phaseolus.

    Science.gov (United States)

    Woloszynska, Magdalena; Bocer, Tomasz; Mackiewicz, Pawel; Janska, Hanna

    2004-11-01

    The mitochondrial genomes of some Phaseolus species contain a fragment of chloroplast trnA gene intron, named pvs-trnA for its location within the Phaseolus vulgaris sterility sequence (pvs). The purpose of this study was to determine the type of transfer (intracellular or horizontal) that gave rise to pvs-trnA. Using a PCR approach we could not find the respective portion of the trnA gene as a part of pvs outside the Phaseolus genus. However, a BLAST search revealed longer fragments of trnA present in the mitochondrial genomes of some Citrus species, Helianthus annuus and Zea mays. Basing on the identity or near-identity between these mitochondrial sequences and their chloroplast counterparts we concluded that they had relocated from chloroplasts to mitochondria via recent, independent, intracellular DNA transfers. In contrast, pvs-trnA displayed a relatively higher sequence divergence when compared with its chloroplast counterpart from Phaseolus vulgaris. Alignment of pvs-trnA with corresponding trnA fragments from 35 plant species as well as phylogenetic analysis revealed that pvs-trnA grouped with non-eudicot sequences and was well separated from all Fabales sequences. In conclusion, we propose that pvs-trnA arose via horizontal transfer of a trnA intron fragment from chloroplast of a non-eudicot plant to Phaseolus mitochondria. This is the first example of horizontal transfer of a chloroplast sequence to the mitochondrial genome in higher plants.

  2. The complete chloroplast genome sequence of date palm (Phoenix dactylifera L..

    Directory of Open Access Journals (Sweden)

    Meng Yang

    Full Text Available BACKGROUND: Date palm (Phoenix dactylifera L., a member of Arecaceae family, is one of the three major economically important woody palms--the two other palms being oil palm and coconut tree--and its fruit is a staple food among Middle East and North African nations, as well as many other tropical and subtropical regions. Here we report a complete sequence of the data palm chloroplast (cp genome based on pyrosequencing. METHODOLOGY/PRINCIPAL FINDINGS: After extracting 369,022 cp sequencing reads from our whole-genome-shotgun data, we put together an assembly and validated it with intensive PCR-based verification, coupled with PCR product sequencing. The date palm cp genome is 158,462 bp in length and has a typical quadripartite structure of the large (LSC, 86,198 bp and small single-copy (SSC, 17,712 bp regions separated by a pair of inverted repeats (IRs, 27,276 bp. Similar to what has been found among most angiosperms, the date palm cp genome harbors 112 unique genes and 19 duplicated fragments in the IR regions. The junctions between LSC/IRs and SSC/IRs show different features of sequence expansion in evolution. We identified 78 SNPs as major intravarietal polymorphisms within the population of a specific cp genome, most of which were located in genes with vital functions. Based on RNA-sequencing data, we also found 18 polycistronic transcription units and three highly expression-biased genes--atpF, trnA-UGC, and rrn23. CONCLUSIONS: Unlike most monocots, date palm has a typical cp genome similar to that of tobacco--with little rearrangement and gene loss or gain. High-throughput sequencing technology facilitates the identification of intravarietal variations in cp genomes among different cultivars. Moreover, transcriptomic analysis of cp genes provides clues for uncovering regulatory mechanisms of transcription and translation in chloroplasts.

  3. Complete chloroplast genome sequences of Drimys, Liriodendron, andPiper: Implications for the phylogeny of magnoliids and the evolution ofGC content

    Energy Technology Data Exchange (ETDEWEB)

    Zhengqiu, C.; Penaflor, C.; Kuehl, J.V.; Leebens-Mack, J.; Carlson, J.; dePamphilis, C.W.; Boore, J.L.; Jansen, R.K.

    2006-06-01

    the inverted repeat due to the presence of rRNA genes and lowest in the small single copy region where most NADH genes are located. Phylogenetic analyses using maximum parsimony and maximum likelihood methods were performed on DNA sequences of 61 protein-coding genes. Trees from both analyses provided strong support for the monophyly of magnoliids and two strongly supported groups were identified, the Canellales/Piperales and the Laurales/Magnoliales. The phylogenies also provided moderate to strong support for the basal position of Amborella, and a sister relationship of magnoliids to a clade that includes monocots and eudicots. The complete sequences of three magnoliid chloroplast genomes provide new data from the largest basal angiosperm clade. Evolutionary comparisons of these new genome sequences, combined with other published angiosperm genome, confirm that GC content is unevenly distributed across the genome by location, codon position, and functional group. Furthermore, phylogenetic analyses provide the strongest support so far for the hypothesis that the magnoliids are sister to a large clade that includes both monocots and eudicots.

  4. The complete chloroplast genome sequence of Gentiana lawrencei var. farreri (Gentianaceae and comparative analysis with its congeneric species

    Directory of Open Access Journals (Sweden)

    Peng-Cheng Fu

    2016-09-01

    Full Text Available Background The chloroplast (cp genome is useful in plant systematics, genetic diversity analysis, molecular identification and divergence dating. The genus Gentiana contains 362 species, but there are only two valuable complete cp genomes. The purpose of this study is to report the characterization of complete cp genome of G. lawrencei var. farreri, which is endemic to the Qinghai-Tibetan Plateau (QTP. Methods Using high throughput sequencing technology, we got the complete nucleotide sequence of the G. lawrencei var. farreri cp genome. The comparison analysis including genome difference and gene divergence was performed with its congeneric species G. straminea. The simple sequence repeats (SSRs and phylogenetics were studied as well. Results The cp genome of G. lawrencei var. farreri is a circular molecule of 138,750 bp, containing a pair of 24,653 bp inverted repeats which are separated by small and large single-copy regions of 11,365 and 78,082 bp, respectively. The cp genome contains 130 known genes, including 85 protein coding genes (PCGs, eight ribosomal RNA genes and 37 tRNA genes. Comparative analyses indicated that G. lawrencei var. farreri is 10,241 bp shorter than its congeneric species G. straminea. Four large gaps were detected that are responsible for 85% of the total sequence loss. Further detailed analyses revealed that 10 PCGs were included in the four gaps that encode nine NADH dehydrogenase subunits. The cp gene content, order and orientation are similar to those of its congeneric species, but with some variation among the PCGs. Three genes, ndhB, ndhF and clpP, have high nonsynonymous to synonymous values. There are 34 SSRs in the G. lawrencei var. farreri cp genome, of which 25 are mononucleotide repeats: no dinucleotide repeats were detected. Comparison with the G. straminea cp genome indicated that five SSRs have length polymorphisms and 23 SSRs are species-specific. The phylogenetic analysis of 48 PCGs from 12 Gentianales

  5. RNA Editing Sites Exist in Protein-coding Genes in the Chloroplast Genome of Cycas taitungensis

    Institute of Scientific and Technical Information of China (English)

    Haiyan Chen; Likun Deng; Yuan Jiang; Ping Lu; Jianing Yu

    2011-01-01

    RNA editing is a post-transcriptional process that results in modifications of ribonucleotides at specific locations.In land plants editing can occur in both mitochondria and chloroplasts and most commonly involves C-to-U changes,especially in seed plants.Using prediction and experimental determination,we investigated RNA editing in 40 protein-coding genes from the chloroplast genome of Cycas taitungensis.A total of 85 editing sites were identified in 25 transcripts.Comparison analysis of the published editotypes of these 25 transcripts in eight species showed that RNA editing events gradually disappear during plant evolution.The editing in the first and third codon position disappeared quicker than that in the second codon position,ndh genes have the highest editing frequency while serine and proline codons were more frequently edited than the codons of other amino acids.These results imply that retained RNA editing sites have imbalanced distribution in genes and most of them may function by changing protein structure or interaction.Mitochondrion protein-coding genes have three times the editing sites compared with chloroplast genes of Cycas,most likely due to slower evolution speed.

  6. Phylogenetic Relationships of the Fern Cyrtomium falcatum (Dryopteridaceae) from Dokdo Island, Sea of East Japan, Based on Chloroplast Genome Sequencing.

    Science.gov (United States)

    Raman, Gurusamy; Choi, Kyoung Su; Park, SeonJoo

    2016-12-02

    Cyrtomium falcatum is a popular ornamental fern cultivated worldwide. Native to the Korean Peninsula, Japan, and Dokdo Island in the Sea of Japan, it is the only fern present on Dokdo Island. We isolated and characterized the chloroplast (cp) genome of C. falcatum, and compared it with those of closely related species. The genes trnV-GAC and trnV-GAU were found to be present within the cp genome of C. falcatum, whereas trnP-GGG and rpl21 were lacking. Moreover, cp genomes of Cyrtomium devexiscapulae and Adiantum capillus-veneris lack trnP-GGG and rpl21, suggesting these are not conserved among angiosperm cp genomes. The deletion of trnR-UCG, trnR-CCG, and trnSeC in the cp genomes of C. falcatum and other eupolypod ferns indicates these genes are restricted to tree ferns, non-core leptosporangiates, and basal ferns. The C. falcatum cp genome also encoded ndhF and rps7, with GUG start codons that were only conserved in polypod ferns, and it shares two significant inversions with other ferns, including a minor inversion of the trnD-GUC region and an approximate 3 kb inversion of the trnG-trnT region. Phylogenetic analyses showed that Equisetum was found to be a sister clade to Psilotales-Ophioglossales with a 100% bootstrap (BS) value. The sister relationship between Pteridaceae and eupolypods was also strongly supported by a 100% BS, but Bayesian molecular clock analyses suggested that C. falcatum diversified in the mid-Paleogene period (45.15 ± 4.93 million years ago) and might have moved from Eurasia to Dokdo Island.

  7. Crystallographic and functional analyses of J-domain of JAC1 essential for chloroplast photorelocation movement in Arabidopsis thaliana.

    Science.gov (United States)

    Takano, Akira; Suetsugu, Noriyuki; Wada, Masamitsu; Kohda, Daisuke

    2010-08-01

    An auxilin-like J-domain-containing protein, JAC1, is necessary for chloroplast movement in Arabidopsis thaliana, to capture photosynthetic light efficiently under weak light conditions. Here, we performed crystallographic and functional analyses of the J-domain of JAC1. The crystal structure of the J-domain is quite similar to that of bovine auxilin, and possesses a similar positively charged surface, which probably forms the interface with the Hsp70 chaperone. The mutation of the highly conserved HPD motif of the JAC1 J-domain abrogated the chloroplast photorelocation response. These results suggest that the requirement of JAC1 in chloroplast photorelocation movement is attributable to the J-domain's cochaperone activity.

  8. The complete chloroplast genome sequence of the medicinal plant Salvia miltiorrhiza.

    Science.gov (United States)

    Qian, Jun; Song, Jingyuan; Gao, Huanhuan; Zhu, Yingjie; Xu, Jiang; Pang, Xiaohui; Yao, Hui; Sun, Chao; Li, Xian'en; Li, Chuyuan; Liu, Juyan; Xu, Haibin; Chen, Shilin

    2013-01-01

    Salvia miltiorrhiza is an important medicinal plant with great economic and medicinal value. The complete chloroplast (cp) genome sequence of Salvia miltiorrhiza, the first sequenced member of the Lamiaceae family, is reported here. The genome is 151,328 bp in length and exhibits a typical quadripartite structure of the large (LSC, 82,695 bp) and small (SSC, 17,555 bp) single-copy regions, separated by a pair of inverted repeats (IRs, 25,539 bp). It contains 114 unique genes, including 80 protein-coding genes, 30 tRNAs and four rRNAs. The genome structure, gene order, GC content and codon usage are similar to the typical angiosperm cp genomes. Four forward, three inverted and seven tandem repeats were detected in the Salvia miltiorrhiza cp genome. Simple sequence repeat (SSR) analysis among the 30 asterid cp genomes revealed that most SSRs are AT-rich, which contribute to the overall AT richness of these cp genomes. Additionally, fewer SSRs are distributed in the protein-coding sequences compared to the non-coding regions, indicating an uneven distribution of SSRs within the cp genomes. Entire cp genome comparison of Salvia miltiorrhiza and three other Lamiales cp genomes showed a high degree of sequence similarity and a relatively high divergence of intergenic spacers. Sequence divergence analysis discovered the ten most divergent and ten most conserved genes as well as their length variation, which will be helpful for phylogenetic studies in asterids. Our analysis also supports that both regional and functional constraints affect gene sequence evolution. Further, phylogenetic analysis demonstrated a sister relationship between Salvia miltiorrhiza and Sesamum indicum. The complete cp genome sequence of Salvia miltiorrhiza reported in this paper will facilitate population, phylogenetic and cp genetic engineering studies of this medicinal plant.

  9. The complete chloroplast genome sequence of the medicinal plant Salvia miltiorrhiza.

    Directory of Open Access Journals (Sweden)

    Jun Qian

    Full Text Available Salvia miltiorrhiza is an important medicinal plant with great economic and medicinal value. The complete chloroplast (cp genome sequence of Salvia miltiorrhiza, the first sequenced member of the Lamiaceae family, is reported here. The genome is 151,328 bp in length and exhibits a typical quadripartite structure of the large (LSC, 82,695 bp and small (SSC, 17,555 bp single-copy regions, separated by a pair of inverted repeats (IRs, 25,539 bp. It contains 114 unique genes, including 80 protein-coding genes, 30 tRNAs and four rRNAs. The genome structure, gene order, GC content and codon usage are similar to the typical angiosperm cp genomes. Four forward, three inverted and seven tandem repeats were detected in the Salvia miltiorrhiza cp genome. Simple sequence repeat (SSR analysis among the 30 asterid cp genomes revealed that most SSRs are AT-rich, which contribute to the overall AT richness of these cp genomes. Additionally, fewer SSRs are distributed in the protein-coding sequences compared to the non-coding regions, indicating an uneven distribution of SSRs within the cp genomes. Entire cp genome comparison of Salvia miltiorrhiza and three other Lamiales cp genomes showed a high degree of sequence similarity and a relatively high divergence of intergenic spacers. Sequence divergence analysis discovered the ten most divergent and ten most conserved genes as well as their length variation, which will be helpful for phylogenetic studies in asterids. Our analysis also supports that both regional and functional constraints affect gene sequence evolution. Further, phylogenetic analysis demonstrated a sister relationship between Salvia miltiorrhiza and Sesamum indicum. The complete cp genome sequence of Salvia miltiorrhiza reported in this paper will facilitate population, phylogenetic and cp genetic engineering studies of this medicinal plant.

  10. De novo assembly and characterization of the complete chloroplast genome of radish (Raphanus sativus L.).

    Science.gov (United States)

    Jeong, Young-Min; Chung, Won-Hyung; Mun, Jeong-Hwan; Kim, Namshin; Yu, Hee-Ju

    2014-11-01

    Radish (Raphanus sativus L.) is an edible root vegetable crop that is cultivated worldwide and whose genome has been sequenced. Here we report the complete nucleotide sequence of the radish cultivar WK10039 chloroplast (cp) genome, along with a de novo assembly strategy using whole genome shotgun sequence reads obtained by next generation sequencing. The radish cp genome is 153,368 bp in length and has a typical quadripartite structure, composed of a pair of inverted repeat regions (26,217 bp each), a large single copy region (83,170 bp), and a small single copy region (17,764 bp). The radish cp genome contains 87 predicted protein-coding genes, 37 tRNA genes, and 8 rRNA genes. Sequence analysis revealed the presence of 91 simple sequence repeats (SSRs) in the radish cp genome. Phylogenetic analysis of 62 protein-coding gene sequences from the 17 cp genomes of the Brassicaceae family suggested that the radish cp genome is most closely related to the cp genomes of Brassica rapa and Brassicanapus. Comparisons with the B. rapa and B. napus cp genomes revealed highly divergent intergenic sequences and introns that can potentially be developed as diagnostic cp markers. Synonymous and nonsynonymous substitutions of cp genes suggested that nucleotide substitutions have occurred at similar rates in most genes. The complete sequence of the radish cp genome would serve as a valuable resource for the development of new molecular markers and the study of the phylogenetic relationships of Raphanus species in the Brassicaceae family.

  11. The Complete Chloroplast Genome of Capsicum annuum var. glabriusculum Using Illumina Sequencing

    Directory of Open Access Journals (Sweden)

    Sebastin Raveendar

    2015-07-01

    Full Text Available Chloroplast (cp genome sequences provide a valuable source for DNA barcoding. Molecular phylogenetic studies have concentrated on DNA sequencing of conserved gene loci. However, this approach is time consuming and more difficult to implement when gene organization differs among species. Here we report the complete re-sequencing of the cp genome of Capsicum pepper (Capsicum annuum var. glabriusculum using the Illumina platform. The total length of the cp genome is 156,817 bp with a 37.7% overall GC content. A pair of inverted repeats (IRs of 50,284 bp were separated by a small single copy (SSC; 18,948 bp and a large single copy (LSC; 87,446 bp. The number of cp genes in C. annuum var. glabriusculum is the same as that in other Capsicum species. Variations in the lengths of LSC; SSC and IR regions were the main contributors to the size variation in the cp genome of this species. A total of 125 simple sequence repeat (SSR and 48 insertions or deletions variants were found by sequence alignment of Capsicum cp genome. These findings provide a foundation for further investigation of cp genome evolution in Capsicum and other higher plants.

  12. The complete chloroplast genome of Capsicum annuum var. glabriusculum using Illumina sequencing.

    Science.gov (United States)

    Raveendar, Sebastin; Na, Young-Wang; Lee, Jung-Ro; Shim, Donghwan; Ma, Kyung-Ho; Lee, Sok-Young; Chung, Jong-Wook

    2015-07-20

    Chloroplast (cp) genome sequences provide a valuable source for DNA barcoding. Molecular phylogenetic studies have concentrated on DNA sequencing of conserved gene loci. However, this approach is time consuming and more difficult to implement when gene organization differs among species. Here we report the complete re-sequencing of the cp genome of Capsicum pepper (Capsicum annuum var. glabriusculum) using the Illumina platform. The total length of the cp genome is 156,817 bp with a 37.7% overall GC content. A pair of inverted repeats (IRs) of 50,284 bp were separated by a small single copy (SSC; 18,948 bp) and a large single copy (LSC; 87,446 bp). The number of cp genes in C. annuum var. glabriusculum is the same as that in other Capsicum species. Variations in the lengths of LSC; SSC and IR regions were the main contributors to the size variation in the cp genome of this species. A total of 125 simple sequence repeat (SSR) and 48 insertions or deletions variants were found by sequence alignment of Capsicum cp genome. These findings provide a foundation for further investigation of cp genome evolution in Capsicum and other higher plants.

  13. Analysis of whole chloroplast genomes from the genera of the Clauseneae, the curry tribe (Rutaceae, Citrus family).

    Science.gov (United States)

    Shivakumar, Vikram S; Appelhans, Marc S; Johnson, Gabriel; Carlsen, Monica; Zimmer, Elizabeth A

    2016-12-11

    The Clauseneae (Aurantioideae, Rutaceae) is a tribe in the Citrus family that, although economically important as it contains the culinary and medicinally-useful curry tree (Bergera koenigii), has been relatively understudied. Due to the recent significant taxonomic changes made to this tribe, a closer inspection of the genetic relationships among its genera has been warranted. Whole genome skimming was used to generate chloroplast genomes from six species, representing each of the four genera (Bergera, Clausena, Glycosmis, Micromelum) in the Clauseneae tribe plus one closely related outgroup (Merrillia), using the published plastome sequence of Citrus sinensis as a reference. Phylogenetically informative character (PIC) data were analyzed using a genome alignment of the seven species, and variability frequency among the species was recorded for each coding and non-coding region, with the regions of highest variability identified for future phylogenetic studies. Non-coding regions exhibited a higher percentage of variable characters as expected, and the phylogenetic markers ycf1, matK, rpoC2, ndhF, trnS-trnG spacer, and trnH-psbA spacer proved to be among the most variable regions. Other markers that are frequently used in phylogenetic studies, e.g. rps16, atpB-rbcL, rps4-trnT, and trnL-trnF, proved to be far less variable. Phylogenetic analyses of the aligned sequences were conducted using Bayesian inference (MrBayes) and Maximum Likelihood (RAxML), yielding highly supported divisions among the four genera.

  14. Research Progress of Sugarcane Chloroplast Genome%甘蔗叶绿体基因组研究进展

    Institute of Scientific and Technical Information of China (English)

    吴杨; 周会

    2013-01-01

    Along with the development of modern molecular biology technologies, complete chloroplast genomes have been sequenced in various plant species to date, and the structure, function and expression of these genes have been deter-mined. The chloroplast genome structure in most higher plants is stable, since the gene number, arrangement and composition are conservative. The determination of sugarcane chloroplast genome sequence laid a good foundation for sugarcane chloroplast related research. This article gives a review on the research progress of sugarcane chloroplast genome through the chloroplast genome map, gene structure, function, chloroplast RNA editing, and phylogenetic analysis in Saccharum and relat-ed genera. This study held great potential to clarify more directions in researches, including sugarcane chloroplast genetic transformation, complete chloroplast nu-cleotide sequence determination in Saccharum and closely related genera, cpSSRs development and application.%随着现代分子生物学技术的发展,目前已经完成了多种植物叶绿体基因组的全序列测定,并研究了这些基因的结构、功能与表达。大部分高等植物的叶绿体基因组结构稳定,基因数量、排列顺序及组成上具有保守性。甘蔗叶绿体基因组测序工作的完成为甘蔗叶绿体相关研究奠定了良好基础。文章从甘蔗叶绿体基因组图谱、结构和功能基因、叶绿体RNA编辑以及甘蔗属叶绿体系统进化等方面综合概述了甘蔗叶绿体基因组研究取得的成果,并从甘蔗叶绿体遗传转化、甘蔗及近缘属叶绿体基因组测序和叶绿体基因组 cpSSRs开发利用等方面指出甘蔗叶绿体基因组今后的研究方向。

  15. Integrated genomic analyses of ovarian carcinoma

    NARCIS (Netherlands)

    Bell, D.; Berchuck, A.; Birrer, M.; Chien, J.; Dao, F.; Dhir, R.; DiSaia, P.; Gabra, H.; Glenn, P.; Godwin, A. K.; Gross, J.; Hartmann, L.; Huang, M.; Huntsman, D. G.; Iacocca, M.; Imielinski, M.; Kalloger, S.; Karlan, B. Y.; Levine, D. A.; Mills, G. B.; Morrison, C.; Mutch, D.; Olvera, N.; Orsulic, S.; Park, K.; Petrelli, N.; Rabeno, B.; Rader, J. S.; Sikic, B. I.; Smith-McCune, K.; Sood, A. K.; Bowtell, D.; Penny, R.; Testa, J. R.; Chang, K.; Dinh, H. H.; Drummond, J. A.; Fowler, G.; Gunaratne, P.; Hawes, A. C.; Kovar, C. L.; Lewis, L. R.; Morgan, M. B.; Newsham, I. F.; Santibanez, J.; Reid, J. G.; Trevino, L. R.; Wu, Y. -Q.; Wang, M.; Muzny, D. M.; Wheeler, D. A.; Gibbs, R. A.; Getz, G.; Lawrence, M. S.; Cibulskis, K.; Sivachenko, A. Y.; Sougnez, C.; Voet, D.; Wilkinson, J.; Bloom, T.; Ardlie, K.; Fennell, T.; Baldwin, J.; Gabriel, S.; Lander, E. S.; Ding, L.; Fulton, R. S.; Koboldt, D. C.; McLellan, M. D.; Wylie, T.; Walker, J.; O'Laughlin, M.; Dooling, D. J.; Fulton, L.; Abbott, R.; Dees, N. D.; Zhang, Q.; Kandoth, C.; Wendl, M.; Schierding, W.; Shen, D.; Harris, C. C.; Schmidt, H.; Kalicki, J.; Delehaunty, K. D.; Fronick, C. C.; Demeter, R.; Cook, L.; Wallis, J. W.; Lin, L.; Magrini, V. J.; Hodges, J. S.; Eldred, J. M.; Smith, S. M.; Pohl, C. S.; Vandin, F.; Raphael, B. J.; Weinstock, G. M.; Mardis, R.; Wilson, R. K.; Meyerson, M.; Winckler, W.; Getz, G.; Verhaak, R. G. W.; Carter, S. L.; Mermel, C. H.; Saksena, G.; Nguyen, H.; Onofrio, R. C.; Lawrence, M. S.; Hubbard, D.; Gupta, S.; Crenshaw, A.; Ramos, A. H.; Ardlie, K.; Chin, L.; Protopopov, A.; Zhang, Juinhua; Kim, T. M.; Perna, I.; Xiao, Y.; Zhang, H.; Ren, G.; Sathiamoorthy, N.; Park, R. W.; Lee, E.; Park, P. J.; Kucherlapati, R.; Absher, D. M.; Waite, L.; Sherlock, G.; Brooks, J. D.; Li, J. Z.; Xu, J.; Myers, R. M.; Laird, P. W.; Cope, L.; Herman, J. G.; Shen, H.; Weisenberger, D. J.; Noushmehr, H.; Pan, F.; Triche, T.; Berman, B. P.; Van den Berg, D. J.; Buckley, J.; Baylin, S. B.; Spellman, P. T.; Purdom, E.; Neuvial, P.; Bengtsson, H.; Jakkula, L. R.; Durinck, S.; Han, J.; Dorton, S.; Marr, H.; Choi, Y. G.; Wang, V.; Wang, N. J.; Ngai, J.; Conboy, J. G.; Parvin, B.; Feiler, H. S.; Speed, T. P.; Gray, J. W.; Levine, D. A.; Socci, N. D.; Liang, Y.; Taylor, B. S.; Schultz, N.; Borsu, L.; Lash, A. E.; Brennan, C.; Viale, A.; Sander, C.; Ladanyi, M.; Hoadley, K. A.; Meng, S.; Du, Y.; Shi, Y.; Li, L.; Turman, Y. J.; Zang, D.; Helms, E. B.; Balu, S.; Zhou, X.; Wu, J.; Topal, M. D.; Hayes, D. N.; Perou, C. M.; Getz, G.; Voet, D.; Saksena, G.; Zhang, Junihua; Zhang, H.; Wu, C. J.; Shukla, S.; Cibulskis, K.; Lawrence, M. S.; Sivachenko, A.; Jing, R.; Park, R. W.; Liu, Y.; Park, P. J.; Noble, M.; Chin, L.; Carter, H.; Kim, D.; Karchin, R.; Spellman, P. T.; Purdom, E.; Neuvial, P.; Bengtsson, H.; Durinck, S.; Han, J.; Korkola, J. E.; Heiser, L. M.; Cho, R. J.; Hu, Z.; Parvin, B.; Speed, T. P.; Gray, J. W.; Schultz, N.; Cerami, E.; Taylor, B. S.; Olshen, A.; Reva, B.; Antipin, Y.; Shen, R.; Mankoo, P.; Sheridan, R.; Ciriello, G.; Chang, W. K.; Bernanke, J. A.; Borsu, L.; Levine, D. A.; Ladanyi, M.; Sander, C.; Haussler, D.; Benz, C. C.; Stuart, J. M.; Benz, S. C.; Sanborn, J. Z.; Vaske, C. J.; Zhu, J.; Szeto, C.; Scott, G. K.; Yau, C.; Hoadley, K. A.; Du, Y.; Balu, S.; Hayes, D. N.; Perou, C. M.; Wilkerson, M. D.; Zhang, N.; Akbani, R.; Baggerly, K. A.; Yung, W. K.; Mills, G. B.; Weinstein, J. N.; Penny, R.; Shelton, T.; Grimm, D.; Hatfield, M.; Morris, S.; Yena, P.; Rhodes, P.; Sherman, M.; Paulauskis, J.; Millis, S.; Kahn, A.; Greene, J. M.; Sfeir, R.; Jensen, M. A.; Chen, J.; Whitmore, J.; Alonso, S.; Jordan, J.; Chu, A.; Zhang, Jinghui; Barker, A.; Compton, C.; Eley, G.; Ferguson, M.; Fielding, P.; Gerhard, D. S.; Myles, R.; Schaefer, C.; Shaw, K. R. Mills; Vaught, J.; Vockley, J. B.; Good, P. J.; Guyer, M. S.; Ozenberger, B.; Peterson, J.; Thomson, E.; Cramer, D.W.

    2011-01-01

    A catalogue of molecular aberrations that cause ovarian cancer is critical for developing and deploying therapies that will improve patients' lives. The Cancer Genome Atlas project has analysed messenger RNA expression, microRNA expression, promoter methylation and DNA copy number in 489 high-grade

  16. The complete chloroplast genome sequence of Podocarpus lambertii: genome structure, evolutionary aspects, gene content and SSR detection.

    Directory of Open Access Journals (Sweden)

    Leila do Nascimento Vieira

    Full Text Available BACKGROUND: Podocarpus lambertii (Podocarpaceae is a native conifer from the Brazilian Atlantic Forest Biome, which is considered one of the 25 biodiversity hotspots in the world. The advancement of next-generation sequencing technologies has enabled the rapid acquisition of whole chloroplast (cp genome sequences at low cost. Several studies have proven the potential of cp genomes as tools to understand enigmatic and basal phylogenetic relationships at different taxonomic levels, as well as further probe the structural and functional evolution of plants. In this work, we present the complete cp genome sequence of P. lambertii. METHODOLOGY/PRINCIPAL FINDINGS: The P. lambertii cp genome is 133,734 bp in length, and similar to other sequenced cupressophytes, it lacks one of the large inverted repeat regions (IR. It contains 118 unique genes and one duplicated tRNA (trnN-GUU, which occurs as an inverted repeat sequence. The rps16 gene was not found, which was previously reported for the plastid genome of another Podocarpaceae (Nageia nagi and Araucariaceae (Agathis dammara. Structurally, P. lambertii shows 4 inversions of a large DNA fragment ∼20,000 bp compared to the Podocarpus totara cp genome. These unexpected characteristics may be attributed to geographical distance and different adaptive needs. The P. lambertii cp genome presents a total of 28 tandem repeats and 156 SSRs, with homo- and dipolymers being the most common and tri-, tetra-, penta-, and hexapolymers occurring with less frequency. CONCLUSION: The complete cp genome sequence of P. lambertii revealed significant structural changes, even in species from the same genus. These results reinforce the apparently loss of rps16 gene in Podocarpaceae cp genome. In addition, several SSRs in the P. lambertii cp genome are likely intraspecific polymorphism sites, which may allow highly sensitive phylogeographic and population structure studies, as well as phylogenetic studies of species of

  17. The Complete Chloroplast Genome Sequence of Podocarpus lambertii: Genome Structure, Evolutionary Aspects, Gene Content and SSR Detection

    Science.gov (United States)

    Vieira, Leila do Nascimento; Faoro, Helisson; Rogalski, Marcelo; Fraga, Hugo Pacheco de Freitas; Cardoso, Rodrigo Luis Alves; de Souza, Emanuel Maltempi; de Oliveira Pedrosa, Fábio; Nodari, Rubens Onofre; Guerra, Miguel Pedro

    2014-01-01

    Background Podocarpus lambertii (Podocarpaceae) is a native conifer from the Brazilian Atlantic Forest Biome, which is considered one of the 25 biodiversity hotspots in the world. The advancement of next-generation sequencing technologies has enabled the rapid acquisition of whole chloroplast (cp) genome sequences at low cost. Several studies have proven the potential of cp genomes as tools to understand enigmatic and basal phylogenetic relationships at different taxonomic levels, as well as further probe the structural and functional evolution of plants. In this work, we present the complete cp genome sequence of P. lambertii. Methodology/Principal Findings The P. lambertii cp genome is 133,734 bp in length, and similar to other sequenced cupressophytes, it lacks one of the large inverted repeat regions (IR). It contains 118 unique genes and one duplicated tRNA (trnN-GUU), which occurs as an inverted repeat sequence. The rps16 gene was not found, which was previously reported for the plastid genome of another Podocarpaceae (Nageia nagi) and Araucariaceae (Agathis dammara). Structurally, P. lambertii shows 4 inversions of a large DNA fragment ∼20,000 bp compared to the Podocarpus totara cp genome. These unexpected characteristics may be attributed to geographical distance and different adaptive needs. The P. lambertii cp genome presents a total of 28 tandem repeats and 156 SSRs, with homo- and dipolymers being the most common and tri-, tetra-, penta-, and hexapolymers occurring with less frequency. Conclusion The complete cp genome sequence of P. lambertii revealed significant structural changes, even in species from the same genus. These results reinforce the apparently loss of rps16 gene in Podocarpaceae cp genome. In addition, several SSRs in the P. lambertii cp genome are likely intraspecific polymorphism sites, which may allow highly sensitive phylogeographic and population structure studies, as well as phylogenetic studies of species of this genus. PMID

  18. Complete Chloroplast Genome Sequence of Aquilaria sinensis (Lour. Gilg and the Evolution Analysis within the Malvalesorder

    Directory of Open Access Journals (Sweden)

    Ying eWang

    2016-03-01

    Full Text Available Aquilaria sinensis (Lour. Gilg is an important medicinal woody plant producing agarwood, which is widely used in traditional Chinese medicine. High-throughput sequencing of chloroplast (cp genomes enhanced the understanding about evolutionary relationships within plant families. In this study, we determined the complete cp genome sequences for A. sinensis. The size of the A.sinensis cp genome was 159,565 bp. This genome included a large single-copy region of 87,482 bp, a small single-copy region of 19,857 bp, and a pair of inverted repeats (IRa and IRb of 26,113 bp each. The GC content of the genome was 37.11%. The A.sinensis cp genome encoded 113 functional genes, including 82 protein-coding genes, 27 tRNA genes, and 4 rRNA genes. Seven genes were duplicated in the protein-coding genes, whereas 11 genes were duplicated in the RNA genes. A total of 45 polymorphic simple-sequence repeat loci and 60 pairs of large repeats were identified. Most simple-sequence repeats were located in the noncoding sections of the large single-copy/small single-copy region and exhibited high A/T content. Moreover, 33 pairs of large repeat sequences were located in the protein-coding genes, whereas 27 pairs were located in the intergenic regions. Aquilaria sinensis cp genome bias ended with A/T on the basis of codon usage. The distribution of codon usage in A.sinensis cp genome was most similar to that in the Gonystylus bancanus cp genome. Comparative results of 82 protein-coding genes from 29 species of cp genomes demonstrated that A.sinensis was a sister species to G. bancanus within the Malvales order. Aquilaria sinensis cp genome presented the highest sequence similarity of >90% with the G. bancanus cp genome by using CGView Comparison Tool. This finding strongly supports the placement of A.sinensis as a sister to G. bancanus within the Malvales order. The complete A.sinensis cp genome information will be highly beneficial for further studies on this traditional

  19. Genomic analyses of the CAM plant pineapple.

    Science.gov (United States)

    Zhang, Jisen; Liu, Juan; Ming, Ray

    2014-07-01

    The innovation of crassulacean acid metabolism (CAM) photosynthesis in arid and/or low CO2 conditions is a remarkable case of adaptation in flowering plants. As the most important crop that utilizes CAM photosynthesis, the genetic and genomic resources of pineapple have been developed over many years. Genetic diversity studies using various types of DNA markers led to the reclassification of the two genera Ananas and Pseudananas and nine species into one genus Ananas and two species, A. comosus and A. macrodontes with five botanical varieties in A. comosus. Five genetic maps have been constructed using F1 or F2 populations, and high-density genetic maps generated by genotype sequencing are essential resources for sequencing and assembling the pineapple genome and for marker-assisted selection. There are abundant expression sequence tag resources but limited genomic sequences in pineapple. Genes involved in the CAM pathway has been analysed in several CAM plants but only a few of them are from pineapple. A reference genome of pineapple is being generated and will accelerate genetic and genomic research in this major CAM crop. This reference genome of pineapple provides the foundation for studying the origin and regulatory mechanism of CAM photosynthesis, and the opportunity to evaluate the classification of Ananas species and botanical cultivars.

  20. Proliferation of group II introns in the chloroplast genome of the green alga Oedocladium carolinianum (Chlorophyceae

    Directory of Open Access Journals (Sweden)

    Jean-Simon Brouard

    2016-10-01

    Full Text Available Background The chloroplast genome sustained extensive changes in architecture during the evolution of the Chlorophyceae, a morphologically and ecologically diverse class of green algae belonging to the Chlorophyta; however, the forces driving these changes are poorly understood. The five orders recognized in the Chlorophyceae form two major clades: the CS clade consisting of the Chlamydomonadales and Sphaeropleales, and the OCC clade consisting of the Oedogoniales, Chaetophorales, and Chaetopeltidales. In the OCC clade, considerable variations in chloroplast DNA (cpDNA structure, size, gene order, and intron content have been observed. The large inverted repeat (IR, an ancestral feature characteristic of most green plants, is present in Oedogonium cardiacum (Oedogoniales but is lacking in the examined members of the Chaetophorales and Chaetopeltidales. Remarkably, the Oedogonium 35.5-kb IR houses genes that were putatively acquired through horizontal DNA transfer. To better understand the dynamics of chloroplast genome evolution in the Oedogoniales, we analyzed the cpDNA of a second representative of this order, Oedocladium carolinianum. Methods The Oedocladium cpDNA was sequenced and annotated. The evolutionary distances separating Oedocladium and Oedogonium cpDNAs and two other pairs of chlorophycean cpDNAs were estimated using a 61-gene data set. Phylogenetic analysis of an alignment of group IIA introns from members of the OCC clade was performed. Secondary structures and insertion sites of oedogonialean group IIA introns were analyzed. Results The 204,438-bp Oedocladium genome is 7.9 kb larger than the Oedogonium genome, but its repertoire of conserved genes is remarkably similar and gene order differs by only one reversal. Although the 23.7-kb IR is missing the putative foreign genes found in Oedogonium, it contains sequences coding for a putative phage or bacterial DNA primase and a hypothetical protein. Intergenic sequences are 1.5-fold

  1. Bioinformatics tools for analysing viral genomic data.

    Science.gov (United States)

    Orton, R J; Gu, Q; Hughes, J; Maabar, M; Modha, S; Vattipally, S B; Wilkie, G S; Davison, A J

    2016-04-01

    The field of viral genomics and bioinformatics is experiencing a strong resurgence due to high-throughput sequencing (HTS) technology, which enables the rapid and cost-effective sequencing and subsequent assembly of large numbers of viral genomes. In addition, the unprecedented power of HTS technologies has enabled the analysis of intra-host viral diversity and quasispecies dynamics in relation to important biological questions on viral transmission, vaccine resistance and host jumping. HTS also enables the rapid identification of both known and potentially new viruses from field and clinical samples, thus adding new tools to the fields of viral discovery and metagenomics. Bioinformatics has been central to the rise of HTS applications because new algorithms and software tools are continually needed to process and analyse the large, complex datasets generated in this rapidly evolving area. In this paper, the authors give a brief overview of the main bioinformatics tools available for viral genomic research, with a particular emphasis on HTS technologies and their main applications. They summarise the major steps in various HTS analyses, starting with quality control of raw reads and encompassing activities ranging from consensus and de novo genome assembly to variant calling and metagenomics, as well as RNA sequencing.

  2. In silico analysis of Simple Sequence Repeats from chloroplast genomes of Solanaceae species

    Directory of Open Access Journals (Sweden)

    Evandro Vagner Tambarussi

    2009-01-01

    Full Text Available The availability of chloroplast genome (cpDNA sequences of Atropa belladonna, Nicotiana sylvestris, N.tabacum, N. tomentosiformis, Solanum bulbocastanum, S. lycopersicum and S. tuberosum, which are Solanaceae species,allowed us to analyze the organization of cpSSRs in their genic and intergenic regions. In general, the number of cpSSRs incpDNA ranged from 161 in S. tuberosum to 226 in N. tabacum, and the number of intergenic cpSSRs was higher than geniccpSSRs. The mononucleotide repeats were the most frequent in studied species, but we also identified di-, tri-, tetra-, pentaandhexanucleotide repeats. Multiple alignments of all cpSSRs sequences from Solanaceae species made the identification ofnucleotide variability possible and the phylogeny was estimated by maximum parsimony. Our study showed that the plastomedatabase can be exploited for phylogenetic analysis and biotechnological approaches.

  3. Balanced gene losses, duplications and intensive rearrangements led to an unusual regularly sized genome in Arbutus unedo chloroplasts.

    Directory of Open Access Journals (Sweden)

    Fernando Martínez-Alberola

    Full Text Available Completely sequenced plastomes provide a valuable source of information about the duplication, loss, and transfer events of chloroplast genes and phylogenetic data for resolving relationships among major groups of plants. Moreover, they can also be useful for exploiting chloroplast genetic engineering technology. Ericales account for approximately six per cent of eudicot diversity with 11,545 species from which only three complete plastome sequences are currently available. With the aim of increasing the number of ericalean complete plastome sequences, and to open new perspectives in understanding Mediterranean plant adaptations, a genomic study on the basis of the complete chloroplast genome sequencing of Arbutus unedo and an updated phylogenomic analysis of Asteridae was implemented. The chloroplast genome of A. unedo shows extensive rearrangements but a medium size (150,897 nt in comparison to most of angiosperms. A number of remarkable distinct features characterize the plastome of A. unedo: five-fold dismissing of the SSC region in relation to most angiosperms; complete loss or pseudogenization of a number of essential genes; duplication of the ndhH-D operon and its location within the two IRs; presence of large tandem repeats located near highly re-arranged regions and pseudogenes. All these features outline the primary evolutionary split between Ericaceae and other ericalean families. The newly sequenced plastome of A. unedo with the available asterid sequences allowed the resolution of some uncertainties in previous phylogenies of Asteridae.

  4. Analysis of the Complete Chloroplast Genome of a Medicinal Plant, Dianthus superbus var. longicalyncinus, from a Comparative Genomics Perspective.

    Science.gov (United States)

    Raman, Gurusamy; Park, SeonJoo

    2015-01-01

    Dianthus superbus var. longicalycinus is an economically important traditional Chinese medicinal plant that is also used for ornamental purposes. In this study, D. superbus was compared to its closely related family of Caryophyllaceae chloroplast (cp) genomes such as Lychnis chalcedonica and Spinacia oleracea. D. superbus had the longest large single copy (LSC) region (82,805 bp), with some variations in the inverted repeat region A (IRA)/LSC regions. The IRs underwent both expansion and constriction during evolution of the Caryophyllaceae family; however, intense variations were not identified. The pseudogene ribosomal protein subunit S19 (rps19) was identified at the IRA/LSC junction, but was not present in the cp genome of other Caryophyllaceae family members. The translation initiation factor IF-1 (infA) and ribosomal protein subunit L23 (rpl23) genes were absent from the Dianthus cp genome. When the cp genome of Dianthus was compared with 31 other angiosperm lineages, the infA gene was found to have been lost in most members of rosids, solanales of asterids and Lychnis of Caryophyllales, whereas rpl23 gene loss or pseudogization had occurred exclusively in Caryophyllales. Nevertheless, the cp genome of Dianthus and Spinacia has two introns in the proteolytic subunit of ATP-dependent protease (clpP) gene, but Lychnis has lost introns from the clpP gene. Furthermore, phylogenetic analysis of individual protein-coding genes infA and rpl23 revealed that gene loss or pseudogenization occurred independently in the cp genome of Dianthus. Molecular phylogenetic analysis also demonstrated a sister relationship between Dianthus and Lychnis based on 78 protein-coding sequences. The results presented herein will contribute to studies of the evolution, molecular biology and genetic engineering of the medicinal and ornamental plant, D. superbus var. longicalycinus.

  5. Analysis of the Complete Chloroplast Genome of a Medicinal Plant, Dianthus superbus var. longicalyncinus, from a Comparative Genomics Perspective.

    Directory of Open Access Journals (Sweden)

    Gurusamy Raman

    Full Text Available Dianthus superbus var. longicalycinus is an economically important traditional Chinese medicinal plant that is also used for ornamental purposes. In this study, D. superbus was compared to its closely related family of Caryophyllaceae chloroplast (cp genomes such as Lychnis chalcedonica and Spinacia oleracea. D. superbus had the longest large single copy (LSC region (82,805 bp, with some variations in the inverted repeat region A (IRA/LSC regions. The IRs underwent both expansion and constriction during evolution of the Caryophyllaceae family; however, intense variations were not identified. The pseudogene ribosomal protein subunit S19 (rps19 was identified at the IRA/LSC junction, but was not present in the cp genome of other Caryophyllaceae family members. The translation initiation factor IF-1 (infA and ribosomal protein subunit L23 (rpl23 genes were absent from the Dianthus cp genome. When the cp genome of Dianthus was compared with 31 other angiosperm lineages, the infA gene was found to have been lost in most members of rosids, solanales of asterids and Lychnis of Caryophyllales, whereas rpl23 gene loss or pseudogization had occurred exclusively in Caryophyllales. Nevertheless, the cp genome of Dianthus and Spinacia has two introns in the proteolytic subunit of ATP-dependent protease (clpP gene, but Lychnis has lost introns from the clpP gene. Furthermore, phylogenetic analysis of individual protein-coding genes infA and rpl23 revealed that gene loss or pseudogenization occurred independently in the cp genome of Dianthus. Molecular phylogenetic analysis also demonstrated a sister relationship between Dianthus and Lychnis based on 78 protein-coding sequences. The results presented herein will contribute to studies of the evolution, molecular biology and genetic engineering of the medicinal and ornamental plant, D. superbus var. longicalycinus.

  6. Genome-Facilitated Analyses of Geomicrobial Processes

    Energy Technology Data Exchange (ETDEWEB)

    Kenneth H. Nealson

    2012-05-02

    that makes up chitin, virtually all of the strains were in fact capable. This led to the discovery of a great many new genes involved with chitin and NAG metabolism (7). In a similar vein, a detailed study of the sugar utilization pathway revealed a major new insight into the regulation of sugar metabolism in this genus (19). Systems Biology and Comparative Genomics of the shewanellae: Several publications were put together describing the use of comparative genomics for analyses of the group Shewanella, and these were a logical culmination of our genomic-driven research (10,15,18). Eight graduate students received their Ph.D. degrees doing part of the work described here, and four postdoctoral fellows were supported. In addition, approximately 20 undergraduates took part in projects during the grant period.

  7. The Chloroplast Genome of Passiflora edulis (Passifloraceae) Assembled from Long Sequence Reads: Structural Organization and Phylogenomic Studies in Malpighiales

    Science.gov (United States)

    Cauz-Santos, Luiz A.; Munhoz, Carla F.; Rodde, Nathalie; Cauet, Stephane; Santos, Anselmo A.; Penha, Helen A.; Dornelas, Marcelo C.; Varani, Alessandro M.; Oliveira, Giancarlo C. X.; Bergès, Hélène; Vieira, Maria Lucia C.

    2017-01-01

    The family Passifloraceae consists of some 700 species classified in around 16 genera. Almost all its members belong to the genus Passiflora. In Brazil, the yellow passion fruit (Passiflora edulis) is of considerable economic importance, both for juice production and consumption as fresh fruit. The availability of chloroplast genomes (cp genomes) and their sequence comparisons has led to a better understanding of the evolutionary relationships within plant taxa. In this study, we obtained the complete nucleotide sequence of the P. edulis chloroplast genome, the first entirely sequenced in the Passifloraceae family. We determined its structure and organization, and also performed phylogenomic studies on the order Malpighiales and the Fabids clade. The P. edulis chloroplast genome is characterized by the presence of two copies of an inverted repeat sequence (IRA and IRB) of 26,154 bp, each separating a small single copy region of 13,378 bp and a large single copy (LSC) region of 85,720 bp. The annotation resulted in the identification of 105 unique genes, including 30 tRNAs, 4 rRNAs, and 71 protein coding genes. Also, 36 repetitive elements and 85 SSRs (microsatellites) were identified. The structure of the complete cp genome of P. edulis differs from that of other species because of rearrangement events detected by means of a comparison based on 22 members of the Malpighiales. The rearrangements were three inversions of 46,151, 3,765 and 1,631 bp, located in the LSC region. Phylogenomic analysis resulted in strongly supported trees, but this could also be a consequence of the limited taxonomic sampling used. Our results have provided a better understanding of the evolutionary relationships in the Malpighiales and the Fabids, confirming the potential of complete chloroplast genome sequences in inferring evolutionary relationships and the utility of long sequence reads for generating very accurate biological information. PMID:28344587

  8. Functional analyses of the Physcomitrella patens phytochromes in regulating chloroplast avoidance movement.

    Science.gov (United States)

    Uenaka, Hidetoshi; Kadota, Akeo

    2007-09-01

    Red light-induced chloroplast movement in Physcomitrella patens (Pp) is mediated by dichroic phytochrome in the cytoplasm. To analyze the molecular function of the photoreceptor in the cytoplasm, we developed a protoplast system in which chloroplast photomovement was exclusively dependent on the expression of phytochrome cDNA constructs introduced by polyethylene glycol (PEG) transformation. YFP was fused to the phytochrome constructs and their expression was detected by fluorescence. The chloroplast avoidance response was induced in the protoplasts expressing a YFP fusion of PHY1-PHY3, but not of PHY4 or YFP alone. Phy::yfp fluorescence was detected in the cytoplasm. No change in the location of phy1::yfp or phy2::yfp was revealed before and after photomovement. When phy1::yfp and phy2::yfp were targeted to the nucleus by fusing a nuclear localization signal to the constructs, red light avoidance was not induced. To determine the domains of PHY2 essential for avoidance response, various partially-deleted PHY2::YFP constructs were tested. The N-terminal extension domain (NTE) was found to be necessary but the C-terminal histidine kinase-related domain (HKRD) was dispensable. An avoidance response was not induced under expression of phytochrome N-terminal half domain [deleting both the PAS (Per, Arnt, Sim)-related domain (PRD) and HKRD]. GUS fusion of this N-terminal half domain, reported to be fully functional in Arabidopsis for several phyA- and phyB-regulated responses was not effective in chloroplast avoidance movement. Domain requirement and GUS fusion effect were also confirmed in PHY1. These results indicate that Pp phy1-Pp phy3 in the cytoplasm mediate chloroplast avoidance movement, and that NTE and PRD, but not HKRD, are required for their function.

  9. Complete chloroplast genomes from apomictic Taraxacum (Asteraceae): Identity and variation between three microspecies.

    Science.gov (United States)

    M Salih, Rubar Hussein; Majeský, Ľuboš; Schwarzacher, Trude; Gornall, Richard; Heslop-Harrison, Pat

    2017-01-01

    Chloroplast DNA sequences show substantial variation between higher plant species, and less variation within species, so are typically excellent markers to investigate evolutionary, population and genetic relationships and phylogenies. We sequenced the plastomes of Taraxacum obtusifrons Markl. (O978); T. stridulum Trávniček ined. (S3); and T. amplum Markl. (A978), three apomictic triploid (2n = 3x = 24) dandelions from the T. officinale agg. We aimed to characterize the variation in plastomes, define relationships and correlations with the apomictic microspecies status, and refine placement of the microspecies in the evolutionary or phylogenetic context of the Asteraceae. The chloroplast genomes of accessions O978 and S3 were identical and 151,322 bp long (where the nuclear genes are known to show variation), while A978 was 151,349 bp long. All three genomes contained 135 unique genes, with an additional copy of the trnF-GGA gene in the LSC region and 20 duplicated genes in the IR region, along with short repeats, the typical major Inverted Repeats (IR1 and IR2, 24,431bp long), and Large and Small Single Copy regions (LSC 83,889bp and SSC 18,571bp in O978). Between the two Taraxacum plastomes types, we identified 28 SNPs. The distribution of polymorphisms suggests some parts of the Taraxacum plastome are evolving at a slower rate. There was a hemi-nested inversion in the LSC region that is common to Asteraceae, and an SSC inversion from ndhF to rps15 found only in some Asteraceae lineages. A comparative repeat analysis showed variation between Taraxacum and the phylogenetically close genus Lactuca, with many more direct repeats of 40bp or more in Lactuca (1% larger plastome than Taraxacum). When individual genes and non-coding regions were for Asteraceae phylogeny reconstruction, not all showed the same evolutionary scenario suggesting care is needed for interpretation of relationships if a limited number of markers are used. Studying genotypic diversity in

  10. Complete chloroplast genomes from apomictic Taraxacum (Asteraceae): Identity and variation between three microspecies

    Science.gov (United States)

    Majeský, Ľuboš; Schwarzacher, Trude; Gornall, Richard; Heslop-Harrison, Pat

    2017-01-01

    Chloroplast DNA sequences show substantial variation between higher plant species, and less variation within species, so are typically excellent markers to investigate evolutionary, population and genetic relationships and phylogenies. We sequenced the plastomes of Taraxacum obtusifrons Markl. (O978); T. stridulum Trávniček ined. (S3); and T. amplum Markl. (A978), three apomictic triploid (2n = 3x = 24) dandelions from the T. officinale agg. We aimed to characterize the variation in plastomes, define relationships and correlations with the apomictic microspecies status, and refine placement of the microspecies in the evolutionary or phylogenetic context of the Asteraceae. The chloroplast genomes of accessions O978 and S3 were identical and 151,322 bp long (where the nuclear genes are known to show variation), while A978 was 151,349 bp long. All three genomes contained 135 unique genes, with an additional copy of the trnF-GGA gene in the LSC region and 20 duplicated genes in the IR region, along with short repeats, the typical major Inverted Repeats (IR1 and IR2, 24,431bp long), and Large and Small Single Copy regions (LSC 83,889bp and SSC 18,571bp in O978). Between the two Taraxacum plastomes types, we identified 28 SNPs. The distribution of polymorphisms suggests some parts of the Taraxacum plastome are evolving at a slower rate. There was a hemi-nested inversion in the LSC region that is common to Asteraceae, and an SSC inversion from ndhF to rps15 found only in some Asteraceae lineages. A comparative repeat analysis showed variation between Taraxacum and the phylogenetically close genus Lactuca, with many more direct repeats of 40bp or more in Lactuca (1% larger plastome than Taraxacum). When individual genes and non-coding regions were for Asteraceae phylogeny reconstruction, not all showed the same evolutionary scenario suggesting care is needed for interpretation of relationships if a limited number of markers are used. Studying genotypic diversity in

  11. The chloroplast genome sequence of the green alga Leptosira terrestris: multiple losses of the inverted repeat and extensive genome rearrangements within the Trebouxiophyceae

    Directory of Open Access Journals (Sweden)

    Turmel Monique

    2007-07-01

    Full Text Available Abstract Background In the Chlorophyta – the green algal phylum comprising the classes Prasinophyceae, Ulvophyceae, Trebouxiophyceae and Chlorophyceae – the chloroplast genome displays a highly variable architecture. While chlorophycean chloroplast DNAs (cpDNAs deviate considerably from the ancestral pattern described for the prasinophyte Nephroselmis olivacea, the degree of remodelling sustained by the two ulvophyte cpDNAs completely sequenced to date is intermediate relative to those observed for chlorophycean and trebouxiophyte cpDNAs. Chlorella vulgaris (Chlorellales is currently the only photosynthetic trebouxiophyte whose complete cpDNA sequence has been reported. To gain insights into the evolutionary trends of the chloroplast genome in the Trebouxiophyceae, we sequenced cpDNA from the filamentous alga Leptosira terrestris (Ctenocladales. Results The 195,081-bp Leptosira chloroplast genome resembles the 150,613-bp Chlorella genome in lacking a large inverted repeat (IR but differs greatly in gene order. Six of the conserved genes present in Chlorella cpDNA are missing from the Leptosira gene repertoire. The 106 conserved genes, four introns and 11 free standing open reading frames (ORFs account for 48.3% of the genome sequence. This is the lowest gene density yet observed among chlorophyte cpDNAs. Contrary to the situation in Chlorella but similar to that in the chlorophycean Scenedesmus obliquus, the gene distribution is highly biased over the two DNA strands in Leptosira. Nine genes, compared to only three in Chlorella, have significantly expanded coding regions relative to their homologues in ancestral-type green algal cpDNAs. As observed in chlorophycean genomes, the rpoB gene is fragmented into two ORFs. Short repeats account for 5.1% of the Leptosira genome sequence and are present mainly in intergenic regions. Conclusion Our results highlight the great plasticity of the chloroplast genome in the Trebouxiophyceae and indicate

  12. The Chloroplast Genome of Euglena mutabilis-Cluster Arrangement, Intron Analysis, and Intrageneric Trends.

    Science.gov (United States)

    Dabbagh, Nadja; Preisfeld, Angelika

    2017-01-01

    A comparative analysis of the chloroplast genome of Euglena mutabilis underlined a high diversity in the evolution of plastids in euglenids. Gene clusters in more derived Euglenales increased in complexity with only a few, but remarkable changes in the genus Euglena. Euglena mutabilis differed from other Euglena species in a mirror-inverted arrangement of 12 from 15 identified clusters, making it very likely that the emergence at the base of the genus Euglena, which has been considered a long branch artifact, is truly a probable position. This was corroborated by many similarities in gene arrangement and orientation with Strombomonas and Monomorphina, rendering the genome organization of E. mutabilis in certain clusters as plesiomorphic feature. By RNA analysis exact exon-intron boundaries and the type of the 77 introns identified were mostly determined unambiguously. A detailed intron study of psbC pointed at two important issues: First, the number of introns varied even between species, and no trend from few to many introns could be observed. Second, mat1 was localized in Eutreptiales exclusively in intron 1, and mat2 was not identified. With the emergence of Euglenaceae in most species, a new intron containing mat2 inserted in front of the previous intron 1 and thereby became intron 2 with mat1.

  13. Comparative Analysis of the Chloroplast Genomic Information of Cunninghamia lanceolata (Lamb.) Hook with Sibling Species from the Genera Cryptomeria D. Don, Taiwania Hayata, and Calocedrus Kurz.

    Science.gov (United States)

    Zheng, Weiwei; Chen, Jinhui; Hao, Zhaodong; Shi, Jisen

    2016-07-07

    Chinese fir (Cunninghamia lanceolata (Lamb.) Hook) is an important coniferous tree species for timber production, which accounts for ~40% of log supply from plantations in southern China. Chloroplast genetic engineering is an exciting field to engineer several valuable tree traits. In this study, we revisited the published complete Chinese fir (NC_021437) and four other coniferous species chloroplast genome sequence in Taxodiaceae. Comparison of their chloroplast genomes revealed three unique inversions found in the downstream of the gene clusters and evolutionary divergence were found, although overall the chloroplast genomic structure of the Cupressaceae linage was conserved. We also investigated the phylogenetic position of Chinese fir among conifers by examining gene functions, selection forces, substitution rates, and the full chloroplast genome sequence. Consistent with previous molecular systematics analysis, the results provided a well-supported phylogeny framework for the Cupressaceae that strongly confirms the "basal" position of Cunninghamia lanceolata. The structure of the Cunninghamia lanceolata chloroplast genome showed a partial lack of one IR copy, rearrangements clearly occurred and slight evolutionary divergence appeared among the cp genome of C. lanceolata, Taiwania cryptomerioides, Taiwania flousiana, Calocedrus formosana and Cryptomeria japonica. The information from sequence divergence and length variation of genes could be further considered for bioengineering research.

  14. A Phylogenetic Analysis of 34 Chloroplast Genomes Elucidates the Relationships between Wild and Domestic Species within the Genus Citrus.

    Science.gov (United States)

    Carbonell-Caballero, Jose; Alonso, Roberto; Ibañez, Victoria; Terol, Javier; Talon, Manuel; Dopazo, Joaquin

    2015-08-01

    Citrus genus includes some of the most important cultivated fruit trees worldwide. Despite being extensively studied because of its commercial relevance, the origin of cultivated citrus species and the history of its domestication still remain an open question. Here, we present a phylogenetic analysis of the chloroplast genomes of 34 citrus genotypes which constitutes the most comprehensive and detailed study to date on the evolution and variability of the genus Citrus. A statistical model was used to estimate divergence times between the major citrus groups. Additionally, a complete map of the variability across the genome of different citrus species was produced, including single nucleotide variants, heteroplasmic positions, indels (insertions and deletions), and large structural variants. The distribution of all these variants provided further independent support to the phylogeny obtained. An unexpected finding was the high level of heteroplasmy found in several of the analyzed genomes. The use of the complete chloroplast DNA not only paves the way for a better understanding of the phylogenetic relationships within the Citrus genus but also provides original insights into other elusive evolutionary processes, such as chloroplast inheritance, heteroplasmy, and gene selection.

  15. A trnI_CAU triplication event in the complete chloroplast genome of Paris verticillata M.Bieb. (Melanthiaceae, Liliales).

    Science.gov (United States)

    Do, Hoang Dang Khoa; Kim, Jung Sung; Kim, Joo-Hwan

    2014-06-19

    The chloroplast is an essential plant organelle responsible for photosynthesis. Gene duplication, relocation, and loss in the chloroplast genome (cpDNA) are useful for exploring the evolution and phylogeny of plant species. In this study, the complete chloroplast genome of Paris verticillata was sequenced using the 454 sequencing system and Sanger sequencing method to trace the evolutionary pattern in the tribe Parideae of the family Melanthiaceae (Liliales). The circular double-stranded cpDNA of P. verticillata (157,379 bp) consists of two inverted repeat regions each of 28,373 bp, a large single copy of 82,726 bp, and a small single copy of 17,907 bp. Gene content and order are generally similar to the previously reported cpDNA sequences within the order Liliales. However, we found that trnI_CAU was triplicated in P. verticillata. In addition, cemA is suspected to be a pseudogene due to the presence of internal stop codons created by poly(A) insertion and single small CA repeats. Such changes were not found in previously examined cpDNAs of the Melanthiaceae or other families of the Liliales, suggesting that such features are unique to the tribe Parideae of Melanthiaceae. The characteristics of P. verticillata cpDNA will provide useful information for uncovering the evolution within Paris and for further research of plastid genome evolution and phylogenetic studies in Liliales.

  16. The complete chloroplast genome sequence of sugar beet (Beta vulgaris ssp. vulgaris).

    Science.gov (United States)

    Li, Han; Cao, Hua; Cai, Yan-Fei; Wang, Ji-Hua; Qu, Su-Ping; Huang, Xing-Qi

    2014-06-01

    The complete nucleotide sequence of the sugar beet (Beta vulgaris ssp. vulgaris) chloroplast genome (cpDNA) was determined in this study. The cpDNA was 149,637 bp in length, containing a pair of 24,439 bp inverted repeat regions (IR), which were separated by small and large single copy regions (SSC and LSC) of 17,701 and 83,057 bp, respectively. 53.4% of the sugar beet cpDNA consisted of gene coding regions (protein coding and RNA genes). The gene content and relative positions of 113 individual genes (79 protein encoding genes, 30 tRNA genes, 4 rRNA genes) were almost identical to those of tobacco cpDNA. The overall AT contents of the sugar beet cpDNA were 63.6% and in the LSC, SSC and IR regions were 65.9%, 70.8% and 57.8%, respectively. Fifteen genes contained one intron, while three genes had two introns.

  17. Complete chloroplast genome sequence of an orchid model plant candidate: Erycina pusilla apply in tropical Oncidium breeding.

    Directory of Open Access Journals (Sweden)

    I-Chun Pan

    Full Text Available Oncidium is an important ornamental plant but the study of its functional genomics is difficult. Erycina pusilla is a fast-growing Oncidiinae species. Several characteristics including low chromosome number, small genome size, short growth period, and its ability to complete its life cycle in vitro make E. pusilla a good model candidate and parent for hybridization for orchids. Although genetic information remains limited, systematic molecular analysis of its chloroplast genome might provide useful genetic information. By combining bacterial artificial chromosome (BAC clones and next-generation sequencing (NGS, the chloroplast (cp genome of E. pusilla was sequenced accurately, efficiently and economically. The cp genome of E. pusilla shares 89 and 84% similarity with Oncidium Gower Ramsey and Phalanopsis aphrodite, respectively. Comparing these 3 cp genomes, 5 regions have been identified as showing diversity. Using PCR analysis of 19 species belonging to the Epidendroideae subfamily, a conserved deletion was found in the rps15-trnN region of the Cymbidieae tribe. Because commercial Oncidium varieties in Taiwan are limited, identification of potential parents using molecular breeding method has become very important. To demonstrate the relationship between taxonomic position and hybrid compatibility of E. pusilla, 4 DNA regions of 36 tropically adapted Oncidiinae varieties have been analyzed. The results indicated that trnF-ndhJ and trnH-psbA were suitable for phylogenetic analysis. E. pusilla proved to be phylogenetically closer to Rodriguezia and Tolumnia than Oncidium, despite its similar floral appearance to Oncidium. These results indicate the hybrid compatibility of E. pusilla, its cp genome providing important information for Oncidium breeding.

  18. Transcriptional Slippage and RNA Editing Increase the Diversity of Transcripts in Chloroplasts: Insight from Deep Sequencing of Vigna radiata Genome and Transcriptome.

    Directory of Open Access Journals (Sweden)

    Ching-Ping Lin

    Full Text Available We performed deep sequencing of the nuclear and organellar genomes of three mungbean genotypes: Vigna radiata ssp. sublobata TC1966, V. radiata var. radiata NM92 and the recombinant inbred line RIL59 derived from a cross between TC1966 and NM92. Moreover, we performed deep sequencing of the RIL59 transcriptome to investigate transcript variability. The mungbean chloroplast genome has a quadripartite structure including a pair of inverted repeats separated by two single copy regions. A total of 213 simple sequence repeats were identified in the chloroplast genomes of NM92 and RIL59; 78 single nucleotide variants and nine indels were discovered in comparing the chloroplast genomes of TC1966 and NM92. Analysis of the mungbean chloroplast transcriptome revealed mRNAs that were affected by transcriptional slippage and RNA editing. Transcriptional slippage frequency was positively correlated with the length of simple sequence repeats of the mungbean chloroplast genome (R2=0.9911. In total, 41 C-to-U editing sites were found in 23 chloroplast genes and in one intergenic spacer. No editing site that swapped U to C was found. A combination of bioinformatics and experimental methods revealed that the plastid-encoded RNA polymerase-transcribed genes psbF and ndhA are affected by transcriptional slippage in mungbean and in main lineages of land plants, including three dicots (Glycine max, Brassica rapa, and Nicotiana tabacum, two monocots (Oryza sativa and Zea mays, two gymnosperms (Pinus taeda and Ginkgo biloba and one moss (Physcomitrella patens. Transcript analysis of the rps2 gene showed that transcriptional slippage could affect transcripts at single sequence repeat regions with poly-A runs. It showed that transcriptional slippage together with incomplete RNA editing may cause sequence diversity of transcripts in chloroplasts of land plants.

  19. Genome size analyses of Pucciniales reveal the largest fungal genomes

    Directory of Open Access Journals (Sweden)

    Silvia eTavares

    2014-08-01

    Full Text Available Rust fungi (Basidiomycota, Pucciniales are biotrophic plant pathogens which exhibit diverse complexities in their life cycles and host ranges. The completion of genome sequencing of a few rust fungi has revealed the occurrence of large genomes. Sequencing efforts for other rust fungi have been hampered by uncertainty concerning their genome sizes. Flow cytometry was recently applied to estimate the genome size of a few rust fungi, and confirmed the occurrence of large genomes in this order (averaging 151.5 Mbp, while the average for Basidiomycota was 49.9 Mbp and was 37.7 Mbp for all fungi. In this work, we have used an innovative and simple approach to simultaneously isolate nuclei from the rust and its host plant in order to estimate the genome size of 30 rust species by flow cytometry. Genome sizes varied over 10-fold, from 70 to 893 Mbp, with an average genome size value of 380.2 Mbp. Compared to the genome sizes of over 1,800 fungi, Gymnosporangium confusum possesses the largest fungal genome ever reported (893.2 Mbp. Moreover, even the smallest rust genome determined in this study is larger than the vast majority of fungal genomes (94 %. The average genome size of the Pucciniales is now of 305.5 Mbp, while the average Basidiomycota genome size has shifted to 70.4 Mbp and the average for all fungi reached 44.2 Mbp. Despite the fact that no correlation could be drawn between the genome sizes, the phylogenomics or the life cycle of rust fungi, it is interesting to note that rusts with Fabaceae hosts present genomes clearly larger than those with Poaceae hosts. Although this study comprises only a small fraction of the more than 7,000 rust species described, it seems already evident that the Pucciniales represent a group where genome size expansion could be a common characteristic. This is in sharp contrast to sister taxa, placing this order in a relevant position in fungal genomics research.

  20. The Chloroplast Genome of Utricularia reniformis Sheds Light on the Evolution of the ndh Gene Complex of Terrestrial Carnivorous Plants from the Lentibulariaceae Family

    Science.gov (United States)

    Silva, Saura R.; Diaz, Yani C. A.; Penha, Helen Alves; Pinheiro, Daniel G.; Fernandes, Camila C.; Miranda, Vitor F. O.; Michael, Todd P.

    2016-01-01

    Lentibulariaceae is the richest family of carnivorous plants spanning three genera including Pinguicula, Genlisea, and Utricularia. Utricularia is globally distributed, and, unlike Pinguicula and Genlisea, has both aquatic and terrestrial forms. In this study we present the analysis of the chloroplast (cp) genome of the terrestrial Utricularia reniformis. U. reniformis has a standard cp genome of 139,725bp, encoding a gene repertoire similar to essentially all photosynthetic organisms. However, an exclusive combination of losses and pseudogenization of the plastid NAD(P)H-dehydrogenase (ndh) gene complex were observed. Comparisons among aquatic and terrestrial forms of Pinguicula, Genlisea, and Utricularia indicate that, whereas the aquatic forms retained functional copies of the eleven ndh genes, these have been lost or truncated in terrestrial forms, suggesting that the ndh function may be dispensable in terrestrial Lentibulariaceae. Phylogenetic scenarios of the ndh gene loss and recovery among Pinguicula, Genlisea, and Utricularia to the ancestral Lentibulariaceae cladeare proposed. Interestingly, RNAseq analysis evidenced that U. reniformis cp genes are transcribed, including the truncated ndh genes, suggesting that these are not completely inactivated. In addition, potential novel RNA-editing sites were identified in at least six U. reniformis cp genes, while none were identified in the truncated ndh genes. Moreover, phylogenomic analyses support that Lentibulariaceae is monophyletic, belonging to the higher core Lamiales clade, corroborating the hypothesis that the first Utricularia lineage emerged in terrestrial habitats and then evolved to epiphytic and aquatic forms. Furthermore, several truncated cp genes were found interspersed with U. reniformis mitochondrial and nuclear genome scaffolds, indicating that as observed in other smaller plant genomes, such as Arabidopsis thaliana, and the related and carnivorous Genlisea nigrocaulis and G. hispidula, the

  1. Assessment of multi-enzyme operon engineering of tobacco chloroplast genome for high-level simultaneous expression of cellulolytic enzymes

    Energy Technology Data Exchange (ETDEWEB)

    Kolotilin, I. [Agriculture and Agri-Food Canada, London, ON (Canada); Pereira, E.O.; Menassa, R. [Western Ontario Univ., London, ON (Canada). Dept. of Biology; Agriculture and Agri-Food Canada, London, ON (Canada)

    2009-07-01

    The use of biofuels as an environmentally-sound substitute for depleting fossil fuels was discussed. Commercially produced biofuels are generated primarily from starch or sugar and supply only a small fraction of global fuel requirements. Although cellulosic biomass can serve as an abundant and renewable source of fermentable sugars, the cost of converting biomass to fuel is too high. Plant genetic engineering techniques are more economical for producing recombinant proteins because of the low-cost of the growing bioreactors. The transformation of the tobacco chloroplast genome has proven to be very prolific in terms of recombinant protein yield, which typically reaches 10 to 20 per cent of total soluble protein. In addition, plastid transcription-translation machinery allows for the simultaneous expression of several genes from artificial operons, providing the potential to engineer several proteins in one transformation step. The purpose of this study was to produce transplastomic tobacco plants bearing single genes as well as operons of cell wall-degrading enzymes for high-level expression. An attempt was made to reproduce an engineering approach in tobacco chloroplasts to generate a potent mini-cellulosome. The resulting enzymes were evaluated for their ability to degrade biomass. The study also examined the feasibility of using crude extracts of highly-expressing plants as an additive in the biomass fermentation process. The productivity of transplastomic plants was compared with plants transiently expressing cellulolytic enzymes directed to other cellular compartments.

  2. Complete chloroplast genome sequence of poisonous and medicinal plant Datura stramonium: organizations and implications for genetic engineering.

    Science.gov (United States)

    Yang, Yang; Dang, Yuanye; Yuanye, Dang; Li, Qing; Qing, Li; Lu, Jinjian; Jinjian, Lu; Li, Xiwen; Xiwen, Li; Wang, Yitao; Yitao, Wang

    2014-01-01

    Datura stramonium is a widely used poisonous plant with great medicinal and economic value. Its chloroplast (cp) genome is 155,871 bp in length with a typical quadripartite structure of the large (LSC, 86,302 bp) and small (SSC, 18,367 bp) single-copy regions, separated by a pair of inverted repeats (IRs, 25,601 bp). The genome contains 113 unique genes, including 80 protein-coding genes, 29 tRNAs and four rRNAs. A total of 11 forward, 9 palindromic and 13 tandem repeats were detected in the D. stramonium cp genome. Most simple sequence repeats (SSR) are AT-rich and are less abundant in coding regions than in non-coding regions. Both SSRs and GC content were unevenly distributed in the entire cp genome. All preferred synonymous codons were found to use A/T ending codons. The difference in GC contents of entire genomes and of the three-codon positions suggests that the D. stramonium cp genome might possess different genomic organization, in part due to different mutational pressures. The five most divergent coding regions and four non-coding regions (trnH-psbA, rps4-trnS, ndhD-ccsA, and ndhI-ndhG) were identified using whole plastome alignment, which can be used to develop molecular markers for phylogenetics and barcoding studies within the Solanaceae. Phylogenetic analysis based on 68 protein-coding genes supported Datura as a sister to Solanum. This study provides valuable information for phylogenetic and cp genetic engineering studies of this poisonous and medicinal plant.

  3. Complete Chloroplast Genome Sequence of Tartary Buckwheat (Fagopyrum tataricum and Comparative Analysis with Common Buckwheat (F. esculentum.

    Directory of Open Access Journals (Sweden)

    Kwang-Soo Cho

    Full Text Available We report the chloroplast (cp genome sequence of tartary buckwheat (Fagopyrum tataricum obtained by next-generation sequencing technology and compared this with the previously reported common buckwheat (F. esculentum ssp. ancestrale cp genome. The cp genome of F. tataricum has a total sequence length of 159,272 bp, which is 327 bp shorter than the common buckwheat cp genome. The cp gene content, order, and orientation are similar to those of common buckwheat, but with some structural variation at tandem and palindromic repeat frequencies and junction areas. A total of seven InDels (around 100 bp were found within the intergenic sequences and the ycf1 gene. Copy number variation of the 21-bp tandem repeat varied in F. tataricum (four repeats and F. esculentum (one repeat, and the InDel of the ycf1 gene was 63 bp long. Nucleotide and amino acid have highly conserved coding sequence with about 98% homology and four genes--rpoC2, ycf3, accD, and clpP--have high synonymous (Ks value. PCR based InDel markers were applied to diverse genetic resources of F. tataricum and F. esculentum, and the amplicon size was identical to that expected in silico. Therefore, these InDel markers are informative biomarkers to practically distinguish raw or processed buckwheat products derived from F. tataricum and F. esculentum.

  4. Complete Chloroplast Genome Sequence of Tartary Buckwheat (Fagopyrum tataricum) and Comparative Analysis with Common Buckwheat (F. esculentum).

    Science.gov (United States)

    Cho, Kwang-Soo; Yun, Bong-Kyoung; Yoon, Young-Ho; Hong, Su-Young; Mekapogu, Manjulatha; Kim, Kyung-Hee; Yang, Tae-Jin

    2015-01-01

    We report the chloroplast (cp) genome sequence of tartary buckwheat (Fagopyrum tataricum) obtained by next-generation sequencing technology and compared this with the previously reported common buckwheat (F. esculentum ssp. ancestrale) cp genome. The cp genome of F. tataricum has a total sequence length of 159,272 bp, which is 327 bp shorter than the common buckwheat cp genome. The cp gene content, order, and orientation are similar to those of common buckwheat, but with some structural variation at tandem and palindromic repeat frequencies and junction areas. A total of seven InDels (around 100 bp) were found within the intergenic sequences and the ycf1 gene. Copy number variation of the 21-bp tandem repeat varied in F. tataricum (four repeats) and F. esculentum (one repeat), and the InDel of the ycf1 gene was 63 bp long. Nucleotide and amino acid have highly conserved coding sequence with about 98% homology and four genes--rpoC2, ycf3, accD, and clpP--have high synonymous (Ks) value. PCR based InDel markers were applied to diverse genetic resources of F. tataricum and F. esculentum, and the amplicon size was identical to that expected in silico. Therefore, these InDel markers are informative biomarkers to practically distinguish raw or processed buckwheat products derived from F. tataricum and F. esculentum.

  5. The chloroplast genome hidden in plain sight, open access publishing and anti-fragile distributed data sources.

    Science.gov (United States)

    McKernan, Kevin Judd

    2016-11-01

    We sequenced several cannabis genomes in 2011 of June and the first and the longest contigs to emerge were the chloroplast and mitochondrial genomes. Having been a contributor to the Human Genome Project and an eye-witness to the real benefits of immediate data release, I have first hand experience with the potential mal-investment of millions of dollars of tax payer money narrowly averted due to the adopted global rapid data release policy. The policy was vital in reducing duplication of effort and economic waste. As a result, we felt obligated to publish the Cannabis genome data in a similar spirit and placed them immediately on a cloud based Amazon server in August of 2011. While these rapid data release practices were heralded by many in the media, we still find some authors fail to find or reference said work and hope to compel the readership that this omission has more pervasive repercussions than bruised egos and is a regression for our community.

  6. [Technological advances in single-cell genomic analyses].

    Science.gov (United States)

    Pan, Xing-Hua; Zhu, Hai-Ying; Marjani, Sadie L

    2011-01-01

    The technological progress of the genomics has transformed life science research. The main objectives of genomics are sequencing of new genomes and genome-wide identification of the function and the interaction of genes and their products. The recently developed second generation or next generation sequencing platforms and DNA microarray technology are immensely important and powerful tools for functional genomic analyses. However, their application is limited by the requirement of sufficient amounts of high quality nucleic acid samples. Therefore, when only a single cell or a very small number of cells are available or are preferred, the whole genomic sequencing or functional genomic objectives cannot be achieved conventionally and require a robust amplification method. This review highlights DNA amplification technologies and summarizes the strategies currently utilized for whole genome sequencing of a single cell, with specific focus on studies investigating microorganisms; An outline for targeted re-sequencing enabling the analysis of larger genomes is also provided. Furthermore, the review presents the emerging functional genomic applications using next-generation sequencing or microarray analysis to examine genome-wide transcriptional profile, chromatin modification and other types of protein-DNA binding profile, and CpG methylation mapping in a single cell or a very low quantity of cells. The nature of these technologies and their prospects are also addressed.

  7. Maternal inheritance of mitochondrial genomes and complex inheritance of chloroplast genomes in Actinidia Lind.: evidences from interspecific crosses.

    Science.gov (United States)

    Li, Dawei; Qi, Xiaoqiong; Li, Xinwei; Li, Li; Zhong, Caihong; Huang, Hongwen

    2013-04-01

    The inheritance pattern of chloroplast and mitochondria is a critical determinant in studying plant phylogenetics, biogeography and hybridization. To better understand chloroplast and mitochondrial inheritance patterns in Actinidia (traditionally called kiwifruit), we performed 11 artificial interspecific crosses and studied the ploidy levels, morphology, and sequence polymorphisms of chloroplast DNA (cpDNA) and mitochondrial DNA (mtDNA) of parents and progenies. Sequence analysis showed that the mtDNA haplotypes of F1 hybrids entirely matched those of the female parents, indicating strictly maternal inheritance of Actinidia mtDNA. However, the cpDNA haplotypes of F1 hybrids, which were predominantly derived from the male parent (9 crosses), could also originate from the mother (1 cross) or both parents (1 cross), demonstrating paternal, maternal, and biparental inheritance of Actinidia cpDNA. The inheritance patterns of the cpDNA in Actinidia hybrids differed according to the species and genotypes chosen to be the parents, rather than the ploidy levels of the parent selected. The multiple inheritance modes of Actinidia cpDNA contradicted the strictly paternal inheritance patterns observed in previous studies, and provided new insights into the use of cpDNA markers in studies of phylogenetics, biogeography and introgression in Actinidia and other angiosperms.

  8. Capturing the biofuel wellhead and powerhouse: the chloroplast and mitochondrial genomes of the leguminous feedstock tree Pongamia pinnata.

    Directory of Open Access Journals (Sweden)

    Stephen H Kazakoff

    Full Text Available Pongamia pinnata (syn. Millettia pinnata is a novel, fast-growing arboreal legume that bears prolific quantities of oil-rich seeds suitable for the production of biodiesel and aviation biofuel. Here, we have used Illumina® 'Second Generation DNA Sequencing (2GS' and a new short-read de novo assembler, SaSSY, to assemble and annotate the Pongamia chloroplast (152,968 bp; cpDNA and mitochondrial (425,718 bp; mtDNA genomes. We also show that SaSSY can be used to accurately assemble 2GS data, by re-assembling the Lotus japonicus cpDNA and in the process assemble its mtDNA (380,861 bp. The Pongamia cpDNA contains 77 unique protein-coding genes and is almost 60% gene-dense. It contains a 50 kb inversion common to other legumes, as well as a novel 6.5 kb inversion that is responsible for the non-disruptive, re-orientation of five protein-coding genes. Additionally, two copies of an inverted repeat firmly place the species outside the subclade of the Fabaceae lacking the inverted repeat. The Pongamia and L. japonicus mtDNA contain just 33 and 31 unique protein-coding genes, respectively, and like other angiosperm mtDNA, have expanded intergenic and multiple repeat regions. Through comparative analysis with Vigna radiata we measured the average synonymous and non-synonymous divergence of all three legume mitochondrial (1.59% and 2.40%, respectively and chloroplast (8.37% and 8.99%, respectively protein-coding genes. Finally, we explored the relatedness of Pongamia within the Fabaceae and showed the utility of the organellar genome sequences by mapping transcriptomic data to identify up- and down-regulated stress-responsive gene candidates and confirm in silico predicted RNA editing sites.

  9. Sequencing of chloroplast genomes from wheat, barley, rye and their relatives provides a detailed insight into the evolution of the Triticeae tribe.

    Directory of Open Access Journals (Sweden)

    Christopher P Middleton

    Full Text Available Using Roche/454 technology, we sequenced the chloroplast genomes of 12 Triticeae species, including bread wheat, barley and rye, as well as the diploid progenitors and relatives of bread wheat Triticum urartu, Aegilops speltoides and Ae. tauschii. Two wild tetraploid taxa, Ae. cylindrica and Ae. geniculata, were also included. Additionally, we incorporated wild Einkorn wheat Triticum boeoticum and its domesticated form T. monococcum and two Hordeum spontaneum (wild barley genotypes. Chloroplast genomes were used for overall sequence comparison, phylogenetic analysis and dating of divergence times. We estimate that barley diverged from rye and wheat approximately 8-9 million years ago (MYA. The genome donors of hexaploid wheat diverged between 2.1-2.9 MYA, while rye diverged from Triticum aestivum approximately 3-4 MYA, more recently than previously estimated. Interestingly, the A genome taxa T. boeoticum and T. urartu were estimated to have diverged approximately 570,000 years ago. As these two have a reproductive barrier, the divergence time estimate also provides an upper limit for the time required for the formation of a species boundary between the two. Furthermore, we conclusively show that the chloroplast genome of hexaploid wheat was contributed by the B genome donor and that this unknown species diverged from Ae. speltoides about 980,000 years ago. Additionally, sequence alignments identified a translocation of a chloroplast segment to the nuclear genome which is specific to the rye/wheat lineage. We propose the presented phylogeny and divergence time estimates as a reference framework for future studies on Triticeae.

  10. PopGenome: an efficient Swiss army knife for population genomic analyses in R.

    Science.gov (United States)

    Pfeifer, Bastian; Wittelsbürger, Ulrich; Ramos-Onsins, Sebastian E; Lercher, Martin J

    2014-07-01

    Although many computer programs can perform population genetics calculations, they are typically limited in the analyses and data input formats they offer; few applications can process the large data sets produced by whole-genome resequencing projects. Furthermore, there is no coherent framework for the easy integration of new statistics into existing pipelines, hindering the development and application of new population genetics and genomics approaches. Here, we present PopGenome, a population genomics package for the R software environment (a de facto standard for statistical analyses). PopGenome can efficiently process genome-scale data as well as large sets of individual loci. It reads DNA alignments and single-nucleotide polymorphism (SNP) data sets in most common formats, including those used by the HapMap, 1000 human genomes, and 1001 Arabidopsis genomes projects. PopGenome also reads associated annotation files in GFF format, enabling users to easily define regions or classify SNPs based on their annotation; all analyses can also be applied to sliding windows. PopGenome offers a wide range of diverse population genetics analyses, including neutrality tests as well as statistics for population differentiation, linkage disequilibrium, and recombination. PopGenome is linked to Hudson's MS and Ewing's MSMS programs to assess statistical significance based on coalescent simulations. PopGenome's integration in R facilitates effortless and reproducible downstream analyses as well as the production of publication-quality graphics. Developers can easily incorporate new analyses methods into the PopGenome framework. PopGenome and R are freely available from CRAN (http://cran.r-project.org/) for all major operating systems under the GNU General Public License.

  11. Sequence and phylogenetic analyses of the chloroplast 16S rRNA, tufA, and rbcL genes from Bryopsis hypnoides

    Institute of Scientific and Technical Information of China (English)

    L(U) Fang; WANG Guangce

    2011-01-01

    Using shotgun sequencing data,the complete sequences of chloroplast 16S rRNA and tufA genes were acquired from native specimens of Bryopsis hypnoides (Qingdao,China).There are two group Ⅰ introns in the 16S rRNA gene,which is structurally similar to that of Caulerpa sertularioides (Bryopsidales,Chlorophyta).The chloroplast-encoded tufA gene sequence is 1230 bp long,very AT-rich (61.5%),and is similar to previously published 16S rRNA sequences of bryopsidinean algae.Phylogenetic analyses based on chloroplast 16S rRNA and tufA gene sequence data support previous hypotheses that the Bryopsidineae,Halimedineae,and Ostreobidineae are three distinct lineages.These results also confirmed the exclusion of Avrainvillea from the family Udoteaceae.Phylogenetic analyses inferred that the genus Bryopsis as sister to Derbesia; however,this clade lacked robust nodal support.Moreover,the phylogenetic tree inferred from rbcL GenBank sequences,combined with the geographical distributions of Bryopsis species,identified a strongly supportive clade for three differently distributed Asian Bryopsis species.The preliminary results suggesting that these organisms are of distinct regional endemism.

  12. Identification of chloroplast genome loci suitable for high-resolution phylogeographic studies of Colocasia esculenta (L.) Schott (Araceae) and closely related taxa.

    Science.gov (United States)

    Ahmed, Ibrar; Matthews, Peter J; Biggs, Patrick J; Naeem, Muhammad; McLenachan, Patricia A; Lockhart, Peter J

    2013-09-01

    Recently, we reported the chloroplast genome-wide association of oligonucleotide repeats, indels and nucleotide substitutions in aroid chloroplast genomes. We hypothesized that the distribution of oligonucleotide repeat sequences in a single representative genome can be used to identify mutational hotspots and loci suitable for population genetic, phylogenetic and phylogeographic studies. Using information on the location of oligonucleotide repeats in the chloroplast genome of taro (Colocasia esculenta), we designed 30 primer pairs to amplify and sequence polymorphic loci. The primers have been tested in a range of intra-specific to intergeneric comparisons, including ten taro samples (Colocasia esculenta) from diverse geographical locations, four other Colocasia species (C. affinis, C. fallax, C. formosana, C. gigantea) and three other aroid genera (represented by Remusatia vivipara, Alocasia brisbanensis and Amorphophallus konjac). Multiple sequence alignments for the intra-specific comparison revealed nucleotide substitutions (point mutations) at all 30 loci and microsatellite polymorphisms at 14 loci. The primer pairs reported here reveal levels of genetic variation suitable for high-resolution phylogeographic and evolutionary studies of taro and other closely related aroids. Our results confirm that information on repeat distribution can be used to identify loci suitable for such studies, and we expect that this approach can be used in other plant groups.

  13. Terpene metabolic engineering via nuclear or chloroplast genomes profoundly and globally impacts off-target pathways through metabolite signalling.

    Science.gov (United States)

    Pasoreck, Elise K; Su, Jin; Silverman, Ian M; Gosai, Sager J; Gregory, Brian D; Yuan, Joshua S; Daniell, Henry

    2016-09-01

    The impact of metabolic engineering on nontarget pathways and outcomes of metabolic engineering from different genomes are poorly understood questions. Therefore, squalene biosynthesis genes FARNESYL DIPHOSPHATE SYNTHASE (FPS) and SQUALENE SYNTHASE (SQS) were engineered via the Nicotiana tabacum chloroplast (C), nuclear (N) or both (CN) genomes to promote squalene biosynthesis. SQS levels were ~4300-fold higher in C and CN lines than in N, but all accumulated ~150-fold higher squalene due to substrate or storage limitations. Abnormal leaf and flower phenotypes, including lower pollen production and reduced fertility, were observed regardless of the compartment or level of transgene expression. Substantial changes in metabolomes of all lines were observed: levels of 65-120 unrelated metabolites, including the toxic alkaloid nicotine, changed by as much as 32-fold. Profound effects of transgenesis on nontarget gene expression included changes in the abundance of 19 076 transcripts by up to 2000-fold in CN; 7784 transcripts by up to 1400-fold in N; and 5224 transcripts by as much as 2200-fold in C. Transporter-related transcripts were induced, and cell cycle-associated transcripts were disproportionally repressed in all three lines. Transcriptome changes were validated by qRT-PCR. The mechanism underlying these large changes likely involves metabolite-mediated anterograde and/or retrograde signalling irrespective of the level of transgene expression or end product, due to imbalance of metabolic pools, offering new insight into both anticipated and unanticipated consequences of metabolic engineering.

  14. Genomic and Transcriptomic Analyses of Foodborne Bacterial Pathogens

    Science.gov (United States)

    Zhang, Wei; Dudley, Edward G.; Wade, Joseph T.

    DNA microarrays (often interchangeably called DNA chips or DNA arrays) are among the most popular analytical tools for high-throughput comparative genomic and transcriptomic analyses of foodborne bacterial pathogens. A typical DNA microarray contains hundreds to millions of small DNA probes that are chemically attached (or "printed") onto the surface of a microscopic glass slide. Depending on the specific "printing" and probe synthesis technologies for different microarray platforms, such DNA probes can be PCR amplicons or in situ synthesized short oligonucleotides. DNA microarray technologies have revolutionized the way that we investigate the biology of foodborne bacterial pathogens. The major advantage of these technologies is that DNA microarrays allow comparison of subtle genomic or transcriptomic variations between two bacterial samples, such as genomic variations between two different bacterial strains or transcriptomic alterations of same bacterial strain under two different treatments. Some applications of comparative genomic hybridization microarrays and global gene expression microarrays have been covered in previous chapters of this book.

  15. Chloroplast DNA sequence of the green alga Oedogonium cardiacum (Chlorophyceae: Unique genome architecture, derived characters shared with the Chaetophorales and novel genes acquired through horizontal transfer

    Directory of Open Access Journals (Sweden)

    Lemieux Claude

    2008-06-01

    Full Text Available Abstract Background To gain insight into the branching order of the five main lineages currently recognized in the green algal class Chlorophyceae and to expand our understanding of chloroplast genome evolution, we have undertaken the sequencing of chloroplast DNA (cpDNA from representative taxa. The complete cpDNA sequences previously reported for Chlamydomonas (Chlamydomonadales, Scenedesmus (Sphaeropleales, and Stigeoclonium (Chaetophorales revealed tremendous variability in their architecture, the retention of only few ancestral gene clusters, and derived clusters shared by Chlamydomonas and Scenedesmus. Unexpectedly, our recent phylogenies inferred from these cpDNAs and the partial sequences of three other chlorophycean cpDNAs disclosed two major clades, one uniting the Chlamydomonadales and Sphaeropleales (CS clade and the other uniting the Oedogoniales, Chaetophorales and Chaetopeltidales (OCC clade. Although molecular signatures provided strong support for this dichotomy and for the branching of the Oedogoniales as the earliest-diverging lineage of the OCC clade, more data are required to validate these phylogenies. We describe here the complete cpDNA sequence of Oedogonium cardiacum (Oedogoniales. Results Like its three chlorophycean homologues, the 196,547-bp Oedogonium chloroplast genome displays a distinctive architecture. This genome is one of the most compact among photosynthetic chlorophytes. It has an atypical quadripartite structure, is intron-rich (17 group I and 4 group II introns, and displays 99 different conserved genes and four long open reading frames (ORFs, three of which are clustered in the spacious inverted repeat of 35,493 bp. Intriguingly, two of these ORFs (int and dpoB revealed high similarities to genes not usually found in cpDNA. At the gene content and gene order levels, the Oedogonium genome most closely resembles its Stigeoclonium counterpart. Characters shared by these chlorophyceans but missing in members

  16. Quantitative metagenomic analyses based on average genome size normalization

    DEFF Research Database (Denmark)

    Frank, Jeremy Alexander; Sørensen, Søren Johannes

    2011-01-01

    Over the past quarter-century, microbiologists have used DNA sequence information to aid in the characterization of microbial communities. During the last decade, this has expanded from single genes to microbial community genomics, or metagenomics, in which the gene content of an environment can...... by estimating average genome sizes. This normalization can relieve comparative biases introduced by differences in community structure, number of sequencing reads, and sequencing read lengths between different metagenomes. We demonstrate the utility of this approach by comparing metagenomes from two different...... marine sources using both conventional small-subunit (SSU) rRNA gene analyses and our quantitative method to calculate the proportion of genomes in each sample that are capable of a particular metabolic trait. With both environments, to determine what proportion of each community they make up and how...

  17. Comparative Genomic and Phylogenomic Analyses Reveal a Conserved Core Genome Shared by Estuarine and Oceanic Cyanopodoviruses

    Science.gov (United States)

    Huang, Sijun; Zhang, Si; Jiao, Nianzhi; Chen, Feng

    2015-01-01

    Podoviruses are among the major viral groups that infect marine picocyanobacteria Prochlorococcus and Synechococcus. Here, we reported the genome sequences of five Synechococcus podoviruses isolated from the estuarine environment, and performed comparative genomic and phylogenomic analyses based on a total of 20 cyanopodovirus genomes. The genomes of all the known marine cyanopodoviruses are highly syntenic. A pan-genome of 349 clustered orthologous groups was determined, among which 15 were core genes. These core genes make up nearly half of each genome in length, reflecting the high level of genome conservation among this cyanophage type. The whole genome phylogenies based on concatenated core genes and gene content were highly consistent and confirmed the separation of two discrete marine cyanopodovirus clusters MPP-A and MPP-B. The genomes within cluster MPP-B grouped into subclusters mainly corresponding to Prochlorococcus or Synechococcus host types. Auxiliary metabolic genes tend to occur in a specific phylogenetic group of these cyanopodoviruses. All the MPP-B phages analyzed here encode the photosynthesis gene psbA, which are absent in all the MPP-A genomes thus far. Interestingly, all the MPP-B and two MPP-A Synechococcus podoviruses encode the thymidylate synthase gene thyX, while at the same genome locus all the MPP-B Prochlorococcus podoviruses encode the transaldolase gene talC. Both genes are hypothesized to have the potential to facilitate the biosynthesis of deoxynucleotide for phage replication. Inheritance of specific functional genes could be important to the evolution and ecological fitness of certain cyanophage genotypes. Our analyses demonstrate that cyanopodoviruses of estuarine and oceanic origins share a conserved core genome and suggest that accessory genes may be related to environmental adaptation. PMID:26569403

  18. Comparative Genomic and Phylogenomic Analyses Reveal a Conserved Core Genome Shared by Estuarine and Oceanic Cyanopodoviruses.

    Directory of Open Access Journals (Sweden)

    Sijun Huang

    Full Text Available Podoviruses are among the major viral groups that infect marine picocyanobacteria Prochlorococcus and Synechococcus. Here, we reported the genome sequences of five Synechococcus podoviruses isolated from the estuarine environment, and performed comparative genomic and phylogenomic analyses based on a total of 20 cyanopodovirus genomes. The genomes of all the known marine cyanopodoviruses are highly syntenic. A pan-genome of 349 clustered orthologous groups was determined, among which 15 were core genes. These core genes make up nearly half of each genome in length, reflecting the high level of genome conservation among this cyanophage type. The whole genome phylogenies based on concatenated core genes and gene content were highly consistent and confirmed the separation of two discrete marine cyanopodovirus clusters MPP-A and MPP-B. The genomes within cluster MPP-B grouped into subclusters mainly corresponding to Prochlorococcus or Synechococcus host types. Auxiliary metabolic genes tend to occur in a specific phylogenetic group of these cyanopodoviruses. All the MPP-B phages analyzed here encode the photosynthesis gene psbA, which are absent in all the MPP-A genomes thus far. Interestingly, all the MPP-B and two MPP-A Synechococcus podoviruses encode the thymidylate synthase gene thyX, while at the same genome locus all the MPP-B Prochlorococcus podoviruses encode the transaldolase gene talC. Both genes are hypothesized to have the potential to facilitate the biosynthesis of deoxynucleotide for phage replication. Inheritance of specific functional genes could be important to the evolution and ecological fitness of certain cyanophage genotypes. Our analyses demonstrate that cyanopodoviruses of estuarine and oceanic origins share a conserved core genome and suggest that accessory genes may be related to environmental adaptation.

  19. Chloroplast movement.

    Science.gov (United States)

    Wada, Masamitsu

    2013-09-01

    Chloroplast movement is important for plant survival under high light and for efficient photosynthesis under low light. This review introduces recent knowledge on chloroplast movement and shows how to analyze the responses and the moving mechanisms, potentially inspiring research in this field. Avoidance from the strong light is mediated by blue light receptor phototropin 2 (phot2) plausibly localized on the chloroplast envelop and accumulation at the week light-irradiated area is mediated by phot1 and phot2 localized on the plasma membrane. Chloroplasts move by chloroplast actin (cp-actin) filaments that must be polymerized by Chloroplast Unusual Positioning1 (CHUP1) at the front side of moving chloroplast. To understand the signal transduction pathways and the mechanism of chloroplast movement, that is, from light capture to motive force-generating mechanism, various methods should be employed based on the various aspects. Observation of chloroplast distribution pattern under different light condition by fixed cell sectioning is somewhat an old-fashioned technique but the most basic and important way. However, most importantly, precise chloroplast behavior during and just after the induction of chloroplast movement by partial cell irradiation using an irradiator with either low light or strong light microbeam should be recorded by time lapse photographs under infrared light and analyzed. Recently various factors involved in chloroplast movement, such as cp-actin filaments and CHUP1, could be traced in Arabidopsis transgenic lines with fluorescent protein tags under a confocal laser scanning microscope (CLSM) and/or a total internal reflection fluorescence microscope (TIRFM). These methods are listed and their advantages and disadvantages are evaluated.

  20. Transfer of a eubacteria-type cell division site-determining factor CrMinD gene to the nucleus from the chloroplast genome in Chlamydomonas reinhardtii

    Institute of Scientific and Technical Information of China (English)

    LIU WeiZhong; HU Yong; ZHANG RunJie; ZHOU WeiWei; ZHU JiaYing; LIU XiangLin; HE YiKun

    2007-01-01

    MinD is a ubiquitous ATPase that plays a crucial role in selection of the division site in eubacteria, chloroplasts, and probably Archaea. In four green algae, Mesostigma viride, Nephroselmis olivacea, Chlorella vulgaris and Prototheca wickerhamii, MinD homologues are encoded in the plastid genome. However, in Arabidopsis, MinD is a nucleus-encoded, chloroplast-targeted protein involved in chloroplast division, which suggests that MinD has been transferred to the nucleus in higher land plants. Yet the lateral gene transfer (LGT) of MinD from plastid to nucleus during plastid evolution remains poorly understood. Here, we identified a nucleus-encoded MinD homologue from unicellular green alga Chlamydomonas reinhardtii, a basal species in the green plant lineage. Overexpression of CrMinD in wild type E. coli inhibited cell division and resulted in the filamentous cell formation, clearly demonstrated the conservation of the MinD protein during the evolution of photosynthetic eukaryotes. The transient expression of CrMinD-egfp confirmed the role of CrMinD protein in the regulation of plastid division. Searching all the published plastid genomic sequences of land plants, no MinD homologues were found, which suggests that the transfer of MinD from plastid to nucleus might have occurred before the evolution of land plants.

  1. Mechanisms of Protein Synthesis in Chloroplasts: How to Design Translatable mRNAs in Chloroplasts

    Institute of Scientific and Technical Information of China (English)

    M. Sugiura

    2007-01-01

    @@ Chloroplast transformation provides a powerful tool to produce useful proteins in plants. After completion of the chloroplast genome sequencing from tobacco plants (Shinozaki et al., 1986, Yukawa et al., 2005), Pal Maliga group developed the high-frequency chloroplast transformation system in tobacco (Svab and Maliga, 1993).

  2. Integrated genomic analyses of de novo pathways underlying atypical meningiomas

    Science.gov (United States)

    Harmancı, Akdes Serin; Youngblood, Mark W.; Clark, Victoria E.; Coşkun, Süleyman; Henegariu, Octavian; Duran, Daniel; Erson-Omay, E. Zeynep; Kaulen, Leon D.; Lee, Tong Ihn; Abraham, Brian J.; Simon, Matthias; Krischek, Boris; Timmer, Marco; Goldbrunner, Roland; Omay, S. Bülent; Baranoski, Jacob; Baran, Burçin; Carrión-Grant, Geneive; Bai, Hanwen; Mishra-Gorur, Ketu; Schramm, Johannes; Moliterno, Jennifer; Vortmeyer, Alexander O.; Bilgüvar, Kaya; Yasuno, Katsuhito; Young, Richard A.; Günel, Murat

    2017-01-01

    Meningiomas are mostly benign brain tumours, with a potential for becoming atypical or malignant. On the basis of comprehensive genomic, transcriptomic and epigenomic analyses, we compared benign meningiomas to atypical ones. Here, we show that the majority of primary (de novo) atypical meningiomas display loss of NF2, which co-occurs either with genomic instability or recurrent SMARCB1 mutations. These tumours harbour increased H3K27me3 signal and a hypermethylated phenotype, mainly occupying the polycomb repressive complex 2 (PRC2) binding sites in human embryonic stem cells, thereby phenocopying a more primitive cellular state. Consistent with this observation, atypical meningiomas exhibit upregulation of EZH2, the catalytic subunit of the PRC2 complex, as well as the E2F2 and FOXM1 transcriptional networks. Importantly, these primary atypical meningiomas do not harbour TERT promoter mutations, which have been reported in atypical tumours that progressed from benign ones. Our results establish the genomic landscape of primary atypical meningiomas and potential therapeutic targets. PMID:28195122

  3. The Unicellular Green Alga Chlamydomonas reinhardtii as an Experimental System to Study Chloroplast RNA Metabolism

    Science.gov (United States)

    Nickelsen, J.; Kück, U.

    Chloroplasts are typical organelles of photoautotrophic eukaryotic cells which drive a variety of functions, including photosynthesis. For many years the unicellular green alga Chlamydomonas reinhardtii has served as an experimental organism for studying photosynthetic processes. The recent development of molecular tools for this organism together with efficient methods of genetic analysis and the availability of many photosynthesis mutants has now made this alga a powerful model system for the analysis of chloroplast biogenesis. For example, techniques have been developed to transfer recombinant DNA into both the nuclear and the chloroplast genome. This allows both complementation tests and analyses of gene functions in vivo. Moreover, site-specific DNA recombinations in the chloroplast allow targeted gene disruption experiments which enable a "reverse genetics" to be performed. The potential of the algal system for the study of chloroplast biogenesis is illustrated in this review by the description of regulatory systems of gene expression involved in organelle biogenesis. One example concerns the regulation of trans-splicing of chloroplast mRNAs, a process which is controlled by both multiple nuclear- and chloroplast-encoded factors. The second example involves the stabilization of chloroplast mRNAs. The available data lead us predict distinct RNA elements, which interact with trans-acting factors to protect the RNA against nucleolytic attacks.

  4. Conflict amongst chloroplast DNA sequences obscures the phylogeny of a group of Asplenium ferns.

    Science.gov (United States)

    Shepherd, Lara D; Holland, Barbara R; Perrie, Leon R

    2008-07-01

    A previous study of the relationships amongst three subgroups of the Austral Asplenium ferns found conflicting signal between the two chloroplast loci investigated. Because organelle genomes like those of chloroplasts and mitochondria are thought to be non-recombining, with a single evolutionary history, we sequenced four additional chloroplast loci with the expectation that this would resolve these relationships. Instead, the conflict was only magnified. Although tree-building analyses favoured one of the three possible trees, one of the alternative trees actually had one more supporting site (six versus five) and received greater support in spectral and neighbor-net analyses. Simulations suggested that chance alone was unlikely to produce strong support for two of the possible trees and none for the third. Likelihood permutation tests indicated that the concatenated chloroplast sequence data appeared to have experienced recombination. However, recombination between the chloroplast genomes of different species would be highly atypical, and corollary supporting observations, like chloroplast heteroplasmy, are lacking. Wider taxon sampling clarified the composition of the Austral group, but the conflicting signal meant analyses (e.g., morphological evolution, biogeographic) conditional on a well-supported phylogeny could not be performed.

  5. Phototropin encoded by a single-copy gene mediates chloroplast photorelocation movements in the liverwort Marchantia polymorpha.

    Science.gov (United States)

    Komatsu, Aino; Terai, Mika; Ishizaki, Kimitsune; Suetsugu, Noriyuki; Tsuboi, Hidenori; Nishihama, Ryuichi; Yamato, Katsuyuki T; Wada, Masamitsu; Kohchi, Takayuki

    2014-09-01

    Blue-light-induced chloroplast photorelocation movement is observed in most land plants. Chloroplasts move toward weak-light-irradiated areas to efficiently absorb light (the accumulation response) and escape from strong-light-irradiated areas to avoid photodamage (the avoidance response). The plant-specific kinase phototropin (phot) is the blue-light receptor for chloroplast movements. Although the molecular mechanisms for chloroplast photorelocation movement have been analyzed, the overall aspects of signal transduction common to land plants are still unknown. Here, we show that the liverwort Marchantia polymorpha exhibits the accumulation and avoidance responses exclusively induced by blue light as well as specific chloroplast positioning in the dark. Moreover, in silico and Southern-blot analyses revealed that the M. polymorpha genome encodes a single PHOT gene, MpPHOT, and its knockout line displayed none of the chloroplast photorelocation movements, indicating that the sole MpPHOT gene mediates all types of movement. Mpphot was localized on the plasma membrane and exhibited blue-light-dependent autophosphorylation both in vitro and in vivo. Heterologous expression of MpPHOT rescued the defects in chloroplast movement of phot mutants in the fern Adiantum capillus-veneris and the seed plant Arabidopsis (Arabidopsis thaliana). These results indicate that Mpphot possesses evolutionarily conserved regulatory activities for chloroplast photorelocation movement. M. polymorpha offers a simple and versatile platform for analyzing the fundamental processes of phototropin-mediated chloroplast photorelocation movement common to land plants.

  6. cpSSR: a New Tool to Analyze Chloroplast Genome of Citrus Somatic Hybrids%叶绿体S S R标记:柑橘体细胞杂种胞质遗传分析的一种新方法

    Institute of Scientific and Technical Information of China (English)

    程运江; 郭文武; 邓秀新

    2003-01-01

    Chloroplast simple sequence repeat (cpSSR) markers in Citrus were developed and success-fully used to analyze chloroplast genome inheritance of Citrus somatic hybrids. Twenty-two previouslyreported cpSSR primer pairs from pine (Pinus thunbergii Parl.), rice (Oryza sativa L.) and tobacco (Nicotianatabacum L.) were tested in Citrus, nine of which could amplify intensive PCR products by agarose gelelectrophoresis. Chloroplast genome inheritance of Citrus somatic hybrids from nine fusions was thenanalyzed, and five of the nine pre-screened primer pairs showed polymorphisms by polyacrylamide gelelectrophoresis. The results revealed the random inheritance nature of chloroplast genome in all analyzedCitrus somatic hybrids, which was in agreement with previous reports based on RFLP or CAPS analyses. Itwas also shown that cpSSR is a more efficient tool in chloroplast genome analyses of somatic hybrids inhigher plants, compared with the conventional RFLP or CAPS analyses.%从水稻(Oryza sativa L.)、烟草(Nicotiana tabacum L.)和黑松(Pinus thunbergiiParl.)等植物的22对叶绿体SSR引物中筛选出 5对能用于柑橘叶绿体SSR分析的引物,应用这5对引物对9个组合的柑橘体细胞杂种的叶绿体遗传进行了分析.结果表明:这些组合再生的杂种中叶绿体都呈现随机分离,该现象与以前报道的RFLP分析结果一致,而且其可靠性已被CAPS分析所证实.表明柑橘叶绿体SSR同RFLP及CAPS一样可靠,并且更简单高效、易于操作,特别适合对柑橘等植物体细胞杂种进行早期胞质遗传组成分析.

  7. Genome-wide analyses of small noncoding RNAs in streptococci

    Directory of Open Access Journals (Sweden)

    Nadja ePatenge

    2015-05-01

    Full Text Available Streptococci represent a diverse group of Gram-positive bacteria, which colonize a wide range of hosts among animals and humans. Streptococcal species occur as commensal as well as pathogenic organisms. Many of the pathogenic species can cause severe, invasive infections in their hosts leading to a high morbidity and mortality. The consequence is a tremendous suffering on the part of men and livestock besides the significant financial burden in the agricultural and healthcare sectors. An environmentally stimulated and tightly controlled expression of virulence factor genes is of fundamental importance for streptococcal pathogenicity. Bacterial small noncoding RNAs (sRNAs modulate the expression of genes involved in stress response, sugar metabolism, surface composition, and other properties that are related to bacterial virulence. Even though the regulatory character is shared by this class of RNAs, variation on the molecular level results in a high diversity of functional mechanisms. The knowledge about the role of sRNAs in streptococci is still limited, but in recent years, genome-wide screens for sRNAs have been conducted in an increasing number of species. Bioinformatics prediction approaches have been employed as well as expression analyses by classical array techniques or next generation sequencing. This review will give an overview of whole genome screens for sRNAs in streptococci with a focus on describing the different methods and comparing their outcome considering sRNA conservation among species, functional similarities, and relevance for streptococcal infection.

  8. Genome-wide methylation analyses in glioblastoma multiforme.

    Directory of Open Access Journals (Sweden)

    Rose K Lai

    Full Text Available Few studies had investigated genome-wide methylation in glioblastoma multiforme (GBM. Our goals were to study differential methylation across the genome in gene promoters using an array-based method, as well as repetitive elements using surrogate global methylation markers. The discovery sample set for this study consisted of 54 GBM from Columbia University and Case Western Reserve University, and 24 brain controls from the New York Brain Bank. We assembled a validation dataset using methylation data of 162 TCGA GBM and 140 brain controls from dbGAP. HumanMethylation27 Analysis Bead-Chips (Illumina were used to interrogate 26,486 informative CpG sites in both the discovery and validation datasets. Global methylation levels were assessed by analysis of L1 retrotransposon (LINE1, 5 methyl-deoxycytidine (5m-dC and 5 hydroxylmethyl-deoxycytidine (5hm-dC in the discovery dataset. We validated a total of 1548 CpG sites (1307 genes that were differentially methylated in GBM compared to controls. There were more than twice as many hypomethylated genes as hypermethylated ones. Both the discovery and validation datasets found 5 tumor methylation classes. Pathway analyses showed that the top ten pathways in hypomethylated genes were all related to functions of innate and acquired immunities. Among hypermethylated pathways, transcriptional regulatory network in embryonic stem cells was the most significant. In the study of global methylation markers, 5m-dC level was the best discriminant among methylation classes, whereas in survival analyses, high level of LINE1 methylation was an independent, favorable prognostic factor in the discovery dataset. Based on a pathway approach, hypermethylation in genes that control stem cell differentiation were significant, poor prognostic factors of overall survival in both the discovery and validation datasets. Approaches that targeted these methylated genes may be a future therapeutic goal.

  9. An internal part of the chloroplast atpA gene sequence is present in the mitochondrial genome of Triticum aestivum: molecular organisation and evolutionary aspects.

    Science.gov (United States)

    Jubier, M F; Lucas, H; Delcher, E; Hartmann, C; Quétier, F; Lejeune, B

    1990-06-01

    An internal part of the chloroplast atpA gene has been identified in the mitochondrial DNA of Triticum aestivum. It is located near the 18S-5S ribosomal genes and partially contained within a repeated sequence. Comparison of the transferred sequence with the original ct sequence reveals several nucleotide changes and shows that neither 5' nor 3' ends are present in the mt genome. No transcript of this region could be detected by Northern analysis. This sequence is present in mitochondrial genomes of other tetraploid and diploid species of Triticum, also in the vicinity of the 18S-5S ribosomal genes, suggesting a unique transfer event. The date of this event is discussed.

  10. Genetic Analysis of Chloroplast Translation

    Energy Technology Data Exchange (ETDEWEB)

    Barkan, Alice

    2005-08-15

    The assembly of the photosynthetic apparatus requires the concerted action of hundreds of genes distributed between the two physically separate genomes in the nucleus and chloroplast. Nuclear genes coordinate this process by controlling the expression of chloroplast genes in response to developmental and environmental cues. However, few regulatory factors have been identified. We used mutant phenotypes to identify nuclear genes in maize that modulate chloroplast translation, a key control point in chloroplast gene expression. This project focused on the nuclear gene crp1, required for the translation of two chloroplast mRNAs. CRP1 is related to fungal proteins involved in the translation of mitochondrial mRNAs, and is the founding member of a large gene family in plants, with {approx}450 members. Members of the CRP1 family are defined by a repeated 35 amino acid motif called a ''PPR'' motif. The PPR motif is closely related to the TPR motif, which mediates protein-protein interactions. We and others have speculated that PPR tracts adopt a structure similar to that of TPR tracts, but with a substrate binding surface adapted to bind RNA instead of protein. To understand how CRP1 influences the translation of specific chloroplast mRNAs, we sought proteins that interact with CRP1, and identified the RNAs associated with CRP1 in vivo. We showed that CRP1 is associated in vivo with the mRNAs whose translation it activates. To explore the functions of PPR proteins more generally, we sought mutations in other PPR-encoding genes: mutations in the maize PPR2 and PPR4 were shown to disrupt chloroplast ribosome biogenesis and chloroplast trans-splicing, respectively. These and other results suggest that the nuclear-encoded PPR family plays a major role in modulating the expression of the chloroplast genome in higher plants.

  11. The closest relatives of cacti: insights from phylogenetic analyses of chloroplast and mitochondrial sequences with special emphasis on relationships in the tribe Anacampseroteae.

    Science.gov (United States)

    Nyffeler, Reto

    2007-01-01

    Recent molecular and morphological systematic investigations revealed that the cacti are most closely related to Anacampseroteae, Portulaca and Talinum of the family Portulacaceae (ACPT clade of suborder Portulacineae). A combined analysis of ndhF, matK, and nad1 sequence data from the chloroplast and the mitochondrial genomes indicates that the tribe Anacampseroteae is the sister group of the family Cactaceae. This clade, together with Portulaca, is well characterized by the presence of axillary hairs or scales. Relationships within Anacampseroteae are characterized by a grade of five species of Grahamia s.l. from North and South America, and Grahamia australiana is found to be sister to the genera Anacampseros and Avonia. A comparison of vegetative characteristics indicates an evolutionary transition from woody subshrubs to dwarf perennial and highly succulent herbs during the diversification of Anacampseroteae. Available evidence from the present investigation as well as from previously published studies suggests that a revised classification of Portulacineae on the basis of inferred phylogenetic relationships might consist of a superfamily that includes Cactaceae and the three genera Anacampseros s.l. (including Avonia and Grahamia s.l.), Portulaca, and Talinum (including Talinella), either referred to three monogeneric families or to a paraphyletic family Portulacaceae*.

  12. Evolution and targeting of Omp85 homologs in the chloroplast outer envelope membrane

    Directory of Open Access Journals (Sweden)

    Philip Michael Day

    2014-10-01

    Full Text Available Translocon at the outer-envelope-membrane of chloroplasts 75 (Toc75 is the core component of the chloroplast protein import machinery. It belongs to the Omp85 family whose members exist in various Gram-negative bacteria, mitochondria and chloroplasts of eukaryotes. Chloroplasts of Viridiplantae contain another Omp85 homolog called outer envelope protein 80 (OEP80, whose exact function is unknown. In addition, the Arabidopsis thaliana genome encodes truncated forms of Toc75 and OEP80. Multiple studies have shown a common origin of the Omp85 homologs of cyanobacteria and chloroplasts but their results about evolutionary relationships among cyanobacterial Omp85 (cyanoOmp85, Toc75 and OEP80 are inconsistent. The bipartite targeting sequence-dependent sorting of Toc75 has been demonstrated but the targeting mechanisms of other chloroplast Omp85 homologs remain largely unexplored. This study was aimed to address these unresolved issues in order to further our understanding of chloroplast evolution. Sequence alignments and recently determined structures of bacterial Omp85 homologs were used to predict structures of chloroplast Omp85 homologs. The results enabled us to identify amino acid residues that may indicate functional divergence of Toc75 from cyanoOmp85 and OEP80. Phylogenetic analyses using Omp85 homologs from various cyanobacteria and chloroplasts provided strong support for the grouping of Toc75 and OEP80 sister to cyanoOmp85. However, this support was diminished when the analysis included Omp85 homologs from other bacteria and mitochondria. Finally, results of import assays using isolated chloroplasts support outer membrane localization of OEP80tr and indicate that OEP80 may carry a cleavable targeting sequence.

  13. Chloroplast DNA Diversity of Oak Species in Eastern Romania

    Directory of Open Access Journals (Sweden)

    Ioan Calin MOLDOVAN

    2010-12-01

    Full Text Available The chloroplast DNA of 34 sessile oak (Quercus petraea and 27 pedunculate oak (Q. robur populations covering the entire natural distribution of the two oak species in Eastern Romania was investigated using four large regions of the chloroplast genome by PCR and RFLP technique. A total of seven chloroplast DNA haplotypes sensu lato have been observed by analysing 305 mature trees. However, due to the high resolution of the electrophoresis method a total of 22 chloroplast variants could have been detected, with new mutations and fragment combinations in two of the amplified regions: psbC/trnD and trnT/trnF. All of the haplotypes belong to the phylogenetic lineages A and E, which originate from the Balkan Peninsula. Most of genetic diversity is distributed among populations (GST=0.779. The chloroplast DNA haplotypes are shared by the two oak species. Different dispersal abilities may explain the higher value of genetic differentiation among populations in sessile oak than in pedunculate oak.

  14. Arabidopsis thaliana leaves with altered chloroplast numbers and chloroplast movement exhibit impaired adjustments to both low and high light

    OpenAIRE

    Königer, Martina; Delamaide, Joy A.; Marlow, Elizabeth D.; Harris, Gary C.

    2008-01-01

    The effects of chloroplast number and size on the capacity for blue light-dependent chloroplast movement, the ability to increase light absorption under low light, and the susceptibility to photoinhibition were investigated in Arabidopsis thaliana. Leaves of wild-type and chloroplast number mutants with mean chloroplast numbers ranging from 120 to two per mesophyll cell were analysed. Chloroplast movement was monitored as changes in light transmission through the leaves. Light transmission wa...

  15. Genome-wide transcription analyses in rice using tiling microarrays

    DEFF Research Database (Denmark)

    Li, Lei; Wang, Xiangfeng; Stolc, Viktor;

    2006-01-01

    Sequencing and computational annotation revealed several features, including high gene numbers, unusual composition of the predicted genes and a large number of genes lacking homology to known genes, that distinguish the rice (Oryza sativa) genome from that of other fully sequenced model species....... We report here a full-genome transcription analysis of the indica rice subspecies using high-density oligonucleotide tiling microarrays. Our results provided expression data support for the existence of 35,970 (81.9%) annotated gene models and identified 5,464 unique transcribed intergenic regions...... activity between duplicated segments of the genome. Collectively, our results provide the first whole-genome transcription map useful for further understanding the rice genome. Udgivelsesdato: 2006-Jan...

  16. Dynamics of Chloroplast Translation during Chloroplast Differentiation in Maize.

    Directory of Open Access Journals (Sweden)

    Prakitchai Chotewutmontri

    2016-07-01

    Full Text Available Chloroplast genomes in land plants contain approximately 100 genes, the majority of which reside in polycistronic transcription units derived from cyanobacterial operons. The expression of chloroplast genes is integrated into developmental programs underlying the differentiation of photosynthetic cells from non-photosynthetic progenitors. In C4 plants, the partitioning of photosynthesis between two cell types, bundle sheath and mesophyll, adds an additional layer of complexity. We used ribosome profiling and RNA-seq to generate a comprehensive description of chloroplast gene expression at four stages of chloroplast differentiation, as displayed along the maize seedling leaf blade. The rate of protein output of most genes increases early in development and declines once the photosynthetic apparatus is mature. The developmental dynamics of protein output fall into several patterns. Programmed changes in mRNA abundance make a strong contribution to the developmental shifts in protein output, but output is further adjusted by changes in translational efficiency. RNAs with prioritized translation early in development are largely involved in chloroplast gene expression, whereas those with prioritized translation in photosynthetic tissues are generally involved in photosynthesis. Differential gene expression in bundle sheath and mesophyll chloroplasts results primarily from differences in mRNA abundance, but differences in translational efficiency amplify mRNA-level effects in some instances. In most cases, rates of protein output approximate steady-state protein stoichiometries, implying a limited role for proteolysis in eliminating unassembled or damaged proteins under non-stress conditions. Tuned protein output results from gene-specific trade-offs between translational efficiency and mRNA abundance, both of which span a large dynamic range. Analysis of ribosome footprints at sites of RNA editing showed that the chloroplast translation machinery

  17. Dynamics of Chloroplast Translation during Chloroplast Differentiation in Maize.

    Science.gov (United States)

    Chotewutmontri, Prakitchai; Barkan, Alice

    2016-07-01

    Chloroplast genomes in land plants contain approximately 100 genes, the majority of which reside in polycistronic transcription units derived from cyanobacterial operons. The expression of chloroplast genes is integrated into developmental programs underlying the differentiation of photosynthetic cells from non-photosynthetic progenitors. In C4 plants, the partitioning of photosynthesis between two cell types, bundle sheath and mesophyll, adds an additional layer of complexity. We used ribosome profiling and RNA-seq to generate a comprehensive description of chloroplast gene expression at four stages of chloroplast differentiation, as displayed along the maize seedling leaf blade. The rate of protein output of most genes increases early in development and declines once the photosynthetic apparatus is mature. The developmental dynamics of protein output fall into several patterns. Programmed changes in mRNA abundance make a strong contribution to the developmental shifts in protein output, but output is further adjusted by changes in translational efficiency. RNAs with prioritized translation early in development are largely involved in chloroplast gene expression, whereas those with prioritized translation in photosynthetic tissues are generally involved in photosynthesis. Differential gene expression in bundle sheath and mesophyll chloroplasts results primarily from differences in mRNA abundance, but differences in translational efficiency amplify mRNA-level effects in some instances. In most cases, rates of protein output approximate steady-state protein stoichiometries, implying a limited role for proteolysis in eliminating unassembled or damaged proteins under non-stress conditions. Tuned protein output results from gene-specific trade-offs between translational efficiency and mRNA abundance, both of which span a large dynamic range. Analysis of ribosome footprints at sites of RNA editing showed that the chloroplast translation machinery does not generally

  18. Chloroplast outer envelope protein CHUP1 is essential for chloroplast anchorage to the plasma membrane and chloroplast movement.

    Science.gov (United States)

    Oikawa, Kazusato; Yamasato, Akihiro; Kong, Sam-Geun; Kasahara, Masahiro; Nakai, Masato; Takahashi, Fumio; Ogura, Yasunobu; Kagawa, Takatoshi; Wada, Masamitsu

    2008-10-01

    Chloroplasts change their intracellular distribution in response to light intensity. Previously, we isolated the chloroplast unusual positioning1 (chup1) mutant of Arabidopsis (Arabidopsis thaliana). This mutant is defective in normal chloroplast relocation movement and shows aggregation of chloroplasts at the bottom of palisade mesophyll cells. The isolated gene encodes a protein with an actin-binding motif. Here, we used biochemical analyses to determine the subcellular localization of full-length CHUP1 on the chloroplast outer envelope. A CHUP1-green fluorescent protein (GFP) fusion, which was detected at the outermost part of mesophyll cell chloroplasts, complemented the chup1 phenotype, but GFP-CHUP1, which was localized mainly in the cytosol, did not. Overexpression of the N-terminal hydrophobic region (NtHR) of CHUP1 fused with GFP (NtHR-GFP) induced a chup1-like phenotype, indicating a dominant-negative effect on chloroplast relocation movement. A similar pattern was found in chloroplast OUTER ENVELOPE PROTEIN7 (OEP7)-GFP transformants, and a protein containing OEP7 in place of NtHR complemented the mutant phenotype. Physiological analyses of transgenic Arabidopsis plants expressing truncated CHUP1 in a chup1 mutant background and cytoskeletal inhibitor experiments showed that the coiled-coil region of CHUP1 anchors chloroplasts firmly on the plasma membrane, consistent with the localization of coiled-coil GFP on the plasma membrane. Thus, CHUP1 localization on chloroplasts, with the N terminus inserted into the chloroplast outer envelope and the C terminus facing the cytosol, is essential for CHUP1 function, and the coiled-coil region of CHUP1 prevents chloroplast aggregation and participates in chloroplast relocation movement.

  19. Genomic analyses provide insights into the history of tomato breeding.

    Science.gov (United States)

    Lin, Tao; Zhu, Guangtao; Zhang, Junhong; Xu, Xiangyang; Yu, Qinghui; Zheng, Zheng; Zhang, Zhonghua; Lun, Yaoyao; Li, Shuai; Wang, Xiaoxuan; Huang, Zejun; Li, Junming; Zhang, Chunzhi; Wang, Taotao; Zhang, Yuyang; Wang, Aoxue; Zhang, Yancong; Lin, Kui; Li, Chuanyou; Xiong, Guosheng; Xue, Yongbiao; Mazzucato, Andrea; Causse, Mathilde; Fei, Zhangjun; Giovannoni, James J; Chetelat, Roger T; Zamir, Dani; Städler, Thomas; Li, Jingfu; Ye, Zhibiao; Du, Yongchen; Huang, Sanwen

    2014-11-01

    The histories of crop domestication and breeding are recorded in genomes. Although tomato is a model species for plant biology and breeding, the nature of human selection that altered its genome remains largely unknown. Here we report a comprehensive analysis of tomato evolution based on the genome sequences of 360 accessions. We provide evidence that domestication and improvement focused on two independent sets of quantitative trait loci (QTLs), resulting in modern tomato fruit ∼100 times larger than its ancestor. Furthermore, we discovered a major genomic signature for modern processing tomatoes, identified the causative variants that confer pink fruit color and precisely visualized the linkage drag associated with wild introgressions. This study outlines the accomplishments as well as the costs of historical selection and provides molecular insights toward further improvement.

  20. Local repeat sequence organization of an intergenic spacer in the chloroplast genome of Chlamydomonas reinhardtii leads to DNA expansion and sequence scrambling: a complex mode of “copy-choice replication”?

    Indian Academy of Sciences (India)

    Mahendra D Wagle; Subhojit Sen; Basuthkar J Rao

    2001-12-01

    Parent-specific, randomly amplified polymorphic DNA (RAPD) markers were obtained from total genomic DNA of Chlamydomonas reinhardtii. Such parent-specific RAPD bands (genomic fingerprints) segregated uniparentally (through mt+) in a cross between a pair of polymorphic interfertile strains of Chlamydomonas (C. reinhardtii and C. minnesotti), suggesting that they originated from the chloroplast genome. Southern analysis mapped the RAPD-markers to the chloroplast genome. One of the RAPD-markers, ``P2” (1.6 kb) was cloned, sequenced and was fine mapped to the 3 kb region encompassing 3′ end of 23S, full 5S and intergenic region between 5S and psbA. This region seems divergent enough between the two parents, such that a specific PCR designed for a parental specific chloroplast sequence within this region, amplified a marker in that parent only and not in the other, indicating the utility of RAPD-scan for locating the genomic regions of sequence divergence. Remarkably, the RAPD-product, ``P2” seems to have originated from a PCR-amplification of a much smaller (about 600 bp), but highly repeat-rich (direct and inverted) domain of the 3 kb region in a manner that yielded no linear sequence alignment with its own template sequence. The amplification yielded the same uniquely ``sequence-scrambled” product, whether the template used for PCR was total cellular DNA, chloroplast DNA or a plasmid clone DNA corresponding to that region. The PCR product, a ``unique” new sequence, had lost the repetitive organization of the template genome where it had originated from and perhaps represented a ``complex path” of copy-choice replication.

  1. GEMBASSY: an EMBOSS associated software package for comprehensive genome analyses.

    Science.gov (United States)

    Itaya, Hidetoshi; Oshita, Kazuki; Arakawa, Kazuharu; Tomita, Masaru

    2013-08-29

    The popular European Molecular Biology Open Software Suite (EMBOSS) currently contains over 400 tools used in various bioinformatics researches, equipped with sophisticated development frameworks for interoperability and tool discoverability as well as rich documentations and various user interfaces. In order to further strengthen EMBOSS in the fields of genomics, we here present a novel EMBOSS associated software (EMBASSY) package named GEMBASSY, which adds more than 50 analysis tools from the G-language Genome Analysis Environment and its Representational State Transfer (REST) and SOAP web services. GEMBASSY basically contains wrapper programs of G-language REST/SOAP web services to provide intuitive and easy access to various annotations within complete genome flatfiles, as well as tools for analyzing nucleic composition, calculating codon usage, and visualizing genomic information. For example, analysis methods such as for calculating distance between sequences by genomic signatures and for predicting gene expression levels from codon usage bias are effective in the interpretation of meta-genomic and meta-transcriptomic data. GEMBASSY tools can be used seamlessly with other EMBOSS tools and UNIX command line tools. The source code written in C is available from GitHub (https://github.com/celery-kotone/GEMBASSY/) and the distribution package is freely available from the GEMBASSY web site (http://www.g-language.org/gembassy/).

  2. Comprehensive analyses of genomes, transcriptomes and metabolites of neem tree.

    Science.gov (United States)

    Kuravadi, Nagesh A; Yenagi, Vijay; Rangiah, Kannan; Mahesh, H B; Rajamani, Anantharamanan; Shirke, Meghana D; Russiachand, Heikham; Loganathan, Ramya Malarini; Shankara Lingu, Chandana; Siddappa, Shilpa; Ramamurthy, Aishwarya; Sathyanarayana, B N; Gowda, Malali

    2015-01-01

    Neem (Azadirachta indica A. Juss) is one of the most versatile tropical evergreen tree species known in India since the Vedic period (1500 BC-600 BC). Neem tree is a rich source of limonoids, having a wide spectrum of activity against insect pests and microbial pathogens. Complex tetranortriterpenoids such as azadirachtin, salanin and nimbin are the major active principles isolated from neem seed. Absolutely nothing is known about the biochemical pathways of these metabolites in neem tree. To identify genes and pathways in neem, we sequenced neem genomes and transcriptomes using next generation sequencing technologies. Assembly of Illumina and 454 sequencing reads resulted in 267 Mb, which accounts for 70% of estimated size of neem genome. We predicted 44,495 genes in the neem genome, of which 32,278 genes were expressed in neem tissues. Neem genome consists about 32.5% (87 Mb) of repetitive DNA elements. Neem tree is phylogenetically related to citrus, Citrus sinensis. Comparative analysis anchored 62% (161 Mb) of assembled neem genomic contigs onto citrus chromomes. Ultrahigh performance liquid chromatography-mass spectrometry-selected reaction monitoring (UHPLC-MS/SRM) method was used to quantify azadirachtin, nimbin, and salanin from neem tissues. Weighted Correlation Network Analysis (WCGNA) of expressed genes and metabolites resulted in identification of possible candidate genes involved in azadirachtin biosynthesis pathway. This study provides genomic, transcriptomic and quantity of top three neem metabolites resource, which will accelerate basic research in neem to understand biochemical pathways.

  3. Comprehensive analyses of genomes, transcriptomes and metabolites of neem tree

    Directory of Open Access Journals (Sweden)

    Nagesh A. Kuravadi

    2015-08-01

    Full Text Available Neem (Azadirachta indica A. Juss is one of the most versatile tropical evergreen tree species known in India since the Vedic period (1500 BC–600 BC. Neem tree is a rich source of limonoids, having a wide spectrum of activity against insect pests and microbial pathogens. Complex tetranortriterpenoids such as azadirachtin, salanin and nimbin are the major active principles isolated from neem seed. Absolutely nothing is known about the biochemical pathways of these metabolites in neem tree. To identify genes and pathways in neem, we sequenced neem genomes and transcriptomes using next generation sequencing technologies. Assembly of Illumina and 454 sequencing reads resulted in 267 Mb, which accounts for 70% of estimated size of neem genome. We predicted 44,495 genes in the neem genome, of which 32,278 genes were expressed in neem tissues. Neem genome consists about 32.5% (87 Mb of repetitive DNA elements. Neem tree is phylogenetically related to citrus, Citrus sinensis. Comparative analysis anchored 62% (161 Mb of assembled neem genomic contigs onto citrus chromomes. Ultrahigh performance liquid chromatography-mass spectrometry-selected reaction monitoring (UHPLC-MS/SRM method was used to quantify azadirachtin, nimbin, and salanin from neem tissues. Weighted Correlation Network Analysis (WCGNA of expressed genes and metabolites resulted in identification of possible candidate genes involved in azadirachtin biosynthesis pathway. This study provides genomic, transcriptomic and quantity of top three neem metabolites resource, which will accelerate basic research in neem to understand biochemical pathways.

  4. Adaptive Evolutionary Analysis of Chloroplast Genes in Euphyllophytes Based on Complete Chloroplast Genome Sequences%基于叶绿体基因组全序列分析真叶植物叶绿体基因的适应性进化

    Institute of Scientific and Technical Information of China (English)

    王博; 高磊; 苏应娟; 王艇

    2012-01-01

    Euphyllophytes comprise fems, gymnosperms, and angiosperms. Relatively abundant chloro-plast genome sequence data has been available for them. In this research, chloroplast gene sequences of 29 euphyllophyte species were extracted from their completely sequenced chloroplast genomes; then an a-daptive evolutionary analysis was performed on the chloroplast genes by running PAML under models allowing w (nonsynonymous/synonymous rate ratio) to vary among sites. The results showed that: ①The percentage of chloroplast genes under positive selection in ferns, gymnosperms, and angiosperms were 6. 5%, 7.5% and 19. 2% , respectively. The number of positively selected genes in angiosperms appeared significantly larger than that of ferns and gymnosperms. ②Most positively selected genes were genetic system or photosynthesis-related genes. Their coding proteins often functioned in chloroplast protein synthesis, gene transcription, energy transformation and regulation, and photosynthesis. We infer that the chloroplast functional genes may have played key roles during the adaptation of euphyllophytes to terrestrial ecosystems.%真叶植物包括蕨类、裸子植物和被子植物.迄今已积累有较为丰富的真叶植物叶绿体基因组全序列数据.选取了29种真叶植物的叶绿体基因组全序列,采用PAML软件基于位点间可变ω模型,分别分析了蕨类、裸子植物和被子植物叶绿体基因的适应性进化.结果显示:①蕨类、裸子植物和被子植物各有6.5%、7.5%和19.2%的叶绿体基因受正选择作用;被子植物经历正选择的叶绿体基因明显比蕨类和裸子植物为多;②被正选择作用的叶绿体基因主要是遗传系统和光合系统基因,它们的编码产物涉及叶绿体蛋白质合成、基因转录、能量转化与调节及光合作用等过程.推测叶绿体功能基因可能在真叶植物对陆生生态环境的适应过程中起着重要作用.

  5. Polymerase chain reaction-single strand conformation polymorphism analyses of nuclear and chloroplast DNA provide evidence for recombination, multiple introductions and nascent speciation in the Caulerpa taxifolia complex.

    Science.gov (United States)

    Meusnier, I; Valero, M; Destombe, C; Godé, C; Desmarais, E; Bonhomme, F; Stam, W T; Olsen, J L

    2002-11-01

    Independent lines of evidence support an Australian origin for the Mediterranean populations of the tropical alga Caulerpa taxifolia. To complement previous biogeographical studies based on nuclear rDNA internal transcribed spacer (ITS), a new chloroplast marker was developed--the cp 16S rDNA intron-2. Sequence variability for both nuclear and chloroplast markers were assessed in 110 individuals using single strand conformation polymorphism. Comparison of intrapopulation genetic diversity between invasive Mediterranean and 'native' Australian populations revealed the occurrence of two divergent and widespread clades. The first clade grouped nontropical invasive populations with inshore-mainland populations from Australia, while the second clustered all offshore-island populations studied so far. Despite our finding of nine distinct nuclear and five distinct chloroplast profiles, a single nucleocytoplasmic combination was characteristic of the invasive populations and sexual reproduction was found to be very rare. C. taxifolia is clearly a complex of genetically and ecologically differentiated sibling species or subspecies.

  6. Isolation of Chloroplasts from Plant Protoplasts.

    Science.gov (United States)

    Lung, Shiu-Cheung; Smith, Matthew D; Chuong, Simon D X

    2015-10-01

    Chloroplasts can be isolated from higher plants directly following homogenization; however, the resulting yield, purity, and intactness are often low, necessitating a large amount of starting material. This protocol is optimized to produce a high yield of pure chloroplasts from isolated Arabidopsis protoplasts. The two-part method is a simple, scaled-down, and low-cost procedure that readily provides healthy mesophyll protoplasts, which are then ruptured to release intact chloroplasts. Chloroplasts isolated using this method are competent for use in biochemical, cellular, and molecular analyses.

  7. Mutation analyses of integrated HBV genome in hepatitis B patients

    Institute of Scientific and Technical Information of China (English)

    Peilin Wang; Xiuhai Wang; Shuying Cong; Hongming Ma; Xuecheng Zhang

    2008-01-01

    Little has been learnt in the last 30 years about detection of HBV genome as well as its mutation analysis between hepatitis B fathers (HBF) and their children. In this study, we used nest polymerase chain reaction (PCR), fluorescence in situ hybridization (FISH), and DNA sequencing analysis, to examine the integrated HBV genome in paraffin-embedded testis tissues, which were taken as samples from HBF, and in peripheral blood mononuclear cells (PBMC) from 74 cases of HBFs and their children who were born after their fathers' HBV infection (caHBF). We found that HBV DNA existed in testis tissues, mainly in the basilar parts of the seminiferous tubules, and also in PBMC of HBF. It was also documented that there were point mutations of poly-loci, insertions and deletions of nucleotides in integrated HBV genomes, and the types of gene mutations in the HBFs were similar to those in caHBF. This study addresses the major types of gene mutations in integrated HBV genome in human patients and also presents reliable evidence of possible genetic transmission of hepatitis B.

  8. Comparative Genomic Analyses of the Human NPHP1 Locus Reveal Complex Genomic Architecture and Its Regional Evolution in Primates.

    Directory of Open Access Journals (Sweden)

    Bo Yuan

    2015-12-01

    Full Text Available Many loci in the human genome harbor complex genomic structures that can result in susceptibility to genomic rearrangements leading to various genomic disorders. Nephronophthisis 1 (NPHP1, MIM# 256100 is an autosomal recessive disorder that can be caused by defects of NPHP1; the gene maps within the human 2q13 region where low copy repeats (LCRs are abundant. Loss of function of NPHP1 is responsible for approximately 85% of the NPHP1 cases-about 80% of such individuals carry a large recurrent homozygous NPHP1 deletion that occurs via nonallelic homologous recombination (NAHR between two flanking directly oriented ~45 kb LCRs. Published data revealed a non-pathogenic inversion polymorphism involving the NPHP1 gene flanked by two inverted ~358 kb LCRs. Using optical mapping and array-comparative genomic hybridization, we identified three potential novel structural variant (SV haplotypes at the NPHP1 locus that may protect a haploid genome from the NPHP1 deletion. Inter-species comparative genomic analyses among primate genomes revealed massive genomic changes during evolution. The aggregated data suggest that dynamic genomic rearrangements occurred historically within the NPHP1 locus and generated SV haplotypes observed in the human population today, which may confer differential susceptibility to genomic instability and the NPHP1 deletion within a personal genome. Our study documents diverse SV haplotypes at a complex LCR-laden human genomic region. Comparative analyses provide a model for how this complex region arose during primate evolution, and studies among humans suggest that intra-species polymorphism may potentially modulate an individual's susceptibility to acquiring disease-associated alleles.

  9. GEMBASSY: an EMBOSS associated software package for comprehensive genome analyses

    OpenAIRE

    Itaya, Hidetoshi; Oshita, Kazuki; Arakawa, Kazuharu; Tomita, Masaru

    2013-01-01

    The popular European Molecular Biology Open Software Suite (EMBOSS) currently contains over 400 tools used in various bioinformatics researches, equipped with sophisticated development frameworks for interoperability and tool discoverability as well as rich documentations and various user interfaces. In order to further strengthen EMBOSS in the fields of genomics, we here present a novel EMBOSS associated software (EMBASSY) package named GEMBASSY, which adds more than 50 analysis tools from t...

  10. Impact of chromatin structures on DNA processing for genomic analyses.

    Directory of Open Access Journals (Sweden)

    Leonid Teytelman

    Full Text Available Chromatin has an impact on recombination, repair, replication, and evolution of DNA. Here we report that chromatin structure also affects laboratory DNA manipulation in ways that distort the results of chromatin immunoprecipitation (ChIP experiments. We initially discovered this effect at the Saccharomyces cerevisiae HMR locus, where we found that silenced chromatin was refractory to shearing, relative to euchromatin. Using input samples from ChIP-Seq studies, we detected a similar bias throughout the heterochromatic portions of the yeast genome. We also observed significant chromatin-related effects at telomeres, protein binding sites, and genes, reflected in the variation of input-Seq coverage. Experimental tests of candidate regions showed that chromatin influenced shearing at some loci, and that chromatin could also lead to enriched or depleted DNA levels in prepared samples, independently of shearing effects. Our results suggested that assays relying on immunoprecipitation of chromatin will be biased by intrinsic differences between regions packaged into different chromatin structures - biases which have been largely ignored to date. These results established the pervasiveness of this bias genome-wide, and suggested that this bias can be used to detect differences in chromatin structures across the genome.

  11. Primers for the Amplification of the Circular Chloroplast DNA from the A-genome Group of Cultivated Cotton

    Institute of Scientific and Technical Information of China (English)

    IBRAHIM Rashid Ismael Hag; AZUMA Jun-Ichi; SAKAMOTO Masahiro

    2008-01-01

    @@ The availability of the plastid genome sequences is one of the bases for comparative,functional,and structural genomic studies of plastid-containing living organisms,in addition to the application of plastid genetic engineering technology.The past efforts to sequence plastid genomes involve complicated preparation protocols.One procedure starts with the isolation of plastids,which was tiresome and time wasting that followed by a second step to extract plastid DNA from the isolated plastids,then finally the build up of plasmid or bacterial artificial chromosome (BAC) library.

  12. Polymerase chain reaction-single strand conformation polymorphism analyses of nuclear and chloroplast DNA provide evidence for recombination, multiple introductions and nascent speciation in the Caulerpa taxifolia complex

    NARCIS (Netherlands)

    Meusnier, I; Valero, M; Destombe, C; Gode, E.; Desmarais, E.; Bonhomme, F.; Stam, W.T.; Olsen, J.L.

    2002-01-01

    Independent lines of evidence support an Australian origin for the Mediterranean populations of the tropical alga Caulerpa taxifolia. To complement previous biogeographical studies based on nuclear rDNA internal transcribed spacer (ITS), a new chloroplast marker was developed - the cp 16S rDNA intro

  13. Genomic Analyses of Bacterial Porin-Cytochrome Gene Clusters

    Directory of Open Access Journals (Sweden)

    Liang eShi

    2014-11-01

    Full Text Available The porin-cytochrome (Pcc protein complex is responsible for trans-outer membrane electron transfer during extracellular reduction of Fe(III by the dissimilatory metal-reducing bacterium Geobacter sulfurreducens PCA. The identified and characterized Pcc complex of G. sulfurreducens PCA consists of a porin-like outer-membrane protein, a periplasmic 8-heme c-type cytochrome (c-Cyt and an outer-membrane 12-heme c-Cyt, and the genes encoding the Pcc proteins are clustered in the same regions of genome (i.e., the pcc gene clusters of G. sulfurreducens PCA. A survey of additionally microbial genomes has identified the pcc gene clusters in all sequenced Geobacter spp. and other bacteria from six different phyla, including Anaeromyxobacter dehalogenans 2CP-1, A. dehalogenans 2CP-C, Anaeromyxobacter sp. K, Candidatus Kuenenia stuttgartiensis, Denitrovibrio acetiphilus DSM 12809, Desulfurispirillum indicum S5, Desulfurivibrio alkaliphilus AHT2, Desulfurobacterium thermolithotrophum DSM 11699, Desulfuromonas acetoxidans DSM 684, Ignavibacterium album JCM 16511, and Thermovibrio ammonificans HB-1. The numbers of genes in the pcc gene clusters vary, ranging from two to nine. Similar to the metal-reducing (Mtr gene clusters of other Fe(III-reducing bacteria, such as Shewanella spp., additional genes that encode putative c-Cyts with predicted cellular localizations at the cytoplasmic membrane, periplasm and outer membrane often associate with the pcc gene clusters. This suggests that the Pcc-associated c-Cyts may be part of the pathways for extracellular electron transfer reactions. The presence of pcc gene clusters in the microorganisms that do not reduce solid-phase Fe(III and Mn(IV oxides, such as D. alkaliphilus AHT2 and I. album JCM 16511, also suggests that some of the pcc gene clusters may be involved in extracellular electron transfer reactions with the substrates other than Fe(III and Mn(IV oxides.

  14. Genomic and comparative genomic analyses of Rickettsia heilongjiangensis provide insight into its evolution and pathogenesis.

    Science.gov (United States)

    Duan, Changsong; Xiong, Xiaolu; Qi, Yong; Gong, Wenping; Jiao, Jun; Wen, Bohai

    2014-08-01

    Rickettsia heilongjiangensis, the causative agent of far eastern spotted fever, is an obligate intracellular gram-negative bacterium that belongs to the spotted fever group rickettsiae. To understand the evolution and pathogenesis of R. heilongjiangensis, we analyzed its genome and compared it with other rickettsial genomes available in GenBank. The R. heilongjiangensis chromosome contains 1333 genes, including 1297 protein coding genes and 36 RNA coding genes. The genome also contains 121 pseudogenes, 54 insertion sequences, and 39 tandem repeats. Sixteen genes encoding the major components of the type IV secretion systems were identified in the R. heilongjiangensis genome. In total, 37 β-barrel outer membrane proteins were predicted in the genome, eight of which have been previously confirmed to be outer membrane proteins. In addition, 266 potential virulence factor genes, seven partially deleted antibiotic resistance genes, and a genomic island were identified in the genome. The codon usage in the genome is compatible with its low GC content, and the amino acid usage shows apparent bias. A comparative genomic analysis showed that R. heilongjiangensis and R. japonica share one unique fragment that may be a target sequence for a diagnostic assay. The orthologs of 37 genes of R. heilongjiangensis were found in pathogenic R. rickettsii str. Sheila Smith but not in non-pathogenic R. rickettsii str. Iowa, which may explain why R. heilongjiangensis is pathogenic. Pan-genome analysis showed that R. heilongjiangensis and 42 other rickettsiae strains share 693 core genes with a pan-genome size of 4837 genes. The pan-genome-based phylogeny showed that R. heilongjiangensis was closely related to R. japonica.

  15. Complete Chloroplast and Mitochondrial Genome Sequences of the Hydrocarbon Oil-Producing Green Microalga Botryococcus braunii Race B (Showa).

    Science.gov (United States)

    Blifernez-Klassen, Olga; Wibberg, Daniel; Winkler, Anika; Blom, Jochen; Goesmann, Alexander; Kalinowski, Jörn; Kruse, Olaf

    2016-06-09

    The green alga Botryococcus braunii is capable of the production and excretion of high quantities of long-chain hydrocarbons and exopolysaccharides. In this study, we present the complete plastid and mitochondrial genomes of the hydrocarbon-producing microalga Botryococcus braunii race B (Showa), with a total length of 156,498 and 129,356 bp, respectively.

  16. Diversity and genome dynamics of marine cyanophages using metagenomic analyses.

    Science.gov (United States)

    Ma, Yingfei; Allen, Lisa Zeigler; Palenik, Brian

    2014-12-01

    Cyanophages are abundant in the oceanic environment and directly impact cyanobacterial distributions, physiological processes and evolution. Two samples collected from coastal Maine in July and September 2009 were enriched for Synechococcus cells using flow cytometry and examined through metagenomic sequencing. Homology-based sequence prediction indicated cyanophages, largely myoviruses, accounted for almost half the reads and provided insights into environmental infection events. T4-phage core-gene phylogenetic reconstruction revealed unique diversity among uncultured cyanophages and reference isolates resulting in identification of a new phylogenetic cluster. Genomic comparison of reference cyanophage strains S-SM2 and Syn1 with putative homologous contigs recovered from metagenomes provided evidence that gene insertion, deletion and recombination have occurred among, and are likely important for diversification of, natural populations. Identification of putative genetic exchange between cyanophage and non-cyanophage viruses, i.e. Micromonas virus and Pelagibacter phage, supports hypotheses related to a significant role for viruses in mediating transfer of genetic material between taxonomically diverse organisms with overlapping ecological niches.

  17. Functional and genomic analyses of alpha-solenoid proteins.

    Directory of Open Access Journals (Sweden)

    David Fournier

    Full Text Available Alpha-solenoids are flexible protein structural domains formed by ensembles of alpha-helical repeats (Armadillo and HEAT repeats among others. While homology can be used to detect many of these repeats, some alpha-solenoids have very little sequence homology to proteins of known structure and we expect that many remain undetected. We previously developed a method for detection of alpha-helical repeats based on a neural network trained on a dataset of protein structures. Here we improved the detection algorithm and updated the training dataset using recently solved structures of alpha-solenoids. Unexpectedly, we identified occurrences of alpha-solenoids in solved protein structures that escaped attention, for example within the core of the catalytic subunit of PI3KC. Our results expand the current set of known alpha-solenoids. Application of our tool to the protein universe allowed us to detect their significant enrichment in proteins interacting with many proteins, confirming that alpha-solenoids are generally involved in protein-protein interactions. We then studied the taxonomic distribution of alpha-solenoids to discuss an evolutionary scenario for the emergence of this type of domain, speculating that alpha-solenoids have emerged in multiple taxa in independent events by convergent evolution. We observe a higher rate of alpha-solenoids in eukaryotic genomes and in some prokaryotic families, such as Cyanobacteria and Planctomycetes, which could be associated to increased cellular complexity. The method is available at http://cbdm.mdc-berlin.de/~ard2/.

  18. Genomic analyses identify molecular subtypes of pancreatic cancer.

    Science.gov (United States)

    Bailey, Peter; Chang, David K; Nones, Katia; Johns, Amber L; Patch, Ann-Marie; Gingras, Marie-Claude; Miller, David K; Christ, Angelika N; Bruxner, Tim J C; Quinn, Michael C; Nourse, Craig; Murtaugh, L Charles; Harliwong, Ivon; Idrisoglu, Senel; Manning, Suzanne; Nourbakhsh, Ehsan; Wani, Shivangi; Fink, Lynn; Holmes, Oliver; Chin, Venessa; Anderson, Matthew J; Kazakoff, Stephen; Leonard, Conrad; Newell, Felicity; Waddell, Nick; Wood, Scott; Xu, Qinying; Wilson, Peter J; Cloonan, Nicole; Kassahn, Karin S; Taylor, Darrin; Quek, Kelly; Robertson, Alan; Pantano, Lorena; Mincarelli, Laura; Sanchez, Luis N; Evers, Lisa; Wu, Jianmin; Pinese, Mark; Cowley, Mark J; Jones, Marc D; Colvin, Emily K; Nagrial, Adnan M; Humphrey, Emily S; Chantrill, Lorraine A; Mawson, Amanda; Humphris, Jeremy; Chou, Angela; Pajic, Marina; Scarlett, Christopher J; Pinho, Andreia V; Giry-Laterriere, Marc; Rooman, Ilse; Samra, Jaswinder S; Kench, James G; Lovell, Jessica A; Merrett, Neil D; Toon, Christopher W; Epari, Krishna; Nguyen, Nam Q; Barbour, Andrew; Zeps, Nikolajs; Moran-Jones, Kim; Jamieson, Nigel B; Graham, Janet S; Duthie, Fraser; Oien, Karin; Hair, Jane; Grützmann, Robert; Maitra, Anirban; Iacobuzio-Donahue, Christine A; Wolfgang, Christopher L; Morgan, Richard A; Lawlor, Rita T; Corbo, Vincenzo; Bassi, Claudio; Rusev, Borislav; Capelli, Paola; Salvia, Roberto; Tortora, Giampaolo; Mukhopadhyay, Debabrata; Petersen, Gloria M; Munzy, Donna M; Fisher, William E; Karim, Saadia A; Eshleman, James R; Hruban, Ralph H; Pilarsky, Christian; Morton, Jennifer P; Sansom, Owen J; Scarpa, Aldo; Musgrove, Elizabeth A; Bailey, Ulla-Maja Hagbo; Hofmann, Oliver; Sutherland, Robert L; Wheeler, David A; Gill, Anthony J; Gibbs, Richard A; Pearson, John V; Waddell, Nicola; Biankin, Andrew V; Grimmond, Sean M

    2016-03-01

    Integrated genomic analysis of 456 pancreatic ductal adenocarcinomas identified 32 recurrently mutated genes that aggregate into 10 pathways: KRAS, TGF-β, WNT, NOTCH, ROBO/SLIT signalling, G1/S transition, SWI-SNF, chromatin modification, DNA repair and RNA processing. Expression analysis defined 4 subtypes: (1) squamous; (2) pancreatic progenitor; (3) immunogenic; and (4) aberrantly differentiated endocrine exocrine (ADEX) that correlate with histopathological characteristics. Squamous tumours are enriched for TP53 and KDM6A mutations, upregulation of the TP63∆N transcriptional network, hypermethylation of pancreatic endodermal cell-fate determining genes and have a poor prognosis. Pancreatic progenitor tumours preferentially express genes involved in early pancreatic development (FOXA2/3, PDX1 and MNX1). ADEX tumours displayed upregulation of genes that regulate networks involved in KRAS activation, exocrine (NR5A2 and RBPJL), and endocrine differentiation (NEUROD1 and NKX2-2). Immunogenic tumours contained upregulated immune networks including pathways involved in acquired immune suppression. These data infer differences in the molecular evolution of pancreatic cancer subtypes and identify opportunities for therapeutic development.

  19. Whole-genome analyses of speciation events in pathogenic Brucellae

    Energy Technology Data Exchange (ETDEWEB)

    Chain, Patrick S. G. [Lawrence Livermore National Laboratory (LLNL); Comerci, Diego J. [Universidad Nacional de General San Martin; Tolmasky, Marcelo E. [California State University; Larimer, Frank W [ORNL; Malfatti, Stephanie [Lawrence Livermore National Laboratory (LLNL); Vergez, Lisa [Lawrence Livermore National Laboratory (LLNL); Aguero, Fernan [Universidad Nacional de General San Martin; Land, Miriam L [ORNL; Ugalde, Rodolfo A. [Universidad Nacional de General San Martin; Garcia, Emilio [Lawrence Livermore National Laboratory (LLNL)

    2005-12-01

    Despite their high DNA identity and a proposal to group classical Brucella species as biovars of Brucella melitensis, the commonly recognized Brucella species can be distinguished by distinct biochemical and fatty acid characters, as well as by a marked host range (e.g., Brucella suis for swine, B. melitensis for sheep and goats, and Brucella abortus for cattle). Here we present the genome of B. abortus 2308, the virulent prototype biovar 1 strain, and its comparison to the two other human pathogenic Brucella species and to B. abortus field isolate 9-941. The global distribution of pseudogenes, deletions, and insertions supports previous indications that B. abortus and B. melitensis share a common ancestor that diverged from B. suis. With the exception of a dozen genes, the genetic complements of both B. abortus strains are identical, whereas the three species differ in gene content and pseudogenes. The pattern of species-specific gene inactivations affecting transcriptional regulators and outer membrane proteins suggests that these inactivations may play an important role in the establishment of host specificity and may have been a primary driver of speciation in the genus Brucella. Despite being nonmotile, the brucellae contain flagellum gene clusters and display species-specific flagellar gene inactivations, which lead to the putative generation of different versions of flagellum-derived structures and may contribute to differences in host specificity and virulence. Metabolic changes such as the lack of complete metabolic pathways for the synthesis of numerous compounds (e.g., glycogen, biotin, NAD, and choline) are consistent with adaptation of brucellae to an intracellular life-style.

  20. Genome-based comparative analyses of Antarctic and temperate species of Paenibacillus.

    Directory of Open Access Journals (Sweden)

    Melissa Dsouza

    Full Text Available Antarctic soils represent a unique environment characterised by extremes of temperature, salinity, elevated UV radiation, low nutrient and low water content. Despite the harshness of this environment, members of 15 bacterial phyla have been identified in soils of the Ross Sea Region (RSR. However, the survival mechanisms and ecological roles of these phyla are largely unknown. The aim of this study was to investigate whether strains of Paenibacillus darwinianus owe their resilience to substantial genomic changes. For this, genome-based comparative analyses were performed on three P. darwinianus strains, isolated from gamma-irradiated RSR soils, together with nine temperate, soil-dwelling Paenibacillus spp. The genome of each strain was sequenced to over 1,000-fold coverage, then assembled into contigs totalling approximately 3 Mbp per genome. Based on the occurrence of essential, single-copy genes, genome completeness was estimated at approximately 88%. Genome analysis revealed between 3,043-3,091 protein-coding sequences (CDSs, primarily associated with two-component systems, sigma factors, transporters, sporulation and genes induced by cold-shock, oxidative and osmotic stresses. These comparative analyses provide an insight into the metabolic potential of P. darwinianus, revealing potential adaptive mechanisms for survival in Antarctic soils. However, a large proportion of these mechanisms were also identified in temperate Paenibacillus spp., suggesting that these mechanisms are beneficial for growth and survival in a range of soil environments. These analyses have also revealed that the P. darwinianus genomes contain significantly fewer CDSs and have a lower paralogous content. Notwithstanding the incompleteness of the assemblies, the large differences in genome sizes, determined by the number of genes in paralogous clusters and the CDS content, are indicative of genome content scaling. Finally, these sequences are a resource for further

  1. Expressing PHB synthetic genes through chloroplast genetic engineering

    Institute of Scientific and Technical Information of China (English)

    2002-01-01

    Chloroplast integration and expression vector containing expression cassettes for phbB, phbA, phbC and aadA genes was constructed and bombarded into the tobacco chloroplast genome. Transplastomic plants were analyzed with PCR and Southern blot. Their homoplastomy was also judged. Northern dot and RT-PCR analysis were employed to investigate transgene expression at transcriptional level. The results indicate that the chloroplast transformation system is compatible for poly-3-hydroxybutyrate (PHB) production.

  2. The geometric increase in meta-analyses from China in the genomic era.

    Directory of Open Access Journals (Sweden)

    John P A Ioannidis

    Full Text Available Meta-analyses are increasingly popular. It is unknown whether this popularity is driven by specific countries and specific meta-analyses types. PubMed was used to identify meta-analyses since 1995 (last update 9/1/2012 and catalogue their types and country of origin. We focused more on meta-analyses from China (the current top producer of meta-analyses versus the USA (top producer until recently. The annual number of meta-analyses from China increased 40-fold between 2003 and 2011 versus 2.4-fold for the USA. The growth of Chinese meta-analyses was driven by genetics (110-fold increase in 2011 versus 2003. The HuGE Navigator identified 612 meta-analyses of genetic association studies published in 2012 from China versus only 109 from the USA. We compared in-depth 50 genetic association meta-analyses from China versus 50 from USA in 2012. Meta-analyses from China almost always used only literature-based data (92%, and focused on one or two genes (94% and variants (78% identified with candidate gene approaches (88%, while many USA meta-analyses used genome-wide approaches and raw data. Both groups usually concluded favorably for the presence of genetic associations (80% versus 74%, but nominal significance (P<0.05 typically sufficed in the China group. Meta-analyses from China typically neglected genome-wide data, and often included candidate gene studies published in Chinese-language journals. Overall, there is an impressive rise of meta-analyses from China, particularly on genetic associations. Since most claimed candidate gene associations are likely false-positives, there is an urgent global need to incorporate genome-wide data and state-of-the art statistical inferences to avoid a flood of false-positive genetic meta-analyses.

  3. The geometric increase in meta-analyses from China in the genomic era.

    Science.gov (United States)

    Ioannidis, John P A; Chang, Christine Q; Lam, Tram Kim; Schully, Sheri D; Khoury, Muin J

    2013-01-01

    Meta-analyses are increasingly popular. It is unknown whether this popularity is driven by specific countries and specific meta-analyses types. PubMed was used to identify meta-analyses since 1995 (last update 9/1/2012) and catalogue their types and country of origin. We focused more on meta-analyses from China (the current top producer of meta-analyses) versus the USA (top producer until recently). The annual number of meta-analyses from China increased 40-fold between 2003 and 2011 versus 2.4-fold for the USA. The growth of Chinese meta-analyses was driven by genetics (110-fold increase in 2011 versus 2003). The HuGE Navigator identified 612 meta-analyses of genetic association studies published in 2012 from China versus only 109 from the USA. We compared in-depth 50 genetic association meta-analyses from China versus 50 from USA in 2012. Meta-analyses from China almost always used only literature-based data (92%), and focused on one or two genes (94%) and variants (78%) identified with candidate gene approaches (88%), while many USA meta-analyses used genome-wide approaches and raw data. Both groups usually concluded favorably for the presence of genetic associations (80% versus 74%), but nominal significance (PChina group. Meta-analyses from China typically neglected genome-wide data, and often included candidate gene studies published in Chinese-language journals. Overall, there is an impressive rise of meta-analyses from China, particularly on genetic associations. Since most claimed candidate gene associations are likely false-positives, there is an urgent global need to incorporate genome-wide data and state-of-the art statistical inferences to avoid a flood of false-positive genetic meta-analyses.

  4. Chloroplast protein and centrosomal genes, a tRNA intron, and odd telomeres in an unusually compact eukaryotic genome, the cryptomonad nucleomorph.

    Science.gov (United States)

    Zauner, S; Fraunholz, M; Wastl, J; Penny, S; Beaton, M; Cavalier-Smith, T; Maier, U G; Douglas, S

    2000-01-04

    Cells of several major algal groups are evolutionary chimeras of two radically different eukaryotic cells. Most of these "cells within cells" lost the nucleus of the former algal endosymbiont. But after hundreds of millions of years cryptomonads still retain the nucleus of their former red algal endosymbiont as a tiny relict organelle, the nucleomorph, which has three minute linear chromosomes, but their function and the nature of their ends have been unclear. We report extensive cryptomonad nucleomorph sequences (68.5 kb), from one end of each of the three chromosomes of Guillardia theta. Telomeres of the nucleomorph chromosomes differ dramatically from those of other eukaryotes, being repeats of the 23-mer sequence (AG)(7)AAG(6)A, not a typical hexamer (commonly TTAGGG). The subterminal regions comprising the rRNA cistrons and one protein-coding gene are exactly repeated at all three chromosome ends. Gene density (one per 0.8 kb) is the highest for any cellular genome. None of the 38 protein-coding genes has spliceosomal introns, in marked contrast to the chlorarachniophyte nucleomorph. Most identified nucleomorph genes are for gene expression or protein degradation; histone, tubulin, and putatively centrosomal ranbpm genes are probably important for chromosome segregation. No genes for primary or secondary metabolism have been found. Two of the three tRNA genes have introns, one in a hitherto undescribed location. Intergenic regions are exceptionally short; three genes transcribed by two different RNA polymerases overlap their neighbors. The reported sequences encode two essential chloroplast proteins, FtsZ and rubredoxin, thus explaining why cryptomonad nucleomorphs persist.

  5. Comparative Analysis of Codon Usage Patterns Among Mitochondrion, Chloroplast and Nuclear Genes in Triticum aestivum L.

    Institute of Scientific and Technical Information of China (English)

    Wen-Juan Zhang; Jie Zhou; Zuo-Feng Li; Li Wang; Xun Gu; Yang Zhong

    2007-01-01

    In many organisms, the difference in codon usage patterns among genes reflects variation in local base compositional biases and the intensity of natural selection. In this study, a comparative analysis was performed to investigate the characteristics of codon bias and factors in shaping the codon usage patterns among mitochondrion,chloroplast and nuclear genes in common wheat (Triticum aestivum L.). GC contents in nuclear genes were higher than that in mitochondrion and chloroplast genes. The neutrality and correspondence analyses indicated that the codon usage in nuclear genes would be a result of relative strong mutational bias, while the codon usage patterns of rnitochondrion and chloroplast genes were more conserved in GC content and influenced by translation level.The Parity Rule 2 (PR2) plot analysis showed that pyrimidines were used more frequently than purines at the third codon position in the three genomes. In addition, using a new alterative strategy, 11, 12, and 24 triplets were defined as preferred codons in the mitochondrion, chloroplast and nuclear genes, respectively. These findings suggested that the mitochondrion, chloroplast and nuclear genes shared particularly different features of codon usage and evolutionary constraints.

  6. Nitrogen control of chloroplast differentiation

    Energy Technology Data Exchange (ETDEWEB)

    Schmidt, G.W.

    1992-07-01

    This project is directed toward understanding how the availability of nitrogen affects the accumulation of chloroplast pigments and proteins functioning in energy transduction and carbon metabolism. Molecular analyses performed with Chlamydomonas reinhardtii grown in a continuous culture system such that ammonium concentration is maintained at a low steady-state concentration so as to limit cell division. As compared to chloroplasts from cells of non-limiting nitrogen provisions, chloroplasts of N-limited cells are profoundly chlorophyll-deficient but still assimilate carbon for deposition of as starch and as storage lipids. Chlorophyll deficiency arises by limiting accumulation of appropriate nuclear-encoded mRNAs of and by depressed rates of translation of chloroplast mRNAs for apoproteins of reaction centers. Chloroplast translational effects can be partially ascribed to diminished rates of chlorophyll biosynthesis in N-limited cells, but pigment levels are not determinants for expression of the nuclear light-harvesting protein genes. Consequently, other signals that are responsive to nitrogen availability mediate transcriptional or post-transcriptional processes for accumulation of the mRNAs for LHC apoproteins and other mRNAs whose abundance is dependent upon high nitrogen levels. Conversely, limited nitrogen availability promotes accumulation of other proteins involved in carbon metabolism and oxidative electron transport in chloroplasts. Hence, thylakoids of N-limited cells exhibit enhanced chlororespiratory activities wherein oxygen serves as the electron acceptor in a pathway that involves plastoquinone and other electron carrier proteins that remain to be thoroughly characterized. Ongoing and future studies are also outlined.

  7. Comparative Genome Analyses of Vibrio anguillarum Strains Reveal a Link with Pathogenicity Traits

    Science.gov (United States)

    Castillo, Daniel; Alvise, Paul D.; Xu, Ruiqi; Zhang, Faxing; Middelboe, Mathias

    2017-01-01

    ABSTRACT Vibrio anguillarum is a marine bacterium that can cause vibriosis in many fish and shellfish species, leading to high mortalities and economic losses in aquaculture. Although putative virulence factors have been identified, the mechanism of pathogenesis of V. anguillarum is not fully understood. Here, we analyzed whole-genome sequences of a collection of V. anguillarum strains and compared them to virulence of the strains as determined in larval challenge assays. Previously identified virulence factors were globally distributed among the strains, with some genetic diversity. However, the pan-genome revealed that six out of nine high-virulence strains possessed a unique accessory genome that was attributed to pathogenic genomic islands, prophage-like elements, virulence factors, and a new set of gene clusters involved in biosynthesis, modification, and transport of polysaccharides. In contrast, V. anguillarum strains that were medium to nonvirulent had a high degree of genomic homogeneity. Finally, we found that a phylogeny based on the core genomes clustered the strains with moderate to no virulence, while six out of nine high-virulence strains represented phylogenetically separate clusters. Hence, we suggest a link between genotype and virulence characteristics of Vibrio anguillarum, which can be used to unravel the molecular evolution of V. anguillarum and can also be important from survey and diagnostic perspectives. IMPORTANCE Comparative genome analysis of strains of a pathogenic bacterial species can be a powerful tool to discover acquisition of mobile genetic elements related to virulence. Here, we compared 28 V. anguillarum strains that differed in virulence in fish larval models. By pan-genome analyses, we found that six of nine highly virulent strains had a unique core and accessory genome. In contrast, V. anguillarum strains that were medium to nonvirulent had low genomic diversity. Integration of genomic and phenotypic features provides

  8. Phylogenetic relationships of Ruteae (Rutaceae): new evidence from the chloroplast genome and comparisons with non-molecular data.

    Science.gov (United States)

    Salvo, Gabriele; Bacchetta, Gianluigi; Ghahremaninejad, Farrokh; Conti, Elena

    2008-12-01

    Phylogenetic analyses of three cpDNA markers (matK, rpl16, and trnL-trnF) were performed to evaluate previous treatments of Ruteae based on morphology and phytochemistry that contradicted each other, especially regarding the taxonomic status of Haplophyllum and Dictamnus. Trees derived from morphological, phytochemical, and molecular datasets of Ruteae were then compared to look for possible patterns of agreement among them. Furthermore, non-molecular characters were mapped on the molecular phylogeny to identify uniquely derived states and patterns of homoplasy in the morphological and phytochemical datasets. The phylogenetic analyses determined that Haplophyllum and Ruta form reciprocally exclusive monophyletic groups and that Dictamnus is not closely related to the other genera of Ruteae. The different types of datasets were partly incongruent with each other. The discordant phylogenetic patterns between the phytochemical and molecular trees might be best explained in terms of convergence in secondary chemical compounds. Finally, only a few non-molecular synapomorphies provided support for the clades of the molecular tree, while most of the morphological characters traditionally used for taxonomic purposes were found to be homoplasious. Within the context of the phylogenetic relationships supported by molecular data, Ruta, the type genus for the family, can only be diagnosed by using a combination of plesiomorphic, homoplasious, and autapomorphic morphological character states.

  9. Genomics and Comparative Genomic Analyses Provide Insight into the Taxonomy and Pathogenic Potential of Novel Emmonsia Pathogens

    Science.gov (United States)

    Yang, Ying; Ye, Qiang; Li, Kang; Li, Zongwei; Bo, Xiaochen; Li, Zhen; Xu, Yingchun; Wang, Shengqi; Wang, Peng; Chen, Huipeng; Wang, Junzhi

    2017-01-01

    Over the last 50 years, newly described species of Emmonsia-like fungi have been implicated globally as sources of systemic human mycosis (emmonsiosis). Their ability to convert into yeast-like cells capable of replication and extra-pulmonary dissemination during the course of infection differentiates them from classical Emmonsia species. Immunocompromised patients are at highest risk of emmonsiosis and exhibit high mortality rates. In order to investigate the molecular basis for pathogenicity of the newly described Emmonsia species, genomic sequencing and comparative genomic analyses of Emmonsia sp. 5z489, which was isolated from a non-deliberately immunosuppressed diabetic patient in China and represents a novel seventh isolate of Emmonsia-like fungi, was performed. The genome size of 5z489 was 35.5 Mbp in length, which is ~5 Mbp larger than other Emmonsia strains. Further, 9,188 protein genes were predicted in the 5z489 genome and 16% of the assembly was identified as repetitive elements, which is the largest abundance in Emmonsia species. Phylogenetic analyses based on whole genome data classified 5z489 and CAC-2015a, another novel isolate, as members of the genus Emmonsia. Our analyses showed that divergences among Emmonsia occurred much earlier than other genera within the family Ajellomycetaceae, suggesting relatively distant evolutionary relationships among the genus. Through comparisons of Emmonsia species, we discovered significant pathogenicity characteristics within the genus as well as putative virulence factors that may play a role in the infection and pathogenicity of the novel Emmonsia strains. Moreover, our analyses revealed a novel distribution mode of DNA methylation patterns across the genome of 5z489, with >50% of methylated bases located in intergenic regions. These methylation patterns differ considerably from other reported fungi, where most methylation occurs in repetitive loci. It is unclear if this difference is related to physiological

  10. Bootstrap, Bayesian probability and maximum likelihood mapping: exploring new tools for comparative genome analyses

    Directory of Open Access Journals (Sweden)

    Gogarten J Peter

    2002-02-01

    Full Text Available Abstract Background Horizontal gene transfer (HGT played an important role in shaping microbial genomes. In addition to genes under sporadic selection, HGT also affects housekeeping genes and those involved in information processing, even ribosomal RNA encoding genes. Here we describe tools that provide an assessment and graphic illustration of the mosaic nature of microbial genomes. Results We adapted the Maximum Likelihood (ML mapping to the analyses of all detected quartets of orthologous genes found in four genomes. We have automated the assembly and analyses of these quartets of orthologs given the selection of four genomes. We compared the ML-mapping approach to more rigorous Bayesian probability and Bootstrap mapping techniques. The latter two approaches appear to be more conservative than the ML-mapping approach, but qualitatively all three approaches give equivalent results. All three tools were tested on mitochondrial genomes, which presumably were inherited as a single linkage group. Conclusions In some instances of interphylum relationships we find nearly equal numbers of quartets strongly supporting the three possible topologies. In contrast, our analyses of genome quartets containing the cyanobacterium Synechocystis sp. indicate that a large part of the cyanobacterial genome is related to that of low GC Gram positives. Other groups that had been suggested as sister groups to the cyanobacteria contain many fewer genes that group with the Synechocystis orthologs. Interdomain comparisons of genome quartets containing the archaeon Halobacterium sp. revealed that Halobacterium sp. shares more genes with Bacteria that live in the same environment than with Bacteria that are more closely related based on rRNA phylogeny . Many of these genes encode proteins involved in substrate transport and metabolism and in information storage and processing. The performed analyses demonstrate that relationships among prokaryotes cannot be accurately

  11. Comparative Genome Analyses of Vibrio anguillarum Strains Reveal a Link with Pathogenicity Traits

    DEFF Research Database (Denmark)

    Castillo, Daniel; D'Alvise, Paul; Xu, Ruiqi

    2017-01-01

    Vibrio anguillarum is a marine bacterium that can cause vibriosis in many fish and shellfish species, leading to high mortalities and economic losses in aquaculture. Although putative virulence factors have been identified, the mechanism of pathogenesis of V. anguillarum is not fully understood....... anguillarum strains that were medium to nonvirulent had a high degree of genomic homogeneity. Finally, we found that a phylogeny based on the core genomes clustered the strains with moderate to no virulence, while six out of nine high-virulence strains represented phylogenetically separate clusters. Hence, we suggest...... be a powerful tool to discover acquisition of mobile genetic elements related to virulence. Here, we compared 28 V. anguillarum strains that differed in virulence in fish larval models. By pan-genome analyses, we found that six of nine highly virulent strains had a unique core and accessory genome. In contrast...

  12. Genome-wide meta-analyses identify multiple loci associated with smoking behavior

    NARCIS (Netherlands)

    H. Furberg (Helena); Y. Kim (Yunjung); J. Dackor (Jennifer); E.A. Boerwinkle (Eric); N. Franceschini (Nora); D. Ardissino (Diego); L. Bernardinelli (Luisa); P.M. Mannucci (Pier); F. Mauri (Francesco); P.A. Merlini (Piera); D. Absher (Devin); T.L. Assimes (Themistocles); S.P. Fortmann (Stephen); C. Iribarren (Carlos); J.W. Knowles (Joshua); T. Quertermous (Thomas); L. Ferrucci (Luigi); T. Tanaka (Toshiko); J.C. Bis (Joshua); T. Haritunians (Talin); B. McKnight (Barbara); B.M. Psaty (Bruce); K.D. Taylor (Kent); E.L. Thacker (Evan); P. Almgren (Peter); L. Groop (Leif); C. Ladenvall (Claes); M. Boehnke (Michael); A.U. Jackson (Anne); K.L. Mohlke (Karen); H.M. Stringham (Heather); J. Tuomilehto (Jaakko); E.J. Benjamin (Emelia); S.J. Hwang; D. Levy (Daniel); S.R. Preis; R.S. Vasan (Ramachandran Srini); J. Duan (Jubao); P.V. Gejman (Pablo); D.F. Levinson (Douglas); A.R. Sanders (Alan); J. Shi (Jianxin); E.H. Lips (Esther); J.D. McKay (James); A. Agudo (Antonio); L. Barzan (Luigi); V. Bencko (Vladimir); S. Benhamou (Simone); X. Castellsagué (Xavier); C. Canova (Cristina); D.I. Conway (David); E. Fabianova (Eleonora); L. Foretova (Lenka); V. Janout (Vladimir); C.M. Healy (Claire); I. Holcátová (Ivana); K. Kjaerheim (Kristina); P. Lagiou; J. Lissowska (Jolanta); R. Lowry (Ray); T.V. MacFarlane (Tatiana); D. Mates (Dana); L. Richiardi (Lorenzo); P. Rudnai (Peter); N. Szeszenia-Dabrowska (Neonilia); D. Zaridze; A. Znaor (Ariana); M. Lathrop (Mark); P. Brennan (Paul); S. Bandinelli (Stefania); T.M. Frayling (Timothy); J.M. Guralnik (Jack); Y. Milaneschi (Yuri); J.R.B. Perry (John); D. Altshuler (David); R. Elosua (Roberto); S. Kathiresan (Sekar); G. Lucas (Gavin); O. Melander (Olle); V. Salomaa (Veikko); S.M. Schwartz (Stephen); B.F. Voight (Benjamin); B.W.J.H. Penninx (Brenda); J.H. Smit (Johannes); N. Vogelzangs (Nicole); D.I. Boomsma (Dorret); E.J.C. de Geus (Eco); J.M. Vink (Jacqueline); G.A.H.M. Willemsen (Gonneke); S.J. Chanock (Stephen); F. Gu (Fangyi); S.E. Hankinson (Susan); D. Hunter (David); A. Hofman (Albert); H.W. Tiemeier (Henning); A.G. Uitterlinden (André); P. Tikka-Kleemola (Päivi); S. Walter (Stefan); D.I. Chasman (Daniel); B.M. Everett (Brendan); G. Pare (Guillaume); P.M. Ridker (Paul); M.D. Li (Ming); H.H. Maes (Hermine); J. Audrain-Mcgovern (Janet); D. Posthuma (Danielle); L.M. Thornton (Laura); C. Lerman (Caryn); J. Kaprio (Jaakko); J.E. Rose (Jed); J.P.A. Ioannidis (John); P. Kraft (Peter); D.Y. Lin (Dan); P.F. Sullivan (Patrick); C.J. O'Donnell (Christopher)

    2010-01-01

    textabstractConsistent but indirect evidence has implicated genetic factors in smoking behavior. We report meta-analyses of several smoking phenotypes within cohorts of the Tobacco and Genetics Consortium (n = 74,053). We also partnered with the European Network of Genetic and Genomic Epidemiology (

  13. RSIADB, a collective resource for genome and transcriptome analyses in Rhizoctonia solani AG1 IA.

    Science.gov (United States)

    Chen, Lei; Ai, Peng; Zhang, Jinfeng; Deng, Qiming; Wang, Shiquan; Li, Shuangcheng; Zhu, Jun; Li, Ping; Zheng, Aiping

    2016-01-01

    Rice [Oryza sativa (L.)] feeds more than half of the world's population. Rhizoctonia solaniis a major fungal pathogen of rice causing extreme crop losses in all rice-growing regions of the world. R. solani AG1 IA is a major cause of sheath blight in rice. In this study, we constructed a comprehensive and user-friendly web-based database, RSIADB, to analyse its draft genome and transcriptome. The database was built using the genome sequence (10,489 genes) and annotation information for R. solani AG1 IA. A total of six RNAseq samples of R. solani AG1 IA were also analysed, corresponding to 10, 18, 24, 32, 48 and 72 h after infection of rice leaves. The RSIADB database enables users to search, browse, and download gene sequences for R. solani AG1 IA, and mine the data using BLAST, Sequence Extractor, Browse and Construction Diagram tools that were integrated into the database. RSIADB is an important genomic resource for scientists working with R. solani AG1 IA and will assist researchers in analysing the annotated genome and transcriptome of this pathogen. This resource will facilitate studies on gene function, pathogenesis factors and secreted proteins, as well as provide an avenue for comparative analyses of genes expressed during different stages of infection. Database URL:http://genedenovoweb.ticp.net:81/rsia/index.php.

  14. Comparative transcriptome analyses and genome assembly of Fusarium oxysporum f. sp. cubense

    NARCIS (Netherlands)

    Dita, M.A.; Herai, R.; Waalwijk, C.; Yamagishi, M.; Giachetto, P.; Ferreira, G.; Souza, de M.; Kema, G.H.J.

    2013-01-01

    Fusarium oxysporum f. sp. cubense (Foc), the causal agent of Fusarium wilt of banana, is a highly destructive and genetically diverse pathogen. Despite its economic importance, genomic information about Foc is limited and no transcriptomic analyses have been reported so far. By using 454 sequencing

  15. Whole-genome analyses of Korean native and Holstein cattle breeds by massively parallel sequencing.

    Directory of Open Access Journals (Sweden)

    Jung-Woo Choi

    Full Text Available A main goal of cattle genomics is to identify DNA differences that account for variations in economically important traits. In this study, we performed whole-genome analyses of three important cattle breeds in Korea--Hanwoo, Jeju Heugu, and Korean Holstein--using the Illumina HiSeq 2000 sequencing platform. We achieved 25.5-, 29.6-, and 29.5-fold coverage of the Hanwoo, Jeju Heugu, and Korean Holstein genomes, respectively, and identified a total of 10.4 million single nucleotide polymorphisms (SNPs, of which 54.12% were found to be novel. We also detected 1,063,267 insertions-deletions (InDels across the genomes (78.92% novel. Annotations of the datasets identified a total of 31,503 nonsynonymous SNPs and 859 frameshift InDels that could affect phenotypic variations in traits of interest. Furthermore, genome-wide copy number variation regions (CNVRs were detected by comparing the Hanwoo, Jeju Heugu, and previously published Chikso genomes against that of Korean Holstein. A total of 992, 284, and 1881 CNVRs, respectively, were detected throughout the genome. Moreover, 53, 65, 45, and 82 putative regions of homozygosity (ROH were identified in Hanwoo, Jeju Heugu, Chikso, and Korean Holstein respectively. The results of this study provide a valuable foundation for further investigations to dissect the molecular mechanisms underlying variation in economically important traits in cattle and to develop genetic markers for use in cattle breeding.

  16. The complete mitochondrial genomes of four cockroaches (Insecta: Blattodea) and phylogenetic analyses within cockroaches.

    Science.gov (United States)

    Cheng, Xue-Fang; Zhang, Le-Ping; Yu, Dan-Na; Storey, Kenneth B; Zhang, Jia-Yong

    2016-07-15

    Three complete mitochondrial genomes of Blaberidae (Insecta: Blattodea) (Gromphadorhina portentosa, Panchlora nivea, Blaptica dubia) and one complete mt genome of Blattidae (Insecta: Blattodea) (Shelfordella lateralis) were sequenced to further understand the characteristics of cockroach mitogenomes and reconstruct the phylogenetic relationship of Blattodea. The gene order and orientation of these four cockroach genomes were similar to known cockroach mt genomes, and contained 13 protein-coding genes (PCGs), 2 ribosomal RNA (rRNA) genes, 22 transfer RNA (tRNA) genes and one control region. The mt genomes of Blattodea exhibited a characteristics of a high A+T composition (70.7%-74.3%) and dominant usage of the TAA stop codon. The AT content of the whole mt genome, PCGs and total tRNAs in G. portentosa was the lowest in known cockroaches. The presence of a 71-bp intergenic spacer region between trnQ and trnM was a unique feature in B. dubia, but absent in other cockroaches, which can be explained by the duplication/random loss model. Based on the nucleotide and amino acid datasets of the 13 PCGs genes, neighbor-joining (NJ), maximum parsimony (MP), maximum likelihood (ML) and bayesian inference (BI) analyses were used to rebuild the phylogenetic relationship of cockroaches. All phylogenetic analyses consistently placed Isoptera as the sister cluster to Cryptocercidae of Blattodea. Ectobiidae and Blaberidae (Blaberoidea) formed a sister clade to Blattidae. Corydiidae is a sister clade of all the remaining cockroach species with a high value in NJ and MP analyses of nucleotide and amino acid datasets, and ML and BI analyses of the amino acid dataset.

  17. Complex chloroplast RNA metabolism: just debugging the genetic programme?

    Directory of Open Access Journals (Sweden)

    Schmitz-Linneweber Christian

    2008-08-01

    Full Text Available Abstract Background The gene expression system of chloroplasts is far more complex than that of their cyanobacterial progenitor. This gain in complexity affects in particular RNA metabolism, specifically the transcription and maturation of RNA. Mature chloroplast RNA is generated by a plethora of nuclear-encoded proteins acquired or recruited during plant evolution, comprising additional RNA polymerases and sigma factors, and sequence-specific RNA maturation factors promoting RNA splicing, editing, end formation and translatability. Despite years of intensive research, we still lack a comprehensive explanation for this complexity. Results We inspected the available literature and genome databases for information on components of RNA metabolism in land plant chloroplasts. In particular, new inventions of chloroplast-specific mechanisms and the expansion of some gene/protein families detected in land plants lead us to suggest that the primary function of the additional nuclear-encoded components found in chloroplasts is the transgenomic suppression of point mutations, fixation of which occurred due to an enhanced genetic drift exhibited by chloroplast genomes. We further speculate that a fast evolution of transgenomic suppressors occurred after the water-to-land transition of plants. Conclusion Our inspections indicate that several chloroplast-specific mechanisms evolved in land plants to remedy point mutations that occurred after the water-to-land transition. Thus, the complexity of chloroplast gene expression evolved to guarantee the functionality of chloroplast genetic information and may not, with some exceptions, be involved in regulatory functions.

  18. Diversity of protist plastids (chloroplasts) and its causation analyses%原生生物质体(叶绿体)的多样性及其形成原因

    Institute of Scientific and Technical Information of China (English)

    张玉娟; 谭欢

    2012-01-01

    真核生物的叶绿体一般具有一定的典型的结构和功能.然而,在单细胞的原生生物中却不断发现结构与功能均与典型叶绿体明显不同的质体(叶绿体),如不具核形体的多层膜质体、具核形体的多层膜质体、具有最小基因组的质体等,表现出质体的丰富多样性.本文概要地介绍了单细胞原生生物中这些非典型的质体,并对形成这种多样性的主要原因,即这些生物的质体在进化过程中发生的一次、二次和三次内共生事件进行了分析探讨.%Eukaryotic chloroplasts normally possess typical structure and function. However, the plastids (chloroplasts) of unicellular protists have various atypical structures and functions, such as multi-membrane-bound plastids without nucelomorph, multi-membrane-bound plastids with nucleomorph and plastids with the smallest genome, which revealing the rich diversity of plastids. Now we review the diversity of plastids in diverse protists, and explore the underlying reasons driving the diversities, the primary, secondary and tertiary endosymbiosis of plastids.

  19. Genomic analyses inform on migration events during the peopling of Eurasia

    KAUST Repository

    Pagani, Luca

    2016-09-20

    High-Coverage whole-genome sequence studies have so far focused on a limited number of geographically restricted populations, or been targeted at specific diseases, such as cancer. Nevertheless, the availability of high-resolution genomic data has led to the development of new methodologies for inferring population history and refuelled the debate on the mutation rate in humans. Here we present the Estonian Biocentre Human Genome Diversity Panel (EGDP), a dataset of 483 high-coverage human genomes from 148 populations worldwide, including 379 new genomes from 125 populations, which we group into diversity and selection sets. We analyse this dataset to refine estimates of continent-wide patterns of heterozygosity, long-and short-distance gene flow, archaic admixture, and changes in effective population size through time as well as for signals of positive or balancing selection. We find a genetic signature in present-day Papuans that suggests that at least 2% of their genome originates from an early and largely extinct expansion of anatomically modern humans (AMHs) out of Africa. Together with evidence from the western Asian fossil record, and admixture between AMHs and Neanderthals predating the main Eurasian expansion, our results contribute to the mounting evidence for the presence of AMHs out of Africa earlier than 75,000 years ago. © 2016 Macmillan Publishers Limited, part of Springer Nature.

  20. Genomic analyses inform on migration events during the peopling of Eurasia

    Science.gov (United States)

    Pagani, Luca; Lawson, Daniel John; Jagoda, Evelyn; Mörseburg, Alexander; Eriksson, Anders; Mitt, Mario; Clemente, Florian; Hudjashov, Georgi; Degiorgio, Michael; Saag, Lauri; Wall, Jeffrey D.; Cardona, Alexia; Mägi, Reedik; Sayres, Melissa A. Wilson; Kaewert, Sarah; Inchley, Charlotte; Scheib, Christiana L.; Järve, Mari; Karmin, Monika; Jacobs, Guy S.; Antao, Tiago; Iliescu, Florin Mircea; Kushniarevich, Alena; Ayub, Qasim; Tyler-Smith, Chris; Xue, Yali; Yunusbayev, Bayazit; Tambets, Kristiina; Mallick, Chandana Basu; Saag, Lehti; Pocheshkhova, Elvira; Andriadze, George; Muller, Craig; Westaway, Michael C.; Lambert, David M.; Zoraqi, Grigor; Turdikulova, Shahlo; Dalimova, Dilbar; Sabitov, Zhaxylyk; Sultana, Gazi Nurun Nahar; Lachance, Joseph; Tishkoff, Sarah; Momynaliev, Kuvat; Isakova, Jainagul; Damba, Larisa D.; Gubina, Marina; Nymadawa, Pagbajabyn; Evseeva, Irina; Atramentova, Lubov; Utevska, Olga; Ricaut, François-Xavier; Brucato, Nicolas; Sudoyo, Herawati; Letellier, Thierry; Cox, Murray P.; Barashkov, Nikolay A.; Škaro, Vedrana; Mulahasano´, Lejla; Primorac, Dragan; Sahakyan, Hovhannes; Mormina, Maru; Eichstaedt, Christina A.; Lichman, Daria V.; Abdullah, Syafiq; Chaubey, Gyaneshwer; Wee, Joseph T. S.; Mihailov, Evelin; Karunas, Alexandra; Litvinov, Sergei; Khusainova, Rita; Ekomasova, Natalya; Akhmetova, Vita; Khidiyatova, Irina; Marjanović, Damir; Yepiskoposyan, Levon; Behar, Doron M.; Balanovska, Elena; Metspalu, Andres; Derenko, Miroslava; Malyarchuk, Boris; Voevoda, Mikhail; Fedorova, Sardana A.; Osipova, Ludmila P.; Lahr, Marta Mirazón; Gerbault, Pascale; Leavesley, Matthew; Migliano, Andrea Bamberg; Petraglia, Michael; Balanovsky, Oleg; Khusnutdinova, Elza K.; Metspalu, Ene; Thomas, Mark G.; Manica, Andrea; Nielsen, Rasmus; Villems, Richard; Willerslev, Eske; Kivisild, Toomas; Metspalu, Mait

    2016-10-01

    High-coverage whole-genome sequence studies have so far focused on a limited number of geographically restricted populations, or been targeted at specific diseases, such as cancer. Nevertheless, the availability of high-resolution genomic data has led to the development of new methodologies for inferring population history and refuelled the debate on the mutation rate in humans. Here we present the Estonian Biocentre Human Genome Diversity Panel (EGDP), a dataset of 483 high-coverage human genomes from 148 populations worldwide, including 379 new genomes from 125 populations, which we group into diversity and selection sets. We analyse this dataset to refine estimates of continent-wide patterns of heterozygosity, long- and short-distance gene flow, archaic admixture, and changes in effective population size through time as well as for signals of positive or balancing selection. We find a genetic signature in present-day Papuans that suggests that at least 2% of their genome originates from an early and largely extinct expansion of anatomically modern humans (AMHs) out of Africa. Together with evidence from the western Asian fossil record, and admixture between AMHs and Neanderthals predating the main Eurasian expansion, our results contribute to the mounting evidence for the presence of AMHs out of Africa earlier than 75,000 years ago.

  1. Analysis of Protein Import into Chloroplasts Isolated from Stressed Plants.

    Science.gov (United States)

    Ling, Qihua; Jarvis, Paul

    2016-11-01

    Chloroplasts are organelles with many vital roles in plants, which include not only photosynthesis but numerous other metabolic and signaling functions. Furthermore, chloroplasts are critical for plant responses to various abiotic stresses, such as salinity and osmotic stresses. A chloroplast may contain up to ~3,000 different proteins, some of which are encoded by its own genome. However, the majority of chloroplast proteins are encoded in the nucleus and synthesized in the cytosol, and these proteins need to be imported into the chloroplast through translocons at the chloroplast envelope membranes. Recent studies have shown that the chloroplast protein import can be actively regulated by stress. To biochemically investigate such regulation of protein import under stress conditions, we developed the method described here as a quick and straightforward procedure that can easily be achieved in any laboratory. In this method, plants are grown under normal conditions and then exposed to stress conditions in liquid culture. Plant material is collected, and chloroplasts are then released by homogenization. The crude homogenate is separated by density gradient centrifugation, enabling isolation of the intact chloroplasts. Chloroplast yield is assessed by counting, and chloroplast intactness is checked under a microscope. For the protein import assays, purified chloroplasts are incubated with (35)S radiolabeled in vitro translated precursor proteins, and time-course experiments are conducted to enable comparisons of import rates between genotypes under stress conditions. We present data generated using this method which show that the rate of protein import into chloroplasts from a regulatory mutant is specifically altered under osmotic stress conditions.

  2. Genome-wide analyses of HTLV-1aD strains from Cape Verde, Africa

    Science.gov (United States)

    Zanella, Louise; de Pina-Araujo I, Isabel; Morgado, Mariza G; Vicente, Ana Carolina

    2016-01-01

    We characterised and reported the first full-length genomes of Human T-cell Lymphotropic Virus Type 1 subgroup HTLV-1aD (CV21 and CV79). This subgroup is one of the major determinants of HTLV-1 infections in North and West Africa, and recombinant strains involving this subgroup have been recently demonstrated. The CV21 and CV79 strains from Cape Verde/Africa were characterised as pure HTLV-1aD genomes, comparative analyses including HTLV-1 subtypes and subgroups revealed HTLV-1aD signatures in the envelope, pol, and pX regions. These genomes provide original information that will contribute to further studies on HTLV-1a epidemiology and evolution. PMID:27653363

  3. Lifestyle transitions in plant pathogenic Colletotrichum fungi deciphered by genome and transcriptome analyses.

    Science.gov (United States)

    O'Connell, Richard J; Thon, Michael R; Hacquard, Stéphane; Amyotte, Stefan G; Kleemann, Jochen; Torres, Maria F; Damm, Ulrike; Buiate, Ester A; Epstein, Lynn; Alkan, Noam; Altmüller, Janine; Alvarado-Balderrama, Lucia; Bauser, Christopher A; Becker, Christian; Birren, Bruce W; Chen, Zehua; Choi, Jaeyoung; Crouch, Jo Anne; Duvick, Jonathan P; Farman, Mark A; Gan, Pamela; Heiman, David; Henrissat, Bernard; Howard, Richard J; Kabbage, Mehdi; Koch, Christian; Kracher, Barbara; Kubo, Yasuyuki; Law, Audrey D; Lebrun, Marc-Henri; Lee, Yong-Hwan; Miyara, Itay; Moore, Neil; Neumann, Ulla; Nordström, Karl; Panaccione, Daniel G; Panstruga, Ralph; Place, Michael; Proctor, Robert H; Prusky, Dov; Rech, Gabriel; Reinhardt, Richard; Rollins, Jeffrey A; Rounsley, Steve; Schardl, Christopher L; Schwartz, David C; Shenoy, Narmada; Shirasu, Ken; Sikhakolli, Usha R; Stüber, Kurt; Sukno, Serenella A; Sweigard, James A; Takano, Yoshitaka; Takahara, Hiroyuki; Trail, Frances; van der Does, H Charlotte; Voll, Lars M; Will, Isa; Young, Sarah; Zeng, Qiandong; Zhang, Jingze; Zhou, Shiguo; Dickman, Martin B; Schulze-Lefert, Paul; Ver Loren van Themaat, Emiel; Ma, Li-Jun; Vaillancourt, Lisa J

    2012-09-01

    Colletotrichum species are fungal pathogens that devastate crop plants worldwide. Host infection involves the differentiation of specialized cell types that are associated with penetration, growth inside living host cells (biotrophy) and tissue destruction (necrotrophy). We report here genome and transcriptome analyses of Colletotrichum higginsianum infecting Arabidopsis thaliana and Colletotrichum graminicola infecting maize. Comparative genomics showed that both fungi have large sets of pathogenicity-related genes, but families of genes encoding secreted effectors, pectin-degrading enzymes, secondary metabolism enzymes, transporters and peptidases are expanded in C. higginsianum. Genome-wide expression profiling revealed that these genes are transcribed in successive waves that are linked to pathogenic transitions: effectors and secondary metabolism enzymes are induced before penetration and during biotrophy, whereas most hydrolases and transporters are upregulated later, at the switch to necrotrophy. Our findings show that preinvasion perception of plant-derived signals substantially reprograms fungal gene expression and indicate previously unknown functions for particular fungal cell types.

  4. Genome-wide analyses of aggressiveness in attention-deficit hyperactivity disorder.

    Science.gov (United States)

    Brevik, Erlend J; van Donkelaar, Marjolein M J; Weber, Heike; Sánchez-Mora, Cristina; Jacob, Christian; Rivero, Olga; Kittel-Schneider, Sarah; Garcia-Martínez, Iris; Aebi, Marcel; van Hulzen, Kimm; Cormand, Bru; Ramos-Quiroga, Josep A; Lesch, Klaus-Peter; Reif, Andreas; Ribasés, Marta; Franke, Barbara; Posserud, Maj-Britt; Johansson, Stefan; Lundervold, Astri J; Haavik, Jan; Zayats, Tetyana

    2016-07-01

    Aggressiveness is a behavioral trait that has the potential to be harmful to individuals and society. With an estimated heritability of about 40%, genetics is important in its development. We performed an exploratory genome-wide association (GWA) analysis of childhood aggressiveness in attention deficit hyperactivity disorder (ADHD) to gain insight into the underlying biological processes associated with this trait. Our primary sample consisted of 1,060 adult ADHD patients (aADHD). To further explore the genetic architecture of childhood aggressiveness, we performed enrichment analyses of suggestive genome-wide associations observed in aADHD among GWA signals of dimensions of oppositionality (defiant/vindictive and irritable dimensions) in childhood ADHD (cADHD). No single polymorphism reached genome-wide significance (P aggressiveness and provide targets for further genetic exploration of aggressiveness across psychiatric disorders. © 2016 The Authors. American Journal of Medical Genetics Part B: Neuropsychiatric Genetics Published by Wiley Periodicals, Inc.

  5. Mitochondrial genome analyses suggest multiple Trichuris species in humans, baboons, and pigs from different geographical regions

    DEFF Research Database (Denmark)

    Hawash, Mohamed B. F.; Andersen, Lee O.; Gasser, Robin B.;

    2015-01-01

    BACKGROUND: The whipworms Trichuris trichiura and Trichuris suis are two parasitic nematodes of humans and pigs, respectively. Although whipworms in human and non-human primates historically have been referred to as T. trichiura, recent reports suggest that several Trichuris spp. are found...... in primates. METHODS AND FINDINGS: We sequenced and annotated complete mitochondrial genomes of Trichuris recovered from a human in Uganda, an olive baboon in the US, a hamadryas baboon in Denmark, and two pigs from Denmark and Uganda. Comparative analyses using other published mitochondrial genomes...... of Trichuris recovered from a human and a porcine host in China and from a françois' leaf-monkey (China) were performed, including phylogenetic analyses and pairwise genetic and amino acid distances. Genetic and protein distances between human Trichuris in Uganda and China were high (~19% and 15%, respectively...

  6. Phylogenetic analyses of cyanobacterial genomes: Quantification of horizontal gene transfer events

    OpenAIRE

    Zhaxybayeva, Olga; Gogarten, J. Peter; Charlebois, Robert L.; Doolittle, W Ford; Papke, R Thane

    2006-01-01

    Using 1128 protein-coding gene families from 11 completely sequenced cyanobacterial genomes, we attempt to quantify horizontal gene transfer events within cyanobacteria, as well as between cyanobacteria and other phyla. A novel method of detecting and enumerating potential horizontal gene transfer events within a group of organisms based on analyses of “embedded quartets” allows us to identify phylogenetic signal consistent with a plurality of gene families, as well as to delineate cases of c...

  7. Comparative and functional genomic analyses of the pathogenicity of phytopathogen Xanthomonas campestris pv. campestris.

    Science.gov (United States)

    Qian, Wei; Jia, Yantao; Ren, Shuang-Xi; He, Yong-Qiang; Feng, Jia-Xun; Lu, Ling-Feng; Sun, Qihong; Ying, Ge; Tang, Dong-Jie; Tang, Hua; Wu, Wei; Hao, Pei; Wang, Lifeng; Jiang, Bo-Le; Zeng, Shenyan; Gu, Wen-Yi; Lu, Gang; Rong, Li; Tian, Yingchuan; Yao, Zhijian; Fu, Gang; Chen, Baoshan; Fang, Rongxiang; Qiang, Boqin; Chen, Zhu; Zhao, Guo-Ping; Tang, Ji-Liang; He, Chaozu

    2005-06-01

    Xanthomonas campestris pathovar campestris (Xcc) is the causative agent of crucifer black rot disease, which causes severe losses in agricultural yield world-wide. This bacterium is a model organism for studying plant-bacteria interactions. We sequenced the complete genome of Xcc 8004 (5,148,708 bp), which is highly conserved relative to that of Xcc ATCC 33913. Comparative genomics analysis indicated that, in addition to a significant genomic-scale rearrangement cross the replication axis between two IS1478 elements, loss and acquisition of blocks of genes, rather than point mutations, constitute the main genetic variation between the two Xcc strains. Screening of a high-density transposon insertional mutant library (16,512 clones) of Xcc 8004 against a host plant (Brassica oleraceae) identified 75 nonredundant, single-copy insertions in protein-coding sequences (CDSs) and intergenic regions. In addition to known virulence factors, full virulence was found to require several additional metabolic pathways and regulatory systems, such as fatty acid degradation, type IV secretion system, cell signaling, and amino acids and nucleotide metabolism. Among the identified pathogenicity-related genes, three of unknown function were found in Xcc 8004-specific chromosomal segments, revealing a direct correlation between genomic dynamics and Xcc virulence. The present combination of comparative and functional genomic analyses provides valuable information about the genetic basis of Xcc pathogenicity, which may offer novel insight toward the development of efficient methods for prevention of this important plant disease.

  8. Quasispecies Analyses of the HIV-1 Near-full-length Genome With Illumina MiSeq.

    Science.gov (United States)

    Ode, Hirotaka; Matsuda, Masakazu; Matsuoka, Kazuhiro; Hachiya, Atsuko; Hattori, Junko; Kito, Yumiko; Yokomaku, Yoshiyuki; Iwatani, Yasumasa; Sugiura, Wataru

    2015-01-01

    Human immunodeficiency virus type-1 (HIV-1) exhibits high between-host genetic diversity and within-host heterogeneity, recognized as quasispecies. Because HIV-1 quasispecies fluctuate in terms of multiple factors, such as antiretroviral exposure and host immunity, analyzing the HIV-1 genome is critical for selecting effective antiretroviral therapy and understanding within-host viral coevolution mechanisms. Here, to obtain HIV-1 genome sequence information that includes minority variants, we sought to develop a method for evaluating quasispecies throughout the HIV-1 near-full-length genome using the Illumina MiSeq benchtop deep sequencer. To ensure the reliability of minority mutation detection, we applied an analysis method of sequence read mapping onto a consensus sequence derived from de novo assembly followed by iterative mapping and subsequent unique error correction. Deep sequencing analyses of aHIV-1 clone showed that the analysis method reduced erroneous base prevalence below 1% in each sequence position and discarded only 1%-frequency sequences throughout the genome. When we evaluated sequences of pol genes from 18 treatment-naïve patients' samples, the deep sequencing results were in agreement with Sanger sequencing and identified numerous additional minority mutations. The results suggest that our deep sequencing method would be suitable for identifying within-host viral population dynamics throughout the genome.

  9. Ferns, mosses and liverworts as model systems for light-mediated chloroplast movements.

    Science.gov (United States)

    Suetsugu, Noriyuki; Higa, Takeshi; Wada, Masamitsu

    2016-11-17

    Light-induced chloroplast movement is found in most plant species, including algae and land plants. In land plants with multiple small chloroplasts, under weak light conditions, the chloroplasts move towards the light and accumulate on the periclinal cell walls to efficiently perceive light for photosynthesis (the accumulation response). Under strong light conditions, chloroplasts escape from light to avoid photodamage (the avoidance response). In most plant species, blue light induces chloroplast movement, and phototropin receptor kinases are the blue light receptors. Molecular mechanisms for photoreceptors, signal transduction and chloroplast motility systems are being studied using the model plant Arabidopsis thaliana. However, to further understand the molecular mechanisms and evolutionary history of chloroplast movement in green plants, analyses using other plant systems are required. Here, we review recent works on chloroplast movement in green algae, liverwort, mosses and ferns that provide new insights on chloroplast movement.

  10. New insights into dynamic actin-based chloroplast photorelocation movement.

    Science.gov (United States)

    Kong, Sam-Geun; Wada, Masamitsu

    2011-09-01

    Chloroplast movement is essential for plants to survive under various environmental light conditions. Phototropins-plant-specific blue-light-activated receptor kinases-mediate the response by perceiving light intensity and direction. Recently, novel chloroplast actin (cp-actin) filaments have been identified as playing a pivotal role in the directional chloroplast photorelocation movement. Encouraging progress has recently been made in this field of research through molecular genetics and cell biological analyses. This review describes factors that have been identified as being involved in chloroplast movement and their roles in the regulation of cp-actin filaments, thus providing a basis for reflection on their biochemical activities and functions.

  11. Complete Chloroplast Genome of the Medicinal Plant Paris polyphylla var. chinensis (Melanthiaceae)%药用植物华重楼(黑药花科)叶绿体全基因组研究

    Institute of Scientific and Technical Information of China (English)

    李晓娟; 杨振艳; 黄玉玲; 纪运恒

    2015-01-01

    In order to understand the characters of chloroplast genome (cp genome) in Paris polyphylla var. chinensis, the chloroplast genome (cp genome) of P. polyphylla var. chinensis was compared with those of 10 species within Liliales by using phylogenomics methods based on complete chloroplast genomes. The results showed that the cp genome of P. polyphylla var. chinensis was 158307 bp in length and display a typical quadripartite structure including two inverted repeat regions (IRA and IRB, 27473 bp), one small single-copy region (SSC, 18175 bp) and one large single-copy region (LSC, 85187 bp). It contained 115 unique genes, including 81 protein-coding genes, 30 tRNAs and 4 rRNAs. The genome structure, gene contents and arrangement of 10 Liliales species cp genomes were very similar. The cemA gene of P. polyphylla var. chinensis was pseudogene with poly(A) and CA SSR patterns after the start codon, and the loci of premature stop codons are different from those of Paris veticillata. In conclusion, the cp genome of P. polyphylla var. chinensis was conservative. The cemA structure and pseudogenization might play an important role in the evolution and phylogeny, and the location of the stop codons in cemA was useful for distinguishing P. polyphylla var. chinensis from P. veticillata.%为探究华重楼(Paris polyphylla var. chinensis)的叶绿体基因组特征,利用叶绿体系统发育基因组学方法,对华重楼与其它百合目植物的叶绿体全基因组进行了比较。结果表明,华重楼的叶绿体全基因组长158307 bp,由4个区组成,包括2个反向重复区(IRA和IRB,27473 bp)、1个小单拷贝区(SSC,18175 bp)和1个大单拷贝区(LSC,85187 bp)。其叶绿体基因组有115个基因,包括81个编码蛋白质基因、30个转运RNA基因和4个核糖体RNA基因。11种百合目植物的叶绿体全基因组的基因组成和基因顺序相似。华重楼的cemA基因是假基因,其起始密码子后有多聚核苷酸poly(A

  12. Direct chloroplast sequencing: comparison of sequencing platforms and analysis tools for whole chloroplast barcoding.

    Directory of Open Access Journals (Sweden)

    Marta Brozynska

    Full Text Available Direct sequencing of total plant DNA using next generation sequencing technologies generates a whole chloroplast genome sequence that has the potential to provide a barcode for use in plant and food identification. Advances in DNA sequencing platforms may make this an attractive approach for routine plant identification. The HiSeq (Illumina and Ion Torrent (Life Technology sequencing platforms were used to sequence total DNA from rice to identify polymorphisms in the whole chloroplast genome sequence of a wild rice plant relative to cultivated rice (cv. Nipponbare. Consensus chloroplast sequences were produced by mapping sequence reads to the reference rice chloroplast genome or by de novo assembly and mapping of the resulting contigs to the reference sequence. A total of 122 polymorphisms (SNPs and indels between the wild and cultivated rice chloroplasts were predicted by these different sequencing and analysis methods. Of these, a total of 102 polymorphisms including 90 SNPs were predicted by both platforms. Indels were more variable with different sequencing methods, with almost all discrepancies found in homopolymers. The Ion Torrent platform gave no apparent false SNP but was less reliable for indels. The methods should be suitable for routine barcoding using appropriate combinations of sequencing platform and data analysis.

  13. Phylogenetic analysis of the genus Avena based on chloroplast intergenic spacer psbA-trnH and single-copy nuclear gene Acc1.

    Science.gov (United States)

    Yan, Hong-Hai; Baum, Bernard R; Zhou, Ping-Ping; Zhao, Jun; Wei, Yu-Ming; Ren, Chang-Zhong; Xiong, Fang-Qiu; Liu, Gang; Zhong, Lin; Zhao, Gang; Peng, Yuan-Ying

    2014-05-01

    Two uncorrelated nucleotide sequences, chloroplast intergenic spacer psbA-trnH and acetyl CoA carboxylase gene (Acc1), were used to perform phylogenetic analyses in 75 accessions of the genus Avena, representing 13 diploids, seven tetraploid, and four hexaploids by maximum parsimony and Bayesian inference. Phylogenic analyses based on the chloroplast intergenic spacer psbA-trnH confirmed that the A genome diploid might be the maternal donor of species of the genus Avena. Two haplotypes of the Acc1 gene region were obtained from the AB genome tetraploids, indicating an allopolyploid origin for the tetraploid species. Among the AB genome species, both gene trees revealed differences between Avena agadiriana and the other species, suggesting that an AS genome diploid might be the A genome donor and the other genome diploid donor might be the Ac genome diploid Avena canariensis or the Ad genome diploid Avena damascena. Three haplotypes of the Acc1 gene have been detected among the ACD genome hexaploid species. The haplotype that seems to represent the D genome clustered with the tetraploid species Avena murphyi and Avena maroccana, which supported the CD genomic designation instead of AC for A. murphyi and A. maroccana.

  14. Matching of array CGH and gene expression microarray features for the purpose of integrative genomic analyses

    Directory of Open Access Journals (Sweden)

    van Wieringen Wessel N

    2012-05-01

    Full Text Available Abstract Background An increasing number of genomic studies interrogating more than one molecular level is published. Bioinformatics follows biological practice, and recent years have seen a surge in methodology for the integrative analysis of genomic data. Often such analyses require knowledge of which elements of one platform link to those of another. Although important, many integrative analyses do not or insufficiently detail the matching of the platforms. Results We describe, illustrate and discuss six matching procedures. They are implemented in the R-package sigaR (available from Bioconductor. The principles underlying the presented matching procedures are generic, and can be combined to form new matching approaches or be applied to the matching of other platforms. Illustration of the matching procedures on a variety of data sets reveals how the procedures differ in the use of the available data, and may even lead to different results for individual genes. Conclusions Matching of data from multiple genomics platforms is an important preprocessing step for many integrative bioinformatic analysis, for which we present six generic procedures, both old and new. They have been implemented in the R-package sigaR, available from Bioconductor.

  15. The mitochondrial genome of Atrijuglans hetaohei Yang (Lepidoptera: Gelechioidea) and related phylogenetic analyses.

    Science.gov (United States)

    Wang, Qiqi; Zhang, Zhengqing; Tang, Guanghui

    2016-04-25

    Complete mitochondrial genome sequences are of great importance for better understanding the genome-level characteristics and phylogenetic relationships among related species. In this study, the complete mitochondrial genome of Atrijuglans hetaohei Yang is sequenced and analyzed, which is 15,379bp in length (GenBank: KT581634) and contains a typical set of 13 protein-coding genes, 22 tRNA genes, two rRNA genes and a non-coding region (control region). Except for cox1 gene that is initiated by CGA codon, all protein-coding genes start with ATN codons and end with the stop codon T, TA or TAA. All tRNAs have a typical clover-leaf secondary structure, except for trnS1, of which the DHU arm could not form a stable stem-loop structure. The secondary structure of rrnL and rrnS consists of 49 helices and 33 helices, respectively. Phylogenetic analyses of the complete mitochondrial genome sequences and of the amino acid sequences for 13 mitochondrial protein-coding genes among related species support the view that A. hetaohei is more closely related to the Gelechioidea than Yponomeutoidea. This result is consistent with a previous classification based on morphology.

  16. FtsZ-less prokaryotic cell division as well as FtsZ- and dynamin-less chloroplast and non-photosynthetic plastid division

    Directory of Open Access Journals (Sweden)

    Shin-Ya eMiyagishima

    2014-09-01

    Full Text Available The chloroplast division machinery is a mixture of a stromal FtsZ-based complex descended from a cyanobacterial ancestor of chloroplasts and a cytosolic dynamin-related protein (DRP 5B-based complex derived from the eukaryotic host. Molecular genetic studies have shown that each component of the division machinery is normally essential for normal chloroplast division. However, several exceptions have been found. In the absence of the FtsZ ring, nonphotosynthetic plastids are able to proliferate, likely by elongation and budding. Depletion of DRP5B impairs, but does not stop chloroplast division. Chloroplasts in glaucophytes, which possesses a peptidoglycan (PG layer, divide without DRP5B. Certain parasitic eukaryotes possess nonphotosynthetic plastids of secondary endosymbiotic origin, but neither FtsZ nor DRP5B is encoded in their genomes. Elucidation of the FtsZ- and/or DRP5B-less chloroplast division mechanism will lead to a better understanding of the function and evolution of the chloroplast division machinery and the finding of the as-yet-unknown mechanism that is likely involved in chloroplast division. Recent studies have shown that FtsZ was lost from a variety of prokaryotes, many of which lost PG by regressive evolution. In addition, even some of the FtsZ-bearing bacteria are able to divide when FtsZ and PG are depleted experimentally. In some cases, alternative mechanisms for cell division, such as budding by an increase of the cell surface-to-volume ratio, are proposed. Although PG is believed to have been lost from chloroplasts other than in glaucophytes, there is some indirect evidence for the existence of PG in chloroplasts. Such information is also useful for understanding how nonphotosynthetic plastids are able to divide in FtsZ-depleted cells and the reason for the retention of FtsZ in chloroplast division. Here we summarize information to facilitate analyses of FtsZ- and/or DRP5B-less chloroplast and nonphotosynthetic plastid

  17. Chloroplast signaling within, between and beyond cells.

    Directory of Open Access Journals (Sweden)

    Krzysztof eBobik

    2015-10-01

    Full Text Available The most conspicuous function of the plastid is oxygenic photosynthesis of chloroplasts, yet plastids are super-factories that produce a plethora of compounds that are indispensable for proper plant physiology and development. Given their origins as free-living prokaryotes, it is not surprising that the plastid possesses its own genome whose expression is essential to plastid function. This semi-autonomous character of plastids requires the existence of sophisticated regulatory mechanisms that provide reliable communication between them and other cellular compartments. Such intracellular signaling is necessary for coordinating whole-cell responses to constantly varying environmental cues and cellular metabolic needs. This is achieved by plastids acting as receivers and transmitters of specific signals that coordinate expression of the nuclear and plastid genomes according to particular needs. In this review we will consider the so-called retrograde signaling occurring between plastids and nucleus, and between plastids and other organelles. Another important role of the plastid we will discuss is the involvement of plastid signaling in biotic and abiotic stress that, in addition to influencing retrograde signaling has direct effects on several cellular compartments including the cell wall. We will also review recent evidence pointing to an intriguing function of chloroplasts in regulating intercellular symplasmic transport. Finally, we consider an intriguing yet neglected aspect of plant biology, chloroplast signaling from the perspective of the entire plant. Thus, accumulating evidence highlights that chloroplasts, with their complex signaling pathways, provide a mechanism for exquisite regulation of plant development, metabolism and responses to the environment. As chloroplast processes are targeted for engineering for improved productivity the effect of such modifications on chloroplast signaling will have to be carefully considered in order

  18. Auxin and chloroplast movements.

    Science.gov (United States)

    Eckstein, Aleksandra; Krzeszowiec, Weronika; Waligórski, Piotr; Gabryś, Halina

    2016-03-01

    Auxin is involved in a wide spectrum of physiological processes in plants, including responses controlled by the blue light photoreceptors phototropins: phototropic bending and stomatal movement. However, the role of auxin in phototropin-mediated chloroplast movements has never been studied. To address this question we searched for potential interactions between auxin and the chloroplast movement signaling pathway using different experimental approaches and two model plants, Arabidopsis thaliana and Nicotiana tabacum. We observed that the disturbance of auxin homeostasis by shoot decapitation caused a decrease in chloroplast movement parameters, which could be rescued by exogenous auxin application. In several cases, the impairment of polar auxin transport, by chemical inhibitors or in auxin carrier mutants, had a similar negative effect on chloroplast movements. This inhibition was not correlated with changes in auxin levels. Chloroplast relocations were also affected by the antiauxin p-chlorophenoxyisobutyric acid and mutations in genes encoding some of the elements of the SCF(TIR1)-Aux/IAA auxin receptor complex. The observed changes in chloroplast movement parameters are not prominent, which points to a modulatory role of auxin in this process. Taken together, the obtained results suggest that auxin acts indirectly to regulate chloroplast movements, presumably by regulating gene expression via the SCF(TIR1)-Aux/IAA-ARF pathway. Auxin does not seem to be involved in controlling the expression of phototropins.

  19. Cloning and Analysis of a cDNA Encoding psbL and psbJ Gene in Rice Chloroplast Genome%水稻叶绿体基因组中一个编码psbL 和psbJ基因cDNA的克隆与分析

    Institute of Scientific and Technical Information of China (English)

    顾克余; 罗林广; 苏昌潮; 翟虎渠

    2001-01-01

    A 505 bp cDNA was cloned from the leaves of rice (Oryza sativaL.) Shanyou 63 combination. DNA sequence analysis showed that it is a part of rice chloroplast genome. Its homology comparison with those known in GenBank found that it encodes 38 amino acid peptide deduced from psbL gene and 40 amino acid peptide deduced from psbJ gene in rice chloroplast PSⅡ. Northern hybridization showed that the cDNA was differentially displayed in hybrid F1 and its parental lines.

  20. [Pathological Diagnoses and Whole-genome Sequence Analyses of the Jaagsiekte Sheep Retrovirus in Xinjiang, China].

    Science.gov (United States)

    Yang, Sufang; Liang, Tian; Zhao, Qingliang; Zhang, Dianqing; Si Junqiang; Zhang, Jing; Yang, Xia; Sheng, Jinliang

    2015-05-01

    To carry out pathologic diagnoses and whole-genome sequence analyses of the Jaagsiekte sheep retrovirus (JSRV) in Xinjiang, China, we first observed sheep suspected to have the JSRV. Then, the extracted virus suspension was observed by transmission electron microscopy (TEM). Total RNAs from lungs of JSRV-infected sheep were extracted and reverse-transcribed using a cDNA synthesis kit. Six pairs of primers were designed according to the exogenous reference virus strain (AF105220). Reverse transcription-polymerase chain reaction was carried out from JSRV-infected tissue, and the whole genome of the JSRV sequenced. Our results showed: flow of nasal fluid ("wheelbarrow test"); different sizes of adenoma lesions in the lungs; papillary hyperplasia of alveolar epithelial cells; alveolar cavity filled with macrophages; dissolute nuclei in central lesions. TEM revealed JSRV particles with a diameter of 88 nm to 125. 4 nm. The full-length of the viral genome sequence was 7456 bp. BLAST analyses showed nucleotide homology of 96% and 95% compared with that of the representative strain from the USA (AF105220) and UK (AF357971). Nucleotide homology was 89.8% and 89.9% compared with the endogenous Jaagsiekte sheep retrovirus, Inner Mongolia strain (DQ838493) and USA strain (EF680300). The specific pathogenic amino-acid sequence "YXXM" was found in the TM district, similar to the exogenous JSRV: this gene has been reported to be oncogenic. This is the first report of the complete genomic sequence of the exogenous JSRV from Xinjiang, and could lay the foundation for study of the biological characteristics and pathogenic mechanisms of the pulmonary adenomatosis virus in sheep.

  1. Identification of the most informative regions of the mitochondrial genome for phylogenetic and coalescent analyses.

    Science.gov (United States)

    Non, A L; Kitchen, A; Mulligan, C J

    2007-09-01

    Analysis of complete mitochondrial genome sequences is becoming increasingly common in genetic studies. The availability of full genome datasets enables an analysis of the information content distributed throughout the mitochondrial genome in order to optimize the research design of future evolutionary studies. The goal of our study was to identify informative regions of the human mitochondrial genome using two criteria: (1) accurate reconstruction of a phylogeny and (2) consistent estimates of time to most recent common ancestor (TMRCA). We created two series of datasets by deleting individual genes of varied length and by deleting 10 equal-size fragments throughout the coding region. Phylogenies were statistically compared to the full-coding-region tree, while coalescent methods were used to estimate the TMRCA and associated credible intervals. Individual fragments important for maintaining a phylogeny similar to the full-coding-region tree encompassed bp 577-2122 and 11,399-16,023, including all or part of 12S rRNA, 16S rRNA, ND4, ND5, ND6, and cytb. The control region only tree was the most poorly resolved with the majority of the tree manifest as an unresolved polytomy. Coalescent estimates of TMRCA were less sensitive to removal of any particular fragment(s) than reconstruction of a consistent phylogeny. Overall, we discovered that half the genome, i.e., bp 3669-11,398, could be removed with no significant change in the phylogeny (p(AU)=0.077) while still maintaining overlap of TMRCA 95% credible intervals. Thus, sequencing a contiguous fragment from bp 11,399 through the control region to bp 3668 would create a dataset that optimizes the information necessary for phylogenetic and coalescent analyses and also takes advantage of the wealth of data already available on the control region.

  2. Exploring photosynthesis evolution by comparative analysis of metabolic networks between chloroplasts and photosynthetic bacteria

    Directory of Open Access Journals (Sweden)

    Hou Jing

    2006-04-01

    Full Text Available Abstract Background Chloroplasts descended from cyanobacteria and have a drastically reduced genome following an endosymbiotic event. Many genes of the ancestral cyanobacterial genome have been transferred to the plant nuclear genome by horizontal gene transfer. However, a selective set of metabolism pathways is maintained in chloroplasts using both chloroplast genome encoded and nuclear genome encoded enzymes. As an organelle specialized for carrying out photosynthesis, does the chloroplast metabolic network have properties adapted for higher efficiency of photosynthesis? We compared metabolic network properties of chloroplasts and prokaryotic photosynthetic organisms, mostly cyanobacteria, based on metabolic maps derived from genome data to identify features of chloroplast network properties that are different from cyanobacteria and to analyze possible functional significance of those features. Results The properties of the entire metabolic network and the sub-network that consists of reactions directly connected to the Calvin Cycle have been analyzed using hypergraph representation. Results showed that the whole metabolic networks in chloroplast and cyanobacteria both possess small-world network properties. Although the number of compounds and reactions in chloroplasts is less than that in cyanobacteria, the chloroplast's metabolic network has longer average path length, a larger diameter, and is Calvin Cycle -centered, indicating an overall less-dense network structure with specific and local high density areas in chloroplasts. Moreover, chloroplast metabolic network exhibits a better modular organization than cyanobacterial ones. Enzymes involved in the same metabolic processes tend to cluster into the same module in chloroplasts. Conclusion In summary, the differences in metabolic network properties may reflect the evolutionary changes during endosymbiosis that led to the improvement of the photosynthesis efficiency in higher plants. Our

  3. Aye-aye population genomic analyses highlight an important center of endemism in northern Madagascar.

    Science.gov (United States)

    Perry, George H; Louis, Edward E; Ratan, Aakrosh; Bedoya-Reina, Oscar C; Burhans, Richard C; Lei, Runhua; Johnson, Steig E; Schuster, Stephan C; Miller, Webb

    2013-04-09

    We performed a population genomics study of the aye-aye, a highly specialized nocturnal lemur from Madagascar. Aye-ayes have low population densities and extensive range requirements that could make this flagship species particularly susceptible to extinction. Therefore, knowledge of genetic diversity and differentiation among aye-aye populations is critical for conservation planning. Such information may also advance our general understanding of Malagasy biogeography, as aye-ayes have the largest species distribution of any lemur. We generated and analyzed whole-genome sequence data for 12 aye-ayes from three regions of Madagascar (North, West, and East). We found that the North population is genetically distinct, with strong differentiation from other aye-ayes over relatively short geographic distances. For comparison, the average FST value between the North and East aye-aye populations--separated by only 248 km--is over 2.1-times greater than that observed between human Africans and Europeans. This finding is consistent with prior watershed- and climate-based hypotheses of a center of endemism in northern Madagascar. Taken together, these results suggest a strong and long-term biogeographical barrier to gene flow. Thus, the specific attention that should be directed toward preserving large, contiguous aye-aye habitats in northern Madagascar may also benefit the conservation of other distinct taxonomic units. To help facilitate future ecological- and conservation-motivated population genomic analyses by noncomputational biologists, the analytical toolkit used in this study is available on the Galaxy Web site.

  4. Evolutionary trajectories of snake genes and genomes revealed by comparative analyses of five-pacer viper

    Science.gov (United States)

    Yin, Wei; Wang, Zong-ji; Li, Qi-ye; Lian, Jin-ming; Zhou, Yang; Lu, Bing-zheng; Jin, Li-jun; Qiu, Peng-xin; Zhang, Pei; Zhu, Wen-bo; Wen, Bo; Huang, Yi-jun; Lin, Zhi-long; Qiu, Bi-tao; Su, Xing-wen; Yang, Huan-ming; Zhang, Guo-jie; Yan, Guang-mei; Zhou, Qi

    2016-01-01

    Snakes have numerous features distinctive from other tetrapods and a rich history of genome evolution that is still obscure. Here, we report the high-quality genome of the five-pacer viper, Deinagkistrodon acutus, and comparative analyses with other representative snake and lizard genomes. We map the evolutionary trajectories of transposable elements (TEs), developmental genes and sex chromosomes onto the snake phylogeny. TEs exhibit dynamic lineage-specific expansion, and many viper TEs show brain-specific gene expression along with their nearby genes. We detect signatures of adaptive evolution in olfactory, venom and thermal-sensing genes and also functional degeneration of genes associated with vision and hearing. Lineage-specific relaxation of functional constraints on respective Hox and Tbx limb-patterning genes supports fossil evidence for a successive loss of forelimbs then hindlimbs during snake evolution. Finally, we infer that the ZW sex chromosome pair had undergone at least three recombination suppression events in the ancestor of advanced snakes. These results altogether forge a framework for our deep understanding into snakes' history of molecular evolution. PMID:27708285

  5. A new database (GCD) on genome composition for eukaryote and prokaryote genome sequences and their initial analyses.

    Science.gov (United States)

    Kryukov, Kirill; Sumiyama, Kenta; Ikeo, Kazuho; Gojobori, Takashi; Saitou, Naruya

    2012-01-01

    Eukaryote genomes contain many noncoding regions, and they are quite complex. To understand these complexities, we constructed a database, Genome Composition Database, for the whole genome composition statistics for 101 eukaryote genome data, as well as more than 1,000 prokaryote genomes. Frequencies of all possible one to ten oligonucleotides were counted for each genome, and these observed values were compared with expected values computed under observed oligonucleotide frequencies of length 1-4. Deviations from expected values were much larger for eukaryotes than prokaryotes, except for fungal genomes. Mammalian genomes showed the largest deviation among animals. The results of comparison are available online at http://esper.lab.nig.ac.jp/genome-composition-database/.

  6. Comparative Genomic and Transcriptional Analyses of CRISPR Systems Across the Genus Pyrobaculum

    Directory of Open Access Journals (Sweden)

    David L Bernick

    2012-07-01

    Full Text Available Within the domain Archaea, the CRISPR immune system appears to be nearly ubiquitous based on computational genome analyses. Initial studies in bacteria demonstrated that the CRISPR system targets invading plasmid and viral DNA. Recent experiments in the model archaeon Pyrococcus furiosus uncovered a novel RNA-targeting variant of the CRISPR system potentially unique to archaea. Because our understanding of CRISPR system evolution in other archaea is limited, we have taken a comparative genomic and transcriptomic view of the CRISPR arrays across six diverse species within the crenarchaeal genus Pyrobaculum. We present transcriptional data from each of four species in the genus (P. aerophilum, P. islandicum, P. calidifontis, P. arsenaticum, analyzing mature CRISPR-associated small RNA abundance from over 20 arrays. Within the genus, there is remarkable conservation of CRISPR array structure, as well as unique features that are have not been studied in other archaeal systems. These unique features include: a nearly invariant CRISPR promoter, conservation of direct repeat families, the 5' polarity of CRISPR-associated small RNA abundance, and a novel CRISPR-specific association with homologues of nurA and herA. These analyses provide a genus-level evolutionary perspective on archaeal CRISPR systems, broadening our understanding beyond existing non-comparative model systems.

  7. Emergence and evolutionary analysis of the human DDR network: implications in comparative genomics and downstream analyses.

    Science.gov (United States)

    Arcas, Aida; Fernández-Capetillo, Oscar; Cases, Ildefonso; Rojas, Ana M

    2014-04-01

    The DNA damage response (DDR) is a crucial signaling network that preserves the integrity of the genome. This network is an ensemble of distinct but often overlapping subnetworks, where different components fulfill distinct functions in precise spatial and temporal scenarios. To understand how these elements have been assembled together in humans, we performed comparative genomic analyses in 47 selected species to trace back their emergence using systematic phylogenetic analyses and estimated gene ages. The emergence of the contribution of posttranslational modifications to the complex regulation of DDR was also investigated. This is the first time a systematic analysis has focused on the evolution of DDR subnetworks as a whole. Our results indicate that a DDR core, mostly constructed around metabolic activities, appeared soon after the emergence of eukaryotes, and that additional regulatory capacities appeared later through complex evolutionary process. Potential key posttranslational modifications were also in place then, with interacting pairs preferentially appearing at the same evolutionary time, although modifications often led to the subsequent acquisition of new targets afterwards. We also found extensive gene loss in essential modules of the regulatory network in fungi, plants, and arthropods, important for their validation as model organisms for DDR studies.

  8. Genome-wide association and linkage analyses localize a progressive retinal atrophy locus in Persian cats.

    Science.gov (United States)

    Alhaddad, Hasan; Gandolfi, Barbara; Grahn, Robert A; Rah, Hyung-Chul; Peterson, Carlyn B; Maggs, David J; Good, Kathryn L; Pedersen, Niels C; Lyons, Leslie A

    2014-08-01

    Hereditary eye diseases of animals serve as excellent models of human ocular disorders and assist in the development of gene and drug therapies for inherited forms of blindness. Several primary hereditary eye conditions affecting various ocular tissues and having different rates of progression have been documented in domestic cats. Gene therapy for canine retinopathies has been successful, thus the cat could be a gene therapy candidate for other forms of retinal degenerations. The current study investigates a hereditary, autosomal recessive, retinal degeneration specific to Persian cats. A multi-generational pedigree segregating for this progressive retinal atrophy was genotyped using a 63 K SNP array and analyzed via genome-wide linkage and association methods. A multi-point parametric linkage analysis localized the blindness phenotype to a ~1.75 Mb region with significant LOD scores (Z ≈ 14, θ = 0.00) on cat chromosome E1. Genome-wide TDT, sib-TDT, and case-control analyses also consistently supported significant association within the same region on chromosome E1, which is homologous to human chromosome 17. Using haplotype analysis, a ~1.3 Mb region was identified as highly associated for progressive retinal atrophy in Persian cats. Several candidate genes within the region are reasonable candidates as a potential causative gene and should be considered for molecular analyses.

  9. Phylogenetic analyses of complete mitochondrial genome sequences suggest a basal divergence of the enigmatic rodent Anomalurus

    Directory of Open Access Journals (Sweden)

    Gissi Carmela

    2007-02-01

    Full Text Available Abstract Background Phylogenetic relationships between Lagomorpha, Rodentia and Primates and their allies (Euarchontoglires have long been debated. While it is now generally agreed that Rodentia constitutes a monophyletic sister-group of Lagomorpha and that this clade (Glires is sister to Primates and Dermoptera, higher-level relationships within Rodentia remain contentious. Results We have sequenced and performed extensive evolutionary analyses on the mitochondrial genome of the scaly-tailed flying squirrel Anomalurus sp., an enigmatic rodent whose phylogenetic affinities have been obscure and extensively debated. Our phylogenetic analyses of the coding regions of available complete mitochondrial genome sequences from Euarchontoglires suggest that Anomalurus is a sister taxon to the Hystricognathi, and that this clade represents the most basal divergence among sampled Rodentia. Bayesian dating methods incorporating a relaxed molecular clock provide divergence-time estimates which are consistently in agreement with the fossil record and which indicate a rapid radiation within Glires around 60 million years ago. Conclusion Taken together, the data presented provide a working hypothesis as to the phylogenetic placement of Anomalurus, underline the utility of mitochondrial sequences in the resolution of even relatively deep divergences and go some way to explaining the difficulty of conclusively resolving higher-level relationships within Glires with available data and methodologies.

  10. Genome-wide analyses reveal a role for peptide hormones in planarian germline development.

    Directory of Open Access Journals (Sweden)

    James J Collins

    Full Text Available Bioactive peptides (i.e., neuropeptides or peptide hormones represent the largest class of cell-cell signaling molecules in metazoans and are potent regulators of neural and physiological function. In vertebrates, peptide hormones play an integral role in endocrine signaling between the brain and the gonads that controls reproductive development, yet few of these molecules have been shown to influence reproductive development in invertebrates. Here, we define a role for peptide hormones in controlling reproductive physiology of the model flatworm, the planarian Schmidtea mediterranea. Based on our observation that defective neuropeptide processing results in defects in reproductive system development, we employed peptidomic and functional genomic approaches to characterize the planarian peptide hormone complement, identifying 51 prohormone genes and validating 142 peptides biochemically. Comprehensive in situ hybridization analyses of prohormone gene expression revealed the unanticipated complexity of the flatworm nervous system and identified a prohormone specifically expressed in the nervous system of sexually reproducing planarians. We show that this member of the neuropeptide Y superfamily is required for the maintenance of mature reproductive organs and differentiated germ cells in the testes. Additionally, comparative analyses of our biochemically validated prohormones with the genomes of the parasitic flatworms Schistosoma mansoni and Schistosoma japonicum identified new schistosome prohormones and validated half of all predicted peptide-encoding genes in these parasites. These studies describe the peptide hormone complement of a flatworm on a genome-wide scale and reveal a previously uncharacterized role for peptide hormones in flatworm reproduction. Furthermore, they suggest new opportunities for using planarians as free-living models for understanding the reproductive biology of flatworm parasites.

  11. Deciphering Clostridium tyrobutyricum Metabolism Based on the Whole-Genome Sequence and Proteome Analyses

    Directory of Open Access Journals (Sweden)

    Joungmin Lee

    2016-06-01

    Full Text Available Clostridium tyrobutyricum is a Gram-positive anaerobic bacterium that efficiently produces butyric acid and is considered a promising host for anaerobic production of bulk chemicals. Due to limited knowledge on the genetic and metabolic characteristics of this strain, however, little progress has been made in metabolic engineering of this strain. Here we report the complete genome sequence of C. tyrobutyricum KCTC 5387 (ATCC 25755, which consists of a 3.07-Mbp chromosome and a 63-kbp plasmid. The results of genomic analyses suggested that C. tyrobutyricum produces butyrate from butyryl-coenzyme A (butyryl-CoA through acetate reassimilation by CoA transferase, differently from Clostridium acetobutylicum, which uses the phosphotransbutyrylase-butyrate kinase pathway; this was validated by reverse transcription-PCR (RT-PCR of related genes, protein expression levels, in vitro CoA transferase assay, and fed-batch fermentation. In addition, the changes in protein expression levels during the course of batch fermentations on glucose were examined by shotgun proteomics. Unlike C. acetobutylicum, the expression levels of proteins involved in glycolytic and fermentative pathways in C. tyrobutyricum did not decrease even at the stationary phase. Proteins related to energy conservation mechanisms, including Rnf complex, NfnAB, and pyruvate-phosphate dikinase that are absent in C. acetobutylicum, were identified. Such features explain why this organism can produce butyric acid to a much higher titer and better tolerate toxic metabolites. This study presenting the complete genome sequence, global protein expression profiles, and genome-based metabolic characteristics during the batch fermentation of C. tyrobutyricum will be valuable in designing strategies for metabolic engineering of this strain.

  12. Molecular Characterization of Five Potyviruses Infecting Korean Sweet Potatoes Based on Analyses of Complete Genome Sequences

    Directory of Open Access Journals (Sweden)

    Hae-Ryun Kwak

    2015-12-01

    Full Text Available Sweet potatoes (Ipomea batatas L. are grown extensively, in tropical and temperate regions, and are important food crops worldwide. In Korea, potyviruses, including Sweet potato feathery mottle virus (SPFMV, Sweet potato virus C (SPVC, Sweet potato virus G (SPVG, Sweet potato virus 2 (SPV2, and Sweet potato latent virus (SPLV, have been detected in sweet potato fields at a high (~95% incidence. In the present work, complete genome sequences of 18 isolates, representing the five potyviruses mentioned above, were compared with previously reported genome sequences. The complete genomes consisted of 10,081 to 10,830 nucleotides, excluding the poly-A tails. Their genomic organizations were typical of the Potyvirus genus, including one target open reading frame coding for a putative polyprotein. Based on phylogenetic analyses and sequence comparisons, the Korean SPFMV isolates belonged to the strains RC and O with >98% nucleotide sequence identity. Korean SPVC isolates had 99% identity to the Japanese isolate SPVC-Bungo and 70% identity to the SPFMV isolates. The Korean SPVG isolates showed 99% identity to the three previously reported SPVG isolates. Korean SPV2 isolates had 97% identity to the SPV2 GWB-2 isolate from the USA. Korean SPLV isolates had a relatively low (88% nucleotide sequence identity with the Taiwanese SPLV-TW isolates, and they were phylogenetically distantly related to SPFMV isolates. Recombination analysis revealed that possible recombination events occurred in the P1, HC-Pro and NIa-NIb regions of SPFMV and SPLV isolates and these regions were identified as hotspots for recombination in the sweet potato potyviruses.

  13. Complete mitochondrial genome sequences of three bats species and whole genome mitochondrial analyses reveal patterns of codon bias and lend support to a basal split in Chiroptera.

    Science.gov (United States)

    Meganathan, P R; Pagan, Heidi J T; McCulloch, Eve S; Stevens, Richard D; Ray, David A

    2012-01-15

    Order Chiroptera is a unique group of mammals whose members have attained self-powered flight as their main mode of locomotion. Much speculation persists regarding bat evolution; however, lack of sufficient molecular data hampers evolutionary and conservation studies. Of ~1200 species, complete mitochondrial genome sequences are available for only eleven. Additional sequences should be generated if we are to resolve many questions concerning these fascinating mammals. Herein, we describe the complete mitochondrial genomes of three bats: Corynorhinus rafinesquii, Lasiurus borealis and Artibeus lituratus. We also compare the currently available mitochondrial genomes and analyze codon usage in Chiroptera. C. rafinesquii, L. borealis and A. lituratus mitochondrial genomes are 16438 bp, 17048 bp and 16709 bp, respectively. Genome organization and gene arrangements are similar to other bats. Phylogenetic analyses using complete mitochondrial genome sequences support previously established phylogenetic relationships and suggest utility in future studies focusing on the evolutionary aspects of these species. Comprehensive analyses of available bat mitochondrial genomes reveal distinct nucleotide patterns and synonymous codon preferences corresponding to different chiropteran families. These patterns suggest that mutational and selection forces are acting to different extents within Chiroptera and shape their mitochondrial genomes.

  14. Comparative genome analyses reveal distinct structure in the saltwater crocodile MHC.

    Science.gov (United States)

    Jaratlerdsiri, Weerachai; Deakin, Janine; Godinez, Ricardo M; Shan, Xueyan; Peterson, Daniel G; Marthey, Sylvain; Lyons, Eric; McCarthy, Fiona M; Isberg, Sally R; Higgins, Damien P; Chong, Amanda Y; John, John St; Glenn, Travis C; Ray, David A; Gongora, Jaime

    2014-01-01

    The major histocompatibility complex (MHC) is a dynamic genome region with an essential role in the adaptive immunity of vertebrates, especially antigen presentation. The MHC is generally divided into subregions (classes I, II and III) containing genes of similar function across species, but with different gene number and organisation. Crocodylia (crocodilians) are widely distributed and represent an evolutionary distinct group among higher vertebrates, but the genomic organisation of MHC within this lineage has been largely unexplored. Here, we studied the MHC region of the saltwater crocodile (Crocodylus porosus) and compared it with that of other taxa. We characterised genomic clusters encompassing MHC class I and class II genes in the saltwater crocodile based on sequencing of bacterial artificial chromosomes. Six gene clusters spanning ∼452 kb were identified to contain nine MHC class I genes, six MHC class II genes, three TAP genes, and a TRIM gene. These MHC class I and class II genes were in separate scaffold regions and were greater in length (2-6 times longer) than their counterparts in well-studied fowl B loci, suggesting that the compaction of avian MHC occurred after the crocodilian-avian split. Comparative analyses between the saltwater crocodile MHC and that from the alligator and gharial showed large syntenic areas (>80% identity) with similar gene order. Comparisons with other vertebrates showed that the saltwater crocodile had MHC class I genes located along with TAP, consistent with birds studied. Linkage between MHC class I and TRIM39 observed in the saltwater crocodile resembled MHC in eutherians compared, but absent in avian MHC, suggesting that the saltwater crocodile MHC appears to have gene organisation intermediate between these two lineages. These observations suggest that the structure of the saltwater crocodile MHC, and other crocodilians, can help determine the MHC that was present in the ancestors of archosaurs.

  15. Comparative genome analyses reveal distinct structure in the saltwater crocodile MHC.

    Directory of Open Access Journals (Sweden)

    Weerachai Jaratlerdsiri

    Full Text Available The major histocompatibility complex (MHC is a dynamic genome region with an essential role in the adaptive immunity of vertebrates, especially antigen presentation. The MHC is generally divided into subregions (classes I, II and III containing genes of similar function across species, but with different gene number and organisation. Crocodylia (crocodilians are widely distributed and represent an evolutionary distinct group among higher vertebrates, but the genomic organisation of MHC within this lineage has been largely unexplored. Here, we studied the MHC region of the saltwater crocodile (Crocodylus porosus and compared it with that of other taxa. We characterised genomic clusters encompassing MHC class I and class II genes in the saltwater crocodile based on sequencing of bacterial artificial chromosomes. Six gene clusters spanning ∼452 kb were identified to contain nine MHC class I genes, six MHC class II genes, three TAP genes, and a TRIM gene. These MHC class I and class II genes were in separate scaffold regions and were greater in length (2-6 times longer than their counterparts in well-studied fowl B loci, suggesting that the compaction of avian MHC occurred after the crocodilian-avian split. Comparative analyses between the saltwater crocodile MHC and that from the alligator and gharial showed large syntenic areas (>80% identity with similar gene order. Comparisons with other vertebrates showed that the saltwater crocodile had MHC class I genes located along with TAP, consistent with birds studied. Linkage between MHC class I and TRIM39 observed in the saltwater crocodile resembled MHC in eutherians compared, but absent in avian MHC, suggesting that the saltwater crocodile MHC appears to have gene organisation intermediate between these two lineages. These observations suggest that the structure of the saltwater crocodile MHC, and other crocodilians, can help determine the MHC that was present in the ancestors of archosaurs.

  16. Genome-wide meta-analyses identify multiple loci associated with smoking behavior.

    LENUS (Irish Health Repository)

    2010-05-01

    Consistent but indirect evidence has implicated genetic factors in smoking behavior. We report meta-analyses of several smoking phenotypes within cohorts of the Tobacco and Genetics Consortium (n = 74,053). We also partnered with the European Network of Genetic and Genomic Epidemiology (ENGAGE) and Oxford-GlaxoSmithKline (Ox-GSK) consortia to follow up the 15 most significant regions (n > 140,000). We identified three loci associated with number of cigarettes smoked per day. The strongest association was a synonymous 15q25 SNP in the nicotinic receptor gene CHRNA3 (rs1051730[A], beta = 1.03, standard error (s.e.) = 0.053, P = 2.8 x 10(-73)). Two 10q25 SNPs (rs1329650[G], beta = 0.367, s.e. = 0.059, P = 5.7 x 10(-10); and rs1028936[A], beta = 0.446, s.e. = 0.074, P = 1.3 x 10(-9)) and one 9q13 SNP in EGLN2 (rs3733829[G], beta = 0.333, s.e. = 0.058, P = 1.0 x 10(-8)) also exceeded genome-wide significance for cigarettes per day. For smoking initiation, eight SNPs exceeded genome-wide significance, with the strongest association at a nonsynonymous SNP in BDNF on chromosome 11 (rs6265[C], odds ratio (OR) = 1.06, 95% confidence interval (Cl) 1.04-1.08, P = 1.8 x 10(-8)). One SNP located near DBH on chromosome 9 (rs3025343[G], OR = 1.12, 95% Cl 1.08-1.18, P = 3.6 x 10(-8)) was significantly associated with smoking cessation.

  17. Arabidopsis thaliana leaves with altered chloroplast numbers and chloroplast movement exhibit impaired adjustments to both low and high light.

    Science.gov (United States)

    Königer, Martina; Delamaide, Joy A; Marlow, Elizabeth D; Harris, Gary C

    2008-01-01

    The effects of chloroplast number and size on the capacity for blue light-dependent chloroplast movement, the ability to increase light absorption under low light, and the susceptibility to photoinhibition were investigated in Arabidopsis thaliana. Leaves of wild-type and chloroplast number mutants with mean chloroplast numbers ranging from 120 to two per mesophyll cell were analysed. Chloroplast movement was monitored as changes in light transmission through the leaves. Light transmission was used as an indicator of the ability of leaves to optimize light absorption. The ability of leaves to deal with 3 h of high light stress at 10 degrees C and their capacity to recover in low light was determined by measuring photochemical efficiencies of PSII using chlorophyll a fluorescence. Chloroplast movement was comparable in leaves ranging in chloroplast numbers from 120 to 30 per mesophyll cell: the final light transmission levels after exposure to 0.1 (accumulation response) and 100 micromol photons m(-2) s(-1) (avoidance response) were indistinguishable, the chloroplasts responded quickly to small increases in light intensity and the kinetics of movement were similar. However, when chloroplast numbers per mesophyll cell decreased to 18 or below, the accumulation response was significantly reduced. The avoidance response was only impaired in mutants with nine or fewer chloroplasts, both in terms of final transmission levels and the speed of movement. Only mutants lacking both blue light receptors (phot1/phot2) or those with drastically reduced chloroplast numbers and severely impacted avoidance responses showed a reduced ability to recover from high light stress.

  18. In silico phylogenetic and virulence gene profile analyses of avian pathogenic Escherichia coli genome sequences

    Directory of Open Access Journals (Sweden)

    Thaís C.G. Rojas

    2014-02-01

    Full Text Available Avian pathogenic Escherichia coli (APEC infections are responsible for significant losses in the poultry industry worldwide. A zoonotic risk has been attributed to APEC strains because they present similarities to extraintestinal pathogenic E. coli (ExPEC associated with illness in humans, mainly urinary tract infections and neonatal meningitis. Here, we present in silico analyses with pathogenic E. coli genome sequences, including recently available APEC genomes. The phylogenetic tree, based on multi-locus sequence typing (MLST of seven housekeeping genes, revealed high diversity in the allelic composition. Nevertheless, despite this diversity, the phylogenetic tree was able to cluster the different pathotypes together. An in silico virulence gene profile was also determined for each of these strains, through the presence or absence of 83 well-known virulence genes/traits described in pathogenic E. coli strains. The MLST phylogeny and the virulence gene profiles demonstrated a certain genetic similarity between Brazilian APEC strains, APEC isolated in the United States, UPEC (uropathogenic E. coli and diarrheagenic strains isolated from humans. This correlation corroborates and reinforces the zoonotic potential hypothesis proposed to APEC.

  19. Genome-Wide Identification, Evolutionary, and Expression Analyses of Histone H3 Variants in Plants

    Directory of Open Access Journals (Sweden)

    Jinteng Cui

    2015-01-01

    Full Text Available Histone variants alter the nucleosome structure and play important roles in chromosome segregation, transcription, DNA repair, and sperm compaction. Histone H3 is encoded by many genes in most eukaryotic species and is the histone that contains the largest variety of posttranslational modifications. Compared with the metazoan H3 variants, little is known about the complex evolutionary history of H3 variants proteins in plants. Here, we study the identification, evolutionary, and expression analyses of histone H3 variants from genomes in major branches in the plant tree of life. Firstly we identified all the histone three related (HTR genes from the examined genomes, then we classified the four groups variants: centromeric H3, H3.1, H3.3 and H3-like, by phylogenetic analysis, intron information, and alignment. We further demonstrated that the H3 variants have evolved under strong purifying selection, indicating the conservation of HTR proteins. Expression analysis revealed that the HTR has a wide expression profile in maize and rice development and plays important roles in development.

  20. Phylogenetic analyses of endoparasitic Acanthocephala based on mitochondrial genomes suggest secondary loss of sensory organs.

    Science.gov (United States)

    Weber, Mathias; Wey-Fabrizius, Alexandra R; Podsiadlowski, Lars; Witek, Alexander; Schill, Ralph O; Sugár, László; Herlyn, Holger; Hankeln, Thomas

    2013-01-01

    The metazoan taxon Syndermata (Monogononta, Bdelloidea, Seisonidea, Acanthocephala) comprises species with vastly different lifestyles. The focus of this study is on the phylogeny within the syndermatan subtaxon Acanthocephala (thorny-headed worms, obligate endoparasites). In order to investigate the controversially discussed phylogenetic relationships of acanthocephalan subtaxa we have sequenced the mitochondrial (mt) genomes of Echinorhynchus truttae (Palaeacanthocephala), Paratenuisentis ambiguus (Eoacanthocephala), Macracanthorhynchus hirudinaceus (Archiacanthocephala), and Philodina citrina (Bdelloidea). In doing so, we present the largest molecular phylogenetic dataset so far for this question comprising all major subgroups of Acanthocephala. Alongside with publicly available mt genome data of four additional syndermatans as well as 18 other lophotrochozoan (spiralian) taxa and one outgroup representative, the derived protein-coding sequences were used for Maximum Likelihood as well as Bayesian phylogenetic analyses. We achieved entirely congruent results, whereupon monophyletic Archiacanthocephala represent the sister taxon of a clade comprising Eoacanthocephala and monophyletic Palaeacanthocephala (Echinorhynchida). This topology suggests the secondary loss of lateral sensory organs (sensory pores) within Palaeacanthocephala and is further in line with the emergence of apical sensory organs in the stem lineage of Archiacanthocephala.

  1. Genomic and secretomic analyses reveal unique features of the lignocellulolytic enzyme system of Penicillium decumbens.

    Science.gov (United States)

    Liu, Guodong; Zhang, Lei; Wei, Xiaomin; Zou, Gen; Qin, Yuqi; Ma, Liang; Li, Jie; Zheng, Huajun; Wang, Shengyue; Wang, Chengshu; Xun, Luying; Zhao, Guo-Ping; Zhou, Zhihua; Qu, Yinbo

    2013-01-01

    Many Penicillium species could produce extracellular enzyme systems with good lignocellulose hydrolysis performance. However, these species and their enzyme systems are still poorly understood and explored due to the lacking of genetic information. Here, we present the genomic and secretomic analyses of Penicillium decumbens that has been used in industrial production of lignocellulolytic enzymes in China for more than fifteen years. Comparative genomics analysis with the phylogenetically most similar species Penicillium chrysogenum revealed that P. decumbens has evolved with more genes involved in plant cell wall degradation, but fewer genes in cellular metabolism and regulation. Compared with the widely used cellulase producer Trichoderma reesei, P. decumbens has a lignocellulolytic enzyme system with more diverse components, particularly for cellulose binding domain-containing proteins and hemicellulases. Further, proteomic analysis of secretomes revealed that P. decumbens produced significantly more lignocellulolytic enzymes in the medium with cellulose-wheat bran as the carbon source than with glucose. The results expand our knowledge on the genetic information of lignocellulolytic enzyme systems in Penicillium species, and will facilitate rational strain improvement for the production of highly efficient enzyme systems used in lignocellulose utilization from Penicillium species.

  2. Extending pathways and processes using molecular interaction networks to analyse cancer genome data

    Directory of Open Access Journals (Sweden)

    Krasnogor Natalio

    2010-12-01

    Full Text Available Abstract Background Cellular processes and pathways, whose deregulation may contribute to the development of cancers, are often represented as cascades of proteins transmitting a signal from the cell surface to the nucleus. However, recent functional genomic experiments have identified thousands of interactions for the signalling canonical proteins, challenging the traditional view of pathways as independent functional entities. Combining information from pathway databases and interaction networks obtained from functional genomic experiments is therefore a promising strategy to obtain more robust pathway and process representations, facilitating the study of cancer-related pathways. Results We present a methodology for extending pre-defined protein sets representing cellular pathways and processes by mapping them onto a protein-protein interaction network, and extending them to include densely interconnected interaction partners. The added proteins display distinctive network topological features and molecular function annotations, and can be proposed as putative new components, and/or as regulators of the communication between the different cellular processes. Finally, these extended pathways and processes are used to analyse their enrichment in pancreatic mutated genes. Significant associations between mutated genes and certain processes are identified, enabling an analysis of the influence of previously non-annotated cancer mutated genes. Conclusions The proposed method for extending cellular pathways helps to explain the functions of cancer mutated genes by exploiting the synergies of canonical knowledge and large-scale interaction data.

  3. Genomic and secretomic analyses reveal unique features of the lignocellulolytic enzyme system of Penicillium decumbens.

    Directory of Open Access Journals (Sweden)

    Guodong Liu

    Full Text Available Many Penicillium species could produce extracellular enzyme systems with good lignocellulose hydrolysis performance. However, these species and their enzyme systems are still poorly understood and explored due to the lacking of genetic information. Here, we present the genomic and secretomic analyses of Penicillium decumbens that has been used in industrial production of lignocellulolytic enzymes in China for more than fifteen years. Comparative genomics analysis with the phylogenetically most similar species Penicillium chrysogenum revealed that P. decumbens has evolved with more genes involved in plant cell wall degradation, but fewer genes in cellular metabolism and regulation. Compared with the widely used cellulase producer Trichoderma reesei, P. decumbens has a lignocellulolytic enzyme system with more diverse components, particularly for cellulose binding domain-containing proteins and hemicellulases. Further, proteomic analysis of secretomes revealed that P. decumbens produced significantly more lignocellulolytic enzymes in the medium with cellulose-wheat bran as the carbon source than with glucose. The results expand our knowledge on the genetic information of lignocellulolytic enzyme systems in Penicillium species, and will facilitate rational strain improvement for the production of highly efficient enzyme systems used in lignocellulose utilization from Penicillium species.

  4. Crowdsourcing genomic analyses of ash and ash dieback – power to the people

    Directory of Open Access Journals (Sweden)

    MacLean Dan

    2013-02-01

    Full Text Available Abstract Ash dieback is a devastating fungal disease of ash trees that has swept across Europe and recently reached the UK. This emergent pathogen has received little study in the past and its effect threatens to overwhelm the ash population. In response to this we have produced some initial genomics datasets and taken the unusual step of releasing them to the scientific community for analysis without first performing our own. In this manner we hope to ‘crowdsource’ analyses and bring the expertise of the community to bear on this problem as quickly as possible. Our data has been released through our website at oadb.tsl.ac.uk and a public GitHub repository.

  5. Mitochondrial Genome Analyses Suggest Multiple Trichuris Species in Humans, Baboons, and Pigs from Different Geographical Regions.

    Directory of Open Access Journals (Sweden)

    Mohamed B F Hawash

    Full Text Available The whipworms Trichuris trichiura and Trichuris suis are two parasitic nematodes of humans and pigs, respectively. Although whipworms in human and non-human primates historically have been referred to as T. trichiura, recent reports suggest that several Trichuris spp. are found in primates.We sequenced and annotated complete mitochondrial genomes of Trichuris recovered from a human in Uganda, an olive baboon in the US, a hamadryas baboon in Denmark, and two pigs from Denmark and Uganda. Comparative analyses using other published mitochondrial genomes of Trichuris recovered from a human and a porcine host in China and from a françois' leaf-monkey (China were performed, including phylogenetic analyses and pairwise genetic and amino acid distances. Genetic and protein distances between human Trichuris in Uganda and China were high (~19% and 15%, respectively suggesting that they represented different species. Trichuris from the olive baboon in US was genetically related to human Trichuris in China, while the other from the hamadryas baboon in Denmark was nearly identical to human Trichuris from Uganda. Baboon-derived Trichuris was genetically distinct from Trichuris from françois' leaf monkey, suggesting multiple whipworm species circulating among non-human primates. The genetic and protein distances between pig Trichuris from Denmark and other regions were roughly 9% and 6%, respectively, while Chinese and Ugandan whipworms were more closely related.Our results indicate that Trichuris species infecting humans and pigs are phylogenetically distinct across geographical regions, which might have important implications for the implementation of suitable and effective control strategies in different regions. Moreover, we provide support for the hypothesis that Trichuris infecting primates represents a complex of cryptic species with some species being able to infect both humans and non-human primates.

  6. Chloroplast Redox Poise

    DEFF Research Database (Denmark)

    Steccanella, Verdiana

    the redox status of the plastoquinone pool and chlorophyll biosynthesis. Furthermore, in the plant cell, the equilibrium between redox reactions and ROS signals is also maintained by various balancing mechanisms among which the thioredoxin reductase-thioredoxin system (TR-Trx) stands out as a mediator......The redox state of the chloroplast is maintained by a delicate balance between energy production and consumption and is affected by the need to avoid increased production of reactive oxygen species (ROS). Redox power and ROS generated in the chloroplast are essential for maintaining physiological...... metabolic pathways and for optimizing chloroplast functions. The redox poise of photosynthetic electron transport components like plastoquinone is crucial to initiate signaling cascades and might also be involved in key biosynthetic pathways such as chlorophyll biosynthesis. We, therefore, explored...

  7. Automatic Chloroplast Movement Analysis.

    Science.gov (United States)

    Johansson, Henrik; Zeidler, Mathias

    2016-01-01

    In response to low or high intensities of light, the chloroplasts in the mesophyll cells of the leaf are able to increase or decrease their exposure to light by accumulating at the upper and lower sides or along the side walls of the cell respectively. This movement, regulated by the phototropin blue light photoreceptors phot1 and phot2, results in a decreased or increased transmission of light through the leaf. This way the plant is able to optimize harvesting of the incoming light or avoid damage caused by excess light. Here we describe a method that indirectly measures the movement of chloroplasts by taking advantage of the resulting change in leaf transmittance. By using a microplate reader, quantitative measurements of chloroplast accumulation or avoidance can be monitored over time, for multiple samples with relatively little hands-on time.

  8. Transformation of phaG and phaC Genes into Tobacco Chloroplast Genome and Genetic Analysis%phaG和phaC基因在烟草叶绿体中的转化及其遗传分析

    Institute of Scientific and Technical Information of China (English)

    王玉华; 吴忠义; 张秀海; 王永勤; 黄丛林; 贾敬芬

    2009-01-01

    present, novel efforts are focused on using the transgenic plants as bioreactors to produce PHAs. Both 3-hydroxyacyl-CoA-ACP-transferase and type Ⅱ PHA synthase are the key enzymes for mcl-PHAs biosynthesis. The gene phaG encoding 3-hydroxyacyl-CoA-ACP-transferase was placed under the control of psbA-pro and psbA-ter of rice to construct phaG expression cassette, and the gene phaC encoding type Ⅱ PHA synthase was placed under the control of prm and rbcL-ter of rice to construct phaC expression cassette, which were ligated with the screening marker gene aadA expression cassette prm-aadA-TpsbA-ter. These recombined fragments were cloned between the plastid rbcL and accD genes of tobacco for targeting to the large single copy region of chloroplast genome. Chloroplast expression vector of pTGC was constructed and then transformed into tobacco chloroplast genome through particle bombardment. Six trans-plastomic tobacco plants were obtained by spectinomycin screening. PCR and Southern blot analysis confirmed integration of phaG andphaC genes into chloroplast genome of T_0 and T_1 transgenic plants, and T_1 transgenic plants exhibited homogenization. The expression of phaC and phaG at transcription level was detected by reverse transcriptase-polymerase chain reaction (RT-PCR). Recombinant transgenes in the tobacco chloroplast genome were maternally inherited and were not transmitted via pollen when out-crossed with untransformed female plants.

  9. Chloroplast: The Trojan Horse in Plant-Virus Interaction.

    Science.gov (United States)

    Bhattacharyya, Dhriti; Chakraborty, Supriya

    2017-01-05

    Chloroplast is one of the most dynamic organelle of a plant cell. It carries out photosynthesis, synthesizes major phytohormones, takes active part in defence response, and is crucial for inter-organelle signaling. Viruses, on the other hand, are extremely strategic in manipulating the internal environment of the host cell. Chloroplast, a prime target for viruses, undergoes enormous structural and functional damage during viral infection. In fact, large proportions of affected gene products in a virus infected plant are closely associated to chloroplast and photosynthesis process. Although chloroplast is deficient in gene-silencing machinery, it elicits effector-triggered immune response against viral pathogens. Virus infection induces the organelle to produce extensive network of stromules which are involved in both viral propagation and anti-viral defence. From last few decades' study, involvement of chloroplast in regulating plant-virus interaction has become increasingly evident. Current review presents an exhaustive account of these facts, with their implication in pathogenicity. We have attempted to highlight the intricacies of chloroplast-virus interaction and explained the existing gaps in current knowledge, which will promote the virologists to utilize the chloroplast genome-based antiviral resistance in economically important crops. This article is protected by copyright. All rights reserved.

  10. A novel chloroplast-localized protein EMB1303 is required for chloroplast development in Arabidopsis

    Institute of Scientific and Technical Information of China (English)

    Xiaozhen Huang; Xiaoyan Zhang; Shuhua Yang

    2009-01-01

    To understand the molecular mechanisms underlying chloroplast development, we isolated and characterized the albino mutant emb1303-1 in Arabidopsis. The mutant displayed a severe dwarf phenotype with small albino rosette leaves and short roots on a synthetic medium containing sucrose. It is pigment-deficient and seedling lethal when grown in soil. Embryo development was delayed in the mutant, although seed germination was not significantly im-paired. The plastids of emb1303-1 were arrested in early developmental stages without the classical stack of thylakoid membrane. Genetic and molecular analyses uncovered that the EMB1303 gene encodes a novel chloroplast-localized protein. Mieroarray and RT-PCR analyses revealed that a number of nuclear-and plastid-encoded genes involved in photosynthesis and chloroplast biogenesis were substantially downregulated in the mutant. Moreover, the accu-mulation of several major chloroplast proteins was severely compromised in emb1303-1. These results suggest that EMBI303 is essential for chloroplast development.

  11. Genome Sequence of Azospirillum brasilense CBG497 and Comparative Analyses of Azospirillum Core and Accessory Genomes provide Insight into Niche Adaptation

    Directory of Open Access Journals (Sweden)

    Victor González

    2012-09-01

    Full Text Available Bacteria of the genus Azospirillum colonize roots of important cereals and grasses, and promote plant growth by several mechanisms, notably phytohormone synthesis. The genomes of several Azospirillum strains belonging to different species, isolated from various host plants and locations, were recently sequenced and published. In this study, an additional genome of an A. brasilense strain, isolated from maize grown on an alkaline soil in the northeast of Mexico, strain CBG497, was obtained. Comparative genomic analyses were performed on this new genome and three other genomes (A. brasilense Sp245, A. lipoferum 4B and Azospirillum sp. B510. The Azospirillum core genome was established and consists of 2,328 proteins, representing between 30% to 38% of the total encoded proteins within a genome. It is mainly chromosomally-encoded and contains 74% of genes of ancestral origin shared with some aquatic relatives. The non-ancestral part of the core genome is enriched in genes involved in signal transduction, in transport and in metabolism of carbohydrates and amino-acids, and in surface properties features linked to adaptation in fluctuating environments, such as soil and rhizosphere. Many genes involved in colonization of plant roots, plant-growth promotion (such as those involved in phytohormone biosynthesis, and properties involved in rhizosphere adaptation (such as catabolism of phenolic compounds, uptake of iron are restricted to a particular strain and/or species, strongly suggesting niche-specific adaptation.

  12. Genome Sequence of Azospirillum brasilense CBG497 and Comparative Analyses of Azospirillum Core and Accessory Genomes provide Insight into Niche Adaptation

    Science.gov (United States)

    Wisniewski-Dyé, Florence; Lozano, Luis; Acosta-Cruz, Erika; Borland, Stéphanie; Drogue, Benoît; Prigent-Combaret, Claire; Rouy, Zoé; Barbe, Valérie; Mendoza Herrera, Alberto; González, Victor; Mavingui, Patrick

    2012-01-01

    Bacteria of the genus Azospirillum colonize roots of important cereals and grasses, and promote plant growth by several mechanisms, notably phytohormone synthesis. The genomes of several Azospirillum strains belonging to different species, isolated from various host plants and locations, were recently sequenced and published. In this study, an additional genome of an A. brasilense strain, isolated from maize grown on an alkaline soil in the northeast of Mexico, strain CBG497, was obtained. Comparative genomic analyses were performed on this new genome and three other genomes (A. brasilense Sp245, A. lipoferum 4B and Azospirillum sp. B510). The Azospirillum core genome was established and consists of 2,328 proteins, representing between 30% to 38% of the total encoded proteins within a genome. It is mainly chromosomally-encoded and contains 74% of genes of ancestral origin shared with some aquatic relatives. The non-ancestral part of the core genome is enriched in genes involved in signal transduction, in transport and in metabolism of carbohydrates and amino-acids, and in surface properties features linked to adaptation in fluctuating environments, such as soil and rhizosphere. Many genes involved in colonization of plant roots, plant-growth promotion (such as those involved in phytohormone biosynthesis), and properties involved in rhizosphere adaptation (such as catabolism of phenolic compounds, uptake of iron) are restricted to a particular strain and/or species, strongly suggesting niche-specific adaptation. PMID:24705077

  13. Expression of human soluble TRAIL in Chlamydomonas reinhardtii chloroplast

    Institute of Scientific and Technical Information of China (English)

    YANG Zongqi; LI yinü; CHEN Feng; LI Dong; ZHANG Zhifang; LIU Yanxin; ZHENG Dexian; WANG Yong; SHEN Guifang

    2006-01-01

    Tumor necrosis factor-related apoptosis-inducing ligand (TRAIL) induces selectively apoptosis in various tumor cells and virus-infected cells, but rarely in normal cells. A chloroplast expression vector, p64TRAIL, containing the cDNA coding for the soluble TRAIL (sTRAIL), was constructed with clpP-trnL-petB-chlL-rpl23-rpl2 as Chlamydomonas reinhardtii plastid homologous recombinant fragments and spectinomycin-resistant aadA gene as a select marker. The plasmid p64TRAIL was transferred into the chloroplast genome of C. reinhardtii by the biolistic method. Three independently transformed lines were obtained by 100 mg/L spectinomycin selection. PCR amplification, Southern blot analysis of the sTRAIL coding region DNA and cultivation cells in the dark all showed that the exogenous DNA had been integrated into chloroplast genome of C. reinhardtii. Western blot analysis showed that human soluble TRAIL was expressed in C. reinhardtii chloroplast. The densitometric analysis of Western blot indicated that the expressed human sTRAIL protein in the chloroplasts of C. reinhardtii accounted for about 0.43%-0.67% of the total soluble proteins.These experimental results demonstrated the possibility of using transgenic chloroplasts of green alga as bioreactors for production of biopharmaceuticals.

  14. Pan-genome analyses identify lineage- and niche-specific markers of evolution and adaptation in Epsilonproteobacteria

    Directory of Open Access Journals (Sweden)

    Ying eZhang

    2014-03-01

    Full Text Available The rapidly increasing availability of complete bacterial genomes has created new opportunities for reconstructing bacterial evolution, but it has also highlighted the difficulty to fully understand the genomic and functional variations occurring among different lineages. Using the class Epsilonproteobacteria as a case study, we investigated the composition, flexibility, and function of its pan-genomes. Models were constructed to extrapolate the expansion of pan-genomes at three different taxonomic levels. The results show that, for Epsilonproteobacteria the seemingly large genome variations among strains of the same species are less noticeable when compared with groups at higher taxonomic ranks, indicating that genome stability is imposed by the potential existence of taxonomic boundaries. The analyses of pan-genomes has also defined a set of universally conserved core genes, based on which a phylogenetic tree was constructed to confirm that thermophilic species from deep-sea hydrothermal vents represent the most ancient lineages of Epsilonproteobacteria. Moreover, by comparing the flexible genome of a chemoautotrophic deep-sea vent species to 1 genomes of species belonging to the same genus, but inhabiting different environments, and 2 genomes of other vent species, but belonging to different genera, we were able to delineate the relative importance of lineage-specific versus niche-specific genes. This result not only emphasizes the overall importance of phylogenetic proximity in shaping the variable part of the genome, but also highlights the adaptive functions of niche-specific genes. Overall, by modeling the expansion of pan-genomes and analyzing core and flexible genes, this study provides snapshots on how the complex processes of gene acquisition, conservation, and removal affect the evolution of different species, and contribute to the metabolic diversity and versatility of Epsilonproteobacteria.

  15. Genome Analyses of an Aggressive and Invasive Lineage of the Irish Potato Famine Pathogen

    Science.gov (United States)

    Raffaele, Sylvain; Bain, Ruairidh A.; Cooke, Louise R.; Etherington, Graham J.; Deahl, Kenneth L.; Farrer, Rhys A.; Gilroy, Eleanor M.; Goss, Erica M.; Grünwald, Niklaus J.; Hein, Ingo; MacLean, Daniel; McNicol, James W.; Randall, Eva; Oliva, Ricardo F.; Pel, Mathieu A.; Shaw, David S.; Squires, Julie N.; Taylor, Moray C.; Vleeshouwers, Vivianne G. A. A.; Birch, Paul R. J.; Lees, Alison K.; Kamoun, Sophien

    2012-01-01

    Pest and pathogen losses jeopardise global food security and ever since the 19th century Irish famine, potato late blight has exemplified this threat. The causal oomycete pathogen, Phytophthora infestans, undergoes major population shifts in agricultural systems via the successive emergence and migration of asexual lineages. The phenotypic and genotypic bases of these selective sweeps are largely unknown but management strategies need to adapt to reflect the changing pathogen population. Here, we used molecular markers to document the emergence of a lineage, termed 13_A2, in the European P. infestans population, and its rapid displacement of other lineages to exceed 75% of the pathogen population across Great Britain in less than three years. We show that isolates of the 13_A2 lineage are among the most aggressive on cultivated potatoes, outcompete other aggressive lineages in the field, and overcome previously effective forms of plant host resistance. Genome analyses of a 13_A2 isolate revealed extensive genetic and expression polymorphisms particularly in effector genes. Copy number variations, gene gains and losses, amino-acid replacements and changes in expression patterns of disease effector genes within the 13_A2 isolate likely contribute to enhanced virulence and aggressiveness to drive this population displacement. Importantly, 13_A2 isolates carry intact and in planta induced Avrblb1, Avrblb2 and Avrvnt1 effector genes that trigger resistance in potato lines carrying the corresponding R immune receptor genes Rpi-blb1, Rpi-blb2, and Rpi-vnt1.1. These findings point towards a strategy for deploying genetic resistance to mitigate the impact of the 13_A2 lineage and illustrate how pathogen population monitoring, combined with genome analysis, informs the management of devastating disease epidemics. PMID:23055926

  16. Genome analyses of an aggressive and invasive lineage of the Irish potato famine pathogen.

    Directory of Open Access Journals (Sweden)

    David E L Cooke

    Full Text Available Pest and pathogen losses jeopardise global food security and ever since the 19(th century Irish famine, potato late blight has exemplified this threat. The causal oomycete pathogen, Phytophthora infestans, undergoes major population shifts in agricultural systems via the successive emergence and migration of asexual lineages. The phenotypic and genotypic bases of these selective sweeps are largely unknown but management strategies need to adapt to reflect the changing pathogen population. Here, we used molecular markers to document the emergence of a lineage, termed 13_A2, in the European P. infestans population, and its rapid displacement of other lineages to exceed 75% of the pathogen population across Great Britain in less than three years. We show that isolates of the 13_A2 lineage are among the most aggressive on cultivated potatoes, outcompete other aggressive lineages in the field, and overcome previously effective forms of plant host resistance. Genome analyses of a 13_A2 isolate revealed extensive genetic and expression polymorphisms particularly in effector genes. Copy number variations, gene gains and losses, amino-acid replacements and changes in expression patterns of disease effector genes within the 13_A2 isolate likely contribute to enhanced virulence and aggressiveness to drive this population displacement. Importantly, 13_A2 isolates carry intact and in planta induced Avrblb1, Avrblb2 and Avrvnt1 effector genes that trigger resistance in potato lines carrying the corresponding R immune receptor genes Rpi-blb1, Rpi-blb2, and Rpi-vnt1.1. These findings point towards a strategy for deploying genetic resistance to mitigate the impact of the 13_A2 lineage and illustrate how pathogen population monitoring, combined with genome analysis, informs the management of devastating disease epidemics.

  17. Establishment of a Gene Expression System in Rice Chloroplast and Obtainment of PPT-Resistant Rice Plants

    Institute of Scientific and Technical Information of China (English)

    LI Yi-nü; SUN Bing-yao; SU Ning; MENG Xiang-xun; ZHANG Zhi-fang; SHEN Gui-fang

    2009-01-01

    In contrast to the situation of random integration of foreign genes in nuclear transformation,the introduction of genes via chloroplast genetic engineering is characterized by site-specific pattern via homologous recombination.To establish an expression system for alien genes in rice chloroplast,the intergenic region of ndhF and trnL was selected as target for sitespecific integration of PPT-resistant bar gene in this study.Two DNA fragments suitable for homologous recombination were cloned from rice chloroplast genome DNA using PCR technique,and the chloroplast-specific expression vector pRB was constructed by fusing a modified 16S rRNA gene promoter to bar gene together with terminator of psbA gene 3'sequence.Chloroplast transformation was carried out by biolistic bombardment of sterile rice calli with the pRB construct.Subsequently,the regenerated plantlets and seeds of progeny arising from reciprocal cross to the wild-type lines were obtained.Molecular analysis suggested that the bar gene has been integrated into rice chloroplast genome.Genetic analysis revealed that bar gene could be transmitted and expressed normally in chloroplast genome.Thus,the bar gene conferred not only selection pressure for the transformation of rice chloroplast genome,but PPT-resistant trait for rice plants as well.It is suggested that an efficient gene expression system in the rice chloroplast has been established by chloroplast transformation technique.

  18. Comparative studies on codon usage pattern of chloroplasts and their host nuclear genes in four plant species

    Indian Academy of Sciences (India)

    Qingpo Liu; Qingzhong Xue

    2005-04-01

    A detailed comparison was made of codon usage of chloroplast genes with their host (nuclear) genes in the four angiosperm species Oryza sativa, Zea mays, Triticum aestivum and Arabidopsis thaliana. The average GC content of the entire genes, and at the three codon positions individually, was higher in nuclear than in chloroplast genes, suggesting different genomic organization and mutation pressures in nuclear and chloroplast genes. The results of Nc-plots and neutrality plots suggested that nucleotide compositional constraint had a large contribution to codon usage bias of nuclear genes in O. sativa, Z. mays, and T. aestivum, whereas natural selection was likely to be playing a large role in codon usage bias in chloroplast genomes. Correspondence analysis and chi-test showed that regardless of the genomic environment (species) of the host, the codon usage pattern of chloroplast genes differed from nuclear genes of their host species by their AU-richness. All the chloroplast genomes have predominantly A- and/or U-ending codons, whereas nuclear genomes have G-, C- or U-ending codons as their optimal codons. These findings suggest that the chloroplast genome might display particular characteristics of codon usage that are different from its host nuclear genome. However, one feature common to both chloroplast and nuclear genomes in this study was that pyrimidines were found more frequently than purines at the synonymous codon position of optimal codons.

  19. Distribution pattern changes of actin filaments during chloroplast movement in Adiantum capillus-veneris.

    Science.gov (United States)

    Tsuboi, Hidenori; Wada, Masamitsu

    2012-05-01

    Chloroplasts change their positions in a cell in response to light intensities. The photoreceptors involved in chloroplast photo-relocation movements and the behavior of chloroplasts during their migration were identified in our previous studies, but the mechanism of movement has yet to be clarified. In this study, the behavior of actin filaments under various light conditions was observed in Adiantum capillus-veneris gametophytes. In chloroplasts staying in one place under a weak light condition and not moving, circular structures composed of actin filaments were observed around the chloroplast periphery. In contrast, short actin filaments were observed at the leading edge of moving chloroplasts induced by partial cell irradiation. In the dark, the circular structures found under the weak light condition disappeared and then reappeared around the moving chloroplasts. Mutant analyses revealed that the disappearance of the circular actin structure was mediated by the blue light photoreceptor, phototropin2.

  20. Comprehensive genomic analyses associate UGT8 variants with musical ability in a Mongolian population

    Science.gov (United States)

    Park, Hansoo; Lee, Seungbok; Kim, Hyun-Jin; Ju, Young Seok; Shin, Jong-Yeon; Hong, Dongwan; von Grotthuss, Marcin; Lee, Dong-Sung; Park, Changho; Kim, Jennifer Hayeon; Kim, Boram; Yoo, Yun Joo; Cho, Sung-Il; Sung, Joohon; Lee, Charles; Kim, Jong-Il; Seo, Jeong-Sun

    2012-01-01

    Background Musical abilities such as recognising music and singing performance serve as means for communication and are instruments in sexual selection. Specific regions of the brain have been found to be activated by musical stimuli, but these have rarely been extended to the discovery of genes and molecules associated with musical ability. Methods A total of 1008 individuals from 73 families were enrolled and a pitch-production accuracy test was applied to determine musical ability. To identify genetic loci and variants that contribute to musical ability, we conducted family-based linkage and association analyses, and incorporated the results with data from exome sequencing and array comparative genomic hybridisation analyses. Results We found significant evidence of linkage at 4q23 with the nearest marker D4S2986 (LOD=3.1), whose supporting interval overlaps a previous study in Finnish families, and identified an intergenic single nucleotide polymorphism (SNP) (rs1251078, p=8.4×10−17) near UGT8, a gene highly expressed in the central nervous system and known to act in brain organisation. In addition, a non-synonymous SNP in UGT8 was revealed to be highly associated with musical ability (rs4148254, p=8.0×10−17), and a 6.2 kb copy number loss near UGT8 showed a plausible association with musical ability (p=2.9×10−6). Conclusions This study provides new insight into the genetics of musical ability, exemplifying a methodology to assign functional significance to synonymous and non-coding alleles by integrating multiple experimental methods. PMID:23118445

  1. Comparative genomic analyses of nickel, cobalt and vitamin B12 utilization

    Directory of Open Access Journals (Sweden)

    Gelfand Mikhail S

    2009-02-01

    Full Text Available Abstract Background Nickel (Ni and cobalt (Co are trace elements required for a variety of biological processes. Ni is directly coordinated by proteins, whereas Co is mainly used as a component of vitamin B12. Although a number of Ni and Co-dependent enzymes have been characterized, systematic evolutionary analyses of utilization of these metals are limited. Results We carried out comparative genomic analyses to examine occurrence and evolutionary dynamics of the use of Ni and Co at the level of (i transport systems, and (ii metalloproteomes. Our data show that both metals are widely used in bacteria and archaea. Cbi/NikMNQO is the most common prokaryotic Ni/Co transporter, while Ni-dependent urease and Ni-Fe hydrogenase, and B12-dependent methionine synthase (MetH, ribonucleotide reductase and methylmalonyl-CoA mutase are the most widespread metalloproteins for Ni and Co, respectively. Occurrence of other metalloenzymes showed a mosaic distribution and a new B12-dependent protein family was predicted. Deltaproteobacteria and Methanosarcina generally have larger Ni- and Co-dependent proteomes. On the other hand, utilization of these two metals is limited in eukaryotes, and very few of these organisms utilize both of them. The Ni-utilizing eukaryotes are mostly fungi (except saccharomycotina and plants, whereas most B12-utilizing organisms are animals. The NiCoT transporter family is the most widespread eukaryotic Ni transporter, and eukaryotic urease and MetH are the most common Ni- and B12-dependent enzymes, respectively. Finally, investigation of environmental and other conditions and identity of organisms that show dependence on Ni or Co revealed that host-associated organisms (particularly obligate intracellular parasites and endosymbionts have a tendency for loss of Ni/Co utilization. Conclusion Our data provide information on the evolutionary dynamics of Ni and Co utilization and highlight widespread use of these metals in the three

  2. GUN1, a Jack-Of-All-Trades in Chloroplast Protein Homeostasis and Signaling

    Science.gov (United States)

    Colombo, Monica; Tadini, Luca; Peracchio, Carlotta; Ferrari, Roberto; Pesaresi, Paolo

    2016-01-01

    The GENOMES UNCOUPLED 1 (GUN1) gene has been reported to encode a chloroplast-localized pentatricopeptide-repeat protein, which acts to integrate multiple indicators of plastid developmental stage and altered plastid function, as part of chloroplast-to-nucleus retrograde communication. However, the molecular mechanisms underlying signal integration by GUN1 have remained elusive, up until the recent identification of a set of GUN1-interacting proteins, by co-immunoprecipitation and mass-spectrometric analyses, as well as protein–protein interaction assays. Here, we review the molecular functions of the different GUN1 partners and propose a major role for GUN1 as coordinator of chloroplast translation, protein import, and protein degradation. This regulatory role is implemented through proteins that, in most cases, are part of multimeric protein complexes and whose precise functions vary depending on their association states. Within this framework, GUN1 may act as a platform to promote specific functions by bringing the interacting enzymes into close proximity with their substrates, or may inhibit processes by sequestering particular pools of specific interactors. Furthermore, the interactions of GUN1 with enzymes of the tetrapyrrole biosynthesis (TPB) pathway support the involvement of tetrapyrroles as signaling molecules in retrograde communication. PMID:27713755

  3. Highly variable chloroplast markers for evaluating plant phylogeny at low taxonomic levels and for DNA barcoding.

    Directory of Open Access Journals (Sweden)

    Wenpan Dong

    Full Text Available BACKGROUND: At present, plant molecular systematics and DNA barcoding techniques rely heavily on the use of chloroplast gene sequences. Because of the relatively low evolutionary rates of chloroplast genes, there are very few choices suitable for molecular studies on angiosperms at low taxonomic levels, and for DNA barcoding of species. METHODOLOGY/PRINCIPAL FINDINGS: We scanned the entire chloroplast genomes of 12 genera to search for highly variable regions. The sequence data of 9 genera were from GenBank and 3 genera were of our own. We identified nearly 5% of the most variable loci from all variable loci in the chloroplast genomes of each genus, and then selected 23 loci that were present in at least three genera. The 23 loci included 4 coding regions, 2 introns, and 17 intergenic spacers. Of the 23 loci, the most variable (in order from highest variability to lowest were intergenic regions ycf1-a, trnK, rpl32-trnL, and trnH-psbA, followed by trnS(UGA-trnG(UCC, petA-psbJ, rps16-trnQ, ndhC-trnV, ycf1-b, ndhF, rpoB-trnC, psbE-petL, and rbcL-accD. Three loci, trnS(UGA-trnG(UCC, trnT-psbD, and trnW-psaJ, showed very high nucleotide diversity per site (π values across three genera. Other loci may have strong potential for resolving phylogenetic and species identification problems at the species level. The loci accD-psaI, rbcL-accD, rpl32-trnL, rps16-trnQ, and ycf1 are absent from some genera. To amplify and sequence the highly variable loci identified in this study, we designed primers from their conserved flanking regions. We tested the applicability of the primers to amplify target sequences in eight species representing basal angiosperms, monocots, eudicots, rosids, and asterids, and confirmed that the primers amplified the desired sequences of these species. SIGNIFICANCE/CONCLUSIONS: Chloroplast genome sequences contain regions that are highly variable. Such regions are the first consideration when screening the suitable loci to resolve

  4. Genome-Wide Analyses of Individual Strongyloides stercoralis (Nematoda: Rhabditoidea) Provide Insights into Population Structure and Reproductive Life Cycles

    Science.gov (United States)

    Aung, Myo Pa Pa Thet Hnin Htwe; Afrin, Tanzila; Nagayasu, Eiji; Tanaka, Ryusei; Higashiarakawa, Miwa; Win, Kyu Kyu; Hirata, Tetsuo; Htike, Wah Win; Fujita, Jiro; Maruyama, Haruhiko

    2016-01-01

    The helminth Strongyloides stercoralis, which is transmitted through soil, infects 30–100 million people worldwide. S. stercoralis reproduces sexually outside the host as well as asexually within the host, which causes a life-long infection. To understand the population structure and transmission patterns of this parasite, we re-sequenced the genomes of 33 individual S. stercoralis nematodes collected in Myanmar (prevalent region) and Japan (non-prevalent region). We utilised a method combining whole genome amplification and next-generation sequencing techniques to detect 298,202 variant positions (0.6% of the genome) compared with the reference genome. Phylogenetic analyses of SNP data revealed an unambiguous geographical separation and sub-populations that correlated with the host geographical origin, particularly for the Myanmar samples. The relatively higher heterozygosity in the genomes of the Japanese samples can possibly be explained by the independent evolution of two haplotypes of diploid genomes through asexual reproduction during the auto-infection cycle, suggesting that analysing heterozygosity is useful and necessary to infer infection history and geographical prevalence. PMID:28033376

  5. Chloroplast DNA Copy Number May Link to Sex Determination in Leucadendron (Proteaceae

    Directory of Open Access Journals (Sweden)

    MADE PHARMAWATI

    2009-03-01

    Full Text Available Leucadendron (Proteaceae is a South African genus, the flowers of which have become a popular item in the Australian cut-flower industry. All species are dioecious. In general the female flowers are the more desirable as cut flowers. The availability of a molecular marker linked to sex determination is therefore needed both to maximize the efficiency of breeding programs and to supply markets with flowers from the preferred sex. The polymerase chain reaction-based method of suppression subtractive hybridization (SSH combined with mirror orientation selection (MOS were applied in an attempt to identify genome differences between male and female plants of Leucadendron discolor. Screening of 416 clones from a male-subtracted genomic DNA library and 282 clones from a female-subtracted library identified 13 candidates for male-specific genomic fragments. Sequence analyses of the 13 candidate DNA fragments showed that they were fragments of the chloroplast DNA, raising the possibility that chloroplast DNA copy number is linked to sex determination in Leucadendron.

  6. Genome-wide meta-analyses of multiancestry cohorts identify multiple new susceptibility loci for refractive error and myopia

    NARCIS (Netherlands)

    V.J.M. Verhoeven (Virginie); P.G. Hysi (Pirro); R. Wojciechowski (Robert); Q. Fan (Qiao); J. Guggenheim (Jean); R. Höhn (René); S. MacGregor (Stuart); A.W. Hewit (Alex); A. Nag (Abhishek); C-Y. Cheng (Ching-Yu); E. Yonova-Doing (Ekaterina); X. Zhou (Xin); M.K. Ikram (Kamran); G.H.S. Buitendijk (Gabrielle); G. Mcmahon (George); J.P. Kemp (John); B.S. Pourcain (Beate); C.L. Simpson (Claire); M.J. Mäkelä; T. Lehtimäki (Terho); M. Kähönen (Mika); A.D. Paterson (Andrew); M. Hosseini (Mehran); H.S. Wong (Hoi Suen); L. Xu (Liang); J.B. Jonas; O. Pärssinen (Olavi); J. Wedenoja (Juho); S.P. Yip (Shea Ping); D.W.H. Ho (Daniel); C.P. Pang (Chi); L.J. Chen (Li); K.P. Burdon (Kathryn); J.E. Craig (Jamie); B.E.K. Klein (Barbara); B.E.K. Klein (Barbara); T. Haller (Toomas); A. Metspalu (Andres); C.C. Khor; E.S. Tai (Shyong); T. Aung (Tin); E.N. Vithana (Eranga); W.-T. Tay (Wan-Ting); V.A. Barathi (Veluchamy); P. Chen (Ping); R. Li (Rui); J. Liao (Jie); Y. Zheng (Yuhui); R.T.H. Ong (Rick Twee-Hee); A. Döring (Angela); D.M. Evans (David); N. Timpson (Nicholas); A. Verkerk; T. Meitinger (Thomas); O. Raitakari (Olli); F. Hawthorne (Felicia); T.D. Spector (Timothy); L.C. Karssen (Lennart); M. Pirastu (Mario); D. Murgia (Daniela); W.Q. Ang (Wei); A. Mishra (Aniket); G.W. Montgomery (Grant); C.E. Pennell (Craig); P. Cumberland (Phillippa); I. Cotlarciuc (Ioana); P. Mitchell (Paul); J.J. Wang (Jie Jin); M. Schache (Maria); S. Janmahasathian (Sarayut); R.P. Igo Jr. (Robert); J.H. Lass Jr. (Jonathan); E.Y. Chew (Emily); S.K. Iyengar (Sudha); T.G.M.F. Gorgels (Theo); I. Rudan (Igor); C. Hayward (Caroline); A.F. Wright (Alan); O. Polasek (Ozren); Z. Vatavuk (Zoran); J.F. Wilson (James); B. Fleck (Brian); T. Zeller (Tanja); A. Mirshahi (Alireza); C. Müller (Christian); A.G. Uitterlinden (André); F. Rivadeneira Ramirez (Fernando); J.R. Vingerling (Hans); A. Hofman (Albert); B.A. Oostra (Ben); N. Amin (Najaf); A.A.B. Bergen (Arthur); Y.Y. Teo (Yik Ying); J.S. Rahi (Jugnoo); V. Vitart (Veronique); C. Williams (Cathy); P.N. Baird (Paul); T.Y. Wong (Tien); K. Oexle (Konrad); A.F.H. Pfeiffer (Andreas); D.A. Mackey (David); T.L. Young (Terri); C.M. van Duijn (Cock); S-M. Saw (Seang-Mei); J.E. Bailey-Wilson (Joan); D.E. Stambolian (Dwight); C.C.W. Klaver (Caroline); C.J. Hammond (Christopher)

    2013-01-01

    textabstractRefractive error is the most common eye disorder worldwide and is a prominent cause of blindness. Myopia affects over 30% of Western populations and up to 80% of Asians. The CREAM consortium conducted genome-wide meta-analyses, including 37,382 individuals from 27 studies of European anc

  7. 基于柑橘及其近缘属植物DNA条形码的叶绿体编码序列筛选%Screening Potential DNA Barcode Regions of Chloroplast Coding Genome for Citrus and Its Related Genera

    Institute of Scientific and Technical Information of China (English)

    于杰; 闫化学; 鲁振华; 周志钦

    2011-01-01

    [Objective] Four coding regions of chloroplast genome of Citrus and its close relatives were analyzed in an attempt to find suitable DNA barcoding markers for species identification and lay a foundation for further study of non-coding region.[ Method ] Four chloroplast DNA regions (matK, rpoB, rpoC1 and rbcL ) of 59 Citrus accessions were sequenced, the intergeneric,interspecific, intraspecific genetic distances were calculated, and the phylogenetic tree of all the accessions tested was built based on the distance data obtained. [Result] The intergeneric and interspecific sequence variations of matK were the highest among four coding regions tested, and had significant difference from other regions studied. On the contrary, no obvious variations were found in the rpoB and rpoC1 regions. The sequence variation of rbcL was medium among the fragments sequenced. [Conclusion] The matK sequence could be used as potential candidate fragment for future DNA barcoding study of Citrus and its closely related genera.%[目的]通过对柑橘及其近缘属植物叶绿体4种编码序列的测定分析,获得能进行DNA条形编码的特征序列,为进一步研究叶绿体非编码区序列奠定基础.[方法]对柑橘及其近缘属植物59份样品进行matK、rpoB、rpoC1、rbcL测序,序列比对与人工校正,计算属间,种同、种内的遗传距离,比较序列间的差异,建立系统发育树.[结果]4种序列中,matK序列在属间、种间差异最大,与其它序列相比具有显著性差异,rbcL序列次之,而rpoB、rpoC1序列两者间没有显著性差异.[结论]matK序列是柑橘及其近缘属植物DNA条形码的未来研究中一个重要的候选片段.

  8. Deciphering the cryptic genome: genome-wide analyses of the rice pathogen Fusarium fujikuroi reveal complex regulation of secondary metabolism and novel metabolites.

    Directory of Open Access Journals (Sweden)

    Philipp Wiemann

    Full Text Available The fungus Fusarium fujikuroi causes "bakanae" disease of rice due to its ability to produce gibberellins (GAs, but it is also known for producing harmful mycotoxins. However, the genetic capacity for the whole arsenal of natural compounds and their role in the fungus' interaction with rice remained unknown. Here, we present a high-quality genome sequence of F. fujikuroi that was assembled into 12 scaffolds corresponding to the 12 chromosomes described for the fungus. We used the genome sequence along with ChIP-seq, transcriptome, proteome, and HPLC-FTMS-based metabolome analyses to identify the potential secondary metabolite biosynthetic gene clusters and to examine their regulation in response to nitrogen availability and plant signals. The results indicate that expression of most but not all gene clusters correlate with proteome and ChIP-seq data. Comparison of the F. fujikuroi genome to those of six other fusaria revealed that only a small number of gene clusters are conserved among these species, thus providing new insights into the divergence of secondary metabolism in the genus Fusarium. Noteworthy, GA biosynthetic genes are present in some related species, but GA biosynthesis is limited to F. fujikuroi, suggesting that this provides a selective advantage during infection of the preferred host plant rice. Among the genome sequences analyzed, one cluster that includes a polyketide synthase gene (PKS19 and another that includes a non-ribosomal peptide synthetase gene (NRPS31 are unique to F. fujikuroi. The metabolites derived from these clusters were identified by HPLC-FTMS-based analyses of engineered F. fujikuroi strains overexpressing cluster genes. In planta expression studies suggest a specific role for the PKS19-derived product during rice infection. Thus, our results indicate that combined comparative genomics and genome-wide experimental analyses identified novel genes and secondary metabolites that contribute to the evolutionary

  9. Comparative and functional genomic analyses of the pathogenicity of phytopathogen Xanthomonas campestris pv. campestris

    OpenAIRE

    Qian, Wei; Jia, Yantao; Ren, Shuang-Xi; He, Yong-Qiang; Feng, Jia-Xun; Lu, Ling-Feng; Sun, Qihong; Ying, Ge; Tang, Dong-Jie; Tang, Hua; Wu, Wei; Hao, Pei; Wang, Lifeng; Jiang, Bo-Le; Zeng, Shenyan

    2005-01-01

    Xanthomonas campestris pathovar campestris (Xcc) is the causative agent of crucifer black rot disease, which causes severe losses in agricultural yield world-wide. This bacterium is a model organism for studying plant-bacteria interactions. We sequenced the complete genome of Xcc 8004 (5,148,708 bp), which is highly conserved relative to that of Xcc ATCC 33913. Comparative genomics analysis indicated that, in addition to a significant genomic-scale rearrangement cross the replication axis bet...

  10. Genome-wide divergence and linkage disequilibrium analyses for Capsicum baccatum revealed by genome-anchored single nucleotide polymorphisms

    Science.gov (United States)

    Principal component analysis (PCA) with 36,621 polymorphic genome-anchored single nucleotide polymorphisms (SNPs) identified collectively for Capsicum annuum and Capsicum baccatum was used to show the distribution of these 2 important incompatible cultivated pepper species. Estimated mean nucleotide...

  11. Mollusc-algal chloroplast endosymbiosis. Photosynthesis, thylakoid protein maintenance, and chloroplast gene expression continue for many months in the absence of the algal nucleus.

    Science.gov (United States)

    Green, B J; Li, W Y; Manhart, J R; Fox, T C; Summer, E J; Kennedy, R A; Pierce, S K; Rumpho, M E

    2000-09-01

    Early in its life cycle, the marine mollusc Elysia chlorotica Gould forms an intracellular endosymbiotic association with chloroplasts of the chromophytic alga Vaucheria litorea C. Agardh. As a result, the dark green sea slug can be sustained in culture solely by photoautotrophic CO(2) fixation for at least 9 months if provided with only light and a source of CO(2). Here we demonstrate that the sea slug symbiont chloroplasts maintain photosynthetic oxygen evolution and electron transport activity through photosystems I and II for several months in the absence of any external algal food supply. This activity is correlated to the maintenance of functional levels of chloroplast-encoded photosystem proteins, due in part at least to de novo protein synthesis of chloroplast proteins in the sea slug. Levels of at least one putative algal nuclear encoded protein, a light-harvesting complex protein homolog, were also maintained throughout the 9-month culture period. The chloroplast genome of V. litorea was found to be 119.1 kb, similar to that of other chromophytic algae. Southern analysis and polymerase chain reaction did not detect an algal nuclear genome in the slug, in agreement with earlier microscopic observations. Therefore, the maintenance of photosynthetic activity in the captured chloroplasts is regulated solely by the algal chloroplast and animal nuclear genomes.

  12. INNOVATIVE STRATEGIES TO IDENTIFY M. TUBERCULOSIS ANTIGENS AND EPITOPES USING GENOME-WIDE ANALYSES

    Directory of Open Access Journals (Sweden)

    Annemieke eGeluk

    2014-06-01

    Full Text Available In view of the fact that only a small part of the Mtb expressome has been explored for identification of antigens capable of activating human T-cell responses, which is critically required for the design of better TB vaccination strategies, more emphasis should be placed on innovative ways to discover new Mtb antigens and explore their function at the several stages of infection. Better protective antigens for TB vaccines are urgently needed, also in view of the disappointing results of the MVA85 vaccine which failed to induce additional protection in BCG vaccinated infants [54]. Moreover, immune responses to relevant antigens may be useful to identify TB-specific biomarker signatures. Here we describe the potency of novel tools and strategies to reveal such Mtb antigens. Using proteins specific for different Mtb infection phases, many new antigens of the latency-associated Mtb DosR regulon as well as Rpf proteins, associated with resuscitating TB, were discovered that were recognized by CD4+ and CD8+ T-cells. Furthermore, by employing MHC binding algorithms and bioinformatics combined with high throughput human T-cell screens and tetramers, HLA-class Ia restricted poly-functional CD8+ T-cells were identified in TB patients. Comparable methods, led to the identification of HLA-E-restricted Mtb epitopes recognized by CD8+ T-cells. A genome-wide unbiased antigen discovery approach was applied to analyse the in vivo Mtb gene expression profiles in the lungs of mice, resulting in the identification of IVE-TB antigens, which are expressed during infection in the lung, the main target organ of Mtb. IVE-TB antigens induce strong T cell responses in long-term latently Mtb infected individuals, and represent an interesting new group of TB antigens for vaccination. In summary, new tools have helped expand our view on the Mtb antigenome involved in human cellular immunity and provided new candidates for TB vaccination.

  13. Genome-wide association analyses identify new susceptibility loci for oral cavity and pharyngeal cancer.

    Science.gov (United States)

    Lesseur, Corina; Diergaarde, Brenda; Olshan, Andrew F; Wünsch-Filho, Victor; Ness, Andrew R; Liu, Geoffrey; Lacko, Martin; Eluf-Neto, José; Franceschi, Silvia; Lagiou, Pagona; Macfarlane, Gary J; Richiardi, Lorenzo; Boccia, Stefania; Polesel, Jerry; Kjaerheim, Kristina; Zaridze, David; Johansson, Mattias; Menezes, Ana M; Curado, Maria Paula; Robinson, Max; Ahrens, Wolfgang; Canova, Cristina; Znaor, Ariana; Castellsagué, Xavier; Conway, David I; Holcátová, Ivana; Mates, Dana; Vilensky, Marta; Healy, Claire M; Szeszenia-Dąbrowska, Neonila; Fabiánová, Eleonóra; Lissowska, Jolanta; Grandis, Jennifer R; Weissler, Mark C; Tajara, Eloiza H; Nunes, Fabio D; de Carvalho, Marcos B; Thomas, Steve; Hung, Rayjean J; Peters, Wilbert H M; Herrero, Rolando; Cadoni, Gabriella; Bueno-de-Mesquita, H Bas; Steffen, Annika; Agudo, Antonio; Shangina, Oxana; Xiao, Xiangjun; Gaborieau, Valérie; Chabrier, Amélie; Anantharaman, Devasena; Boffetta, Paolo; Amos, Christopher I; McKay, James D; Brennan, Paul

    2016-12-01

    We conducted a genome-wide association study of oral cavity and pharyngeal cancer in 6,034 cases and 6,585 controls from Europe, North America and South America. We detected eight significantly associated loci (P < 5 × 10(-8)), seven of which are new for these cancer sites. Oral and pharyngeal cancers combined were associated with loci at 6p21.32 (rs3828805, HLA-DQB1), 10q26.13 (rs201982221, LHPP) and 11p15.4 (rs1453414, OR52N2-TRIM5). Oral cancer was associated with two new regions, 2p23.3 (rs6547741, GPN1) and 9q34.12 (rs928674, LAMC3), and with known cancer-related loci-9p21.3 (rs8181047, CDKN2B-AS1) and 5p15.33 (rs10462706, CLPTM1L). Oropharyngeal cancer associations were limited to the human leukocyte antigen (HLA) region, and classical HLA allele imputation showed a protective association with the class II haplotype HLA-DRB1*1301-HLA-DQA1*0103-HLA-DQB1*0603 (odds ratio (OR) = 0.59, P = 2.7 × 10(-9)). Stratified analyses on a subgroup of oropharyngeal cases with information available on human papillomavirus (HPV) status indicated that this association was considerably stronger in HPV-positive (OR = 0.23, P = 1.6 × 10(-6)) than in HPV-negative (OR = 0.75, P = 0.16) cancers.

  14. Functional and comparative genomics analyses of pmp22 in medaka fish

    Directory of Open Access Journals (Sweden)

    Kawarabayasi Yutaka

    2009-06-01

    Full Text Available Abstract Background Pmp22, a member of the junction protein family Claudin/EMP/PMP22, plays an important role in myelin formation. Increase of pmp22 transcription causes peripheral neuropathy, Charcot-Marie-Tooth disease type1A (CMT1A. The pathophysiological phenotype of CMT1A is aberrant axonal myelination which induces a reduction in nerve conduction velocity (NCV. Several CMT1A model rodents have been established by overexpressing pmp22. Thus, it is thought that pmp22 expression must be tightly regulated for correct myelin formation in mammals. Interestingly, the myelin sheath is also present in other jawed vertebrates. The purpose of this study is to analyze the evolutionary conservation of the association between pmp22 transcription level and vertebrate myelin formation, and to find the conserved non-coding sequences for pmp22 regulation by comparative genomics analyses between jawed fishes and mammals. Results A transgenic pmp22 over-expression medaka fish line was established. The transgenic fish had approximately one fifth the peripheral NCV values of controls, and aberrant myelination of transgenic fish in the peripheral nerve system (PNS was observed. We successfully confirmed that medaka fish pmp22 has the same exon-intron structure as mammals, and identified some known conserved regulatory motifs. Furthermore, we found novel conserved sequences in the first intron and 3'UTR. Conclusion Medaka fish undergo abnormalities in the PNS when pmp22 transcription increases. This result indicates that an adequate pmp22 transcription level is necessary for correct myelination of jawed vertebrates. Comparison of pmp22 orthologs between distantly related species identifies evolutionary conserved sequences that contribute to precise regulation of pmp22 expression.

  15. The complete mitochondrial genome of Trabala vishnou guttata (Lepidoptera: Lasiocampidae) and the related phylogenetic analyses.

    Science.gov (United States)

    Wu, Liuyu; Xiong, Xiao; Wang, Xuming; Xin, Tianrong; Wang, Jing; Zou, Zhiwen; Xia, Bin

    2016-12-01

    The bluish yellow lappet moth, Trabala vishnou guttata is an extraordinarily important pest in China. The complete mitochondrial genome is sequenced and determined firstly, which is based on traditional PCR amplification and primer walking methods with a length of 15,281 bp, including 13 protein-coding (PCG) genes, 22 transfer RNA (rRNA) genes, two ribosomal RNA (tRNA) genes, and an A + T-rich region. The gene order and orientation of the T. vishnou guttata mitogenome were identical to the other sequenced Lasiocampidae species. The overall nucleotide composition of T. vishnou guttata is A (40.27 %), T (40.59 %), C (11.58 %) and G (7.56 %), respectively. All the PCGs initiate with the three orthodox start codons ATN except for coxI with CGA start codon. Three PCGs (coxI, coxII and nad4) used incomplete stop codon T, while the other 10 PCGs terminate with complete stop codon TAA. All tRNA genes have a typical clover-leaf structure except for the absence of a dihydrouridine arm in trnS (AGN). The length of A + T-rich region is 383 bp. Phylogeny is established to reveal the genetic relationship between T. vishnou guttata and other lepidopteran species based on 13 PCGs nucleotide sequences of the sequenced species (32 taxa) by Maximum likelihood and Bayesian methods. Phylogenetic analyses presents that T. vishnou guttata and its closely related species (Dendrolimus taxa) are clustered on Lasiocampidae group. It is a sister clade relationship between Lasiocampidae and other families in Bombycoidea with a bootstrap value of 83 % and a posterior probability of 0.75. This study supports that Lasiocampidae may be independent from Bombycoidea.

  16. Insight in genome-wide association of metabolite quantitative traits by exome sequence analyses.

    Science.gov (United States)

    Demirkan, Ayşe; Henneman, Peter; Verhoeven, Aswin; Dharuri, Harish; Amin, Najaf; van Klinken, Jan Bert; Karssen, Lennart C; de Vries, Boukje; Meissner, Axel; Göraler, Sibel; van den Maagdenberg, Arn M J M; Deelder, André M; C 't Hoen, Peter A; van Duijn, Cornelia M; van Dijk, Ko Willems

    2015-01-01

    Metabolite quantitative traits carry great promise for epidemiological studies, and their genetic background has been addressed using Genome-Wide Association Studies (GWAS). Thus far, the role of less common variants has not been exhaustively studied. Here, we set out a GWAS for metabolite quantitative traits in serum, followed by exome sequence analysis to zoom in on putative causal variants in the associated genes. 1H Nuclear Magnetic Resonance (1H-NMR) spectroscopy experiments yielded successful quantification of 42 unique metabolites in 2,482 individuals from The Erasmus Rucphen Family (ERF) study. Heritability of metabolites were estimated by SOLAR. GWAS was performed by linear mixed models, using HapMap imputations. Based on physical vicinity and pathway analyses, candidate genes were screened for coding region variation using exome sequence data. Heritability estimates for metabolites ranged between 10% and 52%. GWAS replicated three known loci in the metabolome wide significance: CPS1 with glycine (P-value  = 1.27×10-32), PRODH with proline (P-value  = 1.11×10-19), SLC16A9 with carnitine level (P-value  = 4.81×10-14) and uncovered a novel association between DMGDH and dimethyl-glycine (P-value  = 1.65×10-19) level. In addition, we found three novel, suggestively significant loci: TNP1 with pyruvate (P-value  = 1.26×10-8), KCNJ16 with 3-hydroxybutyrate (P-value  = 1.65×10-8) and 2p12 locus with valine (P-value  = 3.49×10-8). Exome sequence analysis identified potentially causal coding and regulatory variants located in the genes CPS1, KCNJ2 and PRODH, and revealed allelic heterogeneity for CPS1 and PRODH. Combined GWAS and exome analyses of metabolites detected by high-resolution 1H-NMR is a robust approach to uncover metabolite quantitative trait loci (mQTL), and the likely causative variants in these loci. It is anticipated that insight in the genetics of intermediate phenotypes will provide additional insight into the

  17. Insight in genome-wide association of metabolite quantitative traits by exome sequence analyses.

    Directory of Open Access Journals (Sweden)

    Ayşe Demirkan

    2015-01-01

    Full Text Available Metabolite quantitative traits carry great promise for epidemiological studies, and their genetic background has been addressed using Genome-Wide Association Studies (GWAS. Thus far, the role of less common variants has not been exhaustively studied. Here, we set out a GWAS for metabolite quantitative traits in serum, followed by exome sequence analysis to zoom in on putative causal variants in the associated genes. 1H Nuclear Magnetic Resonance (1H-NMR spectroscopy experiments yielded successful quantification of 42 unique metabolites in 2,482 individuals from The Erasmus Rucphen Family (ERF study. Heritability of metabolites were estimated by SOLAR. GWAS was performed by linear mixed models, using HapMap imputations. Based on physical vicinity and pathway analyses, candidate genes were screened for coding region variation using exome sequence data. Heritability estimates for metabolites ranged between 10% and 52%. GWAS replicated three known loci in the metabolome wide significance: CPS1 with glycine (P-value  = 1.27×10-32, PRODH with proline (P-value  = 1.11×10-19, SLC16A9 with carnitine level (P-value  = 4.81×10-14 and uncovered a novel association between DMGDH and dimethyl-glycine (P-value  = 1.65×10-19 level. In addition, we found three novel, suggestively significant loci: TNP1 with pyruvate (P-value  = 1.26×10-8, KCNJ16 with 3-hydroxybutyrate (P-value  = 1.65×10-8 and 2p12 locus with valine (P-value  = 3.49×10-8. Exome sequence analysis identified potentially causal coding and regulatory variants located in the genes CPS1, KCNJ2 and PRODH, and revealed allelic heterogeneity for CPS1 and PRODH. Combined GWAS and exome analyses of metabolites detected by high-resolution 1H-NMR is a robust approach to uncover metabolite quantitative trait loci (mQTL, and the likely causative variants in these loci. It is anticipated that insight in the genetics of intermediate phenotypes will provide additional insight

  18. Analyses of pig genomes provide insight to procine demography and evolution

    NARCIS (Netherlands)

    Groenen, M.A.M.; Megens, H.J.W.C.; Frantz, L.A.F.; Bosse, M.; Crooijmans, R.P.M.A.; Dibbits, B.W.; Madsen, O.; Paudel, Y.

    2012-01-01

    For 10,000¿years pigs and humans have shared a close and complex relationship. From domestication to modern breeding practices, humans have shaped the genomes of domestic pigs. Here we present the assembly and analysis of the genome sequence of a female domestic Duroc pig (Sus scrofa) and a comparis

  19. The plant ontology as a tool for comparative plant anatomy and genomic analyses

    Science.gov (United States)

    Plant science is now a major player in the fields of genomics, gene expression analysis, phenomics and metabolomics. Recent advances in sequencing technologies have led to a windfall of data, with new species being added rapidly to the list of species whose genomes have been decoded. The Plant Ontol...

  20. Comparative genomic and morphological analyses of Listeria phages isolated from farm environments.

    Science.gov (United States)

    Denes, Thomas; Vongkamjan, Kitiya; Ackermann, Hans-Wolfgang; Moreno Switt, Andrea I; Wiedmann, Martin; den Bakker, Henk C

    2014-08-01

    The genus Listeria is ubiquitous in the environment and includes the globally important food-borne pathogen Listeria monocytogenes. While the genomic diversity of Listeria has been well studied, considerably less is known about the genomic and morphological diversity of Listeria bacteriophages. In this study, we sequenced and analyzed the genomes of 14 Listeria phages isolated mostly from New York dairy farm environments as well as one related Enterococcus faecalis phage to obtain information on genome characteristics and diversity. We also examined 12 of the phages by electron microscopy to characterize their morphology. These Listeria phages, based on gene orthology and morphology, together with previously sequenced Listeria phages could be classified into five orthoclusters, including one novel orthocluster. One orthocluster (orthocluster I) consists of large genome (~135-kb) myoviruses belonging to the genus “Twort-like viruses,” three orthoclusters (orthoclusters II to IV) contain small-genome (36- to 43-kb) siphoviruses with icosahedral heads, and the novel orthocluster V contains medium-sized-genome (~66-kb) siphoviruses with elongated heads. A novel orthocluster (orthocluster VI) of E. faecalis phages, with medium-sized genomes (~56 kb), was identified, which grouped together and shares morphological features with the novel Listeria phage orthocluster V. This new group of phages (i.e., orthoclusters V and VI) is composed of putative lytic phages that may prove to be useful in phage-based applications for biocontrol, detection, and therapeutic purposes.

  1. New Insights into Dynamic Actin-Based Chloroplast Photorelocation Movement

    Institute of Scientific and Technical Information of China (English)

    Sam-Geun Kong; Masamitsu Wada

    2011-01-01

    Chloroplast movement is essential for plants to survive under various environmental light conditions.Phototropins-plant-specific blue-light-activated receptor kinases-mediate the response by perceiving light intensity and direction.Recently,novel chloroplast actin (cp-actin) filaments have been identified as playing a pivotal role in the directional chloroplast photorelocation movement.Encouraging progress has recently been made in this field of research through molecular genetics and cell biological analyses.This review describes factors that have been identified as being involved in chloroplast movement and their roles in the regulation of cp-actin filaments,thus providing a basis for reflection on their biochemical activities and functions.

  2. Genome-wide analyses of Epstein-Barr virus reveal conserved RNA structures and a novel stable intronic sequence RNA

    OpenAIRE

    2013-01-01

    Background Epstein-Barr virus (EBV) is a human herpesvirus implicated in cancer and autoimmune disorders. Little is known concerning the roles of RNA structure in this important human pathogen. This study provides the first comprehensive genome-wide survey of RNA and RNA structure in EBV. Results Novel EBV RNAs and RNA structures were identified by computational modeling and RNA-Seq analyses of EBV. Scans of the genomic sequences of four EBV strains (EBV-1, EBV-2, GD1, and GD2) and of the clo...

  3. Comparative genomic analyses of Streptococcus mutans provide insights into chromosomal shuffling and species-specific content

    Directory of Open Access Journals (Sweden)

    Nakai Kenta

    2009-08-01

    Full Text Available Abstract Background Streptococcus mutans is the major pathogen of dental caries, and it occasionally causes infective endocarditis. While the pathogenicity of this species is distinct from other human pathogenic streptococci, the species-specific evolution of the genus Streptococcus and its genomic diversity are poorly understood. Results We have sequenced the complete genome of S. mutans serotype c strain NN2025, and compared it with the genome of UA159. The NN2025 genome is composed of 2,013,587 bp, and the two strains show highly conserved core-genome. However, comparison of the two S. mutans strains showed a large genomic inversion across the replication axis producing an X-shaped symmetrical DNA dot plot. This phenomenon was also observed between other streptococcal species, indicating that streptococcal genetic rearrangements across the replication axis play an important role in Streptococcus genetic shuffling. We further confirmed the genomic diversity among 95 clinical isolates using long-PCR analysis. Genomic diversity in S. mutans appears to occur frequently between insertion sequence (IS elements and transposons, and these diversity regions consist of restriction/modification systems, antimicrobial peptide synthesis systems, and transporters. S. mutans may preferentially reject the phage infection by clustered regularly interspaced short palindromic repeats (CRISPRs. In particular, the CRISPR-2 region, which is highly divergent between strains, in NN2025 has long repeated spacer sequences corresponding to the streptococcal phage genome. Conclusion These observations suggest that S. mutans strains evolve through chromosomal shuffling and that phage infection is not needed for gene acquisition. In contrast, S. pyogenes tolerates phage infection for acquisition of virulence determinants for niche adaptation.

  4. Integrative genomic analyses reveal an androgen-driven somatic alteration landscape in early-onset prostate cancer

    DEFF Research Database (Denmark)

    Weischenfeldt, Joachim; Simon, Ronald; Feuerbach, Lars;

    2013-01-01

    comparative assessments with seven elderly-onset PCA genomes. Remarkable age-related differences in structural rearrangement (SR) formation became evident, suggesting distinct disease pathomechanisms. Whereas EO-PCAs harbored a prevalence of balanced SRs, with a specific abundance of androgen-regulated ETS......Early-onset prostate cancer (EO-PCA) represents the earliest clinical manifestation of prostate cancer. To compare the genomic alteration landscapes of EO-PCA with "classical" (elderly-onset) PCA, we performed deep sequencing-based genomics analyses in 11 tumors diagnosed at young age, and pursued...... gene fusions including TMPRSS2:ERG, elderly-onset PCAs displayed primarily non-androgen-associated SRs. Data from a validation cohort of > 10,000 patients showed age-dependent androgen receptor levels and a prevalence of SRs affecting androgen-regulated genes, further substantiating the activity...

  5. Lifestyle transitions in plant pathogenic Colletotrichum fungi deciphered by genome and transcriptome analyses

    NARCIS (Netherlands)

    O'Connell, R.J.; Thon, M.R.; Hacquard, S.; Amyotte, S.G.; Kleemann, J.; Torres, M.F.; Damm, U.; Buiate, E.A.; Epstein, L.; Alkan, N.; Altmuller, J.; Alvarado-Balderrama, L.; Bauser, C.A.; Becker, C.; Birren, B.W.; Chen, Z.; Choi, J.; Crouch, J.A.; Duvick, J.P.; Farman, M.A.; Gan, P.; Heiman, D.; Henrissat, B.; Howard, R.J.; Kabbage, M.; Koch, C.; Kracher, B.; Kubo, Y.; Law, A.D.; Lebrun, M.-H.; Lee, Y.-H.; Miyara, I.; Moore, N.; Neumann, U.; Nordstrom, K.; Panaccione, D.G.; Panstruga, R.; Place, M.; Proctor, R.H.; Prusky, D.; Rech, G.; Reinhardt, R.; Rollins, J.A.; Rounsley, S.; Schardl, C.L.; Schwartz, D.C.; Shenoy, N.; Shirasu, K.; Sikhakolli, U.R.; Stuber, K.; Sukno, S.A.; Sweigard, J.A.; Takano, Y.; Takahara, H.; Trail, F.; Does, H.C.; Voll, L.M.; Will, I.; Young, S.; Zeng, Q.; Zhang, Jingze; Zhou, S.; Dickman, M.B.; Schulze-Lefert, P.; Verloren van Themaat, E.; Ma, L.-J.; Vaillancourt, L.J.

    2012-01-01

    Colletotrichum species are fungal pathogens that devastate crop plants worldwide. Host infection involves the differentiation of specialized cell types that are associated with penetration, growth inside living host cells (biotrophy) and tissue destruction (necrotrophy). We report here genome and tr

  6. Membrane heredity and early chloroplast evolution.

    Science.gov (United States)

    Cavalier-Smith, T

    2000-04-01

    Membrane heredity was central to the unique symbiogenetic origin from cyanobacteria of chloroplasts in the ancestor of Plantae (green plants, red algae, glaucophytes) and to subsequent lateral transfers of plastids to form even more complex photosynthetic chimeras. Each symbiogenesis integrated disparate genomes and several radically different genetic membranes into a more complex cell. The common ancestor of Plantae evolved transit machinery for plastid protein import. In later secondary symbiogeneses, signal sequences were added to target proteins across host perialgal membranes: independently into green algal plastids (euglenoids, chlorarachneans) and red algal plastids (alveolates, chromists). Conservatism and innovation during early plastid diversification are discussed.

  7. Whole-genome analyses resolve early branches in the tree of life of modern birds

    DEFF Research Database (Denmark)

    Sicheritz-Pontén, Thomas; Li, Cai; Li, Bo

    2014-01-01

    To better determine the history of modern birds, we performed a genome-scale phylogenetic analysis of 48 species representing all orders of Neoaves using phylogenomic methods created to handle genome-scale data. We recovered a highly resolved tree that confirms previously controversial sister or ...... levels of incomplete lineage sorting that occurred during a rapid radiation after the Cretaceous-Paleogene mass extinction event about 66 million years ago....

  8. A Nucleus-Encoded Chloroplast Protein YL1 Is Involved in Chloroplast Development and Efficient Biogenesis of Chloroplast ATP Synthase in Rice

    Science.gov (United States)

    Chen, Fei; Dong, Guojun; Wu, Limin; Wang, Fang; Yang, Xingzheng; Ma, Xiaohui; Wang, Haili; Wu, Jiahuan; Zhang, Yanli; Wang, Huizhong; Qian, Qian; Yu, Yanchun

    2016-01-01

    Chloroplast ATP synthase (cpATPase) is an importance thylakoid membrane-associated photosynthetic complex involved in the light-dependent reactions of photosynthesis. In this study, we isolated and characterized a rice (Oryza sativa) mutant yellow leaf 1 (yl1), which exhibits chlorotic leaves throughout developmental stages. The YL1 mutation showed reduced chlorophyll contents, abnormal chloroplast morphology, and decreased photochemical efficiency. Moreover, YL1 deficiency disrupts the expression of genes associated with chloroplast development and photosynthesis. Molecular and genetic analyses revealed that YL1 is a nucleus-encoded protein with a predicted transmembrane domain in its carboxyl-terminus that is conserved in the higher plant kingdom. YL1 localizes to chloroplasts and is preferentially expressed in green tissues containing chloroplasts. Immunoblot analyses showed that inactivation of YL1 leads to drastically reduced accumulation of AtpA (α) and AtpB (β), two core subunits of CF1αβ subcomplex of cpATPase, meanwhile, a severe decrease (ca. 41.7%) in cpATPase activity was observed in the yl1-1 mutant compared with the wild type. Furthermore, yeast two-hybrid and bimolecular fluorescence complementation assays revealed a specific interaction between YL1 and AtpB subunit of cpATPase. Taken together, our results suggest that YL1 is a plant lineage-specific auxiliary factor involved in the biogenesis of the cpATPase complex, possibly via interacting with the β-subunit. PMID:27585744

  9. Genome-Wide Gene Expression Profile Analyses Identify CTTN as a Potential Prognostic Marker in Esophageal Cancer

    OpenAIRE

    2014-01-01

    Aim Esophageal squamous cell carcinoma (ESCC) is one of the most common fatal malignances of the digestive tract. Its prognosis is poor mainly due to the lack of reliable markers for early detection and prognostic prediction. Here we aim to identify the molecules involved in ESCC carcinogenesis and those as potential markers for prognosis and as new molecular therapeutic targets. Methods We performed genome-wide gene expression profile analyses of 10 primary ESCCs and their adjacent normal ti...

  10. Whole-genome resequencing analyses of five pig breeds, including Korean wild and native, and three European origin breeds.

    Science.gov (United States)

    Choi, Jung-Woo; Chung, Won-Hyong; Lee, Kyung-Tai; Cho, Eun-Seok; Lee, Si-Woo; Choi, Bong-Hwan; Lee, Sang-Heon; Lim, Wonjun; Lim, Dajeong; Lee, Yun-Gyeong; Hong, Joon-Ki; Kim, Doo-Wan; Jeon, Hyeon-Jeong; Kim, Jiwoong; Kim, Namshin; Kim, Tae-Hun

    2015-08-01

    Pigs have been one of the most important sources of meat for humans, and their productivity has been substantially improved by recent strong selection. Here, we present whole-genome resequencing analyses of 55 pigs of five breeds representing Korean native pigs, wild boar and three European origin breeds. 1,673.1 Gb of sequence reads were mapped to the Swine reference assembly, covering ∼99.2% of the reference genome, at an average of ∼11.7-fold coverage. We detected 20,123,573 single-nucleotide polymorphisms (SNPs), of which 25.5% were novel. We extracted 35,458 of non-synonymous SNPs in 9,904 genes, which may contribute to traits of interest. The whole SNP sets were further used to access the population structures of the breeds, using multiple methodologies, including phylogenetic, similarity matrix, and population structure analysis. They showed clear population clusters with respect to each breed. Furthermore, we scanned the whole genomes to identify signatures of selection throughout the genome. The result revealed several promising loci that might underlie economically important traits in pigs, such as the CLDN1 and TWIST1 genes. These discoveries provide useful genomic information for further study of the discrete genetic mechanisms associated with economically important traits in pigs.

  11. Complete genome sequence and transcriptomics analyses reveal pigment biosynthesis and regulatory mechanisms in an industrial strain, Monascus purpureus YY-1.

    Science.gov (United States)

    Yang, Yue; Liu, Bin; Du, Xinjun; Li, Ping; Liang, Bin; Cheng, Xiaozhen; Du, Liangcheng; Huang, Di; Wang, Lei; Wang, Shuo

    2015-02-09

    Monascus has been used to produce natural colorants and food supplements for more than one thousand years, and approximately more than one billion people eat Monascus-fermented products during their daily life. In this study, using next-generation sequencing and optical mapping approaches, a 24.1-Mb complete genome of an industrial strain, Monascus purpureus YY-1, was obtained. This genome consists of eight chromosomes and 7,491 genes. Phylogenetic analysis at the genome level provides convincing evidence for the evolutionary position of M. purpureus. We provide the first comprehensive prediction of the biosynthetic pathway for Monascus pigment. Comparative genomic analyses show that the genome of M. purpureus is 13.6-40% smaller than those of closely related filamentous fungi and has undergone significant gene losses, most of which likely occurred during its specialized adaptation to starch-based foods. Comparative transcriptome analysis reveals that carbon starvation stress, resulting from the use of relatively low-quality carbon sources, contributes to the high yield of pigments by repressing central carbon metabolism and augmenting the acetyl-CoA pool. Our work provides important insights into the evolution of this economically important fungus and lays a foundation for future genetic manipulation and engineering of this strain.

  12. The Naked Mole Rat Genome Resource: facilitating analyses of cancer and longevity-related adaptations

    Science.gov (United States)

    Keane, Michael; Craig, Thomas; Alföldi, Jessica; Berlin, Aaron M.; Johnson, Jeremy; Seluanov, Andrei; Gorbunova, Vera; Di Palma, Federica; Lindblad-Toh, Kerstin; Church, George M.; de Magalhães, João Pedro

    2014-01-01

    Motivation: The naked mole rat (Heterocephalus glaber) is an exceptionally long-lived and cancer-resistant rodent native to East Africa. Although its genome was previously sequenced, here we report a new assembly sequenced by us with substantially higher N50 values for scaffolds and contigs. Results: We analyzed the annotation of this new improved assembly and identified candidate genomic adaptations which may have contributed to the evolution of the naked mole rat’s extraordinary traits, including in regions of p53, and the hyaluronan receptors CD44 and HMMR (RHAMM). Furthermore, we developed a freely available web portal, the Naked Mole Rat Genome Resource (http://www.naked-mole-rat.org), featuring the data and results of our analysis, to assist researchers interested in the genome and genes of the naked mole rat, and also to facilitate further studies on this fascinating species. Availability and implementation: The Naked Mole Rat Genome Resource is freely available online at http://www.naked-mole-rat.org. This resource is open source and the source code is available at https://github.com/maglab/naked-mole-rat-portal. Contact: jp@senescence.info PMID:25172923

  13. Genomic analyses of the microsporidian Nosema ceranae, an emergent pathogen of honey bees.

    Directory of Open Access Journals (Sweden)

    R Scott Cornman

    2009-06-01

    Full Text Available Recent steep declines in honey bee health have severely impacted the beekeeping industry, presenting new risks for agricultural commodities that depend on insect pollination. Honey bee declines could reflect increased pressures from parasites and pathogens. The incidence of the microsporidian pathogen Nosema ceranae has increased significantly in the past decade. Here we present a draft assembly (7.86 MB of the N. ceranae genome derived from pyrosequence data, including initial gene models and genomic comparisons with other members of this highly derived fungal lineage. N. ceranae has a strongly AT-biased genome (74% A+T and a diversity of repetitive elements, complicating the assembly. Of 2,614 predicted protein-coding sequences, we conservatively estimate that 1,366 have homologs in the microsporidian Encephalitozoon cuniculi, the most closely related published genome sequence. We identify genes conserved among microsporidia that lack clear homology outside this group, which are of special interest as potential virulence factors in this group of obligate parasites. A substantial fraction of the diminutive N. ceranae proteome consists of novel and transposable-element proteins. For a majority of well-supported gene models, a conserved sense-strand motif can be found within 15 bases upstream of the start codon; a previously uncharacterized version of this motif is also present in E. cuniculi. These comparisons provide insight into the architecture, regulation, and evolution of microsporidian genomes, and will drive investigations into honey bee-Nosema interactions.

  14. From reptilian phylogenomics to reptilian genomes: analyses of c-Jun and DJ-1 proto-oncogenes.

    Science.gov (United States)

    Katsu, Y; Braun, E L; Guillette, L J; Iguchi, T

    2009-01-01

    Genome projects have revolutionized our understanding of both molecular biology and evolution, but there has been a limited collection of genomic data from reptiles. This is surprising given the pivotal position of reptiles in vertebrate phylogeny and the potential utility of information from reptiles for understanding a number of biological phenomena, such as sex determination. Although there are many potential uses for genomic data, one important and useful approach is phylogenomics. Here we report cDNA sequences for the c-Jun(JUN) and DJ-1(PARK7) proto-oncogenes from 3 reptiles (the American alligator, Nile crocodile, and Florida red-belly turtle), show that both genes are expressed in the alligator, and integrate them into analyses of their homologs from other organisms. With these taxa it was possible to conduct analyses that include all major vertebrate lineages. Analyses of c-Jun revealed an unexpected but well-supported frog-turtle clade while analyses of DJ-1 revealed a topology largely congruent with expectation based upon other data. The conflict between the c-Jun topology and expectation appears to reflect the overlap between c-Jun and a CpG island in most taxa, including crocodilians. This CpG island is absent in the frog and turtle, and convergence in base composition appears to be at least partially responsible for the signal uniting these taxa. Noise reduction approaches can eliminate the unexpected frog-turtle clade, demonstrating that multiple signals are present in the c-Jun alignment. We used phylogenetic methods to visualize these signals; we suggest that examining both historical and non-historical signals will prove important for phylogenomic analyses.

  15. The glutathione peroxidase gene family of Lotus japonicus: characterization of genomic clones, expression analyses and immunolocalization in legumes.

    Science.gov (United States)

    Ramos, Javier; Matamoros, Manuel A; Naya, Loreto; James, Euan K; Rouhier, Nicolas; Sato, Shusei; Tabata, Satoshi; Becana, Manuel

    2009-01-01

    Despite the multiple roles played by antioxidants in rhizobia-legume symbioses, little is known about glutathione peroxidases (GPXs) in legumes. Here the characterization of six GPX genes of Lotus japonicus is reported. Expression of GPX genes was analysed by quantitative reverse transcription-polymerase chain reaction in L. japonicus and Lotus corniculatus plants exposed to various treatments known to generate reactive oxygen and/or nitrogen species. LjGPX1 and LjGPX3 were the most abundantly expressed genes in leaves, roots and nodules. Compared with roots, LjGPX1 and LjGPX6 were highly expressed in leaves and LjGPX3 and LjGPX6 in nodules. In roots, salinity decreased GPX4 expression, aluminium decreased expression of the six genes, and cadmium caused up-regulation of GPX3, GPX4 and GPX5 after 1 h and down-regulation of GPX1, GPX2, GPX4 and GPX6 after 3-24 h. Exposure of roots to sodium nitroprusside (a nitric oxide donor) for 1 h increased the mRNA levels of GPX4 and GPX6 by 3.3- and 30-fold, respectively. Thereafter, the GPX6 mRNA level remained consistently higher than that of the control. Immunogold labelling revealed the presence of GPX proteins in root and nodule amyloplasts and in leaf chloroplasts of L. japonicus and other legumes. Labelling was associated with starch grains. These results underscore the differential regulation of GPX expression in response to cadmium, aluminium and nitric oxide, and strongly support a role for GPX6 and possibly other GPX genes in stress and/or metabolic signalling.

  16. Systems genetics of obesity in an F2 pig model by genome-wide association, genetic network and pathway analyses

    Directory of Open Access Journals (Sweden)

    Lisette J. A. Kogelman

    2014-07-01

    Full Text Available Obesity is a complex condition with world-wide exponentially rising prevalence rates, linked with severe diseases like Type 2 Diabetes. Economic and welfare consequences have led to a raised interest in a better understanding of the biological and genetic background. To date, whole genome investigations focusing on single genetic variants have achieved limited success, and the importance of including genetic interactions is becoming evident. Here, the aim was to perform an integrative genomic analysis in an F2 pig resource population that was constructed with an aim to maximize genetic variation of obesity-related phenotypes and genotyped using the 60K SNP chip. Firstly, Genome Wide Association (GWA analysis was performed on the Obesity Index to locate candidate genomic regions that were further validated using combined Linkage Disequilibrium Linkage Analysis and investigated by evaluation of haplotype blocks. We built Weighted Interaction SNP Hub (WISH and differentially wired (DW networks using genotypic correlations amongst obesity-associated SNPs resulting from GWA analysis. GWA results and SNP modules detected by WISH and DW analyses were further investigated by functional enrichment analyses. The functional annotation of SNPs revealed several genes associated with obesity, e.g. NPC2 and OR4D10. Moreover, gene enrichment analyses identified several significantly associated pathways, over and above the GWA study results, that may influence obesity and obesity related diseases, e.g. metabolic processes. WISH networks based on genotypic correlations allowed further identification of various gene ontology terms and pathways related to obesity and related traits, which were not identified by the GWA study. In conclusion, this is the first study to develop a (genetic obesity index and employ systems genetics in a porcine model to provide important insights into the complex genetic architecture associated with obesity and many biological pathways

  17. Genome constitution of Narcissus variety, 'Tete-a-Tete', analysed through GISH and NBS profiling

    NARCIS (Netherlands)

    Wu, H.; Ramanna, M.S.; Arens, P.; Tuyl, van J.M.

    2011-01-01

    The Narcissus variety, ‘Tête-à-Tête’, has been the most popular variety since 1949, and a well known allotriploid (2n = 3x = 24 + B) of spontaneous origin. Because the identity of one of the parents of this variety was uncertain, the genome constitution of ‘Tête-à-Tête’ was investigated by using gen

  18. MRSA transmission on a neonatal intensive care unit: epidemiological and genome-based phylogenetic analyses.

    Directory of Open Access Journals (Sweden)

    Ulrich Nübel

    Full Text Available BACKGROUND: Methicillin-resistant Staphylococcus aureus (MRSA may cause prolonged outbreaks of infections in neonatal intensive care units (NICUs. While the specific factors favouring MRSA spread on neonatal wards are not well understood, colonized infants, their relatives, or health-care workers may all be sources for MRSA transmission. Whole-genome sequencing may provide a new tool for elucidating transmission pathways of MRSA at a local scale. METHODS AND FINDINGS: We applied whole-genome sequencing to trace MRSA spread in a NICU and performed a case-control study to identify risk factors for MRSA transmission. MRSA genomes had accumulated sequence variation sufficiently fast to reflect epidemiological linkage among individual patients, between infants and their mothers, and between infants and staff members, such that the relevance of individual nurses' nasal MRSA colonization for prolonged transmission could be evaluated. In addition to confirming previously reported risk factors, we identified an increased risk of transmission from infants with as yet unknown MRSA colonisation, in contrast to known MRSA-positive infants. CONCLUSIONS: The integration of epidemiological (temporal, spatial and genomic data enabled the phylogenetic testing of several hypotheses on specific MRSA transmission routes within a neonatal intensive-care unit. The pronounced risk of transmission emanating from undetected MRSA carriers suggested that increasing the frequency or speed of microbiological diagnostics could help to reduce transmission of MRSA.

  19. Integrative genome analyses identify key somatic driver mutations of small-cell lung cancer

    NARCIS (Netherlands)

    Peifer, Martin; Fernandez-Cuesta, Lynnette; Sos, Martin L.; George, Julie; Seidel, Danila; Kasper, Lawryn H.; Plenker, Dennis; Leenders, Frauke; Sun, Ruping; Zander, Thomas; Menon, Roopika; Koker, Mirjam; Dahmen, Ilona; Mueller, Christian; Di Cerbo, Vincenzo; Schildhaus, Hans-Ulrich; Altmueller, Janine; Baessmann, Ingelore; Becker, Christian; de Wilde, Bram; Vandesompele, Jo; Boehm, Diana; Ansen, Sascha; Gabler, Franziska; Wilkening, Ines; Heynck, Stefanie; Heuckmann, Johannes M.; Lu, Xin; Carter, Scott L.; Cibulskis, Kristian; Banerji, Shantanu; Getz, Gad; Park, Kwon-Sik; Rauh, Daniel; Gruetter, Christian; Fischer, Matthias; Pasqualucci, Laura; Wright, Gavin; Wainer, Zoe; Russell, Prudence; Petersen, Iver; Chen, Yuan; Stoelben, Erich; Ludwig, Corinna; Schnabel, Philipp; Hoffmann, Hans; Muley, Thomas; Brockmann, Michael; Engel-Riedel, Walburga; Muscarella, Lucia A.; Fazio, Vito M.; Groen, Harry; Timens, Wim; Sietsma, Hannie; Thunnissen, Erik; Smit, Egbert; Heideman, Danielle A. M.; Snijders, Peter J. F.; Cappuzzo, Federico; Ligorio, Claudia; Damiani, Stefania; Field, John; Solberg, Steinar; Brustugun, Odd Terje; Lund-Iversen, Marius; Saenger, Joerg; Clement, Joachim H.; Soltermann, Alex; Moch, Holger; Weder, Walter; Solomon, Benjamin; Soria, Jean-Charles; Validire, Pierre; Besse, Benjamin; Brambilla, Elisabeth; Brambilla, Christian; Lantuejoul, Sylvie; Lorimier, Philippe; Schneider, Peter M.; Hallek, Michael; Pao, William; Meyerson, Matthew; Sage, Julien; Shendure, Jay; Schneider, Robert; Buettner, Reinhard; Wolf, Juergen; Nuernberg, Peter; Perner, Sven; Heukamp, Lukas C.; Brindle, Paul K.; Haas, Stefan; Thomas, Roman K.

    2012-01-01

    Small-cell lung cancer (SCLC) is an aggressive lung tumor subtype with poor prognosis(1-3). We sequenced 29 SCLC exomes, 2 genomes and 15 transcriptomes and found an extremely high mutation rate of 7.4 +/- 1 protein-changing mutations per million base pairs. Therefore, we conducted integrated analys

  20. Multidimensional Genome-wide Analyses Show Accurate FVIII Integration by ZFN in Primary Human Cells

    Science.gov (United States)

    Sivalingam, Jaichandran; Kenanov, Dimitar; Han, Hao; Nirmal, Ajit Johnson; Ng, Wai Har; Lee, Sze Sing; Masilamani, Jeyakumar; Phan, Toan Thang; Maurer-Stroh, Sebastian; Kon, Oi Lian

    2016-01-01

    Costly coagulation factor VIII (FVIII) replacement therapy is a barrier to optimal clinical management of hemophilia A. Therapy using FVIII-secreting autologous primary cells is potentially efficacious and more affordable. Zinc finger nucleases (ZFN) mediate transgene integration into the AAVS1 locus but comprehensive evaluation of off-target genome effects is currently lacking. In light of serious adverse effects in clinical trials which employed genome-integrating viral vectors, this study evaluated potential genotoxicity of ZFN-mediated transgenesis using different techniques. We employed deep sequencing of predicted off-target sites, copy number analysis, whole-genome sequencing, and RNA-seq in primary human umbilical cord-lining epithelial cells (CLECs) with AAVS1 ZFN-mediated FVIII transgene integration. We combined molecular features to enhance the accuracy and activity of ZFN-mediated transgenesis. Our data showed a low frequency of ZFN-associated indels, no detectable off-target transgene integrations or chromosomal rearrangements. ZFN-modified CLECs had very few dysregulated transcripts and no evidence of activated oncogenic pathways. We also showed AAVS1 ZFN activity and durable FVIII transgene secretion in primary human dermal fibroblasts, bone marrow- and adipose tissue-derived stromal cells. Our study suggests that, with close attention to the molecular design of genome-modifying constructs, AAVS1 ZFN-mediated FVIII integration in several primary human cell types may be safe and efficacious. PMID:26689265

  1. Genome-wide association analyses identify variants in developmental genes associated with hypospadias

    DEFF Research Database (Denmark)

    Geller, Frank; Feenstra, Bjarke; Carstensen, Lisbeth;

    2014-01-01

    Hypospadias is a common congenital condition in boys in which the urethra opens on the underside of the penis. We performed a genome-wide association study on 1,006 surgery-confirmed hypospadias cases and 5,486 controls from Denmark. After replication genotyping of an additional 1,972 cases and 1...

  2. Deciphering heterogeneity in pig genome assembly Sscrofa9 by isochore and isochore-like region analyses.

    Directory of Open Access Journals (Sweden)

    Wenqian Zhang

    Full Text Available BACKGROUND: The isochore, a large DNA sequence with relatively small GC variance, is one of the most important structures in eukaryotic genomes. Although the isochore has been widely studied in humans and other species, little is known about its distribution in pigs. PRINCIPAL FINDINGS: In this paper, we construct a map of long homogeneous genome regions (LHGRs, i.e., isochores and isochore-like regions, in pigs to provide an intuitive version of GC heterogeneity in each chromosome. The LHGR pattern study not only quantifies heterogeneities, but also reveals some primary characteristics of the chromatin organization, including the followings: (1 the majority of LHGRs belong to GC-poor families and are in long length; (2 a high gene density tends to occur with the appearance of GC-rich LHGRs; and (3 the density of LINE repeats decreases with an increase in the GC content of LHGRs. Furthermore, a portion of LHGRs with particular GC ranges (50%-51% and 54%-55% tend to have abnormally high gene densities, suggesting that biased gene conversion (BGC, as well as time- and energy-saving principles, could be of importance to the formation of genome organization. CONCLUSION: This study significantly improves our knowledge of chromatin organization in the pig genome. Correlations between the different biological features (e.g., gene density and repeat density and GC content of LHGRs provide a unique glimpse of in silico gene and repeats prediction.

  3. Meta-analyses of genome-wide association studies identify multiple loci associated with pulmonary function

    NARCIS (Netherlands)

    D.B. Hancock (Dana); M. Eijgelsheim (Mark); J.B. Wilk (Jemma); S.A. Gharib (Sina); L.R. Loehr (Laura); K. Marciante (Kristin); N. Franceschini (Nora); Y.M.T.A. van Durme; T.H. Chen; R.G. Barr (Graham); M.B. Schabath (Matthew); D.J. Couper (David); G.G. Brusselle (Guy); B.M. Psaty (Bruce); P. Tikka-Kleemola (Päivi); J.I. Rotter (Jerome); A.G. Uitterlinden (André); A. Hofman (Albert); N.M. Punjabi (Naresh); F. Rivadeneira Ramirez (Fernando); A.C. Morrison (Alanna); P.L. Enright (Paul); K.E. North (Kari); S.R. Heckbert (Susan); T. Lumley (Thomas); B.H.Ch. Stricker (Bruno); G.T. O'Connor (George); S.J. London (Stephanie)

    2010-01-01

    textabstractSpirometric measures of lung function are heritable traits that reflect respiratory health and predict morbidity and mortality. We meta-analyzed genome-wide association studies for two clinically important lung-function measures: forced expiratory volume in the first second (FEV1) and it

  4. Genomic analyses of DNA transformation and penicillin resistance in Streptococcus pneumoniae clinical isolates.

    Science.gov (United States)

    Fani, Fereshteh; Leprohon, Philippe; Zhanel, George G; Bergeron, Michel G; Ouellette, Marc

    2014-01-01

    Alterations in penicillin-binding proteins, the target enzymes for β-lactam antibiotics, are recognized as primary penicillin resistance mechanisms in Streptococcus pneumoniae. Few studies have analyzed penicillin resistance at the genome scale, however, and we report the sequencing of S. pneumoniae R6 transformants generated while reconstructing the penicillin resistance phenotypes from three penicillin-resistant clinical isolates by serial genome transformation. The genome sequences of the three last-level transformants T2-18209, T5-1983, and T3-55938 revealed that 16.2 kb, 82.7 kb, and 137.2 kb of their genomes had been replaced with 5, 20, and 37 recombinant sequence segments derived from their respective parental clinical isolates, documenting the extent of DNA transformation between strains. A role in penicillin resistance was confirmed for some of the mutations identified in the transformants. Several multiple recombination events were also found to have happened at single loci coding for penicillin-binding proteins (PBPs) that increase resistance. Sequencing of the transformants with MICs for penicillin similar to those of the parent clinical strains confirmed the importance of mosaic PBP2x, -2b, and -1a as a driving force in penicillin resistance. A role in resistance for mosaic PBP2a was also observed for two of the resistant clinical isolates.

  5. Psychiatric genome-wide association study analyses implicate neuronal, immune and histone pathways

    DEFF Research Database (Denmark)

    O'Dushlaine, Colm; Rossin, Lizzy; Lee, Phil H.

    2015-01-01

    Genome-wide association studies (GWAS) of psychiatric disorders have identified multiple genetic associations with such disorders, but better methods are needed to derive the underlying biological mechanisms that these signals indicate. We sought to identify biological pathways in GWAS data from ...

  6. Psychiatric genome-wide association study analyses implicate neuronal, immune and histone pathways

    NARCIS (Netherlands)

    O'Dushlaine, Colm; Rossin, Lizzy; Lee, Phil H.; Duncan, Laramie; Parikshak, Neelroop N.; Newhouse, Stephen; Ripke, Stephan; Neale, Benjamin M.; Purcell, Shaun M.; Posthuma, Danielle; Nurnberger, John I.; Lee, S. Hong; Faraone, Stephen V.; Perlis, Roy H.; Mowry, Bryan J.; Thapar, Anita; Goddard, Michael E.; Witte, John S.; Absher, Devin; Agartz, Ingrid; Akil, Huda; Amin, Farooq; Andreassen, Ole A.; Anjorin, Adebayo; Anney, Richard; Anttila, Verneri; Arking, Dan E.; Asherson, Philip; Azevedo, Maria H.; Backlund, Lena; Badner, Judith A.; Bailey, Anthony J.; Banaschewski, Tobias; Barchas, Jack D.; Barnes, Michael R.; Barrett, Thomas B.; Bass, Nicholas; Battaglia, Agatino; Bauer, Michael; Bayes, Monica; Bellivier, Frank; Bergen, Sarah E.; Berrettini, Wade; Betancur, Catalina; Bettecken, Thomas; Biederman, Joseph; Binder, Elisabeth B.; Black, Donald W.; Blackwood, Douglas H. R.; Bloss, Cinnamon S.; Boehnke, Michael; Boomsma, Dorret I.; Breuer, Rene; Bruggeman, Richard; Cormican, Paul; Buccola, Nancy G.; Buitelaar, Jan K.; Bunney, William E.; Buxbaum, Joseph D.; Byerley, William F.; Byrne, Enda M.; Caesar, Sian; Cahn, Wiepke; Cantor, Rita M.; Casas, Miguel; Chakravarti, Aravinda; Chambert, Kimberly; Choudhury, Khalid; Cichon, Sven; Mattheisen, Manuel; Cloninger, C. Robert; Collier, David A.; Cook, Edwin H.; Coon, Hilary; Cormand, Bru; Corvin, Aiden; Coryell, William H.; Craig, David W.; Craig, Ian W.; Crosbie, Jennifer; Cuccaro, Michael L.; Curtis, David; Czamara, Darina; Datta, Susmita; Dawson, Geraldine; Day, Richard; De Geus, Eco J.; Degenhardt, Franziska; Djurovic, Srdjan; Donohoe, Gary J.; Doyle, Alysa E.; Duan, Jubao; Dudbridge, Frank; Duketis, Eftichia; Ebstein, Richard P.; Edenberg, Howard J.; Elia, Josephine; Ennis, Sean; Etain, Bruno; Fanous, Ayman; Farmer, Anne E.; Ferrier, I. Nicol; Flicldnger, Matthew; Fombonne, Eric; Foroud, Tatiana; Frank, Josef; Franke, Barbara; Fraser, Christine; Freedman, Robert; Freimer, Nelson B.; Freitag, Christine M.; Friedl, Marion; Frisen, Louise; Gailagher, Louise; Gejman, Pablo V.; Georgieva, Lyudmila; Gershon, Elliot S.; Giegling, Ina; Gill, Michael; Gordon, Scott D.; Gordon-Smith, Katherine; Green, Elaine K.; Greenwood, Tiffany A.; Grice, Dorothy E.; Gross, Magdalena; Grozeva, Detelina; Guan, Weihua; Gurling, Hugh; De Haan, Lieuwe; Haines, Jonathan L.; Hakonarson, Hakon; Hallmayer, Joachim; Hamilton, Steven P.; Hamshere, Marian L.; Hansen, Thomas F.; Hartmann, Annette M.; Hautzinger, Martin; Heath, Andrew C.; Henders, Anjali K.; Herms, Stefan; Hickie, Ian B.; Hipolito, Maria; Hoefels, Susanne; Holsboer, Florian; Hoogendijk, Witte J.; Hottenga, Jouke-Jan; Hultman, Christina M.; Hus, Vanessa; Ingason, Andres; Ising, Marcus; Jamain, Stephane; Jones, Edward G.; Jones, Ian; Jones, Lisa; Tzeng, Jung-Ying; Kaehler, Anna K.; Kahn, Rene S.; Kandaswamy, Radhika; Keller, Matthew C.; Kennedy, James L.; Kenny, Elaine; Kent, Lindsey; Kim, Yunjung; Kirov, George K.; Klauck, Sabine M.; Klei, Lambertus; Knowles, James A.; Kohli, Martin A.; Koller, Daniel L.; Konte, Bettina; Korszun, Ania; Krabbendam, Lydia; Krasucki, Robert; Kuntsi, Jonna; Kwan, Phoenix; Landen, Mikael; Laengstroem, Niklas; Lathrop, Mark; Lawrence, Jacob; Lawson, William B.; Leboyer, Marion; Ledbetter, David H.; Lencz, Todd; Lesch, Klaus-Peter; Levinson, Douglas F.; Lewis, Cathryn M.; Li, Jun; Lichtenstein, Paul; Lieberman, Jeffrey A.; Lin, Dan-Yu; Linszen, Don H.; Liu, Chunyu; Lohoff, Falk W.; Loo, Sandra K.; Lord, Catherine; Lowe, Jennifer K.; Lucae, Susanne; MacIntyre, Donald J.; Madden, Pamela A. F.; Maestrini, Elena; Magnusson, Patrik K. E.; Mahon, Pamela B.; Maier, Wolfgang; Malhotra, Anil K.; Mane, Shrikant M.; Martin, Christa L.; Martin, Nicholas G.; Matthews, Keith; Mattingsdal, Morten; McCarroll, Steven A.; McGhee, Kevin A.; McGough, James J.; McGrath, Patrick J.; McGuffin, Peter; McInnis, Melvin G.; McIntosh, Andrew; McKinney, Rebecca; McLean, Alan W.; McMahon, Francis J.; McMahon, William M.; McQuillin, Andrew; Medeiros, Helena; Medland, Sarah E.; Meier, Sandra; Melle, Ingrid; Meng, Fan; Meyer, Jobst; Middeldorp, Christel M.; Middleton, Lefkos; Milanova, Vihra; Miranda, Ana; Monaco, Anthony P.; Montgomery, Grant W.; Moran, Jennifer L.; Moreno-De-Luca, Daniel; Morken, Gunnar; Morris, Derek W.; Morrow, Eric M.; Moskvina, Valentina; Muglia, Pierandrea; Muehleisen, Thomas W.; Muir, Walter J.; Mueller-Myhsok, Bertram; Murtha, Michael; Myers, Richard M.; Myin-Germeys, Inez; Neale, Michael C.; Nelson, Stan F.; Nievergelt, Caroline M.; Nikolov, Ivan; Nimgaonkar, Vishwajit; Nolen, Willem A.; Noethen, Markus M.; Nwulia, Evaristus A.; Nyholt, Dale R.; Oades, Robert D.; Olincy, Ann; Oliveira, Guiomar; Olsen, Line; Ophoff, Roel A.; Osby, Urban; Owen, Michael J.; Palotie, Aarno; Parr, Jeremy R.; Paterson, Andrew D.; Pato, Carlos N.; Pato, Michele T.; Penninx, Brenda W.; Pergadia, Michele L.; Pericak-Vance, Margaret A.; Pickard, Benjamin S.; Pimm, Jonathan; Piven, Joseph; Potash, James B.; Poustka, Fritz; Propping, Peter; Puri, Vinay; Quested, Digby J.; Quinn, Emma M.; Ramos-Quiroga, Josep Antoni; Rasmussen, Henrik B.; Raychaudhuri, Soumya; Rehnstroem, Karola; Reif, Andreas; Ribases, Marta; Rice, John P.; Rietschel, Marcella; Roeder, Kathryn; Roeyers, Herbert; Rothenberger, Aribert; Rouleau, Guy; Ruderfer, Douglas; Rujescu, Dan; Sanders, Alan R.; Sanders, Stephan J.; Santangelo, Susan L.; Sergeant, Joseph A.; Schachar, Russell; Schalling, Martin; Schatzberg, Alan F.; Scheftner, William A.; Schellenberg, Gerard D.; Scherer, Stephen W.; Schork, Nicholas J.; Schulze, Thomas G.; Schumacher, Johannes; Schwarz, Markus; Scolnick, Edward; Scott, Laura J.; Shi, Jianxin; Shilling, Paul D.; Shyn, Stanley I.; Silverman, Jeremy M.; Slager, Susan L.; Smalley, Susan L.; Smit, Johannes H.; Smith, Erin N.; Sonuga-Barke, Edmund J. S.; Cair, David St.; State, Matthew; Steffens, Michael; Steinhausen, Hans-Christoph; Strauss, John S.; Strohmaier, Jana; Stroup, T. Scott; Sutdiffe, James S.; Szatmari, Peter; Szelinger, Szabocls; Thirumalai, Srinivasa; Thompson, Robert C.; Todorov, Alexandre A.; Tozzi, Federica; Treutlein, Jens; Uhr, Manfred; Van den Oord, Edwin J. C. G.; Van Grootheest, Gerard; Van Os, Jim; Vicente, Astrid M.; Vieland, Veronica J.; Vincent, John B.; Visscher, Peter M.; Walsh, Christopher A.; Wassink, Thomas H.; Watson, Stanley J.; Weissman, Myrna M.; Werge, Thomas; Wienker, Thomas F.; Wijsman, Ellen M.; Willemsen, Gonneke; Williams, Nigel; Willsey, A. Jeremy; Witt, Stephanie H.; Xu, Wei; Young, Allan H.; Yu, Timothy W.; Zammit, Stanley; Zandi, Peter P.; Zhang, Peng; Zitman, Frans G.; Zoellner, Sebastian; Devlin, Bernie; Kelsoe, John R.; Sklar, Pamela; Daly, Mark J.; O'Donovan, Michael C.; Craddock, Nicholas; Kendler, Kenneth S.; Weiss, Lauren A.; Wray, Naomi R.; Zhao, Zhaoming; Geschwind, Daniel H.; Sullivan, Patrick F.; Smoller, Jordan W.; Holmans, Peter A.; Breen, Gerome

    2015-01-01

    Genome-wide association studies (GWAS) of psychiatric disorders have identified multiple genetic associations with such disorders, but better methods are needed to derive the underlying biological mechanisms that these signals indicate. We sought to identify biological pathways in GWAS data from ove

  7. Genome-wide association analyses identify 18 new loci associated with serum urate concentrations

    NARCIS (Netherlands)

    Köttgen, Anna; Albrecht, Eva; Teumer, Alexander; Vitart, Veronique; Krumsiek, Jan; Hundertmark, Claudia; Pistis, Giorgio; Ruggiero, Daniela; O'Seaghdha, Conall M; Haller, Toomas; Yang, Qiong; Tanaka, Toshiko; Johnson, Andrew D; Kutalik, Zoltán; Smith, Albert V; Shi, Julia; Struchalin, Maksim; Middelberg, Rita P S; Brown, Morris J; Gaffo, Angelo L; Pirastu, Nicola; Li, Guo; Hayward, Caroline; Zemunik, Tatijana; Huffman, Jennifer; Yengo, Loic; Zhao, Jing Hua; Demirkan, Ayse; Feitosa, Mary F; Liu, Xuan; Malerba, Giovanni; Lopez, Lorna M; van der Harst, Pim; Li, Xinzhong; Kleber, Marcus E; Hicks, Andrew A; Nolte, Ilja M; Johansson, Asa; Murgia, Federico; Wild, Sarah H; Bakker, Stephan J L; Peden, John F; Dehghan, Abbas; Steri, Maristella; Tenesa, Albert; Lagou, Vasiliki; Salo, Perttu; Mangino, Massimo; Rose, Lynda M; Lehtimäki, Terho; Woodward, Owen M; Okada, Yukinori; Tin, Adrienne; Müller, Christian; Oldmeadow, Christopher; Putku, Margus; Czamara, Darina; Kraft, Peter; Frogheri, Laura; Thun, Gian Andri; Grotevendt, Anne; Gislason, Gauti Kjartan; Harris, Tamara B; Launer, Lenore J; McArdle, Patrick; Shuldiner, Alan R; Boerwinkle, Eric; Coresh, Josef; Schmidt, Helena; Schallert, Michael; Martin, Nicholas G; Montgomery, Grant W; Kubo, Michiaki; Nakamura, Yusuke; Tanaka, Toshihiro; Munroe, Patricia B; Samani, Nilesh J; Jacobs, David R; Liu, Kiang; D'Adamo, Pio; Ulivi, Sheila; Rotter, Jerome I; Psaty, Bruce M; Vollenweider, Peter; Waeber, Gerard; Campbell, Susan; Devuyst, Olivier; Navarro, Pau; Kolcic, Ivana; Hastie, Nicholas; Balkau, Beverley; Froguel, Philippe; Esko, Tõnu; Salumets, Andres; Khaw, Kay Tee; Langenberg, Claudia; Wareham, Nicholas J; Isaacs, Aaron; Kraja, Aldi; Zhang, Qunyuan; Wild, Philipp S; Scott, Rodney J; Holliday, Elizabeth G; Org, Elin; Viigimaa, Margus; Bandinelli, Stefania; Metter, Jeffrey E; Lupo, Antonio; Trabetti, Elisabetta; Sorice, Rossella; Döring, Angela; Lattka, Eva; Strauch, Konstantin; Theis, Fabian; Waldenberger, Melanie; Wichmann, H-Erich; Davies, Gail; Gow, Alan J; Bruinenberg, Marcel; Stolk, Ronald P; Kooner, Jaspal S; Zhang, Weihua; Winkelmann, Bernhard R; Boehm, Bernhard O; Lucae, Susanne; Penninx, Brenda W; Smit, Johannes H; Curhan, Gary; Mudgal, Poorva; Plenge, Robert M; Portas, Laura; Persico, Ivana; Kirin, Mirna; Wilson, James F; Mateo Leach, Irene; van Gilst, Wiek H; Goel, Anuj; Ongen, Halit; Hofman, Albert; Rivadeneira, Fernando; Uitterlinden, Andre G; Imboden, Medea; von Eckardstein, Arnold; Cucca, Francesco; Nagaraja, Ramaiah; Piras, Maria Grazia; Nauck, Matthias; Schurmann, Claudia; Budde, Kathrin; Ernst, Florian; Farrington, Susan M; Theodoratou, Evropi; Prokopenko, Inga; Stumvoll, Michael; Jula, Antti; Perola, Markus; Salomaa, Veikko; Shin, So-Youn; Spector, Tim D; Sala, Cinzia; Ridker, Paul M; Kähönen, Mika; Viikari, Jorma; Hengstenberg, Christian; Nelson, Christopher P; Meschia, James F; Nalls, Michael A; Sharma, Pankaj; Singleton, Andrew B; Kamatani, Naoyuki; Zeller, Tanja; Burnier, Michel; Attia, John; Laan, Maris; Klopp, Norman; Hillege, Hans L; Kloiber, Stefan; Choi, Hyon; Pirastu, Mario; Tore, Silvia; Probst-Hensch, Nicole M; Völzke, Henry; Gudnason, Vilmundur; Parsa, Afshin; Schmidt, Reinhold; Whitfield, John B; Fornage, Myriam; Gasparini, Paolo; Siscovick, David S; Polašek, Ozren; Campbell, Harry; Rudan, Igor; Bouatia-Naji, Nabila; Metspalu, Andres; Loos, Ruth J F; van Duijn, Cornelia M; Borecki, Ingrid B; Ferrucci, Luigi; Gambaro, Giovanni; Deary, Ian J; Wolffenbuttel, Bruce H R; Chambers, John C; März, Winfried; Pramstaller, Peter P; Snieder, Harold; Gyllensten, Ulf; Wright, Alan F; Navis, Gerjan; Watkins, Hugh; Witteman, Jacqueline C M; Sanna, Serena; Schipf, Sabine; Dunlop, Malcolm G; Tönjes, Anke; Ripatti, Samuli; Soranzo, Nicole; Toniolo, Daniela; Chasman, Daniel I; Raitakari, Olli; Kao, W H Linda; Ciullo, Marina; Fox, Caroline S; Caulfield, Mark; Bochud, Murielle; Gieger, Christian

    2013-01-01

    Elevated serum urate concentrations can cause gout, a prevalent and painful inflammatory arthritis. By combining data from >140,000 individuals of European ancestry within the Global Urate Genetics Consortium (GUGC), we identified and replicated 28 genome-wide significant loci in association with se

  8. Genome sequencing and analyses of the postharvest fungus Penicillium expansum R21

    Science.gov (United States)

    Blue mold is the vernacular name of a common postharvest disease of stored apples, pears and quince that is caused by several common species of Penicillium. This study reports the draft genome sequence of Penicillium expansum strain R21, a strain isolated from a Red Delicious apple in 2011 in Pennsy...

  9. Metabolic model for the filamentous ‘Candidatus Microthrix parvicella’ based on genomic and metagenomic analyses

    DEFF Research Database (Denmark)

    McIlroy, Simon Jon; Kristiansen, Rikke; Albertsen, Mads;

    2013-01-01

    acids as triacylglycerols. Utilisation of trehalose and/or polyphosphate stores or partial oxidation of long-chain fatty acids may supply the energy required for anaerobic lipid uptake and storage. Comparing the genome sequence of this isolate with metagenomes from two full-scale wastewater treatment...

  10. Comparative analyses of the complete mitochondrial genomes of Ascaris lumbricoides and Ascaris suum from humans and pigs.

    Science.gov (United States)

    Liu, Guo-Hua; Wu, Chang-Yi; Song, Hui-Qun; Wei, Shu-Jun; Xu, Min-Jun; Lin, Rui-Qing; Zhao, Guang-Hui; Huang, Si-Yang; Zhu, Xing-Quan

    2012-01-15

    Ascaris lumbricoides and Ascaris suum are parasitic nematodes living in the small intestine of humans and pigs, and can cause the disease ascariasis. For long, there has been controversy as to whether the two ascaridoid taxa represent the same species due to their significant resemblances in morphology. However, the complete mitochondrial (mt) genome data have been lacking for A. lumbricoides in spite of human and animal health significance and socio-economic impact globally of these parasites. In the present study, we sequenced the complete mt genomes of A. lumbricoides and A. suum (China isolate), which was 14,303 bp and 14,311 bp in size, respectively. The identity of the mt genomes was 98.1% between A. lumbricoides and A. suum (China isolate), and 98.5% between A. suum (China isolate) and A. suum (USA isolate). Both genomes are circular, and consist of 36 genes, including 12 genes for proteins, 2 genes for rRNA and 22 genes for tRNA, which are consistent with that of all other species of ascaridoid studied to date. All genes are transcribed in the same direction and have a nucleotide composition high in A and T (71.7% for A. lumbricoides and 71.8% for A. suum). The AT bias had a significant effect on both the codon usage pattern and amino acid composition of proteins. Phylogenetic analyses of A. lumbricoides and A. suum using concatenated amino acid sequences of 12 protein-coding genes, with three different computational algorithms (Bayesian analysis, maximum likelihood and maximum parsimony) all clustered in a clade with high statistical support, indicating that A. lumbricoides and A. suum was very closely related. These mt genome data and the results provide some additional genetic evidence that A. lumbricoides and A. suum may represent the same species. The mt genome data presented in this study are also useful novel markers for studying the molecular epidemiology and population genetics of Ascaris.

  11. Genome-wide meta-analyses of smoking behaviors in African Americans.

    Science.gov (United States)

    David, S P; Hamidovic, A; Chen, G K; Bergen, A W; Wessel, J; Kasberger, J L; Brown, W M; Petruzella, S; Thacker, E L; Kim, Y; Nalls, M A; Tranah, G J; Sung, Y J; Ambrosone, C B; Arnett, D; Bandera, E V; Becker, D M; Becker, L; Berndt, S I; Bernstein, L; Blot, W J; Broeckel, U; Buxbaum, S G; Caporaso, N; Casey, G; Chanock, S J; Deming, S L; Diver, W R; Eaton, C B; Evans, D S; Evans, M K; Fornage, M; Franceschini, N; Harris, T B; Henderson, B E; Hernandez, D G; Hitsman, B; Hu, J J; Hunt, S C; Ingles, S A; John, E M; Kittles, R; Kolb, S; Kolonel, L N; Le Marchand, L; Liu, Y; Lohman, K K; McKnight, B; Millikan, R C; Murphy, A; Neslund-Dudas, C; Nyante, S; Press, M; Psaty, B M; Rao, D C; Redline, S; Rodriguez-Gil, J L; Rybicki, B A; Signorello, L B; Singleton, A B; Smoller, J; Snively, B; Spring, B; Stanford, J L; Strom, S S; Swan, G E; Taylor, K D; Thun, M J; Wilson, A F; Witte, J S; Yamamura, Y; Yanek, L R; Yu, K; Zheng, W; Ziegler, R G; Zonderman, A B; Jorgenson, E; Haiman, C A; Furberg, H

    2012-05-22

    The identification and exploration of genetic loci that influence smoking behaviors have been conducted primarily in populations of the European ancestry. Here we report results of the first genome-wide association study meta-analysis of smoking behavior in African Americans in the Study of Tobacco in Minority Populations Genetics Consortium (n = 32,389). We identified one non-coding single-nucleotide polymorphism (SNP; rs2036527[A]) on chromosome 15q25.1 associated with smoking quantity (cigarettes per day), which exceeded genome-wide significance (β = 0.040, s.e. = 0.007, P = 1.84 × 10(-8)). This variant is present in the 5'-distal enhancer region of the CHRNA5 gene and defines the primary index signal reported in studies of the European ancestry. No other SNP reached genome-wide significance for smoking initiation (SI, ever vs never smoking), age of SI, or smoking cessation (SC, former vs current smoking). Informative associations that approached genome-wide significance included three modestly correlated variants, at 15q25.1 within PSMA4, CHRNA5 and CHRNA3 for smoking quantity, which are associated with a second signal previously reported in studies in European ancestry populations, and a signal represented by three SNPs in the SPOCK2 gene on chr10q22.1. The association at 15q25.1 confirms this region as an important susceptibility locus for smoking quantity in men and women of African ancestry. Larger studies will be needed to validate the suggestive loci that did not reach genome-wide significance and further elucidate the contribution of genetic variation to disparities in cigarette consumption, SC and smoking-attributable disease between African Americans and European Americans.

  12. Acinetobacter seifertii Isolated from China: Genomic Sequence and Molecular Epidemiology Analyses.

    Science.gov (United States)

    Yang, Yunxing; Wang, Jianfeng; Fu, Ying; Ruan, Zhi; Yu, Yunsong

    2016-03-01

    Clinical infections caused by Acinetobacter spp. have increasing public health concerns because of their global occurrence and ability to acquire multidrug resistance. Acinetobacter calcoaceticus-Acinetobacter baumannii (ACB) complex encompasses A. calcoaceticus, A. baumannii, A. pittii (formerly genomic species 3), and A nosocomial (formerly genomic species 13TU), which are predominantly responsible for clinical pathogenesis in the Acinetobacter genus. In our previous study, a putative novel species isolated from 385 non-A. baumannii spp. strains based on the rpoB gene phylogenetic tree was reported. Here, the putative novel species was identified as A. seifertii based on the whole-genome phylogenetic tree. A. seifertii was recognized as a novel member of the ACB complex and close to A. baumannii and A. nosocomials. Furthermore, we studied the characteristics of 10 A. seifertii isolates, which were distributed widely in 6 provinces in China and mainly caused infections in the elderly or children. To define the taxonomic status and characteristics, the biochemical reactions, antimicrobial susceptibility testing, pulsed field gel electrophoresis (PFGE), multilocus sequence typing (MLST), and whole-genome sequence analysis were performed. The phenotypic characteristics failed to distinguish A. serfertii from other species in the ACB complex. Most of the A. seifertii isolates were susceptible to antibiotics commonly used for nosocomial Acinetobacter spp. infections, but one isolate (strain A362) was resistant to ampicillin/sulbactam, ceftazidime and amikacin. The different patterns of MLST and PFGE suggested that the 10 isolates were not identical and lacked clonal relatedness. Our study reported for the first time the molecular epidemiological and genomic features of widely disseminated A. seifertii in China. These observations could enrich the knowledge of infections caused by non-A. baumannii and may provide a scientific basis for future clinical treatment.

  13. Endophytic life strategies decoded by genome and transcriptome analyses of the mutualistic root symbiont Piriformospora indica.

    Directory of Open Access Journals (Sweden)

    Alga Zuccaro

    2011-10-01

    Full Text Available Recent sequencing projects have provided deep insight into fungal lifestyle-associated genomic adaptations. Here we report on the 25 Mb genome of the mutualistic root symbiont Piriformospora indica (Sebacinales, Basidiomycota and provide a global characterization of fungal transcriptional responses associated with the colonization of living and dead barley roots. Extensive comparative analysis of the P. indica genome with other Basidiomycota and Ascomycota fungi that have diverse lifestyle strategies identified features typically associated with both, biotrophism and saprotrophism. The tightly controlled expression of the lifestyle-associated gene sets during the onset of the symbiosis, revealed by microarray analysis, argues for a biphasic root colonization strategy of P. indica. This is supported by a cytological study that shows an early biotrophic growth followed by a cell death-associated phase. About 10% of the fungal genes induced during the biotrophic colonization encoded putative small secreted proteins (SSP, including several lectin-like proteins and members of a P. indica-specific gene family (DELD with a conserved novel seven-amino acids motif at the C-terminus. Similar to effectors found in other filamentous organisms, the occurrence of the DELDs correlated with the presence of transposable elements in gene-poor repeat-rich regions of the genome. This is the first in depth genomic study describing a mutualistic symbiont with a biphasic lifestyle. Our findings provide a significant advance in understanding development of biotrophic plant symbionts and suggest a series of incremental shifts along the continuum from saprotrophy towards biotrophy in the evolution of mycorrhizal association from decomposer fungi.

  14. Update on Chloroplast Research: New Tools, New Topics, and New Trends

    Institute of Scientific and Technical Information of China (English)

    Ute Armbruster; Paolo Pesaresi; Mathias Pribil; Alexander Hertle; Dario Leister

    2011-01-01

    Chloroplasts, the green differentiation form of plastids, are the sites of photosynthesis and other important plant functions. Genetic and genomic technologies have greatly boosted the rate of discovery and functional characterization of chloroplast proteins during the past decade. Indeed, data obtained using high-throughput methodologies, in particular proteomics and transcriptomics, are now routinely used to assign functions to chloroplast proteins. Our knowledge of many chloroplast processes, notably photosynthesis and photorespiration, has reached such an advanced state that biotechnological approaches to crop improvement now seem feasible. Meanwhile, efforts to identify the entire complement of chloroplast proteins and their interactions are progressing rapidly, making the organelle a prime target for systems biology research in plants.

  15. Comparison of intraspecific, interspecific and intergeneric chloroplast diversity in Cycads

    Science.gov (United States)

    Jiang, Guo-Feng; Hinsinger, Damien Daniel; Strijk, Joeri Sergej

    2016-01-01

    Cycads are among the most threatened plant species. Increasing the availability of genomic information by adding whole chloroplast data is a fundamental step in supporting phylogenetic studies and conservation efforts. Here, we assemble a dataset encompassing three taxonomic levels in cycads, including ten genera, three species in the genus Cycas and two individuals of C. debaoensis. Repeated sequences, SSRs and variations of the chloroplast were analyzed at the intraspecific, interspecific and intergeneric scale, and using our sequence data, we reconstruct a phylogenomic tree for cycads. The chloroplast was 162,094 bp in length, with 133 genes annotated, including 87 protein-coding, 37 tRNA and 8 rRNA genes. We found 7 repeated sequences and 39 SSRs. Seven loci showed promising levels of variations for application in DNA-barcoding. The chloroplast phylogeny confirmed the division of Cycadales in two suborders, each of them being monophyletic, revealing a contradiction with the current family circumscription and its evolution. Finally, 10 intraspecific SNPs were found. Our results showed that despite the extremely restricted distribution range of C. debaoensis, using complete chloroplast data is useful not only in intraspecific studies, but also to improve our understanding of cycad evolution and in defining conservation strategies for this emblematic group. PMID:27558458

  16. Spontaneous capture of oilseed rape (Brassica napus) chloroplasts by wild B. rapa: implications for the use of chloroplast transformation for biocontainment.

    Science.gov (United States)

    Haider, Nadia; Allainguillaume, Joel; Wilkinson, Mike J

    2009-04-01

    Environmental concerns over the cultivation of Genetically Modified (GM) crops largely centre on the ecological consequences following gene flow to wild relatives. One attractive solution is to deploy biocontainment measures that prevent hybridization. Chloroplast transformation is the most advanced biocontainment method but is compromised by chloroplast capture (hybridization through the maternal lineage). To date, however, there is a paucity of information on the frequency of chloroplast capture in the wild. Oilseed rape (Brassica napus, AACC) frequently hybridises with wild Brassica rapa (AA, as paternal parent) and yields B. rapa-like introgressed individuals after only two generations. In this study we used chloroplast CAPS markers that differentiate between the two species to survey wild and weedy populations of B. rapa for the capture of B. napus chloroplasts. A total of 464 B. rapa plants belonging to 14 populations growing either in close proximity to B. napus (i.e. sympatric 1 km) were assessed for chloroplast capture using PCR (trnL-F) and CAPS (trnT-L-Xba I) markers. The screen revealed that two sympatric B. rapa populations included 53 plants that possessed the chloroplast of B. napus. In order to discount these B. rapa plants as F(1) crop-wild hybrids, we used a C-genome-specific marker and found that 45 out of 53 plants lacked the C-genome and so were at least second generation introgressants. The most plausible explanation is that these individuals represent multiple cases of chloroplast capture following introgressive hybridisation through the female germ line from the crop. The abundance of such plants in sympatric sites thereby questions whether the use of chloroplast transformation would provide a sufficient biocontainment for GM oilseed rape in the United Kingdom.

  17. An Exploration into Fern Genome Space.

    Science.gov (United States)

    Wolf, Paul G; Sessa, Emily B; Marchant, Daniel Blaine; Li, Fay-Wei; Rothfels, Carl J; Sigel, Erin M; Gitzendanner, Matthew A; Visger, Clayton J; Banks, Jo Ann; Soltis, Douglas E; Soltis, Pamela S; Pryer, Kathleen M; Der, Joshua P

    2015-08-26

    Ferns are one of the few remaining major clades of land plants for which a complete genome sequence is lacking. Knowledge of genome space in ferns will enable broad-scale comparative analyses of land plant genes and genomes, provide insights into genome evolution across green plants, and shed light on genetic and genomic features that characterize ferns, such as their high chromosome numbers and large genome sizes. As part of an initial exploration into fern genome space, we used a whole genome shotgun sequencing approach to obtain low-density coverage (∼0.4X to 2X) for six fern species from the Polypodiales (Ceratopteris, Pteridium, Polypodium, Cystopteris), Cyatheales (Plagiogyria), and Gleicheniales (Dipteris). We explore these data to characterize the proportion of the nuclear genome represented by repetitive sequences (including DNA transposons, retrotransposons, ribosomal DNA, and simple repeats) and protein-coding genes, and to extract chloroplast and mitochondrial genome sequences. Such initial sweeps of fern genomes can provide information useful for selecting a promising candidate fern species for whole genome sequencing. We also describe variation of genomic traits across our sample and highlight some differences and similarities in repeat structure between ferns and seed plants.

  18. Sequencing and analyses of all known human rhinovirus genomes reveal structure and evolution.

    Science.gov (United States)

    Palmenberg, Ann C; Spiro, David; Kuzmickas, Ryan; Wang, Shiliang; Djikeng, Appolinaire; Rathe, Jennifer A; Fraser-Liggett, Claire M; Liggett, Stephen B

    2009-04-03

    Infection by human rhinovirus (HRV) is a major cause of upper and lower respiratory tract disease worldwide and displays considerable phenotypic variation. We examined diversity by completing the genome sequences for all known serotypes (n = 99). Superimposition of capsid crystal structure and optimal-energy RNA configurations established alignments and phylogeny. These revealed conserved motifs; clade-specific diversity, including a potential newly identified species (HRV-D); mutations in field isolates; and recombination. In analogy with poliovirus, a hypervariable 5' untranslated region tract may affect virulence. A configuration consistent with nonscanning internal ribosome entry was found in all HRVs and may account for rapid translation. The data density from complete sequences of the reference HRVs provided high resolution for this degree of modeling and serves as a platform for full genome-based epidemiologic studies and antiviral or vaccine development.

  19. Genome-wide analyses of recombination suggest that Giardia intestinalis assemblages represent different species.

    Science.gov (United States)

    Xu, Feifei; Jerlström-Hultqvist, Jon; Andersson, Jan O

    2012-10-01

    Giardia intestinalis is a major cause of waterborne enteric disease in humans. The species is divided into eight assemblages suggested to represent separate Giardia species based on host specificities and the genetic divergence of marker genes. We have investigated whether genome-wide recombination occurs between assemblages using the three available G. intestinalis genomes. First, the relative nonsynonymous substitution rates of the homologs were compared for 4,009 positional homologs. The vast majority of these comparisons indicate genetic isolation without interassemblage recombinations. Only a region of 6 kbp suggests genetic exchange between assemblages A and E, followed by gene conversion events. Second, recombination-detecting software fails to identify within-gene recombination between the different assemblages for most of the homologs. Our results indicate very low frequency of recombination between the syntenic core genes, suggesting that G. intestinalis assemblages are genetically isolated lineages and thus should be viewed as separated Giardia species.

  20. Classification and regression tree (CART analyses of genomic signatures reveal sets of tetramers that discriminate temperature optima of archaea and bacteria

    Directory of Open Access Journals (Sweden)

    Betsey Dexter Dyer

    2008-01-01

    Full Text Available Classification and regression tree (CART analysis was applied to genome-wide tetranucleotide frequencies (genomic signatures of 195 archaea and bacteria. Although genomic signatures have typically been used to classify evolutionary divergence, in this study, convergent evolution was the focus. Temperature optima for most of the organisms examined could be distinguished by CART analyses of tetranucleotide frequencies. This suggests that pervasive (nonlinear qualities of genomes may reflect certain environmental conditions (such as temperature in which those genomes evolved. The predominant use of GAGA and AGGA as the discriminating tetramers in CART models suggests that purine-loading and codon biases of thermophiles may explain some of the results.

  1. Comparative genomic analyses of the cyanobacterium, Lyngbya aestuarii BL J, a powerful hydrogen producer.

    Directory of Open Access Journals (Sweden)

    Ankita eKothari

    2013-12-01

    Full Text Available The filamentous, non-heterocystous cyanobacterium Lyngbya aestuarii is an important contributor to marine intertidal microbial mats system worldwide. The recent isolate L. aestuarii BL J, is an unusually powerful hydrogen producer. Here we report a morphological, ultrastructural and genomic characterization of this strain to set the basis for future systems studies and applications of this organism. The filaments contain circa 17 μm wide trichomes, composed of stacked disk-like short cells (2 μm long, encased in a prominent, laminated exopolysaccharide sheath. Cellular division occurs by transversal centripetal growth of cross-walls, where several rounds of division proceed simultaneously. Filament division occurs by cell self-immolation of one or groups of cells (necridial cells at the breakage point. Short, sheath-less, motile filaments (hormogonia are also formed. Morphologically and phylogenetically L. aestuarii belongs to a clade of important cyanobacteria that include members of the marine Trichodesmiun and Hydrocoleum genera, as well as terrestrial Microcoleus vaginatus strains, and alkalyphilic strains of Arthrospira. A draft genome of strain BL J was compared to those of other cyanobacteria in order to ascertain some of its ecological constraints and biotechnological potential. The genome had an average GC content of 41.1 %. Of the 6.87 Mb sequenced, 6.44 Mb was present as large contigs (>10,000 bp. It contained 6515 putative protein-encoding genes, of which, 43 % encode proteins of known functional role, 26 % corresponded to proteins with domain or family assignments, 19.6 % encode conserved hypothetical proteins, and 11.3 % encode apparently unique hypothetical proteins. The strain’s genome reveals its adaptations to a life of exposure to intense solar radiation and desiccation. It likely employs the storage compounds, glycogen and cyanophycin but no polyhydroxyalkanoates, and can produce the osmolytes, trehalose and glycine

  2. Combined array CGH plus SNP genome analyses in a single assay for optimized clinical testing.

    Science.gov (United States)

    Wiszniewska, Joanna; Bi, Weimin; Shaw, Chad; Stankiewicz, Pawel; Kang, Sung-Hae L; Pursley, Amber N; Lalani, Seema; Hixson, Patricia; Gambin, Tomasz; Tsai, Chun-hui; Bock, Hans-Georg; Descartes, Maria; Probst, Frank J; Scaglia, Fernando; Beaudet, Arthur L; Lupski, James R; Eng, Christine; Cheung, Sau Wai; Bacino, Carlos; Patel, Ankita

    2014-01-01

    In clinical diagnostics, both array comparative genomic hybridization (array CGH) and single nucleotide polymorphism (SNP) genotyping have proven to be powerful genomic technologies utilized for the evaluation of developmental delay, multiple congenital anomalies, and neuropsychiatric disorders. Differences in the ability to resolve genomic changes between these arrays may constitute an implementation challenge for clinicians: which platform (SNP vs array CGH) might best detect the underlying genetic cause for the disease in the patient? While only SNP arrays enable the detection of copy number neutral regions of absence of heterozygosity (AOH), they have limited ability to detect single-exon copy number variants (CNVs) due to the distribution of SNPs across the genome. To provide comprehensive clinical testing for both CNVs and copy-neutral AOH, we enhanced our custom-designed high-resolution oligonucleotide array that has exon-targeted coverage of 1860 genes with 60,000 SNP probes, referred to as Chromosomal Microarray Analysis - Comprehensive (CMA-COMP). Of the 3240 cases evaluated by this array, clinically significant CNVs were detected in 445 cases including 21 cases with exonic events. In addition, 162 cases (5.0%) showed at least one AOH region >10 Mb. We demonstrate that even though this array has a lower density of SNP probes than other commercially available SNP arrays, it reliably detected AOH events >10 Mb as well as exonic CNVs beyond the detection limitations of SNP genotyping. Thus, combining SNP probes and exon-targeted array CGH into one platform provides clinically useful genetic screening in an efficient manner.

  3. UniPrimer: A Web-Based Primer Design Tool for Comparative Analyses of Primate Genomes

    Directory of Open Access Journals (Sweden)

    Nomin Batnyam

    2012-01-01

    Full Text Available Whole genome sequences of various primates have been released due to advanced DNA-sequencing technology. A combination of computational data mining and the polymerase chain reaction (PCR assay to validate the data is an excellent method for conducting comparative genomics. Thus, designing primers for PCR is an essential procedure for a comparative analysis of primate genomes. Here, we developed and introduced UniPrimer for use in those studies. UniPrimer is a web-based tool that designs PCR- and DNA-sequencing primers. It compares the sequences from six different primates (human, chimpanzee, gorilla, orangutan, gibbon, and rhesus macaque and designs primers on the conserved region across species. UniPrimer is linked to RepeatMasker, Primer3Plus, and OligoCalc softwares to produce primers with high accuracy and UCSC In-Silico PCR to confirm whether the designed primers work. To test the performance of UniPrimer, we designed primers on sample sequences using UniPrimer and manually designed primers for the same sequences. The comparison of the two processes showed that UniPrimer was more effective than manual work in terms of saving time and reducing errors.

  4. Comprehensive Comparative Genomic and Transcriptomic Analyses of the Legume Genes Controlling the Nodulation Process.

    Science.gov (United States)

    Qiao, Zhenzhen; Pingault, Lise; Nourbakhsh-Rey, Mehrnoush; Libault, Marc

    2016-01-01

    Nitrogen is one of the most essential plant nutrients and one of the major factors limiting crop productivity. Having the goal to perform a more sustainable agriculture, there is a need to maximize biological nitrogen fixation, a feature of legumes. To enhance our understanding of the molecular mechanisms controlling the interaction between legumes and rhizobia, the symbiotic partner fixing and assimilating the atmospheric nitrogen for the plant, researchers took advantage of genetic and genomic resources developed across different legume models (e.g., Medicago truncatula, Lotus japonicus, Glycine max, and Phaseolus vulgaris) to identify key regulatory protein coding genes of the nodulation process. In this study, we are presenting the results of a comprehensive comparative genomic analysis to highlight orthologous and paralogous relationships between the legume genes controlling nodulation. Mining large transcriptomic datasets, we also identified several orthologous and paralogous genes characterized by the induction of their expression during nodulation across legume plant species. This comprehensive study prompts new insights into the evolution of the nodulation process in legume plant and will benefit the scientific community interested in the transfer of functional genomic information between species.

  5. Genome-wide pathway association studies of multiple correlated quantitative phenotypes using principle component analyses.

    Directory of Open Access Journals (Sweden)

    Feng Zhang

    Full Text Available Genome-wide pathway association studies provide novel insight into the biological mechanism underlying complex diseases. Current pathway association studies primarily focus on single important disease phenotype, which is sometimes insufficient to characterize the clinical manifestations of complex diseases. We present a multi-phenotypes pathway association study(MPPAS approach using principle component analysis(PCA. In our approach, PCA is first applied to multiple correlated quantitative phenotypes for extracting a set of orthogonal phenotypic components. The extracted phenotypic components are then used for pathway association analysis instead of original quantitative phenotypes. Four statistics were proposed for PCA-based MPPAS in this study. Simulations using the real data from the HapMap project were conducted to evaluate the power and type I error rates of PCA-based MPPAS under various scenarios considering sample sizes, additive and interactive genetic effects. A real genome-wide association study data set of bone mineral density (BMD at hip and spine were also analyzed by PCA-based MPPAS. Simulation studies illustrated the performance of PCA-based MPPAS for identifying the causal pathways underlying complex diseases. Genome-wide MPPAS of BMD detected associations between BMD and KENNY_CTNNB1_TARGETS_UP as well as LONGEVITYPATHWAY pathways in this study. We aim to provide a applicable MPPAS approach, which may help to gain deep understanding the potential biological mechanism of association results for complex diseases.

  6. Genome, Transcriptome, and Functional Analyses of Penicillium expansum Provide New Insights Into Secondary Metabolism and Pathogenicity.

    Science.gov (United States)

    Ballester, Ana-Rosa; Marcet-Houben, Marina; Levin, Elena; Sela, Noa; Selma-Lázaro, Cristina; Carmona, Lourdes; Wisniewski, Michael; Droby, Samir; González-Candelas, Luis; Gabaldón, Toni

    2015-03-01

    The relationship between secondary metabolism and infection in pathogenic fungi has remained largely elusive. The genus Penicillium comprises a group of plant pathogens with varying host specificities and with the ability to produce a wide array of secondary metabolites. The genomes of three Penicillium expansum strains, the main postharvest pathogen of pome fruit, and one Pencillium italicum strain, a postharvest pathogen of citrus fruit, were sequenced and compared with 24 other fungal species. A genomic analysis of gene clusters responsible for the production of secondary metabolites was performed. Putative virulence factors in P. expansum were identified by means of a transcriptomic analysis of apple fruits during the course of infection. Despite a major genome contraction, P. expansum is the Penicillium species with the largest potential for the production of secondary metabolites. Results using knockout mutants clearly demonstrated that neither patulin nor citrinin are required by P. expansum to successfully infect apples. Li et al. ( MPMI-12-14-0398-FI ) reported similar results and conclusions in their recently accepted paper.

  7. Genetic variants associated with subjective well-being, depressive symptoms, and neuroticism identified through genome-wide analyses

    DEFF Research Database (Denmark)

    Okbay, Aysu; Baselmans, Bart M L; De Neve, Jan-Emmanuel

    2016-01-01

    Very few genetic variants have been associated with depression and neuroticism, likely because of limitations on sample size in previous studies. Subjective well-being, a phenotype that is genetically correlated with both of these traits, has not yet been studied with genome-wide data. We conducted...... genome-wide association studies of three phenotypes: subjective well-being (n = 298,420), depressive symptoms (n = 161,460), and neuroticism (n = 170,911). We identify 3 variants associated with subjective well-being, 2 variants associated with depressive symptoms, and 11 variants associated...... with neuroticism, including 2 inversion polymorphisms. The two loci associated with depressive symptoms replicate in an independent depression sample. Joint analyses that exploit the high genetic correlations between the phenotypes (|ρ^| ≈ 0.8) strengthen the overall credibility of the findings and allow us...

  8. Combined genomic and structural analyses of a cultured magnetotactic bacterium reveals its niche adaptation to a dynamic environment

    Directory of Open Access Journals (Sweden)

    Ana Carolina Vieira Araujo

    2016-10-01

    Full Text Available Abstract Background Magnetotactic bacteria (MTB are a unique group of prokaryotes that have a potentially high impact on global geochemical cycling of significant primary elements because of their metabolic plasticity and the ability to biomineralize iron-rich magnetic particles called magnetosomes. Understanding the genetic composition of the few cultivated MTB along with the unique morphological features of this group of bacteria may provide an important framework for discerning their potential biogeochemical roles in natural environments. Results Genomic and ultrastructural analyses were combined to characterize the cultivated magnetotactic coccus Magnetofaba australis strain IT-1. Cells of this species synthesize a single chain of elongated, cuboctahedral magnetite (Fe3O4 magnetosomes that cause them to align along magnetic field lines while they swim being propelled by two bundles of flagella at velocities up to 300 μm s−1. High-speed microscopy imaging showed the cells move in a straight line rather than in the helical trajectory described for other magnetotactic cocci. Specific genes within the genome of Mf. australis strain IT-1 suggest the strain is capable of nitrogen fixation, sulfur reduction and oxidation, synthesis of intracellular polyphosphate granules and transporting iron with low and high affinity. Mf. australis strain IT-1 and Magnetococcus marinus strain MC-1 are closely related phylogenetically although similarity values between their homologous proteins are not very high. Conclusion Mf. australis strain IT-1 inhabits a constantly changing environment and its complete genome sequence reveals a great metabolic plasticity to deal with these changes. Aside from its chemoautotrophic and chemoheterotrophic metabolism, genomic data indicate the cells are capable of nitrogen fixation, possess high and low affinity iron transporters, and might be capable of reducing and oxidizing a number of sulfur compounds. The relatively

  9. Protein disorder in plants: a view from the chloroplast

    Directory of Open Access Journals (Sweden)

    Yruela Inmaculada

    2012-09-01

    Full Text Available Abstract Background The intrinsically unstructured state of some proteins, observed in all living organisms, is essential for basic cellular functions. In this field the available information from plants is limited but it has been reached a point where these proteins can be comprehensively classified on the basis of disorder, function and evolution. Results Our analysis of plant genomes confirms that nuclear-encoded proteins follow the same trend than other multi-cellular eukaryotes; however, chloroplast- and mitochondria- encoded proteins conserve the patterns of Archaea and Bacteria, in agreement with their phylogenetic origin. Based on current knowledge about gene transference from the chloroplast to the nucleus, we report a strong correlation between the rate of disorder of transferred and nuclear-encoded proteins, even for polypeptides that play functional roles back in the chloroplast. We further investigate this trend by reviewing the set of chloroplast ribosomal proteins, one of the most representative transferred gene clusters, finding that the ribosomal large subunit, assembled from a majority of nuclear-encoded proteins, is clearly more unstructured than the small one, which integrates mostly plastid-encoded proteins. Conclusions Our observations suggest that the evolutionary dynamics of the plant nucleus adds disordered segments to genes alike, regardless of their origin, with the notable exception of proteins currently encoded in both genomes, probably due to functional constraints.

  10. Streptococcal taxonomy based on genome sequence analyses [v1; ref status: indexed, http://f1000r.es/o1

    Directory of Open Access Journals (Sweden)

    Cristiane C Thompson

    2013-03-01

    Full Text Available The identification of the clinically relevant viridans streptococci group, at species level, is still problematic. The aim of this study was to extract taxonomic information from the complete genome sequences of 67 streptococci, comprising 19 species, by means of genomic analyses, multilocus sequence analysis (MLSA, average amino acid identity (AAI, genomic signatures, genome-to-genome distances (GGD and codon usage bias. We then attempted to determine the usefulness of these genomic tools for species identification in streptococci. Our results showed that MLSA, AAI and GGD analyses are robust markers to identify streptococci at the species level, for instance, S. pneumoniae, S. mitis, and S. oralis. A Streptococcus species can be defined as a group of strains that share ≥ 95% DNA similarity in MLSA and AAI, and > 70% DNA identity in GGD. This approach allows an advanced understanding of bacterial diversity.

  11. Genome-wide association analyses for fatty acid composition in porcine muscle and abdominal fat tissues.

    Directory of Open Access Journals (Sweden)

    Bin Yang

    Full Text Available Fatty acid composition is an important phenotypic trait in pigs as it affects nutritional, technical and sensory quality of pork. Here, we reported a genome-wide association study (GWAS for fatty acid composition in the longissimus muscle and abdominal fat tissues of 591 White Duroc×Erhualian F2 animals and in muscle samples of 282 Chinese Sutai pigs. A total of 46 loci surpassing the suggestive significance level were identified on 15 pig chromosomes (SSC for 12 fatty acids, revealing the complex genetic architecture of fatty acid composition in pigs. Of the 46 loci, 15 on SSC5, 7, 14 and 16 reached the genome-wide significance level. The two most significant SNPs were ss131535508 (P = 2.48×10(-25 at 41.39 Mb on SSC16 for C20∶0 in abdominal fat and ss478935891 (P = 3.29×10(-13 at 121.31 Mb on SSC14 for muscle C18∶0. A meta-analysis of GWAS identified 4 novel loci and enhanced the association strength at 6 loci compared to those evidenced in a single population, suggesting the presence of common underlying variants. The longissimus muscle and abdominal fat showed consistent association profiles at most of the identified loci and distinct association signals at several loci. All loci have specific effects on fatty acid composition, except for two loci on SSC4 and SSC7 affecting multiple fatness traits. Several promising candidate genes were found in the neighboring regions of the lead SNPs at the genome-wide significant loci, such as SCD for C18∶0 and C16∶1 on SSC14 and ELOVL7 for C20∶0 on SSC16. The findings provide insights into the molecular basis of fatty acid composition in pigs, and would benefit the final identification of the underlying mutations.

  12. Lineage-specific evolution of the vertebrate Otopetrin gene family revealed by comparative genomic analyses

    Directory of Open Access Journals (Sweden)

    Ryan Joseph F

    2011-01-01

    Full Text Available Abstract Background Mutations in the Otopetrin 1 gene (Otop1 in mice and fish produce an unusual bilateral vestibular pathology that involves the absence of otoconia without hearing impairment. The encoded protein, Otop1, is the only functionally characterized member of the Otopetrin Domain Protein (ODP family; the extended sequence and structural preservation of ODP proteins in metazoans suggest a conserved functional role. Here, we use the tools of sequence- and cytogenetic-based comparative genomics to study the Otop1 and the Otop2-Otop3 genes and to establish their genomic context in 25 vertebrates. We extend our evolutionary study to include the gene mutated in Usher syndrome (USH subtype 1G (Ush1g, both because of the head-to-tail clustering of Ush1g with Otop2 and because Otop1 and Ush1g mutations result in inner ear phenotypes. Results We established that OTOP1 is the boundary gene of an inversion polymorphism on human chromosome 4p16 that originated in the common human-chimpanzee lineage more than 6 million years ago. Other lineage-specific evolutionary events included a three-fold expansion of the Otop genes in Xenopus tropicalis and of Ush1g in teleostei fish. The tight physical linkage between Otop2 and Ush1g is conserved in all vertebrates. To further understand the functional organization of the Ushg1-Otop2 locus, we deduced a putative map of binding sites for CCCTC-binding factor (CTCF, a mammalian insulator transcription factor, from genome-wide chromatin immunoprecipitation-sequencing (ChIP-seq data in mouse and human embryonic stem (ES cells combined with detection of CTCF-binding motifs. Conclusions The results presented here clarify the evolutionary history of the vertebrate Otop and Ush1g families, and establish a framework for studying the possible interaction(s of Ush1g and Otop in developmental pathways.

  13. Genome-wide meta-analyses identify three loci associated with primary biliary cirrhosis

    Science.gov (United States)

    Liu, Xiangdong; Invernizzi, Pietro; Lu, Yue; Kosoy, Roman; Lu, Yan; Bianchi, Ilaria; Podda, Mauro; Xu, Chun; Xie, Gang; Macciardi, Fabio; Selmi, Carlo; Lupoli, Sara; Shigeta, Russell; Ransom, Michael; Lleo, Ana; Lee, Annette T; Mason, Andrew L; Myers, Robert P; Peltekian, Kevork M; Ghent, Cameron N; Bernuzzi, Francesca; Zuin, Massimo; Rosina, Floriano; Borghesio, Elisabetta; Floreani, Annarosa; Lazzari, Roberta; Niro, Grazia; Andriulli, Angelo; Muratori, Luigi; Muratori, Paolo; Almasio, Piero L; Andreone, Pietro; Margotti, Marzia; Brunetto, Maurizia; Coco, Barbara; Alvaro, Domenico; Bragazzi, Maria C; Marra, Fabio; Pisano, Alessandro; Rigamonti, Cristina; Colombo, Massimo; Marzioni, Marco; Benedetti, Antonio; Fabris, Luca; Strazzabosco, Mario; Portincasa, Piero; Palmieri, Vincenzo O; Tiribelli, Claudio; Croce, Lory; Bruno, Savino; Rossi, Sonia; Vinci, Maria; Prisco, Cleofe; Mattalia, Alberto; Toniutto, Pierluigi; Picciotto, Antonio; Galli, Andrea; Ferrari, Carlo; Colombo, Silvia; Casella, Giovanni; Morini, Lorenzo; Caporaso, Nicola; Colli, Agostino; Spinzi, Giancarlo; Montanari, Renzo; Gregersen, Peter K; Heathcote, E Jenny; Hirschfield, Gideon M; Siminovitch, Katherine A; Amos, Christopher I; Gershwin, M Eric; Seldin, Michael F

    2011-01-01

    A genome-wide association screen for primary biliary cirrhosis risk alleles was performed in an Italian cohort. The results from the Italian cohort replicated IL12A and IL12RB associations, and a combined meta-analysis using a Canadian dataset identified newly associated loci at SPIB (P = 7.9 × 10–11, odds ratio (OR) = 1.46), IRF5-TNPO3 (P = 2.8 × 10–10, OR = 1.63) and 17q12-21 (P = 1.7 × 10–10, OR = 1.38). PMID:20639880

  14. A high-throughput method for detection of DNA in chloroplasts using flow cytometry

    Directory of Open Access Journals (Sweden)

    Oldenburg Delene J

    2007-03-01

    Full Text Available Abstract Background The amount of DNA in the chloroplasts of some plant species has been shown recently to decline dramatically during leaf development. A high-throughput method of DNA detection in chloroplasts is now needed in order to facilitate the further investigation of this process using large numbers of tissue samples. Results The DNA-binding fluorophores 4',6-diamidino-2-phenylindole (DAPI, SYBR Green I (SG, SYTO 42, and SYTO 45 were assessed for their utility in flow cytometric analysis of DNA in Arabidopsis chloroplasts. Fluorescence microscopy and real-time quantitative PCR (qPCR were used to validate flow cytometry data. We found neither DAPI nor SYTO 45 suitable for flow cytometric analysis of chloroplast DNA (cpDNA content, but did find changes in cpDNA content during development by flow cytometry using SG and SYTO 42. The latter dye provided more sensitive detection, and the results were similar to those from the fluorescence microscopic analysis. Differences in SYTO 42 fluorescence were found to correlate with differences in cpDNA content as determined by qPCR using three primer sets widely spaced across the chloroplast genome, suggesting that the whole genome undergoes copy number reduction during development, rather than selective reduction/degradation of subgenomic regions. Conclusion Flow cytometric analysis of chloroplasts stained with SYTO 42 is a high-throughput method suitable for determining changes in cpDNA content during development and for sorting chloroplasts on the basis of DNA content.

  15. Comparison of analyses of the XVth QTLMAS common dataset III: Genomic Estimations of Breeding Values

    Directory of Open Access Journals (Sweden)

    Demeure Olivier

    2012-05-01

    Full Text Available Abstract Background The QTLMAS XVth dataset consisted of pedigree, marker genotypes and quantitative trait performances of animals with a sib family structure. Pedigree and genotypes concerned 3,000 progenies among those 2,000 were phenotyped. The trait was regulated by 8 QTLs which displayed additive, imprinting or epistatic effects. The 1,000 unphenotyped progenies were considered as candidates to selection and their Genomic Estimated Breeding Values (GEBV were evaluated by participants of the XVth QTLMAS workshop. This paper aims at comparing the GEBV estimation results obtained by seven participants to the workshop. Methods From the known QTL genotypes of each candidate, two "true" genomic values (TV were estimated by organizers: the genotypic value of the candidate (TGV and the expectation of its progeny genotypic values (TBV. GEBV were computed by the participants following different statistical methods: random linear models (including BLUP and Ridge Regression, selection variable techniques (LASSO, Elastic Net and Bayesian methods. Accuracy was evaluated by the correlation between TV (TGV or TBV and GEBV presented by participants. Rank correlation of the best 10% of individuals and error in predictions were also evaluated. Bias was tested by regression of TV on GEBV. Results Large differences between methods were found for all criteria and type of genetic values (TGV, TBV. In general, the criteria ranked consistently methods belonging to the same family. Conclusions Bayesian methods - A

  16. Genomic translational research: Paving the way to individualized cardiac functional analyses and personalized cardiology.

    Science.gov (United States)

    Pasipoularides, Ares

    2017-03-01

    For most of Medicine's past, the best that physicians could do to cope with disease prevention and treatment was based on the expected response of an average patient. Currently, however, a more personalized/precise approach to cardiology and medicine in general is becoming possible, as the cost of sequencing a human genome has declined substantially. As a result, we are witnessing an era of precipitous advances in biomedicine and bourgeoning understanding of the genetic basis of cardiovascular and other diseases, reminiscent of the resurgence of innovations in physico-mathematical sciences and biology-anatomy-cardiology in the Renaissance, a parallel time of radical change and reformation of medical knowledge, education and practice. Now on the horizon is an individualized, diverse patient-centered, approach to medical practice that encompasses the development of new, gene-based diagnostics and preventive medicine tactics, and offers the broadest range of personalized therapies based on pharmacogenetics. Over time, translation of genomic and high-tech approaches unquestionably will transform clinical practice in cardiology and medicine as a whole, with the adoption of new personalized medicine approaches and procedures. Clearly, future prospects far outweigh present accomplishments, which are best viewed as a promising start. It is now essential for pluridisciplinary health care providers to examine the drivers and barriers to the clinical adoption of this emerging revolutionary paradigm, in order to expedite the realization of its potential. So, we are not there yet, but we are definitely on our way.

  17. Intrinsic disorder in Viral Proteins Genome-Linked: experimental and predictive analyses

    Directory of Open Access Journals (Sweden)

    Van Dorsselaer Alain

    2009-02-01

    Full Text Available Abstract Background VPgs are viral proteins linked to the 5' end of some viral genomes. Interactions between several VPgs and eukaryotic translation initiation factors eIF4Es are critical for plant infection. However, VPgs are not restricted to phytoviruses, being also involved in genome replication and protein translation of several animal viruses. To date, structural data are still limited to small picornaviral VPgs. Recently three phytoviral VPgs were shown to be natively unfolded proteins. Results In this paper, we report the bacterial expression, purification and biochemical characterization of two phytoviral VPgs, namely the VPgs of Rice yellow mottle virus (RYMV, genus Sobemovirus and Lettuce mosaic virus (LMV, genus Potyvirus. Using far-UV circular dichroism and size exclusion chromatography, we show that RYMV and LMV VPgs are predominantly or partly unstructured in solution, respectively. Using several disorder predictors, we show that both proteins are predicted to possess disordered regions. We next extend theses results to 14 VPgs representative of the viral diversity. Disordered regions were predicted in all VPg sequences whatever the genus and the family. Conclusion Based on these results, we propose that intrinsic disorder is a common feature of VPgs. The functional role of intrinsic disorder is discussed in light of the biological roles of VPgs.

  18. Genome-Wide Methylome Analyses Reveal Novel Epigenetic Regulation Patterns in Schizophrenia and Bipolar Disorder

    Science.gov (United States)

    Li, Yongsheng; Camarillo, Cynthia; Xu, Juan; Arana, Tania Bedard; Xiao, Yun; Zhao, Zheng; Chen, Hong; Ramirez, Mercedes; Zavala, Juan; Escamilla, Michael A.; Armas, Regina; Mendoza, Ricardo; Ontiveros, Alfonso; Nicolini, Humberto; Jerez Magaña, Alvaro Antonio; Rubin, Lewis P.; Li, Xia; Xu, Chun

    2015-01-01

    Schizophrenia (SZ) and bipolar disorder (BP) are complex genetic disorders. Their appearance is also likely informed by as yet only partially described epigenetic contributions. Using a sequencing-based method for genome-wide analysis, we quantitatively compared the blood DNA methylation landscapes in SZ and BP subjects to control, both in an understudied population, Hispanics along the US-Mexico border. Remarkably, we identified thousands of differentially methylated regions for SZ and BP preferentially located in promoters 3′-UTRs and 5′-UTRs of genes. Distinct patterns of aberrant methylation of promoter sequences were located surrounding transcription start sites. In these instances, aberrant methylation occurred in CpG islands (CGIs) as well as in flanking regions as well as in CGI sparse promoters. Pathway analysis of genes displaying these distinct aberrant promoter methylation patterns showed enhancement of epigenetic changes in numerous genes previously related to psychiatric disorders and neurodevelopment. Integration of gene expression data further suggests that in SZ aberrant promoter methylation is significantly associated with altered gene transcription. In particular, we found significant associations between (1) promoter CGIs hypermethylation with gene repression and (2) CGI 3′-shore hypomethylation with increased gene expression. Finally, we constructed a specific methylation analysis platform that facilitates viewing and comparing aberrant genome methylation in human neuropsychiatric disorders. PMID:25734057

  19. Genome-wide methylome analyses reveal novel epigenetic regulation patterns in schizophrenia and bipolar disorder.

    Science.gov (United States)

    Li, Yongsheng; Camarillo, Cynthia; Xu, Juan; Arana, Tania Bedard; Xiao, Yun; Zhao, Zheng; Chen, Hong; Ramirez, Mercedes; Zavala, Juan; Escamilla, Michael A; Armas, Regina; Mendoza, Ricardo; Ontiveros, Alfonso; Nicolini, Humberto; Magaña, Alvaro Antonio Jerez; Rubin, Lewis P; Li, Xia; Xu, Chun

    2015-01-01

    Schizophrenia (SZ) and bipolar disorder (BP) are complex genetic disorders. Their appearance is also likely informed by as yet only partially described epigenetic contributions. Using a sequencing-based method for genome-wide analysis, we quantitatively compared the blood DNA methylation landscapes in SZ and BP subjects to control, both in an understudied population, Hispanics along the US-Mexico border. Remarkably, we identified thousands of differentially methylated regions for SZ and BP preferentially located in promoters 3'-UTRs and 5'-UTRs of genes. Distinct patterns of aberrant methylation of promoter sequences were located surrounding transcription start sites. In these instances, aberrant methylation occurred in CpG islands (CGIs) as well as in flanking regions as well as in CGI sparse promoters. Pathway analysis of genes displaying these distinct aberrant promoter methylation patterns showed enhancement of epigenetic changes in numerous genes previously related to psychiatric disorders and neurodevelopment. Integration of gene expression data further suggests that in SZ aberrant promoter methylation is significantly associated with altered gene transcription. In particular, we found significant associations between (1) promoter CGIs hypermethylation with gene repression and (2) CGI 3'-shore hypomethylation with increased gene expression. Finally, we constructed a specific methylation analysis platform that facilitates viewing and comparing aberrant genome methylation in human neuropsychiatric disorders.

  20. Genome-Wide Methylome Analyses Reveal Novel Epigenetic Regulation Patterns in Schizophrenia and Bipolar Disorder

    Directory of Open Access Journals (Sweden)

    Yongsheng Li

    2015-01-01

    Full Text Available Schizophrenia (SZ and bipolar disorder (BP are complex genetic disorders. Their appearance is also likely informed by as yet only partially described epigenetic contributions. Using a sequencing-based method for genome-wide analysis, we quantitatively compared the blood DNA methylation landscapes in SZ and BP subjects to control, both in an understudied population, Hispanics along the US-Mexico border. Remarkably, we identified thousands of differentially methylated regions for SZ and BP preferentially located in promoters 3′-UTRs and 5′-UTRs of genes. Distinct patterns of aberrant methylation of promoter sequences were located surrounding transcription start sites. In these instances, aberrant methylation occurred in CpG islands (CGIs as well as in flanking regions as well as in CGI sparse promoters. Pathway analysis of genes displaying these distinct aberrant promoter methylation patterns showed enhancement of epigenetic changes in numerous genes previously related to psychiatric disorders and neurodevelopment. Integration of gene expression data further suggests that in SZ aberrant promoter methylation is significantly associated with altered gene transcription. In particular, we found significant associations between (1 promoter CGIs hypermethylation with gene repression and (2 CGI 3′-shore hypomethylation with increased gene expression. Finally, we constructed a specific methylation analysis platform that facilitates viewing and comparing aberrant genome methylation in human neuropsychiatric disorders.

  1. Integrated Genomic and Network-Based Analyses of Complex Diseases and Human Disease Network.

    Science.gov (United States)

    Al-Harazi, Olfat; Al Insaif, Sadiq; Al-Ajlan, Monirah A; Kaya, Namik; Dzimiri, Nduna; Colak, Dilek

    2016-06-20

    A disease phenotype generally reflects various pathobiological processes that interact in a complex network. The highly interconnected nature of the human protein interaction network (interactome) indicates that, at the molecular level, it is difficult to consider diseases as being independent of one another. Recently, genome-wide molecular measurements, data mining and bioinformatics approaches have provided the means to explore human diseases from a molecular basis. The exploration of diseases and a system of disease relationships based on the integration of genome-wide molecular data with the human interactome could offer a powerful perspective for understanding the molecular architecture of diseases. Recently, subnetwork markers have proven to be more robust and reliable than individual biomarker genes selected based on gene expression profiles alone, and achieve higher accuracy in disease classification. We have applied one of these methodologies to idiopathic dilated cardiomyopathy (IDCM) data that we have generated using a microarray and identified significant subnetworks associated with the disease. In this paper, we review the recent endeavours in this direction, and summarize the existing methodologies and computational tools for network-based analysis of complex diseases and molecular relationships among apparently different disorders and human disease network. We also discuss the future research trends and topics of this promising field.

  2. Analysis of protein interactions at native chloroplast membranes by ellipsometry.

    Directory of Open Access Journals (Sweden)

    Verena Kriechbaumer

    Full Text Available Membrane bound receptors play vital roles in cell signaling, and are the target for many drugs, yet their interactions with ligands are difficult to study by conventional techniques due to the technical difficulty of monitoring these interactions in lipid environments. In particular, the ability to analyse the behaviour of membrane proteins in their native membrane environment is limited. Here, we have developed a quantitative approach to detect specific interactions between low-abundance chaperone receptors within native chloroplast membranes and their soluble chaperone partners. Langmuir-Schaefer film deposition was used to deposit native chloroplasts onto gold-coated glass slides, and interactions between the molecular chaperones Hsp70 and Hsp90 and their receptors in the chloroplast membranes were detected and quantified by total internal reflection ellipsometry (TIRE. We show that native chloroplast membranes deposited on gold-coated glass slides using Langmuir-Schaefer films retain functional receptors capable of binding chaperones with high specificity and affinity. Taking into account the low chaperone receptor abundance in native membranes, these binding properties are consistent with data generated using soluble forms of the chloroplast chaperone receptors, OEP61 and Toc64. Therefore, we conclude that chloroplasts have the capacity to selectively bind chaperones, consistent with the notion that chaperones play an important role in protein targeting to chloroplasts. Importantly, this method of monitoring by TIRE does not require any protein labelling. This novel combination of techniques should be applicable to a wide variety of membranes and membrane protein receptors, thus presenting the opportunity to quantify protein interactions involved in fundamental cellular processes, and to screen for drugs that target membrane proteins.

  3. Post-genomic analyses of fungal lignocellulosic biomass degradation reveal the unexpected potential of the plant pathogen Ustilago maydis

    Directory of Open Access Journals (Sweden)

    Couturier Marie

    2012-02-01

    Full Text Available Abstract Background Filamentous fungi are potent biomass degraders due to their ability to thrive in ligno(hemicellulose-rich environments. During the last decade, fungal genome sequencing initiatives have yielded abundant information on the genes that are putatively involved in lignocellulose degradation. At present, additional experimental studies are essential to provide insights into the fungal secreted enzymatic pools involved in lignocellulose degradation. Results In this study, we performed a wide analysis of 20 filamentous fungi for which genomic data are available to investigate their biomass-hydrolysis potential. A comparison of fungal genomes and secretomes using enzyme activity profiling revealed discrepancies in carbohydrate active enzymes (CAZymes sets dedicated to plant cell wall. Investigation of the contribution made by each secretome to the saccharification of wheat straw demonstrated that most of them individually supplemented the industrial Trichoderma reesei CL847 enzymatic cocktail. Unexpectedly, the most striking effect was obtained with the phytopathogen Ustilago maydis that improved the release of total sugars by 57% and of glucose by 22%. Proteomic analyses of the best-performing secretomes indicated a specific enzymatic mechanism of U. maydis that is likely to involve oxido-reductases and hemicellulases. Conclusion This study provides insight into the lignocellulose-degradation mechanisms by filamentous fungi and allows for the identification of a number of enzymes that are potentially useful to further improve the industrial lignocellulose bioconversion process.

  4. Volatile terpenes from actinomycetes: a biosynthetic study correlating chemical analyses to genome data.

    Science.gov (United States)

    Rabe, Patrick; Citron, Christian A; Dickschat, Jeroen S

    2013-11-25

    The volatile terpenes of 24 actinomycetes whose genomes have been sequenced (or are currently being sequenced) were collected by use of a closed-loop stripping apparatus and identified by GC/MS. The analytical data were compared against a phylogenetic analysis of all 192 currently available sequences of bacterial terpene cyclases (excluding geosmin and 2-methylisoborneol synthases). In addition to the several groups of terpenes with known biosynthetic origin, selinadienes were identified as a large group of biosynthetically related sesquiterpenes that are produced by several streptomycetes. The detection of a large number of previously unrecognised side products of known terpene cyclases proved to be particularly important for an in depth understanding of biosynthetic pathways to known terpenes in actinomycetes. Interpretation of the chemical analytical data in the context of the phylogenetic tree of bacterial terpene cyclases pointed to the function of three new enzymes: (E)-β-caryophyllene synthase, selina-3,7(11)-diene synthase and aristolochene synthase.

  5. Genomic DNA restriction endonuclease from Pasteurella multocida isolated from Indonesia, katha strain and reference strains and analysed by PFGE

    Directory of Open Access Journals (Sweden)

    Supar

    2003-10-01

    Full Text Available Pasteurella multocida strains are the causative disease agents of wide range of domestic and wild animals in Indonesia. The most important serotypes are associated with Hemorrhagic septicaemic (HS diseases in cattle and buffaloes, cholera in ducks and chickens. The HS disease associated with P. multocia in large ruminants in Indonesia is controled by killed whole cell vaccines produced by the use of P. multocida Katha strains. There is no discriminatory data of the molecular biology technique has been applied to investigate P. multocida isolates from different geographic locations in Indonesia. The purpose of this studies were to observe the genetic diversity among P. multocida isolated from various geograpic locations and compared with Katha vaccine strain and other reference strains. A total samples of 38 isolates and strains of P. multocida were analysed by means of pulsed-field gel electrophoresis (PFGE. Each sample was grown in nutrient broth, cells were separeted by centrifugation. Whole cell pellet was mixed with agarose and then prepared agarose plugs. The genomic DNA of each sample was digested in situ (plug with either restriction endonuclease of ApaI and/or BamHI. The digested genomic DNA of each sample was analysed by PFGE, the genomic DNA restricted profile of each sample was compared with others. The use of ApaI restriction endonuclease digestion and analysed by PFGE, demonstrated that 34 out of 38 P. multocia samples could be differentiated into 16 ApaI types, whereas based on the BamHI digestion of these samples were differentiated into 20 BamHI types. Genomic DNA restriction pattern of Indonesian P. multocida isolates originated from cattle and buffaloes associated with haemorrhagic septicaemic diseases demonstrated different pattern to those of vaccine Katha strain, poultry strains as well as the reference strains currenly kept at Balitvet Culture Collection (BCC unit. Two P. multocida isolates derived from ducks with cholera

  6. Genetic and molecular analyses of PEG10 reveal new aspects of genomic organization, transcription and translation.

    Science.gov (United States)

    Lux, Heike; Flammann, Heiko; Hafner, Mathias; Lux, Andreas

    2010-01-13

    The paternally expressed gene PEG10 is a retrotransposon derived gene adapted through mammalian evolution located on human chromosome 7q21. PEG10 codes for at least two proteins, PEG10-RF1 and PEG10-RF1/2, by -1 frameshift translation. Overexpression or reinduced PEG10 expression was seen in malignancies, like hepatocellular carcinoma or B-cell acute and chronic lymphocytic leukemia. PEG10 was also shown to promote adipocyte differentiation. Experimental evidence suggests that the PEG10-RF1 protein is an inhibitor of apoptosis and mediates cell proliferation. Here we present new data on the genomic organization of PEG10 by identifying the major transcription start site, a new splice variant and report the cloning and analysis of 1.9 kb of the PEG10 promoter. Furthermore, we show for the first time that PEG10 translation is initiated at a non-AUG start codon upstream of the previously predicted AUG codon as well as at the AUG codon. The finding that PEG10 translation is initiated at different sides adds a new aspect to the already interesting feature of PEG10's -1 frameshift translation mechanism. It is now important to unravel the cellular functions of the PEG10 protein variants and how they are related to normal or pathological conditions. The generated promoter-reporter constructs can be used for future studies to investigate how PEG10 expression is regulated. In summary, our study provides new data on the genomic organization as well as expression and translation of PEG10, a prerequisite in order to study and understand the role of PEG10 in cancer, embryonic development and normal cell homeostasis.

  7. Genetic and molecular analyses of PEG10 reveal new aspects of genomic organization, transcription and translation.

    Directory of Open Access Journals (Sweden)

    Heike Lux

    Full Text Available The paternally expressed gene PEG10 is a retrotransposon derived gene adapted through mammalian evolution located on human chromosome 7q21. PEG10 codes for at least two proteins, PEG10-RF1 and PEG10-RF1/2, by -1 frameshift translation. Overexpression or reinduced PEG10 expression was seen in malignancies, like hepatocellular carcinoma or B-cell acute and chronic lymphocytic leukemia. PEG10 was also shown to promote adipocyte differentiation. Experimental evidence suggests that the PEG10-RF1 protein is an inhibitor of apoptosis and mediates cell proliferation. Here we present new data on the genomic organization of PEG10 by identifying the major transcription start site, a new splice variant and report the cloning and analysis of 1.9 kb of the PEG10 promoter. Furthermore, we show for the first time that PEG10 translation is initiated at a non-AUG start codon upstream of the previously predicted AUG codon as well as at the AUG codon. The finding that PEG10 translation is initiated at different sides adds a new aspect to the already interesting feature of PEG10's -1 frameshift translation mechanism. It is now important to unravel the cellular functions of the PEG10 protein variants and how they are related to normal or pathological conditions. The generated promoter-reporter constructs can be used for future studies to investigate how PEG10 expression is regulated. In summary, our study provides new data on the genomic organization as well as expression and translation of PEG10, a prerequisite in order to study and understand the role of PEG10 in cancer, embryonic development and normal cell homeostasis.

  8. Genome-wide association analyses identify SPOCK as a key novel gene underlying age at menarche.

    Directory of Open Access Journals (Sweden)

    Yao-Zhong Liu

    2009-03-01

    Full Text Available For females, menarche is a most significant physiological event. Age at menarche (AAM is a trait with high genetic determination and is associated with major complex diseases in women. However, specific genes for AAM variation are largely unknown. To identify genetic factors underlying AAM variation, a genome-wide association study (GWAS examining about 380,000 SNPs was conducted in 477 Caucasian women. A follow-up replication study was performed to validate our major GWAS findings using two independent Caucasian cohorts with 854 siblings and 762 unrelated subjects, respectively, and one Chinese cohort of 1,387 unrelated subjects--all females. Our GWAS identified a novel gene, SPOCK (Sparc/Osteonectin, CWCV, and Kazal-like domains proteoglycan, which had seven SNPs associated with AAM with genome-wide false discovery rate (FDR q<0.05. Six most significant SNPs of the gene were selected for validation in three independent replication cohorts. All of the six SNPs were replicated in at least one cohort. In particular, SNPs rs13357391 and rs1859345 were replicated both within and across different ethnic groups in all three cohorts, with p values of 5.09 x 10(-3 and 4.37 x 10(-3, respectively, in the Chinese cohort and combined p values (obtained by Fisher's method of 5.19 x 10(-5 and 1.02 x 10(-4, respectively, in all three replication cohorts. Interestingly, SPOCK can inhibit activation of MMP-2 (matrix metalloproteinase-2, a key factor promoting endometrial menstrual breakdown and onset of menstrual bleeding. Our findings, together with the functional relevance, strongly supported that the SPOCK gene underlies variation of AAM.

  9. Comparative sequence analyses of genome and transcriptome reveal novel transcripts and variants in the Asian elephant Elephas maximus.

    Science.gov (United States)

    Reddy, Puli Chandramouli; Sinha, Ishani; Kelkar, Ashwin; Habib, Farhat; Pradhan, Saurabh J; Sukumar, Raman; Galande, Sanjeev

    2015-12-01

    The Asian elephant Elephas maximus and the African elephant Loxodonta africana that diverged 5-7 million years ago exhibit differences in their physiology, behaviour and morphology. A comparative genomics approach would be useful and necessary for evolutionary and functional genetic studies of elephants. We performed sequencing of E. maximus and map to L. africana at ~15X coverage. Through comparative sequence analyses, we have identified Asian elephant specific homozygous, non-synonymous single nucleotide variants (SNVs) that map to 1514 protein coding genes, many of which are involved in olfaction. We also present the first report of a high-coverage transcriptome sequence in E. maximus from peripheral blood lymphocytes. We have identified 103 novel protein coding transcripts and 66-long non-coding (lnc)RNAs. We also report the presence of 181 protein domains unique to elephants when compared to other Afrotheria species. Each of these findings can be further investigated to gain a better understanding of functional differences unique to elephant species, as well as those unique to elephantids in comparison with other mammals. This work therefore provides a valuable resource to explore the immense research potential of comparative analyses of transcriptome and genome sequences in the Asian elephant.

  10. Comparative sequence analyses of genome and transcriptome reveal novel transcripts and variants in the Asian elephant Elephas maximus

    Indian Academy of Sciences (India)

    Puli Chandramouli Reddy; Ishani Sinha; Ashwin Kelkar; Farhat Habib; Saurabh J Pradhan; Raman Sukumar; Sanjeev Galande

    2015-12-01

    The Asian elephant Elephas maximus and the African elephant Loxodonta africana that diverged 5-7 million years ago exhibit differences in their physiology, behaviour and morphology. A comparative genomics approach would be useful and necessary for evolutionary and functional genetic studies of elephants. We performed sequencing of E. maximus and map to L. africana at ∼ 15X coverage. Through comparative sequence analyses, we have identified Asian elephant specific homozygous, non-synonymous single nucleotide variants (SNVs) that map to 1514 protein coding genes, many of which are involved in olfaction. We also present the first report of a high-coverage transcriptome sequence in E. maximus from peripheral blood lymphocytes. We have identified 103 novel protein coding transcripts and 66-long non-coding (Inc)RNAs. We also report the presence of 181 protein domains unique to elephants when compared to other Afrotheria species. Each of these findings can be further investigated to gain a better understanding of functional differences unique to elephant species, as well as those unique to elephantids in comparison with other mammals. This work therefore provides a valuable resource to explore the immense research potential of comparative analyses of transcriptome and genome sequences in the Asian elephant.

  11. Genomic Analyses Reveal Demographic History and Temperate Adaptation of the Newly Discovered Honey Bee Subspecies Apis mellifera sinisxinyuan n. ssp.

    Science.gov (United States)

    Chen, Chao; Liu, Zhiguang; Pan, Qi; Chen, Xiao; Wang, Huihua; Guo, Haikun; Liu, Shidong; Lu, Hongfeng; Tian, Shilin; Li, Ruiqiang; Shi, Wei

    2016-05-01

    Studying the genetic signatures of climate-driven selection can produce insights into local adaptation and the potential impacts of climate change on populations. The honey bee (Apis mellifera) is an interesting species to study local adaptation because it originated in tropical/subtropical climatic regions and subsequently spread into temperate regions. However, little is known about the genetic basis of its adaptation to temperate climates. Here, we resequenced the whole genomes of ten individual bees from a newly discovered population in temperate China and downloaded resequenced data from 35 individuals from other populations. We found that the new population is an undescribed subspecies in the M-lineage of A. mellifera (Apis mellifera sinisxinyuan). Analyses of population history show that long-term global temperature has strongly influenced the demographic history of A. m. sinisxinyuan and its divergence from other subspecies. Further analyses comparing temperate and tropical populations identified several candidate genes related to fat body and the Hippo signaling pathway that are potentially involved in adaptation to temperate climates. Our results provide insights into the demographic history of the newly discovered A. m. sinisxinyuan, as well as the genetic basis of adaptation of A. mellifera to temperate climates at the genomic level. These findings will facilitate the selective breeding of A. mellifera to improve the survival of overwintering colonies.

  12. Genome-wide linkage, exome sequencing and functional analyses identify ABCB6 as the pathogenic gene of dyschromatosis universalis hereditaria.

    Directory of Open Access Journals (Sweden)

    Hong Liu

    Full Text Available BACKGROUND: As a genetic disorder of abnormal pigmentation, the molecular basis of dyschromatosis universalis hereditaria (DUH had remained unclear until recently when ABCB6 was reported as a causative gene of DUH. METHODOLOGY: We performed genome-wide linkage scan using Illumina Human 660W-Quad BeadChip and exome sequencing analyses using Agilent SureSelect Human All Exon Kits in a multiplex Chinese DUH family to identify the pathogenic mutations and verified the candidate mutations using Sanger sequencing. Quantitative RT-PCR and Immunohistochemistry was performed to verify the expression of the pathogenic gene, Zebrafish was also used to confirm the functional role of ABCB6 in melanocytes and pigmentation. RESULTS: Genome-wide linkage (assuming autosomal dominant inheritance mode and exome sequencing analyses identified ABCB6 as the disease candidate gene by discovering a coding mutation (c.1358C>T; p.Ala453Val that co-segregates with the disease phenotype. Further mutation analysis of ABCB6 in four other DUH families and two sporadic cases by Sanger sequencing confirmed the mutation (c.1358C>T; p.Ala453Val and discovered a second, co-segregating coding mutation (c.964A>C; p.Ser322Lys in one of the four families. Both mutations were heterozygous in DUH patients and not present in the 1000 Genome Project and dbSNP database as well as 1,516 unrelated Chinese healthy controls. Expression analysis in human skin and mutagenesis interrogation in zebrafish confirmed the functional role of ABCB6 in melanocytes and pigmentation. Given the involvement of ABCB6 mutations in coloboma, we performed ophthalmological examination of the DUH carriers of ABCB6 mutations and found ocular abnormalities in them. CONCLUSION: Our study has advanced our understanding of DUH pathogenesis and revealed the shared pathological mechanism between pigmentary DUH and ocular coloboma.

  13. Genome and phylogenetic analyses of Trypanosoma evansi reveal extensive similarity to T. brucei and multiple independent origins for dyskinetoplasty.

    Science.gov (United States)

    Carnes, Jason; Anupama, Atashi; Balmer, Oliver; Jackson, Andrew; Lewis, Michael; Brown, Rob; Cestari, Igor; Desquesnes, Marc; Gendrin, Claire; Hertz-Fowler, Christiane; Imamura, Hideo; Ivens, Alasdair; Kořený, Luděk; Lai, De-Hua; MacLeod, Annette; McDermott, Suzanne M; Merritt, Chris; Monnerat, Severine; Moon, Wonjong; Myler, Peter; Phan, Isabelle; Ramasamy, Gowthaman; Sivam, Dhileep; Lun, Zhao-Rong; Lukeš, Julius; Stuart, Ken; Schnaufer, Achim

    2015-01-01

    Two key biological features distinguish Trypanosoma evansi from the T. brucei group: independence from the tsetse fly as obligatory vector, and independence from the need for functional mitochondrial DNA (kinetoplast or kDNA). In an effort to better understand the molecular causes and consequences of these differences, we sequenced the genome of an akinetoplastic T. evansi strain from China and compared it to the T. b. brucei reference strain. The annotated T. evansi genome shows extensive similarity to the reference, with 94.9% of the predicted T. b. brucei coding sequences (CDS) having an ortholog in T. evansi, and 94.6% of the non-repetitive orthologs having a nucleotide identity of 95% or greater. Interestingly, several procyclin-associated genes (PAGs) were disrupted or not found in this T. evansi strain, suggesting a selective loss of function in the absence of the insect life-cycle stage. Surprisingly, orthologous sequences were found in T. evansi for all 978 nuclear CDS predicted to represent the mitochondrial proteome in T. brucei, although a small number of these may have lost functionality. Consistent with previous results, the F1FO-ATP synthase γ subunit was found to have an A281 deletion, which is involved in generation of a mitochondrial membrane potential in the absence of kDNA. Candidates for CDS that are absent from the reference genome were identified in supplementary de novo assemblies of T. evansi reads. Phylogenetic analyses show that the sequenced strain belongs to a dominant group of clonal T. evansi strains with worldwide distribution that also includes isolates classified as T. equiperdum. At least three other types of T. evansi or T. equiperdum have emerged independently. Overall, the elucidation of the T. evansi genome sequence reveals extensive similarity of T. brucei and supports the contention that T. evansi should be classified as a subspecies of T. brucei.

  14. Two types of chloroplast gene promoters in Chlamydomonas reinhardtii.

    Science.gov (United States)

    Klein, U; De Camp, J D; Bogorad, L

    1992-04-15

    Structures of the promoters of Chlamydomonas reinhardtii plastid atpB and 16S rRNA-encoding genes were analyzed in vivo. Chimeric constructs, containing the Chlamydomonas chloroplast atpB or 16S rRNA-encoding gene promoter coupled to the Escherichia coli uidA (beta-glucuronidase, GUS) reporter gene and bordered by C. reinhardtii chloroplast sequences, were stably introduced into the chloroplast of Chlamydomonas by microprojectile bombardment. Activity of the promoters in the chloroplast of GUS gene-positive transformants was assayed by measuring the abundance of GUS transcripts and determining the relative rates of GUS transcription in vivo. Deletion analyses of the 16S rRNA gene and atpB promoter fragments showed that the two promoters differ structurally. The 16S rRNA gene promoter resembles the bacterial sigma 70 type with typical -10 and -35 elements. The atpB promoter, on the other hand, lacks a conserved motif in the -35 region but contains, in the -10 region, a characteristic octameric palindrome (TATAATAT) that is conserved in the promoter sequences of some other C. reinhardtii chloroplast genes. For maximum activity, the atpB promoter requires sequences of approximately 22 base pairs upstream and approximately 60 base pairs downstream of the transcription start site.

  15. Comparison of Genome-Wide Association Methods in Analyses of Admixed Populations with Complex Familial Relationships

    DEFF Research Database (Denmark)

    Kadri, Naveen; Guldbrandtsen, Bernt; Sørensen, Peter;

    2014-01-01

    ) levels. We also compared type-I error rates among models in analyses of publicly available human and dog datasets. The models corrected for none, one, or both structure levels. Correction for K was performed with linear mixed models incorporating familial relationships estimated from pedigrees or genetic......Population structure is known to cause false-positive detection in association studies. We compared the power, precision, and type-I error rates of various association models in analyses of a simulated dataset with structure at the population (admixture from two populations; P) and family (K...... corrected for P. In contrast, correction for P alone in linear models was insufficient. The power and precision of linear mixed models with and without correction for P were similar. Furthermore, power, precision, and type-I error rate were comparable in linear mixed models incorporating pedigree...

  16. Loss of matK RNA editing in seed plant chloroplasts

    Directory of Open Access Journals (Sweden)

    Maier Uwe G

    2009-08-01

    Full Text Available Abstract Background RNA editing in chloroplasts of angiosperms proceeds by C-to-U conversions at specific sites. Nuclear-encoded factors are required for the recognition of cis-elements located immediately upstream of editing sites. The ensemble of editing sites in a chloroplast genome differs widely between species, and editing sites are thought to evolve rapidly. However, large-scale analyses of the evolution of individual editing sites have not yet been undertaken. Results Here, we analyzed the evolution of two chloroplast editing sites, matK-2 and matK-3, for which DNA sequences from thousands of angiosperm species are available. Both sites are found in most major taxa, including deep-branching families such as the nymphaeaceae. However, 36 isolated taxa scattered across the entire tree lack a C at one of the two matK editing sites. Tests of several exemplary species from this in silico analysis of matK processing unexpectedly revealed that one of the two sites remain unedited in almost half of all species examined. A comparison of sequences between editors and non-editors showed that specific nucleotides co-evolve with the C at the matK editing sites, suggesting that these nucleotides are critical for editing-site recognition. Conclusion (i Both matK editing sites were present in the common ancestor of all angiosperms and have been independently lost multiple times during angiosperm evolution. (ii The editing activities corresponding to matK-2 and matK-3 are unstable. (iii A small number of third-codon positions in the vicinity of editing sites are selectively constrained independent of the presence of the editing site, most likely because of interacting RNA-binding proteins.

  17. Multistage genome-wide association meta-analyses identified two new loci for bone mineral density.

    Science.gov (United States)

    Zhang, Lei; Choi, Hyung Jin; Estrada, Karol; Leo, Paul J; Li, Jian; Pei, Yu-Fang; Zhang, Yinping; Lin, Yong; Shen, Hui; Liu, Yao-Zhong; Liu, Yongjun; Zhao, Yingchun; Zhang, Ji-Gang; Tian, Qing; Wang, Yu-ping; Han, Yingying; Ran, Shu; Hai, Rong; Zhu, Xue-Zhen; Wu, Shuyan; Yan, Han; Liu, Xiaogang; Yang, Tie-Lin; Guo, Yan; Zhang, Feng; Guo, Yan-fang; Chen, Yuan; Chen, Xiangding; Tan, Lijun; Zhang, Lishu; Deng, Fei-Yan; Deng, Hongyi; Rivadeneira, Fernando; Duncan, Emma L; Lee, Jong Young; Han, Bok Ghee; Cho, Nam H; Nicholson, Geoffrey C; McCloskey, Eugene; Eastell, Richard; Prince, Richard L; Eisman, John A; Jones, Graeme; Reid, Ian R; Sambrook, Philip N; Dennison, Elaine M; Danoy, Patrick; Yerges-Armstrong, Laura M; Streeten, Elizabeth A; Hu, Tian; Xiang, Shuanglin; Papasian, Christopher J; Brown, Matthew A; Shin, Chan Soo; Uitterlinden, André G; Deng, Hong-Wen

    2014-04-01

    Aiming to identify novel genetic variants and to confirm previously identified genetic variants associated with bone mineral density (BMD), we conducted a three-stage genome-wide association (GWA) meta-analysis in 27 061 study subjects. Stage 1 meta-analyzed seven GWA samples and 11 140 subjects for BMDs at the lumbar spine, hip and femoral neck, followed by a Stage 2 in silico replication of 33 SNPs in 9258 subjects, and by a Stage 3 de novo validation of three SNPs in 6663 subjects. Combining evidence from all the stages, we have identified two novel loci that have not been reported previously at the genome-wide significance (GWS; 5.0 × 10(-8)) level: 14q24.2 (rs227425, P-value 3.98 × 10(-13), SMOC1) in the combined sample of males and females and 21q22.13 (rs170183, P-value 4.15 × 10(-9), CLDN14) in the female-specific sample. The two newly identified SNPs were also significant in the GEnetic Factors for OSteoporosis consortium (GEFOS, n = 32 960) summary results. We have also independently confirmed 13 previously reported loci at the GWS level: 1p36.12 (ZBTB40), 1p31.3 (GPR177), 4p16.3 (FGFRL1), 4q22.1 (MEPE), 5q14.3 (MEF2C), 6q25.1 (C6orf97, ESR1), 7q21.3 (FLJ42280, SHFM1), 7q31.31 (FAM3C, WNT16), 8q24.12 (TNFRSF11B), 11p15.3 (SOX6), 11q13.4 (LRP5), 13q14.11 (AKAP11) and 16q24 (FOXL1). Gene expression analysis in osteogenic cells implied potential functional association of the two candidate genes (SMOC1 and CLDN14) in bone metabolism. Our findings independently confirm previously identified biological pathways underlying bone metabolism and contribute to the discovery of novel pathways, thus providing valuable insights into the intervention and treatment of osteoporosis.

  18. Intra-individual polymorphism in chloroplasts from NGS data: where does it come from and how to handle it?

    Science.gov (United States)

    Scarcelli, N; Mariac, C; Couvreur, T L P; Faye, A; Richard, D; Sabot, F; Berthouly-Salazar, C; Vigouroux, Y

    2016-03-01

    Next-generation sequencing allows access to a large quantity of genomic data. In plants, several studies used whole chloroplast genome sequences for inferring phylogeography or phylogeny. Even though the chloroplast is a haploid organelle, NGS plastome data identified a nonnegligible number of intra-individual polymorphic SNPs. Such observations could have several causes such as sequencing errors, the presence of heteroplasmy or transfer of chloroplast sequences in the nuclear and mitochondrial genomes. The occurrence of allelic diversity has practical important impacts on the identification of diversity, the analysis of the chloroplast data and beyond that, significant evolutionary questions. In this study, we show that the observed intra-individual polymorphism of chloroplast sequence data is probably the result of plastid DNA transferred into the mitochondrial and/or the nuclear genomes. We further assess nine different bioinformatics pipelines' error rates for SNP and genotypes calling using SNPs identified in Sanger sequencing. Specific pipelines are adequate to deal with this issue, optimizing both specificity and sensitivity. Our results will allow a proper use of whole chloroplast NGS sequence and will allow a better handling of NGS chloroplast sequence diversity.

  19. BioSMACK: a linux live CD for genome-wide association analyses.

    Science.gov (United States)

    Hong, Chang Bum; Kim, Young Jin; Moon, Sanghoon; Shin, Young-Ah; Go, Min Jin; Kim, Dong-Joon; Lee, Jong-Young; Cho, Yoon Shin

    2012-01-01

    Recent advances in high-throughput genotyping technologies have enabled us to conduct a genome-wide association study (GWAS) on a large cohort. However, analyzing millions of single nucleotide polymorphisms (SNPs) is still a difficult task for researchers conducting a GWAS. Several difficulties such as compatibilities and dependencies are often encountered by researchers using analytical tools, during the installation of software. This is a huge obstacle to any research institute without computing facilities and specialists. Therefore, a proper research environment is an urgent need for researchers working on GWAS. We developed BioSMACK to provide a research environment for GWAS that requires no configuration and is easy to use. BioSMACK is based on the Ubuntu Live CD that offers a complete Linux-based operating system environment without installation. Moreover, we provide users with a GWAS manual consisting of a series of guidelines for GWAS and useful examples. BioSMACK is freely available at http://ksnp.cdc. go.kr/biosmack.

  20. Genomic and transcriptomic analyses of the tangerine pathotype of Alternaria alternata in response to oxidative stress

    Science.gov (United States)

    Wang, Mingshuang; Sun, Xuepeng; Yu, Dongliang; Xu, Jianping; Chung, Kuangren; Li, Hongye

    2016-01-01

    The tangerine pathotype of Alternaria alternata produces the A. citri toxin (ACT) and is the causal agent of citrus brown spot that results in significant yield losses worldwide. Both the production of ACT and the ability to detoxify reactive oxygen species (ROS) are required for A. alternata pathogenicity in citrus. In this study, we report the 34.41 Mb genome sequence of strain Z7 of the tangerine pathotype of A. alternata. The host selective ACT gene cluster in strain Z7 was identified, which included 25 genes with 19 of them not reported previously. Of these, 10 genes were present only in the tangerine pathotype, representing the most likely candidate genes for this pathotype specialization. A transcriptome analysis of the global effects of H2O2 on gene expression revealed 1108 up-regulated and 498 down-regulated genes. Expressions of those genes encoding catalase, peroxiredoxin, thioredoxin and glutathione were highly induced. Genes encoding several protein families including kinases, transcription factors, transporters, cytochrome P450, ubiquitin and heat shock proteins were found associated with adaptation to oxidative stress. Our data not only revealed the molecular basis of ACT biosynthesis but also provided new insights into the potential pathways that the phytopathogen A. alternata copes with oxidative stress. PMID:27582273

  1. Super-sparse principal component analyses for high-throughput genomic data

    Directory of Open Access Journals (Sweden)

    Lee Youngjo

    2010-06-01

    Full Text Available Abstract Background Principal component analysis (PCA has gained popularity as a method for the analysis of high-dimensional genomic data. However, it is often difficult to interpret the results because the principal components are linear combinations of all variables, and the coefficients (loadings are typically nonzero. These nonzero values also reflect poor estimation of the true vector loadings; for example, for gene expression data, biologically we expect only a portion of the genes to be expressed in any tissue, and an even smaller fraction to be involved in a particular process. Sparse PCA methods have recently been introduced for reducing the number of nonzero coefficients, but these existing methods are not satisfactory for high-dimensional data applications because they still give too many nonzero coefficients. Results Here we propose a new PCA method that uses two innovations to produce an extremely sparse loading vector: (i a random-effect model on the loadings that leads to an unbounded penalty at the origin and (ii shrinkage of the singular values obtained from the singular value decomposition of the data matrix. We develop a stable computing algorithm by modifying nonlinear iterative partial least square (NIPALS algorithm, and illustrate the method with an analysis of the NCI cancer dataset that contains 21,225 genes. Conclusions The new method has better performance than several existing methods, particularly in the estimation of the loading vectors.

  2. Phylogenomic analyses reveal the diversity of laccase-coding genes in Fonsecaea genomes

    Science.gov (United States)

    Feng, Peiying; Weiss, Vinicius Almir; Vicente, Vania Aparecida; Stielow, J. Benjamin; de Hoog, Sybren

    2017-01-01

    The genus Fonsecaea comprises black yeast-like fungi of clinical relevance, including etiologic agents of chromoblastomycosis and cerebral phaeohyphomycosis. Presence of melanin and assimilation of monoaromatic hydrocarbons and alkylbenzenes have been proposed as virulence factors. Multicopper oxidase (MCO) is a family of enzymes including laccases, ferroxidases and ascorbate oxidases which are able to catalyze the oxidation of various aromatic organic compounds with the reduction of molecular oxygen to water. Additionally, laccases are required for the production of fungal melanins, a cell-wall black pigment recognized as a key polymer for pathogenicity and extremotolerance in black yeast-like fungi. Although the activity of laccase enzymes has previously been reported in many wood-rotting fungi, the diversity of laccase genes in Fonsecaea has not yet been assessed. In this study, we identified and characterized laccase-coding genes and determined their genomic location in five clinical and environmental Fonsecaea species. The identification of laccases sensu stricto will provide insights into carbon acquisition strategies as well as melanin production in Fonsecaea. PMID:28187150

  3. Genome-Wide Association Analyses Point to Candidate Genes for Electric Shock Avoidance in Drosophila melanogaster.

    Science.gov (United States)

    Appel, Mirjam; Scholz, Claus-Jürgen; Müller, Tobias; Dittrich, Marcus; König, Christian; Bockstaller, Marie; Oguz, Tuba; Khalili, Afshin; Antwi-Adjei, Emmanuel; Schauer, Tamas; Margulies, Carla; Tanimoto, Hiromu; Yarali, Ayse

    2015-01-01

    Electric shock is a common stimulus for nociception-research and the most widely used reinforcement in aversive associative learning experiments. Yet, nothing is known about the mechanisms it recruits at the periphery. To help fill this gap, we undertook a genome-wide association analysis using 38 inbred Drosophila melanogaster strains, which avoided shock to varying extents. We identified 514 genes whose expression levels and/ or sequences co-varied with shock avoidance scores. We independently scrutinized 14 of these genes using mutants, validating the effect of 7 of them on shock avoidance. This emphasizes the value of our candidate gene list as a guide for follow-up research. In addition, by integrating our association results with external protein-protein interaction data we obtained a shock avoidance-associated network of 38 genes. Both this network and the original candidate list contained a substantial number of genes that affect mechanosensory bristles, which are hair-like organs distributed across the fly's body. These results may point to a potential role for mechanosensory bristles in shock sensation. Thus, we not only provide a first list of candidate genes for shock avoidance, but also point to an interesting new hypothesis on nociceptive mechanisms.

  4. Chloroplast in Plant-Virus Interaction

    Science.gov (United States)

    Zhao, Jinping; Zhang, Xian; Hong, Yiguo; Liu, Yule

    2016-01-01

    In plants, the chloroplast is the organelle that conducts photosynthesis. It has been known that chloroplast is involved in virus infection of plants for approximate 70 years. Recently, the subject of chloroplast-virus interplay is getting more and more attention. In this article we discuss the different aspects of chloroplast-virus interaction into three sections: the effect of virus infection on the structure and function of chloroplast, the role of chloroplast in virus infection cycle, and the function of chloroplast in host defense against viruses. In particular, we focus on the characterization of chloroplast protein-viral protein interactions that underlie the interplay between chloroplast and virus. It can be summarized that chloroplast is a common target of plant viruses for viral pathogenesis or propagation; and conversely, chloroplast and its components also can play active roles in plant defense against viruses. Chloroplast photosynthesis-related genes/proteins (CPRGs/CPRPs) are suggested to play a central role during the complex chloroplast-virus interaction. PMID:27757106

  5. Clinical, polysomnographic and genome-wide association analyses of narcolepsy with cataplexy

    DEFF Research Database (Denmark)

    Luca, Gianina; Haba-Rubio, José; Dauvilliers, Yves

    2013-01-01

    The aim of this study was to describe the clinical and PSG characteristics of narcolepsy with cataplexy and their genetic predisposition by using the retrospective patient database of the European Narcolepsy Network (EU-NN). We have analysed retrospective data of 1099 patients with narcolepsy...... diagnostic delay were young age at diagnosis, cataplexy as the first symptom and higher frequency of cataplexy attacks. The mean multiple sleep latency negatively correlated with Epworth Sleepiness Scale (ESS) and with the number of sleep-onset rapid eye movement periods (SOREMPs), but none...

  6. Quality control and conduct of genome-wide association meta-analyses

    DEFF Research Database (Denmark)

    Winkler, Thomas W; Day, Felix R; Croteau-Chonka, Damien C

    2014-01-01

    at the study file level, the meta-level across studies and the meta-analysis output level. Real-world examples highlight issues experienced and solutions developed by the GIANT Consortium that has conducted meta-analyses including data from 125 studies comprising more than 330,000 individuals. We provide...... a general protocol for conducting GWAMAs and carrying out QC to minimize errors and to guarantee maximum use of the data. We also include details for the use of a powerful and flexible software package called EasyQC. Precise timings will be greatly influenced by consortium size. For consortia of comparable...

  7. Genome-wide identification and expression analyses of cytochrome P450 genes in mulberry (Morus notabilis)

    Institute of Scientific and Technical Information of China (English)

    Bi Ma; Yiwei Luo; Ling Jia; Xiwu Qi; Qiwei Zeng; Zhonghuai Xiang; Ningjia He

    2014-01-01

    Cytochrome P450s play critical roles in the biosyn-thesis of physiological y important compounds in plants. These compounds often act as defense toxins to prevent herbivory. In the present study, a total of 174 P450 genes of mulberry (Morus notabilis C.K.Schn) were identified based on bioinfor-matics analyses. These mulberry P450 genes were divided into nine clans and 47 families and were found to be expressed in a tissue-preferential manner. These genes were compared to the P450 genes in Arabidopsis thaliana. Families CYP80, CYP92, CYP728, CYP733, CYP736, and CYP749 were found to exist in mulberry, and they may play important roles in the biosynthesis of mulberry secondary metabolites. Analyses of the functional and metabolic pathways of these genes indicated that mulberry P450 genes may participate in the metabolism of lipids, other secondary metabolites, xenobiotics, amino acids, cofactors, vitamins, terpenoids, and polyketides. These results provide a foundation for understanding of the structures and biological functions of mulberry P450 genes.

  8. Genomic and evolutionary analyses of Tango transposons in Aedes aegypti, Anopheles gambiae and other mosquito species.

    Science.gov (United States)

    Coy, M R; Tu, Z

    2007-08-01

    Tango is a transposon of the Tc1 family and was originally discovered in the African malaria mosquito, Anopheles gambiae. Here we report a systematic analysis of the genome sequence of the yellow fever mosquito, Aedes aegypti, which uncovered three distinct Tango transposons. We name the only An. gambiae Tango transposon AgTango1 and the three Ae. aegypti Tango elements AeTango1-3. Like AgTango1, AeTango1 and AeTango2 elements both have members that retain characteristics of autonomous elements such as intact open reading frames and terminal inverted repeats (TIRs). AeTango3 is a degenerate transposon with no full-length members. All full-length Tango transposons contain subterminal direct repeats within their TIRs. AgTango1 and AeTango1-3 form a single clade among other Tc1 transposons. Within this clade, AgTango1 and AeTango1 are closely related and share approximately 80% identity at the amino acid level, which exceeds the level of similarity of the majority of host genes in the two species. A survey of Tango in other mosquito species was carried out using degenerate PCR. Tango was isolated and sequenced in all members of the An. gambiae species complex, Aedes albopictus and Ochlerotatus atropalpus. Oc. atropalpus contains a rich diversity of Tango elements, while Tango elements in Ae. albopictus and the An. gambiae species complex all belong to Tango1. No Tango was detected in Culex pipiens quinquefasciatus, Anopheles stephensi, Anopheles dirus, Anopheles farauti or Anopheles albimanus using degenerate PCR. Bioinformatic searches of the Cx. p. quinquefasciatus (~10 x coverage) and An. stephensi (0.33 x coverage) databases also failed to uncover any Tango elements. Although other evolutionary scenarios cannot be ruled out, there are indications that Tango1 underwent horizontal transfer among divergent mosquito species.

  9. Whole-genome analyses of the speciation events in the pathogenic Brucellae

    Energy Technology Data Exchange (ETDEWEB)

    Chain, P; Comerci, D; Tolmasky, M; Larimer, F; Malfatti, S; Vergez, L; Aguero, F; Land, M; Ugalde, R; Garcia, E

    2005-07-14

    Despite their high DNA identity and a proposal to group classical Brucella species as biovars of B. melitensis, the commonly recognized Brucella species can be distinguished by distinct biochemical and fatty acid characters as well as by a marked host range (e.g. B. suis for swine, B. melitensis for sheep and goats, B. abortus for cattle). Here we present the genome of B. abortus 2308, the virulent prototype biovar 1 strain, and its comparison to the two other human pathogenic Brucellae species and to the B. abortus field isolate 9-941. The global distribution of pseudogenes, deletions and insertions support previous indications that B. abortus and B. melitensis share a common ancestor that diverged from B. suis. With the exception of a dozen genes, the genetic complement of both B. abortus strains is identical, whereas the three species differ in gene content and pseudogenes. The pattern of species-specific gene inactivations affecting transcriptional regulators and outer membrane proteins suggest that these inactivations may play an important role in the establishment of host-specificity and may have been a primary driver of speciation in the Brucellae. Despite being non-motile, the Brucellae contain flagellum gene clusters and display species-specific flagellar gene inactivations, which lead to the putative generation of different versions of flagellum-derived structures, and may contribute to differences in host-specificity and virulence. Metabolic changes such as the lack of complete metabolic pathways for the synthesis of numerous compounds (e.g. glycogen, biotin, NAD, and choline) are consistent with adaptation of Brucellae to an intracellular lifestyle.

  10. Comparative Genome Analyses of Streptococcus suis Isolates from Endocarditis Demonstrate Persistence of Dual Phenotypic Clones.

    Directory of Open Access Journals (Sweden)

    Mari Tohya

    Full Text Available Many bacterial species coexist in the same niche as heterogeneous clones with different phenotypes; however, understanding of infectious diseases by polyphenotypic bacteria is still limited. In the present study, encapsulation in isolates of the porcine pathogen Streptococcus suis from persistent endocarditis lesions was examined. Coexistence of both encapsulated and unencapsulated S. suis isolates was found in 26 out of 59 endocarditis samples. The isolates were serotype 2, and belonged to two different sequence types (STs, ST1 and ST28. The genomes of each of the 26 pairs of encapsulated and unencapsulated isolates from the 26 samples were sequenced. The data showed that each pair of isolates had one or more unique nonsynonymous mutations in the cps gene, and the encapsulated and unencapsulated isolates from the same samples were closest to each other. Pairwise comparisons of the sequences of cps genes in 7 pairs of encapsulated and unencapsulated isolates identified insertion/deletions (indels ranging from one to 104 bp in different cps genes of unencapsulated isolates. Capsule expression was restored in a subset of unencapsulated isolates by complementation in trans with cps expression vectors. Examination of gene content common to isolates indicated that mutation frequency was higher in ST28 pairs than in ST1 pairs. Genes within mobile genetic elements were mutation hot spots among ST28 isolates. Taken all together, our results demonstrate the coexistence of dual phenotype (encapsulated and unencapsulated bacterial clones and suggest that the dual phenotypes arose independently in each farm by means of spontaneous mutations in cps genes.

  11. Chloroplast evolution: secondary symbiogenesis and multiple losses.

    Science.gov (United States)

    Cavalier-Smith, T

    2002-01-22

    Chloroplasts originated from cyanobacteria only once, but have been laterally transferred to other lineages by symbiogenetic cell mergers. Such secondary symbiogenesis is rarer and chloroplast losses commoner than often assumed.

  12. Integrative functional genomic analyses implicate specific molecular pathways and circuits in autism.

    Science.gov (United States)

    Parikshak, Neelroop N; Luo, Rui; Zhang, Alice; Won, Hyejung; Lowe, Jennifer K; Chandran, Vijayendran; Horvath, Steve; Geschwind, Daniel H

    2013-11-21

    Genetic studies have identified dozens of autism spectrum disorder (ASD) susceptibility genes, raising two critical questions: (1) do these genetic loci converge on specific biological processes, and (2) where does the phenotypic specificity of ASD arise, given its genetic overlap with intellectual disability (ID)? To address this, we mapped ASD and ID risk genes onto coexpression networks representing developmental trajectories and transcriptional profiles representing fetal and adult cortical laminae. ASD genes tightly coalesce in modules that implicate distinct biological functions during human cortical development, including early transcriptional regulation and synaptic development. Bioinformatic analyses suggest that translational regulation by FMRP and transcriptional coregulation by common transcription factors connect these processes. At a circuit level, ASD genes are enriched in superficial cortical layers and glutamatergic projection neurons. Furthermore, we show that the patterns of ASD and ID risk genes are distinct, providing a biological framework for further investigating the pathophysiology of ASD.

  13. Genomic and in silico analyses of CRBN gene and thalidomide embryopathy in humans.

    Science.gov (United States)

    Vianna, Fernanda Sales Luiz; Kowalski, Thayne Woycinck; Tovo-Rodrigues, Luciana; Tagliani-Ribeiro, Alice; Godoy, Bibiane Armiliato; Fraga, Lucas Rosa; Sanseverino, Maria Teresa Vieira; Hutz, Mara Helena; Schuler-Faccini, Lavínia

    2016-12-01

    Thalidomide causes Thalidomide Embryopathy (TE), but is largely used to treat several conditions. Investigations with Cereblon, a thalidomide target protein encoded by CRBN gene, have helped to understand thalidomide therapeutic and teratogenic properties. We sequenced CRBN-thalidomide binding region in 38 TE individuals and 136 Brazilians without congenital anomalies, and performed in silico analyses. Eight variants were identified, seven intronic and one in 3'UTR. TE individuals had rare variants in higher frequency than the non-affected group (p=0.04). The genotype rs1620675 CC was related to neurological anomalies in TE individuals (p=0.004). Bioinformatics analysis suggested this genotype leads to potential alterations in splicing sites and binding to transcription factors. Comparison of the Cereblon-thalidomide binding domains in mammals demonstrated that CRBN is highly conserved across species. All the variants require evaluation in functional assays in order to understand their role in Cereblon-thalidomide binding and complex interactions that lead to TE.

  14. The KAC family of kinesin-like proteins is essential for the association of chloroplasts with the plasma membrane in land plants.

    Science.gov (United States)

    Suetsugu, Noriyuki; Sato, Yoshikatsu; Tsuboi, Hidenori; Kasahara, Masahiro; Imaizumi, Takato; Kagawa, Takatoshi; Hiwatashi, Yuji; Hasebe, Mitsuyasu; Wada, Masamitsu

    2012-11-01

    Chloroplasts require association with the plasma membrane for movement in response to light and for appropriate positioning within the cell to capture photosynthetic light efficiently. In Arabidopsis, CHLOROPLAST UNUSUAL POSITIONING 1 (CHUP1), KINESIN-LIKE PROTEIN FOR ACTIN-BASED CHLOROPLAST MOVEMENT 1 (KAC1) and KAC2 are required for both the proper movement of chloroplasts and the association of chloroplasts with the plasma membrane, through the reorganization of short actin filaments located on the periphery of the chloroplasts. Here, we show that KAC and CHUP1 orthologs (AcKAC1, AcCHUP1A and AcCHUP1B, and PpKAC1 and PpKAC2) play important roles in chloroplast positioning in the fern Adiantum capillus-veneris and the moss Physcomitrella patens. The knockdown of AcKAC1 and two AcCHUP1 genes induced the aggregation of chloroplasts around the nucleus. Analyses of A. capillus-veneris mutants containing perinuclear-aggregated chloroplasts confirmed that AcKAC1 is required for chloroplast-plasma membrane association. In addition, P. patens lines in which two KAC genes had been knocked out showed an aggregated chloroplast phenotype similar to that of the fern kac1 mutants. These results indicate that chloroplast positioning and movement are mediated through the activities of KAC and CHUP1 proteins, which are conserved in land plants.

  15. Subcellular localization of extracytoplasmic proteins in monoderm bacteria: rational secretomics-based strategy for genomic and proteomic analyses.

    Directory of Open Access Journals (Sweden)

    Sandra Renier

    Full Text Available Genome-scale prediction of subcellular localization (SCL is not only useful for inferring protein function but also for supporting proteomic data. In line with the secretome concept, a rational and original analytical strategy mimicking the secretion steps that determine ultimate SCL was developed for Gram-positive (monoderm bacteria. Based on the biology of protein secretion, a flowchart and decision trees were designed considering (i membrane targeting, (ii protein secretion systems, (iii membrane retention, and (iv cell-wall retention by domains or post-translocational modifications, as well as (v incorporation to cell-surface supramolecular structures. Using Listeria monocytogenes as a case study, results were compared with known data set from SCL predictors and experimental proteomics. While in good agreement with experimental extracytoplasmic fractions, the secretomics-based method outperforms other genomic analyses, which were simply not intended to be as inclusive. Compared to all other localization predictors, this method does not only supply a static snapshot of protein SCL but also offers the full picture of the secretion process dynamics: (i the protein routing is detailed, (ii the number of distinct SCL and protein categories is comprehensive, (iii the description of protein type and topology is provided, (iv the SCL is unambiguously differentiated from the protein category, and (v the multiple SCL and protein category are fully considered. In that sense, the secretomics-based method is much more than a SCL predictor. Besides a major step forward in genomics and proteomics of protein secretion, the secretomics-based method appears as a strategy of choice to generate in silico hypotheses for experimental testing.

  16. Genomic analyses of metal resistance genes in three plant growth promoting bacteria of legume plants in Northwest mine tailings, China

    Institute of Scientific and Technical Information of China (English)

    Pin Xie; Xiuli Hao; Martin Herzberg; Yantao Luo; Dietrich H.Nies; Gehong Wei

    2015-01-01

    To better understand the diversity of metal resistance genetic determinant from microbes that survived at metal tailings in northwest of China,a highly elevated level of heavy metal containing region,genomic analyses was conducted using genome sequence of three native metal-resistant plant growth promoting bacteria (PGPB).It shows that:Mesorhizobium amorphae CCNWGS0123 contains metal ~nsporters from P-type ATPase,CDF (Cation Diffusion Facilitator),HupE/UreJ and CHR (chromate ion transporter) family involved in copper,zinc,nickel as well as chromate resistance and homeostasis.Meanwhile,the putative CopA/CueO system is expected to mediate copper resistance in Sinorhizobium meliloti CCNWSX0020 while ZntA transporter,assisted with putative CzcD,determines zinc tolerance in Agrobacterium tumefaciens CCNWGS0286.The greenhouse experiment provides the consistent evidence of the plant growth promoting effects of these microbes on their hosts by nitrogen fixation and/or indoleacetic acid (IAA) secretion,indicating a potential in-site phytoremediation usage in the mining tailing regions of China.

  17. Transcriptional analyses of the region of the equine herpesvirus type 4 genome encoding glycoproteins I and E.

    Science.gov (United States)

    Damiani, A M; Jang, H K; Matsumura, T; Yokoyama, N; Miyazawa, T; Mikami, T

    1999-01-01

    To map the transcripts encoding the equine herpesvirus type 4 (EHV-4) glycoproteins I (gI) and E (gE), transcriptional analyses were performed at the right part of the unique short segment of EHV-4 genome. The results revealed that the gI gene is encoded by a 1.6-kb transcript which is 3' coterminal with a 3.0-kb gD mRNA while the gE gene is encoded by two transcripts of 3.5- and 2.4-kb in size. The transcriptional patterns described in this study for the EHV-4 gI and gE are similar to those found in the equivalent region of herpes simplex virus type 1 and feline herpesvirus type 1. Characterization of EHV-4 gI and gE glycoprotein genes may facilitate future studies to define their roles in the EHV-4 infection.

  18. Genetic variants associated with subjective well-being, depressive symptoms and neuroticism identified through genome-wide analyses

    Science.gov (United States)

    Derringer, Jaime; Gratten, Jacob; Lee, James J; Liu, Jimmy Z; de Vlaming, Ronald; Ahluwalia, Tarunveer S; Buchwald, Jadwiga; Cavadino, Alana; Frazier-Wood, Alexis C; Davies, Gail; Furlotte, Nicholas A; Garfield, Victoria; Geisel, Marie Henrike; Gonzalez, Juan R; Haitjema, Saskia; Karlsson, Robert; van der Laan, Sander W; Ladwig, Karl-Heinz; Lahti, Jari; van der Lee, Sven J; Miller, Michael B; Lind, Penelope A; Liu, Tian; Matteson, Lindsay; Mihailov, Evelin; Minica, Camelia C; Nolte, Ilja M; Mook-Kanamori, Dennis O; van der Most, Peter J; Oldmeadow, Christopher; Qian, Yong; Raitakari, Olli; Rawal, Rajesh; Realo, Anu; Rueedi, Rico; Schmidt, Börge; Smith, Albert V; Stergiakouli, Evie; Tanaka, Toshiko; Taylor, Kent; Thorleifsson, Gudmar; Wedenoja, Juho; Wellmann, Juergen; Westra, Harm-Jan; Willems, Sara M; Zhao, Wei; Amin, Najaf; Bakshi, Andrew; Bergmann, Sven; Bjornsdottir, Gyda; Boyle, Patricia A; Cherney, Samantha; Cox, Simon R; Davis, Oliver S P; Ding, Jun; Direk, Nese; Eibich, Peter; Emeny, Rebecca T; Fatemifar, Ghazaleh; Faul, Jessica D; Ferrucci, Luigi; Forstner, Andreas J; Gieger, Christian; Gupta, Richa; Harris, Tamara B; Harris, Juliette M; Holliday, Elizabeth G; Hottenga, Jouke-Jan; De Jager, Philip L; Kaakinen, Marika A; Kajantie, Eero; Karhunen, Ville; Kolcic, Ivana; Kumari, Meena; Launer, Lenore J; Franke, Lude; Li-Gao, Ruifang; Liewald, David C; Koini, Marisa; Loukola, Anu; Marques-Vidal, Pedro; Montgomery, Grant W; Mosing, Miriam A; Paternoster, Lavinia; Pattie, Alison; Petrovic, Katja E; Pulkki-Råback, Laura; Quaye, Lydia; Räikkönen, Katri; Rudan, Igor; Scott, Rodney J; Smith, Jennifer A; Sutin, Angelina R; Trzaskowski, Maciej; Vinkhuyzen, Anna E; Yu, Lei; Zabaneh, Delilah; Attia, John R; Bennett, David A; Berger, Klaus; Bertram, Lars; Boomsma, Dorret I; Snieder, Harold; Chang, Shun-Chiao; Cucca, Francesco; Deary, Ian J; van Duijn, Cornelia M; Eriksson, Johan G; Bültmann, Ute; de Geus, Eco J C; Groenen, Patrick J F; Gudnason, Vilmundur; Hansen, Torben; Hartman, Catharine A; Haworth, Claire M A; Hayward, Caroline; Heath, Andrew C; Hinds, David A; Hyppönen, Elina; Iacono, William G; Järvelin, Marjo-Riitta; Jöckel, Karl-Heinz; Kaprio, Jaakko; Kardia, Sharon L R; Keltikangas-Järvinen, Liisa; Kraft, Peter; Kubzansky, Laura D; Lehtimäki, Terho; Magnusson, Patrik K E; Martin, Nicholas G; McGue, Matt; Metspalu, Andres; Mills, Melinda; de Mutsert, Renée; Oldehinkel, Albertine J; Pasterkamp, Gerard; Pedersen, Nancy L; Plomin, Robert; Polasek, Ozren; Power, Christine; Rich, Stephen S; Rosendaal, Frits R; den Ruijter, Hester M; Schlessinger, David; Schmidt, Helena; Svento, Rauli; Schmidt, Reinhold; Alizadeh, Behrooz Z; Sørensen, Thorkild I A; Spector, Tim D; Starr, John M; Stefansson, Kari; Steptoe, Andrew; Terracciano, Antonio; Thorsteinsdottir, Unnur; Thurik, A Roy; Timpson, Nicholas J; Tiemeier, Henning; Uitterlinden, André G; Vollenweider, Peter; Wagner, Gert G; Weir, David R; Yang, Jian; Conley, Dalton C; Smith, George Davey; Hofman, Albert; Johannesson, Magnus; Laibson, David I; Medland, Sarah E; Meyer, Michelle N; Pickrell, Joseph K; Esko, Tõnu; Krueger, Robert F; Beauchamp, Jonathan P; Koellinger, Philipp D; Benjamin, Daniel J; Bartels, Meike; Cesarini, David

    2016-01-01

    We conducted genome-wide association studies of three phenotypes: subjective well-being (N = 298,420), depressive symptoms (N = 161,460), and neuroticism (N = 170,910). We identified three variants associated with subjective well-being, two with depressive symptoms, and eleven with neuroticism, including two inversion polymorphisms. The two depressive symptoms loci replicate in an independent depression sample. Joint analyses that exploit the high genetic correlations between the phenotypes (|ρ^| ≈ 0.8) strengthen the overall credibility of the findings, and allow us to identify additional variants. Across our phenotypes, loci regulating expression in central nervous system and adrenal/pancreas tissues are strongly enriched for association. PMID:27089181

  19. Metabolomic and Functional Genomic Analyses Reveal Varietal Differences in Bioactive Compounds of Cooked Rice

    Science.gov (United States)

    Heuberger, Adam L.; Lewis, Matthew R.; Chen, Ming-Hsuan; Brick, Mark A.; Leach, Jan E.; Ryan, Elizabeth P.

    2010-01-01

    Emerging evidence supports that cooked rice (Oryza sativa L.) contains metabolites with biomedical activities, yet little is known about the genetic diversity that is responsible for metabolite variation and differences in health traits. Metabolites from ten diverse varieties of cooked rice were detected using ultra performance liquid chromatography coupled to mass spectrometry. A total of 3,097 compounds were detected, of which 25% differed among the ten varieties. Multivariate analyses of the metabolite profiles showed that the chemical diversity among the varieties cluster according to their defined subspecies classifications: indica, japonica, and aus. Metabolite-specific genetic diversity in rice was investigated by analyzing a collection of single nucleotide polymorphisms (SNPs) in genes from biochemical pathways of nutritional importance. Two classes of bioactive compounds, phenolics and vitamin E, contained nonsynonymous SNPs and SNPs in the 5′ and 3′ untranslated regions for genes in their biosynthesis pathways. Total phenolics and tocopherol concentrations were determined to examine the effect of the genetic diversity among the ten varieties. Per gram of cooked rice, total phenolics ranged from 113.7 to 392.6 µg (gallic acid equivalents), and total tocopherols ranged between 7.2 and 20.9 µg. The variation in the cooked rice metabolome and quantities of bioactive components supports that the SNP-based genetic diversity influenced nutritional components in rice, and that this approach may guide rice improvement strategies for plant and human health. PMID:20886119

  20. Transcriptome sequencing and genome-wide association analyses reveal lysosomal function and actin cytoskeleton remodeling in schizophrenia and bipolar disorder.

    Science.gov (United States)

    Zhao, Z; Xu, J; Chen, J; Kim, S; Reimers, M; Bacanu, S-A; Yu, H; Liu, C; Sun, J; Wang, Q; Jia, P; Xu, F; Zhang, Y; Kendler, K S; Peng, Z; Chen, X

    2015-05-01

    Schizophrenia (SCZ) and bipolar disorder (BPD) are severe mental disorders with high heritability. Clinicians have long noticed the similarities of clinic symptoms between these disorders. In recent years, accumulating evidence indicates some shared genetic liabilities. However, what is shared remains elusive. In this study, we conducted whole transcriptome analysis of post-mortem brain tissues (cingulate cortex) from SCZ, BPD and control subjects, and identified differentially expressed genes in these disorders. We found 105 and 153 genes differentially expressed in SCZ and BPD, respectively. By comparing the t-test scores, we found that many of the genes differentially expressed in SCZ and BPD are concordant in their expression level (q⩽0.01, 53 genes; q⩽0.05, 213 genes; q⩽0.1, 885 genes). Using genome-wide association data from the Psychiatric Genomics Consortium, we found that these differentially and concordantly expressed genes were enriched in association signals for both SCZ (Pgenes show concordant expression and association for both SCZ and BPD. Pathway analyses of these genes indicated that they are involved in the lysosome, Fc gamma receptor-mediated phagocytosis, regulation of actin cytoskeleton pathways, along with several cancer pathways. Functional analyses of these genes revealed an interconnected pathway network centered on lysosomal function and the regulation of actin cytoskeleton. These pathways and their interacting network were principally confirmed by an independent transcriptome sequencing data set of the hippocampus. Dysregulation of lysosomal function and cytoskeleton remodeling has direct impacts on endocytosis, phagocytosis, exocytosis, vesicle trafficking, neuronal maturation and migration, neurite outgrowth and synaptic density and plasticity, and different aspects of these processes have been implicated in SCZ and BPD.

  1. Evolution of chloroplast vesicle transport.

    Science.gov (United States)

    Westphal, Sabine; Soll, Jürgen; Vothknecht, Ute C

    2003-02-01

    Vesicle traffic plays a central role in eukaryotic transport. The presence of a vesicle transport system inside chloroplasts of spermatophytes raises the question of its phylogenetic origin. To elucidate the evolution of this transport system we analyzed organisms belonging to different lineages that arose from the first photosynthetic eukaryote, i.e. glaucocystophytes, chlorophytes, rhodophytes, and charophytes/embryophytes. Intriguingly, vesicle transport is not apparent in any group other than embryophytes. The transfer of this eukaryotic-type vesicle transport system from the cytosol into the chloroplast thus seems a late evolutionary development that was acquired by land plants in order to adapt to new environmental challenges.

  2. The Mitochondrial Genomes of Aquila fasciata and Buteo lagopus (Aves, Accipitriformes: Sequence, Structure and Phylogenetic Analyses.

    Directory of Open Access Journals (Sweden)

    Lan Jiang

    Full Text Available The family Accipitridae is one of the largest groups of non-passerine birds, including 68 genera and 243 species globally distributed. In the present study, we determined the complete mitochondrial sequences of two species of accipitrid, namely Aquila fasciata and Buteo lagopus, and conducted a comparative mitogenome analysis across the family. The mitogenome length of A. fasciata and B. lagopus are 18,513 and 18,559 bp with an A + T content of 54.2% and 55.0%, respectively. For both the two accipitrid birds mtDNAs, obvious positive AT-skew and negative GC-skew biases were detected for all 12 PCGs encoded by the H strand, whereas the reverse was found in MT-ND6 encoded by the L strand. One extra nucleotide'C'is present at the position 174 of MT-ND3 gene of A. fasciata, which is not observed at that of B. lagopus. Six conserved sequence boxes in the Domain II, named boxes F, E, D, C, CSBa, and CSBb, respectively, were recognized in the CRs of A. fasciata and B. lagopus. Rates and patterns of mitochondrial gene evolution within Accipitridae were also estimated. The highest dN/dS was detected for the MT-ATP8 gene (0.32493 among Accipitridae, while the lowest for the MT-CO1 gene (0.01415. Mitophylogenetic analysis supported the robust monophyly of Accipitriformes, and Cathartidae was basal to the balance of the order. Moreover, we performed phylogenetic analyses using two other data sets (two mitochondrial loci, and combined nuclear and mitochondrial loci. Our results indicate that the subfamily Aquilinae and all currently polytypic genera of this subfamily are monophyletic. These two novel mtDNA data will be useful in refining the phylogenetic relationships and evolutionary processes of Accipitriformes.

  3. The Mitochondrial Genomes of Aquila fasciata and Buteo lagopus (Aves, Accipitriformes): Sequence, Structure and Phylogenetic Analyses.

    Science.gov (United States)

    Jiang, Lan; Chen, Juan; Wang, Ping; Ren, Qiongqiong; Yuan, Jian; Qian, Chaoju; Hua, Xinghong; Guo, Zhichun; Zhang, Lei; Yang, Jianke; Wang, Ying; Zhang, Qin; Ding, Hengwu; Bi, De; Zhang, Zongmeng; Wang, Qingqing; Chen, Dongsheng; Kan, Xianzhao

    2015-01-01

    The family Accipitridae is one of the largest groups of non-passerine birds, including 68 genera and 243 species globally distributed. In the present study, we determined the complete mitochondrial sequences of two species of accipitrid, namely Aquila fasciata and Buteo lagopus, and conducted a comparative mitogenome analysis across the family. The mitogenome length of A. fasciata and B. lagopus are 18,513 and 18,559 bp with an A + T content of 54.2% and 55.0%, respectively. For both the two accipitrid birds mtDNAs, obvious positive AT-skew and negative GC-skew biases were detected for all 12 PCGs encoded by the H strand, whereas the reverse was found in MT-ND6 encoded by the L strand. One extra nucleotide'C'is present at the position 174 of MT-ND3 gene of A. fasciata, which is not observed at that of B. lagopus. Six conserved sequence boxes in the Domain II, named boxes F, E, D, C, CSBa, and CSBb, respectively, were recognized in the CRs of A. fasciata and B. lagopus. Rates and patterns of mitochondrial gene evolution within Accipitridae were also estimated. The highest dN/dS was detected for the MT-ATP8 gene (0.32493) among Accipitridae, while the lowest for the MT-CO1 gene (0.01415). Mitophylogenetic analysis supported the robust monophyly of Accipitriformes, and Cathartidae was basal to the balance of the order. Moreover, we performed phylogenetic analyses using two other data sets (two mitochondrial loci, and combined nuclear and mitochondrial loci). Our results indicate that the subfamily Aquilinae and all currently polytypic genera of this subfamily are monophyletic. These two novel mtDNA data will be useful in refining the phylogenetic relationships and evolutionary processes of Accipitriformes.

  4. Spliced leader-based metatranscriptomic analyses lead to recognition of hidden genomic features in dinoflagellates.

    Science.gov (United States)

    Lin, Senjie; Zhang, Huan; Zhuang, Yunyun; Tran, Bao; Gill, John

    2010-11-16

    Environmental transcriptomics (metatranscriptomics) for a specific lineage of eukaryotic microbes (e.g., Dinoflagellata) would be instrumental for unraveling the genetic mechanisms by which these microbes respond to the natural environment, but it has not been exploited because of technical difficulties. Using the recently discovered dinoflagellate mRNA-specific spliced leader as a selective primer, we constructed cDNA libraries (e-cDNAs) from one marine and two freshwater plankton assemblages. Small-scale sequencing of the e-cDNAs revealed functionally diverse transcriptomes proven to be of dinoflagellate origin. A set of dinoflagellate common genes and transcripts of dominant dinoflagellate species were identified. Further analyses of the dataset prompted us to delve into the existing, largely unannotated dinoflagellate EST datasets (DinoEST). Consequently, all four nucleosome core histones, two histone modification proteins, and a nucleosome assembly protein were detected, clearly indicating the presence of nucleosome-like machinery long thought not to exist in dinoflagellates. The isolation of rhodopsin from taxonomically and ecotypically diverse dinoflagellates and its structural similarity and phylogenetic affinity to xanthorhodopsin suggest a common genetic potential in dinoflagellates to use solar energy nonphotosynthetically. Furthermore, we found 55 cytoplasmic ribosomal proteins (RPs) from the e-cDNAs and 24 more from DinoEST, showing that the dinoflagellate phylum possesses all 79 eukaryotic RPs. Our results suggest that a sophisticated eukaryotic molecular machine operates in dinoflagellates that likely encodes many more unsuspected physiological capabilities and, meanwhile, demonstrate that unique spliced leaders are useful for profiling lineage-specific microbial transcriptomes in situ.

  5. Nitrogen control of chloroplast differentiation. Annual progress report

    Energy Technology Data Exchange (ETDEWEB)

    Schmidt, G.W.

    1992-07-01

    This project is directed toward understanding how the availability of nitrogen affects the accumulation of chloroplast pigments and proteins functioning in energy transduction and carbon metabolism. Molecular analyses performed with Chlamydomonas reinhardtii grown in a continuous culture system such that ammonium concentration is maintained at a low steady-state concentration so as to limit cell division. As compared to chloroplasts from cells of non-limiting nitrogen provisions, chloroplasts of N-limited cells are profoundly chlorophyll-deficient but still assimilate carbon for deposition of as starch and as storage lipids. Chlorophyll deficiency arises by limiting accumulation of appropriate nuclear-encoded mRNAs of and by depressed rates of translation of chloroplast mRNAs for apoproteins of reaction centers. Chloroplast translational effects can be partially ascribed to diminished rates of chlorophyll biosynthesis in N-limited cells, but pigment levels are not determinants for expression of the nuclear light-harvesting protein genes. Consequently, other signals that are responsive to nitrogen availability mediate transcriptional or post-transcriptional processes for accumulation of the mRNAs for LHC apoproteins and other mRNAs whose abundance is dependent upon high nitrogen levels. Conversely, limited nitrogen availability promotes accumulation of other proteins involved in carbon metabolism and oxidative electron transport in chloroplasts. Hence, thylakoids of N-limited cells exhibit enhanced chlororespiratory activities wherein oxygen serves as the electron acceptor in a pathway that involves plastoquinone and other electron carrier proteins that remain to be thoroughly characterized. Ongoing and future studies are also outlined.

  6. Localization of phosphatidylcholine in outer envelope membrane of spinach chloroplasts

    Science.gov (United States)

    1985-01-01

    We have examined the effects of phospholipase C from Bacillus cereus on the extent of phospholipid hydrolysis in envelope membrane vesicles and in intact chloroplasts. When isolated envelope vesicles were incubated in presence of phospholipase C, phosphatidylcholine and phosphatidylglycerol, but not phosphatidylinositol, were totally converted into diacylglycerol if they were available to the enzyme (i.e., when the vesicles were sonicated in presence of phospholipase C). These experiments demonstrate that phospholipase C can be used to probe the availability of phosphatidylcholine and phosphatidylglycerol in the cytosolic leaflet of the outer envelope membrane from spinach chloroplasts. When isolated, purified, intact chloroplasts were incubated with low amounts of phospholipase C (0.3 U/mg chlorophyll) under very mild conditions (12 degrees C for 1 min), greater than 80% of phosphatidylcholine molecules and almost none of phosphatidylglycerol molecules were hydrolyzed. Since we have also demonstrated, by using several different methods (phase-contrast and electron microscopy, immunochemical and electrophoretic analyses) that isolated spinach chloroplasts, and especially their outer envelope membrane, remained intact after mild treatment with phospholipase C, we can conclude that there is a marked asymmetric distribution of phospholipids across the outer envelope membrane of spinach chloroplasts. Phosphatidylcholine, the major polar lipid of the outer envelope membrane, is almost entirely accessible from the cytosolic side of the membrane and therefore is probably localized in the outer leaflet of the outer envelope bilayer. On the contrary, phosphatidylglycerol, the major polar lipid in the inner envelope membrane and the thylakoids, is probably not accessible to phospholipase C from the cytosol and therefore is probably localized mostly in the inner leaflet of the outer envelope membrane and in the other chloroplast membranes. PMID:3988805

  7. Hybridization Capture Using RAD Probes (hyRAD, a New Tool for Performing Genomic Analyses on Collection Specimens.

    Directory of Open Access Journals (Sweden)

    Tomasz Suchan

    Full Text Available In the recent years, many protocols aimed at reproducibly sequencing reduced-genome subsets in non-model organisms have been published. Among them, RAD-sequencing is one of the most widely used. It relies on digesting DNA with specific restriction enzymes and performing size selection on the resulting fragments. Despite its acknowledged utility, this method is of limited use with degraded DNA samples, such as those isolated from museum specimens, as these samples are less likely to harbor fragments long enough to comprise two restriction sites making possible ligation of the adapter sequences (in the case of double-digest RAD or performing size selection of the resulting fragments (in the case of single-digest RAD. Here, we address these limitations by presenting a novel method called hybridization RAD (hyRAD. In this approach, biotinylated RAD fragments, covering a random fraction of the genome, are used as baits for capturing homologous fragments from genomic shotgun sequencing libraries. This simple and cost-effective approach allows sequencing of orthologous loci even from highly degraded DNA samples, opening new avenues of research in the field of museum genomics. Not relying on the restriction site presence, it improves among-sample loci coverage. In a trial study, hyRAD allowed us to obtain a large set of orthologous loci from fresh and museum samples from a non-model butterfly species, with a high proportion of single nucleotide polymorphisms present in all eight analyzed specimens, including 58-year-old museum samples. The utility of the method was further validated using 49 museum and fresh samples of a Palearctic grasshopper species for which the spatial genetic structure was previously assessed using mtDNA amplicons. The application of the method is eventually discussed in a wider context. As it does not rely on the restriction site presence, it is therefore not sensitive to among-sample loci polymorphisms in the restriction sites

  8. Molecular analysis of the chloroplast Cu/Zn-SOD gene(AhCSD2) in peanut

    Institute of Scientific and Technical Information of China (English)

    Xiurong; Zhang; Qian; Wan; Fengzhen; Liu; Kun; Zhang; Aiqing; Sun; Bing; Luo; Li; Sun; Yongshan; Wan

    2015-01-01

    Superoxide dismutase(SOD, EC 1.15.1.1) plays a key role in response to drought stress, and differences in SOD activity changes among cultivars are important under drought conditions. We obtained the full-length DNA of the chloroplast Cu/Zn-SOD gene(Ah CSD2)from 11 allotetraploid cultivars and 5 diploid wild species in peanut. BLAST search against the peanut genome showed that the Ah CSD2 genes g CSD2-1 and g CSD2-2 are located at the tops of chromosome A03(A genome) and B03(B genome), respectively, and both contain 8exons and 7 introns. Nucleotide sequence analyses indicated that g CSD2-2 sequences were identical among all the tested cultivars, while g CSD2-1 sequences showed allelic variations.The amino acid sequences deduced from g CSD2-1 and g CSD2-2 both contain a chloroplast transit peptide and are distinguished by 6 amino acid(aa) residue differences. The other 2aa residue variations in the mature peptide regions give rise to three-dimensional structure changes of the protein deduced from the genes g CSD2-1 and g CSD2-2. Sequences analyses of cultivars and wild species showed that g CSD2-2 of Arachis hypogaea and g Aip CSD2(Arachis ipaensis) are identical, and despite the abundant polymorphic loci between g CSD2-1 of A.hypogaea and sequences from A genome wild species, the deduced amino acid sequence of Ah CSD2-1(A. hypogaea) is identical to that of Adu CSD2(Arachis duranensis), whereas Aco CSD2(Arachis correntina) and Aca CSD2(Arachis cardenasii) both have 2 aa differences in the transit peptide region compared with Ah CSD2-1(A. hypogaea). Based on the Peanut Genome Project, promoter prediction revealed many stress-related cis-acting elements within the potential promoter regions(pp-A and pp-B). pp-A contains more binding sites for drought-associated transcriptional factors than pp-B. We hypothesize that the marked changes in SOD activity in different cultivars under drought stress are tightly regulated by transcription factors through transcription and

  9. Molecular analysis of the chloroplast Cu/Zn-SOD gene (AhCSD2) in peanut

    Institute of Scientific and Technical Information of China (English)

    Xiurong Zhang; Qian Wan; Fengzhen Liu⁎; Kun Zhang; Aiqing Sun; Bing Luo; Li Sun; Yongshan Wan⁎⁎

    2015-01-01

    Superoxide dismutase (SOD, EC 1.15.1.1) plays a key role in response to drought stress, and differences in SOD activity changes among cultivars are important under drought conditions. We obtained the full-length DNA of the chloroplast Cu/Zn-SOD gene (AhCSD2) from 11 allotetraploid cultivars and 5 diploid wild species in peanut. BLAST search against the peanut genome showed that the AhCSD2 genes gCSD2-1 and gCSD2-2 are located at the tops of chromosome A03 (A genome) and B03 (B genome), respectively, and both contain 8 exons and 7 introns. Nucleotide sequence analyses indicated that gCSD2-2 sequences were identical among all the tested cultivars, while gCSD2-1 sequences showed allelic variations. The amino acid sequences deduced from gCSD2-1 and gCSD2-2 both contain a chloroplast transit peptide and are distinguished by 6 amino acid (aa) residue differences. The other 2 aa residue variations in the mature peptide regions give rise to three-dimensional structure changes of the protein deduced from the genes gCSD2-1 and gCSD2-2. Sequences analyses of cultivars and wild species showed that gCSD2-2 of Arachis hypogaea and gAipCSD2 (Arachis ipaensis) are identical, and despite the abundant polymorphic loci between gCSD2-1 of A. hypogaea and sequences from A genome wild species, the deduced amino acid sequence of AhCSD2-1 (A. hypogaea) is identical to that of AduCSD2 (Arachis duranensis), whereas AcoCSD2 (Arachis correntina) and AcaCSD2 (Arachis cardenasii) both have 2 aa differences in the transit peptide region compared with AhCSD2-1 (A. hypogaea). Based on the Peanut Genome Project, promoter prediction revealed many stress-related cis-acting elements within the potential promoter regions (pp-A and pp-B). pp-A contains more binding sites for drought-associated transcriptional factors than pp-B. We hypothesize that the marked changes in SOD activity in different cultivars under drought stress are tightly regulated by transcription factors through transcription and

  10. The Chlamydomonas Genome Reveals the Evolution of Key Animal and Plant Functions

    Energy Technology Data Exchange (ETDEWEB)

    Merchant, Sabeeha S

    2007-04-09

    Chlamydomonas reinhardtii is a unicellular green alga whose lineage diverged from land plants over 1 billion years ago. It is a model system for studying chloroplast-based photosynthesis, as well as the structure, assembly, and function of eukaryotic flagella (cilia), which were inherited from the common ancestor of plants and animals, but lost in land plants. We sequenced the 120-megabase nuclear genome of Chlamydomonas and performed comparative phylogenomic analyses, identifying genes encoding uncharacterized proteins that are likely associated with the function and biogenesis of chloroplasts or eukaryotic flagella. Analyses of the Chlamydomonas genome advance our understanding of the ancestral eukaryotic cell, reveal previously unknown genes associated with photosynthetic and flagellar functions, and establish links between ciliopathy and the composition and function of flagella.

  11. Development of the First Chloroplast Microsatellite Loci in Ginkgo biloba (Ginkgoaceae

    Directory of Open Access Journals (Sweden)

    Chun-Xiang Xie

    2013-07-01

    Full Text Available Premise of the study: To investigate population genetics, phylogeography, and cultivar origin of Ginkgo biloba, chloroplast microsatellite primers were developed. Methods and Results: Twenty-one chloroplast microsatellite markers were identified referring to the two published chloroplast genomes of G. biloba. Polymorphisms were assessed on four natural populations from the two refugia in China. Eight loci were detected to be polymorphic in these populations. The number of alleles per locus ranged from three to seven, and the unbiased haploid diversity per locus varied from 0.441 to 0.807. Conclusions: For the first time, we developed 21 chloroplast microsatellite markers for G. biloba, including 13 monomorphic and eight polymorphic ones within the assessed natural populations. These markers should provide a powerful tool for the study of genetic variation of both natural and cultivated populations of G. biloba, as well as cultivars.

  12. Transcriptome analysis of ectopic chloroplast development in green curd cauliflower (Brassica oleracea L. var. botrytis

    Directory of Open Access Journals (Sweden)

    Zhou Xiangjun

    2011-11-01

    Full Text Available Abstract Background Chloroplasts are the green plastids where photosynthesis takes place. The biogenesis of chloroplasts requires the coordinate expression of both nuclear and chloroplast genes and is regulated by developmental and environmental signals. Despite extensive studies of this process, the genetic basis and the regulatory control of chloroplast biogenesis and development remain to be elucidated. Results Green cauliflower mutant causes ectopic development of chloroplasts in the curd tissue of the plant, turning the otherwise white curd green. To investigate the transcriptional control of chloroplast development, we compared gene expression between green and white curds using the RNA-seq approach. Deep sequencing produced over 15 million reads with lengths of 86 base pairs from each cDNA library. A total of 7,155 genes were found to exhibit at least 3-fold changes in expression between green and white curds. These included light-regulated genes, genes encoding chloroplast constituents, and genes involved in chlorophyll biosynthesis. Moreover, we discovered that the cauliflower ELONGATED HYPOCOTYL5 (BoHY5 was expressed higher in green curds than white curds and that 2616 HY5-targeted genes, including 1600 up-regulated genes and 1016 down-regulated genes, were differently expressed in green in comparison to white curd tissue. All these 1600 up-regulated genes were HY5-targeted genes in the light. Conclusions The genome-wide profiling of gene expression by RNA-seq in green curds led to the identification of large numbers of genes associated with chloroplast development, and suggested the role of regulatory genes in the high hierarchy of light signaling pathways in mediating the ectopic chloroplast development in the green curd cauliflower mutant.

  13. Photosynthesis of root chloroplasts developed in Arabidopsis lines overexpressing GOLDEN2-LIKE transcription factors.

    Science.gov (United States)

    Kobayashi, Koichi; Sasaki, Daichi; Noguchi, Ko; Fujinuma, Daiki; Komatsu, Hirohisa; Kobayashi, Masami; Sato, Mayuko; Toyooka, Kiminori; Sugimoto, Keiko; Niyogi, Krishna K; Wada, Hajime; Masuda, Tatsuru

    2013-08-01

    In plants, genes involved in photosynthesis are encoded separately in nuclei and plastids, and tight cooperation between these two genomes is therefore required for the development of functional chloroplasts. Golden2-like (GLK) transcription factors are involved in chloroplast development, directly targeting photosynthesis-associated nuclear genes for up-regulation. Although overexpression of GLKs leads to chloroplast development in non-photosynthetic organs, the mechanisms of coordination between the nuclear gene expression influenced by GLKs and the photosynthetic processes inside chloroplasts are largely unknown. To elucidate the impact of GLK-induced expression of photosynthesis-associated nuclear genes on the construction of photosynthetic systems, chloroplast morphology and photosynthetic characteristics in greenish roots of Arabidopsis thaliana lines overexpressing GLKs were compared with those in wild-type roots and leaves. Overexpression of GLKs caused up-regulation of not only their direct targets but also non-target nuclear and plastid genes, leading to global induction of chloroplast biogenesis in the root. Large antennae relative to reaction centers were observed in wild-type roots and were further enhanced by GLK overexpression due to the increased expression of target genes associated with peripheral light-harvesting antennae. Photochemical efficiency was lower in the root chloroplasts than in leaf chloroplasts, suggesting that the imbalance in the photosynthetic machinery decreases the efficiency of light utilization in root chloroplasts. Despite the low photochemical efficiency, root photosynthesis contributed to carbon assimilation in Arabidopsis. Moreover, GLK overexpression increased CO₂ fixation and promoted phototrophic performance of the root, showing the potential of root photosynthesis to improve effective carbon utilization in plants.

  14. A nuclear mutant of Chlamydomonas that exhibits increased sensitivity to UV irradiation, reduced recombination of nuclear genes, and altered transmission of chloroplast genes.

    Science.gov (United States)

    Rosen, H; Newman, S M; Boynton, J E; Gillham, N W

    1991-01-01

    Meiotic progeny of Chlamydomonas reinhardtii normally receive chloroplast genomes only from the mt+ parent. However, exceptional zygotes, which transmit the chloroplast genomes of both parents or, more rarely, only those of the mt- parent, arise at a low frequency. Mutations at the mt(+)-linked mat-3 locus were found previously to elevate the transmission of chloroplast genomes from the mt- parent, resulting in a much higher than normal frequency of exceptional zygotes. In this paper we demonstrate that an ultraviolet-sensitive nuclear mutation mapping at the uvsE1 locus, which is unlinked to mating type, also promotes chloroplast genome transmission from the mt- parent. This mutant, which was previously shown to reduce recombination of nuclear genes in meiosis, acts synergistically with the mat-3-3 mutation to produce an extremely high frequency of exceptional zygotes. Through the use of restriction fragment length polymorphisms existing in the chloroplast genomes of C. reinhardtii and the interfertile strain C. smithii, we show that chloroplast DNA fragments from the mt- parent normally begin to disappear shortly after zygote formation. However, this process appears to be blocked totally in the absence of wild-type uvsE1 and mat-3 gene products. Our findings are consistent with the hypothesis that both gene products contribute to the mechanism responsible for uniparental inheritance of the chloroplast genome from the mt+ parent.

  15. Large-scale genome-wide association studies and meta-analyses of longitudinal change in adult lung function.

    Directory of Open Access Journals (Sweden)

    Wenbo Tang

    Full Text Available Genome-wide association studies (GWAS have identified numerous loci influencing cross-sectional lung function, but less is known about genes influencing longitudinal change in lung function.We performed GWAS of the rate of change in forced expiratory volume in the first second (FEV1 in 14 longitudinal, population-based cohort studies comprising 27,249 adults of European ancestry using linear mixed effects model and combined cohort-specific results using fixed effect meta-analysis to identify novel genetic loci associated with longitudinal change in lung function. Gene expression analyses were subsequently performed for identified genetic loci. As a secondary aim, we estimated the mean rate of decline in FEV1 by smoking pattern, irrespective of genotypes, across these 14 studies using meta-analysis.The overall meta-analysis produced suggestive evidence for association at the novel IL16/STARD5/TMC3 locus on chromosome 15 (P  =  5.71 × 10(-7. In addition, meta-analysis using the five cohorts with ≥3 FEV1 measurements per participant identified the novel ME3 locus on chromosome 11 (P  =  2.18 × 10(-8 at genome-wide significance. Neither locus was associated with FEV1 decline in two additional cohort studies. We confirmed gene expression of IL16, STARD5, and ME3 in multiple lung tissues. Publicly available microarray data confirmed differential expression of all three genes in lung samples from COPD patients compared with controls. Irrespective of genotypes, the combined estimate for FEV1 decline was 26.9, 29.2 and 35.7 mL/year in never, former, and persistent smokers, respectively.In this large-scale GWAS, we identified two novel genetic loci in association with the rate of change in FEV1 that harbor candidate genes with biologically plausible functional links to lung function.

  16. Large-Scale Genome-Wide Association Studies and Meta-Analyses of Longitudinal Change in Adult Lung Function

    Science.gov (United States)

    Tang, Wenbo; Kowgier, Matthew; Loth, Daan W.; Soler Artigas, María; Joubert, Bonnie R.; Hodge, Emily; Gharib, Sina A.; Smith, Albert V.; Ruczinski, Ingo; Gudnason, Vilmundur; Mathias, Rasika A.; Harris, Tamara B.; Hansel, Nadia N.; Launer, Lenore J.; Barnes, Kathleen C.; Hansen, Joyanna G.; Albrecht, Eva; Aldrich, Melinda C.; Allerhand, Michael; Barr, R. Graham; Brusselle, Guy G.; Couper, David J.; Curjuric, Ivan; Davies, Gail; Deary, Ian J.; Dupuis, Josée; Fall, Tove; Foy, Millennia; Franceschini, Nora; Gao, Wei; Gläser, Sven; Gu, Xiangjun; Hancock, Dana B.; Heinrich, Joachim; Hofman, Albert; Imboden, Medea; Ingelsson, Erik; James, Alan; Karrasch, Stefan; Koch, Beate; Kritchevsky, Stephen B.; Kumar, Ashish; Lahousse, Lies; Li, Guo; Lind, Lars; Lindgren, Cecilia; Liu, Yongmei; Lohman, Kurt; Lumley, Thomas; McArdle, Wendy L.; Meibohm, Bernd; Morris, Andrew P.; Morrison, Alanna C.; Musk, Bill; North, Kari E.; Palmer, Lyle J.; Probst-Hensch, Nicole M.; Psaty, Bruce M.; Rivadeneira, Fernando; Rotter, Jerome I.; Schulz, Holger; Smith, Lewis J.; Sood, Akshay; Starr, John M.; Strachan, David P.; Teumer, Alexander; Uitterlinden, André G.; Völzke, Henry; Voorman, Arend; Wain, Louise V.; Wells, Martin T.; Wilk, Jemma B.; Williams, O. Dale; Heckbert, Susan R.; Stricker, Bruno H.; London, Stephanie J.; Fornage, Myriam; Tobin, Martin D.; O′Connor, George T.; Hall, Ian P.; Cassano, Patricia A.

    2014-01-01

    Background Genome-wide association studies (GWAS) have identified numerous loci influencing cross-sectional lung function, but less is known about genes influencing longitudinal change in lung function. Methods We performed GWAS of the rate of change in forced expiratory volume in the first second (FEV1) in 14 longitudinal, population-based cohort studies comprising 27,249 adults of European ancestry using linear mixed effects model and combined cohort-specific results using fixed effect meta-analysis to identify novel genetic loci associated with longitudinal change in lung function. Gene expression analyses were subsequently performed for identified genetic loci. As a secondary aim, we estimated the mean rate of decline in FEV1 by smoking pattern, irrespective of genotypes, across these 14 studies using meta-analysis. Results The overall meta-analysis produced suggestive evidence for association at the novel IL16/STARD5/TMC3 locus on chromosome 15 (P  =  5.71 × 10-7). In addition, meta-analysis using the five cohorts with ≥3 FEV1 measurements per participant identified the novel ME3 locus on chromosome 11 (P  =  2.18 × 10-8) at genome-wide significance. Neither locus was associated with FEV1 decline in two additional cohort studies. We confirmed gene expression of IL16, STARD5, and ME3 in multiple lung tissues. Publicly available microarray data confirmed differential expression of all three genes in lung samples from COPD patients compared with controls. Irrespective of genotypes, the combined estimate for FEV1 decline was 26.9, 29.2 and 35.7 mL/year in never, former, and persistent smokers, respectively. Conclusions In this large-scale GWAS, we identified two novel genetic loci in association with the rate of change in FEV1 that harbor candidate genes with biologically plausible functional links to lung function. PMID:24983941

  17. Cross-Disorder Genome-Wide Analyses Suggest a Complex Genetic Relationship Between Tourette Syndrome and Obsessive-Compulsive Disorder

    Science.gov (United States)

    Yu, Dongmei; Mathews, Carol A.; Scharf, Jeremiah M.; Neale, Benjamin M.; Davis, Lea K.; Gamazon, Eric R.; Derks, Eske M.; Evans, Patrick; Edlund, Christopher K.; Crane, Jacquelyn; Fagerness, Jesen A.; Osiecki, Lisa; Gallagher, Patience; Gerber, Gloria; Haddad, Stephen; Illmann, Cornelia; McGrath, Lauren M.; Mayerfeld, Catherine; Arepalli, Sampath; Barlassina, Cristina; Barr, Cathy L.; Bellodi, Laura; Benarroch, Fortu; Berrió, Gabriel Bedoya; Bienvenu, O. Joseph; Black, Donald; Bloch, Michael H.; Brentani, Helena; Bruun, Ruth D.; Budman, Cathy L.; Camarena, Beatriz; Campbell, Desmond D.; Cappi, Carolina; Cardona Silgado, Julio C.; Cavallini, Maria C.; Chavira, Denise A.; Chouinard, Sylvain; Cook, Edwin H.; Cookson, M. R.; Coric, Vladimir; Cullen, Bernadette; Cusi, Daniele; Delorme, Richard; Denys, Damiaan; Dion, Yves; Eapen, Valsama; Egberts, Karin; Falkai, Peter; Fernandez, Thomas; Fournier, Eduardo; Garrido, Helena; Geller, Daniel; Gilbert, Donald; Girard, Simon L.; Grabe, Hans J.; Grados, Marco A.; Greenberg, Benjamin D.; Gross-Tsur, Varda; Grünblatt, Edna; Hardy, John; Heiman, Gary A.; Hemmings, Sian M.J.; Herrera, Luis D.; Hezel, Dianne M.; Hoekstra, Pieter J.; Jankovic, Joseph; Kennedy, James L.; King, Robert A.; Konkashbaev, Anuar I.; Kremeyer, Barbara; Kurlan, Roger; Lanzagorta, Nuria; Leboyer, Marion; Leckman, James F.; Lennertz, Leonhard; Liu, Chunyu; Lochner, Christine; Lowe, Thomas L.; Lupoli, Sara; Macciardi, Fabio; Maier, Wolfgang; Manunta, Paolo; Marconi, Maurizio; McCracken, James T.; Mesa Restrepo, Sandra C.; Moessner, Rainald; Moorjani, Priya; Morgan, Jubel; Muller, Heike; Murphy, Dennis L.; Naarden, Allan L.; Ochoa, William Cornejo; Ophoff, Roel A.; Pakstis, Andrew J.; Pato, Michele T.; Pato, Carlos N.; Piacentini, John; Pittenger, Christopher; Pollak, Yehuda; Rauch, Scott L.; Renner, Tobias; Reus, Victor I.; Richter, Margaret A.; Riddle, Mark A.; Robertson, Mary M.; Romero, Roxana; Rosário, Maria C.; Rosenberg, David; Ruhrmann, Stephan; Sabatti, Chiara; Salvi, Erika; Sampaio, Aline S.; Samuels, Jack; Sandor, Paul; Service, Susan K.; Sheppard, Brooke; Singer, Harvey S.; Smit, Jan H.; Stein, Dan J.; Strengman, Eric; Tischfield, Jay A.; Turiel, Maurizio; Valencia Duarte, Ana V.; Vallada, Homero; Veenstra-VanderWeele, Jeremy; Walitza, Susanne; Walkup, John; Wang, Ying; Weale, Mike; Weiss, Robert; Wendland, Jens R.; Westenberg, Herman G.M.; Yao, Yin; Hounie, Ana G.; Miguel, Euripedes C.; Nicolini, Humberto; Wagner, Michael; Ruiz-Linares, Andres; Cath, Danielle C.; McMahon, William; Posthuma, Danielle; Oostra, Ben A.; Nestadt, Gerald; Rouleau, Guy A.; Purcell, Shaun; Jenike, Michael A.; Heutink, Peter; Hanna, Gregory L.; Conti, David V.; Arnold, Paul D.; Freimer, Nelson; Stewart, S. Evelyn; Knowles, James A.; Cox, Nancy J.; Pauls, David L.

    2014-01-01

    Obsessive-compulsive disorder (OCD) and Tourette Syndrome (TS) are highly heritable neurodevelopmental disorders that are thought to share genetic risk factors. However, the identification of definitive susceptibility genes for these etiologically complex disorders remains elusive. Here, we report a combined genome-wide association study (GWAS) of TS and OCD in 2723 cases (1310 with OCD, 834 with TS, 579 with OCD plus TS/chronic tics (CT)), 5667 ancestry-matched controls, and 290 OCD parent-child trios. Although no individual single nucleotide polymorphisms (SNPs) achieved genome-wide significance, the GWAS signals were enriched for SNPs strongly associated with variations in brain gene expression levels, i.e. expression quantitative loci (eQTLs), suggesting the presence of true functional variants that contribute to risk of these disorders. Polygenic score analyses identified a significant polygenic component for OCD (p=2×10−4), predicting 3.2% of the phenotypic variance in an independent data set. In contrast, TS had a smaller, non-significant polygenic component, predicting only 0.6% of the phenotypic variance (p=0.06). No significant polygenic signal was detected across the two disorders, although the sample is likely underpowered to detect a modest shared signal. Furthermore, the OCD polygenic signal was significantly attenuated when cases with both OCD and TS/CT were included in the analysis (p=0.01). Previous work has shown that TS and OCD have some degree of shared genetic variation. However, the data from this study suggest that there are also distinct components to the genetic architectures of TS and OCD. Furthermore, OCD with co-occurring TS/CT may have different underlying genetic susceptibility compared to OCD alone. PMID:25158072

  18. Signal integration by chloroplast phosphorylation networks: An update

    Directory of Open Access Journals (Sweden)

    Anna eSchoenberg

    2012-11-01

    Full Text Available Forty years after the initial discovery of light-dependent protein phosphorylation at the thylakoid membrane system, we are now beginning to understand the roles of chloroplast phosphorylation networks in their function to decode and mediate information on the metabolic status of the organelle to long-term adaptations in plastid and nuclear gene expression. With the help of genetics and functional genomics tools, chloroplast kinases and several hundred phosphoproteins were identified that now await detailed functional characterization. The regulation and the target protein spectrum of some kinases are understood, but this information is fragmentary with respect to kinase and target protein crosstalk in a changing environment. In this review we will highlight the most recent advances in the field and discuss approaches that might lead to a comprehensive understanding of plastid signal integration by protein phosphorylation.

  19. CHUP1 mediates actin-based light-induced chloroplast avoidance movement in the moss Physcomitrella patens.

    Science.gov (United States)

    Usami, Hiroka; Maeda, Takuma; Fujii, Yusuke; Oikawa, Kazusato; Takahashi, Fumio; Kagawa, Takatoshi; Wada, Masamitsu; Kasahara, Masahiro

    2012-12-01

    Chloroplasts change their intracellular distribution in response to light intensity. CHUP1 (CHLOROPLAST UNUSUAL POSITIONING1) is indispensable for this response in Arabidopsis thaliana. However, involvement of CHUP1 in light-induced chloroplast movement is unknown in other plants. In this study, CHUP1 orthologues were isolated from a moss, Physcomitrella patens, and a fern, Adiantum capillus-veneris, by cDNA library screening and PCR cloning based on the P. patens genome sequence. Functional motifs found in CHUP1 of A. thaliana were conserved among the CHUP1 orthologues. In addition to the putative functional regions, the C-terminal regions (approximately 250 amino acids), which are unique in CHUP1s, were highly conserved. Green fluorescent protein (GFP) fusions of P. patens CHUP1s (PpCHUP1A, PpCHUP1B and PpCHUP1C) were transiently expressed in protoplast cells. All GFP fusions were localized on the chloroplasts. Light-induced chloroplast avoidance movement of chup1 disruptants of P. patens was examined in the presence of cytoskeletal inhibitors because of the utilization of both microtubules and actin filaments for the movement in P. patens. When actin filaments were disrupted by cytochalasin B, the wild type (WT) and all chup1 disruptants showed chloroplast avoidance movement. However, when microtubules were disrupted by Oryzalin, chloroplasts in ∆chup1A and ∆chup1A/B rarely moved and stayed in the strong light-irradiated area. On the other hand, WT, ∆chup1B and ∆chup1C showed chloroplast avoidance movement. These results suggest that PpCHUP1A predominantly mediates the actin-based light-induced chloroplast avoidance movement. This study reveals that CHUP1 functions on the chloroplasts and is involved in the actin-based light-induced chloroplast avoidance movement in P. patens.

  20. Chloroplast avoidance movement is not functional in plants grown under strong sunlight.

    Science.gov (United States)

    Higa, Takeshi; Wada, Masamitsu

    2016-04-01

    Chloroplast movement in nine climbing plant species was investigated. It is thought that chloroplasts generally escape from strong light to avoid photodamage but accumulate towards weak light to perform photosynthesis effectively. Unexpectedly, however, the leaves of climbing plants grown under strong sunlight showed very low or no chloroplast photorelocation responses to either weak or strong blue light when detected by red light transmittance through leaves. Direct observations of Cayratia japonica leaves, for example, revealed that the average number of chloroplasts in upper periclinal walls of palisade tissue cells was only 1.2 after weak blue-light irradiation and almost all of the chloropl