annotation est-ssr characterization: Topics by WorldWideScience.org

Sample records for annotation est-ssr characterization

Mining and comparative survey of EST-SSR markers among members of Euphorbiaceae family.

Science.gov (United States)

Sen, Surojit; Dehury, Budheswar; Sahu, Jagajjit; Rathi, Sunayana; Yadav, Raj Narain Singh

2018-04-06

Euphorbiaceae represents flowering plants family of tropical and sub-tropical region rich in secondary metabolites of economic importance. To understand and assess the genetic makeup among the members, this study was undertaken to characterize and compare SSR markers from publicly available ESTs and GSSs of nine selected species of the family. Mining of SSRs was performed by MISA, primer designing by Primer3, while functional annotation, gene ontology (GO) and enrichment analysis were performed by Blast2GO. A total 12,878 number of SSRs were detected from 101,701 number of EST sequences. SSR density ranged from 1 SSR/3.22 kb to 1 SSR/15.65 kb. A total of 1873 primer pairs were designed for the annotated SSR-Contigs. About 77.07% SSR-ESTs could be assigned a significant match to the protein database. 3037 unique SSR-FDM were assigned and IPR003657 (WRKY Domain) was found to be the most dominant FDM among the members. 1810 unique GO terms obtained were further subjected to enrichment analysis to obtain 513 statistically significant GO terms mapped to the SSR containing ESTs. Most frequent enriched GO terms were, GO:0003824 for molecular function, GO:0006350 for biological process and GO:0005886 for cellular component, justifying the richness of defensive secondary metabolites and phytomedicine within the family. The results from this study provides tangible insight to genetic make-up and distribution of SSRs. Functional annotation corresponded many genes of unknown functions which may be considered as novel genes or genes responsible for stress specific secondary metabolites. Further studies are required to understand stress specific genes accountable for leveraging the synthesis of secondary metabolites.
Development and characterization of EST-SSR markers in Bombax ceiba (Malvaceae).

Science.gov (United States)

Ju, Miao-Miao; Ma, Huan-Cheng; Xin, Pei-Yao; Zhou, Zhi-Li; Tian, Bin

2015-04-01

Bombax ceiba (Malvaceae), commonly known as silk cotton tree, is a multipurpose tree species of tropical forests. Novel expressed sequence tag-simple sequence repeat (EST-SSR) markers were developed and characterized for the species using transcriptome analysis. A total of 33 new EST-SSR markers were developed for B. ceiba, of which 13 showed polymorphisms across the 24 individuals from four distant populations tested in the study. The results showed that the number of alleles per polymorphic locus ranged from two to four, and the expected heterozygosity and observed heterozygosity per locus varied from 0.043 to 0.654 and from 0 to 0.609, respectively. These newly developed EST-SSR markers can be used in phylogeographic and population genetic studies to investigate the origin of B. ceiba populations. Furthermore, these EST-SSR markers could also greatly promote the development of molecular breeding studies pertaining to silk cotton tree.
Characterization and multiplexing of EST-SSR primers in Cynodon (Poaceae) species1.

Science.gov (United States)

Jewell, Margaret C; Frere, Celine H; Prentis, Peter J; Lambrides, Christopher J; Godwin, Ian D

2010-10-01

Cynodon species are multiple-use grasses that display varying levels of adaptation to biotic and abiotic stress. Previously identified EST-SSR primers were characterized and multiplexed to assess the level of genetic diversity present within a collection of almost 1200 Cynodon accessions from across Australia. • Two multiplex reactions were developed comprising a total of 16 EST-SSR markers. All SSR markers amplified across different Cynodon species and different levels of ploidy. The number of alleles ranged from one to eight per locus and the total number of alleles for the germplasm collection was 79. • The 16 markers show sufficient variation for the characterization of Cynodon core collections and analysis of population genetic diversity in Cynodon grasses.
Exploiting EST databases for the development and characterization of EST-SSR markers in castor bean (Ricinus communis L.

Directory of Open Access Journals (Sweden)

Yang Jun-Bo

2010-12-01

Full Text Available Abstract Background The castor bean (Ricinus communis L., a monotypic species in the spurge family (Euphorbiaceae, 2n = 20, is an important non-edible oilseed crop widely cultivated in tropical, sub-tropical and temperate countries for its high economic value. Because of the high level of ricinoleic acid (over 85% in its seed oil, the castor bean seed derivatives are often used in aviation oil, lubricants, nylon, dyes, inks, soaps, adhesive and biodiesel. Due to lack of efficient molecular markers, little is known about the population genetic diversity and the genetic relationships among castor bean germplasm. Efficient and robust molecular markers are increasingly needed for breeding and improving varieties in castor bean. The advent of modern genomics has produced large amounts of publicly available DNA sequence data. In particular, expressed sequence tags (ESTs provide valuable resources to develop gene-associated SSR markers. Results In total, 18,928 publicly available non-redundant castor bean EST sequences, representing approximately 17.03 Mb, were evaluated and 7732 SSR sites in 5,122 ESTs were identified by data mining. Castor bean exhibited considerably high frequency of EST-SSRs. We developed and characterized 118 polymorphic EST-SSR markers from 379 primer pairs flanking repeats by screening 24 castor bean samples collected from different countries. A total of 350 alleles were identified from 118 polymorphic SSR loci, ranging from 2-6 per locus (A with an average of 2.97. The EST-SSR markers developed displayed moderate gene diversity (He with an average of 0.41. Genetic relationships among 24 germplasms were investigated using the genotypes of 350 alleles, showing geographic pattern of genotypes across genetic diversity centers of castor bean. Conclusion Castor bean EST sequences exhibited considerably high frequency of SSR sites, and were rich resources for developing EST-SSR markers. These EST-SSR markers would be particularly
Mining and characterization of EST-SSR markers for Zingiber officinale Roscoe with transferability to other species of Zingiberaceae.

Science.gov (United States)

Awasthi, Praveen; Singh, Ashish; Sheikh, Gulfam; Mahajan, Vidushi; Gupta, Ajai Prakash; Gupta, Suphla; Bedi, Yashbir S; Gandhi, Sumit G

2017-10-01

Zingiber officinale is a model spice herb, well known for its medicinal value. It is primarily a vegetatively propagated commercial crop. However, considerable diversity in its morphology, fiber content and chemoprofiles has been reported. The present study explores the utility of EST-derived markers in studying genetic diversity in different accessions of Z. officinale and their cross transferability within the Zingiberaceae family. A total of 38,115 ESTs sequences were assembled to generate 7850 contigs and 10,762 singletons. SSRs were searched in the unigenes and 515 SSR-containing ESTs were identified with a frequency of 1 SSR per 25.21 kb of the genome. These ESTs were also annotated using BLAST2GO. Primers were designed for 349 EST-SSRs and 25 primer pairs were randomly picked for EST SSR study. Out of these, 16 primer pairs could be optimized for amplification in different accessions of Z. officinale as well as other species belonging to Zingiberaceae. GES454, GES466, GES480 and GES486 markers were found to exhibit 100% cross-transferability among different members of Zingiberaceae.
Characterization of the Kenaf (Hibiscus cannabinus) Global Transcriptome Using Illumina Paired-End Sequencing and Development of EST-SSR Markers

Science.gov (United States)

Li, Hui; Li, Defang; Chen, Anguo; Tang, Huijuan; Li, Jianjun; Huang, Siqi

2016-01-01

Kenaf (Hibiscus cannabinus L.) is an economically important natural fiber crop grown worldwide. However, only 20 expressed tag sequences (ESTs) for kenaf are available in public databases. The aim of this study was to develop large-scale simple sequence repeat (SSR) markers to lay a solid foundation for the construction of genetic linkage maps and marker-assisted breeding in kenaf. We used Illumina paired-end sequencing technology to generate new EST-simple sequences and MISA software to mine SSR markers. We identified 71,318 unigenes with an average length of 1143 nt and annotated these unigenes using four different protein databases. Overall, 9324 complementary pairs were designated as EST-SSR markers, and their quality was validated using 100 randomly selected SSR markers. In total, 72 primer pairs reproducibly amplified target amplicons, and 61 of these primer pairs detected significant polymorphism among 28 kenaf accessions. Thus, in this study, we have developed large-scale SSR markers for kenaf, and this new resource will facilitate construction of genetic linkage maps, investigation of fiber growth and development in kenaf, and also be of value to novel gene discovery and functional genomic studies. PMID:26960153
Characterization of the Kenaf (Hibiscus cannabinus) Global Transcriptome Using Illumina Paired-End Sequencing and Development of EST-SSR Markers.

Science.gov (United States)

Li, Hui; Li, Defang; Chen, Anguo; Tang, Huijuan; Li, Jianjun; Huang, Siqi

2016-01-01

Kenaf (Hibiscus cannabinus L.) is an economically important natural fiber crop grown worldwide. However, only 20 expressed tag sequences (ESTs) for kenaf are available in public databases. The aim of this study was to develop large-scale simple sequence repeat (SSR) markers to lay a solid foundation for the construction of genetic linkage maps and marker-assisted breeding in kenaf. We used Illumina paired-end sequencing technology to generate new EST-simple sequences and MISA software to mine SSR markers. We identified 71,318 unigenes with an average length of 1143 nt and annotated these unigenes using four different protein databases. Overall, 9324 complementary pairs were designated as EST-SSR markers, and their quality was validated using 100 randomly selected SSR markers. In total, 72 primer pairs reproducibly amplified target amplicons, and 61 of these primer pairs detected significant polymorphism among 28 kenaf accessions. Thus, in this study, we have developed large-scale SSR markers for kenaf, and this new resource will facilitate construction of genetic linkage maps, investigation of fiber growth and development in kenaf, and also be of value to novel gene discovery and functional genomic studies.
Characterization of variable EST SSR markers for Norway spruce (Picea abies L.

Directory of Open Access Journals (Sweden)

Spiess Nadine

2011-10-01

Full Text Available Abstract Background Norway spruce is widely distributed across Europe and the predominant tree of the Alpine region. Fast growth and the fact that timber can be harvested cost-effectively in relatively young populations define its status as one of the economically most important tree species of Northern Europe. In this study, EST derived simple sequence repeat (SSR markers were developed for the assessment of putative functional diversity in Austrian Norway spruce stands. Results SSR sequences were identified by analyzing 14,022 publicly available EST sequences. Tri-nucleotide repeat motifs were most abundant in the data set followed by penta- and hexa-nucleotide repeats. Specific primer pairs were designed for sixty loci. Among these, 27 displayed polymorphism in a testing population of 16 P. abies individuals sampled across Austria and in an additional screening population of 96 P. abies individuals from two geographically distinct Austrian populations. Allele numbers per locus ranged from two to 17 with observed heterozygosity ranging from 0.075 to 0.99. Conclusions We have characterized variable EST SSR markers for Norway spruce detected in expressed genes. Due to their moderate to high degree of variability in the two tested screening populations, these newly developed SSR markers are well suited for the analysis of stress related functional variation present in Norway spruce populations.
[EST-SSR identification, markers development of Ligusticum chuanxiong based on Ligusticum chuanxiong transcriptome sequences].

Science.gov (United States)

Yuan, Can; Peng, Fang; Yang, Ze-Mao; Zhong, Wen-Juan; Mou, Fang-Sheng; Gong, Yi-Yun; Ji, Pei-Cheng; Pu, De-Qiang; Huang, Hai-Yan; Yang, Xiao; Zhang, Chao

2017-09-01

Ligusticum chuanxiong is a well-known traditional Chinese medicine plant. The study on its molecular markers development and germplasm resources is very important. In this study, we obtained 24 422 unigenes by assembling transcriptome sequencing reads of L. chuanxiong root. EST-SSR was detected and 4 073 SSR loci were identified. EST-SSR distribution and characteristic analysis results showed that the mono-nucleotide repeats were the main repeat types, accounting for 41.0%. In addition, the sequences containing SSR were functionally annotated in Gene Ontology (GO) and KEGG pathway and were assigned to 49 GO categories, 242 KEGG pathways, among them 2 201 sequences were annotated against Nr database. By validating 235 EST-SSRs,74 primer pairs were ultimately proved to have high quality amplification. Subsequently, genetic diversity analysis, UPGMA cluster analysis, PCoA analysis and population structure analysis of 34 L. chuanxiong germplasm resources were carried out with 74 primer pairs. In both UPGMA tree and PCoA results, L. chuanxiong resources were clustered into two groups, which are believed to be partial related to their geographical distribution. In this study, EST-SSRs in L. chuanxiong was firstly identified, and newly developed molecular markers would contribute significantly to further genetic diversity study, the purity detection, gene mapping, and molecular breeding. Copyright© by the Chinese Pharmaceutical Association.
Characterization and development of EST-derived SSR markers in cultivated sweetpotato (Ipomoea batatas

Directory of Open Access Journals (Sweden)

Li Yujun

2011-10-01

Full Text Available Abstract Background Currently there exists a limited availability of genetic marker resources in sweetpotato (Ipomoea batatas, which is hindering genetic research in this species. It is necessary to develop more molecular markers for potential use in sweetpotato genetic research. With the newly developed next generation sequencing technology, large amount of transcribed sequences of sweetpotato have been generated and are available for identifying SSR markers by data mining. Results In this study, we investigated 181,615 ESTs for the identification and development of SSR markers. In total, 8,294 SSRs were identified from 7,163 SSR-containing unique ESTs. On an average, one SSR was found per 7.1 kb of EST sequence with tri-nucleotide motifs (42.9% being the most abundant followed by di- (41.2%, tetra- (9.2%, penta- (3.7% and hexa-nucleotide (3.1% repeat types. The top five motifs included AG/CT (26.9%, AAG/CTT (13.5%, AT/TA (10.6%, CCG/CGG (5.8% and AAT/ATT (4.5%. After removing possible duplicate of published EST-SSRs of sweetpotato, a total of non-repeat 7,958 SSR motifs were identified. Based on these SSR-containing sequences, 1,060 pairs of high-quality SSR primers were designed and used for validation of the amplification and assessment of the polymorphism between two parents of one mapping population (E Shu 3 Hao and Guang 2k-30 and eight accessions of cultivated sweetpotatoes. The results showed that 816 primer pairs could yield reproducible and strong amplification products, of which 195 (23.9% and 342 (41.9% primer pairs exhibited polymorphism between E Shu 3 Hao and Guang 2k-30 and among the 8 cultivated sweetpotatoes, respectively. Conclusion This study gives an insight into the frequency, type and distribution of sweetpotato EST-SSRs and demonstrates successful development of EST-SSR markers in cultivated sweetpotato. These EST-SSR markers could enrich the current resource of molecular markers for the sweetpotato community and would
Development of genomic SSR and potential EST-SSR markers in ...

African Journals Online (AJOL)

In addition, forty four EST-SSRs which can be amplified with expected sizes were identified from a B. chinense root cDNA library. The genomic SSR markers and potential EST-SSR markers developed in the present study should be useful for genetic diversity and molecular marker assistant selection breeding research in ...
Development and Characterization of 37 Novel EST-SSR Markers in Pisum sativum (Fabaceae

Directory of Open Access Journals (Sweden)

Xiaofeng Zhuang

2013-01-01

Full Text Available Premise of the study: Simple sequence repeat markers were developed based on expressed sequence tags (EST-SSR and screened for polymorphism among 23 Pisum sativum individuals to assist development and refinement of pea linkage maps. In particular, the SSR markers were developed to assist in mapping of white mold disease resistance quantitative trait loci. Methods and Results: Primer pairs were designed for 46 SSRs identified in EST contiguous sequences assembled from a 454 pyrosequenced transcriptome of the pea cultivar, ‘LIFTER’. Thirty-seven SSR markers amplified PCR products, of which 11 (30% SSR markers produced polymorphism in 23 individuals, including parents of recombinant inbred lines, with two to four alleles. The observed and expected heterozygosities ranged from 0 to 0.43 and from 0.31 to 0.83, respectively. Conclusions: These EST-SSR markers for pea will be useful for refinement of pea linkage maps, and will likely be useful for comparative mapping of pea and as tools for marker-based pea breeding.
Comparison of relative efficiency of genomic SSR and EST-SSR markers in estimating genetic diversity in sugarcane.

Science.gov (United States)

Parthiban, S; Govindaraj, P; Senthilkumar, S

2018-03-01

Twenty-five primer pairs developed from genomic simple sequence repeats (SSR) were compared with 25 expressed sequence tags (EST) SSRs to evaluate the efficiency of these two sets of primers using 59 sugarcane genetic stocks. The mean polymorphism information content (PIC) of genomic SSR was higher (0.72) compared to the PIC value recorded by EST-SSR marker (0.62). The relatively low level of polymorphism in EST-SSR markers may be due to the location of these markers in more conserved and expressed sequences compared to genomic sequences which are spread throughout the genome. Dendrogram based on the genomic SSR and EST-SSR marker data showed differences in grouping of genotypes. A total of 59 sugarcane accessions were grouped into 6 and 4 clusters using genomic SSR and EST-SSR, respectively. The highly efficient genomic SSR could subcluster the genotypes of some of the clusters formed by EST-SSR markers. The difference in dendrogram observed was probably due to the variation in number of markers produced by genomic SSR and EST-SSR and different portion of genome amplified by both the markers. The combined dendrogram (genomic SSR and EST-SSR) more clearly showed the genetic relationship among the sugarcane genotypes by forming four clusters. The mean genetic similarity (GS) value obtained using EST-SSR among 59 sugarcane accessions was 0.70, whereas the mean GS obtained using genomic SSR was 0.63. Although relatively lower level of polymorphism was displayed by the EST-SSR markers, genetic diversity shown by the EST-SSR was found to be promising as they were functional marker. High level of PIC and low genetic similarity values of genomic SSR may be more useful in DNA fingerprinting, selection of true hybrids, identification of variety specific markers and genetic diversity analysis. Identification of diverse parents based on cluster analysis can be effectively done with EST-SSR as the genetic similarity estimates are based on functional attributes related to
Novel and Stress Relevant EST Derived SSR Markers Developed and Validated in Peanut

Science.gov (United States)

Bosamia, Tejas C.; Mishra, Gyan P.; Thankappan, Radhakrishnan; Dobaria, Jentilal R.

2015-01-01

With the aim to increase the number of functional markers in resource poor crop like cultivated peanut (Arachis hypogaea), large numbers of available expressed sequence tags (ESTs) in the public databases, were employed for the development of novel EST derived simple sequence repeat (SSR) markers. From 16424 unigenes, 2784 (16.95%) SSRs containing unigenes having 3373 SSR motifs were identified. Of these, 2027 (72.81%) sequences were annotated and 4124 gene ontology terms were assigned. Among different SSR motif-classes, tri-nucleotide repeats (33.86%) were the most abundant followed by di-nucleotide repeats (27.51%) while AG/CT (20.7%) and AAG/CTT (13.25%) were the most abundant repeat-motifs. A total of 2456 EST-SSR novel primer pairs were designed, of which 366 unigenes having relevance to various stresses and other functions, were PCR validated using a set of 11 diverse peanut genotypes. Of these, 340 (92.62%) primer pairs yielded clear and scorable PCR products and 39 (10.66%) primer pairs exhibited polymorphisms. Overall, the number of alleles per marker ranged from 1-12 with an average of 3.77 and the PIC ranged from 0.028 to 0.375 with an average of 0.325. The identified EST-SSRs not only enriched the existing molecular markers kitty, but would also facilitate the targeted research in marker-trait association for various stresses, inter-specific studies and genetic diversity analysis in peanut. PMID:26046991
Genetic diversity in soybean germplasm identified by SSR and EST-SSR markers Diversidade genética em germoplasma de soja identificada por marcadores SSR e EST-SSR

Directory of Open Access Journals (Sweden)

Bruno Mello Mulato

2010-03-01

Full Text Available The objectives of this work were to investigate the genetic variation in 79 soybean (Glycine max accessions from different regions of the world, to cluster the accessions based on their similarity, and to test the correlation between the two types of markers used. Simple sequence repeat markers present in genomic (SSR and in expressed regions (EST-SSR were used. Thirty SSR primer-pairs were selected (20 genomic and 10 EST-SSR based on their distribution on the 20 genetic linkage groups of soybean, on their trinucleotide repetition unit and on their polymorphism information content. All analyzed loci were polymorphic, and 259 alleles were found. The number of alleles per locus varied from 2-21, with an average of 8.63. The accessions exhibit a significant number of rare alleles, with genotypes 19, 35, 63 and 65 carrying the greater number of exclusive alleles. Accessions 75 and 79 were the most similar and accessions 31 and 35, and 40 and 78, were the most divergent ones. A low correlation between SSR and EST-SSR data was observed, thus genomic and expressed microsatellite markers are required for an appropriate analysis of genetic diversity in soybean. The genetic diversity observed was high and allowed the formation of five groups and several subgroups. A moderate relationship between genetic divergence and geographic origin of accessions was observed.Os objetivos deste trabalho foram avaliar a diversidade genética de 79 acessos de soja de diferentes regiões do mundo, agrupá-los de acordo com a similaridade e testar a correlação entre os dois tipos de marcadores utilizados. Foram utilizados marcadores microssatélites genômicos (SSR e funcionais (EST-SSR. Trinta pares de primers SSR foram selecionados (20 genômicos e 10 EST-SSR de acordo com sua distribuição nos 20 grupos de ligação da soja, com sua unidade de repetição trinucleotídica e com seu conteúdo de informação polimórfica. Todos os lócus analisados foram polim
Development and characterization of polymorphic EST based SSR markers in barley (Hordeum vulgare).

Science.gov (United States)

Jo, Won-Sam; Kim, Hye-Yeong; Kim, Kyung-Min

2017-08-01

In barley, breeding using good genetic characteristics can improve the quality or quantity of crop characters from one generation to the next generation. The development of effective molecular markers in barley is crucial for understanding and analyzing the diversity of useful alleles. In this study, we conducted genetic relationship analysis using expressed sequence tag-simple sequence repeat (EST-SSR) markers for barley identification and assessment of barley cultivar similarity. Seeds from 82 cultivars, including 31 each of naked and hulled barley from the Korea Seed and Variety Service and 20 of malting barley from the RDA-Genebank Information Center, were analyzed in this study. A cDNA library of the cultivar Gwanbori was constructed for use in analysis of genetic relationships, and 58 EST-SSR markers were developed and characterized. In total, 47 SSR markers were employed to analyze polymorphisms. A relationship dendrogram based on the polymorphism data was constructed to compare genetic diversity. We found that the polymorphism information content among the examined cultivars was 0.519, which indicates that there is low genetic diversity among Korean barley cultivars. The results obtained in this study may be useful in preventing redundant investment in new cultivars and in resolving disputes over seed patents. Our approach can be used by companies and government groups to develop different cultivars with distinguishable markers. In addition, the developed markers can be used for quantitative trait locus analysis to improve both the quantity and the quality of cultivated barley.
ESAP plus: a web-based server for EST-SSR marker development.

Science.gov (United States)

Ponyared, Piyarat; Ponsawat, Jiradej; Tongsima, Sissades; Seresangtakul, Pusadee; Akkasaeng, Chutipong; Tantisuwichwong, Nathpapat

2016-12-22

Simple sequence repeats (SSRs) have become widely used as molecular markers in plant genetic studies due to their abundance, high allelic variation at each locus and simplicity to analyze using conventional PCR amplification. To study plants with unknown genome sequence, SSR markers from Expressed Sequence Tags (ESTs), which can be obtained from the plant mRNA (converted to cDNA), must be utilized. With the advent of high-throughput sequencing technology, huge EST sequence data have been generated and are now accessible from many public databases. However, SSR marker identification from a large in-house or public EST collection requires a computational pipeline that makes use of several standard bioinformatic tools to design high quality EST-SSR primers. Some of these computational tools are not users friendly and must be tightly integrated with reference genomic databases. A web-based bioinformatic pipeline, called EST Analysis Pipeline Plus (ESAP Plus), was constructed for assisting researchers to develop SSR markers from a large EST collection. ESAP Plus incorporates several bioinformatic scripts and some useful standard software tools necessary for the four main procedures of EST-SSR marker development, namely 1) pre-processing, 2) clustering and assembly, 3) SSR mining and 4) SSR primer design. The proposed pipeline also provides two alternative steps for reducing EST redundancy and identifying SSR loci. Using public sugarcane ESTs, ESAP Plus automatically executed the aforementioned computational pipeline via a simple web user interface, which was implemented using standard PHP, HTML, CSS and Java scripts. With ESAP Plus, users can upload raw EST data and choose various filtering options and parameters to analyze each of the four main procedures through this web interface. All input EST data and their predicted SSR results will be stored in the ESAP Plus MySQL database. Users will be notified via e-mail when the automatic process is completed and they can
Characterization and comparison of EST-SSR and TRAP markers for genetic analysis of the Japanese persimmon Diospyros kaki.

Science.gov (United States)

Luo, C; Zhang, F; Zhang, Q L; Guo, D Y; Luo, Z R

2013-01-09

We developed and characterized expressed sequence tags (ESTs)-simple sequence repeats (SSRs) and targeted region amplified polymorphism (TRAP) markers to examine genetic relationships in the persimmon genus Diospyros gene pool. In total, we characterized 14 EST-SSR primer pairs and 36 TRAP primer combinations, which were amplified across 20 germplasms of 4 species in the genus Diospyros. We used various genetic parameters, including effective multiplex ratio (EMR), diversity index (DI), and marker index (MI), to test the utility of these markers. TRAP markers gave higher EMR (24.85) but lower DI (0.33), compared to EST-SSRs (EMR = 3.65, DI = 0.34). TRAP gave a very high MI (8.08), which was about 8 times than the MI of EST-SSR (1.25). These markers were utilized for phylogenetic inference of 20 genotypes of Diospyros kaki Thunb. and allied species, with a result that all kaki genotypes clustered closely and 3 allied species formed an independent group. These markers could be further exploited for large-scale genetic relationship inference.
Differential transferability of EST-SSR primers developed from diploid species Pseudoroegneria spicata, Thinopyrum bessarabicum, and Th. elongatum

Science.gov (United States)

Simple sequence repeat technology based on expressed sequence tag (EST-SSR) is a useful genomic tool for genome mapping, characterizing plant species relationships, elucidating genome evolution, and tracing genes on alien chromosome segments. EST-SSR primers developed from three perennial diploid T...
Development and characterization of novel EST-SSR markers and their application for genetic diversity analysis of Jerusalem artichoke (Helianthus tuberosus L.).

Science.gov (United States)

Mornkham, T; Wangsomnuk, P P; Mo, X C; Francisco, F O; Gao, L Z; Kurzweil, H

2016-10-24

Jerusalem artichoke (Helianthus tuberosus L.) is a perennial tuberous plant and a traditional inulin-rich crop in Thailand. It has become the most important source of inulin and has great potential for use in chemical and food industries. In this study, expressed sequence tag (EST)-based simple sequence repeat (SSR) markers were developed from 40,362 Jerusalem artichoke ESTs retrieved from the NCBI database. Among 23,691 non-redundant identified ESTs, 1949 SSR motifs harboring 2 to 6 nucleotides with varied repeat motifs were discovered from 1676 assembled sequences. Seventy-nine primer pairs were generated from EST sequences harboring SSR motifs. Our results show that 43 primers are polymorphic for the six studied populations, while the remaining 36 were either monomorphic or failed to amplify. These 43 SSR loci exhibited a high level of genetic diversity among populations, with allele numbers varying from 2 to 7, with an average of 3.95 alleles per loci. Heterozygosity ranged from 0.096 to 0.774, with an average of 0.536; polymorphic index content ranged from 0.096 to 0.854, with an average of 0.568. Principal component analysis and neighbor-joining analysis revealed that the six populations could be divided into six clusters. Our results indicate that these newly characterized EST-SSR markers may be useful in the exploration of genetic diversity and range expansion of the Jerusalem artichoke, and in cross-species application for the genus Helianthus.

Transcriptomic resources for the medicinal legume Mucuna pruriens: de novo transcriptome assembly, annotation, identification and validation of EST-SSR markers.

Science.gov (United States)

Sathyanarayana, N; Pittala, Ranjith Kumar; Tripathi, Pankaj Kumar; Chopra, Ratan; Singh, Heikham Russiachand; Belamkar, Vikas; Bhardwaj, Pardeep Kumar; Doyle, Jeff J; Egan, Ashley N

2017-05-25

The medicinal legume Mucuna pruriens (L.) DC. has attracted attention worldwide as a source of the anti-Parkinson's drug L-Dopa. It is also a popular green manure cover crop that offers many agronomic benefits including high protein content, nitrogen fixation and soil nutrients. The plant currently lacks genomic resources and there is limited knowledge on gene expression, metabolic pathways, and genetics of secondary metabolite production. Here, we present transcriptomic resources for M. pruriens, including a de novo transcriptome assembly and annotation, as well as differential transcript expression analyses between root, leaf, and pod tissues. We also develop microsatellite markers and analyze genetic diversity and population structure within a set of Indian germplasm accessions. One-hundred ninety-one million two hundred thirty-three thousand two hundred forty-two bp cleaned reads were assembled into 67,561 transcripts with mean length of 626 bp and N50 of 987 bp. Assembled sequences were annotated using BLASTX against public databases with over 80% of transcripts annotated. We identified 7,493 simple sequence repeat (SSR) motifs, including 787 polymorphic repeats between the parents of a mapping population. 134 SSRs from expressed sequenced tags (ESTs) were screened against 23 M. pruriens accessions from India, with 52 EST-SSRs retained after quality control. Population structure analysis using a Bayesian framework implemented in fastSTRUCTURE showed nearly similar groupings as with distance-based (neighbor-joining) and principal component analyses, with most of the accessions clustering per geographical origins. Pair-wise comparison of transcript expression in leaves, roots and pods identified 4,387 differentially expressed transcripts with the highest number occurring between roots and leaves. Differentially expressed transcripts were enriched with transcription factors and transcripts annotated as belonging to secondary metabolite pathways. The M
First genetic linkage map of Taraxacum koksaghyz Rodin based on AFLP, SSR, COS and EST-SSR markers.

Science.gov (United States)

Arias, Marina; Hernandez, Monica; Remondegui, Naroa; Huvenaars, Koen; van Dijk, Peter; Ritter, Enrique

2016-08-04

Taraxacum koksaghyz Rodin (TKS) has been studied in many occasions as a possible alternative source for natural rubber production of good quality and for inulin production. Some tire companies are already testing TKS tire prototypes. There are also many investigations on the production of bio-fuels from inulin and inulin applications for health improvement and in the food industry. A limited amount of genomic resources exist for TKS and particularly no genetic linkage map is available in this species. We have constructed the first TKS genetic linkage map based on AFLP, COS, SSR and EST-SSR markers. The integrated linkage map with eight linkage groups (LG), representing the eight chromosomes of Russian dandelion, has 185 individual AFLP markers from parent 1, 188 individual AFLP markers from parent 2, 75 common AFLP markers and 6 COS, 1 SSR and 63 EST-SSR loci. Blasting the EST-SSR sequences against known sequences from lettuce allowed a partial alignment of our TKS map with a lettuce map. Blast searches against plant gene databases revealed some homologies with useful genes for downstream applications in the future.
Expressed Sequence Tag-Simple Sequence Repeat (EST-SSR Marker Resources for Diversity Analysis of Mango (Mangifera indica L.

Directory of Open Access Journals (Sweden)

Natalie L. Dillon

2014-01-01

Full Text Available In this study, a collection of 24,840 expressed sequence tags (ESTs generated from five mango (Mangifera indica L. cDNA libraries was mined for EST-based simple sequence repeat (SSR markers. Over 1,000 ESTs with SSR motifs were detected from more than 24,000 EST sequences with di- and tri-nucleotide repeat motifs the most abundant. Of these, 25 EST-SSRs in genes involved in plant development, stress response, and fruit color and flavor development pathways were selected, developed into PCR markers and characterized in a population of 32 mango selections including M. indica varieties, and related Mangifera species. Twenty-four of the 25 EST-SSR markers exhibited polymorphisms, identifying a total of 86 alleles with an average of 5.38 alleles per locus, and distinguished between all Mangifera selections. Private alleles were identified for Mangifera species. These newly developed EST-SSR markers enhance the current 11 SSR mango genetic identity panel utilized by the Australian Mango Breeding Program. The current panel has been used to identify progeny and parents for selection and the application of this extended panel will further improve and help to design mango hybridization strategies for increased breeding efficiency.
Mining and gene ontology based annotation of SSR markers from expressed sequence tags of Humulus lupulus

Science.gov (United States)

Singh, Swati; Gupta, Sanchita; Mani, Ashutosh; Chaturvedi, Anoop

2012-01-01

Humulus lupulus is commonly known as hops, a member of the family moraceae. Currently many projects are underway leading to the accumulation of voluminous genomic and expressed sequence tag sequences in public databases. The genetically characterized domains in these databases are limited due to non-availability of reliable molecular markers. The large data of EST sequences are available in hops. The simple sequence repeat markers extracted from EST data are used as molecular markers for genetic characterization, in the present study. 25,495 EST sequences were examined and assembled to get full-length sequences. Maximum frequency distribution was shown by mononucleotide SSR motifs i.e. 60.44% in contig and 62.16% in singleton where as minimum frequency are observed for hexanucleotide SSR in contig (0.09%) and pentanucleotide SSR in singletons (0.12%). Maximum trinucleotide motifs code for Glutamic acid (GAA) while AT/TA were the most frequent repeat of dinucleotide SSRs. Flanking primer pairs were designed in-silico for the SSR containing sequences. Functional categorization of SSRs containing sequences was done through gene ontology terms like biological process, cellular component and molecular function. PMID:22368382
Analysis of inheritance mode in chrysanthemum using EST-derived SSR markers

NARCIS (Netherlands)

Park, Sang Kun; Arens, Paul; Esselink, Danny; Lim, Jin Hee; Shin, Hak Ki

2015-01-01

To study the inheritance mode of hexaploid chrysanthemum (random or preferential chromosome pairing), a segregation analysis was carried out using SSR markers derived from chrysanthemum ESTs in the public domain. A total of 248 EST-SSR primer pairs were screened in chrysanthemum cultivars
WGSSAT: A High-Throughput Computational Pipeline for Mining and Annotation of SSR Markers From Whole Genomes.

Science.gov (United States)

Pandey, Manmohan; Kumar, Ravindra; Srivastava, Prachi; Agarwal, Suyash; Srivastava, Shreya; Nagpure, Naresh S; Jena, Joy K; Kushwaha, Basdeo

2018-03-16

Mining and characterization of Simple Sequence Repeat (SSR) markers from whole genomes provide valuable information about biological significance of SSR distribution and also facilitate development of markers for genetic analysis. Whole genome sequencing (WGS)-SSR Annotation Tool (WGSSAT) is a graphical user interface pipeline developed using Java Netbeans and Perl scripts which facilitates in simplifying the process of SSR mining and characterization. WGSSAT takes input in FASTA format and automates the prediction of genes, noncoding RNA (ncRNA), core genes, repeats and SSRs from whole genomes followed by mapping of the predicted SSRs onto a genome (classified according to genes, ncRNA, repeats, exonic, intronic, and core gene region) along with primer identification and mining of cross-species markers. The program also generates a detailed statistical report along with visualization of mapped SSRs, genes, core genes, and RNAs. The features of WGSSAT were demonstrated using Takifugu rubripes data. This yielded a total of 139 057 SSR, out of which 113 703 SSR primer pairs were uniquely amplified in silico onto a T. rubripes (fugu) genome. Out of 113 703 mined SSRs, 81 463 were from coding region (including 4286 exonic and 77 177 intronic), 7 from RNA, 267 from core genes of fugu, whereas 105 641 SSR and 601 SSR primer pairs were uniquely mapped onto the medaka genome. WGSSAT is tested under Ubuntu Linux. The source code, documentation, user manual, example dataset and scripts are available online at https://sourceforge.net/projects/wgssat-nbfgr.
Development, cross-species/genera transferability of novel EST-SSR markers and their utility in revealing population structure and genetic diversity in sugarcane

KAUST Repository

Singh, Ram K.

2013-07-01

Sugarcane (Saccharum spp. hybrid) with complex polyploid genome requires a large number of informative DNA markers for various applications in genetics and breeding. Despite the great advances in genomic technology, it is observed in several crop species, especially in sugarcane, the availability of molecular tools such as microsatellite markers are limited. Now-a-days EST-SSR markers are preferred to genomic SSR (gSSR) as they represent only the functional part of the genome, which can be easily associated with desired trait. The present study was taken up with a new set of 351 EST-SSRs developed from the 4085 non redundant EST sequences of two Indian sugarcane cultivars. Among these EST-SSRs, TNR containing motifs were predominant with a frequency of 51.6%. Thirty percent EST-SSRs showed homology with annotated protein. A high frequency of SSRs was found in the 5\\'UTR and in the ORF (about 27%) and a low frequency was observed in the 3\\'UTR (about 8%). Two hundred twenty-seven EST-SSRs were evaluated, in sugarcane, allied genera of sugarcane and cereals, and 134 of these have revealed polymorphism with a range of PIC value 0.12 to 0.99. The cross transferability rate ranged from 87.0% to 93.4% in Saccharum complex, 80.0% to 87.0% in allied genera, and 76.0% to 80.0% in cereals. Cloning and sequencing of EST-SSR size variant amplicons revealed that the variation in the number of repeat-units was the main source of EST-SSR fragment polymorphism. When 124 sugarcane accessions were analyzed for population structure using model-based approach, seven genetically distinct groups or admixtures thereof were observed in sugarcane. Results of principal coordinate analysis or UPGMA to evaluate genetic relationships delineated also the 124 accessions into seven groups. Thus, a high level of polymorphism adequate genetic diversity and population structure assayed with the EST-SSR markers not only suggested their utility in various applications in genetics and genomics in
A second generation framework for the analysis of microsatellites in expressed sequence tags and the development of EST-SSR markers for a conifer, Cryptomeria japonica

Directory of Open Access Journals (Sweden)

Ueno Saneyoshi

2012-04-01

Full Text Available Abstract Background Microsatellites or simple sequence repeats (SSRs in expressed sequence tags (ESTs are useful resources for genome analysis because of their abundance, functionality and polymorphism. The advent of commercial second generation sequencing machines has lead to new strategies for developing EST-SSR markers, necessitating the development of bioinformatic framework that can keep pace with the increasing quality and quantity of sequence data produced. We describe an open scheme for analyzing ESTs and developing EST-SSR markers from reads collected by Sanger sequencing and pyrosequencing of sugi (Cryptomeria japonica. Results We collected 141,097 sequence reads by Sanger sequencing and 1,333,444 by pyrosequencing. After trimming contaminant and low quality sequences, 118,319 Sanger and 1,201,150 pyrosequencing reads were passed to the MIRA assembler, generating 81,284 contigs that were analysed for SSRs. 4,059 SSRs were found in 3,694 (4.54% contigs, giving an SSR frequency lower than that in seven other plant species with gene indices (5.4–21.9%. The average GC content of the SSR-containing contigs was 41.55%, compared to 40.23% for all contigs. Tri-SSRs were the most common SSRs; the most common motif was AT, which was found in 655 (46.3% di-SSRs, followed by the AAG motif, found in 342 (25.9% tri-SSRs. Most (72.8% tri-SSRs were in coding regions, but 55.6% of the di-SSRs were in non-coding regions; the AT motif was most abundant in 3′ untranslated regions. Gene ontology (GO annotations showed that six GO terms were significantly overrepresented within SSR-containing contigs. Forty–four EST-SSR markers were developed from 192 primer pairs using two pipelines: read2Marker and the newly-developed CMiB, which combines several open tools. Markers resulting from both pipelines showed no differences in PCR success rate and polymorphisms, but PCR success and polymorphism were significantly affected by the expected PCR product size
A second generation framework for the analysis of microsatellites in expressed sequence tags and the development of EST-SSR markers for a conifer, Cryptomeria japonica

Science.gov (United States)

2012-01-01

Background Microsatellites or simple sequence repeats (SSRs) in expressed sequence tags (ESTs) are useful resources for genome analysis because of their abundance, functionality and polymorphism. The advent of commercial second generation sequencing machines has lead to new strategies for developing EST-SSR markers, necessitating the development of bioinformatic framework that can keep pace with the increasing quality and quantity of sequence data produced. We describe an open scheme for analyzing ESTs and developing EST-SSR markers from reads collected by Sanger sequencing and pyrosequencing of sugi (Cryptomeria japonica). Results We collected 141,097 sequence reads by Sanger sequencing and 1,333,444 by pyrosequencing. After trimming contaminant and low quality sequences, 118,319 Sanger and 1,201,150 pyrosequencing reads were passed to the MIRA assembler, generating 81,284 contigs that were analysed for SSRs. 4,059 SSRs were found in 3,694 (4.54%) contigs, giving an SSR frequency lower than that in seven other plant species with gene indices (5.4–21.9%). The average GC content of the SSR-containing contigs was 41.55%, compared to 40.23% for all contigs. Tri-SSRs were the most common SSRs; the most common motif was AT, which was found in 655 (46.3%) di-SSRs, followed by the AAG motif, found in 342 (25.9%) tri-SSRs. Most (72.8%) tri-SSRs were in coding regions, but 55.6% of the di-SSRs were in non-coding regions; the AT motif was most abundant in 3′ untranslated regions. Gene ontology (GO) annotations showed that six GO terms were significantly overrepresented within SSR-containing contigs. Forty–four EST-SSR markers were developed from 192 primer pairs using two pipelines: read2Marker and the newly-developed CMiB, which combines several open tools. Markers resulting from both pipelines showed no differences in PCR success rate and polymorphisms, but PCR success and polymorphism were significantly affected by the expected PCR product size and number of SSR
annot8r: GO, EC and KEGG annotation of EST datasets

Directory of Open Access Journals (Sweden)

Schmid Ralf

2008-04-01

Full Text Available Abstract Background The expressed sequence tag (EST methodology is an attractive option for the generation of sequence data for species for which no completely sequenced genome is available. The annotation and comparative analysis of such datasets poses a formidable challenge for research groups that do not have the bioinformatics infrastructure of major genome sequencing centres. Therefore, there is a need for user-friendly tools to facilitate the annotation of non-model species EST datasets with well-defined ontologies that enable meaningful cross-species comparisons. To address this, we have developed annot8r, a platform for the rapid annotation of EST datasets with GO-terms, EC-numbers and KEGG-pathways. Results annot8r automatically downloads all files relevant for the annotation process and generates a reference database that stores UniProt entries, their associated Gene Ontology (GO, Enzyme Commission (EC and Kyoto Encyclopaedia of Genes and Genomes (KEGG annotation and additional relevant data. For each of GO, EC and KEGG, annot8r extracts a specific sequence subset from the UniProt dataset based on the information stored in the reference database. These three subsets are then formatted for BLAST searches. The user provides the protein or nucleotide sequences to be annotated and annot8r runs BLAST searches against these three subsets. The BLAST results are parsed and the corresponding annotations retrieved from the reference database. The annotations are saved both as flat files and also in a relational postgreSQL results database to facilitate more advanced searches within the results. annot8r is integrated with the PartiGene suite of EST analysis tools. Conclusion annot8r is a tool that assigns GO, EC and KEGG annotations for data sets resulting from EST sequencing projects both rapidly and efficiently. The benefits of an underlying relational database, flexibility and the ease of use of the program make it ideally suited for non
EST-PAC a web package for EST annotation and protein sequence prediction

Directory of Open Access Journals (Sweden)

Strahm Yvan

2006-10-01

Full Text Available Abstract With the decreasing cost of DNA sequencing technology and the vast diversity of biological resources, researchers increasingly face the basic challenge of annotating a larger number of expressed sequences tags (EST from a variety of species. This typically consists of a series of repetitive tasks, which should be automated and easy to use. The results of these annotation tasks need to be stored and organized in a consistent way. All these operations should be self-installing, platform independent, easy to customize and amenable to using distributed bioinformatics resources available on the Internet. In order to address these issues, we present EST-PAC a web oriented multi-platform software package for expressed sequences tag (EST annotation. EST-PAC provides a solution for the administration of EST and protein sequence annotations accessible through a web interface. Three aspects of EST annotation are automated: 1 searching local or remote biological databases for sequence similarities using Blast services, 2 predicting protein coding sequence from EST data and, 3 annotating predicted protein sequences with functional domain predictions. In practice, EST-PAC integrates the BLASTALL suite, EST-Scan2 and HMMER in a relational database system accessible through a simple web interface. EST-PAC also takes advantage of the relational database to allow consistent storage, powerful queries of results and, management of the annotation process. The system allows users to customize annotation strategies and provides an open-source data-management environment for research and education in bioinformatics.
Construction of an EST-SSR-based interspecific transcriptome ...

Indian Academy of Sciences (India)

Construction of an EST-SSR-based interspecific transcriptome linkage map of fibre development in cotton. CHUANXIANG LIU, DAOJUN YUAN and ZHONGXU LIN. ∗. National Key Laboratory of Crop Genetic Improvement and National Centre of Plant Gene Research (Wuhan),. Huazhong Agricultural University, Wuhan ...
Development of Novel Polymorphic EST-SSR Markers in Bailinggu (Pleurotus tuoliensis for Crossbreeding

Directory of Open Access Journals (Sweden)

Yueting Dai

2017-11-01

Full Text Available Identification of monokaryons and their mating types and discrimination of hybrid offspring are key steps for the crossbreeding of Pleurotus tuoliensis (Bailinggu. However, conventional crossbreeding methods are troublesome and time consuming. Using RNA-seq technology, we developed new expressed sequence tag-simple sequence repeat (EST-SSR markers for Bailinggu to easily and rapidly identify monokaryons and their mating types, genetic diversity and hybrid offspring. We identified 1110 potential EST-based SSR loci from a newly-sequenced Bailinggu transcriptome and then randomly selected 100 EST-SSRs for further validation. Results showed that 39, 43 and 34 novel EST-SSR markers successfully identified monokaryons from their parent dikaryons, differentiated two different mating types and discriminated F1 and F2 hybrid offspring, respectively. Furthermore, a total of 86 alleles were detected in 37 monokaryons using 18 highly informative EST-SSRs. The observed number of alleles per locus ranged from three to seven. Cluster analysis revealed that these monokaryons have a relatively high level of genetic diversity. Transfer rates of the EST-SSRs in the monokaryons of closely-related species Pleurotus eryngii var. ferulae and Pleurotus ostreatus were 72% and 64%, respectively. Therefore, our study provides new SSR markers and an efficient method to enhance the crossbreeding of Bailinggu and closely-related species.
Development of Novel Polymorphic EST-SSR Markers in Bailinggu (Pleurotus tuoliensis) for Crossbreeding

Science.gov (United States)

Dai, Yueting; Su, Wenying; Song, Bing; Li, Yu; Fu, Yongping

2017-01-01

Identification of monokaryons and their mating types and discrimination of hybrid offspring are key steps for the crossbreeding of Pleurotus tuoliensis (Bailinggu). However, conventional crossbreeding methods are troublesome and time consuming. Using RNA-seq technology, we developed new expressed sequence tag-simple sequence repeat (EST-SSR) markers for Bailinggu to easily and rapidly identify monokaryons and their mating types, genetic diversity and hybrid offspring. We identified 1110 potential EST-based SSR loci from a newly-sequenced Bailinggu transcriptome and then randomly selected 100 EST-SSRs for further validation. Results showed that 39, 43 and 34 novel EST-SSR markers successfully identified monokaryons from their parent dikaryons, differentiated two different mating types and discriminated F1 and F2 hybrid offspring, respectively. Furthermore, a total of 86 alleles were detected in 37 monokaryons using 18 highly informative EST-SSRs. The observed number of alleles per locus ranged from three to seven. Cluster analysis revealed that these monokaryons have a relatively high level of genetic diversity. Transfer rates of the EST-SSRs in the monokaryons of closely-related species Pleurotus eryngii var. ferulae and Pleurotus ostreatus were 72% and 64%, respectively. Therefore, our study provides new SSR markers and an efficient method to enhance the crossbreeding of Bailinggu and closely-related species. PMID:29149037
Genetic diversity of cucumber estimated by morpho-physiological and EST-SSR markers.

Science.gov (United States)

Pandey, Sudhakar; Ansari, Waquar Akhter; Pandey, Maneesh; Singh, Bijendra

2018-02-01

In the present study, genetic variation among 40 cucumber genotypes was analyzed by means of morpho-physiological traits and 21 EST-SSR markers. Diversity was observed for morpho-physiological characters like days to 50% female flowering (37-46.9, number of fruits/plant (1.33-5.80), average fruit weight (41-333), vine length (36-364), relative water content (58.5-92.7), electrolyte leakage (15.9-37.1), photosynthetic efficiency (0.40-0.75) and chlorophyll concentration index (11.1-28.6). The pair wise Jaccard similarity coefficient ranged from 0.00 to 0.27 for quantitative traits and 0.24 to 0.96 for EST-SSR markers indicating that the accessions represent genetically diverse populations. With twenty-one EST-SSR markers, polymorphism revealed among 40 cucumber genotypes, number of alleles varied 2-6 with an average 3.05. Polymorphism information content varied from 0.002 to 0.989 (mean = 0.308). The number of effective allele (Ne), expected heterozygosity (He) and unbiased expected heterozygosity (uHe) of these EST-SSRs were 1.079-1.753, 0.074-0.428 and 0.074-0.434, respectively. Same 21 EST-SSR markers transferability checked in four other Cucumis species: snapmelon ( Cucumis melo var. momordica ), muskmelon ( Cucumis melo L.), pickling melon ( Cucumis melo var. conomon ) and wild muskmelon ( Cucumis melo var. agrestis ) with frequency of 61.9, 95.2, 76.2, and 76.2%, respectively. Present study provides useful information on variability, which can assist geneticists with desirable traits for cucumber germplasm utilization. Observed physiological parameters may assists in selection of genotype for abiotic stress tolerance also, EST-SSR markers may be useful for genetic studies in related species.
Exploration of genetic diversity among medicinally important genus Epimedium species based on genomic and EST-SSR marker.

Science.gov (United States)

Yousaf, Zubaida; Hu, Weiming; Zhang, Yanjun; Zeng, Shaohua; Wang, Ying

2015-01-01

Epimedium species has gained prime importance due to their medicinal and economic values. Therefore, in this study, 26 genomic SSR and 10 EST-SSR markers were developed for 13 medicinal species of the Epimedium genus and one out-group species Vancouveria hexandra W. J. Hooker to explore the existing genetic diversity. A total of 100 alleles by genomic SSR and 65 by EST-SSR were detected. The genomic SSR markers were presented between 2-7 alleles per locus. The observed heterozygosity (Ho) and expected heterozygosity (He) ranged from 0.00 to 4.5 and 0.0254 to 2.8108, respectively. Similarly, for EST-SSR, these values were ranged from 3.00 to 4.00 and 1.9650 to 2.7142. The number of alleles for EST-SSR markers ranged from 3 to 10 with an average of 3.51 per loci. It has been concluded that medicinally important species of the genus Epimedium possesses lower intraspecific genetic variation.
Genetic diversity of Phytophthora sojae isolates in Heilongjiang Province in China assessed by RAPD and EST-SSR

Science.gov (United States)

Wu, J. J.; Xu, P. F.; Liu, L. J.; Wang, J. S.; Lin, W. G.; Zhang, S. Z.; Wei, L.

Random-amplified polymorphic DNA (RAPD) and EST-SSR markers were used to estimate the genetic relationship among thirty-nine P.sojae isolates from three locations in Heilongjiang Province, and nine isolates from Ohio in America were made as reference strains. 10 of 50 RAPD primers and 5 of 33 EST-SSR were polymorphic across 48 P.sojae isolates. Similarity values among P.sojae isolates were from 49% to 82% based on the RAPD data. The similarities based on EST-SSR markers ranged from 47% to 85%. The genetic diversity revealed by EST-SSR marker analysis was higher than that obtained from RAPD. The similarity matrices for the SSR data and the RAPD data were moderately correlated (r = 0.47). Genetic similarity coefficients were also relatively lower, which demonstrated complicated genetic background within each location. The high similarity values range revealed the ability of RAPD/EST-SSR markers to distinguish even among morphological similar phytophthora.
Development and Characterization of 1,906 EST-SSR Markers from Unigenes in Jute (Corchorus spp..

Directory of Open Access Journals (Sweden)

Liwu Zhang

Full Text Available Jute, comprising white and dark jute, is the second important natural fiber crop after cotton worldwide. However, the lack of expressed sequence tag-derived simple sequence repeat (EST-SSR markers has resulted in a large gap in the improvement of jute. Previously, de novo 48,914 unigenes from white jute were assembled. In this study, 1,906 EST-SSRs were identified from these assembled uingenes. Among these markers, di-, tri- and tetra-nucleotide repeat types were the abundant types (12.0%, 56.9% and 21.6% respectively. The AG-rich or GA-rich nucleotide repeats were the predominant. Subsequently, a sample of 116 SSRs, located in genes encoding transcription factors and cellulose synthases, were selected to survey polymorphisms among12 diverse jute accessions. Of these, 83.6% successfully amplified at least one fragment and detected polymorphism among the 12diverse genotypes, indicating that the newly developed SSRs are of good quality. Furthermore, the genetic similarity coefficients of all the 12 accessions were evaluated using 97 polymorphic SSRs. The cluster analysis divided the jute accessions into two main groups with genetic similarity coefficient of 0.61. These EST-SSR markers not only enrich molecular markers of jute genome, but also facilitate genetic and genomic researches in jute.
Development of EST-SSR markers in flowering Chinese cabbage (Brassica campestris L. ssp. chinensis var. utilis Tsen et Lee based on de novo transcriptomic assemblies.

Directory of Open Access Journals (Sweden)

Jingfang Chen

Full Text Available Flowering Chinese cabbage is one of the most important vegetable crops in southern China. Genetic improvement of various agronomic traits in this crop is underway to meet high market demand in the region, but the progress is hampered by limited number of molecular markers available in this crop. This study aimed to develop EST-SSR markers from transcriptome sequences generated by next-generation sequencing. RNA-seq of eight cabbage samples identified 48,975 unigenes. Of these unigenes, 23,267 were annotated in 56 gene ontology (GO categories, 6,033 were mapped to 131 KEGG pathways, and 7,825 were assigned to clusters of orthologous groups (COGs. From the unigenes, 8,165 EST-SSR loci were identified and 98.57% of them were 1-3 nucleotide repeats with 14.32%, 41.08% and 43.17% of mono-, di- and tri-nucleotide repeats, respectively. Fifty-eight types of motifs were identified with A/T, AG/CT, AT/AT, AC/GT, AAG/CTT and AGG/CCT the most abundant. The lengths of repeated nucleotide sequences in all SSR loci ranged from 12 to 60 bp, with most (88.51% under 20 bp. Among 170 primer pairs were randomly selected from a total of 4,912 SSR primers we designed, 48 yielded unambiguously polymorphic bands with high reproducibility. Cluster analysis using 48 SSRs classified 34 flowering Chinese cabbage cultivars into three groups. A large number of EST-SSR markers identified in this study will facilitate marker-assisted selection in the breeding programs of flowering Chinese cabbage.
Development of EST-SSR markers in flowering Chinese cabbage (Brassica campestris L. ssp. chinensis var. utilis Tsen et Lee) based on de novo transcriptomic assemblies.

Science.gov (United States)

Chen, Jingfang; Li, Ronghua; Xia, Yanshi; Bai, Guihua; Guo, Peiguo; Wang, Zhiliang; Zhang, Hua; Siddique, Kadambot H M

2017-01-01

Flowering Chinese cabbage is one of the most important vegetable crops in southern China. Genetic improvement of various agronomic traits in this crop is underway to meet high market demand in the region, but the progress is hampered by limited number of molecular markers available in this crop. This study aimed to develop EST-SSR markers from transcriptome sequences generated by next-generation sequencing. RNA-seq of eight cabbage samples identified 48,975 unigenes. Of these unigenes, 23,267 were annotated in 56 gene ontology (GO) categories, 6,033 were mapped to 131 KEGG pathways, and 7,825 were assigned to clusters of orthologous groups (COGs). From the unigenes, 8,165 EST-SSR loci were identified and 98.57% of them were 1-3 nucleotide repeats with 14.32%, 41.08% and 43.17% of mono-, di- and tri-nucleotide repeats, respectively. Fifty-eight types of motifs were identified with A/T, AG/CT, AT/AT, AC/GT, AAG/CTT and AGG/CCT the most abundant. The lengths of repeated nucleotide sequences in all SSR loci ranged from 12 to 60 bp, with most (88.51%) under 20 bp. Among 170 primer pairs were randomly selected from a total of 4,912 SSR primers we designed, 48 yielded unambiguously polymorphic bands with high reproducibility. Cluster analysis using 48 SSRs classified 34 flowering Chinese cabbage cultivars into three groups. A large number of EST-SSR markers identified in this study will facilitate marker-assisted selection in the breeding programs of flowering Chinese cabbage.

In silico comparative analysis of EST-SSRs in three cotton genomes ...

African Journals Online (AJOL)

The range of repeat number change in each HG was wider in Gr-Gh. The annotation of the SSR-ESTs showed that more Gene Ontology (GO) items targeted by SSR-ESTs of Ga and Gr than those of Gh. This study gave us new insights into the difference between the three cotton genomes, which will be more helpful to ...
Development and Validation of EST-SSR Markers from the Transcriptome of Adzuki Bean (Vigna angularis).

Science.gov (United States)

Chen, Honglin; Liu, Liping; Wang, Lixia; Wang, Suhua; Somta, Prakit; Cheng, Xuzhen

2015-01-01

The adzuki bean (Vigna angularis (Ohwi) Ohwi and Ohashi) is an important grain legume of Asia. It is cultivated mainly in China, Japan and Korea. Despite its importance, few genomic resources are available for molecular genetic research of adzuki bean. In this study, we developed EST-SSR markers for the adzuki bean through next-generation sequencing. More than 112 million high-quality cDNA sequence reads were obtained from adzuki bean using Illumina paired-end sequencing technology, and the sequences were de novo assembled into 65,950 unigenes. The average length of the unigenes was 1,213 bp. Among the unigenes, 14,547 sequences contained a unique simple sequence repeat (SSR) and 3,350 sequences contained more than one SSR. A total of 7,947 EST-SSRs were identified as potential molecular markers, with mono-nucleotide A/T repeats (99.0%) as the most abundant motif class, followed by AG/CT (68.4%), AAG/CTT (30.0%), AAAG/CTTT (26.2%), AAAAG/CTTTT (16.1%), and AACGGG/CCCGTT (6.0%). A total of 500 SSR markers were randomly selected for validation, of which 296 markers produced reproducible amplicons with 38 polymorphic markers among the 32 adzuki bean genotypes selected from diverse geographical locations across China. The large number of SSR-containing sequences and EST-SSR markers will be valuable for genetic analysis of the adzuki bean and related Vigna species.
Isolation and characterization of twenty-nine novel EST-SSR ...

Indian Academy of Sciences (India)

All results were adjusted for multiple simul- taneous ... ated from HWE after Bonferroni correction (adjusted P = 0.0017) ... ity test of the designed SSR markers. In the 30 examined. S. undulata samples, the PIC value of the 29 newly devel-.
Validation of dbEST-SSRs and transferability of some other solanaceous species SSR in ashwagandha [Withania Somnifera (L.) Dunal].

Science.gov (United States)

Parmar, Eva K; Fougat, Ranbir S; Patel, Chandni B; Zala, Harshvardhan N; Patel, Mahesh A; Patel, Swati K; Kumar, Sushil

2015-12-01

Cross-species transferability and expressed sequence tags (ESTs) in public databases are cost-effective means for developing simple sequence repeats (SSRs) for less-studied species like medicinal plants. In this study, 11 EST-SSR markers developed from 742 available ESTs of Withania Somnifera EST sequences and 95 SSR primer pairs derived from other solanaceous crops (tomato, eggplant, chili, and tobacco) were utilized for their amplification and validation. Out of 11, 10 EST-SSRs showed good amplification quality and produced 13 loci with a product size ranging between 167 and 291 bp. Similarly, of the 95 cross-genera SSR loci assayed, 20 (21 %) markers showed the transferability of 5, 27, 32, and 14.2 % from eggplant, chili, tomato, and tobacco, respectively, to ashwagandha. In toto, these 30 SSR markers reported here will be valuable resources and may be applicable for the analysis of intra- and inter-specific genetic diversity in ashwagandha for which till date no information about SSR is available.
AcEST(EST sequences of Adiantum capillus-veneris and their annotation) - AcEST | LSDB Archive [Life Science Database Archive metadata

Lifescience Database Archive (English)

Full Text Available List Contact us AcEST AcEST(EST sequences of Adiantum capillus-veneris and their annotation) Data detail Dat...a name AcEST(EST sequences of Adiantum capillus-veneris and their annotation) DOI 10.18908/lsdba.nbdc00839-0...01 Description of data contents EST sequence of Adiantum capillus-veneris and its annotation (clone ID, libr...le search URL http://togodb.biosciencedbc.jp/togodb/view/archive_acest#en Data acquisition method Capillary ...ainst UniProtKB/Swiss-Prot and UniProtKB/TrEMBL databases) Number of data entries Adiantum capillus-veneris
Exploiting the transcriptome of Euphrates Poplar, Populus euphratica (Salicaceae to develop and characterize new EST-SSR markers and construct an EST-SSR database.

Directory of Open Access Journals (Sweden)

Fang K Du

Full Text Available BACKGROUND: Microsatellite markers or Simple Sequence Repeats (SSRs are the most popular markers in population/conservation genetics. However, the development of novel microsatellite markers has been impeded by high costs, a lack of available sequence data and technical difficulties. New species-specific microsatellite markers were required to investigate the evolutionary history of the Euphratica tree, Populus euphratica, the only tree species found in the desert regions of Western China and adjacent Central Asian countries. METHODOLOGY/PRINCIPAL FINDINGS: A total of 94,090 non-redundant Expressed Sequence Tags (ESTs from P. euphratica comprising around 63 Mb of sequence data were searched for SSRs. 4,202 SSRs were found in 3,839 ESTs, with 311 ESTs containing multiple SSRs. The most common motif types were trinucleotides (37% and hexanucleotides (33% repeats. We developed primer pairs for all of the identified EST-SSRs (eSSRs and selected 673 of these pairs at random for further validation. 575 pairs (85% gave successful amplification, of which, 464 (80.7% were polymorphic in six to 24 individuals from natural populations across Northern China. We also tested the transferability of the polymorphic eSSRs to nine other Populus species. In addition, to facilitate the use of these new eSSR markers by other researchers, we mapped them onto Populus trichocarpa scaffolds in silico and compiled our data into a web-based database (http://202.205.131.253:8080/poplar/resources/static_page/index.html. CONCLUSIONS: The large set of validated eSSRs identified in this work will have many potential applications in studies on P. euphratica and other poplar species, in fields such as population genetics, comparative genomics, linkage mapping, QTL, and marker-assisted breeding. Their use will be facilitated by their incorporation into a user-friendly web-based database.
Characterization and Development of EST-SSRs by Deep Transcriptome Sequencing in Chinese Cabbage (Brassica rapa L. ssp. pekinensis

Directory of Open Access Journals (Sweden)

Qian Ding

2015-01-01

Full Text Available Simple sequence repeats (SSRs are among the most important markers for population analysis and have been widely used in plant genetic mapping and molecular breeding. Expressed sequence tag-SSR (EST-SSR markers, located in the coding regions, are potentially more efficient for QTL mapping, gene targeting, and marker-assisted breeding. In this study, we investigated 51,694 nonredundant unigenes, assembled from clean reads from deep transcriptome sequencing with a Solexa/Illumina platform, for identification and development of EST-SSRs in Chinese cabbage. In total, 10,420 EST-SSRs with over 12 bp were identified and characterized, among which 2744 EST-SSRs are new and 2317 are known ones showing polymorphism with previously reported SSRs. A total of 7877 PCR primer pairs for 1561 EST-SSR loci were designed, and primer pairs for twenty-four EST-SSRs were selected for primer evaluation. In nineteen EST-SSR loci (79.2%, amplicons were successfully generated with high quality. Seventeen (89.5% showed polymorphism in twenty-four cultivars of Chinese cabbage. The polymorphic alleles of each polymorphic locus were sequenced, and the results showed that most polymorphisms were due to variations of SSR repeat motifs. The EST-SSRs identified and characterized in this study have important implications for developing new tools for genetics and molecular breeding in Chinese cabbage.
DEVELOPMENT OF EST-SSR MARKERS TO ASSESS GENETIC DIVERSITY OF BROCCOLI AND ITS RELATED SPECIES

Directory of Open Access Journals (Sweden)

Nur Kholilatul Izzah

2017-01-01

Full Text Available Development of Expressed Sequence Tag-Simple Sequence Repeat (EST-SSR markers derived from public database is known to be more efficient, faster and low cost. The objective of this study was to generate a new set of EST-SSR markers for broccoli and its related species and their usefulness for assessing their genetic diversity. A total of 202 Brassica oleracea ESTs were retrieved from NCBI and then assembled into 172 unigenes by means of CAP3 program. Identification of SSRs was carried out using web-based tool, RepeatMasker software. Afterwards, EST-SSR markers were developed using Primer3 program. Among the identified SSRs, trinucleotide repeats were the most common repeat types, which accounted for about 50%. A total of eight primer pairs were successfully designed and yielded amplification products. Among them, five markers were polymorphic and displayed a total of 30 alleles with an average number of six alleles per locus. The polymorphic markers were subsequently used for analyzing genetic diversity of 36 B. oleracea cultivars including 22 broccoli, five cauliflower and nine kohlrabi cultivars based on genetic similarity matrix as implemented in NTSYS program. At similarity coefficient of 61%, a UPGMA clustering dendrogram effectively separated 36 genotypes into three main groups, where 30 out of 36 genotypes were clearly discriminated. The result obtained in the present study would help breeders in selecting parental lines for crossing. Moreover, the novel EST-SSR markers developed in the study could be a valuable tool for differentiating cultivars of broccoli and related species.
Three vibrio-resistance related EST-SSR markers revealed by selective genotyping in the clam Meretrix meretrix.

Science.gov (United States)

Nie, Qing; Yue, Xin; Chai, Xueliang; Wang, Hongxia; Liu, Baozhong

2013-08-01

The clam Meretrix meretrix is an important commercial bivalve distributed in the coastal areas of South and Southeast Asia. In this study, marker-trait association analyses were performed based on the stock materials of M. meretrix with different vibrio-resistance profile obtained by selective breeding. Forty-eight EST-SSR markers were screened and 27 polymorphic SSRs of them were genotyped in the clam stocks with different resistance to Vibrio parahaemolyticus (11-R and 11-S) and to Vibrio harveyi (09-R and 09-C). Allele frequency distributions of the SSRs among different stocks were compared using Pearson's Chi-square test, and three functional EST-SSR markers (MM959, MM4765 and MM8364) were found to be associated with vibrio-resistance trait. The 140-bp allele of MM959 and 128-bp allele of MM4765 had significantly higher frequencies in resistant groups (11-R and 09-R) than in susceptive/control groups (11-S and 09-C) (P SSR markers were consistent with the three subgroups distinctions. The putative functions of contig959, contig4765 and contig8364 also suggested that the three SSR-involved genes might play important roles in immunity of M. meretrix. All these results supported that EST-SSR markers MM959, MM4765 and MM8364 were associated with vibrio-resistance and would be useful for marker-assisted selection (MAS) in M. meretrix genetic breeding. Copyright © 2013 Elsevier Ltd. All rights reserved.
Development of novel EST-SSR markers for ploidy identification based on de novo transcriptome assembly for Misgurnus anguillicaudatus.

Science.gov (United States)

Feng, Bing; Yi, Soojin V; Zhang, Manman; Zhou, Xiaoyun

2018-01-01

The co-existence of several ploidy types in natural populations makes the cyprinid loach Misgurnus anguillicaudatus an exciting model system to study the genetic and phenotypic consequences of ploidy variations. A first step in such effort is to identify the specific ploidy of an individual. Currently popular methods of karyotyping via cytological preparation or flow cytometry require a large amount of tissue (such as blood) samples, which can be damaging or fatal to the fishes. Here, we developed novel microsatellite markers (SSR markers) from M. anguillicaudatus and show that they can effectively discriminate ploidy using samples collected in a minimally invasive way. Specifically, we generated whole genome transcriptomes from multiple M. anguillicaudatus using the Illumina paired-end sequencing. Approximately 150 million raw reads were assembled into 76,544 non-redundant unigenes. A total of 8,194 potential SSR markers were identified. We selected 98 pairs with more than five tandem repeats for further assays. Out of 45 putative EST-SSR markers that successfully amplified and harbored polymorphism in diploids, 11 markers displayed high variability in tetraploids. We further demonstrate that a set of five EST-SSR markers selected from these are sufficient to distinguish ploidy levels, by first validating them on 69 reference specimens with known ploidy levels and then subsequently using fresh-collected 96 ploidy-unknown specimens. The results from EST-SSR markers are highly concordant with those from independent flow cytometry analysis. The novel EST-SSR markers developed here should facilitate genetic studies of polyploidy in the emerging model system M. anguillicaudatus.
Transcriptome sequencing of mung bean (Vigna radiate L.) genes and the identification of EST-SSR markers.

Science.gov (United States)

Chen, Honglin; Wang, Lixia; Wang, Suhua; Liu, Chunji; Blair, Matthew Wohlgemuth; Cheng, Xuzhen

2015-01-01

Mung bean (Vigna radiate (L.) Wilczek) is an important traditional food legume crop, with high economic and nutritional value. It is widely grown in China and other Asian countries. Despite its importance, genomic information is currently unavailable for this crop plant species or some of its close relatives in the Vigna genus. In this study, more than 103 million high quality cDNA sequence reads were obtained from mung bean using Illumina paired-end sequencing technology. The processed reads were assembled into 48,693 unigenes with an average length of 874 bp. Of these unigenes, 25,820 (53.0%) and 23,235 (47.7%) showed significant similarity to proteins in the NCBI non-redundant protein and nucleotide sequence databases, respectively. Furthermore, 19,242 (39.5%) could be classified into gene ontology categories, 18,316 (37.6%) into Swiss-Prot categories and 10,918 (22.4%) into KOG database categories (E-value SSR), and 2,303 sequences contained more than one SSR together in the same expressed sequence tag (EST). A total of 13,134 EST-SSRs were identified as potential molecular markers, with mono-nucleotide A/T repeats being the most abundant motif class and G/C repeats being rare. In this SSR analysis, we found five main repeat motifs: AG/CT (30.8%), GAA/TTC (12.6%), AAAT/ATTT (6.8%), AAAAT/ATTTT (6.2%) and AAAAAT/ATTTTT (1.9%). A total of 200 SSR loci were randomly selected for validation by PCR amplification as EST-SSR markers. Of these, 66 marker primer pairs produced reproducible amplicons that were polymorphic among 31 mung bean accessions selected from diverse geographical locations. The large number of SSR-containing sequences found in this study will be valuable for the construction of a high-resolution genetic linkage maps, association or comparative mapping and genetic analyses of various Vigna species.
Development and characterization of EST-SSR markers for Ottelia acuminata var. jingxiensis (Hydrocharitaceae).

Science.gov (United States)

Li, Zhi-Zhong; Lu, Meng-Xue; Saina, Josphat K; Gichira, Andrew W; Wang, Qing-Feng; Chen, Jin-Ming

2017-11-01

Simple sequence repeat (SSR) markers were derived from transcriptomic data for Ottelia acuminata (Hydrocharitaceae), a species comprising five endemic and highly endangered varieties in China. Sixteen novel SSR markers were developed for O. acuminata var. jingxiensis . One to eight alleles per locus were found, with a mean of 2.896. The observed and expected heterozygosity ranged from 0.000 to 1.000 and 0.000 to 0.793, respectively. Interestingly, in cross-varietal amplification, 13 out of the 16 loci were successfully amplified in O. acuminata var. acuminata , and 12 amplified in each of the other three varieties of O. acuminata . These newly developed SSR markers will facilitate further study of genetic variation and provide important genetic data needed for appropriate conservation of natural populations of all varieties of O. acuminata .
Genetic diversity and relationships among different tomato varieties revealed by EST-SSR markers.

Science.gov (United States)

Korir, N K; Diao, W; Tao, R; Li, X; Kayesh, E; Li, A; Zhen, W; Wang, S

2014-01-08

The genetic diversity and relationship of 42 tomato varieties sourced from different geographic regions was examined with EST-SSR markers. The genetic diversity was between 0.18 and 0.77, with a mean of 0.49; the polymorphic information content ranged from 0.17 to 0.74, with a mean of 0.45. This indicates a fairly high degree of diversity among these tomato varieties. Based on the cluster analysis using unweighted pair-group method with arithmetic average (UPGMA), all the tomato varieties fell into 5 groups, with no obvious geographical distribution characteristics despite their diverse sources. The principal component analysis (PCA) supported the clustering result; however, relationships among varieties were more complex in the PCA scatterplot than in the UPGMA dendrogram. This information about the genetic relationships between these tomato lines helps distinguish these 42 varieties and will be useful for tomato variety breeding and selection. We confirm that the EST-SSR marker system is useful for studying genetic diversity among tomato varieties. The high degree of polymorphism and the large number of bands obtained per assay shows that SSR is the most informative marker system for tomato genotyping for purposes of rights/protection and for the tomato industry in general. It is recommended that these varieties be subjected to identification using an SSR-based manual cultivar identification diagram strategy or other easy-to-use and referable methods so as to provide a complete set of information concerning genetic relationships and a readily usable means of identifying these varieties.
Exploiting Illumina Sequencing for the Development of 95 Novel Polymorphic EST-SSR Markers in Common Vetch (Vicia sativa subsp. sativa

Directory of Open Access Journals (Sweden)

Zhipeng Liu

2014-05-01

Full Text Available The common vetch (Vicia sativa subsp. sativa, a self-pollinating and diploid species, is one of the most important annual legumes in the world due to its short growth period, high nutritional value, and multiple usages as hay, grain, silage, and green manure. The available simple sequence repeat (SSR markers for common vetch, however, are insufficient to meet the developing demand for genetic and molecular research on this important species. Here, we aimed to develop and characterise several polymorphic EST-SSR markers from the vetch Illumina transcriptome. A total number of 1,071 potential EST-SSR markers were identified from 1025 unigenes whose lengths were greater than 1,000 bp, and 450 primer pairs were then designed and synthesized. Finally, 95 polymorphic primer pairs were developed for the 10 common vetch accessions, which included 50 individuals. Among the 95 EST-SSR markers, the number of alleles ranged from three to 13, and the polymorphism information content values ranged from 0.09 to 0.98. The observed heterozygosity values ranged from 0.00 to 1.00, and the expected heterozygosity values ranged from 0.11 to 0.98. These 95 EST-SSR markers developed from the vetch Illumina transcriptome could greatly promote the development of genetic and molecular breeding studies pertaining to in this species.
Assessment of Functional EST-SSR Markers (Sugarcane in Cross-Species Transferability, Genetic Diversity among Poaceae Plants, and Bulk Segregation Analysis

Directory of Open Access Journals (Sweden)

Shamshad Ul Haq

2016-01-01

Full Text Available Expressed sequence tags (ESTs are important resource for gene discovery, gene expression and its regulation, molecular marker development, and comparative genomics. We procured 10000 ESTs and analyzed 267 EST-SSRs markers through computational approach. The average density was one SSR/10.45 kb or 6.4% frequency, wherein trinucleotide repeats (66.74% were the most abundant followed by di- (26.10%, tetra- (4.67%, penta- (1.5%, and hexanucleotide (1.2% repeats. Functional annotations were done and after-effect newly developed 63 EST-SSRs were used for cross transferability, genetic diversity, and bulk segregation analysis (BSA. Out of 63 EST-SSRs, 42 markers were identified owing to their expansion genetics across 20 different plants which amplified 519 alleles at 180 loci with an average of 2.88 alleles/locus and the polymorphic information content (PIC ranged from 0.51 to 0.93 with an average of 0.83. The cross transferability ranged from 25% for wheat to 97.22% for Schlerostachya, with an average of 55.86%, and genetic relationships were established based on diversification among them. Moreover, 10 EST-SSRs were recognized as important markers between bulks of pooled DNA of sugarcane cultivars through BSA. This study highlights the employability of the markers in transferability, genetic diversity in grass species, and distinguished sugarcane bulks.
Rapid identification of red-flesh loquat cultivars using EST-SSR markers based on manual cultivar identification diagram strategy.

Science.gov (United States)

Li, X Y; Xu, H X; Chen, J W

2014-04-29

Manual cultivar identification diagram is a new strategy for plant cultivar identification based on DNA markers, providing information to efficiently separate cultivars. We tested 25 pairs of apple EST-SSR primers for amplification of PCR products from loquat cultivars. These EST-SSR primers provided clear amplification products from the loquat cultivars, with a relatively high transferability rate of 84% to loquat; 11 pairs of primers amplified polymorphic products. After analysis of 24 red-fleshed loquat accessions, we found that only 7 pairs of primers could clearly separate all of them. A cultivar identification diagram of the 24 cultivars was constructed using polymorphic bands from the DNA fingerprints and EST-SSR primers. Any two of the 24 cultivars could be rapidly separated from each other, according to the polymorphic bands from the cultivars; the corresponding primers were marked in the correct position on the cultivar identification diagram. This red-flesh loquat cultivar identification diagram can separate the 24 red-flesh loquat cultivars, which is of benefit for loquat cultivar identification for germplasm management and breeding programs.
EST Express: PHP/MySQL based automated annotation of ESTs from expression libraries.

Science.gov (United States)

Smith, Robin P; Buchser, William J; Lemmon, Marcus B; Pardinas, Jose R; Bixby, John L; Lemmon, Vance P

2008-04-10

Several biological techniques result in the acquisition of functional sets of cDNAs that must be sequenced and analyzed. The emergence of redundant databases such as UniGene and centralized annotation engines such as Entrez Gene has allowed the development of software that can analyze a great number of sequences in a matter of seconds. We have developed "EST Express", a suite of analytical tools that identify and annotate ESTs originating from specific mRNA populations. The software consists of a user-friendly GUI powered by PHP and MySQL that allows for online collaboration between researchers and continuity with UniGene, Entrez Gene and RefSeq. Two key features of the software include a novel, simplified Entrez Gene parser and tools to manage cDNA library sequencing projects. We have tested the software on a large data set (2,016 samples) produced by subtractive hybridization. EST Express is an open-source, cross-platform web server application that imports sequences from cDNA libraries, such as those generated through subtractive hybridization or yeast two-hybrid screens. It then provides several layers of annotation based on Entrez Gene and RefSeq to allow the user to highlight useful genes and manage cDNA library projects.
EST Express: PHP/MySQL based automated annotation of ESTs from expression libraries

Directory of Open Access Journals (Sweden)

Pardinas Jose R

2008-04-01

Full Text Available Abstract Background Several biological techniques result in the acquisition of functional sets of cDNAs that must be sequenced and analyzed. The emergence of redundant databases such as UniGene and centralized annotation engines such as Entrez Gene has allowed the development of software that can analyze a great number of sequences in a matter of seconds. Results We have developed "EST Express", a suite of analytical tools that identify and annotate ESTs originating from specific mRNA populations. The software consists of a user-friendly GUI powered by PHP and MySQL that allows for online collaboration between researchers and continuity with UniGene, Entrez Gene and RefSeq. Two key features of the software include a novel, simplified Entrez Gene parser and tools to manage cDNA library sequencing projects. We have tested the software on a large data set (2,016 samples produced by subtractive hybridization. Conclusion EST Express is an open-source, cross-platform web server application that imports sequences from cDNA libraries, such as those generated through subtractive hybridization or yeast two-hybrid screens. It then provides several layers of annotation based on Entrez Gene and RefSeq to allow the user to highlight useful genes and manage cDNA library projects.
simple sequence repeats (EST-SSR)

African Journals Online (AJOL)

Yomi

2012-01-19

Jan 19, 2012 ... 212 primer pairs selected, based on repeat patterns of n≥8 for di-, tri-, tetra- and penta-nucleotide repeat ... Cluster analysis revealed a high genetic similarity among the sugarcane (Saccharum spp.) breeding lines which could reduce the genetic gain in ..... The multiple allele characteristic of SSR com-.
EST-SSR marker revealed effective over biochemical and morphological scepticism towards identification of specific turmeric (Curcuma longa L.) cultivars.

Science.gov (United States)

Sahoo, Ambika; Jena, Sudipta; Kar, Basudeba; Sahoo, Suprava; Ray, Asit; Singh, Subhashree; Joshi, Raj Kumar; Acharya, Laxmikanta; Nayak, Sanghamitra

2017-05-01

Turmeric (Curcuma longa L., family Zingiberaceae) is one of the most economically important plants for its use in food, medicine, and cosmetic industries. Cultivar identification is a major constraint in turmeric, owing to high degree of morphological similarity that in turn, affects its commercialization. The present study addresses this constraint, using EST-SSR marker based, molecular identification of 8 elite cultivars and 88 accessions in turmeric. Fifty EST-SSR primers were screened against eight cultivars of turmeric (Suroma, Roma, Lakadong, Megha, Alleppey Supreme, Kedaram, Pratibha, and Suvarna); out of which 11 primers showed polymorphic banding pattern. The polymorphic information content (PIC) of these primers ranged from 0.13 to 0.48. However, only three SSR loci (CSSR 14, CSSR 15, and CSSR 18) gave reproducible unique banding pattern clearly distinguishing the cultivars 'Lakadong' and 'Suvarna' from other cultivars tested. These three unique SSR markers also proved to be effective in identification of 'Lakadong' cultivars when analysed with 88 accessions of turmeric collected from different agro-climatic regions. Furthermore, two identified cultivars (Lakadong and Suvarna) could also be precisely differentiated when analysed and based on phylogenetic tree, with other 94 genotypes of turmeric. The novel SSR markers can be used for identification and authentication of two commercially important turmeric cultivars 'Lakadong' and 'Suvarna'.

Genetic Diversity and Association of EST-SSR and SCoT Markers with Rust Traits in Orchardgrass (Dactylis glomerata L.).

Science.gov (United States)

Yan, Haidong; Zhang, Yu; Zeng, Bing; Yin, Guohua; Zhang, Xinquan; Ji, Yang; Huang, Linkai; Jiang, Xiaomei; Liu, Xinchun; Peng, Yan; Ma, Xiao; Yan, Yanhong

2016-01-08

Orchardgrass (Dactylis glomerata L.), is a well-known perennial forage species; however, rust diseases have caused a noticeable reduction in the quality and production of orchardgrass. In this study, genetic diversity was assessed and the marker-trait associations for rust were examined using 18 EST-SSR and 21 SCoT markers in 75 orchardgrass accessions. A high level of genetic diversity was detected in orchardgrass with an average genetic diversity index of 0.369. For the EST-SSR and SCoT markers, 164 and 289 total bands were obtained, of which 148 (90.24%) and 272 (94.12%) were polymorphic, respectively. Results from an AMOVA analysis showed that more genetic variance existed within populations (87.57%) than among populations (12.43%). Using a parameter marker index, the efficiencies of the EST-SSR and SCoT markers were compared to show that SCoTs have higher marker efficiency (8.07) than EST-SSRs (4.82). The results of a UPGMA cluster analysis and a STRUCTURE analysis were both correlated with the geographic distribution of the orchardgrass accessions. Linkage disequilibrium analysis revealed an average r² of 0.1627 across all band pairs, indicating a high extent of linkage disequilibrium in the material. An association analysis between the rust trait and 410 bands from the EST-SSR and SCoT markers using TASSEL software revealed 20 band panels were associated with the rust trait in both 2011 and 2012. The 20 bands obtained from association analysis could be used in breeding programs for lineage selection to prevent great losses of orchardgrass caused by rust, and provide valuable information for further association mapping using this collection of orchardgrass.
Development and Evaluation of a Novel Set of EST-SSR Markers Based on Transcriptome Sequences of Black Locust (Robinia pseudoacacia L.).

Science.gov (United States)

Guo, Qi; Wang, Jin-Xing; Su, Li-Zhuo; Lv, Wei; Sun, Yu-Han; Li, Yun

2017-07-07

Black locust ( Robinia pseudoacacia L. of the family Fabaceae) is an ecologically and economically important deciduous tree. However, few genomic resources are available for this forest species, and few effective expressed sequence tag-derived simple sequence repeat (EST-SSR) markers have been developed to date. In this study, paired-end sequencing was used to sequence transcriptomes of R. pseudoacacia by the Illumina HiSeq TM2000 platform, and EST-SSR loci were identified by de novo assembly. Furthermore, a total of 1697 primer pairs were successfully designed, from which 286 primers met the selection screening criteria; 94 pairs were randomly selected and tested for validation using polymerase chain reaction amplification. Forty-five primers were verified as polymorphic, with clear bands. The polymorphism information content values were 0.033-0.765, the number of alleles per locus ranged from 2 to 10, and the observed and expected heterozygosities were 0.000-0.931 and 0.035-0.810, respectively, indicating a high level of informativeness. Subsequently, 45 polymorphic EST-SSR loci were tested for amplification efficiency, using the verified primers, in an additional nine species of Leguminosae, 23 loci were amplified in more than three species, of which two loci were amplified successfully in all species. These EST-SSR markers provide a valuable tool for investigating the genetic diversity and population structure of R . pseudoacacia , constructing a DNA fingerprint database, performing quantitative trait locus mapping, and preserving genetic information.
EST-derived SSR markers used as anchor loci for the construction of a consensus linkage map in ryegrass (Lolium spp.

Directory of Open Access Journals (Sweden)

Studer Bruno

2010-08-01

Full Text Available Abstract Background Genetic markers and linkage mapping are basic prerequisites for marker-assisted selection and map-based cloning. In the case of the key grassland species Lolium spp., numerous mapping populations have been developed and characterised for various traits. Although some genetic linkage maps of these populations have been aligned with each other using publicly available DNA markers, the number of common markers among genetic maps is still low, limiting the ability to compare candidate gene and QTL locations across germplasm. Results A set of 204 expressed sequence tag (EST-derived simple sequence repeat (SSR markers has been assigned to map positions using eight different ryegrass mapping populations. Marker properties of a subset of 64 EST-SSRs were assessed in six to eight individuals of each mapping population and revealed 83% of the markers to be polymorphic in at least one population and an average number of alleles of 4.88. EST-SSR markers polymorphic in multiple populations served as anchor markers and allowed the construction of the first comprehensive consensus map for ryegrass. The integrated map was complemented with 97 SSRs from previously published linkage maps and finally contained 284 EST-derived and genomic SSR markers. The total map length was 742 centiMorgan (cM, ranging for individual chromosomes from 70 cM of linkage group (LG 6 to 171 cM of LG 2. Conclusions The consensus linkage map for ryegrass based on eight mapping populations and constructed using a large set of publicly available Lolium EST-SSRs mapped for the first time together with previously mapped SSR markers will allow for consolidating existing mapping and QTL information in ryegrass. Map and markers presented here will prove to be an asset in the development for both molecular breeding of ryegrass as well as comparative genetics and genomics within grass species.
An annotated genetic map of loblolly pine based on microsatellite and cDNA markers

Directory of Open Access Journals (Sweden)

Wimalanathan Kokulapalan

2011-01-01

Full Text Available Abstract Background Previous loblolly pine (Pinus taeda L. genetic linkage maps have been based on a variety of DNA polymorphisms, such as AFLPs, RAPDs, RFLPs, and ESTPs, but only a few SSRs (simple sequence repeats, also known as simple tandem repeats or microsatellites, have been mapped in P. taeda. The objective of this study was to integrate a large set of SSR markers from a variety of sources and published cDNA markers into a composite P. taeda genetic map constructed from two reference mapping pedigrees. A dense genetic map that incorporates SSR loci will benefit complete pine genome sequencing, pine population genetics studies, and pine breeding programs. Careful marker annotation using a variety of references further enhances the utility of the integrated SSR map. Results The updated P. taeda genetic map, with an estimated genome coverage of 1,515 cM(Kosambi across 12 linkage groups, incorporated 170 new SSR markers and 290 previously reported SSR, RFLP, and ESTP markers. The average marker interval was 3.1 cM. Of 233 mapped SSR loci, 84 were from cDNA-derived sequences (EST-SSRs and 149 were from non-transcribed genomic sequences (genomic-SSRs. Of all 311 mapped cDNA-derived markers, 77% were associated with NCBI Pta UniGene clusters, 67% with RefSeq proteins, and 62% with functional Gene Ontology (GO terms. Duplicate (i.e., redundant accessory and paralogous markers were tentatively identified by evaluating marker sequences by their UniGene cluster IDs, clone IDs, and relative map positions. The average gene diversity, He, among polymorphic SSR loci, including those that were not mapped, was 0.43 for 94 EST-SSRs and 0.72 for 83 genomic-SSRs. The genetic map can be viewed and queried at http://www.conifergdb.org/pinemap. Conclusions Many polymorphic and genetically mapped SSR markers are now available for use in P. taeda population genetics, studies of adaptive traits, and various germplasm management applications. Annotating mapped
Genome-Wide Characterization of Simple Sequence Repeat (SSR) Loci in Chinese Jujube and Jujube SSR Primer Transferability

Science.gov (United States)

Xiao, Jing; Zhao, Jin; Liu, Mengjun; Liu, Ping; Dai, Li; Zhao, Zhihui

2015-01-01

Chinese jujube (Ziziphus jujuba), an economically important species in the Rhamnaceae family, is a popular fruit tree in Asia. Here, we surveyed and characterized simple sequence repeats (SSRs) in the jujube genome. A total of 436,676 SSR loci were identified, with an average distance of 0.93 Kb between the loci. A large proportion of the SSRs included mononucleotide, dinucleotide and trinucleotide repeat motifs, which accounted for 64.87%, 24.40%, and 8.74% of all repeats, respectively. Among the mononucleotide repeats, A/T was the most common, whereas AT/TA was the most common dinucleotide repeat. A total of 30,565 primer pairs were successfully designed and screened using a series of criteria. Moreover, 725 of 1,000 randomly selected primer pairs were effective among 6 cultivars, and 511 of these primer pairs were polymorphic. Sequencing the amplicons of two SSRs across three jujube cultivars revealed variations in the repeats. The transferability of jujube SSR primers proved that 35/64 SSRs could be transferred across family boundary. Using jujube SSR primers, clustering analysis results from 15 species were highly consistent with the Angiosperm Phylogeny Group (APGIII) System. The genome-wide characterization of SSRs in Chinese jujube is very valuable for whole-genome characterization and marker-assisted selection in jujube breeding. In addition, the transferability of jujube SSR primers could provide a solid foundation for their further utilization. PMID:26000739
Development of a novel set of EST-SSR markers and cross-species amplification in Tamarix africana (Tamaricaceae).

Science.gov (United States)

Terzoli, Serena; Beritognolo, Isacco; Sabatti, Maurizio; Kuzminsky, Elena

2010-06-01

Tamarix plants are resistant to abiotic stresses and have become invasive in North America. Their taxonomy is troublesome, and few molecular makers are available to enable species identification or to track the spread of specific invasive genotypes. Transcriptome sequencing projects offer a potential source for the development of new markers. • Thirteen polymorphic simple sequence repeats (SSRs) markers derived from Expressed Sequence Tags (ESTs) from Tamarix hispida, T. androssowii, T. ramosissima, and T. albiflonum were identified and screened on 24 samples of T. africana to detect polymorphism. The number of alleles per locus ranged from two to eight, with an average of 4.3 alleles per locus, and the mean expected heterozygosity was 0.453. • Amplification products of these 13 loci were also generated for T. gallica. These new EST-SSR markers will be useful in genetic characterization of Tamarix, as additional tools for taxonomic clarification, and for studying invasive populations where they are a threat.
Pairagon+N-SCAN_EST: a model-based gene annotation pipeline

DEFF Research Database (Denmark)

Arumugam, Manimozhiyan; Wei, Chaochun; Brown, Randall H

2006-01-01

This paper describes Pairagon+N-SCAN_EST, a gene annotation pipeline that uses only native alignments. For each expressed sequence it chooses the best genomic alignment. Systems like ENSEMBL and ExoGean rely on trans alignments, in which expressed sequences are aligned to the genomic loci...... with de novo gene prediction by using N-SCAN_EST. N-SCAN_EST is based on a generalized HMM probability model augmented with a phylogenetic conservation model and EST alignments. It can predict complete transcripts by extending or merging EST alignments, but it can also predict genes in regions without EST...
Construction of a genetic map using EST-SSR markers and QTL analysis of major agronomic characters in hexaploid sweet potato (Ipomoea batatas (L.) Lam).

Science.gov (United States)

Kim, Jin-Hee; Chung, Il Kyung; Kim, Kyung-Min

2017-01-01

The Sweet potato, Ipomoea batatas (L.) Lam, is difficult to study in genetics and genomics because it is a hexaploid. The sweet potato study not have been performed domestically or internationally. In this study was performed to construct genetic map and quantitative trait loci (QTL) analysis. A total of 245 EST-SSR markers were developed, and the map was constructed by using 210 of those markers. The total map length was 1508.1 cM, and the mean distance between markers was 7.2 cM. Fifteen characteristics were investigated for QTLs analysis. According to those, the Four QTLs were identified, and The LOD score was 3.0. Further studies need to develop molecular markers in terms of EST-SSR markers for doing to be capable of efficient breeding. The genetic map created here using EST-SSR markers will facilitate planned breeding of sweet potato cultivars with various desirable traits.
Development and mapping of a public reference set of SSR markers in Lolium perenne L.

NARCIS (Netherlands)

Bach, J.L.; Muylle, H.; Arens, P.F.P.; Andersen, C.H.; Bach Holm, P.; Ghesquiere, M.; Julier, B.; Lubberstedt, T.; Nielsen, K.K.; Riek, de J.; Roldán-Ruiz, I.; Roulund, N.; Taylor, C.; Vosman, B.J.; Barre, P.

2005-01-01

We report on the characterization and mapping of 76 simple sequence repeat (SSR) markers for Lolium perenne. These markers are publicly available or obtained either from genomic libraries enriched for SSR motifs or L. perenne expressed sequence tag (EST) clones. Four L. perenne mapping populations
Comparative high-throughput transcriptome sequencing and development of SiESTa, the Silene EST annotation database

Directory of Open Access Journals (Sweden)

Marais Gabriel AB

2011-07-01

Full Text Available Abstract Background The genus Silene is widely used as a model system for addressing ecological and evolutionary questions in plants, but advances in using the genus as a model system are impeded by the lack of available resources for studying its genome. Massively parallel sequencing cDNA has recently developed into an efficient method for characterizing the transcriptomes of non-model organisms, generating massive amounts of data that enable the study of multiple species in a comparative framework. The sequences generated provide an excellent resource for identifying expressed genes, characterizing functional variation and developing molecular markers, thereby laying the foundations for future studies on gene sequence and gene expression divergence. Here, we report the results of a comparative transcriptome sequencing study of eight individuals representing four Silene and one Dianthus species as outgroup. All sequences and annotations have been deposited in a newly developed and publicly available database called SiESTa, the Silene EST annotation database. Results A total of 1,041,122 EST reads were generated in two runs on a Roche GS-FLX 454 pyrosequencing platform. EST reads were analyzed separately for all eight individuals sequenced and were assembled into contigs using TGICL. These were annotated with results from BLASTX searches and Gene Ontology (GO terms, and thousands of single-nucleotide polymorphisms (SNPs were characterized. Unassembled reads were kept as singletons and together with the contigs contributed to the unigenes characterized in each individual. The high quality of unigenes is evidenced by the proportion (49% that have significant hits in similarity searches with the A. thaliana proteome. The SiESTa database is accessible at http://www.siesta.ethz.ch. Conclusion The sequence collections established in the present study provide an important genomic resource for four Silene and one Dianthus species and will help to
Comparative high-throughput transcriptome sequencing and development of SiESTa, the Silene EST annotation database

Science.gov (United States)

2011-01-01

Background The genus Silene is widely used as a model system for addressing ecological and evolutionary questions in plants, but advances in using the genus as a model system are impeded by the lack of available resources for studying its genome. Massively parallel sequencing cDNA has recently developed into an efficient method for characterizing the transcriptomes of non-model organisms, generating massive amounts of data that enable the study of multiple species in a comparative framework. The sequences generated provide an excellent resource for identifying expressed genes, characterizing functional variation and developing molecular markers, thereby laying the foundations for future studies on gene sequence and gene expression divergence. Here, we report the results of a comparative transcriptome sequencing study of eight individuals representing four Silene and one Dianthus species as outgroup. All sequences and annotations have been deposited in a newly developed and publicly available database called SiESTa, the Silene EST annotation database. Results A total of 1,041,122 EST reads were generated in two runs on a Roche GS-FLX 454 pyrosequencing platform. EST reads were analyzed separately for all eight individuals sequenced and were assembled into contigs using TGICL. These were annotated with results from BLASTX searches and Gene Ontology (GO) terms, and thousands of single-nucleotide polymorphisms (SNPs) were characterized. Unassembled reads were kept as singletons and together with the contigs contributed to the unigenes characterized in each individual. The high quality of unigenes is evidenced by the proportion (49%) that have significant hits in similarity searches with the A. thaliana proteome. The SiESTa database is accessible at http://www.siesta.ethz.ch. Conclusion The sequence collections established in the present study provide an important genomic resource for four Silene and one Dianthus species and will help to further develop Silene as a
High polymorphism in Est-SSR loci for cellulose synthase and β-amylase of sugarcane varieties (Saccharum spp.) used by the industrial sector for ethanol production.

Science.gov (United States)

Augusto, Raphael; Maranho, Rone Charles; Mangolin, Claudete Aparecida; Pires da Silva Machado, Maria de Fátima

2015-01-01

High and low polymorphisms in simple sequence repeats of expressed sequence tag (EST-SSR) for specific proteins and enzymes, such as β-amylase, cellulose synthase, xyloglucan endotransglucosylase, fructose 1,6-bisphosphate aldolase, and fructose 1,6-bisphosphatase, were used to illustrate the genetic divergence within and between varieties of sugarcane (Saccharum spp.) and to guide the technological paths to optimize ethanol production from lignocellulose biomass. The varieties RB72454, RB867515, RB92579, and SP813250 on the second stage of cutting, all grown in the state of Paraná (PR), and the varieties RB92579 and SP813250 cultured in the PR state and in Northeastern Brazil, state of Pernambuco (PE), were analyzed using five EST-SSR primers for EstC66, EstC67, EstC68, EstC69, and EstC91 loci. Genetic divergence was evident in the EstC67 and EstC69 loci for β-amylase and cellulose synthase, respectively, among the four sugarcane varieties. An extremely high level of genetic differentiation was also detected in the EstC67 locus from the RB82579 and SP813250 varieties cultured in the PR and PE states. High polymorphism in SSR of the cellulose synthase locus may explain the high variability of substrates used in pretreatment and enzymatic hydrolysis processes, which has been an obstacle to effective industrial adaptations.
Population structure and genetic diversity characterization of a sunflower association mapping population using SSR and SNP markers.

Science.gov (United States)

Filippi, Carla V; Aguirre, Natalia; Rivas, Juan G; Zubrzycki, Jeremias; Puebla, Andrea; Cordes, Diego; Moreno, Maria V; Fusari, Corina M; Alvarez, Daniel; Heinz, Ruth A; Hopp, Horacio E; Paniego, Norma B; Lia, Veronica V

2015-02-13

Argentina has a long tradition of sunflower breeding, and its germplasm is a valuable genetic resource worldwide. However, knowledge of the genetic constitution and variability levels of the Argentinean germplasm is still scarce, rendering the global map of cultivated sunflower diversity incomplete. In this study, 42 microsatellite loci and 384 single nucleotide polymorphisms (SNPs) were used to characterize the first association mapping population used for quantitative trait loci mapping in sunflower, along with a selection of allied open-pollinated and composite populations from the germplasm bank of the National Institute of Agricultural Technology of Argentina. The ability of different kinds of markers to assess genetic diversity and population structure was also evaluated. The analysis of polymorphism in the set of sunflower accessions studied here showed that both the microsatellites and SNP markers were informative for germplasm characterization, although to different extents. In general, the estimates of genetic variability were moderate. The average genetic diversity, as quantified by the expected heterozygosity, was 0.52 for SSR loci and 0.29 for SNPs. Within SSR markers, those derived from non-coding regions were able to capture higher levels of diversity than EST-SSR. A significant correlation was found between SSR and SNP- based genetic distances among accessions. Bayesian and multivariate methods were used to infer population structure. Evidence for the existence of three different genetic groups was found consistently across data sets (i.e., SSR, SNP and SSR + SNP), with the maintainer/restorer status being the most prevalent characteristic associated with group delimitation. The present study constitutes the first report comparing the performance of SSR and SNP markers for population genetics analysis in cultivated sunflower. We show that the SSR and SNP panels examined here, either used separately or in conjunction, allowed consistent
Genic and Intergenic SSR Database Generation, SNPs Determination and Pathway Annotations, in Date Palm (Phoenix dactylifera L.).

Science.gov (United States)

Mokhtar, Morad M; Adawy, Sami S; El-Assal, Salah El-Din S; Hussein, Ebtissam H A

2016-01-01

The present investigation was carried out aiming to use the bioinformatics tools in order to identify and characterize, simple sequence repeats within the third Version of the date palm genome and develop a new SSR primers database. In addition single nucleotide polymorphisms (SNPs) that are located within the SSR flanking regions were recognized. Moreover, the pathways for the sequences assigned by SSR primers, the biological functions and gene interaction were determined. A total of 172,075 SSR motifs was identified on date palm genome sequence with a frequency of 450.97 SSRs per Mb. Out of these, 130,014 SSRs (75.6%) were located within the intergenic regions with a frequency of 499 SSRs per Mb. While, only 42,061 SSRs (24.4%) were located within the genic regions with a frequency of 347.5 SSRs per Mb. A total of 111,403 of SSR primer pairs were designed, that represents 291.9 SSR primers per Mb. Out of the 111,403, only 31,380 SSR primers were in the genic regions, while 80,023 primers were in the intergenic regions. A number of 250,507 SNPs were recognized in 84,172 SSR flanking regions, which represents 75.55% of the total SSR flanking regions. Out of 12,274 genes only 463 genes comprising 896 SSR primers were mapped onto 111 pathways using KEGG data base. The most abundant enzymes were identified in the pathway related to the biosynthesis of antibiotics. We tested 1031 SSR primers using both publicly available date palm genome sequences as templates in the in silico PCR reactions. Concerning in vitro validation, 31 SSR primers among those used in the in silico PCR were synthesized and tested for their ability to detect polymorphism among six Egyptian date palm cultivars. All tested primers have successfully amplified products, but only 18 primers detected polymorphic amplicons among the studied date palm cultivars.
Development and validation of new SSR markers from expressed regions in the garlic genome

Directory of Open Access Journals (Sweden)

Meryem Ipek

2015-02-01

Full Text Available Only a limited number of simple sequence repeat (SSR markers is available for the genome of garlic (Allium sativum L. despite the fact that SSR markers have become one of the most preferred DNA marker systems. To develop new SSR markers for the garlic genome, garlic expressed sequence tags (ESTs at the publicly available GarlicEST database were screened for SSR motifs and a total of 132 SSR motifs were identified. Primer pairs were designed for 50 SSR motifs and 24 of these primer pairs were selected as SSR markers based on their consistent amplification patterns and polymorphisms. In addition, two SSR markers were developed from the sequences of garlic cDNA-AFLP fragments. The use of 26 EST-SSR markers for the assessment of genetic relationship was tested using 31 garlic genotypes. Twenty six EST-SSR markers amplified 130 polymorphic DNA fragments and the number of polymorphic alleles per SSR marker ranged from 2 to 13 with an average of 5 alleles. Observed heterozygosity and polymorphism information content (PIC of the SSR markers were between 0.23 and 0.88, and 0.20 and 0.87, respectively. Twenty one out of the 31 garlic genotypes were analyzed in a previous study using AFLP markers and the garlic genotypes clustered together with AFLP markers were also grouped together with EST-SSR markers demonstrating high concordance between AFLP and EST-SSR marker systems and possible immediate application of EST-SSR markers for fingerprinting of garlic clones. EST-SSR markers could be used in genetic studies such as genetic mapping, association mapping, genetic diversity and comparison of the genomes of Allium species.
Development and characterization of genic SSR markers from low ...

Indian Academy of Sciences (India)

Development and characterization of genic SSR markers from low depth genome ... A variety of molecular markers are currently ... chloroform method (Sambrook et al. 1989). ..... Available online, http://www.iucnredlist.org/details/168255/0.
The characterization of a new set of EST-derived simple sequence repeat (SSR markers as a resource for the genetic analysis of Phaseolus vulgaris

Directory of Open Access Journals (Sweden)

Borba Tereza CO

2011-05-01

Full Text Available Abstract Background Over recent years, a growing effort has been made to develop microsatellite markers for the genomic analysis of the common bean (Phaseolus vulgaris to broaden the knowledge of the molecular genetic basis of this species. The availability of large sets of expressed sequence tags (ESTs in public databases has given rise to an expedient approach for the identification of SSRs (Simple Sequence Repeats, specifically EST-derived SSRs. In the present work, a battery of new microsatellite markers was obtained from a search of the Phaseolus vulgaris EST database. The diversity, degree of transferability and polymorphism of these markers were tested. Results From 9,583 valid ESTs, 4,764 had microsatellite motifs, from which 377 were used to design primers, and 302 (80.11% showed good amplification quality. To analyze transferability, a group of 167 SSRs were tested, and the results showed that they were 82% transferable across at least one species. The highest amplification rates were observed between the species from the Phaseolus (63.7%, Vigna (25.9%, Glycine (19.8%, Medicago (10.2%, Dipterix (6% and Arachis (1.8% genera. The average PIC (Polymorphism Information Content varied from 0.53 for genomic SSRs to 0.47 for EST-SSRs, and the average number of alleles per locus was 4 and 3, respectively. Among the 315 newly tested SSRs in the BJ (BAT93 X Jalo EEP558 population, 24% (76 were polymorphic. The integration of these segregant loci into a framework map composed of 123 previously obtained SSR markers yielded a total of 199 segregant loci, of which 182 (91.5% were mapped to 14 linkage groups, resulting in a map length of 1,157 cM. Conclusions A total of 302 newly developed EST-SSR markers, showing good amplification quality, are available for the genetic analysis of Phaseolus vulgaris. These markers showed satisfactory rates of transferability, especially between species that have great economic and genomic values. Their diversity
SSR allelic variation in almond (Prunus dulcis Mill.).

Science.gov (United States)

Xie, Hua; Sui, Yi; Chang, Feng-Qi; Xu, Yong; Ma, Rong-Cai

2006-01-01

Sixteen SSR markers including eight EST-SSR and eight genomic SSRs were used for genetic diversity analysis of 23 Chinese and 15 international almond cultivars. EST- and genomic SSR markers previously reported in species of Prunus, mainly peach, proved to be useful for almond genetic analysis. DNA sequences of 117 alleles of six of the 16 SSR loci were analysed to reveal sequence variation among the 38 almond accessions. For the four SSR loci with AG/CT repeats, no insertions or deletions were observed in the flanking regions of the 98 alleles sequenced. Allelic size variation of these loci resulted exclusively from differences in the structures of repeat motifs, which involved interruptions or occurrences of new motif repeats in addition to varying number of AG/CT repeats. Some alleles had a high number of uninterrupted repeat motifs, indicating that SSR mutational patterns differ among alleles at a given SSR locus within the almond species. Allelic homoplasy was observed in the SSR loci because of base substitutions, interruptions or compound repeat motifs. Substitutions in the repeat regions were found at two SSR loci, suggesting that point mutations operate on SSRs and hinder the further SSR expansion by introducing repeat interruptions to stabilize SSR loci. Furthermore, it was shown that some potential point mutations in the flanking regions are linked with new SSR repeat motif variation in almond and peach.
Development of eSSR-Markers in Setaria italica and Their Applicability in Studying Genetic Diversity, Cross-Transferability and Comparative Mapping in Millet and Non-Millet Species.

Science.gov (United States)

Kumari, Kajal; Muthamilarasan, Mehanathan; Misra, Gopal; Gupta, Sarika; Subramanian, Alagesan; Parida, Swarup Kumar; Chattopadhyay, Debasis; Prasad, Manoj

2013-01-01

Foxtail millet (Setariaitalica L.) is a tractable experimental model crop for studying functional genomics of millets and bioenergy grasses. But the limited availability of genomic resources, particularly expressed sequence-based genic markers is significantly impeding its genetic improvement. Considering this, we attempted to develop EST-derived-SSR (eSSR) markers and utilize them in germplasm characterization, cross-genera transferability and in silico comparative mapping. From 66,027 foxtail millet EST sequences 24,828 non-redundant ESTs were deduced, representing ~16 Mb, which revealed 534 (~2%) eSSRs in 495 SSR containing ESTs at a frequency of 1/30 kb. A total of 447 pp were successfully designed, of which 327 were mapped physically onto nine chromosomes. About 106 selected primer pairs representing the foxtail millet genome showed high-level of cross-genera amplification at an average of ~88% in eight millets and four non-millet species. Broad range of genetic diversity (0.02-0.65) obtained in constructed phylogenetic tree using 40 eSSR markers demonstrated its utility in germplasm characterizations and phylogenetics. Comparative mapping of physically mapped eSSR markers showed considerable proportion of sequence-based orthology and syntenic relationship between foxtail millet chromosomes and sorghum (~68%), maize (~61%) and rice (~42%) chromosomes. Synteny analysis of eSSRs of foxtail millet, rice, maize and sorghum suggested the nested chromosome fusion frequently observed in grass genomes. Thus, for the first time we had generated large-scale eSSR markers in foxtail millet and demonstrated their utility in germplasm characterization, transferability, phylogenetics and comparative mapping studies in millets and bioenergy grass species.
A fast and cost-effective approach to develop and map EST-SSR markers: oak as a case study

Directory of Open Access Journals (Sweden)

Cherubini Marcello

2010-10-01

Full Text Available Abstract Background Expressed Sequence Tags (ESTs are a source of simple sequence repeats (SSRs that can be used to develop molecular markers for genetic studies. The availability of ESTs for Quercus robur and Quercus petraea provided a unique opportunity to develop microsatellite markers to accelerate research aimed at studying adaptation of these long-lived species to their environment. As a first step toward the construction of a SSR-based linkage map of oak for quantitative trait locus (QTL mapping, we describe the mining and survey of EST-SSRs as well as a fast and cost-effective approach (bin mapping to assign these markers to an approximate map position. We also compared the level of polymorphism between genomic and EST-derived SSRs and address the transferability of EST-SSRs in Castanea sativa (chestnut. Results A catalogue of 103,000 Sanger ESTs was assembled into 28,024 unigenes from which 18.6% presented one or more SSR motifs. More than 42% of these SSRs corresponded to trinucleotides. Primer pairs were designed for 748 putative unigenes. Overall 37.7% (283 were found to amplify a single polymorphic locus in a reference full-sib pedigree of Quercus robur. The usefulness of these loci for establishing a genetic map was assessed using a bin mapping approach. Bin maps were constructed for the male and female parental tree for which framework linkage maps based on AFLP markers were available. The bin set consisting of 14 highly informative offspring selected based on the number and position of crossover sites. The female and male maps comprised 44 and 37 bins, with an average bin length of 16.5 cM and 20.99 cM, respectively. A total of 256 EST-SSRs were assigned to bins and their map position was further validated by linkage mapping. EST-SSRs were found to be less polymorphic than genomic SSRs, but their transferability rate to chestnut, a phylogenetically related species to oak, was higher. Conclusion We have generated a bin map for oak

Exploiting transcriptome data for the development and characterization of gene-based SSR markers related to cold tolerance in oil palm (Elaeis guineensis).

Science.gov (United States)

Xiao, Yong; Zhou, Lixia; Xia, Wei; Mason, Annaliese S; Yang, Yaodong; Ma, Zilong; Peng, Ming

2014-12-19

The oil palm (Elaeis guineensis, 2n = 32) has the highest oil yield of any crop species, as well as comprising the richest dietary source of provitamin A. For the tropical species, the best mean growth temperature is about 27°C, with a minimal growth temperature of 15°C. Hence, the plantation area is limited into the geographical ranges of 10°N to 10°S. Enhancing cold tolerance capability will increase the total cultivation area and subsequently oil productivity of this tropical species. Developing molecular markers related to cold tolerance would be helpful for molecular breeding of cold tolerant Elaeis guineensis. In total, 5791 gene-based SSRs were identified in 51,452 expressed sequences from Elaeis guineensis transcriptome data: approximately one SSR was detected per 10 expressed sequences. Of these 5791 gene-based SSRs, 916 were derived from expressed sequences up- or down-regulated at least two-fold in response to cold stress. A total of 182 polymorphic markers were developed and characterized from 442 primer pairs flanking these cold-responsive SSR repeats. The polymorphic information content (PIC) of these polymorphic SSR markers across 24 lines of Elaeis guineensis varied from 0.08 to 0.65 (mean = 0.31 ± 0.12). Using in-silico mapping, 137 (75.3%) of the 182 polymorphic SSR markers were located onto the 16 Elaeis guineensis chromosomes. Total coverage of 473 Mbp was achieved, with an average physical distance of 3.4 Mbp between adjacent markers (range 96 bp - 20.8 Mbp). Meanwhile, Comparative analysis of transcriptome under cold stress revealed that one ICE1 putative ortholog, five CBF putative orthologs, 19 NAC transcription factors and four cold-induced orhologs were up-regulated at least two fold in response to cold stress. Interestingly, 5' untranslated region of both Unigene21287 (ICE1) and CL2628.Contig1 (NAC) both contained an SSR markers. In the present study, a series of SSR markers were developed based on sequences
Development of eSSR-Markers in Setaria italica and Their Applicability in Studying Genetic Diversity, Cross-Transferability and Comparative Mapping in Millet and Non-Millet Species.

Directory of Open Access Journals (Sweden)

Kajal Kumari

Full Text Available Foxtail millet (Setariaitalica L. is a tractable experimental model crop for studying functional genomics of millets and bioenergy grasses. But the limited availability of genomic resources, particularly expressed sequence-based genic markers is significantly impeding its genetic improvement. Considering this, we attempted to develop EST-derived-SSR (eSSR markers and utilize them in germplasm characterization, cross-genera transferability and in silico comparative mapping. From 66,027 foxtail millet EST sequences 24,828 non-redundant ESTs were deduced, representing ~16 Mb, which revealed 534 (~2% eSSRs in 495 SSR containing ESTs at a frequency of 1/30 kb. A total of 447 pp were successfully designed, of which 327 were mapped physically onto nine chromosomes. About 106 selected primer pairs representing the foxtail millet genome showed high-level of cross-genera amplification at an average of ~88% in eight millets and four non-millet species. Broad range of genetic diversity (0.02-0.65 obtained in constructed phylogenetic tree using 40 eSSR markers demonstrated its utility in germplasm characterizations and phylogenetics. Comparative mapping of physically mapped eSSR markers showed considerable proportion of sequence-based orthology and syntenic relationship between foxtail millet chromosomes and sorghum (~68%, maize (~61% and rice (~42% chromosomes. Synteny analysis of eSSRs of foxtail millet, rice, maize and sorghum suggested the nested chromosome fusion frequently observed in grass genomes. Thus, for the first time we had generated large-scale eSSR markers in foxtail millet and demonstrated their utility in germplasm characterization, transferability, phylogenetics and comparative mapping studies in millets and bioenergy grass species.
Development and characterization of genomic SSR markers in Cynodon transvaalensis Burtt-Davy.

Science.gov (United States)

Tan, Chengcheng; Wu, Yanqi; Taliaferro, Charles M; Bell, Greg E; Martin, Dennis L; Smith, Mike W

2014-08-01

Simple sequence repeat (SSR) markers are a major molecular tool for genetic and genomic research that have been extensively developed and used in major crops. However, few are available in African bermudagrass (Cynodon transvaalensis Burtt-Davy), an economically important warm-season turfgrass species. African bermudagrass is mainly used for hybridizations with common bermudagrass [C. dactylon var. dactylon (L.) Pers.] in the development of superior interspecific hybrid turfgrass cultivars. Accordingly, the major objective of this study was to develop and characterize a large set of SSR markers. Genomic DNA of C. transvaalensis '4200TN 24-2' from an Oklahoma State University (OSU) turf nursery was extracted for construction of four SSR genomic libraries enriched with [CA](n), [GA](n), [AAG](n), and [AAT](n) as core repeat motifs. A total of 3,064 clones were sequenced at the OSU core facility. The sequences were categorized into singletons and contiguous sequences to exclude redundancy. From the two sequence categories, 1,795 SSR loci were identified. After excluding duplicate SSRs by comparison with previously developed SSR markers using a nucleotide basic local alignment tool, 1,426 unique primer pairs (PPs) were designed. Out of the 1,426 designed PPs, 981 (68.8 %) amplified alleles of the expected size in the donor DNA. Polymorphisms of the SSR PPs tested in eight C. transvaalensis plants were 93 % polymorphic with 544 markers effective in all genotypes. Inheritance of the SSRs was examined in six F(1) progeny of African parents 'T577' × 'Uganda', indicating 917 markers amplified heritable alleles. The SSR markers developed in the study are the first large set of co-dominant markers in African bermudagrass and should be highly valuable for molecular and traditional breeding research.
Development of SSR Markers and Assessment of Genetic Diversity in Medicinal Chrysanthemum morifolium Cultivars.

Science.gov (United States)

Feng, Shangguo; He, Renfeng; Lu, Jiangjie; Jiang, Mengying; Shen, Xiaoxia; Jiang, Yan; Wang, Zhi'an; Wang, Huizhong

2016-01-01

Chrysanthemum morifolium, is a well-known flowering plant worldwide, and has a high commercial, floricultural, and medicinal value. In this study, simple-sequence repeat (SSR) markers were generated from EST datasets and were applied to assess the genetic diversity among 32 cultivars. A total of 218 in silico SSR loci were identified from 7300 C. morifolium ESTs retrieved from GenBank. Of all SSR loci, 61.47% of them (134) were hexa-nucleotide repeats, followed by tri-nucleotide repeats (17.89%), di-nucleotide repeats (12.39%), tetra-nucleotide repeats (4.13%), and penta-nucleotide repeats (4.13%). In this study, 17 novel EST-SSR markers were verified. Along with 38 SSR markers reported previously, 55 C. morifolium SSR markers were selected for further genetic diversity analysis. PCR amplification of these EST-SSRs produced 1319 fragments, 1306 of which showed polymorphism. The average polymorphism information content of the SSR primer pairs was 0.972 (0.938-0.993), which showed high genetic diversity among C. morifolium cultivars. Based on SSR markers, 32 C. morifolium cultivars were separated into two main groups by partitioning of the clusters using the unweighted pair group method with arithmetic mean dendrogram, which was further supported by a principal coordinate analysis plot. Phylogenetic relationship among C. morifolium cultivars as revealed by SSR markers was highly consistent with the classification of medicinal C. morifolium populations according to their origin and ecological distribution. Our results demonstrated that SSR markers were highly reproducible and informative, and could be used to evaluate genetic diversity and relationships among medicinal C. morifolium cultivars.
Characterization of novel SSR markers in diverse sainfoin (Onobrychis viciifolia) germplasm.

Science.gov (United States)

Kempf, Katharina; Mora-Ortiz, Marina; Smith, Lydia M J; Kölliker, Roland; Skøt, Leif

2016-08-30

Sainfoin is a perennial forage legume with beneficial properties for animal husbandry due to the presence of secondary metabolites. However, worldwide cultivation of sainfoin is marginal due to the lack of varieties with good agronomic performance, adapted to a broad range of environmental conditions. Little is known about the genetics of sainfoin and only few genetic markers are available to assist breeding and genetic investigations. The objective of this study was to develop a set of SSR markers useful for genetic studies in sainfoin and their characterization in diverse germplasm. A set of 400 SSR primer combinations were tested for amplification and their ability to detect polymorphisms in a set of 32 sainfoin individuals, representing distinct varieties or landraces. Alleles were scored for presence or absence and polymorphism information content of each SSR locus was calculated with an adapted formula taking into account the tetraploid character of sainfoin. Relationships among individuals were visualized using cluster and principle components analysis. Of the 400 primer combinations tested, 101 reliably detected polymorphisms among the 32 sainfoin individuals. Among the 1154 alleles amplified 250 private alleles were observed. The number of alleles per locus ranged from 2 to 24 with an average of 11.4 alleles. The average polymorphism information content reached values of 0.14 to 0.36. The clustering of the 32 individuals suggested a separation into two groups depending on the origin of the accessions. The SSR markers characterized and tested in this study provide a valuable tool to detect polymorphisms in sainfoin for future genetic studies and breeding programs. As a proof of concept, we showed that these markers can be used to separate sainfoin individuals based on their origin.
tropiTree: An NGS-Based EST-SSR Resource for 24 Tropical Tree Species

Science.gov (United States)

Russell, Joanne R.; Hedley, Peter E.; Cardle, Linda; Dancey, Siobhan; Morris, Jenny; Booth, Allan; Odee, David; Mwaura, Lucy; Omondi, William; Angaine, Peter; Machua, Joseph; Muchugi, Alice; Milne, Iain; Kindt, Roeland; Jamnadass, Ramni; Dawson, Ian K.

2014-01-01

The development of genetic tools for non-model organisms has been hampered by cost, but advances in next-generation sequencing (NGS) have created new opportunities. In ecological research, this raises the prospect for developing molecular markers to simultaneously study important genetic processes such as gene flow in multiple non-model plant species within complex natural and anthropogenic landscapes. Here, we report the use of bar-coded multiplexed paired-end Illumina NGS for the de novo development of expressed sequence tag-derived simple sequence repeat (EST-SSR) markers at low cost for a range of 24 tree species. Each chosen tree species is important in complex tropical agroforestry systems where little is currently known about many genetic processes. An average of more than 5,000 EST-SSRs was identified for each of the 24 sequenced species, whereas prior to analysis 20 of the species had fewer than 100 nucleotide sequence citations. To make results available to potential users in a suitable format, we have developed an open-access, interactive online database, tropiTree (http://bioinf.hutton.ac.uk/tropiTree), which has a range of visualisation and search facilities, and which is a model for the efficient presentation and application of NGS data. PMID:25025376
SNP mining porcine ESTs with MAVIANT, a novel tool for SNP evaluation and annotation

DEFF Research Database (Denmark)

Panitz, Frank; Stengaard, Henrik; Hornshoj, Henrik

2007-01-01

MOTIVATION: Single nucleotide polymorphisms (SNPs) analysis is an important means to study genetic variation. A fast and cost-efficient approach to identify large numbers of novel candidates is the SNP mining of large scale sequencing projects. The increasing availability of sequence trace data...... manual annotation, which is immediately accessible and can be easily shared with external collaborators. RESULTS: Large-scale SNP mining of polymorphisms bases on porcine EST sequences yielded more than 7900 candidate SNPs in coding regions (cSNPs), which were annotated relative to the human genome. Non...
Characterization and compilation of polymorphic simple sequence repeat (SSR markers of peanut from public database

Directory of Open Access Journals (Sweden)

Zhao Yongli

2012-07-01

Full Text Available Abstract Background There are several reports describing thousands of SSR markers in the peanut (Arachis hypogaea L. genome. There is a need to integrate various research reports of peanut DNA polymorphism into a single platform. Further, because of lack of uniformity in the labeling of these markers across the publications, there is some confusion on the identities of many markers. We describe below an effort to develop a central comprehensive database of polymorphic SSR markers in peanut. Findings We compiled 1,343 SSR markers as detecting polymorphism (14.5% within a total of 9,274 markers. Amongst all polymorphic SSRs examined, we found that AG motif (36.5% was the most abundant followed by AAG (12.1%, AAT (10.9%, and AT (10.3%.The mean length of SSR repeats in dinucleotide SSRs was significantly longer than that in trinucleotide SSRs. Dinucleotide SSRs showed higher polymorphism frequency for genomic SSRs when compared to trinucleotide SSRs, while for EST-SSRs, the frequency of polymorphic SSRs was higher in trinucleotide SSRs than in dinucleotide SSRs. The correlation of the length of SSR and the frequency of polymorphism revealed that the frequency of polymorphism was decreased as motif repeat number increased. Conclusions The assembled polymorphic SSRs would enhance the density of the existing genetic maps of peanut, which could also be a useful source of DNA markers suitable for high-throughput QTL mapping and marker-assisted selection in peanut improvement and thus would be of value to breeders.
Development of 23 novel polymorphic EST-SSR markers for the endangered relict conifer Metasequoia glyptostroboides.

Science.gov (United States)

Jin, Yuqing; Bi, Quanxin; Guan, Wenbin; Mao, Jian-Feng

2015-09-01

Metasequoia glyptostroboides is an endangered relict conifer species endemic to China. In this study, expressed sequence tag-simple sequence repeat (EST-SSR) markers were developed using transcriptome mining for future genetic and functional studies. We collected 97,565 unigene sequences generated by 454 pyrosequencing. A bioinformatics analysis identified 2087 unique and putative microsatellites, from which 96 novel microsatellite markers were developed. Fifty-three of the 96 primer sets successfully amplified clear fragments of the expected sizes; 23 of those loci were polymorphic. The number of alleles per locus ranged from two to eight, with an average of three, and the observed and expected heterozygosity values ranged from 0 to 1.0 and 0.117 to 0.813, respectively. These microsatellite loci will enrich the genetic resources to develop functional studies and conservation strategies for this endangered relict species.
Functional molecular markers (EST-SSR) in the full-sib reciprocal recurrent selection program of maize (Zea mays L.).

Science.gov (United States)

Galvão, K S C; Ramos, H C C; Santos, P H A D; Entringer, G C; Vettorazzi, J C F; Pereira, M G

2015-07-03

This study aimed to improve grain yield in the full-sib reciprocal recurrent selection program of maize from the North Fluminense State University. In the current phase of the program, the goal is to maintain, or even increase, the genetic variability within and among populations, in order to increase heterosis of the 13th cycle of reciprocal recurrent selection. Microsatellite expressed sequence tags (EST-SSRs) were used as a tool to assist the maximization step of genetic variability, targeting the functional genome. Eighty S1 progenies of the 13th recur-rent selection cycle, 40 from each population (CIMMYT and Piranão), were analyzed using 20 EST-SSR loci. Genetic diversity, observed heterozygosity, information content of polymorphism, and inbreeding co-efficient were estimated. Subsequently, analysis of genetic dissimilarity, molecular variance, and a graphical dispersion of genotypes were conducted. The number of alleles in the CIMMYT population ranged from 1 to 6, while in the Piranão population the range was from 2 to 8, with a mean of 3.65 and 4.35, respectively. As evidenced by the number of alleles, the Shannon index showed greater diversity for the Piranão population (1.04) in relation to the CIMMYT population (0.89). The genic SSR markers were effective in clustering genotypes into their respective populations before selection and an increase in the variation between populations after selection was observed. The results indicate that the study populations have expressive genetic diversity, which cor-responds to the functional genome, indicating that this strategy may contribute to genetic gain, especially in association with the grain yield of future hybrids.
Extrapolative microRNA precursor based SSR mining from tea EST database in respect to agronomic traits.

Science.gov (United States)

Hazra, Anjan; Dasgupta, Nirjhar; Sengupta, Chandan; Das, Sauren

2017-07-06

Tea (Camellia sinensis, (L.) Kuntze) is considered as most popular drink across the world and it is widely consumed beverage for its several health-benefit characteristics. These positive traits primarily rely on its regulatory networks of different metabolic pathways. Development of microsatellite markers from the conserved genomic regions are being worthwhile for reviewing the genetic diversity of closely related species or self-pollinated species. Although several SSR markers have been reported, in tea, the trait-specific Simple Sequence Repeat (SSR) markers, leading to be useful in marker assisted breeding technique, are yet to be identified. Micro RNAs are short, non-coding RNA molecules, involved in post transcriptional mode of gene regulation and thus effects on related phenotype. Present study deals with identification of the microsatellite motifs within the reported and predicted miRNA precursors that are effectively followed by designing of primers from SSR flanking regions in order to PCR validation. In addition to the earlier reports, two new miRNAs are predicting here from tea expressed tag sequence database. Furthermore, 18 SSR motifs are found to be in 13 of all 33 predicted miRNAs. Trinucleotide motifs are most abundant among all followed by dinucleotides. Since, miRNA based SSR markers are evidenced to have significant role on genetic fingerprinting study, these outcomes would pave the way in developing novel markers for tagging tea specific agronomic traits as well as substantiating non-conventional breeding program.
Development of a set of SSR markers for genetic polymorphism detection and interspecific hybrid jute breeding

Directory of Open Access Journals (Sweden)

Dipnarayan Saha

2017-10-01

Full Text Available Corchorus capsularis (white jute and C. olitorius (dark jute are the two principal cultivated species of jute that produce natural bast fiber of commercial importance. We have identified 4509 simple sequence repeat (SSR loci from 34,163 unigene sequences of C. capsularis to develop a non-redundant set of 2079 flanking primer pairs. Among the SSRs, trinucleotide repeats were most frequent (60% followed by dinucleotide repeats (37.6%. Annotation of the SSR-containing unigenes revealed their putative functions in various biological and molecular processes, including responses to biotic and abiotic signals. Eighteen expressed gene-derived SSR (eSSR markers were successfully mapped to the existing single-nucleotide polymorphism (SNP linkage map of jute, providing additional anchor points. Amplification of 72% of the 74 randomly selected primer pairs was successful in a panel of 24 jute accessions, comprising five and twelve accessions of C. capsularis and C. olitorius, respectively, and seven wild jute species. Forty-three primer pairs produced an average of 2.7 alleles and 58.1% polymorphism in a panel of 24 jute accessions. The mean PIC value was 0.34 but some markers showed PIC values higher than 0.5, suggesting that these markers can efficiently measure genetic diversity and serve for mapping of quantitative trait loci (QTLs in jute. A primer polymorphism survey with parents of a wide-hybridized population between a cultivated jute and its wild relative revealed their efficacy for interspecific hybrid identification. For ready accessibility of jute eSSR primers, we compiled all information in a user-friendly web database, JuteMarkerdb (http://jutemarkerdb.icar.gov.in/ for the first time in jute. This eSSR resource in jute is expected to be of use in characterization of germplasm, interspecific hybrid and variety identification, and marker-assisted breeding of superior-quality jute.
Development of a core set of SSR markers for the characterization of Gossypium germplasm

Science.gov (United States)

Molecular markers such as simple sequence repeats (SSR) are a useful tool for characterizing genetic diversity of Gossypium germplasm collections. Genetic profiles by DNA fingerprinting of cotton accessions can only be compared among different collections if a common set of molecular markers are us...
Development and characterization of genomic SSR markers for Anneslea fragrans (Pentaphylacaceae).

Science.gov (United States)

Sun, Lijing; Meng, Kaikai; Liao, Boyong; Li, Chunmei; Zhang, Yue; Liao, Wenbo; Chen, Sufang

2017-10-01

The genus Anneslea (Pentaphylacaceae) contains four species and six varieties, most of which are locally endemic. Here, simple sequence repeat (SSR) markers were developed for the conservation of these species. The genome of A. fragrans was sequenced and de novo assembled into 445,162 contigs, of which 30,409 SSR loci were detected. Primers for 100 SSR loci were validated with PCR amplification in three populations of A. fragrans . Seventy-nine loci successfully amplified, and 30 were polymorphic. The mean number of alleles, observed heterozygosity, and expected heterozygosity were 7.01 ± 1.60, 0.817 ± 0.241, and 0.796 ± 0.145, respectively. Most primers could be amplified in Ternstroemia gymnanthera , T. kwangtungensis , and Cleyera pachyphylla . Our study demonstrated that shotgun genome sequencing is an efficient way to develop genomic SSR markers for nonmodel species. These genomic SSR loci will be valuable in population genetic studies in Anneslea and its relatives.
Development of 23 novel polymorphic EST-SSR markers for the endangered relict conifer Metasequoia glyptostroboides1

Science.gov (United States)

Jin, Yuqing; Bi, Quanxin; Guan, Wenbin; Mao, Jian-Feng

2015-01-01

Premise of the study: Metasequoia glyptostroboides is an endangered relict conifer species endemic to China. In this study, expressed sequence tag–simple sequence repeat (EST-SSR) markers were developed using transcriptome mining for future genetic and functional studies. Methods and Results: We collected 97,565 unigene sequences generated by 454 pyrosequencing. A bioinformatics analysis identified 2087 unique and putative microsatellites, from which 96 novel microsatellite markers were developed. Fifty-three of the 96 primer sets successfully amplified clear fragments of the expected sizes; 23 of those loci were polymorphic. The number of alleles per locus ranged from two to eight, with an average of three, and the observed and expected heterozygosity values ranged from 0 to 1.0 and 0.117 to 0.813, respectively. Conclusions: These microsatellite loci will enrich the genetic resources to develop functional studies and conservation strategies for this endangered relict species. PMID:26421250
Genome-wide microsatellite characterization and marker development in the sequenced Brassica crop species.

Science.gov (United States)

Shi, Jiaqin; Huang, Shunmou; Zhan, Jiepeng; Yu, Jingyin; Wang, Xinfa; Hua, Wei; Liu, Shengyi; Liu, Guihua; Wang, Hanzhong

2014-02-01

Although much research has been conducted, the pattern of microsatellite distribution has remained ambiguous, and the development/utilization of microsatellite markers has still been limited/inefficient in Brassica, due to the lack of genome sequences. In view of this, we conducted genome-wide microsatellite characterization and marker development in three recently sequenced Brassica crops: Brassica rapa, Brassica oleracea and Brassica napus. The analysed microsatellite characteristics of these Brassica species were highly similar or almost identical, which suggests that the pattern of microsatellite distribution is likely conservative in Brassica. The genomic distribution of microsatellites was highly non-uniform and positively or negatively correlated with genes or transposable elements, respectively. Of the total of 115 869, 185 662 and 356 522 simple sequence repeat (SSR) markers developed with high frequencies (408.2, 343.8 and 356.2 per Mb or one every 2.45, 2.91 and 2.81 kb, respectively), most represented new SSR markers, the majority had determined physical positions, and a large number were genic or putative single-locus SSR markers. We also constructed a comprehensive database for the newly developed SSR markers, which was integrated with public Brassica SSR markers and annotated genome components. The genome-wide SSR markers developed in this study provide a useful tool to extend the annotated genome resources of sequenced Brassica species to genetic study/breeding in different Brassica species.
Gene-based SSR markers for common bean (Phaseolus vulgaris L.) derived from root and leaf tissue ESTs: an integration of the BMc series.

Science.gov (United States)

Blair, Matthew W; Hurtado, Natalia; Chavarro, Carolina M; Muñoz-Torres, Monica C; Giraldo, Martha C; Pedraza, Fabio; Tomkins, Jeff; Wing, Rod

2011-03-22

Sequencing of cDNA libraries for the development of expressed sequence tags (ESTs) as well as for the discovery of simple sequence repeats (SSRs) has been a common method of developing microsatellites or SSR-based markers. In this research, our objective was to further sequence and develop common bean microsatellites from leaf and root cDNA libraries derived from the Andean gene pool accession G19833 and the Mesoamerican gene pool accession DOR364, mapping parents of a commonly used reference map. The root libraries were made from high and low phosphorus treated plants. A total of 3,123 EST sequences from leaf and root cDNA libraries were screened and used for direct simple sequence repeat discovery. From these EST sequences we found 184 microsatellites; the majority containing tri-nucleotide motifs, many of which were GC rich (ACC, AGC and AGG in particular). Di-nucleotide motif microsatellites were about half as common as the tri-nucleotide motif microsatellites but most of these were AGn microsatellites with a moderate number of ATn microsatellites in root ESTs followed by few ACn and no GCn microsatellites. Out of the 184 new SSR loci, 120 new microsatellite markers were developed in the BMc (Bean Microsatellites from cDNAs) series and these were evaluated for their capacity to distinguish bean diversity in a germplasm panel of 18 genotypes. We developed a database with images of the microsatellites and their polymorphism information content (PIC), which averaged 0.310 for polymorphic markers. The present study produced information about microsatellite frequency in root and leaf tissues of two important genotypes for common bean genomics: namely G19833, the Andean genotype selected for whole genome shotgun sequencing from race Peru, and DOR364 a race Mesoamerica subgroup 2 genotype that is a small-red seeded, released variety in Central America. Both race Peru and Mesoamerica subgroup 2 (small red beans) have been understudied in comparison to race Nueva
Exploiting BAC-end sequences for the mining, characterization and utility of new short sequences repeat (SSR) markers in Citrus.

Science.gov (United States)

Biswas, Manosh Kumar; Chai, Lijun; Mayer, Christoph; Xu, Qiang; Guo, Wenwu; Deng, Xiuxin

2012-05-01

The aim of this study was to develop a large set of microsatellite markers based on publicly available BAC-end sequences (BESs), and to evaluate their transferability, discriminating capacity of genotypes and mapping ability in Citrus. A set of 1,281 simple sequence repeat (SSR) markers were developed from the 46,339 Citrus clementina BAC-end sequences (BES), of them 20.67% contained SSR longer than 20 bp, corresponding to roughly one perfect SSR per 2.04 kb. The most abundant motifs were di-nucleotide (16.82%) repeats. Among all repeat motifs (TA/AT)n is the most abundant (8.38%), followed by (AG/CT)n (4.51%). Most of the BES-SSR are located in the non-coding region, but 1.3% of BES-SSRs were found to be associated with transposable element (TE). A total of 400 novel SSR primer pairs were synthesized and their transferability and polymorphism tested on a set of 16 Citrus and Citrus relative's species. Among these 333 (83.25%) were successfully amplified and 260 (65.00%) showed cross-species transferability with Poncirus trifoliata and Fortunella sp. These cross-species transferable markers could be useful for cultivar identification, for genomic study of Citrus, Poncirus and Fortunella sp. Utility of the developed SSR marker was demonstrated by identifying a set of 118 markers each for construction of linkage map of Citrus reticulata and Poncirus trifoliata. Genetic diversity and phylogenetic relationship among 40 Citrus and its related species were conducted with the aid of 25 randomly selected SSR primer pairs and results revealed that citrus genomic SSRs are superior to genic SSR for genetic diversity and germplasm characterization of Citrus spp.
Genetic variation, population structure and linkage disequilibrium in Switchgrass with ISSR, SCoT and EST-SSR markers.

Science.gov (United States)

Zhang, Yu; Yan, Haidong; Jiang, Xiaomei; Wang, Xiaoli; Huang, Linkai; Xu, Bin; Zhang, Xinquan; Zhang, Lexin

2016-01-01

To evaluate genetic variation, population structure, and the extent of linkage disequilibrium (LD), 134 switchgrass ( Panicum virgatum L.) samples were analyzed with 51 markers, including 16 ISSRs, 20 SCoTs, and 15 EST-SSRs. In this study, a high level of genetic variation was observed in the switchgrass samples and they had an average Nei's gene diversity index (H) of 0.311. A total of 793 bands were obtained, of which 708 (89.28 %) were polymorphic. Using a parameter marker index (MI), the efficiency of the three types of markers (ISSR, SCoT, and EST-SSR) in the study were compared and we found that SCoT had a higher marker efficiency than the other two markers. The 134 switchgrass samples could be divided into two sub-populations based on STRUCTURE, UPGMA clustering, and principal coordinate analyses (PCA), and upland and lowland ecotypes could be separated by UPGMA clustering and PCA analyses. Linkage disequilibrium analysis revealed an average r 2 of 0.035 across all 51 markers, indicating a trend of higher LD in sub-population 2 than that in sub-population 1 ( P < 0.01). The population structure revealed in this study will guide the design of future association studies using these switchgrass samples.
Simple sequence repeat (SSR) vs. sequence-related amplified polymorphism (SRAP) markers for Cynara cardunculus characterization

Energy Technology Data Exchange (ETDEWEB)

Casadevall, R.; Martin, E.; Cravero, V.

2011-07-01

A little is known about the genetic variability present in globe artichoke, cultivated and wild cardoons. This knowledge is very important for efficient genetic resources utilization, and to gain a better understanding of genetic structure of this botanical varieties. With the aims to determine genetic distances between Cynara cardunculus accessions and to compare two molecular markers systems for their efficiency to differ between botanical varieties, a molecular characterization of sixteen accessions from different geographical origins was performed. Seven SSR and seven SRAP markers were used for varieties characterization and to calculate genetic distances between them. Both distance matrices were subjected to cluster analysis. Exclusive SSR alleles were found for globe artichoke and for wild cardoon, but non exclusive alleles were found for cultivated cardoon. For both markers systems two major groups were identified, one of them included mostly globe artichoke accessions and the other one grouped mainly cardoons. The differences observed in the sub-cluster conformation with each marker systems may be due to intrinsic characteristics of the markers. Concluding, both kind of molecular markers are valuable tools for studying genetic distances between C. cardunculus accessions although they give different information. Nevertheless, SSR electrophoretic profiles are simpler to score than SRAP markers because they consist of just a few bands. As well, bands are highly informative because of the great number of alleles existing in population and they are codominant markers. In addition, SSRs use would reduce time and costs. (Author) 31 refs.

Kmer-SSR: a fast and exhaustive SSR search algorithm.

Science.gov (United States)

Pickett, Brandon D; Miller, Justin B; Ridge, Perry G

2017-12-15

One of the main challenges with bioinformatics software is that the size and complexity of datasets necessitate trading speed for accuracy, or completeness. To combat this problem of computational complexity, a plethora of heuristic algorithms have arisen that report a 'good enough' solution to biological questions. However, in instances such as Simple Sequence Repeats (SSRs), a 'good enough' solution may not accurately portray results in population genetics, phylogenetics and forensics, which require accurate SSRs to calculate intra- and inter-species interactions. We present Kmer-SSR, which finds all SSRs faster than most heuristic SSR identification algorithms in a parallelized, easy-to-use manner. The exhaustive Kmer-SSR option has 100% precision and 100% recall and accurately identifies every SSR of any specified length. To identify more biologically pertinent SSRs, we also developed several filters that allow users to easily view a subset of SSRs based on user input. Kmer-SSR, coupled with the filter options, accurately and intuitively identifies SSRs quickly and in a more user-friendly manner than any other SSR identification algorithm. The source code is freely available on GitHub at https://github.com/ridgelab/Kmer-SSR. perry.ridge@byu.edu. © The Author(s) 2017. Published by Oxford University Press.
De novo Assembly, Characterization of Immature Seed Transcriptome and Development of Genic-SSR Markers in Black Gram [Vigna mungo (L. Hepper].

Directory of Open Access Journals (Sweden)

J Souframanien

Full Text Available Black gram [V. mungo (L. Hepper] is an important legume crop extensively grown in south and south-east Asia, where it is a major source of dietary protein for its predominantly vegetarian population. However, lack of genomic information and markers has become a limitation for genetic improvement of this crop. Here, we report the transcriptome sequencing of the immature seeds of black gram cv. TU94-2, by Illumina paired end sequencing technology to generate transcriptome sequences for gene discovery and genic-SSR marker development. A total of 17.2 million paired-end reads were generated and 48,291 transcript contigs (TCS were assembled with an average length of 443 bp. Based on sequence similarity search, 33,766 TCS showed significant similarity to known proteins. Among these, only 29,564 TCS were annotated with gene ontology (GO functional categories. A total number of 138 unique KEGG (Kyoto Encyclopedia of Genes and Genomes pathways were identified, of which majority of TCS are grouped into purine metabolism (678 followed by pyrimidine metabolism (263. A total of 48,291 TCS were searched for SSRs and 1,840 SSRs were identified in 1,572 TCS with an average frequency of one SSR per 11.9 kb. The tri-nucleotide repeats were most abundant (35% followed by di-nucleotide repeats (32%. PCR primer pairs were successfully designed for 933 SSR loci. Sequences analyses indicate that about 64.4% and 35.6% of the SSR motifs were present in the coding sequences (CDS and untranslated regions (UTRs respectively. Tri-nucleotide repeats (57.3% were preferentially present in the CDS. The rate of successful amplification and polymorphism were investigated using selected primers among 18 black gram accessions. Genic-SSR markers developed from the Illumina paired end sequencing of black gram immature seed transcriptome will provide a valuable resource for genetic diversity, evolution, linkage mapping, comparative genomics and marker-assisted selection in black gram.
De novo Assembly, Characterization of Immature Seed Transcriptome and Development of Genic-SSR Markers in Black Gram [Vigna mungo (L.) Hepper

Science.gov (United States)

Souframanien, J.; Reddy, Kandali Sreenivasulu

2015-01-01

Black gram [V. mungo (L.) Hepper] is an important legume crop extensively grown in south and south-east Asia, where it is a major source of dietary protein for its predominantly vegetarian population. However, lack of genomic information and markers has become a limitation for genetic improvement of this crop. Here, we report the transcriptome sequencing of the immature seeds of black gram cv. TU94-2, by Illumina paired end sequencing technology to generate transcriptome sequences for gene discovery and genic-SSR marker development. A total of 17.2 million paired-end reads were generated and 48,291 transcript contigs (TCS) were assembled with an average length of 443 bp. Based on sequence similarity search, 33,766 TCS showed significant similarity to known proteins. Among these, only 29,564 TCS were annotated with gene ontology (GO) functional categories. A total number of 138 unique KEGG (Kyoto Encyclopedia of Genes and Genomes) pathways were identified, of which majority of TCS are grouped into purine metabolism (678) followed by pyrimidine metabolism (263). A total of 48,291 TCS were searched for SSRs and 1,840 SSRs were identified in 1,572 TCS with an average frequency of one SSR per 11.9 kb. The tri-nucleotide repeats were most abundant (35%) followed by di-nucleotide repeats (32%). PCR primer pairs were successfully designed for 933 SSR loci. Sequences analyses indicate that about 64.4% and 35.6% of the SSR motifs were present in the coding sequences (CDS) and untranslated regions (UTRs) respectively. Tri-nucleotide repeats (57.3%) were preferentially present in the CDS. The rate of successful amplification and polymorphism were investigated using selected primers among 18 black gram accessions. Genic-SSR markers developed from the Illumina paired end sequencing of black gram immature seed transcriptome will provide a valuable resource for genetic diversity, evolution, linkage mapping, comparative genomics and marker-assisted selection in black gram. PMID
Characterization of functional SSR markers in Prosopis alba and their transferability across Prosopis species

Directory of Open Access Journals (Sweden)

María F. Pomponio

2015-08-01

Full Text Available Aim of study: The aim of the study was to characterize functional microsatellite markers in Prosopis alba and examine the transferability to species from the Prosopis genus. Area of the study: samples were obtained from natural populations of Argentina. Material and Methods: Eleven SSR functional markers related to stress and metabolism were amplified in a sample of 152 genotypes from P.alba, P. denudans, P. hassleriP. chilensis, P. flexuosa, and interspecific hybrids. Main results: In P. alba, the PIC average value was 0.36; and 6 out of the 11 primers showed high values of polymorphism ranging from 0.40 to 0.71. The cross-species transferability was high with high percentages of polymorphic loci. Research highlights: The SSR markers developed in P.alba were easily transferred to other Prosopis species which did not have functional markers.
Development of genomic SSR markers for fingerprinting lettuce (Lactuca sativa L.) cultivars and mapping genes.

Science.gov (United States)

Rauscher, Gilda; Simko, Ivan

2013-01-22

Lettuce (Lactuca sativa L.) is the major crop from the group of leafy vegetables. Several types of molecular markers were developed that are effectively used in lettuce breeding and genetic studies. However only a very limited number of microsattelite-based markers are publicly available. We have employed the method of enriched microsatellite libraries to develop 97 genomic SSR markers. Testing of newly developed markers on a set of 36 Lactuca accession (33 L. sativa, and one of each L. serriola L., L. saligna L., and L. virosa L.) revealed that both the genetic heterozygosity (UHe = 0.56) and the number of loci per SSR (Na = 5.50) are significantly higher for genomic SSR markers than for previously developed EST-based SSR markers (UHe = 0.32, Na = 3.56). Fifty-four genomic SSR markers were placed on the molecular linkage map of lettuce. Distribution of markers in the genome appeared to be random, with the exception of possible cluster on linkage group 6. Any combination of 32 genomic SSRs was able to distinguish genotypes of all 36 accessions. Fourteen of newly developed SSR markers originate from fragments with high sequence similarity to resistance gene candidates (RGCs) and RGC pseudogenes. Analysis of molecular variance (AMOVA) of L. sativa accessions showed that approximately 3% of genetic diversity was within accessions, 79% among accessions, and 18% among horticultural types. The newly developed genomic SSR markers were added to the pool of previously developed EST-SSRs markers. These two types of SSR-based markers provide useful tools for lettuce cultivar fingerprinting, development of integrated molecular linkage maps, and mapping of genes.
Development of genic SSR markers from transcriptome sequencing of pear buds.

Science.gov (United States)

Yue, Xiao-yan; Liu, Guo-qin; Zong, Yu; Teng, Yuan-wen; Cai, Dan-ying

2014-04-01

A total of 8375 genic simple sequence repeat (SSR) loci were discovered from a unigene set assembled from 116282 transcriptomic unigenes in this study. Dinucleotide repeat motifs were the most common with a frequency of 65.11%, followed by trinucleotide (32.81%). A total of 4100 primer pairs were designed from the SSR loci. Of these, 343 primer pairs (repeat length ≥15 bp) were synthesized with an M13 tail and tested for stable amplification and polymorphism in four Pyrus accessions. After the preliminary test, 104 polymorphic genic SSR markers were developed; dinucleotide and trinucleotide repeats represented 97.11% (101) of these. Twenty-eight polymorphic genic SSR markers were selected randomly to further validate genetic diversity among 28 Pyrus accessions. These markers displayed a high level of polymorphism. The number of alleles at these SSR loci ranged from 2 to 17, with a mean of 9.43 alleles per locus, and the polymorphism information content (PIC) values ranged from 0.26 to 0.91. The UPGMA (unweighted pair-group method with arithmetic average) cluster analysis grouped the 28 Pyrus accessions into two groups: Oriental pears and Occidental pears, which are congruent to the traditional taxonomy, demonstrating their effectiveness in analyzing Pyrus phylogenetic relationships, enriching rare Pyrus EST-SSR resources, and confirming the potential value of a pear transcriptome database for the development of new SSR markers.
galaxieEST: addressing EST identity through automated phylogenetic analysis.

Science.gov (United States)

Nilsson, R Henrik; Rajashekar, Balaji; Larsson, Karl-Henrik; Ursing, Björn M

2004-07-05

Research involving expressed sequence tags (ESTs) is intricately coupled to the existence of large, well-annotated sequence repositories. Comparatively complete and satisfactory annotated public sequence libraries are, however, available only for a limited range of organisms, rendering the absence of sequences and gene structure information a tangible problem for those working with taxa lacking an EST or genome sequencing project. Paralogous genes belonging to the same gene family but distinguished by derived characteristics are particularly prone to misidentification and erroneous annotation; high but incomplete levels of sequence similarity are typically difficult to interpret and have formed the basis of many unsubstantiated assumptions of orthology. In these cases, a phylogenetic study of the query sequence together with the most similar sequences in the database may be of great value to the identification process. In order to facilitate this laborious procedure, a project to employ automated phylogenetic analysis in the identification of ESTs was initiated. galaxieEST is an open source Perl-CGI script package designed to complement traditional similarity-based identification of EST sequences through employment of automated phylogenetic analysis. It uses a series of BLAST runs as a sieve to retrieve nucleotide and protein sequences for inclusion in neighbour joining and parsimony analyses; the output includes the BLAST output, the results of the phylogenetic analyses, and the corresponding multiple alignments. galaxieEST is available as an on-line web service for identification of fungal ESTs and for download / local installation for use with any organism group at http://galaxie.cgb.ki.se/galaxieEST.html. By addressing sequence relatedness in addition to similarity, galaxieEST provides an integrative view on EST origin and identity, which may prove particularly useful in cases where similarity searches return one or more pertinent, but not full, matches and
(SSR) markers for analysis of genetic diversity in African rice ...

African Journals Online (AJOL)

Bonny Oloka

2015-05-06

May 6, 2015 ... and conservation. To address this knowledge gap, 10 highly polymorphic rice simple sequence repeat. (SSR) markers were used to characterize 99 rice genotypes to determine their diversity and place them in their different population groups. The SSR markers were multiplexed in 3 panels to increase their.
Transcriptome analysis of the desert locust central nervous system: production and annotation of a Schistocerca gregaria EST database.

Science.gov (United States)

Badisco, Liesbeth; Huybrechts, Jurgen; Simonet, Gert; Verlinden, Heleen; Marchal, Elisabeth; Huybrechts, Roger; Schoofs, Liliane; De Loof, Arnold; Vanden Broeck, Jozef

2011-03-21

The desert locust (Schistocerca gregaria) displays a fascinating type of phenotypic plasticity, designated as 'phase polyphenism'. Depending on environmental conditions, one genome can be translated into two highly divergent phenotypes, termed the solitarious and gregarious (swarming) phase. Although many of the underlying molecular events remain elusive, the central nervous system (CNS) is expected to play a crucial role in the phase transition process. Locusts have also proven to be interesting model organisms in a physiological and neurobiological research context. However, molecular studies in locusts are hampered by the fact that genome/transcriptome sequence information available for this branch of insects is still limited. We have generated 34,672 raw expressed sequence tags (EST) from the CNS of desert locusts in both phases. These ESTs were assembled in 12,709 unique transcript sequences and nearly 4,000 sequences were functionally annotated. Moreover, the obtained S. gregaria EST information is highly complementary to the existing orthopteran transcriptomic data. Since many novel transcripts encode neuronal signaling and signal transduction components, this paper includes an overview of these sequences. Furthermore, several transcripts being differentially represented in solitarious and gregarious locusts were retrieved from this EST database. The findings highlight the involvement of the CNS in the phase transition process and indicate that this novel annotated database may also add to the emerging knowledge of concomitant neuronal signaling and neuroplasticity events. In summary, we met the need for novel sequence data from desert locust CNS. To our knowledge, we hereby also present the first insect EST database that is derived from the complete CNS. The obtained S. gregaria EST data constitute an important new source of information that will be instrumental in further unraveling the molecular principles of phase polyphenism, in further establishing
Transcriptome analysis of the desert locust central nervous system: production and annotation of a Schistocerca gregaria EST database.

Directory of Open Access Journals (Sweden)

Liesbeth Badisco

Full Text Available BACKGROUND: The desert locust (Schistocerca gregaria displays a fascinating type of phenotypic plasticity, designated as 'phase polyphenism'. Depending on environmental conditions, one genome can be translated into two highly divergent phenotypes, termed the solitarious and gregarious (swarming phase. Although many of the underlying molecular events remain elusive, the central nervous system (CNS is expected to play a crucial role in the phase transition process. Locusts have also proven to be interesting model organisms in a physiological and neurobiological research context. However, molecular studies in locusts are hampered by the fact that genome/transcriptome sequence information available for this branch of insects is still limited. METHODOLOGY: We have generated 34,672 raw expressed sequence tags (EST from the CNS of desert locusts in both phases. These ESTs were assembled in 12,709 unique transcript sequences and nearly 4,000 sequences were functionally annotated. Moreover, the obtained S. gregaria EST information is highly complementary to the existing orthopteran transcriptomic data. Since many novel transcripts encode neuronal signaling and signal transduction components, this paper includes an overview of these sequences. Furthermore, several transcripts being differentially represented in solitarious and gregarious locusts were retrieved from this EST database. The findings highlight the involvement of the CNS in the phase transition process and indicate that this novel annotated database may also add to the emerging knowledge of concomitant neuronal signaling and neuroplasticity events. CONCLUSIONS: In summary, we met the need for novel sequence data from desert locust CNS. To our knowledge, we hereby also present the first insect EST database that is derived from the complete CNS. The obtained S. gregaria EST data constitute an important new source of information that will be instrumental in further unraveling the molecular
76 FR 68243 - Social Security Rulings, SSR 91-1c and SSR 66-18c; Rescission of Social Security Rulings (SSR) 66...

Science.gov (United States)

2011-11-03

..., Social Security Online, at http://www.socialsecurity.gov . SUPPLEMENTARY INFORMATION: SSRs make available... SOCIAL SECURITY ADMINISTRATION [Docket No. SSA-2011-0068] Social Security Rulings, SSR 91-1c and SSR 66-18c; Rescission of Social Security Rulings (SSR) 66-18c and SSR 91-1c AGENCY: Social Security...
Wheat EST resources for functional genomics of abiotic stress

Directory of Open Access Journals (Sweden)

Links Matthew G

2006-06-01

Full Text Available Abstract Background Wheat is an excellent species to study freezing tolerance and other abiotic stresses. However, the sequence of the wheat genome has not been completely characterized due to its complexity and large size. To circumvent this obstacle and identify genes involved in cold acclimation and associated stresses, a large scale EST sequencing approach was undertaken by the Functional Genomics of Abiotic Stress (FGAS project. Results We generated 73,521 quality-filtered ESTs from eleven cDNA libraries constructed from wheat plants exposed to various abiotic stresses and at different developmental stages. In addition, 196,041 ESTs for which tracefiles were available from the National Science Foundation wheat EST sequencing program and DuPont were also quality-filtered and used in the analysis. Clustering of the combined ESTs with d2_cluster and TGICL yielded a few large clusters containing several thousand ESTs that were refractory to routine clustering techniques. To resolve this problem, the sequence proximity and "bridges" were identified by an e-value distance graph to manually break clusters into smaller groups. Assembly of the resolved ESTs generated a 75,488 unique sequence set (31,580 contigs and 43,908 singletons/singlets. Digital expression analyses indicated that the FGAS dataset is enriched in stress-regulated genes compared to the other public datasets. Over 43% of the unique sequence set was annotated and classified into functional categories according to Gene Ontology. Conclusion We have annotated 29,556 different sequences, an almost 5-fold increase in annotated sequences compared to the available wheat public databases. Digital expression analysis combined with gene annotation helped in the identification of several pathways associated with abiotic stress. The genomic resources and knowledge developed by this project will contribute to a better understanding of the different mechanisms that govern stress tolerance in
Development and Testing of New Gene-Homologous EST-SSRs for Eucalyptus gomphocephala (Myrtaceae

Directory of Open Access Journals (Sweden)

Donna Bradbury

2013-07-01

Full Text Available Premise of the study: New microsatellite (simple sequence repeat [SSR] primers were developed from Eucalyptus expressed sequence tags (ESTs and optimized for genetic studies of the southwestern Australian tree E. gomphocephala, which is severely impacted by tree health decline and habitat fragmentation. Methods and Results: A total of 133 gene-homologous EST-SSR primer pairs were designed for Eucalyptus, and 44 were screened in E. gomphocephala. Of these, 17 produced reliable amplification products and 11 were polymorphic. Between two and 13 alleles were observed per locus, and observed heterozygosities ranged from 0.172 to 0.867. All 17 EST-SSRs that amplified E. gomphocephala cross-amplified to at least one of E. marginata, E. camaldulensis, and E. victrix. Conclusions: This set of EST-SSR primer pairs will be valuable tools for future population genetic studies of E. gomphocephala and other eucalypts, particularly for studying gene-linked variation and informing seed-sourcing strategies for ecological restoration.
Characterization of functional SSR markers in Prosopis alba and their transferability across Prosopis species

OpenAIRE

María F. Pomponio; Cintia Acuña; Vivien Pentreath; Diego L. Lauenstein; Susana M. Poltri; Susana Torales

2015-01-01

Aim of study: The aim of the study was to characterize functional microsatellite markers in Prosopis alba and examine the transferability to species from the Prosopis genus. Area of the study: samples were obtained from natural populations of Argentina. Material and Methods: Eleven SSR functional markers related to stress and metabolism were amplified in a sample of 152 genotypes from P.alba, P. denudans, P. hassleriP. chilensis, P. flexuosa, and interspecific hybrids. Main res...
Identification and characterization of salt responsive miRNA-SSR markers in rice (Oryza sativa).

Science.gov (United States)

Mondal, Tapan Kumar; Ganie, Showkat Ahmad

2014-02-10

Salinity is an important abiotic stress that affects agricultural production and productivity. It is a complex trait that is regulated by different molecular mechanisms. miRNAs are non-coding RNAs which are highly conserved and regulate gene expression. Simple sequence repeats (SSRs) are robust molecular markers for studying genetic diversity. Although several SSR markers are available now, challenge remains to identify the trait-specific SSRs which can be used for marker assisted breeding. In order to understand the genetic diversity of salt responsive-miRNA genes in rice, SSR markers were mined from 130 members of salt-responsive miRNA genes of rice and validated among the contrasting panels of tolerant as well as susceptible rice genotypes, each with 12 genotypes. Although 12 miR-SSRs were found to be polymorphic, only miR172b-SSR was able to differentiate the tolerant and susceptible genotypes in 2 different groups. It had also been found that miRNA genes were more diverse in susceptible genotypes than the tolerant one (as indicated by polymorphic index content) which might interfere to form the stem-loop structure of premature miRNA and their subsequent synthesis in susceptible genotypes. Thus, we concluded that length variations of the repeats in salt responsive miRNA genes may be responsible for a possible sensitivity to salinity adaptation. This is the first report of characterization of trait specific miRNA derived SSRs in plants. Copyright © 2013 Elsevier B.V. All rights reserved.
Improved annotation of 3' untranslated regions and complex loci by combination of strand-specific direct RNA sequencing, RNA-Seq and ESTs.

Directory of Open Access Journals (Sweden)

Nicholas J Schurch

Full Text Available The reference annotations made for a genome sequence provide the framework for all subsequent analyses of the genome. Correct and complete annotation in addition to the underlying genomic sequence is particularly important when interpreting the results of RNA-seq experiments where short sequence reads are mapped against the genome and assigned to genes according to the annotation. Inconsistencies in annotations between the reference and the experimental system can lead to incorrect interpretation of the effect on RNA expression of an experimental treatment or mutation in the system under study. Until recently, the genome-wide annotation of 3' untranslated regions received less attention than coding regions and the delineation of intron/exon boundaries. In this paper, data produced for samples in Human, Chicken and A. thaliana by the novel single-molecule, strand-specific, Direct RNA Sequencing technology from Helicos Biosciences which locates 3' polyadenylation sites to within +/- 2 nt, were combined with archival EST and RNA-Seq data. Nine examples are illustrated where this combination of data allowed: (1 gene and 3' UTR re-annotation (including extension of one 3' UTR by 5.9 kb; (2 disentangling of gene expression in complex regions; (3 clearer interpretation of small RNA expression and (4 identification of novel genes. While the specific examples displayed here may become obsolete as genome sequences and their annotations are refined, the principles laid out in this paper will be of general use both to those annotating genomes and those seeking to interpret existing publically available annotations in the context of their own experimental data.
Development and characterization of polymorphic genomic-SSR markers in Asian long-horned beetle (Anoplophora glabripennis).

Science.gov (United States)

Liu, Zhaoyang; Tao, Jing; Luo, Youqing

2017-12-01

The Asian long-horned beetle (ALB), Anoplophora glabripennis (Motschulsky) (Coleoptera: Cerambycidae: Lamiinae), is a wood-borer and polyphagous xylophage that is native to Asia. It infests and seriously harms healthy trees, and therefore is a cause for considerable environmental concern. The analysis of population genetic structure of ALB and sibling species Anoplophora nobilis (Ganglbauer) will not only help to clarify the relationship between environmental variables and mechanisms of speciation, but also will enhance our understanding of evolutionary processes. However, the known genetic markers, particularly microsatellites, are limited for this species. SSRLocator software was used to analyze the distribution and frequencies of genomic simple sequence repeat (SSR), to infer the basic characteristics of repeat motifs, and to design primers. We developed SSR loci of 2-6 repeated units, including 10,650 perfect SSRs, and found 140 types of repeat motifs. A total of 2621 SSR markers were discovered in ALB whole-genome shotgun sequences. 48 pairs of SSR primers were randomly chosen from 2621 SSR markers, and half of these 48 pairs were polymorphic containing 4 di-, 7 tri-, 2 tetra-, and 11-hexamer SSRs. Four populations test the effectiveness of the primers. These results suggest that our method for whole-genome SSR screening is feasible and efficient, and the SSR markers developed in this study are suitable for further population genetics studies of ALB. Moreover, they may also be useful for the development of SSRs for other Coleoptera.
Transcriptome Sequencing and Development of Genic SSR Markers of an Endangered Chinese Endemic Genus Dipteronia Oliver (Aceraceae).

Science.gov (United States)

Zhou, Tao; Li, Zhong-Hu; Bai, Guo-Qing; Feng, Li; Chen, Chen; Wei, Yue; Chang, Yong-Xia; Zhao, Gui-Fang

2016-02-23

Dipteronia Oliver (Aceraceae) is an endangered Chinese endemic genus consisting of two living species, Dipteronia sinensis and Dipteronia dyeriana. However, studies on the population genetics and evolutionary analyses of Dipteronia have been hindered by limited genomic resources and genetic markers. Here, the generation, de novo assembly and annotation of transcriptome datasets, and a large set of microsatellite or simple sequence repeat (SSR) markers derived from Dipteronia have been described. After Illumina pair-end sequencing, approximately 93.2 million reads were generated and assembled to yield a total of 99,358 unigenes. A majority of these unigenes (53%, 52,789) had at least one blast hit against the public protein databases. Further, 12,377 SSR loci were detected and 4179 primer pairs were designed for experimental validation. Of these 4179 primer pairs, 435 primer pairs were randomly selected to test polymorphism. Our results show that products from 132 primer pairs were polymorphic, in which 97 polymorphic SSR markers were further selected to analyze the genetic diversity of 10 natural populations of Dipteronia. The identification of SSR markers during our research will provide the much valuable data for population genetic analyses and evolutionary studies in Dipteronia.
Development and Characterization of Microsatellite Markers from the Transcriptome of Firmiana danxiaensis (Malvaceae s.l.

Directory of Open Access Journals (Sweden)

Qiang Fan

2013-11-01

Full Text Available Premise of the study: Firmiana consists of 12–16 species, many of which are narrow endemics. Expressed sequence tag (EST–simple sequence repeat (SSR markers were developed and characterized for size polymorphism in four Firmiana species. Methods and Results: A total of 102 EST-SSR primer pairs were designed based on the transcriptome sequences of F. danxiaensis; these were then characterized in four Firmiana species—F. danxiaensis, F. kwangsiensis, F. hainanensis, and F. simplex. In these four species, 17 primer pairs were successfully amplified, and 14 were polymorphic in at least one species. The number of alleles ranged from one to 13, and the observed and expected heterozygosities ranged from 0 to 1 and 0 to 0.925, respectively. The lowest level of polymorphism was observed in F. danxiaensis. Conclusions: These polymorphic EST-SSR markers are valuable for conservation genetics studies in the endangered Firmiana species.
Identification and characterization of gene-based SSR markers in date palm (Phoenix dactylifera L.

Directory of Open Access Journals (Sweden)

Zhao Yongli

2012-12-01

Full Text Available Abstract Background Date palm (Phoenix dactylifera L. is an important tree in the Middle East and North Africa due to the nutritional value of its fruit. Molecular Breeding would accelerate genetic improvement of fruit tree through marker assisted selection. However, the lack of molecular markers in date palm restricts the application of molecular breeding. Results In this study, we analyzed 28,889 EST sequences from the date palm genome database to identify simple-sequence repeats (SSRs and to develop gene-based markers, i.e. expressed sequence tag-SSRs (EST-SSRs. We identified 4,609 ESTs as containing SSRs, among which, trinucleotide motifs (69.7% were the most common, followed by tetranucleotide (10.4% and dinucleotide motifs (9.6%. The motif AG (85.7% was most abundant in dinucleotides, while motifs AGG (26.8%, AAG (19.3%, and AGC (16.1% were most common among trinucleotides. A total of 4,967 primer pairs were designed for EST-SSR markers from the computational data. In a follow up laboratory study, we tested a sample of 20 random selected primer pairs for amplification and polymorphism detection using genomic DNA from date palm cultivars. Nearly one-third of these primer pairs detected DNA polymorphism to differentiate the twelve date palm cultivars used. Functional categorization of EST sequences containing SSRs revealed that 3,108 (67.4% of such ESTs had homology with known proteins. Conclusion Date palm EST sequences exhibits a good resource for developing gene-based markers. These genic markers identified in our study may provide a valuable genetic and genomic tool for further genetic research and varietal development in date palm, such as diversity study, QTL mapping, and molecular breeding.

Annotation of novel neuropeptide precursors in the migratory locust based on transcript screening of a public EST database and mass spectrometry

Directory of Open Access Journals (Sweden)

De Loof Arnold

2006-08-01

Full Text Available Abstract Background For holometabolous insects there has been an explosion of proteomic and peptidomic information thanks to large genome sequencing projects. Heterometabolous insects, although comprising many important species, have been far less studied. The migratory locust Locusta migratoria, a heterometabolous insect, is one of the most infamous agricultural pests. They undergo a well-known and profound phase transition from the relatively harmless solitary form to a ferocious gregarious form. The underlying regulatory mechanisms of this phase transition are not fully understood, but it is undoubtedly that neuropeptides are involved. However, neuropeptide research in locusts is hampered by the absence of genomic information. Results Recently, EST (Expressed Sequence Tag databases from Locusta migratoria were constructed. Using bioinformatical tools, we searched these EST databases specifically for neuropeptide precursors. Based on known locust neuropeptide sequences, we confirmed the sequence of several previously identified neuropeptide precursors (i.e. pacifastin-related peptides, which consolidated our method. In addition, we found two novel neuroparsin precursors and annotated the hitherto unknown tachykinin precursor. Besides one of the known tachykinin peptides, this EST contained an additional tachykinin-like sequence. Using neuropeptide precursors from Drosophila melanogaster as a query, we succeeded in annotating the Locusta neuropeptide F, allatostatin-C and ecdysis-triggering hormone precursor, which until now had not been identified in locusts or in any other heterometabolous insect. For the tachykinin precursor, the ecdysis-triggering hormone precursor and the allatostatin-C precursor, translation of the predicted neuropeptides in neural tissues was confirmed with mass spectrometric techniques. Conclusion In this study we describe the annotation of 6 novel neuropeptide precursors and the neuropeptides they encode from the
De novo characterization of Larimichthys crocea transcriptome for growth-/immune-related gene identification and massive microsatellite (SSR) marker development

Science.gov (United States)

Han, Zhaofang; Xiao, Shijun; Liu, Xiande; Liu, Yang; Li, Jiakai; Xie, Yangjie; Wang, Zhiyong

2017-03-01

The large yellow croaker, Larimichthys crocea is an important marine fish in China with a high economic value. In the last decade, the stock conservation and aquaculture industry of this species have been facing severe challenges because of wild population collapse and degeneration of important economic traits. However, genes contributing to growth and immunity in L. crocea have not been thoroughly analyzed, and available molecular markers are still not sufficient for genetic resource management and molecular selection. In this work, we sequenced the transcriptome in L. crocea liver tissue with a Roche 454 sequencing platform and assembled the transcriptome into 93 801 transcripts. Of them, 38 856 transcripts were successfully annotated in nt, nr, Swiss-Prot, InterPro, COG, GO and KEGG databases. Based on the annotation information, 3 165 unigenes related to growth and immunity were identified. Additionally, a total of 6 391 simple sequence repeats (SSRs) were identified from the transcriptome, among which 4 498 SSRs had enough flanking regions to design primers for polymerase chain reactions (PCR). To access the polymorphism of these markers, 30 primer pairs were randomly selected for PCR amplification and validation in 30 individuals, and 12 primer pairs (40.0%) exhibited obvious length polymorphisms. This work applied RNA-Seq to assemble and analyze a live transcriptome in L. crocea. With gene annotation and sequence information, genes related to growth and immunity were identified and massive SSR markers were developed, providing valuable genetic resources for future gene functional analysis and selective breeding of L. crocea.
Development and Molecular Characterization of Novel Polymorphic Genomic DNA SSR Markers in Lentinula edodes.

Science.gov (United States)

Moon, Suyun; Lee, Hwa-Yong; Shim, Donghwan; Kim, Myungkil; Ka, Kang-Hyeon; Ryoo, Rhim; Ko, Han-Gyu; Koo, Chang-Duck; Chung, Jong-Wook; Ryu, Hojin

2017-06-01

Sixteen genomic DNA simple sequence repeat (SSR) markers of Lentinula edodes were developed from 205 SSR motifs present in 46.1-Mb long L. edodes genome sequences. The number of alleles ranged from 3-14 and the major allele frequency was distributed from 0.17-0.96. The values of observed and expected heterozygosity ranged from 0.00-0.76 and 0.07-0.90, respectively. The polymorphic information content value ranged from 0.07-0.89. A dendrogram, based on 16 SSR markers clustered by the paired hierarchical clustering' method, showed that 33 shiitake cultivars could be divided into three major groups and successfully identified. These SSR markers will contribute to the efficient breeding of this species by providing diversity in shiitake varieties. Furthermore, the genomic information covered by the markers can provide a valuable resource for genetic linkage map construction, molecular mapping, and marker-assisted selection in the shiitake mushroom.
Analysis of SSR information in EST resources of sugarcane

Science.gov (United States)

Expressed sequence tags ( ESTs) offer the opportunity to exploit single, low -copy, conserved sequence motifs for the development of simple sequence repeats ( SSRs). The total of 262 113 ESTs of sugarcane (Saccharum officinarum) in the database of NCBI were downloaded and analyzed, which resulted in...
Leaf Transcriptome Sequencing for Identifying Genic-SSR Markers and SNP Heterozygosity in Crossbred Mango Variety 'Amrapali' (Mangifera indica L.).

Science.gov (United States)

Mahato, Ajay Kumar; Sharma, Nimisha; Singh, Akshay; Srivastav, Manish; Jaiprakash; Singh, Sanjay Kumar; Singh, Anand Kumar; Sharma, Tilak Raj; Singh, Nagendra Kumar

2016-01-01

Mango (Mangifera indica L.) is called "king of fruits" due to its sweetness, richness of taste, diversity, large production volume and a variety of end usage. Despite its huge economic importance genomic resources in mango are scarce and genetics of useful horticultural traits are poorly understood. Here we generated deep coverage leaf RNA sequence data for mango parental varieties 'Neelam', 'Dashehari' and their hybrid 'Amrapali' using next generation sequencing technologies. De-novo sequence assembly generated 27,528, 20,771 and 35,182 transcripts for the three genotypes, respectively. The transcripts were further assembled into a non-redundant set of 70,057 unigenes that were used for SSR and SNP identification and annotation. Total 5,465 SSR loci were identified in 4,912 unigenes with 288 type I SSR (n ≥ 20 bp). One hundred type I SSR markers were randomly selected of which 43 yielded PCR amplicons of expected size in the first round of validation and were designated as validated genic-SSR markers. Further, 22,306 SNPs were identified by aligning high quality sequence reads of the three mango varieties to the reference unigene set, revealing significantly enhanced SNP heterozygosity in the hybrid Amrapali. The present study on leaf RNA sequencing of mango varieties and their hybrid provides useful genomic resource for genetic improvement of mango.
Construction of new EST-SSRs for Fusarium resistant wheat breeding.

Science.gov (United States)

Yumurtaci, Aysen; Sipahi, Hulya; Al-Abdallat, Ayed; Jighly, Abdulqader; Baum, Michael

2017-06-01

Surveying Fusarium resistance in wheat with easy applicable molecular markers such as simple sequence repeats (SSRs) is a prerequest for molecular breeding. Expressed sequence tags (ESTs) are one of the main sources for development of new SSR candidates. Therefore, 18.292 publicly available wheat ESTs were mined and genotyping of newly developed 55 EST-SSR derived primer pairs produced clear fragments in ten wheat cultivars carrying different levels of Fusarium resistance. Among the proved markers, 23 polymorphic EST-SSRs were obtained and related alleles were mostly found on B and D genome. Based on the fragment profiling and similarity analysis, a 327bp amplicon, which was a product of contig 1207 (chromosome 5BL), was detected only in Fusarium head blight (FHB) resistant cultivars (CM82036 and Sumai) and the amino acid sequences showed a similarity to pathogen related proteins. Another FHB resistance related EST-SSR, Contig 556 (chromosome 1BL) produced a 151bp fragment in Sumai and was associated to wax2-like protein. A polymorphic 204bp fragment, derived from Contig 578 (chromosome 1DL), was generated from root rot (FRR) resistant cultivars (2-49; Altay2000 and Sunco). A total of 98 alleles were displayed with an average of 1.8 alleles per locus and the polymorphic information content (PIC) ranged from 0.11 to 0.78. Dendrogram tree with two main and five sub-groups were displayed the highest genetic relationship between FRR resistant cultivars (2-49 and Altay2000), FRR sensitive cultivars (Seri82 and Scout66) and FHB resistant cultivars (CM82036 and Sumai). Thus, exploitation of these candidate EST-SSRs may help to genotype other wheat sources for Fusarium resistance. Copyright © 2017 Elsevier Ltd. All rights reserved.
Leaf Transcriptome Sequencing for Identifying Genic-SSR Markers and SNP Heterozygosity in Crossbred Mango Variety ‘Amrapali’ (Mangifera indica L.)

Science.gov (United States)

Mahato, Ajay Kumar; Sharma, Nimisha; Singh, Akshay; Srivastav, Manish; Jaiprakash; Singh, Sanjay Kumar; Singh, Anand Kumar; Sharma, Tilak Raj; Singh, Nagendra Kumar

2016-01-01

Mango (Mangifera indica L.) is called “king of fruits” due to its sweetness, richness of taste, diversity, large production volume and a variety of end usage. Despite its huge economic importance genomic resources in mango are scarce and genetics of useful horticultural traits are poorly understood. Here we generated deep coverage leaf RNA sequence data for mango parental varieties ‘Neelam’, ‘Dashehari’ and their hybrid ‘Amrapali’ using next generation sequencing technologies. De-novo sequence assembly generated 27,528, 20,771 and 35,182 transcripts for the three genotypes, respectively. The transcripts were further assembled into a non-redundant set of 70,057 unigenes that were used for SSR and SNP identification and annotation. Total 5,465 SSR loci were identified in 4,912 unigenes with 288 type I SSR (n ≥ 20 bp). One hundred type I SSR markers were randomly selected of which 43 yielded PCR amplicons of expected size in the first round of validation and were designated as validated genic-SSR markers. Further, 22,306 SNPs were identified by aligning high quality sequence reads of the three mango varieties to the reference unigene set, revealing significantly enhanced SNP heterozygosity in the hybrid Amrapali. The present study on leaf RNA sequencing of mango varieties and their hybrid provides useful genomic resource for genetic improvement of mango. PMID:27736892
Identification, validation and cross-species transferability of novel Lavandula EST-SSRs.

Science.gov (United States)

Adal, Ayelign M; Demissie, Zerihun A; Mahmoud, Soheil S

2015-04-01

We identified and characterized EST-SSRs with strong discrimination power against Lavandula angustifolia and Lavandula x intermedia . The markers also showed considerable cross-species transferability rate into six related Lavandula species. Lavenders (Lavandula) are important economical crops grown around the globe for essential oil production. In an attempt to develop genetic markers for these plants, we analyzed over 13,000 unigenes developed from L. angustifolia and L. x intermedia EST databases, and identified 3,459 simple sequence repeats (SSR), which were dominated by trinucleotides (41.2 %) and dinucleotides (31.45 %). Approximately, 19 % of the unigenes contained at least one SSR marker, over 60 % of which were localized in the UTRs. Only 252 EST-SSRs were 18 bp or longer from which 31 loci were validated, and 24 amplified discrete fragments with 85 % polymorphism in L. x intermedia and L. angustifolia. The average number of alleles in L. x intermedia and L. angustifolia were 3.42 and 3.71 per marker with average PIC values of 0.47 and 0.52, respectively. These values suggest a moderate to strong level of informativeness for the markers, with some loci producing unique fingerprints. The cross-species transferability rate of the markers ranges 50-100 % across eight species. The utility of these markers was assessed in eight Lavandula species and 15 L. angustifolia and L. x intermedia cultivars, and the dendrogram deduced from their similarity indexes successfully delineated the species into their respective sections and the cultivars into their respective species. These markers have potential for application in fingerprinting, diversity studies and marker-assisted breeding of Lavandula.
Improvement of SSR core design for ABWR-II

International Nuclear Information System (INIS)

Moriwaki, Masanao; Aoyama, Motoo; Okada, Hiroyuki; Kitamura, Hideya; Sakurada, Koichi; Tanabe, Akira

2003-01-01

In order to enhance the spectral shift effect in the ABWR-II reactor, a novel core design to bring out better performance of spectral shift rods (SSRs) is studied. The SSR is a new type of water rod, in which the water level develops naturally during operation and changes according to the coolant flow rate through the channel. By using the SSR, the average moderator density, which is directly related to core reactivity, can be controlled over a wide range by the core flow rate. In the new SSR core design, two types of SSR bundles, in which settings for the SSR water levels are different, are utilized and loaded according to flow distribution in the core. This two-region SSR core design allows wide variation in the average SSR water level, thus improving fuel economy. Enhancement of SSR function in the two-region SSR core increases the uranium saving factor by about 25%, from the 6% of the conventional uniform SSR core to about 8%. (author)
SSR and morphological trait based population structure analysis of 130 diverse flax (Linum usitatissimum L.) accessions.

Science.gov (United States)

Choudhary, Shashi Bhushan; Sharma, Hariom Kumar; Kumar, Arroju Anil; Maruthi, Rangappa Thimmaiah; Mitra, Jiban; Chowdhury, Isholeena; Singh, Binay Kumar; Karmakar, Pran Gobinda

2017-02-01

A total of 130 flax accessions of diverse morphotypes and worldwide origin were assessed for genetic diversity and population structure using 11 morphological traits and microsatellite markers (15 gSSRs and 7 EST-SSRs). Analysis performed after classifying these accessions on the basis of plant height, branching pattern, seed size, Indian/foreign origin into six categories called sub-populations viz. fibre type exotic, fibre type indigenous, intermediate type exotic, intermediate type indigenous, linseed type exotic and linseed type indigenous. The study assessed different diversity indices, AMOVA, population structure and included a principal coordinate analysis based on different marker systems. The highest diversity was exhibited by gSSR markers (SI=0.46; He=0.31; P=85.11). AMOVA based on all markers explained significant difference among fibre type, intermediate type and linseed type populations of flax. In terms of variation explained by different markers, EST-SSR markers (12%) better differentiated flax populations compared to morphological (9%) and gSSR (6%) markers at P=0.01. The maximum Nei's unbiased genetic distance (D=0.11) was observed between fibre type and linseed type exotic sub-populations based on EST-SSR markers. The combined structure analysis by using all markers grouped Indian fibre type accessions (63.4%) in a separate cluster along with the Indian intermediate type (48.7%), whereas Indian accessions (82.16%) of linseed type constituted an independent cluster. These findings were supported by the results of the principal coordinate analysis. Morphological markers employed in the study found complementary with microsatellite based markers in deciphering genetic diversity and population structure of the flax germplasm. Copyright © 2016 Académie des sciences. Published by Elsevier Masson SAS. All rights reserved.
The use of sequence-based SSR mining for the development of a vast collection of microsatellites in Aquilegia Formosa

Science.gov (United States)

Brandon Schlautman; Vera Pfeiffer; Juan Zalapa; Johanne Brunet

2014-01-01

Numerous microsatellite markers were developed for Aquilegia formosafrom sequences deposited within the Expressed Sequence Tag (EST), Genomic Survey Sequence (GSS), and Nucleotide databases in NCBI. Microsatellites (SSRs) were identified and primers were designed for 9 SSR containing sequences in the Nucleotide database, 3803 sequences in the EST...
SA-SSR: a suffix array-based algorithm for exhaustive and efficient SSR discovery in large genetic sequences.

Science.gov (United States)

Pickett, B D; Karlinsey, S M; Penrod, C E; Cormier, M J; Ebbert, M T W; Shiozawa, D K; Whipple, C J; Ridge, P G

2016-09-01

Simple Sequence Repeats (SSRs) are used to address a variety of research questions in a variety of fields (e.g. population genetics, phylogenetics, forensics, etc.), due to their high mutability within and between species. Here, we present an innovative algorithm, SA-SSR, based on suffix and longest common prefix arrays for efficiently detecting SSRs in large sets of sequences. Existing SSR detection applications are hampered by one or more limitations (i.e. speed, accuracy, ease-of-use, etc.). Our algorithm addresses these challenges while being the most comprehensive and correct SSR detection software available. SA-SSR is 100% accurate and detected >1000 more SSRs than the second best algorithm, while offering greater control to the user than any existing software. SA-SSR is freely available at http://github.com/ridgelab/SA-SSR CONTACT: perry.ridge@byu.edu Supplementary data are available at Bioinformatics online. © The Author 2016. Published by Oxford University Press.
Novel expressed sequence tag- simple sequence repeats (EST ...

African Journals Online (AJOL)

Using different bioinformatic criteria, the SUCEST database was used to mine for simple sequence repeat (SSR) markers. Among 42,189 clusters, 1,425 expressed sequence tag- simple sequence repeats (EST-SSRs) were identified in silico. Trinucleotide repeats were the most abundant SSRs detected. Of 212 primer pairs ...
Two EST-derived marker systems for cultivar identification in tree peony.

Science.gov (United States)

Zhang, J J; Shu, Q Y; Liu, Z A; Ren, H X; Wang, L S; De Keyser, E

2012-02-01

Tree peony (Paeonia suffruticosa Andrews), a woody deciduous shrub, belongs to the section Moutan DC. in the genus of Paeonia of the Paeoniaceae family. To increase the efficiency of breeding, two EST-derived marker systems were developed based on a tree peony expressed sequence tag (EST) database. Using target region amplification polymorphism (TRAP), 19 of 39 primer pairs showed good amplification for 56 accessions with amplicons ranging from 120 to 3,000 bp long, among which 99.3% were polymorphic. In contrast, 7 of 21 primer pairs demonstrated adequate amplification with clear bands for simple sequence repeats (SSRs) developed from ESTs, and a total of 33 alleles were found in 56 accessions. The similarity matrices generated by TRAP and EST-SSR markers were compared, and the Mantel test (r = 0.57778, P = 0.0020) showed a moderate correlation between the two types of molecular markers. TRAP markers were suitable for DNA fingerprinting and EST-SSR markers were more appropriate for discriminating synonyms (the same cultivars with different names due to limited information exchanged among different geographic areas). The two sets of EST-derived markers will be used further for genetic linkage map construction and quantitative trait locus detection in tree peony.
Development and Characterization of Simple Sequence Repeat (SSR) Markers Based on RNA-Sequencing of Medicago sativa and In silico Mapping onto the M. truncatula Genome

Science.gov (United States)

Wang, Zan; Yu, Guohui; Shi, Binbin; Wang, Xuemin; Qiang, Haiping; Gao, Hongwen

2014-01-01

Sufficient codominant genetic markers are needed for various genetic investigations in alfalfa since the species is an outcrossing autotetraploid. With the newly developed next generation sequencing technology, a large amount of transcribed sequences of alfalfa have been generated and are available for identifying SSR markers by data mining. A total of 54,278 alfalfa non-redundant unigenes were assembled through the Illumina HiSeqTM 2000 sequencing technology. Based on 3,903 unigene sequences, 4,493 SSRs were identified. Tri-nucleotide repeats (56.71%) were the most abundant motif class while AG/CT (21.7%), AGG/CCT (19.8%), AAC/GTT (10.3%), ATC/ATG (8.8%), and ACC/GGT (6.3%) were the subsequent top five nucleotide repeat motifs. Eight hundred and thirty- seven EST-SSR primer pairs were successfully designed. Of these, 527 (63%) primer pairs yielded clear and scored PCR products and 372 (70.6%) exhibited polymorphisms. High transferability was observed for ssp falcata at 99.2% (523) and 71.7% (378) in M. truncatula. In addition, 313 of 527 SSR marker sequences were in silico mapped onto the eight M. truncatula chromosomes. Thirty-six polymorphic SSR primer pairs were used in the genetic relatedness analysis of 30 Chinese alfalfa cultivated accessions generating a total of 199 scored alleles. The mean observed heterozygosity and polymorphic information content were 0.767 and 0.635, respectively. The codominant markers not only enriched the current resources of molecular markers in alfalfa, but also would facilitate targeted investigations in marker-trait association, QTL mapping, and genetic diversity analysis in alfalfa. PMID:24642969
Identification, characterization and utilization of unigene derived microsatellite markers in tea (Camellia sinensis L.

Directory of Open Access Journals (Sweden)

Mohapatra Trilochan

2009-05-01

Full Text Available Abstract Background Despite great advances in genomic technology observed in several crop species, the availability of molecular tools such as microsatellite markers has been limited in tea (Camellia sinensis L.. The development of microsatellite markers will have a major impact on genetic analysis, gene mapping and marker assisted breeding. Unigene derived microsatellite (UGMS markers identified from publicly available sequence database have the advantage of assaying variation in the expressed component of the genome with unique identity and position. Therefore, they can serve as efficient and cost effective alternative markers in such species. Results Considering the multiple advantages of UGMS markers, 1,223 unigenes were predicted from 2,181 expressed sequence tags (ESTs of tea (Camellia sinensis L.. A total of 109 (8.9% unigenes containing 120 SSRs were identified. SSR abundance was one in every 3.55 kb of EST sequences. The microsatellites mainly comprised of di (50.8%, tri (30.8%, tetra (6.6%, penta (7.5% and few hexa (4.1% nucleotide repeats. Among the dinucleotide repeats, (GAn.(TCn were most abundant (83.6%. Ninety six primer pairs could be designed form 83.5% of SSR containing unigenes. Of these, 61 (63.5% primer pairs were experimentally validated and used to investigate the genetic diversity among the 34 accessions of different Camellia spp. Fifty one primer pairs (83.6% were successfully cross transferred to the related species at various levels. Functional annotation of the unigenes containing SSRs was done through gene ontology (GO characterization. Thirty six (60% of them revealed significant sequence similarity with the known/putative proteins of Arabidopsis thaliana. Polymorphism information content (PIC ranged from 0.018 to 0.972 with a mean value of 0.497. The average heterozygosity expected (HE and observed (Ho obtained was 0.654 and 0.413 respectively, thereby suggesting highly heterogeneous nature of tea. Further, test
An efficient identification strategy of clonal tea cultivars using long-core motif SSR markers.

Science.gov (United States)

Wang, Rang Jian; Gao, Xiang Feng; Kong, Xiang Rui; Yang, Jun

2016-01-01

Microsatellites, or simple sequence repeats (SSRs), especially those with long-core motifs (tri-, tetra-, penta-, and hexa-nucleotide) represent an excellent tool for DNA fingerprinting. SSRs with long-core motifs are preferred since neighbor alleles are more easily separated and identified from each other, which render the interpretation of electropherograms and the true alleles more reliable. In the present work, with the purpose of characterizing a set of core SSR markers with long-core motifs for well fingerprinting clonal cultivars of tea (Camellia sinensis), we analyzed 66 elite clonal tea cultivars in China with 33 initially-chosen long-core motif SSR markers covering all the 15 linkage groups of tea plant genome. A set of 6 SSR markers were conclusively selected as core SSR markers after further selection. The polymorphic information content (PIC) of the core SSR markers was >0.5, with ≤5 alleles in each marker containing 10 or fewer genotypes. Phylogenetic analysis revealed that the core SSR markers were not strongly correlated with the trait 'cultivar processing-property'. The combined probability of identity (PID) between two random cultivars for the whole set of 6 SSR markers was estimated to be 2.22 × 10(-5), which was quite low, confirmed the usefulness of the proposed SSR markers for fingerprinting analyses in Camellia sinensis. Moreover, for the sake of quickly discriminating the clonal tea cultivars, a cultivar identification diagram (CID) was subsequently established using these core markers, which fully reflected the identification process and provided the immediate information about which SSR markers were needed to identify a cultivar chosen among the tested ones. The results suggested that long-core motif SSR markers used in the investigation contributed to the accurate and efficient identification of the clonal tea cultivars and enabled the protection of intellectual property.
Simple sequence repeat marker loci discovery using SSR primer.

Science.gov (United States)

Robinson, Andrew J; Love, Christopher G; Batley, Jacqueline; Barker, Gary; Edwards, David

2004-06-12

Simple sequence repeats (SSRs) have become important molecular markers for a broad range of applications, such as genome mapping and characterization, phenotype mapping, marker assisted selection of crop plants and a range of molecular ecology and diversity studies. With the increase in the availability of DNA sequence information, an automated process to identify and design PCR primers for amplification of SSR loci would be a useful tool in plant breeding programs. We report an application that integrates SPUTNIK, an SSR repeat finder, with Primer3, a PCR primer design program, into one pipeline tool, SSR Primer. On submission of multiple FASTA formatted sequences, the script screens each sequence for SSRs using SPUTNIK. The results are parsed to Primer3 for locus-specific primer design. The script makes use of a Web-based interface, enabling remote use. This program has been written in PERL and is freely available for non-commercial users by request from the authors. The Web-based version may be accessed at http://hornbill.cspp.latrobe.edu.au/
EST-SSR

African Journals Online (AJOL)

user2

2013-02-27

Feb 27, 2013 ... Due to their high abundance, multi- allelic nature ... different root colors and origins, and 10 related species in Brassica were selected for ... analysis and transferability study, genomic DNA of two radish advanced inbred ...
Genome size, cytogenetic data and transferability of EST-SSRs markers in wild and cultivated species of the genus Theobroma L. (Byttnerioideae, Malvaceae)

Science.gov (United States)

da Silva, Rangeline Azevedo; Souza, Gustavo; Lemos, Lívia Santos Lima; Lopes, Uilson Vanderlei; Patrocínio, Nara Geórgia Ribeiro Braz; Alves, Rafael Moysés; Marcellino, Lucília Helena; Clement, Didier; Micheli, Fabienne

2017-01-01

The genus Theobroma comprises several trees species native to the Amazon. Theobroma cacao L. plays a key economic role mainly in the chocolate industry. Both cultivated and wild forms are described within the genus. Variations in genome size and chromosome number have been used for prediction purposes including the frequency of interspecific hybridization or inference about evolutionary relationships. In this study, the nuclear DNA content, karyotype and genetic diversity using functional microsatellites (EST-SSR) of seven Theobroma species were characterized. The nuclear content of DNA for all analyzed Theobroma species was 1C = ~ 0.46 pg. These species presented 2n = 20 with small chromosomes and only one pair of terminal heterochromatic bands positively stained (CMA+/DAPI− bands). The small size of Theobroma ssp. genomes was equivalent to other Byttnerioideae species, suggesting that the basal lineage of Malvaceae have smaller genomes and that there was an expansion of 2C values in the more specialized family clades. A set of 20 EST-SSR primers were characterized for related species of Theobroma, in which 12 loci were polymorphic. The polymorphism information content (PIC) ranged from 0.23 to 0.65, indicating a high level of information per locus. Combined results of flow cytometry, cytogenetic data and EST-SSRs markers will contribute to better describe the species and infer about the evolutionary relationships among Theobroma species. In addition, the importance of a core collection for conservation purposes is highlighted. PMID:28187131

Genome size, cytogenetic data and transferability of EST-SSRs markers in wild and cultivated species of the genus Theobroma L. (Byttnerioideae, Malvaceae.

Directory of Open Access Journals (Sweden)

Rangeline Azevedo da Silva

Full Text Available The genus Theobroma comprises several trees species native to the Amazon. Theobroma cacao L. plays a key economic role mainly in the chocolate industry. Both cultivated and wild forms are described within the genus. Variations in genome size and chromosome number have been used for prediction purposes including the frequency of interspecific hybridization or inference about evolutionary relationships. In this study, the nuclear DNA content, karyotype and genetic diversity using functional microsatellites (EST-SSR of seven Theobroma species were characterized. The nuclear content of DNA for all analyzed Theobroma species was 1C = ~ 0.46 pg. These species presented 2n = 20 with small chromosomes and only one pair of terminal heterochromatic bands positively stained (CMA+/DAPI- bands. The small size of Theobroma ssp. genomes was equivalent to other Byttnerioideae species, suggesting that the basal lineage of Malvaceae have smaller genomes and that there was an expansion of 2C values in the more specialized family clades. A set of 20 EST-SSR primers were characterized for related species of Theobroma, in which 12 loci were polymorphic. The polymorphism information content (PIC ranged from 0.23 to 0.65, indicating a high level of information per locus. Combined results of flow cytometry, cytogenetic data and EST-SSRs markers will contribute to better describe the species and infer about the evolutionary relationships among Theobroma species. In addition, the importance of a core collection for conservation purposes is highlighted.
An online conserved SSR discovery through cross-species comparison

Directory of Open Access Journals (Sweden)

Tun-Wen Pai

2009-02-01

Full Text Available Tun-Wen Pai1, Chien-Ming Chen1, Meng-Chang Hsiao1, Ronshan Cheng2, Wen-Shyong Tzou3, Chin-Hua Hu31Department of Computer Science and Engineering; 2Department of Aquaculture, 3Institute of Bioscience and Biotechnology, National Taiwan Ocean University, Keelung, Taiwan, Republic of ChinaAbstract: Simple sequence repeats (SSRs play important roles in gene regulation and genome evolution. Although there exist several online resources for SSR mining, most of them only extract general SSR patterns without providing functional information. Here, an online search tool, CG-SSR (Comparative Genomics SSR discovery, has been developed for discovering potential functional SSRs from vertebrate genomes through cross-species comparison. In addition to revealing SSR candidates in conserved regions among various species, it also combines accurate coordinate and functional genomics information. CG-SSR is the first comprehensive and efficient online tool for conserved SSR discovery.Keywords: microsatellites, genome, comparative genomics, functional SSR, gene ontology, conserved region
DNA Fingerprinting Eastern Redbud Cultivars (Cercis canadensis) Using SSR Markers

Science.gov (United States)

In this study we present data for a subset of SSR loci, 76 out of the 130 high-quality loci, which were selected out of hundreds of SSR loci identified from a SSR-enriched library. SSR markers are abundant in eukaryotic genomes and are highly reproducible. Previously, we have used SSR markers to e...
FullSSR: Microsatellite Finder and Primer Designer

Directory of Open Access Journals (Sweden)

Sebastián Metz

2016-01-01

Full Text Available Microsatellites are genomic sequences comprised of tandem repeats of short nucleotide motifs widely used as molecular markers in population genetics. FullSSR is a new bioinformatic tool for microsatellite (SSR loci detection and primer design using genomic data from NGS assay. The software was tested with 2000 sequences of Oryza sativa shotgun sequencing project from the National Center of Biotechnology Information Trace Archive and with partial genome sequencing with ROCHE 454® from Caiman latirostris, Salvator merianae, Aegla platensis, and Zilchiopsis collastinensis. FullSSR performance was compared against other similar SSR search programs. The results of the use of this kind of approach depend on the parameters set by the user. In addition, results can be affected by the analyzed sequences because of differences among the genomes. FullSSR simplifies the detection of SSRs and primer design on a big data set. The command line interface of FullSSR was intended to be used as part of genomic analysis tools pipeline; however, it can be used as a stand-alone program because the results are easily interpreted for a nonexpert user.
Molecular characterization of a peanut variety and its derivatives based on SSR and COP analysis

Institute of Scientific and Technical Information of China (English)

Xiaoping REN; Boshou LIAO; Huifang JIANG; Zhongyuan YUAN; Yuning CHEN; Xiaojing ZHOU; Li HUANG; Jiaquan HUANG; Yong LEI; Liying YAN

2016-01-01

Despite the economic importance of the peanut,no studies have been carried out to determine the correlation between genetic distances based on molecular markers and on coefficient of parentage (COP) data.In this study,simple sequence repeat (SSR) and pedigree data were used to assess the genetic distance between the Fuhuasheng variety and its derivative cultivars.A total of 39 SSR polymorphism primers were used,and 151 bands were obtained,with an average of 2.04 bands in each primer.The genetic SSR-based distance (GD) values ranged from 0.02 to 0.81,while the COP-based GD ranged from 0.25 to 0.98.Certain Fuhuasheng loci displayed higher transmission rates.These loci or nearby chromosomal regions might be associated with desirable traits in Fuhuasheng variety,thus being frequently selected in breeding programs.Therefore,it can be suggested that COP analysis should be the preferred method for estimating genetic diversity invarieties with available complete pedigree information and parents.In this case,marker analysis would provide the best estimations.
SSR marker development and intraspecific genetic divergence exploration of Chrysanthemum indicum based on transcriptome analysis.

Science.gov (United States)

Han, Zhengzhou; Ma, Xinye; Wei, Min; Zhao, Tong; Zhan, Ruoting; Chen, Weiwen

2018-04-25

Chrysanthemum indicum L., an important ancestral species of the flowering plant chrysanthemum, can be used as medicine and for functional food development. Due to the lack of hereditary information for this species and the difficulty of germplasm identification, we herein provide new genetic insight from the perspective of intraspecific transcriptome comparison and present single sequence repeat (SSR) molecular marker recognition technology. Through the study of a diploid germplasm (DIWNT) and a tetraploid germplasm (DIWT), the following outcome were obtained. (1) A significant difference in Gene Ontology (GO) and Kyoto Encyclopedia of Genes and Genomes (KEGG) annotations for specific homologous genes was observed using the OrthoMCL method for the identification of homologous gene families between the two cytotypes. Ka/Ks analysis of common, single-copy homologous family members also revealed a greater difference among genes that experienced positive selection than among those experiencing positive selection. (2) Of more practical value, 2575 SSR markers were predicted and partly verified. We used TaxonGap as a visual tool to inspect genotype uniqueness and screen for high-performance molecular loci; we recommend four primers of 65 randomly selected primers with a combined identification success rate of 88.6% as priorities for further development of DNA fingerprinting of C. indicum germplasm. The SSR technology based on next-generation sequencing was proved to be successful in the identification of C. indicum germplasms. And the information on the intraspecfic genetic divergence generated by transcriptome comparison deepened the understanding of this complex species' nature.
Development of unigene-derived SSR markers in cowpea (Vigna unguiculata) and their transferability to other Vigna species.

Science.gov (United States)

Gupta, S K; Gopalakrishna, T

2010-07-01

Unigene sequences available in public databases provide a cost-effective and valuable source for the development of molecular markers. In this study, the identification and development of unigene-based SSR markers in cowpea (Vigna unguiculata (L.) Walp.) is presented. A total of 1071 SSRs were identified in 15 740 cowpea unigene sequences downloaded from the National Center for Biotechnology Information. The most frequent SSR motifs present in the unigenes were trinucleotides (59.7%), followed by dinucleotides (34.8%), pentanucleotides (4%), and tetranucleotides (1.5%). The copy number varied from 6 to 33 for dinucleotide, 5 to 29 for trinucleotide, 5 to 7 for tetranucleotide, and 4 to 6 for pentanucleotide repeats. Primer pairs were successfully designed for 803 SSR motifs and 102 SSR markers were finally characterized and validated. Putative function was assigned to 64.7% of the unigene SSR markers based on significant homology to reported proteins. About 31.7% of the SSRs were present in coding sequences and 68.3% in untranslated regions of the genes. About 87% of the SSRs located in the coding sequences were trinucleotide repeats. Allelic variation at 32 SSR loci produced 98 alleles in 20 cowpea genotypes. The polymorphic information content for the SSR markers varied from 0.10 to 0.83 with an average of 0.53. These unigene SSR markers showed a high rate of transferability (88%) across other Vigna species, thereby expanding their utility. Alignment of unigene sequences with soybean genomic sequences revealed the presence of introns in amplified products of some of the SSR markers. This study presents the distribution of SSRs in the expressed portion of the cowpea genome and is the first report of the development of functional unigene-based SSR markers in cowpea. These SSR markers would play an important role in molecular mapping, comparative genomics, and marker-assisted selection strategies in cowpea and other Vigna species.
Development and identification of SSR markers associated with starch properties and β-carotene content in the storage root of sweet potato (Ipomoea batatas L.

Directory of Open Access Journals (Sweden)

Kai eZhang

2016-03-01

Full Text Available Sweet potato (Ipomoea batatas L. is a nutritious food crop and, based on the high starch content of its storage root, a potential bioethanol feedstock. Enhancing the nutritional value and starch quantity of storage roots are important goals of sweet potato breeding programs aimed at developing improved varieties for direct consumption, processing, and industrial uses. However, developing improved lines of sweet potato is challenging due to the genetic complexity of this plant and the lack of genome information. Short sequence repeat (SSR markers are powerful molecular tools for tracking important loci in crops and for molecular-based breeding strategies; however, few SSR markers and marker-trait associations have hitherto been identified in sweet potato. In this study, we identified 1,824 SSRs by using a de novo assembly of publicly available ESTs and mRNAs in sweet potato, and designed 1,476 primer pairs based on SSR-containing sequences. We mapped 214 pairs of primers in a natural population comprised of 239 germplasms, and identified 1,278 alleles with an average of 5.972 alleles per locus and a major allele frequency of 0.7702. Population structure analysis revealed two subpopulations in this panel of germplasms, and phenotypic characterization demonstrated that this panel is suitable for association mapping of starch-related traits. We identified 32, 16, and 17 SSR markers associated with starch content, β-carotene content, and starch composition in the storage root, respectively, using association analysis and further evaluation of a subset of sweet potato genotypes with various characteristics. The SSR markers identified here can be used to select varieties with desired traits and to investigate the genetic mechanism underlying starch and carotenoid formation in the starchy roots of sweet potato.
Development and Identification of SSR Markers Associated with Starch Properties and β-Carotene Content in the Storage Root of Sweet Potato (Ipomoea batatas L.).

Science.gov (United States)

Zhang, Kai; Wu, Zhengdan; Tang, Daobin; Lv, Changwen; Luo, Kai; Zhao, Yong; Liu, Xun; Huang, Yuanxin; Wang, Jichun

2016-01-01

Sweet potato (Ipomoea batatas L.) is a nutritious food crop and, based on the high starch content of its storage root, a potential bioethanol feedstock. Enhancing the nutritional value and starch quantity of storage roots are important goals of sweet potato breeding programs aimed at developing improved varieties for direct consumption, processing, and industrial uses. However, developing improved lines of sweet potato is challenging due to the genetic complexity of this plant and the lack of genome information. Short sequence repeat (SSR) markers are powerful molecular tools for tracking important loci in crops and for molecular-based breeding strategies; however, few SSR markers and marker-trait associations have hitherto been identified in sweet potato. In this study, we identified 1824 SSRs by using a de novo assembly of publicly available ESTs and mRNAs in sweet potato, and designed 1476 primer pairs based on SSR-containing sequences. We mapped 214 pairs of primers in a natural population comprised of 239 germplasms, and identified 1278 alleles with an average of 5.972 alleles per locus and a major allele frequency of 0.7702. Population structure analysis revealed two subpopulations in this panel of germplasms, and phenotypic characterization demonstrated that this panel is suitable for association mapping of starch-related traits. We identified 32, 16, and 17 SSR markers associated with starch content, β-carotene content, and starch composition in the storage root, respectively, using association analysis and further evaluation of a subset of sweet potato genotypes with various characteristics. The SSR markers identified here can be used to select varieties with desired traits and to investigate the genetic mechanism underlying starch and carotenoid formation in the starchy roots of sweet potato.
[SSR loci information analysis in transcriptome of Andrographis paniculata].

Science.gov (United States)

Li, Jun-Ren; Chen, Xiu-Zhen; Tang, Xiao-Ting; He, Rui; Zhan, Ruo-Ting

2018-06-01

To study the SSR loci information and develop molecular markers, a total of 43 683 Unigenes in transcriptome of Andrographis paniculata were used to explore SSR. The distribution frequency of SSR and the basic characteristics of repeat motifs were analyzed using MicroSAtellite software, SSR primers were designed by Primer 3.0 software and then validated by PCR. Moreover, the gene function analysis of SSR Unigene was obtained by Blast. The results showed that 14 135 SSR loci were found in the transcriptome of A. paniculata, which distributed in 9 973 Unigenes with a distribution frequency of 32.36%. Di-nucleotide and Tri-nucleotide repeat were the main types, accounted for 75.54% of all SSRs. The repeat motifs of AT/AT and CCG/CGG were the predominant repeat types of Di-nucleotide and Tri-nucleotide, respectively. A total of 4 740 pairs of SSR primers with the potential to produce polymorphism were designed for maker development. Ten pairs of primers in 20 pairs of randomly picked primers produced fragments with expected molecular size. The gene function of Unigenes containing SSR were mostly related to the basic metabolism function of A. paniculata. The SSR markers in transcriptome of A. paniculata show rich type, strong specificity and high potential of polymorphism, which will benefit the candidate gene mining and marker-assisted breeding. Copyright© by the Chinese Pharmaceutical Association.
ESTIMA, a tool for EST management in a multi-project environment.

Science.gov (United States)

Kumar, Charu G; LeDuc, Richard; Gong, George; Roinishivili, Levan; Lewin, Harris A; Liu, Lei

2004-11-04

Single-pass, partial sequencing of complementary DNA (cDNA) libraries generates thousands of chromatograms that are processed into high quality expressed sequence tags (ESTs), and then assembled into contigs representative of putative genes. Usually, to be of value, ESTs and contigs must be associated with meaningful annotations, and made available to end-users. A web application, Expressed Sequence Tag Information Management and Annotation (ESTIMA), has been created to meet the EST annotation and data management requirements of multiple high-throughput EST sequencing projects. It is anchored on individual ESTs and organized around different properties of ESTs including chromatograms, base-calling quality scores, structure of assembled transcripts, and multiple sources of comparison to infer functional annotation, Gene Ontology associations, and cDNA library information. ESTIMA consists of a relational database schema and a set of interactive query interfaces. These are integrated with a suite of web-based tools that allow a user to query and retrieve information. Further, query results are interconnected among the various EST properties. ESTIMA has several unique features. Users may run their own EST processing pipeline, search against preferred reference genomes, and use any clustering and assembly algorithm. The ESTIMA database schema is very flexible and accepts output from any EST processing and assembly pipeline. ESTIMA has been used for the management of EST projects of many species, including honeybee (Apis mellifera), cattle (Bos taurus), songbird (Taeniopygia guttata), corn rootworm (Diabrotica vergifera), catfish (Ictalurus punctatus, Ictalurus furcatus), and apple (Malus x domestica). The entire resource may be downloaded and used as is, or readily adapted to fit the unique needs of other cDNA sequencing projects. The scripts used to create the ESTIMA interface are freely available to academic users in an archived format from http
Sequencing and analysis of full-length cDNAs, 5'-ESTs and 3'-ESTs from a cartilaginous fish, the elephant shark (Callorhinchus milii).

KAUST Repository

Brenner, Sydney

2012-10-08

Cartilaginous fishes are the most ancient group of living jawed vertebrates (gnathostomes) and are, therefore, an important reference group for understanding the evolution of vertebrates. The elephant shark (Callorhinchus milii), a holocephalan cartilaginous fish, has been identified as a model cartilaginous fish genome because of its compact genome (∼910 Mb) and a genome project has been initiated to obtain its whole genome sequence. In this study, we have generated and sequenced full-length enriched cDNA libraries of the elephant shark using the \\'oligo-capping\\' method and Sanger sequencing. A total of 6,778 full-length protein-coding cDNA and 10,701 full-length noncoding cDNA were sequenced from six tissues (gills, intestine, kidney, liver, spleen, and testis) of the elephant shark. Analysis of their polyadenylation signals showed that polyadenylation usage in elephant shark is similar to that in mammals. Furthermore, both coding and noncoding transcripts of the elephant shark use the same proportion of canonical polyadenylation sites. Besides BLASTX searches, protein-coding transcripts were annotated by Gene Ontology, InterPro domain, and KEGG pathway analyses. By comparing elephant shark genes to bony vertebrate genes, we identified several ancient genes present in elephant shark but differentially lost in tetrapods or teleosts. Only ∼6% of elephant shark noncoding cDNA showed similarity to known noncoding RNAs (ncRNAs). The rest are either highly divergent ncRNAs or novel ncRNAs. In addition to full-length transcripts, 30,375 5\\'-ESTs and 41,317 3\\'-ESTs were sequenced and annotated. The clones and transcripts generated in this study are valuable resources for annotating transcription start sites, exon-intron boundaries, and UTRs of genes in the elephant shark genome, and for the functional characterization of protein sequences. These resources will also be useful for annotating genes in other cartilaginous fishes whose genomes have been targeted for
Sequencing and analysis of full-length cDNAs, 5'-ESTs and 3'-ESTs from a cartilaginous fish, the elephant shark (Callorhinchus milii).

KAUST Repository

Brenner, Sydney; Kodzius, Rimantas; Tan, Yue Ying; Tay, Alice; Tay, Boon-Hui; Venkatesh, Byrappa

2012-01-01

Cartilaginous fishes are the most ancient group of living jawed vertebrates (gnathostomes) and are, therefore, an important reference group for understanding the evolution of vertebrates. The elephant shark (Callorhinchus milii), a holocephalan cartilaginous fish, has been identified as a model cartilaginous fish genome because of its compact genome (∼910 Mb) and a genome project has been initiated to obtain its whole genome sequence. In this study, we have generated and sequenced full-length enriched cDNA libraries of the elephant shark using the 'oligo-capping' method and Sanger sequencing. A total of 6,778 full-length protein-coding cDNA and 10,701 full-length noncoding cDNA were sequenced from six tissues (gills, intestine, kidney, liver, spleen, and testis) of the elephant shark. Analysis of their polyadenylation signals showed that polyadenylation usage in elephant shark is similar to that in mammals. Furthermore, both coding and noncoding transcripts of the elephant shark use the same proportion of canonical polyadenylation sites. Besides BLASTX searches, protein-coding transcripts were annotated by Gene Ontology, InterPro domain, and KEGG pathway analyses. By comparing elephant shark genes to bony vertebrate genes, we identified several ancient genes present in elephant shark but differentially lost in tetrapods or teleosts. Only ∼6% of elephant shark noncoding cDNA showed similarity to known noncoding RNAs (ncRNAs). The rest are either highly divergent ncRNAs or novel ncRNAs. In addition to full-length transcripts, 30,375 5'-ESTs and 41,317 3'-ESTs were sequenced and annotated. The clones and transcripts generated in this study are valuable resources for annotating transcription start sites, exon-intron boundaries, and UTRs of genes in the elephant shark genome, and for the functional characterization of protein sequences. These resources will also be useful for annotating genes in other cartilaginous fishes whose genomes have been targeted for whole
Genetic diversity of wild and cultivated Rubus species in Colombia using AFLP and SSR markers

Directory of Open Access Journals (Sweden)

Sandra Bibiana Aguilar

2007-01-01

Full Text Available The Andean blackberry belongs to the genus Rubus, the largest of the Rosaceae family and one of the mostdiverse of the plant kingdom. In Colombia Rubus glaucus Benth, known as the Andean raspberry or blackberry, is one of thenine edible of the genus out of forty-four reported species. In this study wild and cultivated genotypes, collected in the CentralAndes of Colombia were analyzed by AFLP and SSR markers. Sexual reproduction seems to play an important role inmaintaining the genetic variability in R. glaucus, and the viability of using the SSR of Rubus alceifolius to characterizeColombian Rubus species was clearly demonstrated. All species evaluated produced very specific banding patterns,differentiating them from the others. Both AFLP and SSR produced bands exclusive to each of the following species: R.robustus, R. urticifolius, R. glaucus, and R. rosifolius. The SSR markers differentiated diploid and tetraploid genotypes of R.glaucus.
Analysis of the genetic diversity of super sweet corn inbred lines using SSR and SSAP markers.

Science.gov (United States)

Ko, W R; Sa, K J; Roy, N S; Choi, H-J; Lee, J K

2016-01-22

In this study, we compared the efficiency of simple sequence repeat (SSR) and sequence specific amplified polymorphism (SSAP) markers for analyzing genetic diversity, genetic relationships, and population structure of 87 super sweet corn inbred lines from different origins. SSR markers showed higher average gene diversity and Shannon's information index than SSAP markers. To assess genetic relationships and characterize inbred lines using SSR and SSAP markers, genetic similarity (GS) matrices were constructed. The dendrogram using SSR marker data showed a complex pattern with nine clusters and a GS of 53.0%. For SSAP markers, three clusters were observed with a GS of 50.8%. Results of combined marker data showed six clusters with 53.5% GS. To analyze the genetic population structure of SSR and SSAP marker data, the 87 inbred lines were divided into groups I, II, and admixed based on the membership probability threshold of 0.8. Using combined marker data, the population structure was K = 3 and was divided into groups I, II, III, and admixed. This study represents a comparative analysis of SSR and SSAP marker data for the study of genetic diversity and genetic relationships in super sweet corn inbred lines. Our results would be useful for maize-breeding programs in Korea.
Functional markers based molecular characterization and cloning of resistance gene analogs encoding NBS-LRR disease resistance proteins in finger millet (Eleusine coracana).

Science.gov (United States)

Panwar, Preety; Jha, Anand Kumar; Pandey, P K; Gupta, Arun K; Kumar, Anil

2011-06-01

Magnaporthe grisea, the blast fungus is one of the main pathological threats to finger millet crop worldwide. A systematic search for the blast resistance gene analogs was carried out, using functional molecular markers. Three-fourths of the recognition-dependent disease resistance genes (R-genes) identified in plants encodes nucleotide binding site (NBS) leucine-rich repeat (LRR) proteins. NBS-LRR homologs have only been isolated on a limited scale from Eleusine coracana. Genomic DNA sequences sharing homology with NBS region of resistance gene analogs were isolated and characterized from resistant genotypes of finger millet using PCR based approach with primers designed from conserved regions of NBS domain. Attempts were made to identify molecular markers linked to the resistance gene and to differentiate the resistant bulk from the susceptible bulk. A total of 9 NBS-LRR and 11 EST-SSR markers generated 75.6 and 73.5% polymorphism respectively amongst 73 finger millet genotypes. NBS-5, NBS-9, NBS-3 and EST-SSR-04 markers showed a clear polymorphism which differentiated resistant genotypes from susceptible genotypes. By comparing the banding pattern of different resistant and susceptible genotypes, five DNA amplifications of NBS and EST-SSR primers (NBS-05(504,) NBS-09(711), NBS-07(688), NBS-03(509) and EST-SSR-04(241)) were identified as markers for the blast resistance in resistant genotypes. Principal coordinate plot and UPGMA analysis formed similar groups of the genotypes and placed most of the resistant genotypes together showing a high level of genetic relatedness and the susceptible genotypes were placed in different groups on the basis of differential disease score. Our results provided a clue for the cloning of finger millet blast resistance gene analogs which not only facilitate the process of plant breeding but also molecular characterization of blast resistance gene analogs from Eleusine coracana.
Characterization of genetic diversity in chickpea using SSR markers, Start Codon Targeted Polymorphism (SCoT) and Conserved DNA-Derived Polymorphism (CDDP).

Science.gov (United States)

Hajibarat, Zahra; Saidi, Abbas; Hajibarat, Zohreh; Talebi, Reza

2015-07-01

To evaluate the genetic diversity among 48 genotypes of chickpea comprising cultivars, landraces and internationally developed improved lines genetic distances were evaluated using three different molecular marker techniques: Simple Sequence Repeat (SSR); Start Codon Targeted (SCoT) and Conserved DNA-derived Polymorphism (CDDP). Average polymorphism information content (PIC) for SSR, SCoT and CDDP markers was 0.47, 0.45 and 0.45, respectively, and this revealed that three different marker types were equal for the assessment of diversity amongst genotypes. Cluster analysis for SSR and SCoT divided the genotypes in to three distinct clusters and using CDDP markers data, genotypes grouped in to five clusters. There were positive significant correlation (r = 0.43, P SSR markers. These results suggest that efficiency of SSR, SCOT and CDDP markers was relatively the same in fingerprinting of chickpea genotypes. To our knowledge, this is the first detailed report of using targeted DNA region molecular marker (CDDP) for genetic diversity analysis in chickpea in comparison with SCoT and SSR markers. Overall, our results are able to prove the suitability of SCoT and CDDP markers for genetic diversity analysis in chickpea for their high rates of polymorphism and their potential for genome diversity and germplasm conservation.
An EST dataset for Metasequoia glyptostroboides buds: the first EST resource for molecular genomics studies in Metasequoia.

Science.gov (United States)

Zhao, Ying; Thammannagowda, Shivegowda; Staton, Margaret; Tang, Sha; Xia, Xinli; Yin, Weilun; Liang, Haiying

2013-03-01

The "living fossil" Metasequoia glyptostroboides Hu et Cheng, commonly known as dawn redwood or Chinese redwood, is the only living species in the genus and is valued for its essential oil and crude extracts that have great potential for anti-fungal activity. Despite its paleontological significance and economical value as a rare relict species, genomic resources of Metasequoia are very limited. In order to gain insight into the molecular mechanisms behind the formation of reproductive buds and the transition from vegetative phase to reproductive phase in Metasequoia, we performed sequencing of expressed sequence tags from Metasequoia vegetative buds and female buds. By using the 454 pyrosequencing technology, a total of 1,571,764 high-quality reads were generated, among which 733,128 were from vegetative buds and 775,636 were from female buds. These EST reads were clustered and assembled into 114,124 putative unique transcripts (PUTs) with an average length of 536 bp. The 97,565 PUTs that were at least 100 bp in length were functionally annotated by a similarity search against public databases and assigned with Gene Ontology (GO) terms. A total of 59 known floral gene families and 190 isotigs involved in hormone regulation were captured in the dataset. Furthermore, a set of PUTs differentially expressed in vegetative and reproductive buds, as well as SSR motifs and high confidence SNPs, were identified. This is the first large-scale expressed sequence tags ever generated in Metasequoia and the first evidence for floral genes in this critically endangered deciduous conifer species.
Assessment of inter- and intra-cultivar variations in olive using SSR markers

Directory of Open Access Journals (Sweden)

Ahmet Ipek

2012-10-01

Full Text Available Olive (Olea europaea L. production in the world has been made by using many cultivars, and the genetic uniformity of commercial cultivars is important for standard olive oil and table olive production. The genetic variation among and within commonly cultivated olive cultivars in Turkey was analyzed using SSR markers. A total of 135 leaf samples were collected from 11 commonly cultivated olive cultivars from 11 provinces in four geographical regions of Turkey. Seven SSR primer pairs generated 46 SSR markers, and the number of SSR markers per primer pair ranged from 4 (UDO-14 to 9 (GAPU-89 with an average of 6.57. This high level of SSR polymorphism suggests that olive production in Turkey has been made using genetically diverse olive cultivars and this high level of genetic variation is probably due to the location of Turkey in the center of the origin of olive. The UPGMA dendrogram, developed to visualize the estimated genetic relationships among the 135 samples, demonstrated that the clustering of olive cultivars was not based on geographical regions of cultivation. Presence of genetic variation was detected within a nationwide grown Turkish olive cultivar, called 'Gemlik'. Olive growers successfully discriminated olive cultivars with distinct morphological and pomological characters. However, there was some confusion about the identification of cultivars with similar phenotypic traits. To prevent misidentification of olive cultivars and to minimize intra-cultivar variation, certified propagation materials which were characterized using DNA based molecular markers should be used during the establishment of new olive orchards.
ESTIMA, a tool for EST management in a multi-project environment

Directory of Open Access Journals (Sweden)

Lewin Harris A

2004-11-01

Full Text Available Abstract Background Single-pass, partial sequencing of complementary DNA (cDNA libraries generates thousands of chromatograms that are processed into high quality expressed sequence tags (ESTs, and then assembled into contigs representative of putative genes. Usually, to be of value, ESTs and contigs must be associated with meaningful annotations, and made available to end-users. Results A web application, Expressed Sequence Tag Information Management and Annotation (ESTIMA, has been created to meet the EST annotation and data management requirements of multiple high-throughput EST sequencing projects. It is anchored on individual ESTs and organized around different properties of ESTs including chromatograms, base-calling quality scores, structure of assembled transcripts, and multiple sources of comparison to infer functional annotation, Gene Ontology associations, and cDNA library information. ESTIMA consists of a relational database schema and a set of interactive query interfaces. These are integrated with a suite of web-based tools that allow a user to query and retrieve information. Further, query results are interconnected among the various EST properties. ESTIMA has several unique features. Users may run their own EST processing pipeline, search against preferred reference genomes, and use any clustering and assembly algorithm. The ESTIMA database schema is very flexible and accepts output from any EST processing and assembly pipeline. ESTIMA has been used for the management of EST projects of many species, including honeybee (Apis mellifera, cattle (Bos taurus, songbird (Taeniopygia guttata, corn rootworm (Diabrotica vergifera, catfish (Ictalurus punctatus, Ictalurus furcatus, and apple (Malus x domestica. The entire resource may be downloaded and used as is, or readily adapted to fit the unique needs of other cDNA sequencing projects. Conclusions The scripts used to create the ESTIMA interface are freely available to academic users in

(SSR) markers

African Journals Online (AJOL)

acer

2013-06-26

Jun 26, 2013 ... analysis was in general agreement with PCoA in discrimi- nating the cultivars. Conclusions. Estimation of morphological diversity may provide addi- tional information on the present finding. Nonetheless, the 29 SSR markers provided considerable genetic reso- lution and this genetic diversity analysis ...
(SSR) markers

African Journals Online (AJOL)

SAM

2014-07-30

Jul 30, 2014 ... India and the country is currently the leading producer, consumer and exporter of ... registration with the competent authority for plant variety protection. Conventionally ... detection of duplicates, parental verification in crosses, gene tagging in .... allelic patterns as revealed by the current set of SSR markers.
Segregation analysis of microsatellite (SSR) markers in sugarcane polyploids.

Science.gov (United States)

Lu, X; Zhou, H; Pan, Y-B; Chen, C Y; Zhu, J R; Chen, P H; Li, Y-R; Cai, Q; Chen, R K

2015-12-28

No information is available on segregation analysis of DNA markers involving both pollen and self-progeny. Therefore, we used capillary electrophoresis- and fluorescence-based DNA fingerprinting together with single pollen collection and polymerase chain reaction (PCR) to investigate simple sequence repeat (SSR) marker segregation among 964 single pollens and 288 self-progenies (S1) of sugarcane cultivar LCP 85-384. Twenty SSR DNA fragments (alleles) were amplified by five polymorphic SSR markers. Only one non-parental SSR allele was observed in 2392 PCRs. SSR allele inheritance was in accordance with Mendelian laws of segregation and independent assortment. Highly significant correlation coefficients were found between frequencies of observed and expected genotypes in pollen and S1 populations. Within the S1 population, the most frequent genotype of each SSR marker was the parental genotype of the same marker. The number of genotypes was higher in pollen than S1 population. PIC values of the five SSR markers were greater in pollen than S1 populations. Eleven of 20 SSR alleles (55%) were segregated in accordance with Mendelian segregation ratios expected from pollen and S1 populations of a 2n = 10x polyploid. Six of 20 SSR alleles were segregated in a 3:1 (presence:absence) ratio and were simplex markers. Four and one alleles were segregated in 77:4 and 143:1 ratios and considered duplex and triplex markers, respectively. Segregation ratios of remaining alleles were unexplainable. The results provide information about selection of crossing parents, estimation of seedling population optimal size, and promotion of efficient selection, which may be valuable for sugarcane breeders.
An RNA Sequencing Transcriptome Analysis of Grasspea (Lathyrus sativus L.) and Development of SSR and KASP Markers.

Science.gov (United States)

Hao, Xiaopeng; Yang, Tao; Liu, Rong; Hu, Jinguo; Yao, Yang; Burlyaeva, Marina; Wang, Yan; Ren, Guixing; Zhang, Hongyan; Wang, Dong; Chang, Jianwu; Zong, Xuxiao

2017-01-01

Grasspea ( Lathyrus sativus L., 2n = 14) has great agronomic potential because of its ability to survive under extreme conditions, such as drought and flood. However, this legume is less investigated because of its sparse genomic resources and very slow breeding process. In this study, 570 million quality-filtered and trimmed cDNA sequence reads with total length of over 82 billion bp were obtained using the Illumina NextSeq TM 500 platform. Approximately two million contigs and 142,053 transcripts were assembled from our RNA-Seq data, which resulted in 27,431 unigenes with an average length of 1,250 bp and maximum length of 48,515 bp. The unigenes were of high-quality. For example, the stay-green (SGR) gene of grasspea was aligned with the SGR gene of pea with high similarity. Among these unigenes, 3,204 EST-SSR primers were designed, 284 of which were randomly chosen for validation. Of these validated unigenes, 87 (30.6%) EST-SSR primers produced polymorphic amplicons among 43 grasspea accessions selected from different geographical locations. Meanwhile, 146,406 SNPs were screened and 50 SNP loci were randomly chosen for the kompetitive allele-specific PCR (KASP) validation. Over 80% (42) SNP loci were successfully transformed to KASP markers. Comparison of the dendrograms according to the SSR and KASP markers showed that the different marker systems are partially consistent with the dendrogram constructed in our study.
Next-Generation Sequencing of the Chrysanthemum nankingense (Asteraceae) Transcriptome Permits Large-Scale Unigene Assembly and SSR Marker Discovery

Science.gov (United States)

Wang, Haibin; Jiang, Jiafu; Chen, Sumei; Qi, Xiangyu; Peng, Hui; Li, Pirui; Song, Aiping; Guan, Zhiyong; Fang, Weimin; Liao, Yuan; Chen, Fadi

2013-01-01

Background Simple sequence repeats (SSRs) are ubiquitous in eukaryotic genomes. Chrysanthemum is one of the largest genera in the Asteraceae family. Only few Chrysanthemum expressed sequence tag (EST) sequences have been acquired to date, so the number of available EST-SSR markers is very low. Methodology/Principal Findings Illumina paired-end sequencing technology produced over 53 million sequencing reads from C. nankingense mRNA. The subsequent de novo assembly yielded 70,895 unigenes, of which 45,789 (64.59%) unigenes showed similarity to the sequences in NCBI database. Out of 45,789 sequences, 107 have hits to the Chrysanthemum Nr protein database; 679 and 277 sequences have hits to the database of Helianthus and Lactuca species, respectively. MISA software identified a large number of putative EST-SSRs, allowing 1,788 primer pairs to be designed from the de novo transcriptome sequence and a further 363 from archival EST sequence. Among 100 primer pairs randomly chosen, 81 markers have amplicons and 20 are polymorphic for genotypes analysis in Chrysanthemum. The results showed that most (but not all) of the assays were transferable across species and that they exposed a significant amount of allelic diversity. Conclusions/Significance SSR markers acquired by transcriptome sequencing are potentially useful for marker-assisted breeding and genetic analysis in the genus Chrysanthemum and its related genera. PMID:23626799
Transferability of simple sequence repeat (SSR) markers developed in guava (Psidium guajava L.) to four Myrtaceae species.

Science.gov (United States)

Rai, Manoj K; Phulwaria, Mahendra; Shekhawat, N S

2013-08-01

Present study demonstrated the cross-genera transferability of 23 simple sequence repeat (SSR) primer pairs developed for guava (Psidium guajava L.) to four new targets, two species of eucalypts (Eucalyptus citriodora, Eucalyptus camaldulensis), bottlebrush (Callistemon lanceolatus) and clove (Syzygium aromaticum), belonging to the family Myrtaceae and subfamily Myrtoideae. Off the 23 SSR loci assayed, 18 (78.2%) gave cross-amplification in E. citriodora, 14 (60.8%) in E. camaldulensis and 17-17 (73.9%) in C. lanceolatus and S. aromaticum. Eight primer pairs were found to be transferable to all four species. The number of alleles detected at each locus ranged from one to nine, with an average of 4.8, 2.6, 4.5 and 4.6 alleles in E. citriodora, E. camaldulensis, C. lanceolatus and S. aromaticum, respectively. The high levels of cross-genera transferability of guava SSRs may be applicable for the analysis of intra- and inter specific genetic diversity of target species, especially in E. citriodora, C. lanceolatus and S. aromaticum, for which till date no information about EST-derived as well as genomic SSR is available.
Generation and application of SSR markers in avocado

International Nuclear Information System (INIS)

Sharon, D.; Lavi, U.; Cregan, P.B.; Hillel, J.

1998-01-01

Simple Sequence Repeat (SSR) DNA markers were generated and applied to avocado. An SSR marker is based on a pair of primers which are synthesized on the basis of DNA sequences flanking a micro satellite. These markers are PCR based, quite polymorphic and abundant in several species. These are the markers, of choice in the human genome. The number of SSR markers in the avocado genome was calculated to be about 45,000, with the A/T micro satellite being the most frequent (1 in 40 kb). SSR markers are quite expensive to generate due to the required multi-step procedure; Screening a genomic library, about 66% of the positive clones turned out after sequencing to be SSR containing clones. In only about 55% of these, was it possible to synthesize primers and, of this group, only about 50% of the markers were useful for typing a specific family. Typing of five avocado cultivars using 59 SSR markers results in one to eight alleles per locus, mean heterozygosity ranging between 0.51 and 0.66 and gene diversity ranging between 0.42 and 0.66. The SSR markers were used to estimate the genetic relationships between various Persea species. The number of alleles in these species ranged between five and twelve with heterozygosity levels between 0.11-0.78 and gene diversity between 0.69-0.89. A preliminary genetic map, based on these SSR markers together with some DNA fingerprints (DFP) and randomly amplified polymorphic DNA (RAPD) markers, was drawn. The map consists of 12 linkage group having two to five markers each. Linkage analysis with several quantitative trait loci (QTLs) was performed by genetic typing and phenotypic assessment of the progeny of a controlled cross. The results of the interval mapping suggest that the gene(s) coding for the existence of fibers in the flesh, are probably linked to linkage group 3. (author)
Generation and application of SSR markers in avocado

Energy Technology Data Exchange (ETDEWEB)

Sharon, D; Lavi, U [Institute of Horticulture, ARO Volcani Center, Bet-Dagan (Israel); Cregan, P B [United States Department of Agriculture, Agricultural Research Service, Beltsville, Maryland (United States); Hillel, J [Faculty of Agriculture, Hebrew University of Jerusalem, Rehovot (Israel)

1998-10-01

Simple Sequence Repeat (SSR) DNA markers were generated and applied to avocado. An SSR marker is based on a pair of primers which are synthesized on the basis of DNA sequences flanking a micro satellite. These markers are PCR based, quite polymorphic and abundant in several species. These are the markers, of choice in the human genome. The number of SSR markers in the avocado genome was calculated to be about 45,000, with the A/T micro satellite being the most frequent (1 in 40 kb). SSR markers are quite expensive to generate due to the required multi-step procedure; Screening a genomic library, about 66% of the positive clones turned out after sequencing to be SSR containing clones. In only about 55% of these, was it possible to synthesize primers and, of this group, only about 50% of the markers were useful for typing a specific family. Typing of five avocado cultivars using 59 SSR markers results in one to eight alleles per locus, mean heterozygosity ranging between 0.51 and 0.66 and gene diversity ranging between 0.42 and 0.66. The SSR markers were used to estimate the genetic relationships between various Persea species. The number of alleles in these species ranged between five and twelve with heterozygosity levels between 0.11-0.78 and gene diversity between 0.69-0.89. A preliminary genetic map, based on these SSR markers together with some DNA fingerprints (DFP) and randomly amplified polymorphic DNA (RAPD) markers, was drawn. The map consists of 12 linkage group having two to five markers each. Linkage analysis with several quantitative trait loci (QTLs) was performed by genetic typing and phenotypic assessment of the progeny of a controlled cross. The results of the interval mapping suggest that the gene(s) coding for the existence of fibers in the flesh, are probably linked to linkage group 3. (author) 20 refs, 3 figs, 8 tabs
SSR markers: a tool for species identification in Psidium (Myrtaceae).

Science.gov (United States)

Tuler, A C; Carrijo, T T; Nóia, L R; Ferreira, A; Peixoto, A L; da Silva Ferreira, M F

2015-11-01

Molecular DNA markers are used for detection of polymorphisms in individuals. As they are independent of developmental stage of the plant and environmental influences, they can be useful tools in taxonomy. The alleles of simple sequence repeat (SSR) markers (or microsatellites) are traditionally used to identify taxonomic units. This application demands the laborious and costly delimitation of exclusive alleles in order to avoid homoplasy. Here, we propose a method for identification of species based on the amplification profile of groups of SSR markers obtained by a transferability study. The approach considers that the SSR are conserved among related species. In this context, using Psidium as a model, 141 SSR markers developed for Psidium guajava were transferred to 13 indigenous species of Psidium from the Atlantic Rainforest. Transferability of the markers was high and 28 SSR were conserved in all species. Four SSR groups were defined and they can help in the identification of all 13 Psidium species studied. A group of 31 SSR was genotyped, with one to six alleles each. The H0 varied from 0.0 to 0.46, and PIC from 0.0 to 0.74. Cluster analysis revealed shared alleles among species. The high percentage of SSR transferability found in Psidium evidences the narrow phylogenetic relationship existing among these species since transferability occurs by the preservation of the microsatellites and anchoring regions. The proposed method was useful for distinguishing the species of Psidium, being useful in taxonomic studies.
Genome-Wide Development of MicroRNA-Based SSR Markers in Medicago truncatula with Their Transferability Analysis and Utilization in Related Legume Species.

Science.gov (United States)

Min, Xueyang; Zhang, Zhengshe; Liu, Yisong; Wei, Xingyi; Liu, Zhipeng; Wang, Yanrong; Liu, Wenxian

2017-11-18

Microsatellite (simple sequence repeats, SSRs) marker is one of the most widely used markers in marker-assisted breeding. As one type of functional markers, MicroRNA-based SSR (miRNA-SSR) markers have been exploited mainly in animals, but the development and characterization of miRNA-SSR markers in plants are still limited. In the present study, miRNA-SSR markers for Medicago truncatula ( M. truncatula ) were developed and their cross-species transferability in six leguminous species was evaluated. A total of 169 primer pairs were successfully designed from 130 M. truncatula miRNA genes, the majority of which were mononucleotide repeats (70.41%), followed by dinucleotide repeats (14.20%), compound repeats (11.24%) and trinucleotide repeats (4.14%). Functional classification of SSR-containing miRNA genes showed that all targets could be grouped into three Gene Ontology (GO) categories: 17 in biological process, 11 in molecular function, and 14 in cellular component. The miRNA-SSR markers showed high transferability in other six leguminous species, ranged from 74.56% to 90.53%. Furthermore, 25 Mt - miRNA-SSR markers were used to evaluate polymorphisms in 20 alfalfa accessions, and the polymorphism information content (PIC) values ranged from 0.39 to 0.89 with an average of 0.71, the allele number per marker varied from 3 to 18 with an average of 7.88, indicating a high level of informativeness. The present study is the first time developed and characterized of M. truncatula miRNA-SSRs and demonstrated their utility in transferability, these novel markers will be valuable for genetic diversity analysis, marker-assisted selection and genotyping in leguminous species.
GarlicESTdb: an online database and mining tool for garlic EST sequences

Directory of Open Access Journals (Sweden)

Choi Sang-Haeng

2009-05-01

Full Text Available Abstract Background Allium sativum., commonly known as garlic, is a species in the onion genus (Allium, which is a large and diverse one containing over 1,250 species. Its close relatives include chives, onion, leek and shallot. Garlic has been used throughout recorded history for culinary, medicinal use and health benefits. Currently, the interest in garlic is highly increasing due to nutritional and pharmaceutical value including high blood pressure and cholesterol, atherosclerosis and cancer. For all that, there are no comprehensive databases available for Expressed Sequence Tags(EST of garlic for gene discovery and future efforts of genome annotation. That is why we developed a new garlic database and applications to enable comprehensive analysis of garlic gene expression. Description GarlicESTdb is an integrated database and mining tool for large-scale garlic (Allium sativum EST sequencing. A total of 21,595 ESTs collected from an in-house cDNA library were used to construct the database. The analysis pipeline is an automated system written in JAVA and consists of the following components: automatic preprocessing of EST reads, assembly of raw sequences, annotation of the assembled sequences, storage of the analyzed information into MySQL databases, and graphic display of all processed data. A web application was implemented with the latest J2EE (Java 2 Platform Enterprise Edition software technology (JSP/EJB/JavaServlet for browsing and querying the database, for creation of dynamic web pages on the client side, and for mapping annotated enzymes to KEGG pathways, the AJAX framework was also used partially. The online resources, such as putative annotation, single nucleotide polymorphisms (SNP and tandem repeat data sets, can be searched by text, explored on the website, searched using BLAST, and downloaded. To archive more significant BLAST results, a curation system was introduced with which biologists can easily edit best-hit annotation
GarlicESTdb: an online database and mining tool for garlic EST sequences.

Science.gov (United States)

Kim, Dae-Won; Jung, Tae-Sung; Nam, Seong-Hyeuk; Kwon, Hyuk-Ryul; Kim, Aeri; Chae, Sung-Hwa; Choi, Sang-Haeng; Kim, Dong-Wook; Kim, Ryong Nam; Park, Hong-Seog

2009-05-18

Allium sativum., commonly known as garlic, is a species in the onion genus (Allium), which is a large and diverse one containing over 1,250 species. Its close relatives include chives, onion, leek and shallot. Garlic has been used throughout recorded history for culinary, medicinal use and health benefits. Currently, the interest in garlic is highly increasing due to nutritional and pharmaceutical value including high blood pressure and cholesterol, atherosclerosis and cancer. For all that, there are no comprehensive databases available for Expressed Sequence Tags(EST) of garlic for gene discovery and future efforts of genome annotation. That is why we developed a new garlic database and applications to enable comprehensive analysis of garlic gene expression. GarlicESTdb is an integrated database and mining tool for large-scale garlic (Allium sativum) EST sequencing. A total of 21,595 ESTs collected from an in-house cDNA library were used to construct the database. The analysis pipeline is an automated system written in JAVA and consists of the following components: automatic preprocessing of EST reads, assembly of raw sequences, annotation of the assembled sequences, storage of the analyzed information into MySQL databases, and graphic display of all processed data. A web application was implemented with the latest J2EE (Java 2 Platform Enterprise Edition) software technology (JSP/EJB/JavaServlet) for browsing and querying the database, for creation of dynamic web pages on the client side, and for mapping annotated enzymes to KEGG pathways, the AJAX framework was also used partially. The online resources, such as putative annotation, single nucleotide polymorphisms (SNP) and tandem repeat data sets, can be searched by text, explored on the website, searched using BLAST, and downloaded. To archive more significant BLAST results, a curation system was introduced with which biologists can easily edit best-hit annotation information for others to view. The Garlic
Development of SSR markers and construction of a linkage map in jute

Indian Academy of Sciences (India)

We recently initiated a programme to develop simple sequence repeat markers and reported a set of 2469 SSR that were developed using four SSR-enriched libraries (Mir et al. 2009). In this communication, we report an additional set of 607 novel SSR in 393 SSR containing sequences. However, primers could be ...
New device based on the super spatial resolution (SSR) method

International Nuclear Information System (INIS)

Soluri, A.; Atzeni, G.; Ucci, A.; Bellone, T.; Cusanno, F.; Rodilossi, G.; Massari, R.

2013-01-01

Recently it have been described that innovative methods, namely Super Spatial Resolution (SSR), can be used to improve the scintigraphic imaging. The aim of SSR techniques is the enhancement of the resolution of an imaging system, using information from several images. In this paper we describe a new experimental apparatus that could be used for molecular imaging and small animal imaging. In fact we present a new device, completely automated, that uses the SSR method and provides images with better spatial resolution in comparison to the original resolution. Preliminary small animal imaging studies confirm the feasibility of a very high resolution system in scintigraphic imaging and the possibility to have gamma cameras using the SSR method, to perform the applications on functional imaging. -- Highlights: • Super spatial resolution brings a high resolution image from scintigraphic images. • Resolution improvement depends on the signal to noise ratio of the original images. • The SSR shows significant improvement on spatial resolution in scintigraphic images. • The SSR method is potentially utilizable for all scintigraphic devices
Development and characterization of a Psathyrostachys huashanica Keng 7Ns chromosome addition line with leaf rust resistance.

Directory of Open Access Journals (Sweden)

Wanli Du

Full Text Available The aim of this study was to characterize a Triticum aestivum-Psathyrostachys huashanica Keng (2n = 2x = 14, NsNs disomic addition line 2-1-6-3. Individual line 2-1-6-3 plants were analyzed using cytological, genomic in situ hybridization (GISH, EST-SSR, and EST-STS techniques. The alien addition line 2-1-6-3 was shown to have two P. huashanica chromosomes, with a meiotic configuration of 2n = 44 = 22 II. We tested 55 EST-SSR and 336 EST-STS primer pairs that mapped onto seven different wheat chromosomes using DNA from parents and the P. huashanica addition line. One EST-SSR and nine EST-STS primer pairs indicated that the additional chromosome of P. huashanica belonged to homoeologous group 7, the diagnostic fragments of five EST-STS markers (BE404955, BE591127, BE637663, BF482781 and CD452422 were cloned, sequenced and compared. The results showed that the amplified polymorphic bands of P. huashanica and disomic addition line 2-1-6-3 shared 100% sequence identity, which was designated as the 7Ns disomic addition line. Disomic addition line 2-1-6-3 was evaluated to test the leaf rust resistance of adult stages in the field. We found that one pair of the 7Ns genome chromosomes carried new leaf rust resistance gene(s. Moreover, wheat line 2-1-6-3 had a superior numbers of florets and grains per spike, which were associated with the introgression of the paired P. huashanica chromosomes. These high levels of disease resistance and stable, excellent agronomic traits suggest that this line could be utilized as a novel donor in wheat breeding programs.
Analysis of expressed sequence tags from Prunus mume flower and fruit and development of simple sequence repeat markers

Directory of Open Access Journals (Sweden)

Gao Zhihong

2010-07-01

Full Text Available Abstract Background Expressed Sequence Tag (EST has been a cost-effective tool in molecular biology and represents an abundant valuable resource for genome annotation, gene expression, and comparative genomics in plants. Results In this study, we constructed a cDNA library of Prunus mume flower and fruit, sequenced 10,123 clones of the library, and obtained 8,656 expressed sequence tag (EST sequences with high quality. The ESTs were assembled into 4,473 unigenes composed of 1,492 contigs and 2,981 singletons and that have been deposited in NCBI (accession IDs: GW868575 - GW873047, among which 1,294 unique ESTs were with known or putative functions. Furthermore, we found 1,233 putative simple sequence repeats (SSRs in the P. mume unigene dataset. We randomly tested 42 pairs of PCR primers flanking potential SSRs, and 14 pairs were identified as true-to-type SSR loci and could amplify polymorphic bands from 20 individual plants of P. mume. We further used the 14 EST-SSR primer pairs to test the transferability on peach and plum. The result showed that nearly 89% of the primer pairs produced target PCR bands in the two species. A high level of marker polymorphism was observed in the plum species (65% and low in the peach (46%, and the clustering analysis of the three species indicated that these SSR markers were useful in the evaluation of genetic relationships and diversity between and within the Prunus species. Conclusions We have constructed the first cDNA library of P. mume flower and fruit, and our data provide sets of molecular biology resources for P. mume and other Prunus species. These resources will be useful for further study such as genome annotation, new gene discovery, gene functional analysis, molecular breeding, evolution and comparative genomics between Prunus species.
Association of AFLP and SSR markers with agronomic and fibre ...

Indian Academy of Sciences (India)

We have attempted to tag yield and fibre quality traits with AFLP and SSR markers using F2 and F3 populations of a cross between two Gossypium hirsutum varieties, PS56-4 and RS2013. Out of 50 AFLP primer combinations and 177 SSR primer pairs tested, 32 AFLP and four SSR primers were chosen for genotyping F2 ...
Reimagining SSR in Contexts of Security Pluralism

Directory of Open Access Journals (Sweden)

Megan Price

2017-07-01

Full Text Available Within the repertoire of international stabilization interventions, security sector reform (SSR and other conventional efforts to strengthen security and governance institutions remain central. There is increasing recognition that the policies and practices operating under the rubric of SSR are blind to the empirical reality of 'security pluralism' in most stabilization contexts. In these contexts, both security providers directly authorized by the state (police, army and a multitude of other coercive actors engage in producing and reproducing order, and enjoy varying degrees of public authority and legitimacy. Recognizing this, research was undertaken in three cities (Beirut, Nairobi, and Tunis to discern the conditions enabling various security providers to forge constructive relations with local populations and governance actors. Drawing on insights generated by these case studies, this article problematizes conventional state-centric approaches and argues for a bold reimagining of SSR. It makes the case for an SSR approach that prioritizes promoting the accountability and responsiveness of all security providers, integrating efforts to strengthen the social determinants of security, and enabling a phased transition from relational to rules-based systems of security provision and governance.
An Efficient Strategy Combining SSR Markers- and Advanced QTL-seq-driven QTL Mapping Unravels Candidate Genes Regulating Grain Weight in Rice.

Science.gov (United States)

Daware, Anurag; Das, Sweta; Srivastava, Rishi; Badoni, Saurabh; Singh, Ashok K; Agarwal, Pinky; Parida, Swarup K; Tyagi, Akhilesh K

2016-01-01

Development and use of genome-wide informative simple sequence repeat (SSR) markers and novel integrated genomic strategies are vital to drive genomics-assisted breeding applications and for efficient dissection of quantitative trait loci (QTLs) underlying complex traits in rice. The present study developed 6244 genome-wide informative SSR markers exhibiting in silico fragment length polymorphism based on repeat-unit variations among genomic sequences of 11 indica, japonica, aus , and wild rice accessions. These markers were mapped on diverse coding and non-coding sequence components of known cloned/candidate genes annotated from 12 chromosomes and revealed a much higher amplification (97%) and polymorphic potential (88%) along with wider genetic/functional diversity level (16-74% with a mean 53%) especially among accessions belonging to indica cultivar group, suggesting their utility in large-scale genomics-assisted breeding applications in rice. A high-density 3791 SSR markers-anchored genetic linkage map (IR 64 × Sonasal) spanning 2060 cM total map-length with an average inter-marker distance of 0.54 cM was generated. This reference genetic map identified six major genomic regions harboring robust QTLs (31% combined phenotypic variation explained with a 5.7-8.7 LOD) governing grain weight on six rice chromosomes. One strong grain weight major QTL region ( OsqGW5.1 ) was narrowed-down by integrating traditional QTL mapping with high-resolution QTL region-specific integrated SSR and single nucleotide polymorphism markers-based QTL-seq analysis and differential expression profiling. This led us to delineate two natural allelic variants in two known cis -regulatory elements (RAV1AAT and CARGCW8GAT) of glycosyl hydrolase and serine carboxypeptidase genes exhibiting pronounced seed-specific differential regulation in low (Sonasal) and high (IR 64) grain weight mapping parental accessions. Our genome-wide SSR marker resource (polymorphic within/between diverse
COGNATE: comparative gene annotation characterizer.

Science.gov (United States)

Wilbrandt, Jeanne; Misof, Bernhard; Niehuis, Oliver

2017-07-17

The comparison of gene and genome structures across species has the potential to reveal major trends of genome evolution. However, such a comparative approach is currently hampered by a lack of standardization (e.g., Elliott TA, Gregory TR, Philos Trans Royal Soc B: Biol Sci 370:20140331, 2015). For example, testing the hypothesis that the total amount of coding sequences is a reliable measure of potential proteome diversity (Wang M, Kurland CG, Caetano-Anollés G, PNAS 108:11954, 2011) requires the application of standardized definitions of coding sequence and genes to create both comparable and comprehensive data sets and corresponding summary statistics. However, such standard definitions either do not exist or are not consistently applied. These circumstances call for a standard at the descriptive level using a minimum of parameters as well as an undeviating use of standardized terms, and for software that infers the required data under these strict definitions. The acquisition of a comprehensive, descriptive, and standardized set of parameters and summary statistics for genome publications and further analyses can thus greatly benefit from the availability of an easy to use standard tool. We developed a new open-source command-line tool, COGNATE (Comparative Gene Annotation Characterizer), which uses a given genome assembly and its annotation of protein-coding genes for a detailed description of the respective gene and genome structure parameters. Additionally, we revised the standard definitions of gene and genome structures and provide the definitions used by COGNATE as a working draft suggestion for further reference. Complete parameter lists and summary statistics are inferred using this set of definitions to allow down-stream analyses and to provide an overview of the genome and gene repertoire characteristics. COGNATE is written in Perl and freely available at the ZFMK homepage ( https://www.zfmk.de/en/COGNATE ) and on github ( https

Gene mining a marama bean expressed sequence tags (ESTs ...

African Journals Online (AJOL)

The authors reported the identification of genes associated with embryonic development and microsatellite sequences. The future direction will entail characterization of these genes using gene over-expression and mutant assays. Key words: Namibia, simple sequence repeats (SSR), data mining, homology searches, ...
MEETING: Chlamydomonas Annotation Jamboree - October 2003

Energy Technology Data Exchange (ETDEWEB)

Grossman, Arthur R

2007-04-13

Shotgun sequencing of the nuclear genome of Chlamydomonas reinhardtii (Chlamydomonas throughout) was performed at an approximate 10X coverage by JGI. Roughly half of the genome is now contained on 26 scaffolds, all of which are at least 1.6 Mb, and the coverage of the genome is ~95%. There are now over 200,000 cDNA sequence reads that we have generated as part of the Chlamydomonas genome project (Grossman, 2003; Shrager et al., 2003; Grossman et al. 2007; Merchant et al., 2007); other sequences have also been generated by the Kasuza sequence group (Asamizu et al., 1999; Asamizu et al., 2000) or individual laboratories that have focused on specific genes. Shrager et al. (2003) placed the reads into distinct contigs (an assemblage of reads with overlapping nucleotide sequences), and contigs that group together as part of the same genes have been designated ACEs (assembly of contigs generated from EST information). All of the reads have also been mapped to the Chlamydomonas nuclear genome and the cDNAs and their corresponding genomic sequences have been reassembled, and the resulting assemblage is called an ACEG (an Assembly of contiguous EST sequences supported by genomic sequence) (Jain et al., 2007). Most of the unique genes or ACEGs are also represented by gene models that have been generated by the Joint Genome Institute (JGI, Walnut Creek, CA). These gene models have been placed onto the DNA scaffolds and are presented as a track on the Chlamydomonas genome browser associated with the genome portal (http://genome.jgi-psf.org/Chlre3/Chlre3.home.html). Ultimately, the meeting grant awarded by DOE has helped enormously in the development of an annotation pipeline (a set of guidelines used in the annotation of genes) and resulted in high quality annotation of over 4,000 genes; the annotators were from both Europe and the USA. Some of the people who led the annotation initiative were Arthur Grossman, Olivier Vallon, and Sabeeha Merchant (with many individual
SAT, a flexible and optimized Web application for SSR marker development

Directory of Open Access Journals (Sweden)

Rami Jean-François

2007-11-01

Full Text Available Abstract Background Simple Sequence Repeats (SSRs, or microsatellites, are among the most powerful genetic markers known. A common method for the development of SSR markers is the construction of genomic DNA libraries enriched for SSR sequences, followed by DNA sequencing. However, designing optimal SSR markers from bulk sequence data is a laborious and time-consuming process. Results SAT (SSR Analysis Tool is a user-friendly Web application developed to minimize tedious manual operations and reduce errors. This tool facilitates the integration, analysis and display of sequence data from SSR-enriched libraries. SAT is designed to successively perform base calling and quality evaluation of chromatograms, eliminate cloning vector, adaptors and low quality sequences, detect chimera or partially digested sequences, search for SSR motifs, cluster and assemble the redundant sequences, and design SSR primer pairs. An additional virtual PCR step establishes primer specificity. Users may modify the different parameters of each step of the SAT analysis. Although certain steps are compulsory, such as SSR motifs search and sequence assembly, users do not have to run the entire pipeline, and they can choose selectively which steps to perform. A database allows users to store and query results, and to redo individual steps of the workflow. Conclusion The SAT Web application is available at http://sat.cirad.fr/sat, and a standalone command-line version is also freely downloadable. Users must send an email to the SAT administrator tropgene@cirad.fr to request a login and password.
Analysis of the leaf transcriptome of Musa acuminata during interaction with Mycosphaerella musicola: gene assembly, annotation and marker development.

Science.gov (United States)

Passos, Marco A N; de Cruz, Viviane Oliveira; Emediato, Flavia L; de Teixeira, Cristiane Camargo; Azevedo, Vânia C Rennó; Brasileiro, Ana C M; Amorim, Edson P; Ferreira, Claudia F; Martins, Natalia F; Togawa, Roberto C; Júnior, Georgios J Pappas; da Silva, Orzenil Bonfim; Miller, Robert N G

2013-02-05

Although banana (Musa sp.) is an important edible crop, contributing towards poverty alleviation and food security, limited transcriptome datasets are available for use in accelerated molecular-based breeding in this genus. 454 GS-FLX Titanium technology was employed to determine the sequence of gene transcripts in genotypes of Musa acuminata ssp. burmannicoides Calcutta 4 and M. acuminata subgroup Cavendish cv. Grande Naine, contrasting in resistance to the fungal pathogen Mycosphaerella musicola, causal organism of Sigatoka leaf spot disease. To enrich for transcripts under biotic stress responses, full length-enriched cDNA libraries were prepared from whole plant leaf materials, both uninfected and artificially challenged with pathogen conidiospores. The study generated 846,762 high quality sequence reads, with an average length of 334 bp and totalling 283 Mbp. De novo assembly generated 36,384 and 35,269 unigene sequences for M. acuminata Calcutta 4 and Cavendish Grande Naine, respectively. A total of 64.4% of the unigenes were annotated through Basic Local Alignment Search Tool (BLAST) similarity analyses against public databases.Assembled sequences were functionally mapped to Gene Ontology (GO) terms, with unigene functions covering a diverse range of molecular functions, biological processes and cellular components. Genes from a number of defense-related pathways were observed in transcripts from each cDNA library. Over 99% of contig unigenes mapped to exon regions in the reference M. acuminata DH Pahang whole genome sequence. A total of 4068 genic-SSR loci were identified in Calcutta 4 and 4095 in Cavendish Grande Naine. A subset of 95 potential defense-related gene-derived simple sequence repeat (SSR) loci were validated for specific amplification and polymorphism across M. acuminata accessions. Fourteen loci were polymorphic, with alleles per polymorphic locus ranging from 3 to 8 and polymorphism information content ranging from 0.34 to 0.82. A large set
The Physalis peruviana leaf transcriptome: assembly, annotation and gene model prediction

Directory of Open Access Journals (Sweden)

Garzón-Martínez Gina A

2012-04-01

Full Text Available Abstract Background Physalis peruviana commonly known as Cape gooseberry is a member of the Solanaceae family that has an increasing popularity due to its nutritional and medicinal values. A broad range of genomic tools is available for other Solanaceae, including tomato and potato. However, limited genomic resources are currently available for Cape gooseberry. Results We report the generation of a total of 652,614 P. peruviana Expressed Sequence Tags (ESTs, using 454 GS FLX Titanium technology. ESTs, with an average length of 371 bp, were obtained from a normalized leaf cDNA library prepared using a Colombian commercial variety. De novo assembling was performed to generate a collection of 24,014 isotigs and 110,921 singletons, with an average length of 1,638 bp and 354 bp, respectively. Functional annotation was performed using NCBI’s BLAST tools and Blast2GO, which identified putative functions for 21,191 assembled sequences, including gene families involved in all the major biological processes and molecular functions as well as defense response and amino acid metabolism pathways. Gene model predictions in P. peruviana were obtained by using the genomes of Solanum lycopersicum (tomato and Solanum tuberosum (potato. We predict 9,436 P. peruviana sequences with multiple-exon models and conserved intron positions with respect to the potato and tomato genomes. Additionally, to study species diversity we developed 5,971 SSR markers from assembled ESTs. Conclusions We present the first comprehensive analysis of the Physalis peruviana leaf transcriptome, which will provide valuable resources for development of genetic tools in the species. Assembled transcripts with gene models could serve as potential candidates for marker discovery with a variety of applications including: functional diversity, conservation and improvement to increase productivity and fruit quality. P. peruviana was estimated to be phylogenetically branched out before the
The Physalis peruviana leaf transcriptome: assembly, annotation and gene model prediction.

Science.gov (United States)

Garzón-Martínez, Gina A; Zhu, Z Iris; Landsman, David; Barrero, Luz S; Mariño-Ramírez, Leonardo

2012-04-25

Physalis peruviana commonly known as Cape gooseberry is a member of the Solanaceae family that has an increasing popularity due to its nutritional and medicinal values. A broad range of genomic tools is available for other Solanaceae, including tomato and potato. However, limited genomic resources are currently available for Cape gooseberry. We report the generation of a total of 652,614 P. peruviana Expressed Sequence Tags (ESTs), using 454 GS FLX Titanium technology. ESTs, with an average length of 371 bp, were obtained from a normalized leaf cDNA library prepared using a Colombian commercial variety. De novo assembling was performed to generate a collection of 24,014 isotigs and 110,921 singletons, with an average length of 1,638 bp and 354 bp, respectively. Functional annotation was performed using NCBI's BLAST tools and Blast2GO, which identified putative functions for 21,191 assembled sequences, including gene families involved in all the major biological processes and molecular functions as well as defense response and amino acid metabolism pathways. Gene model predictions in P. peruviana were obtained by using the genomes of Solanum lycopersicum (tomato) and Solanum tuberosum (potato). We predict 9,436 P. peruviana sequences with multiple-exon models and conserved intron positions with respect to the potato and tomato genomes. Additionally, to study species diversity we developed 5,971 SSR markers from assembled ESTs. We present the first comprehensive analysis of the Physalis peruviana leaf transcriptome, which will provide valuable resources for development of genetic tools in the species. Assembled transcripts with gene models could serve as potential candidates for marker discovery with a variety of applications including: functional diversity, conservation and improvement to increase productivity and fruit quality. P. peruviana was estimated to be phylogenetically branched out before the divergence of five other Solanaceae family members, S
Large-scale development of PIP and SSR markers and their complementary applied in Nicotiana.

Science.gov (United States)

Huang, L; Cao, H; Yang, L; Yu, Yu; Wang, Yu

2013-08-01

PIP (Potential Intron Polymorphism) and SSR (Simple Sequence Repeats) were used in many species, but large-scale development and combined use of these two markers have not been reported in tobacco. In this study, a total of 12,388 PIP and 76,848 SSR markers were designed and uploaded to a web-accessible database (http://yancao.sdau.edu.cn/tgb/). E-PCR analysis showed that PIP and SSR rarely overlapped and were strongly complementary in the tobacco genome. The density was 3.07 PIP and 1.72 SSR markers per 10 kb of the known sequences. A total of 153 and 166 alleles were detectedby 22 PIP and 22 SSR markers in 64 Nicotiana accessions. SSR produced higher PIC (polymorphism information content) values and identified more alleles than PIP, whereas PIP could identify larger numbers of rare alleles. Mantel testing demonstrated a high correlation coefficient (r = 0.949, P SSR. The UPGMA dendrogram created from the combined PIP and SSR markers was clearer and more reliable than the individual PIP or SSR dendrograms. It suggested that PIP and SSR can make up the deficiency of molecular markers not only in tobacco but other plant.
Genetic Characterization of Turkish Snake Melon (Cucumis melo L. subsp. melo flexuosus Group) Accessions Revealed by SSR Markers.

Science.gov (United States)

Solmaz, Ilknur; Kacar, Yildiz Aka; Simsek, Ozhan; Sari, Nebahat

2016-08-01

Snake melon is an important cucurbit crop especially in the Southeastern and the Mediterranean region of Turkey. It is consumed as fresh or pickled. The production is mainly done with the local landraces in the country. Turkey is one of the secondary diversification centers of melon and possesses valuable genetic resources which have different morphological characteristics in case of snake melon. Genetic diversity of snake melon genotypes collected from different regions of Turkey and reference genotypes obtained from World Melon Gene Bank in Avignon-France was examined using 13 simple sequence repeat (SSR) markers. A total of 69 alleles were detected, with an average of 5.31 alleles per locus. The polymorphism information content of SSR markers ranged from 0.19 to 0.57 (average 0.38). Based on cluster analysis, two major groups were defined. The first major group included only one accession (61), while the rest of all accessions grouped in the second major group and separated into different sub-clusters. Based on SSR markers, cluster analysis indicated that considerably high genetic variability exists among the examined accessions; however, Turkish snake melon accessions were grouped together with the reference snake melon accessions.
Experiment data report for semiscale Mod-2A primary feed and bleed experiment series (Tests S-SR-1 and S-SR-2)

International Nuclear Information System (INIS)

Fogdall, S.P.

1982-10-01

This report presents test data recorded for Tests S-SR-1 and S-SR-2 of the Semiscale Mod-2A Primary Feed and Bleed Tests. These tests are part of a series of Semiscale tests that investigate the thermal-hydraulic phenomena resulting from a hypothesized loss-of-coolant accident (LOCA) or abnormal operating transient. These tests provide experimental data for assessing the analytical capability of computer codes used in LOCA and operational transient analysis. The primary objectives of Tests S-SR-1 and -2 were to provide data on primary system recovery through the use of primary feed and bleed cooling, with no heat transfer to the secondaries. Data was obtained using high- and low-head pump curves for the safety injection (SI) pumps. This report presents the uninterpreted data from Tests S-SR-1 and -2 for analysis. The data, presented as graphs in engineering units, have been analyzed only to the extent necessary to ensure that they are reasonable and consistent
SSR-CE/FD assessment of Guizhou approved sugarcane cultivars and regional materials

Science.gov (United States)

Twelve sugarcane genotypes (three cultivars and nine clones involved in regional tests) from Guizhou Province, China were analyzed using SSR-capillary electrophoresis/fluorescence detection (SSR-CE/FD) technology to construct the SSR fingerprints and assess the genetic diversity. A total of 131 DNA ...
Molecular characterization and genetic diversity of Jatropha curcas L. in Costa Rica

Science.gov (United States)

Vásquez-Mayorga, Marcela; Fuchs, Eric J.; Hernández, Eduardo J.; Herrera, Franklin; Hernández, Jesús; Moreira, Ileana; Arnáez, Elizabeth

2017-01-01

We estimated the genetic diversity of 50 Jatropha curcas samples from the Costa Rican germplasm bank using 18 EST-SSR, one G-SSR and nrDNA-ITS markers. We also evaluated the phylogenetic relationships among samples using nuclear ribosomal ITS markers. Non-toxicity was evaluated using G-SSRs and SCARs markers. A Neighbor-Joining (NJ) tree and a Maximum Likelihood (ML) tree were constructed using SSR markers and ITS sequences, respectively. Heterozygosity was moderate (He = 0.346), but considerable compared to worldwide values for J. curcas. The PIC (PIC = 0.274) and inbreeding coefficient (f = − 0.102) were both low. Clustering was not related to the geographical origin of accessions. International accessions clustered independently of collection sites, suggesting a lack of genetic structure, probably due to the wide distribution of this crop and ample gene flow. Molecular markers identified only one non-toxic accession (JCCR-24) from Mexico. This work is part of a countrywide effort to characterize the genetic diversity of the Jatropha curcas germplasm bank in Costa Rica. PMID:28289556
Molecular characterization and genetic diversity of Jatropha curcas L. in Costa Rica

Directory of Open Access Journals (Sweden)

Marcela Vásquez-Mayorga

2017-02-01

Full Text Available We estimated the genetic diversity of 50 Jatropha curcas samples from the Costa Rican germplasm bank using 18 EST-SSR, one G-SSR and nrDNA-ITS markers. We also evaluated the phylogenetic relationships among samples using nuclear ribosomal ITS markers. Non-toxicity was evaluated using G-SSRs and SCARs markers. A Neighbor-Joining (NJ tree and a Maximum Likelihood (ML tree were constructed using SSR markers and ITS sequences, respectively. Heterozygosity was moderate (He = 0.346, but considerable compared to worldwide values for J. curcas. The PIC (PIC = 0.274 and inbreeding coefficient (f = − 0.102 were both low. Clustering was not related to the geographical origin of accessions. International accessions clustered independently of collection sites, suggesting a lack of genetic structure, probably due to the wide distribution of this crop and ample gene flow. Molecular markers identified only one non-toxic accession (JCCR-24 from Mexico. This work is part of a countrywide effort to characterize the genetic diversity of the Jatropha curcas germplasm bank in Costa Rica.
Genetic variation and DNA fingerprinting of durian types in Malaysia using simple sequence repeat (SSR) markers.

Science.gov (United States)

Siew, Ging Yang; Ng, Wei Lun; Tan, Sheau Wei; Alitheen, Noorjahan Banu; Tan, Soon Guan; Yeap, Swee Keong

2018-01-01

Durian ( Durio zibethinus ) is one of the most popular tropical fruits in Asia. To date, 126 durian types have been registered with the Department of Agriculture in Malaysia based on phenotypic characteristics. Classification based on morphology is convenient, easy, and fast but it suffers from phenotypic plasticity as a direct result of environmental factors and age. To overcome the limitation of morphological classification, there is a need to carry out genetic characterization of the various durian types. Such data is important for the evaluation and management of durian genetic resources in producing countries. In this study, simple sequence repeat (SSR) markers were used to study the genetic variation in 27 durian types from the germplasm collection of Universiti Putra Malaysia. Based on DNA sequences deposited in Genbank, seven pairs of primers were successfully designed to amplify SSR regions in the durian DNA samples. High levels of variation among the 27 durian types were observed (expected heterozygosity, H E = 0.35). The DNA fingerprinting power of SSR markers revealed by the combined probability of identity (PI) of all loci was 2.3×10 -3 . Unique DNA fingerprints were generated for 21 out of 27 durian types using five polymorphic SSR markers (the other two SSR markers were monomorphic). We further tested the utility of these markers by evaluating the clonal status of shared durian types from different germplasm collection sites, and found that some were not clones. The findings in this preliminary study not only shows the feasibility of using SSR markers for DNA fingerprinting of durian types, but also challenges the current classification of durian types, e.g., on whether the different types should be called "clones", "varieties", or "cultivars". Such matters have a direct impact on the regulation and management of durian genetic resources in the region.
Development of SSR markers for a Tibetan medicinal plant, Lancea tibetica (Phrymaceae), based on RAD sequencing.

Science.gov (United States)

Tian, Zunzhe; Zhang, Faqi; Liu, Hairui; Gao, Qingbo; Chen, Shilong

2016-11-01

Lancea tibetica (Phrymaceae), a Tibetan medicinal plant, is endemic to the Qinghai-Tibet Plateau. The over-exploitation of wild L. tibetica has led to the destruction of many populations. To enhance protection and management, biological research, especially population genetic studies, should be carried out on L. tibetica . Simple sequence repeat (SSR) markers of L. tibetica were developed to analyze population diversity. Four thousand four hundred and forty-one SSR loci were identified for L. tibetica based on restriction-site associated DNA (RAD) sequencing on the Illumina HiSeq platform. One hundred SSR loci were arbitrarily selected for primer design, and 38 of them were successfully amplified. These markers were tested on 56 individuals from three populations of L. tibetica , and 10 markers displayed polymorphisms. The total number of alleles per locus ranged from three to eight, and observed and expected heterozygosities ranged from 0.200 to 1.000 and 0.683 to 0.879, respectively. We tested for cross-amplification of these 10 markers in the related species L. hirsuta and found that nine could be successfully amplified. The SSR markers characterized here are the first to be developed and tested in L. tibetica . They will be useful for future population genetic studies on L. tibetica and closely related species.
Transcriptome characterization of the South African abalone Haliotis midae using sequencing-by-synthesis

Directory of Open Access Journals (Sweden)

Roodt-Wilding Rouvay

2011-03-01

Full Text Available Abstract Background Worldwide, the genus Haliotis is represented by 56 extant species and several of these are commercially cultured. Among the six abalone species found in South Africa, Haliotis midae is the only aquacultured species. Despite its economic importance, genomic sequence resources for H. midae, and for abalone in general, are still scarce. Next generation sequencing technologies provide a fast and efficient tool to generate large sequence collections that can be used to characterize the transcriptome and identify expressed genes associated with economically important traits like growth and disease resistance. Results More than 25 million short reads generated by the Illumina Genome Analyzer were de novo assembled in 22,761 contigs with an average size of 260 bp. With a stringent E-value threshold of 10-10, 3,841 contigs (16.8% had a BLAST homologous match against the Genbank non-redundant (NR protein database. Most of these sequences were annotated using the gene ontology (GO and eukaryotic orthologous groups of proteins (KOG databases and assigned to various functional categories. According to annotation results, many gene families involved in immune response were identified. Thousands of simple sequence repeats (SSR and single nucleotide polymorphisms (SNP were detected. Setting stringent parameters to ensure a high probability of amplification, 420 primer pairs in 181 contigs containing SSR loci were designed. Conclusion This data represents the most comprehensive genomic resource for the South African abalone H. midae to date. The amount of assembled sequences demonstrated the utility of the Illumina sequencing technology in the transcriptome characterization of a non-model species. It allowed the development of several markers and the identification of promising candidate genes for future studies on population and functional genomics in H. midae and in other abalone species.
Genetic diversity and population structure of Brassica oleracea germplasm in Ireland using SSR markers.

Science.gov (United States)

El-Esawi, Mohamed A; Germaine, Kieran; Bourke, Paula; Malone, Renee

2016-01-01

The most economically important Brassica oleracea species is endangered in Ireland, with no prior reported genetic characterization studies. This study assesses the genetic diversity, population structure and relationships of B. oleracea germplasm in Ireland using microsatellite (SSRs) markers. A total of 118 individuals from 25 accessions of Irish B. oleracea were genotyped. The SSR loci used revealed a total of 47 alleles. The observed heterozygosity (0.699) was higher than the expected one (0.417). Moreover, the average values of fixation indices (F) were negative, indicating excess of heterozygotes in all accessions. Polymorphic information content (PIC) values of SSR loci ranged from 0.27 to 0.66, with an average of 0.571, and classified 10 loci as informative markers (PIC>0.5) to differentiate among the accessions studied. The genetic differentiation among accessions showed that 27.1% of the total genetic variation was found among accessions, and 72.9% of the variation resided within accessions. The averages of total heterozygosity (H(T)) and intra-accession genetic diversity (H(S)) were 0.577 and 0.442, respectively. Cluster analysis of SSR data distinguished among kale and Brussels sprouts cultivars. This study provided a new insight into the exploitation of the genetically diverse spring cabbages accessions, revealing a high genetic variation, as potential resources for future breeding programs. SSR loci were effective for differentiation among the accessions studied. Copyright © 2016 Académie des sciences. Published by Elsevier SAS. All rights reserved.
A new set of ESTs and cDNA clones from full-length and normalized libraries for gene discovery and functional characterization in citrus

Directory of Open Access Journals (Sweden)

Alamar Santiago

2009-09-01

Full Text Available Abstract Background Interpretation of ever-increasing raw sequence information generated by modern genome sequencing technologies faces multiple challenges, such as gene function analysis and genome annotation. Indeed, nearly 40% of genes in plants encode proteins of unknown function. Functional characterization of these genes is one of the main challenges in modern biology. In this regard, the availability of full-length cDNA clones may fill in the gap created between sequence information and biological knowledge. Full-length cDNA clones facilitate functional analysis of the corresponding genes enabling manipulation of their expression in heterologous systems and the generation of a variety of tagged versions of the native protein. In addition, the development of full-length cDNA sequences has the power to improve the quality of genome annotation. Results We developed an integrated method to generate a new normalized EST collection enriched in full-length and rare transcripts of different citrus species from multiple tissues and developmental stages. We constructed a total of 15 cDNA libraries, from which we isolated 10,898 high-quality ESTs representing 6142 different genes. Percentages of redundancy and proportion of full-length clones range from 8 to 33, and 67 to 85, respectively, indicating good efficiency of the approach employed. The new EST collection adds 2113 new citrus ESTs, representing 1831 unigenes, to the collection of citrus genes available in the public databases. To facilitate functional analysis, cDNAs were introduced in a Gateway-based cloning vector for high-throughput functional analysis of genes in planta. Herein, we describe the technical methods used in the library construction, sequence analysis of clones and the overexpression of CitrSEP, a citrus homolog to the Arabidopsis SEP3 gene, in Arabidopsis as an example of a practical application of the engineered Gateway vector for functional analysis. Conclusion The new
A new set of ESTs and cDNA clones from full-length and normalized libraries for gene discovery and functional characterization in citrus

Science.gov (United States)

Marques, M Carmen; Alonso-Cantabrana, Hugo; Forment, Javier; Arribas, Raquel; Alamar, Santiago; Conejero, Vicente; Perez-Amador, Miguel A

2009-01-01

Background Interpretation of ever-increasing raw sequence information generated by modern genome sequencing technologies faces multiple challenges, such as gene function analysis and genome annotation. Indeed, nearly 40% of genes in plants encode proteins of unknown function. Functional characterization of these genes is one of the main challenges in modern biology. In this regard, the availability of full-length cDNA clones may fill in the gap created between sequence information and biological knowledge. Full-length cDNA clones facilitate functional analysis of the corresponding genes enabling manipulation of their expression in heterologous systems and the generation of a variety of tagged versions of the native protein. In addition, the development of full-length cDNA sequences has the power to improve the quality of genome annotation. Results We developed an integrated method to generate a new normalized EST collection enriched in full-length and rare transcripts of different citrus species from multiple tissues and developmental stages. We constructed a total of 15 cDNA libraries, from which we isolated 10,898 high-quality ESTs representing 6142 different genes. Percentages of redundancy and proportion of full-length clones range from 8 to 33, and 67 to 85, respectively, indicating good efficiency of the approach employed. The new EST collection adds 2113 new citrus ESTs, representing 1831 unigenes, to the collection of citrus genes available in the public databases. To facilitate functional analysis, cDNAs were introduced in a Gateway-based cloning vector for high-throughput functional analysis of genes in planta. Herein, we describe the technical methods used in the library construction, sequence analysis of clones and the overexpression of CitrSEP, a citrus homolog to the Arabidopsis SEP3 gene, in Arabidopsis as an example of a practical application of the engineered Gateway vector for functional analysis. Conclusion The new EST collection denotes an
RAPD and SSR Polymorphisms in Mutant Lines of Transgenic Wheat Mediated by Low Energy Ion Beam

International Nuclear Information System (INIS)

Wang Tiegu; Huang Qunce; Feng Weisen

2007-01-01

Two types of markers-random amplified polymorphic DNA (RAPD) and simple sequence repeat DNA (SSR)-have been used to characterize the genetic diversity among nine mutant lines of transgenic wheat intermediated by low energy ion beam and their four receptor cultivars. The objectives of this study were to analyze RAPD-based and SSR-based genetic variance among transgenic wheat lines and with their receptors, and to find specific genetic markers of special traits of transgenic wheat lines. 170 RAPD primers were amplified to 733 fragments in all the experimental materials. There were 121 polymorphic fragments out of the 733 fragments with a ratio of polymorphic fragments of 16.5%. 29 SSR primer pairs were amplified to 83 fragments in all the experiment materials. There were 57 polymorphic fragments out of the 83 fragments with a ratio of polymorphic fragments of 68.7%. The dendrograms were prepared based on a genetic distance matrix using the UPGMA (Unweighted Pair-group Method with Arithmetic averaging) algorithm, which corresponded well to the results of the wheat pedigree analysis and separated the 13 genotypes into four groups. Association analysis between RAPD and SSR markers with the special traits of transgenic wheat mutant lines discovered that three RAPD markers, s1, opt-16, and f14, were significantly associated with the muticate trait, while three SSR markers, Rht8 (Xgwm261), Rht-B1b, and Rht-D1b, highly associated with the dwarf trait. These markers will be useful for marker-assistant breeding and can be used as candidate markers for further gene mapping and cloning
RAPD and SSR Polymorphisms in Mutant Lines of Transgenic Wheat Mediated by Low Energy Ion Beam

Science.gov (United States)

Wang, Tiegu; Huang, Qunce; Feng, Weisen

2007-10-01

Two types of markers-random amplified polymorphic DNA (RAPD) and simple sequence repeat DNA (SSR)-have been used to characterize the genetic diversity among nine mutant lines of transgenic wheat intermediated by low energy ion beam and their four receptor cultivars. The objectives of this study were to analyze RAPD-based and SSR-based genetic variance among transgenic wheat lines and with their receptors, and to find specific genetic markers of special traits of transgenic wheat lines. 170 RAPD primers were amplified to 733 fragments in all the experimental materials. There were 121 polymorphic fragments out of the 733 fragments with a ratio of polymorphic fragments of 16.5%. 29 SSR primer pairs were amplified to 83 fragments in all the experiment materials. There were 57 polymorphic fragments out of the 83 fragments with a ratio of polymorphic fragments of 68.7%. The dendrograms were prepared based on a genetic distance matrix using the UPGMA (Unweighted Pair-group Method with Arithmetic averaging) algorithm, which corresponded well to the results of the wheat pedigree analysis and separated the 13 genotypes into four groups. Association analysis between RAPD and SSR markers with the special traits of transgenic wheat mutant lines discovered that three RAPD markers, s1, opt-16, and f14, were significantly associated with the muticate trait, while three SSR markers, Rht8 (Xgwm261), Rht-B1b, and Rht-D1b, highly associated with the dwarf trait. These markers will be useful for marker-assistant breeding and can be used as candidate markers for further gene mapping and cloning.

RAPD and SSR Polymorphisms in Mutant Lines of Transgenic Wheat Mediated by Low Energy Ion Beam

Energy Technology Data Exchange (ETDEWEB)

Tiegu, Wang [Henan Provincial Key Laboratory of Ion Beam Bio-Engineering, Zhengzhou University, Zhengzhou 450052 (China); Qunce, Huang [Henan Provincial Key Laboratory of Ion Beam Bio-Engineering, Zhengzhou University, Zhengzhou 450052 (China); Weisen, Feng [Luoyang Institute of Agricultural Science, Luoyang 471022 (China)

2007-10-15

Two types of markers-random amplified polymorphic DNA (RAPD) and simple sequence repeat DNA (SSR)-have been used to characterize the genetic diversity among nine mutant lines of transgenic wheat intermediated by low energy ion beam and their four receptor cultivars. The objectives of this study were to analyze RAPD-based and SSR-based genetic variance among transgenic wheat lines and with their receptors, and to find specific genetic markers of special traits of transgenic wheat lines. 170 RAPD primers were amplified to 733 fragments in all the experimental materials. There were 121 polymorphic fragments out of the 733 fragments with a ratio of polymorphic fragments of 16.5%. 29 SSR primer pairs were amplified to 83 fragments in all the experiment materials. There were 57 polymorphic fragments out of the 83 fragments with a ratio of polymorphic fragments of 68.7%. The dendrograms were prepared based on a genetic distance matrix using the UPGMA (Unweighted Pair-group Method with Arithmetic averaging) algorithm, which corresponded well to the results of the wheat pedigree analysis and separated the 13 genotypes into four groups. Association analysis between RAPD and SSR markers with the special traits of transgenic wheat mutant lines discovered that three RAPD markers, s1, opt-16, and f14, were significantly associated with the muticate trait, while three SSR markers, Rht8 (Xgwm261), Rht-B1b, and Rht-D1b, highly associated with the dwarf trait. These markers will be useful for marker-assistant breeding and can be used as candidate markers for further gene mapping and cloning.
SSR: Its Effects on Students' Reading Habits after They Complete the Program.

Science.gov (United States)

Wiesendanger, Katherine D.; Bader, Lois

1989-01-01

Studies the effect of sustained silent reading (SSR) on recreational reading habits after termination of instruction. Finds that SSR students read more than those not in the program, and that SSR has no impact on above average readers, great impact on average readers, and little impact on below average readers. (RS)
New gSSR and EST-SSR markers reveal high genetic diversity in the invasive plant Ambrosia artemisiifolia L. and can be transferred to other invasive Ambrosia species.

Science.gov (United States)

Meyer, Lucie; Causse, Romain; Pernin, Fanny; Scalone, Romain; Bailly, Géraldine; Chauvel, Bruno; Délye, Christophe; Le Corre, Valérie

2017-01-01

Ambrosia artemisiifolia L., (common ragweed), is an annual invasive and highly troublesome plant species originating from North America that has become widespread across Europe. New sets of genomic and expressed sequence tag (EST) based simple sequence repeats (SSRs) markers were developed in this species using three approaches. After validation, 13 genomic SSRs and 13 EST-SSRs were retained and used to characterize the genetic diversity and population genetic structure of Ambrosia artemisiifolia populations from the native (North America) and invasive (Europe) ranges of the species. Analysing the mating system based on maternal families did not reveal any departure from complete allogamy and excess homozygosity was mostly due the presence of null alleles. High genetic diversity and patterns of genetic structure in Europe suggest two main introduction events followed by secondary colonization events. Cross-species transferability of the newly developed markers to other invasive species of the Ambrosia genus was assessed. Sixty-five percent and 75% of markers, respectively, were transferable from A. artemisiifolia to Ambrosia psilostachya and Ambrosia tenuifolia. 40% were transferable to Ambrosia trifida, this latter species being seemingly more phylogenetically distantly related to A. artemisiifolia than the former two.
On the status of Ukrainian SSR before the beginning of perestroika

Directory of Open Access Journals (Sweden)

А. А. Омарова

2014-12-01

Full Text Available The article studies legal status of Ukrainian SSR before the beginning of Perestroika. Constitutional acts, laws and other regulatory acts are analyzed which fixed the status of Ukrainian SSR within USSR. On the basis of study of regulatory sources and monographs we came to conclusion that a contradiction existed between legally fixed and actual status of Ukrainian SSR within USSR before the beginning of Perestroika.
Association of SSR markers with contents of fatty acids in olive oil and genetic diversity analysis of an olive core collection.

Science.gov (United States)

Ipek, M; Ipek, A; Seker, M; Gul, M K

2015-03-27

The purpose of this research was to characterize an olive core collection using some agronomic characters and simple sequence repeat (SSR) markers and to determine SSR markers associated with the content of fatty acids in olive oil. SSR marker analysis demonstrated the presence of a high amount of genetic variation between the olive cultivars analyzed. A UPGMA dendrogram demonstrated that olive cultivars did not cluster on the basis of their geographic origin. Fatty acid components of olive oil in these cultivars were determined. The results also showed that there was a great amount of variation between the olive cultivars in terms of fatty acid composition. For example, oleic acid content ranged from 57.76 to 76.9% with standard deviation of 5.10%. Significant correlations between fatty acids of olive oil were observed. For instance, a very high negative correlation (-0.812) between oleic and linoleic acids was detected. A structured association analysis between the content of fatty acids in olive oil and SSR markers was performed. STRUCTURE analysis assigned olive cultivars to two gene pools (K = 2). Assignment of olive cultivars to these gene pools was not based on geographical origin. Association between fatty acid traits and SSR markers was evaluated using the general linear model of TASSEL. Significant associations were determined between five SSR markers and stearic, oleic, linoleic, and linolenic acids of olive oil. Very high associations (P < 0.001) between ssrOeUA-DCA14 and stearic acid and between GAPU71B and oleic acid indicated that these markers could be used for marker-assisted selection in olive.
DNA profiling of pineapple cultivars in Japan discriminated by SSR markers

Science.gov (United States)

Shoda, Moriyuki; Urasaki, Naoya; Sakiyama, Sumisu; Terakami, Shingo; Hosaka, Fumiko; Shigeta, Narumi; Nishitani, Chikako; Yamamoto, Toshiya

2012-01-01

We developed 18 polymorphic simple sequence repeat (SSR) markers in pineapple (Ananas comosus) by using genomic libraries enriched for GA and CA motifs. The markers were used to genotype 31 pineapple accessions, including seven cultivars and 11 breeding lines from Okinawa Prefecture, 12 foreign accessions and one from a related species. These SSR loci were highly polymorphic: the 31 accessions contained three to seven alleles per locus, with an average of 4.1. The values of expected heterozygosity ranged from 0.09 to 0.76, with an average of 0.52. All 31 accessions could be successfully differentiated by the 18 SSR markers, with the exception of ‘N67-10’ and ‘Hawaiian Smooth Cayenne’. A single combination of three markers TsuAC004, TsuAC010 and TsuAC041, was enough to distinguish all accessions with one exception. A phenogram based on the SSR genotypes did not show any distinct groups, but it suggested that pineapples bred in Japan are genetically diversed. We reconfirmed the parentage of 14 pineapple accessions by comparing the SSR alleles at 17 SSR loci in each accession and its reported parents. The obtained information will contribute substantially to protecting plant breeders’ rights. PMID:23341750
De novo comparative transcriptome analysis of genes involved in fruit morphology of pumpkin cultivars with extreme size difference and development of EST-SSR markers.

Science.gov (United States)

Xanthopoulou, Aliki; Ganopoulos, Ioannis; Psomopoulos, Fotis; Manioudaki, Maria; Moysiadis, Theodoros; Kapazoglou, Aliki; Osathanunkul, Maslin; Michailidou, Sofia; Kalivas, Apostolos; Tsaftaris, Athanasios; Nianiou-Obeidat, Irini; Madesis, Panagiotis

2017-07-30

The genetic basis of fruit size and shape was investigated for the first time in Cucurbita species and genetic loci associated with fruit morphology have been identified. Although extensive genomic resources are available at present for tomato (Solanum lycopersicum), cucumber (Cucumis sativus), melon (Cucumis melo) and watermelon (Citrullus lanatus), genomic databases for Cucurbita species are limited. Recently, our group reported the generation of pumpkin (Cucurbita pepo) transcriptome databases from two contrasting cultivars with extreme fruit sizes. In the current study we used these databases to perform comparative transcriptome analysis in order to identify genes with potential roles in fruit morphology and fruit size. Differential Gene Expression (DGE) analysis between cv. 'Munchkin' (small-fruit) and cv. 'Big Moose' (large-fruit) revealed a variety of candidate genes associated with fruit morphology with significant differences in gene expression between the two cultivars. In addition, we have set the framework for generating EST-SSR markers, which discriminate different C. pepo cultivars and show transferability to related Cucurbitaceae species. The results of the present study will contribute to both further understanding the molecular mechanisms regulating fruit morphology and furthermore identifying the factors that determine fruit size. Moreover, they may lead to the development of molecular marker tools for selecting genotypes with desired morphological traits. Copyright © 2017. Published by Elsevier B.V.
simple sequence repeat (SSR)

African Journals Online (AJOL)

In the present study, 78 mapped simple sequence repeat (SSR) markers representing 11 linkage groups of adzuki bean were evaluated for transferability to mungbean and related Vigna spp. 41 markers amplified characteristic bands in at least one Vigna species. The transferability percentage across the genotypes ranged ...
Annotation of the protein coding regions of the equine genome

DEFF Research Database (Denmark)

Hestand, Matthew S.; Kalbfleisch, Theodore S.; Coleman, Stephen J.

2015-01-01

Current gene annotation of the horse genome is largely derived from in silico predictions and cross-species alignments. Only a small number of genes are annotated based on equine EST and mRNA sequences. To expand the number of equine genes annotated from equine experimental evidence, we sequenced m...... and appear to be small errors in the equine reference genome, since they are also identified as homozygous variants by genomic DNA resequencing of the reference horse. Taken together, we provide a resource of equine mRNA structures and protein coding variants that will enhance equine and cross...
ConiferEST: an integrated bioinformatics system for data reprocessing and mining of conifer expressed sequence tags (ESTs).

Science.gov (United States)

Liang, Chun; Wang, Gang; Liu, Lin; Ji, Guoli; Fang, Lin; Liu, Yuansheng; Carter, Kikia; Webb, Jason S; Dean, Jeffrey F D

2007-05-29

With the advent of low-cost, high-throughput sequencing, the amount of public domain Expressed Sequence Tag (EST) sequence data available for both model and non-model organism is growing exponentially. While these data are widely used for characterizing various genomes, they also present a serious challenge for data quality control and validation due to their inherent deficiencies, particularly for species without genome sequences. ConiferEST is an integrated system for data reprocessing, visualization and mining of conifer ESTs. In its current release, Build 1.0, it houses 172,229 loblolly pine EST sequence reads, which were obtained from reprocessing raw DNA sequencer traces using our software--WebTraceMiner. The trace files were downloaded from NCBI Trace Archive. ConiferEST provides biologists unique, easy-to-use data visualization and mining tools for a variety of putative sequence features including cloning vector segments, adapter sequences, restriction endonuclease recognition sites, polyA and polyT runs, and their corresponding Phred quality values. Based on these putative features, verified sequence features such as 3' and/or 5' termini of cDNA inserts in either sense or non-sense strand have been identified in-silico. Interestingly, only 30.03% of the designated 3' ESTs were found to have an authenticated 5' terminus in the non-sense strand (i.e., polyT tails), while fewer than 5.34% of the designated 5' ESTs had a verified 5' terminus in the sense strand. Such previously ignored features provide valuable insight for data quality control and validation of error-prone ESTs, as well as the ability to identify novel functional motifs embedded in large EST datasets. We found that "double-termini adapters" were effective indicators of potential EST chimeras. For all sequences with in-silico verified termini/terminus, we used InterProScan to assign protein domain signatures, results of which are available for in-depth exploration using our biologist
ConiferEST: an integrated bioinformatics system for data reprocessing and mining of conifer expressed sequence tags (ESTs

Directory of Open Access Journals (Sweden)

Carter Kikia

2007-05-01

Full Text Available Abstract Background With the advent of low-cost, high-throughput sequencing, the amount of public domain Expressed Sequence Tag (EST sequence data available for both model and non-model organism is growing exponentially. While these data are widely used for characterizing various genomes, they also present a serious challenge for data quality control and validation due to their inherent deficiencies, particularly for species without genome sequences. Description ConiferEST is an integrated system for data reprocessing, visualization and mining of conifer ESTs. In its current release, Build 1.0, it houses 172,229 loblolly pine EST sequence reads, which were obtained from reprocessing raw DNA sequencer traces using our software – WebTraceMiner. The trace files were downloaded from NCBI Trace Archive. ConiferEST provides biologists unique, easy-to-use data visualization and mining tools for a variety of putative sequence features including cloning vector segments, adapter sequences, restriction endonuclease recognition sites, polyA and polyT runs, and their corresponding Phred quality values. Based on these putative features, verified sequence features such as 3' and/or 5' termini of cDNA inserts in either sense or non-sense strand have been identified in-silico. Interestingly, only 30.03% of the designated 3' ESTs were found to have an authenticated 5' terminus in the non-sense strand (i.e., polyT tails, while fewer than 5.34% of the designated 5' ESTs had a verified 5' terminus in the sense strand. Such previously ignored features provide valuable insight for data quality control and validation of error-prone ESTs, as well as the ability to identify novel functional motifs embedded in large EST datasets. We found that "double-termini adapters" were effective indicators of potential EST chimeras. For all sequences with in-silico verified termini/terminus, we used InterProScan to assign protein domain signatures, results of which are available
The UniProtKB/Swiss-Prot knowledgebase and its Plant Proteome Annotation Program.

Science.gov (United States)

Schneider, Michel; Lane, Lydie; Boutet, Emmanuel; Lieberherr, Damien; Tognolli, Michael; Bougueleret, Lydie; Bairoch, Amos

2009-04-13

The UniProt knowledgebase, UniProtKB, is the main product of the UniProt consortium. It consists of two sections, UniProtKB/Swiss-Prot, the manually curated section, and UniProtKB/TrEMBL, the computer translation of the EMBL/GenBank/DDBJ nucleotide sequence database. Taken together, these two sections cover all the proteins characterized or inferred from all publicly available nucleotide sequences. The Plant Proteome Annotation Program (PPAP) of UniProtKB/Swiss-Prot focuses on the manual annotation of plant-specific proteins and protein families. Our major effort is currently directed towards the two model plants Arabidopsis thaliana and Oryza sativa. In UniProtKB/Swiss-Prot, redundancy is minimized by merging all data from different sources in a single entry. The proposed protein sequence is frequently modified after comparison with ESTs, full length transcripts or homologous proteins from other species. The information present in manually curated entries allows the reconstruction of all described isoforms. The annotation also includes proteomics data such as PTM and protein identification MS experimental results. UniProtKB and the other products of the UniProt consortium are accessible online at www.uniprot.org.
Molecular characterization of barley (Hordeum vulgare L. accessions of the Serbian GeneBank by SSR fingerprinting

Directory of Open Access Journals (Sweden)

Šurlan-Momirović Gordana

2013-01-01

Full Text Available Molecular diversity of 145 barley (Hordeum vulgare subsp. vulgare L. accessions from the Serbian GenBank was assessed by single sequence repeats (SSR markers. A set of 15 SSRs, covering all chromosomes of the diploid barley genome with 2-3 SSR markers per chromosome, with a range of 4-18 alleles per locus were used. In total, 15 loci and 119 alleles were detected, with an average of 7.93 alleles per locus. The Polymorphic information content value ranged from 0.220 to 0.782 with a mean value of 0.534. Regarding the growth habit and row type groups, gene diversity was comparatively higher for the spring (0.616 and six-rowed accessions (0.616 than for the winter and two- rowed accessions (0.322 and 0.478, respectively. Analysis of molecular variance showed that all sources of variation were significant (P < 0.01, but the between-group component was predominant (76.85% for growth habit and 89.45% for row type. Unweighted Pair Group Method with Arithmetic Mean (UPGMA cluster analysis based on the shared allele distance (DSA matrix estimated on the SSR data assigned the genotypes into two clusters - the first smaller consisting of the six 6-rowed spring cultivars and the second comprising six subclusters. Genotype MBR1012 was separated from all other genotypes that constitute UPGMA tree. The associations of genotypes belonging to different growth habit and row type groups were assessed using Principal Coordinate Analysis revealing separation of winter growth habit group from facultative one. The use of the STRUCTURE clustering algorithm allowed the identification of 2 subpopulations of genotypes. [Projekat Ministarstva nauke Republike Srbije, br. TR31092
CGKB: an annotation knowledge base for cowpea (Vigna unguiculata L. methylation filtered genomic genespace sequences

Directory of Open Access Journals (Sweden)

Spraggins Thomas A

2007-04-01

potential domains on annotated GSS were analyzed using the HMMER package against the Pfam database. The annotated GSS were also assigned with Gene Ontology annotation terms and integrated with 228 curated plant metabolic pathways from the Arabidopsis Information Resource (TAIR knowledge base. The UniProtKB-Swiss-Prot ENZYME database was used to assign putative enzymatic function to each GSS. Each GSS was also analyzed with the Tandem Repeat Finder (TRF program in order to identify potential SSRs for molecular marker discovery. The raw sequence data, processed annotation, and SSR results were stored in relational tables designed in key-value pair fashion using a PostgreSQL relational database management system. The biological knowledge derived from the sequence data and processed results are represented as views or materialized views in the relational database management system. All materialized views are indexed for quick data access and retrieval. Data processing and analysis pipelines were implemented using the Perl programming language. The web interface was implemented in JavaScript and Perl CGI running on an Apache web server. The CPU intensive data processing and analysis pipelines were run on a computer cluster of more than 30 dual-processor Apple XServes. A job management system called Vela was created as a robust way to submit large numbers of jobs to the Portable Batch System (PBS. Conclusion CGKB is an integrated and annotated resource for cowpea GSS with features of homology-based and HMM-based annotations, enzyme and pathway annotations, GO term annotation, toolkits, and a large number of other facilities to perform complex queries. The cowpea GSS, chloroplast sequences, mitochondrial sequences, retroelements, and SSR sequences are available as FASTA formatted files and downloadable at CGKB. This database and web interface are publicly accessible at http://cowpeagenomics.med.virginia.edu/CGKB/.
Assessment of Multiple GNSS Real-Time SSR Products from Different Analysis Centers

Directory of Open Access Journals (Sweden)

Zhiyu Wang

2018-03-01

Full Text Available The real-time State Space Representation (SSR product of the GNSS (Global Navigation Satellite System orbit and clock is one of the most essential corrections for real-time precise point positioning (PPP. In this work, the performance of current SSR products from eight analysis centers were assessed by comparing it with the final product and the accuracy of real-time PPP. Numerical results showed that (1 the accuracies of the GPS SSR product were better than 8 cm for the satellite orbit and 0.3 ns for the satellite clock; (2 the accuracies of the GLONASS (GLObalnaya NAvigatsionnaya Sputnikovaya Sistema SSR product were better than 10 cm for orbit RMS (Root Mean Square and 0.6 ns for clock STD (Standard Deviation; and (3 the accuracies of the BDS (BeiDou Navigation Satellite System and Galileo SSR products from CLK93 were about 14.54 and 4.42 cm for the orbit RMS and 0.32 and 0.18 ns for the clock STD, respectively. The simulated kinematic PPP results obtained using the SSR products from CLK93 and CLK51 performed better than those using other SSR products; and the accuracy of PPP based on all products was better than 6 and 10 cm in the horizontal and vertical directions, respectively. The real-time kinematic PPP experiment carried out in Beijing, Tianjin, and Shijiazhuang, China indicated that the SSR product CLK93 from Centre National d’Etudes Spatiales (CNES had a better performance than CAS01. Moreover, the PPP with GPS + BDS dual systems had a higher accuracy than those with only a GPS single system.
Use of EST-SSR Markers for Evaluating Genetic Diversity and Fingerprinting Celery (Apium graveolens L. Cultivars

Directory of Open Access Journals (Sweden)

Nan Fu

2014-02-01

Full Text Available Celery (Apium graveolens L. is one of the most economically important vegetables worldwide, but genetic and genomic resources supporting celery molecular breeding are quite limited, thus few studies on celery have been conducted so far. In this study we made use of simple sequence repeat (SSR markers generated from previous celery transcriptome sequencing and attempted to detect the genetic diversity and relationships of commonly used celery accessions and explore the efficiency of the primers used for cultivars identification. Analysis of molecular variance (AMOVA of Apium graveolens L. var. dulce showed that approximately 43% of genetic diversity was within accessions, 45% among accessions, and 22% among horticultural types. The neighbor-joining tree generated by unweighted pair group method with arithmetic mean (UPGMA, and population structure analysis, as well as principal components analysis (PCA, separated the cultivars into clusters corresponding to the geographical areas where they originated. Genetic distance analysis suggested that genetic variation within Apium graveolens was quite limited. Genotypic diversity showed any combinations of 55 genic SSRs were able to distinguish the genotypes of all 30 accessions.
SSR Analysis of Genetic Diversity Among 192 Diploid Potato Cultivars

Directory of Open Access Journals (Sweden)

Xiaoyan Song

2016-05-01

Full Text Available In potato breeding, it is difficult to improve the traits of interest at the tetraploid level due to the tetrasomic inheritance. A promising alternative is diploid breeding. Thus it is necessary to assess the genetic diversity of diploid potato germplasm for efficient exploration and deployment of desirable traits. In this study, we used SSR markers to evaluate the genetic diversity of diploid potato cultivars. To screen polymorphic SSR markers, 55 pairs of SSR primers were employed to amplify 39 cultivars with relatively distant genetic relationships. Among them, 12 SSR markers with high polymorphism located at 12 chromosomes were chosen to evaluate the genetic diversity of 192 diploid potato cultivars. The primers produced 6 to 18 bands with an average of 8.2 bands per primer. In total, 98 bands were amplified from 192 cultivars, and 97 of them were polymorphic. Cluster analysis using UPGMA showed the genetic relationships of all accessions tested: 186 of the 192 accessions could be distinguished by only 12 pairs of SSR primers, and the 192 diploid cultivars were divided into 11 groups, and 83.3% constituted the first group. Clustering results showed relatively low genetic diversity among 192 diploid cultivars, with closer relationship at the molecular level. The results can provide molecular basis for diploid potato breeding.
PineElm_SSRdb: a microsatellite marker database identified from genomic, chloroplast, mitochondrial and EST sequences of pineapple (Ananas comosus (L.) Merrill).

Science.gov (United States)

Chaudhary, Sakshi; Mishra, Bharat Kumar; Vivek, Thiruvettai; Magadum, Santoshkumar; Yasin, Jeshima Khan

2016-01-01

Simple Sequence Repeats or microsatellites are resourceful molecular genetic markers. There are only few reports of SSR identification and development in pineapple. Complete genome sequence of pineapple available in the public domain can be used to develop numerous novel SSRs. Therefore, an attempt was made to identify SSRs from genomic, chloroplast, mitochondrial and EST sequences of pineapple which will help in deciphering genetic makeup of its germplasm resources. A total of 359511 SSRs were identified in pineapple (356385 from genome sequence, 45 from chloroplast sequence, 249 in mitochondrial sequence and 2832 from EST sequences). The list of EST-SSR markers and their details are available in the database. PineElm_SSRdb is an open source database available for non-commercial academic purpose at http://app.bioelm.com/ with a mapping tool which can develop circular maps of selected marker set. This database will be of immense use to breeders, researchers and graduates working on Ananas spp. and to others working on cross-species transferability of markers, investigating diversity, mapping and DNA fingerprinting.
Improving Microbial Genome Annotations in an Integrated Database Context

Science.gov (United States)

Chen, I-Min A.; Markowitz, Victor M.; Chu, Ken; Anderson, Iain; Mavromatis, Konstantinos; Kyrpides, Nikos C.; Ivanova, Natalia N.

2013-01-01

Effective comparative analysis of microbial genomes requires a consistent and complete view of biological data. Consistency regards the biological coherence of annotations, while completeness regards the extent and coverage of functional characterization for genomes. We have developed tools that allow scientists to assess and improve the consistency and completeness of microbial genome annotations in the context of the Integrated Microbial Genomes (IMG) family of systems. All publicly available microbial genomes are characterized in IMG using different functional annotation and pathway resources, thus providing a comprehensive framework for identifying and resolving annotation discrepancies. A rule based system for predicting phenotypes in IMG provides a powerful mechanism for validating functional annotations, whereby the phenotypic traits of an organism are inferred based on the presence of certain metabolic reactions and pathways and compared to experimentally observed phenotypes. The IMG family of systems are available at http://img.jgi.doe.gov/. PMID:23424620
Improving microbial genome annotations in an integrated database context.

Directory of Open Access Journals (Sweden)

I-Min A Chen

Full Text Available Effective comparative analysis of microbial genomes requires a consistent and complete view of biological data. Consistency regards the biological coherence of annotations, while completeness regards the extent and coverage of functional characterization for genomes. We have developed tools that allow scientists to assess and improve the consistency and completeness of microbial genome annotations in the context of the Integrated Microbial Genomes (IMG family of systems. All publicly available microbial genomes are characterized in IMG using different functional annotation and pathway resources, thus providing a comprehensive framework for identifying and resolving annotation discrepancies. A rule based system for predicting phenotypes in IMG provides a powerful mechanism for validating functional annotations, whereby the phenotypic traits of an organism are inferred based on the presence of certain metabolic reactions and pathways and compared to experimentally observed phenotypes. The IMG family of systems are available at http://img.jgi.doe.gov/.

Arduino Based RFID Line Switching Using SSR

Directory of Open Access Journals (Sweden)

Michael E.

2017-10-01

Full Text Available The importance of line switching cannot be overemphasized as they are used to connect and disconnect substations to and from a distribution grid. At the cradle of technology line switching was achieved via the use of manual switches or fuses which could endanger life as a result of electrocution when expose during maintenance. This ill prompted the development of automated line switching using relays and contactors. With time this tends to fail as a result of wearing of the contact which is as a result of arcing and low voltage. To avert all these ills this paper presents Arduino based Radio Frequency Identification RFID line switching using Solid State Relay SSR. This is to ensure the safety of operators or technologist and to also avert the problem associated with relays and contactors using SSR. This was achieved using RFID RC-522 reader ardriuno Uno SSR and other discrete components. The system was tested and worked perfectly reducing the risk of electrocution and eliminating damage wearing of the contacts common with contactors and relays.
Single nucleotide polymorphism isolated from a novel EST dataset in garden asparagus (Asparagus officinalis L.).

Science.gov (United States)

Mercati, Francesco; Riccardi, Paolo; Leebens-Mack, Jim; Abenavoli, Maria Rosa; Falavigna, Agostino; Sunseri, Francesco

2013-04-01

Single nucleotide polymorphisms (SNPs) and simple sequence repeats (SSR) are abundant and evenly distributed co-dominant molecular markers in plant genomes. SSRs are valuable for marker assisted breeding and positional cloning of genes associated traits of interest. Although several high throughput platforms have been developed to identify SNP and SSR markers for analysis of segregant plant populations, breeding in garden asparagus (Asparagus officinalis L.) has been limited by a low content of such markers. In this study massively parallel GS-FLX pyro-sequencing technology (454 Life Sciences) has been used to sequence and compare transcriptome from two genotypes: a rust tolerant male (1770) and a susceptible female (G190). A total of 122,963 and 99,368 sequence reads, with an average length of 245.7bp, have been recovered from accessions 1770 and 190 respectively. A computational pipeline has been used to predict and visually inspect putative SNPs and SSR sequences. Analysis of Gene Ontology (GO) slim annotation assignments for all assembled uniscripts indicated that the 24,403 assemblies represent genes from a broad array of functions. Further, over 1800 putative SNPs and 1000 SSRs were detected. One hundred forty-four SNPs together with 60 selected SSRs were validated and used to develop a preliminary genetic map by using a large BC(1) population, derived from 1770 and G190. The abundance of SNPs and SSRs provides a foundation for the development of saturated genetic maps and their utilization in assisted asparagus breeding programs. Copyright © 2013 Elsevier Ireland Ltd. All rights reserved.
Transcriptome sequencing of lentil based on second-generation technology permits large-scale unigene assembly and SSR marker discovery

Directory of Open Access Journals (Sweden)

Materne Michael

2011-05-01

Full Text Available Abstract Background Lentil (Lens culinaris Medik. is a cool-season grain legume which provides a rich source of protein for human consumption. In terms of genomic resources, lentil is relatively underdeveloped, in comparison to other Fabaceae species, with limited available data. There is hence a significant need to enhance such resources in order to identify novel genes and alleles for molecular breeding to increase crop productivity and quality. Results Tissue-specific cDNA samples from six distinct lentil genotypes were sequenced using Roche 454 GS-FLX Titanium technology, generating c. 1.38 × 106 expressed sequence tags (ESTs. De novo assembly generated a total of 15,354 contigs and 68,715 singletons. The complete unigene set was sequence-analysed against genome drafts of the model legume species Medicago truncatula and Arabidopsis thaliana to identify 12,639, and 7,476 unique matches, respectively. When compared to the genome of Glycine max, a total of 20,419 unique hits were observed corresponding to c. 31% of the known gene space. A total of 25,592 lentil unigenes were subsequently annoated from GenBank. Simple sequence repeat (SSR-containing ESTs were identified from consensus sequences and a total of 2,393 primer pairs were designed. A subset of 192 EST-SSR markers was screened for validation across a panel 12 cultivated lentil genotypes and one wild relative species. A total of 166 primer pairs obtained successful amplification, of which 47.5% detected genetic polymorphism. Conclusions A substantial collection of ESTs has been developed from sequence analysis of lentil genotypes using second-generation technology, permitting unigene definition across a broad range of functional categories. As well as providing resources for functional genomics studies, the unigene set has permitted significant enhancement of the number of publicly-available molecular genetic markers as tools for improvement of this species.
Salzburger State Reactance Scale (SSR Scale): Validation of a Scale Measuring State Reactance.

Science.gov (United States)

Sittenthaler, Sandra; Traut-Mattausch, Eva; Steindl, Christina; Jonas, Eva

This paper describes the construction and empirical evaluation of an instrument for measuring state reactance, the Salzburger State Reactance (SSR) Scale. The results of a confirmatory factor analysis supported a hypothesized three-factor structure: experience of reactance, aggressive behavioral intentions, and negative attitudes. Correlations with divergent and convergent measures support the validity of this structure. The SSR Subscales were strongly related to the other state reactance measures. Moreover, the SSR Subscales showed modest positive correlations with trait measures of reactance. The SSR Subscales correlated only slightly or not at all with neighboring constructs (e.g., autonomy, experience of control). The only exception was fairness scales, which showed moderate correlations with the SSR Subscales. Furthermore, a retest analysis confirmed the temporal stability of the scale. Suggestions for further validation of this questionnaire are discussed.
SSRPT (SSR Pointer Trackeer) for Cassini Mission Operations - A Ground Data Analysis Tool

Science.gov (United States)

Kan, E.

1998-01-01

Tracking the resources of the two redundant Solid State Recorders (SSR) is a necessary routine for Cassini spacecraft mission operations. Instead of relying on a full-fledged spacecraft hardware/software simulator to track and predict the SSR recording and playback pointer positions, a stand-alone SSR Pointer Tracker tool was developed as part of JPL's Multimission Spacecraft Analysis system.
SSR-based association mapping of salt tolerance in cotton (Gossypium hirsutum L.).

Science.gov (United States)

Zhao, Y L; Wang, H M; Shao, B X; Chen, W; Guo, Z J; Gong, H Y; Sang, X H; Wang, J J; Ye, W W

2016-05-25

The identification of simple sequence repeat (SSR) markers associated with salt tolerance in cotton contributes to molecular assisted selection (MAS), which can improve the efficiency of traditional breeding. In this study, 134 samples of upland cotton cultivars were selected. The seedling emergence rates were tested under 0.3% NaCl stress. A total of 74 SSR markers were used to scan the genomes of these samples. To identify SSR markers associated with salt tolerance, an association analysis was performed between salt tolerance and SSR markers using TASSEL 2.1, based on the analysis of genetic structure using Structure 2.3.4. The results showed that the seedling emergence rates of 134 cultivars were significantly different, and 27 salt-sensitive and 10 salt-tolerant cultivars were identified. A total of 148 loci were found in 74 SSR markers involving 246 allelic variations, which ranged from 2 to 7 with an average of 3.32 per SSR marker. The gene diversity ranged from 0.0295 to 0.4959, with the average being 0.2897. The polymorphic information content ranged from0.0290 to 0.3729, with the average being 0.2381. This natural population was classified into two subgroups by Structure 2.3.4, containing 89 and 45 samples, respectively. Finally, eight SSR sites associated with salt tolerance ware found through an association analysis, with the rate of explanation ranging from 2.91 to 7.82% and an average of 4.32%. These results provide reference data for the use MAS for salt tolerance in cotton.
Genetic diversity of wild and cultivated genotypes of pigeonpea through RAPD and SSR markers.

Science.gov (United States)

Walunjkar, Babasaheb C; Parihar, Akarsh; Singh, Nirbhay Kumar; Parmar, L D

2015-03-01

Eight wild and four cultivated pigeonpea genotypes were subjected to RAPD and microsatellite analysis, with 40 primers each. Out of these, eight RAPD and five SSR primers were found polymorphic. RAPD primers showed 100% polymorphism and produced a total of 517 DNA fragments, whereas SSR primers produced 67 fragments and they too showed 100% polymorphism. The RAPD markers revealed highest similarity co-efficient of 0.93 (GT-100 and ICPL-87), whereas the highest similarity co-efficient obtained with SSR markers was 1.00 (GTH-1 and GT-100). Average PIC value obtained with RAPD and SSR were 0.90 and 0.18, respectively. The arithmetic mean heterozygosity and marker index were 0.90 and 22.47 respectively with RAPD marker, whereas the corresponding values for SSR markers were 0.18 and 33.66. Moreover; the four wild genotypes (Cajanus scarabaeoides, Rhyncosia rufescence, Cajanus cajanifolius and Rhyncosia canna) and the four cultivars (GTH-1, GT-100, ICPL-87 and GT-1) grouped distinctly in the same subgroups of the dendrograms obtained with both RAPD and SSR analysis. Therefore, the findings of SSR supplement and validate the results obtained with RAPD analysis.
Transcriptome sequencing in an ecologically important tree species: assembly, annotation, and marker discovery

Directory of Open Access Journals (Sweden)

Benkman Craig W

2010-03-01

Full Text Available Abstract Background Massively parallel sequencing of cDNA is now an efficient route for generating enormous sequence collections that represent expressed genes. This approach provides a valuable starting point for characterizing functional genetic variation in non-model organisms, especially where whole genome sequencing efforts are currently cost and time prohibitive. The large and complex genomes of pines (Pinus spp. have hindered the development of genomic resources, despite the ecological and economical importance of the group. While most genomic studies have focused on a single species (P. taeda, genomic level resources for other pines are insufficiently developed to facilitate ecological genomic research. Lodgepole pine (P. contorta is an ecologically important foundation species of montane forest ecosystems and exhibits substantial adaptive variation across its range in western North America. Here we describe a sequencing study of expressed genes from P. contorta, including their assembly and annotation, and their potential for molecular marker development to support population and association genetic studies. Results We obtained 586,732 sequencing reads from a 454 GS XLR70 Titanium pyrosequencer (mean length: 306 base pairs. A combination of reference-based and de novo assemblies yielded 63,657 contigs, with 239,793 reads remaining as singletons. Based on sequence similarity with known proteins, these sequences represent approximately 17,000 unique genes, many of which are well covered by contig sequences. This sequence collection also included a surprisingly large number of retrotransposon sequences, suggesting that they are highly transcriptionally active in the tissues we sampled. We located and characterized thousands of simple sequence repeats and single nucleotide polymorphisms as potential molecular markers in our assembled and annotated sequences. High quality PCR primers were designed for a substantial number of the SSR loci
Development of ESTs from chickpea roots and their use in diversity analysis of the Cicer genus

Directory of Open Access Journals (Sweden)

Eshwar K

2005-08-01

Full Text Available Abstract Background Chickpea is a major crop in many drier regions of the world where it is an important protein-rich food and an increasingly valuable traded commodity. The wild annual Cicer species are known to possess unique sources of resistance to pests and diseases, and tolerance to environmental stresses. However, there has been limited utilization of these wild species by chickpea breeding programs due to interspecific crossing barriers and deleterious linkage drag. Molecular genetic diversity analysis may help predict which accessions are most likely to produce fertile progeny when crossed with chickpea cultivars. While, trait-markers may provide an effective tool for breaking linkage drag. Although SSR markers are the assay of choice for marker-assisted selection of specific traits in conventional breeding populations, they may not provide reliable estimates of interspecific diversity, and may lose selective power in backcross programs based on interspecific introgressions. Thus, we have pursued the development of gene-based markers to resolve these problems and to provide candidate gene markers for QTL mapping of important agronomic traits. Results An EST library was constructed after subtractive suppressive hybridization (SSH of root tissue from two very closely related chickpea genotypes (Cicer arietinum. A total of 106 EST-based markers were designed from 477 sequences with functional annotations and these were tested on C. arietinum. Forty-four EST markers were polymorphic when screened across nine Cicer species (including the cultigen. Parsimony and PCoA analysis of the resultant EST-marker dataset indicated that most accessions cluster in accordance with the previously defined classification of primary (C. arietinum, C. echinospermum and C. reticulatum, secondary (C. pinnatifidum, C. bijugum and C. judaicum, and tertiary (C. yamashitae, C. chrossanicum and C. cuneatum gene-pools. A large proportion of EST alleles (45% were only
Cultivar identification and genetic relationship of pineapple (Ananas comosus) cultivars using SSR markers.

Science.gov (United States)

Lin, Y S; Kuan, C S; Weng, I S; Tsai, C C

2015-11-25

The genetic relationships among 27 pineapple [Ananas comosus (L.) Merr.] cultivars and lines were examined using 16 simple sequence repeat (SSR) markers. The number of alleles per locus of the SSR markers ranged from 2 to 6 (average 3.19), for a total of 51 alleles. Similarity coefficients were calculated on the basis of 51 amplified bands. A dendrogram was created according to the 16 SSR markers by the unweighted pair-group method. The banding patterns obtained from the SSR primers allowed most of the cultivars and lines to be distinguished, with the exception of vegetative clones. According to the dendrogram, the 27 pineapple cultivars and lines were clustered into three main clusters and four individual clusters. As expected, the dendrogram showed that derived cultivars and lines are closely related to their parental cultivars; the genetic relationships between pineapple cultivars agree with the genealogy of their breeding history. In addition, the analysis showed that there is no obvious correlation between SSR markers and morphological characters. In conclusion, SSR analysis is an efficient method for pineapple cultivar identification and can offer valuable informative characters to identify pineapple cultivars in Taiwan.
An accurate and efficient method for large-scale SSR genotyping and applications.

Science.gov (United States)

Li, Lun; Fang, Zhiwei; Zhou, Junfei; Chen, Hong; Hu, Zhangfeng; Gao, Lifen; Chen, Lihong; Ren, Sheng; Ma, Hongyu; Lu, Long; Zhang, Weixiong; Peng, Hai

2017-06-02

Accurate and efficient genotyping of simple sequence repeats (SSRs) constitutes the basis of SSRs as an effective genetic marker with various applications. However, the existing methods for SSR genotyping suffer from low sensitivity, low accuracy, low efficiency and high cost. In order to fully exploit the potential of SSRs as genetic marker, we developed a novel method for SSR genotyping, named as AmpSeq-SSR, which combines multiplexing polymerase chain reaction (PCR), targeted deep sequencing and comprehensive analysis. AmpSeq-SSR is able to genotype potentially more than a million SSRs at once using the current sequencing techniques. In the current study, we simultaneously genotyped 3105 SSRs in eight rice varieties, which were further validated experimentally. The results showed that the accuracies of AmpSeq-SSR were nearly 100 and 94% with a single base resolution for homozygous and heterozygous samples, respectively. To demonstrate the power of AmpSeq-SSR, we adopted it in two applications. The first was to construct discriminative fingerprints of the rice varieties using 3105 SSRs, which offer much greater discriminative power than the 48 SSRs commonly used for rice. The second was to map Xa21, a gene that confers persistent resistance to rice bacterial blight. We demonstrated that genome-scale fingerprints of an organism can be efficiently constructed and candidate genes, such as Xa21 in rice, can be accurately and efficiently mapped using an innovative strategy consisting of multiplexing PCR, targeted sequencing and computational analysis. While the work we present focused on rice, AmpSeq-SSR can be readily extended to animals and micro-organisms. © The Author(s) 2017. Published by Oxford University Press on behalf of Nucleic Acids Research.
The research of SSR which can be restrained by photovoltaic grid connected

Science.gov (United States)

Li, Kuan; Liu, Meng; Zheng, Wei; Li, Yudun; Wang, Xin

2018-02-01

Utilization of photovoltaic power generation has attracted considerable attention, and it is growing rapidly due to its environmental benefits. The series capacitive compensation is needed to be introduced into the lines which could improve the transmission capacity. However, the series capacitive compensation may lead to sub-synchronous resonance(SSR). This paper proposes a method to restrain the SSR based on photovoltaic grid connected which is caused by series capacitive compensation. Sub-synchronous oscillation damping controller (SSDC) is designed based on complex torque coefficient approach, and the SSDC is added to the PV power station’s main controller to damp SSR. IEEE Second benchmark model is used as simulation model based on PSCAD/EMTDC. The results show that the designed SSDC could restrain SSR and improve stability in PV grid connected effectively.
Identification and genetic mapping of highly polymorphic microsatellite loci from an EST database of the septoria tritici blotch pathogen Mycosphaerella graminicola

NARCIS (Netherlands)

Goodwin, S.B.; Lee, van der T.A.J.; Cavaletto, J.R.; Lintel Hekkert, te B.; Crane, C.F.; Kema, G.H.J.

2007-01-01

A database of 30,137 EST sequences from Mycosphaerella graminicola, the septoria tritici blotch fungus of wheat, was scanned with a custom software pipeline for di- and trinucleotide units repeated tandemly six or more times. The bioinformatics analysis identified 109 putative SSR loci, and for 99
Phylogenetic and Diversity Analysis of Dactylis glomerata Subspecies Using SSR and IT-ISJ Markers.

Science.gov (United States)

Yan, Defei; Zhao, Xinxin; Cheng, Yajuan; Ma, Xiao; Huang, Linkai; Zhang, Xinquan

2016-10-31

The genus Dactylis , an important forage crop, has a wide geographical distribution in temperate regions. While this genus is thought to include a single species, Dactylis glomerata , this species encompasses many subspecies whose relationships have not been fully characterized. In this study, the genetic diversity and phylogenetic relationships of nine representative Dactylis subspecies were examined using SSR and IT-ISJ markers. In total, 21 pairs of SSR primers and 15 pairs of IT-ISJ primers were used to amplify 295 polymorphic bands with polymorphic rates of 100%. The average polymorphic information contents (PICs) of SSR and IT-ISJ markers were 0.909 and 0.780, respectively. The combined data of the two markers indicated a high level of genetic diversity among the nine D. glomerata subspecies, with a Nei's gene diversity index value of 0.283 and Shannon's diversity of 0.448. Preliminarily phylogenetic analysis results revealed that the 20 accessions could be divided into three groups (A, B, C). Furthermore, they could be divided into five clusters, which is similar to the structure analysis with K = 5. Phylogenetic placement in these three groups may be related to the distribution ranges and the climate types of the subspecies in each group. Group A contained eight accessions of four subspecies, originating from the west Mediterranean, while Group B contained seven accessions of three subspecies, originating from the east Mediterranean.
Mapping of QTLs for frost tolerance and heading time using SSR ...

African Journals Online (AJOL)

STORAGESEVER

2008-10-19

Oct 19, 2008 ... using SSR markers in bread wheat. Omid Sofalian1*, Seyyed A. ... Key words: Bread wheat, frost tolerance, heading time, QTL mapping, single marker analysis, SSR. INTRODUCTION. Abiotic stresses are crucial ... cultivars are divided into two types (winter and spring growth habit) depending on their need ...
Cloning, analysis and functional annotation of expressed sequence tags from the Earthworm Eisenia fetida

Science.gov (United States)

Pirooznia, Mehdi; Gong, Ping; Guan, Xin; Inouye, Laura S; Yang, Kuan; Perkins, Edward J; Deng, Youping

2007-01-01

Background Eisenia fetida, commonly known as red wiggler or compost worm, belongs to the Lumbricidae family of the Annelida phylum. Little is known about its genome sequence although it has been extensively used as a test organism in terrestrial ecotoxicology. In order to understand its gene expression response to environmental contaminants, we cloned 4032 cDNAs or expressed sequence tags (ESTs) from two E. fetida libraries enriched with genes responsive to ten ordnance related compounds using suppressive subtractive hybridization-PCR. Results A total of 3144 good quality ESTs (GenBank dbEST accession number EH669363–EH672369 and EL515444–EL515580) were obtained from the raw clone sequences after cleaning. Clustering analysis yielded 2231 unique sequences including 448 contigs (from 1361 ESTs) and 1783 singletons. Comparative genomic analysis showed that 743 or 33% of the unique sequences shared high similarity with existing genes in the GenBank nr database. Provisional function annotation assigned 830 Gene Ontology terms to 517 unique sequences based on their homology with the annotated genomes of four model organisms Drosophila melanogaster, Mus musculus, Saccharomyces cerevisiae, and Caenorhabditis elegans. Seven percent of the unique sequences were further mapped to 99 Kyoto Encyclopedia of Genes and Genomes pathways based on their matching Enzyme Commission numbers. All the information is stored and retrievable at a highly performed, web-based and user-friendly relational database called EST model database or ESTMD version 2. Conclusion The ESTMD containing the sequence and annotation information of 4032 E. fetida ESTs is publicly accessible at . PMID:18047730
Development and characterization of EST-SSR markers for Artocarpus hypargyreus (Moraceae).

Science.gov (United States)

Liu, Haijun; Tan, Weizheng; Sun, Hongbin; Liu, Yu; Meng, Kaikai; Liao, Wenbo

2016-12-01

Polymorphic microsatellite markers were developed for Artocarpus hypargyreus (Moraceae), a threatened species endemic to China, to investigate the genetic diversity and structure of the species. Based on the transcriptome data of A. hypargyreus , 63 primer pairs were preliminarily designed and tested, of which 34 were successfully amplified and 10 displayed clear polymorphisms across the 67 individuals from four populations of A. hypargyreus . The results showed the number of alleles per locus ranged from three to 10, and the observed heterozygosity and expected heterozygosity per locus varied from 0.000 to 0.706 and from 0.328 to 0.807, respectively. These microsatellite markers will be useful in exploring genetic diversity and structure of A. hypargyreus . Furthermore, most loci were successfully cross-amplified in A. nitidus and A. heterophyllus , indicating that they will be of great value for genetic study across this genus.
Data of 10 SSR markers for genomes of homo sapiens and monkeys.

Science.gov (United States)

Reddy, K K V V V S; Raju, S Viswanadha; Someswara Rao, Chinta

2017-06-01

In this data, we present 10 Simple Sequence Repeat(SSR) markers TAGA, TCAT, GAAT, AGAT, AGAA, GATA, TATC, CTTT, TCTG and TCTA which are extracted from the genomes of homo sapiens and monkeys using string matching mechanism [1]. All loci showed 4 Base Pair(bp) in allele size, indicating that there are some polymorphisms between individuals correlating to the number of SSR repeats that maybe useful for the detection of similarity among the genotypes. Collectively, these data show that the SSR extraction is a valuable method to illustrate genetic variation of genomes.
Cross-species amplification of 105 Lolium perenne SSR loci in 23 species within the Poaceae

DEFF Research Database (Denmark)

Jensen, Louise Bach; Holm, Preben Bach; Lübberstedt, Thomas

2007-01-01

Amplification of 105 Lolium perenne SSR markers was studied in 23 grass species representing seven tribes from three subfamilies of Poaceae. Twelve of the SSR markers are published for the first time. Between 2% and 96% of the SSR markers could be amplified within a given species. A subset of eight...... SSR markers was evaluated for polymorphism across nine of the 23 grass species. Four to seven of the markers were polymorphic within each species, with an average detection of 2.4 alleles per species....
Development of advanced BWR fuel bundle with spectral shift rod (3) -transient analysis of ABWR core with SSR

International Nuclear Information System (INIS)

Ikegawa, T.; Chaki, M.; Ohga, Y.; Abe, M.

2010-01-01

The spectral shift rod (SSR) is a new type of water rod, utilized instead of the conventional water rod, in which a water level develops during core operation. The water level can be changed according to the fuel channel flow rate. In this study, ABWR plant performance with SSR fuel bundles under transient conditions has been evaluated using the TRACG code. The TRACG code, which can treat three-dimensional hydrodynamic calculations in a reactor pressure vessel, is well suited for evaluating the reactor transient performance with the SSR fuel bundles because it can calculate the water levels in the SSR at each channel grouping and therefore evaluate the core reactivity according to the water level changes in the SSR. 'Generator load rejection with total turbine bypass failure' and 'Recirculation flow control failure with increasing flow' were selected as cases which may increase the reactivity with the increasing water level in the SSR. It was found that the absolute value of the void reactivity coefficient in the SSR core was larger than that in the conventional water rod core because the core averaged void fraction in the SSR core, which has the vapor region above the water level in the SSR, was larger than that in the conventional water rod core. Therefore, AMCPR for the SSR core was a little larger than that for the conventional water rod core; however, the difference was smaller than 0.02 because the inlet of the SSR ascending path was designed to be small enough to prevent the rapid water level increase in the SSR. (authors)

Caracterização de genótipos de mirtilo utilizando marcadores moleculares Characterization of blueberry genotypes using molecular markers

Directory of Open Access Journals (Sweden)

Sergio Delmar dos Anjos e Silva

2008-03-01

Full Text Available O cultivo do mirtilo está em expansão no Brasil, em especial em regiões de clima temperado, onde há grande demanda em relação a cultivares adaptadas às condições edafoclimáticas regionais. O objetivo deste trabalho foi caracterizar genótipos de mirtilo do programa de melhoramento da Embrapa Clima Temperado, utilizando marcadores moleculares do tipo RAPD e SSR. Foram caracterizados 40 genótipos de mirtilo por RAPD e oito cultivares por microssatélites. Os nove primers utilizados na técnica de RAPD geraram 89 marcadores. A similaridade genética entre os genótipos variou de 64 a 89%. Utilizando a similaridade média (66%, foram obtidos quatro grupos. Foram gerados 11 marcadores a partir de três pares de primers de microssatélites. A similaridade genética entre as cultivares variou de 25 a 75%. Com similaridade média (42,4%, foram obtidos três grupos. Com apenas três pares de primers de SSR, foi possível definir o padrão das oito cultivares de mirtilo, revelando a eficiência da técnica de microssatélite na caracterização de genótipos dessa espécie. Esses resultados revelam a eficiência dos marcadores tipo RAPD e SSR na caracterização de genótipos de mirtilo. Entretanto, os marcadores tipo microssatélites geram resultados mais precisos, sendo os mais recomendados para uso em programas de melhoramento e identificação de cultivares.The blueberry crop planting area is increasing in Brazil, especially in Temperate Climate Zones, generating demands relating to suitable cultivars adapted to regional climate and soil conditions. This work aimed to characterize blueberry genotypes from Embrapa Clima Temperado breeding program, using RAPD and SSR molecular markers. There were characterized 40 blueberry genotypes using RAPD and 8 cultivars using SSR molecular markers. The 9 RAPD primers generated 89 markers. The genetic similarity ranged from 64 to 89%. Through the average similarity (66%, it was possible to identify four
[Study on quality evaluation of sequence and SSR information in transcriptome of Astragalus membranacus].

Science.gov (United States)

Chang, Yue; Yang, Song; Liu, Zhen-Peng; Ren, Wei-Chao; Liu, Jie; Ma, Wei

2016-04-01

In this study, 454/Roche GS FLX sequencing technology was used to obtain the data of the Astragalus membranaceus. Four hundred and fifty-four Sequencing System Software was applied to carry out the transcription of the group from scratch. Using MISA tools, 9 893 unigenes were selected for the sequence of the genome of A. membranaceus, and the information of SSR locus was analyzed. According to the result, the average length of reads was 413 bp, about 86% of the reads was involved in the splicing, the length of the N50 was 1 205 bp, the number of unigenes was measured by the whole transcript. 1 729 SSR loci in the A. membranaceus transcriptome were searched, the occurrence frequency of SSR was 9.24%, the frequency of SSR in the whole transcriptome was 13.42%, the average length of SSR was 7.97 kb. One hundred and twenty-seven kinds of core repeat sequences were found, the dominant type was TG/AC type of dinucleotide, it appeared to account for 4.25% of the total SSR locus. The results of the sequence of the transcription of the A. membranaceus transcriptome revealed the overall expression, and a large number of unigenessequence was obtained, and the SSR locus in the genome of the A. membranaceus is high, and the type is diverse, and the polymorphism of the gene is high. Copyright© by the Chinese Pharmaceutical Association.
A new set of validated SSR primers for application in mulberry ...

Indian Academy of Sciences (India)

user

2018-01-11

Jan 11, 2018 ... We chose 37 genomic sequences that had high number of di, tri and tetra SSR motifs from ... primers failed to produce any amplification products and ... Sericulture is a cottage industry that employs 8.03 million people across ... The resolving power of the SSR markers ranged from 0.11 (MulSatG100717) to.
Identification of molecular performance from oil palm clones based on SSR markers

Science.gov (United States)

Putri, Lollie Agustina P.; Basyuni, M.; Bayu, Eva S.; Arvita, D.; Arifiyanto, D.; Syahputra, I.

2018-03-01

In Indonesia, the oil palms are an important economic crop, producing food and raw materials for the food, confectionary, cosmetics and oleo-chemical industrial demands of oil palm products. Clonal oil palm offers the potential for greater productivity because it is possible to establish uniform tree stands comprising identical copies (clones) of a limited number of highly productive oil palms. Unfortunately, tissue culture sometimes accentuates the expression of detects in oil palm, particularly when embryogenesis is induced in particullar callus for prolonged periods. This research is conducted by taking individual tree sample of clone germplasm two years old. The purpose of this research is to molecular performance analysis of some oil palm clones based on SSR markers. A total of 30 trees oil palm clones were used for analysis. In this experiment, the DNA profile diversity was assessed using five loci of oil palm’s specific SSR markers. The results of the experiment indicated out of 3 SSR markers (FR-0779, FR-3663 and FR-0782) showed monomorphic of PCR product and 2 SSR markers (FR-0783 and FR- 3745) showed polymorphic of PCR product. There are 10 total number of PCR product. These preliminary results demonstrated SSR marker can be used to evaluate genetic relatedness among trees of oil palm clones.
Surviving extreme polar winters by desiccation: clues from Arctic springtail (Onychiurus arcticus EST libraries

Directory of Open Access Journals (Sweden)

Kube Michael

2007-12-01

Full Text Available Abstract Background Ice, snow and temperatures of -14°C are conditions which most animals would find difficult, if not impossible, to survive in. However this exactly describes the Arctic winter, and the Arctic springtail Onychiurus arcticus regularly survives these extreme conditions and re-emerges in the spring. It is able to do this by reducing the amount of water in its body to almost zero: a process that is called "protective dehydration". The aim of this project was to generate clones and sequence data in the form of ESTs to provide a platform for the future molecular characterisation of the processes involved in protective dehydration. Results Five normalised libraries were produced from both desiccating and rehydrating populations of O. arcticus from stages that had previously been defined as potentially informative for molecular analyses. A total of 16,379 EST clones were generated and analysed using Blast and GO annotation. 40% of the clones produced significant matches against the Swissprot and trembl databases and these were further analysed using GO annotation. Extraction and analysis of GO annotations proved an extremely effective method for identifying generic processes associated with biochemical pathways, proving more efficient than solely analysing Blast data output. A number of genes were identified, which have previously been shown to be involved in water transport and desiccation such as members of the aquaporin family. Identification of these clones in specific libraries associated with desiccation validates the computational analysis by library rather than producing a global overview of all libraries combined. Conclusion This paper describes for the first time EST data from the arctic springtail (O. arcticus. This significantly enhances the number of Collembolan ESTs in the public databases, providing useful comparative data within this phylum. The use of GO annotation for analysis has facilitated the identification of a
Genetic variability of a Brazilian Capsicum frutescens germplasm collection using morphological characteristics and SSR markers.

Science.gov (United States)

Carvalho, S I C; Bianchetti, L B; Ragassi, C F; Ribeiro, C S C; Reifschneider, F J B; Buso, G S C; Faleiro, F G

2017-07-06

Characterization studies provide essential information for the conservation and use of germplasm in plant breeding programs. In this study, 103 Capsicum frutescens L. accessions from the Active Germplasm Bank of Embrapa Hortaliças, representative of all five Brazilian geographic regions, were characterized based on morphological characteristics and microsatellite (or simple sequence repeat - SSR) molecular markers. Morphological characterization was carried out using 57 descriptors, and molecular characterization was based on 239 alleles from 24 microsatellite loci. From the estimates of genetic distances among accessions, based on molecular characterization, a cluster analysis was carried out, and a dendrogram was established. Correlations between morphological and molecular variables were also estimated. Twelve morphological descriptors were monomorphic for the set of C. frutescens accessions, and those with the highest degree of polymorphism were stem length (14.0 to 62.0 cm), stem diameter (1.0 to 4.2 cm), days to flowering (90 to 129), days to fruiting (100 to 140), fruit weight (0.1 to 1.4 g), fruit length (0.6 to 4.6 cm), and fruit wall thickness (0.25 to 1.5 mm). The polymorphism information content for the SSR loci varied from 0.36 (EPMS 417) to 0.75 (CA49), with an overall mean of 0.57. The correlation value between morphological and molecular characterization data was 0.6604, which was statistically significant. Fourteen accessions were described as belonging to the morphological type tabasco, 85 were described as malagueta, and four were malaguetinha, a morphological type confirmed in this study. The typical morphological pattern of malagueta was described. Six similarity groups were established for C. frutescens based on the dendrogram and are discussed individually. The genetic variability analyzed in the study highlights the importance of characterizing genetic resources available for the development of new C. frutescens cultivars with the potential
Genetic variation of maize (Zea mays L.) mutants based on ssr analysis

Energy Technology Data Exchange (ETDEWEB)

Hongni, Qin; Yilin, Cai; Chunrong, Yang; Guoqiang, Wang [Maize Research Institute of Southwest University, Chongqing (China)

2008-12-15

52 SSR primers that gave stable profiles amplified in sample of the Maize inbred line '082' and its 48 mutants were selected from 97 primers, and produced 170 polymorphic amplified fragments. The average number of allele per SSR locus was 3.27 with a range from 2 to 6. The polymorphism information content for the SSR loci varied from 0.039 to 0.715 with an average of 0.327. Genetic similarities among the 49 materials ranged from 0.377 to 1.000 with an average of 0.823. The 49 materials were divided into 6 groups by UPGMA. The results indicated that distinct variation existed among mutants. (authors)
Genetic variation of maize (Zea mays L.) mutants based on ssr analysis

International Nuclear Information System (INIS)

Qin Hongni; Cai Yilin; Yang Chunrong; Wang Guoqiang

2008-01-01

52 SSR primers that gave stable profiles amplified in sample of the Maize inbred line '082' and its 48 mutants were selected from 97 primers, and produced 170 polymorphic amplified fragments. The average number of allele per SSR locus was 3.27 with a range from 2 to 6. The polymorphism information content for the SSR loci varied from 0.039 to 0.715 with an average of 0.327. Genetic similarities among the 49 materials ranged from 0.377 to 1.000 with an average of 0.823. The 49 materials were divided into 6 groups by UPGMA. The results indicated that distinct variation existed among mutants. (authors)
Annotation-based feature extraction from sets of SBML models.

Science.gov (United States)

Alm, Rebekka; Waltemath, Dagmar; Wolfien, Markus; Wolkenhauer, Olaf; Henkel, Ron

2015-01-01

Model repositories such as BioModels Database provide computational models of biological systems for the scientific community. These models contain rich semantic annotations that link model entities to concepts in well-established bio-ontologies such as Gene Ontology. Consequently, thematically similar models are likely to share similar annotations. Based on this assumption, we argue that semantic annotations are a suitable tool to characterize sets of models. These characteristics improve model classification, allow to identify additional features for model retrieval tasks, and enable the comparison of sets of models. In this paper we discuss four methods for annotation-based feature extraction from model sets. We tested all methods on sets of models in SBML format which were composed from BioModels Database. To characterize each of these sets, we analyzed and extracted concepts from three frequently used ontologies, namely Gene Ontology, ChEBI and SBO. We find that three out of the methods are suitable to determine characteristic features for arbitrary sets of models: The selected features vary depending on the underlying model set, and they are also specific to the chosen model set. We show that the identified features map on concepts that are higher up in the hierarchy of the ontologies than the concepts used for model annotations. Our analysis also reveals that the information content of concepts in ontologies and their usage for model annotation do not correlate. Annotation-based feature extraction enables the comparison of model sets, as opposed to existing methods for model-to-keyword comparison, or model-to-model comparison.
Genetic Diversity in Various Accessions of Pineapple [Ananas comosus (L.) Merr.] Using ISSR and SSR Markers.

Science.gov (United States)

Wang, Jian-Sheng; He, Jun-Hu; Chen, Hua-Rui; Chen, Ye-Yuan; Qiao, Fei

2017-12-01

Inter simple sequence repeat (ISSR) and simple sequence repeat (SSR) markers were used to assess the genetic diversity of 36 pineapple accessions that were introduced from 10 countries/regions. Thirteen ISSR primers amplified 96 bands, of which 91 (93.65%) were polymorphic, whereas 20 SSR primers amplified 73 bands, of which 70 (96.50%) were polymorphic. Nei's gene diversity (h = 0.28), Shannon's information index (I = 0.43), and polymorphism information content (PIC = 0.29) generated using the SSR primers were higher than that with ISSR primers (h = 0.23, I = 0.37, PIC = 0.24), thereby suggesting that the SSR system is more efficient than the ISSR system in assessing genetic diversity in various pineapple accessions. Mean genetic similarities were 0.74, 0.61, and 0.69, as determined using ISSR, SSR, and combined ISSR/SSR, respectively. These results suggest that the genetic diversity among pineapple accessions is very high. We clustered the 36 pineapple accessions into three or five groups on the basis of the phylogenetic trees constructed based on the results of ISSR, SSR, and combined ISSR/SSR analyses using the unweighted pair-group with arithmetic averaging (UPGMA) method. The results of principal components analysis (PCA) also supported the UPGMA clustering. These results will be useful not only for the scientific conservation and management of pineapple germplasm but also for the improvement of the current pineapple breeding strategies.
Construction of an SSR and RAD-Marker Based Molecular Linkage Map of Vigna vexillata (L.) A. Rich.

Science.gov (United States)

Marubodee, Rusama; Ogiso-Tanaka, Eri; Isemura, Takehisa; Chankaew, Sompong; Kaga, Akito; Naito, Ken; Ehara, Hiroshi; Tomooka, Norihiko

2015-01-01

Vigna vexillata (L.) A. Rich. (tuber cowpea) is an underutilized crop for consuming its tuber and mature seeds. Wild form of V. vexillata is a pan-tropical perennial herbaceous plant which has been used by local people as a food. Wild V. vexillata has also been considered as useful gene(s) source for V. unguiculata (cowpea), since it was reported to have various resistance gene(s) for insects and diseases of cowpea. To exploit the potential of V. vexillata, an SSR-based linkage map of V. vexillata was developed. A total of 874 SSR markers successfully amplified single DNA fragment in V. vexillata among 1,336 SSR markers developed from Vigna angularis (azuki bean), V. unguiculata and Phaseolus vulgaris (common bean). An F2 population of 300 plants derived from a cross between salt resistant (V1) and susceptible (V5) accessions was used for mapping. A genetic linkage map was constructed using 82 polymorphic SSR markers loci, which could be assigned to 11 linkage groups spanning 511.5 cM in length with a mean distance of 7.2 cM between adjacent markers. To develop higher density molecular linkage map and to confirm SSR markers position in a linkage map, RAD markers were developed and a combined SSR and RAD markers linkage map of V. vexillata was constructed. A total of 559 (84 SSR and 475 RAD) markers loci could be assigned to 11 linkage groups spanning 973.9 cM in length with a mean distance of 1.8 cM between adjacent markers. Linkage and genetic position of all SSR markers in an SSR linkage map were confirmed. When an SSR genetic linkage map of V. vexillata was compared with those of V. radiata and V. unguiculata, it was suggested that the structure of V. vexillata chromosome was considerably differentiated. This map is the first SSR and RAD marker-based V. vexillata linkage map which can be used for the mapping of useful traits.
High-throughput development of genome-wide locus-specific informative SSR markers in wheat

Science.gov (United States)

Although simple sequence repeat (SSR) markers are not new, they are still useful and often used markers in molecular mapping and marker-assisted breeding, particularly in developing countries. However, locus-specific SSR markers could be more useful and informative in wheat breeding and genetic stud...
A framework for annotating human genome in disease context.

Science.gov (United States)

Xu, Wei; Wang, Huisong; Cheng, Wenqing; Fu, Dong; Xia, Tian; Kibbe, Warren A; Lin, Simon M

2012-01-01

Identification of gene-disease association is crucial to understanding disease mechanism. A rapid increase in biomedical literatures, led by advances of genome-scale technologies, poses challenge for manually-curated-based annotation databases to characterize gene-disease associations effectively and timely. We propose an automatic method-The Disease Ontology Annotation Framework (DOAF) to provide a comprehensive annotation of the human genome using the computable Disease Ontology (DO), the NCBO Annotator service and NCBI Gene Reference Into Function (GeneRIF). DOAF can keep the resulting knowledgebase current by periodically executing automatic pipeline to re-annotate the human genome using the latest DO and GeneRIF releases at any frequency such as daily or monthly. Further, DOAF provides a computable and programmable environment which enables large-scale and integrative analysis by working with external analytic software or online service platforms. A user-friendly web interface (doa.nubic.northwestern.edu) is implemented to allow users to efficiently query, download, and view disease annotations and the underlying evidences.
Improved annotation through genome-scale metabolic modeling of Aspergillus oryzae

DEFF Research Database (Denmark)

Vongsangnak, Wanwipa; Olsen, Peter; Hansen, Kim

2008-01-01

Background: Since ancient times the filamentous fungus Aspergillus oryzae has been used in the fermentation industry for the production of fermented sauces and the production of industrial enzymes. Recently, the genome sequence of A. oryzae with 12,074 annotated genes was released but the number...... to a genome scale metabolic model of A. oryzae. Results: Our assembled EST sequences we identified 1,046 newly predicted genes in the A. oryzae genome. Furthermore, it was possible to assign putative protein functions to 398 of the newly predicted genes. Noteworthy, our annotation strategy resulted...... model was validated and shown to correctly describe the phenotypic behavior of A. oryzae grown on different carbon sources. Conclusion: A much enhanced annotation of the A. oryzae genome was performed and a genomescale metabolic model of A. oryzae was reconstructed. The model accurately predicted...
Development and genetic mapping of SSR markers in foxtail millet [Setaria italica (L.) P. Beauv.].

Science.gov (United States)

Jia, Xiaoping; Zhang, Zhongbao; Liu, Yinghui; Zhang, Chengwei; Shi, Yunsu; Song, Yanchun; Wang, Tianyu; Li, Yu

2009-02-01

SSR markers are desirable markers in analysis of genetic diversity, quantitative trait loci mapping and gene locating. In this study, SSR markers were developed from two genomic libraries enriched for (GA)n and (CA)n of foxtail millet [Setaria italica (L.) P. Beauv.], a crop of historical importance in China. A total of 100 SSR markers among the 193 primer pairs detected polymorphism between two mapping parents of an F(2) population, i.e. "B100" of cultivated S. italica and "A10" of wild S. viridis. Excluding 14 markers with unclear amplifications, and five markers unlinked with any linkage group, a foxtail millet SSR linkage map was constructed by integrating 81 new developed SSR markers with 20 RFLP anchored markers. The 81 SSRs covered nine chromosomes of foxtail millet. The length of the map was 1,654 cM, with an average interval distance between markers of 16.4 cM. The 81 SSR markers were not evenly distributed throughout the nine chromosomes, with Ch.8 harbouring the least (3 markers) and Ch.9 harbouring the most (18 markers). To verify the usefulness of the SSR markers developed, 37 SSR markers were randomly chosen to analyze genetic diversity of 40 foxtail millet accessions. Totally 228 alleles were detected, with an average 6.16 alleles per locus. Polymorphism information content (PIC) value for each locus ranged from 0.413 to 0.847, with an average of 0.697. A positive correlation between PIC and number of alleles and between PIC and number of repeat unit were found [0.802 and 0.429, respectively (P < 0.01)]. UPGMA analysis revealed that the 40 foxtail millet cultivars could be grouped into five clusters in which the landraces' grouping was largely consistent with ecotypes while the breeding varieties from different provinces in China tended to be grouped together.
Genome survey of pistachio (Pistacia vera L.) by next generation sequencing: Development of novel SSR markers and genetic diversity in Pistacia species.

Science.gov (United States)

Ziya Motalebipour, Elmira; Kafkas, Salih; Khodaeiaminjan, Mortaza; Çoban, Nergiz; Gözel, Hatice

2016-12-07

Pistachio (Pistacia vera L.) is one of the most important nut crops in the world. There are about 11 wild species in the genus Pistacia, and they have importance as rootstock seed sources for cultivated P. vera and forest trees. Published information on the pistachio genome is limited. Therefore, a genome survey is necessary to obtain knowledge on the genome structure of pistachio by next generation sequencing. Simple sequence repeat (SSR) markers are useful tools for germplasm characterization, genetic diversity analysis, and genetic linkage mapping, and may help to elucidate genetic relationships among pistachio cultivars and species. To explore the genome structure of pistachio, a genome survey was performed using the Illumina platform at approximately 40× coverage depth in the P. vera cv. Siirt. The K-mer analysis indicated that pistachio has a genome that is about 600 Mb in size and is highly heterozygous. The assembly of 26.77 Gb Illumina data produced 27,069 scaffolds at N50 = 3.4 kb with a total of 513.5 Mb. A total of 59,280 SSR motifs were detected with a frequency of 8.67 kb. A total of 206 SSRs were used to characterize 24 P. vera cultivars and 20 wild Pistacia genotypes (four genotypes from each five wild Pistacia species) belonging to P. atlantica, P. integerrima, P. chinenesis, P. terebinthus, and P. lentiscus genotypes. Overall 135 SSR loci amplified in all 44 cultivars and genotypes, 41 were polymorphic in six Pistacia species. The novel SSR loci developed from cultivated pistachio were highly transferable to wild Pistacia species. The results from a genome survey of pistachio suggest that the genome size of pistachio is about 600 Mb with a high heterozygosity rate. This information will help to design whole genome sequencing strategies for pistachio. The newly developed novel polymorphic SSRs in this study may help germplasm characterization, genetic diversity, and genetic linkage mapping studies in the genus Pistacia.
Estimating the annotation error rate of curated GO database sequence annotations

Directory of Open Access Journals (Sweden)

Brown Alfred L

2007-05-01

Full Text Available Abstract Background Annotations that describe the function of sequences are enormously important to researchers during laboratory investigations and when making computational inferences. However, there has been little investigation into the data quality of sequence function annotations. Here we have developed a new method of estimating the error rate of curated sequence annotations, and applied this to the Gene Ontology (GO sequence database (GOSeqLite. This method involved artificially adding errors to sequence annotations at known rates, and used regression to model the impact on the precision of annotations based on BLAST matched sequences. Results We estimated the error rate of curated GO sequence annotations in the GOSeqLite database (March 2006 at between 28% and 30%. Annotations made without use of sequence similarity based methods (non-ISS had an estimated error rate of between 13% and 18%. Annotations made with the use of sequence similarity methodology (ISS had an estimated error rate of 49%. Conclusion While the overall error rate is reasonably low, it would be prudent to treat all ISS annotations with caution. Electronic annotators that use ISS annotations as the basis of predictions are likely to have higher false prediction rates, and for this reason designers of these systems should consider avoiding ISS annotations where possible. Electronic annotators that use ISS annotations to make predictions should be viewed sceptically. We recommend that curators thoroughly review ISS annotations before accepting them as valid. Overall, users of curated sequence annotations from the GO database should feel assured that they are using a comparatively high quality source of information.
Contributions to In Silico Genome Annotation

KAUST Repository

Kalkatawi, Manal M.

2017-11-30

Genome annotation is an important topic since it provides information for the foundation of downstream genomic and biological research. It is considered as a way of summarizing part of existing knowledge about the genomic characteristics of an organism. Annotating different regions of a genome sequence is known as structural annotation, while identifying functions of these regions is considered as a functional annotation. In silico approaches can facilitate both tasks that otherwise would be difficult and timeconsuming. This study contributes to genome annotation by introducing several novel bioinformatics methods, some based on machine learning (ML) approaches. First, we present Dragon PolyA Spotter (DPS), a method for accurate identification of the polyadenylation signals (PAS) within human genomic DNA sequences. For this, we derived a novel feature-set able to characterize properties of the genomic region surrounding the PAS, enabling development of high accuracy optimized ML predictive models. DPS considerably outperformed the state-of-the-art results. The second contribution concerns developing generic models for structural annotation, i.e., the recognition of different genomic signals and regions (GSR) within eukaryotic DNA. We developed DeepGSR, a systematic framework that facilitates generating ML models to predict GSR with high accuracy. To the best of our knowledge, no available generic and automated method exists for such task that could facilitate the studies of newly sequenced organisms. The prediction module of DeepGSR uses deep learning algorithms to derive highly abstract features that depend mainly on proper data representation and hyperparameters calibration. DeepGSR, which was evaluated on recognition of PAS and translation initiation sites (TIS) in different organisms, yields a simpler and more precise representation of the problem under study, compared to some other hand-tailored models, while producing high accuracy prediction results. Finally
Development and validation of genic-SSR markers in sesame by RNA-seq.

Science.gov (United States)

Zhang, Haiyang; Wei, Libin; Miao, Hongmei; Zhang, Tide; Wang, Cuiying

2012-07-16

Sesame (Sesamum indicum L.) is one of the most important oil crops; however, a lack of useful molecular markers hinders current genetic research. We performed transcriptome sequencing of samples from different sesame growth and developmental stages, and mining of genic-SSR markers to identify valuable markers for sesame molecular genetics research. In this study, 75 bp and 100 bp paired-end RNA-seq was used to sequence 24 cDNA libraries, and 42,566 uni-transcripts were assembled from more than 260 million filtered reads. The total length of uni-transcript sequences was 47.99 Mb, and 7,324 SSRs (SSRs ≥15 bp) and 4,440 SSRs (SSRs ≥18 bp) were identified. On average, there was one genic-SSR per 6.55 kb (SSRs ≥15 bp) or 10.81 kb (SSRs ≥18 bp). Among perfect SSRs (≥18 bp), di-nucleotide motifs (48.01%) were the most abundant, followed by tri- (20.96%), hexa- (25.37%), penta- (2.97%), tetra- (2.12%), and mono-nucleotides (0.57%). The top four motif repeats were (AG/CT)n [1,268 (34.51%)], (CA/TG)n [281 (7.65%)], (AT/AT)n [215 (5.85%)], and (GAA/TTC)n [131 (3.57%)]. A total of 2,164 SSR primer pairs were identified in the 4,440 SSR-containing sequences (≥18 bp), and 300 SSR primer pairs were randomly chosen for validation. These SSR markers were amplified and validated in 25 sesame accessions (24 cultivated accessions, one wild species). 276 (92.0%) primer pairs yielded PCR amplification products in 24 cultivars. Thirty two primer pairs (11.59%) exhibited polymorphisms. Moreover, 203 primer pairs (67.67%) yielded PCR amplicons in the wild accession and 167 (60.51%) were polymorphic between species. A UPGMA dendrogram based on genetic similarity coefficients showed that the correlation between genotype and geographical source was low and that the genetic basis of sesame in China is narrow, as previously reported. The 32 polymorphic primer pairs were validated using an F2 mapping population; 18 primer pairs exhibited polymorphisms between the parents, and 14
A simple ssr analysis for genetic diversity estimation of maize landraces

Directory of Open Access Journals (Sweden)

Ignjatovic-Micic Dragana

2015-01-01

Full Text Available collection of 2217 landraces from western Balkan (former Yugoslavia is maintained at Maize Research Institute Zemun Polje gene bank. Nine flint and nine dent accessions from six agro-ecological groups (races, chosen on the basis of diverse pedigrees, were analyzed for genetic relatedness using phenotypic and simple sequence repeat (SSR markers. One of the aims was to establish a reliable set of SSR markers for a rapid diversity analysis using polyacrilamide gels and ethidium bromide staining. In the principal component analysis (PCA the first three principal components accounted for 80.86% of total variation and separated most of the flint from dent landraces. Ten SSR primers revealed a total of 56 and 63 alleles in flint and dent landraces, respectively, with low stuttering and good allele resolution on the gels. High average PIC value (0.822 also supports informativeness and utility of the markers used in this study. Higher genetic variation was observed among flint genotypes, as genetic distances between flint landraces covered a larger range of values (0.11- 0.38 than between dent (0.22 - 0.33 genotypes. Both phenotypic and SSR analyses distinguished flint and dent landraces, but neither of them could abstract agro-ecological groups. The SSR method used gave clear, easy to read band patterns that could be used for reliable allele frequency determination. Genetic diversity revealed for both markers indicated that the landraces were highly adapted to specific environmental conditions and purposes and could be valuable sources of genetic variability. [Projekat Ministarstva nauke Republike Srbije, br. TR31028: Exploitation of maize diversity to improve grain quality and drought tolerance

Use of SSR markers for DNA fingerprinting and diversity analysis of Pakistani sugarcane (Saccharum spp. hybrids) cultivars

Science.gov (United States)

In recent years SSR markers have been used widely for genetic analysis. The objective of this study was to use an SSR-based marker system to develop the molecular fingerprints and analyze the genetic relationship of sugarcane cultivars grown in Pakistan. Twenty-one highly polymorphic SSR markers wer...
Transcriptome survey of Patagonian southern beech Nothofagus nervosa (= N. Alpina: assembly, annotation and molecular marker discovery

Directory of Open Access Journals (Sweden)

Torales Susana L

2012-07-01

Full Text Available Abstract Background Nothofagus nervosa is one of the most emblematic native tree species of Patagonian temperate forests. Here, the shotgun RNA-sequencing (RNA-Seq of the transcriptome of N. nervosa, including de novo assembly, functional annotation, and in silico discovery of potential molecular markers to support population and associations genetic studies, are described. Results Pyrosequencing of a young leaf cDNA library generated a total of 111,814 high quality reads, with an average length of 447 bp. De novo assembly using Newbler resulted into 3,005 tentative isotigs (including alternative transcripts. The non-assembled sequences (singletons were clustered with CD-HIT-454 to identify natural and artificial duplicates from pyrosequencing reads, leading to 21,881 unique singletons. 15,497 out of 24,886 non-redundant sequences or unigenes, were successfully annotated against a plant protein database. A substantial number of simple sequence repeat markers (SSRs were discovered in the assembled and annotated sequences. More than 40% of the SSR sequences were inside ORF sequences. To confirm the validity of these predicted markers, a subset of 73 SSRs selected through functional annotation evidences were successfully amplified from six seedlings DNA samples, being 14 polymorphic. Conclusions This paper is the first report that shows a highly precise representation of the mRNAs diversity present in young leaves of a native South American tree, N. nervosa, as well as its in silico deduced putative functionality. The reported Nothofagus transcriptome sequences represent a unique resource for genetic studies and provide a tool to discover genes of interest and genetic markers that will greatly aid questions involving evolution, ecology, and conservation using genetic and genomic approaches in the genus.
Transcriptomic analysis, genic SSR development, and genetic diversity of proso millet (Panicum miliaceum; Poaceae).

Science.gov (United States)

Hou, Siyu; Sun, Zhaoxia; Li, Yaoshen; Wang, Yijie; Ling, Hubin; Xing, Guofang; Han, Yuanhuai; Li, Hongying

2017-07-01

Proso millet ( Panicum miliaceum ; Poaceae) is a minor crop with good nutritional qualities and strong tolerance to drought stress and soil infertility. However, studies on genetic diversity have been limited due to a lack of efficient genetic markers. Illumina sequencing technology was used to generate short read sequences of proso millet, and de novo transcriptome assemblies were used to develop a de novo assembly of proso millet. Genic simple sequence repeat (SSR) markers were identified and used to detect polymorphism among 56 accessions. Population structure and genetic similarity coefficient were estimated. In total, 25,341 unique gene sequences and 4724 SSR loci were obtained from the transcriptome, of which 229 pairs of SSR primers were validated, which resulted in 14 polymorphic genic SSR primers exhibiting 43 total alleles. According to the ratio of polymorphic markers (6.1%, 14/229), there are potentially 288 polymorphic genic SSR markers available for genetic assay development in the future. Bayesian population analyses showed that the 56 accessions comprised two distinct groups. A genetic structure and cluster assay indicated that the accessions from the Loess Plateau of China shared a high genetic similarity coefficient with those from other regions and that there was no correlation between genetic diversity and geographic origin. The transcriptome sequencing data and millet-specific SSR markers developed in this study establish an excellent resource for gene discovery and may improve the development of breeding programs in proso millet in the future.
Genetic Diversity of Namibian Pennisetum glaucum (L.) R. BR. (Pearl Millet) Landraces Analyzed by SSR and Morphological Markers.

Science.gov (United States)

McBenedict, Billy; Chimwamurombe, Percy; Kwembeya, Ezekeil; Maggs-Kölling, Gillian

2016-01-01

Current Pennisetum glaucum (L.) R. BR. cultivars in Namibia have overall poor performance posing a threat to the nation's food security because this crop is staple for over 70% of the Namibian population. The crop suffers from undesirable production traits such as susceptibility to diseases, low yield, and prolonged reproductive cycle. This study aimed to understand the genetic diversity of the crop in Namibia by simple sequence repeats (SSRs) and morphology analysis. A total of 1441 genotypes were collected from the National Gene Bank representing all the Namibian landraces. A sample of 96 genotypes was further analyzed by SSR using Shannon-Wiener diversity index and revealed a value of 0.45 indicating low genetic diversity. Ordination using Principal Coordinate Analysis (PCoA) on SSR data confirmed clusters generated by UPGMA for the 96 P. glaucum accessions. UPGMA phenograms of 29 morphological characterized genotypes were generated for SSR and morphology data and the two trees revealed 78% resemblance. Lodging susceptibility, tillering attitude, spike density, fodder yield potential, early vigour, and spike shape were the phenotypic characters upon which some clusters were based in both datasets. It is recommended that efforts should be made to widen the current gene pool in Namibia.
Genetic Diversity of Namibian Pennisetum glaucum (L. R. BR. (Pearl Millet Landraces Analyzed by SSR and Morphological Markers

Directory of Open Access Journals (Sweden)

Billy McBenedict

2016-01-01

Full Text Available Current Pennisetum glaucum (L. R. BR. cultivars in Namibia have overall poor performance posing a threat to the nation’s food security because this crop is staple for over 70% of the Namibian population. The crop suffers from undesirable production traits such as susceptibility to diseases, low yield, and prolonged reproductive cycle. This study aimed to understand the genetic diversity of the crop in Namibia by simple sequence repeats (SSRs and morphology analysis. A total of 1441 genotypes were collected from the National Gene Bank representing all the Namibian landraces. A sample of 96 genotypes was further analyzed by SSR using Shannon-Wiener diversity index and revealed a value of 0.45 indicating low genetic diversity. Ordination using Principal Coordinate Analysis (PCoA on SSR data confirmed clusters generated by UPGMA for the 96 P. glaucum accessions. UPGMA phenograms of 29 morphological characterized genotypes were generated for SSR and morphology data and the two trees revealed 78% resemblance. Lodging susceptibility, tillering attitude, spike density, fodder yield potential, early vigour, and spike shape were the phenotypic characters upon which some clusters were based in both datasets. It is recommended that efforts should be made to widen the current gene pool in Namibia.
Characterization of the global transcriptome for Pyropia haitanensis (Bangiales, Rhodophyta) and development of cSSR markers.

Science.gov (United States)

Xie, Chaotian; Li, Bing; Xu, Yan; Ji, Dehua; Chen, Changsheng

2013-02-16

Pyropia haitanensis is an economically important mariculture crop in China and is also valuable in life science research. However, the lack of genetic information of this organism hinders the understanding of the molecular mechanisms of specific traits. Thus, high-throughput sequencing is needed to generate a number of transcriptome sequences to be used for gene discovery and molecular marker development. In this study, high-throughput sequencing was used to analyze the global transcriptome of P. haitanensis. Approximately 103 million 90 bp paired-end reads were generated using an Illumina HiSeq 2000. De novo assembly with paired-end information yielded 24,575 unigenes with an average length of 645 bp. Based on sequence similarity searches with known proteins, a total of 16,377 (66.64%) genes were identified. Of these annotated unigenes, 5,471 and 9,168 unigenes were assigned to gene ontology and clusters of orthologous groups, respectively. Searching against the KEGG database indicated that 12,167 (49.51%) unigenes mapped to 124 KEGG pathways. Among the carbon fixation pathways, almost all the essential genes related to the C3- and C4-pathways for P. haitanensis were discovered. Significantly different expression levels of three key genes (Rubisco, PEPC and PEPCK) in different lifecycle stages of P. haitanensis indicated that the carbon fixation pathway in the conchocelis and thallus were different, and the C4-like pathway might play important roles in the conchocelis stage. In addition, 2,727 cSSRs loci were identified in the unigenes. Among them, trinucleotide SSRs were the dominant repeat motif (87.17%, 2,377) and GCC/CCG motifs were the most common repeats (60.07%, 1,638). High quality primers to 824 loci were designed and 100 primer pairs were randomly evaluated in six strains of P. haitanensis. Eighty-seven primer pairs successfully yielded amplicons. This study generated a large number of putative P. haitanensis transcript sequences, which can be used for
Genetic diversity and DNA fingerprinting in jute (Corchorus spp. based on SSR markers

Directory of Open Access Journals (Sweden)

Liwu Zhang

2015-10-01

Full Text Available Genetic diversity analysis and DNA finger printing are very useful in breeding programs, seed conservation and management. Jute (Corchorus spp. is the second most important natural fiber crop after cotton. DNA fingerprinting studies in jute using SSR markers are limited. In this study, 58 jute accessions, including two control varieties (Huangma 179 and Kuanyechangguo from the official variety registry in China were evaluated with 28 pairs of SSR primers. A total of 184 polymorphic loci were identified. Each primer detected 3 to 15 polymorphic loci, with an average of 6.6. The 58 jute accessions were DNA-fingerprinted with 67 SSR markers from the 28 primer pairs. These markers differentiated the 58 jute accessions from one another, with CoSSR305-120 and CoSSR174-195 differentiating Huangma 179 and Kuanyechangguo, respectively. NTSYS-pc2.10 software was used to analyze the genetic diversity in the 58 jute accessions. Their genetic similarity coefficients ranged from 0.520 to 0.910 with an average of 0.749, indicating relatively great genetic diversity among them. The 58 jute accessions were divided into four groups with the coefficient 0.710 used as a value for classification, consistent with their species and pedigrees. All these results may be useful both for protection of intellectual property rights of jute accessions and for jute improvement.
Genetic diversity and DNA fingerprinting in jute(Corchorus spp.) based on SSR markers

Institute of Scientific and Technical Information of China (English)

Liwu; Zhang; Rongrong; Cai; Minhang; Yuan; Aifen; Tao; Jiantang; Xu; Lihui; Lin; Pingping; Fang; Jianmin; Qi

2015-01-01

Genetic diversity analysis and DNA finger printing are very useful in breeding programs,seed conservation and management. Jute(Corchorus spp.) is the second most important natural fiber crop after cotton. DNA fingerprinting studies in jute using SSR markers are limited. In this study, 58 jute accessions, including two control varieties(Huangma 179 and Kuanyechangguo) from the official variety registry in China were evaluated with 28 pairs of SSR primers. A total of 184 polymorphic loci were identified. Each primer detected 3 to 15 polymorphic loci, with an average of 6.6. The 58 jute accessions were DNA-fingerprinted with 67 SSR markers from the 28 primer pairs. These markers differentiated the 58 jute accessions from one another, with Co SSR305-120 and Co SSR174-195 differentiating Huangma 179 and Kuanyechangguo, respectively. NTSYS-pc2.10 software was used to analyze the genetic diversity in the 58 jute accessions. Their genetic similarity coefficients ranged from 0.520 to 0.910 with an average of 0.749, indicating relatively great genetic diversity among them. The 58 jute accessions were divided into four groups with the coefficient 0.710 used as a value for classification, consistent with their species and pedigrees. All these results may be useful both for protection of intellectual property rights of jute accessions and for jute improvement.
Novel and highly informative Capsicum SSR markers and their cross-species transferability.

Science.gov (United States)

Buso, G S C; Reis, A M M; Amaral, Z P S; Ferreira, M E

2016-09-23

This study was undertaken primarily to develop new simple sequence repeat (SSR) markers for Capsicum. As part of this project aimed at broadening the use of molecular tools in Capsicum breeding, two genomic libraries enriched for AG/TC repeat sequences were constructed for Capsicum annuum. A total of 475 DNA clones were sequenced from both libraries and 144 SSR markers were tested on cultivated and wild species of Capsicum. Forty-five SSR markers were randomly selected to genotype a panel of 48 accessions of the Capsicum germplasm bank. The number of alleles per locus ranged from 2 to 11, with an average of 6 alleles. The polymorphism information content was on average 0.60, ranging from 0.20 to 0.83. The cross-species transferability to seven cultivated and wild Capsicum species was tested with a set of 91 SSR markers. We found that a high proportion of the loci produced amplicons in all species tested. C. frutescens had the highest number of transferable markers, whereas the wild species had the lowest. Our results indicate that the new markers can be readily used in genetic analyses of Capsicum.
Cloning and characterization of a pyrethroid pesticide decomposing esterase gene, Est3385, from Rhodopseudomonas palustris PSB-S.

Science.gov (United States)

Luo, Xiangwen; Zhang, Deyong; Zhou, Xuguo; Du, Jiao; Zhang, Songbai; Liu, Yong

2018-05-09

Full length open reading frame of pyrethroid detoxification gene, Est3385, contains 963 nucleotides. This gene was identified and cloned based on the genome sequence of Rhodopseudomonas palustris PSB-S available at the GneBank. The predicted amino acid sequence of Est3385 shared moderate identities (30-46%) with the known homologous esterases. Phylogenetic analysis revealed that Est3385 was a member in the esterase family I. Recombinant Est3385 was heterologous expressed in E. coli, purified and characterized for its substrate specificity, kinetics and stability under various conditions. The optimal temperature and pH for Est3385 were 35 °C and 6.0, respectively. This enzyme could detoxify various pyrethroid pesticides and degrade the optimal substrate fenpropathrin with a Km and Vmax value of 0.734 ± 0.013 mmol·l -1 and 0.918 ± 0.025 U·µg -1 , respectively. No cofactor was found to affect Est3385 activity but substantial reduction of enzymatic activity was observed when metal ions were applied. Taken together, a new pyrethroid degradation esterase was identified and characterized. Modification of Est3385 with protein engineering toolsets should enhance its potential for field application to reduce the pesticide residue from agroecosystems.
GMATA: An Integrated Software Package for Genome-Scale SSR Mining, Marker Development and Viewing.

Science.gov (United States)

Wang, Xuewen; Wang, Le

2016-01-01

Simple sequence repeats (SSRs), also referred to as microsatellites, are highly variable tandem DNAs that are widely used as genetic markers. The increasing availability of whole-genome and transcript sequences provides information resources for SSR marker development. However, efficient software is required to efficiently identify and display SSR information along with other gene features at a genome scale. We developed novel software package Genome-wide Microsatellite Analyzing Tool Package (GMATA) integrating SSR mining, statistical analysis and plotting, marker design, polymorphism screening and marker transferability, and enabled simultaneously display SSR markers with other genome features. GMATA applies novel strategies for SSR analysis and primer design in large genomes, which allows GMATA to perform faster calculation and provides more accurate results than existing tools. Our package is also capable of processing DNA sequences of any size on a standard computer. GMATA is user friendly, only requires mouse clicks or types inputs on the command line, and is executable in multiple computing platforms. We demonstrated the application of GMATA in plants genomes and reveal a novel distribution pattern of SSRs in 15 grass genomes. The most abundant motifs are dimer GA/TC, the A/T monomer and the GCG/CGC trimer, rather than the rich G/C content in DNA sequence. We also revealed that SSR count is a linear to the chromosome length in fully assembled grass genomes. GMATA represents a powerful application tool that facilitates genomic sequence analyses. GAMTA is freely available at http://sourceforge.net/projects/gmata/?source=navbar.
Genetic Diversity and Identification of Chinese-Grown Pecan Using ISSR and SSR Markers

Directory of Open Access Journals (Sweden)

Zhong-Ren Guo

2011-12-01

Full Text Available Pecan is an important horticultural nut crop originally from North America and now widely cultivated in China for its high ecological, ornamental and economic value. Currently, there are over one hundred cultivars grown in China, including introduced American cultivars and Chinese seedling breeding cultivars. Molecular markers were used to assess the genetic diversity of these cultivars and to identify the pedigrees of fine pecan plants with good characteristics and no cultivar-related data. A total of 77 samples grown in China were studied, including 14 introduced cultivars, 12 domestic seedling breeding cultivars, and 49 fine pecan plants with no cultivar data, together with Carya cathayensis and Juglans nigra. A total of 77 ISSR and 19 SSR primers were prescreened; 10 ISSR and eight SSR primers were selected, yielding a total of 94 amplified bands (100% polymorphic in the range of 140–1,950 bp for the ISSR and 70 amplified bands (100% polymorphic in the range of 50–350 bp for SSR markers. Genetic diversity analyses indicated Chinese-grown pecan cultivars and fine plants had significant diversity at the DNA level. The dengrograms constructed with ISSR, SSR or combined data were very similar, but showed very weak grouping association with morphological characters. However, the progeny were always grouped with the parents. The great diversity found among the Chinese cultivars and the interesting germplasm of the fine pecan plants analyzed in this study are very useful for increasing the diversity of the pecan gene pool. All 77 accessions in this study could be separated based on the ISSR and SSR fingerprints produced by one or more primers. The results of our study also showed that ISSR and SSR techniques were both suitable for genetic diversity analyses and the identification of pecan resources.
Genetic diversity and identification of Chinese-grown pecan using ISSR and SSR markers.

Science.gov (United States)

Jia, Xiao-Dong; Wang, Tao; Zhai, Min; Li, Yong-Rong; Guo, Zhong-Ren

2011-12-06

Pecan is an important horticultural nut crop originally from North America and now widely cultivated in China for its high ecological, ornamental and economic value. Currently, there are over one hundred cultivars grown in China, including introduced American cultivars and Chinese seedling breeding cultivars. Molecular markers were used to assess the genetic diversity of these cultivars and to identify the pedigrees of fine pecan plants with good characteristics and no cultivar-related data. A total of 77 samples grown in China were studied, including 14 introduced cultivars, 12 domestic seedling breeding cultivars, and 49 fine pecan plants with no cultivar data, together with Carya cathayensis and Juglans nigra. A total of 77 ISSR and 19 SSR primers were prescreened; 10 ISSR and eight SSR primers were selected, yielding a total of 94 amplified bands (100% polymorphic) in the range of 140-1,950 bp for the ISSR and 70 amplified bands (100% polymorphic) in the range of 50-350 bp for SSR markers. Genetic diversity analyses indicated Chinese-grown pecan cultivars and fine plants had significant diversity at the DNA level. The dengrograms constructed with ISSR, SSR or combined data were very similar, but showed very weak grouping association with morphological characters. However, the progeny were always grouped with the parents. The great diversity found among the Chinese cultivars and the interesting germplasm of the fine pecan plants analyzed in this study are very useful for increasing the diversity of the pecan gene pool. All 77 accessions in this study could be separated based on the ISSR and SSR fingerprints produced by one or more primers. The results of our study also showed that ISSR and SSR techniques were both suitable for genetic diversity analyses and the identification of pecan resources.
Development and validation of new SSR markers from expressed regions in the garlic genome

Science.gov (United States)

Limited number of simple sequence repeat (SSR) markers is available for the genome of garlic (Allium sativum L.) although SSR markers have become one of the most preferred marker systems because they are typically co-dominant, reproducible, cross species transferable and highly polymorphic. In this ...
Development and characterization of EST-SSR markers for Artocarpus hypargyreus (Moraceae)1

Science.gov (United States)

Liu, Haijun; Tan, Weizheng; Sun, Hongbin; Liu, Yu; Meng, Kaikai; Liao, Wenbo

2016-01-01

Premise of the study: Polymorphic microsatellite markers were developed for Artocarpus hypargyreus (Moraceae), a threatened species endemic to China, to investigate the genetic diversity and structure of the species. Methods and Results: Based on the transcriptome data of A. hypargyreus, 63 primer pairs were preliminarily designed and tested, of which 34 were successfully amplified and 10 displayed clear polymorphisms across the 67 individuals from four populations of A. hypargyreus. The results showed the number of alleles per locus ranged from three to 10, and the observed heterozygosity and expected heterozygosity per locus varied from 0.000 to 0.706 and from 0.328 to 0.807, respectively. Conclusions: These microsatellite markers will be useful in exploring genetic diversity and structure of A. hypargyreus. Furthermore, most loci were successfully cross-amplified in A. nitidus and A. heterophyllus, indicating that they will be of great value for genetic study across this genus. PMID:28101438
Phylogenetic molecular function annotation

International Nuclear Information System (INIS)

Engelhardt, Barbara E; Jordan, Michael I; Repo, Susanna T; Brenner, Steven E

2009-01-01

It is now easier to discover thousands of protein sequences in a new microbial genome than it is to biochemically characterize the specific activity of a single protein of unknown function. The molecular functions of protein sequences have typically been predicted using homology-based computational methods, which rely on the principle that homologous proteins share a similar function. However, some protein families include groups of proteins with different molecular functions. A phylogenetic approach for predicting molecular function (sometimes called 'phylogenomics') is an effective means to predict protein molecular function. These methods incorporate functional evidence from all members of a family that have functional characterizations using the evolutionary history of the protein family to make robust predictions for the uncharacterized proteins. However, they are often difficult to apply on a genome-wide scale because of the time-consuming step of reconstructing the phylogenies of each protein to be annotated. Our automated approach for function annotation using phylogeny, the SIFTER (Statistical Inference of Function Through Evolutionary Relationships) methodology, uses a statistical graphical model to compute the probabilities of molecular functions for unannotated proteins. Our benchmark tests showed that SIFTER provides accurate functional predictions on various protein families, outperforming other available methods.
DNA Profiles of MTG (Moderat Tahan Gano) Oil Palm Variety Based on SSR Marker

Science.gov (United States)

Putri, L. A. P.; Setiado, H.; Hardianti, R.

2017-03-01

The oil palm, an economically important tree in Indonesia, has been one of the world’s major sources of edible oil and a significant precursor of biodiesel fuel. The objectives of this study were to know DNA profile of commercial MTG (Moderat Tahan Gano) oil palm variety collections. A total of 10 trees MTG oil palm variety were used for analysis. In this experiment, the DNA profile diversity was assessed using mEgCIR0174 and SSR-1 loci of oil palm’s specific SSR markers. The results of the experiment indicated out of 3 alleles of pcr product of mEgCIR0174 (198, 203 and 208 bp) and SSR-1 (201, 217 and 232 bp). These preliminary results demonstrated SSR marker can be used to evaluate genetic relatedness among trees of MTG (Moderat Tahan Gano) oil palm variety derived from different crossing or difference to desease resistance trait or misslabeled.
SSR markers reveal genetic variation between improved cassava ...

African Journals Online (AJOL)

SERVER

2007-12-03

Dec 3, 2007 ... Numerical Taxonomy and Multivariate Analysis System software .... DNA extraction and polymerase chain reaction with SSR .... Primers that have PIC value falling between 0.50 to 0.70 ..... Woodman, Lee M, Porter K (2000).
APPLICATION OF RYE SSR MARKERS FOR DETECTION OF GENETIC DIVERSITY IN TRITICALE

Directory of Open Access Journals (Sweden)

Želmíra Balážová

2016-06-01

Full Text Available Present study aims to testify usefulness of particular rye SSR markers for the detection of genetic diversity degree in the set of 20 triticale cultivars coming from different European countries. For this purpose, a set of six rye SSR markers were used. The set of six polymorphic markers provided 22 alleles with an average frequency of 3.67 alleles per locus. The number of alleles ranged between 2 (SCM43 and 5 (SCM28, SCM86. Resulting from the number and frequency of alleles diversity index (DI, polymorphic information content (PIC and probability of identity (PI were calculated. An average value of PIC for 6 SSR markers was 0.505, the highest value was calculated for rye SSR marker SCM86 (0.706. Based on UPGMA algorithm, a dendrogram was constructed. In dendrogram cultivars were divided into two main clusters. The first cluster contained two cultivars, Russian cultivar Greneder and Slovak cultivar Largus, and second included 18 cultivars. Genetically the closest were two Greek cultivars (Niobi and Thisbi and were close to other Greek cultivar Vrodi. It was possible to separate triticale cultivars of spring and winter form in dendrogram. Results showed the utility of rye microsatellite markers for estimation of genetic diversity of European triticale genotypes leading to genotype identification.
Impedance characteristics of DFIGs considering the impacts of DFIG numbers and its application on SSR analysis

Science.gov (United States)

You, J. J.; Ning, W. Y.; Jiang, H.; Dong, Y. X.; Liu, H.; Li, Y.

2017-05-01

Sub-synchronous resonance (SSR) occurred in the power system with both Doubly Fed Induction Generators (DFIGs) and series-compensated lines. There have been several papers focusing on the analysis of such SSR phenomenon. To the authors’ knowledge, the impacts of DFIG numbers are seldom considered in existing papers. In this paper, the impedance characteristics of DFIGs under different numbers are analyzed. Then the relationship between DFIG numbers and SSR characteristics is discussed. The contributions to be included in the paper are as following: 1) the impedance of N DFIGs is roughly equal to 1/N of the impedance of one DFIG. 2) The SSR risk of system is related with both the numbers of DFIGs and the parameters of the series-compensated lines.

Comparative assessment of genetic diversity in Sesamum indicum L. using RAPD and SSR markers.

Science.gov (United States)

Dar, Aejaz Ahmad; Mudigunda, Sushma; Mittal, Pramod Kumar; Arumugam, Neelakantan

2017-05-01

Sesame (Sesamum indicum L.) is an ancient oilseed crop known for its nutty seeds and high-quality edible oil. It is an unexplored crop with a great economic potential. The present study deals with assessment of genetic diversity in the crop. Twenty two RAPD and 18 SSR primers were used for analysis of the 47 different sesame accessions grown in different agroclimatic zones of India. A total of 256 bands were obtained with RAPD primers, of which 191 were polymorphic. SSR primers gave 64 DNA bands, of which all of were polymorphic. The Jaccard's similarity coefficient of RAPD, SSR, and pooled RAPD and SSR data ranged from 0.510 to 0.885, 0.167 to 0.867, and 0.505 to 0.853, respectively. Maximum polymorphic information content was reported with SSRs (0.194) compared to RAPDs (0.186). Higher marker index was observed with RAPDs (1.426) than with SSRs (0.621). Similarly, maximum resolving power was found with RAPD (4.012) primers than with SSRs (0.884). The RAPD primer RPI-B11 and SSR primer S16 were the most informative in terms of describing genetic variability among the varieties under study. At a molecular level, the seed coat colour was distinguishable by the presence and absence of a group of marker amplicon/s. White and brown seeded varieties clustered close to each other, while black seeded varieties remained distanced from the cluster. In the present study, we found higher variability in Sesamum indicum L. using RAPD and SSR markers and these could assist in DNA finger printing, conservation of germplasm, and crop improvement.
Assessment of genetic diversity in Brassica juncea Brassicaceae genotypes using phenotypic differences and SSR markers

Directory of Open Access Journals (Sweden)

Vinu V.

2013-12-01

Full Text Available Las especies de mostaza del género Brassica representan uno de los cultivos de semillas oleaginosas más importantes en India, sin embargo, su diversidad genética es poco conocida. Para la utilización de genotipos en programas de cultivos resulta esencial un mayor conocimiento sobre este tema. Debido a ello, se evaluó la diversidad genética entre 44 genotipos de mostaza de la India Brassica juncea incluyendo variedades y líneas puras de diferentes zonas agro-climáticas de la India y algunos genotipos exóticos Australia, Polonia y China. Para ello, se utilizaron marcadores SSR específicos del genoma A y B y datos fenotípicos del rendimiento de 12 cosechas y sus características. De los 143 primers evaluados, 134 reportaron polimorfismo y un total de 355 alelos fueron amplificados. Se generaron dendrogramas a partir de los coeficientes de similitud de Jaccard y de disimilitud Manhattan, basados en un algoritmo de vinculación promedio UPGMA. Se utilizaron datos de marcadores genéticos y datos fenotípicos. Los genotipos se agruparon en cuatro grupos basados en las distancias genéticas. Ambos patrones de agrupamiento, tanto los basados en los coeficientes de similitud de Jaccard como los basados en los de disimilitud Manhattan, separaron independientemente los genotipos por su genealogía y origen, de una manera efectiva. El PCoA reveló que la agrupación de genotipos basada en datos de marcadores SSR, es más convincente que los datos fenotípicos, sin embargo, se observó que la correlación entre las matrices de distancia fenotípica y genética resultó muy baja r=0.11. Por lo tanto, para estudios de diversidad basados en marcadores moleculares es necesario realizar más pruebas. Los marcadores SSR constituyen herramientas más eficientes para discriminar entre genotipos de B. juncea, que las características cuantitativas.
Comparison of SSR and SNP markers in estimation of genetic diversity and population structure of Indian rice varieties.

Science.gov (United States)

Singh, Nivedita; Choudhury, Debjani Roy; Singh, Amit Kumar; Kumar, Sundeep; Srinivasan, Kalyani; Tyagi, R K; Singh, N K; Singh, Rakesh

2013-01-01

Simple sequence repeat (SSR) and Single Nucleotide Polymorphic (SNP), the two most robust markers for identifying rice varieties were compared for assessment of genetic diversity and population structure. Total 375 varieties of rice from various regions of India archived at the Indian National GeneBank, NBPGR, New Delhi, were analyzed using thirty six genetic markers, each of hypervariable SSR (HvSSR) and SNP which were distributed across 12 rice chromosomes. A total of 80 alleles were amplified with the SSR markers with an average of 2.22 alleles per locus whereas, 72 alleles were amplified with SNP markers. Polymorphic information content (PIC) values for HvSSR ranged from 0.04 to 0.5 with an average of 0.25. In the case of SNP markers, PIC values ranged from 0.03 to 0.37 with an average of 0.23. Genetic relatedness among the varieties was studied; utilizing an unrooted tree all the genotypes were grouped into three major clusters with both SSR and SNP markers. Analysis of molecular variance (AMOVA) indicated that maximum diversity was partitioned between and within individual level but not between populations. Principal coordinate analysis (PCoA) with SSR markers showed that genotypes were uniformly distributed across the two axes with 13.33% of cumulative variation whereas, in case of SNP markers varieties were grouped into three broad groups across two axes with 45.20% of cumulative variation. Population structure were tested using K values from 1 to 20, but there was no clear population structure, therefore Ln(PD) derived Δk was plotted against the K to determine the number of populations. In case of SSR maximum Δk was at K=5 whereas, in case of SNP maximum Δk was found at K=15, suggesting that resolution of population was higher with SNP markers, but SSR were more efficient for diversity analysis.
Characterization of genic microsatellite markers derived from expressed sequence tags in Pacific abalone ( Haliotis discus hannai)

Science.gov (United States)

Li, Qi; Shu, Jing; Zhao, Cui; Liu, Shikai; Kong, Lingfeng; Zheng, Xiaodong

2010-01-01

Simple sequence repeat (SSR) markers were developed from the expressed sequence tags (ESTs) of Pacific abalone ( Haliotis discus hannai). Repeat motifs were found in 4.95% of the ESTs at a frequency of one repeat every 10.04 kb of EST sequences, after redundancy elimination. Seventeen polymorphic EST-SSRs were developed. The number of alleles per locus varied from 2-17, with an average of 6.8 alleles per locus. The expected and observed heterozygosities ranged from 0.159 to 0.928 and from 0.132 to 0.922, respectively. Twelve of the 17 loci (70.6%) were successfully amplified in H. diversicolor. Seventeen loci segregated in three families, with three showing the presence of null alleles (17.6%). The adequate level of variability and low frequency of null alleles observed in H. discus hannai, together with the high rate of transportability across Haliotis species, make this set of EST-SSR markers an important tool for comparative mapping, marker-assisted selection, and evolutionary studies, not only in the Pacific abalone, but also in related species.
Thermal hydraulic test of advanced fuel bundle with spectral shift rod (SSR) for BWR. Effect of thermal hydraulic parameters on steady state characteristics

International Nuclear Information System (INIS)

Kondo, Takao; Kitou, Kazuaki; Chaki, Masao; Ohga, Yukiharu; Makigami, Takeshi

2011-01-01

Japanese national project of next generation light water reactor (LWR) development started in 2008. Under this project, spectral shift rod (SSR) is being developed. SSR, which replaces conventional water rod (WR) of boiling water reactor (BWR) fuel bundle, was invented to enhance the BWR's merit, spectral shift effect for uranium saving. In SSR, water boils by neutron and gamma-ray direct heating and water level is formed as a boundary of the upper steam region and the lower water region. This SSR water level can be controlled by core flow rate, which amplifies the change of average core void fraction, resulting in the amplified spectral shift effect. This paper presents the steady state test results of the base geometry case in SSR thermal hydraulic test, which was conducted under the national project of next generation LWR. In the test, thermal hydraulic parameters, such as flow rate, pressure, inlet subcooling and heater rod power are changed to evaluate these effects on SSR water level and other SSR characteristics. In the test results, SSR water level rose as flow rate rose, which showed controllability of SSR water level by flow rate. The sensitivities of other thermal hydraulic parameters on SSR water level were also evaluated. The obtained data of parameter's sensitivities is various enough for the further analytical evaluation. The fluctuation of SSR water level was also measured to be small enough. As a result, it was confirmed that SSR's steady state performance was as planned and that SSR design concept is feasible. (author)
Transcriptomic analysis, genic SSR development, and genetic diversity of proso millet (Panicum miliaceum; Poaceae)1

Science.gov (United States)

Hou, Siyu; Sun, Zhaoxia; Li, Yaoshen; Wang, Yijie; Ling, Hubin; Xing, Guofang; Han, Yuanhuai; Li, Hongying

2017-01-01

Premise of the study: Proso millet (Panicum miliaceum; Poaceae) is a minor crop with good nutritional qualities and strong tolerance to drought stress and soil infertility. However, studies on genetic diversity have been limited due to a lack of efficient genetic markers. Methods: Illumina sequencing technology was used to generate short read sequences of proso millet, and de novo transcriptome assemblies were used to develop a de novo assembly of proso millet. Genic simple sequence repeat (SSR) markers were identified and used to detect polymorphism among 56 accessions. Population structure and genetic similarity coefficient were estimated. Results: In total, 25,341 unique gene sequences and 4724 SSR loci were obtained from the transcriptome, of which 229 pairs of SSR primers were validated, which resulted in 14 polymorphic genic SSR primers exhibiting 43 total alleles. According to the ratio of polymorphic markers (6.1%, 14/229), there are potentially 288 polymorphic genic SSR markers available for genetic assay development in the future. Bayesian population analyses showed that the 56 accessions comprised two distinct groups. Discussion: A genetic structure and cluster assay indicated that the accessions from the Loess Plateau of China shared a high genetic similarity coefficient with those from other regions and that there was no correlation between genetic diversity and geographic origin. The transcriptome sequencing data and millet-specific SSR markers developed in this study establish an excellent resource for gene discovery and may improve the development of breeding programs in proso millet in the future. PMID:28791202
78 FR 8217 - Social Security Ruling, SSR 13-1p; Titles II and XVI: Agency Processes for Addressing Allegations...

Science.gov (United States)

2013-02-05

... SOCIAL SECURITY ADMINISTRATION [Docket No. SSA-2012-0071] Social Security Ruling, SSR 13-1p; Titles II and XVI: Agency Processes for Addressing Allegations of Unfairness, Prejudice, Partiality, Bias... the third column, the fourth line under the ``Summary'' heading, change ``SSR-13-Xp'' to ``SSR-13-1p...
An annotated corpus with nanomedicine and pharmacokinetic parameters.

Science.gov (United States)

Lewinski, Nastassja A; Jimenez, Ivan; McInnes, Bridget T

2017-01-01

A vast amount of data on nanomedicines is being generated and published, and natural language processing (NLP) approaches can automate the extraction of unstructured text-based data. Annotated corpora are a key resource for NLP and information extraction methods which employ machine learning. Although corpora are available for pharmaceuticals, resources for nanomedicines and nanotechnology are still limited. To foster nanotechnology text mining (NanoNLP) efforts, we have constructed a corpus of annotated drug product inserts taken from the US Food and Drug Administration's Drugs@FDA online database. In this work, we present the development of the Engineered Nanomedicine Database corpus to support the evaluation of nanomedicine entity extraction. The data were manually annotated for 21 entity mentions consisting of nanomedicine physicochemical characterization, exposure, and biologic response information of 41 Food and Drug Administration-approved nanomedicines. We evaluate the reliability of the manual annotations and demonstrate the use of the corpus by evaluating two state-of-the-art named entity extraction systems, OpenNLP and Stanford NER. The annotated corpus is available open source and, based on these results, guidelines and suggestions for future development of additional nanomedicine corpora are provided.
Development of simple sequence repeat markers and diversity analysis in alfalfa (Medicago sativa L.).

Science.gov (United States)

Wang, Zan; Yan, Hongwei; Fu, Xinnian; Li, Xuehui; Gao, Hongwen

2013-04-01

Efficient and robust molecular markers are essential for molecular breeding in plant. Compared to dominant and bi-allelic markers, multiple alleles of simple sequence repeat (SSR) markers are particularly informative and superior in genetic linkage map and QTL mapping in autotetraploid species like alfalfa. The objective of this study was to enrich SSR markers directly from alfalfa expressed sequence tags (ESTs). A total of 12,371 alfalfa ESTs were retrieved from the National Center for Biotechnology Information. Total 774 SSR-containing ESTs were identified from 716 ESTs. On average, one SSR was found per 7.7 kb of EST sequences. Tri-nucleotide repeats (48.8 %) was the most abundant motif type, followed by di-(26.1 %), tetra-(11.5 %), penta-(9.7 %), and hexanucleotide (3.9 %). One hundred EST-SSR primer pairs were successfully designed and 29 exhibited polymorphism among 28 alfalfa accessions. The allele number per marker ranged from two to 21 with an average of 6.8. The PIC values ranged from 0.195 to 0.896 with an average of 0.608, indicating a high level of polymorphism of the EST-SSR markers. Based on the 29 EST-SSR markers, assessment of genetic diversity was conducted and found that Medicago sativa ssp. sativa was clearly different from the other subspecies. The high transferability of those EST-SSR markers was also found for relative species.
Design, validation and annotation of transcriptome-wide oligonucleotide probes for the oligochaete annelid Eisenia fetida.

Directory of Open Access Journals (Sweden)

Ping Gong

Full Text Available High density oligonucleotide probe arrays have increasingly become an important tool in genomics studies. In organisms with incomplete genome sequence, one strategy for oligo probe design is to reduce the number of unique probes that target every non-redundant transcript through bioinformatic analysis and experimental testing. Here we adopted this strategy in making oligo probes for the earthworm Eisenia fetida, a species for which we have sequenced transcriptome-scale expressed sequence tags (ESTs. Our objectives were to identify unique transcripts as targets, to select an optimal and non-redundant oligo probe for each of these target ESTs, and to annotate the selected target sequences. We developed a streamlined and easy-to-follow approach to the design, validation and annotation of species-specific array probes. Four 244K-formatted oligo arrays were designed using eArray and were hybridized to a pooled E. fetida cRNA sample. We identified 63,541 probes with unsaturated signal intensities consistently above the background level. Target transcripts of these probes were annotated using several sequence alignment algorithms. Significant hits were obtained for 37,439 (59% probed targets. We validated and made publicly available 63.5K oligo probes so the earthworm research community can use them to pursue ecological, toxicological, and other functional genomics questions. Our approach is efficient, cost-effective and robust because it (1 does not require a major genomics core facility; (2 allows new probes to be easily added and old probes modified or eliminated when new sequence information becomes available, (3 is not bioinformatics-intensive upfront but does provide opportunities for more in-depth annotation of biological functions for target genes; and (4 if desired, EST orthologs to the UniGene clusters of a reference genome can be identified and selected in order to improve the target gene specificity of designed probes. This approach is
Chromosomal location of genomic SSR markers associated

Indian Academy of Sciences (India)

Among the same earlier tested 230 primers, one SSR marker (Xgwm311) also amplified a fragment which is present in the resistant parent and in the resistant bulks, but absent in the susceptible parent and in the susceptible bulks. To understand the chromosome group location of these diagnostic markers, Xgwm382 and ...
Physical location of SSR regions and cytogenetic instabilities in ...

Indian Academy of Sciences (India)

2014-08-18

Aug 18, 2014 ... RESEARCH NOTE. Physical location of SSR regions and cytogenetic instabilities in Pinus ... first cytogenetic study in Scots pine using SSRs in FISH experiments. ... Science, Mannheim, Germany) according to manufacturer's.
Development of genomic SSR and potential EST-SSR markers in ...

African Journals Online (AJOL)

PRECIOUS

2009-11-16

Nov 16, 2009 ... Institute of Medicinal Plant Development (IMPLAD), Chinese Academy of Medical Sciences (CAMS) and Peking .... from three sources of B. fatcatum in Taiwan (Liu et al., ..... Ministry of Finance People's Republic of China (No.
Characterizing and annotating the genome using RNA-seq data.

Science.gov (United States)

Chen, Geng; Shi, Tieliu; Shi, Leming

2017-02-01

Bioinformatics methods for various RNA-seq data analyses are in fast evolution with the improvement of sequencing technologies. However, many challenges still exist in how to efficiently process the RNA-seq data to obtain accurate and comprehensive results. Here we reviewed the strategies for improving diverse transcriptomic studies and the annotation of genetic variants based on RNA-seq data. Mapping RNA-seq reads to the genome and transcriptome represent two distinct methods for quantifying the expression of genes/transcripts. Besides the known genes annotated in current databases, many novel genes/transcripts (especially those long noncoding RNAs) still can be identified on the reference genome using RNA-seq. Moreover, owing to the incompleteness of current reference genomes, some novel genes are missing from them. Genome- guided and de novo transcriptome reconstruction are two effective and complementary strategies for identifying those novel genes/transcripts on or beyond the reference genome. In addition, integrating the genes of distinct databases to conduct transcriptomics and genetics studies can improve the results of corresponding analyses.
Tibiofemoral subchondral surface ratio (SSR) is a predictor of osteoarthritis symptoms and radiographic progression: data from the Osteoarthritis Initiative (OAI).

Science.gov (United States)

Everhart, J S; Siston, R A; Flanigan, D C

2014-06-01

Symptomatic knee osteoarthritis (OA) is poorly correlated with radiographic severity, but subchondral bone measures may be useful for risk assessment as bone shape is grossly unaffected at early radiographic stages. We sought to determine whether compartment-specific size mismatch in the naturally asymmetric tibiofemoral joint, measured as tibiofemoral subchondral surface ratio (SSR): (1) predicts incident symptoms, (2) predicts incident or progressive OA, (3) is reproducible and time invariant. OA Initiative participants with baseline MRIs and up to 48-month follow-up (n = 1,338) were analyzed. Logistic regression was used to determine the association between SSR and incident symptoms, incident OA, and progression of OA after adjusting for demographic, radiologic, injury-related, and lifestyle-related factors. Reproducibility was assessed as % coefficient of variation (CV) on repeat MRI studies at baseline and 24 months. Increased medial SSR is protective against incident symptoms at 48 months (per 0.1 increase: OR 0.48 CI 0.30, 0.75; P = 0.001). Increased lateral SSR values are protective against lateral OA incidence (OR 0.23 CI 0.06, 0.77; P = 0.016) or progression (OR 0.66 CI 0.43, 0.99; P = 0.049) at 24 months. Both medial and lateral SSR are stable over time (medial: mean change 0.001 SD 0.016; lateral: mean change 0.000 SD 0.017) and are highly reproducible (3.0% CV medial SSR; 2.7% CV lateral SSR). A larger medial SSR is protective against developing OA-related symptoms. A larger lateral SSR is protective against lateral OA incidence or progression. Finally, lateral and medial SSR are stable over time and are highly reproducible across MRI studies. Copyright © 2014 Osteoarthritis Research Society International. Published by Elsevier Ltd. All rights reserved.
nGASP - the nematode genome annotation assessment project

Energy Technology Data Exchange (ETDEWEB)

Coghlan, A; Fiedler, T J; McKay, S J; Flicek, P; Harris, T W; Blasiar, D; Allen, J; Stein, L D

2008-12-19

While the C. elegans genome is extensively annotated, relatively little information is available for other Caenorhabditis species. The nematode genome annotation assessment project (nGASP) was launched to objectively assess the accuracy of protein-coding gene prediction software in C. elegans, and to apply this knowledge to the annotation of the genomes of four additional Caenorhabditis species and other nematodes. Seventeen groups worldwide participated in nGASP, and submitted 47 prediction sets for 10 Mb of the C. elegans genome. Predictions were compared to reference gene sets consisting of confirmed or manually curated gene models from WormBase. The most accurate gene-finders were 'combiner' algorithms, which made use of transcript- and protein-alignments and multi-genome alignments, as well as gene predictions from other gene-finders. Gene-finders that used alignments of ESTs, mRNAs and proteins came in second place. There was a tie for third place between gene-finders that used multi-genome alignments and ab initio gene-finders. The median gene level sensitivity of combiners was 78% and their specificity was 42%, which is nearly the same accuracy as reported for combiners in the human genome. C. elegans genes with exons of unusual hexamer content, as well as those with many exons, short exons, long introns, a weak translation start signal, weak splice sites, or poorly conserved orthologs were the most challenging for gene-finders. While the C. elegans genome is extensively annotated, relatively little information is available for other Caenorhabditis species. The nematode genome annotation assessment project (nGASP) was launched to objectively assess the accuracy of protein-coding gene prediction software in C. elegans, and to apply this knowledge to the annotation of the genomes of four additional Caenorhabditis species and other nematodes. Seventeen groups worldwide participated in nGASP, and submitted 47 prediction sets for 10 Mb of the C
Development of Novel SSR Markers for Flax (Linum usitatissimum L.) Using Reduced-Representation Genome Sequencing.

Science.gov (United States)

Wu, Jianzhong; Zhao, Qian; Wu, Guangwen; Zhang, Shuquan; Jiang, Tingbo

2016-01-01

Flax ( Linum usitatissimum L.) is a major fiber and oil yielding crop grown in northeastern China. Identification of flax molecular markers is a key step toward improving flax yield and quality via marker-assisted breeding. Simple sequence repeat (SSR) markers, which are based on genomic structural variation, are considered the most valuable type of genetic marker for this purpose. In this study, we screened 1574 microsatellites from Linum usitatissimum L. obtained using reduced representation genome sequencing (RRGS) to systematically identify SSR markers. The resulting set of microsatellites consisted mainly of trinucleotide (56.10%) and dinucleotide (35.23%) repeats, with each motif consisting of 5-8 repeats. We then evaluated marker sensitivity and specificity based on samples of 48 flax isolates obtained from northeastern China. Using the new SSR panel, the results demonstrated that fiber flax and oilseed flax varieties clustered into two well separated groups. The novel SSR markers developed in this study show potential value for selection of varieties for use in flax breeding programs.
Ubiquitous Annotation Systems

DEFF Research Database (Denmark)

Hansen, Frank Allan

2006-01-01

Ubiquitous annotation systems allow users to annotate physical places, objects, and persons with digital information. Especially in the field of location based information systems much work has been done to implement adaptive and context-aware systems, but few efforts have focused on the general...... requirements for linking information to objects in both physical and digital space. This paper surveys annotation techniques from open hypermedia systems, Web based annotation systems, and mobile and augmented reality systems to illustrate different approaches to four central challenges ubiquitous annotation...... systems have to deal with: anchoring, structuring, presentation, and authoring. Through a number of examples each challenge is discussed and HyCon, a context-aware hypermedia framework developed at the University of Aarhus, Denmark, is used to illustrate an integrated approach to ubiquitous annotations...
Comparison of SSR and SNP markers in estimation of genetic diversity and population structure of Indian rice varieties.

Directory of Open Access Journals (Sweden)

Nivedita Singh

Full Text Available Simple sequence repeat (SSR and Single Nucleotide Polymorphic (SNP, the two most robust markers for identifying rice varieties were compared for assessment of genetic diversity and population structure. Total 375 varieties of rice from various regions of India archived at the Indian National GeneBank, NBPGR, New Delhi, were analyzed using thirty six genetic markers, each of hypervariable SSR (HvSSR and SNP which were distributed across 12 rice chromosomes. A total of 80 alleles were amplified with the SSR markers with an average of 2.22 alleles per locus whereas, 72 alleles were amplified with SNP markers. Polymorphic information content (PIC values for HvSSR ranged from 0.04 to 0.5 with an average of 0.25. In the case of SNP markers, PIC values ranged from 0.03 to 0.37 with an average of 0.23. Genetic relatedness among the varieties was studied; utilizing an unrooted tree all the genotypes were grouped into three major clusters with both SSR and SNP markers. Analysis of molecular variance (AMOVA indicated that maximum diversity was partitioned between and within individual level but not between populations. Principal coordinate analysis (PCoA with SSR markers showed that genotypes were uniformly distributed across the two axes with 13.33% of cumulative variation whereas, in case of SNP markers varieties were grouped into three broad groups across two axes with 45.20% of cumulative variation. Population structure were tested using K values from 1 to 20, but there was no clear population structure, therefore Ln(PD derived Δk was plotted against the K to determine the number of populations. In case of SSR maximum Δk was at K=5 whereas, in case of SNP maximum Δk was found at K=15, suggesting that resolution of population was higher with SNP markers, but SSR were more efficient for diversity analysis.
Soil-characterization and soil-amendment use on coal surface mine lands: An annotated bibliography. Information Circular/1991

International Nuclear Information System (INIS)

Norland, M.R.; Veith, D.L.

1991-01-01

The U.S. Bureau of Mines Report on United States and Canadian Literature pertaining to soil characterization and the use of soil amendments as a part of the reclamation process of coal surface-mined lands contains 1,280 references. The references were published during the 1977 to 1988 period. Each reference is evaluated by keywords, providing the reader with a means of rapidly sorting through the references to locate those articles with the coal mining regions and subjects of interest. All references are annotated

Ontological Annotation with WordNet

Energy Technology Data Exchange (ETDEWEB)

Sanfilippo, Antonio P.; Tratz, Stephen C.; Gregory, Michelle L.; Chappell, Alan R.; Whitney, Paul D.; Posse, Christian; Paulson, Patrick R.; Baddeley, Bob; Hohimer, Ryan E.; White, Amanda M.

2006-06-06

Semantic Web applications require robust and accurate annotation tools that are capable of automating the assignment of ontological classes to words in naturally occurring text (ontological annotation). Most current ontologies do not include rich lexical databases and are therefore not easily integrated with word sense disambiguation algorithms that are needed to automate ontological annotation. WordNet provides a potentially ideal solution to this problem as it offers a highly structured lexical conceptual representation that has been extensively used to develop word sense disambiguation algorithms. However, WordNet has not been designed as an ontology, and while it can be easily turned into one, the result of doing this would present users with serious practical limitations due to the great number of concepts (synonym sets) it contains. Moreover, mapping WordNet to an existing ontology may be difficult and requires substantial labor. We propose to overcome these limitations by developing an analytical platform that (1) provides a WordNet-based ontology offering a manageable and yet comprehensive set of concept classes, (2) leverages the lexical richness of WordNet to give an extensive characterization of concept class in terms of lexical instances, and (3) integrates a class recognition algorithm that automates the assignment of concept classes to words in naturally occurring text. The ensuing framework makes available an ontological annotation platform that can be effectively integrated with intelligence analysis systems to facilitate evidence marshaling and sustain the creation and validation of inference models.
Genetic diversity and apple leaf spot disease resistance characterization assessed by SSR markers

Directory of Open Access Journals (Sweden)

Gustavo H.F. Klabunde

2016-09-01

Full Text Available Among the cultivation problems of apple production in Brazil, Apple Leaf Spot (ALS disease represents one of the main breeding challenges. This study aims at analyzing the genetic diversity among 152 apple scion accessions available at the Apple Gene Bank of EPAGRI, located in Caçador, Santa Catarina/ Brazil. Eleven genomic SSR loci were analyzed to assess genetic diversity of ALS resistant and susceptible accessions. Results revealed high genetic diversity of the studied accessions, being 120 exclusive alleles (67 unique from scion accessions resistant to ALS, and a mean PIC of 0.823. The locus Probability of Identity (I ranged from 0.017 to 0.089. The combined I was 4.11 x 10-16, and the Power of Exclusion was 99.99999259%. In addition, the DNA fingerprint patterns will contribute as additional descriptors to select parental for crosses and early identification of apple accessions for breeding purposes, and also for cultivar protection.
Development and characterization of microsatellite markers for Morus spp. and assessment of their transferability to other closely related species

Science.gov (United States)

2013-01-01

Background Adoption of genomics based breeding has emerged as a promising approach for achieving comprehensive crop improvement. Such an approach is more relevant in the case of perennial species like mulberry. However, unavailability of genomic resources of co-dominant marker systems has been the major constraint for adopting molecular breeding to achieve genetic enhancement of Mulberry. The goal of this study was to develop and characterize a large number of locus specific genic and genomic SSR markers which can be effectively used for molecular characterization of mulberry species/genotypes. Result We analyzed a total of 3485 DNA sequences including genomic and expressed sequences (ESTs) of mulberry (Morus alba L.) genome. We identified 358 sequences to develop appropriate microsatellite primer pairs representing 222 genomic and 136 EST regions. Primers amplifying locus specific regions of Dudia white (a genotype of Morus alba L), were identified and 137 genomic and 51 genic SSR markers were standardized. A two pronged strategy was adopted to assess the applicability of these SSR markers using mulberry species and genotypes along with a few closely related species belonging to the family Moraceae viz., Ficus, Fig and Jackfruit. While 100% of these markers amplified specific loci on the mulberry genome, 79% were transferable to other related species indicating the robustness of these markers and the potential they hold in analyzing the molecular and genetic diversity among mulberry germplasm as well as other related species. The inherent ability of these markers in detecting heterozygosity combined with a high average polymorphic information content (PIC) of 0.559 ranging between 0.076 and 0.943 clearly demonstrates their potential as genomic resources in diversity analysis. The dissimilarity coefficient determined based on Neighbor joining method, revealed that the markers were successful in segregating the mulberry species, genotypes and other related species
Comparative evaluation of genetic diversity using RAPD, SSR and ...

Indian Academy of Sciences (India)

1Department of Molecular Biology and Genetic Engineering, 2Ranichauri Hill Campus, G. B. Pant University of Agriculture ... Comparison of RAPD, SSR and cytochrome P450 gene based markers, in terms of the ...... project programme.
Supplementary data: Development of SSR markers and construction ...

Indian Academy of Sciences (India)

Supplementary data: Development of SSR markers and construction of a linkage map in jute. Moumita Das, Sumana Banerjee, Raman Dhariwal, Shailendra Vyas, Reyazul R. Mir, Niladri Topdar, Avijit Kundu, Jitendra P. Khurana, Akhilesh K. Tyagi,. Debabrata Sarkar, Mohit K. Sinha, Harindra S. Balyan and Pushpendra K.
In silico characterization of microsatellites in Eucalyptus spp.: abundance, length variation and transposon associations

Directory of Open Access Journals (Sweden)

Edenilson Rabello

2005-01-01

Full Text Available This study assessed the abundance of microsatellites, or simple sequence repeats (SSR, in 19 Eucalyptus EST libraries from FORESTs, containing cDNA sequences from five species: E. grandis, E. globulus, E. saligna, E. urophylla and E. camaldulensis. Overall, a total of 11,534 SSRs and 8,447 SSR-containing sequences (25.5% of total ESTs were identified, with an average of 1 SSR/2.5 kb when considering all motifs and 1 SSR/3.1 kb when mononucleotides were not included. Dimeric repeats were the most abundant (41.03%, followed by trimerics (36.11% and monomerics (19.59%. The most frequent motifs were A/T (87.24% for monomerics, AG/CT (94.44% for dimerics, CCG/CGG (37.87% for trimerics, AAGG/CCTT (18.75% for tetramerics, AGAGG/CCTCT (14.04% for pentamerics and ACGGCG/CGCCGT (6.30% for hexamerics. According to sequence length, Class II or potentially variable markers were the most commonly found, followed by Class III. Two sequences presented high similarity to previously published Eucalyptus sequences from the NCBI database, EMBRA_72 and EMBRA_122. Local blastn search for transposons did not reveal the presence of any transposable elements with a cut-off value of 10-50. The large number of microsatellites identified will contribute to the refinement of marker-assisted mapping and to the discovery of novel markers for virtually all genes of economic interest.
Automatic Function Annotations for Hoare Logic

Directory of Open Access Journals (Sweden)

Daniel Matichuk

2012-11-01

Full Text Available In systems verification we are often concerned with multiple, inter-dependent properties that a program must satisfy. To prove that a program satisfies a given property, the correctness of intermediate states of the program must be characterized. However, this intermediate reasoning is not always phrased such that it can be easily re-used in the proofs of subsequent properties. We introduce a function annotation logic that extends Hoare logic in two important ways: (1 when proving that a function satisfies a Hoare triple, intermediate reasoning is automatically stored as function annotations, and (2 these function annotations can be exploited in future Hoare logic proofs. This reduces duplication of reasoning between the proofs of different properties, whilst serving as a drop-in replacement for traditional Hoare logic to avoid the costly process of proof refactoring. We explain how this was implemented in Isabelle/HOL and applied to an experimental branch of the seL4 microkernel to significantly reduce the size and complexity of existing proofs.
Development of Genome-Wide SSR Markers from Angelica gigas Nakai Using Next Generation Sequencing.

Science.gov (United States)

Gil, Jinsu; Um, Yurry; Kim, Serim; Kim, Ok Tae; Koo, Sung Cheol; Reddy, Chinreddy Subramanyam; Kim, Seong-Cheol; Hong, Chang Pyo; Park, Sin-Gi; Kim, Ho Bang; Lee, Dong Hoon; Jeong, Byung-Hoon; Chung, Jong-Wook; Lee, Yi

2017-09-21

Angelica gigas Nakai is an important medicinal herb, widely utilized in Asian countries especially in Korea, Japan, and China. Although it is a vital medicinal herb, the lack of sequencing data and efficient molecular markers has limited the application of a genetic approach for horticultural improvements. Simple sequence repeats (SSRs) are universally accepted molecular markers for population structure study. In this study, we found over 130,000 SSRs, ranging from di- to deca-nucleotide motifs, using the genome sequence of Manchu variety (MV) of A. gigas, derived from next generation sequencing (NGS). From the putative SSR regions identified, a total of 16,496 primer sets were successfully designed. Among them, we selected 848 SSR markers that showed polymorphism from in silico analysis and contained tri- to hexa-nucleotide motifs. We tested 36 SSR primer sets for polymorphism in 16 A. gigas accessions. The average polymorphism information content (PIC) was 0.69; the average observed heterozygosity ( H O ) values, and the expected heterozygosity ( H E ) values were 0.53 and 0.73, respectively. These newly developed SSR markers would be useful tools for molecular genetics, genotype identification, genetic mapping, molecular breeding, and studying species relationships of the Angelica genus.
Annotation sémantique à partir de textes : Cas des observations dans les Bulletins de Santé du végétal

OpenAIRE

Zargayouna , H.; Roussey , C.; Ouardani , S.

2017-01-01

9ème atelier Recherche d'Information SEmantique (RISE 2017) adossé à la conférence IC 2017 de la Plateforme Francophone d'Intelligence Artificielle, Caen, FRA, 04-/07/2017 - 04/07/2017; National audience; Dans cet article, nous décrivons un schéma d'annotation pour annoter les observations extraites des bulletins agricoles disponibles sur le web. Le but est de proposer un processus d'annotation automatique qui permet de générer des annotations complexes accessibles via un sparql endpoint. Nou...
PAVE: Program for assembling and viewing ESTs

Directory of Open Access Journals (Sweden)

Bomhoff Matthew

2009-08-01

Full Text Available Abstract Background New sequencing technologies are rapidly emerging. Many laboratories are simultaneously working with the traditional Sanger ESTs and experimenting with ESTs generated by the 454 Life Science sequencers. Though Sanger ESTs have been used to generate contigs for many years, no program takes full advantage of the 5' and 3' mate-pair information, hence, many tentative transcripts are assembled into two separate contigs. The new 454 technology has the benefit of high-throughput expression profiling, but introduces time and space problems for assembling large contigs. Results The PAVE (Program for Assembling and Viewing ESTs assembler takes advantage of the 5' and 3' mate-pair information by requiring that the mate-pairs be assembled into the same contig and joined by n's if the two sub-contigs do not overlap. It handles the depth of 454 data sets by "burying" similar ESTs during assembly, which retains the expression level information while circumventing time and space problems. PAVE uses MegaBLAST for the clustering step and CAP3 for assembly, however it assembles incrementally to enforce the mate-pair constraint, bury ESTs, and reduce incorrect joins and splits. The PAVE data management system uses a MySQL database to store multiple libraries of ESTs along with their metadata; the management system allows multiple assemblies with variations on libraries and parameters. Analysis routines provide standard annotation for the contigs including a measure of differentially expressed genes across the libraries. A Java viewer program is provided for display and analysis of the results. Our results clearly show the benefit of using the PAVE assembler to explicitly use mate-pair information and bury ESTs for large contigs. Conclusion The PAVE assembler provides a software package for assembling Sanger and/or 454 ESTs. The assembly software, data management software, Java viewer and user's guide are freely available.
PAVE: program for assembling and viewing ESTs.

Science.gov (United States)

Soderlund, Carol; Johnson, Eric; Bomhoff, Matthew; Descour, Anne

2009-08-26

New sequencing technologies are rapidly emerging. Many laboratories are simultaneously working with the traditional Sanger ESTs and experimenting with ESTs generated by the 454 Life Science sequencers. Though Sanger ESTs have been used to generate contigs for many years, no program takes full advantage of the 5' and 3' mate-pair information, hence, many tentative transcripts are assembled into two separate contigs. The new 454 technology has the benefit of high-throughput expression profiling, but introduces time and space problems for assembling large contigs. The PAVE (Program for Assembling and Viewing ESTs) assembler takes advantage of the 5' and 3' mate-pair information by requiring that the mate-pairs be assembled into the same contig and joined by n's if the two sub-contigs do not overlap. It handles the depth of 454 data sets by "burying" similar ESTs during assembly, which retains the expression level information while circumventing time and space problems. PAVE uses MegaBLAST for the clustering step and CAP3 for assembly, however it assembles incrementally to enforce the mate-pair constraint, bury ESTs, and reduce incorrect joins and splits. The PAVE data management system uses a MySQL database to store multiple libraries of ESTs along with their metadata; the management system allows multiple assemblies with variations on libraries and parameters. Analysis routines provide standard annotation for the contigs including a measure of differentially expressed genes across the libraries. A Java viewer program is provided for display and analysis of the results. Our results clearly show the benefit of using the PAVE assembler to explicitly use mate-pair information and bury ESTs for large contigs. The PAVE assembler provides a software package for assembling Sanger and/or 454 ESTs. The assembly software, data management software, Java viewer and user's guide are freely available.
Construction of an integrated genetic linkage map for the A genome of Brassica napus using SSR markers derived from sequenced BACs in B. rapa

Directory of Open Access Journals (Sweden)

King Graham J

2010-10-01

Full Text Available Abstract Background The Multinational Brassica rapa Genome Sequencing Project (BrGSP has developed valuable genomic resources, including BAC libraries, BAC-end sequences, genetic and physical maps, and seed BAC sequences for Brassica rapa. An integrated linkage map between the amphidiploid B. napus and diploid B. rapa will facilitate the rapid transfer of these valuable resources from B. rapa to B. napus (Oilseed rape, Canola. Results In this study, we identified over 23,000 simple sequence repeats (SSRs from 536 sequenced BACs. 890 SSR markers (designated as BrGMS were developed and used for the construction of an integrated linkage map for the A genome in B. rapa and B. napus. Two hundred and nineteen BrGMS markers were integrated to an existing B. napus linkage map (BnaNZDH. Among these mapped BrGMS markers, 168 were only distributed on the A genome linkage groups (LGs, 18 distrubuted both on the A and C genome LGs, and 33 only distributed on the C genome LGs. Most of the A genome LGs in B. napus were collinear with the homoeologous LGs in B. rapa, although minor inversions or rearrangements occurred on A2 and A9. The mapping of these BAC-specific SSR markers enabled assignment of 161 sequenced B. rapa BACs, as well as the associated BAC contigs to the A genome LGs of B. napus. Conclusion The genetic mapping of SSR markers derived from sequenced BACs in B. rapa enabled direct links to be established between the B. napus linkage map and a B. rapa physical map, and thus the assignment of B. rapa BACs and the associated BAC contigs to the B. napus linkage map. This integrated genetic linkage map will facilitate exploitation of the B. rapa annotated genomic resources for gene tagging and map-based cloning in B. napus, and for comparative analysis of the A genome within Brassica species.
SeqAnt: A web service to rapidly identify and annotate DNA sequence variations

Directory of Open Access Journals (Sweden)

Patel Viren

2010-09-01

Full Text Available Abstract Background The enormous throughput and low cost of second-generation sequencing platforms now allow research and clinical geneticists to routinely perform single experiments that identify tens of thousands to millions of variant sites. Existing methods to annotate variant sites using information from publicly available databases via web browsers are too slow to be useful for the large sequencing datasets being routinely generated by geneticists. Because sequence annotation of variant sites is required before functional characterization can proceed, the lack of a high-throughput pipeline to efficiently annotate variant sites can act as a significant bottleneck in genetics research. Results SeqAnt (Sequence Annotator is an open source web service and software package that rapidly annotates DNA sequence variants and identifies recessive or compound heterozygous loci in human, mouse, fly, and worm genome sequencing experiments. Variants are characterized with respect to their functional type, frequency, and evolutionary conservation. Annotated variants can be viewed on a web browser, downloaded in a tab-delimited text file, or directly uploaded in a BED format to the UCSC genome browser. To demonstrate the speed of SeqAnt, we annotated a series of publicly available datasets that ranged in size from 37 to 3,439,107 variant sites. The total time to completely annotate these data completely ranged from 0.17 seconds to 28 minutes 49.8 seconds. Conclusion SeqAnt is an open source web service and software package that overcomes a critical bottleneck facing research and clinical geneticists using second-generation sequencing platforms. SeqAnt will prove especially useful for those investigators who lack dedicated bioinformatics personnel or infrastructure in their laboratories.
GENETIC DIVERSITY OF S3 MAIZE GENOTYPES RESISTANT TO DOWNY MILDEW BASED ON SSR MARKERS

Directory of Open Access Journals (Sweden)

Amran Muis

2016-02-01

Full Text Available The compulsory requirement for releasing new high yielding maize varieties is resistance to downy mildew. The study aimed to determine the level of homozygosity, genetic diversity, and genetic distance of 30 S3 genotypes of maize. Number of primers to be used were 30 polymorphic SSR loci which are distributed over the entire maize genomes. The S3 genotypes used were resistant to downy mildew with homozygosity level of >80%, genetic distance between the test and tester strains >0.7, and anthesis silking interval (ASI between inbred lines and tester lines was maximum 3 days. The results showed that 30 SSR primers used were spread evenly across the maize genomes which were manifested in the representation of SSR loci on each chromosome of a total of 10 chromosomes. The levels of polymorphism ranged from 0.13 to 0.78, an average of 0.51, and the number of alleles ranged from 2 to 8 alleles per SSR locus, an average of 4 alleles per SSR locus. The size of nucleotides in each locus also varied from 70 to 553 bp. Cophenetic correlation value (r at 0.67 indicated that the Unweighted Pair-Group Method Using Arithmetic Averages (UPGMA was less reliable for differentiating genotypes in five groups. Of the total of 30 genotypes analyzed, 17 genotypes had homozygosity level of >80% so it can be included in the hybrid assembly program.
Diversity of garlic (Allium sativum L.) using SSR, EST and AFLP markers

Science.gov (United States)

Germplasm from the center of origin/diversity is important for the breeding and fingerprinting crop plants. In this study we utilized both dominant and co-dominant markers for the characterization of garlic samples from diverse geographic origins to assess the relative utility of these markers to id...
Genetic variation assessment of acid lime accessions collected from south of Iran using SSR and ISSR molecular markers.

Science.gov (United States)

Sharafi, Ata Allah; Abkenar, Asad Asadi; Sharafi, Ali; Masaeli, Mohammad

2016-01-01

Iran has a long history of acid lime cultivation and propagation. In this study, genetic variation in 28 acid lime accessions from five regions of south of Iran, and their relatedness with other 19 citrus cultivars were analyzed using Simple Sequence Repeat (SSR) and Inter-Simple Sequence Repeat (ISSR) molecular markers. Nine primers for SSR and nine ISSR primers were used for allele scoring. In total, 49 SSR and 131 ISSR polymorphic alleles were detected. Cluster analysis of SSR and ISSR data showed that most of the acid lime accessions (19 genotypes) have hybrid origin and genetically distance with nucellar of Mexican lime (9 genotypes). As nucellar of Mexican lime are susceptible to phytoplasma, these acid lime genotypes can be used to evaluate their tolerance against biotic constricts like lime "witches' broom disease".
Simple sequence repeat (SSR)-based genetic variability among ...

African Journals Online (AJOL)

The objective of this study was to compare if simple sequence repeat (SSR) markers could correctly identify peanut genotypes with difference in specific leaf weight (SLW) and relative water content (RWC). Four peanut genotypes and two water regimes (FC and 1/3 available water; 1/3 AW) were arranged in factorial ...
Insight into the genetic variability analysis and cultivar identification of tall fescue by using SSR markers.

Science.gov (United States)

Fu, Kaixin; Guo, Zhihui; Zhang, Xinquan; Fan, Yan; Wu, Wendan; Li, Daxu; Peng, Yan; Huang, Linkai; Sun, Ming; Bai, Shiqie; Ma, Xiao

2016-01-01

Genetic diversity of 19 forage-type and 2 turf-type cultivars of tall fescue ( Festuca arundinacea Schreb.) was revealed using SSR markers in an attempt to explore the genetic relationships among them, and examine potential use of SSR markers to identify cultivars by bulked samples. A total of 227 clear band was scored with 14 SSR primers and out of which 201 (88.6 %) were found polymorphic. The percentage of polymorphic bands (PPB) per primer pair varied from 62.5 to 100 % with an average of 86.9 %. The polymorphism information content (PIC) value ranged from 0.116 to 0.347 with an average of 0.257 and the highest PIC value (0.347) was noticed for primer NFA040 followed by NFA113 (0.346) whereas the highest discriminating power (D) of 1 was shown in NFA037 and LMgSSR02-01C. A Neighbor-joining dendrogram and the principal component analysis identified six major clusters and grouped the cultivars in agreement with their breeding histories. STRUCTURE analysis divided these cultivars into 3 sub-clades which correspond to distance based groupings. These findings indicates that SSR markers by bulking strategy are a useful tool to measure genetic diversity among tall fescue cultivars and could be used to supplement morphological data for plant variety protection.
Towards Viral Genome Annotation Standards, Report from the 2010 NCBI Annotation Workshop.

Science.gov (United States)

Brister, James Rodney; Bao, Yiming; Kuiken, Carla; Lefkowitz, Elliot J; Le Mercier, Philippe; Leplae, Raphael; Madupu, Ramana; Scheuermann, Richard H; Schobel, Seth; Seto, Donald; Shrivastava, Susmita; Sterk, Peter; Zeng, Qiandong; Klimke, William; Tatusova, Tatiana

2010-10-01

Improvements in DNA sequencing technologies portend a new era in virology and could possibly lead to a giant leap in our understanding of viral evolution and ecology. Yet, as viral genome sequences begin to fill the world's biological databases, it is critically important to recognize that the scientific promise of this era is dependent on consistent and comprehensive genome annotation. With this in mind, the NCBI Genome Annotation Workshop recently hosted a study group tasked with developing sequence, function, and metadata annotation standards for viral genomes. This report describes the issues involved in viral genome annotation and reviews policy recommendations presented at the NCBI Annotation Workshop.
Towards Viral Genome Annotation Standards, Report from the 2010 NCBI Annotation Workshop

Directory of Open Access Journals (Sweden)

Qiandong Zeng

2010-10-01

Full Text Available Improvements in DNA sequencing technologies portend a new era in virology and could possibly lead to a giant leap in our understanding of viral evolution and ecology. Yet, as viral genome sequences begin to fill the world’s biological databases, it is critically important to recognize that the scientific promise of this era is dependent on consistent and comprehensive genome annotation. With this in mind, the NCBI Genome Annotation Workshop recently hosted a study group tasked with developing sequence, function, and metadata annotation standards for viral genomes. This report describes the issues involved in viral genome annotation and reviews policy recommendations presented at the NCBI Annotation Workshop.

Molecular diversity and phylogeny of Triticum-Aegilops species possessing D genome revealed by SSR and ISSR markers

Directory of Open Access Journals (Sweden)

Moradkhani Hoda

2015-12-01

Full Text Available The aim of this study is investigation the applicability of SSR and ISSR markers in evaluating the genetic relationships in twenty accessions of Aegilops and Triticum species with D genome in different ploidy levels. Totally, 119 bands and 46 alleles were detected using ten primers for ISSR and SSR markers, respectively. Polymorphism Information Content values for all primers ranged from 0.345 to 0.375 with an average of 0.367 for SSR, and varied from 0.29 to 0.44 with the average 0.37 for ISSR marker. Analysis of molecular variance (AMOVA revealed that 81% (ISSR and 84% (SSR of variability was partitioned among individuals within populations. Comparing the genetic diversity of Aegilops and Triticum accessions, based on genetic parameters, shows that genetic variation of Ae. crassa and Ae. tauschii species are higher than other species, especially in terms of Nei’s gene diversity. Cluster analysis, based on both markers, separated total accessions in three groups. However, classification based on SSR marker data was not conformed to classification according to ISSR marker data. Principal co-ordinate analysis (PCoA for SSR and ISSR data showed that, the first two components clarified 53.48% and 49.91% of the total variation, respectively. This analysis (PCoA, also, indicated consistent patterns of genetic relationships for ISSR data sets, however, the grouping of accessions was not completely accorded to their own geographical origins. Consequently, a high level of genetic diversity was revealed from the accessions sampled from different eco-geographical regions of Iran.
SSR

African Journals Online (AJOL)

RAJU

2013-07-24

Jul 24, 2013 ... bioinformatics tools for development of genic molecular markers is, therefore, crucial in this phase. This mini-review mainly ... cing, EST generation and analysis has resulted in the .... SNP discovery from EST sequences include clustering, sequence ...... Integrated with Primer Design and PCR Simulation.
Automating Ontological Annotation with WordNet

Energy Technology Data Exchange (ETDEWEB)

Sanfilippo, Antonio P.; Tratz, Stephen C.; Gregory, Michelle L.; Chappell, Alan R.; Whitney, Paul D.; Posse, Christian; Paulson, Patrick R.; Baddeley, Bob L.; Hohimer, Ryan E.; White, Amanda M.

2006-01-22

Semantic Web applications require robust and accurate annotation tools that are capable of automating the assignment of ontological classes to words in naturally occurring text (ontological annotation). Most current ontologies do not include rich lexical databases and are therefore not easily integrated with word sense disambiguation algorithms that are needed to automate ontological annotation. WordNet provides a potentially ideal solution to this problem as it offers a highly structured lexical conceptual representation that has been extensively used to develop word sense disambiguation algorithms. However, WordNet has not been designed as an ontology, and while it can be easily turned into one, the result of doing this would present users with serious practical limitations due to the great number of concepts (synonym sets) it contains. Moreover, mapping WordNet to an existing ontology may be difficult and requires substantial labor. We propose to overcome these limitations by developing an analytical platform that (1) provides a WordNet-based ontology offering a manageable and yet comprehensive set of concept classes, (2) leverages the lexical richness of WordNet to give an extensive characterization of concept class in terms of lexical instances, and (3) integrates a class recognition algorithm that automates the assignment of concept classes to words in naturally occurring text. The ensuing framework makes available an ontological annotation platform that can be effectively integrated with intelligence analysis systems to facilitate evidence marshaling and sustain the creation and validation of inference models.
Molecular genetic diversity assessment of Citrus species grown in Iran revealed by SSR, ISSR and CAPS molecular markers

Directory of Open Access Journals (Sweden)

Ata Allah Sharafi

2017-12-01

Full Text Available In this study, genetic diversity in 19 citrus cultivars was analyzed using Simple Sequence Repeat (SSR, Inter-simple Sequence Repeat (ISSR and cleaved amplified polymorphic sequence (CAPS markers. Nine primers for SSR, nine ISSR primers and two primers for CAPS were used for allele scoring. One chloroplast DNA region (rbcL-ORF106 and one mitochondrial DNA region (18S-5S were analyzed using cleaved amplified polymorphic sequence (CAPS marker in 19 citrus accessions grown in Iran. In total, 45 SSR and 131 ISSR polymorphic alleles and tree organelle genome types were detected. Cluster analysis of SSR and ISSR data was performed using UPGMA method and based on Jaccard's coefficient. The result of this investigation showed that the SSR and ISSR primers were highly informative and efficient in detecting genetic variability and relationships of the citrus accessions. And CAPS marker analysis Results showed that Bakraee and one of off type Mexican lime had banding pattern similar to Clementine Mandarin, while Pummelo regarded as maternal parent of other studied genotypes Citron regarded as father parent showed definite banding pattern among 19 studied genotypes which it confirmed Cytoplasmic inheritance from mother cellular organelles.
Semi-Semantic Annotation: A guideline for the URDU.KON-TB treebank POS annotation

Directory of Open Access Journals (Sweden)

Qaiser ABBAS

2016-12-01

Full Text Available This work elaborates the semi-semantic part of speech annotation guidelines for the URDU.KON-TB treebank: an annotated corpus. A hierarchical annotation scheme was designed to label the part of speech and then applied on the corpus. This raw corpus was collected from the Urdu Wikipedia and the Jang newspaper and then annotated with the proposed semi-semantic part of speech labels. The corpus contains text of local & international news, social stories, sports, culture, finance, religion, traveling, etc. This exercise finally contributed a part of speech annotation to the URDU.KON-TB treebank. Twenty-two main part of speech categories are divided into subcategories, which conclude the morphological, and semantical information encoded in it. This article reports the annotation guidelines in major; however, it also briefs the development of the URDU.KON-TB treebank, which includes the raw corpus collection, designing & employment of annotation scheme and finally, its statistical evaluation and results. The guidelines presented as follows, will be useful for linguistic community to annotate the sentences not only for the national language Urdu but for the other indigenous languages like Punjab, Sindhi, Pashto, etc., as well.
Molecular Assortment of Lens Species with Different Adaptations to Drought Conditions Using SSR Markers

Science.gov (United States)

Singh, Dharmendra; Singh, Chandan Kumar; Tomar, Ram Sewak Singh; Taunk, Jyoti; Singh, Ranjeet; Maurya, Sadhana; Chaturvedi, Ashish Kumar; Pal, Madan; Singh, Rajendra; Dubey, Sarawan Kumar

2016-01-01

The success of drought tolerance breeding programs can be enhanced through molecular assortment of germplasm. This study was designed to characterize molecular diversity within and between Lens species with different adaptations to drought stress conditions using SSR markers. Drought stress was applied at seedling stage to study the effects on morpho-physiological traits under controlled condition, where tolerant cultivars and wilds showed 12.8–27.6% and 9.5–23.2% reduction in seed yield per plant respectively. When juxtaposed to field conditions, the tolerant cultivars (PDL-1 and PDL-2) and wild (ILWL-314 and ILWL-436) accessions showed 10.5–26.5% and 7.5%–15.6% reduction in seed yield per plant, respectively under rain-fed conditions. The reductions in seed yield in the two tolerant cultivars and wilds under severe drought condition were 48–49% and 30.5–45.3% respectively. A set of 258 alleles were identified among 278 genotypes using 35 SSR markers. Genetic diversity and polymorphism information contents varied between 0.321–0.854 and 0.299–0.836, with mean value of 0.682 and 0.643, respectively. All the genotypes were clustered into 11 groups based on SSR markers. Tolerant genotypes were grouped in cluster 6 while sensitive ones were mainly grouped into cluster 7. Wild accessions were separated from cultivars on the basis of both population structure and cluster analysis. Cluster analysis has further grouped the wild accessions on the basis of species and sub-species into 5 clusters. Physiological and morphological characters under drought stress were significantly (P = 0.05) different among microsatellite clusters. These findings suggest that drought adaptation is variable among wild and cultivated genotypes. Also, genotypes from contrasting clusters can be selected for hybridization which could help in evolution of better segregants for improving drought tolerance in lentil. PMID:26808306
Molecular breeding in Brassica for salt tolerance: importance of microsatellite (SSR) markers for molecular breeding in Brassica.

Science.gov (United States)

Kumar, Manu; Choi, Ju-Young; Kumari, Nisha; Pareek, Ashwani; Kim, Seong-Ryong

2015-01-01

Salinity is one of the important abiotic factors for any crop management in irrigated as well as rainfed areas, which leads to poor harvests. This yield reduction in salt affected soils can be overcome by improving salt tolerance in crops or by soil reclamation. Salty soils can be reclaimed by leaching the salt or by cultivation of salt tolerance crops. Salt tolerance is a quantitative trait controlled by several genes. Poor knowledge about mechanism of its inheritance makes slow progress in its introgression into target crops. Brassica is known to be a good reclamation crop. Inter and intra specific variation within Brassica species shows potential of molecular breeding to raise salinity tolerant genotypes. Among the various molecular markers, SSR markers are getting high attention, since they are randomly sparsed, highly variable and show co-dominant inheritance. Furthermore, as sequencing techniques are improving and softwares to find SSR markers are being developed, SSR markers technology is also evolving rapidly. Comparative SSR marker studies targeting Arabidopsis thaliana and Brassica species which lie in the same family will further aid in studying the salt tolerance related QTLs and subsequent identification of the "candidate genes" and finding out the origin of important QTLs. Although, there are a few reports on molecular breeding for improving salt tolerance using molecular markers in Brassica species, usage of SSR markers has a big potential to improve salt tolerance in Brassica crops. In order to obtain best harvests, role of SSR marker driven breeding approaches play important role and it has been discussed in this review especially for the introgression of salt tolerance traits in crops.
Molecular breeding in Brassica for salt tolerance: importance of microsatellite (SSR) markers for molecular breeding in Brassica

Science.gov (United States)

Kumar, Manu; Choi, Ju-Young; Kumari, Nisha; Pareek, Ashwani; Kim, Seong-Ryong

2015-01-01

Salinity is one of the important abiotic factors for any crop management in irrigated as well as rainfed areas, which leads to poor harvests. This yield reduction in salt affected soils can be overcome by improving salt tolerance in crops or by soil reclamation. Salty soils can be reclaimed by leaching the salt or by cultivation of salt tolerance crops. Salt tolerance is a quantitative trait controlled by several genes. Poor knowledge about mechanism of its inheritance makes slow progress in its introgression into target crops. Brassica is known to be a good reclamation crop. Inter and intra specific variation within Brassica species shows potential of molecular breeding to raise salinity tolerant genotypes. Among the various molecular markers, SSR markers are getting high attention, since they are randomly sparsed, highly variable and show co-dominant inheritance. Furthermore, as sequencing techniques are improving and softwares to find SSR markers are being developed, SSR markers technology is also evolving rapidly. Comparative SSR marker studies targeting Arabidopsis thaliana and Brassica species which lie in the same family will further aid in studying the salt tolerance related QTLs and subsequent identification of the “candidate genes” and finding out the origin of important QTLs. Although, there are a few reports on molecular breeding for improving salt tolerance using molecular markers in Brassica species, usage of SSR markers has a big potential to improve salt tolerance in Brassica crops. In order to obtain best harvests, role of SSR marker driven breeding approaches play important role and it has been discussed in this review especially for the introgression of salt tolerance traits in crops. PMID:26388887
BioAnnote: a software platform for annotating biomedical documents with application in medical learning environments.

Science.gov (United States)

López-Fernández, H; Reboiro-Jato, M; Glez-Peña, D; Aparicio, F; Gachet, D; Buenaga, M; Fdez-Riverola, F

2013-07-01

Automatic term annotation from biomedical documents and external information linking are becoming a necessary prerequisite in modern computer-aided medical learning systems. In this context, this paper presents BioAnnote, a flexible and extensible open-source platform for automatically annotating biomedical resources. Apart from other valuable features, the software platform includes (i) a rich client enabling users to annotate multiple documents in a user friendly environment, (ii) an extensible and embeddable annotation meta-server allowing for the annotation of documents with local or remote vocabularies and (iii) a simple client/server protocol which facilitates the use of our meta-server from any other third-party application. In addition, BioAnnote implements a powerful scripting engine able to perform advanced batch annotations. Copyright © 2013 Elsevier Ireland Ltd. All rights reserved.
Chado controller: advanced annotation management with a community annotation system.

Science.gov (United States)

Guignon, Valentin; Droc, Gaëtan; Alaux, Michael; Baurens, Franc-Christophe; Garsmeur, Olivier; Poiron, Claire; Carver, Tim; Rouard, Mathieu; Bocs, Stéphanie

2012-04-01

We developed a controller that is compliant with the Chado database schema, GBrowse and genome annotation-editing tools such as Artemis and Apollo. It enables the management of public and private data, monitors manual annotation (with controlled vocabularies, structural and functional annotation controls) and stores versions of annotation for all modified features. The Chado controller uses PostgreSQL and Perl. The Chado Controller package is available for download at http://www.gnpannot.org/content/chado-controller and runs on any Unix-like operating system, and documentation is available at http://www.gnpannot.org/content/chado-controller-doc The system can be tested using the GNPAnnot Sandbox at http://www.gnpannot.org/content/gnpannot-sandbox-form valentin.guignon@cirad.fr; stephanie.sidibe-bocs@cirad.fr Supplementary data are available at Bioinformatics online.
An annotated corpus with nanomedicine and pharmacokinetic parameters

Directory of Open Access Journals (Sweden)

Lewinski NA

2017-10-01

Full Text Available Nastassja A Lewinski,1 Ivan Jimenez,1 Bridget T McInnes2 1Department of Chemical and Life Science Engineering, Virginia Commonwealth University, Richmond, VA, 2Department of Computer Science, Virginia Commonwealth University, Richmond, VA, USA Abstract: A vast amount of data on nanomedicines is being generated and published, and natural language processing (NLP approaches can automate the extraction of unstructured text-based data. Annotated corpora are a key resource for NLP and information extraction methods which employ machine learning. Although corpora are available for pharmaceuticals, resources for nanomedicines and nanotechnology are still limited. To foster nanotechnology text mining (NanoNLP efforts, we have constructed a corpus of annotated drug product inserts taken from the US Food and Drug Administration’s Drugs@FDA online database. In this work, we present the development of the Engineered Nanomedicine Database corpus to support the evaluation of nanomedicine entity extraction. The data were manually annotated for 21 entity mentions consisting of nanomedicine physicochemical characterization, exposure, and biologic response information of 41 Food and Drug Administration-approved nanomedicines. We evaluate the reliability of the manual annotations and demonstrate the use of the corpus by evaluating two state-of-the-art named entity extraction systems, OpenNLP and Stanford NER. The annotated corpus is available open source and, based on these results, guidelines and suggestions for future development of additional nanomedicine corpora are provided. Keywords: nanotechnology, informatics, natural language processing, text mining, corpora
Analysis of esterase isozyme and SSR for mutagenic progenies induced by space mutation in mustard

International Nuclear Information System (INIS)

Shen Jinjuan; Liu Yihua; Zhang Zhaorong; Ran Guangkui; Zhao Shouzhong; Xiao Li

2012-01-01

Seeds of five mustard (Brassica juncea Coss) varieties were carried into outer space by 'Shijian No.8' satellite. After five years' consecutive planting and selection, ten relatively stable mutant lines were obtained, which had significant variation in agronomic and economic characters. The mutant lines and their original varieties without space mutation treatment as control were studied by esterase isozyme and SSR analyses. Electrophoresis analysis of esterase isozymes indicated that there were differences between mutant lines and their controls in enzyme types and enzyme activity. Different mustard varieties had different enzymographs, and so did the mutants induced by space mutation, which shows different sensitivity among different mustard varieties. The SSR analysis showed that large differences were found in the SSR loci between mutant lines and their original variety, the variation frequency was between 9.52% and 57.14% with an average frequency of 26.19% for all the mutant lines. Among the mutant SSR loci, about 56.36% showed changes in band number and 43.64% in molecular weight. These results indicated that the ten mutant lines had large genetic difference in phenotype, genomic sequence and gene expression, and the outer space mutation would be an effective method to develop new mustard germplasm and variety. (authors)
Large-scale development of SSR markers in tobacco and construction of a linkage map in flue-cured tobacco.

Science.gov (United States)

Tong, Zhijun; Xiao, Bingguang; Jiao, Fangchan; Fang, Dunhuang; Zeng, Jianmin; Wu, Xingfu; Chen, Xuejun; Yang, Jiankang; Li, Yongping

2016-06-01

Tobacco (Nicotiana tabacum L.), particularly flue-cured tobacco, is one of the most economically important nonfood crops and is also an important model system in plant biotechnology. Despite its importance, only limited molecular marker resources are available for genome analysis, genetic mapping, and breeding. Simple sequence repeats (SSR) are one of the most widely-used molecular markers, having significant advantages including that they are generally co-dominant, easy to use, abundant in eukaryotic organisms, and produce highly reproducible results. In this study, based on the genome sequence data of flue-cured tobacco (K326), we developed a total of 13,645 mostly novel SSR markers, which were working in a set of eighteen tobacco varieties of four different types. A mapping population of 213 backcross (BC1) individuals, which were derived from an intra-type cross between two flue-cured tobacco varieties, Y3 and K326, was selected for mapping. Based on the newly developed SSR markers as well as published SSR markers, we constructed a genetic map consisting of 626 SSR loci distributed across 24 linkage groups and covering a total length of 1120.45 cM with an average distance of 1.79 cM between adjacent markers, which is the highest density map of flue-cured tobacco till date.
Genome-wide analysis of SSR and ILP markers in trees: diversity profiling, alternate distribution, and applications in duplication.

Science.gov (United States)

Xia, Xinyao; Luan, Lin Lin; Qin, Guanghua; Yu, Li Fang; Wang, Zhi Wei; Dong, Wan Chen; Song, Yumin; Qiao, Yuling; Zhang, Xian Sheng; Sang, Ya Lin; Yang, Long

2017-12-20

Molecular markers are efficient tools for breeding and genetic studies. However, despite their ecological and economic importance, their development and application have long been hampered. In this study, we identified 524,170 simple sequence repeat (SSR), 267,636 intron length polymorphism (ILP), and 11,872 potential intron polymorphism (PIP) markers from 16 tree species based on recently available genome sequences. Larger motifs, including hexamers and heptamers, accounted for most of the seven different types of SSR loci. Within these loci, A/T bases comprised a significantly larger proportion of sequence than G/C. SSR and ILP markers exhibited an alternative distribution pattern. Most SSRs were monomorphic markers, and the proportions of polymorphic markers were positively correlated with genome size. By verifying with all 16 tree species, 54 SSR, 418 ILP, and four PIP universal markers were obtained, and their efficiency was examined by PCR. A combination of five SSR and six ILP markers were used for the phylogenetic analysis of 30 willow samples, revealing a positive correlation between genetic diversity and geographic distance. We also found that SSRs can be used as tools for duplication analysis. Our findings provide important foundations for the development of breeding and genetic studies in tree species.
Development of expressed sequence tag-simple sequence repeat markers for genetic characterization and population structure analysis of Praxelis clematidea (Asteraceae).

Science.gov (United States)

Wang, Q Z; Huang, M; Downie, S R; Chen, Z X

2016-05-23

Invasive plants tend to spread aggressively in new habitats and an understanding of their genetic diversity and population structure is useful for their management. In this study, expressed sequence tag-simple sequence repeat (EST-SSR) markers were developed for the invasive plant species Praxelis clematidea (Asteraceae) from 5548 Stevia rebaudiana (Asteraceae) expressed sequence tags (ESTs). A total of 133 microsatellite-containing ESTs (2.4%) were identified, of which 56 (42.1%) were hexanucleotide repeat motifs and 50 (37.6%) were trinucleotide repeat motifs. Of the 24 primer pairs designed from these 133 ESTs, 7 (29.2%) resulted in significant polymorphisms. The number of alleles per locus ranged from 5 to 9. The relatively high genetic diversity (H = 0.2667, I = 0.4212, and P = 100%) of P. clematidea was related to high gene flow (Nm = 1.4996) among populations. The coefficient of population differentiation (GST = 0.2500) indicated that most genetic variation occurred within populations. A Mantel test suggested that there was significant correlation between genetic distance and geographical distribution (r = 0.3192, P = 0.012). These results further support the transferability of EST-SSR markers between closely related genera of the same family.
Development of a simple sequence repeat (SSR) marker set to ...

African Journals Online (AJOL)

GREGORY

2010-08-23

Aug 23, 2010 ... varieties. Tuber seeds of most of these varieties are not produced and distributed in an organized way ... races from Canary Islands using 19 SSR markers. The ... The aim of the current study was to determine a set of.
Linkage Map of a Gene Controlling Zero Tannins (zt-1 in Faba Bean (Vicia faba L. with SSR and ISSR Markers

Directory of Open Access Journals (Sweden)

Wanwei Hou

2018-05-01

Full Text Available Faba bean (Vicia faba L., a partially allogamous species, is rich in protein. Condensed tannins limit the use of faba beans as food and feed. Two recessive genes, zt-1 and zt-2, control the zero tannin content in faba bean and promote a white flower phenotype. To determine the inheritance and develop a linkage map for the zt-1 gene in the faba bean germplasm M3290, F2 and F3 progenies were derived from the purple flower and high tannin content genotypes Qinghai12 and zt-1 line M3290, respectively. Genetic analysis verified a single recessive gene for zero tannin content and flower colour. In total, 596 SSR markers and 100 ISSR markers were used to test the polymorphisms between the parents and bulks for the contrasting flower colour via Bulked Segregant Analysis (BSA. Subsequently, six SSR markers and seven ISSR markers were used to genotype the entire 413 F2 population. Linkage analysis showed that the zt-1 gene was closely linked to the SSR markers SSR84 and M78, with genetic distances of 2.9 and 5.8 cM, respectively. The two flanked SSR markers were used to test 34 faba bean genotypes with different flower colours. The closely linked SSR marker SSR84 predicted the zt-1 genotypes with absolute accuracy. The results from the marker-assisted selection (MAS from this study could provide a solid foundation for further faba bean breeding programmes.
The Non-Peptide Vasopressin V1b Receptor Antagonist, SSR149415, Ameliorates Spermatogenesis Function in a Mouse Model of Chronic Social Defeat Stress.

Science.gov (United States)

Wang, Bin; Zhou, Jian; Zhuang, Yan-Yan; Wang, Liang-Liang; Pu, Jin-Xian; Huang, Yu-Hua; Xia, Fei; Lv, Jin-Xing

2017-11-01

To determine the effects of SSR149415 on testis and spermatogenesis in male mice subjected to chronic social defeat stress, C57BL/6 male mice were divided into two groups: Control and Stress. Then Stress group was subdivided into four subgroups administered water, SSR149415 (1 mg/kg/day), SSR149415 (10 mg/kg/day), SSR149415 (30 mg/kg/day), respectively. The behavioral alterations revealed by social interaction test and open field test were measured. The physical indices, including body weight and gonad weight (testis and epididymis) as well as testis/body weight and cauda epididymis/body weight were detected. Serum hormones, including testosterone, luteinizing hormone (LH), and follicle-stimulating hormone (FSH) were determined. Sperm count and abnormality as well as testicular histology structure were assessed. The germ cells apoptosis were also evaluated. Chronic social defeat stress-induced behavioral abnormality, as well as gonad atrophy (testis and epididymis) was significantly alleviated in stressed male mice exposed to SSR149415. Regressed serum testosterone levels and elevated serum FSH and LH levels exhibited by stressed male mice were observably reversed following SSR149415 administration. Chronic social defeat stress-induced damage in testicular histology structure and semen quality were also improved after SSR149415 administration. In addition, SSR149415 significantly reversed chronic social defeat stress-induced germ cells apoptosis. Overall, we provide clear evidence indicating the amelioration of chronic social defeat stress-induced behavioral abnormality and testicular dysfunction via SSR149415, promoting the development of drug-directed therapy against this disease. J. Cell. Biochem. 118: 3891-3898, 2017. © 2017 Wiley Periodicals, Inc. © 2017 Wiley Periodicals, Inc.
Confamiliar transferability of simple sequence repeat (SSR) markers from cotton (Gossypium hirsutum L.) and jute (Corchorus olitorius L.) to twenty two Malvaceous species.

Science.gov (United States)

Satya, Pratik; Paswan, Pramod Kumar; Ghosh, Swagata; Majumdar, Snehalata; Ali, Nasim

2016-06-01

Cross-species transferability is a quick and economic method to enrich SSR database, particularly for minor crops where little genomic information is available. However, transferability of SSR markers varies greatly between species, genera and families of plant species. We assessed confamiliar transferability of SSR markers from cotton (Gossypium hirsutum) and jute (Corchorus olitorius) to 22 species distributed in different taxonomic groups of Malvaceae. All the species selected were potential industrial crop species having little or no genomic resources or SSR database. Of the 14 cotton SSR loci tested, 13 (92.86 %) amplified in G. arboreum and 71.43 % exhibited cross-genera transferability. Nine out of 11 jute SSRs (81.81 %) showed cross-transferability across genera. SSRs from both the species exhibited high polymorphism and resolving power in other species. The correlation between transferability of cotton and jute SSRs were highly significant (r = 0.813). The difference in transferability among species was also significant for both the marker groups. High transferability was observed at genus, tribe and subfamily level. At tribe level, transferability of jute SSRs (41.04 %) was higher than that of cotton SSRs (33.74 %). The tribe Byttnerieae exhibited highest SSR transferability (48.7 %). The high level of cross-genera transferability (>50 %) in ten species of Malvaceae, where no SSR resource is available, calls for large scale transferability testing from the enriched SSR databases of cotton and jute.
FragKB: structural and literature annotation resource of conserved peptide fragments and residues.

Directory of Open Access Journals (Sweden)

Ashish V Tendulkar

Full Text Available BACKGROUND: FragKB (Fragment Knowledgebase is a repository of clusters of structurally similar fragments from proteins. Fragments are annotated with information at the level of sequence, structure and function, integrating biological descriptions derived from multiple existing resources and text mining. METHODOLOGY: FragKB contains approximately 400,000 conserved fragments from 4,800 representative proteins from PDB. Literature annotations are extracted from more than 1,700 articles and are available for over 12,000 fragments. The underlying systematic annotation workflow of FragKB ensures efficient update and maintenance of this database. The information in FragKB can be accessed through a web interface that facilitates sequence and structural visualization of fragments together with known literature information on the consequences of specific residue mutations and functional annotations of proteins and fragment clusters. FragKB is accessible online at http://ubio.bioinfo.cnio.es/biotools/fragkb/. SIGNIFICANCE: The information presented in FragKB can be used for modeling protein structures, for designing novel proteins and for functional characterization of related fragments. The current release is focused on functional characterization of proteins through inspection of conservation of the fragments.

Generation of a predicted protein database from EST data and application to iTRAQ analyses in grape (Vitis vinifera cv. Cabernet Sauvignon) berries at ripening initiation

Science.gov (United States)

Lücker, Joost; Laszczak, Mario; Smith, Derek; Lund, Steven T

2009-01-01

Background iTRAQ is a proteomics technique that uses isobaric tags for relative and absolute quantitation of tryptic peptides. In proteomics experiments, the detection and high confidence annotation of proteins and the significance of corresponding expression differences can depend on the quality and the species specificity of the tryptic peptide map database used for analysis of the data. For species for which finished genome sequence data are not available, identification of proteins relies on similarity to proteins from other species using comprehensive peptide map databases such as the MSDB. Results We were interested in characterizing ripening initiation ('veraison') in grape berries at the protein level in order to better define the molecular control of this important process for grape growers and wine makers. We developed a bioinformatic pipeline for processing EST data in order to produce a predicted tryptic peptide database specifically targeted to the wine grape cultivar, Vitis vinifera cv. Cabernet Sauvignon, and lacking truncated N- and C-terminal fragments. By searching iTRAQ MS/MS data generated from berry exocarp and mesocarp samples at ripening initiation, we determined that implementation of the custom database afforded a large improvement in high confidence peptide annotation in comparison to the MSDB. We used iTRAQ MS/MS in conjunction with custom peptide db searches to quantitatively characterize several important pathway components for berry ripening previously described at the transcriptional level and confirmed expression patterns for these at the protein level. Conclusion We determined that a predicted peptide database for MS/MS applications can be derived from EST data using advanced clustering and trimming approaches and successfully implemented for quantitative proteome profiling. Quantitative shotgun proteome profiling holds great promise for characterizing biological processes such as fruit ripening initiation and may be further
Development of genic-SSR markers by deep transcriptome sequencing in pigeonpea [Cajanus cajan (L.) Millspaugh

Science.gov (United States)

2011-01-01

Background Pigeonpea [Cajanus cajan (L.) Millspaugh], one of the most important food legumes of semi-arid tropical and subtropical regions, has limited genomic resources, particularly expressed sequence based (genic) markers. We report a comprehensive set of validated genic simple sequence repeat (SSR) markers using deep transcriptome sequencing, and its application in genetic diversity analysis and mapping. Results In this study, 43,324 transcriptome shotgun assembly unigene contigs were assembled from 1.696 million 454 GS-FLX sequence reads of separate pooled cDNA libraries prepared from leaf, root, stem and immature seed of two pigeonpea varieties, Asha and UPAS 120. A total of 3,771 genic-SSR loci, excluding homopolymeric and compound repeats, were identified; of which 2,877 PCR primer pairs were designed for marker development. Dinucleotide was the most common repeat motif with a frequency of 60.41%, followed by tri- (34.52%), hexa- (2.62%), tetra- (1.67%) and pentanucleotide (0.76%) repeat motifs. Primers were synthesized and tested for 772 of these loci with repeat lengths of ≥18 bp. Of these, 550 markers were validated for consistent amplification in eight diverse pigeonpea varieties; 71 were found to be polymorphic on agarose gel electrophoresis. Genetic diversity analysis was done on 22 pigeonpea varieties and eight wild species using 20 highly polymorphic genic-SSR markers. The number of alleles at these loci ranged from 4-10 and the polymorphism information content values ranged from 0.46 to 0.72. Neighbor-joining dendrogram showed distinct separation of the different groups of pigeonpea cultivars and wild species. Deep transcriptome sequencing of the two parental lines helped in silico identification of polymorphic genic-SSR loci to facilitate the rapid development of an intra-species reference genetic map, a subset of which was validated for expected allelic segregation in the reference mapping population. Conclusion We developed 550 validated genic-SSR
SSR: What's in "School Science Review" for "PSR" Readers?

Science.gov (United States)

Lakin, Liz

2004-01-01

This article summarises ideas and developments in teaching and learning in science of relevance to "Primary Science Review" ("PSR") readers from three recent issues (309, 310, and 311) of "School Science Review" ("SSR"), the ASE journal for science education 11-19. The themes running through these are: ICT, the implications for science education…
Optimization of the Geometric Beta for the SSR2 Cavities of the Project X

International Nuclear Information System (INIS)

Solyak, N.; Vostrikov, A.; Yakovlev, V.P.; Awida, M.H.; Berrutti, P.; Gonin, I.V.

2012-01-01

Project X based on the 3 GeV CW superconducting Linac and is currently in the R and D phase. The CW SC Linac starts from a low-energy SCRF section (2.1 - 165 MeV) containing three different types of resonators. HWR f = 162.5 MHz (2.1 - 11 MeV) having beta= 0.11, SSR1 f = 325 MHz (11 - 35 MeV) having beta = 0.21. In this paper we present the analysis that lead to the final design of SSR2 f = 325 MHz cavity (35 - 165 MeV). We present the results of optimization of the geometric beta and the comparison between single, double and triple spoke resonators used in Project X frontend. A β optimization has been carried out for the last spoke cavity section of Project X front end. The optimization process of β opt for a single spoke resonator family SSR2 shown that β opt = 0.47 looks better than the previous choice, which is β opt = 0.4. This change can save some cavities and provide the same final energy for this section, 160 MeV. Single double and triple spoke resonator performances have been compared. The best option is the single spoke resonator SSR2 because the NTTF of a multi-spoke resonator is much narrower than a single one. In the energy range considered (40-160 MeV) the most efficient resonator is the single spoke one.
De novo transcriptomic analysis of cowpea (Vigna unguiculata L. Walp.) for genic SSR marker development.

Science.gov (United States)

Chen, Honglin; Wang, Lixia; Liu, Xiaoyan; Hu, Liangliang; Wang, Suhua; Cheng, Xuzhen

2017-07-11

Cowpea [Vigna unguiculata (L.) Walp.] is one of the most important legumes in tropical and semi-arid regions. However, there is relatively little genomic information available for genetic research on and breeding of cowpea. The objectives of this study were to analyse the cowpea transcriptome and develop genic molecular markers for future genetic studies of this genus. Approximately 54 million high-quality cDNA sequence reads were obtained from cowpea based on Illumina paired-end sequencing technology and were de novo assembled to generate 47,899 unigenes with an N50 length of 1534 bp. Sequence similarity analysis revealed 36,289 unigenes (75.8%) with significant similarity to known proteins in the non-redundant (Nr) protein database, 23,471 unigenes (49.0%) with BLAST hits in the Swiss-Prot database, and 20,654 unigenes (43.1%) with high similarity in the Kyoto Encyclopedia of Genes and Genomes (KEGG) database. Further analysis identified 5560 simple sequence repeats (SSRs) as potential genic molecular markers. Validating a random set of 500 SSR markers yielded 54 polymorphic markers among 32 cowpea accessions. This transcriptomic analysis of cowpea provided a valuable set of genomic data for characterizing genes with important agronomic traits in Vigna unguiculata and a new set of genic SSR markers for further genetic studies and breeding in cowpea and related Vigna species.
A SNP and SSR based genetic map of asparagus bean (Vigna. unguiculata ssp. sesquipedialis and comparison with the broader species.

Directory of Open Access Journals (Sweden)

Pei Xu

Full Text Available Asparagus bean (Vigna. unguiculata ssp. sesquipedialis is a distinctive subspecies of cowpea [Vigna. unguiculata (L. Walp.] that apparently originated in East Asia and is characterized by extremely long and thin pods and an aggressive climbing growth habit. The crop is widely cultivated throughout Asia for the production of immature pods known as 'long beans' or 'asparagus beans'. While the genome of cowpea ssp. unguiculata has been characterized recently by high-density genetic mapping and partial sequencing, little is known about the genome of asparagus bean. We report here the first genetic map of asparagus bean based on SNP and SSR markers. The current map consists of 375 loci mapped onto 11 linkage groups (LGs, with 191 loci detected by SNP markers and 184 loci by SSR markers. The overall map length is 745 cM, with an average marker distance of 1.98 cM. There are four high marker-density blocks distributed on three LGs and three regions of segregation distortion (SDRs identified on two other LGs, two of which co-locate in chromosomal regions syntenic to SDRs in soybean. Synteny between asparagus bean and the model legume Lotus. japonica was also established. This work provides the basis for mapping and functional analysis of genes/QTLs of particular interest in asparagus bean, as well as for comparative genomics study of cowpea at the subspecies level.
A SNP and SSR Based Genetic Map of Asparagus Bean (Vigna. unguiculata ssp. sesquipedialis) and Comparison with the Broader Species

Science.gov (United States)

Xu, Pei; Wu, Xiaohua; Wang, Baogen; Liu, Yonghua; Ehlers, Jeffery D.; Close, Timothy J.; Roberts, Philip A.; Diop, Ndeye-Ndack; Qin, Dehui; Hu, Tingting; Lu, Zhongfu; Li, Guojing

2011-01-01

Asparagus bean (Vigna. unguiculata ssp. sesquipedialis) is a distinctive subspecies of cowpea [Vigna. unguiculata (L.) Walp.] that apparently originated in East Asia and is characterized by extremely long and thin pods and an aggressive climbing growth habit. The crop is widely cultivated throughout Asia for the production of immature pods known as ‘long beans’ or ‘asparagus beans’. While the genome of cowpea ssp. unguiculata has been characterized recently by high-density genetic mapping and partial sequencing, little is known about the genome of asparagus bean. We report here the first genetic map of asparagus bean based on SNP and SSR markers. The current map consists of 375 loci mapped onto 11 linkage groups (LGs), with 191 loci detected by SNP markers and 184 loci by SSR markers. The overall map length is 745 cM, with an average marker distance of 1.98 cM. There are four high marker-density blocks distributed on three LGs and three regions of segregation distortion (SDRs) identified on two other LGs, two of which co-locate in chromosomal regions syntenic to SDRs in soybean. Synteny between asparagus bean and the model legume Lotus. japonica was also established. This work provides the basis for mapping and functional analysis of genes/QTLs of particular interest in asparagus bean, as well as for comparative genomics study of cowpea at the subspecies level. PMID:21253606
Global comparative analysis of ESTs from the southern cattle tick, Rhipicephalus (Boophilus microplus

Directory of Open Access Journals (Sweden)

Pertea Geo

2007-10-01

Full Text Available Abstract Background The southern cattle tick, Rhipicephalus (Boophilus microplus, is an economically important parasite of cattle and can transmit several pathogenic microorganisms to its cattle host during the feeding process. Understanding the biology and genomics of R. microplus is critical to developing novel methods for controlling these ticks. Results We present a global comparative genomic analysis of a gene index of R. microplus comprised of 13,643 unique transcripts assembled from 42,512 expressed sequence tags (ESTs, a significant fraction of the complement of R. microplus genes. The source material for these ESTs consisted of polyA RNA from various tissues, lifestages, and strains of R. microplus, including larvae exposed to heat, cold, host odor, and acaricide. Functional annotation using RPS-Blast analysis identified conserved protein domains in the conceptually translated gene index and assigned GO terms to those database transcripts which had informative BlastX hits. Blast Score Ratio and SimiTri analysis compared the conceptual transcriptome of the R. microplus database to other eukaryotic proteomes and EST databases, including those from 3 ticks. The most abundant protein domains in BmiGI were also analyzed by SimiTri methodology. Conclusion These results indicate that a large fraction of BmiGI entries have no homologs in other sequenced genomes. Analysis with the PartiGene annotation pipeline showed 64% of the members of BmiGI could not be assigned GO annotation, thus minimal information is available about a significant fraction of the tick genome. This highlights the important insights in tick biology which are likely to result from a tick genome sequencing project. Global comparative analysis identified some tick genes with unexpected phylogenetic relationships which detailed analysis attributed to gene losses in some members of the animal kingdom. Some tick genes were identified which had close orthologues to mammalian genes
SSR marker based DNA fingerprinting and diversity study in rice ...

African Journals Online (AJOL)

The genetic diversity and DNA fingerprinting of 15 elite rice genotypes using 30 SSR primers on chromosome numbers 7-12 was investigated. The results revealed that all the primers showed distinct polymorphism among the cultivars studied indicating the robust nature of microsatellites in revealing polymorphism. Cluster ...
MixtureTree annotator: a program for automatic colorization and visual annotation of MixtureTree.

Directory of Open Access Journals (Sweden)

Shu-Chuan Chen

Full Text Available The MixtureTree Annotator, written in JAVA, allows the user to automatically color any phylogenetic tree in Newick format generated from any phylogeny reconstruction program and output the Nexus file. By providing the ability to automatically color the tree by sequence name, the MixtureTree Annotator provides a unique advantage over any other programs which perform a similar function. In addition, the MixtureTree Annotator is the only package that can efficiently annotate the output produced by MixtureTree with mutation information and coalescent time information. In order to visualize the resulting output file, a modified version of FigTree is used. Certain popular methods, which lack good built-in visualization tools, for example, MEGA, Mesquite, PHY-FI, TreeView, treeGraph and Geneious, may give results with human errors due to either manually adding colors to each node or with other limitations, for example only using color based on a number, such as branch length, or by taxonomy. In addition to allowing the user to automatically color any given Newick tree by sequence name, the MixtureTree Annotator is the only method that allows the user to automatically annotate the resulting tree created by the MixtureTree program. The MixtureTree Annotator is fast and easy-to-use, while still allowing the user full control over the coloring and annotating process.
A Fast Silver Staining Protocol Enabling Simple and Efficient Detection of SSR Markers using a Non-denaturing Polyacrylamide Gel.

Science.gov (United States)

Huang, Ling; Deng, Xiaohui; Li, Ronghua; Xia, Yanshi; Bai, Guihua; Siddique, Kadambot H M; Guo, Peiguo

2018-04-20

Simple Sequence Repeat (SSR) is one of the most effective markers used in plant and animal genetic research and molecular breeding programs. Silver staining is a widely used method for the detection of SSR markers in a polyacrylamide gel. However, conventional protocols for silver staining are technically demanding and time-consuming. Like many other biological laboratory techniques, silver staining protocols have been steadily optimized to improve detection efficiency. Here, we report a simplified silver staining method that significantly reduces reagent costs and enhances the detection resolution and picture clarity. The new method requires two major steps (impregnation and development) and three reagents (silver nitrate, sodium hydroxide, and formaldehyde), and only 7 min of processing for a non-denaturing polyacrylamide gel. Compared to previously reported protocols, this new method is easier, quicker and uses fewer chemical reagents for SSR detection. Therefore, this simple, low-cost, and effective silver staining protocol will benefit genetic mapping and marker-assisted breeding by a quick generation of SSR marker data.
Characteristics of the Lotus japonicus gene repertoire deduced from large-scale expressed sequence tag (EST) analysis.

Science.gov (United States)

Asamizu, Erika; Nakamura, Yasukazu; Sato, Shusei; Tabata, Satoshi

2004-02-01

To perform a comprehensive analysis of genes expressed in a model legume, Lotus japonicus, a total of 74472 3'-end expressed sequence tags (EST) were generated from cDNA libraries produced from six different organs. Clustering of sequences was performed with an identity criterion of 95% for 50 bases, and a total of 20457 non-redundant sequences, 8503 contigs and 11954 singletons were generated. EST sequence coverage was analyzed by using the annotated L. japonicus genomic sequence and 1093 of the 1889 predicted protein-encoding genes (57.9%) were hit by the EST sequence(s). Gene content was compared to several plant species. Among the 8503 contigs, 471 were identified as sequences conserved only in leguminous species and these included several disease resistance-related genes. This suggested that in legumes, these genes may have evolved specifically to resist pathogen attack. The rate of gene sequence divergence was assessed by comparing similarity level and functional category based on the Gene Ontology (GO) annotation of Arabidopsis genes. This revealed that genes encoding ribosomal proteins, as well as those related to translation, photosynthesis, and cellular structure were more abundantly represented in the highly conserved class, and that genes encoding transcription factors and receptor protein kinases were abundantly represented in the less conserved class. To make the sequence information and the cDNA clones available to the research community, a Web database with useful services was created at http://www.kazusa.or.jp/en/plant/lotus/EST/.
Cranberry SSR multiplexing panels for DNA horticultural fingerprinting and genetic studies

Science.gov (United States)

Cranberry (Vaccinium macrocarpon) is in need of inexpensive high-throughput DNA fingerprinting methods for genetic research and germplasm purity testing for agricultural purposes. Therefore, we designed and validated 16-multiplexing panels containing 61 evenly distributed simple sequence (SSR) marke...
Evaluating Hierarchical Structure in Music Annotations.

Science.gov (United States)

McFee, Brian; Nieto, Oriol; Farbood, Morwaread M; Bello, Juan Pablo

2017-01-01

Music exhibits structure at multiple scales, ranging from motifs to large-scale functional components. When inferring the structure of a piece, different listeners may attend to different temporal scales, which can result in disagreements when they describe the same piece. In the field of music informatics research (MIR), it is common to use corpora annotated with structural boundaries at different levels. By quantifying disagreements between multiple annotators, previous research has yielded several insights relevant to the study of music cognition. First, annotators tend to agree when structural boundaries are ambiguous. Second, this ambiguity seems to depend on musical features, time scale, and genre. Furthermore, it is possible to tune current annotation evaluation metrics to better align with these perceptual differences. However, previous work has not directly analyzed the effects of hierarchical structure because the existing methods for comparing structural annotations are designed for "flat" descriptions, and do not readily generalize to hierarchical annotations. In this paper, we extend and generalize previous work on the evaluation of hierarchical descriptions of musical structure. We derive an evaluation metric which can compare hierarchical annotations holistically across multiple levels. sing this metric, we investigate inter-annotator agreement on the multilevel annotations of two different music corpora, investigate the influence of acoustic properties on hierarchical annotations, and evaluate existing hierarchical segmentation algorithms against the distribution of inter-annotator agreement.
Evaluating Hierarchical Structure in Music Annotations

Directory of Open Access Journals (Sweden)

Brian McFee

2017-08-01

Full Text Available Music exhibits structure at multiple scales, ranging from motifs to large-scale functional components. When inferring the structure of a piece, different listeners may attend to different temporal scales, which can result in disagreements when they describe the same piece. In the field of music informatics research (MIR, it is common to use corpora annotated with structural boundaries at different levels. By quantifying disagreements between multiple annotators, previous research has yielded several insights relevant to the study of music cognition. First, annotators tend to agree when structural boundaries are ambiguous. Second, this ambiguity seems to depend on musical features, time scale, and genre. Furthermore, it is possible to tune current annotation evaluation metrics to better align with these perceptual differences. However, previous work has not directly analyzed the effects of hierarchical structure because the existing methods for comparing structural annotations are designed for “flat” descriptions, and do not readily generalize to hierarchical annotations. In this paper, we extend and generalize previous work on the evaluation of hierarchical descriptions of musical structure. We derive an evaluation metric which can compare hierarchical annotations holistically across multiple levels. sing this metric, we investigate inter-annotator agreement on the multilevel annotations of two different music corpora, investigate the influence of acoustic properties on hierarchical annotations, and evaluate existing hierarchical segmentation algorithms against the distribution of inter-annotator agreement.
An automated annotation tool for genomic DNA sequences using

Indian Academy of Sciences (India)

Genomic sequence data are often available well before the annotated sequence is published. We present a method for analysis of genomic DNA to identify coding sequences using the GeneScan algorithm and characterize these resultant sequences by BLAST. The routines are used to develop a system for automated ...
A set of tetra-nucleotide core motif SSR markers for efficient identification of potato (Solanum tuberosum) cultivars.

Science.gov (United States)

Kishine, Masahiro; Tsutsumi, Katsuji; Kitta, Kazumi

2017-12-01

Simple sequence repeat (SSR) is a popular tool for individual fingerprinting. The long-core motif (e.g. tetra-, penta-, and hexa-nucleotide) simple sequence repeats (SSRs) are preferred because they make it easier to separate and distinguish neighbor alleles. In the present study, a new set of 8 tetra-nucleotide SSRs in potato ( Solanum tuberosum ) is reported. By using these 8 markers, 72 out of 76 cultivars obtained from Japan and the United States were clearly discriminated, while two pairs, both of which arose from natural variation, showed identical profiles. The combined probability of identity between two random cultivars for the set of 8 SSR markers was estimated to be 1.10 × 10 -8 , confirming the usefulness of the proposed SSR markers for fingerprinting analyses of potato.
Pipeline to upgrade the genome annotations

Directory of Open Access Journals (Sweden)

Lijin K. Gopi

2017-12-01

Full Text Available Current era of functional genomics is enriched with good quality draft genomes and annotations for many thousands of species and varieties with the support of the advancements in the next generation sequencing technologies (NGS. Around 25,250 genomes, of the organisms from various kingdoms, are submitted in the NCBI genome resource till date. Each of these genomes was annotated using various tools and knowledge-bases that were available during the period of the annotation. It is obvious that these annotations will be improved if the same genome is annotated using improved tools and knowledge-bases. Here we present a new genome annotation pipeline, strengthened with various tools and knowledge-bases that are capable of producing better quality annotations from the consensus of the predictions from different tools. This resource also perform various additional annotations, apart from the usual gene predictions and functional annotations, which involve SSRs, novel repeats, paralogs, proteins with transmembrane helices, signal peptides etc. This new annotation resource is trained to evaluate and integrate all the predictions together to resolve the overlaps and ambiguities of the boundaries. One of the important highlights of this resource is the capability of predicting the phylogenetic relations of the repeats using the evolutionary trace analysis and orthologous gene clusters. We also present a case study, of the pipeline, in which we upgrade the genome annotation of Nelumbo nucifera (sacred lotus. It is demonstrated that this resource is capable of producing an improved annotation for a better understanding of the biology of various organisms.
Community annotation and bioinformatics workforce development in concert--Little Skate Genome Annotation Workshops and Jamborees.

Science.gov (United States)

Wang, Qinghua; Arighi, Cecilia N; King, Benjamin L; Polson, Shawn W; Vincent, James; Chen, Chuming; Huang, Hongzhan; Kingham, Brewster F; Page, Shallee T; Rendino, Marc Farnum; Thomas, William Kelley; Udwary, Daniel W; Wu, Cathy H

2012-01-01

Recent advances in high-throughput DNA sequencing technologies have equipped biologists with a powerful new set of tools for advancing research goals. The resulting flood of sequence data has made it critically important to train the next generation of scientists to handle the inherent bioinformatic challenges. The North East Bioinformatics Collaborative (NEBC) is undertaking the genome sequencing and annotation of the little skate (Leucoraja erinacea) to promote advancement of bioinformatics infrastructure in our region, with an emphasis on practical education to create a critical mass of informatically savvy life scientists. In support of the Little Skate Genome Project, the NEBC members have developed several annotation workshops and jamborees to provide training in genome sequencing, annotation and analysis. Acting as a nexus for both curation activities and dissemination of project data, a project web portal, SkateBase (http://skatebase.org) has been developed. As a case study to illustrate effective coupling of community annotation with workforce development, we report the results of the Mitochondrial Genome Annotation Jamborees organized to annotate the first completely assembled element of the Little Skate Genome Project, as a culminating experience for participants from our three prior annotation workshops. We are applying the physical/virtual infrastructure and lessons learned from these activities to enhance and streamline the genome annotation workflow, as we look toward our continuing efforts for larger-scale functional and structural community annotation of the L. erinacea genome.
Community annotation and bioinformatics workforce development in concert—Little Skate Genome Annotation Workshops and Jamborees

Science.gov (United States)

Wang, Qinghua; Arighi, Cecilia N.; King, Benjamin L.; Polson, Shawn W.; Vincent, James; Chen, Chuming; Huang, Hongzhan; Kingham, Brewster F.; Page, Shallee T.; Farnum Rendino, Marc; Thomas, William Kelley; Udwary, Daniel W.; Wu, Cathy H.

2012-01-01

Recent advances in high-throughput DNA sequencing technologies have equipped biologists with a powerful new set of tools for advancing research goals. The resulting flood of sequence data has made it critically important to train the next generation of scientists to handle the inherent bioinformatic challenges. The North East Bioinformatics Collaborative (NEBC) is undertaking the genome sequencing and annotation of the little skate (Leucoraja erinacea) to promote advancement of bioinformatics infrastructure in our region, with an emphasis on practical education to create a critical mass of informatically savvy life scientists. In support of the Little Skate Genome Project, the NEBC members have developed several annotation workshops and jamborees to provide training in genome sequencing, annotation and analysis. Acting as a nexus for both curation activities and dissemination of project data, a project web portal, SkateBase (http://skatebase.org) has been developed. As a case study to illustrate effective coupling of community annotation with workforce development, we report the results of the Mitochondrial Genome Annotation Jamborees organized to annotate the first completely assembled element of the Little Skate Genome Project, as a culminating experience for participants from our three prior annotation workshops. We are applying the physical/virtual infrastructure and lessons learned from these activities to enhance and streamline the genome annotation workflow, as we look toward our continuing efforts for larger-scale functional and structural community annotation of the L. erinacea genome. PMID:22434832

Development and Characterization of Microsatellite Markers for the Cape Gooseberry Physalis peruviana

OpenAIRE

Simbaqueba, Jaime; S?nchez, Pilar; Sanchez, Erika; N??ez Zarantes, Victor Manuel; Chacon, Maria Isabel; Barrero, Luz Stella; Mari?o-Ram?rez, Leonardo

2011-01-01

Physalis peruviana, commonly known as Cape gooseberry, is an Andean Solanaceae fruit with high nutritional value and interesting medicinal properties. In the present study we report the development and characterization of microsatellite loci from a P. peruviana commercial Colombian genotype. We identified 932 imperfect and 201 perfect Simple Sequence Repeats (SSR) loci in untranslated regions (UTRs) and 304 imperfect and 83 perfect SSR loci in coding regions from the assembled Physalis peruvi...
Genetic variation of european maize genotypes (zea mays l. Detected using ssr markers

Directory of Open Access Journals (Sweden)

Martin Vivodík

2017-01-01

Full Text Available The SSR molecular markers were used to assess genetic diversity in 40 old European maize genotypes. Ten SSR primers revealed a total of 65 alleles ranging from 4 (UMC1060 to 8 (UMC2002 and UMC1155 alleles per locus with a mean value of 6.50 alleles per locus. The PIC values ranged from 0.713 (UMC1060 to 0.842 (UMC2002 with an average value of 0.810 and the DI value ranged from 0.734 (UMC1060 to 0.848 (UMC2002 with an average value of 0.819. 100% of used SSR markers had PIC and DI values higher than 0.7 that means high polymorphism of chosen markers used for analysis. Probability of identity (PI was low ranged from 0.004 (UMC1072 to 0.022 (UMC1060 with an average of 0.008. A dendrogram was constructed from a genetic distance matrix based on profiles of the 10 maize SSR loci using the unweighted pair-group method with the arithmetic average (UPGMA. According to analysis, the collection of 40 diverse accessions of maize was clustered into four clusters. The first cluster contained nine genotypes of maize, while the second cluster contained the four genotypes of maize. The third cluster contained 5 maize genotypes. Cluster 4 contained five genotypes from Hungary (22.73%, two genotypes from Poland (9.10%, seven genotypes of maize from Union of Soviet Socialist Republics (31.81%, six genotypes from Czechoslovakia (27.27%, one genotype from Slovak Republic (4.55% and one genotype of maize is from Yugoslavia (4.55%. We could not distinguish 4 maize genotypes grouped in cluster 4, (Voroneskaja and Kocovska Skora and 2 Hungarian maize genotypes - Feheres Sarga Filleres and Mindszentpusztai Feher, which are genetically the closest.
Characterization of EST-based SSR loci in the spruce budworm, Choristoneura fumiferana (Lepidoptera: Tortricidae)

Science.gov (United States)

B.M.T. Brunet; D. Doucet; B.R. Sturtevant; F.A.H. Sperling

2013-01-01

After identifying 114 microsatellite loci from Choristoneura fumiferana expressed sequence tags, 87 loci were assayed in a panel of 11 wild-caught individuals, giving 29 polymorphic loci. Further analysis of 20 of these loci on 31 individuals collected from a single population in northern Minnesota identified 14 in Hardy-Weinberg equilibrium.
Reasoning with Annotations of Texts

OpenAIRE

Ma , Yue; Lévy , François; Ghimire , Sudeep

2011-01-01

International audience; Linguistic and semantic annotations are important features for text-based applications. However, achieving and maintaining a good quality of a set of annotations is known to be a complex task. Many ad hoc approaches have been developed to produce various types of annotations, while comparing those annotations to improve their quality is still rare. In this paper, we propose a framework in which both linguistic and domain information can cooperate to reason with annotat...
Large-scale inference of gene function through phylogenetic annotation of Gene Ontology terms: case study of the apoptosis and autophagy cellular processes.

Science.gov (United States)

Feuermann, Marc; Gaudet, Pascale; Mi, Huaiyu; Lewis, Suzanna E; Thomas, Paul D

2016-01-01

We previously reported a paradigm for large-scale phylogenomic analysis of gene families that takes advantage of the large corpus of experimentally supported Gene Ontology (GO) annotations. This 'GO Phylogenetic Annotation' approach integrates GO annotations from evolutionarily related genes across ∼100 different organisms in the context of a gene family tree, in which curators build an explicit model of the evolution of gene functions. GO Phylogenetic Annotation models the gain and loss of functions in a gene family tree, which is used to infer the functions of uncharacterized (or incompletely characterized) gene products, even for human proteins that are relatively well studied. Here, we report our results from applying this paradigm to two well-characterized cellular processes, apoptosis and autophagy. This revealed several important observations with respect to GO annotations and how they can be used for function inference. Notably, we applied only a small fraction of the experimentally supported GO annotations to infer function in other family members. The majority of other annotations describe indirect effects, phenotypes or results from high throughput experiments. In addition, we show here how feedback from phylogenetic annotation leads to significant improvements in the PANTHER trees, the GO annotations and GO itself. Thus GO phylogenetic annotation both increases the quantity and improves the accuracy of the GO annotations provided to the research community. We expect these phylogenetically based annotations to be of broad use in gene enrichment analysis as well as other applications of GO annotations.Database URL: http://amigo.geneontology.org/amigo. © The Author(s) 2016. Published by Oxford University Press.
Fingerprinting and genetic purity assessment of F1 barley hybrids and their salt-tolerant parental lines using nSSR molecular markers.

Science.gov (United States)

Ben Romdhane, Mériam; Riahi, Leila; Jardak, Rahma; Ghorbel, Abdelwahed; Zoghlami, Nejia

2018-01-01

Hybridity and the genuineness of hybrids are prominent characteristics for quality control of seeds and thereby for varietal improvement. In the current study, the cross between two local barley genotypes (Ardhaoui: female; Testour: male) previously identified as susceptible/tolerant to salt stress in Tunisia was achieved. The hybrid genetic purity of the generated F 1 putative hybrids and the fingerprinting of the parents along with their offspring were assessed using a set of 17 nuclear SSR markers. Among the analyzed loci, 11 nSSR were shown polymorphic among the parents and their offspring. Based on the applied 11 polymorphic SSR loci, a total of 28 alleles were detected with an average of 2.54 alleles per locus. The locus HVM33 presented the highest number of alleles. The highest polymorphism information content value was detected for the locus HVM33 (0.6713) whereas the lowest PIC value (0.368) was revealed by the loci BMAC0156 , EBMAC0970 and BMAG0013 with a mean value of 0.4619. The probabilities of identical genotypes PI for the 11 microsatellite markers were 8.63 × 10 -7 . Banding patterns among parents and hybrids showed polymorphic fragments. The 11 SSR loci had produced unique fingerprints for each analyzed genotype and segregate between the two parental lines and their four hybrids. Parentage analysis confirms the hybrid purity of the four analyzed genotypes. Six Tunisian barley accessions were used as an outgroup in the multivariate analysis to confirm the efficiency of the employed 11 nSSR markers in genetic differentiation among various barley germplasms. Thus, neighbor joining and factorial analysis revealed clearly the discrimination among the parental lines, the four hybrids and the outgroup accessions. Out of the detected polymorphic 11 nuclear SSR markers, a set of five markers ( HVM33 , WMC1E8 , BMAC0154 , BMAC0040 and BMAG0007 ) were shown to be sufficient and informative enough to discriminate among the six genotypes representing the two
Identification of SSR and RAPD markers associated with QTLs of ...

African Journals Online (AJOL)

SERVER

2008-04-03

Apr 3, 2008 ... Parents and. F3 families had significant differences in studied traits (p ≤. 0.01). In this study, SSR and RAPD markers were used together for constructing linkage groups and rescanning the genome of rapeseed to identify QTLs controlling winter survival and related traits. For this, the parental polymorphism ...
Annotated ESTs from various tissues of the brown planthopper Nilaparvata lugens: a genomic resource for studying agricultural pests.

Science.gov (United States)

Noda, Hiroaki; Kawai, Sawako; Koizumi, Yoko; Matsui, Kageaki; Zhang, Qiang; Furukawa, Shigetoyo; Shimomura, Michihiko; Mita, Kazuei

2008-03-03

The brown planthopper (BPH), Nilaparvata lugens (Hemiptera, Delphacidae), is a serious insect pests of rice plants. Major means of BPH control are application of agricultural chemicals and cultivation of BPH resistant rice varieties. Nevertheless, BPH strains that are resistant to agricultural chemicals have developed, and BPH strains have appeared that are virulent against the resistant rice varieties. Expressed sequence tag (EST) analysis and related applications are useful to elucidate the mechanisms of resistance and virulence and to reveal physiological aspects of this non-model insect, with its poorly understood genetic background. More than 37,000 high-quality ESTs, excluding sequences of mitochondrial genome, microbial genomes, and rDNA, have been produced from 18 libraries of various BPH tissues and stages. About 10,200 clusters have been made from whole EST sequences, with average EST size of 627 bp. Among the top ten most abundantly expressed genes, three are unique and show no homology in BLAST searches. The actin gene was highly expressed in BPH, especially in the thorax. Tissue-specifically expressed genes were extracted based on the expression frequency among the libraries. An EST database is available at our web site. The EST library will provide useful information for transcriptional analyses, proteomic analyses, and gene functional analyses of BPH. Moreover, specific genes for hemimetabolous insects will be identified. The microarray fabricated based on the EST information will be useful for finding genes related to agricultural and biological problems related to this pest.
Alleviation SSR and Low Frequency Power Oscillations in Series Compensated Transmission Line using SVC Supplementary Controllers

Science.gov (United States)

Kumar, Sanjiv; Kumar, Narendra

2017-06-01

In this work, supplementary sub-synchronous damping controllers (SSDC) are proposed for damping sub-synchronous oscillations in power systems with series compensated transmission lines. Series compensation have extensively been used as effective means of increasing the power transfer capability of a transmission lines and improving transient stability limits of power systems. Series compensation with transmission lines may cause sub-synchronous resonance (SSR). The eigenvalue investigation tool is used to ascertain the existence of SSR. It is shown that the addition of supplementary controller is able to stabilize all unstable modes for T-network model. Eigenvalue investigation and time domain transient simulation of detailed nonlinear system are considered to investigate the performance of the controllers. The efficacies of the suggested supplementary controllers are compared on the IEEE first benchmark model for computer simulations of SSR by means of time domain simulation in Matlab/Simulink environment. Supplementary SSDC are considered in order to compare effectiveness of SSDC during higher loading in alleviating the small signal stability problem.
Discovery and annotation of small proteins using genomics, proteomics and computational approaches

Energy Technology Data Exchange (ETDEWEB)

Yang, Xiaohan; Tschaplinski, Timothy J.; Hurst, Gregory B.; Jawdy, Sara; Abraham, Paul E.; Lankford, Patricia K.; Adams, Rachel M.; Shah, Manesh B.; Hettich, Robert L.; Lindquist, Erika; Kalluri, Udaya C.; Gunter, Lee E.; Pennacchio, Christa; Tuskan, Gerald A.

2011-03-02

Small proteins (10 200 amino acids aa in length) encoded by short open reading frames (sORF) play important regulatory roles in various biological processes, including tumor progression, stress response, flowering, and hormone signaling. However, ab initio discovery of small proteins has been relatively overlooked. Recent advances in deep transcriptome sequencing make it possible to efficiently identify sORFs at the genome level. In this study, we obtained 2.6 million expressed sequence tag (EST) reads from Populus deltoides leaf transcriptome and reconstructed full-length transcripts from the EST sequences. We identified an initial set of 12,852 sORFs encoding proteins of 10 200 aa in length. Three computational approaches were then used to enrich for bona fide protein-coding sORFs from the initial sORF set: (1) codingpotential prediction, (2) evolutionary conservation between P. deltoides and other plant species, and (3) gene family clustering within P. deltoides. As a result, a high-confidence sORF candidate set containing 1469 genes was obtained. Analysis of the protein domains, non-protein-coding RNA motifs, sequence length distribution, and protein mass spectrometry data supported this high-confidence sORF set. In the high-confidence sORF candidate set, known protein domains were identified in 1282 genes (higher-confidence sORF candidate set), out of which 611 genes, designated as highest-confidence candidate sORF set, were supported by proteomics data. Of the 611 highest-confidence candidate sORF genes, 56 were new to the current Populus genome annotation. This study not only demonstrates that there are potential sORF candidates to be annotated in sequenced genomes, but also presents an efficient strategy for discovery of sORFs in species with no genome annotation yet available.
Characterization of 10 new nuclear microsatellite markers in Acca sellowiana (Myrtaceae).

Science.gov (United States)

Klabunde, Gustavo H F; Olkoski, Denise; Vilperte, Vinicius; Zucchi, Maria I; Nodari, Rubens O

2014-06-01

Microsatellite primers were identified and characterized in Acca sellowiana in order to expand the limited number of pre-existing polymorphic markers for use in population genetic studies for conservation, phylogeography, breeding, and domestication. • A total of 10 polymorphic microsatellite primers were designed from clones obtained from a simple sequence repeat (SSR)-enriched genomic library. The primers amplified di- and trinucleotide repeats with four to 27 alleles per locus. In all tested populations, the observed heterozygosity ranged from 0.269 to 1.0. • These new polymorphic SSR markers will allow future genetic studies to be denser, either for genetic structure characterization of natural populations or for studies involving genetic breeding and domestication process in A. sellowiana.
Semantic annotation of consumer health questions.

Science.gov (United States)

Kilicoglu, Halil; Ben Abacha, Asma; Mrabet, Yassine; Shooshan, Sonya E; Rodriguez, Laritza; Masterton, Kate; Demner-Fushman, Dina

2018-02-06

Consumers increasingly use online resources for their health information needs. While current search engines can address these needs to some extent, they generally do not take into account that most health information needs are complex and can only fully be expressed in natural language. Consumer health question answering (QA) systems aim to fill this gap. A major challenge in developing consumer health QA systems is extracting relevant semantic content from the natural language questions (question understanding). To develop effective question understanding tools, question corpora semantically annotated for relevant question elements are needed. In this paper, we present a two-part consumer health question corpus annotated with several semantic categories: named entities, question triggers/types, question frames, and question topic. The first part (CHQA-email) consists of relatively long email requests received by the U.S. National Library of Medicine (NLM) customer service, while the second part (CHQA-web) consists of shorter questions posed to MedlinePlus search engine as queries. Each question has been annotated by two annotators. The annotation methodology is largely the same between the two parts of the corpus; however, we also explain and justify the differences between them. Additionally, we provide information about corpus characteristics, inter-annotator agreement, and our attempts to measure annotation confidence in the absence of adjudication of annotations. The resulting corpus consists of 2614 questions (CHQA-email: 1740, CHQA-web: 874). Problems are the most frequent named entities, while treatment and general information questions are the most common question types. Inter-annotator agreement was generally modest: question types and topics yielded highest agreement, while the agreement for more complex frame annotations was lower. Agreement in CHQA-web was consistently higher than that in CHQA-email. Pairwise inter-annotator agreement proved most
A Resource of Quantitative Functional Annotation for Homo sapiens Genes.

Science.gov (United States)

Taşan, Murat; Drabkin, Harold J; Beaver, John E; Chua, Hon Nian; Dunham, Julie; Tian, Weidong; Blake, Judith A; Roth, Frederick P

2012-02-01

The body of human genomic and proteomic evidence continues to grow at ever-increasing rates, while annotation efforts struggle to keep pace. A surprisingly small fraction of human genes have clear, documented associations with specific functions, and new functions continue to be found for characterized genes. Here we assembled an integrated collection of diverse genomic and proteomic data for 21,341 human genes and make quantitative associations of each to 4333 Gene Ontology terms. We combined guilt-by-profiling and guilt-by-association approaches to exploit features unique to the data types. Performance was evaluated by cross-validation, prospective validation, and by manual evaluation with the biological literature. Functional-linkage networks were also constructed, and their utility was demonstrated by identifying candidate genes related to a glioma FLN using a seed network from genome-wide association studies. Our annotations are presented-alongside existing validated annotations-in a publicly accessible and searchable web interface.
Annotating and Interpreting Linear and Cyclic Peptide Tandem Mass Spectra.

Science.gov (United States)

Niedermeyer, Timo Horst Johannes

2016-01-01

Nonribosomal peptides often possess pronounced bioactivity, and thus, they are often interesting hit compounds in natural product-based drug discovery programs. Their mass spectrometric characterization is difficult due to the predominant occurrence of non-proteinogenic monomers and, especially in the case of cyclic peptides, the complex fragmentation patterns observed. This makes nonribosomal peptide tandem mass spectra annotation challenging and time-consuming. To meet this challenge, software tools for this task have been developed. In this chapter, the workflow for using the software mMass for the annotation of experimentally obtained peptide tandem mass spectra is described. mMass is freely available (http://www.mmass.org), open-source, and the most advanced and user-friendly software tool for this purpose. The software enables the analyst to concisely annotate and interpret tandem mass spectra of linear and cyclic peptides. Thus, it is highly useful for accelerating the structure confirmation and elucidation of cyclic as well as linear peptides and depsipeptides.
Factors Affecting SSR in Holstein Dairy Cows

Directory of Open Access Journals (Sweden)

Alireza Heravi Mosavi

2016-08-01

Full Text Available Introduction Secondary sex ratio (SSR is the proportion of males to females at birth. It has been shown in many different mammalian species, many factors are associated with SSR. Changes in secondary sex ratio in dairy cows is considered economically important and the ability to change it could affect the revenues and profitability of a dairy farm. Thus, sperm or embryo sexing techniques in recent years has attracted more attention. Most breed of dairy cattle are more likely to have female calf is born to use them as replacement heifers and in order to maintain their productive herd number. On the contrary, when the goal is the production of meat, bull calves due to higher growth rates and production efficiency, are more convenient and more economically efficient. The aim of present study was to investigate some key factors affecting SSR in Iranian Holstein cows. According to Fisher, the sex ratio in the population under the control of natural selection is not always the same. There is overwhelming evidence to support the theory that shows Fisher Primary and secondary sex ratio sex ratio can deviate from this balance and natural selection caused a change in this ratio can be in certain circumstances. For example, the secondary sex ratio of 52:48 has been reported in dairy cows. Studies on mammalian species suggest that several factors, including latitude of the location, the dominant regional climate model, time and frequency of mating to ovulation, diet, age of parents, physical score, breed and produced eggs from ovarian left or right can have a significant effect on the secondary sex ratio. Weather conditions may modify the internal environment and the effect on physiological mechanisms or through the impact on the frequency and type of foods available to parents, the secondary sex ratio is impressive. The impact on the quantity and quality of parent's access to food sources in many species of mammals, the sex ratio has been fixed. Previous
Predicting word sense annotation agreement

DEFF Research Database (Denmark)

Martinez Alonso, Hector; Johannsen, Anders Trærup; Lopez de Lacalle, Oier

2015-01-01

High agreement is a common objective when annotating data for word senses. However, a number of factors make perfect agreement impossible, e.g. the limitations of the sense inventories, the difficulty of the examples or the interpretation preferences of the annotations. Estimating potential...... agreement is thus a relevant task to supplement the evaluation of sense annotations. In this article we propose two methods to predict agreement on word-annotation instances. We experiment with a continuous representation and a three-way discretization of observed agreement. In spite of the difficulty...
Identification and differentiation of Ricinus communis L. using SSR markers

Directory of Open Access Journals (Sweden)

Zdenka Gálová

2015-12-01

Full Text Available Normal 0 false false false CS JA X-NONE The castor-oil plant (Ricinus communis L., a member of the spurge family (Euphorbiaceae, is a versatile industrial oil crop that is cultivated in many tropical and subtropical regions of the world. Castor oil is of continuing importance to the global specialty chemical industry because it is the only commercial source of a hydroxylated fatty acid. Castor also has tremendous future potential as an industrial oilseed crop because of its high seed oil content, unique fatty acid composition, potentially high oil yields and ability to be grown under drought and saline conditions. Knowledge of genetic variability is important for breeding programs to provide the basis for developing desirable genotypes. The aim of this study was to assess genetic diversity within the set of 60 ricin genotypes using 10 SSR primers. Ten SSR primers revealed a total of 67 alleles ranging from 4 to 9 alleles per locus with a mean value of 6.70 alleles per locus. The PIC values ranged from 0.719 to 0.860 with an average value of 0.813 and the DI value ranged from 0.745 to 0.862 with an average value of 0.821. Probability of identity (PI was low ranged from 0.004 to 0.018 with an average of 0.008. A dendrogram was constructed from a genetic distance matrix based on profiles of the 10 SSR loci using the unweighted pair-group method with the arithmetic average (UPGMA. According to analysis, the collection of 60 diverse accessions of castor bean was clustered into six clusters. We could not distinguish 2 genotypes grouped in cluster 1, RM-96 and RM-98, which are genetically the closest. Knowledge on the genetic diversity of castor can be used to future breeding programs of castor.
Molecular genetic alterations in egfr CA-SSR-1 microsatellite and egfr copy number changes are associated with aggressiveness in thymoma.

Science.gov (United States)

Conti, Salvatore; Gallo, Enzo; Sioletic, Stefano; Facciolo, Francesco; Palmieri, Giovannella; Lauriola, Libero; Evoli, Amelia; Martucci, Robert; Di Benedetto, Anna; Novelli, Flavia; Giannarelli, Diana; Deriu, Gloria; Granone, Pierluigi; Ottaviano, Margaret; Muti, Paola; Pescarmona, Edoardo; Marino, Mirella

2016-03-01

The key role of egfr in thymoma pathogenesis has been questioned following the failure in identifying recurrent genetic alterations of egfr coding sequences and relevant egfr amplification rate. We investigated the role of the non-coding egfr CA simple sequence repeat 1 (CA-SSR-1) in a thymoma case series. We used sequencing and egfr-fluorescence in situ hybridization (FISH) to genotype 43 thymomas; (I) for polymorphisms and somatic loss of heterozygosity of the non-coding egfr CA-SSR-1 microsatellite and (II) for egfr gene copy number changes. We found two prevalent CA-SSR-1 genotypes: a homozygous 16 CA repeat and a heterozygous genotype, bearing alleles with 16 and 20 CA repeats. The average combined allele length was correlated with tumor subtype: shorter sequences were significantly associated with the more aggressive WHO thymoma subtype group including B2/B3, B3 and B3/C histotypes. Four out of 29 informative cases analysed for somatic CA-SSR-1 loss of heterozygosity showed allelic imbalance (AI), 3/4 with loss of the longer allele. By egfr-FISH analysis, 9 out of 33 cases were FISH positive. Moreover, the two integrated techniques demonstrated that 3 out of 4 CA-SSR-1-AI positive cases with short allele relative prevalence showed significantly low or high chromosome 7 "polysomy"/increased gene copy number by egfr-FISH. Our molecular and genetic and follow up data indicated that CA-SSR-1-allelic imbalance with short allele relative prevalence significantly correlated with EGFR 3+ immunohistochemical score, increased egfr Gene Copy Number, advanced stage and with relapsing/metastatic behaviour in thymomas.
Identification of SSR and RAPD markers associated with QTLs of ...

African Journals Online (AJOL)

Because of importance of winter survival in winter type of Brassica napus, this study was performed to identify the QTLs controlling winter survival and related traits using SSR and RAPD markers. For this, an F2:3 population of 200 families derived from crossing between cv. 'SLMO46' (winter type and cold resistant) and cv.
Alignment-Annotator web server: rendering and annotating sequence alignments.

Science.gov (United States)

Gille, Christoph; Fähling, Michael; Weyand, Birgit; Wieland, Thomas; Gille, Andreas

2014-07-01

Alignment-Annotator is a novel web service designed to generate interactive views of annotated nucleotide and amino acid sequence alignments (i) de novo and (ii) embedded in other software. All computations are performed at server side. Interactivity is implemented in HTML5, a language native to web browsers. The alignment is initially displayed using default settings and can be modified with the graphical user interfaces. For example, individual sequences can be reordered or deleted using drag and drop, amino acid color code schemes can be applied and annotations can be added. Annotations can be made manually or imported (BioDAS servers, the UniProt, the Catalytic Site Atlas and the PDB). Some edits take immediate effect while others require server interaction and may take a few seconds to execute. The final alignment document can be downloaded as a zip-archive containing the HTML files. Because of the use of HTML the resulting interactive alignment can be viewed on any platform including Windows, Mac OS X, Linux, Android and iOS in any standard web browser. Importantly, no plugins nor Java are required and therefore Alignment-Anotator represents the first interactive browser-based alignment visualization. http://www.bioinformatics.org/strap/aa/ and http://strap.charite.de/aa/. © The Author(s) 2014. Published by Oxford University Press on behalf of Nucleic Acids Research.

4-(2-Chloro-4-methoxy-5-methylphenyl)-N-[(1S)-2-cyclopropyl-1-(3-fluoro-4-methylphenyl)ethyl]5-methyl-N-(2-propynyl)-1,3-thiazol-2-amine hydrochloride (SSR125543A): a potent and selective corticotrophin-releasing factor(1) receptor antagonist. I. Biochemical and pharmacological characterization.

Science.gov (United States)

Gully, Danielle; Geslin, Michel; Serva, Laurence; Fontaine, Evelyne; Roger, Pierre; Lair, Christine; Darre, Valerie; Marcy, Claudine; Rouby, Pierre-Eric; Simiand, Jacques; Guitard, Josette; Gout, Georgette; Steinberg, Regis; Rodier, Daniel; Griebel, Guy; Soubrie, Philippe; Pascal, Marc; Pruss, Rebecca; Scatton, Bernard; Maffrand, Jean-Pierre; Le Fur, Gerard

2002-04-01

4-(2-Chloro-4-methoxy-5-methylphenyl)-N-[(1S)-2-cyclopropyl-1- (3-fluoro-4-methylphenyl)ethyl]5-methyl-N-(2-propynyl)-1,3-thiazol-2-amine hydrochloride (SSR125543A), a new 2-aminothiazole derivative, shows nanomolar affinity for human cloned or native corticotrophin-releasing factor (CRF)(1) receptors (pK(i) values of 8.73 and 9.08, respectively), and a 1000-fold selectivity for CRF(1) versus CRF(2 alpha) receptor and CRF binding protein. SSR125543A antagonizes CRF-induced stimulation of cAMP synthesis in human retinoblastoma Y 79 cells (IC(50) = 3.0 +/- 0.4 nM) and adrenocorticotropin hormone (ACTH) secretion in mouse pituitary tumor AtT-20 cells. SSR125543A is devoid of agonist activity in these models. Its brain penetration was demonstrated in rats by using an ex vivo [(125)I-Tyr(0)] ovine CRF binding assay. SSR125543A displaced radioligand binding to the CRF(1) receptor in the brain with an ID(50) of 6.5 mg/kg p.o. (duration of action >24 h). SSR125543A also inhibited the increase in plasma ACTH levels elicited in rats by i.v. CRF (4 microg/kg) injection (ID(50) = 1, 5, or 5 mg/kg i.v., i.p., and p.o., respectively); this effect lasted for more than 6 h when the drug was given orally at a dose of 30 mg/kg. SSR125543A (10 mg/kg p.o.) reduced by 73% the increase in plasma ACTH levels elicited by a 15-min restraint stress in rats. Moreover, SSR125543A (20 mg/kg i.p.) also antagonized the increase of hippocampal acetylcholine release induced by i.c.v. injection of 1 microg of CRF in rats. Finally, SSR125543A reduced forepaw treading induced by i.c.v. injection of 1 microg of CRF in gerbils (ID(50) = approximately 10 mg/kg p.o.). Altogether, these data indicate that SSR125543A is a potent, selective, and orally active CRF(1) receptor antagonist.
Annotated ESTs from various tissues of the brown planthopper Nilaparvata lugens: A genomic resource for studying agricultural pests

Directory of Open Access Journals (Sweden)

Zhang Qiang

2008-03-01

Full Text Available Abstract Background The brown planthopper (BPH, Nilaparvata lugens (Hemiptera, Delphacidae, is a serious insect pests of rice plants. Major means of BPH control are application of agricultural chemicals and cultivation of BPH resistant rice varieties. Nevertheless, BPH strains that are resistant to agricultural chemicals have developed, and BPH strains have appeared that are virulent against the resistant rice varieties. Expressed sequence tag (EST analysis and related applications are useful to elucidate the mechanisms of resistance and virulence and to reveal physiological aspects of this non-model insect, with its poorly understood genetic background. Results More than 37,000 high-quality ESTs, excluding sequences of mitochondrial genome, microbial genomes, and rDNA, have been produced from 18 libraries of various BPH tissues and stages. About 10,200 clusters have been made from whole EST sequences, with average EST size of 627 bp. Among the top ten most abundantly expressed genes, three are unique and show no homology in BLAST searches. The actin gene was highly expressed in BPH, especially in the thorax. Tissue-specifically expressed genes were extracted based on the expression frequency among the libraries. An EST database is available at our web site. Conclusion The EST library will provide useful information for transcriptional analyses, proteomic analyses, and gene functional analyses of BPH. Moreover, specific genes for hemimetabolous insects will be identified. The microarray fabricated based on the EST information will be useful for finding genes related to agricultural and biological problems related to this pest.
SSR-Based DNA Fingerprinting and Diversity Assessment Among Indian Germplasm of Euryale ferox: an Aquatic Underutilized and Neglected Food Crop.

Science.gov (United States)

Kumar, Nitish; Shikha, Divya; Kumari, Swati; Choudhary, Binod Kumar; Kumar, Lokendra; Singh, Indu Shekhar

2017-10-30

Euryale ferox is native to Southeast Asia and China, and it is one of the important aquatic food crops propagated mostly in eastern part of India. The aim of the present study was to characterize and evaluate the genetic diversity of ex situ collections of E. ferox germplasm from different geographical states of India using microsatellite (simple sequence repeats (SSRs)) markers. Ten SSR markers were analyzed to assess DNA fingerprinting and genetic diversity of 16 cultivated germplasm of E. ferox. Total 37 polymorphic alleles were recorded with an average of 3.7 allele frequency per primer. The polymorphic information content value varied from 0.204 to 0.735 with mean of 0.448. A high range of heterozygosity (Ho 0.228; He 0.512) was detected in the present study. The neighbor-joining (N-J) tree and the principle coordinate analysis showed that the germplasm divided in to three main clusters. The results of the present investigation comply that SSR markers are effective for computing genetic assessment of genetic diversity and similarity with classifying cultivated varieties of E. ferox. Evaluation of genetic diversity among Indian E. ferox germplasm could provide useful information for genetic improvement.
Development and mapping of SSR markers linked to resistance-gene homologue clusters in common bean

Institute of Scientific and Technical Information of China (English)

Luz; Nayibe; Garzon; Matthew; Wohlgemuth; Blair

2014-01-01

Common bean is an important but often a disease-susceptible legume crop of temperate,subtropical and tropical regions worldwide. The crop is affected by bacterial, fungal and viral pathogens. The strategy of resistance-gene homologue(RGH) cloning has proven to be an efficient tool for identifying markers and R(resistance) genes associated with resistances to diseases. Microsatellite or SSR markers can be identified by physical association with RGH clones on large-insert DNA clones such as bacterial artificial chromosomes(BACs). Our objectives in this work were to identify RGH-SSR in a BAC library from the Andean genotype G19833 and to test and map any polymorphic markers to identify associations with known positions of disease resistance genes. We developed a set of specific probes designed for clades of common bean RGH genes and then identified positive BAC clones and developed microsatellites from BACs having SSR loci in their end sequences. A total of 629 new RGH-SSRs were identified and named BMr(bean microsatellite RGH-associated markers). A subset of these markers was screened for detecting polymorphism in the genetic mapping population DOR364 × G19833. A genetic map was constructed with a total of 264 markers,among which were 80 RGH loci anchored to single-copy RFLP and SSR markers. Clusters of RGH-SSRs were observed on most of the linkage groups of common bean and in positions associated with R-genes and QTL. The use of these new markers to select for disease resistance is discussed.
Large-scale Identification of Expressed Sequence Tags (ESTs from Nicotianatabacum by Normalized cDNA Library Sequencing

Directory of Open Access Journals (Sweden)

Alvarez S Perez

2014-12-01

Full Text Available An expressed sequence tags (EST resource for tobacco plants (Nicotianatabacum was established using high-throughput sequencing of randomly selected clones from one cDNA library representing a range of plant organs (leaf, stem, root and root base. Over 5000 ESTs were generated from the 3’ ends of 8000 clones, analyzed by BLAST searches and categorized functionally. All annotated ESTs were classified into 18 functional categories, unique transcripts involved in energy were the largest group accounting for 831 (32.32% of the annotated ESTs. After excluding 2450 non-significant tentative unique transcripts (TUTs, 100 unique sequences (1.67% of total TUTs were identified from the N. tabacum database. In the array result two genes strongly related to the tobacco mosaic virus (TMV were obtained, one basic form of pathogenesis-related protein 1 precursor (TBT012G08 and ubiquitin (TBT087G01. Both of them were found in the variety Hongda, some other important genes were classified into two groups, one of these implicated in plant development like those genes related to a photosynthetic process (chlorophyll a-b binding protein, photosystem I, ferredoxin I and III, ATP synthase and a further group including genes related to plant stress response (ubiquitin, ubiquitin-like protein SMT3, glycine-rich RNA binding protein, histones and methallothionein. The interesting finding in this study is that two of these genes have never been reported before in N. tabacum (ubiquitin-like protein SMT3 and methallothionein. The array results were confirmed using quantitative PCR.
Identification of new SSR markers linked to leaf chlorophyll content, flag leaf senescence and cell membrane stability traits in wheat under water stressed condition.

Science.gov (United States)

Barakat, Mohamed N; Saleh, Mohamed; Al-Doss, Abdullah A; Moustafa, Khaled A; Elshafei, Adel A; Al-Qurainy, Fahed H

2015-03-01

Segregating F4 families from the cross between drought sensitive (Yecora Rojo) and drought tolerant (Pavon 76) genotypes were made to identify SSR markers linked to leaf chlorophyll content, flag leaf senescence and cell membrane stability traits in wheat (Triticum aestivum L.) under water-stressed condition and to map quantitative trait locus (QTL) for the three physiological traits. The parents and 150 F4 families were evaluated phenotypically for drought tolerance using two irrigation treatments (2500 and 7500 m3/ha). Using 400 SSR primers tested for polymorphism in testing parental and F4 families genotypes, the results revealed that QTL for leaf chlorophyll content, flag leaf senescence and cell membrane stability traits were associated with 12, 5 and 12 SSR markers, respectively and explained phenotypic variation ranged from 6 to 42%. The SSR markers for physiological traits had genetic distances ranged from 12.5 to 25.5 cM. These SSR markers can be further used in breeding programs for drought tolerance in wheat.
Genome-Wide Analysis of Simple Sequence Repeats and Efficient Development of Polymorphic SSR Markers Based on Whole Genome Re-Sequencing of Multiple Isolates of the Wheat Stripe Rust Fungus.

Directory of Open Access Journals (Sweden)

Huaiyong Luo

Full Text Available The biotrophic parasitic fungus Puccinia striiformis f. sp. tritici (Pst causes stripe rust, a devastating disease of wheat, endangering global food security. Because the Pst population is highly dynamic, it is difficult to develop wheat cultivars with durable and highly effective resistance. Simple sequence repeats (SSRs are widely used as molecular markers in genetic studies to determine population structure in many organisms. However, only a small number of SSR markers have been developed for Pst. In this study, a total of 4,792 SSR loci were identified using the whole genome sequences of six isolates from different regions of the world, with a marker density of one SSR per 22.95 kb. The majority of the SSRs were di- and tri-nucleotide repeats. A database containing 1,113 SSR markers were established. Through in silico comparison, the previously reported SSR markers were found mainly in exons, whereas the SSR markers in the database were mostly in intergenic regions. Furthermore, 105 polymorphic SSR markers were confirmed in silico by their identical positions and nucleotide variations with INDELs identified among the six isolates. When 104 in silico polymorphic SSR markers were used to genotype 21 Pst isolates, 84 produced the target bands, and 82 of them were polymorphic and revealed the genetic relationships among the isolates. The results show that whole genome re-sequencing of multiple isolates provides an ideal resource for developing SSR markers, and the newly developed SSR markers are useful for genetic and population studies of the wheat stripe rust fungus.
Genome-Wide Analysis of Simple Sequence Repeats and Efficient Development of Polymorphic SSR Markers Based on Whole Genome Re-Sequencing of Multiple Isolates of the Wheat Stripe Rust Fungus.

Science.gov (United States)

Luo, Huaiyong; Wang, Xiaojie; Zhan, Gangming; Wei, Guorong; Zhou, Xinli; Zhao, Jing; Huang, Lili; Kang, Zhensheng

2015-01-01

The biotrophic parasitic fungus Puccinia striiformis f. sp. tritici (Pst) causes stripe rust, a devastating disease of wheat, endangering global food security. Because the Pst population is highly dynamic, it is difficult to develop wheat cultivars with durable and highly effective resistance. Simple sequence repeats (SSRs) are widely used as molecular markers in genetic studies to determine population structure in many organisms. However, only a small number of SSR markers have been developed for Pst. In this study, a total of 4,792 SSR loci were identified using the whole genome sequences of six isolates from different regions of the world, with a marker density of one SSR per 22.95 kb. The majority of the SSRs were di- and tri-nucleotide repeats. A database containing 1,113 SSR markers were established. Through in silico comparison, the previously reported SSR markers were found mainly in exons, whereas the SSR markers in the database were mostly in intergenic regions. Furthermore, 105 polymorphic SSR markers were confirmed in silico by their identical positions and nucleotide variations with INDELs identified among the six isolates. When 104 in silico polymorphic SSR markers were used to genotype 21 Pst isolates, 84 produced the target bands, and 82 of them were polymorphic and revealed the genetic relationships among the isolates. The results show that whole genome re-sequencing of multiple isolates provides an ideal resource for developing SSR markers, and the newly developed SSR markers are useful for genetic and population studies of the wheat stripe rust fungus.
Construction of 12 EST libraries and characterization of a 12,226 EST dataset for chicory (Cichorium intybus root, leaves and nodules in the context of carbohydrate metabolism investigation

Directory of Open Access Journals (Sweden)

Boutry Marc

2009-01-01

Full Text Available Abstract Background The industrial chicory, Cichorium intybus, is a member of the Asteraceae family that accumulates fructan of the inulin type in its root. Inulin is a low calories sweetener, a texture agent and a health promoting ingredient due to its prebiotic properties. Average inulin chain length is a critical parameter that is genotype and temperature dependent. In the context of the study of carbohydrate metabolism and to get insight into the transcriptome of chicory root and to visualize temporal changes of gene expression during the growing season, we obtained and characterized 10 cDNA libraries from chicory roots regularly sampled in field during a growing season. A leaf and a nodule libraries were also obtained for comparison. Results Approximately 1,000 Expressed Sequence Tags (EST were obtained from each of twelve cDNA libraries resulting in a 12,226 EST dataset. Clustering of these ESTs returned 1,922 contigs and 4,869 singlets for a total of 6,791 putative unigenes. All ESTs were compared to public sequence databases and functionally classified. Data were specifically searched for sequences related to carbohydrate metabolism. Season wide evolution of functional classes was evaluated by comparing libraries at the level of functional categories and unigenes distribution. Conclusion This chicory EST dataset provides a season wide outlook of the genes expressed in the root and to a minor extent in leaves and nodules. The dataset contains more than 200 sequences related to carbohydrate metabolism and 3,500 new ESTs when compared to other recently released chicory EST datasets, probably because of the season wide coverage of the root samples. We believe that these sequences will contribute to accelerate research and breeding of the industrial chicory as well as of closely related species.
Identification of SSR markers closely linked to the yellow seed coat color gene in heading Chinese cabbage (Brassica rapa L. ssp. pekinensis).

Science.gov (United States)

Ren, Yanjing; Wu, Junqing; Zhao, Jing; Hao, Lingyu; Zhang, Lugang

2017-02-15

Research on the yellow-seeded variety of heading Chinese cabbage will aid in broadening its germplasm resources and lay a foundation for AA genome research in Brassica crops. Here, an F 2 segregating population of 1575 individuals was constructed from two inbred lines (brown-seeded '92S105' and yellow-seeded '91-125'). This population was used to identify the linkage molecular markers of the yellow seed coat trait using simple sequence repeat (SSR) techniques combined with a bulk segregant analysis (BSA). Of the 144 SSR primer pairs on the A01-A10 chromosomes from the Brassica database (http://brassicadb.org/brad/), two pairs located on the A06 chromosome showed polymorphic bands between the bulk DNA pools of eight brown-seeded and eight yellow-seeded F 2 progeny. Based on the genome sequence, 454 SSR markers were designed to A06 to detect these polymorphic bands and were synthesized. Six SSR markers linked to the seed coat color gene were successfully selected for fine linkage genetic map construction, in which the two closest flanking markers, SSR449a and SSR317, mapped the Brsc-ye gene to a 40.2 kb region with distances of 0.07 and 0.06 cM, respectively. The molecular markers obtained in this report will assist in the marker-assisted selection and breeding of yellow-seeded lines in Brassica rapa L. and other close species. © 2017. Published by The Company of Biologists Ltd.
Objective-guided image annotation.

Science.gov (United States)

Mao, Qi; Tsang, Ivor Wai-Hung; Gao, Shenghua

2013-04-01

Automatic image annotation, which is usually formulated as a multi-label classification problem, is one of the major tools used to enhance the semantic understanding of web images. Many multimedia applications (e.g., tag-based image retrieval) can greatly benefit from image annotation. However, the insufficient performance of image annotation methods prevents these applications from being practical. On the other hand, specific measures are usually designed to evaluate how well one annotation method performs for a specific objective or application, but most image annotation methods do not consider optimization of these measures, so that they are inevitably trapped into suboptimal performance of these objective-specific measures. To address this issue, we first summarize a variety of objective-guided performance measures under a unified representation. Our analysis reveals that macro-averaging measures are very sensitive to infrequent keywords, and hamming measure is easily affected by skewed distributions. We then propose a unified multi-label learning framework, which directly optimizes a variety of objective-specific measures of multi-label learning tasks. Specifically, we first present a multilayer hierarchical structure of learning hypotheses for multi-label problems based on which a variety of loss functions with respect to objective-guided measures are defined. And then, we formulate these loss functions as relaxed surrogate functions and optimize them by structural SVMs. According to the analysis of various measures and the high time complexity of optimizing micro-averaging measures, in this paper, we focus on example-based measures that are tailor-made for image annotation tasks but are seldom explored in the literature. Experiments show consistency with the formal analysis on two widely used multi-label datasets, and demonstrate the superior performance of our proposed method over state-of-the-art baseline methods in terms of example-based measures on four
Annotation of mammalian primary microRNAs

Directory of Open Access Journals (Sweden)

Enright Anton J

2008-11-01

Full Text Available Abstract Background MicroRNAs (miRNAs are important regulators of gene expression and have been implicated in development, differentiation and pathogenesis. Hundreds of miRNAs have been discovered in mammalian genomes. Approximately 50% of mammalian miRNAs are expressed from introns of protein-coding genes; the primary transcript (pri-miRNA is therefore assumed to be the host transcript. However, very little is known about the structure of pri-miRNAs expressed from intergenic regions. Here we annotate transcript boundaries of miRNAs in human, mouse and rat genomes using various transcription features. The 5' end of the pri-miRNA is predicted from transcription start sites, CpG islands and 5' CAGE tags mapped in the upstream flanking region surrounding the precursor miRNA (pre-miRNA. The 3' end of the pri-miRNA is predicted based on the mapping of polyA signals, and supported by cDNA/EST and ditags data. The predicted pri-miRNAs are also analyzed for promoter and insulator-associated regulatory regions. Results We define sets of conserved and non-conserved human, mouse and rat pre-miRNAs using bidirectional BLAST and synteny analysis. Transcription features in their flanking regions are used to demarcate the 5' and 3' boundaries of the pri-miRNAs. The lengths and boundaries of primary transcripts are highly conserved between orthologous miRNAs. A significant fraction of pri-miRNAs have lengths between 1 and 10 kb, with very few introns. We annotate a total of 59 pri-miRNA structures, which include 82 pre-miRNAs. 36 pri-miRNAs are conserved in all 3 species. In total, 18 of the confidently annotated transcripts express more than one pre-miRNA. The upstream regions of 54% of the predicted pri-miRNAs are found to be associated with promoter and insulator regulatory sequences. Conclusion Little is known about the primary transcripts of intergenic miRNAs. Using comparative data, we are able to identify the boundaries of a significant proportion of
A high density barley microsatellite consensus map with 775 SSR loci

NARCIS (Netherlands)

Varshney, R.K.; Marcel, T.C.; Ramsay, L.; Russell, J.; Roder, M.S.; Stein, N.; Waugh, R.; Langridge, P.; Niks, R.E.; Graner, A.

2007-01-01

A microsatellite or simple sequence repeat (SSR) consensus map of barley was constructed by joining six independent genetic maps based on the mapping populations 'Igri x Franka', 'Steptoe x Morex', 'OWBRec x OWBDom', 'Lina x Canada Park', 'L94 x Vada' and 'SusPtrit x Vada'. Segregation data for
High levels of heterozygosity found for 15 SSR loci in Solanum chacoense

Science.gov (United States)

Genetic variation is a necessary prerequisite for improving domesticated plants through breeding; without it, breeding progress would be impossible. Genetic variation can be readily ascertained with co-dominant DNA markers, such as simple sequence repeats (SSRs). Twenty-four SSR markers specifically...
Generation and analysis of blueberry transcriptome sequences from leaves, developing fruit, and flower buds from cold acclimation through deacclimation.

Science.gov (United States)

Rowland, Lisa J; Alkharouf, Nadim; Darwish, Omar; Ogden, Elizabeth L; Polashock, James J; Bassil, Nahla V; Main, Dorrie

2012-04-02

There has been increased consumption of blueberries in recent years fueled in part because of their many recognized health benefits. Blueberry fruit is very high in anthocyanins, which have been linked to improved night vision, prevention of macular degeneration, anti-cancer activity, and reduced risk of heart disease. Very few genomic resources have been available for blueberry, however. Further development of genomic resources like expressed sequence tags (ESTs), molecular markers, and genetic linkage maps could lead to more rapid genetic improvement. Marker-assisted selection could be used to combine traits for climatic adaptation with fruit and nutritional quality traits. Efforts to sequence the transcriptome of the commercial highbush blueberry (Vaccinium corymbosum) cultivar Bluecrop and use the sequences to identify genes associated with cold acclimation and fruit development and develop SSR markers for mapping studies are presented here. Transcriptome sequences were generated from blueberry fruit at different stages of development, flower buds at different stages of cold acclimation, and leaves by next-generation Roche 454 sequencing. Over 600,000 reads were assembled into approximately 15,000 contigs and 124,000 singletons. The assembled sequences were annotated and functionally mapped to Gene Ontology (GO) terms. Frequency of the most abundant sequences in each of the libraries was compared across all libraries to identify genes that are potentially differentially expressed during cold acclimation and fruit development. Real-time PCR was performed to confirm their differential expression patterns. Overall, 14 out of 17 of the genes examined had differential expression patterns similar to what was predicted from their reads alone. The assembled sequences were also mined for SSRs. From these sequences, 15,886 blueberry EST-SSR loci were identified. Primers were designed from 7,705 of the SSR-containing sequences with adequate flanking sequence. One hundred
Generation and analysis of blueberry transcriptome sequences from leaves, developing fruit, and flower buds from cold acclimation through deacclimation

Directory of Open Access Journals (Sweden)

Rowland Lisa J

2012-04-01

Full Text Available Abstract Background There has been increased consumption of blueberries in recent years fueled in part because of their many recognized health benefits. Blueberry fruit is very high in anthocyanins, which have been linked to improved night vision, prevention of macular degeneration, anti-cancer activity, and reduced risk of heart disease. Very few genomic resources have been available for blueberry, however. Further development of genomic resources like expressed sequence tags (ESTs, molecular markers, and genetic linkage maps could lead to more rapid genetic improvement. Marker-assisted selection could be used to combine traits for climatic adaptation with fruit and nutritional quality traits. Results Efforts to sequence the transcriptome of the commercial highbush blueberry (Vaccinium corymbosum cultivar Bluecrop and use the sequences to identify genes associated with cold acclimation and fruit development and develop SSR markers for mapping studies are presented here. Transcriptome sequences were generated from blueberry fruit at different stages of development, flower buds at different stages of cold acclimation, and leaves by next-generation Roche 454 sequencing. Over 600,000 reads were assembled into approximately 15,000 contigs and 124,000 singletons. The assembled sequences were annotated and functionally mapped to Gene Ontology (GO terms. Frequency of the most abundant sequences in each of the libraries was compared across all libraries to identify genes that are potentially differentially expressed during cold acclimation and fruit development. Real-time PCR was performed to confirm their differential expression patterns. Overall, 14 out of 17 of the genes examined had differential expression patterns similar to what was predicted from their reads alone. The assembled sequences were also mined for SSRs. From these sequences, 15,886 blueberry EST-SSR loci were identified. Primers were designed from 7,705 of the SSR-containing sequences
Generation and analysis of blueberry transcriptome sequences from leaves, developing fruit, and flower buds from cold acclimation through deacclimation

Science.gov (United States)

2012-01-01

Background There has been increased consumption of blueberries in recent years fueled in part because of their many recognized health benefits. Blueberry fruit is very high in anthocyanins, which have been linked to improved night vision, prevention of macular degeneration, anti-cancer activity, and reduced risk of heart disease. Very few genomic resources have been available for blueberry, however. Further development of genomic resources like expressed sequence tags (ESTs), molecular markers, and genetic linkage maps could lead to more rapid genetic improvement. Marker-assisted selection could be used to combine traits for climatic adaptation with fruit and nutritional quality traits. Results Efforts to sequence the transcriptome of the commercial highbush blueberry (Vaccinium corymbosum) cultivar Bluecrop and use the sequences to identify genes associated with cold acclimation and fruit development and develop SSR markers for mapping studies are presented here. Transcriptome sequences were generated from blueberry fruit at different stages of development, flower buds at different stages of cold acclimation, and leaves by next-generation Roche 454 sequencing. Over 600,000 reads were assembled into approximately 15,000 contigs and 124,000 singletons. The assembled sequences were annotated and functionally mapped to Gene Ontology (GO) terms. Frequency of the most abundant sequences in each of the libraries was compared across all libraries to identify genes that are potentially differentially expressed during cold acclimation and fruit development. Real-time PCR was performed to confirm their differential expression patterns. Overall, 14 out of 17 of the genes examined had differential expression patterns similar to what was predicted from their reads alone. The assembled sequences were also mined for SSRs. From these sequences, 15,886 blueberry EST-SSR loci were identified. Primers were designed from 7,705 of the SSR-containing sequences with adequate flanking
Annotation and sequence diversity of transposable elements in common bean (Phaseolus vulgaris

Directory of Open Access Journals (Sweden)

Scott eJackson

2014-07-01

Full Text Available Common bean (Phaseolus vulgaris is an important legume crop grown and consumed worldwide. With the availability of the common bean genome sequence, the next challenge is to annotate the genome and characterize functional DNA elements. Transposable elements (TEs are the most abundant component of plant genomes and can dramatically affect genome evolution and genetic variation. Thus, it is pivotal to identify TEs in the common bean genome. In this study, we performed a genome-wide transposon annotation in common bean using a combination of homology and sequence structure-based methods. We developed a 2.12-Mb transposon database which includes 791 representative transposon sequences and is available upon request or from www.phytozome.org. Of note, nearly all transposons in the database are previously unrecognized TEs. More than 5,000 transposon-related expressed sequence tags (ESTs were detected which indicates that some transposons may be transcriptionally active. Two Ty1-copia retrotransposon families were found to encode the envelope-like protein which has rarely been identified in plant genomes. Also, we identified an extra open reading frame (ORF termed ORF2 from 15 Ty3-gypsy families that was located between the ORF encoding the retrotransposase and the 3’LTR. The ORF2 was in opposite transcriptional orientation to retrotransposase. Sequence homology searches and phylogenetic analysis suggested that the ORF2 may have an ancient origin, but its function is not clear. This transposon data provides a useful resource for understanding the genome organization and evolution and may be used to identify active TEs for developing transposon-tagging system in common bean and other related genomes.
Genetic Variation and Association Analysis of the SSR Markers Linked to the Major Drought-Yield QTLs of Rice.

Science.gov (United States)

Tabkhkar, Narjes; Rabiei, Babak; Samizadeh Lahiji, Habibollah; Hosseini Chaleshtori, Maryam

2018-02-24

Drought is one of the major abiotic stresses, which hampers the production of rice worldwide. Informative molecular markers are valuable tools for improving the drought tolerance in various varieties of rice. The present study was conducted to evaluate the informative simple sequence repeat (SSR) markers in a diverse set of rice genotypes. The genetic diversity analyses of the 83 studied rice genotypes were performed using 34 SSR markers closely linked to the major quantitative trait loci (QTLs) of grain yield under drought stress (qDTYs). In general, our results indicated high levels of polymorphism. In addition, we screened these rice genotypes at the reproductive stage under both drought stress and nonstressful conditions. The results of the regression analysis demonstrated a significant relationship between 11 SSR marker alleles and the plant paddy weight under stressful conditions. Under the nonstressful conditions, 16 SSR marker alleles showed a significant correlation with the plant paddy weight. Finally, four markers (RM279, RM231, RM166, and RM231) demonstrated a significant association with the plant paddy weight under both stressful and nonstressful conditions. These informative-associated alleles may be useful for improving the crop yield under both drought stress and nonstressful conditions in breeding programs.
Reading Actively Online: An Exploratory Investigation of Online Annotation Tools for Inquiry Learning / La lecture active en ligne: étude exploratoire sur les outils d'annotation en ligne pour l'apprentissage par l’enquête

Directory of Open Access Journals (Sweden)

Jingyan Lu

2012-11-01

Full Text Available This study seeks to design and facilitate active reading among secondary school students with an online annotation tool – Diigo. Two classes of different academic performance levels were recruited to examine their annotation behavior and perceptions of Diigo. We wanted to determine whether the two classes differed in how they used Diigo; how they perceived Diigo; and whether how they used Diigo was related to how they perceived it. Using annotation data and surveys in which students reported on their use and perceptions of Diigo, we found that although the tool facilitated individual annotations, the two classes used and perceived it differently. Overall, the study showed Diigo to be a promising tool for enhancing active reading in the inquiry learning process. Cette étude vise à concevoir et à faciliter la lecture active chez les élèves du secondaire grâce à l’outil d'annotation en ligne Diigo. Deux classes avec des niveaux de rendement scolaire différents ont été retenues afin qu’on examine leur manière d’annoter et leur perception de Diigo. Nous avons voulu déterminer si les deux classes diffèrent dans leur façon d’utiliser Diigo, leur perception de Diigo, et si leur manière d’utiliser Diigo était liée à leur perception. En utilisant les données d'annotation et d'enquêtes dans lesquelles les élèves relataient leur utilisation et leur perception de Diigo, nous avons constaté que, même si l'outil a facilité les annotations individuelles, les deux classes l’ont utilisé et perçu différemment. Dans l'ensemble, l'étude a montré que Diigo est un outil prometteur pour l'amélioration de la lecture active dans le processus d'apprentissage par enquête.

Localized infusions of the partial alpha 7 nicotinic receptor agonist SSR180711 evoke rapid and transient increases in prefrontal glutamate release

DEFF Research Database (Denmark)

Bortz, D M; Mikkelsen, J D; Bruno, J P

2013-01-01

The ability of local infusions of the alpha 7 nicotinic acetycholine receptor (α7 nAChR) partial agonist SSR180711 to evoke glutamate release in prefrontal cortex was determined in awake rats using a microelectrode array. Infusions of SSR180711 produced dose-dependent increases in glutamate levels...
The first set of EST resource for gene discovery and marker development in pigeonpea (Cajanus cajan L.

Directory of Open Access Journals (Sweden)

Byregowda Munishamappa

2010-03-01

Full Text Available Abstract Background Pigeonpea (Cajanus cajan (L. Millsp is one of the major grain legume crops of the tropics and subtropics, but biotic stresses [Fusarium wilt (FW, sterility mosaic disease (SMD, etc.] are serious challenges for sustainable crop production. Modern genomic tools such as molecular markers and candidate genes associated with resistance to these stresses offer the possibility of facilitating pigeonpea breeding for improving biotic stress resistance. Availability of limited genomic resources, however, is a serious bottleneck to undertake molecular breeding in pigeonpea to develop superior genotypes with enhanced resistance to above mentioned biotic stresses. With an objective of enhancing genomic resources in pigeonpea, this study reports generation and analysis of comprehensive resource of FW- and SMD- responsive expressed sequence tags (ESTs. Results A total of 16 cDNA libraries were constructed from four pigeonpea genotypes that are resistant and susceptible to FW ('ICPL 20102' and 'ICP 2376' and SMD ('ICP 7035' and 'TTB 7' and a total of 9,888 (9,468 high quality ESTs were generated and deposited in dbEST of GenBank under accession numbers GR463974 to GR473857 and GR958228 to GR958231. Clustering and assembly analyses of these ESTs resulted into 4,557 unique sequences (unigenes including 697 contigs and 3,860 singletons. BLASTN analysis of 4,557 unigenes showed a significant identity with ESTs of different legumes (23.2-60.3%, rice (28.3%, Arabidopsis (33.7% and poplar (35.4%. As expected, pigeonpea ESTs are more closely related to soybean (60.3% and cowpea ESTs (43.6% than other plant ESTs. Similarly, BLASTX similarity results showed that only 1,603 (35.1% out of 4,557 total unigenes correspond to known proteins in the UniProt database (≤ 1E-08. Functional categorization of the annotated unigenes sequences showed that 153 (3.3% genes were assigned to cellular component category, 132 (2.8% to biological process, and 132 (2
Guidelines for visualizing and annotating rule-based models†

Science.gov (United States)

Chylek, Lily A.; Hu, Bin; Blinov, Michael L.; Emonet, Thierry; Faeder, James R.; Goldstein, Byron; Gutenkunst, Ryan N.; Haugh, Jason M.; Lipniacki, Tomasz; Posner, Richard G.; Yang, Jin; Hlavacek, William S.

2011-01-01

Rule-based modeling provides a means to represent cell signaling systems in a way that captures site-specific details of molecular interactions. For rule-based models to be more widely understood and (re)used, conventions for model visualization and annotation are needed. We have developed the concepts of an extended contact map and a model guide for illustrating and annotating rule-based models. An extended contact map represents the scope of a model by providing an illustration of each molecule, molecular component, direct physical interaction, post-translational modification, and enzyme-substrate relationship considered in a model. A map can also illustrate allosteric effects, structural relationships among molecular components, and compartmental locations of molecules. A model guide associates elements of a contact map with annotation and elements of an underlying model, which may be fully or partially specified. A guide can also serve to document the biological knowledge upon which a model is based. We provide examples of a map and guide for a published rule-based model that characterizes early events in IgE receptor (FcεRI) signaling. We also provide examples of how to visualize a variety of processes that are common in cell signaling systems but not considered in the example model, such as ubiquitination. An extended contact map and an associated guide can document knowledge of a cell signaling system in a form that is visual as well as executable. As a tool for model annotation, a map and guide can communicate the content of a model clearly and with precision, even for large models. PMID:21647530
Guidelines for visualizing and annotating rule-based models.

Science.gov (United States)

Chylek, Lily A; Hu, Bin; Blinov, Michael L; Emonet, Thierry; Faeder, James R; Goldstein, Byron; Gutenkunst, Ryan N; Haugh, Jason M; Lipniacki, Tomasz; Posner, Richard G; Yang, Jin; Hlavacek, William S

2011-10-01

Rule-based modeling provides a means to represent cell signaling systems in a way that captures site-specific details of molecular interactions. For rule-based models to be more widely understood and (re)used, conventions for model visualization and annotation are needed. We have developed the concepts of an extended contact map and a model guide for illustrating and annotating rule-based models. An extended contact map represents the scope of a model by providing an illustration of each molecule, molecular component, direct physical interaction, post-translational modification, and enzyme-substrate relationship considered in a model. A map can also illustrate allosteric effects, structural relationships among molecular components, and compartmental locations of molecules. A model guide associates elements of a contact map with annotation and elements of an underlying model, which may be fully or partially specified. A guide can also serve to document the biological knowledge upon which a model is based. We provide examples of a map and guide for a published rule-based model that characterizes early events in IgE receptor (FcεRI) signaling. We also provide examples of how to visualize a variety of processes that are common in cell signaling systems but not considered in the example model, such as ubiquitination. An extended contact map and an associated guide can document knowledge of a cell signaling system in a form that is visual as well as executable. As a tool for model annotation, a map and guide can communicate the content of a model clearly and with precision, even for large models.
Genetic diversity based on SSR markers in maize (Zea mays L.)

Indian Academy of Sciences (India)

Home; Journals; Journal of Genetics; Volume 87; Issue 3. Genetic diversity based on SSR markers in maize (Zea mays L.) landraces from Wuling mountain region in China. Yao Qi-Lun Fang Ping Kang Ke-Cheng Pan Guang-Tang. Research Note Volume 87 Issue 3 December 2008 pp 287-291 ...
Generation of a predicted protein database from EST data and application to iTRAQ analyses in grape (Vitis vinifera cv. Cabernet Sauvignon berries at ripening initiation

Directory of Open Access Journals (Sweden)

Smith Derek

2009-01-01

Full Text Available Abstract Background iTRAQ is a proteomics technique that uses isobaric tags for relative and absolute quantitation of tryptic peptides. In proteomics experiments, the detection and high confidence annotation of proteins and the significance of corresponding expression differences can depend on the quality and the species specificity of the tryptic peptide map database used for analysis of the data. For species for which finished genome sequence data are not available, identification of proteins relies on similarity to proteins from other species using comprehensive peptide map databases such as the MSDB. Results We were interested in characterizing ripening initiation ('veraison' in grape berries at the protein level in order to better define the molecular control of this important process for grape growers and wine makers. We developed a bioinformatic pipeline for processing EST data in order to produce a predicted tryptic peptide database specifically targeted to the wine grape cultivar, Vitis vinifera cv. Cabernet Sauvignon, and lacking truncated N- and C-terminal fragments. By searching iTRAQ MS/MS data generated from berry exocarp and mesocarp samples at ripening initiation, we determined that implementation of the custom database afforded a large improvement in high confidence peptide annotation in comparison to the MSDB. We used iTRAQ MS/MS in conjunction with custom peptide db searches to quantitatively characterize several important pathway components for berry ripening previously described at the transcriptional level and confirmed expression patterns for these at the protein level. Conclusion We determined that a predicted peptide database for MS/MS applications can be derived from EST data using advanced clustering and trimming approaches and successfully implemented for quantitative proteome profiling. Quantitative shotgun proteome profiling holds great promise for characterizing biological processes such as fruit ripening
Genetic diversity in watermelon (Citrullus lanatus) landraces from Zimbabwe revealed by RAPD and SSR markers.

Science.gov (United States)

Mujaju, C; Sehic, J; Werlemark, G; Garkava-Gustavsson, L; Fatih, M; Nybom, H

2010-08-01

Low polymorphism in cultivated watermelon has been reported in previous studies, based mainly on US Plant Introductions and watermelon cultivars, most of which were linked to breeding programmes associated with disease resistance. Since germplasm sampled in a putative centre of origin in southern Africa may harbour considerably higher variability, DNA marker-based diversity was estimated among 81 seedlings from eight accessions of watermelon collected in Zimbabwe; five accessions of cow-melons (Citrullus lanatus var. citroides) and three of sweet watermelons (C. lanatus var. lanatus). Two molecular marker methods were used, random amplified polymorphic DNA (RAPD) and simple sequence repeats (SSR) also known as microsatellite DNA. Ten RAPD primers produced 138 markers of which 122 were polymorphic. Nine SSR primer pairs detected a total of 43 alleles with an average of 4.8 alleles per locus. The polymorphic information content (PIC) ranged from 0.47 to 0.77 for the RAPD primers and from 0.39 to 0.97 for the SSR loci. Similarity matrices obtained with SSR and RAPD, respectively, were highly correlated but only RAPD was able to provide each sample with an individual-specific DNA profile. Dendrograms and multidimensional scaling (MDS) produced two major clusters; one with the five cow-melon accessions and the other with the three sweet watermelon accessions. One of the most variable cow-melon accessions took an intermediate position in the MDS analysis, indicating the occurrence of gene flow between the two subspecies. Analysis of molecular variation (AMOVA) attributed most of the variability to within-accessions, and contrary to previous reports, sweet watermelon accessions apparently contain diversity of the same magnitude as the cow-melons.
Concept annotation in the CRAFT corpus.

Science.gov (United States)

Bada, Michael; Eckert, Miriam; Evans, Donald; Garcia, Kristin; Shipley, Krista; Sitnikov, Dmitry; Baumgartner, William A; Cohen, K Bretonnel; Verspoor, Karin; Blake, Judith A; Hunter, Lawrence E

2012-07-09

Manually annotated corpora are critical for the training and evaluation of automated methods to identify concepts in biomedical text. This paper presents the concept annotations of the Colorado Richly Annotated Full-Text (CRAFT) Corpus, a collection of 97 full-length, open-access biomedical journal articles that have been annotated both semantically and syntactically to serve as a research resource for the biomedical natural-language-processing (NLP) community. CRAFT identifies all mentions of nearly all concepts from nine prominent biomedical ontologies and terminologies: the Cell Type Ontology, the Chemical Entities of Biological Interest ontology, the NCBI Taxonomy, the Protein Ontology, the Sequence Ontology, the entries of the Entrez Gene database, and the three subontologies of the Gene Ontology. The first public release includes the annotations for 67 of the 97 articles, reserving two sets of 15 articles for future text-mining competitions (after which these too will be released). Concept annotations were created based on a single set of guidelines, which has enabled us to achieve consistently high interannotator agreement. As the initial 67-article release contains more than 560,000 tokens (and the full set more than 790,000 tokens), our corpus is among the largest gold-standard annotated biomedical corpora. Unlike most others, the journal articles that comprise the corpus are drawn from diverse biomedical disciplines and are marked up in their entirety. Additionally, with a concept-annotation count of nearly 100,000 in the 67-article subset (and more than 140,000 in the full collection), the scale of conceptual markup is also among the largest of comparable corpora. The concept annotations of the CRAFT Corpus have the potential to significantly advance biomedical text mining by providing a high-quality gold standard for NLP systems. The corpus, annotation guidelines, and other associated resources are freely available at http://bionlp-corpora.sourceforge.net/CRAFT/index.shtml.
Essential Requirements for Digital Annotation Systems

Directory of Open Access Journals (Sweden)

ADRIANO, C. M.

2012-06-01

Full Text Available Digital annotation systems are usually based on partial scenarios and arbitrary requirements. Accidental and essential characteristics are usually mixed in non explicit models. Documents and annotations are linked together accidentally according to the current technology, allowing for the development of disposable prototypes, but not to the support of non-functional requirements such as extensibility, robustness and interactivity. In this paper we perform a careful analysis on the concept of annotation, studying the scenarios supported by digital annotation tools. We also derived essential requirements based on a classification of annotation systems applied to existing tools. The analysis performed and the proposed classification can be applied and extended to other type of collaborative systems.
Microsatellites for the genus Cucurbita and an SSR-based genetic linkage map of Cucurbita pepo L.

Science.gov (United States)

Gong, L.; Stift, G.; Kofler, R.; Pachner, M.

2008-01-01

Until recently, only a few microsatellites have been available for Cucurbita, thus their development is highly desirable. The Austrian oil-pumpkin variety Gleisdorfer Ölkürbis (C. pepo subsp. pepo) and the C. moschata cultivar Soler (Puerto Rico) were used for SSR development. SSR-enriched partial genomic libraries were established and 2,400 clones were sequenced. Of these 1,058 (44%) contained an SSR at least four repeats long. Primers were designed for 532 SSRs; 500 primer pairs produced fragments of expected size. Of these, 405 (81%) amplified polymorphic fragments in a set of 12 genotypes: three C. moschata, one C. ecuadorensis, and eight C. pepo representing all eight cultivar groups. On an average, C. pepo and C. moschata produced 3.3 alleles per primer pair, showing high inter-species transferability. There were 187 SSR markers detecting polymorphism between the USA oil-pumpkin variety “Lady Godiva” (O5) and the Italian crookneck variety “Bianco Friulano” (CN), which are the parents of our previous F2 mapping population. It has been used to construct the first published C. pepo map, containing mainly RAPD and AFLP markers. Now the updated map comprises 178 SSRs, 244 AFLPs, 230 RAPDs, five SCARs, and two morphological traits (h and B). It contains 20 linkage groups with a map density of 2.9 cM. The observed genome coverage (Co) is 86.8%. Electronic supplementary material The online version of this article (doi:10.1007/s00122-008-0750-2) contains supplementary material, which is available to authorized users. PMID:18379753
Citrus sinensis annotation project (CAP): a comprehensive database for sweet orange genome.

Science.gov (United States)

Wang, Jia; Chen, Dijun; Lei, Yang; Chang, Ji-Wei; Hao, Bao-Hai; Xing, Feng; Li, Sen; Xu, Qiang; Deng, Xiu-Xin; Chen, Ling-Ling

2014-01-01

Citrus is one of the most important and widely grown fruit crop with global production ranking firstly among all the fruit crops in the world. Sweet orange accounts for more than half of the Citrus production both in fresh fruit and processed juice. We have sequenced the draft genome of a double-haploid sweet orange (C. sinensis cv. Valencia), and constructed the Citrus sinensis annotation project (CAP) to store and visualize the sequenced genomic and transcriptome data. CAP provides GBrowse-based organization of sweet orange genomic data, which integrates ab initio gene prediction, EST, RNA-seq and RNA-paired end tag (RNA-PET) evidence-based gene annotation. Furthermore, we provide a user-friendly web interface to show the predicted protein-protein interactions (PPIs) and metabolic pathways in sweet orange. CAP provides comprehensive information beneficial to the researchers of sweet orange and other woody plants, which is freely available at http://citrus.hzau.edu.cn/.
Making web annotations persistent over time

Energy Technology Data Exchange (ETDEWEB)

Sanderson, Robert [Los Alamos National Laboratory; Van De Sompel, Herbert [Los Alamos National Laboratory

2010-01-01

As Digital Libraries (DL) become more aligned with the web architecture, their functional components need to be fundamentally rethought in terms of URIs and HTTP. Annotation, a core scholarly activity enabled by many DL solutions, exhibits a clearly unacceptable characteristic when existing models are applied to the web: due to the representations of web resources changing over time, an annotation made about a web resource today may no longer be relevant to the representation that is served from that same resource tomorrow. We assume the existence of archived versions of resources, and combine the temporal features of the emerging Open Annotation data model with the capability offered by the Memento framework that allows seamless navigation from the URI of a resource to archived versions of that resource, and arrive at a solution that provides guarantees regarding the persistence of web annotations over time. More specifically, we provide theoretical solutions and proof-of-concept experimental evaluations for two problems: reconstructing an existing annotation so that the correct archived version is displayed for all resources involved in the annotation, and retrieving all annotations that involve a given archived version of a web resource.
Semantic annotation in biomedicine: the current landscape.

Science.gov (United States)

Jovanović, Jelena; Bagheri, Ebrahim

2017-09-22

The abundance and unstructured nature of biomedical texts, be it clinical or research content, impose significant challenges for the effective and efficient use of information and knowledge stored in such texts. Annotation of biomedical documents with machine intelligible semantics facilitates advanced, semantics-based text management, curation, indexing, and search. This paper focuses on annotation of biomedical entity mentions with concepts from relevant biomedical knowledge bases such as UMLS. As a result, the meaning of those mentions is unambiguously and explicitly defined, and thus made readily available for automated processing. This process is widely known as semantic annotation, and the tools that perform it are known as semantic annotators.Over the last dozen years, the biomedical research community has invested significant efforts in the development of biomedical semantic annotation technology. Aiming to establish grounds for further developments in this area, we review a selected set of state of the art biomedical semantic annotators, focusing particularly on general purpose annotators, that is, semantic annotation tools that can be customized to work with texts from any area of biomedicine. We also examine potential directions for further improvements of today's annotators which could make them even more capable of meeting the needs of real-world applications. To motivate and encourage further developments in this area, along the suggested and/or related directions, we review existing and potential practical applications and benefits of semantic annotators.
Analysis of a normalised expressed sequence tag (EST) library from a key pollinator, the bumblebee Bombus terrestris.

Science.gov (United States)

Sadd, Ben M; Kube, Michael; Klages, Sven; Reinhardt, Richard; Schmid-Hempel, Paul

2010-02-15

The bumblebee, Bombus terrestris (Order Hymenoptera), is of widespread importance. This species is extensively used for commercial pollination in Europe, and along with other Bombus spp. is a key member of natural pollinator assemblages. Furthermore, the species is studied in a wide variety of biological fields. The objective of this project was to create a B. terrestris EST resource that will prove to be valuable in obtaining a deeper understanding of this significant social insect. A normalised cDNA library was constructed from the thorax and abdomen of B. terrestris workers in order to enhance the discovery of rare genes. A total of 29'428 ESTs were sequenced. Subsequent clustering resulted in 13'333 unique sequences. Of these, 58.8 percent had significant similarities to known proteins, with 54.5 percent having a "best-hit" to existing Hymenoptera sequences. Comparisons with the honeybee and other insects allowed the identification of potential candidates for gene loss, pseudogene evolution, and possible incomplete annotation in the honeybee genome. Further, given the focus of much basic research and the perceived threat of disease to natural and commercial populations, the immune system of bumblebees is a particularly relevant component. Although the library is derived from unchallenged bees, we still uncover transcription of a number of immune genes spanning the principally described insect immune pathways. Additionally, the EST library provides a resource for the discovery of genetic markers that can be used in population level studies. Indeed, initial screens identified 589 simple sequence repeats and 854 potential single nucleotide polymorphisms. The resource that these B. terrestris ESTs represent is valuable for ongoing work. The ESTs provide direct evidence of transcriptionally active regions, but they will also facilitate further functional genomics, gene discovery and future genome annotation. These are important aspects in obtaining a greater
Development of an SSR-based identification key for Tunisian local almonds

Directory of Open Access Journals (Sweden)

Hassouna Gouta

2012-04-01

Full Text Available Ten simple sequence repeat (SSR loci were used to study polymorphism in 54 almond genotypes. All genotypes used in this study originated from almond-growing areas in Tunisia with different climatic conditions ranging from the sub-humid to the arid and are preserved in the national collection at Sidi Bouzid. Using ten SSR, 130 alleles and 250 genotypes were revealed. In order to develop an identification key for each accession, the data were analysed separately for each microsatellite marker. The most polymorphic microsatellite (CPDCT042 was used as a first marker. Two microsatellite loci (CPDCT042 and CPDCT025 were sufficient to discriminate among all accessions studied. Neighbour-joining clustering and principal coordinate analysis were performed to arrange the genotypes according to their genetic relationships and origin. The results are discussed in the context of almond collection management, conformity checks, identification of homonyms, and screening of the local almond germplasm. Furthermore, this microsatellite-based key is a first step toward a marker-assisted identification almond database.
Analysis of cold resistance and identification of SSR markers linked to cold resistance genes in Brassica rapa L.

Science.gov (United States)

Huang, Zhen; Zhang, Xuexian; Jiang, Shouhua; Qin, Mengfan; Zhao, Na; Lang, Lina; Liu, Yaping; Tian, Zhengshu; Liu, Xia; Wang, Yang; Zhang, Binbin; Xu, Aixia

2017-06-01

Currently, cold temperatures are one of the main factors threatening rapeseed production worldwide; thus, it is imperative to identify cold-resistant germplasm and to cultivate cold-resistant rapeseed varieties. In this study, the cold resistance of four Brassica rapa varieties was analyzed. The cold resistance of Longyou6 and Longyou7 was better than that of Tianyou2 and Tianyou4. Thus, an F 2 population derived from Longyou6 and Tianyou4 was used to study the correlation of cold resistance and physiological indexes. Our results showed that the degree of frost damage was related to the relative conductivity and MDA content (r1 = 0.558 and r2 = 0.447, respectively). In order to identify the markers related to cold resistance, 504 pairs of SSR (simple sequence repeats) primers were used to screen the two parents and F 2 population. Four and five SSR markers had highly significant positive correlation to relative conductivity and MDA, respectively. In addition, three of these SSR markers had a highly significant positive correlation to both of these two indexes. These three SSR markers were subsequently confirmed to be used to distinguish between cold-resistant and non-cold-resistant varieties. The results of this study will lay a solid foundation for the mapping of cold-resistant genes and molecular markers assisted selection for the cold-resistance.
Magnetic characterization techniques for nanomaterials

CERN Document Server

2017-01-01

Sixth volume of a 40 volume series on nanoscience and nanotechnology, edited by the renowned scientist Challa S.S.R. Kumar. This handbook gives a comprehensive overview about Magnetic Characterization Techniques for Nanomaterials. Modern applications and state-of-the-art techniques are covered and make this volume an essential reading for research scientists in academia and industry.
Development of EST-SSR markers for Elaeocarpus photiniifolia (Elaeocarpaceae), an endemic taxon of the Bonin Islands.

Science.gov (United States)

Sugai, Kyoko; Setsuko, Suzuki; Uchiyama, Kentaro; Murakami, Noriaki; Kato, Hidetoshi; Yoshimaru, Hiroshi

2012-02-01

Expressed sequence tag (EST)-derived microsatellite markers were developed for Elaeocarpus photiniifolia, an endemic taxon of the Bonin Islands. Initially, a complementary DNA (cDNA) library was constructed by de novo pyrosequencing of total RNA extracted from a seedling. A total of 267 primer pairs were designed from the library. Of the 48 tested loci, 25 loci were polymorphic among 41 individuals representing the entire geographical range of the species, with the number of alleles per locus and expected heterozygosity ranging from two to 14 and 0.09 to 0.86, respectively. Most loci were transferable to a related species, E. sylvestris. The developed markers will be useful for evaluating the genetic structure of E. photiniifolia.
Active learning reduces annotation time for clinical concept extraction.

Science.gov (United States)

Kholghi, Mahnoosh; Sitbon, Laurianne; Zuccon, Guido; Nguyen, Anthony

2017-10-01

To investigate: (1) the annotation time savings by various active learning query strategies compared to supervised learning and a random sampling baseline, and (2) the benefits of active learning-assisted pre-annotations in accelerating the manual annotation process compared to de novo annotation. There are 73 and 120 discharge summary reports provided by Beth Israel institute in the train and test sets of the concept extraction task in the i2b2/VA 2010 challenge, respectively. The 73 reports were used in user study experiments for manual annotation. First, all sequences within the 73 reports were manually annotated from scratch. Next, active learning models were built to generate pre-annotations for the sequences selected by a query strategy. The annotation/reviewing time per sequence was recorded. The 120 test reports were used to measure the effectiveness of the active learning models. When annotating from scratch, active learning reduced the annotation time up to 35% and 28% compared to a fully supervised approach and a random sampling baseline, respectively. Reviewing active learning-assisted pre-annotations resulted in 20% further reduction of the annotation time when compared to de novo annotation. The number of concepts that require manual annotation is a good indicator of the annotation time for various active learning approaches as demonstrated by high correlation between time rate and concept annotation rate. Active learning has a key role in reducing the time required to manually annotate domain concepts from clinical free text, either when annotating from scratch or reviewing active learning-assisted pre-annotations. Copyright © 2017 Elsevier B.V. All rights reserved.
Similarity maps and hierarchical clustering for annotating FT-IR spectral images.

Science.gov (United States)

Zhong, Qiaoyong; Yang, Chen; Großerüschkamp, Frederik; Kallenbach-Thieltges, Angela; Serocka, Peter; Gerwert, Klaus; Mosig, Axel

2013-11-20

Unsupervised segmentation of multi-spectral images plays an important role in annotating infrared microscopic images and is an essential step in label-free spectral histopathology. In this context, diverse clustering approaches have been utilized and evaluated in order to achieve segmentations of Fourier Transform Infrared (FT-IR) microscopic images that agree with histopathological characterization. We introduce so-called interactive similarity maps as an alternative annotation strategy for annotating infrared microscopic images. We demonstrate that segmentations obtained from interactive similarity maps lead to similarly accurate segmentations as segmentations obtained from conventionally used hierarchical clustering approaches. In order to perform this comparison on quantitative grounds, we provide a scheme that allows to identify non-horizontal cuts in dendrograms. This yields a validation scheme for hierarchical clustering approaches commonly used in infrared microscopy. We demonstrate that interactive similarity maps may identify more accurate segmentations than hierarchical clustering based approaches, and thus are a viable and due to their interactive nature attractive alternative to hierarchical clustering. Our validation scheme furthermore shows that performance of hierarchical two-means is comparable to the traditionally used Ward's clustering. As the former is much more efficient in time and memory, our results suggest another less resource demanding alternative for annotating large spectral images.

Characterization of expressed sequence tag-derived simple sequence repeat markers for Aspergillus flavus: emphasis on variability of isolates from the southern United States.

Science.gov (United States)

Wang, Xinwang; Wadl, Phillip A; Wood-Jones, Alicia; Windham, Gary; Trigiano, Robert N; Scruggs, Mary; Pilgrim, Candace; Baird, Richard

2012-12-01

Simple sequence repeat (SSR) markers were developed from Aspergillus flavus expressed sequence tag (EST) database to conduct an analysis of genetic relationships of Aspergillus isolates from numerous host species and geographical regions, but primarily from the United States. Twenty-nine primers were designed from 362 tri-nucleotide EST-SSR sequences. Eighteen polymorphic loci were used to genotype 96 Aspergillus species isolates. The number of alleles detected per locus ranged from 2 to 24 with a mean of 8.2 alleles. Haploid diversity ranged from 0.28 to 0.91. Genetic distance matrix was used to perform principal coordinates analysis (PCA) and to generate dendrograms using unweighted pair group method with arithmetic mean (UPGMA). Two principal coordinates explained more than 75 % of the total variation among the isolates. One clade was identified for A. flavus isolates (n = 87) with the other Aspergillus species (n = 7) using PCA, but five distinct clusters were present when the others taxa were excluded from the analysis. Six groups were noted when the EST-SSR data were compared using UPGMA. However, the latter PCA or UPGMA comparison resulted in no direct associations with host species, geographical region or aflatoxin production. Furthermore, there was no direct correlation to visible morphological features such as sclerotial types. The isolates from Mississippi Delta region, which contained the largest percentage of isolates, did not show any unusual clustering except for isolates K32, K55, and 199. Further studies of these three isolates are warranted to evaluate their pathogenicity, aflatoxin production potential, additional gene sequences (e.g., RPB2), and morphological comparisons.
A database of annotated tentative orthologs from crop abiotic stress transcripts.

Science.gov (United States)

Balaji, Jayashree; Crouch, Jonathan H; Petite, Prasad V N S; Hoisington, David A

2006-10-07

A minimal requirement to initiate a comparative genomics study on plant responses to abiotic stresses is a dataset of orthologous sequences. The availability of a large amount of sequence information, including those derived from stress cDNA libraries allow for the identification of stress related genes and orthologs associated with the stress response. Orthologous sequences serve as tools to explore genes and their relationships across species. For this purpose, ESTs from stress cDNA libraries across 16 crop species including 6 important cereal crops and 10 dicots were systematically collated and subjected to bioinformatics analysis such as clustering, grouping of tentative orthologous sets, identification of protein motifs/patterns in the predicted protein sequence, and annotation with stress conditions, tissue/library source and putative function. All data are available to the scientific community at http://intranet.icrisat.org/gt1/tog/homepage.htm. We believe that the availability of annotated plant abiotic stress ortholog sets will be a valuable resource for researchers studying the biology of environmental stresses in plant systems, molecular evolution and genomics.
AFLP/SSR mapping of resistance genes to Alectra vogelii in cowpea ...

African Journals Online (AJOL)

To find and map the resistance gene to A. vogelii in cowpea, a F2 population from a cross involving a resistant parent IT81D-994 and a susceptible TVX3236 was screened. Amplified fragment length polymorphism (AFLP) in combination with Single Sequence Repeat (SSR) analysis was used to identify markers that may be ...
Screening of the White Button Mushroom (Agaricusbisporus Homokaryons and Producing New Hybrid Strain by SSR Markers

Directory of Open Access Journals (Sweden)

marzieh nourashrafeddin

2018-02-01

detection. Ten SSR primers showed polymorphism in parental control samples that were used to this experiment. The isolates were divided into two general homoallelic and heteroallelic groups and seven isolates from homoallellic group, which showed one-band pattern, characterized as putative homokaryon. Genetic similarity was calculated by NTSYSpc software version 2.02 e using UPGMA method. In the next step of experiment, the isolates (4 and 8 had minimum genetic similarity that was crossed to produce hybrid. In order to confirm the hybrid formation, PCR-SSR reaction with a primer (AbSSR 45 was performed. Results and Discussions: Basidiospores were collected and allowed to germinate on CEA medium. Putative homokaryons were different in colony morphology and growth rate compared to the original heterokaryons. Mycelium samples showed different colony morphology including tomentose, apprised and strandy mycelium. Different growth rate can be affected by genetic factors in nucleus and mitoconderia. After four weeks, mycelium browning was appeared in liquid compost extract medium and created a disturbance in DNA extraction. To solve this problem, DNA was extracted from three-week old mycelium. Mycelium browning may cause by phenolic compounds produced by mycelium and enzymes that catalyze melanin biosynthesis reactions. Ten primers were used to homokaryon isolation. These primers were situated on the 9 linkage groups of 13 haploid chromosomes. Seven isolates were distinguished as putative homokaryon that showed one-band in all primers on the gel electrophoresis. The results of genetic similarity calculation showed that this index was variable between 0.17 to 0.67in 7 homokaryon isolates and the minimum genetic similarity (0.17 was observed between isolates 4 and 8. These two isolates were crossed and the result of this crossing was N1 hybrid. Also, other homokaryon isolates were crossed and mating incompatibility was observed in some of them. According to these observations, it is
Pattern analysis approach reveals restriction enzyme cutting abnormalities and other cDNA library construction artifacts using raw EST data

Directory of Open Access Journals (Sweden)

Zhou Sun

2012-05-01

Full Text Available Abstract Background Expressed Sequence Tag (EST sequences are widely used in applications such as genome annotation, gene discovery and gene expression studies. However, some of GenBank dbEST sequences have proven to be “unclean”. Identification of cDNA termini/ends and their structures in raw ESTs not only facilitates data quality control and accurate delineation of transcription ends, but also furthers our understanding of the potential sources of data abnormalities/errors present in the wet-lab procedures for cDNA library construction. Results After analyzing a total of 309,976 raw Pinus taeda ESTs, we uncovered many distinct variations of cDNA termini, some of which prove to be good indicators of wet-lab artifacts, and characterized each raw EST by its cDNA terminus structure patterns. In contrast to the expected patterns, many ESTs displayed complex and/or abnormal patterns that represent potential wet-lab errors such as: a failure of one or both of the restriction enzymes to cut the plasmid vector; a failure of the restriction enzymes to cut the vector at the correct positions; the insertion of two cDNA inserts into a single vector; the insertion of multiple and/or concatenated adapters/linkers; the presence of 3′-end terminal structures in designated 5′-end sequences or vice versa; and so on. With a close examination of these artifacts, many problematic ESTs that have been deposited into public databases by conventional bioinformatics pipelines or tools could be cleaned or filtered by our methodology. We developed a software tool for Abnormality Filtering and Sequence Trimming for ESTs (AFST, http://code.google.com/p/afst/ using a pattern analysis approach. To compare AFST with other pipelines that submitted ESTs into dbEST, we reprocessed 230,783 Pinus taeda and 38,709 Arachis hypogaea GenBank ESTs. We found 7.4% of Pinus taeda and 29.2% of Arachis hypogaea GenBank ESTs are “unclean” or abnormal, all of which could be cleaned
Genome Survey Sequencing of Luffa Cylindrica L. and Microsatellite High Resolution Melting (SSR-HRM) Analysis for Genetic Relationship of Luffa Genotypes.

Science.gov (United States)

An, Jianyu; Yin, Mengqi; Zhang, Qin; Gong, Dongting; Jia, Xiaowen; Guan, Yajing; Hu, Jin

2017-09-11

Luffa cylindrica (L.) Roem. is an economically important vegetable crop in China. However, the genomic information on this species is currently unknown. In this study, for the first time, a genome survey of L. cylindrica was carried out using next-generation sequencing (NGS) technology. In total, 43.40 Gb sequence data of L. cylindrica , about 54.94× coverage of the estimated genome size of 789.97 Mb, were obtained from HiSeq 2500 sequencing, in which the guanine plus cytosine (GC) content was calculated to be 37.90%. The heterozygosity of genome sequences was only 0.24%. In total, 1,913,731 contigs (>200 bp) with 525 bp N 50 length and 1,410,117 scaffolds (>200 bp) with 885.01 Mb total length were obtained. From the initial assembled L. cylindrica genome, 431,234 microsatellites (SSRs) (≥5 repeats) were identified. The motif types of SSR repeats included 62.88% di-nucleotide, 31.03% tri-nucleotide, 4.59% tetra-nucleotide, 0.96% penta-nucleotide and 0.54% hexa-nucleotide. Eighty genomic SSR markers were developed, and 51/80 primers could be used in both "Zheda 23" and "Zheda 83". Nineteen SSRs were used to investigate the genetic diversity among 32 accessions through SSR-HRM analysis. The unweighted pair group method analysis (UPGMA) dendrogram tree was built by calculating the SSR-HRM raw data. SSR-HRM could be effectively used for genotype relationship analysis of Luffa species.
Development of SSR markers for the short arm of rye chromosome 1

Czech Academy of Sciences Publication Activity Database

Kofler, R.; Stift, G.; Gong, L.; Suchánková, Pavla; Šimková, Hana; Doležel, Jaroslav; Lelley, T.

2007-01-01

Roč. 71, - (2007), s. 279-281 ISSN 0723-7812 R&D Projects: GA ČR GA521/04/0607 Institutional research plan: CEZ:AV0Z50380511 Source of funding: V - iné verejné zdroje Keywords : SSR markers * rye chromosome 1 Subject RIV: GE - Plant Breeding
Early experiences with crowdsourcing airway annotations in chest CT

DEFF Research Database (Denmark)

Cheplygina, Veronika; Perez-Rovira, Adria; Kuo, Wieying

2016-01-01

Measuring airways in chest computed tomography (CT) images is important for characterizing diseases such as cystic fibrosis, yet very time-consuming to perform manually. Machine learning algorithms offer an alternative, but need large sets of annotated data to perform well. We investigate whether...... a number of further research directions and provide insight into the challenges of crowdsourcing in medical images from the perspective of first-time users....
RASTtk: A modular and extensible implementation of the RAST algorithm for building custom annotation pipelines and annotating batches of genomes

Energy Technology Data Exchange (ETDEWEB)

Brettin, Thomas; Davis, James J.; Disz, Terry; Edwards, Robert A.; Gerdes, Svetlana; Olsen, Gary J.; Olson, Robert; Overbeek, Ross; Parrello, Bruce; Pusch, Gordon D.; Shukla, Maulik; Thomason, James A.; Stevens, Rick; Vonstein, Veronika; Wattam, Alice R.; Xia, Fangfang

2015-02-10

The RAST (Rapid Annotation using Subsystem Technology) annotation engine was built in 2008 to annotate bacterial and archaeal genomes. It works by offering a standard software pipeline for identifying genomic features (i.e., protein-encoding genes and RNA) and annotating their functions. Recently, in order to make RAST a more useful research tool and to keep pace with advancements in bioinformatics, it has become desirable to build a version of RAST that is both customizable and extensible. In this paper, we describe the RAST tool kit (RASTtk), a modular version of RAST that enables researchers to build custom annotation pipelines. RASTtk offers a choice of software for identifying and annotating genomic features as well as the ability to add custom features to an annotation job. RASTtk also accommodates the batch submission of genomes and the ability to customize annotation protocols for batch submissions. This is the first major software restructuring of RAST since its inception.
RASTtk: a modular and extensible implementation of the RAST algorithm for building custom annotation pipelines and annotating batches of genomes.

Science.gov (United States)

Brettin, Thomas; Davis, James J; Disz, Terry; Edwards, Robert A; Gerdes, Svetlana; Olsen, Gary J; Olson, Robert; Overbeek, Ross; Parrello, Bruce; Pusch, Gordon D; Shukla, Maulik; Thomason, James A; Stevens, Rick; Vonstein, Veronika; Wattam, Alice R; Xia, Fangfang

2015-02-10

The RAST (Rapid Annotation using Subsystem Technology) annotation engine was built in 2008 to annotate bacterial and archaeal genomes. It works by offering a standard software pipeline for identifying genomic features (i.e., protein-encoding genes and RNA) and annotating their functions. Recently, in order to make RAST a more useful research tool and to keep pace with advancements in bioinformatics, it has become desirable to build a version of RAST that is both customizable and extensible. In this paper, we describe the RAST tool kit (RASTtk), a modular version of RAST that enables researchers to build custom annotation pipelines. RASTtk offers a choice of software for identifying and annotating genomic features as well as the ability to add custom features to an annotation job. RASTtk also accommodates the batch submission of genomes and the ability to customize annotation protocols for batch submissions. This is the first major software restructuring of RAST since its inception.
PROBLEMS OF THE EFFICIENCY INCREASING OF TRANSPORTATION BY AIR OF UKRAINIAN SSR (1960-1980

Directory of Open Access Journals (Sweden)

Anatoliy Gorban

2015-11-01

Full Text Available The article is devoted to the problems of the efficiency increasing of the air transportation. The difficulties of increasing the efficiency of transportation by air in Ukrainian SSR in 1960-1980 were researched, factors that adversely affected the organization of the transport sector were determined and depicted. The article analyzes what caused such difficulties and it was found out that the causes of these difficulties are connected with the organizational problems of air transport of Ukrainian SSR, which negatively affected the operation of the industry. The central aim of the research is to focus on the main problems of air transport of Ukrainian SSR. So, we should say that the transport operation of those years was distributed too unevenly and was dependent on the population density of the territory of the republic. Purpose of the article is to determine, compile and analyze the factors that negatively affected the organization of air transportation of the Ukrainian republic and reduced the efficiency of its operation. Results of the research shows technical, organization and economical deficiency of air transport of Ukrainian SSR which caused the ineffectiveness of this type of transport and determines the nature of such difficulties. Statement of the problem. During the specified period (1960–1980 the air transport had undergone rapid development. Many new airlines were opened, airports were being built and reconstructed, the terms of exploiting of turbojet aircrafts were increased, the speed of planes was increasing. All these facts ensured safe and reliable air connection of all district centers, connected Ukraine with the other Soviet republics and foreign countries by air corridors. Ukrainian Department of Civil Aviation became the biggest regional Department of the Ministry of Civil Aviation of the USSR. But, at the same time the intensity of the increase of cargo and passenger transportation since 1970s led to accumulation of
A comprehensive characterization of simple sequence repeats in pepper genomes provides valuable resources for marker development in Capsicum.

Science.gov (United States)

Cheng, Jiaowen; Zhao, Zicheng; Li, Bo; Qin, Cheng; Wu, Zhiming; Trejo-Saavedra, Diana L; Luo, Xirong; Cui, Junjie; Rivera-Bustamante, Rafael F; Li, Shuaicheng; Hu, Kailin

2016-01-07

The sequences of the full set of pepper genomes including nuclear, mitochondrial and chloroplast are now available for use. However, the overall of simple sequence repeats (SSR) distribution in these genomes and their practical implications for molecular marker development in Capsicum have not yet been described. Here, an average of 868,047.50, 45.50 and 30.00 SSR loci were identified in the nuclear, mitochondrial and chloroplast genomes of pepper, respectively. Subsequently, systematic comparisons of various species, genome types, motif lengths, repeat numbers and classified types were executed and discussed. In addition, a local database composed of 113,500 in silico unique SSR primer pairs was built using a homemade bioinformatics workflow. As a pilot study, 65 polymorphic markers were validated among a wide collection of 21 Capsicum genotypes with allele number and polymorphic information content value per marker raging from 2 to 6 and 0.05 to 0.64, respectively. Finally, a comparison of the clustering results with those of a previous study indicated the usability of the newly developed SSR markers. In summary, this first report on the comprehensive characterization of SSR motifs in pepper genomes and the very large set of SSR primer pairs will benefit various genetic studies in Capsicum.
La implementación de la política pública de salud sexual y reproductiva (SSR en el Eje Cafetero colombiano: el caso del embarazo adolescente

Directory of Open Access Journals (Sweden)

Sara E. del Castillo Matamoros

2008-05-01

Full Text Available La política nacional de salud sexual y reproductiva (SSR definida en Colombia en 2002 por el Ministerio de la Protección Social para los años 2002 a 2006 señala los temas prioritarios en este campo: maternidad segura, planificación familiar, salud sexual y reproductiva de las y los adolescentes, cáncer de cuello uterino, infecciones de transmisión sexual y reproductiva, VIH/SIDA, y violencia doméstica y sexual. La investigación que se reporta se ha focalizado en el análisis cuantitativo y cualitativo del proceso de implementación de la política de salud sexual y reproductiva de los y las adolescentes en los tres departamentos y capitales que conforman el llamado “Eje Cafetero colombiano”. Los resultados del estudio mostraron que, si bien se mide una reducción en el número de nacimientos en adolescentes (10-19 años en la región vinculada al estudio entre 2003 y 2005, no se pudieron determinar estrategias y actividades explicativas de ello. La política ha permitido visibilizar y legitimar acciones específicas en este campo para la población adolescente. Sin embargo, se observó la existencia de importantes en las metodologías de recolección y sistematización de los datos relativos a la salud sexual y reproductiva de la población adolescente según las entidades estudiadas, así como la ausencia de retroalimentación cuantitativa utilizable para los actores de la política. En conclusión, se emiten unas recomendaciones para el ajuste de dicha política de SSR.
AIGO: Towards a unified framework for the Analysis and the Inter-comparison of GO functional annotations

Directory of Open Access Journals (Sweden)

Defoin-Platel Michael

2011-11-01

Full Text Available Abstract Background In response to the rapid growth of available genome sequences, efforts have been made to develop automatic inference methods to functionally characterize them. Pipelines that infer functional annotation are now routinely used to produce new annotations at a genome scale and for a broad variety of species. These pipelines differ widely in their inference algorithms, confidence thresholds and data sources for reasoning. This heterogeneity makes a comparison of the relative merits of each approach extremely complex. The evaluation of the quality of the resultant annotations is also challenging given there is often no existing gold-standard against which to evaluate precision and recall. Results In this paper, we present a pragmatic approach to the study of functional annotations. An ensemble of 12 metrics, describing various aspects of functional annotations, is defined and implemented in a unified framework, which facilitates their systematic analysis and inter-comparison. The use of this framework is demonstrated on three illustrative examples: analysing the outputs of state-of-the-art inference pipelines, comparing electronic versus manual annotation methods, and monitoring the evolution of publicly available functional annotations. The framework is part of the AIGO library (http://code.google.com/p/aigo for the Analysis and the Inter-comparison of the products of Gene Ontology (GO annotation pipelines. The AIGO library also provides functionalities to easily load, analyse, manipulate and compare functional annotations and also to plot and export the results of the analysis in various formats. Conclusions This work is a step toward developing a unified framework for the systematic study of GO functional annotations. This framework has been designed so that new metrics on GO functional annotations can be added in a very straightforward way.
Simple sequence repeat (SSR) markers are effective for identifying ...

African Journals Online (AJOL)

DNA was extracted from newly formed leaves and amplified using 21 simple sequence repeat (SSR) markers (NH001c, NH002b, NH005b, NH007b, NH008b, NH009b, NH011b, NH013b, NH012a, NH014a, NH015a, NH017a, KA4b, KA5, KA14, KA16, KB16, KU10, BGA35, BGT23b and HGA8b). The data was analyzed by ...
Computer systems for annotation of single molecule fragments

Science.gov (United States)

Schwartz, David Charles; Severin, Jessica

2016-07-19

There are provided computer systems for visualizing and annotating single molecule images. Annotation systems in accordance with this disclosure allow a user to mark and annotate single molecules of interest and their restriction enzyme cut sites thereby determining the restriction fragments of single nucleic acid molecules. The markings and annotations may be automatically generated by the system in certain embodiments and they may be overlaid translucently onto the single molecule images. An image caching system may be implemented in the computer annotation systems to reduce image processing time. The annotation systems include one or more connectors connecting to one or more databases capable of storing single molecule data as well as other biomedical data. Such diverse array of data can be retrieved and used to validate the markings and annotations. The annotation systems may be implemented and deployed over a computer network. They may be ergonomically optimized to facilitate user interactions.
Overexpressed Genes/ESTs and Characterization of Distinct Amplicons on 17823 in Breast Cancer Cells

Directory of Open Access Journals (Sweden)

Ayse E. Erson

2001-01-01

Full Text Available 17823 is a frequent site of gene amplification in breast cancer. Several lines of evidence suggest the presence of multiple amplicons on 17823. To characterize distinct amplicons on 17823 and localize putative oncogenes, we screened genes and expressed sequence tags (ESTs in existing physical and radiation hybrid maps for amplification and overexpression in breast cancer cell lines by semiquantitative duplex PCR, semiquantitative duplex RT-PCR, Southern blot, Northern blot analyses. We identified two distinct amplicons on 17823, one including TBX2 and another proximal region including RPS6KB1 (PS6K and MUL. In addition to these previously reported overexpressed genes, we also identified amplification and overexpression of additional uncharacterized genes and ESTs, some of which suggest potential oncogenic activity. In conclusion, we have further defined two distinct regions of gene amplification and overexpression on 17823 with identification of new potential oncogene candidates. Based on the amplification and overexpression patterns of known and as of yet unrecognized genes on 17823, it is likely that some of these genes mapping to the discrete amplicons function as oncogenes and contribute to tumor progression in breast cancer cells.
Image annotation under X Windows

Science.gov (United States)

Pothier, Steven

1991-08-01

A mechanism for attaching graphic and overlay annotation to multiple bits/pixel imagery while providing levels of performance approaching that of native mode graphics systems is presented. This mechanism isolates programming complexity from the application programmer through software encapsulation under the X Window System. It ensures display accuracy throughout operations on the imagery and annotation including zooms, pans, and modifications of the annotation. Trade-offs that affect speed of display, consumption of memory, and system functionality are explored. The use of resource files to tune the display system is discussed. The mechanism makes use of an abstraction consisting of four parts; a graphics overlay, a dithered overlay, an image overly, and a physical display window. Data structures are maintained that retain the distinction between the four parts so that they can be modified independently, providing system flexibility. A unique technique for associating user color preferences with annotation is introduced. An interface that allows interactive modification of the mapping between image value and color is discussed. A procedure that provides for the colorization of imagery on 8-bit display systems using pixel dithering is explained. Finally, the application of annotation mechanisms to various applications is discussed.
Motion lecture annotation system to learn Naginata performances

Science.gov (United States)

Kobayashi, Daisuke; Sakamoto, Ryota; Nomura, Yoshihiko

2013-12-01

This paper describes a learning assistant system using motion capture data and annotation to teach "Naginata-jutsu" (a skill to practice Japanese halberd) performance. There are some video annotation tools such as YouTube. However these video based tools have only single angle of view. Our approach that uses motion-captured data allows us to view any angle. A lecturer can write annotations related to parts of body. We have made a comparison of effectiveness between the annotation tool of YouTube and the proposed system. The experimental result showed that our system triggered more annotations than the annotation tool of YouTube.
MIPS: analysis and annotation of proteins from whole genomes.

Science.gov (United States)

Mewes, H W; Amid, C; Arnold, R; Frishman, D; Güldener, U; Mannhaupt, G; Münsterkötter, M; Pagel, P; Strack, N; Stümpflen, V; Warfsmann, J; Ruepp, A

2004-01-01

The Munich Information Center for Protein Sequences (MIPS-GSF), Neuherberg, Germany, provides protein sequence-related information based on whole-genome analysis. The main focus of the work is directed toward the systematic organization of sequence-related attributes as gathered by a variety of algorithms, primary information from experimental data together with information compiled from the scientific literature. MIPS maintains automatically generated and manually annotated genome-specific databases, develops systematic classification schemes for the functional annotation of protein sequences and provides tools for the comprehensive analysis of protein sequences. This report updates the information on the yeast genome (CYGD), the Neurospora crassa genome (MNCDB), the database of complete cDNAs (German Human Genome Project, NGFN), the database of mammalian protein-protein interactions (MPPI), the database of FASTA homologies (SIMAP), and the interface for the fast retrieval of protein-associated information (QUIPOS). The Arabidopsis thaliana database, the rice database, the plant EST databases (MATDB, MOsDB, SPUTNIK), as well as the databases for the comprehensive set of genomes (PEDANT genomes) are described elsewhere in the 2003 and 2004 NAR database issues, respectively. All databases described, and the detailed descriptions of our projects can be accessed through the MIPS web server (http://mips.gsf.de).

In vivo imaging of neuroinflammation in the rodent brain with [{sup 11}C]SSR180575, a novel indoleacetamide radioligand of the translocator protein (18 kDa)

Energy Technology Data Exchange (ETDEWEB)

Chauveau, Fabien [CEA, DSV, IBM, Service Hospitalier Frederic Joliot, Orsay (France); Universite Paris Sud, INSERM U1023, Orsay (France); Universite Lyon 1, Creatis, CNRS UMR 5220, INSERM U630, INSA Lyon, Lyon (France); Boutin, Herve [CEA, DSV, IBM, Service Hospitalier Frederic Joliot, Orsay (France); Universite Paris Sud, INSERM U1023, Orsay (France); University of Manchester, Faculty of Life Sciences, Manchester (United Kingdom); Camp, Nadja van; Tavitian, Bertrand [CEA, DSV, IBM, Service Hospitalier Frederic Joliot, Orsay (France); Universite Paris Sud, INSERM U1023, Orsay (France); Thominiaux, Cyrille; Dolle, Frederic [CEA, DSV, IBM, Service Hospitalier Frederic Joliot, Orsay (France); Hantraye, Philippe [CEA, DSV, IBM, MIRCEN, Fontenay-aux-Roses (France); Rivron, Luc [Sanofi-Aventis, GMPK-Global Isotope Chemistry and Metabolite Synthesis Department (ICMS), Paris (France); Marguet, Frank; Castel, Marie-Noelle; Rooney, Thomas; Benavides, Jesus [Sanofi-Aventis, CNS Department, Paris (France)

2011-03-15

Neuroinflammation is involved in neurological disorders through the activation of microglial cells. Imaging of neuroinflammation with radioligands for the translocator protein (18 kDa) (TSPO) could prove to be an attractive biomarker for disease diagnosis and therapeutic evaluation. The indoleacetamide-derived 7-chloro-N,N,5-trimethyl-4-oxo-3-phenyl-3,5-dihydro-4H-pyridazino[4,5-b]indole-1-acetamide, SSR180575, is a selective high-affinity TSPO ligand in human and rodents with neuroprotective effects. Here we report the radiolabelling of SSR180575 with {sup 11}C and in vitro and in vivo imaging in an acute model of neuroinflammation in rats. The image contrast and the binding of [{sup 11}C]SSR180575 are higher than that obtained with the isoquinoline-based TSPO radioligand, [{sup 11}C]PK11195. Competition studies demonstrate that [{sup 11}C]SSR180575 has high specific binding for the TSPO. [{sup 11}C]SSR180575 is the first PET radioligand for the TSPO based on an indoleacetamide scaffold designed for imaging neuroinflammation in animal models and in the clinic. (orig.)
BEACON: automated tool for Bacterial GEnome Annotation ComparisON

KAUST Repository

Kalkatawi, Manal M.

2015-08-18

Background Genome annotation is one way of summarizing the existing knowledge about genomic characteristics of an organism. There has been an increased interest during the last several decades in computer-based structural and functional genome annotation. Many methods for this purpose have been developed for eukaryotes and prokaryotes. Our study focuses on comparison of functional annotations of prokaryotic genomes. To the best of our knowledge there is no fully automated system for detailed comparison of functional genome annotations generated by different annotation methods (AMs). Results The presence of many AMs and development of new ones introduce needs to: a/ compare different annotations for a single genome, and b/ generate annotation by combining individual ones. To address these issues we developed an Automated Tool for Bacterial GEnome Annotation ComparisON (BEACON) that benefits both AM developers and annotation analysers. BEACON provides detailed comparison of gene function annotations of prokaryotic genomes obtained by different AMs and generates extended annotations through combination of individual ones. For the illustration of BEACON’s utility, we provide a comparison analysis of multiple different annotations generated for four genomes and show on these examples that the extended annotation can increase the number of genes annotated by putative functions up to 27 %, while the number of genes without any function assignment is reduced. Conclusions We developed BEACON, a fast tool for an automated and a systematic comparison of different annotations of single genomes. The extended annotation assigns putative functions to many genes with unknown functions. BEACON is available under GNU General Public License version 3.0 and is accessible at: http://www.cbrc.kaust.edu.sa/BEACON/
BEACON: automated tool for Bacterial GEnome Annotation ComparisON.

Science.gov (United States)

Kalkatawi, Manal; Alam, Intikhab; Bajic, Vladimir B

2015-08-18

Genome annotation is one way of summarizing the existing knowledge about genomic characteristics of an organism. There has been an increased interest during the last several decades in computer-based structural and functional genome annotation. Many methods for this purpose have been developed for eukaryotes and prokaryotes. Our study focuses on comparison of functional annotations of prokaryotic genomes. To the best of our knowledge there is no fully automated system for detailed comparison of functional genome annotations generated by different annotation methods (AMs). The presence of many AMs and development of new ones introduce needs to: a/ compare different annotations for a single genome, and b/ generate annotation by combining individual ones. To address these issues we developed an Automated Tool for Bacterial GEnome Annotation ComparisON (BEACON) that benefits both AM developers and annotation analysers. BEACON provides detailed comparison of gene function annotations of prokaryotic genomes obtained by different AMs and generates extended annotations through combination of individual ones. For the illustration of BEACON's utility, we provide a comparison analysis of multiple different annotations generated for four genomes and show on these examples that the extended annotation can increase the number of genes annotated by putative functions up to 27%, while the number of genes without any function assignment is reduced. We developed BEACON, a fast tool for an automated and a systematic comparison of different annotations of single genomes. The extended annotation assigns putative functions to many genes with unknown functions. BEACON is available under GNU General Public License version 3.0 and is accessible at: http://www.cbrc.kaust.edu.sa/BEACON/ .
JGI Plant Genomics Gene Annotation Pipeline

Energy Technology Data Exchange (ETDEWEB)

Shu, Shengqiang; Rokhsar, Dan; Goodstein, David; Hayes, David; Mitros, Therese

2014-07-14

Plant genomes vary in size and are highly complex with a high amount of repeats, genome duplication and tandem duplication. Gene encodes a wealth of information useful in studying organism and it is critical to have high quality and stable gene annotation. Thanks to advancement of sequencing technology, many plant species genomes have been sequenced and transcriptomes are also sequenced. To use these vastly large amounts of sequence data to make gene annotation or re-annotation in a timely fashion, an automatic pipeline is needed. JGI plant genomics gene annotation pipeline, called integrated gene call (IGC), is our effort toward this aim with aid of a RNA-seq transcriptome assembly pipeline. It utilizes several gene predictors based on homolog peptides and transcript ORFs. See Methods for detail. Here we present genome annotation of JGI flagship green plants produced by this pipeline plus Arabidopsis and rice except for chlamy which is done by a third party. The genome annotations of these species and others are used in our gene family build pipeline and accessible via JGI Phytozome portal whose URL and front page snapshot are shown below.
Annotating temporal information in clinical narratives.

Science.gov (United States)

Sun, Weiyi; Rumshisky, Anna; Uzuner, Ozlem

2013-12-01

Temporal information in clinical narratives plays an important role in patients' diagnosis, treatment and prognosis. In order to represent narrative information accurately, medical natural language processing (MLP) systems need to correctly identify and interpret temporal information. To promote research in this area, the Informatics for Integrating Biology and the Bedside (i2b2) project developed a temporally annotated corpus of clinical narratives. This corpus contains 310 de-identified discharge summaries, with annotations of clinical events, temporal expressions and temporal relations. This paper describes the process followed for the development of this corpus and discusses annotation guideline development, annotation methodology, and corpus quality. Copyright © 2013 Elsevier Inc. All rights reserved.
New genes expressed in human brains: implications for annotating evolving genomes.

Science.gov (United States)

Zhang, Yong E; Landback, Patrick; Vibranovski, Maria; Long, Manyuan

2012-11-01

New genes have frequently formed and spread to fixation in a wide variety of organisms, constituting abundant sets of lineage-specific genes. It was recently reported that an excess of primate-specific and human-specific genes were upregulated in the brains of fetuses and infants, and especially in the prefrontal cortex, which is involved in cognition. These findings reveal the prevalent addition of new genetic components to the transcriptome of the human brain. More generally, these findings suggest that genomes are continually evolving in both sequence and content, eroding the conservation endowed by common ancestry. Despite increasing recognition of the importance of new genes, we highlight here that these genes are still seriously under-characterized in functional studies and that new gene annotation is inconsistent in current practice. We propose an integrative approach to annotate new genes, taking advantage of functional and evolutionary genomic methods. We finally discuss how the refinement of new gene annotation will be important for the detection of evolutionary forces governing new gene origination. Copyright © 2012 WILEY Periodicals, Inc.
Genetic diversity of sweet sorghum germplasm in Mexico using AFLP and SSR markers

Science.gov (United States)

The objective of this work was to evaluate the diversity and genetic relationships between lines and varieties of the sweet sorghum (Sorghum bicolor) germplasm bank of the National Institute for Forestry, Agriculture and Livestock Research, Mexico, using AFLP and SSR markers. The molecular markers ...
Characterization of 10 new nuclear microsatellite markers in Acca sellowiana (Myrtaceae)1

Science.gov (United States)

Klabunde, Gustavo H. F.; Olkoski, Denise; Vilperte, Vinicius; Zucchi, Maria I.; Nodari, Rubens O.

2014-01-01

• Premise of the study: Microsatellite primers were identified and characterized in Acca sellowiana in order to expand the limited number of pre-existing polymorphic markers for use in population genetic studies for conservation, phylogeography, breeding, and domestication. • Methods and Results: A total of 10 polymorphic microsatellite primers were designed from clones obtained from a simple sequence repeat (SSR)–enriched genomic library. The primers amplified di- and trinucleotide repeats with four to 27 alleles per locus. In all tested populations, the observed heterozygosity ranged from 0.269 to 1.0. • Conclusions: These new polymorphic SSR markers will allow future genetic studies to be denser, either for genetic structure characterization of natural populations or for studies involving genetic breeding and domestication process in A. sellowiana. PMID:25202632
Surface science tools for nanomaterials characterization

CERN Document Server

2015-01-01

Fourth volume of a 40volume series on nano science and nanotechnology, edited by the renowned scientist Challa S.S.R. Kumar. This handbook gives a comprehensive overview about Surface Science Tools for Nanomaterials Characterization. Modern applications and state-of-the-art techniques are covered and make this volume an essential reading for research scientists in academia and industry.
Characterization of some bread wheat genotypes using molecular markers for drought tolerance.

Science.gov (United States)

Ateş Sönmezoğlu, Özlem; Terzi, Begüm

2018-02-01

Because of its wide geographical adaptation and importance in human nutrition, wheat is one of the most important crops in the world. However, wheat yield has reduced due to drought stress posing threat to sustainability and world food security in agricultural production. The first stage of drought tolerant variety breeding occurs on the molecular and biochemical characterization and classification of wheat genotypes. The aim of the present study is characterization of widely grown bread wheat cultivars and breeding lines for drought tolerance so as to be adapted to different regions in Turkey. The genotypes were screened with molecular markers for the presence of QTLs mapped to different chromosomes. Results of the molecular studies identified and detected 15 polymorphic SSR markers which gave the clearest PCR bands among the control genotypes. At the end of the research, bread wheat genotypes which were classified for tolerance or sensitivity to drought and the genetic similarity within control varieties were determined by molecular markers. According to SSR based dendrogram, two main groups were obtained for drought tolerance. At end of the molecular screening with SSR primers, genetic similarity coefficients were obtained that ranged from 0.14 to 0.71. The ones numbered 8 and 11 were the closest genotypes to drought tolerant cultivar Gerek 79 and the furthest genotypes from this cultivar were number 16 and to drought sensitive cultivar Sultan 95. The genotypes as drought tolerance due to their SSR markers scores are expected to provide useful information for drought related molecular breeding studies.
Facilitating functional annotation of chicken microarray data

Directory of Open Access Journals (Sweden)

Gresham Cathy R

2009-10-01

Full Text Available Abstract Background Modeling results from chicken microarray studies is challenging for researchers due to little functional annotation associated with these arrays. The Affymetrix GenChip chicken genome array, one of the biggest arrays that serve as a key research tool for the study of chicken functional genomics, is among the few arrays that link gene products to Gene Ontology (GO. However the GO annotation data presented by Affymetrix is incomplete, for example, they do not show references linked to manually annotated functions. In addition, there is no tool that facilitates microarray researchers to directly retrieve functional annotations for their datasets from the annotated arrays. This costs researchers amount of time in searching multiple GO databases for functional information. Results We have improved the breadth of functional annotations of the gene products associated with probesets on the Affymetrix chicken genome array by 45% and the quality of annotation by 14%. We have also identified the most significant diseases and disorders, different types of genes, and known drug targets represented on Affymetrix chicken genome array. To facilitate functional annotation of other arrays and microarray experimental datasets we developed an Array GO Mapper (AGOM tool to help researchers to quickly retrieve corresponding functional information for their dataset. Conclusion Results from this study will directly facilitate annotation of other chicken arrays and microarray experimental datasets. Researchers will be able to quickly model their microarray dataset into more reliable biological functional information by using AGOM tool. The disease, disorders, gene types and drug targets revealed in the study will allow researchers to learn more about how genes function in complex biological systems and may lead to new drug discovery and development of therapies. The GO annotation data generated will be available for public use via AgBase website and
Effective damping for SSR analysis of parallel turbine-generators

International Nuclear Information System (INIS)

Agrawal, B.L.; Farmer, R.G.

1988-01-01

Damping is a dominant parameter in studies to determine SSR problem severity and countermeasure requirements. To reach valid conclusions for multi-unit plants, it is essential that the net effective damping of unequally loaded units be known. For the Palo Verde Nuclear Generating Station, extensive testing and analysis have been performed to verify and develop an accurate means of determining the effective damping of unequally loaded units in parallel. This has led to a unique and simple algorithm which correlates well with two other analytic techniques
Comparison of the effectiveness of ISJ and SSR markers and detection of outlier loci in conservation genetics of Pulsatilla patens populations.

Science.gov (United States)

Bilska, Katarzyna; Szczecińska, Monika

2016-01-01

Research into the protection of rare and endangered plant species involves genetic analyses to determine their genetic variation and genetic structure. Various categories of genetic markers are used for this purpose. Microsatellites, also known as simple sequence repeats (SSR), are the most popular category of markers in population genetics research. In most cases, microsatellites account for a large part of the noncoding DNA and exert a neutral effect on the genome. Neutrality is a desirable feature in evaluations of genetic differences between populations, but it does not support analyses of a population's ability to adapt to a given environment or its evolutionary potential. Despite the numerous advantages of microsatellites, non-neutral markers may supply important information in conservation genetics research. They are used to evaluate adaptation to specific environmental conditions and a population's adaptive potential. The aim of this study was to compare the level of genetic variation in Pulsatilla patens populations revealed by neutral SSR markers and putatively adaptive ISJ markers (intron-exon splice junction). The experiment was conducted on 14 Polish populations of P. patens and three P. patens populations from the nearby region of Vitebsk in Belarus. A total of 345 individuals were examined. Analyses were performed with the use of eight SSR primers specific to P. patens and three ISJ primers. SSR markers revealed a higher level of genetic variation than ISJ markers ( H e = 0.609, H e = 0.145, respectively). An analysis of molecular variance (AMOVA) revealed that, the overall genetic diversity between the analyzed populations defined by parameters F ST and Φ PT for SSR (20%) and Φ PT for ISJ (21%) markers was similar. Analysis conducted in the Structure program divided analyzed populations into two groups (SSR loci) and three groups (ISJ markers). Mantel test revealed correlations between the geographic distance and genetic diversity of Polish
Dictionary-driven protein annotation.

Science.gov (United States)

Rigoutsos, Isidore; Huynh, Tien; Floratos, Aris; Parida, Laxmi; Platt, Daniel

2002-09-01

Computational methods seeking to automatically determine the properties (functional, structural, physicochemical, etc.) of a protein directly from the sequence have long been the focus of numerous research groups. With the advent of advanced sequencing methods and systems, the number of amino acid sequences that are being deposited in the public databases has been increasing steadily. This has in turn generated a renewed demand for automated approaches that can annotate individual sequences and complete genomes quickly, exhaustively and objectively. In this paper, we present one such approach that is centered around and exploits the Bio-Dictionary, a collection of amino acid patterns that completely covers the natural sequence space and can capture functional and structural signals that have been reused during evolution, within and across protein families. Our annotation approach also makes use of a weighted, position-specific scoring scheme that is unaffected by the over-representation of well-conserved proteins and protein fragments in the databases used. For a given query sequence, the method permits one to determine, in a single pass, the following: local and global similarities between the query and any protein already present in a public database; the likeness of the query to all available archaeal/ bacterial/eukaryotic/viral sequences in the database as a function of amino acid position within the query; the character of secondary structure of the query as a function of amino acid position within the query; the cytoplasmic, transmembrane or extracellular behavior of the query; the nature and position of binding domains, active sites, post-translationally modified sites, signal peptides, etc. In terms of performance, the proposed method is exhaustive, objective and allows for the rapid annotation of individual sequences and full genomes. Annotation examples are presented and discussed in Results, including individual queries and complete genomes that were
Use of Simple Sequence Repeat (SSR) markers for DNA fingerprinting and diversity analysis of sugarcane (Saccharum spp.) cultivars resistant and susceptible to red rot

Science.gov (United States)

In recent years SSR markers have been used widely for the genetic analysis. The objective of present research was to use SSR markers to develop DNA-based genetic identification and analyze genetic relationship of sugarcane cultivars grown in Pakistan either resistant or susceptible to red rot. Twent...
Annotated outline for the SCP conceptual design report: Office of Geologic Repositories

International Nuclear Information System (INIS)

1987-06-01

The Nuclear Waste Policy Act of 1982 (NWPA) requires that site characterization plans (SCPs) be submitted to the Nuclear Regulatory Commission (NRC), affected States and Indian tribes, and the general public for review and comment prior to the sinking of shafts at a candidate repository site. The SCP is also required by the NRC licensing procedures for the disposal of high-level waste. An Annotated Outline (AO) for Site Characterization Plans (OGR/B-5) has been prepared to provide DOE's standard format and guidance for preparation of SCPs. Consistent with the AO for SCPs. Chapter 6 of the SCP is to provide the requirements and references the media-specific design data base, describe the current design concepts, and discuss design information needs. In order to develop this design information, the Office of Geologic Repositories program is planning a SCP conceptual design phase as part of the overall repository design process. This phase is the first step in the design process, and the result and design can be expected to change as the program moves through the site characterization phase. The Annotated Outline which follows provides the standard format and guidance for the preparation of the SCP Conceptual Design Reports. It is considered to meet the intent of NRC's proposed Generic Technical Position philosophy contained therein. The SCP Conceptual Design Report will be the primary basis for preparation of Chapter 6 of the SCP and will be stand-alone reference document for the SCP. Appendix 1 to this Annotated Outline provides a correlation between Chapter 6 of the SCP and SCP Conceptual Design Report for the information purposes
The effectiveness of annotated (vs. non-annotated) digital pathology slides as a teaching tool during dermatology and pathology residencies.

Science.gov (United States)

Marsch, Amanda F; Espiritu, Baltazar; Groth, John; Hutchens, Kelli A

2014-06-01

With today's technology, paraffin-embedded, hematoxylin & eosin-stained pathology slides can be scanned to generate high quality virtual slides. Using proprietary software, digital images can also be annotated with arrows, circles and boxes to highlight certain diagnostic features. Previous studies assessing digital microscopy as a teaching tool did not involve the annotation of digital images. The objective of this study was to compare the effectiveness of annotated digital pathology slides versus non-annotated digital pathology slides as a teaching tool during dermatology and pathology residencies. A study group composed of 31 dermatology and pathology residents was asked to complete an online pre-quiz consisting of 20 multiple choice style questions, each associated with a static digital pathology image. After completion, participants were given access to an online tutorial composed of digitally annotated pathology slides and subsequently asked to complete a post-quiz. A control group of 12 residents completed a non-annotated version of the tutorial. Nearly all participants in the study group improved their quiz score, with an average improvement of 17%, versus only 3% (P = 0.005) in the control group. These results support the notion that annotated digital pathology slides are superior to non-annotated slides for the purpose of resident education. © 2014 John Wiley & Sons A/S. Published by John Wiley & Sons Ltd.
Irradiation facilities on the TRIGA-SSR thermal column

Energy Technology Data Exchange (ETDEWEB)

Roth, C; Aioanei, L; Preda, M; Gugiu, D [Institute for Nuclear Research, Pitesti (Romania); Garlea, I; Kelerman, C; Garlea, C [SENDRA ' Nuclear Technologies' ltd. Bucharest (Romania)

2004-07-01

The development of thermal and intermediate energy neutron irradiation facilities at the steady state core of the Romanian TRIGA Reactor is described. The reference thermal neutron irradiation facility consists of a dry spherical cavity placed into the graphite thermal column of the SSR core and the intermediate energy neutron irradiation facility is a {sigma}{sigma} system located into the thermal flux cavity. The implementation of the irradiation facilities into the under-water thermal column represented an important challenge from the standpoint of instrumentation solutions. The neutron flux and spectrum measurements were performed using foil activation techniques and fission rate measurements by sealed fission chambers, followed by spectrum unfolding procedure. The absolute fission reaction measurements, using calibrated fission chambers, allow the neutron flux density unit transfer from international reference neutron fields. The MCNP-4C code package was used for neutron spectrum computations in the thermal flux cavity and in the {sigma}{sigma} system. The neutron characterization program demonstrates the accuracy of the spectrum characteristics and neutron flux densities reported to the local monitoring system count rates. Some discrepancies, as compared to other similar facilities, were identified and discussed. These are caused by thermal column particularities: the presence of a water layer between the graphite cells (thermal neutron absorption) and smaller geometrical dimensions (neutron escape phenomena). Based on these results the metrological certification process, according to Romanian metrological laws requirements, is now in progress. (nevyjel)
Automatic annotation of head velocity and acceleration in Anvil

DEFF Research Database (Denmark)

Jongejan, Bart

2012-01-01

We describe an automatic face tracker plugin for the ANVIL annotation tool. The face tracker produces data for velocity and for acceleration in two dimensions. We compare the annotations generated by the face tracking algorithm with independently made manual annotations for head movements....... The annotations are a useful supplement to manual annotations and may help human annotators to quickly and reliably determine onset of head movements and to suggest which kind of head movement is taking place....
Analysis of genetic diversity among rapeseed cultivars and breeding lines by srap and ssr molecular markers

International Nuclear Information System (INIS)

Channa, S.A.; Tian, H.

2016-01-01

The knowledge of genetic diversity is very important for developing new rapeseed (Brassica napus L.) cultivars. The genetic diversity among 77 rapeseed accessions, including 22 varieties and 55 advanced breeding lines were analyzed by 47 sequence-related amplified polymorphism (SRAP) and 56 simple sequence repeat (SSR) primers. A total of 270 SRAP and 194 SSR polymorphic fragments were detected with an average of 5.74 and 3.46 for SRAP and SSR primer, respectively. The cluster analysis grouped the 77 accessions into five major clusters. Cluster I contained spring and winter type varieties from Czech Republic and semi-winter varieties and their respective breeding lines from China. The 16 elite breeding lines discovered in Cluster II, III, IV and V indicated higher genetic distance than accessions in Cluster I. The principal component analysis and structure analysis exhibited similar results to the cluster analysis. Analysis of molecular variance revealed that genetic diversity of the selected breeding lines was comparable to the rapeseed varieties, and variation among varieties and lines was significant. The diverse and unique group of 16 elite breeding lines detected in this study can be utilized in the future breeding program as a source for development of commercial varieties with more desirable characters. (author)

[SSR analysis on stress effect of transgenic hybrid poplar 741 on Clostera anachoreta (Fabricius) (Lepidoptera: Notodontidae)].

Science.gov (United States)

Liu, Jun Xia; Song, Xiao Ying; Jiang, Wen Hu; Zhou, Guo Na; Gao, Bao Jia

2016-12-01

The genetic differentiation of the experimental population of Clostera anachoreta fed on different resistant transgenic 741 poplar leaves was analyzed by SSR molecular marker technique to investigate stress effect of transgenic poplar Bt gene as food on target insect. The experimental population of C. anachoreta fed on transgenic 741 poplar high resistant strains 'Pb29', medium resis-tant strains 'Pb17' and non-transgenic poplar (CK), and the screened ten pairs of SSR primers were used. The results showed that 76 alleles were observed in ten pairs of primers. The average allele was 7.6, the average effective number of alleles was 2.2, the average observed heterozygosity was 0.5167, the average expected heterozygosity was 0.5167, and the average percentage of polymorphic loci was 96.7%. The genetic diversity level of C. anachoreta experimental population fed on transgenic poplar 741 was significantly higher than that fed on non-transgenic populations, and C. anachoreta fed on high resistance had the lowest genetic similarity with CK samples, which showed an increasing trend of the genetic diversity of the experimental population fed on transgenic Bt poplar. It was thus clear that transgenic hybrid poplar 741 had stress effects on genetic differentiation of C. anachoreta experimental population by SSR.
Mesotext. Framing and exploring annotations

NARCIS (Netherlands)

Boot, P.; Boot, P.; Stronks, E.

2007-01-01

From the introduction: Annotation is an important item on the wish list for digital scholarly tools. It is one of John Unsworth’s primitives of scholarship (Unsworth 2000). Especially in linguistics,a number of tools have been developed that facilitate the creation of annotations to source material
Annotating images by mining image search results

NARCIS (Netherlands)

Wang, X.J.; Zhang, L.; Li, X.; Ma, W.Y.

2008-01-01

Although it has been studied for years by the computer vision and machine learning communities, image annotation is still far from practical. In this paper, we propose a novel attempt at model-free image annotation, which is a data-driven approach that annotates images by mining their search
Novel STATCOM Controller for Mitigating SSR and Damping Power System Oscillations in a Series Compensated Wind Parks

DEFF Research Database (Denmark)

Bak-Jensen, Birgitte; El-Moursi, M. S.; Abdel-Rahman, Mansour Hassan

2010-01-01

This paper addresses implementation issues associated with a novel damping control algorithm for a STATCOM in a series compensated wind park for mitigating SSR (subsynchronous resonance) and damping power system oscillations. The IEEE first benchmark model on subsynchronous resonance is adopted...... the SSR, damping the power system oscillation and enhancing the transient stability margin in response to different SCRs....... in the STATCOM control structure. The performances of the controllers are tested in steady state operation and in response to system contingencies, taking into account the impact of short circuit ratios (SCRs). Simulation results are presented to demonstrate the capability of the controllers for mitigating...
In vivo imaging of neuro-inflammation in the rodent brain with [11C]SSR180575, a novel indoleacetamide radioligand of the translocator protein (18 kDa)

International Nuclear Information System (INIS)

Chauveau, F.; Boutin, H.; Van Camp, N.; Thominiaux, C.; Dolle, F.; Tavitian, B.; Chauveau, F.; Boutin, H.; Van Camp, N.; Tavitian, B.; Hantraye, P.; Rivron, L.; Marguet, F.; Castel, M.N.; Rooney, T.; Benavides, J.; Chauveau, F.; Boutin, H.

2011-01-01

Purpose Neuro-inflammation is involved in neurological disorders through the activation of micro-glial cells. Imaging of neuro-inflammation with radioligands for the translocator protein (18 kDa) (TSPO) could prove to be an attractive bio-marker for disease diagnosis and therapeutic evaluation. The indoleacetamide-derived 7-chloro-N, N, 5- trimethyl-4-oxo-3-phenyl-3, 5-dihydro-4H-pyridazino[4, 5-b] indole-1-acetamide, SSR180575, is a selective high-affinity TSPO ligand in human and rodents with neuro-protective effects. Methods Here we report the radiolabelling of SSR180575 with 11 C and in vitro and in vivo imaging in an acute model of neuro-inflammation in rats. Results The image contrast and the binding of [ 11 C] SSR180575 are higher than that obtained with the isoquinoline- based TSPO radioligand, [ 11 C]PK11195. Competition studies demonstrate that [ 11 C]SSR180575 has high specific binding for the TSPO. Conclusion [ 11 C]SSR180575 is the first PET radioligand for the TSPO based on an indoleacetamide scaffold designed for imaging neuro-inflammation in animal models and in the clinic. (authors)
WormBase: Annotating many nematode genomes.

Science.gov (United States)

Howe, Kevin; Davis, Paul; Paulini, Michael; Tuli, Mary Ann; Williams, Gary; Yook, Karen; Durbin, Richard; Kersey, Paul; Sternberg, Paul W

2012-01-01

WormBase (www.wormbase.org) has been serving the scientific community for over 11 years as the central repository for genomic and genetic information for the soil nematode Caenorhabditis elegans. The resource has evolved from its beginnings as a database housing the genomic sequence and genetic and physical maps of a single species, and now represents the breadth and diversity of nematode research, currently serving genome sequence and annotation for around 20 nematodes. In this article, we focus on WormBase's role of genome sequence annotation, describing how we annotate and integrate data from a growing collection of nematode species and strains. We also review our approaches to sequence curation, and discuss the impact on annotation quality of large functional genomics projects such as modENCODE.
Teaching and Learning Communities through Online Annotation

Science.gov (United States)

van der Pluijm, B.

2016-12-01

What do colleagues do with your assigned textbook? What they say or think about the material? Want students to be more engaged in their learning experience? If so, online materials that complement standard lecture format provide new opportunity through managed, online group annotation that leverages the ubiquity of internet access, while personalizing learning. The concept is illustrated with the new online textbook "Processes in Structural Geology and Tectonics", by Ben van der Pluijm and Stephen Marshak, which offers a platform for sharing of experiences, supplementary materials and approaches, including readings, mathematical applications, exercises, challenge questions, quizzes, alternative explanations, and more. The annotation framework used is Hypothes.is, which offers a free, open platform markup environment for annotation of websites and PDF postings. The annotations can be public, grouped or individualized, as desired, including export access and download of annotations. A teacher group, hosted by a moderator/owner, limits access to members of a user group of teachers, so that its members can use, copy or transcribe annotations for their own lesson material. Likewise, an instructor can host a student group that encourages sharing of observations, questions and answers among students and instructor. Also, the instructor can create one or more closed groups that offers study help and hints to students. Options galore, all of which aim to engage students and to promote greater responsibility for their learning experience. Beyond new capacity, the ability to analyze student annotation supports individual learners and their needs. For example, student notes can be analyzed for key phrases and concepts, and identify misunderstandings, omissions and problems. Also, example annotations can be shared to enhance notetaking skills and to help with studying. Lastly, online annotation allows active application to lecture posted slides, supporting real-time notetaking
Displaying Annotations for Digitised Globes

Science.gov (United States)

Gede, Mátyás; Farbinger, Anna

2018-05-01

Thanks to the efforts of the various globe digitising projects, nowadays there are plenty of old globes that can be examined as 3D models on the computer screen. These globes usually contain a lot of interesting details that an average observer would not entirely discover for the first time. The authors developed a website that can display annotations for such digitised globes. These annotations help observers of the globe to discover all the important, interesting details. Annotations consist of a plain text title, a HTML formatted descriptive text and a corresponding polygon and are stored in KML format. The website is powered by the Cesium virtual globe engine.
THE DIMENSIONS OF COMPOSITION ANNOTATION.

Science.gov (United States)

MCCOLLY, WILLIAM

ENGLISH TEACHER ANNOTATIONS WERE STUDIED TO DETERMINE THE DIMENSIONS AND PROPERTIES OF THE ENTIRE SYSTEM FOR WRITING CORRECTIONS AND CRITICISMS ON COMPOSITIONS. FOUR SETS OF COMPOSITIONS WERE WRITTEN BY STUDENTS IN GRADES 9 THROUGH 13. TYPESCRIPTS OF THE COMPOSITIONS WERE ANNOTATED BY CLASSROOM ENGLISH TEACHERS. THEN, 32 ENGLISH TEACHERS JUDGED…
Evaluation of three automated genome annotations for Halorhabdus utahensis.

Directory of Open Access Journals (Sweden)

Peter Bakke

2009-07-01

Full Text Available Genome annotations are accumulating rapidly and depend heavily on automated annotation systems. Many genome centers offer annotation systems but no one has compared their output in a systematic way to determine accuracy and inherent errors. Errors in the annotations are routinely deposited in databases such as NCBI and used to validate subsequent annotation errors. We submitted the genome sequence of halophilic archaeon Halorhabdus utahensis to be analyzed by three genome annotation services. We have examined the output from each service in a variety of ways in order to compare the methodology and effectiveness of the annotations, as well as to explore the genes, pathways, and physiology of the previously unannotated genome. The annotation services differ considerably in gene calls, features, and ease of use. We had to manually identify the origin of replication and the species-specific consensus ribosome-binding site. Additionally, we conducted laboratory experiments to test H. utahensis growth and enzyme activity. Current annotation practices need to improve in order to more accurately reflect a genome's biological potential. We make specific recommendations that could improve the quality of microbial annotation projects.
Development of Pineapple Microsatellite Markers and Germplasm Genetic Diversity Analysis

Directory of Open Access Journals (Sweden)

Suping Feng

2013-01-01

Full Text Available Two methods were used to develop pineapple microsatellite markers. Genomic library-based SSR development: using selectively amplified microsatellite assay, 86 sequences were generated from pineapple genomic library. 91 (96.8% of the 94 Simple Sequence Repeat (SSR loci were dinucleotide repeats (39 AC/GT repeats and 52 GA/TC repeats, accounting for 42.9% and 57.1%, resp., and the other three were mononucleotide repeats. Thirty-six pairs of SSR primers were designed; 24 of them generated clear bands of expected sizes, and 13 of them showed polymorphism. EST-based SSR development: 5659 pineapple EST sequences obtained from NCBI were analyzed; among 1397 nonredundant EST sequences, 843 were found containing 1110 SSR loci (217 of them contained more than one SSR locus. Frequency of SSRs in pineapple EST sequences is 1SSR/3.73 kb, and 44 types were found. Mononucleotide, dinucleotide, and trinucleotide repeats dominate, accounting for 95.6% in total. AG/CT and AGC/GCT were the dominant type of dinucleotide and trinucleotide repeats, accounting for 83.5% and 24.1%, respectively. Thirty pairs of primers were designed for each of randomly selected 30 sequences; 26 of them generated clear and reproducible bands, and 22 of them showed polymorphism. Eighteen pairs of primers obtained by the one or the other of the two methods above that showed polymorphism were selected to carry out germplasm genetic diversity analysis for 48 breeds of pineapple; similarity coefficients of these breeds were between 0.59 and 1.00, and they can be divided into four groups accordingly. Amplification products of five SSR markers were extracted and sequenced, corresponding repeat loci were found and locus mutations are mainly in copy number of repeats and base mutations in the flanking region.
Development of Pineapple Microsatellite Markers and Germplasm Genetic Diversity Analysis

Science.gov (United States)

Tong, Helin; Chen, You; Wang, Jingyi; Chen, Yeyuan; Sun, Guangming; He, Junhu; Wu, Yaoting

2013-01-01

Two methods were used to develop pineapple microsatellite markers. Genomic library-based SSR development: using selectively amplified microsatellite assay, 86 sequences were generated from pineapple genomic library. 91 (96.8%) of the 94 Simple Sequence Repeat (SSR) loci were dinucleotide repeats (39 AC/GT repeats and 52 GA/TC repeats, accounting for 42.9% and 57.1%, resp.), and the other three were mononucleotide repeats. Thirty-six pairs of SSR primers were designed; 24 of them generated clear bands of expected sizes, and 13 of them showed polymorphism. EST-based SSR development: 5659 pineapple EST sequences obtained from NCBI were analyzed; among 1397 nonredundant EST sequences, 843 were found containing 1110 SSR loci (217 of them contained more than one SSR locus). Frequency of SSRs in pineapple EST sequences is 1SSR/3.73 kb, and 44 types were found. Mononucleotide, dinucleotide, and trinucleotide repeats dominate, accounting for 95.6% in total. AG/CT and AGC/GCT were the dominant type of dinucleotide and trinucleotide repeats, accounting for 83.5% and 24.1%, respectively. Thirty pairs of primers were designed for each of randomly selected 30 sequences; 26 of them generated clear and reproducible bands, and 22 of them showed polymorphism. Eighteen pairs of primers obtained by the one or the other of the two methods above that showed polymorphism were selected to carry out germplasm genetic diversity analysis for 48 breeds of pineapple; similarity coefficients of these breeds were between 0.59 and 1.00, and they can be divided into four groups accordingly. Amplification products of five SSR markers were extracted and sequenced, corresponding repeat loci were found and locus mutations are mainly in copy number of repeats and base mutations in the flanking region. PMID:24024187
MimoSA: a system for minimotif annotation

Directory of Open Access Journals (Sweden)

Kundeti Vamsi

2010-06-01

Full Text Available Abstract Background Minimotifs are short peptide sequences within one protein, which are recognized by other proteins or molecules. While there are now several minimotif databases, they are incomplete. There are reports of many minimotifs in the primary literature, which have yet to be annotated, while entirely novel minimotifs continue to be published on a weekly basis. Our recently proposed function and sequence syntax for minimotifs enables us to build a general tool that will facilitate structured annotation and management of minimotif data from the biomedical literature. Results We have built the MimoSA application for minimotif annotation. The application supports management of the Minimotif Miner database, literature tracking, and annotation of new minimotifs. MimoSA enables the visualization, organization, selection and editing functions of minimotifs and their attributes in the MnM database. For the literature components, Mimosa provides paper status tracking and scoring of papers for annotation through a freely available machine learning approach, which is based on word correlation. The paper scoring algorithm is also available as a separate program, TextMine. Form-driven annotation of minimotif attributes enables entry of new minimotifs into the MnM database. Several supporting features increase the efficiency of annotation. The layered architecture of MimoSA allows for extensibility by separating the functions of paper scoring, minimotif visualization, and database management. MimoSA is readily adaptable to other annotation efforts that manually curate literature into a MySQL database. Conclusions MimoSA is an extensible application that facilitates minimotif annotation and integrates with the Minimotif Miner database. We have built MimoSA as an application that integrates dynamic abstract scoring with a high performance relational model of minimotif syntax. MimoSA's TextMine, an efficient paper-scoring algorithm, can be used to
Genetic Assessment of Moroccan Tomato (Solanum lycopersicum L. Genotypes by RAPD and SSR Markers

Directory of Open Access Journals (Sweden)

Rajae Amraoui

2017-06-01

Full Text Available For the first time eight local tomato cultivars collected from four different regions of Morocco were assessed with RAPD and SSR methods. Most of RAPD markers give monomorphic banding profiles. Only OPU03 marker showed a total of 4 polymorphic amplicons out of 8 recorded in FIGUIG2 cultivar. The analysis with SSR markers gives more polymorphism. The number of alleles amplified assessed from 2 to 5 alleles among cultivars. The similarity matrix subjected by the unweighted pairgroup arithmetic method (UPGMA clustering grouped the cultivars in four groups where FIGUIG2 cultivar formed a separate and more distant cluster. In addition this cultivar holds the very high percentage of uniformity (99% indicating that is an homogeneous traditional cultivar with high purity. This genotype can be conserved and used in breeding programs. More traditional Moroccan cultivars must be collected in order to determine their genetic structure.
Annotate-it: a Swiss-knife approach to annotation, analysis and interpretation of single nucleotide variation in human disease.

Science.gov (United States)

Sifrim, Alejandro; Van Houdt, Jeroen Kj; Tranchevent, Leon-Charles; Nowakowska, Beata; Sakai, Ryo; Pavlopoulos, Georgios A; Devriendt, Koen; Vermeesch, Joris R; Moreau, Yves; Aerts, Jan

2012-01-01

The increasing size and complexity of exome/genome sequencing data requires new tools for clinical geneticists to discover disease-causing variants. Bottlenecks in identifying the causative variation include poor cross-sample querying, constantly changing functional annotation and not considering existing knowledge concerning the phenotype. We describe a methodology that facilitates exploration of patient sequencing data towards identification of causal variants under different genetic hypotheses. Annotate-it facilitates handling, analysis and interpretation of high-throughput single nucleotide variant data. We demonstrate our strategy using three case studies. Annotate-it is freely available and test data are accessible to all users at http://www.annotate-it.org.
Annotated chemical patent corpus: a gold standard for text mining.

Directory of Open Access Journals (Sweden)

Saber A Akhondi

Full Text Available Exploring the chemical and biological space covered by patent applications is crucial in early-stage medicinal chemistry activities. Patent analysis can provide understanding of compound prior art, novelty checking, validation of biological assays, and identification of new starting points for chemical exploration. Extracting chemical and biological entities from patents through manual extraction by expert curators can take substantial amount of time and resources. Text mining methods can help to ease this process. To validate the performance of such methods, a manually annotated patent corpus is essential. In this study we have produced a large gold standard chemical patent corpus. We developed annotation guidelines and selected 200 full patents from the World Intellectual Property Organization, United States Patent and Trademark Office, and European Patent Office. The patents were pre-annotated automatically and made available to four independent annotator groups each consisting of two to ten annotators. The annotators marked chemicals in different subclasses, diseases, targets, and modes of action. Spelling mistakes and spurious line break due to optical character recognition errors were also annotated. A subset of 47 patents was annotated by at least three annotator groups, from which harmonized annotations and inter-annotator agreement scores were derived. One group annotated the full set. The patent corpus includes 400,125 annotations for the full set and 36,537 annotations for the harmonized set. All patents and annotated entities are publicly available at www.biosemantics.org.
Development and characterization of microsatellite markers for the Cape gooseberry Physalis peruviana.

Science.gov (United States)

Simbaqueba, Jaime; Sánchez, Pilar; Sanchez, Erika; Núñez Zarantes, Victor Manuel; Chacon, Maria Isabel; Barrero, Luz Stella; Mariño-Ramírez, Leonardo

2011-01-01

Physalis peruviana, commonly known as Cape gooseberry, is an Andean Solanaceae fruit with high nutritional value and interesting medicinal properties. In the present study we report the development and characterization of microsatellite loci from a P. peruviana commercial Colombian genotype. We identified 932 imperfect and 201 perfect Simple Sequence Repeats (SSR) loci in untranslated regions (UTRs) and 304 imperfect and 83 perfect SSR loci in coding regions from the assembled Physalis peruviana leaf transcriptome. The UTR SSR loci were used for the development of 162 primers for amplification. The efficiency of these primers was tested via PCR in a panel of seven P. peruviana accessions including Colombia, Kenya and Ecuador ecotypes and one closely related species Physalis floridana. We obtained an amplification rate of 83% and a polymorphic rate of 22%. Here we report the first P. peruviana specific microsatellite set, a valuable tool for a wide variety of applications, including functional diversity, conservation and improvement of the species.
New Hypervariable SSR Markers for Diversity Analysis, Hybrid Purity Testing and Trait Mapping in Pigeonpea [Cajanus cajan (L.) Millspaugh].

Science.gov (United States)

Bohra, Abhishek; Jha, Rintu; Pandey, Gaurav; Patil, Prakash G; Saxena, Rachit K; Singh, Indra P; Singh, D; Mishra, R K; Mishra, Ankita; Singh, F; Varshney, Rajeev K; Singh, N P

2017-01-01

Draft genome sequence in pigeonpea offers unprecedented opportunities for genomics assisted crop improvement via enabling access to genome-wide genetic markers. In the present study, 421 hypervariable simple sequence repeat (SSR) markers from the pigeonpea genome were screened on a panel of eight pigeonpea genotypes yielding marker validation and polymorphism percentages of 95.24 and 54.11%, respectively. The SSR marker assay uncovered a total of 570 alleles with three as an average number of alleles per marker. Similarly, the mean values for gene diversity and PIC were 0.44 and 0.37, respectively. The number of polymorphic markers ranged from 39 to 89 for different parental combinations. Further, 60 of these SSRs were assayed on 94 genotypes, and model based clustering using STRUCTURE resulted in the identification of the two subpopulations ( K = 2). This remained in close agreement with the clustering patterns inferred from genetic distance (GD)-based approaches i.e., dendrogram, factorial and principal coordinate analysis (PCoA). The AMOVA accounted majority of the genetic variation within groups (89%) in comparison to the variation existing between the groups (11%). A subset of these markers was implicated for hybrid purity testing. We also demonstrated utility of these SSR markers in trait mapping through association and bi-parental linkage analyses. The general linear (GLM) and mixed linear (MLM) models both detected a single SSR marker (CcGM03681) with R 2 = 16.4 as associated with the resistance to Fusarium wilt variant 2. Similarly, by using SSR data in a segregating backcross population, the corresponding restorer-of-fertility ( Rf ) locus was putatively mapped at 39 cM with the marker CcGM08896. However, The marker-trait associations (MTAs) detected here represent a very preliminary type and hence demand deeper investigations for conclusive evidence. Given their ability to reveal polymorphism in simple agarose gels, the hypervariable SSRs are valuable
Comparison of 454-ESTs from Huperzia serrata and Phlegmariurus carinatus reveals putative genes involved in lycopodium alkaloid biosynthesis and developmental regulation

Directory of Open Access Journals (Sweden)

Steinmetz André

2010-09-01

Full Text Available Abstract Background Plants of the Huperziaceae family, which comprise the two genera Huperzia and Phlegmariurus, produce various types of lycopodium alkaloids that are used to treat a number of human ailments, such as contusions, swellings and strains. Huperzine A, which belongs to the lycodine type of lycopodium alkaloids, has been used as an anti-Alzheimer's disease drug candidate. Despite their medical importance, little genomic or transcriptomic data are available for the members of this family. We used massive parallel pyrosequencing on the Roche 454-GS FLX Titanium platform to generate a substantial EST dataset for Huperzia serrata (H. serrata and Phlegmariurus carinatus (P. carinatus as representative members of the Huperzia and Phlegmariurus genera, respectively. H. serrata and P. carinatus are important plants for research on the biosynthesis of lycopodium alkaloids. We focused on gene discovery in the areas of bioactive compound biosynthesis and transcriptional regulation as well as genetic marker detection in these species. Results For H. serrata, 36,763 unique putative transcripts were generated from 140,930 reads totaling over 57,028,559 base pairs; for P. carinatus, 31,812 unique putative transcripts were generated from 79,920 reads totaling over 30,498,684 base pairs. Using BLASTX searches of public databases, 16,274 (44.3% unique putative transcripts from H. serrata and 14,070 (44.2% from P. carinatus were assigned to at least one protein. Gene Ontology (GO and Kyoto Encyclopedia of Genes and Genomes (KEGG orthology annotations revealed that the functions of the unique putative transcripts from these two species cover a similarly broad set of molecular functions, biological processes and biochemical pathways. In particular, a total of 20 H. serrata candidate cytochrome P450 genes, which are more abundant in leaves than in roots and might be involved in lycopodium alkaloid biosynthesis, were found based on the comparison of H
Genetic diversity analysis of Amomum tsao-ko in Jinping County of Yunnan Province using SSR markers

Science.gov (United States)

Ma, Mengli; Wang, Tiantao; Lei, En; Meng, Hengling; Xie, Linyan; Zhu, Kunlong; Duan, Shaoze; Li, Wenqiang; Lu, Bingyue

2017-08-01

Genetic diversity analysis is very important for germplasm resources conservation and utilization. The objective of this study was to assess the genetic diversity among 44 individuals of Amomum tsao-ko from Jinping County of Yunnan Province using 5 selected SSR (simple sequence repeat) markers. A total of 23 polymorphic loci were detected among these germplasms, with an average of 4.6 polymorphic loci per SSR primer combination. The percentage of polymorphic loci was 100%, whereas the mean effective number of alleles (Ne), observed heterozygosity(Ho), expected heterozygosity (He), Shannon's information index (I), and the mean polymorphism information content (PIC) were 3. 410, 0. 491, 0. 679, 1.266 and 0. 672, respectively, indicating that the Amomum tsao-ko germplasms from Jinping County had high genetic diversity.

Systematic interpretation of microarray data using experiment annotations

Directory of Open Access Journals (Sweden)

Frohme Marcus

2006-12-01

Full Text Available Abstract Background Up to now, microarray data are mostly assessed in context with only one or few parameters characterizing the experimental conditions under study. More explicit experiment annotations, however, are highly useful for interpreting microarray data, when available in a statistically accessible format. Results We provide means to preprocess these additional data, and to extract relevant traits corresponding to the transcription patterns under study. We found correspondence analysis particularly well-suited for mapping such extracted traits. It visualizes associations both among and between the traits, the hereby annotated experiments, and the genes, revealing how they are all interrelated. Here, we apply our methods to the systematic interpretation of radioactive (single channel and two-channel data, stemming from model organisms such as yeast and drosophila up to complex human cancer samples. Inclusion of technical parameters allows for identification of artifacts and flaws in experimental design. Conclusion Biological and clinical traits can act as landmarks in transcription space, systematically mapping the variance of large datasets from the predominant changes down toward intricate details.
Diverse Image Annotation

KAUST Repository

Wu, Baoyuan

2017-11-09

In this work we study the task of image annotation, of which the goal is to describe an image using a few tags. Instead of predicting the full list of tags, here we target for providing a short list of tags under a limited number (e.g., 3), to cover as much information as possible of the image. The tags in such a short list should be representative and diverse. It means they are required to be not only corresponding to the contents of the image, but also be different to each other. To this end, we treat the image annotation as a subset selection problem based on the conditional determinantal point process (DPP) model, which formulates the representation and diversity jointly. We further explore the semantic hierarchy and synonyms among the candidate tags, and require that two tags in a semantic hierarchy or in a pair of synonyms should not be selected simultaneously. This requirement is then embedded into the sampling algorithm according to the learned conditional DPP model. Besides, we find that traditional metrics for image annotation (e.g., precision, recall and F1 score) only consider the representation, but ignore the diversity. Thus we propose new metrics to evaluate the quality of the selected subset (i.e., the tag list), based on the semantic hierarchy and synonyms. Human study through Amazon Mechanical Turk verifies that the proposed metrics are more close to the humans judgment than traditional metrics. Experiments on two benchmark datasets show that the proposed method can produce more representative and diverse tags, compared with existing image annotation methods.
Diverse Image Annotation

KAUST Repository

Wu, Baoyuan; Jia, Fan; Liu, Wei; Ghanem, Bernard

2017-01-01

In this work we study the task of image annotation, of which the goal is to describe an image using a few tags. Instead of predicting the full list of tags, here we target for providing a short list of tags under a limited number (e.g., 3), to cover as much information as possible of the image. The tags in such a short list should be representative and diverse. It means they are required to be not only corresponding to the contents of the image, but also be different to each other. To this end, we treat the image annotation as a subset selection problem based on the conditional determinantal point process (DPP) model, which formulates the representation and diversity jointly. We further explore the semantic hierarchy and synonyms among the candidate tags, and require that two tags in a semantic hierarchy or in a pair of synonyms should not be selected simultaneously. This requirement is then embedded into the sampling algorithm according to the learned conditional DPP model. Besides, we find that traditional metrics for image annotation (e.g., precision, recall and F1 score) only consider the representation, but ignore the diversity. Thus we propose new metrics to evaluate the quality of the selected subset (i.e., the tag list), based on the semantic hierarchy and synonyms. Human study through Amazon Mechanical Turk verifies that the proposed metrics are more close to the humans judgment than traditional metrics. Experiments on two benchmark datasets show that the proposed method can produce more representative and diverse tags, compared with existing image annotation methods.
Annotating individual human genomes.

Science.gov (United States)

Torkamani, Ali; Scott-Van Zeeland, Ashley A; Topol, Eric J; Schork, Nicholas J

2011-10-01

Advances in DNA sequencing technologies have made it possible to rapidly, accurately and affordably sequence entire individual human genomes. As impressive as this ability seems, however, it will not likely amount to much if one cannot extract meaningful information from individual sequence data. Annotating variations within individual genomes and providing information about their biological or phenotypic impact will thus be crucially important in moving individual sequencing projects forward, especially in the context of the clinical use of sequence information. In this paper we consider the various ways in which one might annotate individual sequence variations and point out limitations in the available methods for doing so. It is arguable that, in the foreseeable future, DNA sequencing of individual genomes will become routine for clinical, research, forensic, and personal purposes. We therefore also consider directions and areas for further research in annotating genomic variants. Copyright © 2011 Elsevier Inc. All rights reserved.
ANNOTATING INDIVIDUAL HUMAN GENOMES*

Science.gov (United States)

Torkamani, Ali; Scott-Van Zeeland, Ashley A.; Topol, Eric J.; Schork, Nicholas J.

2014-01-01

Advances in DNA sequencing technologies have made it possible to rapidly, accurately and affordably sequence entire individual human genomes. As impressive as this ability seems, however, it will not likely to amount to much if one cannot extract meaningful information from individual sequence data. Annotating variations within individual genomes and providing information about their biological or phenotypic impact will thus be crucially important in moving individual sequencing projects forward, especially in the context of the clinical use of sequence information. In this paper we consider the various ways in which one might annotate individual sequence variations and point out limitations in the available methods for doing so. It is arguable that, in the foreseeable future, DNA sequencing of individual genomes will become routine for clinical, research, forensic, and personal purposes. We therefore also consider directions and areas for further research in annotating genomic variants. PMID:21839162
Protein sequence annotation in the genome era: the annotation concept of SWISS-PROT+TREMBL.

Science.gov (United States)

Apweiler, R; Gateau, A; Contrino, S; Martin, M J; Junker, V; O'Donovan, C; Lang, F; Mitaritonna, N; Kappus, S; Bairoch, A

1997-01-01

SWISS-PROT is a curated protein sequence database which strives to provide a high level of annotation, a minimal level of redundancy and high level of integration with other databases. Ongoing genome sequencing projects have dramatically increased the number of protein sequences to be incorporated into SWISS-PROT. Since we do not want to dilute the quality standards of SWISS-PROT by incorporating sequences without proper sequence analysis and annotation, we cannot speed up the incorporation of new incoming data indefinitely. However, as we also want to make the sequences available as fast as possible, we introduced TREMBL (TRanslation of EMBL nucleotide sequence database), a supplement to SWISS-PROT. TREMBL consists of computer-annotated entries in SWISS-PROT format derived from the translation of all coding sequences (CDS) in the EMBL nucleotide sequence database, except for CDS already included in SWISS-PROT. While TREMBL is already of immense value, its computer-generated annotation does not match the quality of SWISS-PROTs. The main difference is in the protein functional information attached to sequences. With this in mind, we are dedicating substantial effort to develop and apply computer methods to enhance the functional information attached to TREMBL entries.
The GATO gene annotation tool for research laboratories

Directory of Open Access Journals (Sweden)

A. Fujita

2005-11-01

Full Text Available Large-scale genome projects have generated a rapidly increasing number of DNA sequences. Therefore, development of computational methods to rapidly analyze these sequences is essential for progress in genomic research. Here we present an automatic annotation system for preliminary analysis of DNA sequences. The gene annotation tool (GATO is a Bioinformatics pipeline designed to facilitate routine functional annotation and easy access to annotated genes. It was designed in view of the frequent need of genomic researchers to access data pertaining to a common set of genes. In the GATO system, annotation is generated by querying some of the Web-accessible resources and the information is stored in a local database, which keeps a record of all previous annotation results. GATO may be accessed from everywhere through the internet or may be run locally if a large number of sequences are going to be annotated. It is implemented in PHP and Perl and may be run on any suitable Web server. Usually, installation and application of annotation systems require experience and are time consuming, but GATO is simple and practical, allowing anyone with basic skills in informatics to access it without any special training. GATO can be downloaded at [http://mariwork.iq.usp.br/gato/]. Minimum computer free space required is 2 MB.
GSV Annotated Bibliography

Energy Technology Data Exchange (ETDEWEB)

Roberts, Randy S. [Lawrence Livermore National Lab. (LLNL), Livermore, CA (United States); Pope, Paul A. [Los Alamos National Lab. (LANL), Los Alamos, NM (United States); Jiang, Ming [Lawrence Livermore National Lab. (LLNL), Livermore, CA (United States); Trucano, Timothy G. [Sandia National Lab. (SNL-NM), Albuquerque, NM (United States); Aragon, Cecilia R. [Lawrence Berkeley National Lab. (LBNL), Berkeley, CA (United States); Ni, Kevin [Lawrence Livermore National Lab. (LLNL), Livermore, CA (United States); Wei, Thomas [Argonne National Lab. (ANL), Argonne, IL (United States); Chilton, Lawrence K. [Pacific Northwest National Lab. (PNNL), Richland, WA (United States); Bakel, Alan [Argonne National Lab. (ANL), Argonne, IL (United States)

2010-09-14

The following annotated bibliography was developed as part of the geospatial algorithm verification and validation (GSV) project for the Simulation, Algorithms and Modeling program of NA-22. Verification and Validation of geospatial image analysis algorithms covers a wide range of technologies. Papers in the bibliography are thus organized into the following five topic areas: Image processing and analysis, usability and validation of geospatial image analysis algorithms, image distance measures, scene modeling and image rendering, and transportation simulation models. Many other papers were studied during the course of the investigation including. The annotations for these articles can be found in the paper "On the verification and validation of geospatial image analysis algorithms".
Generation of Unbiased Ionospheric Corrections in Brazilian Region for GNSS positioning based on SSR concept

Science.gov (United States)

Monico, J. F. G.; De Oliveira, P. S., Jr.; Morel, L.; Fund, F.; Durand, S.; Durand, F.

2017-12-01

Mitigation of ionospheric effects on GNSS (Global Navigation Satellite System) signals is very challenging, especially for GNSS positioning applications based on SSR (State Space Representation) concept, which requires the knowledge of spatial correlated errors with considerable accuracy level (centimeter). The presence of satellite and receiver hardware biases on GNSS measurements difficult the proper estimation of ionospheric corrections, reducing their physical meaning. This problematic can lead to ionospheric corrections biased of several meters and often presenting negative values, which is physically not possible. In this contribution, we discuss a strategy to obtain SSR ionospheric corrections based on GNSS measurements from CORS (Continuous Operation Reference Stations) Networks with minimal presence of hardware biases and consequently physical meaning. Preliminary results are presented on generation and application of such corrections for simulated users located in Brazilian region under high level of ionospheric activity.
Mining and Development of Novel SSR Markers Using Next Generation Sequencing (NGS Data in Plants

Directory of Open Access Journals (Sweden)

Sima Taheri

2018-02-01

Full Text Available Microsatellites, or simple sequence repeats (SSRs, are one of the most informative and multi-purpose genetic markers exploited in plant functional genomics. However, the discovery of SSRs and development using traditional methods are laborious, time-consuming, and costly. Recently, the availability of high-throughput sequencing technologies has enabled researchers to identify a substantial number of microsatellites at less cost and effort than traditional approaches. Illumina is a noteworthy transcriptome sequencing technology that is currently used in SSR marker development. Although 454 pyrosequencing datasets can be used for SSR development, this type of sequencing is no longer supported. This review aims to present an overview of the next generation sequencing, with a focus on the efficient use of de novo transcriptome sequencing (RNA-Seq and related tools for mining and development of microsatellites in plants.
Solar Tutorial and Annotation Resource (STAR)

Science.gov (United States)

Showalter, C.; Rex, R.; Hurlburt, N. E.; Zita, E. J.

2009-12-01

We have written a software suite designed to facilitate solar data analysis by scientists, students, and the public, anticipating enormous datasets from future instruments. Our “STAR" suite includes an interactive learning section explaining 15 classes of solar events. Users learn software tools that exploit humans’ superior ability (over computers) to identify many events. Annotation tools include time slice generation to quantify loop oscillations, the interpolation of event shapes using natural cubic splines (for loops, sigmoids, and filaments) and closed cubic splines (for coronal holes). Learning these tools in an environment where examples are provided prepares new users to comfortably utilize annotation software with new data. Upon completion of our tutorial, users are presented with media of various solar events and asked to identify and annotate the images, to test their mastery of the system. Goals of the project include public input into the data analysis of very large datasets from future solar satellites, and increased public interest and knowledge about the Sun. In 2010, the Solar Dynamics Observatory (SDO) will be launched into orbit. SDO’s advancements in solar telescope technology will generate a terabyte per day of high-quality data, requiring innovation in data management. While major projects develop automated feature recognition software, so that computers can complete much of the initial event tagging and analysis, still, that software cannot annotate features such as sigmoids, coronal magnetic loops, coronal dimming, etc., due to large amounts of data concentrated in relatively small areas. Previously, solar physicists manually annotated these features, but with the imminent influx of data it is unrealistic to expect specialized researchers to examine every image that computers cannot fully process. A new approach is needed to efficiently process these data. Providing analysis tools and data access to students and the public have proven
SSR240612 [(2R)-2-[((3R)-3-(1,3-benzodioxol-5-yl)-3-[[(6-methoxy-2-naphthyl)sulfonyl]amino]propanoyl)amino]-3-(4-[[2R,6S)-2,6-dimethylpiperidinyl]methyl]phenyl)-N-isopropyl-N-methylpropanamide hydrochloride], a new nonpeptide antagonist of the bradykinin B1 receptor: biochemical and pharmacological characterization.

Science.gov (United States)

Gougat, Jean; Ferrari, Bernard; Sarran, Lionel; Planchenault, Claudine; Poncelet, Martine; Maruani, Jeanne; Alonso, Richard; Cudennec, Annie; Croci, Tiziano; Guagnini, Fabio; Urban-Szabo, Katalin; Martinolle, Jean-Pierre; Soubrié, Philippe; Finance, Olivier; Le Fur, Gérard

2004-05-01

The biochemical and pharmacological properties of a novel non-peptide antagonist of the bradykinin (BK) B(1) receptor, SSR240612 [(2R)-2-[((3R)-3-(1,3-benzodioxol-5-yl)-3-[[(6-methoxy-2-naphthyl)sulfonyl]amino]propanoyl)amino]-3-(4-[[2R,6S)-2,6-dimethylpiperidinyl]methyl]phenyl)-N-isopropyl-N-methylpropanamide hydrochloride] were evaluated. SSR240612 inhibited the binding of [(3)H]Lys(0)-des-Arg(9)-BK to the B(1) receptor in human fibroblast MRC5 and to recombinant human B(1) receptor expressed in human embryonic kidney cells with inhibition constants (K(i)) of 0.48 and 0.73 nM, respectively. The compound selectivity for B(1) versus B(2) receptors was in the range of 500- to 1000-fold. SSR240612 inhibited Lys(0)-desAr(9)-BK (10 nM)-induced inositol monophosphate formation in human fibroblast MRC5, with an IC(50) of 1.9 nM. It also antagonized des-Arg(9)-BK-induced contractions of isolated rabbit aorta and mesenteric plexus of rat ileum with a pA(2) of 8.9 and 9.4, respectively. Antagonistic properties of SSR240612 were also demonstrated in vivo. SSR240612 inhibited des-Arg(9)-BK-induced paw edema in mice (3 and 10 mg/kg p.o. and 0.3 and 1 mg/kg i.p.). Moreover, SSR240612 reduced capsaicin-induced ear edema in mice (0.3, 3 and 30 mg/kg p.o.) and tissue destruction and neutrophil accumulation in the rat intestine following splanchnic artery occlusion/reperfusion (0.3 mg/kg i.v.). The compound also inhibited thermal hyperalgesia induced by UV irradiation (1 and 3 mg/kg p.o.) and the late phase of nociceptive response to formalin in rats (10 and 30 mg/kg p.o.). Finally, SSR240612 (20 and 30 mg/kg p.o.) prevented neuropathic thermal pain induced by sciatic nerve constriction in the rat. In conclusion, SSR240612 is a new, potent, and orally active specific non-peptide bradykinin B(1) receptor antagonist.
Discovering gene annotations in biomedical text databases

Directory of Open Access Journals (Sweden)

Ozsoyoglu Gultekin

2008-03-01

Full Text Available Abstract Background Genes and gene products are frequently annotated with Gene Ontology concepts based on the evidence provided in genomics articles. Manually locating and curating information about a genomic entity from the biomedical literature requires vast amounts of human effort. Hence, there is clearly a need forautomated computational tools to annotate the genes and gene products with Gene Ontology concepts by computationally capturing the related knowledge embedded in textual data. Results In this article, we present an automated genomic entity annotation system, GEANN, which extracts information about the characteristics of genes and gene products in article abstracts from PubMed, and translates the discoveredknowledge into Gene Ontology (GO concepts, a widely-used standardized vocabulary of genomic traits. GEANN utilizes textual "extraction patterns", and a semantic matching framework to locate phrases matching to a pattern and produce Gene Ontology annotations for genes and gene products. In our experiments, GEANN has reached to the precision level of 78% at therecall level of 61%. On a select set of Gene Ontology concepts, GEANN either outperforms or is comparable to two other automated annotation studies. Use of WordNet for semantic pattern matching improves the precision and recall by 24% and 15%, respectively, and the improvement due to semantic pattern matching becomes more apparent as the Gene Ontology terms become more general. Conclusion GEANN is useful for two distinct purposes: (i automating the annotation of genomic entities with Gene Ontology concepts, and (ii providing existing annotations with additional "evidence articles" from the literature. The use of textual extraction patterns that are constructed based on the existing annotations achieve high precision. The semantic pattern matching framework provides a more flexible pattern matching scheme with respect to "exactmatching" with the advantage of locating approximate
Assessment of genetic diversity in ragi [Eleusine coracana (L.) Gaertn] using morphological, RAPD and SSR markers.

Science.gov (United States)

Prabhu, Kalapad Santosh; Das, Anath Bandhu; Dikshit, Nilamani

2018-04-25

Finger millet (Eleusine coracana L. Gaertn., 2n=36) is one of the most important minor crops, commonly known as 'ragi' and used as a staple food grain in more than 25 countries including Africa and south Asia. Twenty-seven accessions of ragi were collected from different parts of India and were evaluated for morpho-genetic diversity studies. Simple sequence repeat (SSR) and random amplified polymorphic DNA (RAPD) markers were used for assessment of genetic diversity among 27 genotypes of E. coracana. High degree of similarity (90%) was obtained between 'IC49979A' and 'IC49974B' genotypes, whereas low level of similarity (9.09%) was found between 'IC204141' and 'IC49985' as evident in morphological and DNA markers. A total of 64 SSR and 301 RAPD amplicons were produced, out of which 87.50% and 77.20% DNA fragments showed polymorphism, respectively. The clustering pattern obtained among the genotypes corresponded well with their morphological and cytological data with a monophyletic origin of this species which was further supported by high bootstrap values and principal component analysis. Cluster analysis showed that ragi accessions were categorised into three distinct groups. Genotypes IC344761, IC340116, IC340127, IC49965 and IC49985 found accession specific in RAPD and SSR markers. The variation among ragi accessions might be used as potential source of germplasm for crop improvement.
CpGAVAS, an integrated web server for the annotation, visualization, analysis, and GenBank submission of completely sequenced chloroplast genome sequences

Science.gov (United States)

2012-01-01

Background The complete sequences of chloroplast genomes provide wealthy information regarding the evolutionary history of species. With the advance of next-generation sequencing technology, the number of completely sequenced chloroplast genomes is expected to increase exponentially, powerful computational tools annotating the genome sequences are in urgent need. Results We have developed a web server CPGAVAS. The server accepts a complete chloroplast genome sequence as input. First, it predicts protein-coding and rRNA genes based on the identification and mapping of the most similar, full-length protein, cDNA and rRNA sequences by integrating results from Blastx, Blastn, protein2genome and est2genome programs. Second, tRNA genes and inverted repeats (IR) are identified using tRNAscan, ARAGORN and vmatch respectively. Third, it calculates the summary statistics for the annotated genome. Fourth, it generates a circular map ready for publication. Fifth, it can create a Sequin file for GenBank submission. Last, it allows the extractions of protein and mRNA sequences for given list of genes and species. The annotation results in GFF3 format can be edited using any compatible annotation editing tools. The edited annotations can then be uploaded to CPGAVAS for update and re-analyses repeatedly. Using known chloroplast genome sequences as test set, we show that CPGAVAS performs comparably to another application DOGMA, while having several superior functionalities. Conclusions CPGAVAS allows the semi-automatic and complete annotation of a chloroplast genome sequence, and the visualization, editing and analysis of the annotation results. It will become an indispensible tool for researchers studying chloroplast genomes. The software is freely accessible from http://www.herbalgenomics.org/cpgavas. PMID:23256920
CpGAVAS, an integrated web server for the annotation, visualization, analysis, and GenBank submission of completely sequenced chloroplast genome sequences

Directory of Open Access Journals (Sweden)

Liu Chang

2012-12-01

Full Text Available Abstract Background The complete sequences of chloroplast genomes provide wealthy information regarding the evolutionary history of species. With the advance of next-generation sequencing technology, the number of completely sequenced chloroplast genomes is expected to increase exponentially, powerful computational tools annotating the genome sequences are in urgent need. Results We have developed a web server CPGAVAS. The server accepts a complete chloroplast genome sequence as input. First, it predicts protein-coding and rRNA genes based on the identification and mapping of the most similar, full-length protein, cDNA and rRNA sequences by integrating results from Blastx, Blastn, protein2genome and est2genome programs. Second, tRNA genes and inverted repeats (IR are identified using tRNAscan, ARAGORN and vmatch respectively. Third, it calculates the summary statistics for the annotated genome. Fourth, it generates a circular map ready for publication. Fifth, it can create a Sequin file for GenBank submission. Last, it allows the extractions of protein and mRNA sequences for given list of genes and species. The annotation results in GFF3 format can be edited using any compatible annotation editing tools. The edited annotations can then be uploaded to CPGAVAS for update and re-analyses repeatedly. Using known chloroplast genome sequences as test set, we show that CPGAVAS performs comparably to another application DOGMA, while having several superior functionalities. Conclusions CPGAVAS allows the semi-automatic and complete annotation of a chloroplast genome sequence, and the visualization, editing and analysis of the annotation results. It will become an indispensible tool for researchers studying chloroplast genomes. The software is freely accessible from http://www.herbalgenomics.org/cpgavas.
Annotating the human genome with Disease Ontology

Science.gov (United States)

Osborne, John D; Flatow, Jared; Holko, Michelle; Lin, Simon M; Kibbe, Warren A; Zhu, Lihua (Julie); Danila, Maria I; Feng, Gang; Chisholm, Rex L

2009-01-01

Background The human genome has been extensively annotated with Gene Ontology for biological functions, but minimally computationally annotated for diseases. Results We used the Unified Medical Language System (UMLS) MetaMap Transfer tool (MMTx) to discover gene-disease relationships from the GeneRIF database. We utilized a comprehensive subset of UMLS, which is disease-focused and structured as a directed acyclic graph (the Disease Ontology), to filter and interpret results from MMTx. The results were validated against the Homayouni gene collection using recall and precision measurements. We compared our results with the widely used Online Mendelian Inheritance in Man (OMIM) annotations. Conclusion The validation data set suggests a 91% recall rate and 97% precision rate of disease annotation using GeneRIF, in contrast with a 22% recall and 98% precision using OMIM. Our thesaurus-based approach allows for comparisons to be made between disease containing databases and allows for increased accuracy in disease identification through synonym matching. The much higher recall rate of our approach demonstrates that annotating human genome with Disease Ontology and GeneRIF for diseases dramatically increases the coverage of the disease annotation of human genome. PMID:19594883
MIPS bacterial genomes functional annotation benchmark dataset.

Science.gov (United States)

Tetko, Igor V; Brauner, Barbara; Dunger-Kaltenbach, Irmtraud; Frishman, Goar; Montrone, Corinna; Fobo, Gisela; Ruepp, Andreas; Antonov, Alexey V; Surmeli, Dimitrij; Mewes, Hans-Wernen

2005-05-15

Any development of new methods for automatic functional annotation of proteins according to their sequences requires high-quality data (as benchmark) as well as tedious preparatory work to generate sequence parameters required as input data for the machine learning methods. Different program settings and incompatible protocols make a comparison of the analyzed methods difficult. The MIPS Bacterial Functional Annotation Benchmark dataset (MIPS-BFAB) is a new, high-quality resource comprising four bacterial genomes manually annotated according to the MIPS functional catalogue (FunCat). These resources include precalculated sequence parameters, such as sequence similarity scores, InterPro domain composition and other parameters that could be used to develop and benchmark methods for functional annotation of bacterial protein sequences. These data are provided in XML format and can be used by scientists who are not necessarily experts in genome annotation. BFAB is available at http://mips.gsf.de/proj/bfab
Annotating non-coding regions of the genome.

Science.gov (United States)

Alexander, Roger P; Fang, Gang; Rozowsky, Joel; Snyder, Michael; Gerstein, Mark B

2010-08-01

Most of the human genome consists of non-protein-coding DNA. Recently, progress has been made in annotating these non-coding regions through the interpretation of functional genomics experiments and comparative sequence analysis. One can conceptualize functional genomics analysis as involving a sequence of steps: turning the output of an experiment into a 'signal' at each base pair of the genome; smoothing this signal and segmenting it into small blocks of initial annotation; and then clustering these small blocks into larger derived annotations and networks. Finally, one can relate functional genomics annotations to conserved units and measures of conservation derived from comparative sequence analysis.
Sequencing and De Novo Transcriptome Assembly of Brachypodium sylvaticum (Poaceae

Directory of Open Access Journals (Sweden)

Samuel E. Fox

2013-03-01

Full Text Available Premise of the study: We report the de novo assembly and characterization of the transcriptomes of Brachypodium sylvaticum (slender false-brome accessions from native populations of Spain and Greece, and an invasive population west of Corvallis, Oregon, USA. Methods and Results: More than 350 million sequence reads from the mRNA libraries prepared from three B. sylvaticum genotypes were assembled into 120,091 (Corvallis, 104,950 (Spain, and 177,682 (Greece transcript contigs. In comparison with the B. distachyon Bd21 reference genome and GenBank protein sequences, we estimate >90% exome coverage for B. sylvaticum. The transcripts were assigned Gene Ontology and InterPro annotations. Brachypodium sylvaticum sequence reads aligned against the Bd21 genome revealed 394,654 single-nucleotide polymorphisms (SNPs and >20,000 simple sequence repeat (SSR DNA sites. Conclusions: To our knowledge, this is the first report of transcriptome sequencing of invasive plant species with a closely related sequenced reference genome. The sequences and identified SNP variant and SSR sites will provide tools for developing novel genetic markers for use in genotyping and characterization of invasive behavior of B. sylvaticum.

Some links on this page may take you to non-federal websites. Their policies may differ from this site.