WorldWideScience

Sample records for species est database

  1. Analysis of expressed sequence tags from Actinidia: applications of a cross species EST database for gene discovery in the areas of flavor, health, color and ripening

    Directory of Open Access Journals (Sweden)

    Richardson Annette C

    2008-07-01

    Full Text Available Abstract Background Kiwifruit (Actinidia spp. are a relatively new, but economically important crop grown in many different parts of the world. Commercial success is driven by the development of new cultivars with novel consumer traits including flavor, appearance, healthful components and convenience. To increase our understanding of the genetic diversity and gene-based control of these key traits in Actinidia, we have produced a collection of 132,577 expressed sequence tags (ESTs. Results The ESTs were derived mainly from four Actinidia species (A. chinensis, A. deliciosa, A. arguta and A. eriantha and fell into 41,858 non redundant clusters (18,070 tentative consensus sequences and 23,788 EST singletons. Analysis of flavor and fragrance-related gene families (acyltransferases and carboxylesterases and pathways (terpenoid biosynthesis is presented in comparison with a chemical analysis of the compounds present in Actinidia including esters, acids, alcohols and terpenes. ESTs are identified for most genes in color pathways controlling chlorophyll degradation and carotenoid biosynthesis. In the health area, data are presented on the ESTs involved in ascorbic acid and quinic acid biosynthesis showing not only that genes for many of the steps in these pathways are represented in the database, but that genes encoding some critical steps are absent. In the convenience area, genes related to different stages of fruit softening are identified. Conclusion This large EST resource will allow researchers to undertake the tremendous challenge of understanding the molecular basis of genetic diversity in the Actinidia genus as well as provide an EST resource for comparative fruit genomics. The various bioinformatics analyses we have undertaken demonstrates the extent of coverage of ESTs for genes encoding different biochemical pathways in Actinidia.

  2. Signaling pathways in a Citrus EST database

    Directory of Open Access Journals (Sweden)

    Angela Mehta

    2007-01-01

    Full Text Available Citrus spp. are economically important crops, which in Brazil are grown mainly in the State of São Paulo. Citrus cultures are attacked by several pathogens, causing severe yield losses. In order to better understand this culture, the Millenium Project (IAC Cordeirópolis was launched in order to sequence Citrus ESTs (expressed sequence tags from different tissues, including leaf, bark, fruit, root and flower. Plants were submitted to biotic and abiotic stresses and investigated under different development stages (adult vs. juvenile. Several cDNA libraries were constructed and the sequences obtained formed the Citrus ESTs database with almost 200,000 sequences. Searches were performed in the Citrus database to investigate the presence of different signaling pathway components. Several of the genes involved in the signaling of sugar, calcium, cytokinin, plant hormones, inositol phosphate, MAPKinase and COP9 were found in the citrus genome and are discussed in this paper. The results obtained may indicate that similar mechanisms described in other plants, such as Arabidopsis, occur in citrus. Further experimental studies must be conducted in order to understand the different signaling pathways present.

  3. EST data - RED | LSDB Archive [Life Science Database Archive metadata

    Lifescience Database Archive (English)

    Full Text Available switchLanguage; BLAST Search Image Search Home About Archive Update History Data ...st.zip File URL: ftp://ftp.biosciencedbc.jp/archive/red/LATEST/red_est.zip File size: 629 KB Simple search U...ase Database Description Download License Update History of This Database Site Policy | Contact Us EST data - RED | LSDB Archive ...

  4. Database Description - AcEST | LSDB Archive [Life Science Database Archive metadata

    Lifescience Database Archive (English)

    Full Text Available abase Description General information of database Database name AcEST Alternative n...hi, Tokyo-to 192-0397 Tel: +81-42-677-1111(ext.3654) E-mail: Database classificat...eneris Taxonomy ID: 13818 Database description This is a database of EST sequences of Adiantum capillus-vene...(3): 223-227. External Links: Original website information Database maintenance site Plant Environmental Res...base Database Description Download License Update History of This Database Site Policy | Contact Us Database Description - AcEST | LSDB Archive ...

  5. Update History of This Database - AcEST | LSDB Archive [Life Science Database Archive metadata

    Lifescience Database Archive (English)

    Full Text Available switchLanguage; BLAST Search Image Search Home About Archive Update History Data ...List Contact us AcEST Update History of This Database Date Update contents 2013/01/10 Errors found on AcEST ...s Database Database Description Download License Update History of This Data...base Site Policy | Contact Us Update History of This Database - AcEST | LSDB Archive ... ...Conting data have been correceted. For details, please refer to the following page. Data correction 2010/03/29 AcEST English archi

  6. Putative resistance genes in the CitEST database

    Directory of Open Access Journals (Sweden)

    Simone Guidetti-Gonzalez

    2007-01-01

    Full Text Available Disease resistance in plants is usually associated with the activation of a wide variety of defense responses to prevent pathogen replication and/or movement. The ability of the host plant to recognize the pathogen and to activate defense responses is regulated by direct or indirect interaction between the products of plant resistance (R and pathogen avirulence (Avr genes. Attempted infection of plants by avirulent pathogens elicits a battery of defenses often followed by the collapse of the challenged host cells. Localized host cell death may help to prevent the pathogen from spreading to uninfected tissues, known as hypersensitive response (HR. When either the plant or the pathogen lacks its cognate gene, activation of the plant’s defense responses fails to occur or is delayed and does not prevent pathogen colonization. In the CitEST database, we identified 1,300 reads related to R genes in Citrus which have been reported in other plant species. These reads were translated in silico, and alignments of their amino acid sequences revealed the presence of characteristic domains and motifs that are specific to R gene classes. The description of the reads identified suggests that they function as resistance genes in citrus.

  7. Construction, database integration, and application of an Oenothera EST library.

    Science.gov (United States)

    Mrácek, Jaroslav; Greiner, Stephan; Cho, Won Kyong; Rauwolf, Uwe; Braun, Martha; Umate, Pavan; Altstätter, Johannes; Stoppel, Rhea; Mlcochová, Lada; Silber, Martina V; Volz, Stefanie M; White, Sarah; Selmeier, Renate; Rudd, Stephen; Herrmann, Reinhold G; Meurer, Jörg

    2006-09-01

    Coevolution of cellular genetic compartments is a fundamental aspect in eukaryotic genome evolution that becomes apparent in serious developmental disturbances after interspecific organelle exchanges. The genus Oenothera represents a unique, at present the only available, resource to study the role of the compartmentalized plant genome in diversification of populations and speciation processes. An integrated approach involving cDNA cloning, EST sequencing, and bioinformatic data mining was chosen using Oenothera elata with the genetic constitution nuclear genome AA with plastome type I. The Gene Ontology system grouped 1621 unique gene products into 17 different functional categories. Application of arrays generated from a selected fraction of ESTs revealed significantly differing expression profiles among closely related Oenothera species possessing the potential to generate fertile and incompatible plastid/nuclear hybrids (hybrid bleaching). Furthermore, the EST library provides a valuable source of PCR-based polymorphic molecular markers that are instrumental for genotyping and molecular mapping approaches.

  8. Exploiting a wheat EST database to assess genetic diversity.

    Science.gov (United States)

    Karakas, Ozge; Gurel, Filiz; Uncuoglu, Ahu Altinkut

    2010-10-01

    Expressed sequence tag (EST) markers have been used to assess variety and genetic diversity in wheat (Triticum aestivum). In this study, 1549 ESTs from wheat infested with yellow rust were used to examine the genetic diversity of six susceptible and resistant wheat cultivars. The aim of using these cultivars was to improve the competitiveness of public wheat breeding programs through the intensive use of modern, particularly marker-assisted, selection technologies. The F(2) individuals derived from cultivar crosses were screened for resistance to yellow rust at the seedling stage in greenhouses and adult stage in the field to identify DNA markers genetically linked to resistance. Five hundred and sixty ESTs were assembled into 136 contigs and 989 singletons. BlastX search results showed that 39 (29%) contigs and 96 (10%) singletons were homologous to wheat genes. The database-matched contigs and singletons were assigned to eight functional groups related to protein synthesis, photosynthesis, metabolism and energy, stress proteins, transporter proteins, protein breakdown and recycling, cell growth and division and reactive oxygen scavengers. PCR analyses with primers based on the contigs and singletons showed that the most polymorphic functional categories were photosynthesis (contigs) and metabolism and energy (singletons). EST analysis revealed considerable genetic variability among the Turkish wheat cultivars resistant and susceptible to yellow rust disease and allowed calculation of the mean genetic distance between cultivars, with the greatest similarity (0.725) being between Harmankaya99 and Sönmez2001, and the lowest (0.622) between Aytin98 and Izgi01.

  9. Exploiting a wheat EST database to assess genetic diversity

    Directory of Open Access Journals (Sweden)

    Ozge Karakas

    2010-01-01

    Full Text Available Expressed sequence tag (EST markers have been used to assess variety and genetic diversity in wheat (Triticum aestivum. In this study, 1549 ESTs from wheat infested with yellow rust were used to examine the genetic diversity of six susceptible and resistant wheat cultivars. The aim of using these cultivars was to improve the competitiveness of public wheat breeding programs through the intensive use of modern, particularly marker-assisted, selection technologies. The F2 individuals derived from cultivar crosses were screened for resistance to yellow rust at the seedling stage in greenhouses and adult stage in the field to identify DNA markers genetically linked to resistance. Five hundred and sixty ESTs were assembled into 136 contigs and 989 singletons. BlastX search results showed that 39 (29% contigs and 96 (10% singletons were homologous to wheat genes. The database-matched contigs and singletons were assigned to eight functional groups related to protein synthesis, photosynthesis, metabolism and energy, stress proteins, transporter proteins, protein breakdown and recycling, cell growth and division and reactive oxygen scavengers. PCR analyses with primers based on the contigs and singletons showed that the most polymorphic functional categories were photosynthesis (contigs and metabolism and energy (singletons. EST analysis revealed considerable genetic variability among the Turkish wheat cultivars resistant and susceptible to yellow rust disease and allowed calculation of the mean genetic distance between cultivars, with the greatest similarity (0.725 being between Harmankaya99 and Sönmez2001, and the lowest (0.622 between Aytin98 and Izgi01.

  10. AcEST(EST sequences of Adiantum capillus-veneris and their annotation) - AcEST | LSDB Archive [Life Science Database Archive metadata

    Lifescience Database Archive (English)

    Full Text Available List Contact us AcEST AcEST(EST sequences of Adiantum capillus-veneris and their annotation) Data detail Dat...a name AcEST(EST sequences of Adiantum capillus-veneris and their annotation) DOI 10.18908/lsdba.nbdc00839-0...01 Description of data contents EST sequence of Adiantum capillus-veneris and its annotation (clone ID, libr...le search URL http://togodb.biosciencedbc.jp/togodb/view/archive_acest#en Data acquisition method Capillary ...ainst UniProtKB/Swiss-Prot and UniProtKB/TrEMBL databases) Number of data entries Adiantum capillus-veneris

  11. GarlicESTdb: an online database and mining tool for garlic EST sequences

    Directory of Open Access Journals (Sweden)

    Choi Sang-Haeng

    2009-05-01

    Full Text Available Abstract Background Allium sativum., commonly known as garlic, is a species in the onion genus (Allium, which is a large and diverse one containing over 1,250 species. Its close relatives include chives, onion, leek and shallot. Garlic has been used throughout recorded history for culinary, medicinal use and health benefits. Currently, the interest in garlic is highly increasing due to nutritional and pharmaceutical value including high blood pressure and cholesterol, atherosclerosis and cancer. For all that, there are no comprehensive databases available for Expressed Sequence Tags(EST of garlic for gene discovery and future efforts of genome annotation. That is why we developed a new garlic database and applications to enable comprehensive analysis of garlic gene expression. Description GarlicESTdb is an integrated database and mining tool for large-scale garlic (Allium sativum EST sequencing. A total of 21,595 ESTs collected from an in-house cDNA library were used to construct the database. The analysis pipeline is an automated system written in JAVA and consists of the following components: automatic preprocessing of EST reads, assembly of raw sequences, annotation of the assembled sequences, storage of the analyzed information into MySQL databases, and graphic display of all processed data. A web application was implemented with the latest J2EE (Java 2 Platform Enterprise Edition software technology (JSP/EJB/JavaServlet for browsing and querying the database, for creation of dynamic web pages on the client side, and for mapping annotated enzymes to KEGG pathways, the AJAX framework was also used partially. The online resources, such as putative annotation, single nucleotide polymorphisms (SNP and tandem repeat data sets, can be searched by text, explored on the website, searched using BLAST, and downloaded. To archive more significant BLAST results, a curation system was introduced with which biologists can easily edit best-hit annotation

  12. GarlicESTdb: an online database and mining tool for garlic EST sequences.

    Science.gov (United States)

    Kim, Dae-Won; Jung, Tae-Sung; Nam, Seong-Hyeuk; Kwon, Hyuk-Ryul; Kim, Aeri; Chae, Sung-Hwa; Choi, Sang-Haeng; Kim, Dong-Wook; Kim, Ryong Nam; Park, Hong-Seog

    2009-05-18

    Allium sativum., commonly known as garlic, is a species in the onion genus (Allium), which is a large and diverse one containing over 1,250 species. Its close relatives include chives, onion, leek and shallot. Garlic has been used throughout recorded history for culinary, medicinal use and health benefits. Currently, the interest in garlic is highly increasing due to nutritional and pharmaceutical value including high blood pressure and cholesterol, atherosclerosis and cancer. For all that, there are no comprehensive databases available for Expressed Sequence Tags(EST) of garlic for gene discovery and future efforts of genome annotation. That is why we developed a new garlic database and applications to enable comprehensive analysis of garlic gene expression. GarlicESTdb is an integrated database and mining tool for large-scale garlic (Allium sativum) EST sequencing. A total of 21,595 ESTs collected from an in-house cDNA library were used to construct the database. The analysis pipeline is an automated system written in JAVA and consists of the following components: automatic preprocessing of EST reads, assembly of raw sequences, annotation of the assembled sequences, storage of the analyzed information into MySQL databases, and graphic display of all processed data. A web application was implemented with the latest J2EE (Java 2 Platform Enterprise Edition) software technology (JSP/EJB/JavaServlet) for browsing and querying the database, for creation of dynamic web pages on the client side, and for mapping annotated enzymes to KEGG pathways, the AJAX framework was also used partially. The online resources, such as putative annotation, single nucleotide polymorphisms (SNP) and tandem repeat data sets, can be searched by text, explored on the website, searched using BLAST, and downloaded. To archive more significant BLAST results, a curation system was introduced with which biologists can easily edit best-hit annotation information for others to view. The Garlic

  13. Pepper EST database: comprehensive in silico tool for analyzing the chili pepper (Capsicum annuum transcriptome

    Directory of Open Access Journals (Sweden)

    Kim Woo Taek

    2008-10-01

    Full Text Available Abstract Background There is no dedicated database available for Expressed Sequence Tags (EST of the chili pepper (Capsicum annuum, although the interest in a chili pepper EST database is increasing internationally due to the nutritional, economic, and pharmaceutical value of the plant. Recent advances in high-throughput sequencing of the ESTs of chili pepper cv. Bukang have produced hundreds of thousands of complementary DNA (cDNA sequences. Therefore, a chili pepper EST database was designed and constructed to enable comprehensive analysis of chili pepper gene expression in response to biotic and abiotic stresses. Results We built the Pepper EST database to mine the complexity of chili pepper ESTs. The database was built on 122,582 sequenced ESTs and 116,412 refined ESTs from 21 pepper EST libraries. The ESTs were clustered and assembled into virtual consensus cDNAs and the cDNAs were assigned to metabolic pathway, Gene Ontology (GO, and MIPS Functional Catalogue (FunCat. The Pepper EST database is designed to provide a workbench for (i identifying unigenes in pepper plants, (ii analyzing expression patterns in different developmental tissues and under conditions of stress, and (iii comparing the ESTs with those of other members of the Solanaceae family. The Pepper EST database is freely available at http://genepool.kribb.re.kr/pepper/. Conclusion The Pepper EST database is expected to provide a high-quality resource, which will contribute to gaining a systemic understanding of plant diseases and facilitate genetics-based population studies. The database is also expected to contribute to analysis of gene synteny as part of the chili pepper sequencing project by mapping ESTs to the genome.

  14. REDVET está indexada en Academia Journals Database

    OpenAIRE

    Redaccion Veterinaria.org

    2007-01-01

    El slogan de la Academia Journals Database, base de datos y directorio bibliográfico es “La difusión de conocimientos científicos de calidad controlada” por lo que es una satisfacción para REDVET que se nos valore por esta institución como una revista de calidad.

  15. Analysis of newly established EST databases reveals similarities between heart regeneration in newt and fish

    Directory of Open Access Journals (Sweden)

    Weis Patrick

    2010-01-01

    Full Text Available Abstract Background The newt Notophthalmus viridescens possesses the remarkable ability to respond to cardiac damage by formation of new myocardial tissue. Surprisingly little is known about changes in gene activities that occur during the course of regeneration. To begin to decipher the molecular processes, that underlie restoration of functional cardiac tissue, we generated an EST database from regenerating newt hearts and compared the transcriptional profile of selected candidates with genes deregulated during zebrafish heart regeneration. Results A cDNA library of 100,000 cDNA clones was generated from newt hearts 14 days after ventricular injury. Sequencing of 11520 cDNA clones resulted in 2894 assembled contigs. BLAST searches revealed 1695 sequences with potential homology to sequences from the NCBI database. BLAST searches to TrEMBL and Swiss-Prot databases assigned 1116 proteins to Gene Ontology terms. We also identified a relatively large set of 174 ORFs, which are likely to be unique for urodele amphibians. Expression analysis of newt-zebrafish homologues confirmed the deregulation of selected genes during heart regeneration. Sequences, BLAST results and GO annotations were visualized in a relational web based database followed by grouping of identified proteins into clusters of GO Terms. Comparison of data from regenerating zebrafish hearts identified biological processes, which were uniformly overrepresented during cardiac regeneration in newt and zebrafish. Conclusion We concluded that heart regeneration in newts and zebrafish led to the activation of similar sets of genes, which suggests that heart regeneration in both species might follow similar principles. The design of the newly established newt EST database allows identification of molecular pathways important for heart regeneration.

  16. All 5' EST - KOME | LSDB Archive [Life Science Database Archive metadata

    Lifescience Database Archive (English)

    Full Text Available switchLanguage; BLAST Search Image Search Home About Archive Update History Data ...n of data contents 5' EST sequences Data file File name: CSV: kome_est_5end_all.zip File URL: ftp://ftp.biosciencedbc.jp/archiv...fasta.zip File URL: ftp://ftp.biosciencedbc.jp/archive/kome/LATEST/kome_est_5end_...se Description Download License Update History of This Database Site Policy | Contact Us All 5' EST - KOME | LSDB Archive ...

  17. All 3' EST - KOME | LSDB Archive [Life Science Database Archive metadata

    Lifescience Database Archive (English)

    Full Text Available switchLanguage; BLAST Search Image Search Home About Archive Update History Data ...n of data contents 3' EST sequences Data file File name: CSV: kome_est_3end_all.zip File URL: ftp://ftp.biosciencedbc.jp/archiv...fasta.zip File URL: ftp://ftp.biosciencedbc.jp/archive/kome/LATEST/kome_est_3end_...se Description Download License Update History of This Database Site Policy | Contact Us All 3' EST - KOME | LSDB Archive ...

  18. License - AcEST | LSDB Archive [Life Science Database Archive metadata

    Lifescience Database Archive (English)

    Full Text Available List Contact us AcEST License to Use This Database Last updated : 2010/02/15 You may use this database in co...he Standard License specifies the license terms regarding the use of this database and the requirements you ...must follow in using this database. The Additional License specifies those items ... License The Standard License for this database is the license specified in the Creative Commons Attribution...-Share Alike 2.1 Japan . If you use data from this database, please be sure attribute this database

  19. Exploiting the transcriptome of Euphrates Poplar, Populus euphratica (Salicaceae to develop and characterize new EST-SSR markers and construct an EST-SSR database.

    Directory of Open Access Journals (Sweden)

    Fang K Du

    Full Text Available BACKGROUND: Microsatellite markers or Simple Sequence Repeats (SSRs are the most popular markers in population/conservation genetics. However, the development of novel microsatellite markers has been impeded by high costs, a lack of available sequence data and technical difficulties. New species-specific microsatellite markers were required to investigate the evolutionary history of the Euphratica tree, Populus euphratica, the only tree species found in the desert regions of Western China and adjacent Central Asian countries. METHODOLOGY/PRINCIPAL FINDINGS: A total of 94,090 non-redundant Expressed Sequence Tags (ESTs from P. euphratica comprising around 63 Mb of sequence data were searched for SSRs. 4,202 SSRs were found in 3,839 ESTs, with 311 ESTs containing multiple SSRs. The most common motif types were trinucleotides (37% and hexanucleotides (33% repeats. We developed primer pairs for all of the identified EST-SSRs (eSSRs and selected 673 of these pairs at random for further validation. 575 pairs (85% gave successful amplification, of which, 464 (80.7% were polymorphic in six to 24 individuals from natural populations across Northern China. We also tested the transferability of the polymorphic eSSRs to nine other Populus species. In addition, to facilitate the use of these new eSSR markers by other researchers, we mapped them onto Populus trichocarpa scaffolds in silico and compiled our data into a web-based database (http://202.205.131.253:8080/poplar/resources/static_page/index.html. CONCLUSIONS: The large set of validated eSSRs identified in this work will have many potential applications in studies on P. euphratica and other poplar species, in fields such as population genetics, comparative genomics, linkage mapping, QTL, and marker-assisted breeding. Their use will be facilitated by their incorporation into a user-friendly web-based database.

  20. License - ClEST | LSDB Archive [Life Science Database Archive metadata

    Lifescience Database Archive (English)

    Full Text Available switchLanguage; BLAST Search Image Search Home About Archive Update History Data ...License Update History of This Database Site Policy | Contact Us License - ClEST | LSDB Archive ...

  1. Exploiting EST databases for the development and characterization of EST-SSR markers in castor bean (Ricinus communis L.

    Directory of Open Access Journals (Sweden)

    Yang Jun-Bo

    2010-12-01

    Full Text Available Abstract Background The castor bean (Ricinus communis L., a monotypic species in the spurge family (Euphorbiaceae, 2n = 20, is an important non-edible oilseed crop widely cultivated in tropical, sub-tropical and temperate countries for its high economic value. Because of the high level of ricinoleic acid (over 85% in its seed oil, the castor bean seed derivatives are often used in aviation oil, lubricants, nylon, dyes, inks, soaps, adhesive and biodiesel. Due to lack of efficient molecular markers, little is known about the population genetic diversity and the genetic relationships among castor bean germplasm. Efficient and robust molecular markers are increasingly needed for breeding and improving varieties in castor bean. The advent of modern genomics has produced large amounts of publicly available DNA sequence data. In particular, expressed sequence tags (ESTs provide valuable resources to develop gene-associated SSR markers. Results In total, 18,928 publicly available non-redundant castor bean EST sequences, representing approximately 17.03 Mb, were evaluated and 7732 SSR sites in 5,122 ESTs were identified by data mining. Castor bean exhibited considerably high frequency of EST-SSRs. We developed and characterized 118 polymorphic EST-SSR markers from 379 primer pairs flanking repeats by screening 24 castor bean samples collected from different countries. A total of 350 alleles were identified from 118 polymorphic SSR loci, ranging from 2-6 per locus (A with an average of 2.97. The EST-SSR markers developed displayed moderate gene diversity (He with an average of 0.41. Genetic relationships among 24 germplasms were investigated using the genotypes of 350 alleles, showing geographic pattern of genotypes across genetic diversity centers of castor bean. Conclusion Castor bean EST sequences exhibited considerably high frequency of SSR sites, and were rich resources for developing EST-SSR markers. These EST-SSR markers would be particularly

  2. About Libraries - AcEST | LSDB Archive [Life Science Database Archive metadata

    Lifescience Database Archive (English)

    Full Text Available switchLanguage; BLAST Search Image Search Home About Archive Update History Data ...ormat text file. Data file File name: acest_library.zip File URL: ftp://ftp.biosciencedbc.jp/archive/acest/L...ATEST/acest_library.zip File size: 2KB Simple search URL http://togodb.biosciencedbc.jp/togodb/view/archiv...s Database Database Description Download License Update History of This Database Site Policy | Contact Us About Libraries - AcEST | LSDB Archive ...

  3. Database Description - ClEST | LSDB Archive [Life Science Database Archive metadata

    Lifescience Database Archive (English)

    Full Text Available filiation: National Institute of Advanced Industrial Science and Technology Contact address Symbiotic Evolut...2012 Aug; 47(3):233-243. External Links: Original website information Database maintenance site Symbiotic Ev

  4. Clone - ClEST | LSDB Archive [Life Science Database Archive metadata

    Lifescience Database Archive (English)

    Full Text Available switchLanguage; BLAST Search Image Search Home About Archive Update History Data ...le URL: ftp://ftp.biosciencedbc.jp/archive/clest/LATEST/clest_clone.zip File size: 660 KB Simple search URL ...ion Download License Update History of This Database Site Policy | Contact Us Clone - ClEST | LSDB Archive ...

  5. Cluster - ClEST | LSDB Archive [Life Science Database Archive metadata

    Lifescience Database Archive (English)

    Full Text Available switchLanguage; BLAST Search Image Search Home About Archive Update History Data ...e File name: clest_cluster.zip File URL: ftp://ftp.biosciencedbc.jp/archive/clest/LATEST/clest_cluster.zip F...ownload License Update History of This Database Site Policy | Contact Us Cluster - ClEST | LSDB Archive ...

  6. Efficient Serial and Parallel Algorithms for Selection of Unique Oligos in EST Databases.

    Science.gov (United States)

    Mata-Montero, Manrique; Shalaby, Nabil; Sheppard, Bradley

    2013-01-01

    Obtaining unique oligos from an EST database is a problem of great importance in bioinformatics, particularly in the discovery of new genes and the mapping of the human genome. Many algorithms have been developed to find unique oligos, many of which are much less time consuming than the traditional brute force approach. An algorithm was presented by Zheng et al. (2004) which finds the solution of the unique oligos search problem efficiently. We implement this algorithm as well as several new algorithms based on some theorems included in this paper. We demonstrate how, with these new algorithms, we can obtain unique oligos much faster than with previous ones. We parallelize these new algorithms to further improve the time of finding unique oligos. All algorithms are run on ESTs obtained from a Barley EST database.

  7. Identification, validation and cross-species transferability of novel Lavandula EST-SSRs.

    Science.gov (United States)

    Adal, Ayelign M; Demissie, Zerihun A; Mahmoud, Soheil S

    2015-04-01

    We identified and characterized EST-SSRs with strong discrimination power against Lavandula angustifolia and Lavandula x intermedia . The markers also showed considerable cross-species transferability rate into six related Lavandula species. Lavenders (Lavandula) are important economical crops grown around the globe for essential oil production. In an attempt to develop genetic markers for these plants, we analyzed over 13,000 unigenes developed from L. angustifolia and L. x intermedia EST databases, and identified 3,459 simple sequence repeats (SSR), which were dominated by trinucleotides (41.2 %) and dinucleotides (31.45 %). Approximately, 19 % of the unigenes contained at least one SSR marker, over 60 % of which were localized in the UTRs. Only 252 EST-SSRs were 18 bp or longer from which 31 loci were validated, and 24 amplified discrete fragments with 85 % polymorphism in L. x intermedia and L. angustifolia. The average number of alleles in L. x intermedia and L. angustifolia were 3.42 and 3.71 per marker with average PIC values of 0.47 and 0.52, respectively. These values suggest a moderate to strong level of informativeness for the markers, with some loci producing unique fingerprints. The cross-species transferability rate of the markers ranges 50-100 % across eight species. The utility of these markers was assessed in eight Lavandula species and 15 L. angustifolia and L. x intermedia cultivars, and the dendrogram deduced from their similarity indexes successfully delineated the species into their respective sections and the cultivars into their respective species. These markers have potential for application in fingerprinting, diversity studies and marker-assisted breeding of Lavandula.

  8. tropiTree: An NGS-Based EST-SSR Resource for 24 Tropical Tree Species

    Science.gov (United States)

    Russell, Joanne R.; Hedley, Peter E.; Cardle, Linda; Dancey, Siobhan; Morris, Jenny; Booth, Allan; Odee, David; Mwaura, Lucy; Omondi, William; Angaine, Peter; Machua, Joseph; Muchugi, Alice; Milne, Iain; Kindt, Roeland; Jamnadass, Ramni; Dawson, Ian K.

    2014-01-01

    The development of genetic tools for non-model organisms has been hampered by cost, but advances in next-generation sequencing (NGS) have created new opportunities. In ecological research, this raises the prospect for developing molecular markers to simultaneously study important genetic processes such as gene flow in multiple non-model plant species within complex natural and anthropogenic landscapes. Here, we report the use of bar-coded multiplexed paired-end Illumina NGS for the de novo development of expressed sequence tag-derived simple sequence repeat (EST-SSR) markers at low cost for a range of 24 tree species. Each chosen tree species is important in complex tropical agroforestry systems where little is currently known about many genetic processes. An average of more than 5,000 EST-SSRs was identified for each of the 24 sequenced species, whereas prior to analysis 20 of the species had fewer than 100 nucleotide sequence citations. To make results available to potential users in a suitable format, we have developed an open-access, interactive online database, tropiTree (http://bioinf.hutton.ac.uk/tropiTree), which has a range of visualisation and search facilities, and which is a model for the efficient presentation and application of NGS data. PMID:25025376

  9. DEVELOPMENT OF EST-SSR MARKERS TO ASSESS GENETIC DIVERSITY OF BROCCOLI AND ITS RELATED SPECIES

    Directory of Open Access Journals (Sweden)

    Nur Kholilatul Izzah

    2017-01-01

    Full Text Available Development of Expressed Sequence Tag-Simple Sequence Repeat (EST-SSR markers derived from public database is known to be more efficient, faster and low cost. The objective of this study was to generate a new set of EST-SSR markers for broccoli and its related species and their usefulness for assessing their genetic diversity. A total of 202 Brassica oleracea ESTs were retrieved from NCBI and then assembled into 172 unigenes by means of CAP3 program. Identification of SSRs was carried out using web-based tool, RepeatMasker software. Afterwards, EST-SSR markers were developed using Primer3 program. Among the identified SSRs, trinucleotide repeats were the most common repeat types, which accounted for about 50%. A total of eight primer pairs were successfully designed and yielded amplification products. Among them, five markers were polymorphic and displayed a total of 30 alleles with an average number of six alleles per locus. The polymorphic markers were subsequently used for analyzing genetic diversity of 36 B. oleracea cultivars including 22 broccoli, five cauliflower and nine kohlrabi cultivars based on genetic similarity matrix as implemented in NTSYS program. At similarity coefficient of 61%, a UPGMA clustering dendrogram effectively separated 36 genotypes into three main groups, where 30 out of 36 genotypes were clearly discriminated. The result obtained in the present study would help breeders in selecting parental lines for crossing. Moreover, the novel EST-SSR markers developed in the study could be a valuable tool for differentiating cultivars of broccoli and related species.

  10. The mining of toxin-like polypeptides from EST database by single residue distribution analysis.

    Science.gov (United States)

    Kozlov, Sergey; Grishin, Eugene

    2011-01-31

    Novel high throughput sequencing technologies require permanent development of bioinformatics data processing methods. Among them, rapid and reliable identification of encoded proteins plays a pivotal role. To search for particular protein families, the amino acid sequence motifs suitable for selective screening of nucleotide sequence databases may be used. In this work, we suggest a novel method for simplified representation of protein amino acid sequences named Single Residue Distribution Analysis, which is applicable both for homology search and database screening. Using the procedure developed, a search for amino acid sequence motifs in sea anemone polypeptides was performed, and 14 different motifs with broad and low specificity were discriminated. The adequacy of motifs for mining toxin-like sequences was confirmed by their ability to identify 100% toxin-like anemone polypeptides in the reference polypeptide database. The employment of novel motifs for the search of polypeptide toxins in Anemonia viridis EST dataset allowed us to identify 89 putative toxin precursors. The translated and modified ESTs were scanned using a special algorithm. In addition to direct comparison with the motifs developed, the putative signal peptides were predicted and homology with known structures was examined. The suggested method may be used to retrieve structures of interest from the EST databases using simple amino acid sequence motifs as templates. The efficiency of the procedure for directed search of polypeptides is higher than that of most currently used methods. Analysis of 39939 ESTs of sea anemone Anemonia viridis resulted in identification of five protein precursors of earlier described toxins, discovery of 43 novel polypeptide toxins, and prediction of 39 putative polypeptide toxin sequences. In addition, two precursors of novel peptides presumably displaying neuronal function were disclosed.

  11. The mining of toxin-like polypeptides from EST database by single residue distribution analysis

    Directory of Open Access Journals (Sweden)

    Grishin Eugene

    2011-01-01

    Full Text Available Abstract Background Novel high throughput sequencing technologies require permanent development of bioinformatics data processing methods. Among them, rapid and reliable identification of encoded proteins plays a pivotal role. To search for particular protein families, the amino acid sequence motifs suitable for selective screening of nucleotide sequence databases may be used. In this work, we suggest a novel method for simplified representation of protein amino acid sequences named Single Residue Distribution Analysis, which is applicable both for homology search and database screening. Results Using the procedure developed, a search for amino acid sequence motifs in sea anemone polypeptides was performed, and 14 different motifs with broad and low specificity were discriminated. The adequacy of motifs for mining toxin-like sequences was confirmed by their ability to identify 100% toxin-like anemone polypeptides in the reference polypeptide database. The employment of novel motifs for the search of polypeptide toxins in Anemonia viridis EST dataset allowed us to identify 89 putative toxin precursors. The translated and modified ESTs were scanned using a special algorithm. In addition to direct comparison with the motifs developed, the putative signal peptides were predicted and homology with known structures was examined. Conclusions The suggested method may be used to retrieve structures of interest from the EST databases using simple amino acid sequence motifs as templates. The efficiency of the procedure for directed search of polypeptides is higher than that of most currently used methods. Analysis of 39939 ESTs of sea anemone Anemonia viridis resulted in identification of five protein precursors of earlier described toxins, discovery of 43 novel polypeptide toxins, and prediction of 39 putative polypeptide toxin sequences. In addition, two precursors of novel peptides presumably displaying neuronal function were disclosed.

  12. Comparative high-throughput transcriptome sequencing and development of SiESTa, the Silene EST annotation database

    Directory of Open Access Journals (Sweden)

    Marais Gabriel AB

    2011-07-01

    Full Text Available Abstract Background The genus Silene is widely used as a model system for addressing ecological and evolutionary questions in plants, but advances in using the genus as a model system are impeded by the lack of available resources for studying its genome. Massively parallel sequencing cDNA has recently developed into an efficient method for characterizing the transcriptomes of non-model organisms, generating massive amounts of data that enable the study of multiple species in a comparative framework. The sequences generated provide an excellent resource for identifying expressed genes, characterizing functional variation and developing molecular markers, thereby laying the foundations for future studies on gene sequence and gene expression divergence. Here, we report the results of a comparative transcriptome sequencing study of eight individuals representing four Silene and one Dianthus species as outgroup. All sequences and annotations have been deposited in a newly developed and publicly available database called SiESTa, the Silene EST annotation database. Results A total of 1,041,122 EST reads were generated in two runs on a Roche GS-FLX 454 pyrosequencing platform. EST reads were analyzed separately for all eight individuals sequenced and were assembled into contigs using TGICL. These were annotated with results from BLASTX searches and Gene Ontology (GO terms, and thousands of single-nucleotide polymorphisms (SNPs were characterized. Unassembled reads were kept as singletons and together with the contigs contributed to the unigenes characterized in each individual. The high quality of unigenes is evidenced by the proportion (49% that have significant hits in similarity searches with the A. thaliana proteome. The SiESTa database is accessible at http://www.siesta.ethz.ch. Conclusion The sequence collections established in the present study provide an important genomic resource for four Silene and one Dianthus species and will help to

  13. Comparative high-throughput transcriptome sequencing and development of SiESTa, the Silene EST annotation database

    Science.gov (United States)

    2011-01-01

    Background The genus Silene is widely used as a model system for addressing ecological and evolutionary questions in plants, but advances in using the genus as a model system are impeded by the lack of available resources for studying its genome. Massively parallel sequencing cDNA has recently developed into an efficient method for characterizing the transcriptomes of non-model organisms, generating massive amounts of data that enable the study of multiple species in a comparative framework. The sequences generated provide an excellent resource for identifying expressed genes, characterizing functional variation and developing molecular markers, thereby laying the foundations for future studies on gene sequence and gene expression divergence. Here, we report the results of a comparative transcriptome sequencing study of eight individuals representing four Silene and one Dianthus species as outgroup. All sequences and annotations have been deposited in a newly developed and publicly available database called SiESTa, the Silene EST annotation database. Results A total of 1,041,122 EST reads were generated in two runs on a Roche GS-FLX 454 pyrosequencing platform. EST reads were analyzed separately for all eight individuals sequenced and were assembled into contigs using TGICL. These were annotated with results from BLASTX searches and Gene Ontology (GO) terms, and thousands of single-nucleotide polymorphisms (SNPs) were characterized. Unassembled reads were kept as singletons and together with the contigs contributed to the unigenes characterized in each individual. The high quality of unigenes is evidenced by the proportion (49%) that have significant hits in similarity searches with the A. thaliana proteome. The SiESTa database is accessible at http://www.siesta.ethz.ch. Conclusion The sequence collections established in the present study provide an important genomic resource for four Silene and one Dianthus species and will help to further develop Silene as a

  14. ESTminer: a Web interface for mining EST contig and cluster databases.

    Science.gov (United States)

    Huang, Yecheng; Pumphrey, Janie; Gingle, Alan R

    2005-03-01

    ESTminer is a Web application and database schema for interactive mining of expressed sequence tag (EST) contig and cluster datasets. The Web interface contains a query frame that allows the selection of contigs/clusters with specific cDNA library makeup or a threshold number of members. The results are displayed as color-coded tree nodes, where the color indicates the fractional size of each cDNA library component. The nodes are expandable, revealing library statistics as well as EST or contig members, with links to sequence data, GenBank records or user configurable links. Also, the interface allows 'queries within queries' where the result set of a query is further filtered by the subsequent query. ESTminer is implemented in Java/JSP and the package, including MySQL and Oracle schema creation scripts, is available from http://cggc.agtec.uga.edu/Data/download.asp agingle@uga.edu.

  15. An elm EST database for identifying leaf beetle egg-induced defense genes

    Directory of Open Access Journals (Sweden)

    Büchel Kerstin

    2012-06-01

    Full Text Available Abstract Background Plants can defend themselves against herbivorous insects prior to the onset of larval feeding by responding to the eggs laid on their leaves. In the European field elm (Ulmus minor, egg laying by the elm leaf beetle ( Xanthogaleruca luteola activates the emission of volatiles that attract specialised egg parasitoids, which in turn kill the eggs. Little is known about the transcriptional changes that insect eggs trigger in plants and how such indirect defense mechanisms are orchestrated in the context of other biological processes. Results Here we present the first large scale study of egg-induced changes in the transcriptional profile of a tree. Five cDNA libraries were generated from leaves of (i untreated control elms, and elms treated with (ii egg laying and feeding by elm leaf beetles, (iii feeding, (iv artificial transfer of egg clutches, and (v methyl jasmonate. A total of 361,196 ESTs expressed sequence tags (ESTs were identified which clustered into 52,823 unique transcripts (Unitrans and were stored in a database with a public web interface. Among the analyzed Unitrans, 73% could be annotated by homology to known genes in the UniProt (Plant database, particularly to those from Vitis, Ricinus, Populus and Arabidopsis. Comparative in silico analysis among the different treatments revealed differences in Gene Ontology term abundances. Defense- and stress-related gene transcripts were present in high abundance in leaves after herbivore egg laying, but transcripts involved in photosynthesis showed decreased abundance. Many pathogen-related genes and genes involved in phytohormone signaling were expressed, indicative of jasmonic acid biosynthesis and activation of jasmonic acid responsive genes. Cross-comparisons between different libraries based on expression profiles allowed the identification of genes with a potential relevance in egg-induced defenses, as well as other biological processes, including signal transduction

  16. An elm EST database for identifying leaf beetle egg-induced defense genes.

    Science.gov (United States)

    Büchel, Kerstin; McDowell, Eric; Nelson, Will; Descour, Anne; Gershenzon, Jonathan; Hilker, Monika; Soderlund, Carol; Gang, David R; Fenning, Trevor; Meiners, Torsten

    2012-06-15

    Plants can defend themselves against herbivorous insects prior to the onset of larval feeding by responding to the eggs laid on their leaves. In the European field elm (Ulmus minor), egg laying by the elm leaf beetle ( Xanthogaleruca luteola) activates the emission of volatiles that attract specialised egg parasitoids, which in turn kill the eggs. Little is known about the transcriptional changes that insect eggs trigger in plants and how such indirect defense mechanisms are orchestrated in the context of other biological processes. Here we present the first large scale study of egg-induced changes in the transcriptional profile of a tree. Five cDNA libraries were generated from leaves of (i) untreated control elms, and elms treated with (ii) egg laying and feeding by elm leaf beetles, (iii) feeding, (iv) artificial transfer of egg clutches, and (v) methyl jasmonate. A total of 361,196 ESTs expressed sequence tags (ESTs) were identified which clustered into 52,823 unique transcripts (Unitrans) and were stored in a database with a public web interface. Among the analyzed Unitrans, 73% could be annotated by homology to known genes in the UniProt (Plant) database, particularly to those from Vitis, Ricinus, Populus and Arabidopsis. Comparative in silico analysis among the different treatments revealed differences in Gene Ontology term abundances. Defense- and stress-related gene transcripts were present in high abundance in leaves after herbivore egg laying, but transcripts involved in photosynthesis showed decreased abundance. Many pathogen-related genes and genes involved in phytohormone signaling were expressed, indicative of jasmonic acid biosynthesis and activation of jasmonic acid responsive genes. Cross-comparisons between different libraries based on expression profiles allowed the identification of genes with a potential relevance in egg-induced defenses, as well as other biological processes, including signal transduction, transport and primary metabolism

  17. Statistics information of rice EST mapping results - RGP estmap2001 | LSDB Archive [Life Science Database Archive metadata

    Lifescience Database Archive (English)

    Full Text Available List Contact us RGP estmap2001 Statistics information of rice EST mapping results Data detail Data name Statistics...of This Database Site Policy | Contact Us Statistics information of rice EST mapping results - RGP estmap2001 | LSDB Archive ...

  18. Candidate gene database and transcript map for peach, a model species for fruit trees.

    Science.gov (United States)

    Horn, Renate; Lecouls, Anne-Claire; Callahan, Ann; Dandekar, Abhaya; Garay, Lilibeth; McCord, Per; Howad, Werner; Chan, Helen; Verde, Ignazio; Main, Doreen; Jung, Sook; Georgi, Laura; Forrest, Sam; Mook, Jennifer; Zhebentyayeva, Tatyana; Yu, Yeisoo; Kim, Hye Ran; Jesudurai, Christopher; Sosinski, Bryon; Arús, Pere; Baird, Vance; Parfitt, Dan; Reighard, Gregory; Scorza, Ralph; Tomkins, Jeffrey; Wing, Rod; Abbott, Albert Glenn

    2005-05-01

    Peach (Prunus persica) is a model species for the Rosaceae, which includes a number of economically important fruit tree species. To develop an extensive Prunus expressed sequence tag (EST) database for identifying and cloning the genes important to fruit and tree development, we generated 9,984 high-quality ESTs from a peach cDNA library of developing fruit mesocarp. After assembly and annotation, a putative peach unigene set consisting of 3,842 ESTs was defined. Gene ontology (GO) classification was assigned based on the annotation of the single "best hit" match against the Swiss-Prot database. No significant homology could be found in the GenBank nr databases for 24.3% of the sequences. Using core markers from the general Prunus genetic map, we anchored bacterial artificial chromosome (BAC) clones on the genetic map, thereby providing a framework for the construction of a physical and transcript map. A transcript map was developed by hybridizing 1,236 ESTs from the putative peach unigene set and an additional 68 peach cDNA clones against the peach BAC library. Hybridizing ESTs to genetically anchored BACs immediately localized 11.2% of the ESTs on the genetic map. ESTs showed a clustering of expressed genes in defined regions of the linkage groups. [The data were built into a regularly updated Genome Database for Rosaceae (GDR), available at (http://www.genome.clemson.edu/gdr/).].

  19. Ontology and diversity of transcript-associated microsatellites mined from a globe artichoke EST database

    Science.gov (United States)

    Scaglione, Davide; Acquadro, Alberto; Portis, Ezio; Taylor, Christopher A; Lanteri, Sergio; Knapp, Steven J

    2009-01-01

    Background The globe artichoke (Cynara cardunculus var. scolymus L.) is a significant crop in the Mediterranean basin. Despite its commercial importance and its both dietary and pharmaceutical value, knowledge of its genetics and genomics remains scant. Microsatellite markers have become a key tool in genetic and genomic analysis, and we have exploited recently acquired EST (expressed sequence tag) sequence data (Composite Genome Project - CGP) to develop an extensive set of microsatellite markers. Results A unigene assembly was created from over 36,000 globe artichoke EST sequences, containing 6,621 contigs and 12,434 singletons. Over 12,000 of these unigenes were functionally assigned on the basis of homology with Arabidopsis thaliana reference proteins. A total of 4,219 perfect repeats, located within 3,308 unigenes was identified and the gene ontology (GO) analysis highlighted some GO term's enrichments among different classes of microsatellites with respect to their position. Sufficient flanking sequence was available to enable the design of primers to amplify 2,311 of these microsatellites, and a set of 300 was tested against a DNA panel derived from 28 C. cardunculus genotypes. Consistent amplification and polymorphism was obtained from 236 of these assays. Their polymorphic information content (PIC) ranged from 0.04 to 0.90 (mean 0.66). Between 176 and 198 of the assays were informative in at least one of the three available mapping populations. Conclusion EST-based microsatellites have provided a large set of de novo genetic markers, which show significant amounts of polymorphism both between and within the three taxa of C. cardunculus. They are thus well suited as assays for phylogenetic analysis, the construction of genetic maps, marker-assisted breeding, transcript mapping and other genomic applications in the species. PMID:19785740

  20. Validation of dbEST-SSRs and transferability of some other solanaceous species SSR in ashwagandha [Withania Somnifera (L.) Dunal].

    Science.gov (United States)

    Parmar, Eva K; Fougat, Ranbir S; Patel, Chandni B; Zala, Harshvardhan N; Patel, Mahesh A; Patel, Swati K; Kumar, Sushil

    2015-12-01

    Cross-species transferability and expressed sequence tags (ESTs) in public databases are cost-effective means for developing simple sequence repeats (SSRs) for less-studied species like medicinal plants. In this study, 11 EST-SSR markers developed from 742 available ESTs of Withania Somnifera EST sequences and 95 SSR primer pairs derived from other solanaceous crops (tomato, eggplant, chili, and tobacco) were utilized for their amplification and validation. Out of 11, 10 EST-SSRs showed good amplification quality and produced 13 loci with a product size ranging between 167 and 291 bp. Similarly, of the 95 cross-genera SSR loci assayed, 20 (21 %) markers showed the transferability of 5, 27, 32, and 14.2 % from eggplant, chili, tomato, and tobacco, respectively, to ashwagandha. In toto, these 30 SSR markers reported here will be valuable resources and may be applicable for the analysis of intra- and inter-specific genetic diversity in ashwagandha for which till date no information about SSR is available.

  1. SPECIES DATABASES AND THE BIOINFORMATICS REVOLUTION.

    Science.gov (United States)

    Biological databases are having a growth spurt. Much of this results from research in genetics and biodiversity, coupled with fast-paced developments in information technology. The revolution in bioinformatics, defined by Sugden and Pennisi (2000) as the "tools and techniques for...

  2. A global database of ant species abundances

    Czech Academy of Sciences Publication Activity Database

    Gibb, H.; Dunn, R. R.; Sanders, N. J.; Grossman, B. F.; Photakis, M.; Abril, S.; Agosti, D.; Andersen, A. N.; Angulo, E.; Armbrecht, I.; Arnan, X.; Baccaro, F. B.; Bishop, T. R.; Boulay, R.; Brühl, C.; Castracani, C.; Cerdá, X.; Del Toro, I.; Delsinne, T.; Diaz, M.; Donoso, D. A.; Ellison, A. M.; Enríquez, M. L.; Fayle, Tom Maurice; Feener, D. H.; Fisher, B. L.; Fisher, R. N.; Fitzpatrick, M. C.; Gómez, C.; Gotelli, N. J.; Gove, A.; Grasso, D. A.; Groc, S.; Guenard, B.; Gunawardene, N.; Heterick, B.; Hoffmann, B.; Janda, Milan; Jenkins, C.; Kaspari, M.; Klimeš, Petr; Lach, L.; Laeger, T.; Lattke, J.; Leponce, M.; Lessard, J.-P.; Longino, J.; Lucky, A.; Luke, S. H.; Majer, J.; McGlynn, T. P.; Menke, S.; Mezger, D.; Mori, A.; Moses, Jimmy; Munyai, T. C.; Pacheco, R.; Paknia, O.; Pearce-Duvet, J.; Pfeiffer, M.; Philpott, S. M.; Resasco, J.; Retana, J.; Silva, R. R.; Sorger, M. D.; Souza, J.; Suarez, A.; Tista, M.; Vasconcelos, H. L.; Vonshak, M.; Weisser, M. D.; Yates, M.; Parr, C. L.

    2017-01-01

    Roč. 98, č. 3 (2017), s. 883-884 ISSN 0012-9658 R&D Projects: GA ČR GB14-36098G; GA ČR GAP505/12/2467; GA ČR GPP505/12/P875 Institutional support: RVO:60077344 Keywords : abundance * ants * database Subject RIV: EH - Ecology, Behaviour OBOR OECD: Ecology Impact factor: 4.809, year: 2016 http://onlinelibrary.wiley.com/doi/10.1002/ecy.1682/abstract

  3. Generation of a predicted protein database from EST data and application to iTRAQ analyses in grape (Vitis vinifera cv. Cabernet Sauvignon) berries at ripening initiation

    Science.gov (United States)

    Lücker, Joost; Laszczak, Mario; Smith, Derek; Lund, Steven T

    2009-01-01

    Background iTRAQ is a proteomics technique that uses isobaric tags for relative and absolute quantitation of tryptic peptides. In proteomics experiments, the detection and high confidence annotation of proteins and the significance of corresponding expression differences can depend on the quality and the species specificity of the tryptic peptide map database used for analysis of the data. For species for which finished genome sequence data are not available, identification of proteins relies on similarity to proteins from other species using comprehensive peptide map databases such as the MSDB. Results We were interested in characterizing ripening initiation ('veraison') in grape berries at the protein level in order to better define the molecular control of this important process for grape growers and wine makers. We developed a bioinformatic pipeline for processing EST data in order to produce a predicted tryptic peptide database specifically targeted to the wine grape cultivar, Vitis vinifera cv. Cabernet Sauvignon, and lacking truncated N- and C-terminal fragments. By searching iTRAQ MS/MS data generated from berry exocarp and mesocarp samples at ripening initiation, we determined that implementation of the custom database afforded a large improvement in high confidence peptide annotation in comparison to the MSDB. We used iTRAQ MS/MS in conjunction with custom peptide db searches to quantitatively characterize several important pathway components for berry ripening previously described at the transcriptional level and confirmed expression patterns for these at the protein level. Conclusion We determined that a predicted peptide database for MS/MS applications can be derived from EST data using advanced clustering and trimming approaches and successfully implemented for quantitative proteome profiling. Quantitative shotgun proteome profiling holds great promise for characterizing biological processes such as fruit ripening initiation and may be further

  4. Generation of a predicted protein database from EST data and application to iTRAQ analyses in grape (Vitis vinifera cv. Cabernet Sauvignon berries at ripening initiation

    Directory of Open Access Journals (Sweden)

    Smith Derek

    2009-01-01

    Full Text Available Abstract Background iTRAQ is a proteomics technique that uses isobaric tags for relative and absolute quantitation of tryptic peptides. In proteomics experiments, the detection and high confidence annotation of proteins and the significance of corresponding expression differences can depend on the quality and the species specificity of the tryptic peptide map database used for analysis of the data. For species for which finished genome sequence data are not available, identification of proteins relies on similarity to proteins from other species using comprehensive peptide map databases such as the MSDB. Results We were interested in characterizing ripening initiation ('veraison' in grape berries at the protein level in order to better define the molecular control of this important process for grape growers and wine makers. We developed a bioinformatic pipeline for processing EST data in order to produce a predicted tryptic peptide database specifically targeted to the wine grape cultivar, Vitis vinifera cv. Cabernet Sauvignon, and lacking truncated N- and C-terminal fragments. By searching iTRAQ MS/MS data generated from berry exocarp and mesocarp samples at ripening initiation, we determined that implementation of the custom database afforded a large improvement in high confidence peptide annotation in comparison to the MSDB. We used iTRAQ MS/MS in conjunction with custom peptide db searches to quantitatively characterize several important pathway components for berry ripening previously described at the transcriptional level and confirmed expression patterns for these at the protein level. Conclusion We determined that a predicted peptide database for MS/MS applications can be derived from EST data using advanced clustering and trimming approaches and successfully implemented for quantitative proteome profiling. Quantitative shotgun proteome profiling holds great promise for characterizing biological processes such as fruit ripening

  5. A global database of ant species abundances

    Science.gov (United States)

    Gibb, Heloise; Dunn, Rob R.; Sanders, Nathan J.; Grossman, Blair F.; Photakis, Manoli; Abril, Silvia; Agosti, Donat; Andersen, Alan N.; Angulo, Elena; Armbrecht, Ingre; Arnan, Xavier; Baccaro, Fabricio B.; Bishop, Tom R.; Boulay, Raphael; Bruhl, Carsten; Castracani, Cristina; Cerda, Xim; Del Toro, Israel; Delsinne, Thibaut; Diaz, Mireia; Donoso, David A.; Ellison, Aaron M.; Enriquez, Martha L.; Fayle, Tom M.; Feener Jr., Donald H.; Fisher, Brian L.; Fisher, Robert N.; Fitpatrick, Matthew C.; Gomez, Cristanto; Gotelli, Nicholas J.; Gove, Aaron; Grasso, Donato A.; Groc, Sarah; Guenard, Benoit; Gunawardene, Nihara; Heterick, Brian; Hoffmann, Benjamin; Janda, Milan; Jenkins, Clinton; Kaspari, Michael; Klimes, Petr; Lach, Lori; Laeger, Thomas; Lattke, John; Leponce, Maurice; Lessard, Jean-Philippe; Longino, John; Lucky, Andrea; Luke, Sarah H.; Majer, Jonathan; McGlynn, Terrence P.; Menke, Sean; Mezger, Dirk; Mori, Alessandra; Moses, Jimmy; Munyai, Thinandavha Caswell; Pacheco, Renata; Paknia, Omid; Pearce-Duvet, Jessica; Pfeiffer, Martin; Philpott, Stacy M.; Resasco, Julian; Retana, Javier; Silva, Rogerio R.; Sorger, Magdalena D.; Souza, Jorge; Suarez, Andrew V.; Tista, Melanie; Vasconcelos, Heraldo L.; Vonshak, Merav; Weiser, Michael D.; Yates, Michelle; Parr, Catherine L.

    2017-01-01

    What forces structure ecological assemblages? A key limitation to general insights about assemblage structure is the availability of data that are collected at a small spatial grain (local assemblages) and a large spatial extent (global coverage). Here, we present published and unpublished data from 51,388 ant abundance and occurrence records of more than 2693 species and 7953 morphospecies from local assemblages collected at 4212 locations around the world. Ants were selected because they are diverse and abundant globally, comprise a large fraction of animal biomass in most terrestrial communities, and are key contributors to a range of ecosystem functions. Data were collected between 1949 and 2014, and include, for each geo-referenced sampling site, both the identity of the ants collected and details of sampling design, habitat type and degree of disturbance. The aim of compiling this dataset was to provide comprehensive species abundance data in order to test relationships between assemblage structure and environmental and biogeographic factors. Data were collected using a variety of standardised methods, such as pitfall and Winkler traps, and will be valuable for studies investigating large-scale forces structuring local assemblages. Understanding such relationships is particularly critical under current rates of global change. We encourage authors holding additional data on systematically collected ant assemblages, especially those in dry and cold, and remote areas, to contact us and contribute their data to this growing dataset.

  6. PineElm_SSRdb: a microsatellite marker database identified from genomic, chloroplast, mitochondrial and EST sequences of pineapple (Ananas comosus (L.) Merrill).

    Science.gov (United States)

    Chaudhary, Sakshi; Mishra, Bharat Kumar; Vivek, Thiruvettai; Magadum, Santoshkumar; Yasin, Jeshima Khan

    2016-01-01

    Simple Sequence Repeats or microsatellites are resourceful molecular genetic markers. There are only few reports of SSR identification and development in pineapple. Complete genome sequence of pineapple available in the public domain can be used to develop numerous novel SSRs. Therefore, an attempt was made to identify SSRs from genomic, chloroplast, mitochondrial and EST sequences of pineapple which will help in deciphering genetic makeup of its germplasm resources. A total of 359511 SSRs were identified in pineapple (356385 from genome sequence, 45 from chloroplast sequence, 249 in mitochondrial sequence and 2832 from EST sequences). The list of EST-SSR markers and their details are available in the database. PineElm_SSRdb is an open source database available for non-commercial academic purpose at http://app.bioelm.com/ with a mapping tool which can develop circular maps of selected marker set. This database will be of immense use to breeders, researchers and graduates working on Ananas spp. and to others working on cross-species transferability of markers, investigating diversity, mapping and DNA fingerprinting.

  7. Transcriptome analysis of the desert locust central nervous system: production and annotation of a Schistocerca gregaria EST database.

    Science.gov (United States)

    Badisco, Liesbeth; Huybrechts, Jurgen; Simonet, Gert; Verlinden, Heleen; Marchal, Elisabeth; Huybrechts, Roger; Schoofs, Liliane; De Loof, Arnold; Vanden Broeck, Jozef

    2011-03-21

    The desert locust (Schistocerca gregaria) displays a fascinating type of phenotypic plasticity, designated as 'phase polyphenism'. Depending on environmental conditions, one genome can be translated into two highly divergent phenotypes, termed the solitarious and gregarious (swarming) phase. Although many of the underlying molecular events remain elusive, the central nervous system (CNS) is expected to play a crucial role in the phase transition process. Locusts have also proven to be interesting model organisms in a physiological and neurobiological research context. However, molecular studies in locusts are hampered by the fact that genome/transcriptome sequence information available for this branch of insects is still limited. We have generated 34,672 raw expressed sequence tags (EST) from the CNS of desert locusts in both phases. These ESTs were assembled in 12,709 unique transcript sequences and nearly 4,000 sequences were functionally annotated. Moreover, the obtained S. gregaria EST information is highly complementary to the existing orthopteran transcriptomic data. Since many novel transcripts encode neuronal signaling and signal transduction components, this paper includes an overview of these sequences. Furthermore, several transcripts being differentially represented in solitarious and gregarious locusts were retrieved from this EST database. The findings highlight the involvement of the CNS in the phase transition process and indicate that this novel annotated database may also add to the emerging knowledge of concomitant neuronal signaling and neuroplasticity events. In summary, we met the need for novel sequence data from desert locust CNS. To our knowledge, we hereby also present the first insect EST database that is derived from the complete CNS. The obtained S. gregaria EST data constitute an important new source of information that will be instrumental in further unraveling the molecular principles of phase polyphenism, in further establishing

  8. Transcriptome analysis of the desert locust central nervous system: production and annotation of a Schistocerca gregaria EST database.

    Directory of Open Access Journals (Sweden)

    Liesbeth Badisco

    Full Text Available BACKGROUND: The desert locust (Schistocerca gregaria displays a fascinating type of phenotypic plasticity, designated as 'phase polyphenism'. Depending on environmental conditions, one genome can be translated into two highly divergent phenotypes, termed the solitarious and gregarious (swarming phase. Although many of the underlying molecular events remain elusive, the central nervous system (CNS is expected to play a crucial role in the phase transition process. Locusts have also proven to be interesting model organisms in a physiological and neurobiological research context. However, molecular studies in locusts are hampered by the fact that genome/transcriptome sequence information available for this branch of insects is still limited. METHODOLOGY: We have generated 34,672 raw expressed sequence tags (EST from the CNS of desert locusts in both phases. These ESTs were assembled in 12,709 unique transcript sequences and nearly 4,000 sequences were functionally annotated. Moreover, the obtained S. gregaria EST information is highly complementary to the existing orthopteran transcriptomic data. Since many novel transcripts encode neuronal signaling and signal transduction components, this paper includes an overview of these sequences. Furthermore, several transcripts being differentially represented in solitarious and gregarious locusts were retrieved from this EST database. The findings highlight the involvement of the CNS in the phase transition process and indicate that this novel annotated database may also add to the emerging knowledge of concomitant neuronal signaling and neuroplasticity events. CONCLUSIONS: In summary, we met the need for novel sequence data from desert locust CNS. To our knowledge, we hereby also present the first insect EST database that is derived from the complete CNS. The obtained S. gregaria EST data constitute an important new source of information that will be instrumental in further unraveling the molecular

  9. Characterization and multiplexing of EST-SSR primers in Cynodon (Poaceae) species1.

    Science.gov (United States)

    Jewell, Margaret C; Frere, Celine H; Prentis, Peter J; Lambrides, Christopher J; Godwin, Ian D

    2010-10-01

    Cynodon species are multiple-use grasses that display varying levels of adaptation to biotic and abiotic stress. Previously identified EST-SSR primers were characterized and multiplexed to assess the level of genetic diversity present within a collection of almost 1200 Cynodon accessions from across Australia. • Two multiplex reactions were developed comprising a total of 16 EST-SSR markers. All SSR markers amplified across different Cynodon species and different levels of ploidy. The number of alleles ranged from one to eight per locus and the total number of alleles for the germplasm collection was 79. • The 16 markers show sufficient variation for the characterization of Cynodon core collections and analysis of population genetic diversity in Cynodon grasses.

  10. Gene Discovery in the Apicomplexa as Revealed by EST Sequencing and Assembly of a Comparative Gene Database

    Science.gov (United States)

    Li, Li; Brunk, Brian P.; Kissinger, Jessica C.; Pape, Deana; Tang, Keliang; Cole, Robert H.; Martin, John; Wylie, Todd; Dante, Mike; Fogarty, Steven J.; Howe, Daniel K.; Liberator, Paul; Diaz, Carmen; Anderson, Jennifer; White, Michael; Jerome, Maria E.; Johnson, Emily A.; Radke, Jay A.; Stoeckert, Christian J.; Waterston, Robert H.; Clifton, Sandra W.; Roos, David S.; Sibley, L. David

    2003-01-01

    Large-scale EST sequencing projects for several important parasites within the phylum Apicomplexa were undertaken for the purpose of gene discovery. Included were several parasites of medical importance (Plasmodium falciparum, Toxoplasma gondii) and others of veterinary importance (Eimeria tenella, Sarcocystis neurona, and Neospora caninum). A total of 55,192 ESTs, deposited into dbEST/GenBank, were included in the analyses. The resulting sequences have been clustered into nonredundant gene assemblies and deposited into a relational database that supports a variety of sequence and text searches. This database has been used to compare the gene assemblies using BLAST similarity comparisons to the public protein databases to identify putative genes. Of these new entries, ∼15%–20% represent putative homologs with a conservative cutoff of p neurona: , , , , , , , , , , , , , –, –, –, –, –. Eimeria tenella: –, –, –, –, –, –, –, –, – , –, –, –, –, –, –, –, –, –, –, –. Neospora caninum: –, –, , – , –, –.] PMID:12618375

  11. EST Table - KAIKOcDNA | LSDB Archive [Life Science Database Archive metadata

    Lifescience Database Archive (English)

    Full Text Available uences registered to public database as of September 2011. Data file File name: kaiko_cdna_main.zip File URL...: ftp://ftp.biosciencedbc.jp/archive/kaiko-cdna/LATEST/kaiko_cdna_main.zip File s...ize: 157 MB Simple search URL http://togodb.biosciencedbc.jp/togodb/view/kaiko_cdna_main#en Data acquisition

  12. Identification and genetic mapping of highly polymorphic microsatellite loci from an EST database of the septoria tritici blotch pathogen Mycosphaerella graminicola.

    Science.gov (United States)

    Goodwin, Stephen B; van der Lee, Theo A J; Cavaletto, Jessica R; Te Lintel Hekkert, Bas; Crane, Charles F; Kema, Gert H J

    2007-05-01

    A database of 30,137 EST sequences from Mycosphaerella graminicola, the septoria tritici blotch fungus of wheat, was scanned with a custom software pipeline for di- and trinucleotide units repeated tandemly six or more times. The bioinformatics analysis identified 109 putative SSR loci, and for 99 of them, flanking primers were developed successfully and tested for amplification and polymorphism by PCR on five field isolates of diverse origin, including the parents of the standard M. graminicola mapping population. Seventy-seven of the 99 primer pairs generated an easily scored banding pattern and 51 were polymorphic, with up to four alleles per locus, among the isolates tested. Among these 51 loci, 23 were polymorphic between the parents of the mapping population. Twenty-one of these as well as two previously published microsatellite loci were positioned on the existing genetic linkage map of M. graminicola on 13 of the 24 linkage groups. Most (66%) of the primer pairs also amplified bands in the closely related barley pathogen Septoria passerinii, but only six were polymorphic among four isolates tested. A subset of the primer pairs also revealed polymorphisms when tested with DNA from the related banana black leaf streak (Black Sigatoka) pathogen, M. fijiensis. The EST database provided an excellent source of new, highly polymorphic microsatellite markers that can be multiplexed for high-throughput genetic analyses of M. graminicola and related species.

  13. Numeric databases on the kinetics of transient species in solution

    International Nuclear Information System (INIS)

    Helman, W.P.; Hug, G.L.; Carmichael, Ian; Ross, A.B.

    1988-01-01

    A description is given of data compilations on the kinetics of transient species in solution. In particular information is available for the reactions of radicals in aqueous solution and for excited states such as singlet molecular oxygen and those of metal complexes in solution. Methods for compilation and use of the information in computer-readable form are also described. Emphasis is placed on making the database available for online searching. (author)

  14. Annotation of novel neuropeptide precursors in the migratory locust based on transcript screening of a public EST database and mass spectrometry

    Directory of Open Access Journals (Sweden)

    De Loof Arnold

    2006-08-01

    Full Text Available Abstract Background For holometabolous insects there has been an explosion of proteomic and peptidomic information thanks to large genome sequencing projects. Heterometabolous insects, although comprising many important species, have been far less studied. The migratory locust Locusta migratoria, a heterometabolous insect, is one of the most infamous agricultural pests. They undergo a well-known and profound phase transition from the relatively harmless solitary form to a ferocious gregarious form. The underlying regulatory mechanisms of this phase transition are not fully understood, but it is undoubtedly that neuropeptides are involved. However, neuropeptide research in locusts is hampered by the absence of genomic information. Results Recently, EST (Expressed Sequence Tag databases from Locusta migratoria were constructed. Using bioinformatical tools, we searched these EST databases specifically for neuropeptide precursors. Based on known locust neuropeptide sequences, we confirmed the sequence of several previously identified neuropeptide precursors (i.e. pacifastin-related peptides, which consolidated our method. In addition, we found two novel neuroparsin precursors and annotated the hitherto unknown tachykinin precursor. Besides one of the known tachykinin peptides, this EST contained an additional tachykinin-like sequence. Using neuropeptide precursors from Drosophila melanogaster as a query, we succeeded in annotating the Locusta neuropeptide F, allatostatin-C and ecdysis-triggering hormone precursor, which until now had not been identified in locusts or in any other heterometabolous insect. For the tachykinin precursor, the ecdysis-triggering hormone precursor and the allatostatin-C precursor, translation of the predicted neuropeptides in neural tissues was confirmed with mass spectrometric techniques. Conclusion In this study we describe the annotation of 6 novel neuropeptide precursors and the neuropeptides they encode from the

  15. Differential transferability of EST-SSR primers developed from diploid species Pseudoroegneria spicata, Thinopyrum bessarabicum, and Th. elongatum

    Science.gov (United States)

    Simple sequence repeat technology based on expressed sequence tag (EST-SSR) is a useful genomic tool for genome mapping, characterizing plant species relationships, elucidating genome evolution, and tracing genes on alien chromosome segments. EST-SSR primers developed from three perennial diploid T...

  16. DataTri, a database of American triatomine species occurrence

    Science.gov (United States)

    Ceccarelli, Soledad; Balsalobre, Agustín; Medone, Paula; Cano, María Eugenia; Gurgel Gonçalves, Rodrigo; Feliciangeli, Dora; Vezzani, Darío; Wisnivesky-Colli, Cristina; Gorla, David E.; Marti, Gerardo A.; Rabinovich, Jorge E.

    2018-04-01

    Trypanosoma cruzi, the causative agent of Chagas disease, is transmitted to mammals - including humans - by insect vectors of the subfamily Triatominae. We present the results of a compilation of triatomine occurrence and complementary ecological data that represents the most complete, integrated and updated database (DataTri) available on triatomine species at a continental scale. This database was assembled by collecting the records of triatomine species published from 1904 to 2017, spanning all American countries with triatomine presence. A total of 21815 georeferenced records were obtained from published literature, personal fieldwork and data provided by colleagues. The data compiled includes 24 American countries, 14 genera and 135 species. From a taxonomic perspective, 67.33% of the records correspond to the genus Triatoma, 20.81% to Panstrongylus, 9.01% to Rhodnius and the remaining 2.85% are distributed among the other 11 triatomine genera. We encourage using DataTri information in various areas, especially to improve knowledge of the geographical distribution of triatomine species and its variations in time.

  17. Predicted sub-populations in a marine shrimp proteome as revealed by combined EST and cDNA data from multiple Penaeus species

    Directory of Open Access Journals (Sweden)

    Kotewong Rattanawadee

    2010-11-01

    Full Text Available Abstract Background Many species of marine shrimp in the Family Penaeidae, viz. Penaeus (Litopenaeus vannamei, Penaeus monodon, Penaeus (Fenneropenaeus chinensis, and Penaeus (Marsupenaeus japonicus, are animals of economic importance in the aquaculture industry. Yet information about their DNA and protein sequences is lacking. In order to predict their collective proteome, we combined over 270,000 available EST and cDNA sequences from the 4 shrimp species with all protein sequences of Drosophila melanogaster and Caenorhabditis elegans. EST data from 4 other crustaceans, the crab Carcinus maenas, the lobster Homarus americanus (Decapoda, the water flea Daphnia pulex, and the brine shrimp Artemia franciscana were also used. Findings Similarity searches from EST collections of the 4 shrimp species matched 64% of the protein sequences of the fruit fly, but only 45% of nematode proteins, indicating that the shrimp proteome content is more similar to that of an insect than a nematode. Combined results with 4 additional non-shrimp crustaceans increased matching to 78% of fruit fly and 56% of nematode proteins, suggesting that present shrimp EST collections still lack sequences for many conserved crustacean proteins. Analysis of matching data revealed the presence of 4 EST groups from shrimp, namely sequences for proteins that are both fruit fly-like and nematode-like, fruit fly-like only, nematode-like only, and non-matching. Gene ontology profiles of proteins for the 3 matching EST groups were analyzed. For non-matching ESTs, a small fraction matched protein sequences from other species in the UniProt database, including other crustacean-specific proteins. Conclusions Shrimp ESTs indicated that the shrimp proteome is comprised of sub-populations of proteins similar to those common to both insect and nematode models, those present specifically in either model, or neither. Combining small EST collections from related species to compensate for their

  18. Extrapolative microRNA precursor based SSR mining from tea EST database in respect to agronomic traits.

    Science.gov (United States)

    Hazra, Anjan; Dasgupta, Nirjhar; Sengupta, Chandan; Das, Sauren

    2017-07-06

    Tea (Camellia sinensis, (L.) Kuntze) is considered as most popular drink across the world and it is widely consumed beverage for its several health-benefit characteristics. These positive traits primarily rely on its regulatory networks of different metabolic pathways. Development of microsatellite markers from the conserved genomic regions are being worthwhile for reviewing the genetic diversity of closely related species or self-pollinated species. Although several SSR markers have been reported, in tea, the trait-specific Simple Sequence Repeat (SSR) markers, leading to be useful in marker assisted breeding technique, are yet to be identified. Micro RNAs are short, non-coding RNA molecules, involved in post transcriptional mode of gene regulation and thus effects on related phenotype. Present study deals with identification of the microsatellite motifs within the reported and predicted miRNA precursors that are effectively followed by designing of primers from SSR flanking regions in order to PCR validation. In addition to the earlier reports, two new miRNAs are predicting here from tea expressed tag sequence database. Furthermore, 18 SSR motifs are found to be in 13 of all 33 predicted miRNAs. Trinucleotide motifs are most abundant among all followed by dinucleotides. Since, miRNA based SSR markers are evidenced to have significant role on genetic fingerprinting study, these outcomes would pave the way in developing novel markers for tagging tea specific agronomic traits as well as substantiating non-conventional breeding program.

  19. Exploration of genetic diversity among medicinally important genus Epimedium species based on genomic and EST-SSR marker.

    Science.gov (United States)

    Yousaf, Zubaida; Hu, Weiming; Zhang, Yanjun; Zeng, Shaohua; Wang, Ying

    2015-01-01

    Epimedium species has gained prime importance due to their medicinal and economic values. Therefore, in this study, 26 genomic SSR and 10 EST-SSR markers were developed for 13 medicinal species of the Epimedium genus and one out-group species Vancouveria hexandra W. J. Hooker to explore the existing genetic diversity. A total of 100 alleles by genomic SSR and 65 by EST-SSR were detected. The genomic SSR markers were presented between 2-7 alleles per locus. The observed heterozygosity (Ho) and expected heterozygosity (He) ranged from 0.00 to 4.5 and 0.0254 to 2.8108, respectively. Similarly, for EST-SSR, these values were ranged from 3.00 to 4.00 and 1.9650 to 2.7142. The number of alleles for EST-SSR markers ranged from 3 to 10 with an average of 3.51 per loci. It has been concluded that medicinally important species of the genus Epimedium possesses lower intraspecific genetic variation.

  20. First Microsatellite Markers Developed from Cupuassu ESTs: Application in Diversity Analysis and Cross-Species Transferability to Cacao.

    Science.gov (United States)

    Ferraz Dos Santos, Lucas; Moreira Fregapani, Roberta; Falcão, Loeni Ludke; Togawa, Roberto Coiti; Costa, Marcos Mota do Carmo; Lopes, Uilson Vanderlei; Peres Gramacho, Karina; Alves, Rafael Moyses; Micheli, Fabienne; Marcellino, Lucilia Helena

    2016-01-01

    The cupuassu tree (Theobroma grandiflorum) (Willd. ex Spreng.) Schum. is a fruitful species from the Amazon with great economical potential, due to the multiple uses of its fruit´s pulp and seeds in the food and cosmetic industries, including the production of cupulate, an alternative to chocolate. In order to support the cupuassu breeding program and to select plants presenting both pulp/seed quality and fungal disease resistance, SSRs from Next Generation Sequencing ESTs were obtained and used in diversity analysis. From 8,330 ESTs, 1,517 contained one or more SSRs (1,899 SSRs identified). The most abundant motifs identified in the EST-SSRs were hepta- and trinucleotides, and they were found with a minimum and maximum of 2 and 19 repeats, respectively. From the 1,517 ESTs containing SSRs, 70 ESTs were selected based on their functional annotation, focusing on pulp and seed quality, as well as resistance to pathogens. The 70 ESTs selected contained 77 SSRs, and among which, 11 were polymorphic in cupuassu genotypes. These EST-SSRs were able to discriminate the cupuassu genotype in relation to resistance/susceptibility to witches' broom disease, as well as to pulp quality (SST/ATT values). Finally, we showed that these markers were transferable to cacao genotypes, and that genome availability might be used as a predictive tool for polymorphism detection and primer design useful for both Theobroma species. To our knowledge, this is the first report involving EST-SSRs from cupuassu and is also a pioneer in the analysis of marker transferability from cupuassu to cacao. Moreover, these markers might contribute to develop or saturate the cupuassu and cacao genetic maps, respectively.

  1. First Microsatellite Markers Developed from Cupuassu ESTs: Application in Diversity Analysis and Cross-Species Transferability to Cacao.

    Directory of Open Access Journals (Sweden)

    Lucas Ferraz Dos Santos

    Full Text Available The cupuassu tree (Theobroma grandiflorum (Willd. ex Spreng. Schum. is a fruitful species from the Amazon with great economical potential, due to the multiple uses of its fruit´s pulp and seeds in the food and cosmetic industries, including the production of cupulate, an alternative to chocolate. In order to support the cupuassu breeding program and to select plants presenting both pulp/seed quality and fungal disease resistance, SSRs from Next Generation Sequencing ESTs were obtained and used in diversity analysis. From 8,330 ESTs, 1,517 contained one or more SSRs (1,899 SSRs identified. The most abundant motifs identified in the EST-SSRs were hepta- and trinucleotides, and they were found with a minimum and maximum of 2 and 19 repeats, respectively. From the 1,517 ESTs containing SSRs, 70 ESTs were selected based on their functional annotation, focusing on pulp and seed quality, as well as resistance to pathogens. The 70 ESTs selected contained 77 SSRs, and among which, 11 were polymorphic in cupuassu genotypes. These EST-SSRs were able to discriminate the cupuassu genotype in relation to resistance/susceptibility to witches' broom disease, as well as to pulp quality (SST/ATT values. Finally, we showed that these markers were transferable to cacao genotypes, and that genome availability might be used as a predictive tool for polymorphism detection and primer design useful for both Theobroma species. To our knowledge, this is the first report involving EST-SSRs from cupuassu and is also a pioneer in the analysis of marker transferability from cupuassu to cacao. Moreover, these markers might contribute to develop or saturate the cupuassu and cacao genetic maps, respectively.

  2. First Microsatellite Markers Developed from Cupuassu ESTs: Application in Diversity Analysis and Cross-Species Transferability to Cacao

    Science.gov (United States)

    Ferraz dos Santos, Lucas; Moreira Fregapani, Roberta; Falcão, Loeni Ludke; Togawa, Roberto Coiti; Costa, Marcos Mota do Carmo; Lopes, Uilson Vanderlei; Peres Gramacho, Karina; Alves, Rafael Moyses

    2016-01-01

    The cupuassu tree (Theobroma grandiflorum) (Willd. ex Spreng.) Schum. is a fruitful species from the Amazon with great economical potential, due to the multiple uses of its fruit´s pulp and seeds in the food and cosmetic industries, including the production of cupulate, an alternative to chocolate. In order to support the cupuassu breeding program and to select plants presenting both pulp/seed quality and fungal disease resistance, SSRs from Next Generation Sequencing ESTs were obtained and used in diversity analysis. From 8,330 ESTs, 1,517 contained one or more SSRs (1,899 SSRs identified). The most abundant motifs identified in the EST-SSRs were hepta- and trinucleotides, and they were found with a minimum and maximum of 2 and 19 repeats, respectively. From the 1,517 ESTs containing SSRs, 70 ESTs were selected based on their functional annotation, focusing on pulp and seed quality, as well as resistance to pathogens. The 70 ESTs selected contained 77 SSRs, and among which, 11 were polymorphic in cupuassu genotypes. These EST-SSRs were able to discriminate the cupuassu genotype in relation to resistance/susceptibility to witches’ broom disease, as well as to pulp quality (SST/ATT values). Finally, we showed that these markers were transferable to cacao genotypes, and that genome availability might be used as a predictive tool for polymorphism detection and primer design useful for both Theobroma species. To our knowledge, this is the first report involving EST-SSRs from cupuassu and is also a pioneer in the analysis of marker transferability from cupuassu to cacao. Moreover, these markers might contribute to develop or saturate the cupuassu and cacao genetic maps, respectively. PMID:26949967

  3. Identification and genetic mapping of highly polymorphic microsatellite loci from an EST database of the septoria tritici blotch pathogen Mycosphaerella graminicola

    NARCIS (Netherlands)

    Goodwin, S.B.; Lee, van der T.A.J.; Cavaletto, J.R.; Lintel Hekkert, te B.; Crane, C.F.; Kema, G.H.J.

    2007-01-01

    A database of 30,137 EST sequences from Mycosphaerella graminicola, the septoria tritici blotch fungus of wheat, was scanned with a custom software pipeline for di- and trinucleotide units repeated tandemly six or more times. The bioinformatics analysis identified 109 putative SSR loci, and for 99

  4. Mining and characterization of EST-SSR markers for Zingiber officinale Roscoe with transferability to other species of Zingiberaceae.

    Science.gov (United States)

    Awasthi, Praveen; Singh, Ashish; Sheikh, Gulfam; Mahajan, Vidushi; Gupta, Ajai Prakash; Gupta, Suphla; Bedi, Yashbir S; Gandhi, Sumit G

    2017-10-01

    Zingiber officinale is a model spice herb, well known for its medicinal value. It is primarily a vegetatively propagated commercial crop. However, considerable diversity in its morphology, fiber content and chemoprofiles has been reported. The present study explores the utility of EST-derived markers in studying genetic diversity in different accessions of Z. officinale and their cross transferability within the Zingiberaceae family. A total of 38,115 ESTs sequences were assembled to generate 7850 contigs and 10,762 singletons. SSRs were searched in the unigenes and 515 SSR-containing ESTs were identified with a frequency of 1 SSR per 25.21 kb of the genome. These ESTs were also annotated using BLAST2GO. Primers were designed for 349 EST-SSRs and 25 primer pairs were randomly picked for EST SSR study. Out of these, 16 primer pairs could be optimized for amplification in different accessions of Z. officinale as well as other species belonging to Zingiberaceae. GES454, GES466, GES480 and GES486 markers were found to exhibit 100% cross-transferability among different members of Zingiberaceae.

  5. Development of a novel set of EST-SSR markers and cross-species amplification in Tamarix africana (Tamaricaceae).

    Science.gov (United States)

    Terzoli, Serena; Beritognolo, Isacco; Sabatti, Maurizio; Kuzminsky, Elena

    2010-06-01

    Tamarix plants are resistant to abiotic stresses and have become invasive in North America. Their taxonomy is troublesome, and few molecular makers are available to enable species identification or to track the spread of specific invasive genotypes. Transcriptome sequencing projects offer a potential source for the development of new markers. • Thirteen polymorphic simple sequence repeats (SSRs) markers derived from Expressed Sequence Tags (ESTs) from Tamarix hispida, T. androssowii, T. ramosissima, and T. albiflonum were identified and screened on 24 samples of T. africana to detect polymorphism. The number of alleles per locus ranged from two to eight, with an average of 4.3 alleles per locus, and the mean expected heterozygosity was 0.453. • Amplification products of these 13 loci were also generated for T. gallica. These new EST-SSR markers will be useful in genetic characterization of Tamarix, as additional tools for taxonomic clarification, and for studying invasive populations where they are a threat.

  6. QualitySNP: a pipeline for detecting single nucleotide polymorphisms and insertions/deletions in EST data from diploid and polyploid species

    Directory of Open Access Journals (Sweden)

    Voorrips Roeland E

    2006-10-01

    Full Text Available Abstract Background Single nucleotide polymorphisms (SNPs are important tools in studying complex genetic traits and genome evolution. Computational strategies for SNP discovery make use of the large number of sequences present in public databases (in most cases as expressed sequence tags (ESTs and are considered to be faster and more cost-effective than experimental procedures. A major challenge in computational SNP discovery is distinguishing allelic variation from sequence variation between paralogous sequences, in addition to recognizing sequencing errors. For the majority of the public EST sequences, trace or quality files are lacking which makes detection of reliable SNPs even more difficult because it has to rely on sequence comparisons only. Results We have developed a new algorithm to detect reliable SNPs and insertions/deletions (indels in EST data, both with and without quality files. Implemented in a pipeline called QualitySNP, it uses three filters for the identification of reliable SNPs. Filter 1 screens for all potential SNPs and identifies variation between or within genotypes. Filter 2 is the core filter that uses a haplotype-based strategy to detect reliable SNPs. Clusters with potential paralogs as well as false SNPs caused by sequencing errors are identified. Filter 3 screens SNPs by calculating a confidence score, based upon sequence redundancy and quality. Non-synonymous SNPs are subsequently identified by detecting open reading frames of consensus sequences (contigs with SNPs. The pipeline includes a data storage and retrieval system for haplotypes, SNPs and alignments. QualitySNP's versatility is demonstrated by the identification of SNPs in EST datasets from potato, chicken and humans. Conclusion QualitySNP is an efficient tool for SNP detection, storage and retrieval in diploid as well as polyploid species. It is available for running on Linux or UNIX systems. The program, test data, and user manual are available at

  7. Validation of minor species of the MIPAS2D database

    Directory of Open Access Journals (Sweden)

    Enzo Papandrea

    2014-01-01

    Full Text Available The MIPAS2D [Dinelli et al., 2010] database has been developed applying the tomographic analysis technique GMTR [Carlotti et al., 2001] to measurements acquired in the nominal observation mode of the complete MIPAS (Michelson Interferometer for Passive Atmosphere Sounding [Fischer et al., 2008] mission. […

  8. Taxonomic evaluation of selected Ganoderma species and database sequence validation

    Directory of Open Access Journals (Sweden)

    Suldbold Jargalmaa

    2017-07-01

    Full Text Available Species in the genus Ganoderma include several ecologically important and pathogenic fungal species whose medicinal and economic value is substantial. Due to the highly similar morphological features within the Ganoderma, identification of species has relied heavily on DNA sequencing using BLAST searches, which are only reliable if the GenBank submissions are accurately labeled. In this study, we examined 113 specimens collected from 1969 to 2016 from various regions in Korea using morphological features and multigene analysis (internal transcribed spacer, translation elongation factor 1-α, and the second largest subunit of RNA polymerase II. These specimens were identified as four Ganoderma species: G. sichuanense, G. cf. adspersum, G. cf. applanatum, and G. cf. gibbosum. With the exception of G. sichuanense, these species were difficult to distinguish based solely on morphological features. However, phylogenetic analysis at three different loci yielded concordant phylogenetic information, and supported the four species distinctions with high bootstrap support. A survey of over 600 Ganoderma sequences available on GenBank revealed that 65% of sequences were either misidentified or ambiguously labeled. Here, we suggest corrected annotations for GenBank sequences based on our phylogenetic validation and provide updated global distribution patterns for these Ganoderma species.

  9. Taxonomic evaluation of selected Ganoderma species and database sequence validation

    Science.gov (United States)

    Jargalmaa, Suldbold; Eimes, John A.; Park, Myung Soo; Park, Jae Young; Oh, Seung-Yoon

    2017-01-01

    Species in the genus Ganoderma include several ecologically important and pathogenic fungal species whose medicinal and economic value is substantial. Due to the highly similar morphological features within the Ganoderma, identification of species has relied heavily on DNA sequencing using BLAST searches, which are only reliable if the GenBank submissions are accurately labeled. In this study, we examined 113 specimens collected from 1969 to 2016 from various regions in Korea using morphological features and multigene analysis (internal transcribed spacer, translation elongation factor 1-α, and the second largest subunit of RNA polymerase II). These specimens were identified as four Ganoderma species: G. sichuanense, G. cf. adspersum, G. cf. applanatum, and G. cf. gibbosum. With the exception of G. sichuanense, these species were difficult to distinguish based solely on morphological features. However, phylogenetic analysis at three different loci yielded concordant phylogenetic information, and supported the four species distinctions with high bootstrap support. A survey of over 600 Ganoderma sequences available on GenBank revealed that 65% of sequences were either misidentified or ambiguously labeled. Here, we suggest corrected annotations for GenBank sequences based on our phylogenetic validation and provide updated global distribution patterns for these Ganoderma species. PMID:28761785

  10. World Register of marine Cave Species (WoRCS: a new Thematic Species Database for marine and anchialine cave biodiversity

    Directory of Open Access Journals (Sweden)

    Vasilis Gerovasileiou

    2016-09-01

    Full Text Available Scientific exploration of marine cave environments and anchialine ecosystems over recent decades has led to outstanding discoveries of novel taxa, increasing our knowledge of biodiversity. However, biological research on underwater caves has taken place only in a few areas of the world and relevant information remains fragmented in isolated publications and databases. This fragmentation makes assessing the conservation status of marine cave species especially problematic, and this issue should be addressed urgently given the stresses resulting from planned and rampant development in the coastal zone worldwide. The goal of the World Register of marine Cave Species (WoRCS initiative is to create a comprehensive taxonomic and ecological database of known species from marine caves and anchialine systems worldwide and to present this as a Thematic Species Database (TSD of the World Register of marine Species (WoRMS. WoRCS will incorporate ecological data (e.g., type of environment, salinity regimes, and cave zone as well as geographical information on the distribution of species in cave and anchialine environments. Biodiversity data will be progressively assembled from individual database sources at regional, national or local levels, as well as from literature sources (estimate: >20,000 existing records of cave-dwelling species scattered in several databases. Information will be organized in the WoRCS database following a standard glossary based on existing terminology. Cave-related information will be managed by the WoRCS thematic editors with all data dynamically linked to WoRMS and its team of taxonomic editors. In order to mobilize data into global biogeographic databases, a Gazetteer of the Marine and Anchialine Caves of the World will be established. The presence records of species could be eventually georeferenced for submission to the Ocean Biogeographic Information System (OBIS and constitute an important dataset for biogeographical and

  11. OryzaGenome: Genome Diversity Database of Wild Oryza Species

    KAUST Repository

    Ohyanagi, Hajime

    2015-11-18

    The species in the genus Oryza, encompassing nine genome types and 23 species, are a rich genetic resource and may have applications in deeper genomic analyses aiming to understand the evolution of plant genomes. With the advancement of next-generation sequencing (NGS) technology, a flood of Oryza species reference genomes and genomic variation information has become available in recent years. This genomic information, combined with the comprehensive phenotypic information that we are accumulating in our Oryzabase, can serve as an excellent genotype-phenotype association resource for analyzing rice functional and structural evolution, and the associated diversity of the Oryza genus. Here we integrate our previous and future phenotypic/habitat information and newly determined genotype information into a united repository, named OryzaGenome, providing the variant information with hyperlinks to Oryzabase. The current version of OryzaGenome includes genotype information of 446 O. rufipogon accessions derived by imputation and of 17 accessions derived by imputation-free deep sequencing. Two variant viewers are implemented: SNP Viewer as a conventional genome browser interface and Variant Table as a textbased browser for precise inspection of each variant one by one. Portable VCF (variant call format) file or tabdelimited file download is also available. Following these SNP (single nucleotide polymorphism) data, reference pseudomolecules/ scaffolds/contigs and genome-wide variation information for almost all of the closely and distantly related wild Oryza species from the NIG Wild Rice Collection will be available in future releases. All of the resources can be accessed through http://viewer.shigen.info/oryzagenome/.

  12. Federated or cached searches: providing expected performance from multiple invasive species databases

    Science.gov (United States)

    Graham, Jim; Jarnevich, Catherine S.; Simpson, Annie; Newman, Gregory J.; Stohlgren, Thomas J.

    2011-01-01

    Invasive species are a universal global problem, but the information to identify them, manage them, and prevent invasions is stored around the globe in a variety of formats. The Global Invasive Species Information Network is a consortium of organizations working toward providing seamless access to these disparate databases via the Internet. A distributed network of databases can be created using the Internet and a standard web service protocol. There are two options to provide this integration. First, federated searches are being proposed to allow users to search “deep” web documents such as databases for invasive species. A second method is to create a cache of data from the databases for searching. We compare these two methods, and show that federated searches will not provide the performance and flexibility required from users and a central cache of the datum are required to improve performance.

  13. Databases

    Digital Repository Service at National Institute of Oceanography (India)

    Kunte, P.D.

    Information on bibliographic as well as numeric/textual databases relevant to coastal geomorphology has been included in a tabular form. Databases cover a broad spectrum of related subjects like coastal environment and population aspects, coastline...

  14. The Coral Trait Database, a curated database of trait information for coral species from the global oceans

    Science.gov (United States)

    Madin, Joshua S.; Anderson, Kristen D.; Andreasen, Magnus Heide; Bridge, Tom C. L.; Cairns, Stephen D.; Connolly, Sean R.; Darling, Emily S.; Diaz, Marcela; Falster, Daniel S.; Franklin, Erik C.; Gates, Ruth D.; Hoogenboom, Mia O.; Huang, Danwei; Keith, Sally A.; Kosnik, Matthew A.; Kuo, Chao-Yang; Lough, Janice M.; Lovelock, Catherine E.; Luiz, Osmar; Martinelli, Julieta; Mizerek, Toni; Pandolfi, John M.; Pochon, Xavier; Pratchett, Morgan S.; Putnam, Hollie M.; Roberts, T. Edward; Stat, Michael; Wallace, Carden C.; Widman, Elizabeth; Baird, Andrew H.

    2016-03-01

    Trait-based approaches advance ecological and evolutionary research because traits provide a strong link to an organism’s function and fitness. Trait-based research might lead to a deeper understanding of the functions of, and services provided by, ecosystems, thereby improving management, which is vital in the current era of rapid environmental change. Coral reef scientists have long collected trait data for corals; however, these are difficult to access and often under-utilized in addressing large-scale questions. We present the Coral Trait Database initiative that aims to bring together physiological, morphological, ecological, phylogenetic and biogeographic trait information into a single repository. The database houses species- and individual-level data from published field and experimental studies alongside contextual data that provide important framing for analyses. In this data descriptor, we release data for 56 traits for 1547 species, and present a collaborative platform on which other trait data are being actively federated. Our overall goal is for the Coral Trait Database to become an open-source, community-led data clearinghouse that accelerates coral reef research.

  15. The Coral Trait Database, a curated database of trait information for coral species from the global oceans.

    Science.gov (United States)

    Madin, Joshua S; Anderson, Kristen D; Andreasen, Magnus Heide; Bridge, Tom C L; Cairns, Stephen D; Connolly, Sean R; Darling, Emily S; Diaz, Marcela; Falster, Daniel S; Franklin, Erik C; Gates, Ruth D; Harmer, Aaron; Hoogenboom, Mia O; Huang, Danwei; Keith, Sally A; Kosnik, Matthew A; Kuo, Chao-Yang; Lough, Janice M; Lovelock, Catherine E; Luiz, Osmar; Martinelli, Julieta; Mizerek, Toni; Pandolfi, John M; Pochon, Xavier; Pratchett, Morgan S; Putnam, Hollie M; Roberts, T Edward; Stat, Michael; Wallace, Carden C; Widman, Elizabeth; Baird, Andrew H

    2016-03-29

    Trait-based approaches advance ecological and evolutionary research because traits provide a strong link to an organism's function and fitness. Trait-based research might lead to a deeper understanding of the functions of, and services provided by, ecosystems, thereby improving management, which is vital in the current era of rapid environmental change. Coral reef scientists have long collected trait data for corals; however, these are difficult to access and often under-utilized in addressing large-scale questions. We present the Coral Trait Database initiative that aims to bring together physiological, morphological, ecological, phylogenetic and biogeographic trait information into a single repository. The database houses species- and individual-level data from published field and experimental studies alongside contextual data that provide important framing for analyses. In this data descriptor, we release data for 56 traits for 1547 species, and present a collaborative platform on which other trait data are being actively federated. Our overall goal is for the Coral Trait Database to become an open-source, community-led data clearinghouse that accelerates coral reef research.

  16. Construction of an Ostrea edulis database from genomic and expressed sequence tags (ESTs) obtained from Bonamia ostreae infected haemocytes: Development of an immune-enriched oligo-microarray.

    Science.gov (United States)

    Pardo, Belén G; Álvarez-Dios, José Antonio; Cao, Asunción; Ramilo, Andrea; Gómez-Tato, Antonio; Planas, Josep V; Villalba, Antonio; Martínez, Paulino

    2016-12-01

    The flat oyster, Ostrea edulis, is one of the main farmed oysters, not only in Europe but also in the United States and Canada. Bonamiosis due to the parasite Bonamia ostreae has been associated with high mortality episodes in this species. This parasite is an intracellular protozoan that infects haemocytes, the main cells involved in oyster defence. Due to the economical and ecological importance of flat oyster, genomic data are badly needed for genetic improvement of the species, but they are still very scarce. The objective of this study is to develop a sequence database, OedulisDB, with new genomic and transcriptomic resources, providing new data and convenient tools to improve our knowledge of the oyster's immune mechanisms. Transcriptomic and genomic sequences were obtained using 454 pyrosequencing and compiled into an O. edulis database, OedulisDB, consisting of two sets of 10,318 and 7159 unique sequences that represent the oyster's genome (WG) and de novo haemocyte transcriptome (HT), respectively. The flat oyster transcriptome was obtained from two strains (naïve and tolerant) challenged with B. ostreae, and from their corresponding non-challenged controls. Approximately 78.5% of 5619 HT unique sequences were successfully annotated by Blast search using public databases. A total of 984 sequences were identified as being related to immune response and several key immune genes were identified for the first time in flat oyster. Additionally, transcriptome information was used to design and validate the first oligo-microarray in flat oyster enriched with immune sequences from haemocytes. Our transcriptomic and genomic sequencing and subsequent annotation have largely increased the scarce resources available for this economically important species and have enabled us to develop an OedulisDB database and accompanying tools for gene expression analysis. This study represents the first attempt to characterize in depth the O. edulis haemocyte transcriptome in

  17. PseudoMLSA: a database for multigenic sequence analysis of Pseudomonas species

    Directory of Open Access Journals (Sweden)

    Lalucat Jorge

    2010-04-01

    Full Text Available Abstract Background The genus Pseudomonas comprises more than 100 species of environmental, clinical, agricultural, and biotechnological interest. Although, the recommended method for discriminating bacterial species is DNA-DNA hybridisation, alternative techniques based on multigenic sequence analysis are becoming a common practice in bacterial species discrimination studies. Since there is not a general criterion for determining which genes are more useful for species resolution; the number of strains and genes analysed is increasing continuously. As a result, sequences of different genes are dispersed throughout several databases. This sequence information needs to be collected in a common database, in order to be useful for future identification-based projects. Description The PseudoMLSA Database is a comprehensive database of multiple gene sequences from strains of Pseudomonas species. The core of the database is composed of selected gene sequences from all Pseudomonas type strains validly assigned to the genus through 2008. The database is aimed to be useful for MultiLocus Sequence Analysis (MLSA procedures, for the identification and characterisation of any Pseudomonas bacterial isolate. The sequences are available for download via a direct connection to the National Center for Biotechnology Information (NCBI. Additionally, the database includes an online BLAST interface for flexible nucleotide queries and similarity searches with the user's datasets, and provides a user-friendly output for easily parsing, navigating, and analysing BLAST results. Conclusions The PseudoMLSA database amasses strains and sequence information of validly described Pseudomonas species, and allows free querying of the database via a user-friendly, web-based interface available at http://www.uib.es/microbiologiaBD/Welcome.html. The web-based platform enables easy retrieval at strain or gene sequence information level; including references to published peer

  18. Databases

    Directory of Open Access Journals (Sweden)

    Nick Ryan

    2004-01-01

    Full Text Available Databases are deeply embedded in archaeology, underpinning and supporting many aspects of the subject. However, as well as providing a means for storing, retrieving and modifying data, databases themselves must be a result of a detailed analysis and design process. This article looks at this process, and shows how the characteristics of data models affect the process of database design and implementation. The impact of the Internet on the development of databases is examined, and the article concludes with a discussion of a range of issues associated with the recording and management of archaeological data.

  19. H2DB: a heritability database across multiple species by annotating trait-associated genomic loci.

    Science.gov (United States)

    Kaminuma, Eli; Fujisawa, Takatomo; Tanizawa, Yasuhiro; Sakamoto, Naoko; Kurata, Nori; Shimizu, Tokurou; Nakamura, Yasukazu

    2013-01-01

    H2DB (http://tga.nig.ac.jp/h2db/), an annotation database of genetic heritability estimates for humans and other species, has been developed as a knowledge database to connect trait-associated genomic loci. Heritability estimates have been investigated for individual species, particularly in human twin studies and plant/animal breeding studies. However, there appears to be no comprehensive heritability database for both humans and other species. Here, we introduce an annotation database for genetic heritabilities of various species that was annotated by manually curating online public resources in PUBMED abstracts and journal contents. The proposed heritability database contains attribute information for trait descriptions, experimental conditions, trait-associated genomic loci and broad- and narrow-sense heritability specifications. Annotated trait-associated genomic loci, for which most are single-nucleotide polymorphisms derived from genome-wide association studies, may be valuable resources for experimental scientists. In addition, we assigned phenotype ontologies to the annotated traits for the purposes of discussing heritability distributions based on phenotypic classifications.

  20. Genome size, cytogenetic data and transferability of EST-SSRs markers in wild and cultivated species of the genus Theobroma L. (Byttnerioideae, Malvaceae)

    Science.gov (United States)

    da Silva, Rangeline Azevedo; Souza, Gustavo; Lemos, Lívia Santos Lima; Lopes, Uilson Vanderlei; Patrocínio, Nara Geórgia Ribeiro Braz; Alves, Rafael Moysés; Marcellino, Lucília Helena; Clement, Didier; Micheli, Fabienne

    2017-01-01

    The genus Theobroma comprises several trees species native to the Amazon. Theobroma cacao L. plays a key economic role mainly in the chocolate industry. Both cultivated and wild forms are described within the genus. Variations in genome size and chromosome number have been used for prediction purposes including the frequency of interspecific hybridization or inference about evolutionary relationships. In this study, the nuclear DNA content, karyotype and genetic diversity using functional microsatellites (EST-SSR) of seven Theobroma species were characterized. The nuclear content of DNA for all analyzed Theobroma species was 1C = ~ 0.46 pg. These species presented 2n = 20 with small chromosomes and only one pair of terminal heterochromatic bands positively stained (CMA+/DAPI− bands). The small size of Theobroma ssp. genomes was equivalent to other Byttnerioideae species, suggesting that the basal lineage of Malvaceae have smaller genomes and that there was an expansion of 2C values in the more specialized family clades. A set of 20 EST-SSR primers were characterized for related species of Theobroma, in which 12 loci were polymorphic. The polymorphism information content (PIC) ranged from 0.23 to 0.65, indicating a high level of information per locus. Combined results of flow cytometry, cytogenetic data and EST-SSRs markers will contribute to better describe the species and infer about the evolutionary relationships among Theobroma species. In addition, the importance of a core collection for conservation purposes is highlighted. PMID:28187131

  1. Genome size, cytogenetic data and transferability of EST-SSRs markers in wild and cultivated species of the genus Theobroma L. (Byttnerioideae, Malvaceae.

    Directory of Open Access Journals (Sweden)

    Rangeline Azevedo da Silva

    Full Text Available The genus Theobroma comprises several trees species native to the Amazon. Theobroma cacao L. plays a key economic role mainly in the chocolate industry. Both cultivated and wild forms are described within the genus. Variations in genome size and chromosome number have been used for prediction purposes including the frequency of interspecific hybridization or inference about evolutionary relationships. In this study, the nuclear DNA content, karyotype and genetic diversity using functional microsatellites (EST-SSR of seven Theobroma species were characterized. The nuclear content of DNA for all analyzed Theobroma species was 1C = ~ 0.46 pg. These species presented 2n = 20 with small chromosomes and only one pair of terminal heterochromatic bands positively stained (CMA+/DAPI- bands. The small size of Theobroma ssp. genomes was equivalent to other Byttnerioideae species, suggesting that the basal lineage of Malvaceae have smaller genomes and that there was an expansion of 2C values in the more specialized family clades. A set of 20 EST-SSR primers were characterized for related species of Theobroma, in which 12 loci were polymorphic. The polymorphism information content (PIC ranged from 0.23 to 0.65, indicating a high level of information per locus. Combined results of flow cytometry, cytogenetic data and EST-SSRs markers will contribute to better describe the species and infer about the evolutionary relationships among Theobroma species. In addition, the importance of a core collection for conservation purposes is highlighted.

  2. Comprehensive database of diameter-based biomass regressions for North American tree species

    Science.gov (United States)

    Jennifer C. Jenkins; David C. Chojnacky; Linda S. Heath; Richard A. Birdsey

    2004-01-01

    A database consisting of 2,640 equations compiled from the literature for predicting the biomass of trees and tree components from diameter measurements of species found in North America. Bibliographic information, geographic locations, diameter limits, diameter and biomass units, equation forms, statistical errors, and coefficients are provided for each equation,...

  3. EST-PAC a web package for EST annotation and protein sequence prediction

    Directory of Open Access Journals (Sweden)

    Strahm Yvan

    2006-10-01

    Full Text Available Abstract With the decreasing cost of DNA sequencing technology and the vast diversity of biological resources, researchers increasingly face the basic challenge of annotating a larger number of expressed sequences tags (EST from a variety of species. This typically consists of a series of repetitive tasks, which should be automated and easy to use. The results of these annotation tasks need to be stored and organized in a consistent way. All these operations should be self-installing, platform independent, easy to customize and amenable to using distributed bioinformatics resources available on the Internet. In order to address these issues, we present EST-PAC a web oriented multi-platform software package for expressed sequences tag (EST annotation. EST-PAC provides a solution for the administration of EST and protein sequence annotations accessible through a web interface. Three aspects of EST annotation are automated: 1 searching local or remote biological databases for sequence similarities using Blast services, 2 predicting protein coding sequence from EST data and, 3 annotating predicted protein sequences with functional domain predictions. In practice, EST-PAC integrates the BLASTALL suite, EST-Scan2 and HMMER in a relational database system accessible through a simple web interface. EST-PAC also takes advantage of the relational database to allow consistent storage, powerful queries of results and, management of the annotation process. The system allows users to customize annotation strategies and provides an open-source data-management environment for research and education in bioinformatics.

  4. Systematic discovery of unannotated genes in 11 yeast species using a database of orthologous genomic segments

    LENUS (Irish Health Repository)

    OhEigeartaigh, Sean S

    2011-07-26

    Abstract Background In standard BLAST searches, no information other than the sequences of the query and the database entries is considered. However, in situations where two genes from different species have only borderline similarity in a BLAST search, the discovery that the genes are located within a region of conserved gene order (synteny) can provide additional evidence that they are orthologs. Thus, for interpreting borderline search results, it would be useful to know whether the syntenic context of a database hit is similar to that of the query. This principle has often been used in investigations of particular genes or genomic regions, but to our knowledge it has never been implemented systematically. Results We made use of the synteny information contained in the Yeast Gene Order Browser database for 11 yeast species to carry out a systematic search for protein-coding genes that were overlooked in the original annotations of one or more yeast genomes but which are syntenic with their orthologs. Such genes tend to have been overlooked because they are short, highly divergent, or contain introns. The key features of our software - called SearchDOGS - are that the database entries are classified into sets of genomic segments that are already known to be orthologous, and that very weak BLAST hits are retained for further analysis if their genomic location is similar to that of the query. Using SearchDOGS we identified 595 additional protein-coding genes among the 11 yeast species, including two new genes in Saccharomyces cerevisiae. We found additional genes for the mating pheromone a-factor in six species including Kluyveromyces lactis. Conclusions SearchDOGS has proven highly successful for identifying overlooked genes in the yeast genomes. We anticipate that our approach can be adapted for study of further groups of species, such as bacterial genomes. More generally, the concept of doing sequence similarity searches against databases to which external

  5. CitEST libraries

    Directory of Open Access Journals (Sweden)

    Maria Luísa P. Natividade Targon

    2007-01-01

    Full Text Available In order to obtain a better understanding of what is citrus, 33 cDNA libraries were constructed from different citrus species and genera. Total RNA was extracted from fruits, leaves, flowers, bark, seeds and roots, and subjected or not to different biotic and abiotic stresses (pathogens and drought and at several developmental stages. To identify putative promoter sequences, as well as molecular markers that could be useful for breeding programs, one shotgun library was prepared from sweet orange (Citrus sinensis var. Olimpia. In addition, EST libraries were also constructed for a citrus pathogen, the oomycete Phythophthora parasitica in either virulent or avirulent form. A total of 286,559 cDNA clones from citrus were sequenced from their 5’ end, generating 242,790 valid reads of citrus. A total of 9,504 sequences were produced in the shotgun library and the valid reads were assembled using CAP3. In this procedure, we obtained 1,131 contigs and 4,083 singletons. A total of 19,200 cDNA clones from P. parasitica were sequenced, resulting in 16,400 valid reads. The number of ESTs generated in this project is, to our knowledge, the largest citrus sequence database in the world.

  6. Assessment of Functional EST-SSR Markers (Sugarcane in Cross-Species Transferability, Genetic Diversity among Poaceae Plants, and Bulk Segregation Analysis

    Directory of Open Access Journals (Sweden)

    Shamshad Ul Haq

    2016-01-01

    Full Text Available Expressed sequence tags (ESTs are important resource for gene discovery, gene expression and its regulation, molecular marker development, and comparative genomics. We procured 10000 ESTs and analyzed 267 EST-SSRs markers through computational approach. The average density was one SSR/10.45 kb or 6.4% frequency, wherein trinucleotide repeats (66.74% were the most abundant followed by di- (26.10%, tetra- (4.67%, penta- (1.5%, and hexanucleotide (1.2% repeats. Functional annotations were done and after-effect newly developed 63 EST-SSRs were used for cross transferability, genetic diversity, and bulk segregation analysis (BSA. Out of 63 EST-SSRs, 42 markers were identified owing to their expansion genetics across 20 different plants which amplified 519 alleles at 180 loci with an average of 2.88 alleles/locus and the polymorphic information content (PIC ranged from 0.51 to 0.93 with an average of 0.83. The cross transferability ranged from 25% for wheat to 97.22% for Schlerostachya, with an average of 55.86%, and genetic relationships were established based on diversification among them. Moreover, 10 EST-SSRs were recognized as important markers between bulks of pooled DNA of sugarcane cultivars through BSA. This study highlights the employability of the markers in transferability, genetic diversity in grass species, and distinguished sugarcane bulks.

  7. Total internal partition sums for molecular species in the 2000 edition of the HITRAN database

    International Nuclear Information System (INIS)

    Fischer, J.; Gamache, R.R.; Goldman, A.; Rothman, L.S.; Perrin, A.

    2003-01-01

    Total internal partition sums (TIPS) are calculated for all molecular species in the 2000 HITRAN database. In addition, the TIPS for 13 other isotopomers/isotopologues of ozone and carbon dioxide are presented. The calculations address the corrections suggested by Goldman et al. (JQSRT 66 (2000) 455). The calculations consider the temperature range 70-3000 K to be applicable to a variety of remote sensing needs. The method of calculation for each molecular species is stated and comparisons with data from the literature are discussed. A new method of recall for the partition sums, Lagrange 4-point interpolation, is developed. This method, unlike previous versions of the TIPS code, allows all molecular species to be considered

  8. Brassica database (BRAD) version 2.0: integrating and mining Brassicaceae species genomic resources.

    Science.gov (United States)

    Wang, Xiaobo; Wu, Jian; Liang, Jianli; Cheng, Feng; Wang, Xiaowu

    2015-01-01

    The Brassica database (BRAD) was built initially to assist users apply Brassica rapa and Arabidopsis thaliana genomic data efficiently to their research. However, many Brassicaceae genomes have been sequenced and released after its construction. These genomes are rich resources for comparative genomics, gene annotation and functional evolutionary studies of Brassica crops. Therefore, we have updated BRAD to version 2.0 (V2.0). In BRAD V2.0, 11 more Brassicaceae genomes have been integrated into the database, namely those of Arabidopsis lyrata, Aethionema arabicum, Brassica oleracea, Brassica napus, Camelina sativa, Capsella rubella, Leavenworthia alabamica, Sisymbrium irio and three extremophiles Schrenkiella parvula, Thellungiella halophila and Thellungiella salsuginea. BRAD V2.0 provides plots of syntenic genomic fragments between pairs of Brassicaceae species, from the level of chromosomes to genomic blocks. The Generic Synteny Browser (GBrowse_syn), a module of the Genome Browser (GBrowse), is used to show syntenic relationships between multiple genomes. Search functions for retrieving syntenic and non-syntenic orthologs, as well as their annotation and sequences are also provided. Furthermore, genome and annotation information have been imported into GBrowse so that all functional elements can be visualized in one frame. We plan to continually update BRAD by integrating more Brassicaceae genomes into the database. Database URL: http://brassicadb.org/brad/. © The Author(s) 2015. Published by Oxford University Press.

  9. Development, cross-species/genera transferability of novel EST-SSR markers and their utility in revealing population structure and genetic diversity in sugarcane

    KAUST Repository

    Singh, Ram K.

    2013-07-01

    Sugarcane (Saccharum spp. hybrid) with complex polyploid genome requires a large number of informative DNA markers for various applications in genetics and breeding. Despite the great advances in genomic technology, it is observed in several crop species, especially in sugarcane, the availability of molecular tools such as microsatellite markers are limited. Now-a-days EST-SSR markers are preferred to genomic SSR (gSSR) as they represent only the functional part of the genome, which can be easily associated with desired trait. The present study was taken up with a new set of 351 EST-SSRs developed from the 4085 non redundant EST sequences of two Indian sugarcane cultivars. Among these EST-SSRs, TNR containing motifs were predominant with a frequency of 51.6%. Thirty percent EST-SSRs showed homology with annotated protein. A high frequency of SSRs was found in the 5\\'UTR and in the ORF (about 27%) and a low frequency was observed in the 3\\'UTR (about 8%). Two hundred twenty-seven EST-SSRs were evaluated, in sugarcane, allied genera of sugarcane and cereals, and 134 of these have revealed polymorphism with a range of PIC value 0.12 to 0.99. The cross transferability rate ranged from 87.0% to 93.4% in Saccharum complex, 80.0% to 87.0% in allied genera, and 76.0% to 80.0% in cereals. Cloning and sequencing of EST-SSR size variant amplicons revealed that the variation in the number of repeat-units was the main source of EST-SSR fragment polymorphism. When 124 sugarcane accessions were analyzed for population structure using model-based approach, seven genetically distinct groups or admixtures thereof were observed in sugarcane. Results of principal coordinate analysis or UPGMA to evaluate genetic relationships delineated also the 124 accessions into seven groups. Thus, a high level of polymorphism adequate genetic diversity and population structure assayed with the EST-SSR markers not only suggested their utility in various applications in genetics and genomics in

  10. BrassicaTED - a public database for utilization of miniature transposable elements in Brassica species.

    Science.gov (United States)

    Murukarthick, Jayakodi; Sampath, Perumal; Lee, Sang Choon; Choi, Beom-Soon; Senthil, Natesan; Liu, Shengyi; Yang, Tae-Jin

    2014-06-20

    MITE, TRIM and SINEs are miniature form transposable elements (mTEs) that are ubiquitous and dispersed throughout entire plant genomes. Tens of thousands of members cause insertion polymorphism at both the inter- and intra- species level. Therefore, mTEs are valuable targets and resources for development of markers that can be utilized for breeding, genetic diversity and genome evolution studies. Taking advantage of the completely sequenced genomes of Brassica rapa and B. oleracea, characterization of mTEs and building a curated database are prerequisite to extending their utilization for genomics and applied fields in Brassica crops. We have developed BrassicaTED as a unique web portal containing detailed characterization information for mTEs of Brassica species. At present, BrassicaTED has datasets for 41 mTE families, including 5894 and 6026 members from 20 MITE families, 1393 and 1639 members from 5 TRIM families, 1270 and 2364 members from 16 SINE families in B. rapa and B. oleracea, respectively. BrassicaTED offers different sections to browse structural and positional characteristics for every mTE family. In addition, we have added data on 289 MITE insertion polymorphisms from a survey of seven Brassica relatives. Genes with internal mTE insertions are shown with detailed gene annotation and microarray-based comparative gene expression data in comparison with their paralogs in the triplicated B. rapa genome. This database also includes a novel tool, K BLAST (Karyotype BLAST), for clear visualization of the locations for each member in the B. rapa and B. oleracea pseudo-genome sequences. BrassicaTED is a newly developed database of information regarding the characteristics and potential utility of mTEs including MITE, TRIM and SINEs in B. rapa and B. oleracea. The database will promote the development of desirable mTE-based markers, which can be utilized for genomics and breeding in Brassica species. BrassicaTED will be a valuable repository for scientists

  11. Framework for Optimizing Selection of Interspecies Correlation Estimation Models to Address Species Diversity and Toxicity Gaps in an Aquatic Database

    Science.gov (United States)

    The Chemical Aquatic Fate and Effects (CAFE) database is a tool that facilitates assessments of accidental chemical releases into aquatic environments. CAFE contains aquatic toxicity data used in the development of species sensitivity distributions (SSDs) and the estimation of ha...

  12. DDMGD: the database of text-mined associations between genes methylated in diseases from different species

    KAUST Repository

    Raies, A. B.

    2014-11-14

    Gathering information about associations between methylated genes and diseases is important for diseases diagnosis and treatment decisions. Recent advancements in epigenetics research allow for large-scale discoveries of associations of genes methylated in diseases in different species. Searching manually for such information is not easy, as it is scattered across a large number of electronic publications and repositories. Therefore, we developed DDMGD database (http://www.cbrc.kaust.edu.sa/ddmgd/) to provide a comprehensive repository of information related to genes methylated in diseases that can be found through text mining. DDMGD\\'s scope is not limited to a particular group of genes, diseases or species. Using the text mining system DEMGD we developed earlier and additional post-processing, we extracted associations of genes methylated in different diseases from PubMed Central articles and PubMed abstracts. The accuracy of extracted associations is 82% as estimated on 2500 hand-curated entries. DDMGD provides a user-friendly interface facilitating retrieval of these associations ranked according to confidence scores. Submission of new associations to DDMGD is provided. A comparison analysis of DDMGD with several other databases focused on genes methylated in diseases shows that DDMGD is comprehensive and includes most of the recent information on genes methylated in diseases.

  13. DDMGD: the database of text-mined associations between genes methylated in diseases from different species.

    Science.gov (United States)

    Bin Raies, Arwa; Mansour, Hicham; Incitti, Roberto; Bajic, Vladimir B

    2015-01-01

    Gathering information about associations between methylated genes and diseases is important for diseases diagnosis and treatment decisions. Recent advancements in epigenetics research allow for large-scale discoveries of associations of genes methylated in diseases in different species. Searching manually for such information is not easy, as it is scattered across a large number of electronic publications and repositories. Therefore, we developed DDMGD database (http://www.cbrc.kaust.edu.sa/ddmgd/) to provide a comprehensive repository of information related to genes methylated in diseases that can be found through text mining. DDMGD's scope is not limited to a particular group of genes, diseases or species. Using the text mining system DEMGD we developed earlier and additional post-processing, we extracted associations of genes methylated in different diseases from PubMed Central articles and PubMed abstracts. The accuracy of extracted associations is 82% as estimated on 2500 hand-curated entries. DDMGD provides a user-friendly interface facilitating retrieval of these associations ranked according to confidence scores. Submission of new associations to DDMGD is provided. A comparison analysis of DDMGD with several other databases focused on genes methylated in diseases shows that DDMGD is comprehensive and includes most of the recent information on genes methylated in diseases. © The Author(s) 2014. Published by Oxford University Press on behalf of Nucleic Acids Research.

  14. Making species checklists understandable to machines - a shift from relational databases to ontologies.

    Science.gov (United States)

    Laurenne, Nina; Tuominen, Jouni; Saarenmaa, Hannu; Hyvönen, Eero

    2014-01-01

    The scientific names of plants and animals play a major role in Life Sciences as information is indexed, integrated, and searched using scientific names. The main problem with names is their ambiguous nature, because more than one name may point to the same taxon and multiple taxa may share the same name. In addition, scientific names change over time, which makes them open to various interpretations. Applying machine-understandable semantics to these names enables efficient processing of biological content in information systems. The first step is to use unique persistent identifiers instead of name strings when referring to taxa. The most commonly used identifiers are Life Science Identifiers (LSID), which are traditionally used in relational databases, and more recently HTTP URIs, which are applied on the Semantic Web by Linked Data applications. We introduce two models for expressing taxonomic information in the form of species checklists. First, we show how species checklists are presented in a relational database system using LSIDs. Then, in order to gain a more detailed representation of taxonomic information, we introduce meta-ontology TaxMeOn to model the same content as Semantic Web ontologies where taxa are identified using HTTP URIs. We also explore how changes in scientific names can be managed over time. The use of HTTP URIs is preferable for presenting the taxonomic information of species checklists. An HTTP URI identifies a taxon and operates as a web address from which additional information about the taxon can be located, unlike LSID. This enables the integration of biological data from different sources on the web using Linked Data principles and prevents the formation of information silos. The Linked Data approach allows a user to assemble information and evaluate the complexity of taxonomical data based on conflicting views of taxonomic classifications. Using HTTP URIs and Semantic Web technologies also facilitate the representation of the

  15. MethBank 3.0: a database of DNA methylomes across a variety of species.

    Science.gov (United States)

    Li, Rujiao; Liang, Fang; Li, Mengwei; Zou, Dong; Sun, Shixiang; Zhao, Yongbing; Zhao, Wenming; Bao, Yiming; Xiao, Jingfa; Zhang, Zhang

    2018-01-04

    MethBank (http://bigd.big.ac.cn/methbank) is a database that integrates high-quality DNA methylomes across a variety of species and provides an interactive browser for visualization of methylation data. Here, we present an updated implementation of MethBank (version 3.0) by incorporating more DNA methylomes from multiple species and equipping with more enhanced functionalities for data annotation and more friendly web interfaces for data presentation, search and visualization. MethBank 3.0 features large-scale integration of high-quality methylomes, involving 34 consensus reference methylomes derived from a large number of human samples, 336 single-base resolution methylomes from different developmental stages and/or tissues of five plants, and 18 single-base resolution methylomes from gametes and early embryos at multiple stages of two animals. Additionally, it is enhanced by improving the functionalities for data annotation, which accordingly enables systematic identification of methylation sites closely associated with age, sites with constant methylation levels across different ages, differentially methylated promoters, age-specific differentially methylated cytosines/regions, and methylated CpG islands. Moreover, MethBank provides tools to estimate human methylation age online and to identify differentially methylated promoters, respectively. Taken together, MethBank is upgraded with significant improvements and advances over the previous version, which is of great help for deciphering DNA methylation regulatory mechanisms for epigenetic studies. © The Author(s) 2017. Published by Oxford University Press on behalf of Nucleic Acids Research.

  16. A database of marine phytoplankton abundance, biomass and species composition in Australian waters

    Science.gov (United States)

    Davies, Claire H.; Coughlan, Alex; Hallegraeff, Gustaaf; Ajani, Penelope; Armbrecht, Linda; Atkins, Natalia; Bonham, Prudence; Brett, Steve; Brinkman, Richard; Burford, Michele; Clementson, Lesley; Coad, Peter; Coman, Frank; Davies, Diana; Dela-Cruz, Jocelyn; Devlin, Michelle; Edgar, Steven; Eriksen, Ruth; Furnas, Miles; Hassler, Christel; Hill, David; Holmes, Michael; Ingleton, Tim; Jameson, Ian; Leterme, Sophie C.; Lønborg, Christian; McLaughlin, James; McEnnulty, Felicity; McKinnon, A. David; Miller, Margaret; Murray, Shauna; Nayar, Sasi; Patten, Renee; Pritchard, Tim; Proctor, Roger; Purcell-Meyerink, Diane; Raes, Eric; Rissik, David; Ruszczyk, Jason; Slotwinski, Anita; Swadling, Kerrie M.; Tattersall, Katherine; Thompson, Peter; Thomson, Paul; Tonks, Mark; Trull, Thomas W.; Uribe-Palomino, Julian; Waite, Anya M.; Yauwenas, Rouna; Zammit, Anthony; Richardson, Anthony J.

    2016-06-01

    There have been many individual phytoplankton datasets collected across Australia since the mid 1900s, but most are unavailable to the research community. We have searched archives, contacted researchers, and scanned the primary and grey literature to collate 3,621,847 records of marine phytoplankton species from Australian waters from 1844 to the present. Many of these are small datasets collected for local questions, but combined they provide over 170 years of data on phytoplankton communities in Australian waters. Units and taxonomy have been standardised, obviously erroneous data removed, and all metadata included. We have lodged this dataset with the Australian Ocean Data Network (http://portal.aodn.org.au/) allowing public access. The Australian Phytoplankton Database will be invaluable for global change studies, as it allows analysis of ecological indicators of climate change and eutrophication (e.g., changes in distribution; diatom:dinoflagellate ratios). In addition, the standardised conversion of abundance records to biomass provides modellers with quantifiable data to initialise and validate ecosystem models of lower marine trophic levels.

  17. EST2Prot: Mapping EST sequences to proteins

    Directory of Open Access Journals (Sweden)

    Lin David M

    2006-03-01

    Full Text Available Abstract Background EST libraries are used in various biological studies, from microarray experiments to proteomic and genetic screens. These libraries usually contain many uncharacterized ESTs that are typically ignored since they cannot be mapped to known genes. Consequently, new discoveries are possibly overlooked. Results We describe a system (EST2Prot that uses multiple elements to map EST sequences to their corresponding protein products. EST2Prot uses UniGene clusters, substring analysis, information about protein coding regions in existing DNA sequences and protein database searches to detect protein products related to a query EST sequence. Gene Ontology terms, Swiss-Prot keywords, and protein similarity data are used to map the ESTs to functional descriptors. Conclusion EST2Prot extends and significantly enriches the popular UniGene mapping by utilizing multiple relations between known biological entities. It produces a mapping between ESTs and proteins in real-time through a simple web-interface. The system is part of the Biozon database and is accessible at http://biozon.org/tools/est/.

  18. The African Crane Database (1978-2014): Records of three threatened crane species (Family: Gruidae) from southern and eastern Africa

    Science.gov (United States)

    Smith, Tanya; Page-Nicholson, Samantha; Gibbons, Bradley; Jones, M. Genevieve W.; van Niekerk, Mark; Botha, Bronwyn; Oliver, Kirsten; McCann, Kevin

    2016-01-01

    Abstract Background The International Crane Foundation (ICF) / Endangered Wildlife Trust’s (EWT) African Crane Conservation Programme has recorded 26 403 crane sightings in its database from 1978 to 2014. This sightings collection is currently ongoing and records are continuously added to the database by the EWT field staff, ICF/EWT Partnership staff, various partner organizations and private individuals. The dataset has two peak collection periods: 1994-1996 and 2008-2012. The dataset collection spans five African countries: Kenya, Rwanda, South Africa, Uganda and Zambia; 98% of the data were collected in South Africa. Georeferencing of the dataset was verified before publication of the data. The dataset contains data on three African crane species: Blue Crane Anthropoides paradiseus, Grey Crowned Crane Balearica regulorum and Wattled Crane Bugeranus carunculatus. The Blue and Wattled Cranes are classified by the IUCN Red List of Threatened Species as Vulnerable and the Grey Crowned Crane as Endangered. New information This is the single most comprehensive dataset published on African Crane species that adds new information about the distribution of these three threatened species. We hope this will further aid conservation authorities to monitor and protect these species. The dataset continues to grow and especially to expand in geographic coverage into new countries in Africa and new sites within countries. The dataset can be freely accessed through the Global Biodiversity Information Facility data portal. PMID:27956850

  19. Hawaiʻi Coral Disease database (HICORDIS: species-specific coral health data from across the Hawaiian archipelago

    Directory of Open Access Journals (Sweden)

    Jamie M. Caldwell

    2016-09-01

    Full Text Available The Hawaiʻi Coral Disease database (HICORDIS houses data on colony-level coral health condition observed across the Hawaiian archipelago, providing information to conduct future analyses on coral reef health in an era of changing environmental conditions. Colonies were identified to the lowest taxonomic classification possible (species or genera, measured and assessed for visual signs of health condition. Data were recorded for 286,071 coral colonies surveyed on 1819 transects at 660 sites between 2005 and 2015. The database contains observations for 60 species from 22 genera with 21 different health conditions. The goals of the HICORDIS database are to: i provide open access, quality controlled and validated coral health data assembled from disparate surveys conducted across Hawaiʻi; ii facilitate appropriate crediting of data; and iii encourage future analyses of coral reef health. In this article, we describe and provide data from the HICORDIS database. The data presented in this paper were used in the research article “Satellite SST-based Coral Disease Outbreak Predictions for the Hawaiian Archipelago” (Caldwell et al., 2016 [1]. Keywords: Marine biology, Coral, Reefs, Disease, Hawaii

  20. A two-locus DNA sequence database for typing plant and human pathogens within the Fusarium oxysporum species complex

    DEFF Research Database (Denmark)

    O'Donnell, Kerry; Gueidan, C; Sink, S

    2009-01-01

    We constructed a two-locus database, comprising partial translation elongation factor (EF-1alpha) gene sequences and nearly full-length sequences of the nuclear ribosomal intergenic spacer region (IGS rDNA) for 850 isolates spanning the phylogenetic breadth of the Fusarium oxysporum species compl...... of the IGS rDNA sequences may be non-orthologous. We also evaluated enniatin, fumonisin and moniliformin mycotoxin production in vitro within a phylogenetic framework....

  1. Table of Cluster and Organism Species Number - Gclust Server | LSDB Archive [Life Science Database Archive metadata

    Lifescience Database Archive (English)

    Full Text Available List Contact us Gclust Server Table of Cluster and Organism Species Number Data detail Data name Table of Cluster and Organism...resentative sequence ID of cluster, its length, the number of sequences contained in the cluster, organism s...pecies, the number of sequences belonging to the cluster for each of 95 organism ...t Us Table of Cluster and Organism Species Number - Gclust Server | LSDB Archive ...

  2. Global coordination and standardisation in marine biodiversity through the World Register of Marine Species (WoRMS and related databases.

    Directory of Open Access Journals (Sweden)

    Mark J Costello

    Full Text Available The World Register of Marine Species is an over 90% complete open-access inventory of all marine species names. Here we illustrate the scale of the problems with species names, synonyms, and their classification, and describe how WoRMS publishes online quality assured information on marine species. Within WoRMS, over 100 global, 12 regional and 4 thematic species databases are integrated with a common taxonomy. Over 240 editors from 133 institutions and 31 countries manage the content. To avoid duplication of effort, content is exchanged with 10 external databases. At present WoRMS contains 460,000 taxonomic names (from Kingdom to subspecies, 368,000 species level combinations of which 215,000 are currently accepted marine species names, and 26,000 related but non-marine species. Associated information includes 150,000 literature sources, 20,000 images, and locations of 44,000 specimens. Usage has grown linearly since its launch in 2007, with about 600,000 unique visitors to the website in 2011, and at least 90 organisations from 12 countries using WoRMS for their data management. By providing easy access to expert-validated content, WoRMS improves quality control in the use of species names, with consequent benefits to taxonomy, ecology, conservation and marine biodiversity research and management. The service manages information on species names that would otherwise be overly costly for individuals, and thus minimises errors in the application of nomenclature standards. WoRMS' content is expanding to include host-parasite relationships, additional literature sources, locations of specimens, images, distribution range, ecological, and biological data. Species are being categorised as introduced (alien, invasive, of conservation importance, and on other attributes. These developments have a multiplier effect on its potential as a resource for biodiversity research and management. As a consequence of WoRMS, we are witnessing improved

  3. Global Coordination and Standardisation in Marine Biodiversity through the World Register of Marine Species (WoRMS) and Related Databases

    Science.gov (United States)

    Bouchet, Philippe; Boxshall, Geoff; Fauchald, Kristian; Gordon, Dennis; Hoeksema, Bert W.; Poore, Gary C. B.; van Soest, Rob W. M.; Stöhr, Sabine; Walter, T. Chad; Vanhoorne, Bart; Decock, Wim

    2013-01-01

    The World Register of Marine Species is an over 90% complete open-access inventory of all marine species names. Here we illustrate the scale of the problems with species names, synonyms, and their classification, and describe how WoRMS publishes online quality assured information on marine species. Within WoRMS, over 100 global, 12 regional and 4 thematic species databases are integrated with a common taxonomy. Over 240 editors from 133 institutions and 31 countries manage the content. To avoid duplication of effort, content is exchanged with 10 external databases. At present WoRMS contains 460,000 taxonomic names (from Kingdom to subspecies), 368,000 species level combinations of which 215,000 are currently accepted marine species names, and 26,000 related but non-marine species. Associated information includes 150,000 literature sources, 20,000 images, and locations of 44,000 specimens. Usage has grown linearly since its launch in 2007, with about 600,000 unique visitors to the website in 2011, and at least 90 organisations from 12 countries using WoRMS for their data management. By providing easy access to expert-validated content, WoRMS improves quality control in the use of species names, with consequent benefits to taxonomy, ecology, conservation and marine biodiversity research and management. The service manages information on species names that would otherwise be overly costly for individuals, and thus minimises errors in the application of nomenclature standards. WoRMS' content is expanding to include host-parasite relationships, additional literature sources, locations of specimens, images, distribution range, ecological, and biological data. Species are being categorised as introduced (alien, invasive), of conservation importance, and on other attributes. These developments have a multiplier effect on its potential as a resource for biodiversity research and management. As a consequence of WoRMS, we are witnessing improved communication within the

  4. Rapid in silico cloning of genes using expressed sequence tags (ESTs).

    Science.gov (United States)

    Gill, R W; Sanseau, P

    2000-01-01

    Expressed sequence tags (ESTs) are short single-pass DNA sequences obtained from either end of cDNA clones. These ESTs are derived from a vast number of cDNA libraries obtained from different species. Human ESTs are the bulk of the data and have been widely used to identify new members of gene families, as markers on the human chromosomes, to discover polymorphism sites and to compare expression patterns in different tissues or pathologies states. Information strategies have been devised to query EST databases. Since most of the analysis is performed with a computer, the term "in silico" strategy has been coined. In this chapter we will review the current status of EST databases, the pros and cons of EST-type data and describe possible strategies to retrieve meaningful information.

  5. Ex-vessel Fish Price Database: Disaggregating Prices for Low-Priced Species from Reduction Fisheries

    Directory of Open Access Journals (Sweden)

    Travis C. Tai

    2017-11-01

    Full Text Available Ex-vessel fish prices are essential for comprehensive fisheries management and socioeconomic analyses for fisheries science. In this paper, we reconstructed a global ex-vessel price database with the following areas of improvement: (1 compiling reported prices explicitly listed as “for reduction to fishmeal and fish oil” to estimate prices separately for catches destined for fishmeal and fish oil production, and other non-direct human consumption purposes; (2 including 95% confidence limit estimates for each price estimation; and (3 increasing the number of input data and the number of price estimates to match the reconstructed Sea Around Us catch database. Our primary focus was to address this first area of improvement as ex-vessel prices for catches destined for non-direct human consumption purposes were substantially overestimated, notably in countries with large reduction fisheries. For example in Peru, 2010 landed values were estimated as 3.8 billion real 2010 USD when using separate prices for reduction fisheries, compared with 5.8 billion using previous methods with only one price for all end-products. This update of the price database has significant global and country-specific impacts on fisheries price and landed value trends over time.

  6. galaxieEST: addressing EST identity through automated phylogenetic analysis.

    Science.gov (United States)

    Nilsson, R Henrik; Rajashekar, Balaji; Larsson, Karl-Henrik; Ursing, Björn M

    2004-07-05

    Research involving expressed sequence tags (ESTs) is intricately coupled to the existence of large, well-annotated sequence repositories. Comparatively complete and satisfactory annotated public sequence libraries are, however, available only for a limited range of organisms, rendering the absence of sequences and gene structure information a tangible problem for those working with taxa lacking an EST or genome sequencing project. Paralogous genes belonging to the same gene family but distinguished by derived characteristics are particularly prone to misidentification and erroneous annotation; high but incomplete levels of sequence similarity are typically difficult to interpret and have formed the basis of many unsubstantiated assumptions of orthology. In these cases, a phylogenetic study of the query sequence together with the most similar sequences in the database may be of great value to the identification process. In order to facilitate this laborious procedure, a project to employ automated phylogenetic analysis in the identification of ESTs was initiated. galaxieEST is an open source Perl-CGI script package designed to complement traditional similarity-based identification of EST sequences through employment of automated phylogenetic analysis. It uses a series of BLAST runs as a sieve to retrieve nucleotide and protein sequences for inclusion in neighbour joining and parsimony analyses; the output includes the BLAST output, the results of the phylogenetic analyses, and the corresponding multiple alignments. galaxieEST is available as an on-line web service for identification of fungal ESTs and for download / local installation for use with any organism group at http://galaxie.cgb.ki.se/galaxieEST.html. By addressing sequence relatedness in addition to similarity, galaxieEST provides an integrative view on EST origin and identity, which may prove particularly useful in cases where similarity searches return one or more pertinent, but not full, matches and

  7. DDMGD: the database of text-mined associations between genes methylated in diseases from different species

    KAUST Repository

    Raies, A. B.; Mansour, H.; Incitti, R.; Bajic, Vladimir B.

    2014-01-01

    ://www.cbrc.kaust.edu.sa/ddmgd/) to provide a comprehensive repository of information related to genes methylated in diseases that can be found through text mining. DDMGD's scope is not limited to a particular group of genes, diseases or species. Using the text mining system DEMGD we

  8. Assumption-versus data-based approaches to summarizing species' ranges.

    Science.gov (United States)

    Peterson, A Townsend; Navarro-Sigüenza, Adolfo G; Gordillo, Alejandro

    2018-06-01

    For conservation decision making, species' geographic distributions are mapped using various approaches. Some such efforts have downscaled versions of coarse-resolution extent-of-occurrence maps to fine resolutions for conservation planning. We examined the quality of the extent-of-occurrence maps as range summaries and the utility of refining those maps into fine-resolution distributional hypotheses. Extent-of-occurrence maps tend to be overly simple, omit many known and well-documented populations, and likely frequently include many areas not holding populations. Refinement steps involve typological assumptions about habitat preferences and elevational ranges of species, which can introduce substantial error in estimates of species' true areas of distribution. However, no model-evaluation steps are taken to assess the predictive ability of these models, so model inaccuracies are not noticed. Whereas range summaries derived by these methods may be useful in coarse-grained, global-extent studies, their continued use in on-the-ground conservation applications at fine spatial resolutions is not advisable in light of reliance on assumptions, lack of real spatial resolution, and lack of testing. In contrast, data-driven techniques that integrate primary data on biodiversity occurrence with remotely sensed data that summarize environmental dimensions (i.e., ecological niche modeling or species distribution modeling) offer data-driven solutions based on a minimum of assumptions that can be evaluated and validated quantitatively to offer a well-founded, widely accepted method for summarizing species' distributional patterns for conservation applications. © 2016 Society for Conservation Biology.

  9. VTCdb: a gene co-expression database for the crop species Vitis vinifera (grapevine).

    Science.gov (United States)

    Wong, Darren C J; Sweetman, Crystal; Drew, Damian P; Ford, Christopher M

    2013-12-16

    Gene expression datasets in model plants such as Arabidopsis have contributed to our understanding of gene function and how a single underlying biological process can be governed by a diverse network of genes. The accumulation of publicly available microarray data encompassing a wide range of biological and environmental conditions has enabled the development of additional capabilities including gene co-expression analysis (GCA). GCA is based on the understanding that genes encoding proteins involved in similar and/or related biological processes may exhibit comparable expression patterns over a range of experimental conditions, developmental stages and tissues. We present an open access database for the investigation of gene co-expression networks within the cultivated grapevine, Vitis vinifera. The new gene co-expression database, VTCdb (http://vtcdb.adelaide.edu.au/Home.aspx), offers an online platform for transcriptional regulatory inference in the cultivated grapevine. Using condition-independent and condition-dependent approaches, grapevine co-expression networks were constructed using the latest publicly available microarray datasets from diverse experimental series, utilising the Affymetrix Vitis vinifera GeneChip (16 K) and the NimbleGen Grape Whole-genome microarray chip (29 K), thus making it possible to profile approximately 29,000 genes (95% of the predicted grapevine transcriptome). Applications available with the online platform include the use of gene names, probesets, modules or biological processes to query the co-expression networks, with the option to choose between Affymetrix or Nimblegen datasets and between multiple co-expression measures. Alternatively, the user can browse existing network modules using interactive network visualisation and analysis via CytoscapeWeb. To demonstrate the utility of the database, we present examples from three fundamental biological processes (berry development, photosynthesis and flavonoid biosynthesis

  10. EUCANEXT: an integrated database for the exploration of genomic and transcriptomic data from Eucalyptus species

    OpenAIRE

    Nascimento, Leandro Costa; Salazar, Marcela Mendes; Lepikson-Neto, Jorge; Camargo, Eduardo Leal Oliveira; Parreiras, Lucas Salera; Pereira, Gonçalo Amarante Guimarães; Carazzolle, Marcelo Falsarella

    2017-01-01

    Abstract Tree species of the genus Eucalyptus are the most valuable and widely planted hardwoods in the world. Given the economic importance of Eucalyptus trees, much effort has been made towards the generation of specimens with superior forestry properties that can deliver high-quality feedstocks, customized to the industrýs needs for both cellulosic (paper) and lignocellulosic biomass production. In line with these efforts, large sets of molecular data have been generated by several scienti...

  11. New gSSR and EST-SSR markers reveal high genetic diversity in the invasive plant Ambrosia artemisiifolia L. and can be transferred to other invasive Ambrosia species.

    Science.gov (United States)

    Meyer, Lucie; Causse, Romain; Pernin, Fanny; Scalone, Romain; Bailly, Géraldine; Chauvel, Bruno; Délye, Christophe; Le Corre, Valérie

    2017-01-01

    Ambrosia artemisiifolia L., (common ragweed), is an annual invasive and highly troublesome plant species originating from North America that has become widespread across Europe. New sets of genomic and expressed sequence tag (EST) based simple sequence repeats (SSRs) markers were developed in this species using three approaches. After validation, 13 genomic SSRs and 13 EST-SSRs were retained and used to characterize the genetic diversity and population genetic structure of Ambrosia artemisiifolia populations from the native (North America) and invasive (Europe) ranges of the species. Analysing the mating system based on maternal families did not reveal any departure from complete allogamy and excess homozygosity was mostly due the presence of null alleles. High genetic diversity and patterns of genetic structure in Europe suggest two main introduction events followed by secondary colonization events. Cross-species transferability of the newly developed markers to other invasive species of the Ambrosia genus was assessed. Sixty-five percent and 75% of markers, respectively, were transferable from A. artemisiifolia to Ambrosia psilostachya and Ambrosia tenuifolia. 40% were transferable to Ambrosia trifida, this latter species being seemingly more phylogenetically distantly related to A. artemisiifolia than the former two.

  12. Cluster based on sequence comparison of homologous proteins of 95 organism species - Gclust Server | LSDB Archive [Life Science Database Archive metadata

    Lifescience Database Archive (English)

    Full Text Available List Contact us Gclust Server Cluster based on sequence comparison of homologous proteins of 95 organism spe...cies Data detail Data name Cluster based on sequence comparison of homologous proteins of 95 organism specie...istory of This Database Site Policy | Contact Us Cluster based on sequence compariso

  13. Development of EST-derived markers in Dendrobium from EST of related taxa

    Directory of Open Access Journals (Sweden)

    Narisa Juejun

    2013-04-01

    Full Text Available Public databases are useful for molecular marker development. The major aim of this study was to develop expressedsequence tag (EST-derived markers in Dendrobium from available ESTs of Phalaenopsis and Dendrobium. A total of 6063sequences were screened for simple sequence repeats (SSRs and introns. Primers flanking these regions were generated andtested on genomic DNAs of Phalaenopsis and Dendrobium. Twenty-three percent of amplifiable Phalaenopsis EST-derivedmarkers were cross-genera transferable to Dendrobium. Forty-one markers from both Phalaenopsis and Dendrobium thatamplified in Dendrobium were assessed on six commercial cultivars and six wild accessions. All of them were transferableamong Dendrobium species. High polymorphism and heterozygosity were observed within wild accessions. Sixteen polymorphic markers were evaluated for linkage analysis on an F1 segregating population. Seven markers were mapped into threelinkage groups, two of which showed syntenic relationship between dendrobium and rice. This relationship will facilitatefurther quantitative trait loci (QTL mapping and comparative genomic studies of Dendrobium. Our results indicate thatPhalaenopsis EST-derived markers are valuable tools for genetic research and breeding applications in Dendrobium.

  14. Amino acid sequences of predicted proteins and their annotation for 95 organism species. - Gclust Server | LSDB Archive [Life Science Database Archive metadata

    Lifescience Database Archive (English)

    Full Text Available List Contact us Gclust Server Amino acid sequences of predicted proteins and their annotation for 95 organis...m species. Data detail Data name Amino acid sequences of predicted proteins and their annotation for 95 orga...nism species. DOI 10.18908/lsdba.nbdc00464-001 Description of data contents Amino acid sequences of predicted proteins...Database Description Download License Update History of This Database Site Policy | Contact Us Amino acid sequences of predicted prot...eins and their annotation for 95 organism species. - Gclust Server | LSDB Archive ...

  15. SolRgene: an online database to explore disease resistance genes in tuber-bearing Solanum species

    Directory of Open Access Journals (Sweden)

    Vleeshouwers Vivianne GAA

    2011-08-01

    Full Text Available Abstract Background The cultivated potato (Solanum tuberosum L. is an important food crop, but highly susceptible to many pathogens. The major threat to potato production is the Irish famine pathogen Phytophthora infestans, which causes the devastating late blight disease. Potato breeding makes use of germplasm from wild relatives (wild germplasm to introduce resistances into cultivated potato. The Solanum section Petota comprises tuber-bearing species that are potential donors of new disease resistance genes. The aim of this study was to explore Solanum section Petota for resistance genes and generate a widely accessible resource that is useful for studying and implementing disease resistance in potato. Description The SolRgene database contains data on resistance to P. infestans and presence of R genes and R gene homologues in Solanum section Petota. We have explored Solanum section Petota for resistance to late blight in high throughput disease tests under various laboratory conditions and in field trials. From resistant wild germplasm, segregating populations were generated and assessed for the presence of resistance genes. All these data have been entered into the SolRgene database. To facilitate genetic and resistance gene evolution studies, phylogenetic data of the entire SolRgene collection are included, as well as a tool for generating phylogenetic trees of selected groups of germplasm. Data from resistance gene allele-mining studies are incorporated, which enables detection of R gene homologs in related germplasm. Using these resources, various resistance genes have been detected and some of these have been cloned, whereas others are in the cloning pipeline. All this information is stored in the online SolRgene database, which allows users to query resistance data, sequences, passport data of the accessions, and phylogenic classifications. Conclusion Solanum section Petota forms the basis of the SolRgene database, which contains a

  16. PHI-base: a new interface and further additions for the multi-species pathogen–host interactions database

    Science.gov (United States)

    Urban, Martin; Cuzick, Alayne; Rutherford, Kim; Irvine, Alistair; Pedro, Helder; Pant, Rashmi; Sadanadan, Vidyendra; Khamari, Lokanath; Billal, Santoshkumar; Mohanty, Sagar; Hammond-Kosack, Kim E.

    2017-01-01

    The pathogen–host interactions database (PHI-base) is available at www.phi-base.org. PHI-base contains expertly curated molecular and biological information on genes proven to affect the outcome of pathogen–host interactions reported in peer reviewed research articles. In addition, literature that indicates specific gene alterations that did not affect the disease interaction phenotype are curated to provide complete datasets for comparative purposes. Viruses are not included. Here we describe a revised PHI-base Version 4 data platform with improved search, filtering and extended data display functions. A PHIB-BLAST search function is provided and a link to PHI-Canto, a tool for authors to directly curate their own published data into PHI-base. The new release of PHI-base Version 4.2 (October 2016) has an increased data content containing information from 2219 manually curated references. The data provide information on 4460 genes from 264 pathogens tested on 176 hosts in 8046 interactions. Prokaryotic and eukaryotic pathogens are represented in almost equal numbers. Host species belong ∼70% to plants and 30% to other species of medical and/or environmental importance. Additional data types included into PHI-base 4 are the direct targets of pathogen effector proteins in experimental and natural host organisms. The curation problems encountered and the future directions of the PHI-base project are briefly discussed. PMID:27915230

  17. PHI-base: a new interface and further additions for the multi-species pathogen-host interactions database.

    Science.gov (United States)

    Urban, Martin; Cuzick, Alayne; Rutherford, Kim; Irvine, Alistair; Pedro, Helder; Pant, Rashmi; Sadanadan, Vidyendra; Khamari, Lokanath; Billal, Santoshkumar; Mohanty, Sagar; Hammond-Kosack, Kim E

    2017-01-04

    The pathogen-host interactions database (PHI-base) is available at www.phi-base.org PHI-base contains expertly curated molecular and biological information on genes proven to affect the outcome of pathogen-host interactions reported in peer reviewed research articles. In addition, literature that indicates specific gene alterations that did not affect the disease interaction phenotype are curated to provide complete datasets for comparative purposes. Viruses are not included. Here we describe a revised PHI-base Version 4 data platform with improved search, filtering and extended data display functions. A PHIB-BLAST search function is provided and a link to PHI-Canto, a tool for authors to directly curate their own published data into PHI-base. The new release of PHI-base Version 4.2 (October 2016) has an increased data content containing information from 2219 manually curated references. The data provide information on 4460 genes from 264 pathogens tested on 176 hosts in 8046 interactions. Prokaryotic and eukaryotic pathogens are represented in almost equal numbers. Host species belong ∼70% to plants and 30% to other species of medical and/or environmental importance. Additional data types included into PHI-base 4 are the direct targets of pathogen effector proteins in experimental and natural host organisms. The curation problems encountered and the future directions of the PHI-base project are briefly discussed. © The Author(s) 2016. Published by Oxford University Press on behalf of Nucleic Acids Research.

  18. Using the TIGR gene index databases for biological discovery.

    Science.gov (United States)

    Lee, Yuandan; Quackenbush, John

    2003-11-01

    The TIGR Gene Index web pages provide access to analyses of ESTs and gene sequences for nearly 60 species, as well as a number of resources derived from these. Each species-specific database is presented using a common format with a homepage. A variety of methods exist that allow users to search each species-specific database. Methods implemented currently include nucleotide or protein sequence queries using WU-BLAST, text-based searches using various sequence identifiers, searches by gene, tissue and library name, and searches using functional classes through Gene Ontology assignments. This protocol provides guidance for using the Gene Index Databases to extract information.

  19. A dichotomous species of Codium (Bryopsidales, Chlorophyta is colonizing northern Chile Una especie dicotómica de Codium (Bryopsidales, Chlorophyta está colonizando el norte de Chile

    Directory of Open Access Journals (Sweden)

    ALEJANDRA GONZÁLEZ

    2004-06-01

    Full Text Available In late 2001 and early 2002, a dichotomous species of Codium appeared colonizing the low intertidal and shallow subtidal bottoms of Caldera Bay, northern Chile (27° 03’ S, 70° 51’ W. Due to the ecological and economic impact the species is having in Caldera Bay and its potential spread along the Chilean coastline, we studied the taxonomic identity of the species and examined its relationships with other dichotomous species of Codium reported for temperate Pacific South America. Morphological analyses suggest that the seaweeds from Caldera Bay belong to Codium fragile (Suringar Hariot. Not only is there strong agreement in internal and external morphological characters, but among all the species reported for Peru and Chile, this is the only one exhibiting utricles with rounded, apiculate tip terminating in a mucron. This species has a broad geographic distribution in temperate waters. In Chile it was known only from the coasts of Valdivia to the Straits of Magellan (39° 48’ S, 73° 26’ W to 53° 10’ S, 73° 49’ W. This is the first record of C. fragile in northern Chile, and this study discusses several alternative hypotheses for the presence of the species into this area. The morphological characteristics of the material collected in Caldera partially agree with diagnostic characters known for C. fragile subspecies tasmanicum and C. fragile subspecies tomentosoides. However, the rapid population spread of the species in northern Chile, and recent molecular analysis support the identification of this form as the invasive C. fragile subspecies tomentosoidesA fines de 2001 y principio de 2002, apareció en el norte de Chile una especie del género Codium, de morfología dicotómica, colonizando los niveles intermareales y submareales de la Bahía de Caldera (27° 03’ S, 70° 51’ O. Debido al impacto ecológico y económico que ha provocado esta especie en la Bahía de Caldera y a su dispersión potencial a lo largo de la costa de

  20. HIGH IONIZATION SPECIES IN THE NEARBY INTERSTELLAR MEDIUM FROM AN EXHAUSTIVE ANALYSIS OF THE IUE INES DATABASE

    Energy Technology Data Exchange (ETDEWEB)

    Freire Ferrero, R.; Halbwachs, J.-L. [Observatoire Astronomique, Universite de Strasbourg, 11 rue de l' Universite, 67.000 Strasbourg (France); Morales Duran, C.; Cabo Cubeiro, A. M., E-mail: rubens.freire@astro.unistra.fr, E-mail: jean-louis.halbwachs@astro.unistra.fr, E-mail: morales@cab.inta-csic.es, E-mail: amcabo@cab.inta-csic.es [LAEX, Centro de Astrobiologia (CAB, CSIC-INTA), LAEFF, European Space Astronomy Centre, P.O. Box 78, 28691 Villanueva de la Canada, Madrid (Spain)

    2012-02-15

    Main-sequence (MS) and giant late-B and early-A type stars are the best targets for searching for nearby interstellar (IS) Si IV and C IV resonance lines because they are not able to produce them, either in atmospheric layers or in their circumstellar environment, and because many stars of these spectral types are nearby and located in the local interstellar medium (LISM). In addition, the use of certain stars hotter than B6 can lead to misinterpretations (e.g., alpha Arae). This work analyzes the reliable Short-Wavelength Prime high-resolution UV spectra of 558 B6-A9 type stars observed by the International Ultraviolet Explorer at distances lower than 400 pc from the Sun. For the first time, this work utilizes the entire INES database to extract stellar and IS information in a systematic way from homogeneous data. Stars were classified into seven groups: normal (MS and subgiant), giant, peculiar, emission line, Algols, pre-main sequence or Herbig Ae/Be, and shell stars. Only 10 normal stars, located beyond 90 pc, show weak Si IV and C IV absorptions and are clustered around the direction of Sco-Cen, while 85 located closer than 90 pc, as well as another 89 beyond 90 pc, do not show any absorptions at all. We conclude that these highly ionized species originate most probably in the LISM, at 90 pc from the Sun, corresponding to the warm interaction zone (IZ), where the expanding Loop I super-shell and the Local Bubble collide. We base our results on a robust statistical analysis of both spectral types and spatial distributions of the different stellar groups we defined. Our estimated column densities N of both ions are similar to those obtained by other authors for distant stars, suggesting that these absorptions concentrate fundamentally in the IZ. A logarithmic correlation between N and the distance D confirms these results.

  1. Development of EST-derived markers in Dendrobium from EST of related taxa

    OpenAIRE

    Narisa Juejun; Chataporn Chunwongse; Julapark Chunwongse

    2013-01-01

    Public databases are useful for molecular marker development. The major aim of this study was to develop expressedsequence tag (EST)-derived markers in Dendrobium from available ESTs of Phalaenopsis and Dendrobium. A total of 6063sequences were screened for simple sequence repeats (SSRs) and introns. Primers flanking these regions were generated andtested on genomic DNAs of Phalaenopsis and Dendrobium. Twenty-three percent of amplifiable Phalaenopsis EST-derivedmarkers were cross-genera trans...

  2. EST! best practices

    Science.gov (United States)

    2000-10-05

    This booklet presents the eighteen winners in a competition for projects that promote environmentally sustainable transportation (EST). These range from car sharing to hydrogen fueled buses to public transit promotion campaign. Each project is presen...

  3. Wheat EST resources for functional genomics of abiotic stress

    Directory of Open Access Journals (Sweden)

    Links Matthew G

    2006-06-01

    Full Text Available Abstract Background Wheat is an excellent species to study freezing tolerance and other abiotic stresses. However, the sequence of the wheat genome has not been completely characterized due to its complexity and large size. To circumvent this obstacle and identify genes involved in cold acclimation and associated stresses, a large scale EST sequencing approach was undertaken by the Functional Genomics of Abiotic Stress (FGAS project. Results We generated 73,521 quality-filtered ESTs from eleven cDNA libraries constructed from wheat plants exposed to various abiotic stresses and at different developmental stages. In addition, 196,041 ESTs for which tracefiles were available from the National Science Foundation wheat EST sequencing program and DuPont were also quality-filtered and used in the analysis. Clustering of the combined ESTs with d2_cluster and TGICL yielded a few large clusters containing several thousand ESTs that were refractory to routine clustering techniques. To resolve this problem, the sequence proximity and "bridges" were identified by an e-value distance graph to manually break clusters into smaller groups. Assembly of the resolved ESTs generated a 75,488 unique sequence set (31,580 contigs and 43,908 singletons/singlets. Digital expression analyses indicated that the FGAS dataset is enriched in stress-regulated genes compared to the other public datasets. Over 43% of the unique sequence set was annotated and classified into functional categories according to Gene Ontology. Conclusion We have annotated 29,556 different sequences, an almost 5-fold increase in annotated sequences compared to the available wheat public databases. Digital expression analysis combined with gene annotation helped in the identification of several pathways associated with abiotic stress. The genomic resources and knowledge developed by this project will contribute to a better understanding of the different mechanisms that govern stress tolerance in

  4. Shortcomings of of the commercial MALDI-TOF MS database and use of MLSA as an arbiter in the identification of Nocardia species

    Directory of Open Access Journals (Sweden)

    Gema eCarrasco

    2016-04-01

    Full Text Available Nocardia species are difficult to identify, a consequence of the ever increasing number of species known and their homogeneous genetic characteristics. 16S rRNA analysis has been the gold standard for identifying these organisms, but proteomic techniques such as matrix-assisted laser desorption ionization-time of flight (MALDI-TOF MS and housekeeping gene analysis, have also been explored. One hundred high (n=25, intermediate (n=20 and low (n=55 prevalence (for Spain Nocardia strains belonging to 30 species were identified via 16S rRNA and MALDI-TOF MS analysis. The manufacturer-provided database MALDI Biotyper library v4.0 (5.627 entries, Bruker Daltonik was employed. In the high prevalence group (N. farcinica, N. abscessus, N. cyriacigeorgica and N. nova, the 16S rRNA and MALDI-TOF MS methods provided the same identification for 76% of the strains examined. For the intermediate prevalence group (N. brasiliensis, N. carnea, N. otitidiscaviarum and N. transvalensis complex, this figure fell to 45%. In the low-prevalence group (22 species, these two methods were concordant only in six strains at the species level. Tetra-gene multi-locus sequencing analysis (MLSA involving the concatemer gyrB-16S rRNA-hsp65-secA1 was used to arbitrate between discrepant identifications (n=67. Overall, the MLSA confirmed the results provided at species level by 16S rRNA analysis in 34.3% of discrepancies, and those provided by MALDI-TOF MS in 13.4%. MALDI-TOF MS could be a strong candidate for the identification of Nocardia species, but only if its reference spectrum database improves, especially with respect to unusual, recently described species and species included in the described Nocardia complexes.

  5. Shortcomings of the Commercial MALDI-TOF MS Database and Use of MLSA as an Arbiter in the Identification of Nocardia Species

    Science.gov (United States)

    Carrasco, Gema; de Dios Caballero, Juan; Garrido, Noelia; Valdezate, Sylvia; Cantón, Rafael; Sáez-Nieto, Juan A.

    2016-01-01

    Nocardia species are difficult to identify, a consequence of the ever increasing number of species known and their homogeneous genetic characteristics. 16S rRNA analysis has been the gold standard for identifying these organisms, but proteomic techniques such as matrix-assisted laser desorption ionization-time of flight (MALDI-TOF MS) and housekeeping gene analysis, have also been explored. One hundred high (n = 25), intermediate (n = 20), and low (n = 55) prevalence (for Spain) Nocardia strains belonging to 30 species were identified via 16S rRNA and MALDI-TOF MS analysis. The manufacturer-provided database MALDI Biotyper library v4.0 (5.627 entries, Bruker Daltonik) was employed. In the high prevalence group (Nocardia farcinica, N. abscessus, N. cyriacigeorgica and N. nova), the 16S rRNA and MALDI-TOF MS methods provided the same identification for 76% of the strains examined. For the intermediate prevalence group (N. brasiliensis, N. carnea, N. otitidiscaviarum and N. transvalensis complex), this figure fell to 45%. In the low-prevalence group (22 species), these two methods were concordant only in six strains at the species level. Tetra-gene multi-locus sequencing analysis (MLSA) involving the concatemer gyrB-16S rRNA-hsp65-secA1 was used to arbitrate between discrepant identifications (n = 67). Overall, the MLSA confirmed the results provided at species level by 16S rRNA analysis in 34.3% of discrepancies, and those provided by MALDI-TOF MS in 13.4%. MALDI-TOF MS could be a strong candidate for the identification of Nocardia species, but only if its reference spectrum database improves, especially with respect to unusual, recently described species and species included in the described Nocardia complexes. PMID:27148228

  6. DMPD: NF-kappaB activation by reactive oxygen species: fifteen years later. [Dynamic Macrophage Pathway CSML Database

    Lifescience Database Archive (English)

    Full Text Available 16723122 NF-kappaB activation by reactive oxygen species: fifteen years later. Gloi...svg) (.html) (.csml) Show NF-kappaB activation by reactive oxygen species: fifteen years later. PubmedID 167...23122 Title NF-kappaB activation by reactive oxygen species: fifteen years later.

  7. EST Vocabulary Instruction

    Directory of Open Access Journals (Sweden)

    Célia D.S. Bell

    2012-05-01

    Full Text Available This study aims at contributing to the investigation on the instruction of EST (English for Science and Technology vocabulary, in terms of receptive use of the language. It evaluates the effectiveness of two teaching approaches to the acquisition of vocabulary. The first approach consisted of teaching vocabulary through the use of dictionaries, where the words were merely translated into the learners’ L1 or defined in the target language thus promoting superficial level of word processing. The second approach employed activities promoting deep level of word processing. Data were analysed quantitatively. Results indicated that the two approaches seem to have some equipotentiality, as far as EST vocabulary is concerned.

  8. ESTIMA, a tool for EST management in a multi-project environment

    Directory of Open Access Journals (Sweden)

    Lewin Harris A

    2004-11-01

    Full Text Available Abstract Background Single-pass, partial sequencing of complementary DNA (cDNA libraries generates thousands of chromatograms that are processed into high quality expressed sequence tags (ESTs, and then assembled into contigs representative of putative genes. Usually, to be of value, ESTs and contigs must be associated with meaningful annotations, and made available to end-users. Results A web application, Expressed Sequence Tag Information Management and Annotation (ESTIMA, has been created to meet the EST annotation and data management requirements of multiple high-throughput EST sequencing projects. It is anchored on individual ESTs and organized around different properties of ESTs including chromatograms, base-calling quality scores, structure of assembled transcripts, and multiple sources of comparison to infer functional annotation, Gene Ontology associations, and cDNA library information. ESTIMA consists of a relational database schema and a set of interactive query interfaces. These are integrated with a suite of web-based tools that allow a user to query and retrieve information. Further, query results are interconnected among the various EST properties. ESTIMA has several unique features. Users may run their own EST processing pipeline, search against preferred reference genomes, and use any clustering and assembly algorithm. The ESTIMA database schema is very flexible and accepts output from any EST processing and assembly pipeline. ESTIMA has been used for the management of EST projects of many species, including honeybee (Apis mellifera, cattle (Bos taurus, songbird (Taeniopygia guttata, corn rootworm (Diabrotica vergifera, catfish (Ictalurus punctatus, Ictalurus furcatus, and apple (Malus x domestica. The entire resource may be downloaded and used as is, or readily adapted to fit the unique needs of other cDNA sequencing projects. Conclusions The scripts used to create the ESTIMA interface are freely available to academic users in

  9. ESTIMA, a tool for EST management in a multi-project environment.

    Science.gov (United States)

    Kumar, Charu G; LeDuc, Richard; Gong, George; Roinishivili, Levan; Lewin, Harris A; Liu, Lei

    2004-11-04

    Single-pass, partial sequencing of complementary DNA (cDNA) libraries generates thousands of chromatograms that are processed into high quality expressed sequence tags (ESTs), and then assembled into contigs representative of putative genes. Usually, to be of value, ESTs and contigs must be associated with meaningful annotations, and made available to end-users. A web application, Expressed Sequence Tag Information Management and Annotation (ESTIMA), has been created to meet the EST annotation and data management requirements of multiple high-throughput EST sequencing projects. It is anchored on individual ESTs and organized around different properties of ESTs including chromatograms, base-calling quality scores, structure of assembled transcripts, and multiple sources of comparison to infer functional annotation, Gene Ontology associations, and cDNA library information. ESTIMA consists of a relational database schema and a set of interactive query interfaces. These are integrated with a suite of web-based tools that allow a user to query and retrieve information. Further, query results are interconnected among the various EST properties. ESTIMA has several unique features. Users may run their own EST processing pipeline, search against preferred reference genomes, and use any clustering and assembly algorithm. The ESTIMA database schema is very flexible and accepts output from any EST processing and assembly pipeline. ESTIMA has been used for the management of EST projects of many species, including honeybee (Apis mellifera), cattle (Bos taurus), songbird (Taeniopygia guttata), corn rootworm (Diabrotica vergifera), catfish (Ictalurus punctatus, Ictalurus furcatus), and apple (Malus x domestica). The entire resource may be downloaded and used as is, or readily adapted to fit the unique needs of other cDNA sequencing projects. The scripts used to create the ESTIMA interface are freely available to academic users in an archived format from http

  10. A salmonid EST genomic study: genes, duplications, phylogeny and microarrays

    Directory of Open Access Journals (Sweden)

    Brahmbhatt Sonal

    2008-11-01

    Full Text Available Abstract Background Salmonids are of interest because of their relatively recent genome duplication, and their extensive use in wild fisheries and aquaculture. A comprehensive gene list and a comparison of genes in some of the different species provide valuable genomic information for one of the most widely studied groups of fish. Results 298,304 expressed sequence tags (ESTs from Atlantic salmon (69% of the total, 11,664 chinook, 10,813 sockeye, 10,051 brook trout, 10,975 grayling, 8,630 lake whitefish, and 3,624 northern pike ESTs were obtained in this study and have been deposited into the public databases. Contigs were built and putative full-length Atlantic salmon clones have been identified. A database containing ESTs, assemblies, consensus sequences, open reading frames, gene predictions and putative annotation is available. The overall similarity between Atlantic salmon ESTs and those of rainbow trout, chinook, sockeye, brook trout, grayling, lake whitefish, northern pike and rainbow smelt is 93.4, 94.2, 94.6, 94.4, 92.5, 91.7, 89.6, and 86.2% respectively. An analysis of 78 transcript sets show Salmo as a sister group to Oncorhynchus and Salvelinus within Salmoninae, and Thymallinae as a sister group to Salmoninae and Coregoninae within Salmonidae. Extensive gene duplication is consistent with a genome duplication in the common ancestor of salmonids. Using all of the available EST data, a new expanded salmonid cDNA microarray of 32,000 features was created. Cross-species hybridizations to this cDNA microarray indicate that this resource will be useful for studies of all 68 salmonid species. Conclusion An extensive collection and analysis of salmonid RNA putative transcripts indicate that Pacific salmon, Atlantic salmon and charr are 94–96% similar while the more distant whitefish, grayling, pike and smelt are 93, 92, 89 and 86% similar to salmon. The salmonid transcriptome reveals a complex history of gene duplication that is

  11. Potential impacts of OCS oil and gas activities on fisheries. Volume 1. Annotated bibliography and database descriptions for target-species distribution and abundance studies. Section 1, Part 2. Final report

    International Nuclear Information System (INIS)

    Tear, L.M.

    1989-10-01

    The purpose of the volume is to present an annotated bibliography of unpublished and grey literature related to the distribution and abundance of select species of finfish and shellfish along the coasts of the United States. The volume also includes descriptions of databases that contain information related to target species' distribution and abundance. An index is provided at the end of each section to help the reader locate studies or databases related to a particular species

  12. Potential impacts of OCS oil and gas activities on fisheries. Volume 1. Annotated bibliography and database descriptions for target species distribution and abundance studies. Section 1, Part 1. Final report

    International Nuclear Information System (INIS)

    Tear, L.M.

    1989-10-01

    The purpose of the volume is to present an annotated bibliography of unpublished and grey literature related to the distribution and abundance of select species of finfish and shellfish along the coasts of the United States. The volume also includes descriptions of databases that contain information related to target species' distribution and abundance. An index is provided at the end of each section to help the reader locate studies or databases related to a particular species

  13. GeMprospector--online design of cross-species genetic marker candidates in legumes and grasses.

    Science.gov (United States)

    Fredslund, Jakob; Madsen, Lene H; Hougaard, Birgit K; Sandal, Niels; Stougaard, Jens; Bertioli, David; Schauser, Leif

    2006-07-01

    The web program GeMprospector (URL: http://cgi-www.daimi.au.dk/cgi-chili/GeMprospector/main) allows users to automatically design large sets of cross-species genetic marker candidates targeting either legumes or grasses. The user uploads a collection of ESTs from one or more legume or grass species, and they are compared with a database of clusters of homologous EST and genomic sequences from other legumes or grasses, respectively. Multiple sequence alignments between submitted ESTs and their homologues in the appropriate database form the basis of automated PCR primer design in conserved exons such that each primer set amplifies an intron. The only user input is a collection of ESTs, not necessarily from more than one species, and GeMprospector can boost the potential of such an EST collection by combining it with a large database to produce cross-species genetic marker candidates for legumes or grasses.

  14. A Global Database of Field-observed Leaf Area Index in Woody Plant Species, 1932-2011

    Data.gov (United States)

    National Aeronautics and Space Administration — This data set provides global leaf area index (LAI) values for woody species. The data are a compilation of field-observed data from 1,216 locations obtained from...

  15. A Global Database of Field-observed Leaf Area Index in Woody Plant Species, 1932-2011

    Data.gov (United States)

    National Aeronautics and Space Administration — ABSTRACT: This data set provides global leaf area index (LAI) values for woody species. The data are a compilation of field-observed data from 1,216 locations...

  16. EST-SSR

    African Journals Online (AJOL)

    user2

    2013-02-27

    Feb 27, 2013 ... Due to their high abundance, multi- allelic nature ... different root colors and origins, and 10 related species in Brassica were selected for ... analysis and transferability study, genomic DNA of two radish advanced inbred ...

  17. Cyanide and antimony thermodynamic database for the aqueous species and solids for the EPA-MINTEQ geochemical code

    International Nuclear Information System (INIS)

    Sehmel, G.A.

    1989-05-01

    Thermodynamic data for aqueous species and solids that contain cyanide and antimony were tabulated from several commonly accepted, published sources of thermodynamic data and recent journal article. The review does not include gases or organic complexes of either antimony or cyanide, nor does the review include the sulfur compounds of cyanide. The basic thermodynamic data, ΔG/sub f,298//sup o/, ΔH/sub f,298//sup o/, and S/sub f//sup o/ values, were chosen to represent each solid phase and aqueous species for which data were available in the appropriate standard state. From these data the equilibrium constants (log K/sub r,298//sup o/) and enthalpies of reaction (ΔH/sub r,298//sup o/) at 298 K (25/degree/C) were calculated for reactions involving the formation of these aqueous species and solids from the basic components. 34 refs., 14 tabs

  18. annot8r: GO, EC and KEGG annotation of EST datasets

    Directory of Open Access Journals (Sweden)

    Schmid Ralf

    2008-04-01

    Full Text Available Abstract Background The expressed sequence tag (EST methodology is an attractive option for the generation of sequence data for species for which no completely sequenced genome is available. The annotation and comparative analysis of such datasets poses a formidable challenge for research groups that do not have the bioinformatics infrastructure of major genome sequencing centres. Therefore, there is a need for user-friendly tools to facilitate the annotation of non-model species EST datasets with well-defined ontologies that enable meaningful cross-species comparisons. To address this, we have developed annot8r, a platform for the rapid annotation of EST datasets with GO-terms, EC-numbers and KEGG-pathways. Results annot8r automatically downloads all files relevant for the annotation process and generates a reference database that stores UniProt entries, their associated Gene Ontology (GO, Enzyme Commission (EC and Kyoto Encyclopaedia of Genes and Genomes (KEGG annotation and additional relevant data. For each of GO, EC and KEGG, annot8r extracts a specific sequence subset from the UniProt dataset based on the information stored in the reference database. These three subsets are then formatted for BLAST searches. The user provides the protein or nucleotide sequences to be annotated and annot8r runs BLAST searches against these three subsets. The BLAST results are parsed and the corresponding annotations retrieved from the reference database. The annotations are saved both as flat files and also in a relational postgreSQL results database to facilitate more advanced searches within the results. annot8r is integrated with the PartiGene suite of EST analysis tools. Conclusion annot8r is a tool that assigns GO, EC and KEGG annotations for data sets resulting from EST sequencing projects both rapidly and efficiently. The benefits of an underlying relational database, flexibility and the ease of use of the program make it ideally suited for non

  19. Toward fish and seafood traceability: anchovy species determination in fish products by molecular markers and support through a public domain database.

    Science.gov (United States)

    Jérôme, Marc; Martinsohn, Jann Thorsten; Ortega, Delphine; Carreau, Philippe; Verrez-Bagnis, Véronique; Mouchel, Olivier

    2008-05-28

    Traceability in the fish food sector plays an increasingly important role for consumer protection and confidence building. This is reflected by the introduction of legislation and rules covering traceability on national and international levels. Although traceability through labeling is well established and supported by respective regulations, monitoring and enforcement of these rules are still hampered by the lack of efficient diagnostic tools. We describe protocols using a direct sequencing method based on 212-274-bp diagnostic sequences derived from species-specific mitochondria DNA cytochrome b, 16S rRNA, and cytochrome oxidase subunit I sequences which can efficiently be applied to unambiguously determine even closely related fish species in processed food products labeled "anchovy". Traceability of anchovy-labeled products is supported by the public online database AnchovyID ( http://anchovyid.jrc.ec.europa.eu), which provided data obtained during our study and tools for analytical purposes.

  20. PAVE: Program for assembling and viewing ESTs

    Directory of Open Access Journals (Sweden)

    Bomhoff Matthew

    2009-08-01

    Full Text Available Abstract Background New sequencing technologies are rapidly emerging. Many laboratories are simultaneously working with the traditional Sanger ESTs and experimenting with ESTs generated by the 454 Life Science sequencers. Though Sanger ESTs have been used to generate contigs for many years, no program takes full advantage of the 5' and 3' mate-pair information, hence, many tentative transcripts are assembled into two separate contigs. The new 454 technology has the benefit of high-throughput expression profiling, but introduces time and space problems for assembling large contigs. Results The PAVE (Program for Assembling and Viewing ESTs assembler takes advantage of the 5' and 3' mate-pair information by requiring that the mate-pairs be assembled into the same contig and joined by n's if the two sub-contigs do not overlap. It handles the depth of 454 data sets by "burying" similar ESTs during assembly, which retains the expression level information while circumventing time and space problems. PAVE uses MegaBLAST for the clustering step and CAP3 for assembly, however it assembles incrementally to enforce the mate-pair constraint, bury ESTs, and reduce incorrect joins and splits. The PAVE data management system uses a MySQL database to store multiple libraries of ESTs along with their metadata; the management system allows multiple assemblies with variations on libraries and parameters. Analysis routines provide standard annotation for the contigs including a measure of differentially expressed genes across the libraries. A Java viewer program is provided for display and analysis of the results. Our results clearly show the benefit of using the PAVE assembler to explicitly use mate-pair information and bury ESTs for large contigs. Conclusion The PAVE assembler provides a software package for assembling Sanger and/or 454 ESTs. The assembly software, data management software, Java viewer and user's guide are freely available.

  1. PAVE: program for assembling and viewing ESTs.

    Science.gov (United States)

    Soderlund, Carol; Johnson, Eric; Bomhoff, Matthew; Descour, Anne

    2009-08-26

    New sequencing technologies are rapidly emerging. Many laboratories are simultaneously working with the traditional Sanger ESTs and experimenting with ESTs generated by the 454 Life Science sequencers. Though Sanger ESTs have been used to generate contigs for many years, no program takes full advantage of the 5' and 3' mate-pair information, hence, many tentative transcripts are assembled into two separate contigs. The new 454 technology has the benefit of high-throughput expression profiling, but introduces time and space problems for assembling large contigs. The PAVE (Program for Assembling and Viewing ESTs) assembler takes advantage of the 5' and 3' mate-pair information by requiring that the mate-pairs be assembled into the same contig and joined by n's if the two sub-contigs do not overlap. It handles the depth of 454 data sets by "burying" similar ESTs during assembly, which retains the expression level information while circumventing time and space problems. PAVE uses MegaBLAST for the clustering step and CAP3 for assembly, however it assembles incrementally to enforce the mate-pair constraint, bury ESTs, and reduce incorrect joins and splits. The PAVE data management system uses a MySQL database to store multiple libraries of ESTs along with their metadata; the management system allows multiple assemblies with variations on libraries and parameters. Analysis routines provide standard annotation for the contigs including a measure of differentially expressed genes across the libraries. A Java viewer program is provided for display and analysis of the results. Our results clearly show the benefit of using the PAVE assembler to explicitly use mate-pair information and bury ESTs for large contigs. The PAVE assembler provides a software package for assembling Sanger and/or 454 ESTs. The assembly software, data management software, Java viewer and user's guide are freely available.

  2. From biomedicine to natural history research: EST resources for ambystomatid salamanders

    Directory of Open Access Journals (Sweden)

    Bryant Susan V

    2004-08-01

    Full Text Available Abstract Background Establishing genomic resources for closely related species will provide comparative insights that are crucial for understanding diversity and variability at multiple levels of biological organization. We developed ESTs for Mexican axolotl (Ambystoma mexicanum and Eastern tiger salamander (A. tigrinum tigrinum, species with deep and diverse research histories. Results Approximately 40,000 quality cDNA sequences were isolated for these species from various tissues, including regenerating limb and tail. These sequences and an existing set of 16,030 cDNA sequences for A. mexicanum were processed to yield 35,413 and 20,599 high quality ESTs for A. mexicanum and A. t. tigrinum, respectively. Because the A. t. tigrinum ESTs were obtained primarily from a normalized library, an approximately equal number of contigs were obtained for each species, with 21,091 unique contigs identified overall. The 10,592 contigs that showed significant similarity to sequences from the human RefSeq database reflected a diverse array of molecular functions and biological processes, with many corresponding to genes expressed during spinal cord injury in rat and fin regeneration in zebrafish. To demonstrate the utility of these EST resources, we searched databases to identify probes for regeneration research, characterized intra- and interspecific nucleotide polymorphism, saturated a human – Ambystoma synteny group with marker loci, and extended PCR primer sets designed for A. mexicanum / A. t. tigrinum orthologues to a related tiger salamander species. Conclusions Our study highlights the value of developing resources in traditional model systems where the likelihood of information transfer to multiple, closely related taxa is high, thus simultaneously enabling both laboratory and natural history research.

  3. From biomedicine to natural history research: EST resources for ambystomatid salamanders

    Science.gov (United States)

    Putta, Srikrishna; Smith, Jeramiah J; Walker, John A; Rondet, Mathieu; Weisrock, David W; Monaghan, James; Samuels, Amy K; Kump, Kevin; King, David C; Maness, Nicholas J; Habermann, Bianca; Tanaka, Elly; Bryant, Susan V; Gardiner, David M; Parichy, David M; Voss, S Randal

    2004-01-01

    Background Establishing genomic resources for closely related species will provide comparative insights that are crucial for understanding diversity and variability at multiple levels of biological organization. We developed ESTs for Mexican axolotl (Ambystoma mexicanum) and Eastern tiger salamander (A. tigrinum tigrinum), species with deep and diverse research histories. Results Approximately 40,000 quality cDNA sequences were isolated for these species from various tissues, including regenerating limb and tail. These sequences and an existing set of 16,030 cDNA sequences for A. mexicanum were processed to yield 35,413 and 20,599 high quality ESTs for A. mexicanum and A. t. tigrinum, respectively. Because the A. t. tigrinum ESTs were obtained primarily from a normalized library, an approximately equal number of contigs were obtained for each species, with 21,091 unique contigs identified overall. The 10,592 contigs that showed significant similarity to sequences from the human RefSeq database reflected a diverse array of molecular functions and biological processes, with many corresponding to genes expressed during spinal cord injury in rat and fin regeneration in zebrafish. To demonstrate the utility of these EST resources, we searched databases to identify probes for regeneration research, characterized intra- and interspecific nucleotide polymorphism, saturated a human – Ambystoma synteny group with marker loci, and extended PCR primer sets designed for A. mexicanum / A. t. tigrinum orthologues to a related tiger salamander species. Conclusions Our study highlights the value of developing resources in traditional model systems where the likelihood of information transfer to multiple, closely related taxa is high, thus simultaneously enabling both laboratory and natural history research. PMID:15310388

  4. Analysis of SSR information in EST resources of sugarcane

    Science.gov (United States)

    Expressed sequence tags ( ESTs) offer the opportunity to exploit single, low -copy, conserved sequence motifs for the development of simple sequence repeats ( SSRs). The total of 262 113 ESTs of sugarcane (Saccharum officinarum) in the database of NCBI were downloaded and analyzed, which resulted in...

  5. Enabling the hypothesis-driven prioritization of ligand candidates in big databases: Screenlamp and its application to GPCR inhibitor discovery for invasive species control

    Science.gov (United States)

    Raschka, Sebastian; Scott, Anne M.; Liu, Nan; Gunturu, Santosh; Huertas, Mar; Li, Weiming; Kuhn, Leslie A.

    2018-03-01

    While the advantage of screening vast databases of molecules to cover greater molecular diversity is often mentioned, in reality, only a few studies have been published demonstrating inhibitor discovery by screening more than a million compounds for features that mimic a known three-dimensional (3D) ligand. Two factors contribute: the general difficulty of discovering potent inhibitors, and the lack of free, user-friendly software to incorporate project-specific knowledge and user hypotheses into 3D ligand-based screening. The Screenlamp modular toolkit presented here was developed with these needs in mind. We show Screenlamp's ability to screen more than 12 million commercially available molecules and identify potent in vivo inhibitors of a G protein-coupled bile acid receptor within the first year of a discovery project. This pheromone receptor governs sea lamprey reproductive behavior, and to our knowledge, this project is the first to establish the efficacy of computational screening in discovering lead compounds for aquatic invasive species control. Significant enhancement in activity came from selecting compounds based on one of the hypotheses: that matching two distal oxygen groups in the 3D structure of the pheromone is crucial for activity. Six of the 15 most active compounds met these criteria. A second hypothesis—that presence of an alkyl sulfate side chain results in high activity—identified another 6 compounds in the top 10, demonstrating the significant benefits of hypothesis-driven screening.

  6. AcEST: DK948090 [AcEST

    Lifescience Database Archive (English)

    Full Text Available erase A OS=Aspergillus tubingen... 50 1e-05 sp|P0C1S9|DGLB_RAT Sn1-specific diacy...|P47145|YJ77_YEAST Putative lipase YJR107W OS=Saccharomyces ce... 50 8e-06 sp|O42815|FAEA_ASPTU Feruloyl est

  7. Relational databases

    CERN Document Server

    Bell, D A

    1986-01-01

    Relational Databases explores the major advances in relational databases and provides a balanced analysis of the state of the art in relational databases. Topics covered include capture and analysis of data placement requirements; distributed relational database systems; data dependency manipulation in database schemata; and relational database support for computer graphics and computer aided design. This book is divided into three sections and begins with an overview of the theory and practice of distributed systems, using the example of INGRES from Relational Technology as illustration. The

  8. Update History of This Database - ClEST | LSDB Archive [Life Science Database Archive metadata

    Lifescience Database Archive (English)

    Full Text Available search(/contents-en/) != -1 || url.search(/index-e.html/) != -1 ) { document.getElementById(lang).innerHTML=.../) != -1 ) { url = url.replace(-e.html,.html); document.getElementById(lang).innerHTML=[ Japanese |...en/,/jp/); document.getElementById(lang).innerHTML=[ Japanese | English ]; } else if ( url.search(//contents...//) != -1 ) { url = url.replace(/contents/,/contents-en/); document.getElementById(lang).innerHTML=[ Japanes...e(/contents-en/,/contents/); document.getElementById(lang).innerHTML=[ Japanese | English ]; } else if( url.

  9. AcEST: BP912212 [AcEST

    Lifescience Database Archive (English)

    Full Text Available YMU001_000016_D11 457 Adiantum capillus-veneris mRNA. clone: YMU001_000016_D11. BP912212... CL1085Contig1 Show BP912212 Clone id YMU001_000016_D11 Library YMU01 Length 457 Definition Adiantum ca...pillus-veneris mRNA. clone: YMU001_000016_D11. Accession BP912212 Tissue type prothallium Developmental stag...f protein database search programs, Nucleic Acids Res. 25:3389-3402. Query= BP912212...|Adiantum capillus-veneris mRNA, clone: YMU001_000016_D11. (457 letters) Database: uniprot_sprot.fasta 412

  10. AcEST: BP912312 [AcEST

    Lifescience Database Archive (English)

    Full Text Available YMU001_000017_F01 489 Adiantum capillus-veneris mRNA. clone: YMU001_000017_F01. BP912312... CL1779Contig1 Show BP912312 Clone id YMU001_000017_F01 Library YMU01 Length 489 Definition Adiantum ca...pillus-veneris mRNA. clone: YMU001_000017_F01. Accession BP912312 Tissue type prothallium Developmental stag...on of protein database search programs, Nucleic Acids Res. 25:3389-3402. Query= BP912312...|Adiantum capillus-veneris mRNA, clone: YMU001_000017_F01. (489 letters) Database: uniprot_sprot.fasta 412

  11. AcEST: BP912128 [AcEST

    Lifescience Database Archive (English)

    Full Text Available YMU001_000015_D10 477 Adiantum capillus-veneris mRNA. clone: YMU001_000015_D10. BP91212...8 CL2328Contig1 Show BP912128 Clone id YMU001_000015_D10 Library YMU01 Length 477 Definition Adiantum ca...pillus-veneris mRNA. clone: YMU001_000015_D10. Accession BP912128 Tissue type prothallium Developmental stag... protein database search programs, Nucleic Acids Res. 25:3389-3402. Query= BP91212...8|Adiantum capillus-veneris mRNA, clone: YMU001_000015_D10. (461 letters) Database: uniprot_sprot.fasta 412

  12. AcEST: BP912123 [AcEST

    Lifescience Database Archive (English)

    Full Text Available YMU001_000015_D05 496 Adiantum capillus-veneris mRNA. clone: YMU001_000015_D05. BP91212...3 CL498Contig1 Show BP912123 Clone id YMU001_000015_D05 Library YMU01 Length 496 Definition Adiantum cap...illus-veneris mRNA. clone: YMU001_000015_D05. Accession BP912123 Tissue type prothallium Developmental stage...n of protein database search programs, Nucleic Acids Res. 25:3389-3402. Query= BP91212...3|Adiantum capillus-veneris mRNA, clone: YMU001_000015_D05. (478 letters) Database: uniprot_sprot.fasta 412

  13. AcEST: BP912912 [AcEST

    Lifescience Database Archive (English)

    Full Text Available YMU001_000024_C05 413 Adiantum capillus-veneris mRNA. clone: YMU001_000024_C05. BP912912... CL1433Contig1 Show BP912912 Clone id YMU001_000024_C05 Library YMU01 Length 413 Definition Adiantum ca...pillus-veneris mRNA. clone: YMU001_000024_C05. Accession BP912912 Tissue type prothallium Developmental stag... of protein database search programs, Nucleic Acids Res. 25:3389-3402. Query= BP912912|Adiantum capillus-ven...eris mRNA, clone: YMU001_000024_C05. (413 letters) Database: uniprot_sprot.fasta 412

  14. AcEST: BP914061 [AcEST

    Lifescience Database Archive (English)

    Full Text Available YMU001_000039_D09 599 Adiantum capillus-veneris mRNA. clone: YMU001_000039_D09. BP91406...1 CL1730Contig1 Show BP914061 Clone id YMU001_000039_D09 Library YMU01 Length 599 Definition Adiantum ca...pillus-veneris mRNA. clone: YMU001_000039_D09. Accession BP914061 Tissue type prothallium Developmental stag... a new generation of protein database search programs, Nucleic Acids Res. 25:3389-3402. Query= BP914061|Adia...database search programs, Nucleic Acids Res. 25:3389-3402. Query= BP914061|Adiantum capillus-veneris mRNA, c

  15. AcEST: DK958406 [AcEST

    Lifescience Database Archive (English)

    Full Text Available 8. 5' end sequence. DK958406 CL15Contig1 Show DK958406 Clone id TST39A01NGRL0030_O18 Library TST39 Length 60...7 Definition Adiantum capillus-veneris mRNA. clone: TST39A01NGRL0030_O18. 5' end sequence. Accession DK958406...LAST and PSI-BLAST: a new generation of protein database search programs, Nucleic Acids Res. 25:3389-3402. Query= DK958406...database search programs, Nucleic Acids Res. 25:3389-3402. Query= DK958406|Adiantum capillus-veneris mRNA, c

  16. AcEST: BP914064 [AcEST

    Lifescience Database Archive (English)

    Full Text Available YMU001_000039_D12 560 Adiantum capillus-veneris mRNA. clone: YMU001_000039_D12. BP91406...4 CL532Contig1 Show BP914064 Clone id YMU001_000039_D12 Library YMU01 Length 560 Definition Adiantum cap...illus-veneris mRNA. clone: YMU001_000039_D12. Accession BP914064 Tissue type prothallium Developmental stage...f protein database search programs, Nucleic Acids Res. 25:3389-3402. Query= BP914064|Adiantum capillus-vener...: a new generation of protein database search programs, Nucleic Acids Res. 25:3389-3402. Query= BP914064|Adi

  17. AcEST: DK950406 [AcEST

    Lifescience Database Archive (English)

    Full Text Available 4. 5' end sequence. DK950406 CL45Contig1 Show DK950406 Clone id TST38A01NGRL0008_J04 Library TST38 Length 64...8 Definition Adiantum capillus-veneris mRNA. clone: TST38A01NGRL0008_J04. 5' end sequence. Accession DK950406...SI-BLAST: a new generation of protein database search programs, Nucleic Acids Res. 25:3389-3402. Query= DK950406... of protein database search programs, Nucleic Acids Res. 25:3389-3402. Query= DK950406

  18. AcEST: BP912406 [AcEST

    Lifescience Database Archive (English)

    Full Text Available YMU001_000018_F09 348 Adiantum capillus-veneris mRNA. clone: YMU001_000018_F09. BP912406... CL1894Contig1 Show BP912406 Clone id YMU001_000018_F09 Library YMU01 Length 348 Definition Adiantum ca...pillus-veneris mRNA. clone: YMU001_000018_F09. Accession BP912406 Tissue type prothallium Developmental stag...in database search programs, Nucleic Acids Res. 25:3389-3402. Query= BP912406|Adiantum capillus-veneris mRNA...n of protein database search programs, Nucleic Acids Res. 25:3389-3402. Query= BP912406

  19. AcEST: BP916406 [AcEST

    Lifescience Database Archive (English)

    Full Text Available YMU001_000087_D01 556 Adiantum capillus-veneris mRNA. clone: YMU001_000087_D01. BP916406... CL1913Contig1 Show BP916406 Clone id YMU001_000087_D01 Library YMU01 Length 556 Definition Adiantum ca...pillus-veneris mRNA. clone: YMU001_000087_D01. Accession BP916406 Tissue type prothallium Developmental stag...ration of protein database search programs, Nucleic Acids Res. 25:3389-3402. Query= BP916406|Adiantum capill...database search programs, Nucleic Acids Res. 25:3389-3402. Query= BP916406|Adiantum capillus-veneris mRNA, c

  20. AcEST: BP914406 [AcEST

    Lifescience Database Archive (English)

    Full Text Available YMU001_000058_E09 562 Adiantum capillus-veneris mRNA. clone: YMU001_000058_E09. BP914406... CL513Contig1 Show BP914406 Clone id YMU001_000058_E09 Library YMU01 Length 562 Definition Adiantum cap...illus-veneris mRNA. clone: YMU001_000058_E09. Accession BP914406 Tissue type prothallium Developmental stage...tion of protein database search programs, Nucleic Acids Res. 25:3389-3402. Query= BP914406|Adiantum capillus...PSI-BLAST: a new generation of protein database search programs, Nucleic Acids Res. 25:3389-3402. Query= BP914406

  1. AcEST: DK951406 [AcEST

    Lifescience Database Archive (English)

    Full Text Available 6. 5' end sequence. DK951406 - Show DK951406 Clone id TST38A01NGRL0011_F06 Library TST38 Length 670 Definiti...on Adiantum capillus-veneris mRNA. clone: TST38A01NGRL0011_F06. 5' end sequence. Accession DK951406 Tissue t...AST and PSI-BLAST: a new generation of protein database search programs, Nucleic Acids Res. 25:3389-3402. Query= DK951406...), Gapped BLAST and PSI-BLAST: a new generation of protein database search programs, Nucleic Acids Res. 25:3389-3402. Query= DK951406

  2. AcEST: DK957406 [AcEST

    Lifescience Database Archive (English)

    Full Text Available 5. 5' end sequence. DK957406 - Show DK957406 Clone id TST39A01NGRL0028_E15 Library TST39 Length 641 Definiti...on Adiantum capillus-veneris mRNA. clone: TST39A01NGRL0028_E15. 5' end sequence. Accession DK957406 Tissue t...ed BLAST and PSI-BLAST: a new generation of protein database search programs, Nucleic Acids Res. 25:3389-3402. Query= DK957406... PSI-BLAST: a new generation of protein database search programs, Nucleic Acids Res. 25:3389-3402. Query= DK957406

  3. AcEST: BP921000 [AcEST

    Lifescience Database Archive (English)

    Full Text Available YMU001_000144_D05 407 Adiantum capillus-veneris mRNA. clone: YMU001_000144_D05. BP921000 - Show BP921000...is mRNA. clone: YMU001_000144_D05. Accession BP921000 Tissue type prothallium Developmental stage - Contig I...eneration of protein database search programs, Nucleic Acids Res. 25:3389-3402. Query= BP921000|Adiantum cap...ein database search programs, Nucleic Acids Res. 25:3389-3402. Query= BP921000|Adiantum capillus-veneris mRN

  4. AcEST: BP920140 [AcEST

    Lifescience Database Archive (English)

    Full Text Available YMU001_000133_E03 489 Adiantum capillus-veneris mRNA. clone: YMU001_000133_E03. BP92014...0 CL2574Contig1 Show BP920140 Clone id YMU001_000133_E03 Library YMU01 Length 489 Definition Adiantum ca...pillus-veneris mRNA. clone: YMU001_000133_E03. Accession BP920140 Tissue type prothallium Developmental stag... database search programs, Nucleic Acids Res. 25:3389-3402. Query= BP920140|Adian... and PSI-BLAST: a new generation of protein database search programs, Nucleic Acids Res. 25:3389-3402. Query= BP92014

  5. AcEST: BP920996 [AcEST

    Lifescience Database Archive (English)

    Full Text Available YMU001_000144_D01 496 Adiantum capillus-veneris mRNA. clone: YMU001_000144_D01. BP92099...6 CL262Contig1 Show BP920996 Clone id YMU001_000144_D01 Library YMU01 Length 496 Definition Adiantum cap...illus-veneris mRNA. clone: YMU001_000144_D01. Accession BP920996 Tissue type prothallium Developmental stage...n of protein database search programs, Nucleic Acids Res. 25:3389-3402. Query= BP92099...n of protein database search programs, Nucleic Acids Res. 25:3389-3402. Query= BP920996|Adiantum capillus-ve

  6. AcEST: BP920995 [AcEST

    Lifescience Database Archive (English)

    Full Text Available YMU001_000144_C12 350 Adiantum capillus-veneris mRNA. clone: YMU001_000144_C12. BP920995 - Show BP92099...is mRNA. clone: YMU001_000144_C12. Accession BP920995 Tissue type prothallium Developmental stage - Contig I...-BLAST: a new generation of protein database search programs, Nucleic Acids Res. 25:3389-3402. Query= BP92099...ew generation of protein database search programs, Nucleic Acids Res. 25:3389-3402. Query= BP92099

  7. AcEST: BP920997 [AcEST

    Lifescience Database Archive (English)

    Full Text Available YMU001_000144_D02 534 Adiantum capillus-veneris mRNA. clone: YMU001_000144_D02. BP92099...7 CL10Contig1 Show BP920997 Clone id YMU001_000144_D02 Library YMU01 Length 534 Definition Adiantum capi...llus-veneris mRNA. clone: YMU001_000144_D02. Accession BP920997 Tissue type prothallium Developmental stage ...a new generation of protein database search programs, Nucleic Acids Res. 25:3389-3402. Query= BP920997|Adian...BLAST: a new generation of protein database search programs, Nucleic Acids Res. 2

  8. AcEST: BP912099 [AcEST

    Lifescience Database Archive (English)

    Full Text Available YMU001_000015_B05 315 Adiantum capillus-veneris mRNA. clone: YMU001_000015_B05. BP912099 - Show BP912099...is mRNA. clone: YMU001_000015_B05. Accession BP912099 Tissue type prothallium Developmental stage - Contig I...BLAST and PSI-BLAST: a new generation of protein database search programs, Nucleic Acids Res. 25:3389-3402. Query= BP912099...of protein database search programs, Nucleic Acids Res. 25:3389-3402. Query= BP912099|Adiantum capillus-vene

  9. AcEST: DK948019 [AcEST

    Lifescience Database Archive (English)

    Full Text Available 4. 5' end sequence. DK948019 CL3Contig1 Show DK948019 Clone id TST38A01NGRL0002_D04 Library TST38 Length 716... Definition Adiantum capillus-veneris mRNA. clone: TST38A01NGRL0002_D04. 5' end sequence. Accession DK948019...of protein database search programs, Nucleic Acids Res. 25:3389-3402. Query= DK94801...database search programs, Nucleic Acids Res. 25:3389-3402. Query= DK948019|Adiantum capillus-veneris mRNA, c...th = 396 Score = 313 bits (801), Expect = 9e-84 Identities = 166/213 (77%), Posit

  10. AcEST: DK958801 [AcEST

    Lifescience Database Archive (English)

    Full Text Available 2. 5' end sequence. DK958801 CL3693Contig1 Show DK958801 Clone id TST39A01NGRL0002_P12 Library TST39 Length ...581 Definition Adiantum capillus-veneris mRNA. clone: TST39A01NGRL0002_P12. 5' end sequence. Accession DK958801...protein database search programs, Nucleic Acids Res. 25:3389-3402. Query= DK958801|Adiantum capillus-veneris...ration of protein database search programs, Nucleic Acids Res. 25:3389-3402. Query= DK958801|Adiantum capill

  11. AcEST: DK956801 [AcEST

    Lifescience Database Archive (English)

    Full Text Available 2. 5' end sequence. DK956801 CL1196Contig1 Show DK956801 Clone id TST39A01NGRL0026_L02 Library TST39 Length ...585 Definition Adiantum capillus-veneris mRNA. clone: TST39A01NGRL0026_L02. 5' end sequence. Accession DK956801...apped BLAST and PSI-BLAST: a new generation of protein database search programs, Nucleic Acids Res. 25:3389-3402. Query= DK956801...on of protein database search programs, Nucleic Acids Res. 25:3389-3402. Query= DK956801|Adiantum capillus-v

  12. AcEST: BP918013 [AcEST

    Lifescience Database Archive (English)

    Full Text Available YMU001_000108_F01 490 Adiantum capillus-veneris mRNA. clone: YMU001_000108_F01. BP918013 - Show BP91801...is mRNA. clone: YMU001_000108_F01. Accession BP918013 Tissue type prothallium Developmental stage - Contig I...protein database search programs, Nucleic Acids Res. 25:3389-3402. Query= BP91801...idylinositol-4-phosphate 5-kinase 1 OS=Oryza sativa subsp. japonica GN=PIPK1 PE=2 SV=2 Length = 801 Score = ..., Gapped BLAST and PSI-BLAST: a new generation of protein database search programs, Nucleic Acids Res. 25:3389-3402. Query= BP91801

  13. AcEST: DK948801 [AcEST

    Lifescience Database Archive (English)

    Full Text Available 1. 5' end sequence. DK948801 - Show DK948801 Clone id TST38A01NGRL0004_E11 Library TST38 Length 549 Definiti...on Adiantum capillus-veneris mRNA. clone: TST38A01NGRL0004_E11. 5' end sequence. Accession DK948801 Tissue t...new generation of protein database search programs, Nucleic Acids Res. 25:3389-3402. Query= DK948801|Adiantu...AST and PSI-BLAST: a new generation of protein database search programs, Nucleic Acids Res. 25:3389-3402. Query= DK948801

  14. AcEST: DK955801 [AcEST

    Lifescience Database Archive (English)

    Full Text Available 3. 5' end sequence. DK955801 CL19Contig1 Show DK955801 Clone id TST39A01NGRL0024_B03 Library TST39 Length 62...2 Definition Adiantum capillus-veneris mRNA. clone: TST39A01NGRL0024_B03. 5' end sequence. Accession DK955801...ST and PSI-BLAST: a new generation of protein database search programs, Nucleic Acids Res. 25:3389-3402. Query= DK955801...LAST: a new generation of protein database search programs, Nucleic Acids Res. 25:3389-3402. Query= DK955801

  15. An EST dataset for Metasequoia glyptostroboides buds: the first EST resource for molecular genomics studies in Metasequoia.

    Science.gov (United States)

    Zhao, Ying; Thammannagowda, Shivegowda; Staton, Margaret; Tang, Sha; Xia, Xinli; Yin, Weilun; Liang, Haiying

    2013-03-01

    The "living fossil" Metasequoia glyptostroboides Hu et Cheng, commonly known as dawn redwood or Chinese redwood, is the only living species in the genus and is valued for its essential oil and crude extracts that have great potential for anti-fungal activity. Despite its paleontological significance and economical value as a rare relict species, genomic resources of Metasequoia are very limited. In order to gain insight into the molecular mechanisms behind the formation of reproductive buds and the transition from vegetative phase to reproductive phase in Metasequoia, we performed sequencing of expressed sequence tags from Metasequoia vegetative buds and female buds. By using the 454 pyrosequencing technology, a total of 1,571,764 high-quality reads were generated, among which 733,128 were from vegetative buds and 775,636 were from female buds. These EST reads were clustered and assembled into 114,124 putative unique transcripts (PUTs) with an average length of 536 bp. The 97,565 PUTs that were at least 100 bp in length were functionally annotated by a similarity search against public databases and assigned with Gene Ontology (GO) terms. A total of 59 known floral gene families and 190 isotigs involved in hormone regulation were captured in the dataset. Furthermore, a set of PUTs differentially expressed in vegetative and reproductive buds, as well as SSR motifs and high confidence SNPs, were identified. This is the first large-scale expressed sequence tags ever generated in Metasequoia and the first evidence for floral genes in this critically endangered deciduous conifer species.

  16. Biofuel Database

    Science.gov (United States)

    Biofuel Database (Web, free access)   This database brings together structural, biological, and thermodynamic data for enzymes that are either in current use or are being considered for use in the production of biofuels.

  17. Community Database

    Data.gov (United States)

    National Oceanic and Atmospheric Administration, Department of Commerce — This excel spreadsheet is the result of merging at the port level of several of the in-house fisheries databases in combination with other demographic databases such...

  18. Database Administrator

    Science.gov (United States)

    Moore, Pam

    2010-01-01

    The Internet and electronic commerce (e-commerce) generate lots of data. Data must be stored, organized, and managed. Database administrators, or DBAs, work with database software to find ways to do this. They identify user needs, set up computer databases, and test systems. They ensure that systems perform as they should and add people to the…

  19. The Porcelain Crab Transcriptome and PCAD, the Porcelain Crab Microarray and Sequence Database

    Energy Technology Data Exchange (ETDEWEB)

    Tagmount, Abderrahmane; Wang, Mei; Lindquist, Erika; Tanaka, Yoshihiro; Teranishi, Kristen S.; Sunagawa, Shinichi; Wong, Mike; Stillman, Jonathon H.

    2010-01-27

    Background: With the emergence of a completed genome sequence of the freshwater crustacean Daphnia pulex, construction of genomic-scale sequence databases for additional crustacean sequences are important for comparative genomics and annotation. Porcelain crabs, genus Petrolisthes, have been powerful crustacean models for environmental and evolutionary physiology with respect to thermal adaptation and understanding responses of marine organisms to climate change. Here, we present a large-scale EST sequencing and cDNA microarray database project for the porcelain crab Petrolisthes cinctipes. Methodology/Principal Findings: A set of ~;;30K unique sequences (UniSeqs) representing ~;;19K clusters were generated from ~;;98K high quality ESTs from a set of tissue specific non-normalized and mixed-tissue normalized cDNA libraries from the porcelain crab Petrolisthes cinctipes. Homology for each UniSeq was assessed using BLAST, InterProScan, GO and KEGG database searches. Approximately 66percent of the UniSeqs had homology in at least one of the databases. All EST and UniSeq sequences along with annotation results and coordinated cDNA microarray datasets have been made publicly accessible at the Porcelain Crab Array Database (PCAD), a feature-enriched version of the Stanford and Longhorn Array Databases.Conclusions/Significance: The EST project presented here represents the third largest sequencing effort for any crustacean, and the largest effort for any crab species. Our assembly and clustering results suggest that our porcelain crab EST data set is equally diverse to the much larger EST set generated in the Daphnia pulex genome sequencing project, and thus will be an important resource to the Daphnia research community. Our homology results support the pancrustacea hypothesis and suggest that Malacostraca may be ancestral to Branchiopoda and Hexapoda. Our results also suggest that our cDNA microarrays cover as much of the transcriptome as can reasonably be captured in

  20. Gas Chromatography and Mass Spectrometry Measurements and Protocols for Database and Library Development Relating to Organic Species in Support of the Mars Science Laboratory

    Science.gov (United States)

    Misra, P.; Garcia, R.; Mahaffy, P. R.

    2010-04-01

    An organic contaminant database and library has been developed for use with the Sample Analysis at Mars (SAM) instrumentation utilizing laboratory-based Gas Chromatography-Mass Spectrometry measurements of pyrolyzed and baked material samples.

  1. Judy Estes Hall (1940-2015).

    Science.gov (United States)

    Sammons, Morgan T; Boucher, Andrew

    2016-01-01

    Presents an obituary for Judy Estes Hall, who passed away on November 24, 2015. Hall served as the Executive Officer of the National Register of Health Service Psychologists until her retirement in 2013. She is a recognized expert in the development of education and training standards for the profession of psychology, she also made significant contributions in the field of international psychology, where she was a renowned expert in cross-national credentialing and an advocate for commonality in licensing standards. She was the coauthor of one edited volume and author of more than 60 journal articles, book chapters, and professional publications. A passionate advocate for the advancement of women in psychology, a devoted mother and grandmother, a connoisseur of wine and international traveler extraordinaire, she touched the personal and professional lives of many. (PsycINFO Database Record (c) 2016 APA, all rights reserved).

  2. Novel expressed sequence tag- simple sequence repeats (EST ...

    African Journals Online (AJOL)

    Using different bioinformatic criteria, the SUCEST database was used to mine for simple sequence repeat (SSR) markers. Among 42,189 clusters, 1,425 expressed sequence tag- simple sequence repeats (EST-SSRs) were identified in silico. Trinucleotide repeats were the most abundant SSRs detected. Of 212 primer pairs ...

  3. AcEST: BP912124 [AcEST

    Lifescience Database Archive (English)

    Full Text Available YMU001_000015_D06 531 Adiantum capillus-veneris mRNA. clone: YMU001_000015_D06. BP91212...4 CL2988Contig1 Show BP912124 Clone id YMU001_000015_D06 Library YMU01 Length 531 Definition Adiantum ca...pillus-veneris mRNA. clone: YMU001_000015_D06. Accession BP912124 Tissue type prothallium Developmental stag...n database search programs, Nucleic Acids Res. 25:3389-3402. Query= BP912124|Adiantum capillus-veneris mRNA,... clone: YMU001_000015_D06. (531 letters) Database: uniprot_sprot.fasta 412,525 sequences; 148,809,765 total

  4. AcEST: BP912125 [AcEST

    Lifescience Database Archive (English)

    Full Text Available YMU001_000015_D07 558 Adiantum capillus-veneris mRNA. clone: YMU001_000015_D07. BP912125 - Show BP91212...is mRNA. clone: YMU001_000015_D07. Accession BP912125 Tissue type prothallium Developmental stage - Contig I...ein database search programs, Nucleic Acids Res. 25:3389-3402. Query= BP912125|Adiantum capillus-veneris mRN...A, clone: YMU001_000015_D07. (558 letters) Database: uniprot_sprot.fasta 412,525 sequences; 148,809,765 tota...copeptide repeat-containing protein At1g08070 OS=Arabidopsis thaliana GN=PCMP-H12

  5. AcEST: BP912120 [AcEST

    Lifescience Database Archive (English)

    Full Text Available YMU001_000015_D01 500 Adiantum capillus-veneris mRNA. clone: YMU001_000015_D01. BP912120 - Show BP91212...is mRNA. clone: YMU001_000015_D01. Accession BP912120 Tissue type prothallium Developmental stage - Contig I...elated Pol polyprotein from transposon TNT 1-94 OS=Nicotiana tabacum Align length 130 Score (bit) 124.0 E-va...: a new generation of protein database search programs, Nucleic Acids Res. 25:3389-3402. Query= BP91212...0|Adiantum capillus-veneris mRNA, clone: YMU001_000015_D01. (500 letters) Database: uniprot_sprot.fasta 412

  6. AcEST: BP921212 [AcEST

    Lifescience Database Archive (English)

    Full Text Available YMU001_000147_A09 361 Adiantum capillus-veneris mRNA. clone: YMU001_000147_A09. BP921212 - Show BP921212...is mRNA. clone: YMU001_000147_A09. Accession BP921212 Tissue type prothallium Developmental stage - Contig I...protein database search programs, Nucleic Acids Res. 25:3389-3402. Query= BP921212|Adiantum capillus-veneris... mRNA, clone: YMU001_000147_A09. (361 letters) Database: uniprot_sprot.fasta 412,...itol-4,5-bisphosphate 3-ki... 30 2.9 sp|O14338|YB33_SCHPO Uncharacterized serine-rich protein C2F12.0... 29

  7. AcEST: DK961212 [AcEST

    Lifescience Database Archive (English)

    Full Text Available TST39A01NGRL0009_I01 591 Adiantum capillus-veneris mRNA. clone: TST39A01NGRL0009_I0...1. 5' end sequence. DK961212 - Show DK961212 Clone id TST39A01NGRL0009_I01 Library TST39 Length 591 Definiti...on Adiantum capillus-veneris mRNA. clone: TST39A01NGRL0009_I01. 5' end sequence. Accession DK961212 Tissue t...eration of protein database search programs, Nucleic Acids Res. 25:3389-3402. Query= DK961212|Adiantum capil...lus-veneris mRNA, clone: TST39A01NGRL0009_I01, 5' (591 letters) Database: uniprot_sprot.fasta 412

  8. AcEST: BP912812 [AcEST

    Lifescience Database Archive (English)

    Full Text Available YMU001_000023_B07 575 Adiantum capillus-veneris mRNA. clone: YMU001_000023_B07. BP912812... CL2610Contig1 Show BP912812 Clone id YMU001_000023_B07 Library YMU01 Length 575 Definition Adiantum ca...pillus-veneris mRNA. clone: YMU001_000023_B07. Accession BP912812 Tissue type prothallium Developmental stag... new generation of protein database search programs, Nucleic Acids Res. 25:3389-3402. Query= BP912812|Adiant...um capillus-veneris mRNA, clone: YMU001_000023_B07. (575 letters) Database: uniprot_sprot.fasta 412,525 sequ

  9. AcEST: BP912127 [AcEST

    Lifescience Database Archive (English)

    Full Text Available YMU001_000015_D09 582 Adiantum capillus-veneris mRNA. clone: YMU001_000015_D09. BP912127 - Show BP91212...is mRNA. clone: YMU001_000015_D09. Accession BP912127 Tissue type prothallium Developmental stage - Contig I...n database search programs, Nucleic Acids Res. 25:3389-3402. Query= BP912127|Adiantum capillus-veneris mRNA,... clone: YMU001_000015_D09. (582 letters) Database: uniprot_sprot.fasta 412,525 sequences; 148,809,765 total ...p|P42825|DNAJ2_ARATH Chaperone protein dnaJ 2 OS=Arabidopsis th... 79 2e-14 sp|Q09912|PSI1_SCHPO Protein psi

  10. AcEST: BP912112 [AcEST

    Lifescience Database Archive (English)

    Full Text Available YMU001_000015_C08 546 Adiantum capillus-veneris mRNA. clone: YMU001_000015_C08. BP912112 - Show BP912112...is mRNA. clone: YMU001_000015_C08. Accession BP912112 Tissue type prothallium Developmental stage - Contig I...Arabidopsis thaliana Align length 171 Score (bit) 121.0 E-value 3.0e-27 Report BLASTX 2.2.19 [Nov-02-2008] R... protein database search programs, Nucleic Acids Res. 25:3389-3402. Query= BP912112|Adiantum capillus-veneri...s mRNA, clone: YMU001_000015_C08. (546 letters) Database: uniprot_sprot.fasta 412,525 sequences; 148,809,765

  11. AcEST: BP912412 [AcEST

    Lifescience Database Archive (English)

    Full Text Available YMU001_000018_G03 551 Adiantum capillus-veneris mRNA. clone: YMU001_000018_G03. BP912412... CL4248Contig1 Show BP912412 Clone id YMU001_000018_G03 Library YMU01 Length 551 Definition Adiantum ca...pillus-veneris mRNA. clone: YMU001_000018_G03. Accession BP912412 Tissue type prothallium Developmental stag...tein database search programs, Nucleic Acids Res. 25:3389-3402. Query= BP912412|Adiantum capillus-veneris mR...NA, clone: YMU001_000018_G03. (551 letters) Database: uniprot_sprot.fasta 412,525 sequences; 148,809,765 tot

  12. AcEST: DK951212 [AcEST

    Lifescience Database Archive (English)

    Full Text Available TST38A01NGRL0010_M18 620 Adiantum capillus-veneris mRNA. clone: TST38A01NGRL0010_M1...8. 5' end sequence. DK951212 CL3520Contig1 Show DK951212 Clone id TST38A01NGRL0010_M18 Library TST38 Length ...620 Definition Adiantum capillus-veneris mRNA. clone: TST38A01NGRL0010_M18. 5' end sequence. Accession DK951212...rotein database search programs, Nucleic Acids Res. 25:3389-3402. Query= DK951212|Adiantum capillus-veneris ...mRNA, clone: TST38A01NGRL0010_M18, 5' (620 letters) Database: uniprot_sprot.fasta 412

  13. AcEST: DK954061 [AcEST

    Lifescience Database Archive (English)

    Full Text Available 1. 5' end sequence. DK954061 - Show DK954061 Clone id TST39A01NGRL0019_H01 Library TST39 Length 594 Definiti...on Adiantum capillus-veneris mRNA. clone: TST39A01NGRL0019_H01. 5' end sequence. Accession DK954061 Tissue t...w generation of protein database search programs, Nucleic Acids Res. 25:3389-3402. Query= DK954061|Adiantum ...T and PSI-BLAST: a new generation of protein database search programs, Nucleic Acids Res. 25:3389-3402. Query= DK95406...TST39A01NGRL0019_H01 594 Adiantum capillus-veneris mRNA. clone: TST39A01NGRL0019_H0

  14. AcEST: DK954069 [AcEST

    Lifescience Database Archive (English)

    Full Text Available 9. 5' end sequence. DK954069 CL33Contig1 Show DK954069 Clone id TST39A01NGRL0019_H09 Library TST39 Length 58...8 Definition Adiantum capillus-veneris mRNA. clone: TST39A01NGRL0019_H09. 5' end sequence. Accession DK95406...f protein database search programs, Nucleic Acids Res. 25:3389-3402. Query= DK954069|Adiantum capillus-vener...T: a new generation of protein database search programs, Nucleic Acids Res. 25:3389-3402. Query= DK95406...TST39A01NGRL0019_H09 588 Adiantum capillus-veneris mRNA. clone: TST39A01NGRL0019_H0

  15. AcEST: DK956406 [AcEST

    Lifescience Database Archive (English)

    Full Text Available 3. 5' end sequence. DK956406 CL111Contig1 Show DK956406 Clone id TST39A01NGRL0025_K13 Library TST39 Length 5...91 Definition Adiantum capillus-veneris mRNA. clone: TST39A01NGRL0025_K13. 5' end sequence. Accession DK956406...LAST: a new generation of protein database search programs, Nucleic Acids Res. 25:3389-3402. Query= DK956406...tr... 231 2e-60 sp|P53445|ALF1_LAMJA Fructose-bisphosphate aldolase, muscle type... 229 6e-60 sp|Q40677|ALFC...ew generation of protein database search programs, Nucleic Acids Res. 25:3389-3402. Query= DK956406

  16. AcEST: DK946406 [AcEST

    Lifescience Database Archive (English)

    Full Text Available 8. 5' end sequence. DK946406 - Show DK946406 Clone id YMU02A01NGRL0012_I08 Library YMU02 Length 540 Definiti...on Adiantum capillus-veneris mRNA. clone: YMU02A01NGRL0012_I08. 5' end sequence. Accession DK946406 Tissue t... protein database search programs, Nucleic Acids Res. 25:3389-3402. Query= DK946406|Adiantum capillus-veneri...chizosacch... 121 2e-27 sp|O14062|RS12A_SCHPO 40S ribosomal protein S12-A OS=Schizosacch... 121 2e-27 sp|Q54... new generation of protein database search programs, Nucleic Acids Res. 25:3389-3402. Query= DK946406

  17. AcEST: DK944064 [AcEST

    Lifescience Database Archive (English)

    Full Text Available 3. 5' end sequence. DK944064 CL53Contig1 Show DK944064 Clone id YMU02A01NGRL0004_N23 Library YMU02 Length 49...9 Definition Adiantum capillus-veneris mRNA. clone: YMU02A01NGRL0004_N23. 5' end sequence. Accession DK94406...ion of protein database search programs, Nucleic Acids Res. 25:3389-3402. Query= DK944064|Adiantum capillus-...ration of protein database search programs, Nucleic Acids Res. 25:3389-3402. Query= DK944064|Adiantum capill...YMU02A01NGRL0004_N23 499 Adiantum capillus-veneris mRNA. clone: YMU02A01NGRL0004_N2

  18. AcEST: DK943636 [AcEST

    Lifescience Database Archive (English)

    Full Text Available YMU02A01NGRL0003_H12 321 Adiantum capillus-veneris mRNA. clone: YMU02A01NGRL0003_H1...2. 5' end sequence. DK943636 CL16Contig1 Show DK943636 Clone id YMU02A01NGRL0003_H12 Library YMU02 Length 32...1 Definition Adiantum capillus-veneris mRNA. clone: YMU02A01NGRL0003_H12. 5' end sequence. Accession DK943636...f protein database search programs, Nucleic Acids Res. 25:3389-3402. Query= DK943636|Adiantum capillus-vener...T and PSI-BLAST: a new generation of protein database search programs, Nucleic Acids Res. 25:3389-3402. Query= DK943636

  19. AcEST: DK953636 [AcEST

    Lifescience Database Archive (English)

    Full Text Available TST39A01NGRL0018_E20 327 Adiantum capillus-veneris mRNA. clone: TST39A01NGRL0018_E2...0. 5' end sequence. DK953636 - Show DK953636 Clone id TST39A01NGRL0018_E20 Library TST39 Length 327 Definiti...on Adiantum capillus-veneris mRNA. clone: TST39A01NGRL0018_E20. 5' end sequence. Accession DK953636 Tissue t...d BLAST and PSI-BLAST: a new generation of protein database search programs, Nucleic Acids Res. 25:3389-3402. Query= DK953636... generation of protein database search programs, Nucleic Acids Res. 25:3389-3402. Query= DK953636|Adiantum c

  20. AcEST: BP920994 [AcEST

    Lifescience Database Archive (English)

    Full Text Available YMU001_000144_C10 322 Adiantum capillus-veneris mRNA. clone: YMU001_000144_C10. BP92099...4 CL2871Contig1 Show BP920994 Clone id YMU001_000144_C10 Library YMU01 Length 322 Definition Adiantum ca...pillus-veneris mRNA. clone: YMU001_000144_C10. Accession BP920994 Tissue type prothallium Developmental stag...), Gapped BLAST and PSI-BLAST: a new generation of protein database search programs, Nucleic Acids Res. 25:3389-3402. Query= BP92099...ped BLAST and PSI-BLAST: a new generation of protein database search programs, Nucleic Acids Res. 25:3389-3402. Query= BP92099

  1. AcEST: BP920990 [AcEST

    Lifescience Database Archive (English)

    Full Text Available YMU001_000144_C03 445 Adiantum capillus-veneris mRNA. clone: YMU001_000144_C03. BP92099...0 CL4123Contig1 Show BP920990 Clone id YMU001_000144_C03 Library YMU01 Length 445 Definition Adiantum ca...pillus-veneris mRNA. clone: YMU001_000144_C03. Accession BP920990 Tissue type prothallium Developmental stag...97), Gapped BLAST and PSI-BLAST: a new generation of protein database search programs, Nucleic Acids Res. 25:3389-3402. Query= BP9209...LAST and PSI-BLAST: a new generation of protein database search programs, Nucleic Acids Res. 25:3389-3402. Query= BP92099

  2. AcEST: DK962099 [AcEST

    Lifescience Database Archive (English)

    Full Text Available 6. 5' end sequence. DK962099 CL3534Contig1 Show DK962099 Clone id TST39A01NGRL0011_P06 Library TST39 Length ...641 Definition Adiantum capillus-veneris mRNA. clone: TST39A01NGRL0011_P06. 5' end sequence. Accession DK962099... a new generation of protein database search programs, Nucleic Acids Res. 25:3389-3402. Query= DK962099...97), Gapped BLAST and PSI-BLAST: a new generation of protein database search programs, Nucleic Acids Res. 25...TST39A01NGRL0011_P06 641 Adiantum capillus-veneris mRNA. clone: TST39A01NGRL0011_P0

  3. AcEST: BP920999 [AcEST

    Lifescience Database Archive (English)

    Full Text Available YMU001_000144_D04 588 Adiantum capillus-veneris mRNA. clone: YMU001_000144_D04. BP92099...9 CL317Contig1 Show BP920999 Clone id YMU001_000144_D04 Library YMU01 Length 588 Definition Adiantum cap...illus-veneris mRNA. clone: YMU001_000144_D04. Accession BP920999 Tissue type prothallium Developmental stage...nd PSI-BLAST: a new generation of protein database search programs, Nucleic Acids Res. 25:3389-3402. Query= BP92099... and PSI-BLAST: a new generation of protein database search programs, Nucleic Acids Res. 25:3389-3402. Query= BP92099

  4. AcEST: DK948012 [AcEST

    Lifescience Database Archive (English)

    Full Text Available 1. 5' end sequence. DK948012 CL61Contig1 Show DK948012 Clone id TST38A01NGRL0002_C21 Library TST38 Length 63...2 Definition Adiantum capillus-veneris mRNA. clone: TST38A01NGRL0002_C21. 5' end sequence. Accession DK94801...eneration of protein database search programs, Nucleic Acids Res. 25:3389-3402. Query= DK948012|Adiantum cap...ST: a new generation of protein database search programs, Nucleic Acids Res. 25:3389-3402. Query= DK948012|A...TST38A01NGRL0002_C21 632 Adiantum capillus-veneris mRNA. clone: TST38A01NGRL0002_C2

  5. AcEST: DK958019 [AcEST

    Lifescience Database Archive (English)

    Full Text Available 2. 5' end sequence. DK958019 - Show DK958019 Clone id TST39A01NGRL0029_O12 Library TST39 Length 641 Definiti...on Adiantum capillus-veneris mRNA. clone: TST39A01NGRL0029_O12. 5' end sequence. Accession DK958019 Tissue t...AST and PSI-BLAST: a new generation of protein database search programs, Nucleic Acids Res. 25:3389-3402. Query= DK95801...a new generation of protein database search programs, Nucleic Acids Res. 25:3389-3402. Query= DK958019|Adian...TST39A01NGRL0029_O12 641 Adiantum capillus-veneris mRNA. clone: TST39A01NGRL0029_O1

  6. AcEST: DK958016 [AcEST

    Lifescience Database Archive (English)

    Full Text Available 9. 5' end sequence. DK958016 - Show DK958016 Clone id TST39A01NGRL0029_O09 Library TST39 Length 624 Definiti...on Adiantum capillus-veneris mRNA. clone: TST39A01NGRL0029_O09. 5' end sequence. Accession DK958016 Tissue t...a new generation of protein database search programs, Nucleic Acids Res. 25:3389-3402. Query= DK958016|Adian... a new generation of protein database search programs, Nucleic Acids Res. 25:3389-3402. Query= DK958016|Adia...TST39A01NGRL0029_O09 624 Adiantum capillus-veneris mRNA. clone: TST39A01NGRL0029_O0

  7. AcEST: DK958013 [AcEST

    Lifescience Database Archive (English)

    Full Text Available 6. 5' end sequence. DK958013 - Show DK958013 Clone id TST39A01NGRL0029_O06 Library TST39 Length 670 Definiti...on Adiantum capillus-veneris mRNA. clone: TST39A01NGRL0029_O06. 5' end sequence. Accession DK958013 Tissue t...7), Gapped BLAST and PSI-BLAST: a new generation of protein database search programs, Nucleic Acids Res. 25:3389-3402. Query= DK95801...n database search programs, Nucleic Acids Res. 25:3389-3402. Query= DK958013|Adia...TST39A01NGRL0029_O06 670 Adiantum capillus-veneris mRNA. clone: TST39A01NGRL0029_O0

  8. AcEST: BP918011 [AcEST

    Lifescience Database Archive (English)

    Full Text Available YMU001_000108_E11 519 Adiantum capillus-veneris mRNA. clone: YMU001_000108_E11. BP918011 - Show BP91801...is mRNA. clone: YMU001_000108_E11. Accession BP918011 Tissue type prothallium Developmental stage - Contig I...database search programs, Nucleic Acids Res. 25:3389-3402. Query= BP918011|Adiant...se search programs, Nucleic Acids Res. 25:3389-3402. Query= BP918011|Adiantum cap

  9. School version of ESTE EU

    International Nuclear Information System (INIS)

    Carny, P.; Suchon, D.; Chyly, M.; Smejkalova, E.; Fabova, V.

    2008-01-01

    ESTE EU is information system and software for radiological impacts assessment to the territory of the country in case of radiation accident inside/outside of the country .The program enables to model dispersion of radioactive clouds in small-scale and meso-scale. The system enables the user to estimate prediction of the source term (release to the atmosphere ) for any point of radiation/nuclear accident in Europe (for any point of the release, but especially for the sites of European power reactors ). The system enables to utilize results of real radiological monitoring in the process of source term estimation. Radiological impacts of release to the atmosphere are modelled and calculated across the Europe and displayed in the geographical information system (GIS). The school version of ESTE EU is intended for students of the universities which are interested in or could work in the field of emergency response, radiological and nuclear accidents, dispersion modelling, radiological impacts calculation and urgent or preventive protective measures implementation. The school version of ESTE EU is planned to be donated to specialized departments of faculties in Slovakia, Czech Republic, etc. System can be fully operated in Slovak, Czech or English language. (authors)

  10. School version of ESTE EU

    International Nuclear Information System (INIS)

    Carny, P.; Suchon, D.; Chyly, M.; Smejkalova, E.; Fabova, V.

    2009-01-01

    ESTE EU is information system and software for radiological impacts assessment to the territory of the country in case of radiation accident inside/outside of the country .The program enables to model dispersion of radioactive clouds in small-scale and meso-scale. The system enables the user to estimate prediction of the source term (release to the atmosphere ) for any point of radiation/nuclear accident in Europe (for any point of the release, but especially for the sites of European power reactors ). The system enables to utilize results of real radiological monitoring in the process of source term estimation. Radiological impacts of release to the atmosphere are modelled and calculated across the Europe and displayed in the geographical information system (GIS). The school version of ESTE EU is intended for students of the universities which are interested in or could work in the field of emergency response, radiological and nuclear accidents, dispersion modelling, radiological impacts calculation and urgent or preventive protective measures implementation. The school version of ESTE EU is planned to be donated to specialized departments of faculties in Slovakia, Czech Republic, etc. System can be fully operated in Slovak, Czech or English language. (authors)

  11. Generation and analysis of ESTs from the eastern oyster, Crassostrea virginica Gmelin and identification of microsatellite and SNP markers

    Directory of Open Access Journals (Sweden)

    Wallace Richard

    2007-06-01

    Full Text Available Abstract Background The eastern oyster, Crassostrea virginica (Gmelin 1791, is an economically important species cultured in many areas in North America. It is also ecologically important because of the impact of its filter feeding behaviour on water quality. Populations of C. virginica have been threatened by overfishing, habitat degradation, and diseases. Through genome research, strategies are being developed to reverse its population decline. However, large-scale expressed sequence tag (EST resources have been lacking for this species. Efficient generation of EST resources from this species has been hindered by a high redundancy of transcripts. The objectives of this study were to construct a normalized cDNA library for efficient EST analysis, to generate thousands of ESTs, and to analyze the ESTs for microsatellites and potential single nucleotide polymorphisms (SNPs. Results A normalized and subtracted C. virginica cDNA library was constructed from pooled RNA isolated from hemocytes, mantle, gill, gonad and digestive tract, muscle, and a whole juvenile oyster. A total of 6,528 clones were sequenced from this library generating 5,542 high-quality EST sequences. Cluster analysis indicated the presence of 635 contigs and 4,053 singletons, generating a total of 4,688 unique sequences. About 46% (2,174 of the unique ESTs had significant hits (E-value ≤ 1e-05 to the non-redundant protein database; 1,104 of which were annotated using Gene Ontology (GO terms. A total of 35 microsatellites were identified from the ESTs, with 18 having sufficient flanking sequences for primer design. A total of 6,533 putative SNPs were also identified using all existing and the newly generated EST resources of the eastern oysters. Conclusion A high quality normalized cDNA library was constructed. A total of 5,542 ESTs were generated representing 4,688 unique sequences. Putative microsatellite and SNP markers were identified. These genome resources provide the

  12. AcEST: BP912612 [AcEST

    Lifescience Database Archive (English)

    Full Text Available YMU001_000020_H07 512 Adiantum capillus-veneris mRNA. clone: YMU001_000020_H07. BP912612 - Show BP912612... Clone id YMU001_000020_H07 Library YMU01 Length 512 Definition Adiantum capillus-vener...is mRNA. clone: YMU001_000020_H07. Accession BP912612 Tissue type prothallium Developmental stage - Contig I...se search programs, Nucleic Acids Res. 25:3389-3402. Query= BP912612|Adiantum cap...illus-veneris mRNA, clone: YMU001_000020_H07. (512 letters) Database: uniprot_sprot.fasta 412,525 sequences;

  13. AcEST: BP912712 [AcEST

    Lifescience Database Archive (English)

    Full Text Available YMU001_000022_A07 476 Adiantum capillus-veneris mRNA. clone: YMU001_000022_A07. BP912712 - Show BP912712...is mRNA. clone: YMU001_000022_A07. Accession BP912712 Tissue type prothallium Developmental stage - Contig I...cleic Acids Res. 25:3389-3402. Query= BP912712|Adiantum capillus-veneris mRNA, cl...one: YMU001_000022_A07. (476 letters) Database: uniprot_sprot.fasta 412,525 sequences; 148,809,765 total let...8%), Positives = 39/69 (56%), Gaps = 4/69 (5%) Frame = +3 Query: 123 TSRRKSNHDQY--LPNYKVGTVHLLLGVKDQHLVSKIDI

  14. AcEST: BP912126 [AcEST

    Lifescience Database Archive (English)

    Full Text Available YMU001_000015_D08 484 Adiantum capillus-veneris mRNA. clone: YMU001_000015_D08. BP912126 CL412...4Contig1 Show BP912126 Clone id YMU001_000015_D08 Library YMU01 Length 484 Definition Adiantum ca...pillus-veneris mRNA. clone: YMU001_000015_D08. Accession BP912126 Tissue type prothallium Developmental stage - Contig ID CL412...-BLAST: a new generation of protein database search programs, Nucleic Acids Res. ...25:3389-3402. Query= BP912126|Adiantum capillus-veneris mRNA, clone: YMU001_000015_D08. (484 letters) Databa

  15. AcEST: BP912012 [AcEST

    Lifescience Database Archive (English)

    Full Text Available YMU001_000012_A06 542 Adiantum capillus-veneris mRNA. clone: YMU001_000012_A06. BP912012... CL2421Contig1 Show BP912012 Clone id YMU001_000012_A06 Library YMU01 Length 542 Definition Adiantum ca...pillus-veneris mRNA. clone: YMU001_000012_A06. Accession BP912012 Tissue type prothallium Developmental stag...rams, Nucleic Acids Res. 25:3389-3402. Query= BP912012|Adiantum capillus-veneris ...mRNA, clone: YMU001_000012_A06. (542 letters) Database: uniprot_sprot.fasta 412,525 sequences; 148,809,765 t

  16. AcEST: BP919406 [AcEST

    Lifescience Database Archive (English)

    Full Text Available YMU001_000124_G04 562 Adiantum capillus-veneris mRNA. clone: YMU001_000124_G04. BP919406 - Show BP919406...is mRNA. clone: YMU001_000124_G04. Accession BP919406 Tissue type prothallium Developmental stage - Contig I...ion of protein database search programs, Nucleic Acids Res. 25:3389-3402. Query= BP919406|Adiantum capillus-...ucleic Acids Res. 25:3389-3402. Query= BP919406|Adiantum capillus-veneris mRNA, c

  17. AcEST: BP914068 [AcEST

    Lifescience Database Archive (English)

    Full Text Available YMU001_000039_E04 420 Adiantum capillus-veneris mRNA. clone: YMU001_000039_E04. BP914068 - Show BP91406...is mRNA. clone: YMU001_000039_E04. Accession BP914068 Tissue type prothallium Developmental stage - Contig I...PSI-BLAST: a new generation of protein database search programs, Nucleic Acids Res. 25:3389-3402. Query= BP91406...se search programs, Nucleic Acids Res. 25:3389-3402. Query= BP914068|Adiantum capillus-veneris mRNA, clone:

  18. AcEST: DK959406 [AcEST

    Lifescience Database Archive (English)

    Full Text Available 7. 5' end sequence. DK959406 CL3039Contig1 Show DK959406 Clone id TST39A01NGRL0004_J17 Library TST39 Length ...621 Definition Adiantum capillus-veneris mRNA. clone: TST39A01NGRL0004_J17. 5' end sequence. Accession DK959406...ed BLAST and PSI-BLAST: a new generation of protein database search programs, Nucleic Acids Res. 25:3389-3402. Query= DK959406...rch programs, Nucleic Acids Res. 25:3389-3402. Query= DK959406|Adiantum capillus-veneris mRNA, clone: TST39A

  19. AcEST: BP913406 [AcEST

    Lifescience Database Archive (English)

    Full Text Available YMU001_000029_H06 570 Adiantum capillus-veneris mRNA. clone: YMU001_000029_H06. BP913406 - Show BP913406...is mRNA. clone: YMU001_000029_H06. Accession BP913406 Tissue type prothallium Developmental stage - Contig I...arch programs, Nucleic Acids Res. 25:3389-3402. Query= BP913406|Adiantum capillus-veneris mRNA, clone: YMU00...eration of protein database search programs, Nucleic Acids Res. 25:3389-3402. Query= BP913406|Adiantum capil...LDVTRGLVNGARGVVVAFES--GKHG---------------LPH 406 Query: 387 VRFACNRAEIVIGPDRQTVESGGMQVARRIQVPLILAWALSVHKCQGM

  20. AcEST: DK944406 [AcEST

    Lifescience Database Archive (English)

    Full Text Available 7. 5' end sequence. DK944406 CL1Contig2 Show DK944406 Clone id YMU02A01NGRL0005_P07 Library YMU02 Length 175... Definition Adiantum capillus-veneris mRNA. clone: YMU02A01NGRL0005_P07. 5' end sequence. Accession DK944406...rams, Nucleic Acids Res. 25:3389-3402. Query= DK944406|Adiantum capillus-veneris mRNA, clone: YMU02A01NGRL00...AST: a new generation of protein database search programs, Nucleic Acids Res. 25:3389-3402. Query= DK944406|

  1. AcEST: DK948406 [AcEST

    Lifescience Database Archive (English)

    Full Text Available 8. 5' end sequence. DK948406 CL33Contig1 Show DK948406 Clone id TST38A01NGRL0003_D18 Library TST38 Length 58...1 Definition Adiantum capillus-veneris mRNA. clone: TST38A01NGRL0003_D18. 5' end sequence. Accession DK948406...ic Acids Res. 25:3389-3402. Query= DK948406|Adiantum capillus-veneris mRNA, clone: TST38A01NGRL0003_D18, 5' ... a new generation of protein database search programs, Nucleic Acids Res. 25:3389-3402. Query= DK948406|Adia

  2. AcEST: DK952406 [AcEST

    Lifescience Database Archive (English)

    Full Text Available 1. 5' end sequence. DK952406 CL1615Contig1 Show DK952406 Clone id TST38A01NGRL0014_A01 Library TST38 Length ...618 Definition Adiantum capillus-veneris mRNA. clone: TST38A01NGRL0014_A01. 5' end sequence. Accession DK952406...ein database search programs, Nucleic Acids Res. 25:3389-3402. Query= DK952406|Adiantum capillus-veneris mRN... Acids Res. 25:3389-3402. Query= DK952406|Adiantum capillus-veneris mRNA, clone: TST38A01NGRL0014_A01, 5' (6

  3. AcEST: BP917406 [AcEST

    Lifescience Database Archive (English)

    Full Text Available YMU001_000100_D10 492 Adiantum capillus-veneris mRNA. clone: YMU001_000100_D10. BP917406... CL2033Contig1 Show BP917406 Clone id YMU001_000100_D10 Library YMU01 Length 492 Definition Adiantum ca...pillus-veneris mRNA. clone: YMU001_000100_D10. Accession BP917406 Tissue type prothallium Developmental stag...of protein database search programs, Nucleic Acids Res. 25:3389-3402. Query= BP917406...Nucleic Acids Res. 25:3389-3402. Query= BP917406|Adiantum capillus-veneris mRNA,

  4. AcEST: DK955406 [AcEST

    Lifescience Database Archive (English)

    Full Text Available 5. 5' end sequence. DK955406 CL689Contig1 Show DK955406 Clone id TST39A01NGRL0023_A05 Library TST39 Length 5...18 Definition Adiantum capillus-veneris mRNA. clone: TST39A01NGRL0023_A05. 5' end sequence. Accession DK955406...Res. 25:3389-3402. Query= DK955406|Adiantum capillus-veneris mRNA, clone: TST39A01NGRL0023_A05, 5' (518 lett...-BLAST: a new generation of protein database search programs, Nucleic Acids Res. 25:3389-3402. Query= DK955406

  5. AcEST: DK953406 [AcEST

    Lifescience Database Archive (English)

    Full Text Available 4. 5' end sequence. DK953406 CL390Contig1 Show DK953406 Clone id TST39A01NGRL0017_K24 Library TST39 Length 5...83 Definition Adiantum capillus-veneris mRNA. clone: TST39A01NGRL0017_K24. 5' end sequence. Accession DK953406...), Gapped BLAST and PSI-BLAST: a new generation of protein database search programs, Nucleic Acids Res. 25:3389-3402. Query= DK953406...ids Res. 25:3389-3402. Query= DK953406|Adiantum capillus-veneris mRNA, clone: TST39A01NGRL0017_K24, 5' (583

  6. AcEST: BP919999 [AcEST

    Lifescience Database Archive (English)

    Full Text Available YMU001_000131_G09 554 Adiantum capillus-veneris mRNA. clone: YMU001_000131_G09. BP919999... CL2968Contig1 Show BP919999 Clone id YMU001_000131_G09 Library YMU01 Length 554 Definition Adiantum ca...pillus-veneris mRNA. clone: YMU001_000131_G09. Accession BP919999 Tissue type prothallium Developmental stag...b Miller, and David J. Lipman (1997), Gapped BLAST and PSI-BLAST: a new generatio...n of protein database search programs, Nucleic Acids Res. 25:3389-3402. Query= BP919999|Adiantum capillus-ve

  7. AcEST: BP920147 [AcEST

    Lifescience Database Archive (English)

    Full Text Available YMU001_000133_F01 365 Adiantum capillus-veneris mRNA. clone: YMU001_000133_F01. BP920147 - Show BP92014...is mRNA. clone: YMU001_000133_F01. Accession BP920147 Tissue type prothallium Developmental stage - Contig I...ch programs, Nucleic Acids Res. 25:3389-3402. Query= BP920147|Adiantum capillus-veneris mRNA, clone: YMU001_...97), Gapped BLAST and PSI-BLAST: a new generation of protein database search programs, Nucleic Acids Res. 25:3389-3402. Query= BP9201

  8. AcEST: BP920144 [AcEST

    Lifescience Database Archive (English)

    Full Text Available YMU001_000133_E09 265 Adiantum capillus-veneris mRNA. clone: YMU001_000133_E09. BP920144 - Show BP92014...is mRNA. clone: YMU001_000133_E09. Accession BP920144 Tissue type prothallium Developmental stage - Contig I...rch programs, Nucleic Acids Res. 25:3389-3402. Query= BP920144|Adiantum capillus-veneris mRNA, clone: YMU001...LAST and PSI-BLAST: a new generation of protein database search programs, Nucleic Acids Res. 25:3389-3402. Query= BP92014

  9. AcEST: BP920148 [AcEST

    Lifescience Database Archive (English)

    Full Text Available YMU001_000133_F02 429 Adiantum capillus-veneris mRNA. clone: YMU001_000133_F02. BP92014...8 CL3819Contig1 Show BP920148 Clone id YMU001_000133_F02 Library YMU01 Length 429 Definition Adiantum ca...pillus-veneris mRNA. clone: YMU001_000133_F02. Accession BP920148 Tissue type prothallium Developmental stag...rograms, Nucleic Acids Res. 25:3389-3402. Query= BP920148|Adiantum capillus-vener...ed BLAST and PSI-BLAST: a new generation of protein database search programs, Nucleic Acids Res. 25:3389-3402. Query= BP92014

  10. AcEST: BP920145 [AcEST

    Lifescience Database Archive (English)

    Full Text Available YMU001_000133_E11 274 Adiantum capillus-veneris mRNA. clone: YMU001_000133_E11. BP920145 - Show BP92014...is mRNA. clone: YMU001_000133_E11. Accession BP920145 Tissue type prothallium Developmental stage - Contig I..., Nucleic Acids Res. 25:3389-3402. Query= BP920145|Adiantum capillus-veneris mRNA, clone: YMU001_000133_E11.... database search programs, Nucleic Acids Res. 25:3389-3402. Query= BP920145|Adian

  11. AcEST: BP920142 [AcEST

    Lifescience Database Archive (English)

    Full Text Available YMU001_000133_E05 486 Adiantum capillus-veneris mRNA. clone: YMU001_000133_E05. BP920142 - Show BP92014...is mRNA. clone: YMU001_000133_E05. Accession BP920142 Tissue type prothallium Developmental stage - Contig I...rotein database search programs, Nucleic Acids Res. 25:3389-3402. Query= BP920142|Adiantum capillus-veneris ...rograms, Nucleic Acids Res. 25:3389-3402. Query= BP920142|Adiantum capillus-veneris mRNA, clone: YMU001_0001

  12. AcEST: BP920998 [AcEST

    Lifescience Database Archive (English)

    Full Text Available YMU001_000144_D03 529 Adiantum capillus-veneris mRNA. clone: YMU001_000144_D03. BP92099...8 CL1935Contig1 Show BP920998 Clone id YMU001_000144_D03 Library YMU01 Length 529 Definition Adiantum ca...pillus-veneris mRNA. clone: YMU001_000144_D03. Accession BP920998 Tissue type prothallium Developmental stag...abase search programs, Nucleic Acids Res. 25:3389-3402. Query= BP920998|Adiantum capillus-veneris mRNA, clon... of protein database search programs, Nucleic Acids Res. 25:3389-3402. Query= BP920998|Adiantum capillus-ven

  13. AcEST: BP918801 [AcEST

    Lifescience Database Archive (English)

    Full Text Available YMU001_000117_F03 542 Adiantum capillus-veneris mRNA. clone: YMU001_000117_F03. BP918801 - Show BP918801...is mRNA. clone: YMU001_000117_F03. Accession BP918801 Tissue type prothallium Developmental stage - Contig I...earch programs, Nucleic Acids Res. 25:3389-3402. Query= BP918801|Adiantum capillus-veneris mRNA, clone: YMU0...generation of protein database search programs, Nucleic Acids Res. 25:3389-3402. Query= BP918801|Adiantum ca

  14. AcEST: BP916801 [AcEST

    Lifescience Database Archive (English)

    Full Text Available YMU001_000091_G06 127 Adiantum capillus-veneris mRNA. clone: YMU001_000091_G06. BP916801... CL2168Contig1 Show BP916801 Clone id YMU001_000091_G06 Library YMU01 Length 127 Definition Adiantum ca...pillus-veneris mRNA. clone: YMU001_000091_G06. Accession BP916801 Tissue type prothallium Developmental stag...ds Res. 25:3389-3402. Query= BP916801|Adiantum capillus-veneris mRNA, clone: YMU0...a new generation of protein database search programs, Nucleic Acids Res. 25:3389-3402. Query= BP916801

  15. AcEST: DK952801 [AcEST

    Lifescience Database Archive (English)

    Full Text Available 9. 5' end sequence. DK952801 - Show DK952801 Clone id TST38A01NGRL0015_B09 Library TST38 Length 646 Definiti...on Adiantum capillus-veneris mRNA. clone: TST38A01NGRL0015_B09. 5' end sequence. Accession DK952801 Tissue t...n of protein database search programs, Nucleic Acids Res. 25:3389-3402. Query= DK952801|Adiantum capillus-ve...ase search programs, Nucleic Acids Res. 25:3389-3402. Query= DK952801|Adiantum capillus-veneris mRNA, clone:

  16. AcEST: DK961801 [AcEST

    Lifescience Database Archive (English)

    Full Text Available 3. 5' end sequence. DK961801 CL4094Contig1 Show DK961801 Clone id TST39A01NGRL0011_C13 Library TST39 Length ...664 Definition Adiantum capillus-veneris mRNA. clone: TST39A01NGRL0011_C13. 5' end sequence. Accession DK961801...tein database search programs, Nucleic Acids Res. 25:3389-3402. Query= DK961801|A...ms, Nucleic Acids Res. 25:3389-3402. Query= DK961801|Adiantum capillus-veneris mRNA, clone: TST39A01NGRL0011

  17. AcEST: BP918015 [AcEST

    Lifescience Database Archive (English)

    Full Text Available YMU001_000108_F03 437 Adiantum capillus-veneris mRNA. clone: YMU001_000108_F03. BP918015 - Show BP91801...is mRNA. clone: YMU001_000108_F03. Accession BP918015 Tissue type prothallium Developmental stage - Contig I.... 25:3389-3402. Query= BP918015|Adiantum capillus-veneris mRNA, clone: YMU001_000108_F03. (437 letters) Data...eration of protein database search programs, Nucleic Acids Res. 25:3389-3402. Query= BP91801

  18. AcEST: DK943801 [AcEST

    Lifescience Database Archive (English)

    Full Text Available 3. 5' end sequence. DK943801 CL1Contig2 Show DK943801 Clone id YMU02A01NGRL0003_P23 Library YMU02 Length 178... Definition Adiantum capillus-veneris mRNA. clone: YMU02A01NGRL0003_P23. 5' end sequence. Accession DK943801...rograms, Nucleic Acids Res. 25:3389-3402. Query= DK943801|Adiantum capillus-veneris mRNA, clone: YMU02A01NGR...-BLAST: a new generation of protein database search programs, Nucleic Acids Res. 25:3389-3402. Query= DK943801

  19. AcEST: DK959801 [AcEST

    Lifescience Database Archive (English)

    Full Text Available 1. 5' end sequence. DK959801 CL1090Contig1 Show DK959801 Clone id TST39A01NGRL0005_K21 Library TST39 Length ...651 Definition Adiantum capillus-veneris mRNA. clone: TST39A01NGRL0005_K21. 5' end sequence. Accession DK959801...ms, Nucleic Acids Res. 25:3389-3402. Query= DK959801|Adiantum capillus-veneris mR...ein database search programs, Nucleic Acids Res. 25:3389-3402. Query= DK959801|Adiantum capillus-veneris mRN

  20. AcEST: DK957801 [AcEST

    Lifescience Database Archive (English)

    Full Text Available 4. 5' end sequence. DK957801 CL2Contig2 Show DK957801 Clone id TST39A01NGRL0029_F04 Library TST39 Length 752... Definition Adiantum capillus-veneris mRNA. clone: TST39A01NGRL0029_F04. 5' end sequence. Accession DK957801...of protein database search programs, Nucleic Acids Res. 25:3389-3402. Query= DK957801|Adiantum capillus-vene... search programs, Nucleic Acids Res. 25:3389-3402. Query= DK957801|Adiantum capillus-veneris mRNA, clone: TS

  1. AcEST: BP918018 [AcEST

    Lifescience Database Archive (English)

    Full Text Available YMU001_000108_F06 436 Adiantum capillus-veneris mRNA. clone: YMU001_000108_F06. BP918018 - Show BP91801...is mRNA. clone: YMU001_000108_F06. Accession BP918018 Tissue type prothallium Developmental stage - Contig I...cids Res. 25:3389-3402. Query= BP918018|Adiantum capillus-veneris mRNA, clone: YM...and PSI-BLAST: a new generation of protein database search programs, Nucleic Acids Res. 25:3389-3402. Query= BP91801

  2. AcEST: DK954801 [AcEST

    Lifescience Database Archive (English)

    Full Text Available 5. 5' end sequence. DK954801 CL674Contig1 Show DK954801 Clone id TST39A01NGRL0021_G15 Library TST39 Length 6...16 Definition Adiantum capillus-veneris mRNA. clone: TST39A01NGRL0021_G15. 5' end sequence. Accession DK954801...a new generation of protein database search programs, Nucleic Acids Res. 25:3389-3402. Query= DK954801|Adian...earch programs, Nucleic Acids Res. 25:3389-3402. Query= DK954801|Adiantum capillus-veneris mRNA, clone: TST3

  3. AcEST: DK947801 [AcEST

    Lifescience Database Archive (English)

    Full Text Available 6. 5' end sequence. DK947801 CL89Contig1 Show DK947801 Clone id YMU02A01NGRM0001_B06 Library YMU02 Length 26...9 Definition Adiantum capillus-veneris mRNA. clone: YMU02A01NGRM0001_B06. 5' end sequence. Accession DK947801...eic Acids Res. 25:3389-3402. Query= DK947801|Adiantum capillus-veneris mRNA, clone: YMU02A01NGRM0001_B06, 5'...LAST: a new generation of protein database search programs, Nucleic Acids Res. 25:3389-3402. Query= DK947801

  4. AcEST: BP917801 [AcEST

    Lifescience Database Archive (English)

    Full Text Available YMU001_000105_F04 280 Adiantum capillus-veneris mRNA. clone: YMU001_000105_F04. BP917801 - Show BP917801...is mRNA. clone: YMU001_000105_F04. Accession BP917801 Tissue type prothallium Developmental stage - Contig I...n of protein database search programs, Nucleic Acids Res. 25:3389-3402. Query= BP917801|Adiantum capillus-ve... Nucleic Acids Res. 25:3389-3402. Query= BP917801|Adiantum capillus-veneris mRNA, clone: YMU001_000105_F04.

  5. AcEST: BP918017 [AcEST

    Lifescience Database Archive (English)

    Full Text Available YMU001_000108_F05 267 Adiantum capillus-veneris mRNA. clone: YMU001_000108_F05. BP918017 - Show BP91801...is mRNA. clone: YMU001_000108_F05. Accession BP918017 Tissue type prothallium Developmental stage - Contig I...otein database search programs, Nucleic Acids Res. 25:3389-3402. Query= BP918017|Adiantum capillus-veneris m...cleic Acids Res. 25:3389-3402. Query= BP918017|Adiantum capillus-veneris mRNA, clone: YMU001_000108_F05. (26

  6. AcEST: DK950801 [AcEST

    Lifescience Database Archive (English)

    Full Text Available 8. 5' end sequence. DK950801 - Show DK950801 Clone id TST38A01NGRL0009_K18 Library TST38 Length 645 Definiti...on Adiantum capillus-veneris mRNA. clone: TST38A01NGRL0009_K18. 5' end sequence. Accession DK950801 Tissue t...on of protein database search programs, Nucleic Acids Res. 25:3389-3402. Query= DK950801|Adiantum capillus-v...abase search programs, Nucleic Acids Res. 25:3389-3402. Query= DK950801|Adiantum

  7. AcEST: BP911801 [AcEST

    Lifescience Database Archive (English)

    Full Text Available YMU001_000009_C12 487 Adiantum capillus-veneris mRNA. clone: YMU001_000009_C12. BP911801 - Show BP911801...is mRNA. clone: YMU001_000009_C12. Accession BP911801 Tissue type prothallium Developmental stage - Contig I...generation of protein database search programs, Nucleic Acids Res. 25:3389-3402. Query= BP911801.... 25:3389-3402. Query= BP911801|Adiantum capillus-veneris mRNA, clone: YMU001_000

  8. AcEST: BP915801 [AcEST

    Lifescience Database Archive (English)

    Full Text Available YMU001_000077_B02 555 Adiantum capillus-veneris mRNA. clone: YMU001_000077_B02. BP915801 - Show BP915801...is mRNA. clone: YMU001_000077_B02. Accession BP915801 Tissue type prothallium Developmental stage - Contig I... Nucleic Acids Res. 25:3389-3402. Query= BP915801|Adiantum capillus-veneris mRNA, clone: YMU001_000077_B02. ...ST: a new generation of protein database search programs, Nucleic Acids Res. 25:3389-3402. Query= BP915801|A

  9. AcEST: DK958011 [AcEST

    Lifescience Database Archive (English)

    Full Text Available 4. 5' end sequence. DK958011 - Show DK958011 Clone id TST39A01NGRL0029_O04 Library TST39 Length 670 Definiti...on Adiantum capillus-veneris mRNA. clone: TST39A01NGRL0029_O04. 5' end sequence. Accession DK958011 Tissue t... programs, Nucleic Acids Res. 25:3389-3402. Query= DK958011|Adiantum capillus-ven...ation of protein database search programs, Nucleic Acids Res. 25:3389-3402. Query= DK958011|Adiantum capillu...TST39A01NGRL0029_O04 670 Adiantum capillus-veneris mRNA. clone: TST39A01NGRL0029_O0

  10. AcEST: BP920801 [AcEST

    Lifescience Database Archive (English)

    Full Text Available YMU001_000141_G10 454 Adiantum capillus-veneris mRNA. clone: YMU001_000141_G10. BP920801... CL819Contig1 Show BP920801 Clone id YMU001_000141_G10 Library YMU01 Length 454 Definition Adiantum cap...illus-veneris mRNA. clone: YMU001_000141_G10. Accession BP920801 Tissue type prothallium Developmental stage... Acids Res. 25:3389-3402. Query= BP920801|Adiantum capillus-veneris mRNA, clone: ...ion of protein database search programs, Nucleic Acids Res. 25:3389-3402. Query= BP920801|Adiantum capillus-

  11. AcEST: BP912801 [AcEST

    Lifescience Database Archive (English)

    Full Text Available YMU001_000023_A07 527 Adiantum capillus-veneris mRNA. clone: YMU001_000023_A07. BP912801 - Show BP912801...is mRNA. clone: YMU001_000023_A07. Accession BP912801 Tissue type prothallium Developmental stage - Contig I...w generation of protein database search programs, Nucleic Acids Res. 25:3389-3402. Query= BP912801...es. 25:3389-3402. Query= BP912801|Adiantum capillus-veneris mRNA, clone: YMU001_0

  12. AcEST: DK951801 [AcEST

    Lifescience Database Archive (English)

    Full Text Available 2. 5' end sequence. DK951801 - Show DK951801 Clone id TST38A01NGRL0012_F22 Library TST38 Length 582 Definiti...on Adiantum capillus-veneris mRNA. clone: TST38A01NGRL0012_F22. 5' end sequence. Accession DK951801 Tissue t...25:3389-3402. Query= DK951801|Adiantum capillus-veneris mRNA, clone: TST38A01NGRL0012_F22, 5' (582 letters) ...: a new generation of protein database search programs, Nucleic Acids Res. 25:3389-3402. Query= DK951801|Adi

  13. Database and Library Development of Organic Species using Gas Chromatography and Mass Spectral Measurements in Support of the Mars Science Laboratory

    Science.gov (United States)

    Garcia, Raul; Mahaffy, Paul; Misra, Prabhakar

    2010-02-01

    Our work involves the development of an organic contaminants database that will allow us to determine which compounds are found here on Earth and would be inadvertently detected in the Mars soil and gaseous samples as impurities. It will be used for the Sample Analysis at Mars (SAM) instrumentation analysis in the Mars Science Laboratory (MSL) rover scheduled for launch in 2011. In order to develop a comprehensive target database, we utilize the NIST Mass Spectral Library, Automated Mass Spectral Deconvolution and Identification System (AMDIS) and Ion Fingerprint Deconvolution (IFD) software to analyze the GC-MS data. We have analyzed data from commercial samples, such as paint and polymers, which have not been implemented into the rover and are now analyzing actual data from pyrolyzation on the rover. We have successfully developed an initial target compound database that will aid SAM in determining whether the components being analyzed come from Mars or are contaminants from either the rover itself or the Earth environment and are continuing to make improvements and adding data to the target contaminants database. )

  14. EST Express: PHP/MySQL based automated annotation of ESTs from expression libraries.

    Science.gov (United States)

    Smith, Robin P; Buchser, William J; Lemmon, Marcus B; Pardinas, Jose R; Bixby, John L; Lemmon, Vance P

    2008-04-10

    Several biological techniques result in the acquisition of functional sets of cDNAs that must be sequenced and analyzed. The emergence of redundant databases such as UniGene and centralized annotation engines such as Entrez Gene has allowed the development of software that can analyze a great number of sequences in a matter of seconds. We have developed "EST Express", a suite of analytical tools that identify and annotate ESTs originating from specific mRNA populations. The software consists of a user-friendly GUI powered by PHP and MySQL that allows for online collaboration between researchers and continuity with UniGene, Entrez Gene and RefSeq. Two key features of the software include a novel, simplified Entrez Gene parser and tools to manage cDNA library sequencing projects. We have tested the software on a large data set (2,016 samples) produced by subtractive hybridization. EST Express is an open-source, cross-platform web server application that imports sequences from cDNA libraries, such as those generated through subtractive hybridization or yeast two-hybrid screens. It then provides several layers of annotation based on Entrez Gene and RefSeq to allow the user to highlight useful genes and manage cDNA library projects.

  15. EST Express: PHP/MySQL based automated annotation of ESTs from expression libraries

    Directory of Open Access Journals (Sweden)

    Pardinas Jose R

    2008-04-01

    Full Text Available Abstract Background Several biological techniques result in the acquisition of functional sets of cDNAs that must be sequenced and analyzed. The emergence of redundant databases such as UniGene and centralized annotation engines such as Entrez Gene has allowed the development of software that can analyze a great number of sequences in a matter of seconds. Results We have developed "EST Express", a suite of analytical tools that identify and annotate ESTs originating from specific mRNA populations. The software consists of a user-friendly GUI powered by PHP and MySQL that allows for online collaboration between researchers and continuity with UniGene, Entrez Gene and RefSeq. Two key features of the software include a novel, simplified Entrez Gene parser and tools to manage cDNA library sequencing projects. We have tested the software on a large data set (2,016 samples produced by subtractive hybridization. Conclusion EST Express is an open-source, cross-platform web server application that imports sequences from cDNA libraries, such as those generated through subtractive hybridization or yeast two-hybrid screens. It then provides several layers of annotation based on Entrez Gene and RefSeq to allow the user to highlight useful genes and manage cDNA library projects.

  16. Trimming and clustering sugarcane ESTs

    Directory of Open Access Journals (Sweden)

    Guilherme P. Telles

    2001-12-01

    Full Text Available The original clustering procedure adopted in the Sugarcane Expressed Sequence Tag project (SUCEST had many problems, for instance too many clusters, the presence of ribosomal sequences, etc. We therefore redesigned the clustering procedure entirely, including a much more careful initial trimming of the reads. In this paper the new trimming and clustering strategies are described in detail and we give the new official figures for the project, 237,954 expressed sequence tags and 43,141 clusters.O método de clustering adotado no Projeto SUCEST (Sugarcane EST Project tinha vários problemas (muitos clusters, presença de seqüências de ribossomo etc. Nós assumimos a tarefa de reprojetar todo o processo de clustering, propondo uma "limpeza" inicial mais cuidadosa das seqüências. Neste artigo as estratégias de limpeza das seqüências e de clustering são descritas em detalhe, incluindo os números oficiais do projeto (237,954 ESTs e 43,141 clusters.

  17. Characteristics of the Lotus japonicus gene repertoire deduced from large-scale expressed sequence tag (EST) analysis.

    Science.gov (United States)

    Asamizu, Erika; Nakamura, Yasukazu; Sato, Shusei; Tabata, Satoshi

    2004-02-01

    To perform a comprehensive analysis of genes expressed in a model legume, Lotus japonicus, a total of 74472 3'-end expressed sequence tags (EST) were generated from cDNA libraries produced from six different organs. Clustering of sequences was performed with an identity criterion of 95% for 50 bases, and a total of 20457 non-redundant sequences, 8503 contigs and 11954 singletons were generated. EST sequence coverage was analyzed by using the annotated L. japonicus genomic sequence and 1093 of the 1889 predicted protein-encoding genes (57.9%) were hit by the EST sequence(s). Gene content was compared to several plant species. Among the 8503 contigs, 471 were identified as sequences conserved only in leguminous species and these included several disease resistance-related genes. This suggested that in legumes, these genes may have evolved specifically to resist pathogen attack. The rate of gene sequence divergence was assessed by comparing similarity level and functional category based on the Gene Ontology (GO) annotation of Arabidopsis genes. This revealed that genes encoding ribosomal proteins, as well as those related to translation, photosynthesis, and cellular structure were more abundantly represented in the highly conserved class, and that genes encoding transcription factors and receptor protein kinases were abundantly represented in the less conserved class. To make the sequence information and the cDNA clones available to the research community, a Web database with useful services was created at http://www.kazusa.or.jp/en/plant/lotus/EST/.

  18. Gene discovery using massively parallel pyrosequencing to develop ESTs for the flesh fly Sarcophaga crassipalpis

    Directory of Open Access Journals (Sweden)

    Hahn Daniel A

    2009-05-01

    Full Text Available Abstract Background Flesh flies in the genus Sarcophaga are important models for investigating endocrinology, diapause, cold hardiness, reproduction, and immunity. Despite the prominence of Sarcophaga flesh flies as models for insect physiology and biochemistry, and in forensic studies, little genomic or transcriptomic data are available for members of this genus. We used massively parallel pyrosequencing on the Roche 454-FLX platform to produce a substantial EST dataset for the flesh fly Sarcophaga crassipalpis. To maximize sequence diversity, we pooled RNA extracted from whole bodies of all life stages and normalized the cDNA pool after reverse transcription. Results We obtained 207,110 ESTs with an average read length of 241 bp. These reads assembled into 20,995 contigs and 31,056 singletons. Using BLAST searches of the NR and NT databases we were able to identify 11,757 unique gene elements (ES. crassipalpis unigenes among GO Biological Process functional groups with that of the Drosophila melanogaster transcriptome suggests that our ESTs are broadly representative of the flesh fly transcriptome. Insertion and deletion errors in 454 sequencing present a serious hurdle to comparative transcriptome analysis. Aided by a new approach to correcting for these errors, we performed a comparative analysis of genetic divergence across GO categories among S. crassipalpis, D. melanogaster, and Anopheles gambiae. The results suggest that non-synonymous substitutions occur at similar rates across categories, although genes related to response to stimuli may evolve slightly faster. In addition, we identified over 500 potential microsatellite loci and more than 12,000 SNPs among our ESTs. Conclusion Our data provides the first large-scale EST-project for flesh flies, a much-needed resource for exploring this model species. In addition, we identified a large number of potential microsatellite and SNP markers that could be used in population and systematic

  19. Federal databases

    International Nuclear Information System (INIS)

    Welch, M.J.; Welles, B.W.

    1988-01-01

    Accident statistics on all modes of transportation are available as risk assessment analytical tools through several federal agencies. This paper reports on the examination of the accident databases by personal contact with the federal staff responsible for administration of the database programs. This activity, sponsored by the Department of Energy through Sandia National Laboratories, is an overview of the national accident data on highway, rail, air, and marine shipping. For each mode, the definition or reporting requirements of an accident are determined and the method of entering the accident data into the database is established. Availability of the database to others, ease of access, costs, and who to contact were prime questions to each of the database program managers. Additionally, how the agency uses the accident data was of major interest

  20. AcEST: BP912122 [AcEST

    Lifescience Database Archive (English)

    Full Text Available YMU001_000015_D04 544 Adiantum capillus-veneris mRNA. clone: YMU001_000015_D04. BP91212...2 CL3363Contig1 Show BP912122 Clone id YMU001_000015_D04 Library YMU01 Length 544 Definition Adiantum ca...pillus-veneris mRNA. clone: YMU001_000015_D04. Accession BP912122 Tissue type prothallium Developmental stag...obium aromaticivorans (strain DSM 12444) Align length 58 Score (bit) 33.1 E-value 0.89 Report BLASTX 2.2.19 ...w generation of protein database search programs, Nucleic Acids Res. 25:3389-3402. Query= BP912122|Adiantum

  1. AcEST: BP912512 [AcEST

    Lifescience Database Archive (English)

    Full Text Available YMU001_000019_D01 513 Adiantum capillus-veneris mRNA. clone: YMU001_000019_D01. BP912512... CL17Contig1 Show BP912512 Clone id YMU001_000019_D01 Library YMU01 Length 513 Definition Adiantum capi...llus-veneris mRNA. clone: YMU001_000019_D01. Accession BP912512 Tissue type prothallium Developmental stage ...earch programs, Nucleic Acids Res. 25:3389-3402. Query= BP912512|Adiantum capillus-veneris mRNA, clone: YMU0...01_000019_D01. (489 letters) Database: uniprot_sprot.fasta 412,525 sequences; 148,809,765 total letters Sear

  2. AcEST: BP912129 [AcEST

    Lifescience Database Archive (English)

    Full Text Available YMU001_000015_D11 268 Adiantum capillus-veneris mRNA. clone: YMU001_000015_D11. BP91212...9 CL691Contig1 Show BP912129 Clone id YMU001_000015_D11 Library YMU01 Length 268 Definition Adiantum cap...illus-veneris mRNA. clone: YMU001_000015_D11. Accession BP912129 Tissue type prothallium Developmental stage...tabase search programs, Nucleic Acids Res. 25:3389-3402. Query= BP912129|Adiantum capillus-veneris mRNA, clo...ne: YMU001_000015_D11. (268 letters) Database: uniprot_sprot.fasta 412,525 sequences; 148,809,765 total lett

  3. AcEST: DK954064 [AcEST

    Lifescience Database Archive (English)

    Full Text Available 4. 5' end sequence. DK954064 CL947Contig1 Show DK954064 Clone id TST39A01NGRL0019_H04 Library TST39 Length 7...69 Definition Adiantum capillus-veneris mRNA. clone: TST39A01NGRL0019_H04. 5' end sequence. Accession DK95406... of protein database search programs, Nucleic Acids Res. 25:3389-3402. Query= DK954064|Adiantum capillus-ven...cleic Acids Res. 25:3389-3402. Query= DK954064|Adiantum capillus-veneris mRNA, clone: TST39A01NGRL0019_H04, ...TST39A01NGRL0019_H04 769 Adiantum capillus-veneris mRNA. clone: TST39A01NGRL0019_H0

  4. AcEST: DK954060 [AcEST

    Lifescience Database Archive (English)

    Full Text Available 4. 5' end sequence. DK954060 CL412Contig1 Show DK954060 Clone id TST39A01NGRL0019_G24 Library TST39 Length 6...26 Definition Adiantum capillus-veneris mRNA. clone: TST39A01NGRL0019_G24. 5' end sequence. Accession DK95406...s Res. 25:3389-3402. Query= DK954060|Adiantum capillus-veneris mRNA, clone: TST39A01NGRL0019_G24, 5' (626 le...generation of protein database search programs, Nucleic Acids Res. 25:3389-3402. Query= DK954060|Adiantum ca...TST39A01NGRL0019_G24 626 Adiantum capillus-veneris mRNA. clone: TST39A01NGRL0019_G2

  5. AcEST: DK944062 [AcEST

    Lifescience Database Archive (English)

    Full Text Available 0. 5' end sequence. DK944062 CL98Contig1 Show DK944062 Clone id YMU02A01NGRL0004_N20 Library YMU02 Length 42...5 Definition Adiantum capillus-veneris mRNA. clone: YMU02A01NGRL0004_N20. 5' end sequence. Accession DK94406... programs, Nucleic Acids Res. 25:3389-3402. Query= DK944062|Adiantum capillus-ven...ion of protein database search programs, Nucleic Acids Res. 25:3389-3402. Query= DK944062|Adiantum capillus-...YMU02A01NGRL0004_N20 425 Adiantum capillus-veneris mRNA. clone: YMU02A01NGRL0004_N2

  6. AcEST: BP914060 [AcEST

    Lifescience Database Archive (English)

    Full Text Available YMU001_000039_D08 539 Adiantum capillus-veneris mRNA. clone: YMU001_000039_D08. BP91406...0 CL1835Contig1 Show BP914060 Clone id YMU001_000039_D08 Library YMU01 Length 539 Definition Adiantum ca...pillus-veneris mRNA. clone: YMU001_000039_D08. Accession BP914060 Tissue type prothallium Developmental stag...rotein database search programs, Nucleic Acids Res. 25:3389-3402. Query= BP914060|Adiantum capillus-veneris ...Acids Res. 25:3389-3402. Query= BP914060|Adiantum capillus-veneris mRNA, clone: YMU001_000039_D08. (539 lett

  7. AcEST: DK954065 [AcEST

    Lifescience Database Archive (English)

    Full Text Available 5. 5' end sequence. DK954065 CL168Contig1 Show DK954065 Clone id TST39A01NGRL0019_H05 Library TST39 Length 6...25 Definition Adiantum capillus-veneris mRNA. clone: TST39A01NGRL0019_H05. 5' end sequence. Accession DK95406... PSI-BLAST: a new generation of protein database search programs, Nucleic Acids Res. 25:3389-3402. Query= DK95406...ids Res. 25:3389-3402. Query= DK954065|Adiantum capillus-veneris mRNA, clone: TST39A01NGRL0019_H05, 5' (625 ...TST39A01NGRL0019_H05 625 Adiantum capillus-veneris mRNA. clone: TST39A01NGRL0019_H0

  8. AcEST: DK961000 [AcEST

    Lifescience Database Archive (English)

    Full Text Available 3. 5' end sequence. DK961000 - Show DK961000 Clone id TST39A01NGRL0008_O23 Library TST39 Length 653 Definiti...on Adiantum capillus-veneris mRNA. clone: TST39A01NGRL0008_O23. 5' end sequence. Accession DK961000 Tissue t... BLAST and PSI-BLAST: a new generation of protein database search programs, Nucleic Acids Res. 25:3389-3402. Query= DK961000...eic Acids Res. 25:3389-3402. Query= DK961000|Adiantum capillus-veneris mRNA, clone: TST39A01NGRL0008_O23, 5'...TST39A01NGRL0008_O23 653 Adiantum capillus-veneris mRNA. clone: TST39A01NGRL0008_O2

  9. AcEST: DK951000 [AcEST

    Lifescience Database Archive (English)

    Full Text Available 4. 5' end sequence. DK951000 CL11Contig1 Show DK951000 Clone id TST38A01NGRL0010_D14 Library TST38 Length 63...5 Definition Adiantum capillus-veneris mRNA. clone: TST38A01NGRL0010_D14. 5' end sequence. Accession DK951000... of protein database search programs, Nucleic Acids Res. 25:3389-3402. Query= DK951000|Adiantum capillus-ven...25:3389-3402. Query= DK951000|Adiantum capillus-veneris mRNA, clone: TST38A01NGRL0010_D14, 5' (635 letters) ...TST38A01NGRL0010_D14 635 Adiantum capillus-veneris mRNA. clone: TST38A01NGRL0010_D1

  10. AcEST: DK949999 [AcEST

    Lifescience Database Archive (English)

    Full Text Available TST38A01NGRL0007_H20 715 Adiantum capillus-veneris mRNA. clone: TST38A01NGRL0007_H2...0. 5' end sequence. DK949999 CL70Contig1 Show DK949999 Clone id TST38A01NGRL0007_H20 Library TST38 Length 71...5 Definition Adiantum capillus-veneris mRNA. clone: TST38A01NGRL0007_H20. 5' end sequence. Accession DK949999...haffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), Gapped BLAST and PSI-BLAST: a n...ew generation of protein database search programs, Nucleic Acids Res. 25:3389-3402. Query= DK949999

  11. AcEST: DK943939 [AcEST

    Lifescience Database Archive (English)

    Full Text Available YMU02A01NGRL0004_H07 354 Adiantum capillus-veneris mRNA. clone: YMU02A01NGRL0004_H0...7. 5' end sequence. DK943939 - Show DK943939 Clone id YMU02A01NGRL0004_H07 Library YMU02 Length 354 Definiti...on Adiantum capillus-veneris mRNA. clone: YMU02A01NGRL0004_H07. 5' end sequence. Accession DK943939 Tissue t... of protein database search programs, Nucleic Acids Res. 25:3389-3402. Query= DK943939|Adiantum capillus-ven... CYA LHPRAVNCRKK CGH+N+LRP KK++ Sbjct: 1 IIEPSLRQLAQKYNCDKMICRKCYARLHPRAVNCRKKKCGHTNNLRPKKKVK 52 TrEMBL (release 39

  12. AcEST: DK953939 [AcEST

    Lifescience Database Archive (English)

    Full Text Available TST39A01NGRL0019_B19 645 Adiantum capillus-veneris mRNA. clone: TST39A01NGRL0019_B1...9. 5' end sequence. DK953939 CL12Contig1 Show DK953939 Clone id TST39A01NGRL0019_B19 Library TST39 Length 64...5 Definition Adiantum capillus-veneris mRNA. clone: TST39A01NGRL0019_B19. 5' end sequence. Accession DK953939...ase search programs, Nucleic Acids Res. 25:3389-3402. Query= DK953939|Adiantum ca...pillus-veneris mRNA, clone: TST39A01NGRL0019_B19, 5' (645 letters) Database: uniprot_sprot.fasta 412,525 seq

  13. AcEST: BP920146 [AcEST

    Lifescience Database Archive (English)

    Full Text Available YMU001_000133_E12 401 Adiantum capillus-veneris mRNA. clone: YMU001_000133_E12. BP92014...6 CL388Contig1 Show BP920146 Clone id YMU001_000133_E12 Library YMU01 Length 401 Definition Adiantum cap...illus-veneris mRNA. clone: YMU001_000133_E12. Accession BP920146 Tissue type prothallium Developmental stage...generation of protein database search programs, Nucleic Acids Res. 25:3389-3402. Query= BP920146|Adiantum ca...rams, Nucleic Acids Res. 25:3389-3402. Query= BP920146|Adiantum capillus-veneris mRNA, clone: YMU001_000133_

  14. AcEST: DK952014 [AcEST

    Lifescience Database Archive (English)

    Full Text Available 6. 5' end sequence. DK952014 CL132Contig1 Show DK952014 Clone id TST38A01NGRL0012_P06 Library TST38 Length 6...73 Definition Adiantum capillus-veneris mRNA. clone: TST38A01NGRL0012_P06. 5' end sequence. Accession DK952014...Acids Res. 25:3389-3402. Query= DK952014|Adiantum capillus-veneris mRNA, clone: T...protein database search programs, Nucleic Acids Res. 25:3389-3402. Query= DK952014|Adiantum capillus-veneris...TST38A01NGRL0012_P06 673 Adiantum capillus-veneris mRNA. clone: TST38A01NGRL0012_P0

  15. AcEST: BP920149 [AcEST

    Lifescience Database Archive (English)

    Full Text Available YMU001_000133_F03 624 Adiantum capillus-veneris mRNA. clone: YMU001_000133_F03. BP92014...9 CL2860Contig1 Show BP920149 Clone id YMU001_000133_F03 Library YMU01 Length 624 Definition Adiantum ca...pillus-veneris mRNA. clone: YMU001_000133_F03. Accession BP920149 Tissue type prothallium Developmental stag...ic Acids Res. 25:3389-3402. Query= BP920149|Adiantum capillus-veneris mRNA, clone: YMU001_000133_F03. (624 l...n of protein database search programs, Nucleic Acids Res. 25:3389-3402. Query= BP92014

  16. AcEST: DK944200 [AcEST

    Lifescience Database Archive (English)

    Full Text Available 8. 5' end sequence. DK944200 - Show DK944200 Clone id YMU02A01NGRL0005_E18 Library YMU02 Length 232 Definiti...on Adiantum capillus-veneris mRNA. clone: YMU02A01NGRL0005_E18. 5' end sequence. Accession DK944200 Tissue t...ams, Nucleic Acids Res. 25:3389-3402. Query= DK944200|Adiantum capillus-veneris mRNA, clone: YMU02A01NGRL000...database search programs, Nucleic Acids Res. 25:3389-3402. Query= DK944200|Adiantum capillus-veneris mRNA, c...YMU02A01NGRL0005_E18 232 Adiantum capillus-veneris mRNA. clone: YMU02A01NGRL0005_E1

  17. AcEST: DK952099 [AcEST

    Lifescience Database Archive (English)

    Full Text Available 2. 5' end sequence. DK952099 CL720Contig1 Show DK952099 Clone id TST38A01NGRL0013_C22 Library TST38 Length 5...79 Definition Adiantum capillus-veneris mRNA. clone: TST38A01NGRL0013_C22. 5' end sequence. Accession DK952099..., Nucleic Acids Res. 25:3389-3402. Query= DK952099|Adiantum capillus-veneris mRNA, clone: TST38A01NGRL0013_C...n database search programs, Nucleic Acids Res. 25:3389-3402. Query= DK952099|Adiantum capillus-veneris mRNA,...TST38A01NGRL0013_C22 579 Adiantum capillus-veneris mRNA. clone: TST38A01NGRL0013_C2

  18. AcEST: BP920993 [AcEST

    Lifescience Database Archive (English)

    Full Text Available YMU001_000144_C06 517 Adiantum capillus-veneris mRNA. clone: YMU001_000144_C06. BP92099...3 CL547Contig1 Show BP920993 Clone id YMU001_000144_C06 Library YMU01 Length 517 Definition Adiantum cap...illus-veneris mRNA. clone: YMU001_000144_C06. Accession BP920993 Tissue type prothallium Developmental stage...rograms, Nucleic Acids Res. 25:3389-3402. Query= BP920993|Adiantum capillus-veneris mRNA, clone: YMU001_0001...w generation of protein database search programs, Nucleic Acids Res. 25:3389-3402. Query= BP920993|Adiantum

  19. AcEST: BP919801 [AcEST

    Lifescience Database Archive (English)

    Full Text Available YMU001_000129_C11 513 Adiantum capillus-veneris mRNA. clone: YMU001_000129_C11. BP919801... CL1Contig3 Show BP919801 Clone id YMU001_000129_C11 Library YMU01 Length 513 Definition Adiantum capil...lus-veneris mRNA. clone: YMU001_000129_C11. Accession BP919801 Tissue type prothallium Developmental stage -...es. 25:3389-3402. Query= BP919801|Adiantum capillus-veneris mRNA, clone: YMU001_000129_C11. (435 letters) Da...eration of protein database search programs, Nucleic Acids Res. 25:3389-3402. Query= BP919801|Adiantum capil

  20. AcEST: DK958012 [AcEST

    Lifescience Database Archive (English)

    Full Text Available 5. 5' end sequence. DK958012 CL314Contig1 Show DK958012 Clone id TST39A01NGRL0029_O05 Library TST39 Length 6...84 Definition Adiantum capillus-veneris mRNA. clone: TST39A01NGRL0029_O05. 5' end sequence. Accession DK95801...s. 25:3389-3402. Query= DK958012|Adiantum capillus-veneris mRNA, clone: TST39A01NGRL0029_O05, 5' (684 letter..., Gapped BLAST and PSI-BLAST: a new generation of protein database search programs, Nucleic Acids Res. 25:3389-3402. Query= DK95801...TST39A01NGRL0029_O05 684 Adiantum capillus-veneris mRNA. clone: TST39A01NGRL0029_O0

  1. AcEST: DK948013 [AcEST

    Lifescience Database Archive (English)

    Full Text Available 2. 5' end sequence. DK948013 - Show DK948013 Clone id TST38A01NGRL0002_C22 Library TST38 Length 680 Definiti...on Adiantum capillus-veneris mRNA. clone: TST38A01NGRL0002_C22. 5' end sequence. Accession DK948013 Tissue t...-BLAST: a new generation of protein database search programs, Nucleic Acids Res. 25:3389-3402. Query= DK94801...ms, Nucleic Acids Res. 25:3389-3402. Query= DK948013|Adiantum capillus-veneris mRNA, clone: TST38A01NGRL0002...TST38A01NGRL0002_C22 680 Adiantum capillus-veneris mRNA. clone: TST38A01NGRL0002_C2

  2. AcEST: DK958014 [AcEST

    Lifescience Database Archive (English)

    Full Text Available 7. 5' end sequence. DK958014 CL177Contig1 Show DK958014 Clone id TST39A01NGRL0029_O07 Library TST39 Length 6...74 Definition Adiantum capillus-veneris mRNA. clone: TST39A01NGRL0029_O07. 5' end sequence. Accession DK95801...ein database search programs, Nucleic Acids Res. 25:3389-3402. Query= DK958014|Ad...binding protein 1 OS=Drosophila melanogaster GN=CG8801 PE=2 SV=1 Length = 652 Sco...ds Res. 25:3389-3402. Query= DK958014|Adiantum capillus-veneris mRNA, clone: TST39A01NGRL0029_O07, 5' (674 l

  3. AcEST: DK948015 [AcEST

    Lifescience Database Archive (English)

    Full Text Available 4. 5' end sequence. DK948015 CL1717Contig1 Show DK948015 Clone id TST38A01NGRL0002_C24 Library TST38 Length ...543 Definition Adiantum capillus-veneris mRNA. clone: TST38A01NGRL0002_C24. 5' end sequence. Accession DK94801...ids Res. 25:3389-3402. Query= DK948015|Adiantum capillus-veneris mRNA, clone: TST38A01NGRL0002_C24, 5' (486 ...TST38A01NGRL0002_C24 543 Adiantum capillus-veneris mRNA. clone: TST38A01NGRL0002_C2...997), Gapped BLAST and PSI-BLAST: a new generation of protein database search programs, Nucleic Acids Res. 2

  4. AcEST: BP918016 [AcEST

    Lifescience Database Archive (English)

    Full Text Available YMU001_000108_F04 434 Adiantum capillus-veneris mRNA. clone: YMU001_000108_F04. BP91801...6 CL3779Contig1 Show BP918016 Clone id YMU001_000108_F04 Library YMU01 Length 434 Definition Adiantum ca...pillus-veneris mRNA. clone: YMU001_000108_F04. Accession BP918016 Tissue type prothallium Developmental stag..., Gapped BLAST and PSI-BLAST: a new generation of protein database search programs, Nucleic Acids Res. 25:3389-3402. Query= BP91801...ic Acids Res. 25:3389-3402. Query= BP918016|Adiantum capillus-veneris mRNA, clone: YMU001_000108_F04. (434 l

  5. AcEST: DK948017 [AcEST

    Lifescience Database Archive (English)

    Full Text Available 2. 5' end sequence. DK948017 CL1579Contig1 Show DK948017 Clone id TST38A01NGRL0002_D02 Library TST38 Length ...638 Definition Adiantum capillus-veneris mRNA. clone: TST38A01NGRL0002_D02. 5' end sequence. Accession DK94801...BLAST and PSI-BLAST: a new generation of protein database search programs, Nucleic Acids Res. 25:3389-3402. Query= DK94801...rch programs, Nucleic Acids Res. 25:3389-3402. Query= DK948017|Adiantum capillus-veneris mRNA, clone: TST38A...TST38A01NGRL0002_D02 638 Adiantum capillus-veneris mRNA. clone: TST38A01NGRL0002_D0

  6. AcEST: DK958015 [AcEST

    Lifescience Database Archive (English)

    Full Text Available 8. 5' end sequence. DK958015 CL1173Contig1 Show DK958015 Clone id TST39A01NGRL0029_O08 Library TST39 Length ...239 Definition Adiantum capillus-veneris mRNA. clone: TST39A01NGRL0029_O08. 5' end sequence. Accession DK95801... Nucleic Acids Res. 25:3389-3402. Query= DK958015|Adiantum capillus-veneris mRNA, clone: TST39A01NGRL0029_O0...d BLAST and PSI-BLAST: a new generation of protein database search programs, Nucleic Acids Res. 25:3389-3402. Query= DK95801...TST39A01NGRL0029_O08 239 Adiantum capillus-veneris mRNA. clone: TST39A01NGRL0029_O0

  7. AcEST: DK958010 [AcEST

    Lifescience Database Archive (English)

    Full Text Available 3. 5' end sequence. DK958010 CL175Contig1 Show DK958010 Clone id TST39A01NGRL0029_O03 Library TST39 Length 6...65 Definition Adiantum capillus-veneris mRNA. clone: TST39A01NGRL0029_O03. 5' end sequence. Accession DK95801...of protein database search programs, Nucleic Acids Res. 25:3389-3402. Query= DK95801...c Acids Res. 25:3389-3402. Query= DK958010|Adiantum capillus-veneris mRNA, clone: TST39A01NGRL0029_O03, 5' (...tive uncharacterized protein OS=Picea... 115 3e-24 tr|A8Y801|A8Y801_ZANAE Rieske iron-sulphur protein OS=Zan

  8. AcEST: DK948011 [AcEST

    Lifescience Database Archive (English)

    Full Text Available 0. 5' end sequence. DK948011 CL3356Contig1 Show DK948011 Clone id TST38A01NGRL0002_C20 Library TST38 Length ...682 Definition Adiantum capillus-veneris mRNA. clone: TST38A01NGRL0002_C20. 5' end sequence. Accession DK94801...cleic Acids Res. 25:3389-3402. Query= DK948011|Adiantum capillus-veneris mRNA, cl...eration of protein database search programs, Nucleic Acids Res. 25:3389-3402. Query= DK948011|Adiantum capil...TST38A01NGRL0002_C20 682 Adiantum capillus-veneris mRNA. clone: TST38A01NGRL0002_C2

  9. AcEST: BP914801 [AcEST

    Lifescience Database Archive (English)

    Full Text Available YMU001_000063_A07 396 Adiantum capillus-veneris mRNA. clone: YMU001_000063_A07. BP914801... CL1121Contig1 Show BP914801 Clone id YMU001_000063_A07 Library YMU01 Length 396 Definition Adiantum ca...pillus-veneris mRNA. clone: YMU001_000063_A07. Accession BP914801 Tissue type prothallium Developmental stag...ped BLAST and PSI-BLAST: a new generation of protein database search programs, Nucleic Acids Res. 25:3389-3402. Query= BP914801...ase search programs, Nucleic Acids Res. 25:3389-3402. Query= BP914801|Adiantum capillus-veneris mRNA, clone:

  10. AcEST: DK958017 [AcEST

    Lifescience Database Archive (English)

    Full Text Available 0. 5' end sequence. DK958017 CL232Contig1 Show DK958017 Clone id TST39A01NGRL0029_O10 Library TST39 Length 6...34 Definition Adiantum capillus-veneris mRNA. clone: TST39A01NGRL0029_O10. 5' end sequence. Accession DK95801...eic Acids Res. 25:3389-3402. Query= DK958017|Adiantum capillus-veneris mRNA, clone: TST39A01NGRL0029_O10, 5'...ew generation of protein database search programs, Nucleic Acids Res. 25:3389-3402. Query= DK958017|Adiantum...TST39A01NGRL0029_O10 634 Adiantum capillus-veneris mRNA. clone: TST39A01NGRL0029_O1

  11. Agroforestry Species Switchboard

    DEFF Research Database (Denmark)

    Kindt, R.; John, I.; Ordonez, J.

    2016-01-01

    The current version of the Agroforestry Species Switchboard documents the presence of a total of 26,135 plant species (33,813 species including synonyms) across 19 web-based databases. When available, hyperlinks to information on the selected species in particular databases are provided. In total...

  12. Database Replication

    CERN Document Server

    Kemme, Bettina

    2010-01-01

    Database replication is widely used for fault-tolerance, scalability and performance. The failure of one database replica does not stop the system from working as available replicas can take over the tasks of the failed replica. Scalability can be achieved by distributing the load across all replicas, and adding new replicas should the load increase. Finally, database replication can provide fast local access, even if clients are geographically distributed clients, if data copies are located close to clients. Despite its advantages, replication is not a straightforward technique to apply, and

  13. Refactoring databases evolutionary database design

    CERN Document Server

    Ambler, Scott W

    2006-01-01

    Refactoring has proven its value in a wide range of development projects–helping software professionals improve system designs, maintainability, extensibility, and performance. Now, for the first time, leading agile methodologist Scott Ambler and renowned consultant Pramodkumar Sadalage introduce powerful refactoring techniques specifically designed for database systems. Ambler and Sadalage demonstrate how small changes to table structures, data, stored procedures, and triggers can significantly enhance virtually any database design–without changing semantics. You’ll learn how to evolve database schemas in step with source code–and become far more effective in projects relying on iterative, agile methodologies. This comprehensive guide and reference helps you overcome the practical obstacles to refactoring real-world databases by covering every fundamental concept underlying database refactoring. Using start-to-finish examples, the authors walk you through refactoring simple standalone databas...

  14. A comprehensive DNA barcode database for Central European beetles with a focus on Germany: adding more than 3500 identified species to BOLD.

    Science.gov (United States)

    Hendrich, Lars; Morinière, Jérôme; Haszprunar, Gerhard; Hebert, Paul D N; Hausmann, Axel; Köhler, Frank; Balke, Michael

    2015-07-01

    Beetles are the most diverse group of animals and are crucial for ecosystem functioning. In many countries, they are well established for environmental impact assessment, but even in the well-studied Central European fauna, species identification can be very difficult. A comprehensive and taxonomically well-curated DNA barcode library could remedy this deficit and could also link hundreds of years of traditional knowledge with next generation sequencing technology. However, such a beetle library is missing to date. This study provides the globally largest DNA barcode reference library for Coleoptera for 15 948 individuals belonging to 3514 well-identified species (53% of the German fauna) with representatives from 97 of 103 families (94%). This study is the first comprehensive regional test of the efficiency of DNA barcoding for beetles with a focus on Germany. Sequences ≥500 bp were recovered from 63% of the specimens analysed (15 948 of 25 294) with short sequences from another 997 specimens. Whereas most specimens (92.2%) could be unambiguously assigned to a single known species by sequence diversity at CO1, 1089 specimens (6.8%) were assigned to more than one Barcode Index Number (BIN), creating 395 BINs which need further study to ascertain if they represent cryptic species, mitochondrial introgression, or simply regional variation in widespread species. We found 409 specimens (2.6%) that shared a BIN assignment with another species, most involving a pair of closely allied species as 43 BINs were involved. Most of these taxa were separated by barcodes although sequence divergences were low. Only 155 specimens (0.97%) show identical or overlapping clusters. © 2014 John Wiley & Sons Ltd.

  15. Construction of 12 EST libraries and characterization of a 12,226 EST dataset for chicory (Cichorium intybus root, leaves and nodules in the context of carbohydrate metabolism investigation

    Directory of Open Access Journals (Sweden)

    Boutry Marc

    2009-01-01

    Full Text Available Abstract Background The industrial chicory, Cichorium intybus, is a member of the Asteraceae family that accumulates fructan of the inulin type in its root. Inulin is a low calories sweetener, a texture agent and a health promoting ingredient due to its prebiotic properties. Average inulin chain length is a critical parameter that is genotype and temperature dependent. In the context of the study of carbohydrate metabolism and to get insight into the transcriptome of chicory root and to visualize temporal changes of gene expression during the growing season, we obtained and characterized 10 cDNA libraries from chicory roots regularly sampled in field during a growing season. A leaf and a nodule libraries were also obtained for comparison. Results Approximately 1,000 Expressed Sequence Tags (EST were obtained from each of twelve cDNA libraries resulting in a 12,226 EST dataset. Clustering of these ESTs returned 1,922 contigs and 4,869 singlets for a total of 6,791 putative unigenes. All ESTs were compared to public sequence databases and functionally classified. Data were specifically searched for sequences related to carbohydrate metabolism. Season wide evolution of functional classes was evaluated by comparing libraries at the level of functional categories and unigenes distribution. Conclusion This chicory EST dataset provides a season wide outlook of the genes expressed in the root and to a minor extent in leaves and nodules. The dataset contains more than 200 sequences related to carbohydrate metabolism and 3,500 new ESTs when compared to other recently released chicory EST datasets, probably because of the season wide coverage of the root samples. We believe that these sequences will contribute to accelerate research and breeding of the industrial chicory as well as of closely related species.

  16. Development and Evaluation of a Novel Set of EST-SSR Markers Based on Transcriptome Sequences of Black Locust (Robinia pseudoacacia L.).

    Science.gov (United States)

    Guo, Qi; Wang, Jin-Xing; Su, Li-Zhuo; Lv, Wei; Sun, Yu-Han; Li, Yun

    2017-07-07

    Black locust ( Robinia pseudoacacia L. of the family Fabaceae) is an ecologically and economically important deciduous tree. However, few genomic resources are available for this forest species, and few effective expressed sequence tag-derived simple sequence repeat (EST-SSR) markers have been developed to date. In this study, paired-end sequencing was used to sequence transcriptomes of R. pseudoacacia by the Illumina HiSeq TM2000 platform, and EST-SSR loci were identified by de novo assembly. Furthermore, a total of 1697 primer pairs were successfully designed, from which 286 primers met the selection screening criteria; 94 pairs were randomly selected and tested for validation using polymerase chain reaction amplification. Forty-five primers were verified as polymorphic, with clear bands. The polymorphism information content values were 0.033-0.765, the number of alleles per locus ranged from 2 to 10, and the observed and expected heterozygosities were 0.000-0.931 and 0.035-0.810, respectively, indicating a high level of informativeness. Subsequently, 45 polymorphic EST-SSR loci were tested for amplification efficiency, using the verified primers, in an additional nine species of Leguminosae, 23 loci were amplified in more than three species, of which two loci were amplified successfully in all species. These EST-SSR markers provide a valuable tool for investigating the genetic diversity and population structure of R . pseudoacacia , constructing a DNA fingerprint database, performing quantitative trait locus mapping, and preserving genetic information.

  17. RDD Databases

    Data.gov (United States)

    National Oceanic and Atmospheric Administration, Department of Commerce — This database was established to oversee documents issued in support of fishery research activities including experimental fishing permits (EFP), letters of...

  18. Snowstorm Database

    Data.gov (United States)

    National Oceanic and Atmospheric Administration, Department of Commerce — The Snowstorm Database is a collection of over 500 snowstorms dating back to 1900 and updated operationally. Only storms having large areas of heavy snowfall (10-20...

  19. Dealer Database

    Data.gov (United States)

    National Oceanic and Atmospheric Administration, Department of Commerce — The dealer reporting databases contain the primary data reported by federally permitted seafood dealers in the northeast. Electronic reporting was implemented May 1,...

  20. National database

    DEFF Research Database (Denmark)

    Kristensen, Helen Grundtvig; Stjernø, Henrik

    1995-01-01

    Artikel om national database for sygeplejeforskning oprettet på Dansk Institut for Sundheds- og Sygeplejeforskning. Det er målet med databasen at samle viden om forsknings- og udviklingsaktiviteter inden for sygeplejen.......Artikel om national database for sygeplejeforskning oprettet på Dansk Institut for Sundheds- og Sygeplejeforskning. Det er målet med databasen at samle viden om forsknings- og udviklingsaktiviteter inden for sygeplejen....

  1. TrED: the Trichophyton rubrum Expression Database

    Directory of Open Access Journals (Sweden)

    Liu Tao

    2007-07-01

    Full Text Available Abstract Background Trichophyton rubrum is the most common dermatophyte species and the most frequent cause of fungal skin infections in humans worldwide. It's a major concern because feet and nail infections caused by this organism is extremely difficult to cure. A large set of expression data including expressed sequence tags (ESTs and transcriptional profiles of this important fungal pathogen are now available. Careful analysis of these data can give valuable information about potential virulence factors, antigens and novel metabolic pathways. We intend to create an integrated database TrED to facilitate the study of dermatophytes, and enhance the development of effective diagnostic and treatment strategies. Description All publicly available ESTs and expression profiles of T. rubrum during conidial germination in time-course experiments and challenged with antifungal agents are deposited in the database. In addition, comparative genomics hybridization results of 22 dermatophytic fungi strains from three genera, Trichophyton, Microsporum and Epidermophyton, are also included. ESTs are clustered and assembled to elongate the sequence length and abate redundancy. TrED provides functional analysis based on GenBank, Pfam, and KOG databases, along with KEGG pathway and GO vocabulary. It is integrated with a suite of custom web-based tools that facilitate querying and retrieving various EST properties, visualization and comparison of transcriptional profiles, and sequence-similarity searching by BLAST. Conclusion TrED is built upon a relational database, with a web interface offering analytic functions, to provide integrated access to various expression data of T. rubrum and comparative results of dermatophytes. It is devoted to be a comprehensive resource and platform to assist functional genomic studies in dermatophytes. TrED is available from URL: http://www.mgc.ac.cn/TrED/.

  2. La parole est à vous

    CERN Multimedia

    Staff Association

    2015-01-01

    Nous avons identifié, ci-dessous, 3 grands chapitres pour lesquelles bon nombre d’entre vous se posent des questions et ont fréquemment sollicités leurs délégués, qui n’ont pas toujours pu donner de réponses : La retraite et sa préparation. L’affiliation aux régimes sécurité sociale de nos états hôtes et ses conséquences au moment de la retraite. La fiscalité en général. Le service juridique, que nous avons interrogé récemment, n’est pas toujours informé de vos interrogations et de ce fait, il n’a pas encore pu investiguer sur ces questions auprès des services compétents. Il nous a suggéré de vous interroger afin de collecter vos questions, de les synthétiser, afin de les transmettre...

  3. ConiferEST: an integrated bioinformatics system for data reprocessing and mining of conifer expressed sequence tags (ESTs

    Directory of Open Access Journals (Sweden)

    Carter Kikia

    2007-05-01

    Full Text Available Abstract Background With the advent of low-cost, high-throughput sequencing, the amount of public domain Expressed Sequence Tag (EST sequence data available for both model and non-model organism is growing exponentially. While these data are widely used for characterizing various genomes, they also present a serious challenge for data quality control and validation due to their inherent deficiencies, particularly for species without genome sequences. Description ConiferEST is an integrated system for data reprocessing, visualization and mining of conifer ESTs. In its current release, Build 1.0, it houses 172,229 loblolly pine EST sequence reads, which were obtained from reprocessing raw DNA sequencer traces using our software – WebTraceMiner. The trace files were downloaded from NCBI Trace Archive. ConiferEST provides biologists unique, easy-to-use data visualization and mining tools for a variety of putative sequence features including cloning vector segments, adapter sequences, restriction endonuclease recognition sites, polyA and polyT runs, and their corresponding Phred quality values. Based on these putative features, verified sequence features such as 3' and/or 5' termini of cDNA inserts in either sense or non-sense strand have been identified in-silico. Interestingly, only 30.03% of the designated 3' ESTs were found to have an authenticated 5' terminus in the non-sense strand (i.e., polyT tails, while fewer than 5.34% of the designated 5' ESTs had a verified 5' terminus in the sense strand. Such previously ignored features provide valuable insight for data quality control and validation of error-prone ESTs, as well as the ability to identify novel functional motifs embedded in large EST datasets. We found that "double-termini adapters" were effective indicators of potential EST chimeras. For all sequences with in-silico verified termini/terminus, we used InterProScan to assign protein domain signatures, results of which are available

  4. ConiferEST: an integrated bioinformatics system for data reprocessing and mining of conifer expressed sequence tags (ESTs).

    Science.gov (United States)

    Liang, Chun; Wang, Gang; Liu, Lin; Ji, Guoli; Fang, Lin; Liu, Yuansheng; Carter, Kikia; Webb, Jason S; Dean, Jeffrey F D

    2007-05-29

    With the advent of low-cost, high-throughput sequencing, the amount of public domain Expressed Sequence Tag (EST) sequence data available for both model and non-model organism is growing exponentially. While these data are widely used for characterizing various genomes, they also present a serious challenge for data quality control and validation due to their inherent deficiencies, particularly for species without genome sequences. ConiferEST is an integrated system for data reprocessing, visualization and mining of conifer ESTs. In its current release, Build 1.0, it houses 172,229 loblolly pine EST sequence reads, which were obtained from reprocessing raw DNA sequencer traces using our software--WebTraceMiner. The trace files were downloaded from NCBI Trace Archive. ConiferEST provides biologists unique, easy-to-use data visualization and mining tools for a variety of putative sequence features including cloning vector segments, adapter sequences, restriction endonuclease recognition sites, polyA and polyT runs, and their corresponding Phred quality values. Based on these putative features, verified sequence features such as 3' and/or 5' termini of cDNA inserts in either sense or non-sense strand have been identified in-silico. Interestingly, only 30.03% of the designated 3' ESTs were found to have an authenticated 5' terminus in the non-sense strand (i.e., polyT tails), while fewer than 5.34% of the designated 5' ESTs had a verified 5' terminus in the sense strand. Such previously ignored features provide valuable insight for data quality control and validation of error-prone ESTs, as well as the ability to identify novel functional motifs embedded in large EST datasets. We found that "double-termini adapters" were effective indicators of potential EST chimeras. For all sequences with in-silico verified termini/terminus, we used InterProScan to assign protein domain signatures, results of which are available for in-depth exploration using our biologist

  5. EST based phylogenomics of Syndermata questions monophyly of Eurotatoria

    Directory of Open Access Journals (Sweden)

    Bucher Gregor

    2008-12-01

    Full Text Available Abstract Background The metazoan taxon Syndermata comprising Rotifera (in the classical sense of Monogononta+Bdelloidea+Seisonidea and Acanthocephala has raised several hypotheses connected to the phylogeny of these animal groups and the included subtaxa. While the monophyletic origin of Syndermata and Acanthocephala is well established based on morphological and molecular data, the phylogenetic position of Syndermata within Spiralia, the monophyletic origin of Monogononta, Bdelloidea, and Seisonidea and the acanthocephalan sister group are still a matter of debate. The comparison of the alternative hypotheses suggests that testing the phylogenetic validity of Eurotatoria (Monogononta+Bdelloidea is the key to unravel the phylogenetic relations within Syndermata. The syndermatan phylogeny in turn is a prerequisite for reconstructing the evolution of the acanthocephalan endoparasitism. Results Here we present our results from a phylogenomic approach studying i the phylogenetic position of Syndermata within Spiralia, ii the monophyletic origin of monogononts and bdelloids and iii the phylogenetic relations of the latter two taxa to acanthocephalans. For this analysis we have generated EST libraries of Pomphorhynchus laevis, Echinorhynchus truttae (Acanthocephala and Brachionus plicatilis (Monogononta. By extending these data with database entries of B. plicatilis, Philodina roseola (Bdelloidea and 25 additional metazoan species, we conducted phylogenetic reconstructions based on 79 ribosomal proteins using maximum likelihood and bayesian approaches. Our findings suggest that the phylogenetic position of Syndermata within Spiralia is close to Platyhelminthes, that Eurotatoria are not monophyletic and that bdelloids are more closely related to acanthocephalans than monogononts. Conclusion Mapping morphological character evolution onto molecular phylogeny suggests the (partial or complete reduction of the corona and the emergence of a retractable

  6. Experiment Databases

    Science.gov (United States)

    Vanschoren, Joaquin; Blockeel, Hendrik

    Next to running machine learning algorithms based on inductive queries, much can be learned by immediately querying the combined results of many prior studies. Indeed, all around the globe, thousands of machine learning experiments are being executed on a daily basis, generating a constant stream of empirical information on machine learning techniques. While the information contained in these experiments might have many uses beyond their original intent, results are typically described very concisely in papers and discarded afterwards. If we properly store and organize these results in central databases, they can be immediately reused for further analysis, thus boosting future research. In this chapter, we propose the use of experiment databases: databases designed to collect all the necessary details of these experiments, and to intelligently organize them in online repositories to enable fast and thorough analysis of a myriad of collected results. They constitute an additional, queriable source of empirical meta-data based on principled descriptions of algorithm executions, without reimplementing the algorithms in an inductive database. As such, they engender a very dynamic, collaborative approach to experimentation, in which experiments can be freely shared, linked together, and immediately reused by researchers all over the world. They can be set up for personal use, to share results within a lab or to create open, community-wide repositories. Here, we provide a high-level overview of their design, and use an existing experiment database to answer various interesting research questions about machine learning algorithms and to verify a number of recent studies.

  7. National Benthic Infaunal Database (NBID)

    Data.gov (United States)

    National Oceanic and Atmospheric Administration, Department of Commerce — The NBID is a quantitative database on abundances of individual benthic species by sample and study region, along with other synoptically measured environmental...

  8. Mining and comparative survey of EST-SSR markers among members of Euphorbiaceae family.

    Science.gov (United States)

    Sen, Surojit; Dehury, Budheswar; Sahu, Jagajjit; Rathi, Sunayana; Yadav, Raj Narain Singh

    2018-04-06

    Euphorbiaceae represents flowering plants family of tropical and sub-tropical region rich in secondary metabolites of economic importance. To understand and assess the genetic makeup among the members, this study was undertaken to characterize and compare SSR markers from publicly available ESTs and GSSs of nine selected species of the family. Mining of SSRs was performed by MISA, primer designing by Primer3, while functional annotation, gene ontology (GO) and enrichment analysis were performed by Blast2GO. A total 12,878 number of SSRs were detected from 101,701 number of EST sequences. SSR density ranged from 1 SSR/3.22 kb to 1 SSR/15.65 kb. A total of 1873 primer pairs were designed for the annotated SSR-Contigs. About 77.07% SSR-ESTs could be assigned a significant match to the protein database. 3037 unique SSR-FDM were assigned and IPR003657 (WRKY Domain) was found to be the most dominant FDM among the members. 1810 unique GO terms obtained were further subjected to enrichment analysis to obtain 513 statistically significant GO terms mapped to the SSR containing ESTs. Most frequent enriched GO terms were, GO:0003824 for molecular function, GO:0006350 for biological process and GO:0005886 for cellular component, justifying the richness of defensive secondary metabolites and phytomedicine within the family. The results from this study provides tangible insight to genetic make-up and distribution of SSRs. Functional annotation corresponded many genes of unknown functions which may be considered as novel genes or genes responsible for stress specific secondary metabolites. Further studies are required to understand stress specific genes accountable for leveraging the synthesis of secondary metabolites.

  9. Leptohyphidae (Insecta, Ephemeroptera do estado do Amazonas, Brasil: novos registros, nova combinação, nova espécie e chave de identificação para estágios ninfais Leptohyphidae (Insecta, Ephemeroptera of Amazonas state, Brazil: new records, new combination, new species and identification key for nymphal stages

    Directory of Open Access Journals (Sweden)

    Enide Luciana Lima Belmont

    2012-09-01

    Full Text Available Leptohyphidae (Insecta, Ephemeroptera do Estado do Amazonas, Brasil: novos registros, nova combinação, nova espécie e chave de identificação para estágios ninfais. Os seguintes gêneros de Leptohyphidae ocorrem no estado do Amazonas: Amanahyphes Salles & Molineri, Leptohyphes Eaton, Tricorythodes Ulmer e Tricorythopsis Traver. A distribuição das espécies de Leptohyphidae no Estado do Amazonas é apresentada. Uma espécie nova, Tricorythodes yapekuna sp. nov., é descrita e pode ser diferenciada de outros Tricorythodes pelas (1 garras tarsais com um par de dentículos submarginais e sem dentículos marginais; (2 palpo maxilar biarticulado; (3 brânquia opercular uniformemente preta com exceção da margem apical; (4 fórmula branquial 2/3/3/3/2; e (5 margem lateral do abdome expandida nos segmentos III_VI. Uma combinação nova, Tricorythopsis rondoniensis (Dias, Cruz & Ferreira, 2009 comb. nov., é proposta e constitui o primeiro registro dessa espécie para o Estado do Amazonas. Uma chave dicotômica ilustrada para identificar ninfas de gêneros e espécies ocorrentes no Amazonas também é apresentada.Leptohyphidae (Insecta, Ephemeroptera of Amazonas state, Brazil: new records, new combination, new species and identification key for nymphal stages. The following genera of Leptohyphidae occur in the Amazonas state: Amanahyphes Salles & Molineri, Leptohyphes Eaton, Tricorythodes Ulmer and Tricorythopsis Traver. Distribution of Leptohyphidae species in Amazonas state is presented. A new species, Tricorythodes yapekuna sp. nov., is described and can be distinguished from other Tricorythodes by: (1 tarsal claws with pair of submarginal denticles and no marginal denticles; (2 bi-articulated maxillary palp; (3 opercular gill black except on apical margin; (4 gill formula 2/3/3/3/2; and (5 expanded lateral abdominal margin of segments III_VI. The new combination, Tricorythopsis rondoniensis (Dias, Cruz & Ferreira, 2009 comb. nov., is proposed

  10. A first step in understanding an invasive weed through its genes: an EST analysis of invasive Centaurea maculosa

    Directory of Open Access Journals (Sweden)

    He Ji

    2007-05-01

    Full Text Available Abstract Background The economic and biological implications of plant invasion are overwhelming; however, the processes by which plants become successful invaders are not well understood. Limited genetic resources are available for most invasive and weedy species, making it difficult to study molecular and genetic aspects that may be associated with invasion. Results As an initial step towards understanding the molecular mechanisms by which plants become invasive, we have generated a normalized Expressed Sequence Tag (EST library comprising seven invasive populations of Centaurea maculosa, an invasive aster in North America. Seventy-seven percent of the 4423 unique transcripts showed significant similarity to existing proteins in the NCBI database and could be grouped based on gene ontology assignments. Conclusion The C. maculosa EST library represents an initial step towards looking at gene-specific expression in this species, and will pave the way for creation of other resources such as microarray chips that can help provide a view of global gene expression in invasive C. maculosa and its native counterparts. To our knowledge, this is the first published set of ESTs derived from an invasive weed that will be targeted to study invasive behavior. Understanding the genetic basis of evolution for increased invasiveness in exotic plants is critical to understanding the mechanisms through which exotic invasions occur.

  11. A first step in understanding an invasive weed through its genes: an EST analysis of invasive Centaurea maculosa.

    Science.gov (United States)

    Broz, Amanda K; Broeckling, Corey D; He, Ji; Dai, Xinbin; Zhao, Patrick X; Vivanco, Jorge M

    2007-05-24

    The economic and biological implications of plant invasion are overwhelming; however, the processes by which plants become successful invaders are not well understood. Limited genetic resources are available for most invasive and weedy species, making it difficult to study molecular and genetic aspects that may be associated with invasion. As an initial step towards understanding the molecular mechanisms by which plants become invasive, we have generated a normalized Expressed Sequence Tag (EST) library comprising seven invasive populations of Centaurea maculosa, an invasive aster in North America. Seventy-seven percent of the 4423 unique transcripts showed significant similarity to existing proteins in the NCBI database and could be grouped based on gene ontology assignments. The C. maculosa EST library represents an initial step towards looking at gene-specific expression in this species, and will pave the way for creation of other resources such as microarray chips that can help provide a view of global gene expression in invasive C. maculosa and its native counterparts. To our knowledge, this is the first published set of ESTs derived from an invasive weed that will be targeted to study invasive behavior. Understanding the genetic basis of evolution for increased invasiveness in exotic plants is critical to understanding the mechanisms through which exotic invasions occur.

  12. Towards the understanding of the cocoa transcriptome: Production and analysis of an exhaustive dataset of ESTs of Theobroma cacao L. generated from various tissues and under various conditions

    Science.gov (United States)

    Argout, Xavier; Fouet, Olivier; Wincker, Patrick; Gramacho, Karina; Legavre, Thierry; Sabau, Xavier; Risterucci, Ange Marie; Da Silva, Corinne; Cascardo, Julio; Allegre, Mathilde; Kuhn, David; Verica, Joseph; Courtois, Brigitte; Loor, Gaston; Babin, Regis; Sounigo, Olivier; Ducamp, Michel; Guiltinan, Mark J; Ruiz, Manuel; Alemanno, Laurence; Machado, Regina; Phillips, Wilberth; Schnell, Ray; Gilmour, Martin; Rosenquist, Eric; Butler, David; Maximova, Siela; Lanaud, Claire

    2008-01-01

    Background Theobroma cacao L., is a tree originated from the tropical rainforest of South America. It is one of the major cash crops for many tropical countries. T. cacao is mainly produced on smallholdings, providing resources for 14 million farmers. Disease resistance and T. cacao quality improvement are two important challenges for all actors of cocoa and chocolate production. T. cacao is seriously affected by pests and fungal diseases, responsible for more than 40% yield losses and quality improvement, nutritional and organoleptic, is also important for consumers. An international collaboration was formed to develop an EST genomic resource database for cacao. Results Fifty-six cDNA libraries were constructed from different organs, different genotypes and different environmental conditions. A total of 149,650 valid EST sequences were generated corresponding to 48,594 unigenes, 12,692 contigs and 35,902 singletons. A total of 29,849 unigenes shared significant homology with public sequences from other species. Gene Ontology (GO) annotation was applied to distribute the ESTs among the main GO categories. A specific information system (ESTtik) was constructed to process, store and manage this EST collection allowing the user to query a database. To check the representativeness of our EST collection, we looked for the genes known to be involved in two different metabolic pathways extensively studied in other plant species and important for T. cacao qualities: the flavonoid and the terpene pathways. Most of the enzymes described in other crops for these two metabolic pathways were found in our EST collection. A large collection of new genetic markers was provided by this ESTs collection. Conclusion This EST collection displays a good representation of the T. cacao transcriptome, suitable for analysis of biochemical pathways based on oligonucleotide microarrays derived from these ESTs. It will provide numerous genetic markers that will allow the construction of a high

  13. Towards the understanding of the cocoa transcriptome: Production and analysis of an exhaustive dataset of ESTs of Theobroma cacao L. generated from various tissues and under various conditions.

    Science.gov (United States)

    Argout, Xavier; Fouet, Olivier; Wincker, Patrick; Gramacho, Karina; Legavre, Thierry; Sabau, Xavier; Risterucci, Ange Marie; Da Silva, Corinne; Cascardo, Julio; Allegre, Mathilde; Kuhn, David; Verica, Joseph; Courtois, Brigitte; Loor, Gaston; Babin, Regis; Sounigo, Olivier; Ducamp, Michel; Guiltinan, Mark J; Ruiz, Manuel; Alemanno, Laurence; Machado, Regina; Phillips, Wilberth; Schnell, Ray; Gilmour, Martin; Rosenquist, Eric; Butler, David; Maximova, Siela; Lanaud, Claire

    2008-10-30

    Theobroma cacao L., is a tree originated from the tropical rainforest of South America. It is one of the major cash crops for many tropical countries. T. cacao is mainly produced on smallholdings, providing resources for 14 million farmers. Disease resistance and T. cacao quality improvement are two important challenges for all actors of cocoa and chocolate production. T. cacao is seriously affected by pests and fungal diseases, responsible for more than 40% yield losses and quality improvement, nutritional and organoleptic, is also important for consumers. An international collaboration was formed to develop an EST genomic resource database for cacao. Fifty-six cDNA libraries were constructed from different organs, different genotypes and different environmental conditions. A total of 149,650 valid EST sequences were generated corresponding to 48,594 unigenes, 12,692 contigs and 35,902 singletons. A total of 29,849 unigenes shared significant homology with public sequences from other species.Gene Ontology (GO) annotation was applied to distribute the ESTs among the main GO categories.A specific information system (ESTtik) was constructed to process, store and manage this EST collection allowing the user to query a database.To check the representativeness of our EST collection, we looked for the genes known to be involved in two different metabolic pathways extensively studied in other plant species and important for T. cacao qualities: the flavonoid and the terpene pathways. Most of the enzymes described in other crops for these two metabolic pathways were found in our EST collection.A large collection of new genetic markers was provided by this ESTs collection. This EST collection displays a good representation of the T. cacao transcriptome, suitable for analysis of biochemical pathways based on oligonucleotide microarrays derived from these ESTs. It will provide numerous genetic markers that will allow the construction of a high density gene map of T. cacao

  14. Towards the understanding of the cocoa transcriptome: Production and analysis of an exhaustive dataset of ESTs of Theobroma cacao L. generated from various tissues and under various conditions

    Directory of Open Access Journals (Sweden)

    Ruiz Manuel

    2008-10-01

    Full Text Available Abstract Background Theobroma cacao L., is a tree originated from the tropical rainforest of South America. It is one of the major cash crops for many tropical countries. T. cacao is mainly produced on smallholdings, providing resources for 14 million farmers. Disease resistance and T. cacao quality improvement are two important challenges for all actors of cocoa and chocolate production. T. cacao is seriously affected by pests and fungal diseases, responsible for more than 40% yield losses and quality improvement, nutritional and organoleptic, is also important for consumers. An international collaboration was formed to develop an EST genomic resource database for cacao. Results Fifty-six cDNA libraries were constructed from different organs, different genotypes and different environmental conditions. A total of 149,650 valid EST sequences were generated corresponding to 48,594 unigenes, 12,692 contigs and 35,902 singletons. A total of 29,849 unigenes shared significant homology with public sequences from other species. Gene Ontology (GO annotation was applied to distribute the ESTs among the main GO categories. A specific information system (ESTtik was constructed to process, store and manage this EST collection allowing the user to query a database. To check the representativeness of our EST collection, we looked for the genes known to be involved in two different metabolic pathways extensively studied in other plant species and important for T. cacao qualities: the flavonoid and the terpene pathways. Most of the enzymes described in other crops for these two metabolic pathways were found in our EST collection. A large collection of new genetic markers was provided by this ESTs collection. Conclusion This EST collection displays a good representation of the T. cacao transcriptome, suitable for analysis of biochemical pathways based on oligonucleotide microarrays derived from these ESTs. It will provide numerous genetic markers that will allow

  15. Threatened & Endangered Species Occurrences

    Data.gov (United States)

    Kansas Data Access and Support Center — The database consists of a single statewide coverage of location records for 54 species contained in the Kansas Natural Heritage Inventory database of the Kansas...

  16. Gene discovery in EST sequences from the wheat leaf rust fungus Puccinia triticina sexual spores, asexual spores and haustoria, compared to other rust and corn smut fungi

    Science.gov (United States)

    2011-01-01

    Background Rust fungi are biotrophic basidiomycete plant pathogens that cause major diseases on plants and trees world-wide, affecting agriculture and forestry. Their biotrophic nature precludes many established molecular genetic manipulations and lines of research. The generation of genomic resources for these microbes is leading to novel insights into biology such as interactions with the hosts and guiding directions for breakthrough research in plant pathology. Results To support gene discovery and gene model verification in the genome of the wheat leaf rust fungus, Puccinia triticina (Pt), we have generated Expressed Sequence Tags (ESTs) by sampling several life cycle stages. We focused on several spore stages and isolated haustorial structures from infected wheat, generating 17,684 ESTs. We produced sequences from both the sexual (pycniospores, aeciospores and teliospores) and asexual (germinated urediniospores) stages of the life cycle. From pycniospores and aeciospores, produced by infecting the alternate host, meadow rue (Thalictrum speciosissimum), 4,869 and 1,292 reads were generated, respectively. We generated 3,703 ESTs from teliospores produced on the senescent primary wheat host. Finally, we generated 6,817 reads from haustoria isolated from infected wheat as well as 1,003 sequences from germinated urediniospores. Along with 25,558 previously generated ESTs, we compiled a database of 13,328 non-redundant sequences (4,506 singlets and 8,822 contigs). Fungal genes were predicted using the EST version of the self-training GeneMarkS algorithm. To refine the EST database, we compared EST sequences by BLASTN to a set of 454 pyrosequencing-generated contigs and Sanger BAC-end sequences derived both from the Pt genome, and to ESTs and genome reads from wheat. A collection of 6,308 fungal genes was identified and compared to sequences of the cereal rusts, Puccinia graminis f. sp. tritici (Pgt) and stripe rust, P. striiformis f. sp. tritici (Pst), and poplar

  17. Gene discovery in EST sequences from the wheat leaf rust fungus Puccinia triticina sexual spores, asexual spores and haustoria, compared to other rust and corn smut fungi

    Directory of Open Access Journals (Sweden)

    Wynhoven Brian

    2011-03-01

    Full Text Available Abstract Background Rust fungi are biotrophic basidiomycete plant pathogens that cause major diseases on plants and trees world-wide, affecting agriculture and forestry. Their biotrophic nature precludes many established molecular genetic manipulations and lines of research. The generation of genomic resources for these microbes is leading to novel insights into biology such as interactions with the hosts and guiding directions for breakthrough research in plant pathology. Results To support gene discovery and gene model verification in the genome of the wheat leaf rust fungus, Puccinia triticina (Pt, we have generated Expressed Sequence Tags (ESTs by sampling several life cycle stages. We focused on several spore stages and isolated haustorial structures from infected wheat, generating 17,684 ESTs. We produced sequences from both the sexual (pycniospores, aeciospores and teliospores and asexual (germinated urediniospores stages of the life cycle. From pycniospores and aeciospores, produced by infecting the alternate host, meadow rue (Thalictrum speciosissimum, 4,869 and 1,292 reads were generated, respectively. We generated 3,703 ESTs from teliospores produced on the senescent primary wheat host. Finally, we generated 6,817 reads from haustoria isolated from infected wheat as well as 1,003 sequences from germinated urediniospores. Along with 25,558 previously generated ESTs, we compiled a database of 13,328 non-redundant sequences (4,506 singlets and 8,822 contigs. Fungal genes were predicted using the EST version of the self-training GeneMarkS algorithm. To refine the EST database, we compared EST sequences by BLASTN to a set of 454 pyrosequencing-generated contigs and Sanger BAC-end sequences derived both from the Pt genome, and to ESTs and genome reads from wheat. A collection of 6,308 fungal genes was identified and compared to sequences of the cereal rusts, Puccinia graminis f. sp. tritici (Pgt and stripe rust, P. striiformis f. sp

  18. Update History of This Database - KAIKOcDNA | LSDB Archive [Life Science Database Archive metadata

    Lifescience Database Archive (English)

    Full Text Available switchLanguage; BLAST Search Image Search Home About Archive Update History Data ...List Contact us KAIKOcDNA Update History of This Database Date Update contents 2014/10/20 The URL of the dat... database ( http://sgp.dna.affrc.go.jp/EST/ ) is opened. About This Database Database Description Download License Update Hi...story of This Database Site Policy | Contact Us Update History of This Database - KAIKOcDNA | LSDB Archive ... ...abase maintenance site is changed. 2014/10/08 KAIKOcDNA English archive site is opened. 2004/04/12 KAIKOcDNA

  19. Exploring nervous system transcriptomes during embryogenesis and metamorphosis in Xenopus tropicalis using EST analysis

    Directory of Open Access Journals (Sweden)

    Wegnez Maurice

    2007-05-01

    Full Text Available Abstract Background The western African clawed frog Xenopus tropicalis is an anuran amphibian species now used as model in vertebrate comparative genomics. It provides the same advantages as Xenopus laevis but is diploid and has a smaller genome of 1.7 Gbp. Therefore X. tropicalis is more amenable to systematic transcriptome surveys. We initiated a large-scale partial cDNA sequencing project to provide a functional genomics resource on genes expressed in the nervous system during early embryogenesis and metamorphosis in X. tropicalis. Results A gene index was defined and analysed after the collection of over 48,785 high quality sequences. These partial cDNA sequences were obtained from an embryonic head and retina library (30,272 sequences and from a metamorphic brain and spinal cord library (27,602 sequences. These ESTs are estimated to represent 9,693 transcripts derived from an estimated 6,000 genes. Comparison of these cDNA sequences with protein databases indicates that 46% contain their start codon. Further annotation included Gene Ontology functional classification, InterPro domain analysis, alternative splicing and non-coding RNA identification. Gene expression profiles were derived from EST counts and used to define transcripts specific to metamorphic stages of development. Moreover, these ESTs allowed identification of a set of 225 polymorphic microsatellites that can be used as genetic markers. Conclusion These cDNA sequences permit in silico cloning of numerous genes and will facilitate studies aimed at deciphering the roles of cognate genes expressed in the nervous system during neural development and metamorphosis. The genomic resources developed to study X. tropicalis biology will accelerate exploration of amphibian physiology and genetics. In particular, the model will facilitate analysis of key questions related to anuran embryogenesis and metamorphosis and its associated regulatory processes.

  20. Comparison of sequencing the D2 region of the large subunit ribosomal RNA gene (MicroSEQ®) versus the internal transcribed spacer (ITS) regions using two public databases for identification of common and uncommon clinically relevant fungal species.

    Science.gov (United States)

    Arbefeville, S; Harris, A; Ferrieri, P

    2017-09-01

    Fungal infections cause considerable morbidity and mortality in immunocompromised patients. Rapid and accurate identification of fungi is essential to guide accurately targeted antifungal therapy. With the advent of molecular methods, clinical laboratories can use new technologies to supplement traditional phenotypic identification of fungi. The aims of the study were to evaluate the sole commercially available MicroSEQ® D2 LSU rDNA Fungal Identification Kit compared to the in-house developed internal transcribed spacer (ITS) regions assay in identifying moulds, using two well-known online public databases to analyze sequenced data. 85 common and uncommon clinically relevant fungi isolated from clinical specimens were sequenced for the D2 region of the large subunit (LSU) of ribosomal RNA (rRNA) gene with the MicroSEQ® Kit and the ITS regions with the in house developed assay. The generated sequenced data were analyzed with the online GenBank and MycoBank public databases. The D2 region of the LSU rRNA gene identified 89.4% or 92.9% of the 85 isolates to the genus level and the full ITS region (f-ITS) 96.5% or 100%, using GenBank or MycoBank, respectively, when compared to the consensus ID. When comparing species-level designations to the consensus ID, D2 region of the LSU rRNA gene aligned with 44.7% (38/85) or 52.9% (45/85) of these isolates in GenBank or MycoBank, respectively. By comparison, f-ITS possessed greater specificity, followed by ITS1, then ITS2 regions using GenBank or MycoBank. Using GenBank or MycoBank, D2 region of the LSU rRNA gene outperformed phenotypic based ID at the genus level. Comparing rates of ID between D2 region of the LSU rRNA gene and the ITS regions in GenBank or MycoBank at the species level against the consensus ID, f-ITS and ITS2 exceeded performance of the D2 region of the LSU rRNA gene, but ITS1 had similar performance to the D2 region of the LSU rRNA gene using MycoBank. Our results indicated that the MicroSEQ® D2 LSU r

  1. ESAP plus: a web-based server for EST-SSR marker development.

    Science.gov (United States)

    Ponyared, Piyarat; Ponsawat, Jiradej; Tongsima, Sissades; Seresangtakul, Pusadee; Akkasaeng, Chutipong; Tantisuwichwong, Nathpapat

    2016-12-22

    Simple sequence repeats (SSRs) have become widely used as molecular markers in plant genetic studies due to their abundance, high allelic variation at each locus and simplicity to analyze using conventional PCR amplification. To study plants with unknown genome sequence, SSR markers from Expressed Sequence Tags (ESTs), which can be obtained from the plant mRNA (converted to cDNA), must be utilized. With the advent of high-throughput sequencing technology, huge EST sequence data have been generated and are now accessible from many public databases. However, SSR marker identification from a large in-house or public EST collection requires a computational pipeline that makes use of several standard bioinformatic tools to design high quality EST-SSR primers. Some of these computational tools are not users friendly and must be tightly integrated with reference genomic databases. A web-based bioinformatic pipeline, called EST Analysis Pipeline Plus (ESAP Plus), was constructed for assisting researchers to develop SSR markers from a large EST collection. ESAP Plus incorporates several bioinformatic scripts and some useful standard software tools necessary for the four main procedures of EST-SSR marker development, namely 1) pre-processing, 2) clustering and assembly, 3) SSR mining and 4) SSR primer design. The proposed pipeline also provides two alternative steps for reducing EST redundancy and identifying SSR loci. Using public sugarcane ESTs, ESAP Plus automatically executed the aforementioned computational pipeline via a simple web user interface, which was implemented using standard PHP, HTML, CSS and Java scripts. With ESAP Plus, users can upload raw EST data and choose various filtering options and parameters to analyze each of the four main procedures through this web interface. All input EST data and their predicted SSR results will be stored in the ESAP Plus MySQL database. Users will be notified via e-mail when the automatic process is completed and they can

  2. Stackfile Database

    Science.gov (United States)

    deVarvalho, Robert; Desai, Shailen D.; Haines, Bruce J.; Kruizinga, Gerhard L.; Gilmer, Christopher

    2013-01-01

    This software provides storage retrieval and analysis functionality for managing satellite altimetry data. It improves the efficiency and analysis capabilities of existing database software with improved flexibility and documentation. It offers flexibility in the type of data that can be stored. There is efficient retrieval either across the spatial domain or the time domain. Built-in analysis tools are provided for frequently performed altimetry tasks. This software package is used for storing and manipulating satellite measurement data. It was developed with a focus on handling the requirements of repeat-track altimetry missions such as Topex and Jason. It was, however, designed to work with a wide variety of satellite measurement data [e.g., Gravity Recovery And Climate Experiment -- GRACE). The software consists of several command-line tools for importing, retrieving, and analyzing satellite measurement data.

  3. GDR (Genome Database for Rosaceae: integrated web resources for Rosaceae genomics and genetics research

    Directory of Open Access Journals (Sweden)

    Ficklin Stephen

    2004-09-01

    Full Text Available Abstract Background Peach is being developed as a model organism for Rosaceae, an economically important family that includes fruits and ornamental plants such as apple, pear, strawberry, cherry, almond and rose. The genomics and genetics data of peach can play a significant role in the gene discovery and the genetic understanding of related species. The effective utilization of these peach resources, however, requires the development of an integrated and centralized database with associated analysis tools. Description The Genome Database for Rosaceae (GDR is a curated and integrated web-based relational database. GDR contains comprehensive data of the genetically anchored peach physical map, an annotated peach EST database, Rosaceae maps and markers and all publicly available Rosaceae sequences. Annotations of ESTs include contig assembly, putative function, simple sequence repeats, and anchored position to the peach physical map where applicable. Our integrated map viewer provides graphical interface to the genetic, transcriptome and physical mapping information. ESTs, BACs and markers can be queried by various categories and the search result sites are linked to the integrated map viewer or to the WebFPC physical map sites. In addition to browsing and querying the database, users can compare their sequences with the annotated GDR sequences via a dedicated sequence similarity server running either the BLAST or FASTA algorithm. To demonstrate the utility of the integrated and fully annotated database and analysis tools, we describe a case study where we anchored Rosaceae sequences to the peach physical and genetic map by sequence similarity. Conclusions The GDR has been initiated to meet the major deficiency in Rosaceae genomics and genetics research, namely a centralized web database and bioinformatics tools for data storage, analysis and exchange. GDR can be accessed at http://www.genome.clemson.edu/gdr/.

  4. GDR (Genome Database for Rosaceae): integrated web resources for Rosaceae genomics and genetics research.

    Science.gov (United States)

    Jung, Sook; Jesudurai, Christopher; Staton, Margaret; Du, Zhidian; Ficklin, Stephen; Cho, Ilhyung; Abbott, Albert; Tomkins, Jeffrey; Main, Dorrie

    2004-09-09

    Peach is being developed as a model organism for Rosaceae, an economically important family that includes fruits and ornamental plants such as apple, pear, strawberry, cherry, almond and rose. The genomics and genetics data of peach can play a significant role in the gene discovery and the genetic understanding of related species. The effective utilization of these peach resources, however, requires the development of an integrated and centralized database with associated analysis tools. The Genome Database for Rosaceae (GDR) is a curated and integrated web-based relational database. GDR contains comprehensive data of the genetically anchored peach physical map, an annotated peach EST database, Rosaceae maps and markers and all publicly available Rosaceae sequences. Annotations of ESTs include contig assembly, putative function, simple sequence repeats, and anchored position to the peach physical map where applicable. Our integrated map viewer provides graphical interface to the genetic, transcriptome and physical mapping information. ESTs, BACs and markers can be queried by various categories and the search result sites are linked to the integrated map viewer or to the WebFPC physical map sites. In addition to browsing and querying the database, users can compare their sequences with the annotated GDR sequences via a dedicated sequence similarity server running either the BLAST or FASTA algorithm. To demonstrate the utility of the integrated and fully annotated database and analysis tools, we describe a case study where we anchored Rosaceae sequences to the peach physical and genetic map by sequence similarity. The GDR has been initiated to meet the major deficiency in Rosaceae genomics and genetics research, namely a centralized web database and bioinformatics tools for data storage, analysis and exchange. GDR can be accessed at http://www.genome.clemson.edu/gdr/.

  5. Development and characterization of novel EST-SSR markers and their application for genetic diversity analysis of Jerusalem artichoke (Helianthus tuberosus L.).

    Science.gov (United States)

    Mornkham, T; Wangsomnuk, P P; Mo, X C; Francisco, F O; Gao, L Z; Kurzweil, H

    2016-10-24

    Jerusalem artichoke (Helianthus tuberosus L.) is a perennial tuberous plant and a traditional inulin-rich crop in Thailand. It has become the most important source of inulin and has great potential for use in chemical and food industries. In this study, expressed sequence tag (EST)-based simple sequence repeat (SSR) markers were developed from 40,362 Jerusalem artichoke ESTs retrieved from the NCBI database. Among 23,691 non-redundant identified ESTs, 1949 SSR motifs harboring 2 to 6 nucleotides with varied repeat motifs were discovered from 1676 assembled sequences. Seventy-nine primer pairs were generated from EST sequences harboring SSR motifs. Our results show that 43 primers are polymorphic for the six studied populations, while the remaining 36 were either monomorphic or failed to amplify. These 43 SSR loci exhibited a high level of genetic diversity among populations, with allele numbers varying from 2 to 7, with an average of 3.95 alleles per loci. Heterozygosity ranged from 0.096 to 0.774, with an average of 0.536; polymorphic index content ranged from 0.096 to 0.854, with an average of 0.568. Principal component analysis and neighbor-joining analysis revealed that the six populations could be divided into six clusters. Our results indicate that these newly characterized EST-SSR markers may be useful in the exploration of genetic diversity and range expansion of the Jerusalem artichoke, and in cross-species application for the genus Helianthus.

  6. Sugarcane expressed sequences tags (ESTs encoding enzymes involved in lignin biosynthesis pathways

    Directory of Open Access Journals (Sweden)

    Ramos Rose Lucia Braz

    2001-01-01

    Full Text Available Lignins are phenolic polymers found in the secondary wall of plant conductive systems where they play an important role by reducing the permeability of the cell wall to water. Lignins are also responsible for the rigidity of the cell wall and are involved in mechanisms of resistance to pathogens. The metabolic routes and enzymes involved in synthesis of lignins have been largely characterized and representative genes that encode enzymes involved in these processes have been cloned from several plant species. The synthesis of lignins is liked to the general metabolism of the phenylpropanoids in plants, having enzymes (e.g. phenylalanine ammonia-lyase (PAL, cinnamate 4-hydroxylase (C4H and caffeic acid O-methyltransferase (COMT common to other processes as well as specific enzymes such as cinnamoyl-CoA reductase (CCR and cinnamyl alcohol dehydrogenase (CAD. Some maize and sorghum mutants, shown to have defective in CAD and/or COMT activity, are easier to digest because they have a reduced lignin content, something which has motivated different research groups to alter the lignin content and composition of model plants by genetic engineering try to improve, for example, the efficiency of paper pulping and digestibility. In the work reported in this paper, we have made an inventory of the sugarcane expressed sequence tag (EST coding for enzymes involved in lignin metabolism which are present in the sugarcane EST genome project (SUCEST database. Our analysis focused on the key enzymes ferulate-5-hydroxylase (F5H, caffeic acid O-methyltransferase (COMT, caffeoyl CoA O-methyltransferase (CCoAOMT, hydroxycinnamate CoA ligase (4CL, cinnamoyl-CoA reductase (CCR and cinnamyl alcohol dehydrogenase (CAD. The comparative analysis of these genes with those described in other species could be used as molecular markers for breeding as well as for the manipulation of lignin metabolism in sugarcane.

  7. EST-derived SNP discovery and selective pressure analysis in Pacific white shrimp ( Litopenaeus vannamei)

    Science.gov (United States)

    Liu, Chengzhang; Wang, Xia; Xiang, Jianhai; Li, Fuhua

    2012-09-01

    Pacific white shrimp has become a major aquaculture and fishery species worldwide. Although a large scale EST resource has been publicly available since 2008, the data have not yet been widely used for SNP discovery or transcriptome-wide assessment of selective pressure. In this study, a set of 155 411 expressed sequence tags (ESTs) from the NCBI database were computationally analyzed and 17 225 single nucleotide polymorphisms (SNPs) were predicted, including 9 546 transitions, 5 124 transversions and 2 481 indels. Among the 7 298 SNP substitutions located in functionally annotated contigs, 58.4% (4 262) are non-synonymous SNPs capable of introducing amino acid mutations. Two hundred and fifty nonsynonymous SNPs in genes associated with economic traits have been identified as candidates for markers in selective breeding. Diversity estimates among the synonymous nucleotides were on average 3.49 times greater than those in non-synonymous, suggesting negative selection. Distribution of non-synonymous to synonymous substitutions (Ka/Ks) ratio ranges from 0 to 4.01, (average 0.42, median 0.26), suggesting that the majority of the affected genes are under purifying selection. Enrichment analysis identified multiple gene ontology categories under positive or negative selection. Categories involved in innate immune response and male gamete generation are rich in positively selected genes, which is similar to reports in Drosophila and primates. This work is the first transcriptome-wide assessment of selective pressure in a Penaeid shrimp species. The functionally annotated SNPs provide a valuable resource of potential molecular markers for selective breeding.

  8. The characterization of a new set of EST-derived simple sequence repeat (SSR markers as a resource for the genetic analysis of Phaseolus vulgaris

    Directory of Open Access Journals (Sweden)

    Borba Tereza CO

    2011-05-01

    Full Text Available Abstract Background Over recent years, a growing effort has been made to develop microsatellite markers for the genomic analysis of the common bean (Phaseolus vulgaris to broaden the knowledge of the molecular genetic basis of this species. The availability of large sets of expressed sequence tags (ESTs in public databases has given rise to an expedient approach for the identification of SSRs (Simple Sequence Repeats, specifically EST-derived SSRs. In the present work, a battery of new microsatellite markers was obtained from a search of the Phaseolus vulgaris EST database. The diversity, degree of transferability and polymorphism of these markers were tested. Results From 9,583 valid ESTs, 4,764 had microsatellite motifs, from which 377 were used to design primers, and 302 (80.11% showed good amplification quality. To analyze transferability, a group of 167 SSRs were tested, and the results showed that they were 82% transferable across at least one species. The highest amplification rates were observed between the species from the Phaseolus (63.7%, Vigna (25.9%, Glycine (19.8%, Medicago (10.2%, Dipterix (6% and Arachis (1.8% genera. The average PIC (Polymorphism Information Content varied from 0.53 for genomic SSRs to 0.47 for EST-SSRs, and the average number of alleles per locus was 4 and 3, respectively. Among the 315 newly tested SSRs in the BJ (BAT93 X Jalo EEP558 population, 24% (76 were polymorphic. The integration of these segregant loci into a framework map composed of 123 previously obtained SSR markers yielded a total of 199 segregant loci, of which 182 (91.5% were mapped to 14 linkage groups, resulting in a map length of 1,157 cM. Conclusions A total of 302 newly developed EST-SSR markers, showing good amplification quality, are available for the genetic analysis of Phaseolus vulgaris. These markers showed satisfactory rates of transferability, especially between species that have great economic and genomic values. Their diversity

  9. Development of EST Intron-Targeting SNP Markers for Panax ginseng and Their Application to Cultivar Authentication.

    Science.gov (United States)

    Wang, Hongtao; Li, Guisheng; Kwon, Woo-Saeng; Yang, Deok-Chun

    2016-06-04

    Panax ginseng is one of the most valuable medicinal plants in the Orient. The low level of genetic variation has limited the application of molecular markers for cultivar authentication and marker-assisted selection in cultivated ginseng. To exploit DNA polymorphism within ginseng cultivars, ginseng expressed sequence tags (ESTs) were searched against the potential intron polymorphism (PIP) database to predict the positions of introns. Intron-flanking primers were then designed in conserved exon regions and used to amplify across the more variable introns. Sequencing results showed that single nucleotide polymorphisms (SNPs), as well as indels, were detected in four EST-derived introns, and SNP markers specific to "Gopoong" and "K-1" were first reported in this study. Based on cultivar-specific SNP sites, allele-specific polymerase chain reaction (PCR) was conducted and proved to be effective for the authentication of ginseng cultivars. Additionally, the combination of a simple NaOH-Tris DNA isolation method and real-time allele-specific PCR assay enabled the high throughput selection of cultivars from ginseng fields. The established real-time allele-specific PCR assay should be applied to molecular authentication and marker assisted selection of P. ginseng cultivars, and the EST intron-targeting strategy will provide a potential approach for marker development in species without whole genomic DNA sequence information.

  10. First genetic linkage map of Taraxacum koksaghyz Rodin based on AFLP, SSR, COS and EST-SSR markers.

    Science.gov (United States)

    Arias, Marina; Hernandez, Monica; Remondegui, Naroa; Huvenaars, Koen; van Dijk, Peter; Ritter, Enrique

    2016-08-04

    Taraxacum koksaghyz Rodin (TKS) has been studied in many occasions as a possible alternative source for natural rubber production of good quality and for inulin production. Some tire companies are already testing TKS tire prototypes. There are also many investigations on the production of bio-fuels from inulin and inulin applications for health improvement and in the food industry. A limited amount of genomic resources exist for TKS and particularly no genetic linkage map is available in this species. We have constructed the first TKS genetic linkage map based on AFLP, COS, SSR and EST-SSR markers. The integrated linkage map with eight linkage groups (LG), representing the eight chromosomes of Russian dandelion, has 185 individual AFLP markers from parent 1, 188 individual AFLP markers from parent 2, 75 common AFLP markers and 6 COS, 1 SSR and 63 EST-SSR loci. Blasting the EST-SSR sequences against known sequences from lettuce allowed a partial alignment of our TKS map with a lettuce map. Blast searches against plant gene databases revealed some homologies with useful genes for downstream applications in the future.

  11. SNP-finding in pig mitochondrial ESTs

    DEFF Research Database (Denmark)

    Scheibye-Alsing, Karsten; Cirera Salicio, Susanna; Gilchrist, M.J.

    2008-01-01

    The Sino-Danish pig genome project produced 685 851 ESTs (Gorodkin et al. 2007), of which 41 499 originated from the mitochondrial genome. In this study, the mitochondrial ESTs were assembled, and 374 putative SNPs were found. Chromatograms for the ESTs containing SNPs were manually inspected, an......, and 112 total (52 non-synonymous) SNPs were found to be of high confidence (five of them are close to disease-causing SNPs in humans). Nine of the high-confidence SNPs were tested experimentally, and eight were confirmed. The SNPs can be accessed online at http://pigest.ku.dk/more.mito...

  12. Ortodoncia estética invisible

    OpenAIRE

    Chávez Sevillano, Manuel Gustavo; Soldevilla Galarza, Luciano

    2014-01-01

    El adulto suele ofrecer resistencia a los convencionales tratamientos ortodóncicos, debido a la necesidad de llevar brackets visibles, tanto metálicos como estéticos de porcelana. El concepto de Ortodoncia Estética Invisible u Ortodoncia Lingual cubre las expectativas de este tipo de pacientes. La técnica multibrackets con aparatología lingual tiene aproximadamente 25 años de desarrollo y con la experiencia de los casos tratados, se ha llegado a la concepción de una técnica completamente prot...

  13. Linking the Taiwan Fish Database to the Global Database

    Directory of Open Access Journals (Sweden)

    Kwang-Tsao Shao

    2007-03-01

    Full Text Available Under the support of the National Digital Archive Program (NDAP, basic species information about most Taiwanese fishes, including their morphology, ecology, distribution, specimens with photos, and literatures have been compiled into the "Fish Database of Taiwan" (http://fishdb.sinica.edu.tw. We expect that the all Taiwanese fish species databank (RSD, with 2800+ species, and the digital "Fish Fauna of Taiwan" will be completed in 2007. Underwater ecological photos and video images for all 2,800+ fishes are quite difficult to achieve but will be collected continuously in the future. In the last year of NDAP, we have successfully integrated all fish specimen data deposited at 7 different institutes in Taiwan as well as their collection maps on the Google Map and Google Earth. Further, the database also provides the pronunciation of Latin scientific names and transliteration of Chinese common names by referring to the Romanization system for all Taiwanese fishes (2,902 species in 292 families so far. The Taiwanese fish species checklist with Chinese common/vernacular names and specimen data has been updated periodically and provided to the global FishBase as well as the Global Biodiversity Information Facility (GBIF through the national portal of the Taiwan Biodiversity Information Facility (TaiBIF. Thus, Taiwanese fish data can be queried and browsed on the WWW. For contributing to the "Barcode of Life" and "All Fishes" international projects, alcohol-preserved specimens of more than 1,800 species and cryobanking tissues of 800 species have been accumulated at RCBAS in the past two years. Through this close collaboration between local and global databases, "The Fish Database of Taiwan" now attracts more than 250,000 visitors and achieves 5 million hits per month. We believe that this local database is becoming an important resource for education, research, conservation, and sustainable use of fish in Taiwan.

  14. Extending Database Integration Technology

    National Research Council Canada - National Science Library

    Buneman, Peter

    1999-01-01

    Formal approaches to the semantics of databases and database languages can have immediate and practical consequences in extending database integration technologies to include a vastly greater range...

  15. De novo comparative transcriptome analysis of genes involved in fruit morphology of pumpkin cultivars with extreme size difference and development of EST-SSR markers.

    Science.gov (United States)

    Xanthopoulou, Aliki; Ganopoulos, Ioannis; Psomopoulos, Fotis; Manioudaki, Maria; Moysiadis, Theodoros; Kapazoglou, Aliki; Osathanunkul, Maslin; Michailidou, Sofia; Kalivas, Apostolos; Tsaftaris, Athanasios; Nianiou-Obeidat, Irini; Madesis, Panagiotis

    2017-07-30

    The genetic basis of fruit size and shape was investigated for the first time in Cucurbita species and genetic loci associated with fruit morphology have been identified. Although extensive genomic resources are available at present for tomato (Solanum lycopersicum), cucumber (Cucumis sativus), melon (Cucumis melo) and watermelon (Citrullus lanatus), genomic databases for Cucurbita species are limited. Recently, our group reported the generation of pumpkin (Cucurbita pepo) transcriptome databases from two contrasting cultivars with extreme fruit sizes. In the current study we used these databases to perform comparative transcriptome analysis in order to identify genes with potential roles in fruit morphology and fruit size. Differential Gene Expression (DGE) analysis between cv. 'Munchkin' (small-fruit) and cv. 'Big Moose' (large-fruit) revealed a variety of candidate genes associated with fruit morphology with significant differences in gene expression between the two cultivars. In addition, we have set the framework for generating EST-SSR markers, which discriminate different C. pepo cultivars and show transferability to related Cucurbitaceae species. The results of the present study will contribute to both further understanding the molecular mechanisms regulating fruit morphology and furthermore identifying the factors that determine fruit size. Moreover, they may lead to the development of molecular marker tools for selecting genotypes with desired morphological traits. Copyright © 2017. Published by Elsevier B.V.

  16. Turismo y estética formativa

    Directory of Open Access Journals (Sweden)

    Clara Inés Sánchez Arciniegas

    2002-11-01

    Full Text Available ¿Existe alguna armonía entre el conocimiento y la estética? Sin duda alguna el hombre con conocimiento y sin virtud de la estética es un autómata; este necesita de las diferentes manifestaciones en las que se encuentra plasmada y necesita recordarlas continuamente. La estética como doctrina de conocimiento no es reciente, en el siglo xviii cuando por primera vez se define teóricamente hasta nuestros días en la Universidad actual. Esta última aparentemente menos evidente debido a las nuevas formas de aprendizaje con herramientas tecnológicas muy avanzadas y contraria a la imagen del universitario que hace dos siglos como complemento a su educación se distraía a través del Gran Tour, partida que tiene el turismo para fomentar el goce estético. ¿Cuál es entonces la relación entre distracción y formación? Se debe entonces reflexionar y comprender que el conocimiento interdisciplinario y la metodología debe procurar en los hombres actitudes adecuadas frente a la cultura y el arte a través del conocimiento.

  17. The morphology of the foregut of larvae and postlarva of Sesarma curacaoense De Man, 1892: a species with facultative lecithotrophy during larval development A morfologia do estômago de larvas e pós-larvas de Sesarma curacaoense De Man, 1892: uma espécie com desenvolvimento larval lecitotrófico facultativo

    Directory of Open Access Journals (Sweden)

    Marlon Aguiar Melo

    2006-01-01

    Full Text Available Previous study on the resistance of larvae of Sesarma curacaoense submitted to starvation has revealed a facultative lecithotrophy during zoeal stages, but megalopa and first juvenile stages are exclusively feeding stages. In the present study, the gross morphology and fine structure of the foregut of S. curacaoense were investigated during larval, megalopa and first juvenile stages. The foregut of the zoea I show specific setae and a filter press apparently functional. The foregut undergoes changes in the zoea II (last larval stage with increment of setae number, mainly on the cardiopyloric valve and complexity of the filter press. After metamorphosis to megalopa stage the foregut become rather complex, with a gastric mill supporting a medial and two lateral teeth well-developed. The foregut of the first juvenile is more specialized compared to the previous stage, showing similar characteristics of the decapod adults. These results provide further evidence of facultative lecithotrophic development in the larvae of S. curacaoense.Estudo prévio sobre o efeito da inanição em larvas de Sesarma curacaoense propôs que estas larvas apresentam comportamento lecitotrófico facultativo. No presente trabalho a morfologia do estômago de S. curacaoense foi estudada durante os estágios larvais, megalopa e juvenil I. A estrutura do estômago da zoea I possui cerdas específicas e com filtro pilórico aparentemente funcional. Especialização no estômago do zoea II (último estágio larval foi evidenciada pelo incremento do número de cerdas na válvula cárdio-pilórica e pela complexidade do filtro pilórico. Após a metamorfose para o estágio megalopa, o estômago ficou consideravelmente complexo, com o aparecimento de um moinho gástrico contendo um medial e dois laterais dentes bem desenvolvidos. O estômago do juvenil I mostrou-se ainda mais especializado que no estágio anterior, exibindo características morfológicas similares

  18. Development and production of an oligonucleotide MuscleChip: use for validation of ambiguous ESTs

    Directory of Open Access Journals (Sweden)

    Lanfranchi Gerolamo

    2002-10-01

    Full Text Available Abstract Background We describe the development, validation, and use of a highly redundant 120,000 oligonucleotide microarray (MuscleChip containing 4,601 probe sets representing 1,150 known genes expressed in muscle and 2,075 EST clusters from a non-normalized subtracted muscle EST sequencing project (28,074 EST sequences. This set included 369 novel EST clusters showing no match to previously characterized proteins in any database. Each probe set was designed to contain 20–32 25 mer oligonucleotides (10–16 paired perfect match and mismatch probe pairs per gene, with each probe evaluated for hybridization kinetics (Tm and similarity to other sequences. The 120,000 oligonucleotides were synthesized by photolithography and light-activated chemistry on each microarray. Results Hybridization of human muscle cRNAs to this MuscleChip (33 samples showed a correlation of 0.6 between the number of ESTs sequenced in each cluster and hybridization intensity. Out of 369 novel EST clusters not showing any similarity to previously characterized proteins, we focused on 250 EST clusters that were represented by robust probe sets on the MuscleChip fulfilling all stringent rules. 102 (41% were found to be consistently "present" by analysis of hybridization to human muscle RNA, of which 40 ESTs (39% could be genome anchored to potential transcription units in the human genome sequence. 19 ESTs of the 40 ESTs were furthermore computer-predicted as exons by one or more than three gene identification algorithms. Conclusion Our analysis found 40 transcriptionally validated, genome-anchored novel EST clusters to be expressed in human muscle. As most of these ESTs were low copy clusters (duplex and triplex in the original 28,000 EST project, the identification of these as significantly expressed is a robust validation of the transcript units that permits subsequent focus on the novel proteins encoded by these genes.

  19. Database development and management

    CERN Document Server

    Chao, Lee

    2006-01-01

    Introduction to Database Systems Functions of a DatabaseDatabase Management SystemDatabase ComponentsDatabase Development ProcessConceptual Design and Data Modeling Introduction to Database Design Process Understanding Business ProcessEntity-Relationship Data Model Representing Business Process with Entity-RelationshipModelTable Structure and NormalizationIntroduction to TablesTable NormalizationTransforming Data Models to Relational Databases .DBMS Selection Transforming Data Models to Relational DatabasesEnforcing ConstraintsCreating Database for Business ProcessPhysical Design and Database

  20. The COMPADRE Plant Matrix Database

    DEFF Research Database (Denmark)

    2014-01-01

    COMPADRE contains demographic information on hundreds of plant species. The data in COMPADRE are in the form of matrix population models and our goal is to make these publicly available to facilitate their use for research and teaching purposes. COMPADRE is an open-access database. We only request...

  1. The Sequenced Angiosperm Genomes and Genome Databases.

    Science.gov (United States)

    Chen, Fei; Dong, Wei; Zhang, Jiawei; Guo, Xinyue; Chen, Junhao; Wang, Zhengjia; Lin, Zhenguo; Tang, Haibao; Zhang, Liangsheng

    2018-01-01

    Angiosperms, the flowering plants, provide the essential resources for human life, such as food, energy, oxygen, and materials. They also promoted the evolution of human, animals, and the planet earth. Despite the numerous advances in genome reports or sequencing technologies, no review covers all the released angiosperm genomes and the genome databases for data sharing. Based on the rapid advances and innovations in the database reconstruction in the last few years, here we provide a comprehensive review for three major types of angiosperm genome databases, including databases for a single species, for a specific angiosperm clade, and for multiple angiosperm species. The scope, tools, and data of each type of databases and their features are concisely discussed. The genome databases for a single species or a clade of species are especially popular for specific group of researchers, while a timely-updated comprehensive database is more powerful for address of major scientific mysteries at the genome scale. Considering the low coverage of flowering plants in any available database, we propose construction of a comprehensive database to facilitate large-scale comparative studies of angiosperm genomes and to promote the collaborative studies of important questions in plant biology.

  2. The first set of EST resource for gene discovery and marker development in pigeonpea (Cajanus cajan L.

    Directory of Open Access Journals (Sweden)

    Byregowda Munishamappa

    2010-03-01

    Full Text Available Abstract Background Pigeonpea (Cajanus cajan (L. Millsp is one of the major grain legume crops of the tropics and subtropics, but biotic stresses [Fusarium wilt (FW, sterility mosaic disease (SMD, etc.] are serious challenges for sustainable crop production. Modern genomic tools such as molecular markers and candidate genes associated with resistance to these stresses offer the possibility of facilitating pigeonpea breeding for improving biotic stress resistance. Availability of limited genomic resources, however, is a serious bottleneck to undertake molecular breeding in pigeonpea to develop superior genotypes with enhanced resistance to above mentioned biotic stresses. With an objective of enhancing genomic resources in pigeonpea, this study reports generation and analysis of comprehensive resource of FW- and SMD- responsive expressed sequence tags (ESTs. Results A total of 16 cDNA libraries were constructed from four pigeonpea genotypes that are resistant and susceptible to FW ('ICPL 20102' and 'ICP 2376' and SMD ('ICP 7035' and 'TTB 7' and a total of 9,888 (9,468 high quality ESTs were generated and deposited in dbEST of GenBank under accession numbers GR463974 to GR473857 and GR958228 to GR958231. Clustering and assembly analyses of these ESTs resulted into 4,557 unique sequences (unigenes including 697 contigs and 3,860 singletons. BLASTN analysis of 4,557 unigenes showed a significant identity with ESTs of different legumes (23.2-60.3%, rice (28.3%, Arabidopsis (33.7% and poplar (35.4%. As expected, pigeonpea ESTs are more closely related to soybean (60.3% and cowpea ESTs (43.6% than other plant ESTs. Similarly, BLASTX similarity results showed that only 1,603 (35.1% out of 4,557 total unigenes correspond to known proteins in the UniProt database (≤ 1E-08. Functional categorization of the annotated unigenes sequences showed that 153 (3.3% genes were assigned to cellular component category, 132 (2.8% to biological process, and 132 (2

  3. Mathematics for Databases

    NARCIS (Netherlands)

    ir. Sander van Laar

    2007-01-01

    A formal description of a database consists of the description of the relations (tables) of the database together with the constraints that must hold on the database. Furthermore the contents of a database can be retrieved using queries. These constraints and queries for databases can very well be

  4. Databases and their application

    NARCIS (Netherlands)

    Grimm, E.C.; Bradshaw, R.H.W; Brewer, S.; Flantua, S.; Giesecke, T.; Lézine, A.M.; Takahara, H.; Williams, J.W.,Jr; Elias, S.A.; Mock, C.J.

    2013-01-01

    During the past 20 years, several pollen database cooperatives have been established. These databases are now constituent databases of the Neotoma Paleoecology Database, a public domain, multiproxy, relational database designed for Quaternary-Pliocene fossil data and modern surface samples. The

  5. DOT Online Database

    Science.gov (United States)

    Page Home Table of Contents Contents Search Database Search Login Login Databases Advisory Circulars accessed by clicking below: Full-Text WebSearch Databases Database Records Date Advisory Circulars 2092 5 data collection and distribution policies. Document Database Website provided by MicroSearch

  6. Medicago truncatula transporter database: a comprehensive database resource for M. truncatula transporters

    Directory of Open Access Journals (Sweden)

    Miao Zhenyan

    2012-02-01

    Full Text Available Abstract Background Medicago truncatula has been chosen as a model species for genomic studies. It is closely related to an important legume, alfalfa. Transporters are a large group of membrane-spanning proteins. They deliver essential nutrients, eject waste products, and assist the cell in sensing environmental conditions by forming a complex system of pumps and channels. Although studies have effectively characterized individual M. truncatula transporters in several databases, until now there has been no available systematic database that includes all transporters in M. truncatula. Description The M. truncatula transporter database (MTDB contains comprehensive information on the transporters in M. truncatula. Based on the TransportTP method, we have presented a novel prediction pipeline. A total of 3,665 putative transporters have been annotated based on International Medicago Genome Annotated Group (IMGAG V3.5 V3 and the M. truncatula Gene Index (MTGI V10.0 releases and assigned to 162 families according to the transporter classification system. These families were further classified into seven types according to their transport mode and energy coupling mechanism. Extensive annotations referring to each protein were generated, including basic protein function, expressed sequence tag (EST mapping, genome locus, three-dimensional template prediction, transmembrane segment, and domain annotation. A chromosome distribution map and text-based Basic Local Alignment Search Tools were also created. In addition, we have provided a way to explore the expression of putative M. truncatula transporter genes under stress treatments. Conclusions In summary, the MTDB enables the exploration and comparative analysis of putative transporters in M. truncatula. A user-friendly web interface and regular updates make MTDB valuable to researchers in related fields. The MTDB is freely available now to all users at http://bioinformatics.cau.edu.cn/MtTransporter/.

  7. Download - AcEST | LSDB Archive [Life Science Database Archive metadata

    Lifescience Database Archive (English)

    Full Text Available search(/contents-en/) != -1 || url.search(/index-e.html/) != -1 ) { document.getElementById(lang).innerHTML=.../) != -1 ) { url = url.replace(-e.html,.html); document.getElementById(lang).innerHTML=[ Japanese |...en/,/jp/); document.getElementById(lang).innerHTML=[ Japanese | English ]; } else if ( url.search(//contents...//) != -1 ) { url = url.replace(/contents/,/contents-en/); document.getElementById(lang).innerHTML=[ Japanes...e(/contents-en/,/contents/); document.getElementById(lang).innerHTML=[ Japanese | English ]; } else if( url.

  8. Download - ClEST | LSDB Archive [Life Science Database Archive metadata

    Lifescience Database Archive (English)

    Full Text Available search(/contents-en/) != -1 || url.search(/index-e.html/) != -1 ) { document.getElementById(lang).innerHTML=.../) != -1 ) { url = url.replace(-e.html,.html); document.getElementById(lang).innerHTML=[ Japanese |...en/,/jp/); document.getElementById(lang).innerHTML=[ Japanese | English ]; } else if ( url.search(//contents...//) != -1 ) { url = url.replace(/contents/,/contents-en/); document.getElementById(lang).innerHTML=[ Japanes...e(/contents-en/,/contents/); document.getElementById(lang).innerHTML=[ Japanese | English ]; } else if( url.

  9. SSR mining in oil palm EST database: application in oil palm ...

    Indian Academy of Sciences (India)

    1Advanced Biotechnology and Breeding Centre, Malaysian Palm Oil Board, P.O. Box 10620,. 50720 Kuala Lumpur .... Genomic DNA was extracted from young leaves. The DNA ..... tries are essential to avoid the risk of genetic erosion. The.

  10. Dietary Supplement Ingredient Database

    Science.gov (United States)

    ... and US Department of Agriculture Dietary Supplement Ingredient Database Toggle navigation Menu Home About DSID Mission Current ... values can be saved to build a small database or add to an existing database for national, ...

  11. Energy Consumption Database

    Science.gov (United States)

    Consumption Database The California Energy Commission has created this on-line database for informal reporting ) classifications. The database also provides easy downloading of energy consumption data into Microsoft Excel (XLSX

  12. Estágio na Compta

    OpenAIRE

    Mendes, Dário Miguel dos Santos

    2017-01-01

    O presente documento descreve o trabalho realizado no segundo ano letivo do Mestrado em Produção de Conteúdos Digitais. Relatando um estágio decorrido na empresa tecnológica Compta S.A. Neste relatório é feita uma apresentação da empresa, bem como os seus objetivos como tal. De seguida, efetua-se um enquadramento do estágio no âmbito contextual dos conteúdos digitais, descrevendo as tecnologias utilizadas, necessárias à criação de aplicações para dispositivos móveis. São ainda apresentados...

  13. Estética, Subjetividad y Conocimiento

    Directory of Open Access Journals (Sweden)

    López Avendaño, Olimpia

    2002-12-01

    Full Text Available Este artículo trata de la tradicional dicotomía conocimiento objetivo-subjetivo. En él se analiza la posición de diferentes autores en búsqueda de los elementos que conducen al nacimiento de un nuevo paradigma, integrador, que supere la antinomia y permita visualizar al conocimiento pero sobre todo al individuo que lo produce, integralmente.

  14. Amar por cartas - este inferno de amara

    Directory of Open Access Journals (Sweden)

    Matildes Demetrio dos Santos

    1999-10-01

    Full Text Available Este trabalho tem por objetivo estudar o amor nas Folhas caídas e nas Cartas de amor à Viscondessa da Luz, de Almeida Garrett.Seus poemas cheios de erotismo e sensua lidade, inspirados numa mulher casada, Rosa Montúrar Infante, escandalizaram a sociedade portuguesado sécu lo XIX. Suas cartas de amor, em segredo, criavam uma ficção de características místicas.

  15. Uncommon Species and Other Features

    Data.gov (United States)

    Vermont Center for Geographic Information — The Vermont Fish and Wildlife Department's Natural Heritage Inventory (NHI) maintains a database of uncommon, rare, threatened and endangered species and natural...

  16. Legume and Lotus japonicus Databases

    DEFF Research Database (Denmark)

    Hirakawa, Hideki; Mun, Terry; Sato, Shusei

    2014-01-01

    Since the genome sequence of Lotus japonicus, a model plant of family Fabaceae, was determined in 2008 (Sato et al. 2008), the genomes of other members of the Fabaceae family, soybean (Glycine max) (Schmutz et al. 2010) and Medicago truncatula (Young et al. 2011), have been sequenced. In this sec....... In this section, we introduce representative, publicly accessible online resources related to plant materials, integrated databases containing legume genome information, and databases for genome sequence and derived marker information of legume species including L. japonicus...

  17. Collecting Taxes Database

    Data.gov (United States)

    US Agency for International Development — The Collecting Taxes Database contains performance and structural indicators about national tax systems. The database contains quantitative revenue performance...

  18. USAID Anticorruption Projects Database

    Data.gov (United States)

    US Agency for International Development — The Anticorruption Projects Database (Database) includes information about USAID projects with anticorruption interventions implemented worldwide between 2007 and...

  19. NoSQL databases

    OpenAIRE

    Mrozek, Jakub

    2012-01-01

    This thesis deals with database systems referred to as NoSQL databases. In the second chapter, I explain basic terms and the theory of database systems. A short explanation is dedicated to database systems based on the relational data model and the SQL standardized query language. Chapter Three explains the concept and history of the NoSQL databases, and also presents database models, major features and the use of NoSQL databases in comparison with traditional database systems. In the fourth ...

  20. Transcriptome sequencing of mung bean (Vigna radiate L.) genes and the identification of EST-SSR markers.

    Science.gov (United States)

    Chen, Honglin; Wang, Lixia; Wang, Suhua; Liu, Chunji; Blair, Matthew Wohlgemuth; Cheng, Xuzhen

    2015-01-01

    Mung bean (Vigna radiate (L.) Wilczek) is an important traditional food legume crop, with high economic and nutritional value. It is widely grown in China and other Asian countries. Despite its importance, genomic information is currently unavailable for this crop plant species or some of its close relatives in the Vigna genus. In this study, more than 103 million high quality cDNA sequence reads were obtained from mung bean using Illumina paired-end sequencing technology. The processed reads were assembled into 48,693 unigenes with an average length of 874 bp. Of these unigenes, 25,820 (53.0%) and 23,235 (47.7%) showed significant similarity to proteins in the NCBI non-redundant protein and nucleotide sequence databases, respectively. Furthermore, 19,242 (39.5%) could be classified into gene ontology categories, 18,316 (37.6%) into Swiss-Prot categories and 10,918 (22.4%) into KOG database categories (E-value SSR), and 2,303 sequences contained more than one SSR together in the same expressed sequence tag (EST). A total of 13,134 EST-SSRs were identified as potential molecular markers, with mono-nucleotide A/T repeats being the most abundant motif class and G/C repeats being rare. In this SSR analysis, we found five main repeat motifs: AG/CT (30.8%), GAA/TTC (12.6%), AAAT/ATTT (6.8%), AAAAT/ATTTT (6.2%) and AAAAAT/ATTTTT (1.9%). A total of 200 SSR loci were randomly selected for validation by PCR amplification as EST-SSR markers. Of these, 66 marker primer pairs produced reproducible amplicons that were polymorphic among 31 mung bean accessions selected from diverse geographical locations. The large number of SSR-containing sequences found in this study will be valuable for the construction of a high-resolution genetic linkage maps, association or comparative mapping and genetic analyses of various Vigna species.

  1. Development and characterization of EST-SSR markers in Bombax ceiba (Malvaceae).

    Science.gov (United States)

    Ju, Miao-Miao; Ma, Huan-Cheng; Xin, Pei-Yao; Zhou, Zhi-Li; Tian, Bin

    2015-04-01

    Bombax ceiba (Malvaceae), commonly known as silk cotton tree, is a multipurpose tree species of tropical forests. Novel expressed sequence tag-simple sequence repeat (EST-SSR) markers were developed and characterized for the species using transcriptome analysis. A total of 33 new EST-SSR markers were developed for B. ceiba, of which 13 showed polymorphisms across the 24 individuals from four distant populations tested in the study. The results showed that the number of alleles per polymorphic locus ranged from two to four, and the expected heterozygosity and observed heterozygosity per locus varied from 0.043 to 0.654 and from 0 to 0.609, respectively. These newly developed EST-SSR markers can be used in phylogeographic and population genetic studies to investigate the origin of B. ceiba populations. Furthermore, these EST-SSR markers could also greatly promote the development of molecular breeding studies pertaining to silk cotton tree.

  2. Mining online genomic resources in Anolis carolinensis facilitates rapid and inexpensive development of cross-species microsatellite markers for the Anolis lizard genus.

    Science.gov (United States)

    Wordley, Claire; Slate, Jon; Stapley, Jessica

    2011-01-01

    Online sequence databases can provide valuable resources for the development of cross-species genetic markers. In particular, mining expressed tag sequences (EST) for microsatellites and developing conserved cross-species microsatellite markers can provide a rapid and relatively inexpensive method to develop new markers for a range of species. Here, we adopt this approach to develop cross-species microsatellite markers in Anolis lizards, which is a model genus in evolutionary biology and ecology. Using EST sequences from Anolis carolinensis, we identified 127 microsatellites that satisfied our criteria, and tested 49 of these in five species of Anolis (carolinensis, distichus, apletophallus, porcatus and sagrei). We identified between 8 and 25 new variable genetic markers for five Anolis species. These markers will be a valuable resource for studies of population genetics, comparative mapping, mating systems, behavioural ecology and adaptive radiations in this diverse lineage. © 2010 Blackwell Publishing Ltd.

  3. REDVET está indexada en Scirus

    OpenAIRE

    veterinaria.org

    2009-01-01

    ResumenA veces, la búsqueda en Internet de artículos científicos se convierte en una pesadilla, por lo que es conveniente recurrir a buscadores temáticos y en este aspecto Scirus es una buena opción ya que es uno de los mejores buscadores de temática científica de la Red pues busca en más de 450 millones de páginas web científicas con una búsqueda muy rápida y con resultados sorprendentes.

  4. Le bonheur est dans les airs

    OpenAIRE

    Robène, Luc; Bodin, Dominique; Héas, Stéphane

    2010-01-01

    A partir des années 1880, en France, la dimension touristique des ascensions en ballon prend une importance croissante. En marge des pratiques strictement utilitaires de l'aérostation s’élabore un univers davantage tourné vers la communion et / ou vers le face-à-face de l'homme avec la nature en altitude. L'observation du milieu aérien, la confrontation avec la sauvagerie des éléments et le bien-être que l'environnement vierge des hauteurs est susceptible de procurer aux aéronautes constituen...

  5. PrimateLit Database

    Science.gov (United States)

    Primate Info Net Related Databases NCRR PrimateLit: A bibliographic database for primatology Top of any problems with this service. We welcome your feedback. The PrimateLit database is no longer being Resources, National Institutes of Health. The database is a collaborative project of the Wisconsin Primate

  6. The PREDICTS database: a global database of how local terrestrial biodiversity responds to human impacts

    Science.gov (United States)

    L.N. Hudson; T. Newbold; S. Contu

    2014-01-01

    Biodiversity continues to decline in the face of increasing anthropogenic pressures such as habitat destruction, exploitation, pollution and introduction of alien species. Existing global databases of species’ threat status or population time series are dominated by charismatic species. The collation of datasets with broad taxonomic and biogeographic extents, and that...

  7. KALIMER database development

    Energy Technology Data Exchange (ETDEWEB)

    Jeong, Kwan Seong; Lee, Yong Bum; Jeong, Hae Yong; Ha, Kwi Seok

    2003-03-01

    KALIMER database is an advanced database to utilize the integration management for liquid metal reactor design technology development using Web applications. KALIMER design database is composed of results database, Inter-Office Communication (IOC), 3D CAD database, and reserved documents database. Results database is a research results database during all phase for liquid metal reactor design technology development of mid-term and long-term nuclear R and D. IOC is a linkage control system inter sub project to share and integrate the research results for KALIMER. 3D CAD database is a schematic overview for KALIMER design structure. And reserved documents database is developed to manage several documents and reports since project accomplishment.

  8. KALIMER database development

    International Nuclear Information System (INIS)

    Jeong, Kwan Seong; Lee, Yong Bum; Jeong, Hae Yong; Ha, Kwi Seok

    2003-03-01

    KALIMER database is an advanced database to utilize the integration management for liquid metal reactor design technology development using Web applications. KALIMER design database is composed of results database, Inter-Office Communication (IOC), 3D CAD database, and reserved documents database. Results database is a research results database during all phase for liquid metal reactor design technology development of mid-term and long-term nuclear R and D. IOC is a linkage control system inter sub project to share and integrate the research results for KALIMER. 3D CAD database is a schematic overview for KALIMER design structure. And reserved documents database is developed to manage several documents and reports since project accomplishment

  9. Logical database design principles

    CERN Document Server

    Garmany, John; Clark, Terry

    2005-01-01

    INTRODUCTION TO LOGICAL DATABASE DESIGNUnderstanding a Database Database Architectures Relational Databases Creating the Database System Development Life Cycle (SDLC)Systems Planning: Assessment and Feasibility System Analysis: RequirementsSystem Analysis: Requirements Checklist Models Tracking and Schedules Design Modeling Functional Decomposition DiagramData Flow Diagrams Data Dictionary Logical Structures and Decision Trees System Design: LogicalSYSTEM DESIGN AND IMPLEMENTATION The ER ApproachEntities and Entity Types Attribute Domains AttributesSet-Valued AttributesWeak Entities Constraint

  10. An Interoperable Cartographic Database

    OpenAIRE

    Slobodanka Ključanin; Zdravko Galić

    2007-01-01

    The concept of producing a prototype of interoperable cartographic database is explored in this paper, including the possibilities of integration of different geospatial data into the database management system and their visualization on the Internet. The implementation includes vectorization of the concept of a single map page, creation of the cartographic database in an object-relation database, spatial analysis, definition and visualization of the database content in the form of a map on t...

  11. Software listing: CHEMTOX database

    International Nuclear Information System (INIS)

    Moskowitz, P.D.

    1993-01-01

    Initially launched in 1983, the CHEMTOX Database was among the first microcomputer databases containing hazardous chemical information. The database is used in many industries and government agencies in more than 17 countries. Updated quarterly, the CHEMTOX Database provides detailed environmental and safety information on 7500-plus hazardous substances covered by dozens of regulatory and advisory sources. This brief listing describes the method of accessing data and provides ordering information for those wishing to obtain the CHEMTOX Database

  12. ICRAF Species Switchboard. Version 1.2

    DEFF Research Database (Denmark)

    Kindt, R.; Ordonez, J.; Smith, E.

    2015-01-01

    The current version of the Agroforestry Species Switchboard documents the presence of a total of 26,135 plant species (33,813 species including synonyms) across 19 web-based databases. When available, hyperlinks to information on the selected species in particular databases are provided. In total...

  13. An algorithm of discovering signatures from DNA databases on a computer cluster.

    Science.gov (United States)

    Lee, Hsiao Ping; Sheu, Tzu-Fang

    2014-10-05

    Signatures are short sequences that are unique and not similar to any other sequence in a database that can be used as the basis to identify different species. Even though several signature discovery algorithms have been proposed in the past, these algorithms require the entirety of databases to be loaded in the memory, thus restricting the amount of data that they can process. It makes those algorithms unable to process databases with large amounts of data. Also, those algorithms use sequential models and have slower discovery speeds, meaning that the efficiency can be improved. In this research, we are debuting the utilization of a divide-and-conquer strategy in signature discovery and have proposed a parallel signature discovery algorithm on a computer cluster. The algorithm applies the divide-and-conquer strategy to solve the problem posed to the existing algorithms where they are unable to process large databases and uses a parallel computing mechanism to effectively improve the efficiency of signature discovery. Even when run with just the memory of regular personal computers, the algorithm can still process large databases such as the human whole-genome EST database which were previously unable to be processed by the existing algorithms. The algorithm proposed in this research is not limited by the amount of usable memory and can rapidly find signatures in large databases, making it useful in applications such as Next Generation Sequencing and other large database analysis and processing. The implementation of the proposed algorithm is available at http://www.cs.pu.edu.tw/~fang/DDCSDPrograms/DDCSD.htm.

  14. Combining next-generation sequencing and online databases for microsatellite development in non-model organisms.

    Science.gov (United States)

    Rico, Ciro; Normandeau, Eric; Dion-Côté, Anne-Marie; Rico, María Inés; Côté, Guillaume; Bernatchez, Louis

    2013-12-03

    Next-generation sequencing (NGS) is revolutionising marker development and the rapidly increasing amount of transcriptomes published across a wide variety of taxa is providing valuable sequence databases for the identification of genetic markers without the need to generate new sequences. Microsatellites are still the most important source of polymorphic markers in ecology and evolution. Motivated by our long-term interest in the adaptive radiation of a non-model species complex of whitefishes (Coregonus spp.), in this study, we focus on microsatellite characterisation and multiplex optimisation using transcriptome sequences generated by Illumina® and Roche-454, as well as online databases of Expressed Sequence Tags (EST) for the study of whitefish evolution and demographic history. We identified and optimised 40 polymorphic loci in multiplex PCR reactions and validated the robustness of our analyses by testing several population genetics and phylogeographic predictions using 494 fish from five lakes and 2 distinct ecotypes.

  15. Marketing! Onde está o Paradigma?

    Directory of Open Access Journals (Sweden)

    Deosir Flávio Lobo de Castro Júnior

    2015-09-01

    Full Text Available O debate quantitativo-qualitativo, não é uma discussão recente. O objetivo desse trabalho, portanto, é verificar por meio do conceito de paradigma, novas perspectivas de compreender as pesquisas acadêmicas em marketing, a evolução do pensamento de marketing e as metodologias utilizadas nos estudos da qualidade de serviço. Sem a pretensão de esgotar este assunto e apresentar uma conclusão final, são apresentadas pesquisas que apontam para a necessidade e a importância da pesquisa qualitativa, uma vez que auxilia ao pesquisador uma melhor compreensão da natureza complexa do mundo social em que vivemos. Segundo Santana e Gomes (2007, após análise da discussão de Hegel e Kant, concluem que a razão e a própria epistemologia são construções históricas e evoluem a partir de contradições. O presente artigo está dividido em cinco momentos. Na primeira parte apresenta-se além da introdução a constituição dos objetivos desse ensaio teórico. Na segunda parte apresenta-se uma breve discussão a respeito do conceito de paradigma e o marketing. Na terceira parte apresenta-se uma retrospectiva histórica do marketing e sua evolução de suas escolas a partir dos estudos de Miranda e Arruda (2004. Na quarta parte, são apresentadas as metodologias dos estudos sobre qualidade de serviços e finalmente na quinta parte apresenta-se as considerações finais.

  16. Integration of Biodiversity Databases in Taiwan and Linkage to Global Databases

    Directory of Open Access Journals (Sweden)

    Kwang-Tsao Shao

    2007-03-01

    Full Text Available The biodiversity databases in Taiwan were dispersed to various institutions and colleges with limited amount of data by 2001. The Natural Resources and Ecology GIS Database sponsored by the Council of Agriculture, which is part of the National Geographic Information System planned by the Ministry of Interior, was the most well established biodiversity database in Taiwan. But thisThis database was, however, mainly collectingcollected the distribution data of terrestrial animals and plants within the Taiwan area. In 2001, GBIF was formed, and Taiwan joined as one of the an Associate Participant and started, starting the establishment and integration of animal and plant species databases; therefore, TaiBIF was able to co-operate with GBIF. The information of Catalog of Life, specimens, and alien species were integrated by the Darwin core. The standard. These metadata standards allowed the biodiversity information of Taiwan to connect with global databases.

  17. Kazusa Marker DataBase: a database for genomics, genetics, and molecular breeding in plants

    Science.gov (United States)

    Shirasawa, Kenta; Isobe, Sachiko; Tabata, Satoshi; Hirakawa, Hideki

    2014-01-01

    In order to provide useful genomic information for agronomical plants, we have established a database, the Kazusa Marker DataBase (http://marker.kazusa.or.jp). This database includes information on DNA markers, e.g., SSR and SNP markers, genetic linkage maps, and physical maps, that were developed at the Kazusa DNA Research Institute. Keyword searches for the markers, sequence data used for marker development, and experimental conditions are also available through this database. Currently, 10 plant species have been targeted: tomato (Solanum lycopersicum), pepper (Capsicum annuum), strawberry (Fragaria × ananassa), radish (Raphanus sativus), Lotus japonicus, soybean (Glycine max), peanut (Arachis hypogaea), red clover (Trifolium pratense), white clover (Trifolium repens), and eucalyptus (Eucalyptus camaldulensis). In addition, the number of plant species registered in this database will be increased as our research progresses. The Kazusa Marker DataBase will be a useful tool for both basic and applied sciences, such as genomics, genetics, and molecular breeding in crops. PMID:25320561

  18. Database on veterinary clinical research in homeopathy.

    Science.gov (United States)

    Clausen, Jürgen; Albrecht, Henning

    2010-07-01

    The aim of the present report is to provide an overview of the first database on clinical research in veterinary homeopathy. Detailed searches in the database 'Veterinary Clinical Research-Database in Homeopathy' (http://www.carstens-stiftung.de/clinresvet/index.php). The database contains about 200 entries of randomised clinical trials, non-randomised clinical trials, observational studies, drug provings, case reports and case series. Twenty-two clinical fields are covered and eight different groups of species are included. The database is free of charge and open to all interested veterinarians and researchers. The database enables researchers and veterinarians, sceptics and supporters to get a quick overview of the status of veterinary clinical research in homeopathy and alleviates the preparation of systematical reviews or may stimulate reproductions or even new studies. 2010 Elsevier Ltd. All rights reserved.

  19. TriMEDB: A database to integrate transcribed markers and facilitate genetic studies of the tribe Triticeae

    Directory of Open Access Journals (Sweden)

    Yoshida Takuhiro

    2008-06-01

    Full Text Available Abstract Background The recent rapid accumulation of sequence resources of various crop species ensures an improvement in the genetics approach, including quantitative trait loci (QTL analysis as well as the holistic population analysis and association mapping of natural variations. Because the tribe Triticeae includes important cereals such as wheat and barley, integration of information on the genetic markers in these crops should effectively accelerate map-based genetic studies on Triticeae species and lead to the discovery of key loci involved in plant productivity, which can contribute to sustainable food production. Therefore, informatics applications and a semantic knowledgebase of genome-wide markers are required for the integration of information on and further development of genetic markers in wheat and barley in order to advance conventional marker-assisted genetic analyses and population genomics of Triticeae species. Description The Triticeae mapped expressed sequence tag (EST database (TriMEDB provides information, along with various annotations, regarding mapped cDNA markers that are related to barley and their homologues in wheat. The current version of TriMEDB provides map-location data for barley and wheat ESTs that were retrieved from 3 published barley linkage maps (the barley single nucleotide polymorphism database of the Scottish Crop Research Institute, the barley transcript map of Leibniz Institute of Plant Genetics and Crop Plant Research, and HarvEST barley ver. 1.63 and 1 diploid wheat map. These data were imported to CMap to allow the visualization of the map positions of the ESTs and interrelationships of these ESTs with public gene models and representative cDNA sequences. The retrieved cDNA sequences corresponding to each EST marker were assigned to the rice genome to predict an exon-intron structure. Furthermore, to generate a unique set of EST markers in Triticeae plants among the public domain, 3472 markers were

  20. Crystal structure of hyperthermophilic esterase EstE1 and the relationship between its dimerization and thermostability properties

    Directory of Open Access Journals (Sweden)

    Koh Eunhee

    2007-07-01

    Full Text Available Abstract Background EstE1 is a hyperthermophilic esterase belonging to the hormone-sensitive lipase family and was originally isolated by functional screening of a metagenomic library constructed from a thermal environmental sample. Dimers and oligomers may have been evolutionally selected in thermophiles because intersubunit interactions can confer thermostability on the proteins. The molecular mechanisms of thermostabilization of this extremely thermostable esterase are not well understood due to the lack of structural information. Results Here we report for the first time the 2.1-Å resolution crystal structure of EstE1. The three-dimensional structure of EstE1 exhibits a classic α/β hydrolase fold with a central parallel-stranded beta sheet surrounded by alpha helices on both sides. The residues Ser154, Asp251, and His281 form the catalytic triad motif commonly found in other α/β hydrolases. EstE1 exists as a dimer that is formed by hydrophobic interactions and salt bridges. Circular dichroism spectroscopy and heat inactivation kinetic analysis of EstE1 mutants, which were generated by structure-based site-directed mutagenesis of amino acid residues participating in EstE1 dimerization, revealed that hydrophobic interactions through Val274 and Phe276 on the β8 strand of each monomer play a major role in the dimerization of EstE1. In contrast, the intermolecular salt bridges contribute less significantly to the dimerization and thermostability of EstE1. Conclusion Our results suggest that intermolecular hydrophobic interactions are essential for the hyperthermostability of EstE1. The molecular mechanism that allows EstE1 to endure high temperature will provide guideline for rational design of a thermostable esterase/lipase using the lipolytic enzymes showing structural similarity to EstE1.

  1. Database Description - PSCDB | LSDB Archive [Life Science Database Archive metadata

    Lifescience Database Archive (English)

    Full Text Available abase Description General information of database Database name PSCDB Alternative n...rial Science and Technology (AIST) Takayuki Amemiya E-mail: Database classification Structure Databases - Protein structure Database...554-D558. External Links: Original website information Database maintenance site Graduate School of Informat...available URL of Web services - Need for user registration Not available About This Database Database Descri...ption Download License Update History of This Database Site Policy | Contact Us Database Description - PSCDB | LSDB Archive ...

  2. Directory of IAEA databases

    International Nuclear Information System (INIS)

    1991-11-01

    The first edition of the Directory of IAEA Databases is intended to describe the computerized information sources available to IAEA staff members. It contains a listing of all databases produced at the IAEA, together with information on their availability

  3. Native Health Research Database

    Science.gov (United States)

    ... Indian Health Board) Welcome to the Native Health Database. Please enter your search terms. Basic Search Advanced ... To learn more about searching the Native Health Database, click here. Tutorial Video The NHD has made ...

  4. Cell Centred Database (CCDB)

    Data.gov (United States)

    U.S. Department of Health & Human Services — The Cell Centered Database (CCDB) is a web accessible database for high resolution 2D, 3D and 4D data from light and electron microscopy, including correlated imaging.

  5. E3 Staff Database

    Data.gov (United States)

    US Agency for International Development — E3 Staff database is maintained by E3 PDMS (Professional Development & Management Services) office. The database is Mysql. It is manually updated by E3 staff as...

  6. Compressing DNA sequence databases with coil

    Directory of Open Access Journals (Sweden)

    Hendy Michael D

    2008-05-01

    Full Text Available Abstract Background Publicly available DNA sequence databases such as GenBank are large, and are growing at an exponential rate. The sheer volume of data being dealt with presents serious storage and data communications problems. Currently, sequence data is usually kept in large "flat files," which are then compressed using standard Lempel-Ziv (gzip compression – an approach which rarely achieves good compression ratios. While much research has been done on compressing individual DNA sequences, surprisingly little has focused on the compression of entire databases of such sequences. In this study we introduce the sequence database compression software coil. Results We have designed and implemented a portable software package, coil, for compressing and decompressing DNA sequence databases based on the idea of edit-tree coding. coil is geared towards achieving high compression ratios at the expense of execution time and memory usage during compression – the compression time represents a "one-off investment" whose cost is quickly amortised if the resulting compressed file is transmitted many times. Decompression requires little memory and is extremely fast. We demonstrate a 5% improvement in compression ratio over state-of-the-art general-purpose compression tools for a large GenBank database file containing Expressed Sequence Tag (EST data. Finally, coil can efficiently encode incremental additions to a sequence database. Conclusion coil presents a compelling alternative to conventional compression of flat files for the storage and distribution of DNA sequence databases having a narrow distribution of sequence lengths, such as EST data. Increasing compression levels for databases having a wide distribution of sequence lengths is a direction for future work.

  7. NIRS database of the original research database

    International Nuclear Information System (INIS)

    Morita, Kyoko

    1991-01-01

    Recently, library staffs arranged and compiled the original research papers that have been written by researchers for 33 years since National Institute of Radiological Sciences (NIRS) established. This papers describes how the internal database of original research papers has been created. This is a small sample of hand-made database. This has been cumulating by staffs who have any knowledge about computer machine or computer programming. (author)

  8. Scopus database: a review.

    Science.gov (United States)

    Burnham, Judy F

    2006-03-08

    The Scopus database provides access to STM journal articles and the references included in those articles, allowing the searcher to search both forward and backward in time. The database can be used for collection development as well as for research. This review provides information on the key points of the database and compares it to Web of Science. Neither database is inclusive, but complements each other. If a library can only afford one, choice must be based in institutional needs.

  9. Aviation Safety Issues Database

    Science.gov (United States)

    Morello, Samuel A.; Ricks, Wendell R.

    2009-01-01

    The aviation safety issues database was instrumental in the refinement and substantiation of the National Aviation Safety Strategic Plan (NASSP). The issues database is a comprehensive set of issues from an extremely broad base of aviation functions, personnel, and vehicle categories, both nationally and internationally. Several aviation safety stakeholders such as the Commercial Aviation Safety Team (CAST) have already used the database. This broader interest was the genesis to making the database publically accessible and writing this report.

  10. Automated Oracle database testing

    CERN Multimedia

    CERN. Geneva

    2014-01-01

    Ensuring database stability and steady performance in the modern world of agile computing is a major challenge. Various changes happening at any level of the computing infrastructure: OS parameters & packages, kernel versions, database parameters & patches, or even schema changes, all can potentially harm production services. This presentation shows how an automatic and regular testing of Oracle databases can be achieved in such agile environment.

  11. Inleiding database-systemen

    NARCIS (Netherlands)

    Pels, H.J.; Lans, van der R.F.; Pels, H.J.; Meersman, R.A.

    1993-01-01

    Dit artikel introduceert de voornaamste begrippen die een rol spelen rond databases en het geeft een overzicht van de doelstellingen, de functies en de componenten van database-systemen. Hoewel de functie van een database intuitief vrij duidelijk is, is het toch een in technologisch opzicht complex

  12. Development of Novel Polymorphic EST-SSR Markers in Bailinggu (Pleurotus tuoliensis for Crossbreeding

    Directory of Open Access Journals (Sweden)

    Yueting Dai

    2017-11-01

    Full Text Available Identification of monokaryons and their mating types and discrimination of hybrid offspring are key steps for the crossbreeding of Pleurotus tuoliensis (Bailinggu. However, conventional crossbreeding methods are troublesome and time consuming. Using RNA-seq technology, we developed new expressed sequence tag-simple sequence repeat (EST-SSR markers for Bailinggu to easily and rapidly identify monokaryons and their mating types, genetic diversity and hybrid offspring. We identified 1110 potential EST-based SSR loci from a newly-sequenced Bailinggu transcriptome and then randomly selected 100 EST-SSRs for further validation. Results showed that 39, 43 and 34 novel EST-SSR markers successfully identified monokaryons from their parent dikaryons, differentiated two different mating types and discriminated F1 and F2 hybrid offspring, respectively. Furthermore, a total of 86 alleles were detected in 37 monokaryons using 18 highly informative EST-SSRs. The observed number of alleles per locus ranged from three to seven. Cluster analysis revealed that these monokaryons have a relatively high level of genetic diversity. Transfer rates of the EST-SSRs in the monokaryons of closely-related species Pleurotus eryngii var. ferulae and Pleurotus ostreatus were 72% and 64%, respectively. Therefore, our study provides new SSR markers and an efficient method to enhance the crossbreeding of Bailinggu and closely-related species.

  13. Development of Novel Polymorphic EST-SSR Markers in Bailinggu (Pleurotus tuoliensis) for Crossbreeding

    Science.gov (United States)

    Dai, Yueting; Su, Wenying; Song, Bing; Li, Yu; Fu, Yongping

    2017-01-01

    Identification of monokaryons and their mating types and discrimination of hybrid offspring are key steps for the crossbreeding of Pleurotus tuoliensis (Bailinggu). However, conventional crossbreeding methods are troublesome and time consuming. Using RNA-seq technology, we developed new expressed sequence tag-simple sequence repeat (EST-SSR) markers for Bailinggu to easily and rapidly identify monokaryons and their mating types, genetic diversity and hybrid offspring. We identified 1110 potential EST-based SSR loci from a newly-sequenced Bailinggu transcriptome and then randomly selected 100 EST-SSRs for further validation. Results showed that 39, 43 and 34 novel EST-SSR markers successfully identified monokaryons from their parent dikaryons, differentiated two different mating types and discriminated F1 and F2 hybrid offspring, respectively. Furthermore, a total of 86 alleles were detected in 37 monokaryons using 18 highly informative EST-SSRs. The observed number of alleles per locus ranged from three to seven. Cluster analysis revealed that these monokaryons have a relatively high level of genetic diversity. Transfer rates of the EST-SSRs in the monokaryons of closely-related species Pleurotus eryngii var. ferulae and Pleurotus ostreatus were 72% and 64%, respectively. Therefore, our study provides new SSR markers and an efficient method to enhance the crossbreeding of Bailinggu and closely-related species. PMID:29149037

  14. Punta del Este terrane : a better knowledge

    International Nuclear Information System (INIS)

    Preciozzi, F.; Sanchez Bettucci, L; Basei, M.

    2002-01-01

    Field Punta del Este (TPE) is constituted by a series of gneisses and migmatites formed in the range of 1000 and 900 Ma, and have been intensely reworked during Rio Doce orogeny (ca. 600-500 Ma). This cortical segment represents high-grade metamorphic terrain, correlatable with complex gneiss Southwest Africa, particularly with Kibariáno Belt - Namaqualanos recognized in the southwestern portion of Africa (Namibia). U-Pb zircon ages in tonalite granitoids indicate values ​​between 1000 and 900 Ma and were interpreted as indicating the time of generation of these rocks. This should also be the age of high-grade metamorphism that affected a large part of the gneissic rocks of the region. Moreover, the anatectic related mobilized leucosomes ages of migmatites gave ca. 520-540 Ma, indicating that the metamorphic conditions (overlapping) during the Rio Doce orogeny reached at least the amphibolite facies. The metasedimentary cover TPE occurs in the vicinity of the towns of La Paloma Rocha. These are represented by a siliciclastic metasedimentary sequence represented by Rocha Formation. Despite the poly phase deformation and low grade metamorphism affecting this training, primary structures are common such as stratification plano-parallel, cross-bedding, hummock y and massive levels with gradational stratification

  15. Two EST-derived marker systems for cultivar identification in tree peony.

    Science.gov (United States)

    Zhang, J J; Shu, Q Y; Liu, Z A; Ren, H X; Wang, L S; De Keyser, E

    2012-02-01

    Tree peony (Paeonia suffruticosa Andrews), a woody deciduous shrub, belongs to the section Moutan DC. in the genus of Paeonia of the Paeoniaceae family. To increase the efficiency of breeding, two EST-derived marker systems were developed based on a tree peony expressed sequence tag (EST) database. Using target region amplification polymorphism (TRAP), 19 of 39 primer pairs showed good amplification for 56 accessions with amplicons ranging from 120 to 3,000 bp long, among which 99.3% were polymorphic. In contrast, 7 of 21 primer pairs demonstrated adequate amplification with clear bands for simple sequence repeats (SSRs) developed from ESTs, and a total of 33 alleles were found in 56 accessions. The similarity matrices generated by TRAP and EST-SSR markers were compared, and the Mantel test (r = 0.57778, P = 0.0020) showed a moderate correlation between the two types of molecular markers. TRAP markers were suitable for DNA fingerprinting and EST-SSR markers were more appropriate for discriminating synonyms (the same cultivars with different names due to limited information exchanged among different geographic areas). The two sets of EST-derived markers will be used further for genetic linkage map construction and quantitative trait locus detection in tree peony.

  16. Database Description - RMOS | LSDB Archive [Life Science Database Archive metadata

    Lifescience Database Archive (English)

    Full Text Available base Description General information of database Database name RMOS Alternative nam...arch Unit Shoshi Kikuchi E-mail : Database classification Plant databases - Rice Microarray Data and other Gene Expression Database...s Organism Taxonomy Name: Oryza sativa Taxonomy ID: 4530 Database description The Ric...19&lang=en Whole data download - Referenced database Rice Expression Database (RED) Rice full-length cDNA Database... (KOME) Rice Genome Integrated Map Database (INE) Rice Mutant Panel Database (Tos17) Rice Genome Annotation Database

  17. An Interoperable Cartographic Database

    Directory of Open Access Journals (Sweden)

    Slobodanka Ključanin

    2007-05-01

    Full Text Available The concept of producing a prototype of interoperable cartographic database is explored in this paper, including the possibilities of integration of different geospatial data into the database management system and their visualization on the Internet. The implementation includes vectorization of the concept of a single map page, creation of the cartographic database in an object-relation database, spatial analysis, definition and visualization of the database content in the form of a map on the Internet. 

  18. Keyword Search in Databases

    CERN Document Server

    Yu, Jeffrey Xu; Chang, Lijun

    2009-01-01

    It has become highly desirable to provide users with flexible ways to query/search information over databases as simple as keyword search like Google search. This book surveys the recent developments on keyword search over databases, and focuses on finding structural information among objects in a database using a set of keywords. Such structural information to be returned can be either trees or subgraphs representing how the objects, that contain the required keywords, are interconnected in a relational database or in an XML database. The structural keyword search is completely different from

  19. Nuclear power economic database

    International Nuclear Information System (INIS)

    Ding Xiaoming; Li Lin; Zhao Shiping

    1996-01-01

    Nuclear power economic database (NPEDB), based on ORACLE V6.0, consists of three parts, i.e., economic data base of nuclear power station, economic data base of nuclear fuel cycle and economic database of nuclear power planning and nuclear environment. Economic database of nuclear power station includes data of general economics, technique, capital cost and benefit, etc. Economic database of nuclear fuel cycle includes data of technique and nuclear fuel price. Economic database of nuclear power planning and nuclear environment includes data of energy history, forecast, energy balance, electric power and energy facilities

  20. Onde está a antropologia?

    Directory of Open Access Journals (Sweden)

    Mariza G. S. Peirano

    1997-10-01

    Full Text Available Este artigo examina o cenário da antropologia contemporânea mediante o contraste entre quatro livros publicados nos últimos anos, dois nos Estados Unidos (After the Fact, de Clifford Geertz, e Making PCR, de Paul Rabinow e dois na Índia (Pathways, de T. N. Madan, e Critical Events, de Veena Das. A comparação é realizada no contexto de uma breve etnografia das livrarias norte-americanas, na qual se avaliam algumas publicações recentes, assim como a reclassificação de disciplinas tradicionais. O papel indispensável dos autores e obras considerados clássicos é examinado no contexto atual, finalizando com uma agenda para reflexão sobre as possibilidades de comunicação entre antropólogos de várias origens e vertentes.This article examines the setting of contemporary anthropology by means of the contrast between four books recently published in the United States (After the Fact by Clifford Geertz and Making PCR by Paul Rabinow and India (Pathways by T. N. Madan and Critical Events by Veena Das. The comparison is made in the context of a brief ethnographic study of bookstores in the United States which evaluates several recent publications, as well as the reclassification of traditional disciplines. The indispensable role of the authors in works considered classics is examined in present-day context, concluding with a reflection on the possibility of communication between anthropologists of diverse backgrounds and perspectives.

  1. Identification and Validation of EST-Derived Molecular Markers, TRAP and VNTRs, for Banana Research

    NARCIS (Netherlands)

    Garcia, S.A.L.; Talebi, R.; Ferreira, C.F.; Vroh, B.I.; Paiva, L.V.; Kema, G.H.J.; Souza, M.T.

    2011-01-01

    The advent of high-throughput sequencing technology has generated abundant information on DNA sequences for the genomes of many plant species. Expressed Sequence Tags (ESTs), which are unique DNA sequences derived from a cDNA library and therefore representing genes transcribed in specific tissues

  2. Database Description - RPD | LSDB Archive [Life Science Database Archive metadata

    Lifescience Database Archive (English)

    Full Text Available ase Description General information of database Database name RPD Alternative name Rice Proteome Database...titute of Crop Science, National Agriculture and Food Research Organization Setsuko Komatsu E-mail: Database... classification Proteomics Resources Plant databases - Rice Organism Taxonomy Name: Oryza sativa Taxonomy ID: 4530 Database... description Rice Proteome Database contains information on protei...and entered in the Rice Proteome Database. The database is searchable by keyword,

  3. Database Description - JSNP | LSDB Archive [Life Science Database Archive metadata

    Lifescience Database Archive (English)

    Full Text Available base Description General information of database Database name JSNP Alternative nam...n Science and Technology Agency Creator Affiliation: Contact address E-mail : Database...sapiens Taxonomy ID: 9606 Database description A database of about 197,000 polymorphisms in Japanese populat...1):605-610 External Links: Original website information Database maintenance site Institute of Medical Scien...er registration Not available About This Database Database Description Download License Update History of This Database

  4. Database Description - ASTRA | LSDB Archive [Life Science Database Archive metadata

    Lifescience Database Archive (English)

    Full Text Available abase Description General information of database Database name ASTRA Alternative n...tics Journal Search: Contact address Database classification Nucleotide Sequence Databases - Gene structure,...3702 Taxonomy Name: Oryza sativa Taxonomy ID: 4530 Database description The database represents classified p...(10):1211-6. External Links: Original website information Database maintenance site National Institute of Ad... for user registration Not available About This Database Database Description Dow

  5. Database Description - RED | LSDB Archive [Life Science Database Archive metadata

    Lifescience Database Archive (English)

    Full Text Available ase Description General information of database Database name RED Alternative name Rice Expression Database...enome Research Unit Shoshi Kikuchi E-mail : Database classification Plant databases - Rice Database classifi...cation Microarray, Gene Expression Organism Taxonomy Name: Oryza sativa Taxonomy ID: 4530 Database descripti... Article title: Rice Expression Database: the gateway to rice functional genomics...nt Science (2002) Dec 7 (12):563-564 External Links: Original website information Database maintenance site

  6. Database Description - PLACE | LSDB Archive [Life Science Database Archive metadata

    Lifescience Database Archive (English)

    Full Text Available abase Description General information of database Database name PLACE Alternative name A Database...Kannondai, Tsukuba, Ibaraki 305-8602, Japan National Institute of Agrobiological Sciences E-mail : Databas...e classification Plant databases Organism Taxonomy Name: Tracheophyta Taxonomy ID: 58023 Database...99, Vol.27, No.1 :297-300 External Links: Original website information Database maintenance site National In...- Need for user registration Not available About This Database Database Descripti

  7. Database Description - Arabidopsis Phenome Database | LSDB Archive [Life Science Database Archive metadata

    Lifescience Database Archive (English)

    Full Text Available List Contact us Arabidopsis Phenome Database Database Description General information of database Database n... BioResource Center Hiroshi Masuya Database classification Plant databases - Arabidopsis thaliana Organism T...axonomy Name: Arabidopsis thaliana Taxonomy ID: 3702 Database description The Arabidopsis thaliana phenome i...heir effective application. We developed the new Arabidopsis Phenome Database integrating two novel database...seful materials for their experimental research. The other, the “Database of Curated Plant Phenome” focusing

  8. ESTE EMO and ESTE EBO - emergency response system for NPP Mochovce and NPP Bohunice V-2

    International Nuclear Information System (INIS)

    Caeny, P.; Chyly, M.; Suchon, D.; Smejkalova, E.; Fabova, V.; Mancikova, M.; Muller, P.

    2009-01-01

    Programs ESTE EMO and ESTE EBO are emergency response systems that help the crisis staff of the NPP in assessing the source term (predicted possible release of radionuclides to the atmosphere ), in assessing the urgent protective measures and sectors under threat, in assessing real release (symptoms of release really detected and observed), in calculating radiological impacts of real release, averted or avertable doses, potential doses and doses during transport or evacuation on specified routes. Both systems serve as instruments in case of severe accident (DBA or BDBA) at NPP Mochovce or NPP Bohunice, accidents with threat of release of radioactivity to the atmosphere. Systems are implemented at emergency centre of Mochovce NPP and Bohunice NPP and connected online to the sources of technological and radiological data from the reactor, primary circuit, confinement, secondary circuit, ventilation stack, from the area of NPP (TDS 1) and from the emergency planning zone (TDS 11). Systems are connected online to the sources of meteorological data, too. (authors)

  9. ESTE EMO and ESTE EBO - emergency response system for NPP Mochovce and NPP Bohunice V-2

    International Nuclear Information System (INIS)

    Caeny, P.; Chyly, M.; Suchon, D.; Smejkalova, E.; Fabova, V.; Mancikova, M.; Muller, P.

    2008-01-01

    Programs ESTE EMO and ESTE EBO are emergency response systems that help the crisis staff of the NPP in assessing the source term (predicted possible release of radionuclides to the atmosphere ), in assessing the urgent protective measures and sectors under threat, in assessing real release (symptoms of release really detected and observed), in calculating radiological impacts of real release, averted or avertable doses, potential doses and doses during transport or evacuation on specified routes. Both systems serve as instruments in case of severe accident (DBA or BDBA) at NPP Mochovce or NPP Bohunice, accidents with threat of release of radioactivity to the atmosphere. Systems are implemented at emergency centre of Mochovce NPP and Bohunice NPP and connected online to the sources of technological and radiological data from the reactor, primary circuit, confinement, secondary circuit, ventilation stack, from the area of NPP (TDS 1) and from the emergency planning zone (TDS 11). Systems are connected online to the sources of meteorological data, too. (authors)

  10. Kohvik-restoran C'est La Vie = Cafe-restaurant C'est La Vie

    Index Scriptorium Estoniae

    2008-01-01

    Kohvik-restorani C'est La Vie (Suur-Karja 5, Tallinn) sisekujundusest. Sisearhitekt: Maile Grünberg, kes kavandas ka valdavalt mööbli. Laevalgusti disainis Tõnis Vellama. Skulptuuride (koopiad) autor on keraamik Ülle Rajasalu. Maile Grünbergist, tema viimased tööd. Keldrikorruse ja I korruse plaan, 8 värv. vaadet, foto M. Grünbergist

  11. Hazard Analysis Database Report

    CERN Document Server

    Grams, W H

    2000-01-01

    The Hazard Analysis Database was developed in conjunction with the hazard analysis activities conducted in accordance with DOE-STD-3009-94, Preparation Guide for U S . Department of Energy Nonreactor Nuclear Facility Safety Analysis Reports, for HNF-SD-WM-SAR-067, Tank Farms Final Safety Analysis Report (FSAR). The FSAR is part of the approved Authorization Basis (AB) for the River Protection Project (RPP). This document describes, identifies, and defines the contents and structure of the Tank Farms FSAR Hazard Analysis Database and documents the configuration control changes made to the database. The Hazard Analysis Database contains the collection of information generated during the initial hazard evaluations and the subsequent hazard and accident analysis activities. The Hazard Analysis Database supports the preparation of Chapters 3 ,4 , and 5 of the Tank Farms FSAR and the Unreviewed Safety Question (USQ) process and consists of two major, interrelated data sets: (1) Hazard Analysis Database: Data from t...

  12. Database Optimizing Services

    Directory of Open Access Journals (Sweden)

    Adrian GHENCEA

    2010-12-01

    Full Text Available Almost every organization has at its centre a database. The database provides support for conducting different activities, whether it is production, sales and marketing or internal operations. Every day, a database is accessed for help in strategic decisions. The satisfaction therefore of such needs is entailed with a high quality security and availability. Those needs can be realised using a DBMS (Database Management System which is, in fact, software for a database. Technically speaking, it is software which uses a standard method of cataloguing, recovery, and running different data queries. DBMS manages the input data, organizes it, and provides ways of modifying or extracting the data by its users or other programs. Managing the database is an operation that requires periodical updates, optimizing and monitoring.

  13. National Database of Geriatrics

    DEFF Research Database (Denmark)

    Kannegaard, Pia Nimann; Vinding, Kirsten L; Hare-Bruun, Helle

    2016-01-01

    AIM OF DATABASE: The aim of the National Database of Geriatrics is to monitor the quality of interdisciplinary diagnostics and treatment of patients admitted to a geriatric hospital unit. STUDY POPULATION: The database population consists of patients who were admitted to a geriatric hospital unit....... Geriatric patients cannot be defined by specific diagnoses. A geriatric patient is typically a frail multimorbid elderly patient with decreasing functional ability and social challenges. The database includes 14-15,000 admissions per year, and the database completeness has been stable at 90% during the past......, percentage of discharges with a rehabilitation plan, and the part of cases where an interdisciplinary conference has taken place. Data are recorded by doctors, nurses, and therapists in a database and linked to the Danish National Patient Register. DESCRIPTIVE DATA: Descriptive patient-related data include...

  14. Tradeoffs in distributed databases

    OpenAIRE

    Juntunen, R. (Risto)

    2016-01-01

    Abstract In a distributed database data is spread throughout the network into separated nodes with different DBMS systems (Date, 2000). According to CAP-theorem three database properties — consistency, availability and partition tolerance cannot be achieved simultaneously in distributed database systems. Two of these properties can be achieved but not all three at the same time (Brewer, 2000). Since this theorem there has b...

  15. Specialist Bibliographic Databases

    OpenAIRE

    Gasparyan, Armen Yuri; Yessirkepov, Marlen; Voronov, Alexander A.; Trukhachev, Vladimir I.; Kostyukova, Elena I.; Gerasimov, Alexey N.; Kitas, George D.

    2016-01-01

    Specialist bibliographic databases offer essential online tools for researchers and authors who work on specific subjects and perform comprehensive and systematic syntheses of evidence. This article presents examples of the established specialist databases, which may be of interest to those engaged in multidisciplinary science communication. Access to most specialist databases is through subscription schemes and membership in professional associations. Several aggregators of information and d...

  16. Supply Chain Initiatives Database

    Energy Technology Data Exchange (ETDEWEB)

    None

    2012-11-01

    The Supply Chain Initiatives Database (SCID) presents innovative approaches to engaging industrial suppliers in efforts to save energy, increase productivity and improve environmental performance. This comprehensive and freely-accessible database was developed by the Institute for Industrial Productivity (IIP). IIP acknowledges Ecofys for their valuable contributions. The database contains case studies searchable according to the types of activities buyers are undertaking to motivate suppliers, target sector, organization leading the initiative, and program or partnership linkages.

  17. Database Description - SAHG | LSDB Archive [Life Science Database Archive metadata

    Lifescience Database Archive (English)

    Full Text Available base Description General information of database Database name SAHG Alternative nam...h: Contact address Chie Motono Tel : +81-3-3599-8067 E-mail : Database classification Structure Databases - ...e databases - Protein properties Organism Taxonomy Name: Homo sapiens Taxonomy ID: 9606 Database description... Links: Original website information Database maintenance site The Molecular Profiling Research Center for D...stration Not available About This Database Database Description Download License Update History of This Database Site Policy | Contact Us Database Description - SAHG | LSDB Archive ...

  18. Intermodal Passenger Connectivity Database -

    Data.gov (United States)

    Department of Transportation — The Intermodal Passenger Connectivity Database (IPCD) is a nationwide data table of passenger transportation terminals, with data on the availability of connections...

  19. Transporter Classification Database (TCDB)

    Data.gov (United States)

    U.S. Department of Health & Human Services — The Transporter Classification Database details a comprehensive classification system for membrane transport proteins known as the Transporter Classification (TC)...

  20. Residency Allocation Database

    Data.gov (United States)

    Department of Veterans Affairs — The Residency Allocation Database is used to determine allocation of funds for residency programs offered by Veterans Affairs Medical Centers (VAMCs). Information...

  1. Smart Location Database - Service

    Data.gov (United States)

    U.S. Environmental Protection Agency — The Smart Location Database (SLD) summarizes over 80 demographic, built environment, transit service, and destination accessibility attributes for every census block...

  2. Database principles programming performance

    CERN Document Server

    O'Neil, Patrick

    2014-01-01

    Database: Principles Programming Performance provides an introduction to the fundamental principles of database systems. This book focuses on database programming and the relationships between principles, programming, and performance.Organized into 10 chapters, this book begins with an overview of database design principles and presents a comprehensive introduction to the concepts used by a DBA. This text then provides grounding in many abstract concepts of the relational model. Other chapters introduce SQL, describing its capabilities and covering the statements and functions of the programmi

  3. Veterans Administration Databases

    Science.gov (United States)

    The Veterans Administration Information Resource Center provides database and informatics experts, customer service, expert advice, information products, and web technology to VA researchers and others.

  4. IVR EFP Database

    Data.gov (United States)

    National Oceanic and Atmospheric Administration, Department of Commerce — This database contains trip-level reports submitted by vessels participating in Exempted Fishery projects with IVR reporting requirements.

  5. Towards Sensor Database Systems

    DEFF Research Database (Denmark)

    Bonnet, Philippe; Gehrke, Johannes; Seshadri, Praveen

    2001-01-01

    . These systems lack flexibility because data is extracted in a predefined way; also, they do not scale to a large number of devices because large volumes of raw data are transferred regardless of the queries that are submitted. In our new concept of sensor database system, queries dictate which data is extracted...... from the sensors. In this paper, we define the concept of sensor databases mixing stored data represented as relations and sensor data represented as time series. Each long-running query formulated over a sensor database defines a persistent view, which is maintained during a given time interval. We...... also describe the design and implementation of the COUGAR sensor database system....

  6. Database Publication Practices

    DEFF Research Database (Denmark)

    Bernstein, P.A.; DeWitt, D.; Heuer, A.

    2005-01-01

    There has been a growing interest in improving the publication processes for database research papers. This panel reports on recent changes in those processes and presents an initial cut at historical data for the VLDB Journal and ACM Transactions on Database Systems.......There has been a growing interest in improving the publication processes for database research papers. This panel reports on recent changes in those processes and presents an initial cut at historical data for the VLDB Journal and ACM Transactions on Database Systems....

  7. Smart Location Database - Download

    Data.gov (United States)

    U.S. Environmental Protection Agency — The Smart Location Database (SLD) summarizes over 80 demographic, built environment, transit service, and destination accessibility attributes for every census block...

  8. Database Description - RMG | LSDB Archive [Life Science Database Archive metadata

    Lifescience Database Archive (English)

    Full Text Available ase Description General information of database Database name RMG Alternative name ...raki 305-8602, Japan National Institute of Agrobiological Sciences E-mail : Database... classification Nucleotide Sequence Databases Organism Taxonomy Name: Oryza sativa Japonica Group Taxonomy ID: 39947 Database...rnal: Mol Genet Genomics (2002) 268: 434–445 External Links: Original website information Database...available URL of Web services - Need for user registration Not available About This Database Database Descri

  9. Database Description - KOME | LSDB Archive [Life Science Database Archive metadata

    Lifescience Database Archive (English)

    Full Text Available base Description General information of database Database name KOME Alternative nam... Sciences Plant Genome Research Unit Shoshi Kikuchi E-mail : Database classification Plant databases - Rice ...Organism Taxonomy Name: Oryza sativa Taxonomy ID: 4530 Database description Information about approximately ...Hayashizaki Y, Kikuchi S. Journal: PLoS One. 2007 Nov 28; 2(11):e1235. External Links: Original website information Database...OS) Rice mutant panel database (Tos17) A Database of Plant Cis-acting Regulatory

  10. Update History of This Database - Arabidopsis Phenome Database | LSDB Archive [Life Science Database Archive metadata

    Lifescience Database Archive (English)

    Full Text Available List Contact us Arabidopsis Phenome Database Update History of This Database Date Update contents 2017/02/27 Arabidopsis Phenome Data...base English archive site is opened. - Arabidopsis Phenome Database (http://jphenom...e.info/?page_id=95) is opened. About This Database Database Description Download License Update History of This Database... Site Policy | Contact Us Update History of This Database - Arabidopsis Phenome Database | LSDB Archive ...

  11. Update History of This Database - SKIP Stemcell Database | LSDB Archive [Life Science Database Archive metadata

    Lifescience Database Archive (English)

    Full Text Available List Contact us SKIP Stemcell Database Update History of This Database Date Update contents 2017/03/13 SKIP Stemcell Database... English archive site is opened. 2013/03/29 SKIP Stemcell Database ( https://www.skip.med.k...eio.ac.jp/SKIPSearch/top?lang=en ) is opened. About This Database Database Description Download License Update History of This Databa...se Site Policy | Contact Us Update History of This Database - SKIP Stemcell Database | LSDB Archive ...

  12. Update History of This Database - Yeast Interacting Proteins Database | LSDB Archive [Life Science Database Archive metadata

    Lifescience Database Archive (English)

    Full Text Available List Contact us Yeast Interacting Proteins Database Update History of This Database Date Update contents 201...0/03/29 Yeast Interacting Proteins Database English archive site is opened. 2000/12/4 Yeast Interacting Proteins Database...( http://itolab.cb.k.u-tokyo.ac.jp/Y2H/ ) is released. About This Database Database Description... Download License Update History of This Database Site Policy | Contact Us Update History of This Database... - Yeast Interacting Proteins Database | LSDB Archive ...

  13. Download - Trypanosomes Database | LSDB Archive [Life Science Database Archive metadata

    Lifescience Database Archive (English)

    Full Text Available List Contact us Trypanosomes Database Download First of all, please read the license of this database. Data ...1.4 KB) Simple search and download Downlaod via FTP FTP server is sometimes jammed. If it is, access [here]. About This Database Data...base Description Download License Update History of This Database Site Policy | Contact Us Download - Trypanosomes Database | LSDB Archive ...

  14. Database design and database administration for a kindergarten

    OpenAIRE

    Vítek, Daniel

    2009-01-01

    The bachelor thesis deals with creation of database design for a standard kindergarten, installation of the designed database into the database system Oracle Database 10g Express Edition and demonstration of the administration tasks in this database system. The verification of the database was proved by a developed access application.

  15. A new set of ESTs and cDNA clones from full-length and normalized libraries for gene discovery and functional characterization in citrus

    Directory of Open Access Journals (Sweden)

    Alamar Santiago

    2009-09-01

    Full Text Available Abstract Background Interpretation of ever-increasing raw sequence information generated by modern genome sequencing technologies faces multiple challenges, such as gene function analysis and genome annotation. Indeed, nearly 40% of genes in plants encode proteins of unknown function. Functional characterization of these genes is one of the main challenges in modern biology. In this regard, the availability of full-length cDNA clones may fill in the gap created between sequence information and biological knowledge. Full-length cDNA clones facilitate functional analysis of the corresponding genes enabling manipulation of their expression in heterologous systems and the generation of a variety of tagged versions of the native protein. In addition, the development of full-length cDNA sequences has the power to improve the quality of genome annotation. Results We developed an integrated method to generate a new normalized EST collection enriched in full-length and rare transcripts of different citrus species from multiple tissues and developmental stages. We constructed a total of 15 cDNA libraries, from which we isolated 10,898 high-quality ESTs representing 6142 different genes. Percentages of redundancy and proportion of full-length clones range from 8 to 33, and 67 to 85, respectively, indicating good efficiency of the approach employed. The new EST collection adds 2113 new citrus ESTs, representing 1831 unigenes, to the collection of citrus genes available in the public databases. To facilitate functional analysis, cDNAs were introduced in a Gateway-based cloning vector for high-throughput functional analysis of genes in planta. Herein, we describe the technical methods used in the library construction, sequence analysis of clones and the overexpression of CitrSEP, a citrus homolog to the Arabidopsis SEP3 gene, in Arabidopsis as an example of a practical application of the engineered Gateway vector for functional analysis. Conclusion The new

  16. A new set of ESTs and cDNA clones from full-length and normalized libraries for gene discovery and functional characterization in citrus

    Science.gov (United States)

    Marques, M Carmen; Alonso-Cantabrana, Hugo; Forment, Javier; Arribas, Raquel; Alamar, Santiago; Conejero, Vicente; Perez-Amador, Miguel A

    2009-01-01

    Background Interpretation of ever-increasing raw sequence information generated by modern genome sequencing technologies faces multiple challenges, such as gene function analysis and genome annotation. Indeed, nearly 40% of genes in plants encode proteins of unknown function. Functional characterization of these genes is one of the main challenges in modern biology. In this regard, the availability of full-length cDNA clones may fill in the gap created between sequence information and biological knowledge. Full-length cDNA clones facilitate functional analysis of the corresponding genes enabling manipulation of their expression in heterologous systems and the generation of a variety of tagged versions of the native protein. In addition, the development of full-length cDNA sequences has the power to improve the quality of genome annotation. Results We developed an integrated method to generate a new normalized EST collection enriched in full-length and rare transcripts of different citrus species from multiple tissues and developmental stages. We constructed a total of 15 cDNA libraries, from which we isolated 10,898 high-quality ESTs representing 6142 different genes. Percentages of redundancy and proportion of full-length clones range from 8 to 33, and 67 to 85, respectively, indicating good efficiency of the approach employed. The new EST collection adds 2113 new citrus ESTs, representing 1831 unigenes, to the collection of citrus genes available in the public databases. To facilitate functional analysis, cDNAs were introduced in a Gateway-based cloning vector for high-throughput functional analysis of genes in planta. Herein, we describe the technical methods used in the library construction, sequence analysis of clones and the overexpression of CitrSEP, a citrus homolog to the Arabidopsis SEP3 gene, in Arabidopsis as an example of a practical application of the engineered Gateway vector for functional analysis. Conclusion The new EST collection denotes an

  17. BBGD: an online database for blueberry genomic data

    Directory of Open Access Journals (Sweden)

    Matthews Benjamin F

    2007-01-01

    Full Text Available Abstract Background Blueberry is a member of the Ericaceae family, which also includes closely related cranberry and more distantly related rhododendron, azalea, and mountain laurel. Blueberry is a major berry crop in the United States, and one that has great nutritional and economical value. Extreme low temperatures, however, reduce crop yield and cause major losses to US farmers. A better understanding of the genes and biochemical pathways that are up- or down-regulated during cold acclimation is needed to produce blueberry cultivars with enhanced cold hardiness. To that end, the blueberry genomics database (BBDG was developed. Along with the analysis tools and web-based query interfaces, the database serves both the broader Ericaceae research community and the blueberry research community specifically by making available ESTs and gene expression data in searchable formats and in elucidating the underlying mechanisms of cold acclimation and freeze tolerance in blueberry. Description BBGD is the world's first database for blueberry genomics. BBGD is both a sequence and gene expression database. It stores both EST and microarray data and allows scientists to correlate expression profiles with gene function. BBGD is a public online database. Presently, the main focus of the database is the identification of genes in blueberry that are significantly induced or suppressed after low temperature exposure. Conclusion By using the database, researchers have developed EST-based markers for mapping and have identified a number of "candidate" cold tolerance genes that are highly expressed in blueberry flower buds after exposure to low temperatures.

  18. PCR-Based EST Mapping in Wheat (Triticum aestivum L.

    Directory of Open Access Journals (Sweden)

    J. PERRY GUSTAFSON

    2009-04-01

    Full Text Available Mapping expressed sequence tags (ESTs to hexaploid wheat is aimed to reveal the structure and function of the hexaploid wheat genome. Sixty eight ESTs representing 26 genes were mapped into all seven homologous chromosome groups of wheat (Triticum aestivum L using a polymerase chain reaction technique. The majority of the ESTs were mapped to homologous chromosome group 2, and the least were mapped to homologous chromosome group 6. Comparative analysis between the EST map from this study and the EST map based on RFLPs showed 14 genes that have been mapped by both approaches were mapped to the same arm of the same homologous chromosome, which indicated that using PCR-based ESTs was a reliable approach in mapping ESTs in hexaploid wheat.

  19. Directory of IAEA databases

    International Nuclear Information System (INIS)

    1992-12-01

    This second edition of the Directory of IAEA Databases has been prepared within the Division of Scientific and Technical Information (NESI). Its main objective is to describe the computerized information sources available to staff members. This directory contains all databases produced at the IAEA, including databases stored on the mainframe, LAN's and PC's. All IAEA Division Directors have been requested to register the existence of their databases with NESI. For the second edition database owners were requested to review the existing entries for their databases and answer four additional questions. The four additional questions concerned the type of database (e.g. Bibliographic, Text, Statistical etc.), the category of database (e.g. Administrative, Nuclear Data etc.), the available documentation and the type of media used for distribution. In the individual entries on the following pages the answers to the first two questions (type and category) is always listed, but the answers to the second two questions (documentation and media) is only listed when information has been made available

  20. HIV Structural Database

    Science.gov (United States)

    SRD 102 HIV Structural Database (Web, free access)   The HIV Protease Structural Database is an archive of experimentally determined 3-D structures of Human Immunodeficiency Virus 1 (HIV-1), Human Immunodeficiency Virus 2 (HIV-2) and Simian Immunodeficiency Virus (SIV) Proteases and their complexes with inhibitors or products of substrate cleavage.

  1. Balkan Vegetation Database

    NARCIS (Netherlands)

    Vassilev, Kiril; Pedashenko, Hristo; Alexandrova, Alexandra; Tashev, Alexandar; Ganeva, Anna; Gavrilova, Anna; Gradevska, Asya; Assenov, Assen; Vitkova, Antonina; Grigorov, Borislav; Gussev, Chavdar; Filipova, Eva; Aneva, Ina; Knollová, Ilona; Nikolov, Ivaylo; Georgiev, Georgi; Gogushev, Georgi; Tinchev, Georgi; Pachedjieva, Kalina; Koev, Koycho; Lyubenova, Mariyana; Dimitrov, Marius; Apostolova-Stoyanova, Nadezhda; Velev, Nikolay; Zhelev, Petar; Glogov, Plamen; Natcheva, Rayna; Tzonev, Rossen; Boch, Steffen; Hennekens, Stephan M.; Georgiev, Stoyan; Stoyanov, Stoyan; Karakiev, Todor; Kalníková, Veronika; Shivarov, Veselin; Russakova, Veska; Vulchev, Vladimir

    2016-01-01

    The Balkan Vegetation Database (BVD; GIVD ID: EU-00-019; http://www.givd.info/ID/EU-00- 019) is a regional database that consists of phytosociological relevés from different vegetation types from six countries on the Balkan Peninsula (Albania, Bosnia and Herzegovina, Bulgaria, Kosovo, Montenegro

  2. World Database of Happiness

    NARCIS (Netherlands)

    R. Veenhoven (Ruut)

    1995-01-01

    textabstractABSTRACT The World Database of Happiness is an ongoing register of research on subjective appreciation of life. Its purpose is to make the wealth of scattered findings accessible, and to create a basis for further meta-analytic studies. The database involves four sections:
    1.

  3. Dictionary as Database.

    Science.gov (United States)

    Painter, Derrick

    1996-01-01

    Discussion of dictionaries as databases focuses on the digitizing of The Oxford English dictionary (OED) and the use of Standard Generalized Mark-Up Language (SGML). Topics include the creation of a consortium to digitize the OED, document structure, relational databases, text forms, sequence, and discourse. (LRW)

  4. Fire test database

    International Nuclear Information System (INIS)

    Lee, J.A.

    1989-01-01

    This paper describes a project recently completed for EPRI by Impell. The purpose of the project was to develop a reference database of fire tests performed on non-typical fire rated assemblies. The database is designed for use by utility fire protection engineers to locate test reports for power plant fire rated assemblies. As utilities prepare to respond to Information Notice 88-04, the database will identify utilities, vendors or manufacturers who have specific fire test data. The database contains fire test report summaries for 729 tested configurations. For each summary, a contact is identified from whom a copy of the complete fire test report can be obtained. Five types of configurations are included: doors, dampers, seals, wraps and walls. The database is computerized. One version for IBM; one for Mac. Each database is accessed through user-friendly software which allows adding, deleting, browsing, etc. through the database. There are five major database files. One each for the five types of tested configurations. The contents of each provides significant information regarding the test method and the physical attributes of the tested configuration. 3 figs

  5. Children's Culture Database (CCD)

    DEFF Research Database (Denmark)

    Wanting, Birgit

    a Dialogue inspired database with documentation, network (individual and institutional profiles) and current news , paper presented at the research seminar: Electronic access to fiction, Copenhagen, November 11-13, 1996......a Dialogue inspired database with documentation, network (individual and institutional profiles) and current news , paper presented at the research seminar: Electronic access to fiction, Copenhagen, November 11-13, 1996...

  6. Atomic Spectra Database (ASD)

    Science.gov (United States)

    SRD 78 NIST Atomic Spectra Database (ASD) (Web, free access)   This database provides access and search capability for NIST critically evaluated data on atomic energy levels, wavelengths, and transition probabilities that are reasonably up-to-date. The NIST Atomic Spectroscopy Data Center has carried out these critical compilations.

  7. Consumer Product Category Database

    Science.gov (United States)

    The Chemical and Product Categories database (CPCat) catalogs the use of over 40,000 chemicals and their presence in different consumer products. The chemical use information is compiled from multiple sources while product information is gathered from publicly available Material Safety Data Sheets (MSDS). EPA researchers are evaluating the possibility of expanding the database with additional product and use information.

  8. Database in Artificial Intelligence.

    Science.gov (United States)

    Wilkinson, Julia

    1986-01-01

    Describes a specialist bibliographic database of literature in the field of artificial intelligence created by the Turing Institute (Glasgow, Scotland) using the BRS/Search information retrieval software. The subscription method for end-users--i.e., annual fee entitles user to unlimited access to database, document provision, and printed awareness…

  9. NoSQL database scaling

    OpenAIRE

    Žardin, Norbert

    2017-01-01

    NoSQL database scaling is a decision, where system resources or financial expenses are traded for database performance or other benefits. By scaling a database, database performance and resource usage might increase or decrease, such changes might have a negative impact on an application that uses the database. In this work it is analyzed how database scaling affect database resource usage and performance. As a results, calculations are acquired, using which database scaling types and differe...

  10. The LHCb configuration database

    CERN Document Server

    Abadie, L; Van Herwijnen, Eric; Jacobsson, R; Jost, B; Neufeld, N

    2005-01-01

    The aim of the LHCb configuration database is to store information about all the controllable devices of the detector. The experiment's control system (that uses PVSS ) will configure, start up and monitor the detector from the information in the configuration database. The database will contain devices with their properties, connectivity and hierarchy. The ability to store and rapidly retrieve huge amounts of data, and the navigability between devices are important requirements. We have collected use cases to ensure the completeness of the design. Using the entity relationship modelling technique we describe the use cases as classes with attributes and links. We designed the schema for the tables using relational diagrams. This methodology has been applied to the TFC (switches) and DAQ system. Other parts of the detector will follow later. The database has been implemented using Oracle to benefit from central CERN database support. The project also foresees the creation of tools to populate, maintain, and co...

  11. Mycobacteriophage genome database.

    Science.gov (United States)

    Joseph, Jerrine; Rajendran, Vasanthi; Hassan, Sameer; Kumar, Vanaja

    2011-01-01

    Mycobacteriophage genome database (MGDB) is an exclusive repository of the 64 completely sequenced mycobacteriophages with annotated information. It is a comprehensive compilation of the various gene parameters captured from several databases pooled together to empower mycobacteriophage researchers. The MGDB (Version No.1.0) comprises of 6086 genes from 64 mycobacteriophages classified into 72 families based on ACLAME database. Manual curation was aided by information available from public databases which was enriched further by analysis. Its web interface allows browsing as well as querying the classification. The main objective is to collect and organize the complexity inherent to mycobacteriophage protein classification in a rational way. The other objective is to browse the existing and new genomes and describe their functional annotation. The database is available for free at http://mpgdb.ibioinformatics.org/mpgdb.php.

  12. Surviving extreme polar winters by desiccation: clues from Arctic springtail (Onychiurus arcticus EST libraries

    Directory of Open Access Journals (Sweden)

    Kube Michael

    2007-12-01

    Full Text Available Abstract Background Ice, snow and temperatures of -14°C are conditions which most animals would find difficult, if not impossible, to survive in. However this exactly describes the Arctic winter, and the Arctic springtail Onychiurus arcticus regularly survives these extreme conditions and re-emerges in the spring. It is able to do this by reducing the amount of water in its body to almost zero: a process that is called "protective dehydration". The aim of this project was to generate clones and sequence data in the form of ESTs to provide a platform for the future molecular characterisation of the processes involved in protective dehydration. Results Five normalised libraries were produced from both desiccating and rehydrating populations of O. arcticus from stages that had previously been defined as potentially informative for molecular analyses. A total of 16,379 EST clones were generated and analysed using Blast and GO annotation. 40% of the clones produced significant matches against the Swissprot and trembl databases and these were further analysed using GO annotation. Extraction and analysis of GO annotations proved an extremely effective method for identifying generic processes associated with biochemical pathways, proving more efficient than solely analysing Blast data output. A number of genes were identified, which have previously been shown to be involved in water transport and desiccation such as members of the aquaporin family. Identification of these clones in specific libraries associated with desiccation validates the computational analysis by library rather than producing a global overview of all libraries combined. Conclusion This paper describes for the first time EST data from the arctic springtail (O. arcticus. This significantly enhances the number of Collembolan ESTs in the public databases, providing useful comparative data within this phylum. The use of GO annotation for analysis has facilitated the identification of a

  13. Species - San Diego Co. [ds121

    Data.gov (United States)

    California Natural Resource Agency — This is the Biological Observation Database point layer representing baseline observations of sensitive species (as defined by the MSCP) throughout San Diego County....

  14. Database Description - SKIP Stemcell Database | LSDB Archive [Life Science Database Archive metadata

    Lifescience Database Archive (English)

    Full Text Available List Contact us SKIP Stemcell Database Database Description General information of database Database name SKIP Stemcell Database...rsity Journal Search: Contact address http://www.skip.med.keio.ac.jp/en/contact/ Database classification Human Genes and Diseases Dat...abase classification Stemcell Article Organism Taxonomy Name: Homo sapiens Taxonomy ID: 9606 Database...ks: Original website information Database maintenance site Center for Medical Genetics, School of medicine, ...lable Web services Not available URL of Web services - Need for user registration Not available About This Database Database

  15. Preparing and Analyzing Expressed Sequence Tags (ESTs Library for the Mammary Tissue of Local Turkish Kivircik Sheep

    Directory of Open Access Journals (Sweden)

    Nehir Ozdemir Ozgenturk

    2017-01-01

    Full Text Available Kivircik sheep is an important local Turkish sheep according to its meat quality and milk productivity. The aim of this study was to analyze gene expression profiles of both prenatal and postnatal stages for the Kivircik sheep. Therefore, two different cDNA libraries, which were taken from the same Kivircik sheep mammary gland tissue at prenatal and postnatal stages, were constructed. Total 3072 colonies which were randomly selected from the two libraries were sequenced for developing a sheep ESTs collection. We used Phred/Phrap computer programs for analysis of the raw EST and readable EST sequences were assembled with the CAP3 software. Putative functions of all unique sequences and statistical analysis were determined by Geneious software. Total 422 ESTs have over 80% similarity to known sequences of other organisms in NCBI classified by Panther database for the Gene Ontology (GO category. By comparing gene expression profiles, we observed some putative genes that may be relative to reproductive performance or play important roles in milk synthesis and secretion. A total of 2414 ESTs have been deposited to the NCBI GenBank database (GW996847–GW999260. EST data in this study have provided a new source of information to functional genome studies of sheep.

  16. Hazard Analysis Database Report

    Energy Technology Data Exchange (ETDEWEB)

    GAULT, G.W.

    1999-10-13

    The Hazard Analysis Database was developed in conjunction with the hazard analysis activities conducted in accordance with DOE-STD-3009-94, Preparation Guide for US Department of Energy Nonreactor Nuclear Facility Safety Analysis Reports, for the Tank Waste Remediation System (TWRS) Final Safety Analysis Report (FSAR). The FSAR is part of the approved TWRS Authorization Basis (AB). This document describes, identifies, and defines the contents and structure of the TWRS FSAR Hazard Analysis Database and documents the configuration control changes made to the database. The TWRS Hazard Analysis Database contains the collection of information generated during the initial hazard evaluations and the subsequent hazard and accident analysis activities. The database supports the preparation of Chapters 3,4, and 5 of the TWRS FSAR and the USQ process and consists of two major, interrelated data sets: (1) Hazard Evaluation Database--Data from the results of the hazard evaluations; and (2) Hazard Topography Database--Data from the system familiarization and hazard identification.

  17. Database for propagation models

    Science.gov (United States)

    Kantak, Anil V.

    1991-07-01

    A propagation researcher or a systems engineer who intends to use the results of a propagation experiment is generally faced with various database tasks such as the selection of the computer software, the hardware, and the writing of the programs to pass the data through the models of interest. This task is repeated every time a new experiment is conducted or the same experiment is carried out at a different location generating different data. Thus the users of this data have to spend a considerable portion of their time learning how to implement the computer hardware and the software towards the desired end. This situation may be facilitated considerably if an easily accessible propagation database is created that has all the accepted (standardized) propagation phenomena models approved by the propagation research community. Also, the handling of data will become easier for the user. Such a database construction can only stimulate the growth of the propagation research it if is available to all the researchers, so that the results of the experiment conducted by one researcher can be examined independently by another, without different hardware and software being used. The database may be made flexible so that the researchers need not be confined only to the contents of the database. Another way in which the database may help the researchers is by the fact that they will not have to document the software and hardware tools used in their research since the propagation research community will know the database already. The following sections show a possible database construction, as well as properties of the database for the propagation research.

  18. Plant databases and data analysis tools

    Science.gov (United States)

    It is anticipated that the coming years will see the generation of large datasets including diagnostic markers in several plant species with emphasis on crop plants. To use these datasets effectively in any plant breeding program, it is essential to have the information available via public database...

  19. The UCSC Genome Browser Database: update 2006

    DEFF Research Database (Denmark)

    Hinrichs, A S; Karolchik, D; Baertsch, R

    2006-01-01

    The University of California Santa Cruz Genome Browser Database (GBD) contains sequence and annotation data for the genomes of about a dozen vertebrate species and several major model organisms. Genome annotations typically include assembly data, sequence composition, genes and gene predictions, ...

  20. The UCSC genome browser database: update 2007

    DEFF Research Database (Denmark)

    Kuhn, R M; Karolchik, D; Zweig, A S

    2006-01-01

    The University of California, Santa Cruz Genome Browser Database contains, as of September 2006, sequence and annotation data for the genomes of 13 vertebrate and 19 invertebrate species. The Genome Browser displays a wide variety of annotations at all scales from the single nucleotide level up t...

  1. Atlas of Iberian water beetles (ESACIB database).

    Science.gov (United States)

    Sánchez-Fernández, David; Millán, Andrés; Abellán, Pedro; Picazo, Félix; Carbonell, José A; Ribera, Ignacio

    2015-01-01

    The ESACIB ('EScarabajos ACuáticos IBéricos') database is provided, including all available distributional data of Iberian and Balearic water beetles from the literature up to 2013, as well as from museum and private collections, PhD theses, and other unpublished sources. The database contains 62,015 records with associated geographic data (10×10 km UTM squares) for 488 species and subspecies of water beetles, 120 of them endemic to the Iberian Peninsula and eight to the Balearic Islands. This database was used for the elaboration of the "Atlas de los Coleópteros Acuáticos de España Peninsular". In this dataset data of 15 additional species has been added: 11 that occur in the Balearic Islands or mainland Portugal but not in peninsular Spain and an other four with mainly terrestrial habits within the genus Helophorus (for taxonomic coherence). The complete dataset is provided in Darwin Core Archive format.

  2. An EST-based approach for identifying genes expressed in the intestine and gills of pre-smolt Atlantic salmon (Salmo salar

    Directory of Open Access Journals (Sweden)

    Adzhubei Alexei

    2005-12-01

    Full Text Available Abstract Background The Atlantic salmon is an important aquaculture species and a very interesting species biologically, since it spawns in fresh water and develops through several stages before becoming a smolt, the stage at which it migrates to the sea to feed. The dramatic change of habitat requires physiological, morphological and behavioural changes to prepare the salmon for its new environment. These changes are called the parr-smolt transformation or smoltification, and pre-adapt the salmon for survival and growth in the marine environment. The development of hypo-osmotic regulatory ability plays an important part in facilitating the transition from rivers to the sea. The physiological mechanisms behind the developmental changes are largely unknown. An understanding of the transformation process will be vital to the future of the aquaculture industry. A knowledge of which genes are expressed prior to the smoltification process is an important basis for further studies. Results In all, 2974 unique sequences, consisting of 779 contigs and 2195 singlets, were generated for Atlantic salmon from two cDNA libraries constructed from the gills and the intestine, accession numbers [Genbank: CK877169-CK879929, CK884015-CK886537 and CN181112-CN181464]. Nearly 50% of the sequences were assigned putative functions because they showed similarity to known genes, mostly from other species, in one or more of the databases used. The Swiss-Prot database returned significant hits for 1005 sequences. These could be assigned predicted gene products, and 967 were annotated using Gene Ontology (GO terms for molecular function, biological process and/or cellular component, employing an annotation transfer procedure. Conclusion This paper describes the construction of two cDNA libraries from pre-smolt Atlantic salmon (Salmo salar and the subsequent EST sequencing, clustering and assigning of putative function to 1005 genes expressed in the gills and/or intestine.

  3. Product Licenses Database Application

    CERN Document Server

    Tonkovikj, Petar

    2016-01-01

    The goal of this project is to organize and centralize the data about software tools available to CERN employees, as well as provide a system that would simplify the license management process by providing information about the available licenses and their expiry dates. The project development process is consisted of two steps: modeling the products (software tools), product licenses, legal agreements and other data related to these entities in a relational database and developing the front-end user interface so that the user can interact with the database. The result is an ASP.NET MVC web application with interactive views for displaying and managing the data in the underlying database.

  4. LandIT Database

    DEFF Research Database (Denmark)

    Iftikhar, Nadeem; Pedersen, Torben Bach

    2010-01-01

    and reporting purposes. This paper presents the LandIT database; which is result of the LandIT project, which refers to an industrial collaboration project that developed technologies for communication and data integration between farming devices and systems. The LandIT database in principal is based...... on the ISOBUS standard; however the standard is extended with additional requirements, such as gradual data aggregation and flexible exchange of farming data. This paper describes the conceptual and logical schemas of the proposed database based on a real-life farming case study....

  5. JICST Factual Database(2)

    Science.gov (United States)

    Araki, Keisuke

    The computer programme, which builds atom-bond connection tables from nomenclatures, is developed. Chemical substances with their nomenclature and varieties of trivial names or experimental code numbers are inputted. The chemical structures of the database are stereospecifically stored and are able to be searched and displayed according to stereochemistry. Source data are from laws and regulations of Japan, RTECS of US and so on. The database plays a central role within the integrated fact database service of JICST and makes interrelational retrieval possible.

  6. Pattern analysis approach reveals restriction enzyme cutting abnormalities and other cDNA library construction artifacts using raw EST data

    Directory of Open Access Journals (Sweden)

    Zhou Sun

    2012-05-01

    Full Text Available Abstract Background Expressed Sequence Tag (EST sequences are widely used in applications such as genome annotation, gene discovery and gene expression studies. However, some of GenBank dbEST sequences have proven to be “unclean”. Identification of cDNA termini/ends and their structures in raw ESTs not only facilitates data quality control and accurate delineation of transcription ends, but also furthers our understanding of the potential sources of data abnormalities/errors present in the wet-lab procedures for cDNA library construction. Results After analyzing a total of 309,976 raw Pinus taeda ESTs, we uncovered many distinct variations of cDNA termini, some of which prove to be good indicators of wet-lab artifacts, and characterized each raw EST by its cDNA terminus structure patterns. In contrast to the expected patterns, many ESTs displayed complex and/or abnormal patterns that represent potential wet-lab errors such as: a failure of one or both of the restriction enzymes to cut the plasmid vector; a failure of the restriction enzymes to cut the vector at the correct positions; the insertion of two cDNA inserts into a single vector; the insertion of multiple and/or concatenated adapters/linkers; the presence of 3′-end terminal structures in designated 5′-end sequences or vice versa; and so on. With a close examination of these artifacts, many problematic ESTs that have been deposited into public databases by conventional bioinformatics pipelines or tools could be cleaned or filtered by our methodology. We developed a software tool for Abnormality Filtering and Sequence Trimming for ESTs (AFST, http://code.google.com/p/afst/ using a pattern analysis approach. To compare AFST with other pipelines that submitted ESTs into dbEST, we reprocessed 230,783 Pinus taeda and 38,709 Arachis hypogaea GenBank ESTs. We found 7.4% of Pinus taeda and 29.2% of Arachis hypogaea GenBank ESTs are “unclean” or abnormal, all of which could be cleaned

  7. Genetic diversity of cucumber estimated by morpho-physiological and EST-SSR markers.

    Science.gov (United States)

    Pandey, Sudhakar; Ansari, Waquar Akhter; Pandey, Maneesh; Singh, Bijendra

    2018-02-01

    In the present study, genetic variation among 40 cucumber genotypes was analyzed by means of morpho-physiological traits and 21 EST-SSR markers. Diversity was observed for morpho-physiological characters like days to 50% female flowering (37-46.9, number of fruits/plant (1.33-5.80), average fruit weight (41-333), vine length (36-364), relative water content (58.5-92.7), electrolyte leakage (15.9-37.1), photosynthetic efficiency (0.40-0.75) and chlorophyll concentration index (11.1-28.6). The pair wise Jaccard similarity coefficient ranged from 0.00 to 0.27 for quantitative traits and 0.24 to 0.96 for EST-SSR markers indicating that the accessions represent genetically diverse populations. With twenty-one EST-SSR markers, polymorphism revealed among 40 cucumber genotypes, number of alleles varied 2-6 with an average 3.05. Polymorphism information content varied from 0.002 to 0.989 (mean = 0.308). The number of effective allele (Ne), expected heterozygosity (He) and unbiased expected heterozygosity (uHe) of these EST-SSRs were 1.079-1.753, 0.074-0.428 and 0.074-0.434, respectively. Same 21 EST-SSR markers transferability checked in four other Cucumis species: snapmelon ( Cucumis melo var. momordica ), muskmelon ( Cucumis melo L.), pickling melon ( Cucumis melo var. conomon ) and wild muskmelon ( Cucumis melo var. agrestis ) with frequency of 61.9, 95.2, 76.2, and 76.2%, respectively. Present study provides useful information on variability, which can assist geneticists with desirable traits for cucumber germplasm utilization. Observed physiological parameters may assists in selection of genotype for abiotic stress tolerance also, EST-SSR markers may be useful for genetic studies in related species.

  8. Livestock Anaerobic Digester Database

    Science.gov (United States)

    The Anaerobic Digester Database provides basic information about anaerobic digesters on livestock farms in the United States, organized in Excel spreadsheets. It includes projects that are under construction, operating, or shut down.

  9. Toxicity Reference Database

    Data.gov (United States)

    U.S. Environmental Protection Agency — The Toxicity Reference Database (ToxRefDB) contains approximately 30 years and $2 billion worth of animal studies. ToxRefDB allows scientists and the interested...

  10. Dissolution Methods Database

    Data.gov (United States)

    U.S. Department of Health & Human Services — For a drug product that does not have a dissolution test method in the United States Pharmacopeia (USP), the FDA Dissolution Methods Database provides information on...

  11. OTI Activity Database

    Data.gov (United States)

    US Agency for International Development — OTI's worldwide activity database is a simple and effective information system that serves as a program management, tracking, and reporting tool. In each country,...

  12. ARTI Refrigerant Database

    Energy Technology Data Exchange (ETDEWEB)

    Calm, J.M. [Calm (James M.), Great Falls, VA (United States)

    1994-05-27

    The Refrigerant Database consolidates and facilitates access to information to assist industry in developing equipment using alternative refrigerants. The underlying purpose is to accelerate phase out of chemical compounds of environmental concern.

  13. Marine Jurisdictions Database

    National Research Council Canada - National Science Library

    Goldsmith, Roger

    1998-01-01

    The purpose of this project was to take the data gathered for the Maritime Claims chart and create a Maritime Jurisdictions digital database suitable for use with oceanographic mission planning objectives...

  14. Medicaid CHIP ESPC Database

    Data.gov (United States)

    U.S. Department of Health & Human Services — The Environmental Scanning and Program Characteristic (ESPC) Database is in a Microsoft (MS) Access format and contains Medicaid and CHIP data, for the 50 states and...

  15. Records Management Database

    Data.gov (United States)

    US Agency for International Development — The Records Management Database is tool created in Microsoft Access specifically for USAID use. It contains metadata in order to access and retrieve the information...

  16. Reach Address Database (RAD)

    Data.gov (United States)

    U.S. Environmental Protection Agency — The Reach Address Database (RAD) stores the reach address of each Water Program feature that has been linked to the underlying surface water features (streams,...

  17. Household Products Database: Pesticides

    Science.gov (United States)

    ... of Products Manufacturers Ingredients About the Database FAQ Product ... control bulbs carpenter ants caterpillars crabgrass control deer dogs dogs/cats fertilizer w/insecticide fertilizer w/weed ...

  18. Mouse Phenome Database (MPD)

    Data.gov (United States)

    U.S. Department of Health & Human Services — The Mouse Phenome Database (MPD) has characterizations of hundreds of strains of laboratory mice to facilitate translational discoveries and to assist in selection...

  19. Consumer Product Category Database

    Data.gov (United States)

    U.S. Environmental Protection Agency — The Chemical and Product Categories database (CPCat) catalogs the use of over 40,000 chemicals and their presence in different consumer products. The chemical use...

  20. Drycleaner Database - Region 7

    Data.gov (United States)

    U.S. Environmental Protection Agency — THIS DATA ASSET NO LONGER ACTIVE: This is metadata documentation for the Region 7 Drycleaner Database (R7DryClnDB) which tracks all Region7 drycleaners who notify...

  1. National Assessment Database

    Data.gov (United States)

    U.S. Environmental Protection Agency — The National Assessment Database stores and tracks state water quality assessment decisions, Total Maximum Daily Loads (TMDLs) and other watershed plans designed to...

  2. IVR RSA Database

    Data.gov (United States)

    National Oceanic and Atmospheric Administration, Department of Commerce — This database contains trip-level reports submitted by vessels participating in Research Set-Aside projects with IVR reporting requirements.

  3. Rat Genome Database (RGD)

    Data.gov (United States)

    U.S. Department of Health & Human Services — The Rat Genome Database (RGD) is a collaborative effort between leading research institutions involved in rat genetic and genomic research to collect, consolidate,...

  4. The CAPEC Database

    DEFF Research Database (Denmark)

    Nielsen, Thomas Lund; Abildskov, Jens; Harper, Peter Mathias

    2001-01-01

    in the compound. This classification makes the CAPEC database a very useful tool, for example, in the development of new property models, since properties of chemically similar compounds are easily obtained. A program with efficient search and retrieval functions of properties has been developed.......The Computer-Aided Process Engineering Center (CAPEC) database of measured data was established with the aim to promote greater data exchange in the chemical engineering community. The target properties are pure component properties, mixture properties, and special drug solubility data....... The database divides pure component properties into primary, secondary, and functional properties. Mixture properties are categorized in terms of the number of components in the mixture and the number of phases present. The compounds in the database have been classified on the basis of the functional groups...

  5. Danish Urogynaecological Database

    DEFF Research Database (Denmark)

    Hansen, Ulla Darling; Gradel, Kim Oren; Larsen, Michael Due

    2016-01-01

    , complications if relevant, implants used if relevant, 3-6-month postoperative recording of symptoms, if any. A set of clinical quality indicators is being maintained by the steering committee for the database and is published in an annual report which also contains extensive descriptive statistics. The database......The Danish Urogynaecological Database is established in order to ensure high quality of treatment for patients undergoing urogynecological surgery. The database contains details of all women in Denmark undergoing incontinence surgery or pelvic organ prolapse surgery amounting to ~5,200 procedures...... has a completeness of over 90% of all urogynecological surgeries performed in Denmark. Some of the main variables have been validated using medical records as gold standard. The positive predictive value was above 90%. The data are used as a quality monitoring tool by the hospitals and in a number...

  6. The Danish Urogynaecological Database

    DEFF Research Database (Denmark)

    Guldberg, Rikke; Brostrøm, Søren; Hansen, Jesper Kjær

    2013-01-01

    in the DugaBase from 1 January 2009 to 31 October 2010, using medical records as a reference. RESULTS: A total of 16,509 urogynaecological procedures were registered in the DugaBase by 31 December 2010. The database completeness has increased by calendar time, from 38.2 % in 2007 to 93.2 % in 2010 for public......INTRODUCTION AND HYPOTHESIS: The Danish Urogynaecological Database (DugaBase) is a nationwide clinical database established in 2006 to monitor, ensure and improve the quality of urogynaecological surgery. We aimed to describe its establishment and completeness and to validate selected variables....... This is the first study based on data from the DugaBase. METHODS: The database completeness was calculated as a comparison between urogynaecological procedures reported to the Danish National Patient Registry and to the DugaBase. Validity was assessed for selected variables from a random sample of 200 women...

  7. Danish Pancreatic Cancer Database

    DEFF Research Database (Denmark)

    Fristrup, Claus; Detlefsen, Sönke; Palnæs Hansen, Carsten

    2016-01-01

    : Death is monitored using data from the Danish Civil Registry. This registry monitors the survival status of the Danish population, and the registration is virtually complete. All data in the database are audited by all participating institutions, with respect to baseline characteristics, key indicators......AIM OF DATABASE: The Danish Pancreatic Cancer Database aims to prospectively register the epidemiology, diagnostic workup, diagnosis, treatment, and outcome of patients with pancreatic cancer in Denmark at an institutional and national level. STUDY POPULATION: Since May 1, 2011, all patients...... with microscopically verified ductal adenocarcinoma of the pancreas have been registered in the database. As of June 30, 2014, the total number of patients registered was 2,217. All data are cross-referenced with the Danish Pathology Registry and the Danish Patient Registry to ensure the completeness of registrations...

  8. Food Habits Database (FHDBS)

    Data.gov (United States)

    National Oceanic and Atmospheric Administration, Department of Commerce — The NEFSC Food Habits Database has two major sources of data. The first, and most extensive, is the standard NEFSC Bottom Trawl Surveys Program. During these...

  9. Functionally Graded Materials Database

    Science.gov (United States)

    Kisara, Katsuto; Konno, Tomomi; Niino, Masayuki

    2008-02-01

    Functionally Graded Materials Database (hereinafter referred to as FGMs Database) was open to the society via Internet in October 2002, and since then it has been managed by the Japan Aerospace Exploration Agency (JAXA). As of October 2006, the database includes 1,703 research information entries with 2,429 researchers data, 509 institution data and so on. Reading materials such as "Applicability of FGMs Technology to Space Plane" and "FGMs Application to Space Solar Power System (SSPS)" were prepared in FY 2004 and 2005, respectively. The English version of "FGMs Application to Space Solar Power System (SSPS)" is now under preparation. This present paper explains the FGMs Database, describing the research information data, the sitemap and how to use it. From the access analysis, user access results and users' interests are discussed.

  10. Tethys Acoustic Metadata Database

    Data.gov (United States)

    National Oceanic and Atmospheric Administration, Department of Commerce — The Tethys database houses the metadata associated with the acoustic data collection efforts by the Passive Acoustic Group. These metadata include dates, locations...

  11. NLCD 2011 database

    Data.gov (United States)

    U.S. Environmental Protection Agency — National Land Cover Database 2011 (NLCD 2011) is the most recent national land cover product created by the Multi-Resolution Land Characteristics (MRLC) Consortium....

  12. Medicare Coverage Database

    Data.gov (United States)

    U.S. Department of Health & Human Services — The Medicare Coverage Database (MCD) contains all National Coverage Determinations (NCDs) and Local Coverage Determinations (LCDs), local articles, and proposed NCD...

  13. Household Products Database

    Data.gov (United States)

    U.S. Department of Health & Human Services — This database links over 4,000 consumer brands to health effects from Material Safety Data Sheets (MSDS) provided by the manufacturers and allows scientists and...

  14. Global Volcano Locations Database

    Data.gov (United States)

    National Oceanic and Atmospheric Administration, Department of Commerce — NGDC maintains a database of over 1,500 volcano locations obtained from the Smithsonian Institution Global Volcanism Program, Volcanoes of the World publication. The...

  15. 1988 Spitak Earthquake Database

    Data.gov (United States)

    National Oceanic and Atmospheric Administration, Department of Commerce — The 1988 Spitak Earthquake database is an extensive collection of geophysical and geological data, maps, charts, images and descriptive text pertaining to the...

  16. Uranium Location Database

    Data.gov (United States)

    U.S. Environmental Protection Agency — A GIS compiled locational database in Microsoft Access of ~15,000 mines with uranium occurrence or production, primarily in the western United States. The metadata...

  17. INIST: databases reorientation

    International Nuclear Information System (INIS)

    Bidet, J.C.

    1995-01-01

    INIST is a CNRS (Centre National de la Recherche Scientifique) laboratory devoted to the treatment of scientific and technical informations and to the management of these informations compiled in a database. Reorientation of the database content has been proposed in 1994 to increase the transfer of research towards enterprises and services, to develop more automatized accesses to the informations, and to create a quality assurance plan. The catalog of publications comprises 5800 periodical titles (1300 for fundamental research and 4500 for applied research). A science and technology multi-thematic database will be created in 1995 for the retrieval of applied and technical informations. ''Grey literature'' (reports, thesis, proceedings..) and human and social sciences data will be added to the base by the use of informations selected in the existing GRISELI and Francis databases. Strong modifications are also planned in the thematic cover of Earth sciences and will considerably reduce the geological information content. (J.S.). 1 tab

  18. Fine Arts Database (FAD)

    Data.gov (United States)

    General Services Administration — The Fine Arts Database records information on federally owned art in the control of the GSA; this includes the location, current condition and information on artists.

  19. Kansas Cartographic Database (KCD)

    Data.gov (United States)

    Kansas Data Access and Support Center — The Kansas Cartographic Database (KCD) is an exact digital representation of selected features from the USGS 7.5 minute topographic map series. Features that are...

  20. Developments in diffraction databases

    International Nuclear Information System (INIS)

    Jenkins, R.

    1999-01-01

    Full text: There are a number of databases available to the diffraction community. Two of the more important of these are the Powder Diffraction File (PDF) maintained by the International Centre for Diffraction Data (ICDD), and the Inorganic Crystal Structure Database (ICSD) maintained by Fachsinformationzentrum (FIZ, Karlsruhe). In application, the PDF has been used as an indispensable tool in phase identification and identification of unknowns. The ICSD database has extensive and explicit reference to the structures of compounds: atomic coordinates, space group and even thermal vibration parameters. A similar database, but for organic compounds, is maintained by the Cambridge Crystallographic Data Centre. These databases are often used as independent sources of information. However, little thought has been given on how to exploit the combined properties of structural database tools. A recently completed agreement between ICDD and FIZ, plus ICDD and Cambridge, provides a first step in complementary use of the PDF and the ICSD databases. The focus of this paper (as indicated below) is to examine ways of exploiting the combined properties of both databases. In 1996, there were approximately 76,000 entries in the PDF and approximately 43,000 entries in the ICSD database. The ICSD database has now been used to calculate entries in the PDF. Thus, to derive d-spacing and peak intensity data requires the synthesis of full diffraction patterns, i.e., we use the structural data in the ICSD database and then add instrumental resolution information. The combined data from PDF and ICSD can be effectively used in many ways. For example, we can calculate PDF data for an ideally random crystal distribution and also in the absence of preferred orientation. Again, we can use systematic studies of intermediate members in solid solutions series to help produce reliable quantitative phase analyses. In some cases, we can study how solid solution properties vary with composition and