WorldWideScience

Sample records for biomedical information extraction

  1. Extracting laboratory test information from biomedical text

    Directory of Open Access Journals (Sweden)

    Yanna Shen Kang

    2013-01-01

    Full Text Available Background: No previous study reported the efficacy of current natural language processing (NLP methods for extracting laboratory test information from narrative documents. This study investigates the pathology informatics question of how accurately such information can be extracted from text with the current tools and techniques, especially machine learning and symbolic NLP methods. The study data came from a text corpus maintained by the U.S. Food and Drug Administration, containing a rich set of information on laboratory tests and test devices. Methods: The authors developed a symbolic information extraction (SIE system to extract device and test specific information about four types of laboratory test entities: Specimens, analytes, units of measures and detection limits. They compared the performance of SIE and three prominent machine learning based NLP systems, LingPipe, GATE and BANNER, each implementing a distinct supervised machine learning method, hidden Markov models, support vector machines and conditional random fields, respectively. Results: Machine learning systems recognized laboratory test entities with moderately high recall, but low precision rates. Their recall rates were relatively higher when the number of distinct entity values (e.g., the spectrum of specimens was very limited or when lexical morphology of the entity was distinctive (as in units of measures, yet SIE outperformed them with statistically significant margins on extracting specimen, analyte and detection limit information in both precision and F-measure. Its high recall performance was statistically significant on analyte information extraction. Conclusions: Despite its shortcomings against machine learning methods, a well-tailored symbolic system may better discern relevancy among a pile of information of the same type and may outperform a machine learning system by tapping into lexically non-local contextual information such as the document structure.

  2. NAMED ENTITY RECOGNITION FROM BIOMEDICAL TEXT -AN INFORMATION EXTRACTION TASK

    Directory of Open Access Journals (Sweden)

    N. Kanya

    2016-07-01

    Full Text Available Biomedical Text Mining targets the Extraction of significant information from biomedical archives. Bio TM encompasses Information Retrieval (IR and Information Extraction (IE. The Information Retrieval will retrieve the relevant Biomedical Literature documents from the various Repositories like PubMed, MedLine etc., based on a search query. The IR Process ends up with the generation of corpus with the relevant document retrieved from the Publication databases based on the query. The IE task includes the process of Preprocessing of the document, Named Entity Recognition (NER from the documents and Relationship Extraction. This process includes Natural Language Processing, Data Mining techniques and machine Language algorithm. The preprocessing task includes tokenization, stop word Removal, shallow parsing, and Parts-Of-Speech tagging. NER phase involves recognition of well-defined objects such as genes, proteins or cell-lines etc. This process leads to the next phase that is extraction of relationships (IE. The work was based on machine learning algorithm Conditional Random Field (CRF.

  3. Construction of an annotated corpus to support biomedical information extraction

    Directory of Open Access Journals (Sweden)

    McNaught John

    2009-10-01

    Full Text Available Abstract Background Information Extraction (IE is a component of text mining that facilitates knowledge discovery by automatically locating instances of interesting biomedical events from huge document collections. As events are usually centred on verbs and nominalised verbs, understanding the syntactic and semantic behaviour of these words is highly important. Corpora annotated with information concerning this behaviour can constitute a valuable resource in the training of IE components and resources. Results We have defined a new scheme for annotating sentence-bound gene regulation events, centred on both verbs and nominalised verbs. For each event instance, all participants (arguments in the same sentence are identified and assigned a semantic role from a rich set of 13 roles tailored to biomedical research articles, together with a biological concept type linked to the Gene Regulation Ontology. To our knowledge, our scheme is unique within the biomedical field in terms of the range of event arguments identified. Using the scheme, we have created the Gene Regulation Event Corpus (GREC, consisting of 240 MEDLINE abstracts, in which events relating to gene regulation and expression have been annotated by biologists. A novel method of evaluating various different facets of the annotation task showed that average inter-annotator agreement rates fall within the range of 66% - 90%. Conclusion The GREC is a unique resource within the biomedical field, in that it annotates not only core relationships between entities, but also a range of other important details about these relationships, e.g., location, temporal, manner and environmental conditions. As such, it is specifically designed to support bio-specific tool and resource development. It has already been used to acquire semantic frames for inclusion within the BioLexicon (a lexical, terminological resource to aid biomedical text mining. Initial experiments have also shown that the corpus may

  4. Introducing meta-services for biomedical information extraction

    OpenAIRE

    Leitner, Florian; Krallinger, Martin; Johnson, Calvin A.; Saetre, Rune; Yoshida, Kazuhiro; Chen, Yan Hua; Kim, Sun; Shin, Soo-Yong; Zhang, Byoung-Tak; Baumgartner, William A.; Hunter, Lawrence; Haddow, Barry; Rodriguez-Penagos, Carlos; Matthews, Michael; Wang, Xinglong

    2008-01-01

    We introduce the first meta-service for information extraction in molecular biology, the BioCreative MetaServer (BCMS; http://bcms.bioinfo.cnio.es/). This prototype platform is a joint effort of 13 research groups and provides automatically generated annotations for PubMed/Medline abstracts. Annotation types cover gene names, gene IDs, species, and protein-protein interactions. The annotations are distributed by the meta-server in both human and machine readable formats (HTML/XML). This servi...

  5. BioSimplify: an open source sentence simplification engine to improve recall in automatic biomedical information extraction

    CERN Document Server

    Jonnalagadda, Siddhartha

    2011-01-01

    BioSimplify is an open source tool written in Java that introduces and facilitates the use of a novel model for sentence simplification tuned for automatic discourse analysis and information extraction (as opposed to sentence simplification for improving human readability). The model is based on a "shot-gun" approach that produces many different (simpler) versions of the original sentence by combining variants of its constituent elements. This tool is optimized for processing biomedical scientific literature such as the abstracts indexed in PubMed. We tested our tool on its impact to the task of PPI extraction and it improved the f-score of the PPI tool by around 7%, with an improvement in recall of around 20%. The BioSimplify tool and test corpus can be downloaded from https://biosimplify.sourceforge.net.

  6. Towards Evidence-based Precision Medicine: Extracting Population Information from Biomedical Text using Binary Classifiers and Syntactic Patterns.

    Science.gov (United States)

    Raja, Kalpana; Dasot, Naman; Goyal, Pawan; Jonnalagadda, Siddhartha R

    2016-01-01

    Precision Medicine is an emerging approach for prevention and treatment of disease that considers individual variability in genes, environment, and lifestyle for each person. The dissemination of individualized evidence by automatically identifying population information in literature is a key for evidence-based precision medicine at the point-of-care. We propose a hybrid approach using natural language processing techniques to automatically extract the population information from biomedical literature. Our approach first implements a binary classifier to classify sentences with or without population information. A rule-based system based on syntactic-tree regular expressions is then applied to sentences containing population information to extract the population named entities. The proposed two-stage approach achieved an F-score of 0.81 using a MaxEnt classifier and the rule- based system, and an F-score of 0.87 using a Nai've-Bayes classifier and the rule-based system, and performed relatively well compared to many existing systems. The system and evaluation dataset is being released as open source. PMID:27570671

  7. BIMS: Biomedical Information Management System

    OpenAIRE

    Mora, Oscar; Bisbal, Jesús

    2013-01-01

    In this paper, we present BIMS (Biomedical Information Management System). BIMS is a software architecture designed to provide a flexible computational framework to manage the information needs of a wide range of biomedical research projects. The main goal is to facilitate the clinicians' job in data entry, and researcher's tasks in data management, in high data quality biomedical research projects. The BIMS architecture has been designed following the two-level modeling paradigm, a promising...

  8. BIMS: Biomedical Information Management System

    OpenAIRE

    Mora Pérez, Oscar

    2009-01-01

    This final year project presents the design principles and prototype implementation of BIMS (Biomedical Information Management System), a flexible software system which provides an infrastructure to manage all information required by biomedical research projects.The BIMS project was initiated with the motivation to solve several limitations in medical data acquisition of some research projects, in which Universitat Pompeu Fabra takes part. These limitations,based on the lack of control mechan...

  9. Enhancing biomedical text summarization using semantic relation extraction.

    Directory of Open Access Journals (Sweden)

    Yue Shang

    Full Text Available Automatic text summarization for a biomedical concept can help researchers to get the key points of a certain topic from large amount of biomedical literature efficiently. In this paper, we present a method for generating text summary for a given biomedical concept, e.g., H1N1 disease, from multiple documents based on semantic relation extraction. Our approach includes three stages: 1 We extract semantic relations in each sentence using the semantic knowledge representation tool SemRep. 2 We develop a relation-level retrieval method to select the relations most relevant to each query concept and visualize them in a graphic representation. 3 For relations in the relevant set, we extract informative sentences that can interpret them from the document collection to generate text summary using an information retrieval based method. Our major focus in this work is to investigate the contribution of semantic relation extraction to the task of biomedical text summarization. The experimental results on summarization for a set of diseases show that the introduction of semantic knowledge improves the performance and our results are better than the MEAD system, a well-known tool for text summarization.

  10. A Unified Active Learning Framework for Biomedical Relation Extraction

    Institute of Scientific and Technical Information of China (English)

    Hong-Tao Zhang; Min-Lie Huang; Xiao-Yan Zhu

    2012-01-01

    Supervised machine learning methods have been employed with great success in the task of biomedical relation extraction.However,existing methods are not practical enough,since manual construction of large training data is very expensive.Therefore,active learning is urgently needed for designing practical relation extraction methods with little human effort.In this paper,we describe a unified active learning framework.Particularly,our framework systematically addresses some practical issues during active learning process,including a strategy for selecting informative data,a data diversity selection algorithm,an active feature acquisition method,and an informative feature selection algorithm,in order to meet the challenges due to the immense amount of complex and diverse biomedical text.The framework is evaluated on protein-protein interaction (PPI) extraction and is shown to achieve promising results with a significant reduction in editorial effort and labeling time.

  11. Mining biomedical images towards valuable information retrieval in biomedical and life sciences.

    Science.gov (United States)

    Ahmed, Zeeshan; Zeeshan, Saman; Dandekar, Thomas

    2016-01-01

    Biomedical images are helpful sources for the scientists and practitioners in drawing significant hypotheses, exemplifying approaches and describing experimental results in published biomedical literature. In last decades, there has been an enormous increase in the amount of heterogeneous biomedical image production and publication, which results in a need for bioimaging platforms for feature extraction and analysis of text and content in biomedical images to take advantage in implementing effective information retrieval systems. In this review, we summarize technologies related to data mining of figures. We describe and compare the potential of different approaches in terms of their developmental aspects, used methodologies, produced results, achieved accuracies and limitations. Our comparative conclusions include current challenges for bioimaging software with selective image mining, embedded text extraction and processing of complex natural language queries. PMID:27538578

  12. Mining biomedical images towards valuable information retrieval in biomedical and life sciences.

    Science.gov (United States)

    Ahmed, Zeeshan; Zeeshan, Saman; Dandekar, Thomas

    2016-01-01

    Biomedical images are helpful sources for the scientists and practitioners in drawing significant hypotheses, exemplifying approaches and describing experimental results in published biomedical literature. In last decades, there has been an enormous increase in the amount of heterogeneous biomedical image production and publication, which results in a need for bioimaging platforms for feature extraction and analysis of text and content in biomedical images to take advantage in implementing effective information retrieval systems. In this review, we summarize technologies related to data mining of figures. We describe and compare the potential of different approaches in terms of their developmental aspects, used methodologies, produced results, achieved accuracies and limitations. Our comparative conclusions include current challenges for bioimaging software with selective image mining, embedded text extraction and processing of complex natural language queries.

  13. Mining biomedical images towards valuable information retrieval in biomedical and life sciences

    Science.gov (United States)

    Ahmed, Zeeshan; Zeeshan, Saman; Dandekar, Thomas

    2016-01-01

    Biomedical images are helpful sources for the scientists and practitioners in drawing significant hypotheses, exemplifying approaches and describing experimental results in published biomedical literature. In last decades, there has been an enormous increase in the amount of heterogeneous biomedical image production and publication, which results in a need for bioimaging platforms for feature extraction and analysis of text and content in biomedical images to take advantage in implementing effective information retrieval systems. In this review, we summarize technologies related to data mining of figures. We describe and compare the potential of different approaches in terms of their developmental aspects, used methodologies, produced results, achieved accuracies and limitations. Our comparative conclusions include current challenges for bioimaging software with selective image mining, embedded text extraction and processing of complex natural language queries. PMID:27538578

  14. Document Exploration and Automatic Knowledge Extraction for Unstructured Biomedical Text

    Science.gov (United States)

    Chu, S.; Totaro, G.; Doshi, N.; Thapar, S.; Mattmann, C. A.; Ramirez, P.

    2015-12-01

    We describe our work on building a web-browser based document reader with built-in exploration tool and automatic concept extraction of medical entities for biomedical text. Vast amounts of biomedical information are offered in unstructured text form through scientific publications and R&D reports. Utilizing text mining can help us to mine information and extract relevant knowledge from a plethora of biomedical text. The ability to employ such technologies to aid researchers in coping with information overload is greatly desirable. In recent years, there has been an increased interest in automatic biomedical concept extraction [1, 2] and intelligent PDF reader tools with the ability to search on content and find related articles [3]. Such reader tools are typically desktop applications and are limited to specific platforms. Our goal is to provide researchers with a simple tool to aid them in finding, reading, and exploring documents. Thus, we propose a web-based document explorer, which we called Shangri-Docs, which combines a document reader with automatic concept extraction and highlighting of relevant terms. Shangri-Docsalso provides the ability to evaluate a wide variety of document formats (e.g. PDF, Words, PPT, text, etc.) and to exploit the linked nature of the Web and personal content by performing searches on content from public sites (e.g. Wikipedia, PubMed) and private cataloged databases simultaneously. Shangri-Docsutilizes Apache cTAKES (clinical Text Analysis and Knowledge Extraction System) [4] and Unified Medical Language System (UMLS) to automatically identify and highlight terms and concepts, such as specific symptoms, diseases, drugs, and anatomical sites, mentioned in the text. cTAKES was originally designed specially to extract information from clinical medical records. Our investigation leads us to extend the automatic knowledge extraction process of cTAKES for biomedical research domain by improving the ontology guided information extraction

  15. Cross language information retrieval for biomedical literature

    NARCIS (Netherlands)

    Schuemie, M.; Trieschnigg, D.; Kraaij, W.

    2007-01-01

    This workshop report discusses the collaborative work of UT, EMC and TNO on the TREC Genomics Track 2007. The biomedical information retrieval task is approached using cross language methods, in which biomedical concept detection is combined with effective IR based on unigram language models. Furthe

  16. Enhancing Biomedical Text Summarization Using Semantic Relation Extraction

    OpenAIRE

    Yue Shang; Yanpeng Li; Hongfei Lin; Zhihao Yang

    2011-01-01

    Automatic text summarization for a biomedical concept can help researchers to get the key points of a certain topic from large amount of biomedical literature efficiently. In this paper, we present a method for generating text summary for a given biomedical concept, e.g., H1N1 disease, from multiple documents based on semantic relation extraction. Our approach includes three stages: 1) We extract semantic relations in each sentence using the semantic knowledge representation tool SemRep. 2) W...

  17. A Relation Extraction Framework for Biomedical Text Using Hybrid Feature Set.

    Science.gov (United States)

    Muzaffar, Abdul Wahab; Azam, Farooque; Qamar, Usman

    2015-01-01

    The information extraction from unstructured text segments is a complex task. Although manual information extraction often produces the best results, it is harder to manage biomedical data extraction manually because of the exponential increase in data size. Thus, there is a need for automatic tools and techniques for information extraction in biomedical text mining. Relation extraction is a significant area under biomedical information extraction that has gained much importance in the last two decades. A lot of work has been done on biomedical relation extraction focusing on rule-based and machine learning techniques. In the last decade, the focus has changed to hybrid approaches showing better results. This research presents a hybrid feature set for classification of relations between biomedical entities. The main contribution of this research is done in the semantic feature set where verb phrases are ranked using Unified Medical Language System (UMLS) and a ranking algorithm. Support Vector Machine and Naïve Bayes, the two effective machine learning techniques, are used to classify these relations. Our approach has been validated on the standard biomedical text corpus obtained from MEDLINE 2001. Conclusively, it can be articulated that our framework outperforms all state-of-the-art approaches used for relation extraction on the same corpus.

  18. A Relation Extraction Framework for Biomedical Text Using Hybrid Feature Set

    Directory of Open Access Journals (Sweden)

    Abdul Wahab Muzaffar

    2015-01-01

    Full Text Available The information extraction from unstructured text segments is a complex task. Although manual information extraction often produces the best results, it is harder to manage biomedical data extraction manually because of the exponential increase in data size. Thus, there is a need for automatic tools and techniques for information extraction in biomedical text mining. Relation extraction is a significant area under biomedical information extraction that has gained much importance in the last two decades. A lot of work has been done on biomedical relation extraction focusing on rule-based and machine learning techniques. In the last decade, the focus has changed to hybrid approaches showing better results. This research presents a hybrid feature set for classification of relations between biomedical entities. The main contribution of this research is done in the semantic feature set where verb phrases are ranked using Unified Medical Language System (UMLS and a ranking algorithm. Support Vector Machine and Naïve Bayes, the two effective machine learning techniques, are used to classify these relations. Our approach has been validated on the standard biomedical text corpus obtained from MEDLINE 2001. Conclusively, it can be articulated that our framework outperforms all state-of-the-art approaches used for relation extraction on the same corpus.

  19. Electromembrane extraction for pharmaceutical and biomedical analysis

    DEFF Research Database (Denmark)

    Huang, Chuixiu; Seip, Knut Fredrik; Gjelstad, Astrid;

    2015-01-01

    . The present paper discusses recent development of EME. The paper focuses on the principles of EME, and discusses how to optimize operational parameters. In addition, pharmaceutical and biomedical applications of EME are reviewed, with emphasis on basic drugs, acidic drugs, amino acids, and peptides. Finally...

  20. An efficient semi-blind source extraction algorithm and its applications to biomedical signal extraction

    Institute of Scientific and Technical Information of China (English)

    YE YaLan; SHEU Phillip C-Y; ZENG JiaZhi; WANG Gang; LU Ke

    2009-01-01

    In many applications, such as biomedical engineering, it is often required to extract a desired signal instead of all source signals. This can be achieved by blind source extraction (BSE) or semi-blind source extraction, which is a powerful technique emerging from the neural network field. In this paper, we propose an efficient semi-blind source extraction algorithm to extract a desired source signal as its first output signal by using a priori information about its kurtosis range. The algorithm is robust to outliers and spiky noise because of adopting a classical robust contrast function. And it is also robust to the estimation errors of the kurtoaia range of the desired signal providing the estimation errors are not large. The algorithm has good extraction performance, even in some poor situations when the kurtosis values of some source signals are very close to each other. Its convergence stability and robustness are theoretically analyzed. Simulations and experiments on artificial generated data and real-world data have confirmed these results.

  1. DeTEXT: A Database for Evaluating Text Extraction from Biomedical Literature Figures.

    Directory of Open Access Journals (Sweden)

    Xu-Cheng Yin

    Full Text Available Hundreds of millions of figures are available in biomedical literature, representing important biomedical experimental evidence. Since text is a rich source of information in figures, automatically extracting such text may assist in the task of mining figure information. A high-quality ground truth standard can greatly facilitate the development of an automated system. This article describes DeTEXT: A database for evaluating text extraction from biomedical literature figures. It is the first publicly available, human-annotated, high quality, and large-scale figure-text dataset with 288 full-text articles, 500 biomedical figures, and 9308 text regions. This article describes how figures were selected from open-access full-text biomedical articles and how annotation guidelines and annotation tools were developed. We also discuss the inter-annotator agreement and the reliability of the annotations. We summarize the statistics of the DeTEXT data and make available evaluation protocols for DeTEXT. Finally we lay out challenges we observed in the automated detection and recognition of figure text and discuss research directions in this area. DeTEXT is publicly available for downloading at http://prir.ustb.edu.cn/DeTEXT/.

  2. An unsupervised text mining method for relation extraction from biomedical literature.

    Directory of Open Access Journals (Sweden)

    Changqin Quan

    Full Text Available The wealth of interaction information provided in biomedical articles motivated the implementation of text mining approaches to automatically extract biomedical relations. This paper presents an unsupervised method based on pattern clustering and sentence parsing to deal with biomedical relation extraction. Pattern clustering algorithm is based on Polynomial Kernel method, which identifies interaction words from unlabeled data; these interaction words are then used in relation extraction between entity pairs. Dependency parsing and phrase structure parsing are combined for relation extraction. Based on the semi-supervised KNN algorithm, we extend the proposed unsupervised approach to a semi-supervised approach by combining pattern clustering, dependency parsing and phrase structure parsing rules. We evaluated the approaches on two different tasks: (1 Protein-protein interactions extraction, and (2 Gene-suicide association extraction. The evaluation of task (1 on the benchmark dataset (AImed corpus showed that our proposed unsupervised approach outperformed three supervised methods. The three supervised methods are rule based, SVM based, and Kernel based separately. The proposed semi-supervised approach is superior to the existing semi-supervised methods. The evaluation on gene-suicide association extraction on a smaller dataset from Genetic Association Database and a larger dataset from publicly available PubMed showed that the proposed unsupervised and semi-supervised methods achieved much higher F-scores than co-occurrence based method.

  3. Information extraction system

    Science.gov (United States)

    Lemmond, Tracy D; Hanley, William G; Guensche, Joseph Wendell; Perry, Nathan C; Nitao, John J; Kidwell, Paul Brandon; Boakye, Kofi Agyeman; Glaser, Ron E; Prenger, Ryan James

    2014-05-13

    An information extraction system and methods of operating the system are provided. In particular, an information extraction system for performing meta-extraction of named entities of people, organizations, and locations as well as relationships and events from text documents are described herein.

  4. BRONCO: Biomedical entity Relation ONcology COrpus for extracting gene-variant-disease-drug relations.

    Science.gov (United States)

    Lee, Kyubum; Lee, Sunwon; Park, Sungjoon; Kim, Sunkyu; Kim, Suhkyung; Choi, Kwanghun; Tan, Aik Choon; Kang, Jaewoo

    2016-01-01

    Comprehensive knowledge of genomic variants in a biological context is key for precision medicine. As next-generation sequencing technologies improve, the amount of literature containing genomic variant data, such as new functions or related phenotypes, rapidly increases. Because numerous articles are published every day, it is almost impossible to manually curate all the variant information from the literature. Many researchers focus on creating an improved automated biomedical natural language processing (BioNLP) method that extracts useful variants and their functional information from the literature. However, there is no gold-standard data set that contains texts annotated with variants and their related functions. To overcome these limitations, we introduce a Biomedical entity Relation ONcology COrpus (BRONCO) that contains more than 400 variants and their relations with genes, diseases, drugs and cell lines in the context of cancer and anti-tumor drug screening research. The variants and their relations were manually extracted from 108 full-text articles. BRONCO can be utilized to evaluate and train new methods used for extracting biomedical entity relations from full-text publications, and thus be a valuable resource to the biomedical text mining research community. Using BRONCO, we quantitatively and qualitatively evaluated the performance of three state-of-the-art BioNLP methods. We also identified their shortcomings, and suggested remedies for each method. We implemented post-processing modules for the three BioNLP methods, which improved their performance.Database URL:http://infos.korea.ac.kr/bronco. PMID:27074804

  5. Utilization of ontology look-up services in information retrieval for biomedical literature.

    Science.gov (United States)

    Vishnyakova, Dina; Pasche, Emilie; Lovis, Christian; Ruch, Patrick

    2013-01-01

    With the vast amount of biomedical data we face the necessity to improve information retrieval processes in biomedical domain. The use of biomedical ontologies facilitated the combination of various data sources (e.g. scientific literature, clinical data repository) by increasing the quality of information retrieval and reducing the maintenance efforts. In this context, we developed Ontology Look-up services (OLS), based on NEWT and MeSH vocabularies. Our services were involved in some information retrieval tasks such as gene/disease normalization. The implementation of OLS services significantly accelerated the extraction of particular biomedical facts by structuring and enriching the data context. The results of precision in normalization tasks were boosted on about 20%.

  6. Classifying and identifying servers for biomedical information retrieval.

    OpenAIRE

    Patrick, T. B.; Springer, G K

    1994-01-01

    Useful retrieval of biomedical information from network information sources requires methods for organized access to those information sources. This access must be organized in terms of the information content of information sources and in terms of the discovery of the network location of those information sources. We have developed an approach to providing organized access to information sources based on a scheme of hierarchical classifiers and identifiers of the servers providing access to ...

  7. Multimedia Information Extraction

    CERN Document Server

    Maybury, Mark T

    2012-01-01

    The advent of increasingly large consumer collections of audio (e.g., iTunes), imagery (e.g., Flickr), and video (e.g., YouTube) is driving a need not only for multimedia retrieval but also information extraction from and across media. Furthermore, industrial and government collections fuel requirements for stock media access, media preservation, broadcast news retrieval, identity management, and video surveillance.  While significant advances have been made in language processing for information extraction from unstructured multilingual text and extraction of objects from imagery and vid

  8. Biomedical and Environmental Sciences INFORMATION FOR AUTHORS

    Institute of Scientific and Technical Information of China (English)

    2007-01-01

    @@ Biomedical and Environmental Sciences, an international journal with emphasis on scientific findings in China, publishes articles dealing with biologic and toxic effects of environmental pollutants on man and other forms of life. The effects may be measured with pharmacological, biochemical, pathological, and immunological techniques. The journal also publishes reports dealing with the entry, transport, and fate of natural and anthropogenic chemicals in the biosphere, and their impact on human health and well-being.Papers describing biochemical, pharmacological, pathological, toxicological and immunological studies of pharmaceuticals (biotechnological products) are also welcome.

  9. Use of controlled vocabularies to improve biomedical information retrieval tasks.

    Science.gov (United States)

    Pasche, Emilie; Gobeill, Julien; Vishnyakova, Dina; Ruch, Patrick; Lovis, Christian

    2013-01-01

    The high heterogeneity of biomedical vocabulary is a major obstacle for information retrieval in large biomedical collections. Therefore, using biomedical controlled vocabularies is crucial for managing these contents. We investigate the impact of query expansion based on controlled vocabularies to improve the effectiveness of two search engines. Our strategy relies on the enrichment of users' queries with additional terms, directly derived from such vocabularies applied to infectious diseases and chemical patents. We observed that query expansion based on pathogen names resulted in improvements of the top-precision of our first search engine, while the normalization of diseases degraded the top-precision. The expansion of chemical entities, which was performed on the second search engine, positively affected the mean average precision. We have shown that query expansion of some types of biomedical entities has a great potential to improve search effectiveness; therefore a fine-tuning of query expansion strategies could help improving the performances of search engines.

  10. Integration and Querying of Genomic and Proteomic Semantic Annotations for Biomedical Knowledge Extraction.

    Science.gov (United States)

    Masseroli, Marco; Canakoglu, Arif; Ceri, Stefano

    2016-01-01

    Understanding complex biological phenomena involves answering complex biomedical questions on multiple biomolecular information simultaneously, which are expressed through multiple genomic and proteomic semantic annotations scattered in many distributed and heterogeneous data sources; such heterogeneity and dispersion hamper the biologists' ability of asking global queries and performing global evaluations. To overcome this problem, we developed a software architecture to create and maintain a Genomic and Proteomic Knowledge Base (GPKB), which integrates several of the most relevant sources of such dispersed information (including Entrez Gene, UniProt, IntAct, Expasy Enzyme, GO, GOA, BioCyc, KEGG, Reactome, and OMIM). Our solution is general, as it uses a flexible, modular, and multilevel global data schema based on abstraction and generalization of integrated data features, and a set of automatic procedures for easing data integration and maintenance, also when the integrated data sources evolve in data content, structure, and number. These procedures also assure consistency, quality, and provenance tracking of all integrated data, and perform the semantic closure of the hierarchical relationships of the integrated biomedical ontologies. At http://www.bioinformatics.deib.polimi.it/GPKB/, a Web interface allows graphical easy composition of queries, although complex, on the knowledge base, supporting also semantic query expansion and comprehensive explorative search of the integrated data to better sustain biomedical knowledge extraction. PMID:27045824

  11. Early progress of the Biomedical Computing Technology Information Center (BCTIC)

    International Nuclear Information System (INIS)

    Through five years of effort by the Society of Nuclear Medicine Computer Committee, the Biomedical Computing Technology Information Center (BCTIC) was established by the Division of Biomedical and Environmental Research (DBER) of the U. S. Energy Research and Development Administration (ERDA) at the Oak Ridge National Laboratory in July of 1975. BCTIC forged ahead into the tasks of designing guidelines and procedures, acquisition and packaging of computer codes, data, and interface designs; building a bibliographic data base; and maintaining a directory of the user community. Important contacts were made with societies and individuals involved in biomedical computing; and BCTIC was publicized through news releases, the BCTIC newsletter (bimonthly, since October, 1975), presentations at meetings, and personal contacts. This paper presents the response BCTIC has received in its initial months, gives a progress report on the developmental phase, and takes a look to the future of BCTIC as a national technology resource in nuclear medicine computing

  12. PIMiner: A web tool for extraction of protein interactions from biomedical literature

    KAUST Repository

    Chowdhary, Rajesh

    2013-01-01

    Information on Protein Interactions (PIs) is valuable for biomedical research, but often lies buried in the scientific literature and cannot be readily retrieved. While much progress has been made over the years in extracting PIs from the literature using computational methods, there is a lack of free, public, user-friendly tools for the discovery of PIs. We developed an online tool for the extraction of PI relationships from PubMed-abstracts, which we name PIMiner. Protein pairs and the words that describe their interactions are reported by PIMiner so that new interactions can be easily detected within text. The interaction likelihood levels are reported too. The option to extract only specific types of interactions is also provided. The PIMiner server can be accessed through a web browser or remotely through a client\\'s command line. PIMiner can process 50,000 PubMed abstracts in approximately 7 min and thus appears suitable for large-scale processing of biological/biomedical literature. Copyright © 2013 Inderscience Enterprises Ltd.

  13. Information Retrieval Systems Adapted to the Biomedical Domain

    CERN Document Server

    Marrero, Mónica; Urbano, Julián; Morato, Jorge; Moreiro, José-Antonio; 10.3145/epi.2010.may.04

    2012-01-01

    The terminology used in Biomedicine shows lexical peculiarities that have required the elaboration of terminological resources and information retrieval systems with specific functionalities. The main characteristics are the high rates of synonymy and homonymy, due to phenomena such as the proliferation of polysemic acronyms and their interaction with common language. Information retrieval systems in the biomedical domain use techniques oriented to the treatment of these lexical peculiarities. In this paper we review some of the techniques used in this domain, such as the application of Natural Language Processing (BioNLP), the incorporation of lexical-semantic resources, and the application of Named Entity Recognition (BioNER). Finally, we present the evaluation methods adopted to assess the suitability of these techniques for retrieving biomedical resources.

  14. [The system of biomedical scientific information of Serbia].

    Science.gov (United States)

    Dacić, M

    1995-09-01

    Building of the System of biomedical scientific information of Yugoslavia (SBMSI YU) began, by the end of 1980, and the system became operative officially in 1986. After the political disintegration of former Yugoslavia SBMSI of Serbia was formed. SBMSI is developed according to the policy of developing of the System of scientific technologic information of Serbia (SSTI S), and with technical support of SSTI S. Reconstruction of the System is done by using former SBMSI YU as a model. Unlike the former SBMSI YU, SBMSI S owns besides the database Biomedicina Serbica, three important databases: database of doctoral dissertations promoted at University Medical School in Belgrade in the period from 1955-1993, database of Master's theses promoted at the University School of Medicine in Belgrade from 1965-1993; A database of foreign biomedical periodicals in libraries of Serbia.

  15. Challenges in Managing Information Extraction

    Science.gov (United States)

    Shen, Warren H.

    2009-01-01

    This dissertation studies information extraction (IE), the problem of extracting structured information from unstructured data. Example IE tasks include extracting person names from news articles, product information from e-commerce Web pages, street addresses from emails, and names of emerging music bands from blogs. IE is all increasingly…

  16. Popularizing biomedical information on an online health forum

    Directory of Open Access Journals (Sweden)

    Patrizia Anesa

    2015-04-01

    Full Text Available Given the increasing importance assumed by online health forums as a form of doctor-patient communication, this study describes the types of dialogic and polylogic interactions that develop in this type of communicative event, focusing on a corpus of threads drawn from an “Ask a Doctor” forum in the area of cardiology. Adopting a discourse analytical approach, the investigation illustrates how these forums may popularize biomedical knowledge. After describing the shifting conceptualization of popularization, we present a new conceptualization of Web 2.0 popularization which we call “oblique popularization”, as it is indirect, user-generated, dialogic and polylogic, and targeted. In particular, this study characterizes the online health forum as a Web 2.0-style popularization tool. It does so not only because biomedical information communicated to a single inquirer is also available to the whole population, but also because the function of the online forum as a medium of popularization is discursively acknowledged by the participants. The study also explores what explanatory tools (such as definitions, analogies, exemplifications, and generalizations are used by the experts to present complex or technical information. The analysis shows that biomedical information is circulated on a health forum by a complex network of participants, and that e-patients’ posts may also serve as prompters of popularization

  17. Organization and Extraction for Information

    Directory of Open Access Journals (Sweden)

    Kuang-hua Chen

    1997-12-01

    Full Text Available The development of Internet makes the researches on information retrieval more changeable. Actually, the so-called "information retrieval" is "text retrieval." It is necessary for users to find out the needed information from the retrieved texts. A higher-level task is information extraction, which extracts the information based on pre-defined templates. From the viewpoint of Library Science, these pre-defined templates are the metadata, which describes the collection of libraries in common. This paper discusses the relationships between metadata and information extraction and how natural1anguage processing helps the task of information extraction.[Article content in Chinese

  18. Information-theoretic evaluation for computational biomedical ontologies

    CERN Document Server

    Clark, Wyatt Travis

    2014-01-01

    The development of effective methods for the prediction of ontological annotations is an important goal in computational biology, yet evaluating their performance is difficult due to problems caused by the structure of biomedical ontologies and incomplete annotations of genes. This work proposes an information-theoretic framework to evaluate the performance of computational protein function prediction. A Bayesian network is used, structured according to the underlying ontology, to model the prior probability of a protein's function. The concepts of misinformation and remaining uncertainty are

  19. Knowledge-based method for determining the meaning of ambiguous biomedical terms using information content measures of similarity.

    Science.gov (United States)

    McInnes, Bridget T; Pedersen, Ted; Liu, Ying; Melton, Genevieve B; Pakhomov, Serguei V

    2011-01-01

    In this paper, we introduce a novel knowledge-based word sense disambiguation method that determines the sense of an ambiguous word in biomedical text using semantic similarity or relatedness measures. These measures quantify the degree of similarity between concepts in the Unified Medical Language System (UMLS). The objective of this work was to develop a method that can disambiguate terms in biomedical text by exploiting similarity information extracted from the UMLS and to evaluate the efficacy of information content-based semantic similarity measures, which augment path-based information with probabilities derived from biomedical corpora. We show that information content-based measures obtain a higher disambiguation accuracy than path-based measures because they weight the path based on where it exists in the taxonomy coupled with the probability of the concepts occurring in a corpus of text.

  20. Organization and Extraction for Information

    OpenAIRE

    Kuang-hua Chen

    1997-01-01

    The development of Internet makes the researches on information retrieval more changeable. Actually, the so-called "information retrieval" is "text retrieval." It is necessary for users to find out the needed information from the retrieved texts. A higher-level task is information extraction, which extracts the information based on pre-defined templates. From the viewpoint of Library Science, these pre-defined templates are the metadata, which describes the collection of libraries in common. ...

  1. Information bottleneck based incremental fuzzy clustering for large biomedical data.

    Science.gov (United States)

    Liu, Yongli; Wan, Xing

    2016-08-01

    Incremental fuzzy clustering combines advantages of fuzzy clustering and incremental clustering, and therefore is important in classifying large biomedical literature. Conventional algorithms, suffering from data sparsity and high-dimensionality, often fail to produce reasonable results and may even assign all the objects to a single cluster. In this paper, we propose two incremental algorithms based on information bottleneck, Single-Pass fuzzy c-means (spFCM-IB) and Online fuzzy c-means (oFCM-IB). These two algorithms modify conventional algorithms by considering different weights for each centroid and object and scoring mutual information loss to measure the distance between centroids and objects. spFCM-IB and oFCM-IB are used to group a collection of biomedical text abstracts from Medline database. Experimental results show that clustering performances of our approaches are better than such prominent counterparts as spFCM, spHFCM, oFCM and oHFCM, in terms of accuracy. PMID:27260783

  2. Computing semantic similarity between biomedical concepts using new information content approach.

    Science.gov (United States)

    Ben Aouicha, Mohamed; Hadj Taieb, Mohamed Ali

    2016-02-01

    The exploitation of heterogeneous clinical sources and healthcare records is fundamental in clinical and translational research. The determination of semantic similarity between word pairs is an important component of text understanding that enables the processing and structuring of textual resources. Some of these measures have been adapted to the biomedical field by incorporating domain information extracted from clinical data or from medical ontologies such as MeSH. This study focuses on Information Content (IC) based measures that exploit the topological parameters of the taxonomy to express the semantics of a concept. A new intrinsic IC computing method based on the taxonomical parameters of the ancestors' subgraph is then assigned to a biomedical concept into the "is a" hierarchy. Moreover, we present a study of the topological parameters through the MeSH taxonomy. This study treats the semantic interpretation and the different ways of expressing the parameters of depth and the descendants' subgraph. Using MeSH as an input ontology, the accuracy of our proposal is evaluated and compared against other IC-based measures according to several widely-used benchmarks of biomedical terms. The correlation between the results obtained for the evaluated measure using the proposed approach and those from the ratings of human' experts shows that our proposal outperforms the previous measures. PMID:26707454

  3. Properties of herbal extracts against Propionibacterium acnes for biomedical application

    Science.gov (United States)

    Lim, Youn-Mook; Kim, Sung Eun; Kim, Yong Soo; Shin, Young Min; Jeong, Sung In; Jo, Sun-Young; Gwon, Hui-Jeong; Park, Jong-seok; Nho, Young-Chang; Kim, Jong-Cheol; Kim, Seong-Jang; Shin, HeungSoo

    2012-10-01

    Propionibacterium acnes (P. acnes), one of the anaerobic bacterium, causes inflammatory acne. To find a novel medication for treating the inflammation caused by P. acnes, we investigated the anti-bacterial and anti-inflammatory activities of several herbal extracts against P. acnes. The aqueous extracts from five dried herbs, Phellodendron amurense Rupr., Paeonia lactiflora Pallas., Houttuynia cordata Thunb., Agrimonia pilosa Ledeb. and Glycyrrhiza uralensis Fisch., were prepared and mixed. In this experiment, 1 mg/ml of the herbal extract mixture caused a decrease in the growth of P. acnes and reduced the production of pro-inflammatory cytokines, TNF-α, IL-8, IL-1β and IL-6, in human monocytic THP-1 cells treated with heat-killed P. acnes. Therefore, this herbal extract mixture may possess both anti-bacterial and anti-inflammatory activities against P. acnes and can be a novel therapeutic agent for treating inflammatory acne.

  4. Extraction of semantic biomedical relations from text using conditional random fields

    Directory of Open Access Journals (Sweden)

    Stetter Martin

    2008-04-01

    Full Text Available Abstract Background The increasing amount of published literature in biomedicine represents an immense source of knowledge, which can only efficiently be accessed by a new generation of automated information extraction tools. Named entity recognition of well-defined objects, such as genes or proteins, has achieved a sufficient level of maturity such that it can form the basis for the next step: the extraction of relations that exist between the recognized entities. Whereas most early work focused on the mere detection of relations, the classification of the type of relation is also of great importance and this is the focus of this work. In this paper we describe an approach that extracts both the existence of a relation and its type. Our work is based on Conditional Random Fields, which have been applied with much success to the task of named entity recognition. Results We benchmark our approach on two different tasks. The first task is the identification of semantic relations between diseases and treatments. The available data set consists of manually annotated PubMed abstracts. The second task is the identification of relations between genes and diseases from a set of concise phrases, so-called GeneRIF (Gene Reference Into Function phrases. In our experimental setting, we do not assume that the entities are given, as is often the case in previous relation extraction work. Rather the extraction of the entities is solved as a subproblem. Compared with other state-of-the-art approaches, we achieve very competitive results on both data sets. To demonstrate the scalability of our solution, we apply our approach to the complete human GeneRIF database. The resulting gene-disease network contains 34758 semantic associations between 4939 genes and 1745 diseases. The gene-disease network is publicly available as a machine-readable RDF graph. Conclusion We extend the framework of Conditional Random Fields towards the annotation of semantic relations from

  5. Extracting information from noise

    OpenAIRE

    Baptie, Brian

    2010-01-01

    Seismic waves created by sources such as wind, the ocean and human activity propagate inside the Earth all the time. Such waves are often regarded as ‘noise’ by seismologists, however, they travel through the Earth in exactly the same way as those waves from earthquakes. Recent advances in theory have shown that ambient noise recorded at two seismic stations can be combined to provide information about the properties of the Earth between the two stations. This is known as seism...

  6. Informed consent among nursing students participating in biomedical research.

    Science.gov (United States)

    Nambiar, Anupama; Christopher, D J; Mammen, Joy; David, Thambu; Kang, Gagandeep; David, Shirley

    2012-01-01

    For consent in biomedical research, it is essential that research participants understand the need for research, the study protocol, the risk and benefits of participation, the freedom to participate or decline and the right to leave the study at any time. A structured questionnaire was used to assess understanding and knowledge among nursing trainees participating in a cohort study investigating exposure and latent tuberculosis at a tertiary care hospital. Data were collected for 138 participants. While 97% were aware of their enrollment into a research protocol, only 78% could state that it was a study on tuberculosis. Approximately two-thirds were aware of plans for blood collection, but not all of them knew the timings or number of samples. The majority (59%) participants had consulted others before making the decision to participate, and only 73% felt that their participation was completely voluntary. Even among healthcare trainees, emphasis needs to be placed on testing both the knowledge and understanding of participants to ensure the principle and practice of truly informed consent. PMID:22864079

  7. Resource Disambiguator for the Web: Extracting Biomedical Resources and Their Citations from the Scientific Literature.

    Science.gov (United States)

    Ozyurt, Ibrahim Burak; Grethe, Jeffrey S; Martone, Maryann E; Bandrowski, Anita E

    2016-01-01

    The NIF Registry developed and maintained by the Neuroscience Information Framework is a cooperative project aimed at cataloging research resources, e.g., software tools, databases and tissue banks, funded largely by governments and available as tools to research scientists. Although originally conceived for neuroscience, the NIF Registry has over the years broadened in the scope to include research resources of general relevance to biomedical research. The current number of research resources listed by the Registry numbers over 13K. The broadening in scope to biomedical science led us to re-christen the NIF Registry platform as SciCrunch. The NIF/SciCrunch Registry has been cataloging the resource landscape since 2006; as such, it serves as a valuable dataset for tracking the breadth, fate and utilization of these resources. Our experience shows research resources like databases are dynamic objects, that can change location and scope over time. Although each record is entered manually and human-curated, the current size of the registry requires tools that can aid in curation efforts to keep content up to date, including when and where such resources are used. To address this challenge, we have developed an open source tool suite, collectively termed RDW: Resource Disambiguator for the (Web). RDW is designed to help in the upkeep and curation of the registry as well as in enhancing the content of the registry by automated extraction of resource candidates from the literature. The RDW toolkit includes a URL extractor from papers, resource candidate screen, resource URL change tracker, resource content change tracker. Curators access these tools via a web based user interface. Several strategies are used to optimize these tools, including supervised and unsupervised learning algorithms as well as statistical text analysis. The complete tool suite is used to enhance and maintain the resource registry as well as track the usage of individual resources through an

  8. Extracting useful information from images

    DEFF Research Database (Denmark)

    Kucheryavskiy, Sergey

    2011-01-01

    The paper presents an overview of methods for extracting useful information from digital images. It covers various approaches that utilized different properties of images, like intensity distribution, spatial frequencies content and several others. A few case studies including isotropic...... and heterogeneous, congruent and non-congruent images are used to illustrate how the described methods work and to compare some of them...

  9. Informed consent in dental extractions.

    Directory of Open Access Journals (Sweden)

    José Luis Capote Femenías

    2009-07-01

    Full Text Available When performing any oral intervention, particularly dental extractions, the specialist should have the oral or written consent of the patient. This consent includes the explanation of all possible complications, whether typical, very serious or personalized associated with the previous health condition, age, profession, religion or any other characteristic of the patient, as well as the possi.ble benefits of the intervention. This article is related with the bioethical aspects related with dental extractions, in order to determine the main elements that the informed consent should include.

  10. Information Extraction and Webpage Understanding

    OpenAIRE

    M.Sharmila Begum; L. Dinesh; P.Aruna

    2011-01-01

    The two most important tasks in information extraction from the Web are webpage structure understanding and natural language sentences processing. However, little work has been done toward an integrated statistical model for understanding webpage structures and processing natural language sentences within the HTML elements. Our recent work on webpage understanding introduces a joint model of Hierarchical Conditional Random Fields (HCRFs) and extended Semi-Markov Conditional Random Fields (Sem...

  11. Proposal for a new LEIR Slow Extraction Scheme dedicated to Biomedical Research

    CERN Document Server

    Garonna, A; Carli, C

    2014-01-01

    This report presents a proposal for a new slow extraction scheme for the Low Energy Ion Ring (LEIR) in the context of the feasibility study for a biomedical research facility at CERN. LEIR has to be maintained as a heavy ion accumulator ring for LHC and for fixed-target experiments with the SPS. In parallel to this on-going operation for physics experiments, an additional secondary use of LEIR for a biomedical research facility was proposed [Dosanjh2013, Holzscheiter2012, PHE2010]. This facility would complement the existing research beam-time available at other laboratories for studies related to ion beam therapy. The new slow extraction [Abler2013] is based on the third-integer resonance. The reference beam is composed of fully stripped carbon ions with extraction energies of 20-440 MeV/u, transverse physical emittances of 5-25 µm and momentum spreads of ±2-9•10-4. Two resonance driving mechanisms have been studied: the quadrupole-driven method and the RF-knockout technique. Both were made compatible...

  12. Web-Based Information Extraction Technology

    Institute of Scientific and Technical Information of China (English)

    2007-01-01

    Information extraction techniques on the Web are the current research hotspot. Now many information extraction techniques based on different principles have appeared and have different capabilities. We classify the existing information extraction techniques by the principle of information extraction and analyze the methods and principles of semantic information adding, schema defining,rule expression, semantic items locating and object locating in the approaches. Based on the above survey and analysis,several open problems are discussed.

  13. Information based universal feature extraction

    Science.gov (United States)

    Amiri, Mohammad; Brause, Rüdiger

    2015-02-01

    In many real world image based pattern recognition tasks, the extraction and usage of task-relevant features are the most crucial part of the diagnosis. In the standard approach, they mostly remain task-specific, although humans who perform such a task always use the same image features, trained in early childhood. It seems that universal feature sets exist, but they are not yet systematically found. In our contribution, we tried to find those universal image feature sets that are valuable for most image related tasks. In our approach, we trained a neural network by natural and non-natural images of objects and background, using a Shannon information-based algorithm and learning constraints. The goal was to extract those features that give the most valuable information for classification of visual objects hand-written digits. This will give a good start and performance increase for all other image learning tasks, implementing a transfer learning approach. As result, in our case we found that we could indeed extract features which are valid in all three kinds of tasks.

  14. Down syndrome screening information in midwifery practices in the Netherlands: Strategies to integrate biomedical information.

    Science.gov (United States)

    Rosman, Sophia

    2016-03-01

    The aim of this qualitative study was to analyse counselling with regard to prenatal screening in midwifery consultations in the Netherlands where a national prenatal screening programme has only existed since 2007, after years of social and political debates. The methodology is based on in situ observations of 25 counselling consultations in four midwifery practices in two main cities in the Netherlands. The results of this study show that, since midwives are obliged to offer information on Down syndrome screening to all pregnant women (2007), they have to deal with the communication of medical screening information using biostatistical concepts to explain risks, calculations, probabilities and chromosomal anomalies. In order to avoid the risk of medicalization of their consultation, midwives develop strategies that allow them to integrate this new biomedical discourse while maintaining their low medicalized approach of midwife-led care. One of their main strategies is to switch from 'alarming' biomedical messages to 'reassuring words' in order to manage the anxiety induced by the information and to keep the control over their low medicalized consultation. They also tend to distance themselves from the obligation to talk about screening. The way midwives handle these counselling consultations allows them to respect their obligation to propose information, and to remain faithful to their struggle to protect the natural process of pregnancy as well as their professional autonomy. PMID:25504473

  15. Personalized Web Services for Web Information Extraction

    OpenAIRE

    Jarir, Zahi; Quafafou, Mohamed; Erradi, Mahammed

    2011-01-01

    The field of information extraction from the Web emerged with the growth of the Web and the multiplication of online data sources. This paper is an analysis of information extraction methods. It presents a service oriented approach for web information extraction considering both web data management and extraction services. Then we propose an SOA based architecture to enhance flexibility and on-the-fly modification of web extraction services. An implementation of the proposed architecture is p...

  16. Information Extraction and Webpage Understanding

    Directory of Open Access Journals (Sweden)

    M.Sharmila Begum

    2011-11-01

    Full Text Available The two most important tasks in information extraction from the Web are webpage structure understanding and natural language sentences processing. However, little work has been done toward an integrated statistical model for understanding webpage structures and processing natural language sentences within the HTML elements. Our recent work on webpage understanding introduces a joint model of Hierarchical Conditional Random Fields (HCRFs and extended Semi-Markov Conditional Random Fields (Semi-CRFs to leverage the page structure understanding results in free text segmentation and labeling. In this top-down integration model, the decision of the HCRF model could guide the decision making of the Semi-CRF model. However, the drawback of the topdown integration strategy is also apparent, i.e., the decision of the Semi-CRF model could not be used by the HCRF model to guide its decision making. This paper proposed a novel framework called WebNLP, which enables bidirectional integration of page structure understanding and text understanding in an iterative manner. We have applied the proposed framework to local business entity extraction and Chinese person and organization name extraction. Experiments show that the WebNLP framework achieved significantly better performance than existing methods.

  17. TAKES: Two-step Approach for Knowledge Extraction in Biomedical Digital Libraries

    Directory of Open Access Journals (Sweden)

    Song, Min

    2014-03-01

    Full Text Available This paper proposes a novel knowledge extraction system, TAKES (Two-step Approach for Knowledge Extraction System, which integrates advanced techniques from Information Retrieval (IR, Information Extraction (IE, and Natural Language Processing (NLP. In particular, TAKES adopts a novel keyphrase extraction-based query expansion technique to collect promising documents. It also uses a Conditional Random Field-based machine learning technique to extract important biological entities and relations. TAKES is applied to biological knowledge extraction, particularly retrieving promising documents that contain Protein-Protein Interaction (PPI and extracting PPI pairs. TAKES consists of two major components: DocSpotter, which is used to query and retrieve promising documents for extraction, and a Conditional Random Field (CRF-based entity extraction component known as FCRF. The present paper investigated research problems addressing the issues with a knowledge extraction system and conducted a series of experiments to test our hypotheses. The findings from the experiments are as follows: First, the author verified, using three different test collections to measure the performance of our query expansion technique, that DocSpotter is robust and highly accurate when compared to Okapi BM25 and SLIPPER. Second, the author verified that our relation extraction algorithm, FCRF, is highly accurate in terms of F-Measure compared to four other competitive extraction algorithms: Support Vector Machine, Maximum Entropy, Single POS HMM, and Rapier.

  18. A Semantics-Based Approach to Retrieving Biomedical Information

    DEFF Research Database (Denmark)

    Andreasen, Troels; Bulskov, Henrik; Zambach, Sine;

    2011-01-01

    This paper describes an approach to representing, organising, and accessing conceptual content of biomedical texts using a formal ontology. The ontology is based on UMLS resources supplemented with domain ontologies developed in the project. The approach introduces the notion of ‘generative ontol...... of data mining of texts identifying paraphrases and concept relations and measuring distances between key concepts in texts. Thus, the project is distinct in its attempt to provide a formal underpinning of conceptual similarity or relatedness of meaning.......This paper describes an approach to representing, organising, and accessing conceptual content of biomedical texts using a formal ontology. The ontology is based on UMLS resources supplemented with domain ontologies developed in the project. The approach introduces the notion of ‘generative...... ontologies’, i.e., ontologies providing increasingly specialised concepts reflecting the phrase structure of natural language. Furthermore, we propose a novel so called ontological semantics which maps noun phrases from texts and queries into nodes in the generative ontology. This enables an advanced form...

  19. Generic information can retrieve known biological associations: implications for biomedical knowledge discovery.

    Directory of Open Access Journals (Sweden)

    Herman H H B M van Haagen

    Full Text Available MOTIVATION: Weighted semantic networks built from text-mined literature can be used to retrieve known protein-protein or gene-disease associations, and have been shown to anticipate associations years before they are explicitly stated in the literature. Our text-mining system recognizes over 640,000 biomedical concepts: some are specific (i.e., names of genes or proteins others generic (e.g., 'Homo sapiens'. Generic concepts may play important roles in automated information retrieval, extraction, and inference but may also result in concept overload and confound retrieval and reasoning with low-relevance or even spurious links. Here, we attempted to optimize the retrieval performance for protein-protein interactions (PPI by filtering generic concepts (node filtering or links to generic concepts (edge filtering from a weighted semantic network. First, we defined metrics based on network properties that quantify the specificity of concepts. Then using these metrics, we systematically filtered generic information from the network while monitoring retrieval performance of known protein-protein interactions. We also systematically filtered specific information from the network (inverse filtering, and assessed the retrieval performance of networks composed of generic information alone. RESULTS: Filtering generic or specific information induced a two-phase response in retrieval performance: initially the effects of filtering were minimal but beyond a critical threshold network performance suddenly drops. Contrary to expectations, networks composed exclusively of generic information demonstrated retrieval performance comparable to unfiltered networks that also contain specific concepts. Furthermore, an analysis using individual generic concepts demonstrated that they can effectively support the retrieval of known protein-protein interactions. For instance the concept "binding" is indicative for PPI retrieval and the concept "mutation abnormality" is

  20. Strategies for Disseminating Information on Biomedical Research on Autism to Hispanic Parents

    Science.gov (United States)

    Lajonchere, Clara M.; Wheeler, Barbara Y.; Valente, Thomas W.; Kreutzer, Cary; Munson, Aron; Narayanan, Shrikanth; Kazemzadeh, Abe; Cruz, Roxana; Martinez, Irene; Schrager, Sheree M.; Schweitzer, Lisa; Chklovski, Tara; Hwang, Darryl

    2016-01-01

    Low income Hispanic families experience multiple barriers to accessing evidence-based information on Autism Spectrum Disorders (ASD). This study utilized a mixed-strategy intervention to create access to information in published bio-medical research articles on ASD by distilling the content into parent-friendly English- and Spanish-language ASD…

  1. Extracting information from multiplex networks

    Science.gov (United States)

    Iacovacci, Jacopo; Bianconi, Ginestra

    2016-06-01

    Multiplex networks are generalized network structures that are able to describe networks in which the same set of nodes are connected by links that have different connotations. Multiplex networks are ubiquitous since they describe social, financial, engineering, and biological networks as well. Extending our ability to analyze complex networks to multiplex network structures increases greatly the level of information that is possible to extract from big data. For these reasons, characterizing the centrality of nodes in multiplex networks and finding new ways to solve challenging inference problems defined on multiplex networks are fundamental questions of network science. In this paper, we discuss the relevance of the Multiplex PageRank algorithm for measuring the centrality of nodes in multilayer networks and we characterize the utility of the recently introduced indicator function Θ ˜ S for describing their mesoscale organization and community structure. As working examples for studying these measures, we consider three multiplex network datasets coming for social science.

  2. Extracting Information from Multiplex Networks

    CERN Document Server

    Iacovacci, Jacopo

    2016-01-01

    Multiplex networks are generalized network structures that are able to describe networks in which the same set of nodes are connected by links that have different connotations. Multiplex networks are ubiquitous since they describe social, financial, engineering and biological networks as well. Extending our ability to analyze complex networks to multiplex network structures increases greatly the level of information that is possible to extract from Big Data. For these reasons characterizing the centrality of nodes in multiplex networks and finding new ways to solve challenging inference problems defined on multiplex networks are fundamental questions of network science. In this paper we discuss the relevance of the Multiplex PageRank algorithm for measuring the centrality of nodes in multilayer networks and we characterize the utility of the recently introduced indicator function $\\widetilde{\\Theta}^{S}$ for describing their mesoscale organization and community structure. As working examples for studying thes...

  3. Extraction and Characterization of Collagen from Buffalo Skin for Biomedical Applications

    Directory of Open Access Journals (Sweden)

    Moustafa A. Rizk

    2016-06-01

    Full Text Available Collagen is widely used for biomedical and pharmaceutical applications due to its excellent biocompatibility, biodegradability and weak antigenicity. However, applicability is limited due to its high cost and probability of disease transmission from the current sources, which are bovine and porcine. In the present study, collagen was extracted from 6 months buffalo skins as alternative save sources. Collagen was characterized by different physico-chemical techniques like ATR-FTIR, Raman, SEM, DSC and amino acids analysis. Proline and hydroxyproline contents of buffalo skin collagen were higher than those of calf skin collagen. Thermal stability of buffalo skin collagen is high with respect to that of calf skin collagen. The obtained buffalo skin collagen shows higher stiffness upon cross-linking with glutaraldehyde. Thus buffalo skin collagen can be used for fabrication of high strength bioactive sponge and sheets for medical applications, like scaffold for tissue engineering, drug delivery and wound dressing system.

  4. Ontology-based retrieval of bio-medical information based on microarray text corpora

    DEFF Research Database (Denmark)

    Hansen, Kim Allan; Zambach, Sine; Have, Christian Theil

    degree. We explore the possibilities of retrieving biomedical information from microarrays in Gene Expression Omnibus (GEO), of which we have indexed a sample semantically, as a rst step towards ontology based searches. Through an example we argue that it is possible to improve the retrieval......Microarray technology is often used in gene expression exper- iments. Information retrieval in the context of microarrays has mainly been concerned with the analysis of the numeric data produced; how- ever, the experiments are often annotated with textual metadata. Al- though biomedical resources...

  5. Biomedical Information Extraction: Mining Disease Associated Genes from Literature

    Science.gov (United States)

    Huang, Zhong

    2014-01-01

    Disease associated gene discovery is a critical step to realize the future of personalized medicine. However empirical and clinical validation of disease associated genes are time consuming and expensive. In silico discovery of disease associated genes from literature is therefore becoming the first essential step for biomarker discovery to…

  6. Use of systematic review to inform the infection risk for biomedical engineers and technicians servicing biomedical devices

    International Nuclear Information System (INIS)

    Full text: Many microorganisms responsible for hospital acquired infections are able to stay viable on surfaces with no visible sign of contamination, in dry conditions and on non-porous surfaces. The infection risk to biomedical staff when servicing biomedical devices is not documented. An indirect approach has been used to examine the different aspects that will affect the risk of infection including a systematic review of microbial contamination and transmission relating to biomedical devices. A systematic review found 58% of biomedical devices have microbial contamination with 13% having at least one pathogenic organism. These microbes can persist for some months. Occupational-infections of biomedical service staff are low compared to other healthcare workers. A biomedical device with contaminated surface or dust was identified as the source of patient outbreaks in 13 papers. The cleaning agent most tested for removal of micro-organisms from devices was alcohol swabs, but sterile water swabs were also effective. However, manufacturers mainly recommend (74%) cleaning devices with water and detergent. Biomedical engineers and technicians have a small risk of being exposed to dangerous micro-organisms on most biomedical devices, but without skin breakage, this exposure is unlikely to cause ill-health. It is recommended that biomedical staff follow good infection control practices, wipe devices with detergent, sterile water or alcohol swabs as recommended by the manufacturer before working on them, and keep alcohol hand rubs accessible at all benches. (author)

  7. An integrated biomedical knowledge extraction and analysis platform: using federated search and document clustering technology.

    Science.gov (United States)

    Taylor, Donald P

    2007-01-01

    High content screening (HCS) requires time-consuming and often complex iterative information retrieval and assessment approaches to optimally conduct drug discovery programs and biomedical research. Pre- and post-HCS experimentation both require the retrieval of information from public as well as proprietary literature in addition to structured information assets such as compound libraries and projects databases. Unfortunately, this information is typically scattered across a plethora of proprietary bioinformatics tools and databases and public domain sources. Consequently, single search requests must be presented to each information repository, forcing the results to be manually integrated for a meaningful result set. Furthermore, these bioinformatics tools and data repositories are becoming increasingly complex to use; typically they fail to allow for more natural query interfaces. Vivisimo has developed an enterprise software platform to bridge disparate silos of information. The platform automatically categorizes search results into descriptive folders without the use of taxonomies to drive the categorization. A new approach to information retrieval for HCS experimentation is proposed. PMID:16988412

  8. An integrated biomedical knowledge extraction and analysis platform: using federated search and document clustering technology.

    Science.gov (United States)

    Taylor, Donald P

    2007-01-01

    High content screening (HCS) requires time-consuming and often complex iterative information retrieval and assessment approaches to optimally conduct drug discovery programs and biomedical research. Pre- and post-HCS experimentation both require the retrieval of information from public as well as proprietary literature in addition to structured information assets such as compound libraries and projects databases. Unfortunately, this information is typically scattered across a plethora of proprietary bioinformatics tools and databases and public domain sources. Consequently, single search requests must be presented to each information repository, forcing the results to be manually integrated for a meaningful result set. Furthermore, these bioinformatics tools and data repositories are becoming increasingly complex to use; typically they fail to allow for more natural query interfaces. Vivisimo has developed an enterprise software platform to bridge disparate silos of information. The platform automatically categorizes search results into descriptive folders without the use of taxonomies to drive the categorization. A new approach to information retrieval for HCS experimentation is proposed.

  9. Personalized Web Services for Web Information Extraction

    CERN Document Server

    Jarir, Zahi; Erradi, Mahammed

    2011-01-01

    The field of information extraction from the Web emerged with the growth of the Web and the multiplication of online data sources. This paper is an analysis of information extraction methods. It presents a service oriented approach for web information extraction considering both web data management and extraction services. Then we propose an SOA based architecture to enhance flexibility and on-the-fly modification of web extraction services. An implementation of the proposed architecture is proposed on the middleware level of Java Enterprise Edition (JEE) servers.

  10. Biosynthesis of silver nanoparticle from leaf extract of Desmodium gangeticum (L.) DC. and its biomedical potential

    Science.gov (United States)

    Thirunavoukkarasu, M.; Balaji, U.; Behera, S.; Panda, P. K.; Mishra, B. K.

    2013-12-01

    An aqueous leaf extract of Desmodium gangeticum was employed to synthesize silver nano particles. Rapid formation of stable silver nanoparticles were observed on exposure of the aqueous leaf extract with solution of silver nitrate. The silver nanoparticles were characterized by UV-visible spectroscopy, scanning electron microscopy, energy dispersive X-ray analysis and transmission electron microscopy (TEM), and Fourier Transform Infra-Red spectroscopy (FTIR) UV-visible spectroscopy, scanning electron microscopy (SEM), energy dispersive X-ray analysis (EDAX), transmission electron microscopy (TEM), and Fourier Transform Infra-Red spectroscopy (FTIR). UV-visible spectrum of the aqueous medium peaked at 450 nm corresponding to the plasmon absorbance of silver nanoparticles. SEM analysis revealed the spherical shape of the particles with sizes ranging from 18 to 39 nm and the EDAX spectrum confirmed the presence of silver along with other elements in the plant metabolite. Further, these biologically synthesized nanoparticles were found to be highly toxic against pathogenic bacteria Escherichia coli, thus implying significance of the present study in production of biomedical products.

  11. Chrysopogon zizanioides aqueous extract mediated synthesis, characterization of crystalline silver and gold nanoparticles for biomedical applications.

    Science.gov (United States)

    Arunachalam, Kantha D; Annamalai, Sathesh Kumar

    2013-01-01

    The exploitation of various plant materials for the biosynthesis of nanoparticles is considered a green technology as it does not involve any harmful chemicals. The aim of this study was to develop a simple biological method for the synthesis of silver and gold nanoparticles using Chrysopogon zizanioides. To exploit various plant materials for the biosynthesis of nanoparticles was considered a green technology. An aqueous leaf extract of C. zizanioides was used to synthesize silver and gold nanoparticles by the bioreduction of silver nitrate (AgNO3) and chloroauric acid (HAuCl4) respectively. Water-soluble organics present in the plant materials were mainly responsible for reducing silver or gold ions to nanosized Ag or Au particles. The synthesized silver and gold nanoparticles were characterized by ultraviolet (UV)-visible spectroscopy, scanning electron microscopy (SEM), energy dispersive X-ray analysis (EDAX), Fourier transform infrared spectroscopy (FTIR), and X-ray diffraction (XRD) analysis. The kinetics decline reactions of aqueous silver/gold ion with the C. zizanioides crude extract were determined by UV-visible spectroscopy. SEM analysis showed that aqueous gold ions, when exposed to the extract were reduced and resulted in the biosynthesis of gold nanoparticles in the size range 20-50 nm. This eco-friendly approach for the synthesis of nanoparticles is simple, can be scaled up for large-scale production with powerful bioactivity as demonstrated by the synthesized silver nanoparticles. The synthesized nanoparticles can have clinical use as antibacterial, antioxidant, as well as cytotoxic agents and can be used for biomedical applications. PMID:23861583

  12. Chrysopogon zizanioides aqueous extract mediated synthesis characterization of crystalline silver and gold nanoparticles for biomedical applications

    Directory of Open Access Journals (Sweden)

    Arunachalam KD

    2013-07-01

    Full Text Available Kantha D Arunachalam, Sathesh Kumar Annamalai Center for Environmental Nuclear Research, Directorate of Research, SRM University, Chennai, Tamil Nadu, India Abstract: The exploitation of various plant materials for the biosynthesis of nanoparticles is considered a green technology as it does not involve any harmful chemicals. The aim of this study was to develop a simple biological method for the synthesis of silver and gold nanoparticles using Chrysopogon zizanioides. To exploit various plant materials for the biosynthesis of nanoparticles was considered a green technology. An aqueous leaf extract of C. zizanioides was used to synthesize silver and gold nanoparticles by the bioreduction of silver nitrate (AgNO3 and chloroauric acid (HAuCl4 respectively. Water-soluble organics present in the plant materials were mainly responsible for reducing silver or gold ions to nanosized Ag or Au particles. The synthesized silver and gold nanoparticles were characterized by ultraviolet (UV-visible spectroscopy, scanning electron microscopy (SEM, energy dispersive X-ray analysis (EDAX, Fourier transform infrared spectroscopy (FTIR, and X-ray diffraction (XRD analysis. The kinetics decline reactions of aqueous silver/gold ion with the C. zizanioides crude extract were determined by UV-visible spectroscopy. SEM analysis showed that aqueous gold ions, when exposed to the extract were reduced and resulted in the biosynthesis of gold nanoparticles in the size range 20–50 nm. This eco-friendly approach for the synthesis of nanoparticles is simple, can be scaled up for large-scale production with powerful bioactivity as demonstrated by the synthesized silver nanoparticles. The synthesized nanoparticles can have clinical use as antibacterial, antioxidant, as well as cytotoxic agents and can be used for biomedical applications. Keywords: nanoparticles, bioreduction, SEM, silver, gold

  13. Discovering biomedical semantic relations in PubMed queries for information retrieval and database curation.

    Science.gov (United States)

    Huang, Chung-Chi; Lu, Zhiyong

    2016-01-01

    Identifying relevant papers from the literature is a common task in biocuration. Most current biomedical literature search systems primarily rely on matching user keywords. Semantic search, on the other hand, seeks to improve search accuracy by understanding the entities and contextual relations in user keywords. However, past research has mostly focused on semantically identifying biological entities (e.g. chemicals, diseases and genes) with little effort on discovering semantic relations. In this work, we aim to discover biomedical semantic relations in PubMed queries in an automated and unsupervised fashion. Specifically, we focus on extracting and understanding the contextual information (or context patterns) that is used by PubMed users to represent semantic relations between entities such as 'CHEMICAL-1 compared to CHEMICAL-2' With the advances in automatic named entity recognition, we first tag entities in PubMed queries and then use tagged entities as knowledge to recognize pattern semantics. More specifically, we transform PubMed queries into context patterns involving participating entities, which are subsequently projected to latent topics via latent semantic analysis (LSA) to avoid the data sparseness and specificity issues. Finally, we mine semantically similar contextual patterns or semantic relations based on LSA topic distributions. Our two separate evaluation experiments of chemical-chemical (CC) and chemical-disease (CD) relations show that the proposed approach significantly outperforms a baseline method, which simply measures pattern semantics by similarity in participating entities. The highest performance achieved by our approach is nearly 0.9 and 0.85 respectively for the CC and CD task when compared against the ground truth in terms of normalized discounted cumulative gain (nDCG), a standard measure of ranking quality. These results suggest that our approach can effectively identify and return related semantic patterns in a ranked order

  14. Discovering biomedical semantic relations in PubMed queries for information retrieval and database curation.

    Science.gov (United States)

    Huang, Chung-Chi; Lu, Zhiyong

    2016-01-01

    Identifying relevant papers from the literature is a common task in biocuration. Most current biomedical literature search systems primarily rely on matching user keywords. Semantic search, on the other hand, seeks to improve search accuracy by understanding the entities and contextual relations in user keywords. However, past research has mostly focused on semantically identifying biological entities (e.g. chemicals, diseases and genes) with little effort on discovering semantic relations. In this work, we aim to discover biomedical semantic relations in PubMed queries in an automated and unsupervised fashion. Specifically, we focus on extracting and understanding the contextual information (or context patterns) that is used by PubMed users to represent semantic relations between entities such as 'CHEMICAL-1 compared to CHEMICAL-2' With the advances in automatic named entity recognition, we first tag entities in PubMed queries and then use tagged entities as knowledge to recognize pattern semantics. More specifically, we transform PubMed queries into context patterns involving participating entities, which are subsequently projected to latent topics via latent semantic analysis (LSA) to avoid the data sparseness and specificity issues. Finally, we mine semantically similar contextual patterns or semantic relations based on LSA topic distributions. Our two separate evaluation experiments of chemical-chemical (CC) and chemical-disease (CD) relations show that the proposed approach significantly outperforms a baseline method, which simply measures pattern semantics by similarity in participating entities. The highest performance achieved by our approach is nearly 0.9 and 0.85 respectively for the CC and CD task when compared against the ground truth in terms of normalized discounted cumulative gain (nDCG), a standard measure of ranking quality. These results suggest that our approach can effectively identify and return related semantic patterns in a ranked order

  15. Toward a fully de-identified biomedical information warehouse.

    Science.gov (United States)

    Liu, Jianhua; Erdal, Selnur; Silvey, Scott A; Ding, Jing; Riedel, John D; Marsh, Clay B; Kamal, Jyoti

    2009-11-14

    The Information Warehouse at the Ohio State University Medical Center is a comprehensive repository of business, clinical, and research data from various source systems. Data collected here is a valuable resource that facilitates both translational research and personalized healthcare. The use of such data in research is governed by federal privacy regulations with oversight by the Institutional Review Board. In 2006, the Information Warehouse was recognized by the OSU IRB as an "Honest Broker" of clinical data, providing investigators with de-identified or limited datasets under stipulations contained in a signed data use agreement. In order to streamline this process even further, the Information Warehouse is developing a de-identified data warehouse that is suitable for direct user access through a controlled query tool that is aimed to support both research and education activities. In this paper we report our findings on performance evaluation of different de-identification schemes that may be used to ensure regulatory compliance while also facilitating practical database updating and querying. We also discuss how date-shifting in the de-identification process can impact other data elements such as diagnosis and procedure codes and consider a possible solution to those problems.

  16. Toward a fully de-identified biomedical information warehouse.

    Science.gov (United States)

    Liu, Jianhua; Erdal, Selnur; Silvey, Scott A; Ding, Jing; Riedel, John D; Marsh, Clay B; Kamal, Jyoti

    2009-01-01

    The Information Warehouse at the Ohio State University Medical Center is a comprehensive repository of business, clinical, and research data from various source systems. Data collected here is a valuable resource that facilitates both translational research and personalized healthcare. The use of such data in research is governed by federal privacy regulations with oversight by the Institutional Review Board. In 2006, the Information Warehouse was recognized by the OSU IRB as an "Honest Broker" of clinical data, providing investigators with de-identified or limited datasets under stipulations contained in a signed data use agreement. In order to streamline this process even further, the Information Warehouse is developing a de-identified data warehouse that is suitable for direct user access through a controlled query tool that is aimed to support both research and education activities. In this paper we report our findings on performance evaluation of different de-identification schemes that may be used to ensure regulatory compliance while also facilitating practical database updating and querying. We also discuss how date-shifting in the de-identification process can impact other data elements such as diagnosis and procedure codes and consider a possible solution to those problems. PMID:20351882

  17. [Biomedical information on the internet using search engines. A one-year trial].

    Science.gov (United States)

    Corrao, Salvatore; Leone, Francesco; Arnone, Sabrina

    2004-01-01

    The internet is a communication medium and content distributor that provide information in the general sense but it could be of great utility regarding as the search and retrieval of biomedical information. Search engines represent a great deal to rapidly find information on the net. However, we do not know whether general search engines and meta-search ones are reliable in order to find useful and validated biomedical information. The aim of our study was to verify the reproducibility of a search by key-words (pediatric or evidence) using 9 international search engines and 1 meta-search engine at the baseline and after a one year period. We analysed the first 20 citations as output of each searching. We evaluated the formal quality of Web-sites and their domain extensions. Moreover, we compared the output of each search at the start of this study and after a one year period and we considered as a criterion of reliability the number of Web-sites cited again. We found some interesting results that are reported throughout the text. Our findings point out an extreme dynamicity of the information on the Web and, for this reason, we advice a great caution when someone want to use search and meta-search engines as a tool for searching and retrieve reliable biomedical information. On the other hand, some search and meta-search engines could be very useful as a first step searching for defining better a search and, moreover, for finding institutional Web-sites too. This paper allows to know a more conscious approach to the internet biomedical information universe.

  18. Organization of Biomedical Data for Collaborative Scientific Research: A Research Information Management System.

    Science.gov (United States)

    Myneni, Sahiti; Patel, Vimla L

    2010-06-01

    Biomedical researchers often work with massive, detailed and heterogeneous datasets. These datasets raise new challenges of information organization and management for scientific interpretation, as they demand much of the researchers' time and attention. The current study investigated the nature of the problems that researchers face when dealing with such data. Four major problems identified with existing biomedical scientific information management methods were related to data organization, data sharing, collaboration, and publications. Therefore, there is a compelling need to develop an efficient and user-friendly information management system to handle the biomedical research data. This study evaluated the implementation of an information management system, which was introduced as part of the collaborative research to increase scientific productivity in a research laboratory. Laboratory members seemed to exhibit frustration during the implementation process. However, empirical findings revealed that they gained new knowledge and completed specified tasks while working together with the new system. Hence, researchers are urged to persist and persevere when dealing with any new technology, including an information management system in a research laboratory environment. PMID:20543892

  19. Organization of Biomedical Data for Collaborative Scientific Research: A Research Information Management System

    Science.gov (United States)

    Myneni, Sahiti; Patel, Vimla L.

    2010-01-01

    Biomedical researchers often work with massive, detailed and heterogeneous datasets. These datasets raise new challenges of information organization and management for scientific interpretation, as they demand much of the researchers’ time and attention. The current study investigated the nature of the problems that researchers face when dealing with such data. Four major problems identified with existing biomedical scientific information management methods were related to data organization, data sharing, collaboration, and publications. Therefore, there is a compelling need to develop an efficient and user-friendly information management system to handle the biomedical research data. This study evaluated the implementation of an information management system, which was introduced as part of the collaborative research to increase scientific productivity in a research laboratory. Laboratory members seemed to exhibit frustration during the implementation process. However, empirical findings revealed that they gained new knowledge and completed specified tasks while working together with the new system. Hence, researchers are urged to persist and persevere when dealing with any new technology, including an information management system in a research laboratory environment. PMID:20543892

  20. Organization of Biomedical Data for Collaborative Scientific Research: A Research Information Management System.

    Science.gov (United States)

    Myneni, Sahiti; Patel, Vimla L

    2010-06-01

    Biomedical researchers often work with massive, detailed and heterogeneous datasets. These datasets raise new challenges of information organization and management for scientific interpretation, as they demand much of the researchers' time and attention. The current study investigated the nature of the problems that researchers face when dealing with such data. Four major problems identified with existing biomedical scientific information management methods were related to data organization, data sharing, collaboration, and publications. Therefore, there is a compelling need to develop an efficient and user-friendly information management system to handle the biomedical research data. This study evaluated the implementation of an information management system, which was introduced as part of the collaborative research to increase scientific productivity in a research laboratory. Laboratory members seemed to exhibit frustration during the implementation process. However, empirical findings revealed that they gained new knowledge and completed specified tasks while working together with the new system. Hence, researchers are urged to persist and persevere when dealing with any new technology, including an information management system in a research laboratory environment.

  1. Automated Extraction Of Associations Between Methylated Genes and Diseases From Biomedical Literature

    KAUST Repository

    Bin Res, Arwa A.

    2012-12-01

    Associations between methylated genes and diseases have been investigated in several studies, and it is critical to have such information available for better understanding of diseases and clinical decisions. However, such information is scattered in a large number of electronic publications and it is difficult to manually search for it. Therefore, the goal of the project is to develop a machine learning model that can efficiently extract such information. Twelve machine learning algorithms were applied and compared in application to this problem based on three approaches that involve: document-term frequency matrices, position weight matrices, and a hybrid approach that uses the combination of the previous two. The best results we obtained by the hybrid approach with a random forest model that, in a 10-fold cross-validation, achieved F-score and accuracy of nearly 85% and 84%, respectively. On a completely separate testing set, F-score and accuracy of 89% and 88%, respectively, were obtained. Based on this model, we developed a tool that automates extraction of associations between methylated genes and diseases from electronic text. Our study contributed an efficient method for extracting specific types of associations from free text and the methodology developed here can be extended to other similar association extraction problems.

  2. Enhanced Pattern Representation in Information Extraction

    Institute of Scientific and Technical Information of China (English)

    廖乐健; 曹元大; 张映波

    2004-01-01

    Traditional pattern representation in information extraction lack in the ability of representing domain-specific concepts and are therefore devoid of flexibility. To overcome these restrictions, an enhanced pattern representation is designed which includes ontological concepts, neighboring-tree structures and soft constraints. An information-extraction inference engine based on hypothesis-generation and conflict-resolution is implemented. The proposed technique is successfully applied to an information extraction system for Chinese-language query front-end of a job-recruitment search engine.

  3. A System for Information Management in BioMedical Studies—SIMBioMS

    Science.gov (United States)

    Krestyaninova, Maria; Zarins, Andris; Viksna, Juris; Kurbatova, Natalja; Rucevskis, Peteris; Neogi, Sudeshna Guha; Gostev, Mike; Perheentupa, Teemu; Knuuttila, Juha; Barrett, Amy; Lappalainen, Ilkka; Rung, Johan; Podnieks, Karlis; Sarkans, Ugis; McCarthy, Mark I; Brazma, Alvis

    2009-01-01

    Summary: SIMBioMS is a web-based open source software system for managing data and information in biomedical studies. It provides a solution for the collection, storage, management and retrieval of information about research subjects and biomedical samples, as well as experimental data obtained using a range of high-throughput technologies, including gene expression, genotyping, proteomics and metabonomics. The system can easily be customized and has proven to be successful in several large-scale multi-site collaborative projects. It is compatible with emerging functional genomics data standards and provides data import and export in accepted standard formats. Protocols for transferring data to durable archives at the European Bioinformatics Institute have been implemented. Availability: The source code, documentation and initialization scripts are available at http://simbioms.org. Contact: support@simbioms.org; mariak@ebi.ac.uk PMID:19633095

  4. Method for Detecting Core Malware Sites Related to Biomedical Information Systems

    Directory of Open Access Journals (Sweden)

    Dohoon Kim

    2015-01-01

    Full Text Available Most advanced persistent threat attacks target web users through malicious code within landing (exploit or distribution sites. There is an urgent need to block the affected websites. Attacks on biomedical information systems are no exception to this issue. In this paper, we present a method for locating malicious websites that attempt to attack biomedical information systems. Our approach uses malicious code crawling to rearrange websites in the order of their risk index by analyzing the centrality between malware sites and proactively eliminates the root of these sites by finding the core-hub node, thereby reducing unnecessary security policies. In particular, we dynamically estimate the risk index of the affected websites by analyzing various centrality measures and converting them into a single quantified vector. On average, the proactive elimination of core malicious websites results in an average improvement in zero-day attack detection of more than 20%.

  5. Method for detecting core malware sites related to biomedical information systems.

    Science.gov (United States)

    Kim, Dohoon; Choi, Donghee; Jin, Jonghyun

    2015-01-01

    Most advanced persistent threat attacks target web users through malicious code within landing (exploit) or distribution sites. There is an urgent need to block the affected websites. Attacks on biomedical information systems are no exception to this issue. In this paper, we present a method for locating malicious websites that attempt to attack biomedical information systems. Our approach uses malicious code crawling to rearrange websites in the order of their risk index by analyzing the centrality between malware sites and proactively eliminates the root of these sites by finding the core-hub node, thereby reducing unnecessary security policies. In particular, we dynamically estimate the risk index of the affected websites by analyzing various centrality measures and converting them into a single quantified vector. On average, the proactive elimination of core malicious websites results in an average improvement in zero-day attack detection of more than 20%. PMID:25821511

  6. Information- Theoretic Analysis for the Difficulty of Extracting Hidden Information

    Institute of Scientific and Technical Information of China (English)

    ZHANG Wei-ming; LI Shi-qu; CAO Jia; LIU Jiu-fen

    2005-01-01

    The difficulty of extracting hidden information,which is essentially a kind of secrecy, is analyzed by information-theoretic method. The relations between key rate, message rate, hiding capacity and difficulty of extraction are studied in the terms of unicity distance of stego-key, and the theoretic conclusion is used to analyze the actual extracting attack on Least Significant Bit(LSB) steganographic algorithms.

  7. System for Information Extraction from News Sites

    OpenAIRE

    Stefanov, Tihomir

    2015-01-01

    The present paper deals with a system for crawling and content extraction from news sites. The system of web crawlers extracts textual and graphic information and checks for multimedia content availability. A part of the programming code and the database have been presented.

  8. A New Approach for Web Information Extraction

    Directory of Open Access Journals (Sweden)

    R.Gunasundari

    2012-01-01

    Full Text Available With the exponentially growing amount of information available on the Internet, an effective technique for users to discern the useful information from the unnecessary information is urgently required. Cleaning web pages for web data extraction becomes critical for improving performance of information retrieval and information extraction. So, we investigate to remove various noise patterns in Web pages instead of extracting relevant content from Web pages to get main content information. To solve this problem, we put forward an extracting main content method which firstly removes the usual noise and the candidate nodes without any main content information from web pages, and makes use of the relation of content text length, the length of anchor text and the number of punctuation marks to extract the main content. In this paper, we focus on removing noise and utilization of all kinds of content-characteristics, experiments show that this approach can enhance the universality and accuracy in extracting the body text of web pages

  9. Implementation and management of a biomedical observation dictionary in a large healthcare information system

    Science.gov (United States)

    Vandenbussche, Pierre-Yves; Cormont, Sylvie; André, Christophe; Daniel, Christel; Delahousse, Jean; Charlet, Jean; Lepage, Eric

    2013-01-01

    Objective This study shows the evolution of a biomedical observation dictionary within the Assistance Publique Hôpitaux Paris (AP-HP), the largest European university hospital group. The different steps are detailed as follows: the dictionary creation, the mapping to logical observation identifier names and codes (LOINC), the integration into a multiterminological management platform and, finally, the implementation in the health information system. Methods AP-HP decided to create a biomedical observation dictionary named AnaBio, to map it to LOINC and to maintain the mapping. A management platform based on methods used for knowledge engineering has been put in place. It aims at integrating AnaBio within the health information system and improving both the quality and stability of the dictionary. Results This new management platform is now active in AP-HP. The AnaBio dictionary is shared by 120 laboratories and currently includes 50 000 codes. The mapping implementation to LOINC reaches 40% of the AnaBio entries and uses 26% of LOINC records. The results of our work validate the choice made to develop a local dictionary aligned with LOINC. Discussion and Conclusions This work constitutes a first step towards a wider use of the platform. The next step will support the entire biomedical production chain, from the clinician prescription, through laboratory tests tracking in the laboratory information system to the communication of results and the use for decision support and biomedical research. In addition, the increase in the mapping implementation to LOINC ensures the interoperability allowing communication with other international health institutions. PMID:23635601

  10. Battling the biomedical information explosion: a plan for implementing a quality filtered database.

    Science.gov (United States)

    Moore, M

    1989-01-01

    In 1980 an article appeared in the Archives of Internal Medicine stating that two-thirds of the studies appearing in the best medical journals contain unwarranted conclusions. In order to deal with the ever increasing mass of biomedical information (termed a "journalistic blastoma" by one editor), IAIMS has extolled the use of quality filters, devices to sort through the literature and sift the good from the bad. The complexity of the task is staggering. Quality is obviously subjective, and its nature raises the question of whether our users can be taught the necessary skills to systematically evaluate the literature. This report is based on a research project that grew into an elective course for medical students at Texas Tech University School of Medicine on "Critical Appraisal of the Biomedical Literature." PMID:10303951

  11. Language Engineering for Information Extraction

    OpenAIRE

    Schierle, Martin

    2012-01-01

    Accompanied by the cultural development to an information society and knowledge economy and driven by the rapid growth of the World Wide Web and decreasing prices for technology and disk space, the world\\'s knowledge is evolving fast, and humans are challenged with keeping up. Despite all efforts on data structuring, a large part of this human knowledge is still hidden behind the ambiguities and fuzziness of natural language. Especially domain language poses new challenges by having specif...

  12. Application of GIS to Geological Information Extraction

    Institute of Scientific and Technical Information of China (English)

    2000-01-01

    GIS. a powerful tool for processing spatial data, is advantageous in its spatial overlaying. In this paper, GIS is applied to the extraction of geological information. Information associated with mineral resources is chosen to delineate the geo-anomalies, the basis of ore-forming anomalies and of mineral-deposit location. This application is illustrated with an example in Weixi area, Yunnan Province.

  13. Semantics-driven modelling of user preferences for information retrieval in the biomedical domain.

    Science.gov (United States)

    Gladun, Anatoly; Rogushina, Julia; Valencia-García, Rafael; Béjar, Rodrigo Martínez

    2013-03-01

    A large amount of biomedical and genomic data are currently available on the Internet. However, data are distributed into heterogeneous biological information sources, with little or even no organization. Semantic technologies provide a consistent and reliable basis with which to confront the challenges involved in the organization, manipulation and visualization of data and knowledge. One of the knowledge representation techniques used in semantic processing is the ontology, which is commonly defined as a formal and explicit specification of a shared conceptualization of a domain of interest. The work presented here introduces a set of interoperable algorithms that can use domain and ontological information to improve information-retrieval processes. This work presents an ontology-based information-retrieval system for the biomedical domain. This system, with which some experiments have been carried out that are described in this paper, is based on the use of domain ontologies for the creation and normalization of lightweight ontologies that represent user preferences in a determined domain in order to improve information-retrieval processes.

  14. A service-oriented distributed semantic mediator: integrating multiscale biomedical information.

    Science.gov (United States)

    Mora, Oscar; Engelbrecht, Gerhard; Bisbal, Jesus

    2012-11-01

    Biomedical research continuously generates large amounts of heterogeneous and multimodal data spread over multiple data sources. These data, if appropriately shared and exploited, could dramatically improve the research practice itself, and ultimately the quality of health care delivered. This paper presents DISMED (DIstributed Semantic MEDiator), an open source semantic mediator that provides a unified view of a federated environment of multiscale biomedical data sources. DISMED is a Web-based software application to query and retrieve information distributed over a set of registered data sources, using semantic technologies. It also offers a userfriendly interface specifically designed to simplify the usage of these technologies by non-expert users. Although the architecture of the software mediator is generic and domain independent, in the context of this paper, DISMED has been evaluated for managing biomedical environments and facilitating research with respect to the handling of scientific data distributed in multiple heterogeneous data sources. As part of this contribution, a quantitative evaluation framework has been developed. It consist of a benchmarking scenario and the definition of five realistic use-cases. This framework, created entirely with public datasets, has been used to compare the performance of DISMED against other available mediators. It is also available to the scientific community in order to evaluate progress in the domain of semantic mediation, in a systematic and comparable manner. The results show an average improvement in the execution time by DISMED of 55% compared to the second best alternative in four out of the five use-cases of the experimental evaluation.

  15. Testbed for Information Extraction from Deep Web

    OpenAIRE

    Yamada, Yasuhiro; Nakatoh, Tetsuya; Hirokawa, Sachio

    2004-01-01

    Search results generated by searchable databases are served dynamically and far larger than the static documents on the Web. These results pages have been referred to as the Deep Web [1]. We need to extract the target data in results pages to integrate them on different searchable databases. We propose a testbed for information extraction from search results. We chose 100 databases randomly from 114,540 pages with search forms. Therefore, these databases have a good variety. We selected 51 da...

  16. Annotation for information extraction from mammography reports.

    Science.gov (United States)

    Bozkurt, Selen; Gulkesen, Kemal Hakan; Rubin, Daniel

    2013-01-01

    Inter and intra-observer variability in mammographic interpretation is a challenging problem, and decision support systems (DSS) may be helpful to reduce variation in practice. Since radiology reports are created as unstructured text reports, Natural language processing (NLP) techniques are needed to extract structured information from reports in order to provide the inputs to DSS. Before creating NLP systems, producing high quality annotated data set is essential. The goal of this project is to develop an annotation schema to guide the information extraction tasks needed from free-text mammography reports. PMID:23823416

  17. Synonym set extraction from the biomedical literature by lexical pattern discovery

    OpenAIRE

    Collier Nigel; McCrae John

    2008-01-01

    Abstract Background Although there are a large number of thesauri for the biomedical domain many of them lack coverage in terms and their variant forms. Automatic thesaurus construction based on patterns was first suggested by Hearst 1, but it is still not clear how to automatically construct such patterns for different semantic relations and domains. In particular it is not certain which patterns are useful for capturing synonymy. The assumption of extant resources such as parsers is also a ...

  18. Assessing the impact of case sensitivity and term information gain on biomedical concept recognition.

    Directory of Open Access Journals (Sweden)

    Tudor Groza

    Full Text Available Concept recognition (CR is a foundational task in the biomedical domain. It supports the important process of transforming unstructured resources into structured knowledge. To date, several CR approaches have been proposed, most of which focus on a particular set of biomedical ontologies. Their underlying mechanisms vary from shallow natural language processing and dictionary lookup to specialized machine learning modules. However, no prior approach considers the case sensitivity characteristics and the term distribution of the underlying ontology on the CR process. This article proposes a framework that models the CR process as an information retrieval task in which both case sensitivity and the information gain associated with tokens in lexical representations (e.g., term labels, synonyms are central components of a strategy for generating term variants. The case sensitivity of a given ontology is assessed based on the distribution of so-called case sensitive tokens in its terms, while information gain is modelled using a combination of divergence from randomness and mutual information. An extensive evaluation has been carried out using the CRAFT corpus. Experimental results show that case sensitivity awareness leads to an increase of up to 0.07 F1 against a non-case sensitive baseline on the Protein Ontology and GO Cellular Component. Similarly, the use of information gain leads to an increase of up to 0.06 F1 against a standard baseline in the case of GO Biological Process and Molecular Function and GO Cellular Component. Overall, subject to the underlying token distribution, these methods lead to valid complementary strategies for augmenting term label sets to improve concept recognition.

  19. A service-oriented distributed semantic mediator: integrating multiscale biomedical information.

    Science.gov (United States)

    Mora, Oscar; Engelbrecht, Gerhard; Bisbal, Jesus

    2012-11-01

    Biomedical research continuously generates large amounts of heterogeneous and multimodal data spread over multiple data sources. These data, if appropriately shared and exploited, could dramatically improve the research practice itself, and ultimately the quality of health care delivered. This paper presents DISMED (DIstributed Semantic MEDiator), an open source semantic mediator that provides a unified view of a federated environment of multiscale biomedical data sources. DISMED is a Web-based software application to query and retrieve information distributed over a set of registered data sources, using semantic technologies. It also offers a userfriendly interface specifically designed to simplify the usage of these technologies by non-expert users. Although the architecture of the software mediator is generic and domain independent, in the context of this paper, DISMED has been evaluated for managing biomedical environments and facilitating research with respect to the handling of scientific data distributed in multiple heterogeneous data sources. As part of this contribution, a quantitative evaluation framework has been developed. It consist of a benchmarking scenario and the definition of five realistic use-cases. This framework, created entirely with public datasets, has been used to compare the performance of DISMED against other available mediators. It is also available to the scientific community in order to evaluate progress in the domain of semantic mediation, in a systematic and comparable manner. The results show an average improvement in the execution time by DISMED of 55% compared to the second best alternative in four out of the five use-cases of the experimental evaluation. PMID:22929464

  20. Information extraction and CT reconstruction of liver images based on diffraction enhanced imaging

    Institute of Scientific and Technical Information of China (English)

    Chunhong Hu; Tao Zhao; Lu Zhang; Hui Li; Xinyan Zhao; Shuqian Luo

    2009-01-01

    X-ray phase-contrast imaging (PCI) is a new emerging imaging technique that generates a high spatial resolution and high contrast of biological soft tissues compared to conventional radiography. Herein a biomedical application of diffraction enhanced imaging (DEI) is presented. As one of the PCI methods, DEI derives contrast from many different kinds of sample information, such as the sample's X-ray absorption, refraction gradient and ultra-small-angle X-ray scattering (USAXS) properties, and the sample information is expressed by three parametric images. Combined with computed tomography (CT), DEI-CT can produce 3D volumetric images of the sample and can be used for investigating micro-structures of biomedical samples. Our DEI experiments for fiver samples were implemented at the topog-raphy station of Beijing Synchrotron Radiation Facility (BSRF). The results show that by using our provided information extraction method and DEI-CT reconstruction approach, the obtained parametric images clearly display the inner structures of liver tissues and the morphology of blood vessels. Furthermore, the reconstructed 3D view of the fiver blood vessels exhibits the micro blood vessels whose minimum diameter is on the order of about tens of microns, much better than its conventional CT reconstruction at a millimeter resolution.In conclusion, both the information extraction method and DEI-CT have the potential for use in biomedical micro-structures analysis.

  1. Web Information Extraction%Web信息抽取

    Institute of Scientific and Technical Information of China (English)

    李晶; 陈恩红

    2003-01-01

    With the tremendous amount of information available on the Web, the ability to quickly obtain information has become a crucial problem. It is not enough for us to acquire information only with Web information retrieval technology. Therefore more and more people pay attention to Web information extraction technology. This paper first in- troduces some concepts of information extraction technology, then introduces and analyzes several typical Web information extraction methods based on the differences in extraction patterns.

  2. Unsupervised information extraction by text segmentation

    CERN Document Server

    Cortez, Eli

    2013-01-01

    A new unsupervised approach to the problem of Information Extraction by Text Segmentation (IETS) is proposed, implemented and evaluated herein. The authors' approach relies on information available on pre-existing data to learn how to associate segments in the input string with attributes of a given domain relying on a very effective set of content-based features. The effectiveness of the content-based features is also exploited to directly learn from test data structure-based features, with no previous human-driven training, a feature unique to the presented approach. Based on the approach, a

  3. Supporting effective health and biomedical information retrieval and navigation: a novel facet view interface evaluation.

    Science.gov (United States)

    Mu, Xiangming; Ryu, Hohyon; Lu, Kun

    2011-08-01

    There is a need to provide a more effective user interface to facilitate non-domain experts' health information seeking in authoritative online databases such as MEDLINE. We developed a new topic cluster based information navigation system called SimMed. Instead of offering a list of documents, SimMed presents users with a list of ranked clusters. Topically similar documents are grouped together to provide users with a better overview of the search results and to support exploration of similar literature within a cluster. We conducted an empirical user study to compare SimMed to a traditional document list based search interface. A total of 42 study participants were recruited to use both interfaces for health information exploration search tasks. The results showed that SimMed is more effective in terms of users' perceived topic knowledge changes and their engagement in user-system interactions. We also developed a new metric to assess users' efforts to find relevant citations. On average, users need significantly fewer clicks to find relevant information in SimMed than in the baseline system. Comments from study participants indicated that SimMed is more helpful in finding similar citations, providing related medical terms, and presenting better organized search results, particularly when the initial search is unsatisfactory. Findings from the study shed light on future health and biomedical information retrieval system and interface designs.

  4. Extracting the information backbone in online system

    CERN Document Server

    Zhang, Qian-Ming; Shang, Ming-Sheng

    2013-01-01

    Information overload is a serious problem in modern society and many solutions such as recommender system have been proposed to filter out irrelevant information. In the literature, researchers mainly dedicated to improve the recommendation performance (accuracy and diversity) of the algorithms while overlooked the influence of topology of the online user-object bipartite networks. In this paper, we find that some information provided by the bipartite networks is not only redundant but also misleading. With such "less can be more" feature, we design some algorithms to improve the recommendation performance by eliminating some links from the original networks. Moreover, we propose a hybrid method combining the time-aware and topology-aware link removal algorithms to extract the backbone which contains the essential information for the recommender systems. From the practical point of view, our method can improve the performance and reduce the computational time of the recommendation system, thus improve both of...

  5. From Waste to Healing Biopolymers: Biomedical Applications of Bio-Collagenic Materials Extracted from Industrial Leather Residues in Wound Healing

    Directory of Open Access Journals (Sweden)

    Rafael Luque

    2013-04-01

    Full Text Available The biomedical properties of a porous bio-collagenic polymer extracted from leather industrial waste residues have been investigated in wound healing and tissue regeneration in induced wounds in rats. Application of the pure undiluted bio-collagen to induced wounds in rats dramatically improved its healing after 7 days in terms of collagen production and wound filling as well as in the migration and differentiation of keratinocytes. The formulation tested was found to be three times more effective than the commercial reference product Catrix® (Heal Progress (HP: 8 ± 1.55 vs. 2.33 ± 0.52, p < 0.001; Formation of Collagen (FC: 7.5 ± 1.05 vs. 2.17 ± 0.75, p < 0.001; Regeneration of Epidermis (RE: 13.33 ± 5.11 vs. 5 ± 5.48, p < 0.05.

  6. CONAN : Text Mining in the Biomedical Domain

    NARCIS (Netherlands)

    Malik, R.

    2006-01-01

    This thesis is about Text Mining. Extracting important information from literature. In the last years, the number of biomedical articles and journals is growing exponentially. Scientists might not find the information they want because of the large number of publications. Therefore a system was cons

  7. DKIE: Open Source Information Extraction for Danish

    DEFF Research Database (Denmark)

    Derczynski, Leon; Field, Camilla Vilhelmsen; Bøgh, Kenneth Sejdenfaden

    2014-01-01

    Danish is a major Scandinavian language spoken daily by around six million people. However, it lacks a unified, open set of NLP tools. This demonstration will introduce DKIE, an extensible open-source toolkit for processing Danish text. We implement an information extraction architecture for Danish...... within GATE, including integrated third-party tools. This implementation includes the creation of a substantial set of corpus annotations for dataintensive named entity recognition. The final application and dataset is made are openly available, and the part-of-speech tagger and NER model also operate...

  8. The caCORE Software Development Kit: Streamlining construction of interoperable biomedical information services

    Directory of Open Access Journals (Sweden)

    Warzel Denise

    2006-01-01

    Full Text Available Abstract Background Robust, programmatically accessible biomedical information services that syntactically and semantically interoperate with other resources are challenging to construct. Such systems require the adoption of common information models, data representations and terminology standards as well as documented application programming interfaces (APIs. The National Cancer Institute (NCI developed the cancer common ontologic representation environment (caCORE to provide the infrastructure necessary to achieve interoperability across the systems it develops or sponsors. The caCORE Software Development Kit (SDK was designed to provide developers both within and outside the NCI with the tools needed to construct such interoperable software systems. Results The caCORE SDK requires a Unified Modeling Language (UML tool to begin the development workflow with the construction of a domain information model in the form of a UML Class Diagram. Models are annotated with concepts and definitions from a description logic terminology source using the Semantic Connector component. The annotated model is registered in the Cancer Data Standards Repository (caDSR using the UML Loader component. System software is automatically generated using the Codegen component, which produces middleware that runs on an application server. The caCORE SDK was initially tested and validated using a seven-class UML model, and has been used to generate the caCORE production system, which includes models with dozens of classes. The deployed system supports access through object-oriented APIs with consistent syntax for retrieval of any type of data object across all classes in the original UML model. The caCORE SDK is currently being used by several development teams, including by participants in the cancer biomedical informatics grid (caBIG program, to create compatible data services. caBIG compatibility standards are based upon caCORE resources, and thus the caCORE SDK has

  9. Extraction of information from unstructured text

    Energy Technology Data Exchange (ETDEWEB)

    Irwin, N.H.; DeLand, S.M.; Crowder, S.V.

    1995-11-01

    Extracting information from unstructured text has become an emphasis in recent years due to the large amount of text now electronically available. This status report describes the findings and work done by the end of the first year of a two-year LDRD. Requirements of the approach included that it model the information in a domain independent way. This means that it would differ from current systems by not relying on previously built domain knowledge and that it would do more than keyword identification. Three areas that are discussed and expected to contribute to a solution include (1) identifying key entities through document level profiling and preprocessing, (2) identifying relationships between entities through sentence level syntax, and (3) combining the first two with semantic knowledge about the terms.

  10. Biomedical Applications of the Information-efficient Spectral Imaging Sensor (ISIS)

    Energy Technology Data Exchange (ETDEWEB)

    Gentry, S.M.; Levenson, R.

    1999-01-21

    The Information-efficient Spectral Imaging Sensor (ISIS) approach to spectral imaging seeks to bridge the gap between tuned multispectral and fixed hyperspectral imaging sensors. By allowing the definition of completely general spectral filter functions, truly optimal measurements can be made for a given task. These optimal measurements significantly improve signal-to-noise ratio (SNR) and speed, minimize data volume and data rate, while preserving classification accuracy. The following paper investigates the application of the ISIS sensing approach in two sample biomedical applications: prostate and colon cancer screening. It is shown that in these applications, two to three optimal measurements are sufficient to capture the majority of classification information for critical sample constituents. In the prostate cancer example, the optimal measurements allow 8% relative improvement in classification accuracy of critical cell constituents over a red, green, blue (RGB) sensor. In the colon cancer example, use of optimal measurements boost the classification accuracy of critical cell constituents by 28% relative to the RGB sensor. In both cases, optimal measurements match the performance achieved by the entire hyperspectral data set. The paper concludes that an ISIS style spectral imager can acquire these optimal spectral images directly, allowing improved classification accuracy over an RGB sensor. Compared to a hyperspectral sensor, the ISIS approach can achieve similar classification accuracy using a significantly lower number of spectral samples, thus minimizing overall sample classification time and cost.

  11. Extracting the information backbone in online system.

    Directory of Open Access Journals (Sweden)

    Qian-Ming Zhang

    Full Text Available Information overload is a serious problem in modern society and many solutions such as recommender system have been proposed to filter out irrelevant information. In the literature, researchers have been mainly dedicated to improving the recommendation performance (accuracy and diversity of the algorithms while they have overlooked the influence of topology of the online user-object bipartite networks. In this paper, we find that some information provided by the bipartite networks is not only redundant but also misleading. With such "less can be more" feature, we design some algorithms to improve the recommendation performance by eliminating some links from the original networks. Moreover, we propose a hybrid method combining the time-aware and topology-aware link removal algorithms to extract the backbone which contains the essential information for the recommender systems. From the practical point of view, our method can improve the performance and reduce the computational time of the recommendation system, thus improving both of their effectiveness and efficiency.

  12. Synonym set extraction from the biomedical literature by lexical pattern discovery

    Science.gov (United States)

    McCrae, John; Collier, Nigel

    2008-01-01

    Background Although there are a large number of thesauri for the biomedical domain many of them lack coverage in terms and their variant forms. Automatic thesaurus construction based on patterns was first suggested by Hearst [1], but it is still not clear how to automatically construct such patterns for different semantic relations and domains. In particular it is not certain which patterns are useful for capturing synonymy. The assumption of extant resources such as parsers is also a limiting factor for many languages, so it is desirable to find patterns that do not use syntactical analysis. Finally to give a more consistent and applicable result it is desirable to use these patterns to form synonym sets in a sound way. Results We present a method that automatically generates regular expression patterns by expanding seed patterns in a heuristic search and then develops a feature vector based on the occurrence of term pairs in each developed pattern. This allows for a binary classifications of term pairs as synonymous or non-synonymous. We then model this result as a probability graph to find synonym sets, which is equivalent to the well-studied problem of finding an optimal set cover. We achieved 73.2% precision and 29.7% recall by our method, out-performing hand-made resources such as MeSH and Wikipedia. Conclusion We conclude that automatic methods can play a practical role in developing new thesauri or expanding on existing ones, and this can be done with only a small amount of training data and no need for resources such as parsers. We also concluded that the accuracy can be improved by grouping into synonym sets. PMID:18366721

  13. Synonym set extraction from the biomedical literature by lexical pattern discovery

    Directory of Open Access Journals (Sweden)

    Collier Nigel

    2008-03-01

    Full Text Available Abstract Background Although there are a large number of thesauri for the biomedical domain many of them lack coverage in terms and their variant forms. Automatic thesaurus construction based on patterns was first suggested by Hearst 1, but it is still not clear how to automatically construct such patterns for different semantic relations and domains. In particular it is not certain which patterns are useful for capturing synonymy. The assumption of extant resources such as parsers is also a limiting factor for many languages, so it is desirable to find patterns that do not use syntactical analysis. Finally to give a more consistent and applicable result it is desirable to use these patterns to form synonym sets in a sound way. Results We present a method that automatically generates regular expression patterns by expanding seed patterns in a heuristic search and then develops a feature vector based on the occurrence of term pairs in each developed pattern. This allows for a binary classifications of term pairs as synonymous or non-synonymous. We then model this result as a probability graph to find synonym sets, which is equivalent to the well-studied problem of finding an optimal set cover. We achieved 73.2% precision and 29.7% recall by our method, out-performing hand-made resources such as MeSH and Wikipedia. Conclusion We conclude that automatic methods can play a practical role in developing new thesauri or expanding on existing ones, and this can be done with only a small amount of training data and no need for resources such as parsers. We also concluded that the accuracy can be improved by grouping into synonym sets.

  14. System architecture and information model for integrated access to distributed biomedical information

    Science.gov (United States)

    Kim, Dongkyu; Alaoui, Adil; Levine, Betty; Leondaridis, Leonidas; Shields, Peter; Byers, Steve; Cleary, Kevin

    2009-02-01

    The current trend towards systems medicine will rely heavily on computational and bioinformatics capabilities to collect, integrate, and analyze massive amounts of data from disparate sources. The objective is to use this information to make medical decisions that improve patient care. At Georgetown University Medical Center, we are developing an informatics capability to integrate several research and clinical databases. Our long term goal is to provide researchers at Georgetown's Lombardi Comprehensive Cancer Center better access to aggregated molecular and clinical information facilitating the investigation of new hypotheses that impact patient care. We also recognize the need for data mining tools and intelligent agents to help researchers in these efforts. This paper describes our initial work to create a flexible platform for researchers and physicians that provides access to information sources including clinical records, medical images, genomic, epigenomic, proteomic and metabolomic data. This paper describes the data sources selected for this pilot project and possible approaches to integrating these databases. We present the different database integration models that we considered. We conclude by outlining the proposed Information Model for the project.

  15. Information sources in biomedical science and medical journalism: methodological approaches and assessment.

    Science.gov (United States)

    Miranda, Giovanna F; Vercellesi, Luisa; Bruno, Flavia

    2004-09-01

    Throughout the world the public is showing increasing interest in medical and scientific subjects and journalists largely spread this information, with an important impact on knowledge and health. Clearly, therefore, the relationship between the journalist and his sources is delicate: freedom and independence of information depend on the independence and truthfulness of the sources. The new "precision journalism" holds that scientific methods should be applied to journalism, so authoritative sources are a common need for journalists and scientists. We therefore compared the individual classifications and methods of assessing of sources in biomedical science and medical journalism to try to extrapolate scientific methods of evaluation to journalism. In journalism and science terms used to classify sources of information show some similarities, but their meanings are different. In science primary and secondary classes of information, for instance, refer to the levels of processing, but in journalism to the official nature of the source itself. Scientists and journalists must both always consult as many sources as possible and check their authoritativeness, reliability, completeness, up-to-dateness and balance. In journalism, however, there are some important differences and limits: too many sources can sometimes diminish the quality of the information. The sources serve a first filter between the event and the journalist, who is not providing the reader with the fact, but with its projection. Journalists have time constraints and lack the objective criteria for searching, the specific background knowledge, and the expertise to fully assess sources. To assist in understanding the wealth of sources of information in journalism, we have prepared a checklist of items and questions. There are at least four fundamental points that a good journalist, like any scientist, should know: how to find the latest information (the sources), how to assess it (the quality and

  16. Chaotic spectra: How to extract dynamic information

    International Nuclear Information System (INIS)

    Nonlinear dynamics is applied to chaotic unassignable atomic and molecular spectra with the aim of extracting detailed information about regular dynamic motions that exist over short intervals of time. It is shown how this motion can be extracted from high resolution spectra by doing low resolution studies or by Fourier transforming limited regions of the spectrum. These motions mimic those of periodic orbits (PO) and are inserts into the dominant chaotic motion. Considering these inserts and the PO as a dynamically decoupled region of space, resonant scattering theory and stabilization methods enable us to compute ladders of resonant states which interact with the chaotic quasi-continuum computed in principle from basis sets placed off the PO. The interaction of the resonances with the quasicontinuum explains the low resolution spectra seen in such experiments. It also allows one to associate low resolution features with a particular PO. The motion on the PO thereby supplies the molecular movements whose quantization causes the low resolution spectra. Characteristic properties of the periodic orbit based resonances are discussed. The method is illustrated on the photoabsorption spectrum of the hydrogen atom in a strong magnetic field and on the photodissociation spectrum of H3+. Other molecular systems which are currently under investigation using this formalism are also mentioned. 53 refs., 10 figs., 2 tabs

  17. Chaotic spectra: How to extract dynamic information

    Energy Technology Data Exchange (ETDEWEB)

    Taylor, H.S.; Gomez Llorente, J.M.; Zakrzewski, J.; Kulander, K.C.

    1988-10-01

    Nonlinear dynamics is applied to chaotic unassignable atomic and molecular spectra with the aim of extracting detailed information about regular dynamic motions that exist over short intervals of time. It is shown how this motion can be extracted from high resolution spectra by doing low resolution studies or by Fourier transforming limited regions of the spectrum. These motions mimic those of periodic orbits (PO) and are inserts into the dominant chaotic motion. Considering these inserts and the PO as a dynamically decoupled region of space, resonant scattering theory and stabilization methods enable us to compute ladders of resonant states which interact with the chaotic quasi-continuum computed in principle from basis sets placed off the PO. The interaction of the resonances with the quasicontinuum explains the low resolution spectra seen in such experiments. It also allows one to associate low resolution features with a particular PO. The motion on the PO thereby supplies the molecular movements whose quantization causes the low resolution spectra. Characteristic properties of the periodic orbit based resonances are discussed. The method is illustrated on the photoabsorption spectrum of the hydrogen atom in a strong magnetic field and on the photodissociation spectrum of H/sub 3//sup +/. Other molecular systems which are currently under investigation using this formalism are also mentioned. 53 refs., 10 figs., 2 tabs.

  18. Extraction of quantifiable information from complex systems

    CERN Document Server

    Dahmen, Wolfgang; Griebel, Michael; Hackbusch, Wolfgang; Ritter, Klaus; Schneider, Reinhold; Schwab, Christoph; Yserentant, Harry

    2014-01-01

    In April 2007, the  Deutsche Forschungsgemeinschaft (DFG) approved the  Priority Program 1324 “Mathematical Methods for Extracting Quantifiable Information from Complex Systems.” This volume presents a comprehensive overview of the most important results obtained over the course of the program.   Mathematical models of complex systems provide the foundation for further technological developments in science, engineering and computational finance.  Motivated by the trend toward steadily increasing computer power, ever more realistic models have been developed in recent years. These models have also become increasingly complex, and their numerical treatment poses serious challenges.   Recent developments in mathematics suggest that, in the long run, much more powerful numerical solution strategies could be derived if the interconnections between the different fields of research were systematically exploited at a conceptual level. Accordingly, a deeper understanding of the mathematical foundations as w...

  19. Using Biomedical Technologies to Inform Economic Modeling: Challenges and Opportunities for Improving Analysis of Environmental Policies

    OpenAIRE

    Roe, Brian E.; Haab, Timothy C.

    2009-01-01

    Advances in biomedical technology have irrevocably jarred open the black box of human decision making, offering social scientists the potential to validate, reject, refine and redefine the individual models of resource allocation that form the foundation of modern economics. In this paper we (1) provide a comprehensive overview of the biomedical methods that may be harnessed by economists and other social scientists to better understand the economic decision making process; (2) review researc...

  20. A Novel Feature Selection Strategy for Enhanced Biomedical Event Extraction Using the Turku System

    Directory of Open Access Journals (Sweden)

    Jingbo Xia

    2014-01-01

    Full Text Available Feature selection is of paramount importance for text-mining classifiers with high-dimensional features. The Turku Event Extraction System (TEES is the best performing tool in the GENIA BioNLP 2009/2011 shared tasks, which relies heavily on high-dimensional features. This paper describes research which, based on an implementation of an accumulated effect evaluation (AEE algorithm applying the greedy search strategy, analyses the contribution of every single feature class in TEES with a view to identify important features and modify the feature set accordingly. With an updated feature set, a new system is acquired with enhanced performance which achieves an increased F-score of 53.27% up from 51.21% for Task 1 under strict evaluation criteria and 57.24% according to the approximate span and recursive criterion.

  1. SEMANTIC INFORMATION EXTRACTION IN UNIVERSITY DOMAIN

    Directory of Open Access Journals (Sweden)

    Swathi

    2012-07-01

    Full Text Available Today’s conventional search engines hardly do provide the essential content relevant to the user’s search query. This is because the context and semantics of the request made by the user is not analyzed to the full extent. So here the need for a semantic web search arises. SWS is upcoming in the area of web search which combines Natural Language Processing and Artificial Intelligence. The objective of the work done here is to design, develop and implement a semantic search engine- SIEU(Semantic Information Extraction in University Domain confined to the university domain. SIEU uses ontology as a knowledge base for the information retrieval process. It is not just a mere keyword search. It is one layer above what Google or any other search engines retrieve by analyzing just the keywords. Here the query is analyzed both syntactically and semantically. The developed system retrieves the web results more relevant to the user query through keyword expansion. The results obtained here will be accurate enough to satisfy the request made by the user. The level of accuracy will be enhanced since the query is analyzed semantically. The system will be of great use to the developers and researchers who work on web. The Google results are re-ranked and optimized for providing the relevant links. For ranking an algorithm has been applied which fetches more apt results for the user query.

  2. Enhancing retinal images by extracting structural information

    Science.gov (United States)

    Molodij, G.; Ribak, E. N.; Glanc, M.; Chenegros, G.

    2014-02-01

    High-resolution imaging of the retina has significant importance for science: physics and optics, biology, and medicine. The enhancement of images with poor contrast and the detection of faint structures require objective methods for assessing perceptual image quality. Under the assumption that human visual perception is highly adapted for extracting structural information from a scene, we introduce a framework for quality assessment based on the degradation of structural information. We implemented a new processing technique on a long sequence of retinal images of subjects with normal vision. We were able to perform a precise shift-and-add at the sub-pixel level in order to resolve the structures of the size of single cells in the living human retina. Last, we quantified the restoration reliability of the distorted images using an improved quality assessment. To that purpose, we used the single image restoration method based on the ergodic principle, which has originated in solar astronomy, to deconvolve aberrations after adaptive optics compensation.

  3. Respiratory Information Extraction from Electrocardiogram Signals

    KAUST Repository

    Amin, Gamal El Din Fathy

    2010-12-01

    The Electrocardiogram (ECG) is a tool measuring the electrical activity of the heart, and it is extensively used for diagnosis and monitoring of heart diseases. The ECG signal reflects not only the heart activity but also many other physiological processes. The respiratory activity is a prominent process that affects the ECG signal due to the close proximity of the heart and the lungs. In this thesis, several methods for the extraction of respiratory process information from the ECG signal are presented. These methods allow an estimation of the lung volume and the lung pressure from the ECG signal. The potential benefit of this is to eliminate the corresponding sensors used to measure the respiration activity. A reduction of the number of sensors connected to patients will increase patients’ comfort and reduce the costs associated with healthcare. As a further result, the efficiency of diagnosing respirational disorders will increase since the respiration activity can be monitored with a common, widely available method. The developed methods can also improve the detection of respirational disorders that occur while patients are sleeping. Such disorders are commonly diagnosed in sleeping laboratories where the patients are connected to a number of different sensors. Any reduction of these sensors will result in a more natural sleeping environment for the patients and hence a higher sensitivity of the diagnosis.

  4. Early Steps Towards Web Scale Information Extraction with LODIE

    OpenAIRE

    Gentile, Anna Lisa; The University of Sheffield; Zhang, Ziqi; The University of Sheffield; Ciravegna, Fabio; The University of Sheffield

    2015-01-01

    Information extraction (IE) is the technique for transforming unstructured textual data into structured representation that can be understood by machines. The exponential growth of the Web generates an exceptional quantity of data for which automatic knowledge capture is essential. This work describes the methodology for web scale information extraction in the LODIE project (linked open data information extraction) and highlights results from the early experiments carried out in the initial p...

  5. Information Extraction on the Web with Credibility Guarantee

    OpenAIRE

    Nguyen, Thanh Tam

    2015-01-01

    The Web became the central medium for valuable sources of information extraction applications. However, such user-generated resources are often plagued by inaccuracies and misinformation due to the inherent openness and uncertainty of the Web. In this work we study the problem of extracting structured information out of Web data with a credibility guarantee. The ultimate goal is that not only the structured information should be extracted as much as possible but also its credibility is high. ...

  6. [Application of the life sciences platform based on oracle to biomedical informations].

    Science.gov (United States)

    Zhao, Zhi-Yun; Li, Tai-Huan; Yang, Hong-Qiao

    2008-03-01

    The life sciences platform based on Oracle database technology is introduced in this paper. By providing a powerful data access, integrating a variety of data types, and managing vast quantities of data, the software presents a flexible, safe and scalable management platform for biomedical data processing. PMID:18581881

  7. A Study of the Information Literacy of Biomedical Graduate Students: Based on the Thesis Topic Discovery Process in Molecular Biology Research

    Directory of Open Access Journals (Sweden)

    Jhao-Yen Huang

    2014-06-01

    Full Text Available The biomedical information environment is in a state of constant and rapid change due to the increase in research data and rapid technological advances. In Taiwan, few research has investigated the information literacy of biomedical graduate students. This exploratory study examined the information literacy abilities and training of biomedical graduate students in Taiwan. Semi-structured interviews based on the Association of College and Research Libraries Information Literacy Competency Standards for Science and Engineering/Technology were conducted with 20 molecular biological graduate students. The interview inquired about their information-seeking channels and information literacy education. The findings show that the biomedical graduate students developed a workable thesis topic with their advisors. Through various information-seeking channels and retrieval strategies, they obtained and critically evaluated information to address different information needs for their thesis research. Through seminars, annual conferences and papers, the interviewees were informed of current developments in their field. Subsequently, through written or oral communications, they were able to integrate and exchange the information. Most interviewees cared about the social, economic, legal, and ethical issues surrounding the use of information. College courses and labs were the main information literacy education environment for them to learn about research skills and knowledge. The study concludes four areas to address for the information literacy of biomedical graduate students, i.e., using professional information, using the current information, efficiency in assessing the domain information, and utilization of diverse information channels. Currently, the interviewees showed rather low usage of library resources, which is a concern for biomedical educators and libraries. [Article content in Chinese

  8. Re-ranking with context for high-performance biomedical information retrieval.

    Science.gov (United States)

    Yin, Xiaoshi; Huang, Jimmy Xiangji; Li, Zhoujun

    2012-01-01

    In this paper, we present a context-sensitive approach to re-ranking retrieved documents for further improving the effectiveness of high-performance biomedical literature retrieval systems. For each topic, a two-dimensional positive context is learnt from the top N retrieved documents and a group of negative contexts are learnt from the last N' documents in initial retrieval ranked list. The contextual space contains lexical context and conceptual context. The probabilities that retrieved documents are generated within the contextual space are then computed for document re-ranking. Empirical evaluation on the TREC Genomics full-text collection and three high-performance biomedical literature retrieval runs demonstrates that the context-sensitive re-ranking approach yields better retrieval performance.

  9. Turning Informal Thesauri Into Formal Ontologies: A Feasibility Study on Biomedical Knowledge re-Use

    OpenAIRE

    Udo Hahn

    2003-01-01

    This paper reports a large-scale knowledge conversion and curation experiment. Biomedical domain knowledge from a semantically weak and shallow terminological resource, the UMLS, is transformed into a rigorous description logics format. This way, the broad coverage of the UMLS is combined with inference mechanisms for consistency and cycle checking. They are the key to proper cleansing of the knowledge directly imported from the UMLS, as well as subsequent updating, maintenance an...

  10. Research on extraction of aeromagnetic weak information based on GIS

    International Nuclear Information System (INIS)

    This paper introduces the meaning of weak information extraction and discusses the theoretic foundation of aeromagnetic weak information extraction and expounds the principle of selected mathematic-physics methods. A kind of data visualization method--'1/4 standard deviation', which is favorable for displaying weak information, is proposed in the GIS software platform. The validity test of weak information extraction has been completed based on the GIS platform in Dongsheng area. Results indicate that the weak anomaly enhancement technique may effectively restrain the interfering noise, improve the signal-to-noise ratio and further delineate the prospective targets of sandstone uranium deposits

  11. The Agent of extracting Internet Information with Lead Order

    Science.gov (United States)

    Mo, Zan; Huang, Chuliang; Liu, Aijun

    In order to carry out e-commerce better, advanced technologies to access business information are in need urgently. An agent is described to deal with the problems of extracting internet information that caused by the non-standard and skimble-scamble structure of Chinese websites. The agent designed includes three modules which respond to the process of extracting information separately. A method of HTTP tree and a kind of Lead algorithm is proposed to generate a lead order, with which the required web can be retrieved easily. How to transform the extracted information structuralized with natural language is also discussed.

  12. Information Extraction Using Distant Supervision and Semantic Similarities

    Directory of Open Access Journals (Sweden)

    PARK, Y.

    2016-02-01

    Full Text Available Information extraction is one of the main research tasks in natural language processing and text mining that extracts useful information from unstructured sentences. Information extraction techniques include named entity recognition, relation extraction, and co-reference resolution. Among them, relation extraction refers to a task that extracts semantic relations between entities such as personal and geographic names in documents. This is an important research area, which is used in knowledge base construction and question and answering systems. This study presents relation extraction using a distant supervision learning technique among semi-supervised learning methods, which have been spotlighted in recent years to reduce human manual work and costs required for supervised learning. That is, this study proposes a method that can improve relation extraction by improving a distant supervision learning technique by applying a clustering method to create a learning corpus and semantic analysis for relation extraction that is difficult to identify using existing distant supervision. Through comparison experiments of various semantic similarity comparison methods, similarity calculation methods that are useful to relation extraction using distant supervision are searched, and a large number of accurate relation triples can be extracted using the proposed structural advantages and semantic similarity comparison.

  13. Linking genes to literature: text mining, information extraction, and retrieval applications for biology.

    Science.gov (United States)

    Krallinger, Martin; Valencia, Alfonso; Hirschman, Lynette

    2008-01-01

    Efficient access to information contained in online scientific literature collections is essential for life science research, playing a crucial role from the initial stage of experiment planning to the final interpretation and communication of the results. The biological literature also constitutes the main information source for manual literature curation used by expert-curated databases. Following the increasing popularity of web-based applications for analyzing biological data, new text-mining and information extraction strategies are being implemented. These systems exploit existing regularities in natural language to extract biologically relevant information from electronic texts automatically. The aim of the BioCreative challenge is to promote the development of such tools and to provide insight into their performance. This review presents a general introduction to the main characteristics and applications of currently available text-mining systems for life sciences in terms of the following: the type of biological information demands being addressed; the level of information granularity of both user queries and results; and the features and methods commonly exploited by these applications. The current trend in biomedical text mining points toward an increasing diversification in terms of application types and techniques, together with integration of domain-specific resources such as ontologies. Additional descriptions of some of the systems discussed here are available on the internet http://zope.bioinfo.cnio.es/bionlp_tools/. PMID:18834499

  14. Information Extraction from Unstructured Text for the Biodefense Knowledge Center

    Energy Technology Data Exchange (ETDEWEB)

    Samatova, N F; Park, B; Krishnamurthy, R; Munavalli, R; Symons, C; Buttler, D J; Cottom, T; Critchlow, T J; Slezak, T

    2005-04-29

    The Bio-Encyclopedia at the Biodefense Knowledge Center (BKC) is being constructed to allow an early detection of emerging biological threats to homeland security. It requires highly structured information extracted from variety of data sources. However, the quantity of new and vital information available from every day sources cannot be assimilated by hand, and therefore reliable high-throughput information extraction techniques are much anticipated. In support of the BKC, Lawrence Livermore National Laboratory and Oak Ridge National Laboratory, together with the University of Utah, are developing an information extraction system built around the bioterrorism domain. This paper reports two important pieces of our effort integrated in the system: key phrase extraction and semantic tagging. Whereas two key phrase extraction technologies developed during the course of project help identify relevant texts, our state-of-the-art semantic tagging system can pinpoint phrases related to emerging biological threats. Also we are enhancing and tailoring the Bio-Encyclopedia by augmenting semantic dictionaries and extracting details of important events, such as suspected disease outbreaks. Some of these technologies have already been applied to large corpora of free text sources vital to the BKC mission, including ProMED-mail, PubMed abstracts, and the DHS's Information Analysis and Infrastructure Protection (IAIP) news clippings. In order to address the challenges involved in incorporating such large amounts of unstructured text, the overall system is focused on precise extraction of the most relevant information for inclusion in the BKC.

  15. Extracting information from free-text mammography reports

    OpenAIRE

    Esuli, Andrea; Marcheggiani, Diego; Sebastiani, Fabrizio

    2010-01-01

    Researchers from ISTI-CNR, Pisa, aim at effectively and efficiently extracting information from free-text mammography reports, as a step towards the automatic transformation of unstructured medical documentation into structured data.

  16. Rank Based Clustering For Document Retrieval From Biomedical Databases

    Directory of Open Access Journals (Sweden)

    Jayanthi Manicassamy

    2009-09-01

    Full Text Available Now a day's, search engines are been most widely used for extracting information's from various resources throughout the world. Where, majority of searches lies in the field of biomedical for retrieving related documents from various biomedical databases. Currently search engines lacks in document clustering and representing relativeness level of documents extracted from the databases. In order to overcome these pitfalls a text based search engine have been developed for retrieving documents from Medline and PubMed biomedical databases. The search engine has incorporated page ranking bases clustering concept which automatically represents relativeness on clustering bases. Apart from this graph tree construction is made for representing the level of relatedness of the documents that are networked together. This advance functionality incorporation for biomedical document based search engine found to provide better results in reviewing related documents based on relativeness.

  17. Efficient Retrieval of Text for Biomedical Domain using Expectation Maximization Algorithm

    Directory of Open Access Journals (Sweden)

    Sumit Vashishtha

    2011-11-01

    Full Text Available Data mining, a branch of computer science [1], is the process of extracting patterns from large data sets by combining methods from statistics and artificial intelligence with database management. Data mining is seen as an increasingly important tool by modern business to transform data into business intelligence giving an informational advantage. Biomedical text retrieval refers to text retrieval techniques applied to biomedical resources and literature available of the biomedical and molecular biology domain. The volume of published biomedical research, and therefore the underlying biomedical knowledge base, is expanding at an increasing rate. Biomedical text retrieval is a way to aid researchers in coping with information overload. By discovering predictive relationships between different pieces of extracted data, data-mining algorithms can be used to improve the accuracy of information extraction. However, textual variation due to typos, abbreviations, and other sources can prevent the productive discovery and utilization of hard-matching rules. Recent methods of soft clustering can exploit predictive relationships in textual data. This paper presents a technique for using soft clustering data mining algorithm to increase the accuracy of biomedical text extraction. Experimental results demonstrate that this approach improves text extraction more effectively that hard keyword matching rules.

  18. Mining knowledge from text repositories using information extraction: A review

    Indian Academy of Sciences (India)

    Sandeep R Sirsat; Dr Vinay Chavan; Dr Shrinivas P Deshpande

    2014-02-01

    There are two approaches to mining text form online repositories. First, when the knowledge to be discovered is expressed directly in the documents to be mined, Information Extraction (IE) alone can serve as an effective tool for such text mining. Second, when the documents contain concrete data in unstructured form rather than abstract knowledge, Information Extraction (IE) can be used to first transform the unstructured data in the document corpus into a structured database, and then use some state-of-the-art data mining algorithms/tools to identify abstract patterns in this extracted data. This paper presents the review of several methods related to these two approaches.

  19. Addressing Information Proliferation: Applications of Information Extraction and Text Mining

    Science.gov (United States)

    Li, Jingjing

    2013-01-01

    The advent of the Internet and the ever-increasing capacity of storage media have made it easy to store, deliver, and share enormous volumes of data, leading to a proliferation of information on the Web, in online libraries, on news wires, and almost everywhere in our daily lives. Since our ability to process and absorb this information remains…

  20. Fine-grained information extraction from German transthoracic echocardiography reports

    OpenAIRE

    Toepfer, Martin; Corovic, Hamo; Fette, Georg; Klügl, Peter; Störk, Stefan; Puppe, Frank

    2015-01-01

    Background Information extraction techniques that get structured representations out of unstructured data make a large amount of clinically relevant information about patients accessible for semantic applications. These methods typically rely on standardized terminologies that guide this process. Many languages and clinical domains, however, lack appropriate resources and tools, as well as evaluations of their applications, especially if detailed conceptualizations of the domain are required....

  1. The study of the extraction of 3-D informations

    Energy Technology Data Exchange (ETDEWEB)

    Kim, Min Ki [Korea Univ., Seoul (Korea); Kim, Jin Hun; Kim, Hui Yung; Lee, Gi Sik; Lee, Yung Shin [Sokyung Univ., Seoul (Korea)

    1998-04-01

    To extract three dimensional information in 3 dimensional real world two methods are applied (stereo image method, virtual reality environment method). 1. Stereo image method. From the paris of stereo image matching methods are applied to find the corresponding points in the two images. To solve the problem various methods are applied 2. Virtual reality environment method. As an alternate method to extract 3-D information, virtual reality environment is use. It is very useful to fine 6 DOF for a some given target points in 3-D space. We considered the accuracies and reliability of the 3-D informations. 34 figs., 4 tabs. (Author)

  2. Linguistic information extraction for job ads (SIRE project)

    OpenAIRE

    Loth, Romain; Battistelli, Delphine; Chaumartin, François-Régis; De Mazancourt, Hugues; Minel, Jean-Luc; Vinckx, Axelle

    2010-01-01

    International audience As a text, each job advertisement expresses rich information about the occupation at hand, such as competence needs (i.e. required degrees, field knowledge, task expertise or technical skills). To facilitate the access to this information, the SIRE project conducted a corpus based study of how to articulate HR expert ontologies with modern semi-supervised information extraction techniques. An adaptive semantic labeling framework is developed through a parallel work o...

  3. Extracting and summarizing information from large data repositories

    OpenAIRE

    Albanese, Massimiliano

    2006-01-01

    Information retrieval from large data repositories has become an important area of computer science. Research in this field is highly encouraged by the ever-increasing rate with which today's society is able to produce digital data. Unfortunately most of such data (e.g. video recordings, plain text documents) are unstructured. Two major issues thus arise in this scenario: i) extracting structured data -- information -- from unstructured data; ii) summarizing information, i.e. reducing large v...

  4. Automated extraction of change information from multispectral satellite imagery

    International Nuclear Information System (INIS)

    Seeing the expected technical improvements as to the spatial and spectral resolution, satellite imagery could more and more provide a basis for complex information systems for recognizing and monitoring even small-scale and short-term structural features of interests within nuclear facilities, for instance construction of buildings, plant expansion, changes of the operational status, underground activities etc. The analysis of large volumes of multi sensor satellite data will then definitely require a high degree of automation for (pre-) processing, analysis and interpretation in order to extract the features of interest. Against this background, the present paper focuses on the automated extraction of change information from multispectral satellite imagery

  5. Towards Automated Education Demand-Offer Information Monitoring: the Information Extraction

    OpenAIRE

    Rudzājs, P

    2012-01-01

    Dynamically changing work environment in knowledge economy causes the changes in knowledge requirements for labor. Therefore it becomes more and more important to be constantly aware of what education is currently demanded and what education is currently offered. The IT solution is vital to process various information sources, extract education information, and provide analysis mechanisms in automated manner. The education information extraction is detailed in this paper in the context of Edu...

  6. Biomedical image representation approach using visualness and spatial information in a concept feature space for interactive region-of-interest-based retrieval.

    Science.gov (United States)

    Rahman, Md Mahmudur; Antani, Sameer K; Demner-Fushman, Dina; Thoma, George R

    2015-10-01

    This article presents an approach to biomedical image retrieval by mapping image regions to local concepts where images are represented in a weighted entropy-based concept feature space. The term "concept" refers to perceptually distinguishable visual patches that are identified locally in image regions and can be mapped to a glossary of imaging terms. Further, the visual significance (e.g., visualness) of concepts is measured as the Shannon entropy of pixel values in image patches and is used to refine the feature vector. Moreover, the system can assist the user in interactively selecting a region-of-interest (ROI) and searching for similar image ROIs. Further, a spatial verification step is used as a postprocessing step to improve retrieval results based on location information. The hypothesis that such approaches would improve biomedical image retrieval is validated through experiments on two different data sets, which are collected from open access biomedical literature.

  7. The CRISP system: an untapped resource for biomedical research project information.

    OpenAIRE

    Collins, K. A.

    1989-01-01

    CRISP (Computer Retrieval of Information on Scientific Projects) is a large database maintained and operated by the National Institutes of Health (NIH). It contains comprehensive scientific and selected administrative data on research carried out by the U.S. Public Health Service (PHS) or supported by PHS grants and contracts. Developed originally to meet the needs of NIH, it is an excellent, largely untapped resource for health information professionals at large, revealing new trends, method...

  8. Ethics Review Committee approval and informed consent: an analysis of biomedical publications originating from Sri Lanka

    OpenAIRE

    Siriwardhana Chesmal; Athukorale Manjula; Lekamwattage Manura; Hewege Suwin; Siribaddana Sisira; Sumathipala Athula; Murray Joanna; Prince Martin

    2008-01-01

    Abstract Background International guidelines on research have focused on protecting research participants. Ethical Research Committee (ERC) approval and informed consent are the cornerstones. Externally sponsored research requires approval through ethical review in both the host and the sponsoring country. This study aimed to determine to what extent ERC approval and informed consent procedures are documented in locally and internationally published human subject research carried out in Sri L...

  9. Coordination of Two On-Line Information Retrieval Services At the University of Minnesota Bio-Medical Library.

    Science.gov (United States)

    Egeland, Jan; Foreman, Gertrude

    The Bio-Medical Library at the University of Minnesota has access to two on-line computer systems: the MEDLINE system (MEDLARS ON-LINE) of the National Library of Medicine and the State University of New York (SUNY) Biomedical Communication Network. MEDLINE consists of two files which provide coverage of the "Index Medicus" from 1970 to the…

  10. THE METHODS OF EXTRACTING WATER INFORMATION FROM SPOT IMAGE

    Institute of Scientific and Technical Information of China (English)

    2002-01-01

    Some techniques and methods for deriving water information from SPOT -4 (XI) image were investigatedand discussed in this paper. An algorithm of decision-tree (DT) classification which includes several classifiers based onthe spectral responding characteristics of water bodies and other objects, was developed and put forward to delineate wa-ter bodies. Another algorithm of decision-tree classification based on both spectral characteristics and auxiliary informa-tion of DEM and slope (DTDS) was also designed for water bodies extraction. In addition, supervised classificationmethod of maximum-likelyhood classification (MLC), and unsupervised method of interactive self-organizing dada analy-sis technique (ISODATA) were used to extract waterbodies for comparison purpose. An index was designed and used toassess the accuracy of different methods adopted in the research. Results have shown that water extraction accuracy wasvariable with respect to the various techniques applied. It was low using ISODATA, very high using DT algorithm andmuch higher using both DTDS and MLC.

  11. A Lip Extraction Algorithm by Using Color Information Considering Obscurity

    Science.gov (United States)

    Shirasawa, Yoichi; Nishida, Makoto

    This paper proposes a method for extracting lip shape and its location from sequential facial images by using color information. The proposed method has no need of extra information on a position nor a form in advance. It is also carried out without special conditions such as lipstick or lighting. Psychometric quantities of a metric hue angle, a metric hue difference and a rectangular coordinates, which are defined in CIE 1976 L*a*b* color space, are used for the extraction. The method employs fuzzy reasoning in order to consider obscurity in image data such as shade on the face. The experimental result indicate the effectiveness of the proposed method; 100 percent of facial images data was estimated a lip’s position, and about 94 percent of facial images data was extracted its shape.

  12. Tumor information extraction in radiology reports for hepatocellular carcinoma patients

    Science.gov (United States)

    Yim, Wen-wai; Denman, Tyler; Kwan, Sharon W.; Yetisgen, Meliha

    2016-01-01

    Hepatocellular carcinoma (HCC) is a deadly disease affecting the liver for which there are many available therapies. Targeting treatments towards specific patient groups necessitates defining patients by stage of disease. Criteria for such stagings include information on tumor number, size, and anatomic location, typically only found in narrative clinical text in the electronic medical record (EMR). Natural language processing (NLP) offers an automatic and scale-able means to extract this information, which can further evidence-based research. In this paper, we created a corpus of 101 radiology reports annotated for tumor information. Afterwards we applied machine learning algorithms to extract tumor information. Our inter-annotator partial match agreement scored at 0.93 and 0.90 F1 for entities and relations, respectively. Based on the annotated corpus, our sequential labeling entity extraction achieved 0.87 F1 partial match, and our maximum entropy classification relation extraction achieved scores 0.89 and 0. 74 F1 with gold and system entities, respectively. PMID:27570686

  13. Tumor information extraction in radiology reports for hepatocellular carcinoma patients.

    Science.gov (United States)

    Yim, Wen-Wai; Denman, Tyler; Kwan, Sharon W; Yetisgen, Meliha

    2016-01-01

    Hepatocellular carcinoma (HCC) is a deadly disease affecting the liver for which there are many available therapies. Targeting treatments towards specific patient groups necessitates defining patients by stage of disease. Criteria for such stagings include information on tumor number, size, and anatomic location, typically only found in narrative clinical text in the electronic medical record (EMR). Natural language processing (NLP) offers an automatic and scale-able means to extract this information, which can further evidence-based research. In this paper, we created a corpus of 101 radiology reports annotated for tumor information. Afterwards we applied machine learning algorithms to extract tumor information. Our inter-annotator partial match agreement scored at 0.93 and 0.90 F1 for entities and relations, respectively. Based on the annotated corpus, our sequential labeling entity extraction achieved 0.87 F1 partial match, and our maximum entropy classification relation extraction achieved scores 0.89 and 0. 74 F1 with gold and system entities, respectively. PMID:27570686

  14. Study of query expansion techniques and their application in the biomedical information retrieval.

    Science.gov (United States)

    Rivas, A R; Iglesias, E L; Borrajo, L

    2014-01-01

    Information Retrieval focuses on finding documents whose content matches with a user query from a large document collection. As formulating well-designed queries is difficult for most users, it is necessary to use query expansion to retrieve relevant information. Query expansion techniques are widely applied for improving the efficiency of the textual information retrieval systems. These techniques help to overcome vocabulary mismatch issues by expanding the original query with additional relevant terms and reweighting the terms in the expanded query. In this paper, different text preprocessing and query expansion approaches are combined to improve the documents initially retrieved by a query in a scientific documental database. A corpus belonging to MEDLINE, called Cystic Fibrosis, is used as a knowledge source. Experimental results show that the proposed combinations of techniques greatly enhance the efficiency obtained by traditional queries.

  15. The Study on Information Extraction Technology of Seismic Damage

    Directory of Open Access Journals (Sweden)

    Huang Zuo-wei

    2013-01-01

    Full Text Available In order to improve the information extraction technology of seismic damage assessment and information publishing of earthquake damage. Based on past earthquake experience it was constructed technical flow of earthquake damage assessment rapidly, this study, take Yushu earthquake as example, studies the framework and establishment of the information service system by means of Arc IMS and distributed database technology. It analysis some key technologies, build web publishing architecture of massive remote sensing images. The system implements joint application of remote sensing image processing technology, database technology and Web GIS technology, the result could provide the important basis for earthquake damage assessment, emergency management and rescue mission.

  16. Extracting Semantic Information from Visual Data: A Survey

    Directory of Open Access Journals (Sweden)

    Qiang Liu

    2016-03-01

    Full Text Available The traditional environment maps built by mobile robots include both metric ones and topological ones. These maps are navigation-oriented and not adequate for service robots to interact with or serve human users who normally rely on the conceptual knowledge or semantic contents of the environment. Therefore, the construction of semantic maps becomes necessary for building an effective human-robot interface for service robots. This paper reviews recent research and development in the field of visual-based semantic mapping. The main focus is placed on how to extract semantic information from visual data in terms of feature extraction, object/place recognition and semantic representation methods.

  17. Ethics Review Committee approval and informed consent: an analysis of biomedical publications originating from Sri Lanka

    Directory of Open Access Journals (Sweden)

    Siriwardhana Chesmal

    2008-02-01

    Full Text Available Abstract Background International guidelines on research have focused on protecting research participants. Ethical Research Committee (ERC approval and informed consent are the cornerstones. Externally sponsored research requires approval through ethical review in both the host and the sponsoring country. This study aimed to determine to what extent ERC approval and informed consent procedures are documented in locally and internationally published human subject research carried out in Sri Lanka. Methods We obtained ERC approval in Sri Lanka and the United Kingdom. Theses from 1985 to 2005 available at the Postgraduate Institute of Medicine (PGIM library affiliated to the University of Colombo were scrutinised using checklists agreed in consultation with senior research collaborators. A Medline search was carried out with MeSH major and minor heading 'Sri Lanka' as the search term for international publications originating in Sri Lanka during 1999 to 2004. All research publications from CMJ during 1999 to 2005 were also scrutinized. Results Of 291 theses, 34% documented ERC approvals and 61% documented obtaining consent. From the international journal survey, 250 publications originated from Sri Lanka of which only 79 full text original research publications could be accessed electronically. Of these 38% documented ERC approval and 39% documented obtaining consent. In the Ceylon Medical Journal 36% documented ERC approval and 37% documented obtaining consent. Conclusion Only one third of the publications scrutinized recorded ERC approval and procurement of informed consent. However, there is a positive trend in documenting these ethical requirements in local postgraduate research and in the local medical journal.

  18. Advanced applications of natural language processing for performing information extraction

    CERN Document Server

    Rodrigues, Mário

    2015-01-01

    This book explains how can be created information extraction (IE) applications that are able to tap the vast amount of relevant information available in natural language sources: Internet pages, official documents such as laws and regulations, books and newspapers, and social web. Readers are introduced to the problem of IE and its current challenges and limitations, supported with examples. The book discusses the need to fill the gap between documents, data, and people, and provides a broad overview of the technology supporting IE. The authors present a generic architecture for developing systems that are able to learn how to extract relevant information from natural language documents, and illustrate how to implement working systems using state-of-the-art and freely available software tools. The book also discusses concrete applications illustrating IE uses.   ·         Provides an overview of state-of-the-art technology in information extraction (IE), discussing achievements and limitations for t...

  19. Most information feature extraction (MIFE) approach for face recognition

    Science.gov (United States)

    Zhao, Jiali; Ren, Haibing; Wang, Haitao; Kee, Seokcheol

    2005-03-01

    We present a MIFE (Most Information Feature Extraction) approach, which extract as abundant as possible information for the face classification task. In the MIFE approach, a facial image is separated into sub-regions and each sub-region makes individual"s contribution for performing face recognition. Specifically, each sub-region is subjected to a sub-region based adaptive gamma (SadaGamma) correction or sub-region based histogram equalization (SHE) in order to account for different illuminations and expressions. Experiment results show that the proposed SadaGamma/SHE correction approach provides an efficient delighting solution for face recognition. MIFE and SadaGamma/SHE correction together achieves lower error ratio in face recognition under different illumination and expression.

  20. Biomedical photonics handbook biomedical diagnostics

    CERN Document Server

    Vo-Dinh, Tuan

    2014-01-01

    Shaped by Quantum Theory, Technology, and the Genomics RevolutionThe integration of photonics, electronics, biomaterials, and nanotechnology holds great promise for the future of medicine. This topic has recently experienced an explosive growth due to the noninvasive or minimally invasive nature and the cost-effectiveness of photonic modalities in medical diagnostics and therapy. The second edition of the Biomedical Photonics Handbook presents fundamental developments as well as important applications of biomedical photonics of interest to scientists, engineers, manufacturers, teachers, studen

  1. ONTOGRABBING: Extracting Information from Texts Using Generative Ontologies

    DEFF Research Database (Denmark)

    Nilsson, Jørgen Fischer; Szymczak, Bartlomiej Antoni; Jensen, P.A.

    2009-01-01

    We describe principles for extracting information from texts using a so-called generative ontology in combination with syntactic analysis. Generative ontologies are introduced as semantic domains for natural language phrases. Generative ontologies extend ordinary finite ontologies with rules for ...... analysis is primarily to identify paraphrases, thereby achieving a search functionality beyond mere keyword search with synsets. We further envisage use of the generative ontology as a phrase-based rather than word-based browser into text corpora....

  2. Extraction of hidden information by efficient community detection in networks

    CERN Document Server

    Lee, Juyong; Lee, Jooyoung

    2012-01-01

    Currently, we are overwhelmed by a deluge of experimental data, and network physics has the potential to become an invaluable method to increase our understanding of large interacting datasets. However, this potential is often unrealized for two reasons: uncovering the hidden community structure of a network, known as community detection, is difficult, and further, even if one has an idea of this community structure, it is not a priori obvious how to efficiently use this information. Here, to address both of these issues, we, first, identify optimal community structure of given networks in terms of modularity by utilizing a recently introduced community detection method. Second, we develop an approach to use this community information to extract hidden information from a network. When applied to a protein-protein interaction network, the proposed method outperforms current state-of-the-art methods that use only the local information of a network. The method is generally applicable to networks from many areas.

  3. Using Ontology Fingerprints to disambiguate gene name entities in the biomedical literature

    OpenAIRE

    Chen, Guocai; Zhao, Jieyi; Cohen, Trevor; Tao, Cui; Sun, Jingchun; Xu, Hua; Bernstam, Elmer V.; Lawson, Andrew; Zeng, Jia; Johnson, Amber M.; Holla, Vijaykumar; Bailey, Ann M.; Lara-Guerra, Humberto; Litzenburger, Beate; Meric-Bernstam, Funda

    2015-01-01

    Ambiguous gene names in the biomedical literature are a barrier to accurate information extraction. To overcome this hurdle, we generated Ontology Fingerprints for selected genes that are relevant for personalized cancer therapy. These Ontology Fingerprints were used to evaluate the association between genes and biomedical literature to disambiguate gene names. We obtained 93.6% precision for the test gene set and 80.4% for the area under a receiver-operating characteristics curve for gene an...

  4. Ranking Biomedical Annotations with Annotator’s Semantic Relevancy

    Directory of Open Access Journals (Sweden)

    Aihua Wu

    2014-01-01

    Full Text Available Biomedical annotation is a common and affective artifact for researchers to discuss, show opinion, and share discoveries. It becomes increasing popular in many online research communities, and implies much useful information. Ranking biomedical annotations is a critical problem for data user to efficiently get information. As the annotator’s knowledge about the annotated entity normally determines quality of the annotations, we evaluate the knowledge, that is, semantic relationship between them, in two ways. The first is extracting relational information from credible websites by mining association rules between an annotator and a biomedical entity. The second way is frequent pattern mining from historical annotations, which reveals common features of biomedical entities that an annotator can annotate with high quality. We propose a weighted and concept-extended RDF model to represent an annotator, a biomedical entity, and their background attributes and merge information from the two ways as the context of an annotator. Based on that, we present a method to rank the annotations by evaluating their correctness according to user’s vote and the semantic relevancy between the annotator and the annotated entity. The experimental results show that the approach is applicable and efficient even when data set is large.

  5. THE METHODS OF EXTRACTING WATER INFORMATION FROM SPOT IMAGE

    Institute of Scientific and Technical Information of China (English)

    DUJin-kang; FENGXue-zhi; 等

    2002-01-01

    Some techniques and methods for deriving water information from SPOT-4(XI) image were investigated and discussed in this paper.An algorithmoif decision-tree(DT) classification which includes several classifiers based on the spectral responding characteristics of water bodies and other objects,was developed and put forward to delineate water bodies.Another algorithm of decision-tree classification based on both spectral characteristics and auxiliary information of DEM and slope(DTDS) was also designed for water bodies extraction.In addition,supervised classification method of maximum-likelyhood classification(MLC),and unsupervised method of interactive self -organizing dada analysis technique(ISODATA) were used to extract waterbodies for comparison purpose.An index was designed and used to assess the accuracy of different methods abopted in the research.Results have shown that water extraction accuracy was variable with respect to the various techniques applied.It was low using ISODATA,very high using DT algorithm and much higher using both DTDS and MLC.

  6. Extraction of spatial information for low-bandwidth telerehabilitation applications

    Directory of Open Access Journals (Sweden)

    Kok Kiong Tan, PhD

    2014-09-01

    Full Text Available Telemedicine applications, based on two-dimensional (2D video conferencing technology, have been around for the past 15 to 20 yr. They have been demonstrated to be acceptable for face-to-face consultations and useful for visual examination of wounds and abrasions. However, certain telerehabilitation assessments need the use of spatial information in order to accurately assess the patient’s condition and sending three-dimensional video data over low-bandwidth networks is extremely challenging. This article proposes an innovative way of extracting the key spatial information from the patient’s movement during telerehabilitation assessment based on 2D video and then presenting the extracted data by using graph plots alongside the video to help physicians in assessments with minimum burden on existing video data transfer. Some common rehabilitation scenarios are chosen for illustrations, and experiments are conducted based on skeletal tracking and color detection algorithms using the Microsoft Kinect sensor. Extracted data are analyzed in detail and their usability discussed.

  7. Transliteration normalization for Information Extraction and Machine Translation

    Directory of Open Access Journals (Sweden)

    Yuval Marton

    2014-12-01

    Full Text Available Foreign name transliterations typically include multiple spelling variants. These variants cause data sparseness and inconsistency problems, increase the Out-of-Vocabulary (OOV rate, and present challenges for Machine Translation, Information Extraction and other natural language processing (NLP tasks. This work aims to identify and cluster name spelling variants using a Statistical Machine Translation method: word alignment. The variants are identified by being aligned to the same “pivot” name in another language (the source-language in Machine Translation settings. Based on word-to-word translation and transliteration probabilities, as well as the string edit distance metric, names with similar spellings in the target language are clustered and then normalized to a canonical form. With this approach, tens of thousands of high-precision name transliteration spelling variants are extracted from sentence-aligned bilingual corpora in Arabic and English (in both languages. When these normalized name spelling variants are applied to Information Extraction tasks, improvements over strong baseline systems are observed. When applied to Machine Translation tasks, a large improvement potential is shown.

  8. Extracting Backbones from Weighted Complex Networks with Incomplete Information

    Directory of Open Access Journals (Sweden)

    Liqiang Qian

    2015-01-01

    Full Text Available The backbone is the natural abstraction of a complex network, which can help people understand a networked system in a more simplified form. Traditional backbone extraction methods tend to include many outliers into the backbone. What is more, they often suffer from the computational inefficiency—the exhaustive search of all nodes or edges is often prohibitively expensive. In this paper, we propose a backbone extraction heuristic with incomplete information (BEHwII to find the backbone in a complex weighted network. First, a strict filtering rule is carefully designed to determine edges to be preserved or discarded. Second, we present a local search model to examine part of edges in an iterative way, which only relies on the local/incomplete knowledge rather than the global view of the network. Experimental results on four real-life networks demonstrate the advantage of BEHwII over the classic disparity filter method by either effectiveness or efficiency validity.

  9. Knowledge discovery: Extracting usable information from large amounts of data

    Energy Technology Data Exchange (ETDEWEB)

    Whiteson, R.

    1998-12-31

    The threat of nuclear weapons proliferation is a problem of world wide concern. Safeguards are the key to nuclear nonproliferation and data is the key to safeguards. The safeguards community has access to a huge and steadily growing volume of data. The advantages of this data rich environment are obvious, there is a great deal of information which can be utilized. The challenge is to effectively apply proven and developing technologies to find and extract usable information from that data. That information must then be assessed and evaluated to produce the knowledge needed for crucial decision making. Efficient and effective analysis of safeguards data will depend on utilizing technologies to interpret the large, heterogeneous data sets that are available from diverse sources. With an order-of-magnitude increase in the amount of data from a wide variety of technical, textual, and historical sources there is a vital need to apply advanced computer technologies to support all-source analysis. There are techniques of data warehousing, data mining, and data analysis that can provide analysts with tools that will expedite their extracting useable information from the huge amounts of data to which they have access. Computerized tools can aid analysts by integrating heterogeneous data, evaluating diverse data streams, automating retrieval of database information, prioritizing inputs, reconciling conflicting data, doing preliminary interpretations, discovering patterns or trends in data, and automating some of the simpler prescreening tasks that are time consuming and tedious. Thus knowledge discovery technologies can provide a foundation of support for the analyst. Rather than spending time sifting through often irrelevant information, analysts could use their specialized skills in a focused, productive fashion. This would allow them to make their analytical judgments with more confidence and spend more of their time doing what they do best.

  10. Knowledge discovery: Extracting usable information from large amounts of data

    International Nuclear Information System (INIS)

    The threat of nuclear weapons proliferation is a problem of world wide concern. Safeguards are the key to nuclear nonproliferation and data is the key to safeguards. The safeguards community has access to a huge and steadily growing volume of data. The advantages of this data rich environment are obvious, there is a great deal of information which can be utilized. The challenge is to effectively apply proven and developing technologies to find and extract usable information from that data. That information must then be assessed and evaluated to produce the knowledge needed for crucial decision making. Efficient and effective analysis of safeguards data will depend on utilizing technologies to interpret the large, heterogeneous data sets that are available from diverse sources. With an order-of-magnitude increase in the amount of data from a wide variety of technical, textual, and historical sources there is a vital need to apply advanced computer technologies to support all-source analysis. There are techniques of data warehousing, data mining, and data analysis that can provide analysts with tools that will expedite their extracting useable information from the huge amounts of data to which they have access. Computerized tools can aid analysts by integrating heterogeneous data, evaluating diverse data streams, automating retrieval of database information, prioritizing inputs, reconciling conflicting data, doing preliminary interpretations, discovering patterns or trends in data, and automating some of the simpler prescreening tasks that are time consuming and tedious. Thus knowledge discovery technologies can provide a foundation of support for the analyst. Rather than spending time sifting through often irrelevant information, analysts could use their specialized skills in a focused, productive fashion. This would allow them to make their analytical judgments with more confidence and spend more of their time doing what they do best

  11. miRSel: Automated extraction of associations between microRNAs and genes from the biomedical literature

    Directory of Open Access Journals (Sweden)

    Zimmer Ralf

    2010-03-01

    Full Text Available Abstract Background MicroRNAs have been discovered as important regulators of gene expression. To identify the target genes of microRNAs, several databases and prediction algorithms have been developed. Only few experimentally confirmed microRNA targets are available in databases. Many of the microRNA targets stored in databases were derived from large-scale experiments that are considered not very reliable. We propose to use text mining of publication abstracts for extracting microRNA-gene associations including microRNA-target relations to complement current repositories. Results The microRNA-gene association database miRSel combines text-mining results with existing databases and computational predictions. Text mining enables the reliable extraction of microRNA, gene and protein occurrences as well as their relationships from texts. Thereby, we increased the number of human, mouse and rat miRNA-gene associations by at least three-fold as compared to e.g. TarBase, a resource for miRNA-gene associations. Conclusions Our database miRSel offers the currently largest collection of literature derived miRNA-gene associations. Comprehensive collections of miRNA-gene associations are important for the development of miRNA target prediction tools and the analysis of regulatory networks. miRSel is updated daily and can be queried using a web-based interface via microRNA identifiers, gene and protein names, PubMed queries as well as gene ontology (GO terms. miRSel is freely available online at http://services.bio.ifi.lmu.de/mirsel.

  12. [Study on Information Extraction of Clinic Expert Information from Hospital Portals].

    Science.gov (United States)

    Zhang, Yuanpeng; Dong, Jiancheng; Qian, Danmin; Geng, Xingyun; Wu, Huiqun; Wang, Li

    2015-12-01

    Clinic expert information provides important references for residents in need of hospital care. Usually, such information is hidden in the deep web and cannot be directly indexed by search engines. To extract clinic expert information from the deep web, the first challenge is to make a judgment on forms. This paper proposes a novel method based on a domain model, which is a tree structure constructed by the attributes of search interfaces. With this model, search interfaces can be classified to a domain and filled in with domain keywords. Another challenge is to extract information from the returned web pages indexed by search interfaces. To filter the noise information on a web page, a block importance model is proposed. The experiment results indicated that the domain model yielded a precision 10.83% higher than that of the rule-based method, whereas the block importance model yielded an F₁ measure 10.5% higher than that of the XPath method. PMID:27079096

  13. Biomedical Engineering

    CERN Document Server

    Suh, Sang C; Tanik, Murat M

    2011-01-01

    Biomedical Engineering: Health Care Systems, Technology and Techniques is an edited volume with contributions from world experts. It provides readers with unique contributions related to current research and future healthcare systems. Practitioners and researchers focused on computer science, bioinformatics, engineering and medicine will find this book a valuable reference.

  14. A High Accuracy Method for Semi-supervised Information Extraction

    Energy Technology Data Exchange (ETDEWEB)

    Tratz, Stephen C.; Sanfilippo, Antonio P.

    2007-04-22

    Customization to specific domains of dis-course and/or user requirements is one of the greatest challenges for today’s Information Extraction (IE) systems. While demonstrably effective, both rule-based and supervised machine learning approaches to IE customization pose too high a burden on the user. Semi-supervised learning approaches may in principle offer a more resource effective solution but are still insufficiently accurate to grant realistic application. We demonstrate that this limitation can be overcome by integrating fully-supervised learning techniques within a semi-supervised IE approach, without increasing resource requirements.

  15. Using XBRL Technology to Extract Competitive Information from Financial Statements

    Directory of Open Access Journals (Sweden)

    Dominik Ditter

    2011-12-01

    Full Text Available The eXtensible Business Reporting Language, or XBRL, is a reporting format for the automatic and electronic exchange of business and financial data. In XBRL every single reported fact is marked with a unique tag, enabling a full computer-based readout of financial data. It has the potential to improve the collection and analysis of financial data for Competitive Intelligence (e.g., the profiling of publicly available financial statements. The article describes how easily information from XBRL reports can be extracted.

  16. Automated extraction of chemical structure information from digital raster images

    Directory of Open Access Journals (Sweden)

    Shedden Kerby A

    2009-02-01

    Full Text Available Abstract Background To search for chemical structures in research articles, diagrams or text representing molecules need to be translated to a standard chemical file format compatible with cheminformatic search engines. Nevertheless, chemical information contained in research articles is often referenced as analog diagrams of chemical structures embedded in digital raster images. To automate analog-to-digital conversion of chemical structure diagrams in scientific research articles, several software systems have been developed. But their algorithmic performance and utility in cheminformatic research have not been investigated. Results This paper aims to provide critical reviews for these systems and also report our recent development of ChemReader – a fully automated tool for extracting chemical structure diagrams in research articles and converting them into standard, searchable chemical file formats. Basic algorithms for recognizing lines and letters representing bonds and atoms in chemical structure diagrams can be independently run in sequence from a graphical user interface-and the algorithm parameters can be readily changed-to facilitate additional development specifically tailored to a chemical database annotation scheme. Compared with existing software programs such as OSRA, Kekule, and CLiDE, our results indicate that ChemReader outperforms other software systems on several sets of sample images from diverse sources in terms of the rate of correct outputs and the accuracy on extracting molecular substructure patterns. Conclusion The availability of ChemReader as a cheminformatic tool for extracting chemical structure information from digital raster images allows research and development groups to enrich their chemical structure databases by annotating the entries with published research articles. Based on its stable performance and high accuracy, ChemReader may be sufficiently accurate for annotating the chemical database with links

  17. Biomedical Materials

    Institute of Scientific and Technical Information of China (English)

    CHANG Jiang; ZHOU Yanling

    2011-01-01

    @@ Biomedical materials, biomaterials for short, is regarded as "any substance or combination of substances, synthetic or natural in origin, which can be used for any period of time, as a whole or as part of a system which treats, augments, or replaces any tissue, organ or function of the body" (Vonrecum & Laberge, 1995).Biomaterials can save lives, relieve suffering and enhance the quality of life for human being.

  18. Computational intelligence in biomedical imaging

    CERN Document Server

    2014-01-01

    This book provides a comprehensive overview of the state-of-the-art computational intelligence research and technologies in biomedical images with emphasis on biomedical decision making. Biomedical imaging offers useful information on patients’ medical conditions and clues to causes of their symptoms and diseases. Biomedical images, however, provide a large number of images which physicians must interpret. Therefore, computer aids are demanded and become indispensable in physicians’ decision making. This book discusses major technical advancements and research findings in the field of computational intelligence in biomedical imaging, for example, computational intelligence in computer-aided diagnosis for breast cancer, prostate cancer, and brain disease, in lung function analysis, and in radiation therapy. The book examines technologies and studies that have reached the practical level, and those technologies that are becoming available in clinical practices in hospitals rapidly such as computational inte...

  19. Event extraction for DNA methylation

    OpenAIRE

    Ohta Tomoko; Pyysalo Sampo; Miwa Makoto; Tsujii Jun’ichi

    2011-01-01

    Abstract Background We consider the task of automatically extracting DNA methylation events from the biomedical domain literature. DNA methylation is a key mechanism of epigenetic control of gene expression and implicated in many cancers, but there has been little study of automatic information extraction for DNA methylation. Results We present an annotation scheme for DNA methylation following the representation of the BioNLP shared task on event extraction, select a set of 200 abstracts inc...

  20. Research of information classification and strategy intelligence extract algorithm based on military strategy hall

    Science.gov (United States)

    Chen, Lei; Li, Dehua; Yang, Jie

    2007-12-01

    Constructing virtual international strategy environment needs many kinds of information, such as economy, politic, military, diploma, culture, science, etc. So it is very important to build an information auto-extract, classification, recombination and analysis management system with high efficiency as the foundation and component of military strategy hall. This paper firstly use improved Boost algorithm to classify obtained initial information, then use a strategy intelligence extract algorithm to extract strategy intelligence from initial information to help strategist to analysis information.

  1. Text mining patents for biomedical knowledge.

    Science.gov (United States)

    Rodriguez-Esteban, Raul; Bundschus, Markus

    2016-06-01

    Biomedical text mining of scientific knowledge bases, such as Medline, has received much attention in recent years. Given that text mining is able to automatically extract biomedical facts that revolve around entities such as genes, proteins, and drugs, from unstructured text sources, it is seen as a major enabler to foster biomedical research and drug discovery. In contrast to the biomedical literature, research into the mining of biomedical patents has not reached the same level of maturity. Here, we review existing work and highlight the associated technical challenges that emerge from automatically extracting facts from patents. We conclude by outlining potential future directions in this domain that could help drive biomedical research and drug discovery.

  2. Text mining patents for biomedical knowledge.

    Science.gov (United States)

    Rodriguez-Esteban, Raul; Bundschus, Markus

    2016-06-01

    Biomedical text mining of scientific knowledge bases, such as Medline, has received much attention in recent years. Given that text mining is able to automatically extract biomedical facts that revolve around entities such as genes, proteins, and drugs, from unstructured text sources, it is seen as a major enabler to foster biomedical research and drug discovery. In contrast to the biomedical literature, research into the mining of biomedical patents has not reached the same level of maturity. Here, we review existing work and highlight the associated technical challenges that emerge from automatically extracting facts from patents. We conclude by outlining potential future directions in this domain that could help drive biomedical research and drug discovery. PMID:27179985

  3. An Extraction Method of an Informative DOM Node from a Web Page by Using Layout Information

    Science.gov (United States)

    Tsuruta, Masanobu; Masuyama, Shigeru

    We propose an informative DOM node extraction method from a Web page for preprocessing of Web content mining. Our proposed method LM uses layout data of DOM nodes generated by a generic Web browser, and the learning set consists of hundreds of Web pages and the annotations of informative DOM nodes of those Web pages. Our method does not require large scale crawling of the whole Web site to which the target Web page belongs. We design LM so that it uses the information of the learning set more efficiently in comparison to the existing method that uses the same learning set. By experiments, we evaluate the methods obtained by combining one that consists of the method for extracting the informative DOM node both the proposed method and the existing methods, and the existing noise elimination methods: Heur removes advertisements and link-lists by some heuristics and CE removes the DOM nodes existing in the Web pages in the same Web site to which the target Web page belongs. Experimental results show that 1) LM outperforms other methods for extracting the informative DOM node, 2) the combination method (LM, {CE(10), Heur}) based on LM (precision: 0.755, recall: 0.826, F-measure: 0.746) outperforms other combination methods.

  4. Searching Biomedical Text: Towards Maximum Relevant Results

    OpenAIRE

    Galde, Ola; Sevaldsen, John Harald

    2006-01-01

    The amount of biomedical information available to users today is large and increasing. The ability to precisely retrieve desired information is vital in order to utilize available knowledge. In this work we investigated how to improve the relevance of biomedical search results. Using the Lucene Java API we applied a series of information retrieval techniques to search in biomedical data. The techniques ranged from basic stemming and stop-word removal to more advanced methods like user relevan...

  5. Spectra in the chaotic region: Methods for extracting dynamic information

    Energy Technology Data Exchange (ETDEWEB)

    Gomez Llorente, J.M.; Zakrzewski, J.; Taylor, H.S.; Kulander, K.C.

    1989-02-01

    Nonlinear dynamics is applied to chaotic unassignable atomic and molecular spectra with the aim of extracting detailed information about regular dynamic motions that exist over short intervals of time. It is shown how this motion can be extracted from high resolution spectra by doing low resolution studies or by Fourier transforming limited regions of the spectrum. These motions mimic those of periodic orbits (PO) and are inserts into the dominant chaotic motion. Considering these inserts and the PO as a dynamically decoupled region of space, resonant scattering theory and stabilization methods enable us to compute ladders of resonant states which interact with the chaotic quasicontinuum computed in principle from basis sets placed off the PO. The interaction of the resonances with the quasicontinuum explains the low resolution spectra seen in such experiments. It also allows one to associate low resolution features with a particular PO. The motion on the PO thereby supplies the molecular movements whose quantization causes the low resolution spectra. Characteristic properties of the periodic orbit based resonances are discussed. The method is illustrated on the photoabsorption spectrum of the hydrogen atom in a strong magnetic field and on the photodissociation spectrum of H/sup +//sub 3/ . Other molecular systems which are currently under investigation using this formalism are also mentioned.

  6. A nonparametric approach to extract information from interspike interval data.

    Science.gov (United States)

    Rossoni, Enrico; Feng, Jianfeng

    2006-01-15

    In this work we develop an approach to extracting information from neural spike trains. Using the expectation-maximization (EM) algorithm, interspike interval data from experiments and simulations are fitted by mixtures of distributions, including Gamma, inverse Gaussian, log-normal, and the distribution of the interspike intervals of the leaky integrate-and-fire model. In terms of the Kolmogorov-Smirnov test for goodness-of-fit, our approach is proved successful (P>0.05) in fitting benchmark data for which a classical parametric approach has been shown to fail before. In addition, we present a novel method to fit mixture models to censored data, and discuss two examples of the application of such a method, which correspond to the case of multiple-trial and multielectrode array data. A MATLAB implementation of the algorithm is available for download from .

  7. Domain-independent information extraction in unstructured text

    Energy Technology Data Exchange (ETDEWEB)

    Irwin, N.H. [Sandia National Labs., Albuquerque, NM (United States). Software Surety Dept.

    1996-09-01

    Extracting information from unstructured text has become an important research area in recent years due to the large amount of text now electronically available. This status report describes the findings and work done during the second year of a two-year Laboratory Directed Research and Development Project. Building on the first-year`s work of identifying important entities, this report details techniques used to group words into semantic categories and to output templates containing selective document content. Using word profiles and category clustering derived during a training run, the time-consuming knowledge-building task can be avoided. Though the output still lacks in completeness when compared to systems with domain-specific knowledge bases, the results do look promising. The two approaches are compatible and could complement each other within the same system. Domain-independent approaches retain appeal as a system that adapts and learns will soon outpace a system with any amount of a priori knowledge.

  8. Extracting kinetic information from literature with KineticRE.

    Science.gov (United States)

    Freitas, Ana Alão; Costa, Hugo; Rocha, Miguel; Rocha, Isabel

    2015-01-01

    To better understand the dynamic behavior of metabolic networks in a wide variety of conditions, the field of Systems Biology has increased its interest in the use of kinetic models. The different databases, available these days, do not contain enough data regarding this topic. Given that a significant part of the relevant information for the development of such models is still wide spread in the literature, it becomes essential to develop specific and powerful text mining tools to collect these data. In this context, this work has as main objective the development of a text mining tool to extract, from scientific literature, kinetic parameters, their respective values and their relations with enzymes and metabolites. The approach proposed integrates the development of a novel plug-in over the text mining framework @Note2. In the end, the pipeline developed was validated with a case study on Kluyveromyces lactis, spanning the analysis and results of 20 full text documents. PMID:26673933

  9. Querying and Extracting Timeline Information from Road Traffic Sensor Data.

    Science.gov (United States)

    Imawan, Ardi; Indikawati, Fitri Indra; Kwon, Joonho; Rao, Praveen

    2016-08-23

    The escalation of traffic congestion in urban cities has urged many countries to use intelligent transportation system (ITS) centers to collect historical traffic sensor data from multiple heterogeneous sources. By analyzing historical traffic data, we can obtain valuable insights into traffic behavior. Many existing applications have been proposed with limited analysis results because of the inability to cope with several types of analytical queries. In this paper, we propose the QET (querying and extracting timeline information) system-a novel analytical query processing method based on a timeline model for road traffic sensor data. To address query performance, we build a TQ-index (timeline query-index) that exploits spatio-temporal features of timeline modeling. We also propose an intuitive timeline visualization method to display congestion events obtained from specified query parameters. In addition, we demonstrate the benefit of our system through a performance evaluation using a Busan ITS dataset and a Seattle freeway dataset.

  10. Querying and Extracting Timeline Information from Road Traffic Sensor Data

    Directory of Open Access Journals (Sweden)

    Ardi Imawan

    2016-08-01

    Full Text Available The escalation of traffic congestion in urban cities has urged many countries to use intelligent transportation system (ITS centers to collect historical traffic sensor data from multiple heterogeneous sources. By analyzing historical traffic data, we can obtain valuable insights into traffic behavior. Many existing applications have been proposed with limited analysis results because of the inability to cope with several types of analytical queries. In this paper, we propose the QET (querying and extracting timeline information system—a novel analytical query processing method based on a timeline model for road traffic sensor data. To address query performance, we build a TQ-index (timeline query-index that exploits spatio-temporal features of timeline modeling. We also propose an intuitive timeline visualization method to display congestion events obtained from specified query parameters. In addition, we demonstrate the benefit of our system through a performance evaluation using a Busan ITS dataset and a Seattle freeway dataset.

  11. Querying and Extracting Timeline Information from Road Traffic Sensor Data.

    Science.gov (United States)

    Imawan, Ardi; Indikawati, Fitri Indra; Kwon, Joonho; Rao, Praveen

    2016-01-01

    The escalation of traffic congestion in urban cities has urged many countries to use intelligent transportation system (ITS) centers to collect historical traffic sensor data from multiple heterogeneous sources. By analyzing historical traffic data, we can obtain valuable insights into traffic behavior. Many existing applications have been proposed with limited analysis results because of the inability to cope with several types of analytical queries. In this paper, we propose the QET (querying and extracting timeline information) system-a novel analytical query processing method based on a timeline model for road traffic sensor data. To address query performance, we build a TQ-index (timeline query-index) that exploits spatio-temporal features of timeline modeling. We also propose an intuitive timeline visualization method to display congestion events obtained from specified query parameters. In addition, we demonstrate the benefit of our system through a performance evaluation using a Busan ITS dataset and a Seattle freeway dataset. PMID:27563900

  12. Enriching a biomedical event corpus with meta-knowledge annotation

    OpenAIRE

    Thompson Paul; Nawaz Raheel; McNaught John; Ananiadou Sophia

    2011-01-01

    Abstract Background Biomedical papers contain rich information about entities, facts and events of biological relevance. To discover these automatically, we use text mining techniques, which rely on annotated corpora for training. In order to extract protein-protein interactions, genotype-phenotype/gene-disease associations, etc., we rely on event corpora that are annotated with classified, structured representations of important facts and findings contained within text. These provide an impo...

  13. [The Internet and clinical medicine. An introduction to the biomedical information services available in electronic form on the "net of all nets"].

    Science.gov (United States)

    Tringali, M; Iannucci, P; Zani, M

    1996-05-01

    Clinical users expectations from medical informatics are evolving due to the wide availability of biomedical information services on the Internet. Thanks to hypertexts and advanced browsing tools users with no informatical expertise can travel on the Internet and easily gain access to textual databases. With a multimedia computer other kinds of information can be grabbed: images, sounds and audiovisual documents. Basic Internet services (electronic mail, discussion lists, file transfer protocol, terminal emulation) can be accessed from a wide range of hardware equipment. However, the real power of a world-scale computer network like the Internet will be unleashed only when its global connectivity will be linked to the powerful retrieval ability of existing clinical and related databases. While biomedical journals editors and other medical information providers are in the way to offer electronic versions of their paper-based products, at the leading edge of this world-scale process is the USA National Library of Medicine, with the Internet-compatible version of its Grateful Med software which is expected to be launched during 1996.

  14. Multilingual Biomedical Dictionary

    OpenAIRE

    Daumke, Philipp; Markó, Kornél; Poprat, Michael; Schulz, Stefan

    2005-01-01

    We present a unique technique to create a multilingual biomedical dictionary, based on a methodology called Morpho-Semantic indexing. Our approach closes a gap caused by the absence of free available multilingual medical dictionaries and the lack of accuracy of non-medical electronic translation tools. We first explain the underlying technology followed by a description of the dictionary interface, which makes use of a multilingual subword thesaurus and of statistical inform...

  15. Multilingual biomedical dictionary.

    Science.gov (United States)

    Daumke, Philipp; Markó, Kornél; Poprat, Michael; Schulz, Stefan

    2005-01-01

    We present a unique technique to create a multilingual biomedical dictionary, based on a methodology called Morpho-Semantic indexing. Our approach closes a gap caused by the absence of free available multilingual medical dictionaries and the lack of accuracy of non-medical electronic translation tools. We first explain the underlying technology followed by a description of the dictionary interface, which makes use of a multilingual subword thesaurus and of statistical information from a domain-specific, multilingual corpus.

  16. Network fingerprint: a knowledge-based characterization of biomedical networks

    Science.gov (United States)

    Cui, Xiuliang; He, Haochen; He, Fuchu; Wang, Shengqi; Li, Fei; Bo, Xiaochen

    2015-01-01

    It can be difficult for biomedical researchers to understand complex molecular networks due to their unfamiliarity with the mathematical concepts employed. To represent molecular networks with clear meanings and familiar forms for biomedical researchers, we introduce a knowledge-based computational framework to decipher biomedical networks by making systematic comparisons to well-studied “basic networks”. A biomedical network is characterized as a spectrum-like vector called “network fingerprint”, which contains similarities to basic networks. This knowledge-based multidimensional characterization provides a more intuitive way to decipher molecular networks, especially for large-scale network comparisons and clustering analyses. As an example, we extracted network fingerprints of 44 disease networks in the Kyoto Encyclopedia of Genes and Genomes (KEGG) database. The comparisons among the network fingerprints of disease networks revealed informative disease-disease and disease-signaling pathway associations, illustrating that the network fingerprinting framework will lead to new approaches for better understanding of biomedical networks. PMID:26307246

  17. Extracting information in spike time patterns with wavelets and information theory.

    Science.gov (United States)

    Lopes-dos-Santos, Vítor; Panzeri, Stefano; Kayser, Christoph; Diamond, Mathew E; Quian Quiroga, Rodrigo

    2015-02-01

    We present a new method to assess the information carried by temporal patterns in spike trains. The method first performs a wavelet decomposition of the spike trains, then uses Shannon information to select a subset of coefficients carrying information, and finally assesses timing information in terms of decoding performance: the ability to identify the presented stimuli from spike train patterns. We show that the method allows: 1) a robust assessment of the information carried by spike time patterns even when this is distributed across multiple time scales and time points; 2) an effective denoising of the raster plots that improves the estimate of stimulus tuning of spike trains; and 3) an assessment of the information carried by temporally coordinated spikes across neurons. Using simulated data, we demonstrate that the Wavelet-Information (WI) method performs better and is more robust to spike time-jitter, background noise, and sample size than well-established approaches, such as principal component analysis, direct estimates of information from digitized spike trains, or a metric-based method. Furthermore, when applied to real spike trains from monkey auditory cortex and from rat barrel cortex, the WI method allows extracting larger amounts of spike timing information. Importantly, the fact that the WI method incorporates multiple time scales makes it robust to the choice of partly arbitrary parameters such as temporal resolution, response window length, number of response features considered, and the number of available trials. These results highlight the potential of the proposed method for accurate and objective assessments of how spike timing encodes information.

  18. ANALYSIS OF INTELLIGENT DATA MINING FOR INFORMATION EXTRACTION USING JAVA AGENT DEVELOPMENT ENVIRONMENT PLATFORM

    Directory of Open Access Journals (Sweden)

    M. Vinoth Kumar

    2013-01-01

    Full Text Available In this study, the problem of unstructured information extraction has been analyzed and the need for a new Information Extraction algorithm is justified. We propose an Intelligent Information Extraction using Java Agent Development Environment (JADE to be an optimal solution for intelligent information extraction. This proposed algorithm first assigns intelligent agents to gathering data, which they then classify, match and organize to construct sequential information for a user query. It improves the efficiency and performance for retrieving a proper information results as a sequence that satisfy user’s needs. It gives the user needed documents based on similarity between query matching and relevant document mechanism. The results obtained from the Intelligent Information Extraction are optimal.

  19. a Probability-Based Statistical Method to Extract Water Body of TM Images with Missing Information

    Science.gov (United States)

    Lian, Shizhong; Chen, Jiangping; Luo, Minghai

    2016-06-01

    Water information cannot be accurately extracted using TM images because true information is lost in some images because of blocking clouds and missing data stripes, thereby water information cannot be accurately extracted. Water is continuously distributed in natural conditions; thus, this paper proposed a new method of water body extraction based on probability statistics to improve the accuracy of water information extraction of TM images with missing information. Different disturbing information of clouds and missing data stripes are simulated. Water information is extracted using global histogram matching, local histogram matching, and the probability-based statistical method in the simulated images. Experiments show that smaller Areal Error and higher Boundary Recall can be obtained using this method compared with the conventional methods.

  20. Automatic Data Extraction from Websites for Generating Aquatic Product Market Information

    Institute of Scientific and Technical Information of China (English)

    YUAN Hong-chun; CHEN Ying; SUN Yue-fu

    2006-01-01

    The massive web-based information resources have led to an increasing demand for effective automatic retrieval of target information for web applications. This paper introduces a web-based data extraction tool that deploys various algorithms to locate, extract and filter tabular data from HTML pages and to transform them into new web-based representations. The tool has been applied in an aquaculture web application platform for extracting and generating aquatic product market information.Results prove that this tool is very effective in extracting the required data from web pages.

  1. Semantator: semantic annotator for converting biomedical text to linked data.

    Science.gov (United States)

    Tao, Cui; Song, Dezhao; Sharma, Deepak; Chute, Christopher G

    2013-10-01

    More than 80% of biomedical data is embedded in plain text. The unstructured nature of these text-based documents makes it challenging to easily browse and query the data of interest in them. One approach to facilitate browsing and querying biomedical text is to convert the plain text to a linked web of data, i.e., converting data originally in free text to structured formats with defined meta-level semantics. In this paper, we introduce Semantator (Semantic Annotator), a semantic-web-based environment for annotating data of interest in biomedical documents, browsing and querying the annotated data, and interactively refining annotation results if needed. Through Semantator, information of interest can be either annotated manually or semi-automatically using plug-in information extraction tools. The annotated results will be stored in RDF and can be queried using the SPARQL query language. In addition, semantic reasoners can be directly applied to the annotated data for consistency checking and knowledge inference. Semantator has been released online and was used by the biomedical ontology community who provided positive feedbacks. Our evaluation results indicated that (1) Semantator can perform the annotation functionalities as designed; (2) Semantator can be adopted in real applications in clinical and transactional research; and (3) the annotated results using Semantator can be easily used in Semantic-web-based reasoning tools for further inference.

  2. Biomedical engineering and nanotechnology

    International Nuclear Information System (INIS)

    This book is predominantly a compilation of papers presented in the conference which is focused on the development in biomedical materials, biomedical devises and instrumentation, biomedical effects of electromagnetic radiation, electrotherapy, radiotherapy, biosensors, biotechnology, bioengineering, tissue engineering, clinical engineering and surgical planning, medical imaging, hospital system management, biomedical education, biomedical industry and society, bioinformatics, structured nanomaterial for biomedical application, nano-composites, nano-medicine, synthesis of nanomaterial, nano science and technology development. The papers presented herein contain the scientific substance to suffice the academic directivity of the researchers from the field of biomedicine, biomedical engineering, material science and nanotechnology. Papers relevant to INIS are indexed separately

  3. Semantic Preview Benefit in English: Individual Differences in the Extraction and Use of Parafoveal Semantic Information

    Science.gov (United States)

    Veldre, Aaron; Andrews, Sally

    2016-01-01

    Although there is robust evidence that skilled readers of English extract and use orthographic and phonological information from the parafovea to facilitate word identification, semantic preview benefits have been elusive. We sought to establish whether individual differences in the extraction and/or use of parafoveal semantic information could…

  4. Information Extraction, Data Integration, and Uncertain Data Management: The State of The Art

    NARCIS (Netherlands)

    Habib, Mena B.; Keulen, van Maurice

    2011-01-01

    Information Extraction, data Integration, and uncertain data management are different areas of research that got vast focus in the last two decades. Many researches tackled those areas of research individually. However, information extraction systems should have integrated with data integration meth

  5. Tagline: Information Extraction for Semi-Structured Text Elements in Medical Progress Notes

    Science.gov (United States)

    Finch, Dezon Kile

    2012-01-01

    Text analysis has become an important research activity in the Department of Veterans Affairs (VA). Statistical text mining and natural language processing have been shown to be very effective for extracting useful information from medical documents. However, neither of these techniques is effective at extracting the information stored in…

  6. Population information extraction in Chaohu watershed based on RS and GIS

    Institute of Scientific and Technical Information of China (English)

    2003-01-01

    It is always difficult to extract population information of small watershed during the region environment assessment. The report adopted compound areal interpolation to study the population in Chaohu watershed with the GIS technique and landuse interpretation data from remote sensing. The result indicated that the method is effective to extract population information of small watershed.

  7. A Framework For Extracting Information From Web Using VTD-XML‘s XPath

    OpenAIRE

    C. Subhashini; Dr.Arti Arya

    2012-01-01

    The exponential growth of WWW (World Wide Web) is the cause for vast pool of information as well as several challenges posed by it, such as extracting potentially useful and unknown information from WWW. Many websites are built with HTML, because of its unstructured layout, it is difficult to obtain effective and precise data from web using HTML. The advent of XML (Extensible Markup Language) proposes a better solution to extract useful knowledge from WWW. Web Data Extraction based on XML Tec...

  8. Medicaid Analytic eXtract (MAX) General Information

    Data.gov (United States)

    U.S. Department of Health & Human Services — The Medicaid Analytic eXtract (MAX) data is a set of person-level data files on Medicaid eligibility, service utilization, and payments. The MAX data are created to...

  9. Biomedical Literature Exploration through Latent Semantics

    Directory of Open Access Journals (Sweden)

    Hugo ARAÚJO

    2013-08-01

    Full Text Available The fast increasing amount of articles published in the biomedical field is creating difficulties in the way this wealth of information can be efficiently exploited by researchers. As a way of overcoming these limitations and potentiating a more efficient use of the literature, we propose an approach for structuring the results of a literature search based on the latent semantic information extracted from a corpus. Moreover, we show how the results of the Latent Semantic Analysis method can be adapted so as to evidence differences between results of different searches. We also propose different visualization techniques that can be applied to explore these results. Used in combination, these techniques could empower users with tools for literature guided knowledge exploration and discovery.

  10. Extracting the symmetry energy information with heavy ion collisions

    International Nuclear Information System (INIS)

    Nuclear symmetry energy plays an important role in the properties of nuclei and neutron stars. Since theoretical predictions of the density dependence of symmetry energy S(ρ) from microscopic nucleon-nucleon interactions show large uncertainties, especially in the region of suprasaturation density, constraining the density dependence of symmetry energy has become one of the main goals in nuclear physics and has stimulated many theoretical and experimental studies. In this paper, we have reviewed the ImQMD05 code and its description on charge distribution, collective flow and nuclear-nuclear stopping. By comparing ImQMD05 prediction with data, the isoscalar part of nucleonic mean field, the energy and density dependence of in-medium NN cross sectoin in the ImQMD05 were determined. In order to extracting the symmetry energy information by comparing the ImQMD05 calculations with the data for 112,124Sn +112,124 Sn at Ebeam=50 MeV/u, the influence of symmetry potential and in-medium NN cross section on the isospin sensitive observables of intermediate-energy heavy-ion collisions was investigated. Focusing on the region above the Fermi energy, our results show that the symmetry potential plays a more important role in the experimental observables, such as double neutron to proton ratio and the isospin transport ratio Ri, than that the in-medium nucleon-nucleon cross section does. Since the copious production of intermediate mass fragments is a distinguishing feature of intermediate-energy heavy-ion collisions, we also examined the influence of cluster emission on the isospin transport ratio using different isospin tracers. The values of the isospin transport ratios with the tracer defined by the isospin asymmetry of the heaviest fragments with Z≥20 in the projectile region is greater than those obtained from projectile residues (emitting source). This phenomenon can be tested experimentally. By comparing the ImQMD05 predictions with the data for three observables, the

  11. Frontiers in biomedical engineering and biotechnology.

    Science.gov (United States)

    Liu, Feng; Goodarzi, Ali; Wang, Haifeng; Stasiak, Joanna; Sun, Jianbo; Zhou, Yu

    2014-01-01

    The 2nd International Conference on Biomedical Engineering and Biotechnology (iCBEB 2013), held in Wuhan on 11–13 October 2013, is an annual conference that aims at providing an opportunity for international and national researchers and practitioners to present the most recent advances and future challenges in the fields of Biomedical Information, Biomedical Engineering and Biotechnology. The papers published by this issue are selected from this conference, which witnesses the frontier in the field of Biomedical Engineering and Biotechnology, which particularly has helped improving the level of clinical diagnosis in medical work.

  12. Telemedicine optoelectronic biomedical data processing system

    Science.gov (United States)

    Prosolovska, Vita V.

    2010-08-01

    The telemedicine optoelectronic biomedical data processing system is created to share medical information for the control of health rights and timely and rapid response to crisis. The system includes the main blocks: bioprocessor, analog-digital converter biomedical images, optoelectronic module for image processing, optoelectronic module for parallel recording and storage of biomedical imaging and matrix screen display of biomedical images. Rated temporal characteristics of the blocks defined by a particular triggering optoelectronic couple in analog-digital converters and time imaging for matrix screen. The element base for hardware implementation of the developed matrix screen is integrated optoelectronic couples produced by selective epitaxy.

  13. Extraction of spatio-temporal information of earthquake event based on semantic technology

    Science.gov (United States)

    Fan, Hong; Guo, Dan; Li, Huaiyuan

    2015-12-01

    In this paper a web information extraction method is presented which identifies a variety of thematic events utilizing the event knowledge framework derived from text training, and then further uses the syntactic analysis to extract the event key information. The method which combines the text semantic information and domain knowledge of the event makes the extraction of information people interested more accurate. In this paper, web based earthquake news extraction is taken as an example. The paper firstly briefs the overall approaches, and then details the key algorithm and experiments of seismic events extraction. Finally, this paper conducts accuracy analysis and evaluation experiments which demonstrate that the proposed method is a promising way of hot events mining.

  14. Semantic information extracting system for classification of radiological reports in radiology information system (RIS)

    Science.gov (United States)

    Shi, Liehang; Ling, Tonghui; Zhang, Jianguo

    2016-03-01

    Radiologists currently use a variety of terminologies and standards in most hospitals in China, and even there are multiple terminologies being used for different sections in one department. In this presentation, we introduce a medical semantic comprehension system (MedSCS) to extract semantic information about clinical findings and conclusion from free text radiology reports so that the reports can be classified correctly based on medical terms indexing standards such as Radlex or SONMED-CT. Our system (MedSCS) is based on both rule-based methods and statistics-based methods which improve the performance and the scalability of MedSCS. In order to evaluate the over all of the system and measure the accuracy of the outcomes, we developed computation methods to calculate the parameters of precision rate, recall rate, F-score and exact confidence interval.

  15. Study on methods and techniques of aeroradiometric weak information extraction for sandstone-hosted uranium deposits based on GIS

    International Nuclear Information System (INIS)

    The weak information extraction is one of the important research contents in the current sandstone-type uranium prospecting in China. This paper introduces the connotation of aeroradiometric weak information extraction, and discusses the formation theories of aeroradiometric weak information extraction, and discusses the formation theories of aeroradiometric weak information and establishes some effective mathematic models for weak information extraction. Models for weak information extraction are realized based on GIS software platform. Application tests of weak information extraction are realized based on GIS software platform. Application tests of weak information extraction are completed in known uranium mineralized areas. Research results prove that the prospective areas of sandstone-type uranium deposits can be rapidly delineated by extracting aeroradiometric weak information. (authors)

  16. Vision Based Extraction of Nutrition Information from Skewed Nutrition Labels

    OpenAIRE

    Zaman, Tanwir

    2016-01-01

    An important component of a healthy diet is the comprehension and retention of nutritional information and understanding of how different food items and nutritional constituents affect our bodies. In the U.S. and many other countries, nutritional information is primarily conveyed to consumers through nutrition labels (NLs) which can be found in all packaged food products. However, sometimes it becomes really challenging to utilize all this information available in these NLs even for consumers...

  17. Extracting local information from crowds through betting markets

    Science.gov (United States)

    Weijs, Steven

    2015-04-01

    In this research, a set-up is considered in which users can bet against a forecasting agency to challenge their probabilistic forecasts. From an information theory standpoint, a reward structure is considered that either provides the forecasting agency with better information, paying the successful providers of information for their winning bets, or funds excellent forecasting agencies through users that think they know better. Especially for local forecasts, the approach may help to diagnose model biases and to identify local predictive information that can be incorporated in the models. The challenges and opportunities for implementing such a system in practice are also discussed.

  18. BIG: a Grid Portal for Biomedical Data and Images

    Directory of Open Access Journals (Sweden)

    Giovanni Aloisio

    2004-06-01

    Full Text Available Modern management of biomedical systems involves the use of many distributed resources, such as high performance computational resources to analyze biomedical data, mass storage systems to store them, medical instruments (microscopes, tomographs, etc., advanced visualization and rendering tools. Grids offer the computational power, security and availability needed by such novel applications. This paper presents BIG (Biomedical Imaging Grid, a Web-based Grid portal for management of biomedical information (data and images in a distributed environment. BIG is an interactive environment that deals with complex user's requests, regarding the acquisition of biomedical data, the "processing" and "delivering" of biomedical images, using the power and security of Computational Grids.

  19. Environmental/Biomedical Terminology Index

    Energy Technology Data Exchange (ETDEWEB)

    Huffstetler, J.K.; Dailey, N.S.; Rickert, L.W.; Chilton, B.D.

    1976-12-01

    The Information Center Complex (ICC), a centrally administered group of information centers, provides information support to environmental and biomedical research groups and others within and outside Oak Ridge National Laboratory. In-house data base building and development of specialized document collections are important elements of the ongoing activities of these centers. ICC groups must be concerned with language which will adequately classify and insure retrievability of document records. Language control problems are compounded when the complexity of modern scientific problem solving demands an interdisciplinary approach. Although there are several word lists, indexes, and thesauri specific to various scientific disciplines usually grouped as Environmental Sciences, no single generally recognized authority can be used as a guide to the terminology of all environmental science. If biomedical terminology for the description of research on environmental effects is also needed, the problem becomes even more complex. The building of a word list which can be used as a general guide to the environmental/biomedical sciences has been a continuing activity of the Information Center Complex. This activity resulted in the publication of the Environmental Biomedical Terminology Index (EBTI).

  20. Environmental/Biomedical Terminology Index

    International Nuclear Information System (INIS)

    The Information Center Complex (ICC), a centrally administered group of information centers, provides information support to environmental and biomedical research groups and others within and outside Oak Ridge National Laboratory. In-house data base building and development of specialized document collections are important elements of the ongoing activities of these centers. ICC groups must be concerned with language which will adequately classify and insure retrievability of document records. Language control problems are compounded when the complexity of modern scientific problem solving demands an interdisciplinary approach. Although there are several word lists, indexes, and thesauri specific to various scientific disciplines usually grouped as Environmental Sciences, no single generally recognized authority can be used as a guide to the terminology of all environmental science. If biomedical terminology for the description of research on environmental effects is also needed, the problem becomes even more complex. The building of a word list which can be used as a general guide to the environmental/biomedical sciences has been a continuing activity of the Information Center Complex. This activity resulted in the publication of the Environmental Biomedical Terminology Index

  1. NEW METHOD OF EXTRACTING WEAK FAILURE INFORMATION IN GEARBOX BY COMPLEX WAVELET DENOISING

    Institute of Scientific and Technical Information of China (English)

    CHEN Zhixin; XU Jinwu; YANG Debin

    2008-01-01

    Because the extract of the weak failure information is always the difficulty and focus of fault detection. Aiming for specific statistical properties of complex wavelet coefficients of gearbox vibration signals, a new signal-denoising method which uses local adaptive algorithm based on dual-tree complex wavelet transform (DT-CWT) is introduced to extract weak failure information in gear, especially to extract impulse components. By taking into account the non-Gaussian probability distribution and the statistical dependencies among wavelet coefficients of some signals, and by taking the advantage of near shift-invariance of DT-CWT, the higher signal-to-noise ratio (SNR) than common wavelet denoising methods can be obtained. Experiments of extracting periodic impulses in gearbox vibration signals indicate that the method can extract incipient fault feature and hidden information from heavy noise, and it has an excellent effect on identifying weak feature signals in gearbox vibration signals.

  2. Biomedical Imaging Principles and Applications

    CERN Document Server

    Salzer, Reiner

    2012-01-01

    This book presents and describes imaging technologies that can be used to study chemical processes and structural interactions in dynamic systems, principally in biomedical systems. The imaging technologies, largely biomedical imaging technologies such as MRT, Fluorescence mapping, raman mapping, nanoESCA, and CARS microscopy, have been selected according to their application range and to the chemical information content of their data. These technologies allow for the analysis and evaluation of delicate biological samples, which must not be disturbed during the profess. Ultimately, this may me

  3. Biomedical image understanding methods and applications

    CERN Document Server

    Lim, Joo-Hwee; Xiong, Wei

    2015-01-01

    A comprehensive guide to understanding and interpreting digital images in medical and functional applications Biomedical Image Understanding focuses on image understanding and semantic interpretation, with clear introductions to related concepts, in-depth theoretical analysis, and detailed descriptions of important biomedical applications. It covers image processing, image filtering, enhancement, de-noising, restoration, and reconstruction; image segmentation and feature extraction; registration; clustering, pattern classification, and data fusion. With contributions from ex

  4. Biomedical engineering fundamentals

    CERN Document Server

    Bronzino, Joseph D

    2014-01-01

    Known as the bible of biomedical engineering, The Biomedical Engineering Handbook, Fourth Edition, sets the standard against which all other references of this nature are measured. As such, it has served as a major resource for both skilled professionals and novices to biomedical engineering.Biomedical Engineering Fundamentals, the first volume of the handbook, presents material from respected scientists with diverse backgrounds in physiological systems, biomechanics, biomaterials, bioelectric phenomena, and neuroengineering. More than three dozen specific topics are examined, including cardia

  5. Photo-mediated green synthesis of silver and zinc oxide nanoparticles using aqueous extracts of two mangrove plant species, Heritiera fomes and Sonneratia apetala and investigation of their biomedical applications.

    Science.gov (United States)

    Thatoi, Priyabrata; Kerry, Rout George; Gouda, Sushanto; Das, Gitishree; Pramanik, Krishna; Thatoi, Hrudayanath; Patra, Jayanta Kumar

    2016-10-01

    Green synthesis by using biological agents has been a simple and effective approach for the synthesis of various forms of nanoparticles. The present investigation was intended to synthesis Ag-NPs and ZnO-NPs under photo-condition using the aqueous extracts of two mangrove plants namely Heritiera fomes and Sonneratia apetala and evaluate their potential biomedical applications. The formation of nanoparticles in aqueous solution of H. fomes and S. apetala under exposure to sun light was validated by change in color and formation of monodispersed NPs with a narrow particle size distribution. Fourier transform infrared spectroscopy (FT-IR) reveals the presence of Oxime and other heterocyclic compounds to be the most probable compounds responsible for the reduction and stability of nanoparticles in the solutions. The synthesized NPs displayed moderate free radical scavenging properties. The anti-inflammatory potential of ZnO-NPs was recorded to be comparatively higher than that of Ag-NP with 79% and 69.1% respectively. The Ag-NPs with unique properties of inhibiting α-amylase (91.14% and 89.16%) were found to be significantly high indicating its antidiabetic property. The synthesized NPs showed varied zone of inhibition (9-16mm) against the tested microbial pathogens. The synthesized nanoparticles possess strong biological activities in terms of antioxidant, anti-inflammatory, antidiabetic and antibacterial, potentials which could be utilized in various biological applications by the cosmetic, food and biomedical industries. PMID:27611454

  6. Spintronic platforms for biomedical applications.

    Science.gov (United States)

    Freitas, P P; Cardoso, F A; Martins, V C; Martins, S A M; Loureiro, J; Amaral, J; Chaves, R C; Cardoso, S; Fonseca, L P; Sebastião, A M; Pannetier-Lecoeur, M; Fermon, C

    2012-02-01

    Since the fundamental discovery of the giant magnetoresistance many spintronic devices have been developed and implemented in our daily life (e.g. information storage and automotive industry). Lately, advances in the sensors technology (higher sensitivity, smaller size) have potentiated other applications, namely in the biological area, leading to the emergence of novel biomedical platforms. In particular the investigation of spintronics and its application to the development of magnetoresistive (MR) biomolecular and biomedical platforms are giving rise to a new class of biomedical diagnostic devices, suitable for bench top bioassays as well as point-of-care and point-of-use devices. Herein, integrated spintronic biochip platforms for diagnostic and cytometric applications, hybrid systems incorporating magnetoresistive sensors applied to neuroelectronic studies and biomedical imaging, namely magneto-encephalography and magneto-cardiography, are reviewed. Also lab-on-a-chip MR-based platforms to perform biological studies at the single molecule level are discussed. Overall the potential and main characteristics of such MR-based biomedical devices, comparing to the existing technologies while giving particular examples of targeted applications, are addressed. PMID:22146898

  7. Pathophysiologic mechanisms of biomedical nanomaterials.

    Science.gov (United States)

    Wang, Liming; Chen, Chunying

    2016-05-15

    Nanomaterials (NMs) have been widespread used in biomedical fields, daily consuming, and even food industry. It is crucial to understand the safety and biomedical efficacy of NMs. In this review, we summarized the recent progress about the physiological and pathological effects of NMs from several levels: protein-nano interface, NM-subcellular structures, and cell-cell interaction. We focused on the detailed information of nano-bio interaction, especially about protein adsorption, intracellular trafficking, biological barriers, and signaling pathways as well as the associated mechanism mediated by nanomaterials. We also introduced related analytical methods that are meaningful and helpful for biomedical effect studies in the future. We believe that knowledge about pathophysiologic effects of NMs is not only significant for rational design of medical NMs but also helps predict their safety and further improve their applications in the future.

  8. Advanced remote sensing terrestrial information extraction and applications

    CERN Document Server

    Liang, Shunlin; Wang, Jindi

    2012-01-01

    Advanced Remote Sensing is an application-based reference that provides a single source of mathematical concepts necessary for remote sensing data gathering and assimilation. It presents state-of-the-art techniques for estimating land surface variables from a variety of data types, including optical sensors such as RADAR and LIDAR. Scientists in a number of different fields including geography, geology, atmospheric science, environmental science, planetary science and ecology will have access to critically-important data extraction techniques and their virtually unlimited application

  9. Spoken Language Understanding Systems for Extracting Semantic Information from Speech

    CERN Document Server

    Tur, Gokhan

    2011-01-01

    Spoken language understanding (SLU) is an emerging field in between speech and language processing, investigating human/ machine and human/ human communication by leveraging technologies from signal processing, pattern recognition, machine learning and artificial intelligence. SLU systems are designed to extract the meaning from speech utterances and its applications are vast, from voice search in mobile devices to meeting summarization, attracting interest from both commercial and academic sectors. Both human/machine and human/human communications can benefit from the application of SLU, usin

  10. Extracting short distance information from b -> s l+ l- effectively

    OpenAIRE

    Lee, Keith S. M.; Ligeti, Zoltan; Stewart, Iain W.; Tackmann, Frank J.

    2006-01-01

    We point out that in inclusive B -> Xs l+ l- decay an angular decomposition provides a third (q^2 dependent) observable sensitive to a different combination of Wilson coefficients than the rate and the forward-backward asymmetry. Since a precise measurement of q^2 dependence requires large data sets, it is important to consider the data integrated over regions of q^2. We develop a strategy to extract all measurable Wilson coefficients in B -> Xs l+ l- from a few simple integrated rates in the...

  11. Extraction of information about periodic orbits from scattering functions

    CERN Document Server

    Bütikofer, T; Seligman, T H; Bütikofer, Thomas; Jung, Christof; Seligman, Thomas H.

    1999-01-01

    As a contribution to the inverse scattering problem for classical chaotic systems, we show that one can select sequences of intervals of continuity, each of which yields the information about period, eigenvalue and symmetry of one unstable periodic orbit.

  12. Semantic relations extraction from unstructured information for domain ontologies enrichment

    OpenAIRE

    Paiva, Luis Miguel Sintra Salvo

    2015-01-01

    Based in internet growth, through semantic web, together with communication speed improvement and fast development of storage device sizes, data and information volume rises considerably every day. Because of this, in the last few years there has been a growing interest in structures for formal representation with suitable characteristics, such as the possibility to organize data and information, as well as the reuse of its contents aimed for the generation of new knowledge. Controlled ...

  13. Parallelization in Extracting Fresh Information from Online Social Network

    OpenAIRE

    Guo, Rui; Wang, Hongzhi; Chen, Mengwen; Li, Jianzhong; Gao, Hong

    2013-01-01

    Online Social Network (OSN) is one of the most hottest services in the past years. It preserves the life of users and provides great potential for journalists, sociologists and business analysts. Crawling data from social network is a basic step for social network information analysis and processing. As the network becomes huge and information on the network updates faster than web pages, crawling is more difficult because of the limitations of band-width, politeness etiquette and computation...

  14. Extracting information masked by the chaotic signal of a time-delay system.

    Science.gov (United States)

    Ponomarenko, V I; Prokhorov, M D

    2002-08-01

    We further develop the method proposed by Bezruchko et al. [Phys. Rev. E 64, 056216 (2001)] for the estimation of the parameters of time-delay systems from time series. Using this method we demonstrate a possibility of message extraction for a communication system with nonlinear mixing of information signal and chaotic signal of the time-delay system. The message extraction procedure is illustrated using both numerical and experimental data and different kinds of information signals.

  15. Constructing a semantic predication gold standard from the biomedical literature

    Directory of Open Access Journals (Sweden)

    Kilicoglu Halil

    2011-12-01

    Full Text Available Abstract Background Semantic relations increasingly underpin biomedical text mining and knowledge discovery applications. The success of such practical applications crucially depends on the quality of extracted relations, which can be assessed against a gold standard reference. Most such references in biomedical text mining focus on narrow subdomains and adopt different semantic representations, rendering them difficult to use for benchmarking independently developed relation extraction systems. In this article, we present a multi-phase gold standard annotation study, in which we annotated 500 sentences randomly selected from MEDLINE abstracts on a wide range of biomedical topics with 1371 semantic predications. The UMLS Metathesaurus served as the main source for conceptual information and the UMLS Semantic Network for relational information. We measured interannotator agreement and analyzed the annotations closely to identify some of the challenges in annotating biomedical text with relations based on an ontology or a terminology. Results We obtain fair to moderate interannotator agreement in the practice phase (0.378-0.475. With improved guidelines and additional semantic equivalence criteria, the agreement increases by 12% (0.415 to 0.536 in the main annotation phase. In addition, we find that agreement increases to 0.688 when the agreement calculation is limited to those predications that are based only on the explicitly provided UMLS concepts and relations. Conclusions While interannotator agreement in the practice phase confirms that conceptual annotation is a challenging task, the increasing agreement in the main annotation phase points out that an acceptable level of agreement can be achieved in multiple iterations, by setting stricter guidelines and establishing semantic equivalence criteria. Mapping text to ontological concepts emerges as the main challenge in conceptual annotation. Annotating predications involving biomolecular

  16. Information Extraction of High Resolution Remote Sensing Images Based on the Calculation of Optimal Segmentation Parameters.

    Science.gov (United States)

    Zhu, Hongchun; Cai, Lijie; Liu, Haiying; Huang, Wei

    2016-01-01

    Multi-scale image segmentation and the selection of optimal segmentation parameters are the key processes in the object-oriented information extraction of high-resolution remote sensing images. The accuracy of remote sensing special subject information depends on this extraction. On the basis of WorldView-2 high-resolution data, the optimal segmentation parameters methodof object-oriented image segmentation and high-resolution image information extraction, the following processes were conducted in this study. Firstly, the best combination of the bands and weights was determined for the information extraction of high-resolution remote sensing image. An improved weighted mean-variance method was proposed andused to calculatethe optimal segmentation scale. Thereafter, the best shape factor parameter and compact factor parameters were computed with the use of the control variables and the combination of the heterogeneity and homogeneity indexes. Different types of image segmentation parameters were obtained according to the surface features. The high-resolution remote sensing images were multi-scale segmented with the optimal segmentation parameters. Ahierarchical network structure was established by setting the information extraction rules to achieve object-oriented information extraction. This study presents an effective and practical method that can explain expert input judgment by reproducible quantitative measurements. Furthermore the results of this procedure may be incorporated into a classification scheme.

  17. TIGER5---Extraction of geographic information from the TIGER system

    Energy Technology Data Exchange (ETDEWEB)

    Gryder, R.K.

    1992-03-01

    The need for geographic information in digital form is becoming apparent in many areas, including Emergency Response Planning and Management. The Bureau of Census recognized this need and developed an automated geographic data base, known as Topologically Integrated Geographic Encoding and Referencing (TIGER) System, to produce the geographic products for the 1990 census. The Bureau makes the information available to the public on CD-ROM disks. As distributed, the geographic information is not directly visible, and a program is needed to display the information in a graphic form. MapInfo is a commercially available program that has the capability to display maps and allows the user to perform certain geographic analyses. MapInfo runs on a variety of low-cost platforms that may be found in an average office environment, as well as on high-performance work stations. The TIGER5 program developed by the Emergency Management Information Systems (EMIS) project at Oak Ridge National Laboratory reads files directly from the Census Bureau TIGER/Line CD-ROM and creates MapInfo Exchange Format Files that can be directly imported into MapInfo. The standard default values for scaling parameters are used, and the resulting map is in the same coordinate system as the world and state maps distributed with MapInfo.

  18. Extracting short distance information from b -> s l+ l- effectively

    CERN Document Server

    Lee, K S M; Stewart, I W; Tackmann, F J; Lee, Keith S. M.; Ligeti, Zoltan; Stewart, Iain W.; Tackmann, Frank J.

    2007-01-01

    We point out that in inclusive B -> Xs l+ l- decay an angular decomposition provides a third (q^2-dependent) observable, in addition to the rate and the forward-backward asymmetry, which is sensitive to a different combination of Wilson coefficients. Since a precise measurement of q^2 dependence requires large data sets, it is important to consider the data integrated over regions of q^2. We develop a strategy to extract all measurable Wilson coefficients in B -> Xs l+ l- from a few simple integrated rates in the low q^2 region. A similar decomposition in B -> K* l+ l-, together with the B -> K* \\gamma rate, also provides a model independent determination of the Wilson coefficients, without reliance on form factor models and without having to measure the zero of the forward-backward asymmetry.

  19. Advanced Extraction of Spatial Information from High Resolution Satellite Data

    Science.gov (United States)

    Pour, T.; Burian, J.; Miřijovský, J.

    2016-06-01

    In this paper authors processed five satellite image of five different Middle-European cities taken by five different sensors. The aim of the paper was to find methods and approaches leading to evaluation and spatial data extraction from areas of interest. For this reason, data were firstly pre-processed using image fusion, mosaicking and segmentation processes. Results going into the next step were two polygon layers; first one representing single objects and the second one representing city blocks. In the second step, polygon layers were classified and exported into Esri shapefile format. Classification was partly hierarchical expert based and partly based on the tool SEaTH used for separability distinction and thresholding. Final results along with visual previews were attached to the original thesis. Results are evaluated visually and statistically in the last part of the paper. In the discussion author described difficulties of working with data of large size, taken by different sensors and different also thematically.

  20. Extracting Conflict-free Information from Multi-labeled Trees

    CERN Document Server

    Deepak, Akshay; McMahon, Michelle M

    2012-01-01

    A multi-labeled tree, or MUL-tree, is a phylogenetic tree where two or more leaves share a label, e.g., a species name. A MUL-tree can imply multiple conflicting phylogenetic relationships for the same set of taxa, but can also contain conflict-free information that is of interest and yet is not obvious. We define the information content of a MUL-tree T as the set of all conflict-free quartet topologies implied by T, and define the maximal reduced form of T as the smallest tree that can be obtained from T by pruning leaves and contracting edges while retaining the same information content. We show that any two MUL-trees with the same information content exhibit the same reduced form. This introduces an equivalence relation in MUL-trees with potential applications to comparing MUL-trees. We present an efficient algorithm to reduce a MUL-tree to its maximally reduced form and evaluate its performance on empirical datasets in terms of both quality of the reduced tree and the degree of data reduction achieved.

  1. On Depth Information Extraction from Metal Detector Signals

    NARCIS (Netherlands)

    Schoolderman, A.J.; Wolf, F.J. de; Merlat, L.

    2003-01-01

    Information on the depth of objects detected with the help of a metal detector is useful for safe excavation of these objects in demining operations. Apart from that, depth informatíon may be used in advanced sensor fusion algorithms for a detection system where a metal detector is combíned with eg.

  2. Extraction of quadrature phase information from multiple pulse NMR signals

    Science.gov (United States)

    Rhim, W.-K.; Burum, D. P.; Vaughan, R. W.

    1976-01-01

    A multiple pulse sequence (8-pulse sequence) used for high-resolution solid state NMR is analyzed with regard to the information available from each of the four wide sampling windows. It is demonstrated that full quadrature phase information can be obtained using only a single phase detector and that, for the commonly encountered situation where the spectral width is much less than the folding frequency, the signals from the various windows can be combined easily using standard complex Fourier transform software. An improvement in the signal-to-noise ratio equal to the square root of 3 is obtained over either standard single or quadrature phase detection schemes. Procedures for correcting spectral distortions are presented.

  3. Using XBRL Technology to Extract Competitive Information from Financial Statements

    OpenAIRE

    Dominik Ditter; Klaus Henselmann; Elisabeth Scherr

    2011-01-01

    The eXtensible Business Reporting Language, or XBRL, is a reporting format for the automatic and electronic exchange of business and financial data. In XBRL every single reported fact is marked with a unique tag, enabling a full computer-based readout of financial data. It has the potential to improve the collection and analysis of financial data for Competitive Intelligence (e.g., the profiling of publicly available financial statements). The article describes how easily information from XBR...

  4. ADVANCED EXTRACTION OF SPATIAL INFORMATION FROM HIGH RESOLUTION SATELLITE DATA

    Directory of Open Access Journals (Sweden)

    T. Pour

    2016-06-01

    Full Text Available In this paper authors processed five satellite image of five different Middle-European cities taken by five different sensors. The aim of the paper was to find methods and approaches leading to evaluation and spatial data extraction from areas of interest. For this reason, data were firstly pre-processed using image fusion, mosaicking and segmentation processes. Results going into the next step were two polygon layers; first one representing single objects and the second one representing city blocks. In the second step, polygon layers were classified and exported into Esri shapefile format. Classification was partly hierarchical expert based and partly based on the tool SEaTH used for separability distinction and thresholding. Final results along with visual previews were attached to the original thesis. Results are evaluated visually and statistically in the last part of the paper. In the discussion author described difficulties of working with data of large size, taken by different sensors and different also thematically.

  5. Applications of computational intelligence in biomedical technology

    CERN Document Server

    Majernik, Jaroslav; Pancerz, Krzysztof; Zaitseva, Elena

    2016-01-01

    This book presents latest results and selected applications of Computational Intelligence in Biomedical Technologies. Most of contributions deal with problems of Biomedical and Medical Informatics, ranging from theoretical considerations to practical applications. Various aspects of development methods and algorithms in Biomedical and Medical Informatics as well as Algorithms for medical image processing, modeling methods are discussed. Individual contributions also cover medical decision making support, estimation of risks of treatments, reliability of medical systems, problems of practical clinical applications and many other topics  This book is intended for scientists interested in problems of Biomedical Technologies, for researchers and academic staff, for all dealing with Biomedical and Medical Informatics, as well as PhD students. Useful information is offered also to IT companies, developers of equipment and/or software for medicine and medical professionals.  .

  6. EXTRACT

    DEFF Research Database (Denmark)

    Pafilis, Evangelos; Buttigieg, Pier Luigi; Ferrell, Barbra;

    2016-01-01

    therefore developed an interactive annotation tool, EXTRACT, which helps curators identify and extract standard-compliant terms for annotation of metagenomic records and other samples. Behind its web-based user interface, the system combines published methods for named entity recognition of environment...... and text-mining-assisted curation revealed that EXTRACT speeds up annotation by 15-25% and helps curators to detect terms that would otherwise have been missed.Database URL: https://extract.hcmr.gr/....

  7. Data-Driven Information Extraction from Chinese Electronic Medical Records.

    Directory of Open Access Journals (Sweden)

    Dong Xu

    Full Text Available This study aims to propose a data-driven framework that takes unstructured free text narratives in Chinese Electronic Medical Records (EMRs as input and converts them into structured time-event-description triples, where the description is either an elaboration or an outcome of the medical event.Our framework uses a hybrid approach. It consists of constructing cross-domain core medical lexica, an unsupervised, iterative algorithm to accrue more accurate terms into the lexica, rules to address Chinese writing conventions and temporal descriptors, and a Support Vector Machine (SVM algorithm that innovatively utilizes Normalized Google Distance (NGD to estimate the correlation between medical events and their descriptions.The effectiveness of the framework was demonstrated with a dataset of 24,817 de-identified Chinese EMRs. The cross-domain medical lexica were capable of recognizing terms with an F1-score of 0.896. 98.5% of recorded medical events were linked to temporal descriptors. The NGD SVM description-event matching achieved an F1-score of 0.874. The end-to-end time-event-description extraction of our framework achieved an F1-score of 0.846.In terms of named entity recognition, the proposed framework outperforms state-of-the-art supervised learning algorithms (F1-score: 0.896 vs. 0.886. In event-description association, the NGD SVM is superior to SVM using only local context and semantic features (F1-score: 0.874 vs. 0.838.The framework is data-driven, weakly supervised, and robust against the variations and noises that tend to occur in a large corpus. It addresses Chinese medical writing conventions and variations in writing styles through patterns used for discovering new terms and rules for updating the lexica.

  8. Omnidirectional vision systems calibration, feature extraction and 3D information

    CERN Document Server

    Puig, Luis

    2013-01-01

    This work focuses on central catadioptric systems, from the early step of calibration to high-level tasks such as 3D information retrieval. The book opens with a thorough introduction to the sphere camera model, along with an analysis of the relation between this model and actual central catadioptric systems. Then, a new approach to calibrate any single-viewpoint catadioptric camera is described.  This is followed by an analysis of existing methods for calibrating central omnivision systems, and a detailed examination of hybrid two-view relations that combine images acquired with uncalibrated

  9. Post-processing of Deep Web Information Extraction Based on Domain Ontology

    Directory of Open Access Journals (Sweden)

    PENG, T.

    2013-11-01

    Full Text Available Many methods are utilized to extract and process query results in deep Web, which rely on the different structures of Web pages and various designing modes of databases. However, some semantic meanings and relations are ignored. So, in this paper, we present an approach for post-processing deep Web query results based on domain ontology which can utilize the semantic meanings and relations. A block identification model (BIM based on node similarity is defined to extract data blocks that are relevant to specific domain after reducing noisy nodes. Feature vector of domain books is obtained by result set extraction model (RSEM based on vector space model (VSM. RSEM, in combination with BIM, builds the domain ontology on books which can not only remove the limit of Web page structures when extracting data information, but also make use of semantic meanings of domain ontology. After extracting basic information of Web pages, a ranking algorithm is adopted to offer an ordered list of data records to users. Experimental results show that BIM and RSEM extract data blocks and build domain ontology accurately. In addition, relevant data records and basic information are extracted and ranked. The performances precision and recall show that our proposed method is feasible and efficient.

  10. STUDY ON EXTRACTING METHODS OF BURIED GEOLOGICAL INFORMATION IN HUAIBEI COAL FIELD

    Institute of Scientific and Technical Information of China (English)

    王四龙; 赵学军; 凌贻棕; 刘玉荣; 宁书年; 侯德文

    1999-01-01

    It is discussed features and the producing mechanism of buried geological information in geological, geophysical and remote sensing data in Huaibei coal field, and studied the methods extracting buried tectonic and igneous rock information from various geological data using digital image processing techniques.

  11. Analysis of Automated Modern Web Crawling and Testing Tools and Their Possible Employment for Information Extraction

    Directory of Open Access Journals (Sweden)

    Tomas Grigalis

    2012-04-01

    Full Text Available World Wide Web has become an enormously big repository of data. Extracting, integrating and reusing this kind of data has a wide range of applications, including meta-searching, comparison shopping, business intelligence tools and security analysis of information in websites. However, reaching information in modern WEB 2.0 web pages, where HTML tree is often dynamically modified by various JavaScript codes, new data are added by asynchronous requests to the web server and elements are positioned with the help of cascading style sheets, is a difficult task. The article reviews automated web testing tools for information extraction tasks.Article in Lithuanian

  12. Question Processing and Clustering in INDOC: A Biomedical Question Answering System

    Directory of Open Access Journals (Sweden)

    Ankush Mittal

    2007-12-01

    Full Text Available The exponential growth in the volume of publications in the biomedical domain has made it impossible for an individual to keep pace with the advances. Even though evidence-based medicine has gained wide acceptance, the physicians are unable to access the relevant information in the required time, leaving most of the questions unanswered. This accentuates the need for fast and accurate biomedical question answering systems. In this paper we introduce INDOC—a biomedical question answering system based on novel ideas of indexing and extracting the answer to the questions posed. INDOC displays the results in clusters to help the user arrive the most relevant set of documents quickly. Evaluation was done against the standard OHSUMED test collection. Our system achieves high accuracy and minimizes user effort.

  13. Question Processing and Clustering in INDOC: A Biomedical Question Answering System

    Directory of Open Access Journals (Sweden)

    Sondhi Parikshit

    2007-01-01

    Full Text Available The exponential growth in the volume of publications in the biomedical domain has made it impossible for an individual to keep pace with the advances. Even though evidence-based medicine has gained wide acceptance, the physicians are unable to access the relevant information in the required time, leaving most of the questions unanswered. This accentuates the need for fast and accurate biomedical question answering systems. In this paper we introduce INDOC—a biomedical question answering system based on novel ideas of indexing and extracting the answer to the questions posed. INDOC displays the results in clusters to help the user arrive the most relevant set of documents quickly. Evaluation was done against the standard OHSUMED test collection. Our system achieves high accuracy and minimizes user effort.

  14. Abstract Information Extraction From Consumer's Comments On Internet Media

    Directory of Open Access Journals (Sweden)

    Kadriye Ergün

    2013-01-01

    Full Text Available In this study, a system developed to summarize by automatically evaluating comments about product with using text mining techniques will be described. The data has been primarily went through morphological analysis process, because they are texts written in natural language. Words and adjectives meaning positive or negative are determined. They show product features in texts. The tree structure is established according to Turkish grammar rules as subordinate and modified words are designated. The software which uses the depth-first search algorithm on the tree structure is developed. Data from result of software is stored in the SQL database. When any inquiry is made from these data depending on any property of product, numerical information which indicates the degree of satisfaction about this property is obtained.

  15. The Technology of Extracting Content Information from Web Page Based on DOM Tree

    Science.gov (United States)

    Yuan, Dingrong; Mo, Zhuoying; Xie, Bing; Xie, Yangcai

    There are huge amounts of information on Web pages, which includes content information and other useless information, such as navigation, advertisement and flash of animation etc. Reducing the toils of Web users, we estabished a thechnique to extract the content information from web page. Fristly, we analyzed the semantic of web documents by V8 engine of Google and parsed the web document into DOM tree. And then, traversed the DOM tree, pruned the DOM tree in the light of the characteristic of Web page's edit language. Finally, we extracted the content information from Web page. Theoretics and experiments showed that the technique could simplify the web page, present the content information to web users and supply clean data for applicable area, such as retrieval, KDD and DM from web.

  16. What do professional forecasters' stock market expectations tell us about herding, information extraction and beauty contests?

    DEFF Research Database (Denmark)

    Rangvid, Jesper; Schmeling, M.; Schrimpf, A.

    2013-01-01

    We study how professional forecasters form equity market expectations based on a new micro-level dataset which includes rich cross-sectional information about individual characteristics. We focus on testing whether agents rely on the beliefs of others, i.e., consensus expectations, when forming t...... that neither information extraction to incorporate dispersed private information, nor herding for reputational reasons can fully explain these results, leaving Keynes' beauty contest argument as a potential candidate for explaining forecaster behavior....

  17. Extraction of Hidden Social Networks from Wiki-Environment Involved in Information Conflict

    OpenAIRE

    Alguliyev, Rasim M.; Aliguliyev, Ramiz M.; Irada Y. Alakbarova

    2016-01-01

    Social network analysis is a widely used technique to analyze relationships among wiki-users in Wikipedia. In this paper the method to identify hidden social networks participating in information conflicts in wiki-environment is proposed. In particular, we describe how text clustering techniques can be used for extraction of hidden social networks of wiki-users caused information conflict. By clustering unstructured text articles caused information conflict we ...

  18. Surface tailoring of inorganic materials for biomedical applications

    CERN Document Server

    Rimondini, Lia; Vernè, Enrica

    2012-01-01

    This e-book provides comprehensive information on technologies for development and characterization of successful functionalized materials for biomedical applications relevant to surface modification.

  19. Extracting information from the data flood of new solar telescopes. Brainstorming

    CERN Document Server

    Ramos, A Asensio

    2012-01-01

    Extracting magnetic and thermodynamic information from spectropolarimetric observations is a difficult and time consuming task. The amount of science-ready data that will be generated by the new family of large solar telescopes is so large that we will be forced to modify the present approach to inference. In this contribution, I propose several possible ways that might be useful for extracting the thermodynamic and magnetic properties of solar plasmas from such observations quickly.

  20. Extracting Information from the Data Flood of New Solar Telescopes: Brainstorming

    Science.gov (United States)

    Asensio Ramos, A.

    2012-12-01

    Extracting magnetic and thermodynamic information from spectropolarimetric observations is a difficult and time consuming task. The amount of science-ready data that will be generated by the new family of large solar telescopes is so large that we will be forced to modify the present approach to inference. In this contribution, I propose several possible ways that might be useful for extracting the thermodynamic and magnetic properties of solar plasmas from such observations quickly.

  1. Biomedical optical imaging

    CERN Document Server

    Fujimoto, James G

    2009-01-01

    Biomedical optical imaging is a rapidly emerging research area with widespread fundamental research and clinical applications. This book gives an overview of biomedical optical imaging with contributions from leading international research groups who have pioneered many of these techniques and applications. A unique research field spanning the microscopic to the macroscopic, biomedical optical imaging allows both structural and functional imaging. Techniques such as confocal and multiphoton microscopy provide cellular level resolution imaging in biological systems. The integration of this tech

  2. Introduction to biomedical engineering

    CERN Document Server

    Enderle, John

    2011-01-01

    Introduction to Biomedical Engineering is a comprehensive survey text for biomedical engineering courses. It is the most widely adopted text across the BME course spectrum, valued by instructors and students alike for its authority, clarity and encyclopedic coverage in a single volume. Biomedical engineers need to understand the wide range of topics that are covered in this text, including basic mathematical modeling; anatomy and physiology; electrical engineering, signal processing and instrumentation; biomechanics; biomaterials science and tissue engineering; and medical and engineering e

  3. Biomedical engineering principles

    CERN Document Server

    Ritter, Arthur B; Valdevit, Antonio; Ascione, Alfred N

    2011-01-01

    Introduction: Modeling of Physiological ProcessesCell Physiology and TransportPrinciples and Biomedical Applications of HemodynamicsA Systems Approach to PhysiologyThe Cardiovascular SystemBiomedical Signal ProcessingSignal Acquisition and ProcessingTechniques for Physiological Signal ProcessingExamples of Physiological Signal ProcessingPrinciples of BiomechanicsPractical Applications of BiomechanicsBiomaterialsPrinciples of Biomedical Capstone DesignUnmet Clinical NeedsEntrepreneurship: Reasons why Most Good Designs Never Get to MarketAn Engineering Solution in Search of a Biomedical Problem

  4. Fundamental of biomedical engineering

    CERN Document Server

    Sawhney, GS

    2007-01-01

    About the Book: A well set out textbook explains the fundamentals of biomedical engineering in the areas of biomechanics, biofluid flow, biomaterials, bioinstrumentation and use of computing in biomedical engineering. All these subjects form a basic part of an engineer''s education. The text is admirably suited to meet the needs of the students of mechanical engineering, opting for the elective of Biomedical Engineering. Coverage of bioinstrumentation, biomaterials and computing for biomedical engineers can meet the needs of the students of Electronic & Communication, Electronic & Instrumenta

  5. An automatic abrupt information extraction method based on singular value decomposition and higher-order statistics

    International Nuclear Information System (INIS)

    One key aspect of local fault diagnosis is how to effectively extract abrupt features from the vibration signals. This paper proposes a method to automatically extract abrupt information based on singular value decomposition and higher-order statistics. In order to observe the distribution law of singular values, a numerical analysis to simulate the noise, periodic signal, abrupt signal and singular value distribution is conducted. Based on higher-order statistics and spectrum analysis, a method to automatically choose the upper and lower borders of the singular value interval reflecting the abrupt information is built. And the selected singular values derived from this method are used to reconstruct abrupt signals. It is proven that the method is able to obtain accurate results by processing the rub-impact fault signal measured from the experiments. The analytical and experimental results indicate that the proposed method is feasible for automatically extracting abrupt information caused by faults like the rotor–stator rub-impact. (paper)

  6. Real-time traffic information extraction based on compressed video with interframe motion vector

    Institute of Scientific and Technical Information of China (English)

    黄庆明; 王聪

    2003-01-01

    Extraction of traffic information from image or video sequence is a hot research topic in intelligenttransportation system and computer vision. A real-time traffic information extraction method based on com-pressed video with interframe motion vectors for speed, density and flow detection, has been proposed for ex-traction of traffic information under fixed camera setting and well-defined environment. The motion vectors arefirst separated from the compressed video streams, and then filtered to eliminate incorrect and noisy vectors u-sing the well-defined environmental knowledge. By applying the projective transform and using the filtered mo-tion vectors, speed can be calculated from motion vector statistics, density can be estimated using the motionvector occupancy, and flow can be detected using the combination of speed and density. The embodiment of aprototype system for sky camera traffic monitoring using the MPEG video has been implemented, and experi-mental results proved the effectiveness of the method proposed.

  7. International symposium on Biomedical Data Infrastructure (BDI 2013)

    CERN Document Server

    Dhillon, Sarinder; Advances in biomedical infrastructure 2013

    2013-01-01

    Current Biomedical Databases are independently administered in geographically distinct locations, lending them almost ideally to adoption of intelligent data management approaches. This book focuses on research issues, problems and opportunities in Biomedical Data Infrastructure identifying new issues and directions for future research in Biomedical Data and Information Retrieval, Semantics in Biomedicine, and Biomedical Data Modeling and Analysis. The book will be a useful guide for researchers, practitioners, and graduate-level students interested in learning state-of-the-art development in biomedical data management.

  8. Ultrasonic Signal Processing Algorithm for Crack Information Extraction on the Keyway of Turbine Rotor Disk

    Energy Technology Data Exchange (ETDEWEB)

    Lee, Hong Kyu; Seo, Won Chan; Park, Chan [Pukyong National University, Busan (Korea, Republic of); Lee, Jong O; Son, Young Ho [KIMM, Daejeon (Korea, Republic of)

    2009-10-15

    An ultrasonic signal processing algorithm was developed for extracting the information of cracks generated around the keyway of a turbine rotor disk. B-scan images were obtained by using keyway specimens and an ultrasonic scan system with x-y position controller. The B-scan images were used as input images for 2-Dimensional signal processing, and the algorithm was constructed with four processing stages of pre-processing, crack candidate region detection, crack region classification and crack information extraction. It is confirmed by experiments that the developed algorithm is effective for the quantitative evaluation of cracks generated around the keyway of turbine rotor disk

  9. Ultrasonic Signal Processing Algorithm for Crack Information Extraction on the Keyway of Turbine Rotor Disk

    International Nuclear Information System (INIS)

    An ultrasonic signal processing algorithm was developed for extracting the information of cracks generated around the keyway of a turbine rotor disk. B-scan images were obtained by using keyway specimens and an ultrasonic scan system with x-y position controller. The B-scan images were used as input images for 2-Dimensional signal processing, and the algorithm was constructed with four processing stages of pre-processing, crack candidate region detection, crack region classification and crack information extraction. It is confirmed by experiments that the developed algorithm is effective for the quantitative evaluation of cracks generated around the keyway of turbine rotor disk

  10. Information Extraction of High-Resolution Remotely Sensed Image Based on Multiresolution Segmentation

    Directory of Open Access Journals (Sweden)

    Peng Shao

    2014-08-01

    Full Text Available The principle of multiresolution segmentation was represented in detail in this study, and the canny algorithm was applied for edge-detection of a remotely sensed image based on this principle. The target image was divided into regions based on object-oriented multiresolution segmentation and edge-detection. Furthermore, object hierarchy was created, and a series of features (water bodies, vegetation, roads, residential areas, bare land and other information were extracted by the spectral and geometrical features. The results indicate that the edge-detection has a positive effect on multiresolution segmentation, and overall accuracy of information extraction reaches to 94.6% by the confusion matrix.

  11. Information extraction for legal knowledge representation – a review of approaches and trends

    Directory of Open Access Journals (Sweden)

    Denis Andrei de Araujo

    2014-11-01

    Full Text Available This work presents an introduction to Information Extraction systems and a survey of the known approaches of Information Extraction in the legal area. This work analyzes with particular attention the techniques that rely on the representation of legal knowledge as a means to achieve better performance, with emphasis on those techniques including ontologies and linguistic support. Some details of the systems implementations are presented, followed by an analysis of the positive and negative points of each approach, aiming to bring the reader a critical position regarding the solutions studied.

  12. Extraction of Informative Blocks from Deep Web Page Using Similar Layout Feature

    OpenAIRE

    Zeng, Jun; Flanagan, Brendan; Hirokawa, Sachio

    2013-01-01

    Due to the explosive growth and popularity of the deep web, information extraction from deep web page has gained more and more attention. However, the HTML structure of web page has become more complicated, making it difficult to recognize target content by only analyzing the HTML source code. In this paper, we propose a method to extract the informative blocks from a deep web using the layout feature. We consider the visual rectangular region of an HTML element as a visual block in web page....

  13. The BioLexicon: a large-scale terminological resource for biomedical text mining

    Directory of Open Access Journals (Sweden)

    Thompson Paul

    2011-10-01

    Full Text Available Abstract Background Due to the rapidly expanding body of biomedical literature, biologists require increasingly sophisticated and efficient systems to help them to search for relevant information. Such systems should account for the multiple written variants used to represent biomedical concepts, and allow the user to search for specific pieces of knowledge (or events involving these concepts, e.g., protein-protein interactions. Such functionality requires access to detailed information about words used in the biomedical literature. Existing databases and ontologies often have a specific focus and are oriented towards human use. Consequently, biological knowledge is dispersed amongst many resources, which often do not attempt to account for the large and frequently changing set of variants that appear in the literature. Additionally, such resources typically do not provide information about how terms relate to each other in texts to describe events. Results This article provides an overview of the design, construction and evaluation of a large-scale lexical and conceptual resource for the biomedical domain, the BioLexicon. The resource can be exploited by text mining tools at several levels, e.g., part-of-speech tagging, recognition of biomedical entities, and the extraction of events in which they are involved. As such, the BioLexicon must account for real usage of words in biomedical texts. In particular, the BioLexicon gathers together different types of terms from several existing data resources into a single, unified repository, and augments them with new term variants automatically extracted from biomedical literature. Extraction of events is facilitated through the inclusion of biologically pertinent verbs (around which events are typically organized together with information about typical patterns of grammatical and semantic behaviour, which are acquired from domain-specific texts. In order to foster interoperability, the BioLexicon is

  14. Biomedical signal and image processing

    CERN Document Server

    Najarian, Kayvan

    2012-01-01

    INTRODUCTION TO DIGITAL SIGNAL AND IMAGE PROCESSINGSignals and Biomedical Signal ProcessingIntroduction and OverviewWhat is a ""Signal""?Analog, Discrete, and Digital SignalsProcessing and Transformation of SignalsSignal Processing for Feature ExtractionSome Characteristics of Digital ImagesSummaryProblemsFourier TransformIntroduction and OverviewOne-Dimensional Continuous Fourier TransformSampling and NYQUIST RateOne-Dimensional Discrete Fourier TransformTwo-Dimensional Discrete Fourier TransformFilter DesignSummaryProblemsImage Filtering, Enhancement, and RestorationIntroduction and Overview

  15. Extraction Method for Earthquake-Collapsed Building Information Based on High-Resolution Remote Sensing

    International Nuclear Information System (INIS)

    At present, the extraction of earthquake disaster information from remote sensing data relies on visual interpretation. However, this technique cannot effectively and quickly obtain precise and efficient information for earthquake relief and emergency management. Collapsed buildings in the town of Zipingpu after the Wenchuan earthquake were used as a case study to validate two kinds of rapid extraction methods for earthquake-collapsed building information based on pixel-oriented and object-oriented theories. The pixel-oriented method is based on multi-layer regional segments that embody the core layers and segments of the object-oriented method. The key idea is to mask layer by layer all image information, including that on the collapsed buildings. Compared with traditional techniques, the pixel-oriented method is innovative because it allows considerably rapid computer processing. As for the object-oriented method, a multi-scale segment algorithm was applied to build a three-layer hierarchy. By analyzing the spectrum, texture, shape, location, and context of individual object classes in different layers, the fuzzy determined rule system was established for the extraction of earthquake-collapsed building information. We compared the two sets of results using three variables: precision assessment, visual effect, and principle. Both methods can extract earthquake-collapsed building information quickly and accurately. The object-oriented method successfully overcomes the pepper salt noise caused by the spectral diversity of high-resolution remote sensing data and solves the problem of same object, different spectrums and that of same spectrum, different objects. With an overall accuracy of 90.38%, the method achieves more scientific and accurate results compared with the pixel-oriented method (76.84%). The object-oriented image analysis method can be extensively applied in the extraction of earthquake disaster information based on high-resolution remote sensing

  16. Science gateways for biomedical big data analysis

    OpenAIRE

    Kampen, van, PJW; Olabarriaga, S.D.; Shahand, S.

    2015-01-01

    Biomedical researchers are facing data deluge challenges such as dealing with large volume of complex heterogeneous data and complex and computationally demanding data processing methods. Such scale and complexity of biomedical research requires multi-disciplinary collaboration between scientists from different organizations. Data-driven or e-Science methods are defined as a combination of Information Technology (IT) and science that enables scientists to tackle the data deluge challenges. Th...

  17. Learning the Structure of Biomedical Relationships from Unstructured Text.

    Directory of Open Access Journals (Sweden)

    Bethany Percha

    2015-07-01

    Full Text Available The published biomedical research literature encompasses most of our understanding of how drugs interact with gene products to produce physiological responses (phenotypes. Unfortunately, this information is distributed throughout the unstructured text of over 23 million articles. The creation of structured resources that catalog the relationships between drugs and genes would accelerate the translation of basic molecular knowledge into discoveries of genomic biomarkers for drug response and prediction of unexpected drug-drug interactions. Extracting these relationships from natural language sentences on such a large scale, however, requires text mining algorithms that can recognize when different-looking statements are expressing similar ideas. Here we describe a novel algorithm, Ensemble Biclustering for Classification (EBC, that learns the structure of biomedical relationships automatically from text, overcoming differences in word choice and sentence structure. We validate EBC's performance against manually-curated sets of (1 pharmacogenomic relationships from PharmGKB and (2 drug-target relationships from DrugBank, and use it to discover new drug-gene relationships for both knowledge bases. We then apply EBC to map the complete universe of drug-gene relationships based on their descriptions in Medline, revealing unexpected structure that challenges current notions about how these relationships are expressed in text. For instance, we learn that newer experimental findings are described in consistently different ways than established knowledge, and that seemingly pure classes of relationships can exhibit interesting chimeric structure. The EBC algorithm is flexible and adaptable to a wide range of problems in biomedical text mining.

  18. Learning the Structure of Biomedical Relationships from Unstructured Text.

    Science.gov (United States)

    Percha, Bethany; Altman, Russ B

    2015-07-01

    The published biomedical research literature encompasses most of our understanding of how drugs interact with gene products to produce physiological responses (phenotypes). Unfortunately, this information is distributed throughout the unstructured text of over 23 million articles. The creation of structured resources that catalog the relationships between drugs and genes would accelerate the translation of basic molecular knowledge into discoveries of genomic biomarkers for drug response and prediction of unexpected drug-drug interactions. Extracting these relationships from natural language sentences on such a large scale, however, requires text mining algorithms that can recognize when different-looking statements are expressing similar ideas. Here we describe a novel algorithm, Ensemble Biclustering for Classification (EBC), that learns the structure of biomedical relationships automatically from text, overcoming differences in word choice and sentence structure. We validate EBC's performance against manually-curated sets of (1) pharmacogenomic relationships from PharmGKB and (2) drug-target relationships from DrugBank, and use it to discover new drug-gene relationships for both knowledge bases. We then apply EBC to map the complete universe of drug-gene relationships based on their descriptions in Medline, revealing unexpected structure that challenges current notions about how these relationships are expressed in text. For instance, we learn that newer experimental findings are described in consistently different ways than established knowledge, and that seemingly pure classes of relationships can exhibit interesting chimeric structure. The EBC algorithm is flexible and adaptable to a wide range of problems in biomedical text mining. PMID:26219079

  19. Biomedical applications engineering tasks

    Science.gov (United States)

    Laenger, C. J., Sr.

    1976-01-01

    The engineering tasks performed in response to needs articulated by clinicians are described. Initial contacts were made with these clinician-technology requestors by the Southwest Research Institute NASA Biomedical Applications Team. The basic purpose of the program was to effectively transfer aerospace technology into functional hardware to solve real biomedical problems.

  20. Terrain Extraction by Integrating Terrestrial Laser Scanner Data and Spectral Information

    Science.gov (United States)

    Lau, C. L.; Halim, S.; Zulkepli, M.; Azwan, A. M.; Tang, W. L.; Chong, A. K.

    2015-10-01

    The extraction of true terrain points from unstructured laser point cloud data is an important process in order to produce an accurate digital terrain model (DTM). However, most of these spatial filtering methods just utilizing the geometrical data to discriminate the terrain points from nonterrain points. The point cloud filtering method also can be improved by using the spectral information available with some scanners. Therefore, the objective of this study is to investigate the effectiveness of using the three-channel (red, green and blue) of the colour image captured from built-in digital camera which is available in some Terrestrial Laser Scanner (TLS) for terrain extraction. In this study, the data acquisition was conducted at a mini replica landscape in Universiti Teknologi Malaysia (UTM), Skudai campus using Leica ScanStation C10. The spectral information of the coloured point clouds from selected sample classes are extracted for spectral analysis. The coloured point clouds which within the corresponding preset spectral threshold are identified as that specific feature point from the dataset. This process of terrain extraction is done through using developed Matlab coding. Result demonstrates that a higher spectral resolution passive image is required in order to improve the output. This is because low quality of the colour images captured by the sensor contributes to the low separability in spectral reflectance. In conclusion, this study shows that, spectral information is capable to be used as a parameter for terrain extraction.

  1. The Ontology for Biomedical Investigations

    Science.gov (United States)

    Bandrowski, Anita; Brinkman, Ryan; Brochhausen, Mathias; Brush, Matthew H.; Chibucos, Marcus C.; Clancy, Kevin; Courtot, Mélanie; Derom, Dirk; Dumontier, Michel; Fan, Liju; Fostel, Jennifer; Fragoso, Gilberto; Gibson, Frank; Gonzalez-Beltran, Alejandra; Haendel, Melissa A.; He, Yongqun; Heiskanen, Mervi; Hernandez-Boussard, Tina; Jensen, Mark; Lin, Yu; Lister, Allyson L.; Lord, Phillip; Malone, James; Manduchi, Elisabetta; McGee, Monnie; Morrison, Norman; Overton, James A.; Parkinson, Helen; Peters, Bjoern; Rocca-Serra, Philippe; Ruttenberg, Alan; Sansone, Susanna-Assunta; Scheuermann, Richard H.; Schober, Daniel; Smith, Barry; Soldatova, Larisa N.; Stoeckert, Christian J.; Taylor, Chris F.; Torniai, Carlo; Turner, Jessica A.; Vita, Randi; Whetzel, Patricia L.; Zheng, Jie

    2016-01-01

    The Ontology for Biomedical Investigations (OBI) is an ontology that provides terms with precisely defined meanings to describe all aspects of how investigations in the biological and medical domains are conducted. OBI re-uses ontologies that provide a representation of biomedical knowledge from the Open Biological and Biomedical Ontologies (OBO) project and adds the ability to describe how this knowledge was derived. We here describe the state of OBI and several applications that are using it, such as adding semantic expressivity to existing databases, building data entry forms, and enabling interoperability between knowledge resources. OBI covers all phases of the investigation process, such as planning, execution and reporting. It represents information and material entities that participate in these processes, as well as roles and functions. Prior to OBI, it was not possible to use a single internally consistent resource that could be applied to multiple types of experiments for these applications. OBI has made this possible by creating terms for entities involved in biological and medical investigations and by importing parts of other biomedical ontologies such as GO, Chemical Entities of Biological Interest (ChEBI) and Phenotype Attribute and Trait Ontology (PATO) without altering their meaning. OBI is being used in a wide range of projects covering genomics, multi-omics, immunology, and catalogs of services. OBI has also spawned other ontologies (Information Artifact Ontology) and methods for importing parts of ontologies (Minimum information to reference an external ontology term (MIREOT)). The OBI project is an open cross-disciplinary collaborative effort, encompassing multiple research communities from around the globe. To date, OBI has created 2366 classes and 40 relations along with textual and formal definitions. The OBI Consortium maintains a web resource (http://obi-ontology.org) providing details on the people, policies, and issues being addressed

  2. The Ontology for Biomedical Investigations.

    Science.gov (United States)

    Bandrowski, Anita; Brinkman, Ryan; Brochhausen, Mathias; Brush, Matthew H; Bug, Bill; Chibucos, Marcus C; Clancy, Kevin; Courtot, Mélanie; Derom, Dirk; Dumontier, Michel; Fan, Liju; Fostel, Jennifer; Fragoso, Gilberto; Gibson, Frank; Gonzalez-Beltran, Alejandra; Haendel, Melissa A; He, Yongqun; Heiskanen, Mervi; Hernandez-Boussard, Tina; Jensen, Mark; Lin, Yu; Lister, Allyson L; Lord, Phillip; Malone, James; Manduchi, Elisabetta; McGee, Monnie; Morrison, Norman; Overton, James A; Parkinson, Helen; Peters, Bjoern; Rocca-Serra, Philippe; Ruttenberg, Alan; Sansone, Susanna-Assunta; Scheuermann, Richard H; Schober, Daniel; Smith, Barry; Soldatova, Larisa N; Stoeckert, Christian J; Taylor, Chris F; Torniai, Carlo; Turner, Jessica A; Vita, Randi; Whetzel, Patricia L; Zheng, Jie

    2016-01-01

    The Ontology for Biomedical Investigations (OBI) is an ontology that provides terms with precisely defined meanings to describe all aspects of how investigations in the biological and medical domains are conducted. OBI re-uses ontologies that provide a representation of biomedical knowledge from the Open Biological and Biomedical Ontologies (OBO) project and adds the ability to describe how this knowledge was derived. We here describe the state of OBI and several applications that are using it, such as adding semantic expressivity to existing databases, building data entry forms, and enabling interoperability between knowledge resources. OBI covers all phases of the investigation process, such as planning, execution and reporting. It represents information and material entities that participate in these processes, as well as roles and functions. Prior to OBI, it was not possible to use a single internally consistent resource that could be applied to multiple types of experiments for these applications. OBI has made this possible by creating terms for entities involved in biological and medical investigations and by importing parts of other biomedical ontologies such as GO, Chemical Entities of Biological Interest (ChEBI) and Phenotype Attribute and Trait Ontology (PATO) without altering their meaning. OBI is being used in a wide range of projects covering genomics, multi-omics, immunology, and catalogs of services. OBI has also spawned other ontologies (Information Artifact Ontology) and methods for importing parts of ontologies (Minimum information to reference an external ontology term (MIREOT)). The OBI project is an open cross-disciplinary collaborative effort, encompassing multiple research communities from around the globe. To date, OBI has created 2366 classes and 40 relations along with textual and formal definitions. The OBI Consortium maintains a web resource (http://obi-ontology.org) providing details on the people, policies, and issues being addressed

  3. The Ontology for Biomedical Investigations.

    Science.gov (United States)

    Bandrowski, Anita; Brinkman, Ryan; Brochhausen, Mathias; Brush, Matthew H; Bug, Bill; Chibucos, Marcus C; Clancy, Kevin; Courtot, Mélanie; Derom, Dirk; Dumontier, Michel; Fan, Liju; Fostel, Jennifer; Fragoso, Gilberto; Gibson, Frank; Gonzalez-Beltran, Alejandra; Haendel, Melissa A; He, Yongqun; Heiskanen, Mervi; Hernandez-Boussard, Tina; Jensen, Mark; Lin, Yu; Lister, Allyson L; Lord, Phillip; Malone, James; Manduchi, Elisabetta; McGee, Monnie; Morrison, Norman; Overton, James A; Parkinson, Helen; Peters, Bjoern; Rocca-Serra, Philippe; Ruttenberg, Alan; Sansone, Susanna-Assunta; Scheuermann, Richard H; Schober, Daniel; Smith, Barry; Soldatova, Larisa N; Stoeckert, Christian J; Taylor, Chris F; Torniai, Carlo; Turner, Jessica A; Vita, Randi; Whetzel, Patricia L; Zheng, Jie

    2016-01-01

    The Ontology for Biomedical Investigations (OBI) is an ontology that provides terms with precisely defined meanings to describe all aspects of how investigations in the biological and medical domains are conducted. OBI re-uses ontologies that provide a representation of biomedical knowledge from the Open Biological and Biomedical Ontologies (OBO) project and adds the ability to describe how this knowledge was derived. We here describe the state of OBI and several applications that are using it, such as adding semantic expressivity to existing databases, building data entry forms, and enabling interoperability between knowledge resources. OBI covers all phases of the investigation process, such as planning, execution and reporting. It represents information and material entities that participate in these processes, as well as roles and functions. Prior to OBI, it was not possible to use a single internally consistent resource that could be applied to multiple types of experiments for these applications. OBI has made this possible by creating terms for entities involved in biological and medical investigations and by importing parts of other biomedical ontologies such as GO, Chemical Entities of Biological Interest (ChEBI) and Phenotype Attribute and Trait Ontology (PATO) without altering their meaning. OBI is being used in a wide range of projects covering genomics, multi-omics, immunology, and catalogs of services. OBI has also spawned other ontologies (Information Artifact Ontology) and methods for importing parts of ontologies (Minimum information to reference an external ontology term (MIREOT)). The OBI project is an open cross-disciplinary collaborative effort, encompassing multiple research communities from around the globe. To date, OBI has created 2366 classes and 40 relations along with textual and formal definitions. The OBI Consortium maintains a web resource (http://obi-ontology.org) providing details on the people, policies, and issues being addressed

  4. The Ontology for Biomedical Investigations.

    Directory of Open Access Journals (Sweden)

    Anita Bandrowski

    Full Text Available The Ontology for Biomedical Investigations (OBI is an ontology that provides terms with precisely defined meanings to describe all aspects of how investigations in the biological and medical domains are conducted. OBI re-uses ontologies that provide a representation of biomedical knowledge from the Open Biological and Biomedical Ontologies (OBO project and adds the ability to describe how this knowledge was derived. We here describe the state of OBI and several applications that are using it, such as adding semantic expressivity to existing databases, building data entry forms, and enabling interoperability between knowledge resources. OBI covers all phases of the investigation process, such as planning, execution and reporting. It represents information and material entities that participate in these processes, as well as roles and functions. Prior to OBI, it was not possible to use a single internally consistent resource that could be applied to multiple types of experiments for these applications. OBI has made this possible by creating terms for entities involved in biological and medical investigations and by importing parts of other biomedical ontologies such as GO, Chemical Entities of Biological Interest (ChEBI and Phenotype Attribute and Trait Ontology (PATO without altering their meaning. OBI is being used in a wide range of projects covering genomics, multi-omics, immunology, and catalogs of services. OBI has also spawned other ontologies (Information Artifact Ontology and methods for importing parts of ontologies (Minimum information to reference an external ontology term (MIREOT. The OBI project is an open cross-disciplinary collaborative effort, encompassing multiple research communities from around the globe. To date, OBI has created 2366 classes and 40 relations along with textual and formal definitions. The OBI Consortium maintains a web resource (http://obi-ontology.org providing details on the people, policies, and issues being

  5. Education of biomedical engineering in Taiwan.

    Science.gov (United States)

    Lin, Kang-Ping; Kao, Tsair; Wang, Jia-Jung; Chen, Mei-Jung; Su, Fong-Chin

    2014-01-01

    Biomedical Engineers (BME) play an important role in medical and healthcare society. Well educational programs are important to support the healthcare systems including hospitals, long term care organizations, manufacture industries of medical devices/instrumentations/systems, and sales/services companies of medical devices/instrumentations/system. In past 30 more years, biomedical engineering society has accumulated thousands people hold a biomedical engineering degree, and work as a biomedical engineer in Taiwan. Most of BME students can be trained in biomedical engineering departments with at least one of specialties in bioelectronics, bio-information, biomaterials or biomechanics. Students are required to have internship trainings in related institutions out of campus for 320 hours before graduating. Almost all the biomedical engineering departments are certified by IEET (Institute of Engineering Education Taiwan), and met the IEET requirement in which required mathematics and fundamental engineering courses. For BMEs after graduation, Taiwanese Society of Biomedical Engineering (TSBME) provides many continue-learning programs and certificates for all members who expect to hold the certification as a professional credit in his working place. In current status, many engineering departments in university are continuously asked to provide joint programs with BME department to train much better quality students. BME is one of growing fields in Taiwan.

  6. Autoradiography: Biomedical applications. January, 1975-August, 1981 (citations from the International Information Service for the Physics and Engineering Communities Data Base). Report for January 75-August 81

    Energy Technology Data Exchange (ETDEWEB)

    1981-08-01

    This bibliography contains citations concerning the methods, equipment, and biomedical applications of autoradiography. Electron microscopy (EM) is discussed at length, and sample preparation for EM is detailed. Numerous biological, biochemical and biophysical studies are discussed. (Contains 84 citations fully indexed and including a title list.)

  7. Information retrieval and terminology extraction in online resources for patients with diabetes.

    Science.gov (United States)

    Seljan, Sanja; Baretić, Maja; Kucis, Vlasta

    2014-06-01

    Terminology use, as a mean for information retrieval or document indexing, plays an important role in health literacy. Specific types of users, i.e. patients with diabetes need access to various online resources (on foreign and/or native language) searching for information on self-education of basic diabetic knowledge, on self-care activities regarding importance of dietetic food, medications, physical exercises and on self-management of insulin pumps. Automatic extraction of corpus-based terminology from online texts, manuals or professional papers, can help in building terminology lists or list of "browsing phrases" useful in information retrieval or in document indexing. Specific terminology lists represent an intermediate step between free text search and controlled vocabulary, between user's demands and existing online resources in native and foreign language. The research aiming to detect the role of terminology in online resources, is conducted on English and Croatian manuals and Croatian online texts, and divided into three interrelated parts: i) comparison of professional and popular terminology use ii) evaluation of automatic statistically-based terminology extraction on English and Croatian texts iii) comparison and evaluation of extracted terminology performed on English manual using statistical and hybrid approaches. Extracted terminology candidates are evaluated by comparison with three types of reference lists: list created by professional medical person, list of highly professional vocabulary contained in MeSH and list created by non-medical persons, made as intersection of 15 lists. Results report on use of popular and professional terminology in online diabetes resources, on evaluation of automatically extracted terminology candidates in English and Croatian texts and on comparison of statistical and hybrid extraction methods in English text. Evaluation of automatic and semi-automatic terminology extraction methods is performed by recall

  8. OpenCV-Based Nanomanipulation Information Extraction and the Probe Operation in SEM

    Directory of Open Access Journals (Sweden)

    Dongjie Li

    2015-02-01

    Full Text Available Aimed at the established telenanomanipulation system, the method of extracting location information and the strategies of probe operation were studied in this paper. First, the machine learning algorithm of OpenCV was used to extract location information from SEM images. Thus nanowires and probe in SEM images can be automatically tracked and the region of interest (ROI can be marked quickly. Then the location of nanowire and probe can be extracted from the ROI. To study the probe operation strategy, the Van der Waals force between probe and a nanowire was computed; thus relevant operating parameters can be obtained. With these operating parameters, the nanowire in 3D virtual environment can be preoperated and an optimal path of the probe can be obtained. The actual probe runs automatically under the telenanomanipulation system's control. Finally, experiments were carried out to verify the above methods, and results show the designed methods have achieved the expected effect.

  9. [Advances in researches on hyperspectral remote sensing forestry information-extracting technology].

    Science.gov (United States)

    Wu, Jian; Peng, Dao-Li

    2011-09-01

    The hyperspectral remote sensing technology has become one of the leading technologies in forestry remote sensing domain. In the present review paper, the advances in researches on hyperspectral remote sensing technology in forestry information extraction both at home and abroad were reviewed, and the five main research aspects including the hyperspectral classification and recognition of forest tree species, the hyperspectral inversion and extraction of forest ecological physical parameters, the hyperspectral monitoring and diagnosis of forest nutrient element, the forest crown density information extraction and the hyperspectral monitoring of forest disasters were summarized. The unresolved problems of hyperspectral technology in the forestry remote sensing applications were pointed out and the possible ways to solve these problems were expounded. Finally, the application prospect of hyperspectral remote sensing technology in forestry was analyzed. PMID:22097816

  10. A method of building information extraction based on mathematical morphology and multiscale

    Science.gov (United States)

    Li, Jing-wen; Wang, Ke; Zhang, Zi-ping; Xue, Long-li; Yin, Shou-qiang; Zhou, Song

    2015-12-01

    In view of monitoring the changes of buildings on Earth's surface ,by analyzing the distribution characteristics of building in remote sensing image, combined with multi-scale in image segmentation and the advantages of mathematical morphology, this paper proposes a multi-scale combined with mathematical morphology of high resolution remote sensing image segmentation method, and uses the multiple fuzzy classification method and the shadow of auxiliary method to extract information building, With the comparison of k-means classification, and the traditional maximum likelihood classification method, the results of experiment object based on multi-scale combined with mathematical morphology of image segmentation and extraction method, can accurately extract the structure of the information is more clear classification data, provide the basis for the intelligent monitoring of earth data and theoretical support.

  11. Extraction and Network Sharing of Forest Vegetation Information based on SVM

    Directory of Open Access Journals (Sweden)

    Zhang Hannv

    2013-05-01

    Full Text Available The support vector machine (SVM is a new method of data mining, which can deal with regression problems (time series analysis, pattern recognition (classification, discriminant analysis and many other issues very well. In recent years, SVM has been widely used in computer classification and recognition of remote sensing images. This paper is based on Landsat TM image data, using a classification method which is based on support vector machine to extract the forest cover information of Dahuanggou tree farm of Changbai Mountain area, and compare with the conventional maximum likelihood classification. The results show that extraction accuracy of forest information based on support vector machine, Kappa values are 0.9810, 0.9716, 0.9753, which are exceeding the extraction accuracy of maximum likelihood method (MLC and Kappa value of 0.9634, the method has good maneuverability and practicality.

  12. Services for annotation of biomedical text

    OpenAIRE

    Hakenberg, Jörg

    2008-01-01

    Motivation: Text mining in the biomedical domain in recent years has focused on the development of tools for recognizing named entities and extracting relations. Such research resulted from the need for such tools as basic components for more advanced solutions. Named entity recognition, entity mention normalization, and relationship extraction now have reached a stage where they perform comparably to human annotators (considering inter--annotator agreement, measured in many studies to be aro...

  13. An Information Extraction Core System for Real World German Text Processing

    CERN Document Server

    Neumann, G; Baur, J; Becker, M; Braun, C

    1997-01-01

    This paper describes SMES, an information extraction core system for real world German text processing. The basic design criterion of the system is of providing a set of basic powerful, robust, and efficient natural language components and generic linguistic knowledge sources which can easily be customized for processing different tasks in a flexible manner.

  14. Application of high spatial resolution airborne hyperspectral remote sensing data in thematic information extraction

    Science.gov (United States)

    Xu, Hong-gen; Ma, Hong-chao; Li, De-ren; Song, Yan

    2006-10-01

    The airborne hyperspectral remote sensing data, such as PHI, OMIS, has the virtues of high spatial and spectral resolution. Hence, from the view of target classification we can consider that it can provide the ability of discriminating targets more detailedly than other data. So it's important to extract thematic information and update database using this kind of data. Whereas, the hyperspectral data has abundant bands and high between-band correlation, the traditional classification methods such as maximum likelihood classifier (MLC) and spectral angle mapper (SAM) have performed poorly in thematic information extraction. For this reason, we present a new method for thematic information extraction with hyperspectral remote sensing data. We perform classification by means of combining the self-organizing map (SOM) neural network which is considered as full-pixel technique with linear spectral mixture analysis (LSMA) which is considered as mixed-pixel technique. The SOM neural network is improved from some aspects to classify the pure data and find the mixed data. And then the mixed data are unmixed and classified by LSMA. The result of experiment shows that we can have the better performance in thematic information extraction with PHI by this means.

  15. Web信息抽取系统的设计%Design of Web Information Extraction System

    Institute of Scientific and Technical Information of China (English)

    刘斌; 张晓婧

    2013-01-01

    In order to obtain the scattered information hidden in Web pages,Web information extraction system design.The system first uses a modified HITS algorithm for topic selection information collection; then the Web page's HTML document structure of the data pre-processing; Finally,based on the XPath DOM tree generation algorithm to obtain the absolute path is an XPath node marked expression,and use the XPath language with XSLT technology to write extraction rules,resulting in a structured database or XML file,to achieve the positioning and Web information extraction.Extraction through a shopping site experiments show that the extraction system works well,can achieve similar batch extract Web page.%为了获取分散Web页面中隐含信息,设计了Web信息抽取系统.该系统首先使用一种改进的HITS主题精选算法进行信息采集;然后对Web页面的HTML结构进行文档的数据预处理;最后,基于DOM树的XPath绝对路径生成算法来获取被标注结点的XPath表达式,并使用XPath语言结合XSLT技术来编写抽取规则,从而得到结构化的数据库或XML文件,实现了Web信息的定位和抽取.通过一个购物网站的抽取实验证明,该系统的抽取效果良好,可以实现相似Web页面的批量抽取.

  16. Information extraction with object based support vector machines and vegetation indices

    Science.gov (United States)

    Ustuner, Mustafa; Abdikan, Saygin; Balik Sanli, Fusun

    2016-07-01

    Information extraction through remote sensing data is important for policy and decision makers as extracted information provide base layers for many application of real world. Classification of remotely sensed data is the one of the most common methods of extracting information however it is still a challenging issue because several factors are affecting the accuracy of the classification. Resolution of the imagery, number and homogeneity of land cover classes, purity of training data and characteristic of adopted classifiers are just some of these challenging factors. Object based image classification has some superiority than pixel based classification for high resolution images since it uses geometry and structure information besides spectral information. Vegetation indices are also commonly used for the classification process since it provides additional spectral information for vegetation, forestry and agricultural areas. In this study, the impacts of the Normalized Difference Vegetation Index (NDVI) and Normalized Difference Red Edge Index (NDRE) on the classification accuracy of RapidEye imagery were investigated. Object based Support Vector Machines were implemented for the classification of crop types for the study area located in Aegean region of Turkey. Results demonstrated that the incorporation of NDRE increase the classification accuracy from 79,96% to 86,80% as overall accuracy, however NDVI decrease the classification accuracy from 79,96% to 78,90%. Moreover it is proven than object based classification with RapidEye data give promising results for crop type mapping and analysis.

  17. Identifying and extracting patient smoking status information from clinical narrative texts in Spanish.

    Science.gov (United States)

    Figueroa, Rosa L; Soto, Diego A; Pino, Esteban J

    2014-01-01

    In this work we present a system to identify and extract patient's smoking status from clinical narrative text in Spanish. The clinical narrative text was processed using natural language processing techniques, and annotated by four people with a biomedical background. The dataset used for classification had 2,465 documents, each one annotated with one of the four smoking status categories. We used two feature representations: single word token and bigrams. The classification problem was divided in two levels. First recognizing between smoker (S) and non-smoker (NS); second recognizing between current smoker (CS) and past smoker (PS). For each feature representation and classification level, we used two classifiers: Support Vector Machines (SVM) and Bayesian Networks (BN). We split our dataset as follows: a training set containing 66% of the available documents that was used to build classifiers and a test set containing the remaining 34% of the documents that was used to test and evaluate the model. Our results show that SVM together with the bigram representation performed better in both classification levels. For S vs NS classification level performance measures were: ACC=85%, Precision=85%, and Recall=90%. For CS vs PS classification level performance measures were: ACC=87%, Precision=91%, and Recall=94%.

  18. Identifying and extracting patient smoking status information from clinical narrative texts in Spanish.

    Science.gov (United States)

    Figueroa, Rosa L; Soto, Diego A; Pino, Esteban J

    2014-01-01

    In this work we present a system to identify and extract patient's smoking status from clinical narrative text in Spanish. The clinical narrative text was processed using natural language processing techniques, and annotated by four people with a biomedical background. The dataset used for classification had 2,465 documents, each one annotated with one of the four smoking status categories. We used two feature representations: single word token and bigrams. The classification problem was divided in two levels. First recognizing between smoker (S) and non-smoker (NS); second recognizing between current smoker (CS) and past smoker (PS). For each feature representation and classification level, we used two classifiers: Support Vector Machines (SVM) and Bayesian Networks (BN). We split our dataset as follows: a training set containing 66% of the available documents that was used to build classifiers and a test set containing the remaining 34% of the documents that was used to test and evaluate the model. Our results show that SVM together with the bigram representation performed better in both classification levels. For S vs NS classification level performance measures were: ACC=85%, Precision=85%, and Recall=90%. For CS vs PS classification level performance measures were: ACC=87%, Precision=91%, and Recall=94%. PMID:25570550

  19. A novel lossless robust watermarking method for copyright protection of biomedical image.

    Science.gov (United States)

    Chen, Zhigang; Deng, Xiaohong; Zeng, Feng; Liu, Huiwen; Chen, Liang

    2013-03-01

    Nanoscience and Nanotechnology promotes the development of biomedical imaging devices, more and more valuable biomedical images are transmitted through the open network, and their copyright suffers a major challenge. This paper proposes a new lossless robust watermarking method to realize biomedical image's copyright protection. For high-resolution biomedical images, the redundant theory and histogram shifting method based on image block's difference are utilized to repeatedly embed copyright watermark into different color channels. In the extracting procedure, each layer's watermark information can be extracted from watermarked image respectively, and the final watermark is judged by the voting theory. The redundant scheme and voting theory guarantees the watermark can resist against random noise's attack well. Besides, in order to reduce the watermark overhead in embedding procedure, the histogram narrowing down technique is used to tackle with those image blocks would overflows and underflows. Experimental results showed that the proposed method outperformed the existing semi-fragile reversible watermarking scheme and provided a great robustness against moderate JPEG compression and salt-pepper and Gaussian noise.

  20. Semi-automatic building extraction in informal settlements from high-resolution satellite imagery

    Science.gov (United States)

    Mayunga, Selassie David

    The extraction of man-made features from digital remotely sensed images is considered as an important step underpinning management of human settlements in any country. Man-made features and buildings in particular are required for varieties of applications such as urban planning, creation of geographical information systems (GIS) databases and Urban City models. The traditional man-made feature extraction methods are very expensive in terms of equipment, labour intensive, need well-trained personnel and cannot cope with changing environments, particularly in dense urban settlement areas. This research presents an approach for extracting buildings in dense informal settlement areas using high-resolution satellite imagery. The proposed system uses a novel strategy of extracting building by measuring a single point at the approximate centre of the building. The fine measurement of the building outlines is then effected using a modified snake model. The original snake model on which this framework is based, incorporates an external constraint energy term which is tailored to preserving the convergence properties of the snake model; its use to unstructured objects will negatively affect their actual shapes. The external constrained energy term was removed from the original snake model formulation, thereby, giving ability to cope with high variability of building shapes in informal settlement areas. The proposed building extraction system was tested on two areas, which have different situations. The first area was Tungi in Dar Es Salaam, Tanzania where three sites were tested. This area is characterized by informal settlements, which are illegally formulated within the city boundaries. The second area was Oromocto in New Brunswick, Canada where two sites were tested. Oromocto area is mostly flat and the buildings are constructed using similar materials. Qualitative and quantitative measures were employed to evaluate the accuracy of the results as well as the performance

  1. Studies on methods and techniques of weak information extraction and integrated evaluation for sandstone-type uranium deposits

    International Nuclear Information System (INIS)

    Weak information extraction and integrated evaluation for sandstone-type uranium deposits are currently one of the important research contents in uranium exploration. Through several years researches, the authors put forward the meaning of aeromagnetic and aeroradioactive weak information extraction, study the formation theories of aeromagnetic and aeroradioactive weak information and establish effective mathematic models for weak information extraction. Based on GIS software, models of weak information extraction are actualized and the expert-grading model for integrated evaluation is developed. The trial of aeromagnetic and aeroradioactive weak information and integrated evaluation of uranium resources are completed by using GIS software in the study area. The researchful results prove that techniques of weak information extraction and integrated evaluation may further delineate the prospective areas of sandstone-type uranium deposits rapidly and improve the predicitive precision. (authors)

  2. PALM-IST: Pathway Assembly from Literature Mining - an Information Search Tool

    OpenAIRE

    Mandloi, Sapan; Chakrabarti, Saikat

    2015-01-01

    Manual curation of biomedical literature has become extremely tedious process due to its exponential growth in recent years. To extract meaningful information from such large and unstructured text, newer and more efficient mining tool is required. Here, we introduce PALM-IST, a computational platform that not only allows users to explore biomedical abstracts using keyword based text mining but also extracts biological entity (e.g., gene/protein, drug, disease, biological processes, cellular c...

  3. Automatic Extraction of Spatio-Temporal Information from Arabic Text Documents

    Directory of Open Access Journals (Sweden)

    Abdelkoui Feriel

    2015-10-01

    Full Text Available Unstructured Arabic text documents are an important source of geographical and temporal information. The possibility of automatically tracking spatio-temporal information, capturing changes relating to events from text documents, is a new challenge in the fields of geographic information retrieval (GIR, temporal information retrieval (TIR and natural language processing (NLP. There was a lot of work on the extraction of information in other languages that use Latin alphabet, such as English,, French, or Spanish, by against the Arabic language is still not well supported in GIR and TIR and it needs to conduct more researches. In this paper, we present an approach that support automated exploration and extraction of spatio-temporal information from Arabic text documents in order to capture and model such information before it can be utilized in search and exploration tasks. The system has been successfully tested on 50 documents that include a mixture of types of Spatial/temporal information. The result achieved 91.01% of recall and of 80% precision. This illustrates that our approach is effective and its performance is satisfactory.

  4. A Framework For Extracting Information From Web Using VTD-XML‘s XPath

    Directory of Open Access Journals (Sweden)

    C. Subhashini

    2012-03-01

    Full Text Available The exponential growth of WWW (World Wide Web is the cause for vast pool of information as well as several challenges posed by it, such as extracting potentially useful and unknown information from WWW. Many websites are built with HTML, because of its unstructured layout, it is difficult to obtain effective and precise data from web using HTML. The advent of XML (Extensible Markup Language proposes a better solution to extract useful knowledge from WWW. Web Data Extraction based on XML Technology solves this problem because XML is a general purpose specification for exchanging data over the Web. In this paper, a framework is suggested to extract the data from the web.Here the semi-structured data in the web page is transformed into well-structured data using standard XML technologies and the new parsing technique called extended VTD-XML (Virtual Token Descriptorfor XML along with Xpath implementation has been used to extract data from the well-structured XML document.

  5. Framework for automatic information extraction from research papers on nanocrystal devices

    Directory of Open Access Journals (Sweden)

    Thaer M. Dieb

    2015-09-01

    Full Text Available To support nanocrystal device development, we have been working on a computational framework to utilize information in research papers on nanocrystal devices. We developed an annotated corpus called “ NaDev” (Nanocrystal Device Development for this purpose. We also proposed an automatic information extraction system called “NaDevEx” (Nanocrystal Device Automatic Information Extraction Framework. NaDevEx aims at extracting information from research papers on nanocrystal devices using the NaDev corpus and machine-learning techniques. However, the characteristics of NaDevEx were not examined in detail. In this paper, we conduct system evaluation experiments for NaDevEx using the NaDev corpus. We discuss three main issues: system performance, compared with human annotators; the effect of paper type (synthesis or characterization on system performance; and the effects of domain knowledge features (e.g., a chemical named entity recognition system and list of names of physical quantities on system performance. We found that overall system performance was 89% in precision and 69% in recall. If we consider identification of terms that intersect with correct terms for the same information category as the correct identification, i.e., loose agreement (in many cases, we can find that appropriate head nouns such as temperature or pressure loosely match between two terms, the overall performance is 95% in precision and 74% in recall. The system performance is almost comparable with results of human annotators for information categories with rich domain knowledge information (source material. However, for other information categories, given the relatively large number of terms that exist only in one paper, recall of individual information categories is not high (39–73%; however, precision is better (75–97%. The average performance for synthesis papers is better than that for characterization papers because of the lack of training examples for

  6. Framework for automatic information extraction from research papers on nanocrystal devices.

    Science.gov (United States)

    Dieb, Thaer M; Yoshioka, Masaharu; Hara, Shinjiro; Newton, Marcus C

    2015-01-01

    To support nanocrystal device development, we have been working on a computational framework to utilize information in research papers on nanocrystal devices. We developed an annotated corpus called " NaDev" (Nanocrystal Device Development) for this purpose. We also proposed an automatic information extraction system called "NaDevEx" (Nanocrystal Device Automatic Information Extraction Framework). NaDevEx aims at extracting information from research papers on nanocrystal devices using the NaDev corpus and machine-learning techniques. However, the characteristics of NaDevEx were not examined in detail. In this paper, we conduct system evaluation experiments for NaDevEx using the NaDev corpus. We discuss three main issues: system performance, compared with human annotators; the effect of paper type (synthesis or characterization) on system performance; and the effects of domain knowledge features (e.g., a chemical named entity recognition system and list of names of physical quantities) on system performance. We found that overall system performance was 89% in precision and 69% in recall. If we consider identification of terms that intersect with correct terms for the same information category as the correct identification, i.e., loose agreement (in many cases, we can find that appropriate head nouns such as temperature or pressure loosely match between two terms), the overall performance is 95% in precision and 74% in recall. The system performance is almost comparable with results of human annotators for information categories with rich domain knowledge information (source material). However, for other information categories, given the relatively large number of terms that exist only in one paper, recall of individual information categories is not high (39-73%); however, precision is better (75-97%). The average performance for synthesis papers is better than that for characterization papers because of the lack of training examples for characterization papers

  7. Clinic expert information extraction based on domain model and block importance model.

    Science.gov (United States)

    Zhang, Yuanpeng; Wang, Li; Qian, Danmin; Geng, Xingyun; Yao, Dengfu; Dong, Jiancheng

    2015-11-01

    To extract expert clinic information from the Deep Web, there are two challenges to face. The first one is to make a judgment on forms. A novel method based on a domain model, which is a tree structure constructed by the attributes of query interfaces is proposed. With this model, query interfaces can be classified to a domain and filled in with domain keywords. Another challenge is to extract information from response Web pages indexed by query interfaces. To filter the noisy information on a Web page, a block importance model is proposed, both content and spatial features are taken into account in this model. The experimental results indicate that the domain model yields a precision 4.89% higher than that of the rule-based method, whereas the block importance model yields an F1 measure 10.5% higher than that of the XPath method. PMID:26231612

  8. Handbook of biomedical optics

    CERN Document Server

    Boas, David A

    2011-01-01

    Biomedical optics holds tremendous promise to deliver effective, safe, non- or minimally invasive diagnostics and targeted, customizable therapeutics. Handbook of Biomedical Optics provides an in-depth treatment of the field, including coverage of applications for biomedical research, diagnosis, and therapy. It introduces the theory and fundamentals of each subject, ensuring accessibility to a wide multidisciplinary readership. It also offers a view of the state of the art and discusses advantages and disadvantages of various techniques.Organized into six sections, this handbook: Contains intr

  9. Biomedical engineering fundamentals

    CERN Document Server

    Bronzino, Joseph D; Bronzino, Joseph D

    2006-01-01

    Over the last century,medicine has come out of the "black bag" and emerged as one of the most dynamic and advanced fields of development in science and technology. Today, biomedical engineering plays a critical role in patient diagnosis, care, and rehabilitation. As such, the field encompasses a wide range of disciplines, from biology and physiology to informatics and signal processing. Reflecting the enormous growth and change in biomedical engineering during the infancy of the 21st century, The Biomedical Engineering Handbook enters its third edition as a set of three carefully focused and

  10. Biomedical Engineering Desk Reference

    CERN Document Server

    Ratner, Buddy D; Schoen, Frederick J; Lemons, Jack E; Dyro, Joseph; Martinsen, Orjan G; Kyle, Richard; Preim, Bernhard; Bartz, Dirk; Grimnes, Sverre; Vallero, Daniel; Semmlow, John; Murray, W Bosseau; Perez, Reinaldo; Bankman, Isaac; Dunn, Stanley; Ikada, Yoshito; Moghe, Prabhas V; Constantinides, Alkis

    2009-01-01

    A one-stop Desk Reference, for Biomedical Engineers involved in the ever expanding and very fast moving area; this is a book that will not gather dust on the shelf. It brings together the essential professional reference content from leading international contributors in the biomedical engineering field. Material covers a broad range of topics including: Biomechanics and Biomaterials; Tissue Engineering; and Biosignal Processing* A hard-working desk reference providing all the essential material needed by biomedical and clinical engineers on a day-to-day basis * Fundamentals, key techniques,

  11. Biomedical applications of polymers

    CERN Document Server

    Gebelein, C G

    1991-01-01

    The biomedical applications of polymers span an extremely wide spectrum of uses, including artificial organs, skin and soft tissue replacements, orthopaedic applications, dental applications, and controlled release of medications. No single, short review can possibly cover all these items in detail, and dozens of books andhundreds of reviews exist on biomedical polymers. Only a few relatively recent examples will be cited here;additional reviews are listed under most of the major topics in this book. We will consider each of the majorclassifications of biomedical polymers to some extent, inclu

  12. Powering biomedical devices

    CERN Document Server

    Romero, Edwar

    2013-01-01

    From exoskeletons to neural implants, biomedical devices are no less than life-changing. Compact and constant power sources are necessary to keep these devices running efficiently. Edwar Romero's Powering Biomedical Devices reviews the background, current technologies, and possible future developments of these power sources, examining not only the types of biomedical power sources available (macro, mini, MEMS, and nano), but also what they power (such as prostheses, insulin pumps, and muscular and neural stimulators), and how they work (covering batteries, biofluids, kinetic and ther

  13. Extraction of Hidden Social Networks from Wiki-Environment Involved in Information Conflict

    Directory of Open Access Journals (Sweden)

    Rasim M. Alguliyev

    2016-03-01

    Full Text Available Social network analysis is a widely used technique to analyze relationships among wiki-users in Wikipedia. In this paper the method to identify hidden social networks participating in information conflicts in wiki-environment is proposed. In particular, we describe how text clustering techniques can be used for extraction of hidden social networks of wiki-users caused information conflict. By clustering unstructured text articles caused information conflict we create social network of wiki-users. For clustering of the conflict articles a hybrid weighted fuzzy-c-means method is proposed.

  14. Extracting directed information flow networks: an application to genetics and semantics

    CERN Document Server

    Masucci, A P; Hernández-García, E; Kalampokis, A

    2010-01-01

    We introduce a general method to infer the directional information flow between populations whose elements are described by n-dimensional vectors of symbolic attributes. The method is based on the Jensen-Shannon divergence and on the Shannon entropy and has a wide range of application. We show here the results of two applications: first extracting the network of genetic flow between the meadows of the seagrass Poseidonia Oceanica, where the meadow elements are specified by sets of microsatellite markers, then we extract the semantic flow network from a set of Wikipedia pages, showing the semantic channels between different areas of knowledge.

  15. A theoretical extraction scheme of transport information based on exclusion models

    Institute of Scientific and Technical Information of China (English)

    Chen Hua; Du Lei; Qu Cheng-Li; Li Wei-Hua; He Liang; Chen Wen-Hao; Sun Peng

    2010-01-01

    In order to explore how to extract more transport information from current fluctuation, a theoretical extraction scheme is presented in a single barrier structure based on exclusion models, which include counter-flows model and tunnel model. The first four cumulants of these two exclusion models are computed in a single barrier structure, and their characteristics are obtained. A scheme with the help of the first three cumulants is devised to check a transport process to follow the counter-flows model, the tunnel model or neither of them. Time series generated by Monte Carlo techniques is adopted to validate the abstraction procedure, and the result is reasonable.

  16. Sensors for biomedical applications

    NARCIS (Netherlands)

    Bergveld, Piet

    1986-01-01

    This paper considers the impact during the last decade of modern IC technology, microelectronics, thin- and thick-film technology, fibre optic technology, etc. on the development of sensors for biomedical applications.

  17. Semantic preview benefit in English: Individual differences in the extraction and use of parafoveal semantic information.

    Science.gov (United States)

    Veldre, Aaron; Andrews, Sally

    2016-06-01

    Although there is robust evidence that skilled readers of English extract and use orthographic and phonological information from the parafovea to facilitate word identification, semantic preview benefits have been elusive. We sought to establish whether individual differences in the extraction and/or use of parafoveal semantic information could account for this discrepancy. Ninety-nine adult readers who were assessed on measures of reading and spelling ability read sentences while their eye movements were recorded. The gaze-contingent boundary paradigm was used to manipulate the availability of relevant semantic and orthographic information in the parafovea. On average, readers showed a benefit from previews high in semantic feature overlap with the target. However, reading and spelling ability yielded opposite effects on semantic preview benefit. High reading ability was associated with a semantic preview benefit that was equivalent to an identical preview on first-pass reading. High spelling ability was associated with a reduced semantic preview benefit despite an overall higher rate of skipping. These results suggest that differences in the magnitude of semantic preview benefits in English reflect constraints on extracting semantic information from the parafovea and competition between the orthographic features of the preview and the target. (PsycINFO Database Record PMID:26595070

  18. Extraction of palaeochannei information from remote sensing imagery in the east of Chaohu Lake, China

    Institute of Scientific and Technical Information of China (English)

    Xinyuan WANG; Zhenya GUO; Li WU; Cheng ZHU; Hui HE

    2012-01-01

    Palaeochannels are deposits of unconsolidated sediments or semi-consolidated sedimentary rocks deposited in ancient,currently inactive fiver and stream channel systems.It is distinct from the overbank deposits of currently active river channels,including ephemeral water courses which do not regularly flow.We have introduced a spectral characteristics-based palaeochannel information extraction model from SPOT-5 imagery with special time phase,which has been built by virtue of an analysis of remote sensing mechanism and spectral characteristics of the palaeochannel,combined with its distinction from the spatial distribution and spectral features of currently active river channels,also with the establishment of remote sensing judging features of the palaeochannel in remote sensing image.This model follows the process of supervised classification → farmland masking and primary component analysis → underground palaeochannel information extractioninformation combination → palaeochannel system image.The Zhegao River Valley in the east of Chaohu Lake was selected as a study area,and SPOT-5 imagery was used as a source of data.The result was satisfactory when this method has been successfully applied to extract the palaeochannel information,which can provide good reference for regional remote sensing archeology and neotectonic research.However,the applicability of this method needs to be tested further in other areas as the spatial characteristics and spectral response of palaeochannel might be different.

  19. Biomedical signal analysis

    CERN Document Server

    Rangayyan, Rangaraj M

    2015-01-01

    The book will help assist a reader in the development of techniques for analysis of biomedical signals and computer aided diagnoses with a pedagogical examination of basic and advanced topics accompanied by over 350 figures and illustrations. Wide range of filtering techniques presented to address various applications. 800 mathematical expressions and equations. Practical questions, problems and laboratory exercises. Includes fractals and chaos theory with biomedical applications.

  20. Biomedical signal processing

    CERN Document Server

    Akay, Metin

    1994-01-01

    Sophisticated techniques for signal processing are now available to the biomedical specialist! Written in an easy-to-read, straightforward style, Biomedical Signal Processing presents techniques to eliminate background noise, enhance signal detection, and analyze computer data, making results easy to comprehend and apply. In addition to examining techniques for electrical signal analysis, filtering, and transforms, the author supplies an extensive appendix with several computer programs that demonstrate techniques presented in the text.

  1. Sharing big biomedical data

    OpenAIRE

    Toga, Arthur W.; Dinov, Ivo D.

    2015-01-01

    Background The promise of Big Biomedical Data may be offset by the enormous challenges in handling, analyzing, and sharing it. In this paper, we provide a framework for developing practical and reasonable data sharing policies that incorporate the sociological, financial, technical and scientific requirements of a sustainable Big Data dependent scientific community. Findings Many biomedical and healthcare studies may be significantly impacted by using large, heterogeneous and incongruent data...

  2. Comparison of edge detection techniques for the automatic information extraction of Lidar data

    Science.gov (United States)

    Li, H.; di, L.; Huang, X.; Li, D.

    2008-05-01

    In recent years, there has been much interest in information extraction from Lidar point cloud data. Many automatic edge detection algorithms have been applied to extracting information from Lidar data. Generally they can be divided as three major categories: early vision gradient operators, optimal detectors and operators using parametric fitting models. Lidar point cloud includes the intensity information and the geographic information. Thus, traditional edge detectors used in remote sensed images can take advantage with the coordination information provided by point data. However, derivation of complex terrain features from Lidar data points depends on the intensity properties and topographic relief of each scene. Take road for example, in some urban area, road has the alike intensity as buildings, but the topographic relationship of road is distinct. The edge detector for road in urban area is different from the detector for buildings. Therefore, in Lidar extraction, each kind of scene has its own suitable edge detector. This paper compares application of the different edge detectors from the previous paragraph to various terrain areas, in order to figure out the proper algorithm for respective terrain type. The Canny, EDISON and SUSAN algorithms were applied to data points with the intensity character and topographic relationship of Lidar data. The Lidar data for test are over different terrain areas, such as an urban area with a mass of buildings, a rural area with vegetation, an area with slope, or an area with a bridge, etc. Results using these edge detectors are compared to determine which algorithm is suitable for a specific terrain area. Key words: Edge detector, Extraction, Lidar, Point data

  3. Using Local Grammar for Entity Extraction from Clinical Reports

    Directory of Open Access Journals (Sweden)

    Aicha Ghoulam

    2015-06-01

    Full Text Available Information Extraction (IE is a natural language processing (NLP task whose aim is to analyze texts written in natural language to extract structured and useful information such as named entities and semantic relations linking these entities. Information extraction is an important task for many applications such as bio-medical literature mining, customer care, community websites, and personal information management. The increasing information available in patient clinical reports is difficult to access. As it is often in an unstructured text form, doctors need tools to enable them access to this information and the ability to search it. Hence, a system for extracting this information in a structured form can benefits healthcare professionals. The work presented in this paper uses a local grammar approach to extract medical named entities from French patient clinical reports. Experimental results show that the proposed approach achieved an F-Measure of 90. 06%.

  4. Automated recognition of malignancy mentions in biomedical literature

    Directory of Open Access Journals (Sweden)

    Liberman Mark Y

    2006-11-01

    Full Text Available Abstract Background The rapid proliferation of biomedical text makes it increasingly difficult for researchers to identify, synthesize, and utilize developed knowledge in their fields of interest. Automated information extraction procedures can assist in the acquisition and management of this knowledge. Previous efforts in biomedical text mining have focused primarily upon named entity recognition of well-defined molecular objects such as genes, but less work has been performed to identify disease-related objects and concepts. Furthermore, promise has been tempered by an inability to efficiently scale approaches in ways that minimize manual efforts and still perform with high accuracy. Here, we have applied a machine-learning approach previously successful for identifying molecular entities to a disease concept to determine if the underlying probabilistic model effectively generalizes to unrelated concepts with minimal manual intervention for model retraining. Results We developed a named entity recognizer (MTag, an entity tagger for recognizing clinical descriptions of malignancy presented in text. The application uses the machine-learning technique Conditional Random Fields with additional domain-specific features. MTag was tested with 1,010 training and 432 evaluation documents pertaining to cancer genomics. Overall, our experiments resulted in 0.85 precision, 0.83 recall, and 0.84 F-measure on the evaluation set. Compared with a baseline system using string matching of text with a neoplasm term list, MTag performed with a much higher recall rate (92.1% vs. 42.1% recall and demonstrated the ability to learn new patterns. Application of MTag to all MEDLINE abstracts yielded the identification of 580,002 unique and 9,153,340 overall mentions of malignancy. Significantly, addition of an extensive lexicon of malignancy mentions as a feature set for extraction had minimal impact in performance. Conclusion Together, these results suggest that the

  5. A Study on Lip Extraction due to Fuzzy Reasoning by Using Color Information

    Science.gov (United States)

    Shirasawa, Yoichi; Nishida, Makoto; Nishi, Kenji

    This paper proposes a method for extracting the lip shape from the region around the lip. It is carried out without limited conditions such as the lipstick or lighting. The proposed method uses color information for the lip shape extraction. They are psychometric quantities of a metric hue angle (hab), a rectangular coordinates(a*) , which are defined in CIE 1976 L*a*b* color space. The method employs fuzzy reasoning was employed in order to consider obscurity in image data such as shade on the face. The membership function of condition part for characteristics in each class was defined by the triangular membership function was used for the fuzzy reasoning. In order to reduce the effect of the data acquisition condition, an extraction method is here presented for the lip shape from region around the lip due to fuzzy reasoning. We studied on set up of the membership function of condition part. The proposed method uses a* color histogram and habcolor histogram of the region around the lip when sets up of membership function of condition part. This paper clarified that the lip was able to be extracted without the no special conditions such as the lipstick or lighting. The experimental result indicates the effectiveness of the propose method; about 98.7 percent of facial images data was extracted its shape.

  6. Extracting information from two-dimensional electrophoresis gels by partial least squares regression

    DEFF Research Database (Denmark)

    Jessen, Flemming; Lametsch, R.; Bendixen, E.;

    2002-01-01

    Two-dimensional gel electrophoresis (2-DE) produces large amounts of data and extraction of relevant information from these data demands a cautious and time consuming process of spot pattern matching between gels. The classical approach of data analysis is to detect protein markers that appear...... or disappear depending on the experimental conditions. Such biomarkers are found by comparing the relative volumes of individual spots in the individual gels. Multivariate statistical analysis and modelling of 2-DE data for comparison and classification is an alternative approach utilising the combination...... of all proteins/spots in the gels. In the present study it is demonstrated how information can be extracted by multivariate data analysis. The strategy is based on partial least squares regression followed by variable selection to find proteins that individually or in combination with other proteins vary...

  7. Road Extraction from High-resolution Remote Sensing Images Based on Multiple Information Fusion

    Directory of Open Access Journals (Sweden)

    LI Xiao-feng

    2016-02-01

    Full Text Available Road extraction from high-resolution remote sensing images has been considered to be a significant but very difficult task.Especially the spectrum of some buildings is similar with that of roads,which makes the surfaces being connect with each other after classification and difficult to be distinguished.Based on the cooperation between road surfaces and edges,this paper presents an approach to purify roads from high-resolution remote sensing images.Firstly,we try to improve the extraction accuracy of road surfaces and edges respectively.The logic cooperation between these two binary images is used to separate road and non-road objects.Then the road objects are confirmed by the cooperation between surfaces and edges.And the effective shape indices(e.g.polar moment of inertia and narrow extent index are applied to eliminate non-road objects.So the road information is refined.The experiments indicate that the proposed approach is efficient for eliminating non-road information and extracting road information from high-resolution remote sensing image.

  8. Semi-Automatic Mapping Generation for the DBpedia Information Extraction Framework

    Directory of Open Access Journals (Sweden)

    Arup Sarkar, Ujjal Marjit, Utpal Biswas

    2013-03-01

    Full Text Available DBpedia is one of the very well known live projectsfrom the Semantic Web. It is likeamirror version ofthe Wikipedia site in Semantic Web. Initially itpublishes the information collected from theWikipedia, but only that part which is relevant tothe Semantic Web.Collecting information forSemantic Web from the Wikipedia is demonstratedas the extraction of structured data. DBpedianormally do this by using a specially designedframework called DBpedia Information ExtractionFramework. This extraction framework do itsworks thorough the evaluation of the similarproperties from the DBpedia Ontology and theWikipedia template. This step is known as DBpediamapping.At present mostof the mapping jobs aredone complete manually.In this paper a newframework is introduced considering the issuesrelated to the template to ontology mapping. A semi-automatic mapping tool for the DBpedia projectisproposedwith the capability of automaticsuggestion generation for the end usersso thatusers can identify the similar Ontology and templateproperties.Proposed framework is useful since afterselection of similar properties, the necessary code tomaintain the mapping between Ontology andtemplate is generated automatically.

  9. MEDSYNDIKATE--a natural language system for the extraction of medical information from findings reports.

    Science.gov (United States)

    Hahn, Udo; Romacker, Martin; Schulz, Stefan

    2002-12-01

    MEDSYNDIKATE is a natural language processor, which automatically acquires medical information from findings reports. In the course of text analysis their contents is transferred to conceptual representation structures, which constitute a corresponding text knowledge base. MEDSYNDIKATE is particularly adapted to deal properly with text structures, such as various forms of anaphoric reference relations spanning several sentences. The strong demands MEDSYNDIKATE poses on the availability of expressive knowledge sources are accounted for by two alternative approaches to acquire medical domain knowledge (semi)automatically. We also present data for the information extraction performance of MEDSYNDIKATE in terms of the semantic interpretation of three major syntactic patterns in medical documents. PMID:12460632

  10. Multiple Features Based Approach to Extract Bio-molecular Event Triggers Using Conditional Random Field

    Directory of Open Access Journals (Sweden)

    Amit Majumder

    2012-11-01

    Full Text Available The purpose of Biomedical Natural Language Processing (BioNLP is to capture biomedical phenomena from textual data by extracting relevant entities, information and relations between biomedical entities (i.e. proteins and genes. In general, in most of the published papers, only binary relations were extracted. In a recent past, the focus is shifted towards extracting more complex relations in the form of bio-molecular events that may include several entities or other relations. In this paper we propose an approach that enables event trigger extraction of relatively complex bio-molecular events. We approach this problem as a detection of bio-molecular event trigger using the well-known algorithm, namely Conditional Random Field (CRF. We apply our experiments on development set. It shows the overall average recall, precision and F-measure values of 64.27504%, 69.97559% and 67.00429%, respectively for the event detection.

  11. Automated Building Extraction from High-Resolution Satellite Imagery in Urban Areas Using Structural, Contextual, and Spectral Information

    Directory of Open Access Journals (Sweden)

    Jin Xiaoying

    2005-01-01

    Full Text Available High-resolution satellite imagery provides an important new data source for building extraction. We demonstrate an integrated strategy for identifying buildings in 1-meter resolution satellite imagery of urban areas. Buildings are extracted using structural, contextual, and spectral information. First, a series of geodesic opening and closing operations are used to build a differential morphological profile (DMP that provides image structural information. Building hypotheses are generated and verified through shape analysis applied to the DMP. Second, shadows are extracted using the DMP to provide reliable contextual information to hypothesize position and size of adjacent buildings. Seed building rectangles are verified and grown on a finely segmented image. Next, bright buildings are extracted using spectral information. The extraction results from the different information sources are combined after independent extraction. Performance evaluation of the building extraction on an urban test site using IKONOS satellite imagery of the City of Columbia, Missouri, is reported. With the combination of structural, contextual, and spectral information, of the building areas are extracted with a quality percentage .

  12. Effective use of Latent Semantic Indexing and Computational Linguistics in Biological and Biomedical Applications

    Directory of Open Access Journals (Sweden)

    Hongyu eChen

    2013-01-01

    Full Text Available Text mining is rapidly becoming an essential technique for the annotation and analysis of large biological data sets. Biomedical literature currently increases at a rate of several thousand papers per week, making automated information retrieval methods the only feasible method of managing this expanding corpus. With the increasing prevalence of open-access journals and constant growth of publicly-available repositories of biomedical literature, literature mining has become much more effective with respect to the extraction of biomedically-relevant data. In recent years, text mining of popular databases such as MEDLINE has evolved from basic term-searches to more sophisticated natural language processing techniques, indexing and retrieval methods, structural analysis and integration of literature with associated metadata. In this review, we will focus on Latent Semantic Indexing (LSI, a computational linguistics technique increasingly used for a variety of biological purposes. It is noted for its ability to consistently outperform benchmark Boolean text searches and co-occurrence models at information retrieval and its power to extract indirect relationships within a data set. LSI has been used successfully to formulate new hypotheses, generate novel connections from existing data, and validate empirical data.

  13. Effective use of latent semantic indexing and computational linguistics in biological and biomedical applications.

    Science.gov (United States)

    Chen, Hongyu; Martin, Bronwen; Daimon, Caitlin M; Maudsley, Stuart

    2013-01-01

    Text mining is rapidly becoming an essential technique for the annotation and analysis of large biological data sets. Biomedical literature currently increases at a rate of several thousand papers per week, making automated information retrieval methods the only feasible method of managing this expanding corpus. With the increasing prevalence of open-access journals and constant growth of publicly-available repositories of biomedical literature, literature mining has become much more effective with respect to the extraction of biomedically-relevant data. In recent years, text mining of popular databases such as MEDLINE has evolved from basic term-searches to more sophisticated natural language processing techniques, indexing and retrieval methods, structural analysis and integration of literature with associated metadata. In this review, we will focus on Latent Semantic Indexing (LSI), a computational linguistics technique increasingly used for a variety of biological purposes. It is noted for its ability to consistently outperform benchmark Boolean text searches and co-occurrence models at information retrieval and its power to extract indirect relationships within a data set. LSI has been used successfully to formulate new hypotheses, generate novel connections from existing data, and validate empirical data.

  14. Integrating systems biology models and biomedical ontologies

    Directory of Open Access Journals (Sweden)

    de Bono Bernard

    2011-08-01

    Full Text Available Abstract Background Systems biology is an approach to biology that emphasizes the structure and dynamic behavior of biological systems and the interactions that occur within them. To succeed, systems biology crucially depends on the accessibility and integration of data across domains and levels of granularity. Biomedical ontologies were developed to facilitate such an integration of data and are often used to annotate biosimulation models in systems biology. Results We provide a framework to integrate representations of in silico systems biology with those of in vivo biology as described by biomedical ontologies and demonstrate this framework using the Systems Biology Markup Language. We developed the SBML Harvester software that automatically converts annotated SBML models into OWL and we apply our software to those biosimulation models that are contained in the BioModels Database. We utilize the resulting knowledge base for complex biological queries that can bridge levels of granularity, verify models based on the biological phenomenon they represent and provide a means to establish a basic qualitative layer on which to express the semantics of biosimulation models. Conclusions We establish an information flow between biomedical ontologies and biosimulation models and we demonstrate that the integration of annotated biosimulation models and biomedical ontologies enables the verification of models as well as expressive queries. Establishing a bi-directional information flow between systems biology and biomedical ontologies has the potential to enable large-scale analyses of biological systems that span levels of granularity from molecules to organisms.

  15. The Feature Extraction Based on Texture Image Information for Emotion Sensing in Speech

    Directory of Open Access Journals (Sweden)

    Kun-Ching Wang

    2014-09-01

    Full Text Available In this paper, we present a novel texture image feature for Emotion Sensing in Speech (ESS. This idea is based on the fact that the texture images carry emotion-related information. The feature extraction is derived from time-frequency representation of spectrogram images. First, we transform the spectrogram as a recognizable image. Next, we use a cubic curve to enhance the image contrast. Then, the texture image information (TII derived from the spectrogram image can be extracted by using Laws’ masks to characterize emotional state. In order to evaluate the effectiveness of the proposed emotion recognition in different languages, we use two open emotional databases including the Berlin Emotional Speech Database (EMO-DB and eNTERFACE corpus and one self-recorded database (KHUSC-EmoDB, to evaluate the performance cross-corpora. The results of the proposed ESS system are presented using support vector machine (SVM as a classifier. Experimental results show that the proposed TII-based feature extraction inspired by visual perception can provide significant classification for ESS systems. The two-dimensional (2-D TII feature can provide the discrimination between different emotions in visual expressions except for the conveyance pitch and formant tracks. In addition, the de-noising in 2-D images can be more easily completed than de-noising in 1-D speech.

  16. An Accurate Integral Method for Vibration Signal Based on Feature Information Extraction

    Directory of Open Access Journals (Sweden)

    Yong Zhu

    2015-01-01

    Full Text Available After summarizing the advantages and disadvantages of current integral methods, a novel vibration signal integral method based on feature information extraction was proposed. This method took full advantage of the self-adaptive filter characteristic and waveform correction feature of ensemble empirical mode decomposition in dealing with nonlinear and nonstationary signals. This research merged the superiorities of kurtosis, mean square error, energy, and singular value decomposition on signal feature extraction. The values of the four indexes aforementioned were combined into a feature vector. Then, the connotative characteristic components in vibration signal were accurately extracted by Euclidean distance search, and the desired integral signals were precisely reconstructed. With this method, the interference problem of invalid signal such as trend item and noise which plague traditional methods is commendably solved. The great cumulative error from the traditional time-domain integral is effectively overcome. Moreover, the large low-frequency error from the traditional frequency-domain integral is successfully avoided. Comparing with the traditional integral methods, this method is outstanding at removing noise and retaining useful feature information and shows higher accuracy and superiority.

  17. Advances in biomedical engineering

    CERN Document Server

    Brown, J H U

    1976-01-01

    Advances in Biomedical Engineering, Volume 6, is a collection of papers that discusses the role of integrated electronics in medical systems and the usage of biological mathematical models in biological systems. Other papers deal with the health care systems, the problems and methods of approach toward rehabilitation, as well as the future of biomedical engineering. One paper discusses the use of system identification as it applies to biological systems to estimate the values of a number of parameters (for example, resistance, diffusion coefficients) by indirect means. More particularly, the i

  18. Advances in biomedical engineering

    CERN Document Server

    Brown, J H U

    1976-01-01

    Advances in Biomedical Engineering, Volume 5, is a collection of papers that deals with application of the principles and practices of engineering to basic and applied biomedical research, development, and the delivery of health care. The papers also describe breakthroughs in health improvements, as well as basic research that have been accomplished through clinical applications. One paper examines engineering principles and practices that can be applied in developing therapeutic systems by a controlled delivery system in drug dosage. Another paper examines the physiological and materials vari

  19. Biomedical enhancements as justice.

    Science.gov (United States)

    Nam, Jeesoo

    2015-02-01

    Biomedical enhancements, the applications of medical technology to make better those who are neither ill nor deficient, have made great strides in the past few decades. Using Amartya Sen's capability approach as my framework, I argue in this article that far from being simply permissible, we have a prima facie moral obligation to use these new developments for the end goal of promoting social justice. In terms of both range and magnitude, the use of biomedical enhancements will mark a radical advance in how we compensate the most disadvantaged members of society.

  20. Note: Sound recovery from video using SVD-based information extraction

    Science.gov (United States)

    Zhang, Dashan; Guo, Jie; Lei, Xiujun; Zhu, Chang'an

    2016-08-01

    This note reports an efficient singular value decomposition (SVD)-based vibration extraction approach that recovers sound information in silent high-speed video. A high-speed camera of which frame rates are in the range of 2 kHz-10 kHz is applied to film the vibrating objects. Sub-images cut from video frames are transformed into column vectors and then reconstructed to a new matrix. The SVD of the new matrix produces orthonormal image bases (OIBs) and image projections onto specific OIB can be recovered as understandable acoustical signals. Standard frequencies of 256 Hz and 512 Hz tuning forks are extracted offline from their vibrating surfaces and a 3.35 s speech signal is recovered online from a piece of paper that is stimulated by sound waves within 1 min.

  1. National information service in mining, mineral processing and extractive metallurgy. [MINTEC

    Energy Technology Data Exchange (ETDEWEB)

    Romaniuk, A.S.; MacDonald, R.J.C.

    1979-03-01

    More than a dedade ago, CANMET management recognized the need to make better use of existing technological information in mining and extractive metallurgy, two fields basic to the economic well-being of Canada. There were at that time no indexes or files didicated to disseminating technical information for the many minerals mined and processed in Canada, including coal. CANMET, with the nation's largest research and library resources in the minerals field, was in a unique position to fill this need. Initial efforts were concentrated on building a mining file beginning with identification of world sources of published information, development of a special thesaurus of terms for language control and adoption of a manual indexing/retrieval system. By early 1973, this file held 8,300 references, with source, abstract and keywords given for each reference. In mid-1973, operations were computerized. Software for indexing and retrieval by batch mode was written by CANMET staff to utilize the hardware facilities of EMR's Computer Science Center. The resulting MINTEC file, one of the few files of technological information produced in Canada, is the basis for the national literature search service in mining offered by CANMET. Attention is now focussed on building a sister-file in extractive metallurgy using the system already developed. Published information sources have been identified and a thesaurus of terms is being compiled and tested. The software developed for CANMET's file-building operations has several features, including the selective dissemination of information and production from magnetic tape of photoready copy for publication, as in a bi-monthly abstracts journal.

  2. Automated DICOM metadata and volumetric anatomical information extraction for radiation dosimetry

    Science.gov (United States)

    Papamichail, D.; Ploussi, A.; Kordolaimi, S.; Karavasilis, E.; Papadimitroulas, P.; Syrgiamiotis, V.; Efstathopoulos, E.

    2015-09-01

    Patient-specific dosimetry calculations based on simulation techniques have as a prerequisite the modeling of the modality system and the creation of voxelized phantoms. This procedure requires the knowledge of scanning parameters and patients’ information included in a DICOM file as well as image segmentation. However, the extraction of this information is complicated and time-consuming. The objective of this study was to develop a simple graphical user interface (GUI) to (i) automatically extract metadata from every slice image of a DICOM file in a single query and (ii) interactively specify the regions of interest (ROI) without explicit access to the radiology information system. The user-friendly application developed in Matlab environment. The user can select a series of DICOM files and manage their text and graphical data. The metadata are automatically formatted and presented to the user as a Microsoft Excel file. The volumetric maps are formed by interactively specifying the ROIs and by assigning a specific value in every ROI. The result is stored in DICOM format, for data and trend analysis. The developed GUI is easy, fast and and constitutes a very useful tool for individualized dosimetry. One of the future goals is to incorporate a remote access to a PACS server functionality.

  3. Status of Research in Biomedical Engineering 1968.

    Science.gov (United States)

    National Inst. of General Medical Sciences (NIH), Bethesda, MD.

    This status report is divided into eight sections. The first four represent the classical engineering or building aspects of bioengineering and deal with biomedical instrumentation, prosthetics, man-machine systems and computer and information systems. The next three sections are related to the scientific, intellectual and academic influence of…

  4. Capturing the Value of Biomedical Research.

    Science.gov (United States)

    Bertuzzi, Stefano; Jamaleddine, Zeina

    2016-03-24

    Assessing the real-world impact of biomedical research is notoriously difficult. Here, we present the framework for building a prospective science-centered information system from scratch that has been afforded by the Sidra Medical and Research Center in Qatar. This experiment is part of the global conversation on maximizing returns on research investment.

  5. 生物医学研究伦理审查信息系统的应用%Application of biomedical research ethics review information system

    Institute of Scientific and Technical Information of China (English)

    彭智才; 尚政琴

    2016-01-01

    目的:探讨生物医学研究伦理审查信息系统在医院生物医学研究伦理审查中的应用.方法:系统设计依据国际协调会议(ICH)、临床试验管理规范(GCP)、世界卫生组织与热带病研究部门(WHO/TDR)的《生物医学研究审查伦理委员会操作指南》以及世界卫生组织、发展伦理委员会审查能力的战略行动(WHO SIDCER)认证有关伦理委员会操作规范进行开发.结果:新型生物医学研究伦理审查系统的应用,解决了伦理委员会务实高效地开展伦理审查和研究者有效遵循伦理原则来开展研究的问题,实现了伦理审查的申请、受理、审查、传达决定以及跟踪审查等操作规程的电子程序化.结论:生物医学研究伦理审查系统的应用,使伦理审查流程更加合理,操作更加简捷,可极大提高医院伦理审查的工作效率,方便和规范伦理审查的文档管理,提升伦理审查的质量管理,对生物医学研究伦理审查工作具有重要意义.%Objective:To discuss the application of "ethical review of biomedical research" system in the hospital ethical review of biomedical research.Methods: To design based on the ICH GCP specification, WHO/TDR" biomedical research ethics committee guidelines" Systematic, WHO SIDCER authentication on the ethics committee specification development.Results: The application of this system can realize the ethical review application, acceptance, examination and decision, convey the follow-up review procedures for electronic program.Conclusion: "The ethical review of biomedical research ethics review" system to make process more reasonable, operation more convenient, greatly improves the work efficiency of the hospital ethics review, document management, facilitate and regulate the ethical review, enhance the quality of management ethics review, plays an important role inbiomedical research ethics review.

  6. Chitosan: A Promising Marine Polysaccharide for Biomedical Research.

    Science.gov (United States)

    Periayah, Mercy Halleluyah; Halim, Ahmad Sukari; Saad, Arman Zaharil Mat

    2016-01-01

    Biomaterials created 50 years ago are still receiving considerable attention for their potential to support development in the biomedical field. Diverse naturally obtained polysaccharides supply a broad range of resources applicable in the biomedical field. Lately, chitosan, a marine polysaccharide derived from chitins-which are extracted from the shells of arthropods such as crab, shrimp, and lobster-is becoming the most wanted biopolymer for use toward therapeutic interventions. This is a general short review of chitosan, highlighting the history, properties, chemical structure, processing method, and factors influencing the usage of chitosan derivatives in the biomedical field.

  7. Chitosan: A promising marine polysaccharide for biomedical research

    Directory of Open Access Journals (Sweden)

    Mercy Halleluyah Periayah

    2016-01-01

    Full Text Available Biomaterials created 50 years ago are still receiving considerable attention for their potential to support development in the biomedical field. Diverse naturally obtained polysaccharides supply a broad range of resources applicable in the biomedical field. Lately, chitosan, a marine polysaccharide derived from chitins—which are extracted from the shells of arthropods such as crab, shrimp, and lobster—is becoming the most wanted biopolymer for use toward therapeutic interventions. This is a general short review of chitosan, highlighting the history, properties, chemical structure, processing method, and factors influencing the usage of chitosan derivatives in the biomedical field.

  8. Biomedical Engineering in Modern Society

    Science.gov (United States)

    Attinger, E. O.

    1971-01-01

    Considers definition of biomedical engineering (BME) and how biomedical engineers should be trained. State of the art descriptions of BME and BME education are followed by a brief look at the future of BME. (TS)

  9. Biomedical applications of photochemistry

    OpenAIRE

    Chan, BP

    2010-01-01

    Photochemistry is the study of photochemical reactions between light and molecules. Recently, there have been increasing interests in using photochemical reactions in the fields of biomaterials and tissue engineering. This work revisits the components and mechanisms of photochemistry and reviews biomedical applications of photochemistry in various disciplines, including oncology, molecular biology, and biosurgery, with particular emphasis on tissue engineering. Finally, potential toxicities a...

  10. Anatomy for Biomedical Engineers

    Science.gov (United States)

    Carmichael, Stephen W.; Robb, Richard A.

    2008-01-01

    There is a perceived need for anatomy instruction for graduate students enrolled in a biomedical engineering program. This appeared especially important for students interested in and using medical images. These students typically did not have a strong background in biology. The authors arranged for students to dissect regions of the body that…

  11. Implantable CMOS Biomedical Devices

    Directory of Open Access Journals (Sweden)

    Toshihiko Noda

    2009-11-01

    Full Text Available The results of recent research on our implantable CMOS biomedical devices are reviewed. Topics include retinal prosthesis devices and deep-brain implantation devices for small animals. Fundamental device structures and characteristics as well as in vivo experiments are presented.

  12. Bevalac biomedical facility

    International Nuclear Information System (INIS)

    This paper describes the physical layout of the Bevalac Facility and the research programs carried out at the facility. Beam time on the Bevalac is divided between two disciplines: one-third for biomedical research and two-thirds for nuclear science studies. The remainder of the paper discusses the beam delivery system including dosimetry, beam sharing and beam scanning

  13. Biomedical applications in EELA.

    Science.gov (United States)

    Cardenas, Miguel; Hernández, Vicente; Mayo, Rafael; Blanquer, Ignacio; Perez-Griffo, Javier; Isea, Raul; Nuñez, Luis; Mora, Henry Ricardo; Fernández, Manuel

    2006-01-01

    The current demand for Grid Infrastructures to bring collabarating groups between Latina America and Europe has created the EELA proyect. This e-infrastructure is used by Biomedical groups in Latina America and Europe for the studies of ocnological analisis, neglected diseases, sequence alignments and computation plygonetics. PMID:16823158

  14. Analysis on health information extracted from an urban professional population in Beijing

    Institute of Scientific and Technical Information of China (English)

    ZHANG Tie-mei; ZHANG Yan; LIU Bin; JIA Hong-bo; LIU Yun-jie; ZHU Ling; LUO Sen-lin; HAN Yi-wen; ZHANG Yan; YANG Shu-wen; LIU An-nan; MA Lan-jun; ZHAO Yan-yan

    2011-01-01

    Background The assembled data from a population could provide information on health trends within the population.The aim of this research was to extract and know basic health information from an urban professional population in Beijing.Methods Data analysis was carried out in a population who underwent a routine medical check-up and aged >20 years,including 30 058 individuals.General information,data from physical examinations and blood samples were collected in the same method.The health status was separated into three groups by the criteria generated in this study,i.e.,people with common chronic diseases,people in a sub-clinic situation,and healthy people.The proportion of both common diseases suffered and health risk distribution of different age groups were also analyzed.Results The proportion of people with common chronic diseases,in the sub-clinic group and in the healthy group was 28.6%,67.8% and 3.6% respectively.There were significant differences in the health situation in different age groups.Hypertension was on the top of list of self-reported diseases.The proportion of chronic diseases increased significantly in people after 35 years of age.Meanwhile,the proportion of sub-clinic conditions was decreasing at the same rate.The complex risk factors to health in this population were metabolic disturbances (61.3%),risk for tumor (2.7%),abnormal results of morphological examination (8.2%) and abnormal results of lab tests of serum (27.8%).Conclusions Health information could be extracted from a complex data set from the heath check-ups of the general population.The information should be applied to support prevention and control chronic diseases as well as for directing intervention for patients with risk factors for disease.

  15. A Methodology to Extract Information from the Brain Waves in a Multi-Core Environment

    OpenAIRE

    Krishnakumar Venkatasubramanian; M. Rajasekhara Babu

    2013-01-01

    Brain Computer Interface (BCI) is one of the most dynamic fields in the world of Artificial Intelligence. It involves a direct neural interface between the subject and the machine, and the information is extracted from the brain in the form of waves. This raw brain-wave needs to be classified, de-noised, translated and suitably stored. These steps involve the use of a number of techniques like, thede-noising involves the use of Discrete Wavelet Transform (DWT) [1] or Recurrent Quantum Neural ...

  16. Multi-Paradigm and Multi-Lingual Information Extraction as Support for Medical Web Labelling Authorities

    Directory of Open Access Journals (Sweden)

    Martin Labsky

    2010-10-01

    Full Text Available Until recently, quality labelling of medical web content has been a pre-dominantly manual activity. However, the advances in automated text processing opened the way to computerised support of this activity. The core enabling technology is information extraction (IE. However, the heterogeneity of websites offering medical content imposes particular requirements on the IE techniques to be applied. In the paper we discuss these requirements and describe a multi-paradigm approach to IE addressing them. Experiments on multi-lingual data are reported. The research has been carried out within the EU MedIEQ project.

  17. Red Tide Information Extraction Based on Multi-source Remote Sensing Data in Haizhou Bay

    Institute of Scientific and Technical Information of China (English)

    2011-01-01

    [Objective] The aim was to extract red tide information in Haizhou Bay on the basis of multi-source remote sensing data.[Method] Red tide in Haizhou Bay was studied based on multi-source remote sensing data,such as IRS-P6 data on October 8,2005,Landsat 5-TM data on May 20,2006,MODIS 1B data on October 6,2006 and HY-1B second-grade data on April 22,2009,which were firstly preprocessed through geometric correction,atmospheric correction,image resizing and so on.At the same time,the synchronous environment mon...

  18. An Unsupervised Graph Based Continuous Word Representation Method for Biomedical Text Mining.

    Science.gov (United States)

    Jiang, Zhenchao; Li, Lishuang; Huang, Degen

    2016-01-01

    In biomedical text mining tasks, distributed word representation has succeeded in capturing semantic regularities, but most of them are shallow-window based models, which are not sufficient for expressing the meaning of words. To represent words using deeper information, we make explicit the semantic regularity to emerge in word relations, including dependency relations and context relations, and propose a novel architecture for computing continuous vector representation by leveraging those relations. The performance of our model is measured on word analogy task and Protein-Protein Interaction Extraction (PPIE) task. Experimental results show that our method performs overall better than other word representation models on word analogy task and have many advantages on biomedical text mining.

  19. NASA Biomedical Informatics Capabilities and Needs

    Science.gov (United States)

    Johnson-Throop, Kathy A.

    2009-01-01

    To improve on-orbit clinical capabilities by developing and providing operational support for intelligent, robust, reliable, and secure, enterprise-wide and comprehensive health care and biomedical informatics systems with increasing levels of autonomy, for use on Earth, low Earth orbit & exploration class missions. Biomedical Informatics is an emerging discipline that has been defined as the study, invention, and implementation of structures and algorithms to improve communication, understanding and management of medical information. The end objective of biomedical informatics is the coalescing of data, knowledge, and the tools necessary to apply that data and knowledge in the decision-making process, at the time and place that a decision needs to be made.

  20. Developing an Approach to Prioritize River Restoration using Data Extracted from Flood Risk Information System Databases.

    Science.gov (United States)

    Vimal, S.; Tarboton, D. G.; Band, L. E.; Duncan, J. M.; Lovette, J. P.; Corzo, G.; Miles, B.

    2015-12-01

    Prioritizing river restoration requires information on river geometry. In many states in the US detailed river geometry has been collected for floodplain mapping and is available in Flood Risk Information Systems (FRIS). In particular, North Carolina has, for its 100 Counties, developed a database of numerous HEC-RAS models which are available through its Flood Risk Information System (FRIS). These models that include over 260 variables were developed and updated by numerous contractors. They contain detailed surveyed or LiDAR derived cross-sections and modeled flood extents for different extreme event return periods. In this work, over 4700 HEC-RAS models' data was integrated and upscaled to utilize detailed cross-section information and 100-year modelled flood extent information to enable river restoration prioritization for the entire state of North Carolina. We developed procedures to extract geomorphic properties such as entrenchment ratio, incision ratio, etc. from these models. Entrenchment ratio quantifies the vertical containment of rivers and thereby their vulnerability to flooding and incision ratio quantifies the depth per unit width. A map of entrenchment ratio for the whole state was derived by linking these model results to a geodatabase. A ranking of highly entrenched counties enabling prioritization for flood allowance and mitigation was obtained. The results were shared through HydroShare and web maps developed for their visualization using Google Maps Engine API.

  1. Audio-Visual Speech Recognition Using Lip Information Extracted from Side-Face Images

    Directory of Open Access Journals (Sweden)

    Koji Iwano

    2007-03-01

    Full Text Available This paper proposes an audio-visual speech recognition method using lip information extracted from side-face images as an attempt to increase noise robustness in mobile environments. Our proposed method assumes that lip images can be captured using a small camera installed in a handset. Two different kinds of lip features, lip-contour geometric features and lip-motion velocity features, are used individually or jointly, in combination with audio features. Phoneme HMMs modeling the audio and visual features are built based on the multistream HMM technique. Experiments conducted using Japanese connected digit speech contaminated with white noise in various SNR conditions show effectiveness of the proposed method. Recognition accuracy is improved by using the visual information in all SNR conditions. These visual features were confirmed to be effective even when the audio HMM was adapted to noise by the MLLR method.

  2. Feature extraction and learning using context cue and Rényi entropy based mutual information

    DEFF Research Database (Denmark)

    Pan, Hong; Olsen, Søren Ingvor; Zhu, Yaping

    2015-01-01

    Feature extraction and learning play a critical role for visual perception tasks. We focus on improving the robustness of the kernel descriptors (KDES) by embedding context cues and further learning a compact and discriminative feature codebook for feature reduction using Rényi entropy based mutual...... improving the robustness of CKD. For feature learning and reduction, we propose a novel codebook learning method, based on a Rényi quadratic entropy based mutual information measure called Cauchy-Schwarz Quadratic Mutual Information (CSQMI), to learn a compact and discriminative CKD codebook. Projecting the...... original full-dimensional CKD onto the codebook, we reduce the dimensionality of CKD while preserving its discriminability. Moreover, the latent connection between Rényi quadratic entropy and the mapping data in kernel feature space further facilitates us to capture the geometric structure as well as the...

  3. Detailed design specification for the ALT Shuttle Information Extraction Subsystem (SIES)

    Science.gov (United States)

    Clouette, G. L.; Fitzpatrick, W. N.

    1976-01-01

    The approach and landing test (ALT) shuttle information extraction system (SIES) is described in terms of general requirements and system characteristics output products and processing options, output products and data sources, and system data flow. The ALT SIES is a data reduction system designed to satisfy certain data processing requirements for the ALT phase of the space shuttle program. The specific ALT SIES data processing requirements are stated in the data reduction complex approach and landing test data processing requirements. In general, ALT SIES must produce time correlated data products as a result of standardized data reduction or special purpose analytical processes. The main characteristics of ALT SIES are: (1) the system operates in a batch (non-interactive) mode; (2) the processing is table driven; (3) it is data base oriented; (4) it has simple operating procedures; and (5) it requires a minimum of run time information.

  4. Foreground and Background Lexicons and Word Sense Disambiguation for Information Extraction

    CERN Document Server

    Kilgarriff, A

    1999-01-01

    Lexicon acquisition from machine-readable dictionaries and corpora is currently a dynamic field of research, yet it is often not clear how lexical information so acquired can be used, or how it relates to structured meaning representations. In this paper I look at this issue in relation to Information Extraction (hereafter IE), and one subtask for which both lexical and general knowledge are required, Word Sense Disambiguation (WSD). The analysis is based on the widely-used, but little-discussed distinction between an IE system's foreground lexicon, containing the domain's key terms which map onto the database fields of the output formalism, and the background lexicon, containing the remainder of the vocabulary. For the foreground lexicon, human lexicography is required. For the background lexicon, automatic acquisition is appropriate. For the foreground lexicon, WSD will occur as a by-product of finding a coherent semantic interpretation of the input. WSD techniques as discussed in recent literature are suit...

  5. Audio-Visual Speech Recognition Using Lip Information Extracted from Side-Face Images

    Directory of Open Access Journals (Sweden)

    Iwano Koji

    2007-01-01

    Full Text Available This paper proposes an audio-visual speech recognition method using lip information extracted from side-face images as an attempt to increase noise robustness in mobile environments. Our proposed method assumes that lip images can be captured using a small camera installed in a handset. Two different kinds of lip features, lip-contour geometric features and lip-motion velocity features, are used individually or jointly, in combination with audio features. Phoneme HMMs modeling the audio and visual features are built based on the multistream HMM technique. Experiments conducted using Japanese connected digit speech contaminated with white noise in various SNR conditions show effectiveness of the proposed method. Recognition accuracy is improved by using the visual information in all SNR conditions. These visual features were confirmed to be effective even when the audio HMM was adapted to noise by the MLLR method.

  6. Solution of Multiple——Point Statistics to Extracting Information from Remotely Sensed Imagery

    Institute of Scientific and Technical Information of China (English)

    Ge Yong; Bai Hexiang; Cheng Qiuming

    2008-01-01

    Two phenomena of similar objects with different spectra and different objects with similar spectrum often result in the difficulty of separation and identification of all types of geographical objects only using spectral information.Therefore,there is a need to incorporate spatial structural and spatial association properties of the surfaces of objects into image processing to improve the accuracy of classification of remotely sensed imagery.In the current article,a new method is proposed on the basis of the principle of multiple-point statistics for combining spectral information and spatial information for image classification.The method was validated by applying to a case study on road extraction based on Landsat TM taken over the Chinese YeHow River delta on August 8,1999. The classification results have shown that this new method provides overall better results than the traditional methods such as maximum likelihood classifier (MLC)

  7. Extracting Urban Ground Object Information from Images and LiDAR Data

    Science.gov (United States)

    Yi, Lina; Zhao, Xuesheng; Li, Luan; Zhang, Guifeng

    2016-06-01

    To deal with the problem of urban ground object information extraction, the paper proposes an object-oriented classification method using aerial image and LiDAR data. Firstly, we select the optimal segmentation scales of different ground objects and synthesize them to get accurate object boundaries. Then, this paper uses ReliefF algorithm to select the optimal feature combination and eliminate the Hughes phenomenon. Eventually, the multiple classifier combination method is applied to get the outcome of the classification. In order to validate the feasible of this method, this paper selects two experimental regions in Stuttgart and Germany (Region A and B, covers 0.21 km2 and 1.1 km2 respectively). The aim of the first experiment on the Region A is to get the optimal segmentation scales and classification features. The overall accuracy of the classification reaches to 93.3 %. The purpose of the experiment on region B is to validate the application-ability of this method for a large area, which is turned out to be reaches 88.4 % overall accuracy. In the end of this paper, the conclusion shows that the proposed method can be performed accurately and efficiently in terms of urban ground information extraction and be of high application value.

  8. [An object-based information extraction technology for dominant tree species group types].

    Science.gov (United States)

    Tian, Tian; Fan, Wen-yi; Lu, Wei; Xiao, Xiang

    2015-06-01

    Information extraction for dominant tree group types is difficult in remote sensing image classification, howevers, the object-oriented classification method using high spatial resolution remote sensing data is a new method to realize the accurate type information extraction. In this paper, taking the Jiangle Forest Farm in Fujian Province as the research area, based on the Quickbird image data in 2013, the object-oriented method was adopted to identify the farmland, shrub-herbaceous plant, young afforested land, Pinus massoniana, Cunninghamia lanceolata and broad-leave tree types. Three types of classification factors including spectral, texture, and different vegetation indices were used to establish a class hierarchy. According to the different levels, membership functions and the decision tree classification rules were adopted. The results showed that the method based on the object-oriented method by using texture, spectrum and the vegetation indices achieved the classification accuracy of 91.3%, which was increased by 5.7% compared with that by only using the texture and spectrum.

  9. Overview of image processing tools to extract physical information from JET videos

    International Nuclear Information System (INIS)

    In magnetic confinement nuclear fusion devices such as JET, the last few years have witnessed a significant increase in the use of digital imagery, not only for the surveying and control of experiments, but also for the physical interpretation of results. More than 25 cameras are routinely used for imaging on JET in the infrared (IR) and visible spectral regions. These cameras can produce up to tens of Gbytes per shot and their information content can be very different, depending on the experimental conditions. However, the relevant information about the underlying physical processes is generally of much reduced dimensionality compared to the recorded data. The extraction of this information, which allows full exploitation of these diagnostics, is a challenging task. The image analysis consists, in most cases, of inverse problems which are typically ill-posed mathematically. The typology of objects to be analysed is very wide, and usually the images are affected by noise, low levels of contrast, low grey-level in-depth resolution, reshaping of moving objects, etc. Moreover, the plasma events have time constants of ms or tens of ms, which imposes tough conditions for real-time applications. On JET, in the last few years new tools and methods have been developed for physical information retrieval. The methodology of optical flow has allowed, under certain assumptions, the derivation of information about the dynamics of video objects associated with different physical phenomena, such as instabilities, pellets and filaments. The approach has been extended in order to approximate the optical flow within the MPEG compressed domain, allowing the manipulation of the large JET video databases and, in specific cases, even real-time data processing. The fast visible camera may provide new information that is potentially useful for disruption prediction. A set of methods, based on the extraction of structural information from the visual scene, have been developed for the

  10. Overview of image processing tools to extract physical information from JET videos

    Science.gov (United States)

    Craciunescu, T.; Murari, A.; Gelfusa, M.; Tiseanu, I.; Zoita, V.; EFDA Contributors, JET

    2014-11-01

    In magnetic confinement nuclear fusion devices such as JET, the last few years have witnessed a significant increase in the use of digital imagery, not only for the surveying and control of experiments, but also for the physical interpretation of results. More than 25 cameras are routinely used for imaging on JET in the infrared (IR) and visible spectral regions. These cameras can produce up to tens of Gbytes per shot and their information content can be very different, depending on the experimental conditions. However, the relevant information about the underlying physical processes is generally of much reduced dimensionality compared to the recorded data. The extraction of this information, which allows full exploitation of these diagnostics, is a challenging task. The image analysis consists, in most cases, of inverse problems which are typically ill-posed mathematically. The typology of objects to be analysed is very wide, and usually the images are affected by noise, low levels of contrast, low grey-level in-depth resolution, reshaping of moving objects, etc. Moreover, the plasma events have time constants of ms or tens of ms, which imposes tough conditions for real-time applications. On JET, in the last few years new tools and methods have been developed for physical information retrieval. The methodology of optical flow has allowed, under certain assumptions, the derivation of information about the dynamics of video objects associated with different physical phenomena, such as instabilities, pellets and filaments. The approach has been extended in order to approximate the optical flow within the MPEG compressed domain, allowing the manipulation of the large JET video databases and, in specific cases, even real-time data processing. The fast visible camera may provide new information that is potentially useful for disruption prediction. A set of methods, based on the extraction of structural information from the visual scene, have been developed for the

  11. Biomedical signals, imaging, and informatics

    CERN Document Server

    Bronzino, Joseph D

    2014-01-01

    Known as the bible of biomedical engineering, The Biomedical Engineering Handbook, Fourth Edition, sets the standard against which all other references of this nature are measured. As such, it has served as a major resource for both skilled professionals and novices to biomedical engineering.Biomedical Signals, Imaging, and Informatics, the third volume of the handbook, presents material from respected scientists with diverse backgrounds in biosignal processing, medical imaging, infrared imaging, and medical informatics.More than three dozen specific topics are examined, including biomedical s

  12. THE STUDY ON HEIGHT INFORMATION EXTRACTION OF CULTURAL FEATURES IN REMOTE SENSING IMAGES BASED ON SHADOW AREAS

    Directory of Open Access Journals (Sweden)

    Z. Bao-Ming

    2012-09-01

    Full Text Available Cultural feature is important element in geospatial information library and the height information is important information of cultural features. The existences of the height information and its precision have direct influence over topographic map, especially the quality of large-scale and medium-scale topographic map, and the level of surveying and mapping support. There are a lot of methods about height information extraction, in which the main methods are ground survey (field direct measurement spatial sensor and photogrammetric ways. However, automatic extraction is very tough. This paper has had an emphasis on segmentation algorithm on shadow areas under multiple constraints and realized automatic extraction of height information by using shadow. Binarization image can be obtained using gray threshold estimated under the multiple constraints. On the interesting area, spot elimination and region splitting are made. After region labeling and non-shadowed regions elimination, shadow area of cultural features can be found. Then height of the cultural features can be calculated using shadow length, sun altitude angle, azimuth angle, and sensor altitude angle, azimuth angle. A great many of experiments have shown that mean square error of the height information of cultural features extraction is close to 2 meter and automatic extraction rate is close to 70%.

  13. Extract the Relational Information of Static Features and Motion Features for Human Activities Recognition in Videos

    Science.gov (United States)

    2016-01-01

    Both static features and motion features have shown promising performance in human activities recognition task. However, the information included in these features is insufficient for complex human activities. In this paper, we propose extracting relational information of static features and motion features for human activities recognition. The videos are represented by a classical Bag-of-Word (BoW) model which is useful in many works. To get a compact and discriminative codebook with small dimension, we employ the divisive algorithm based on KL-divergence to reconstruct the codebook. After that, to further capture strong relational information, we construct a bipartite graph to model the relationship between words of different feature set. Then we use a k-way partition to create a new codebook in which similar words are getting together. With this new codebook, videos can be represented by a new BoW vector with strong relational information. Moreover, we propose a method to compute new clusters from the divisive algorithm's projective function. We test our work on the several datasets and obtain very promising results.

  14. Extracting duration information in a picture category decoding task using hidden Markov Models

    Science.gov (United States)

    Pfeiffer, Tim; Heinze, Nicolai; Frysch, Robert; Deouell, Leon Y.; Schoenfeld, Mircea A.; Knight, Robert T.; Rose, Georg

    2016-04-01

    Objective. Adapting classifiers for the purpose of brain signal decoding is a major challenge in brain-computer-interface (BCI) research. In a previous study we showed in principle that hidden Markov models (HMM) are a suitable alternative to the well-studied static classifiers. However, since we investigated a rather straightforward task, advantages from modeling of the signal could not be assessed. Approach. Here, we investigate a more complex data set in order to find out to what extent HMMs, as a dynamic classifier, can provide useful additional information. We show for a visual decoding problem that besides category information, HMMs can simultaneously decode picture duration without an additional training required. This decoding is based on a strong correlation that we found between picture duration and the behavior of the Viterbi paths. Main results. Decoding accuracies of up to 80% could be obtained for category and duration decoding with a single classifier trained on category information only. Significance. The extraction of multiple types of information using a single classifier enables the processing of more complex problems, while preserving good training results even on small databases. Therefore, it provides a convenient framework for online real-life BCI utilizations.

  15. Classification and Extraction of Urban Land-Use Information from High-Resolution Image Based on Object Multi-features

    Institute of Scientific and Technical Information of China (English)

    Kong Chunfang; Xu Kai; Wu Chonglong

    2006-01-01

    Urban land provides a suitable location for various economic activities which affect the development of surrounding areas. With rapid industrialization and urbanization, the contradictions in land-use become more noticeable. Urban administrators and decision-makers seek modern methods and technology to provide information support for urban growth. Recently, with the fast development of high-resolution sensor technology, more relevant data can be obtained, which is an advantage in studying the sustainable development of urban land-use. However, these data are only information sources and are a mixture of "information" and "noise". Processing, analysis and information extraction from remote sensing data is necessary to provide useful information. This paper extracts urban land-use information from a high-resolution image by using the multi-feature information of the image objects, and adopts an object-oriented image analysis approach and multi-scale image segmentation technology. A classification and extraction model is set up based on the multi-features of the image objects, in order to contribute to information for reasonable planning and effective management. This new image analysis approach offers a satisfactory solution for extracting information quickly and efficiently.

  16. Optical Polarizationin Biomedical Applications

    CERN Document Server

    Tuchin, Valery V; Zimnyakov, Dmitry A

    2006-01-01

    Optical Polarization in Biomedical Applications introduces key developments in optical polarization methods for quantitative studies of tissues, while presenting the theory of polarization transfer in a random medium as a basis for the quantitative description of polarized light interaction with tissues. This theory uses the modified transfer equation for Stokes parameters and predicts the polarization structure of multiple scattered optical fields. The backscattering polarization matrices (Jones matrix and Mueller matrix) important for noninvasive medical diagnostic are introduced. The text also describes a number of diagnostic techniques such as CW polarization imaging and spectroscopy, polarization microscopy and cytometry. As a new tool for medical diagnosis, optical coherent polarization tomography is analyzed. The monograph also covers a range of biomedical applications, among them cataract and glaucoma diagnostics, glucose sensing, and the detection of bacteria.

  17. Toxicology of Biomedical Polymers

    Directory of Open Access Journals (Sweden)

    P. V. Vedanarayanan

    1987-04-01

    Full Text Available This paper deals with the various types of polymers, used in the fabrication of medical devices, their diversity of applications and toxic hazards which may arise out of their application. The potential toxicity of monomers and the various additives used in the manufacture of biomedical polymers have been discussed along with hazards which may arise out of processing of devices such as sterilization. The importance of quality control and stringent toxicity evaluation methods have been emphasised since in our country, at present, there are no regulations covering the manufacturing and marketing of medical devices. Finally the question of the general and subtle long term systemic toxicity of biomedical polymers have been brought to attention with the suggestion that this question needs to be resolved permanently by appropriate studies.

  18. Transforming a research-oriented dataset for evaluation of tactical information extraction technologies

    Science.gov (United States)

    Roy, Heather; Kase, Sue E.; Knight, Joanne

    2016-05-01

    The most representative and accurate data for testing and evaluating information extraction technologies is real-world data. Real-world operational data can provide important insights into human and sensor characteristics, interactions, and behavior. However, several challenges limit the feasibility of experimentation with real-world operational data. Realworld data lacks the precise knowledge of a "ground truth," a critical factor for benchmarking progress of developing automated information processing technologies. Additionally, the use of real-world data is often limited by classification restrictions due to the methods of collection, procedures for processing, and tactical sensitivities related to the sources, events, or objects of interest. These challenges, along with an increase in the development of automated information extraction technologies, are fueling an emerging demand for operationally-realistic datasets for benchmarking. An approach to meet this demand is to create synthetic datasets, which are operationally-realistic yet unclassified in content. The unclassified nature of these unclassified synthetic datasets facilitates the sharing of data between military and academic researchers thus increasing coordinated testing efforts. This paper describes the expansion and augmentation of two synthetic text datasets, one initially developed through academic research collaborations with the Army. Both datasets feature simulated tactical intelligence reports regarding fictitious terrorist activity occurring within a counterinsurgency (COIN) operation. The datasets were expanded and augmented to create two military relevant datasets. The first resulting dataset was created by augmenting and merging the two to create a single larger dataset containing ground-truth. The second resulting dataset was restructured to more realistically represent the format and content of intelligence reports. The dataset transformation effort, the final datasets, and their

  19. Toxicology of Biomedical Polymers

    OpenAIRE

    P. V. Vedanarayanan; A. C. Fernandez

    1987-01-01

    This paper deals with the various types of polymers, used in the fabrication of medical devices, their diversity of applications and toxic hazards which may arise out of their application. The potential toxicity of monomers and the various additives used in the manufacture of biomedical polymers have been discussed along with hazards which may arise out of processing of devices such as sterilization. The importance of quality control and stringent toxicity evaluation methods have been emphasi...

  20. Recent Advances and Emerging Applications in Text and Data Mining for Biomedical Discovery.

    Science.gov (United States)

    Gonzalez, Graciela H; Tahsin, Tasnia; Goodale, Britton C; Greene, Anna C; Greene, Casey S

    2016-01-01

    Precision medicine will revolutionize the way we treat and prevent disease. A major barrier to the implementation of precision medicine that clinicians and translational scientists face is understanding the underlying mechanisms of disease. We are starting to address this challenge through automatic approaches for information extraction, representation and analysis. Recent advances in text and data mining have been applied to a broad spectrum of key biomedical questions in genomics, pharmacogenomics and other fields. We present an overview of the fundamental methods for text and data mining, as well as recent advances and emerging applications toward precision medicine. PMID:26420781

  1. Analysis Methods for Extracting Knowledge from Large-Scale WiFi Monitoring to Inform Building Facility Planning

    DEFF Research Database (Denmark)

    Ruiz-Ruiz, Antonio; Blunck, Henrik; Prentow, Thor Siiger;

    2014-01-01

    realistic data to inform facility planning. In this paper, we propose analysis methods to extract knowledge from large sets of network collected WiFi traces to better inform facility management and planning in large building complexes. The analysis methods, which build on a rich set of temporal and spatial...

  2. Wireless tuning fork gyroscope for biomedical applications

    Science.gov (United States)

    Abraham, Jose K.; Varadan, Vijay K.; Whitchurch, Ashwin K.; Sarukesi, K.

    2003-07-01

    This paper presents the development of a Bluetooth enabled wireless tuning fork gyroscope for the biomedical applications, including gait phase detection system, human motion analysis and physical therapy. This gyroscope is capable of measuring rotation rates between -90 and 90 and it can read the rotation information using a computer. Currently, the information from a gyroscope can trigger automobile airbag deployment during rollover, improve the accuracy and reliability of GPS navigation systems and stabilize moving platforms such as automobiles, airplanes, robots, antennas, and industrial equipment. Adding wireless capability to the existing gyroscope could help to expand its applications in many areas particularly in biomedical applications, where a continuous patient monitoring is quite difficult. This wireless system provides information on several aspects of activities of patients for real-time monitoring in hospitals.

  3. Biomedical Terminology Mapper for UML projects.

    Science.gov (United States)

    Thibault, Julien C; Frey, Lewis

    2013-01-01

    As the biomedical community collects and generates more and more data, the need to describe these datasets for exchange and interoperability becomes crucial. This paper presents a mapping algorithm that can help developers expose local implementations described with UML through standard terminologies. The input UML class or attribute name is first normalized and tokenized, then lookups in a UMLS-based dictionary are performed. For the evaluation of the algorithm 142 UML projects were extracted from caGrid and automatically mapped to National Cancer Institute (NCI) terminology concepts. Resulting mappings at the UML class and attribute levels were compared to the manually curated annotations provided in caGrid. Results are promising and show that this type of algorithm could speed-up the tedious process of mapping local implementations to standard biomedical terminologies.

  4. Face Contour Extraction of Information%人脸轮廓信息的提取

    Institute of Scientific and Technical Information of China (English)

    原瑾

    2011-01-01

    边缘提取在模式识别、机器视觉、图像分析及图像编码等领域都有着重要的研究价值。人脸检测技术是一种人脸识别技术的前提。文章针对人脸检测中人脸定位提出了人脸轮廓信息提取技术,确定人脸检测的主要区域。首先介绍了几种边缘检测算子,然后提出了动态阈值方法来改进图像阈值,提高了边缘检测精度。%Edge extraction has important research value in the fields of pattern recognition, machine vision, image analysis and image coding. Face detection technology is prerequisite of face recognition technology. In view of person face localization in person face detection, the dissertation proposes an extraction technology of face outline information to identify the main regional of face. This article first introduced several edge detection operators, and then proposed the method of dynamic threshold value to improves the image threshold value, which increased the edge detection accuracy.

  5. Developing a Process Model for the Forensic Extraction of Information from Desktop Search Applications

    Directory of Open Access Journals (Sweden)

    Timothy Pavlic

    2008-03-01

    Full Text Available Desktop search applications can contain cached copies of files that were deleted from the file system. Forensic investigators see this as a potential source of evidence, as documents deleted by suspects may still exist in the cache. Whilst there have been attempts at recovering data collected by desktop search applications, there is no methodology governing the process, nor discussion on the most appropriate means to do so. This article seeks to address this issue by developing a process model that can be applied when developing an information extraction application for desktop search applications, discussing preferred methods and the limitations of each. This work represents a more structured approach than other forms of current research.

  6. Optimal Extraction of Cosmological Information from Supernova Datain the Presence of Calibration Uncertainties

    Energy Technology Data Exchange (ETDEWEB)

    Kim, Alex G.; Miquel, Ramon

    2005-09-26

    We present a new technique to extract the cosmological information from high-redshift supernova data in the presence of calibration errors and extinction due to dust. While in the traditional technique the distance modulus of each supernova is determined separately, in our approach we determine all distance moduli at once, in a process that achieves a significant degree of self-calibration. The result is a much reduced sensitivity of the cosmological parameters to the calibration uncertainties. As an example, for a strawman mission similar to that outlined in the SNAP satellite proposal, the increased precision obtained with the new approach is roughly equivalent to a factor of five decrease in the calibration uncertainty.

  7. Metaproteomics: extracting and mining proteome information to characterize metabolic activities in microbial communities

    Energy Technology Data Exchange (ETDEWEB)

    Abraham, Paul E [ORNL; Giannone, Richard J [ORNL; Xiong, Weili [ORNL; Hettich, Robert {Bob} L [ORNL

    2014-01-01

    Contemporary microbial ecology studies usually employ one or more omics approaches to investigate the structure and function of microbial communities. Among these, metaproteomics aims to characterize the metabolic activities of the microbial membership, providing a direct link between the genetic potential and functional metabolism. The successful deployment of metaproteomics research depends on the integration of high-quality experimental and bioinformatic techniques for uncovering the metabolic activities of a microbial community in a way that is complementary to other meta-omic approaches. The essential, quality-defining informatics steps in metaproteomics investigations are: (1) construction of the metagenome, (2) functional annotation of predicted protein-coding genes, (3) protein database searching, (4) protein inference, and (5) extraction of metabolic information. In this article, we provide an overview of current bioinformatic approaches and software implementations in metaproteome studies in order to highlight the key considerations needed for successful implementation of this powerful community-biology tool.

  8. Metaproteomics: extracting and mining proteome information to characterize metabolic activities in microbial communities.

    Science.gov (United States)

    Abraham, Paul E; Giannone, Richard J; Xiong, Weili; Hettich, Robert L

    2014-01-01

    Contemporary microbial ecology studies usually employ one or more "omics" approaches to investigate the structure and function of microbial communities. Among these, metaproteomics aims to characterize the metabolic activities of the microbial membership, providing a direct link between the genetic potential and functional metabolism. The successful deployment of metaproteomics research depends on the integration of high-quality experimental and bioinformatic techniques for uncovering the metabolic activities of a microbial community in a way that is complementary to other "meta-omic" approaches. The essential, quality-defining informatics steps in metaproteomics investigations are: (1) construction of the metagenome, (2) functional annotation of predicted protein-coding genes, (3) protein database searching, (4) protein inference, and (5) extraction of metabolic information. In this article, we provide an overview of current bioinformatic approaches and software implementations in metaproteome studies in order to highlight the key considerations needed for successful implementation of this powerful community-biology tool.

  9. GPS positioning information extraction and application%GPS定位信息提取及应用

    Institute of Scientific and Technical Information of China (English)

    李冰; 曾连荪

    2012-01-01

    The article,describes the method of serial port communication between GPS and the handheld terminal. And extracting GPS positioning information under the operating system of WinCE6.0. The system used the RMC data format of NMEA-0183 communication protocol to analysis. The paper introduces the concept of GIS, and the application of GPS in the field of GIS are also given.%描述了GPS与手持终端串口通信的方法,并在WinCE6.0操作系统下提取GPS的定位信息.采用NMEA-0183通信协议中的RMC数据格式进行解析。简述了GIS概念,并介绍了GPS在GIS方面的应用。

  10. Extracting spatial information from noise measurements of multi-spatial-mode quantum states

    Science.gov (United States)

    Marino, A. M.; Clark, J. B.; Glorieux, Q.; Lett, P. D.

    2012-11-01

    We show that it is possible to use the spatial quantum correlations present in twin beams to extract information about the shape of a binary amplitude mask in the path of one of the beams. The scheme, based on noise measurements through homodyne detection, is useful in the regime where the number of photons is low enough that direct detection with a photodiode is difficult but high enough that photon counting is not an option. We find that under some conditions the use of quantum states of light leads to an enhancement of the sensitivity in the estimation of the shape of the mask over what can be achieved with a classical state with equivalent properties (mean photon flux and noise properties). In addition, we show that the level of enhancement that is obtained is a result of the quantum correlations and cannot be explained with only classical correlations.

  11. EnvMine: A text-mining system for the automatic extraction of contextual information

    Directory of Open Access Journals (Sweden)

    de Lorenzo Victor

    2010-06-01

    Full Text Available Abstract Background For ecological studies, it is crucial to count on adequate descriptions of the environments and samples being studied. Such a description must be done in terms of their physicochemical characteristics, allowing a direct comparison between different environments that would be difficult to do otherwise. Also the characterization must include the precise geographical location, to make possible the study of geographical distributions and biogeographical patterns. Currently, there is no schema for annotating these environmental features, and these data have to be extracted from textual sources (published articles. So far, this had to be performed by manual inspection of the corresponding documents. To facilitate this task, we have developed EnvMine, a set of text-mining tools devoted to retrieve contextual information (physicochemical variables and geographical locations from textual sources of any kind. Results EnvMine is capable of retrieving the physicochemical variables cited in the text, by means of the accurate identification of their associated units of measurement. In this task, the system achieves a recall (percentage of items retrieved of 92% with less than 1% error. Also a Bayesian classifier was tested for distinguishing parts of the text describing environmental characteristics from others dealing with, for instance, experimental settings. Regarding the identification of geographical locations, the system takes advantage of existing databases such as GeoNames to achieve 86% recall with 92% precision. The identification of a location includes also the determination of its exact coordinates (latitude and longitude, thus allowing the calculation of distance between the individual locations. Conclusion EnvMine is a very efficient method for extracting contextual information from different text sources, like published articles or web pages. This tool can help in determining the precise location and physicochemical

  12. Intelligent information extraction to aid science decision making in autonomous space exploration

    Science.gov (United States)

    Merényi, Erzsébet; Tasdemir, Kadim; Farrand, William H.

    2008-04-01

    Effective scientific exploration of remote targets such as solar system objects increasingly calls for autonomous data analysis and decision making on-board. Today, robots in space missions are programmed to traverse from one location to another without regard to what they might be passing by. By not processing data as they travel, they can miss important discoveries, or will need to travel back if scientists on Earth find the data warrant backtracking. This is a suboptimal use of resources even on relatively close targets such as the Moon or Mars. The farther mankind ventures into space, the longer the delay in communication, due to which interesting findings from data sent back to Earth are made too late to command a (roving, floating, or orbiting) robot to further examine a given location. However, autonomous commanding of robots in scientific exploration can only be as reliable as the scientific information extracted from the data that is collected and provided for decision making. In this paper, we focus on the discovery scenario, where information extraction is accomplished with unsupervised clustering. For high-dimensional data with complicated structure, detailed segmentation that identifies all significant groups and discovers the small, surprising anomalies in the data, is a challenging task at which conventional algorithms often fail. We approach the problem with precision manifold learning using self-organizing neural maps with non-standard features developed in the course of our research. We demonstrate the effectiveness and robustness of this approach on multi-spectral imagery from the Mars Exploration Rovers Pancam, and on synthetic hyperspectral imagery.

  13. Facilitating biomedical researchers' interrogation of electronic health record data: Ideas from outside of biomedical informatics.

    Science.gov (United States)

    Hruby, Gregory W; Matsoukas, Konstantina; Cimino, James J; Weng, Chunhua

    2016-04-01

    Electronic health records (EHR) are a vital data resource for research uses, including cohort identification, phenotyping, pharmacovigilance, and public health surveillance. To realize the promise of EHR data for accelerating clinical research, it is imperative to enable efficient and autonomous EHR data interrogation by end users such as biomedical researchers. This paper surveys state-of-art approaches and key methodological considerations to this purpose. We adapted a previously published conceptual framework for interactive information retrieval, which defines three entities: user, channel, and source, by elaborating on channels for query formulation in the context of facilitating end users to interrogate EHR data. We show the current progress in biomedical informatics mainly lies in support for query execution and information modeling, primarily due to emphases on infrastructure development for data integration and data access via self-service query tools, but has neglected user support needed during iteratively query formulation processes, which can be costly and error-prone. In contrast, the information science literature has offered elaborate theories and methods for user modeling and query formulation support. The two bodies of literature are complementary, implying opportunities for cross-disciplinary idea exchange. On this basis, we outline the directions for future informatics research to improve our understanding of user needs and requirements for facilitating autonomous interrogation of EHR data by biomedical researchers. We suggest that cross-disciplinary translational research between biomedical informatics and information science can benefit our research in facilitating efficient data access in life sciences.

  14. The Ontology for Biomedical Investigations

    OpenAIRE

    Anita Bandrowski; Ryan Brinkman; Mathias Brochhausen; Brush, Matthew H.; Bill Bug; Chibucos, Marcus C.; Kevin Clancy; Mélanie Courtot; Dirk Derom; Michel Dumontier; Liju Fan; Jennifer Fostel; Gilberto Fragoso; Frank Gibson; Alejandra Gonzalez-Beltran

    2016-01-01

    The Ontology for Biomedical Investigations (OBI) is an ontology that provides terms with precisely defined meanings to describe all aspects of how investigations in the biological and medical domains are conducted. OBI re-uses ontologies that provide a representation of biomedical knowledge from the Open Biological and Biomedical Ontologies (OBO) project and adds the ability to describe how this knowledge was derived. We here describe the state of OBI and several applications that are using i...

  15. Generating Explanations for Biomedical Queries

    OpenAIRE

    Erdem, Esra; Oztok, Umut

    2013-01-01

    We introduce novel mathematical models and algorithms to generate (shortest or k different) explanations for biomedical queries, using answer set programming. We implement these algorithms and integrate them in BIOQUERY-ASP. We illustrate the usefulness of these methods with some complex biomedical queries related to drug discovery, over the biomedical knowledge resources PHARMGKB, DRUGBANK, BIOGRID, CTD, SIDER, DISEASE ONTOLOGY and ORPHADATA. To appear in Theory and Practice of Logic Program...

  16. Principles of Biomedical Engineering

    CERN Document Server

    Madihally, Sundararajan V

    2010-01-01

    Describing the role of engineering in medicine today, this comprehensive volume covers a wide range of the most important topics in this burgeoning field. Supported with over 145 illustrations, the book discusses bioelectrical systems, mechanical analysis of biological tissues and organs, biomaterial selection, compartmental modeling, and biomedical instrumentation. Moreover, you find a thorough treatment of the concept of using living cells in various therapeutics and diagnostics.Structured as a complete text for students with some engineering background, the book also makes a valuable refere

  17. Statistics in biomedical research

    Directory of Open Access Journals (Sweden)

    González-Manteiga, Wenceslao

    2007-06-01

    Full Text Available The discipline of biostatistics is nowadays a fundamental scientific component of biomedical, public health and health services research. Traditional and emerging areas of application include clinical trials research, observational studies, physiology, imaging, and genomics. The present article reviews the current situation of biostatistics, considering the statistical methods traditionally used in biomedical research, as well as the ongoing development of new methods in response to the new problems arising in medicine. Clearly, the successful application of statistics in biomedical research requires appropriate training of biostatisticians. This training should aim to give due consideration to emerging new areas of statistics, while at the same time retaining full coverage of the fundamentals of statistical theory and methodology. In addition, it is important that students of biostatistics receive formal training in relevant biomedical disciplines, such as epidemiology, clinical trials, molecular biology, genetics, and neuroscience.La Bioestadística es hoy en día una componente científica fundamental de la investigación en Biomedicina, salud pública y servicios de salud. Las áreas tradicionales y emergentes de aplicación incluyen ensayos clínicos, estudios observacionales, fisología, imágenes, y genómica. Este artículo repasa la situación actual de la Bioestadística, considerando los métodos estadísticos usados tradicionalmente en investigación biomédica, así como los recientes desarrollos de nuevos métodos, para dar respuesta a los nuevos problemas que surgen en Medicina. Obviamente, la aplicación fructífera de la estadística en investigación biomédica exige una formación adecuada de los bioestadísticos, formación que debería tener en cuenta las áreas emergentes en estadística, cubriendo al mismo tiempo los fundamentos de la teoría estadística y su metodología. Es importante, además, que los estudiantes de

  18. Advances in biomedical engineering

    CERN Document Server

    Brown, J H U

    1974-01-01

    Advances in Biomedical Engineering, Volume 4, is a collection of papers that deals with gas chromatography, mass spectroscopy and the analysis of minute samples, as well as the role of the government in regulating the production, usage, safety, and efficacy of medical devices. One paper reviews the use of mass spectrometry and computer technology in relation to gas-phase analytical methods based on gas chromatograph-mass spectrometer instruments and gas chromatograph-mass spectrometer-computer analytical systems. Many health practitioners, government and private health agencies, the legal prof

  19. Biomedical Sensors and Instruments

    CERN Document Server

    Tagawa, Tatsuo

    2011-01-01

    The living body is a difficult object to measure: accurate measurements of physiological signals require sensors and instruments capable of high specificity and selectivity that do not interfere with the systems under study. As a result, detailed knowledge of sensor and instrument properties is required to be able to select the "best" sensor from one of the many designed to meet these challenges. From the underlying principles to practical applications, this updated edition of Biomedical Sensors and Instruments provides an easy-to-understand introduction to the various kinds of biome

  20. Advances in biomedical engineering

    CERN Document Server

    Brown, J H U

    1973-01-01

    Advances in Biomedical Engineering, Volume 3, is a collection of papers that discusses circulatory system models, linguistics in computer usage, and clinical applications on patient monitoring. One paper describes the use of comparative models of overall circulatory mechanics that include models of the cardiac pump, of the vascular systems, and of the overall systems behavior. Another paper describes a model in processing medical language data that employs an explicit semantic structure, becoming the basis for the computer-based, artificial intelligence of the system. One paper cites studies b

  1. Biomedical photonics handbook

    CERN Document Server

    Vo-Dinh, Tuan

    2003-01-01

    1.Biomedical Photonics: A Revolution at the Interface of Science and Technology, T. Vo-DinhPHOTONICS AND TISSUE OPTICS2.Optical Properties of Tissues, J. Mobley and T. Vo-Dinh3.Light-Tissue Interactions, V.V. Tuchin 4.Theoretical Models and Algorithms in Optical Diffusion Tomography, S.J. Norton and T. Vo-DinhPHOTONIC DEVICES5.Laser Light in Biomedicine and the Life Sciences: From the Present to the Future, V.S. Letokhov6.Basic Instrumentation in Photonics, T. Vo-Dinh7.Optical Fibers and Waveguides for Medical Applications, I. Gannot and

  2. Biomedical signals and systems

    CERN Document Server

    Tranquillo, Joseph V

    2013-01-01

    Biomedical Signals and Systems is meant to accompany a one-semester undergraduate signals and systems course. It may also serve as a quick-start for graduate students or faculty interested in how signals and systems techniques can be applied to living systems. The biological nature of the examples allows for systems thinking to be applied to electrical, mechanical, fluid, chemical, thermal and even optical systems. Each chapter focuses on a topic from classic signals and systems theory: System block diagrams, mathematical models, transforms, stability, feedback, system response, control, time

  3. Advances in biomedical engineering

    CERN Document Server

    Brown, J H U

    1973-01-01

    Advances in Biomedical Engineering, Volume 2, is a collection of papers that discusses the basic sciences, the applied sciences of engineering, the medical sciences, and the delivery of health services. One paper discusses the models of adrenal cortical control, including the secretion and metabolism of cortisol (the controlled process), as well as the initiation and modulation of secretion of ACTH (the controller). Another paper discusses hospital computer systems-application problems, objective evaluation of technology, and multiple pathways for future hospital computer applications. The pos

  4. A methodology for the extraction of quantitative information from electron microscopy images at the atomic level

    Science.gov (United States)

    Galindo, P. L.; Pizarro, J.; Guerrero, E.; Guerrero-Lebrero, M. P.; Scavello, G.; Yáñez, A.; Núñez-Moraleda, B. M.; Maestre, J. M.; Sales, D. L.; Herrera, M.; Molina, S. I.

    2014-06-01

    In this paper we describe a methodology developed at the University of Cadiz (Spain) in the past few years for the extraction of quantitative information from electron microscopy images at the atomic level. This work is based on a coordinated and synergic activity of several research groups that have been working together over the last decade in two different and complementary fields: Materials Science and Computer Science. The aim of our joint research has been to develop innovative high-performance computing techniques and simulation methods in order to address computationally challenging problems in the analysis, modelling and simulation of materials at the atomic scale, providing significant advances with respect to existing techniques. The methodology involves several fundamental areas of research including the analysis of high resolution electron microscopy images, materials modelling, image simulation and 3D reconstruction using quantitative information from experimental images. These techniques for the analysis, modelling and simulation allow optimizing the control and functionality of devices developed using materials under study, and have been tested using data obtained from experimental samples.

  5. A comparison of techniques for extracting emissivity information from thermal infrared data for geologic studies

    Science.gov (United States)

    Hook, Simon J.; Gabell, A. R.; Green, A. A.; Kealy, P. S.

    1992-01-01

    This article evaluates three techniques developed to extract emissivity information from multispectral thermal infrared data. The techniques are the assumed Channel 6 emittance model, thermal log residuals, and alpha residuals. These techniques were applied to calibrated, atmospherically corrected thermal infrared multispectral scanner (TIMS) data acquired over Cuprite, Nevada in September 1990. Results indicate that the two new techniques (thermal log residuals and alpha residuals) provide two distinct advantages over the assumed Channel 6 emittance model. First, they permit emissivity information to be derived from all six TIMS channels. The assumed Channel 6 emittance model only permits emissivity values to be derived from five of the six TIMS channels. Second, both techniques are less susceptible to noise than the assumed Channel 6 emittance model. The disadvantage of both techniques is that laboratory data must be converted to thermal log residuals or alpha residuals to facilitate comparison with similarly processed image data. An additional advantage of the alpha residual technique is that the processed data are scene-independent unlike those obtained with the other techniques.

  6. How to extract clinically useful information from large amount of dialysis related stored data.

    Science.gov (United States)

    Vito, Domenico; Casagrande, Giustina; Bianchi, Camilla; Costantino, Maria L

    2015-01-01

    The basic storage infrastructure used to gather data from the technological evolution also in the healthcare field was leading to the storing into public or private repository of even higher quantities of data related to patients and their pathological evolution. Big data techniques are spreading also in medical research. By these techniques is possible extract information from complex heterogeneous sources, realizing longitudinal studies focused to correlate the patient status with biometric parameters. In our work we develop a common data infrastructure involving 4 clinical dialysis centers between Lombardy and Switzerland. The common platform has been build to store large amount of clinical data related to 716 dialysis session of 70 patient. The platform is made up by a combination of a MySQL(®) database (Dialysis Database) and a MATLAB-based mining library (Dialysis MATlib). A statistical analysis of these data has been performed on the data gathered. These analyses led to the development of two clinical indexes, representing an example of transformation of big data into clinical information. PMID:26737858

  7. Enriching a biomedical event corpus with meta-knowledge annotation

    Directory of Open Access Journals (Sweden)

    Thompson Paul

    2011-10-01

    Full Text Available Abstract Background Biomedical papers contain rich information about entities, facts and events of biological relevance. To discover these automatically, we use text mining techniques, which rely on annotated corpora for training. In order to extract protein-protein interactions, genotype-phenotype/gene-disease associations, etc., we rely on event corpora that are annotated with classified, structured representations of important facts and findings contained within text. These provide an important resource for the training of domain-specific information extraction (IE systems, to facilitate semantic-based searching of documents. Correct interpretation of these events is not possible without additional information, e.g., does an event describe a fact, a hypothesis, an experimental result or an analysis of results? How confident is the author about the validity of her analyses? These and other types of information, which we collectively term meta-knowledge, can be derived from the context of the event. Results We have designed an annotation scheme for meta-knowledge enrichment of biomedical event corpora. The scheme is multi-dimensional, in that each event is annotated for 5 different aspects of meta-knowledge that can be derived from the textual context of the event. Textual clues used to determine the values are also annotated. The scheme is intended to be general enough to allow integration with different types of bio-event annotation, whilst being detailed enough to capture important subtleties in the nature of the meta-knowledge expressed in the text. We report here on both the main features of the annotation scheme, as well as its application to the GENIA event corpus (1000 abstracts with 36,858 events. High levels of inter-annotator agreement have been achieved, falling in the range of 0.84-0.93 Kappa. Conclusion By augmenting event annotations with meta-knowledge, more sophisticated IE systems can be trained, which allow interpretative

  8. Information Extraction and Dependency on Open Government Data (ogd) for Environmental Monitoring

    Science.gov (United States)

    Abdulmuttalib, Hussein

    2016-06-01

    Environmental monitoring practices support decision makers of different government / private institutions, besides environmentalists and planners among others. This support helps them act towards the sustainability of our environment, and also take efficient measures for protecting human beings in general, but it is difficult to explore useful information from 'OGD' and assure its quality for the purpose. On the other hand, Monitoring itself comprises detecting changes as happens, or within the mitigation period range, which means that any source of data, that is to be used for monitoring, should replicate the information related to the period of environmental monitoring, or otherwise it's considered almost useless or history. In this paper the assessment of information extraction and structuring from Open Government Data 'OGD', that can be useful to environmental monitoring is performed, looking into availability, usefulness to environmental monitoring of a certain type, checking its repetition period and dependences. The particular assessment is being performed on a small sample selected from OGD, bearing in mind the type of the environmental change monitored, such as the increase and concentrations of built up areas, and reduction of green areas, or monitoring the change of temperature in a specific area. The World Bank mentioned in its blog that Data is open if it satisfies both conditions of, being technically open, and legally open. The use of Open Data thus, is regulated by published terms of use, or an agreement which implies some conditions without violating the above mentioned two conditions. Within the scope of the paper I wish to share the experience of using some OGD for supporting an environmental monitoring work, that is performed to mitigate the production of carbon dioxide, by regulating energy consumption, and by properly designing the test area's landscapes, thus using Geodesign tactics, meanwhile wish to add to the results achieved by many

  9. Biomedical applications of nisin.

    Science.gov (United States)

    Shin, J M; Gwak, J W; Kamarajan, P; Fenno, J C; Rickard, A H; Kapila, Y L

    2016-06-01

    Nisin is a bacteriocin produced by a group of Gram-positive bacteria that belongs to Lactococcus and Streptococcus species. Nisin is classified as a Type A (I) lantibiotic that is synthesized from mRNA and the translated peptide contains several unusual amino acids due to post-translational modifications. Over the past few decades, nisin has been used widely as a food biopreservative. Since then, many natural and genetically modified variants of nisin have been identified and studied for their unique antimicrobial properties. Nisin is FDA approved and generally regarded as a safe peptide with recognized potential for clinical use. Over the past two decades the application of nisin has been extended to biomedical fields. Studies have reported that nisin can prevent the growth of drug-resistant bacterial strains, such as methicillin-resistant Staphylococcus aureus, Streptococcus pneumoniae, Enterococci and Clostridium difficile. Nisin has now been shown to have antimicrobial activity against both Gram-positive and Gram-negative disease-associated pathogens. Nisin has been reported to have anti-biofilm properties and can work synergistically in combination with conventional therapeutic drugs. In addition, like host-defence peptides, nisin may activate the adaptive immune response and have an immunomodulatory role. Increasing evidence indicates that nisin can influence the growth of tumours and exhibit selective cytotoxicity towards cancer cells. Collectively, the application of nisin has advanced beyond its role as a food biopreservative. Thus, this review will describe and compare studies on nisin and provide insight into its future biomedical applications.

  10. Sentence Simplification Aids Protein-Protein Interaction Extraction

    OpenAIRE

    Jonnalagadda, Siddhartha; Gonzalez, Graciela

    2010-01-01

    Accurate systems for extracting Protein-Protein Interactions (PPIs) automatically from biomedical articles can help accelerate biomedical research. Biomedical Informatics researchers are collaborating to provide metaservices and advance the state-of-art in PPI extraction. One problem often neglected by current Natural Language Processing systems is the characteristic complexity of the sentences in biomedical literature. In this paper, we report on the impact that automatic simplification of s...

  11. Overview of BioCreAtIvE: critical assessment of information extraction for biology

    Directory of Open Access Journals (Sweden)

    Hirschman Lynette

    2005-05-01

    Full Text Available Abstract Background The goal of the first BioCreAtIvE challenge (Critical Assessment of Information Extraction in Biology was to provide a set of common evaluation tasks to assess the state of the art for text mining applied to biological problems. The results were presented in a workshop held in Granada, Spain March 28–31, 2004. The articles collected in this BMC Bioinformatics supplement entitled "A critical assessment of text mining methods in molecular biology" describe the BioCreAtIvE tasks, systems, results and their independent evaluation. Results BioCreAtIvE focused on two tasks. The first dealt with extraction of gene or protein names from text, and their mapping into standardized gene identifiers for three model organism databases (fly, mouse, yeast. The second task addressed issues of functional annotation, requiring systems to identify specific text passages that supported Gene Ontology annotations for specific proteins, given full text articles. Conclusion The first BioCreAtIvE assessment achieved a high level of international participation (27 groups from 10 countries. The assessment provided state-of-the-art performance results for a basic task (gene name finding and normalization, where the best systems achieved a balanced 80% precision / recall or better, which potentially makes them suitable for real applications in biology. The results for the advanced task (functional annotation from free text were significantly lower, demonstrating the current limitations of text-mining approaches where knowledge extrapolation and interpretation are required. In addition, an important contribution of BioCreAtIvE has been the creation and release of training and test data sets for both tasks. There are 22 articles in this special issue, including six that provide analyses of results or data quality for the data sets, including a novel inter-annotator consistency assessment for the test set used in task 2.

  12. A Knowledge-Driven Approach to Extract Disease-Related Biomarkers from the Literature

    OpenAIRE

    À. Bravo; Cases, M.; Queralt-Rosinach, N.; Sanz, F.; Furlong, L. I.

    2014-01-01

    The biomedical literature represents a rich source of biomarker information. However, both the size of literature databases and their lack of standardization hamper the automatic exploitation of the information contained in these resources. Text mining approaches have proven to be useful for the exploitation of information contained in the scientific publications. Here, we show that a knowledge-driven text mining approach can exploit a large literature database to extract a dataset of biomark...

  13. Trends in modeling Biomedical Complex Systems

    OpenAIRE

    Remondini Daniel; Castellani Gastone; Romano Paolo; Milanesi Luciano; Liò Petro

    2009-01-01

    Abstract In this paper we provide an introduction to the techniques for multi-scale complex biological systems, from the single bio-molecule to the cell, combining theoretical modeling, experiments, informatics tools and technologies suitable for biological and biomedical research, which are becoming increasingly multidisciplinary, multidimensional and information-driven. The most important concepts on mathematical modeling methodologies and statistical inference, bioinformatics and standards...

  14. BioLemmatizer: a lemmatization tool for morphological processing of biomedical text

    Directory of Open Access Journals (Sweden)

    Liu Haibin

    2012-04-01

    Full Text Available Abstract Background The wide variety of morphological variants of domain-specific technical terms contributes to the complexity of performing natural language processing of the scientific literature related to molecular biology. For morphological analysis of these texts, lemmatization has been actively applied in the recent biomedical research. Results In this work, we developed a domain-specific lemmatization tool, BioLemmatizer, for the morphological analysis of biomedical literature. The tool focuses on the inflectional morphology of English and is based on the general English lemmatization tool MorphAdorner. The BioLemmatizer is further tailored to the biological domain through incorporation of several published lexical resources. It retrieves lemmas based on the use of a word lexicon, and defines a set of rules that transform a word to a lemma if it is not encountered in the lexicon. An innovative aspect of the BioLemmatizer is the use of a hierarchical strategy for searching the lexicon, which enables the discovery of the correct lemma even if the input Part-of-Speech information is inaccurate. The BioLemmatizer achieves an accuracy of 97.5% in lemmatizing an evaluation set prepared from the CRAFT corpus, a collection of full-text biomedical articles, and an accuracy of 97.6% on the LLL05 corpus. The contribution of the BioLemmatizer to accuracy improvement of a practical information extraction task is further demonstrated when it is used as a component in a biomedical text mining system. Conclusions The BioLemmatizer outperforms other tools when compared with eight existing lemmatizers. The BioLemmatizer is released as an open source software and can be downloaded from http://biolemmatizer.sourceforge.net.

  15. Multiscale computer modeling in biomechanics and biomedical engineering

    CERN Document Server

    2013-01-01

    This book reviews the state-of-the-art in multiscale computer modeling, in terms of both accomplishments and challenges. The information in the book is particularly useful for biomedical engineers, medical physicists and researchers in systems biology, mathematical biology, micro-biomechanics and biomaterials who are interested in how to bridge between traditional biomedical engineering work at the organ and tissue scales, and the newer arenas of cellular and molecular bioengineering.

  16. Professional Identification for Biomedical Engineers

    Science.gov (United States)

    Long, Francis M.

    1973-01-01

    Discusses four methods of professional identification in biomedical engineering including registration, certification, accreditation, and possible membership qualification of the societies. Indicates that the destiny of the biomedical engineer may be under the control of a new profession, neither the medical nor the engineering. (CC)

  17. In-shore ship extraction from HR optical remote sensing image via salience structure and GIS information

    Science.gov (United States)

    Ren, Xiaoyuan; Jiang, Libing; Tang, Xiao-an

    2015-12-01

    In order to solve the problem of in-shore ship extraction from remote sensing image, a novel method for in-shore ship extraction from high resolution (HR) optical remote sensing image is proposed via salience structure feature and GIS information. Firstly, the berth ROI is located in the image with the aid of the prior GIS auxiliary information. Secondly, the salient corner features at ship bow are extracted from the berth ROI precisely. Finally, a recursive algorithm concerning the symmetric geometry of the ship target is conducted to discriminate the multi docked in-shore targets into mono in-shore ships. The results of the experiments show that the method proposed in this paper can detect the majority of large and medium scale in-shore ships from the optical remote sensing image, including both the mono and the multi adjacent docked in-shore ship cases.

  18. A Semantics and Data-Driven Biomedical Multimedia Software System

    Directory of Open Access Journals (Sweden)

    Shih-Hsi Liu

    2010-08-01

    Full Text Available Developing a large-scale biomedical multimedia software system is always a challenging task: Satisfaction of sundry and stringent biomedical multimedia related requirements and standards; Heterogeneous software deployment and communication environments; and tangling correlation between data/contents and software functionalities, among others. This paper introduces a novel biomedical multimedia software system developed under Service-Oriented Architecture (SOA. Such a system takes the advantage of interoperability of SOA to solve the heterogeneity and correlation problems. The paper also classifies the system into services, annotation, ontologies, semantics matching, and QoS optimization aspects which may potentially solve the requirements problem: By establishing data ontology with respect to data properties, contents, QoS, and biomedical regulations and expanding service ontology to describe more functional and QoS specifications supported by services, appropriate services for processing biomedical multimedia data may be discovered, performed, tuned up or replaced as needed. Lastly, a biomedical education project that improves the performance of feature extraction and classification processed afterwards is introduced to illustrate the advantages of our software system developed under SOA.

  19. Extracting conformational structure information of benzene molecules via laser-induced electron diffraction

    Directory of Open Access Journals (Sweden)

    Yuta Ito

    2016-05-01

    Full Text Available We have measured the angular distributions of high energy photoelectrons of benzene molecules generated by intense infrared femtosecond laser pulses. These electrons arise from the elastic collisions between the benzene ions with the previously tunnel-ionized electrons that have been driven back by the laser field. Theory shows that laser-free elastic differential cross sections (DCSs can be extracted from these photoelectrons, and the DCS can be used to retrieve the bond lengths of gas-phase molecules similar to the conventional electron diffraction method. From our experimental results, we have obtained the C-C and C-H bond lengths of benzene with a spatial resolution of about 10 pm. Our results demonstrate that laser induced electron diffraction (LIED experiments can be carried out with the present-day ultrafast intense lasers already. Looking ahead, with aligned or oriented molecules, more complete spatial information of the molecule can be obtained from LIED, and applying LIED to probe photo-excited molecules, a “molecular movie” of the dynamic system may be created with sub-Ångström spatial and few-ten femtosecond temporal resolutions.

  20. MEMS biomedical implants

    Institute of Scientific and Technical Information of China (English)

    Tai Yuchong

    2012-01-01

    The field of micro-electro-mechanical systems (MEMS) has advanced tremendously for the last 20 years. Most commercially noticeably, the field has successfully advanced from pressure sensors to micro physical sensors, such as accelerometers and gyros, for handheld electronics application. In parallel, MEMS has also advanced into micro total analysis system(TAS) and/or lab-on-a-chip applications. This article would discuss a relatively new but promising future direction towards MEMS biomedical implants. Specifically, Parylene C has been explored to be used as a good MEMS implant material and will be discussed in detail. Demonstrated implant devices, such as retinal and spinal cord implants, are presented in this article.

  1. Checklists in biomedical publications

    Directory of Open Access Journals (Sweden)

    Pardal-Refoyo JL

    2013-12-01

    Full Text Available Introduction and objectives: the authors, reviewers, editors and readers must have specific tools that help them in the process of drafting, review, or reading the articles. Objective: to offer a summary of the major checklists for different types of biomedical research articles. Material and method: review literature and resources of the EQUATOR Network and adaptations in Spanish published by Medicina Clínica and Evidencias en Pediatría journals. Results: are the checklists elaborated by various working groups. (CONSORT and TREND, experimental studies for observational studies (STROBE, accuracy (STARD diagnostic studies, systematic reviews and meta-analyses (PRISMA and for studies to improve the quality (SQUIRE. Conclusions: the use of checklists help to improve the quality of articles and help to authors, reviewers, to the editor and readers in the development and understanding of the content.

  2. Biomedical applications of nisin.

    Science.gov (United States)

    Shin, J M; Gwak, J W; Kamarajan, P; Fenno, J C; Rickard, A H; Kapila, Y L

    2016-06-01

    Nisin is a bacteriocin produced by a group of Gram-positive bacteria that belongs to Lactococcus and Streptococcus species. Nisin is classified as a Type A (I) lantibiotic that is synthesized from mRNA and the translated peptide contains several unusual amino acids due to post-translational modifications. Over the past few decades, nisin has been used widely as a food biopreservative. Since then, many natural and genetically modified variants of nisin have been identified and studied for their unique antimicrobial properties. Nisin is FDA approved and generally regarded as a safe peptide with recognized potential for clinical use. Over the past two decades the application of nisin has been extended to biomedical fields. Studies have reported that nisin can prevent the growth of drug-resistant bacterial strains, such as methicillin-resistant Staphylococcus aureus, Streptococcus pneumoniae, Enterococci and Clostridium difficile. Nisin has now been shown to have antimicrobial activity against both Gram-positive and Gram-negative disease-associated pathogens. Nisin has been reported to have anti-biofilm properties and can work synergistically in combination with conventional therapeutic drugs. In addition, like host-defence peptides, nisin may activate the adaptive immune response and have an immunomodulatory role. Increasing evidence indicates that nisin can influence the growth of tumours and exhibit selective cytotoxicity towards cancer cells. Collectively, the application of nisin has advanced beyond its role as a food biopreservative. Thus, this review will describe and compare studies on nisin and provide insight into its future biomedical applications. PMID:26678028

  3. GDRMS: a system for automatic extraction of the disease-centre relation

    Science.gov (United States)

    Yang, Ronggen; Zhang, Yue; Gong, Lejun

    2012-01-01

    With the rapidly increasing of biomedical literature, the deluge of new articles is leading to information overload. Extracting the available knowledge from the huge amount of biomedical literature has become a major challenge. GDRMS is developed as a tool that extracts the relationship between disease and gene, gene and gene from biomedical literatures using text mining technology. It is a ruled-based system which also provides disease-centre network visualization, constructs the disease-gene database, and represents a gene engine for understanding the function of the gene. The main focus of GDRMS is to provide a valuable opportunity to explore the relationship between disease and gene for the research community about etiology of disease.

  4. Sequential Data Mining for Information Extraction from Texts Fouille de données séquentielles pour l’extraction d’information dans les textes

    Directory of Open Access Journals (Sweden)

    Thierry Charnois

    2010-09-01

    Full Text Available This paper shows the benefit of using data mining methods for Biological Natural Language Processing. A method for discovering linguistic patterns based on a recursive sequential pattern mining is proposed. It does not require a sentence parsing nor other resource except a training data set. It produces understandable results and we show its interest in the extraction of relations between named entities. For the named entities recognition problem, we propose a method based on a new kind of patterns taking account the sequence and its context.

  5. Biomedical engineering frontier research and converging technologies

    CERN Document Server

    Jun, Ho-Wook; Shin, Jennifer; Lee, SangHoon

    2016-01-01

    This book provides readers with an integrative overview of the latest research and developments in the broad field of biomedical engineering. Each of the chapters offers a timely review written by leading biomedical engineers and aims at showing how the convergence of scientific and engineering fields with medicine has created a new basis for practically solving problems concerning human health, wellbeing and disease. While some of the latest frontiers of biomedicine, such as neuroscience and regenerative medicine, are becoming increasingly dependent on new ideas and tools from other disciplines, the paradigm shift caused by technological innovations in the fields of information science, nanotechnology, and robotics is opening new opportunities in healthcare, besides dramatically changing the ways we actually practice science. At the same time, a new generation of engineers, fluent in many different scientific “languages,” is creating entirely new fields of research that approach the “old” questions f...

  6. 3D building reconstruction based on given ground plan information and surface models extracted from spaceborne imagery

    Science.gov (United States)

    Tack, Frederik; Buyuksalih, Gurcan; Goossens, Rudi

    2012-01-01

    3D surface models have gained field as an important tool for urban planning and mapping. However, urban environments have a complex nature to model and they provide a challenge to investigate the current limits of automatic digital surface modeling from high resolution satellite imagery. An approach is introduced to improve a 3D surface model, extracted photogrammetrically from satellite imagery, based on the geometric building information embodied in existing 2D ground plans. First buildings are clipped from the extracted DSM based on the 2D polygonal building ground plans. To generate prismatic shaped structures with vertical walls and flat roofs, building shape is retrieved from the cadastre database while elevation information is extracted from the DSM. Within each 2D building boundary, a constant roof height is extracted based on statistical calculations of the height values. After buildings are extracted from the initial surface model, the remaining DSM is further processed to simplify to a smooth DTM that reflects bare ground, without artifacts, local relief, vegetation, cars and city furniture. In a next phase, both models are merged to yield an integrated city model or generalized DSM. The accuracy of the generalized surface model is assessed according to a quantitative-statistical analysis by comparison with two different types of reference data.

  7. Usability survey of biomedical question answering systems

    Directory of Open Access Journals (Sweden)

    Bauer Michael A

    2012-09-01

    Full Text Available Abstract We live in an age of access to more information than ever before. This can be a double-edged sword. Increased access to information allows for more informed and empowered researchers, while information overload becomes an increasingly serious risk. Thus, there is a need for intelligent information retrieval systems that can summarize relevant and reliable textual sources to satisfy a user's query. Question answering is a specialized type of information retrieval with the aim of returning precise short answers to queries posed as natural language questions. We present a review and comparison of three biomedical question answering systems: askHERMES (http://www.askhermes.org/, EAGLi (http://eagl.unige.ch/EAGLi/, and HONQA (http://services.hon.ch/cgi-bin/QA10/qa.pl.

  8. Terahertz Imaging for Biomedical Applications Pattern Recognition and Tomographic Reconstruction

    CERN Document Server

    Yin, Xiaoxia; Abbott, Derek

    2012-01-01

    Terahertz Imaging for Biomedical Applications: Pattern Recognition and Tomographic Reconstruction presents the necessary algorithms needed to assist screening, diagnosis, and treatment, and these algorithms will play a critical role in the accurate detection of abnormalities present in biomedical imaging. Terahertz biomedical imaging has become an area of interest due to its ability to simultaneously acquire both image and spectral information. Terahertz imaging systems are being commercialized with an increasing number of trials performed in a biomedical setting. Terahertz tomographic imaging and detection technology contributes to the ability to identify opaque objects with clear boundaries,and would be useful to both in vivo and ex vivo environments. This book also: Introduces terahertz radiation techniques and provides a number of topical examples of signal and image processing, as well as machine learning Presents the most recent developments in an emerging field, terahertz radiation Utilizes new methods...

  9. Handbook on advanced design and manufacturing technologies for biomedical devices

    CERN Document Server

    2013-01-01

    The last decades have seen remarkable advances in computer-aided design, engineering and manufacturing technologies, multi-variable simulation tools, medical imaging, biomimetic design, rapid prototyping, micro and nanomanufacturing methods and information management resources, all of which provide new horizons for the Biomedical Engineering fields and the Medical Device Industry. Handbook on Advanced Design and Manufacturing Technologies for Biomedical Devices covers such topics in depth, with an applied perspective and providing several case studies that help to analyze and understand the key factors of the different stages linked to the development of a novel biomedical device, from the conceptual and design steps, to the prototyping and industrialization phases. Main research challenges and future potentials are also discussed, taking into account relevant social demands and a growing market already exceeding billions of dollars. In time, advanced biomedical devices will decisively change methods and resu...

  10. Advancement of Women in the Biomedical Workforce: Insights for Success.

    Science.gov (United States)

    Barfield, Whitney L; Plank-Bazinet, Jennifer L; Austin Clayton, Janine

    2016-08-01

    Women continue to face unique barriers in the biomedical workforce that affect their advancement and retention in this field. The National Institutes of Health (NIH) formed the Working Group on Women in Biomedical Careers to address these issues. Through the efforts of the working group, the NIH funded 14 research grants to identify barriers or to develop and/or test interventions to support women in the biomedical workforce. The grantees that were funded through this endeavor later established the grassroots Research Partnership on Women in Biomedical Careers, and they continue to conduct research and disseminate information on the state of women in academic medicine. This Commentary explores the themes introduced in a collection of articles organized by the research partnership and published in this issue of Academic Medicine. The authors highlight the role that government plays in the advancement of women in academic medicine and highlight the findings put forward in this collection.

  11. Extraction as a source of additional information when concentrations in multicomponent systems are simultaneously determined

    International Nuclear Information System (INIS)

    Using as an example photometric determination of Nd and Sm in their joint presence, the possibility to use the influence of extraction on analytic signal increase is considered. It is shown that interligand exchange in extracts in combination with simultaneous determination of concentrations can be used as a simple means increasing the accuracy of determination. 5 refs.; 2 figs.; 3 tabs

  12. Cluster-Based Query Expansion Using Language Modeling for Biomedical Literature Retrieval

    Science.gov (United States)

    Xu, Xuheng

    2011-01-01

    The tremendously huge volume of biomedical literature, scientists' specific information needs, long terms of multiples words, and fundamental problems of synonym and polysemy have been challenging issues facing the biomedical information retrieval community researchers. Search engines have significantly improved the efficiency and effectiveness of…

  13. Extracting respiratory information from seismocardiogram signals acquired on the chest using a miniature accelerometer

    International Nuclear Information System (INIS)

    Seismocardiography (SCG) is a non-invasive measurement of the vibrations of the chest caused by the heartbeat. SCG signals can be measured using a miniature accelerometer attached to the chest, and are thus well-suited for unobtrusive and long-term patient monitoring. Additionally, SCG contains information relating to both cardiovascular and respiratory systems. In this work, algorithms were developed for extracting three respiration-dependent features of the SCG signal: intensity modulation, timing interval changes within each heartbeat, and timing interval changes between successive heartbeats. Simultaneously with a reference respiration belt, SCG signals were measured from 20 healthy subjects and a respiration rate was estimated using each of the three SCG features and the reference signal. The agreement between each of the three accelerometer-derived respiration rate measurements was computed with respect to the respiration rate derived from the reference respiration belt. The respiration rate obtained from the intensity modulation in the SCG signal was found to be in closest agreement with the respiration rate obtained from the reference respiration belt: the bias was found to be 0.06 breaths per minute with a 95% confidence interval of −0.99 to 1.11 breaths per minute. The limits of agreement between the respiration rates estimated using SCG (intensity modulation) and the reference were within the clinically relevant ranges given in existing literature, demonstrating that SCG could be used for both cardiovascular and respiratory monitoring. Furthermore, phases of each of the three SCG parameters were investigated at four instances of a respiration cycle—start inspiration, peak inspiration, start expiration, and peak expiration—and during breath hold (apnea). The phases of the three SCG parameters observed during the respiration cycle were congruent with existing literature and physiologically expected trends. (paper)

  14. 大数据时代医学生物信息的挖掘和利用%Mining and Utilization of Bio-medical Information in the Time of Big Data

    Institute of Scientific and Technical Information of China (English)

    时钢; 王兴梅; 黄志民; 洪松林; 闫妍; 高伟伟; 门天男

    2014-01-01

    With the development of the hospital information construction,the progress of medical diagnostics and the use of high-throughput experimental equipment, medical data presentation of geometric growth showed the characteristic of big data.In medical research,the specimens library construction, clinical medical treatment, medical and health regulatory aspects put forward a huge chal enge,how to utilize existing medical information system and medical iological information construction in the future,has brought unprecedented opportunities for biomedical research too.To begin the work of big data research which is very meaningful to the construction of hospital informatization construction and biological specimen information database.The application of this research technique wil become the trend of the development of the biomedical science and technology, wil be the future of core technology in the field of bioinformatics research.So do the requirement of technical knowledge, infrastructure, personnel training content is very necessary.Big data wil infiltrate into the medical field, changing the medical research and practice of clinical medicine, medical management.%随着医院信息化的建设,医疗诊断手段进步和高通量实验设备的利用,医学数据呈现几何级数的增长表现出大数据的特征。如何利用现在已有的医疗信息系统和在将来医学生物信息化建设的问题上,对医学研究、标本库建设、临床医疗、医疗卫生监管等都提出了巨大的挑战,也为生物医学研究带来了前所未有的机遇。开展"大数据"相关研究工作对医院信息化建设、生物标本信息库建设是有着意义的。这种研究技术的应用必将成为生物医药科学技术发展的趋势,也必将是未来生物信息研究领域的核心技术。所以做好相关的技术知识了解、基础建设要求、人才培养内容是非常必要的。大数据必将渗透到医学领域,改变

  15. A COMPARATIVE ANALYSIS OF WEB INFORMATION EXTRACTION TECHNIQUES DEEP LEARNING vs. NAÏVE BAYES vs. BACK PROPAGATION NEURAL NETWORKS IN WEB DOCUMENT EXTRACTION

    Directory of Open Access Journals (Sweden)

    J. Sharmila

    2016-01-01

    Full Text Available Web mining related exploration is getting the chance to be more essential these days in view of the reason that a lot of information is overseen through the web. Web utilization is expanding in an uncontrolled way. A particular framework is required for controlling such extensive measure of information in the web space. Web mining is ordered into three noteworthy divisions: Web content mining, web usage mining and web structure mining. Tak-Lam Wong has proposed a web content mining methodology in the exploration with the aid of Bayesian Networks (BN. In their methodology, they were learning on separating the web data and characteristic revelation in view of the Bayesian approach. Roused from their investigation, we mean to propose a web content mining methodology, in view of a Deep Learning Algorithm. The Deep Learning Algorithm gives the interest over BN on the basis that BN is not considered in any learning architecture planning like to propose system. The main objective of this investigation is web document extraction utilizing different grouping algorithm and investigation. This work extricates the data from the web URL. This work shows three classification algorithms, Deep Learning Algorithm, Bayesian Algorithm and BPNN Algorithm. Deep Learning is a capable arrangement of strategies for learning in neural system which is connected like computer vision, speech recognition, and natural language processing and biometrics framework. Deep Learning is one of the simple classification technique and which is utilized for subset of extensive field furthermore Deep Learning has less time for classification. Naive Bayes classifiers are a group of basic probabilistic classifiers in view of applying Bayes hypothesis with concrete independence assumptions between the features. At that point the BPNN algorithm is utilized for classification. Initially training and testing dataset contains more URL. We extract the content presently from the dataset. The

  16. Automatic Extraction of Femur Contours from Calibrated X-Ray Images using Statistical Information

    Directory of Open Access Journals (Sweden)

    Xiao Dong

    2007-09-01

    Full Text Available Automatic identification and extraction of bone contours from x-ray images is an essential first step task for further medical image analysis. In this paper we propose a 3D statistical model based framework for the proximal femur contour extraction from calibrated x-ray images. The automatic initialization to align the 3D model with the x-ray images is solved by an Estimation of Bayesian Network Algorithm to fit a simplified multiple component geometrical model of the proximal femur to the x-ray data. Landmarks can be extracted from the geometrical model for the initialization of the 3D statistical model. The contour extraction is then accomplished by a joint registration and segmentation procedure. We iteratively updates the extracted bone contours and an instanced 3D model to fit the x-ray images. Taking the projected silhouettes of the instanced 3D model on the registered x-ray images as templates, bone contours can be extracted by a graphical model based Bayesian inference. The 3D model can then be updated by a non-rigid 2D/3D registration between the 3D statistical model and the extracted bone contours. Preliminary experiments on clinical data sets verified its validity.

  17. Molecular Biomedical Imaging Laboratory (MBIL)

    Data.gov (United States)

    Federal Laboratory Consortium — The Molecular Biomedical Imaging Laboratory (MBIL) is adjacent-a nd has access-to the Department of Radiology and Imaging Sciences clinical imaging facilities. MBIL...

  18. New Directions for Biomedical Engineering

    Science.gov (United States)

    Plonsey, Robert

    1973-01-01

    Discusses the definition of "biomedical engineering" and the development of educational programs in the field. Includes detailed descriptions of the roles of bioengineers, medical engineers, and chemical engineers. (CC)

  19. John Glenn Biomedical Engineering Consortium

    Science.gov (United States)

    Nall, Marsha

    2004-01-01

    The John Glenn Biomedical Engineering Consortium is an inter-institutional research and technology development, beginning with ten projects in FY02 that are aimed at applying GRC expertise in fluid physics and sensor development with local biomedical expertise to mitigate the risks of space flight on the health, safety, and performance of astronauts. It is anticipated that several new technologies will be developed that are applicable to both medical needs in space and on earth.

  20. Modified chitosans for biomedical applications

    OpenAIRE

    Yalınca, Zülal

    2013-01-01

    ABSTRACT: The subject of this thesis is the exploration of the suitability of chitosan and some of its derivatives for some chosen biomedical applications. Chitosan-graft-poly (N-vinyl imidazole), Chitosan-tripolyphosphate and ascorbyl chitosan were synthesized and characterized for specific biomedical applications in line with their chemical functionalities. Chitosan-graft-poly (N-vinyl imidazole), Chi-graft-PNVI, was synthesized by two methods; via an N-protection route and without N-pr...

  1. Biomedical engineer: an international job.

    Science.gov (United States)

    Crolet, Jean-Marie

    2007-01-01

    Biomedical engineer is an international job for several reasons and it means that the knowledge of at least one foreign language is a necessity. A geographical and structural analysis of the biomedical sector concludes to the teaching of a second foreign language. But in spite of the presence of adequate means, it is not possible for us for the moment to set up such a teaching. This paper presents the solution we have chosen in the framework of Erasmus exchanges.

  2. Hydroxyapatite coatings for biomedical applications

    CERN Document Server

    Zhang, Sam

    2013-01-01

    Hydroxyapatite coatings are of great importance in the biological and biomedical coatings fields, especially in the current era of nanotechnology and bioapplications. With a bonelike structure that promotes osseointegration, hydroxyapatite coating can be applied to otherwise bioinactive implants to make their surface bioactive, thus achieving faster healing and recovery. In addition to applications in orthopedic and dental implants, this coating can also be used in drug delivery. Hydroxyapatite Coatings for Biomedical Applications explores developments in the processing and property characteri

  3. Post-processing of Deep Web Information Extraction Based on Domain Ontology

    OpenAIRE

    Peng, T.; L. Liu

    2013-01-01

    Many methods are utilized to extract and process query results in deep Web, which rely on the different structures of Web pages and various designing modes of databases. However, some semantic meanings and relations are ignored. So, in this paper, we present an approach for post-processing deep Web query results based on domain ontology which can utilize the semantic meanings and relations. A block identification model (BIM) based on node similarity is defined to extract data blocks that ...

  4. Biomedical Applications of NASA Science and Technology

    Science.gov (United States)

    Brown, James N., Jr.

    1968-01-01

    During the period 15 September 1968 to 14 December 1968, the NASA supported Biomedical Application Team at the Research Triangle Institute has identified 6 new problems, performed significant activities on 15 of the active problems identified previously, performed 5 computer searches of the NASA aerospace literature, and maintained one current awareness search. As a partial result of these activities, one technology transfer was accomplished. As a part of continuing problem review, 13 problems were classified inactive. Activities during the quarter involved all phases of team activity with respect to biomedical problems. As has been observed in preceding years, it has been exceedingly difficult to arrange meetings with medical investigators during the fourth quarter of the calendar year. This is a result of a combination of factors. Teaching requirements, submission of grant applications and holidays are the most significant factors involved. As a result, the numbers of new problems identified and of transfers and potential transfers are relatively low during this quarter. Most of our activities have thus been directed toward obtaining information related to problems already identified. Consequently, during the next quarter we will follow up on these activities with the expectation that transfers will be accomplished on a number of them. In addition, the normal availability of researchers to the team is expected to be restored during this quarter, permitting an increase in new problem identification activities as well as follow-up with other researchers on old problems. Another activity scheduled for the next quarter is consultation with several interested biomedical equipment manufacturers to explore means of effective interaction between the Biomedical Application Team and these companies.

  5. Information Extraction of Web Document Based on Htmlparser%基于HTMLParser的Web文献信息提取

    Institute of Scientific and Technical Information of China (English)

    龚真平

    2011-01-01

    Web anlalyzing based on HTMLParser could extract the information of Link,image,meta and title of the links.Using HTMLParser extracts the information of title,key words,abstract,author,sources,ect.in Web document,when store in MySql database after cleaning%基于HTMLParser对网页进行解析,可抽取标签间的Link、image、meta和title等信息。使用HTMLParser来提取Web文献中的题名、关键字、摘要、作者、来源等信息,清洗后存入MySql数据库当中,以备后续数据挖掘使用。对此进行了论述。

  6. Network and Ensemble Enabled Entity Extraction in Informal Text (NEEEEIT) final report.

    Energy Technology Data Exchange (ETDEWEB)

    Kegelmeyer, W. Philip,; Shead, Timothy M. [Sandia National Laboratories, Albuquerque, NM; Dunlavy, Daniel M. [Sandia National Laboratories, Albuquerque, NM

    2013-09-01

    This SAND report summarizes the activities and outcomes of the Network and Ensemble Enabled Entity Extraction in Informal Text (NEEEEIT) LDRD project, which addressed improving the accuracy of conditional random fields for named entity recognition through the use of ensemble methods. Conditional random fields (CRFs) are powerful, flexible probabilistic graphical models often used in supervised machine learning prediction tasks associated with sequence data. Specifically, they are currently the best known option for named entity recognition (NER) in text. NER is the process of labeling words in sentences with semantic identifiers such as %E2%80%9Cperson%E2%80%9D, %E2%80%9Cdate%E2%80%9D, or %E2%80%9Corganization%E2%80%9D. Ensembles are a powerful statistical inference meta-method that can make most supervised machine learning methods more accurate, faster, or both. Ensemble methods are normally best suited to %E2%80%9Cunstable%E2%80%9D classification methods with high variance error. CRFs applied to NER are very stable classifiers, and as such, would initially seem to be resistant to the benefits of ensembles. The NEEEEIT project nonetheless worked out how to generalize ensemble methods to CRFs, demonstrated that accuracy can indeed be improved by proper use of ensemble techniques, and generated a new CRF code, %E2%80%9CpyCrust%E2%80%9D and a surrounding application environment, %E2%80%9CNEEEEIT%E2%80%9D, which implement those improvements. The summary practical advice that results from this work, then, is: When making use of CRFs for label prediction tasks in machine learning, use the pyCrust CRF base classifier with NEEEEIT's bagging ensemble implementation. (If those codes are not available, then de-stablize your CRF code via every means available, and generate the bagged training sets by hand.) If you have ample pre-processing computational time, do %E2%80%9Cforward feature selection%E2%80%9D to find and remove counter-productive feature classes. Conversely

  7. RPCs in biomedical applications

    Energy Technology Data Exchange (ETDEWEB)

    Belli, G. [Dipartimento di Fisica Nucleare e Teorica and Sezione INFN, via A. Bassi 6, 27100 Pavia (Italy); De Vecchi, C. [Dipartimento di Fisica Nucleare e Teorica and Sezione INFN, via A. Bassi 6, 27100 Pavia (Italy); Giroletti, E. [Dipartimento di Fisica Nucleare e Teorica and Sezione INFN, via A. Bassi 6, 27100 Pavia (Italy); Guida, R. [Dipartimento di Fisica Nucleare e Teorica and Sezione INFN, via A. Bassi 6, 27100 Pavia (Italy); Musitelli, G. [Dipartimento di Fisica Nucleare e Teorica and Sezione INFN, via A. Bassi 6, 27100 Pavia (Italy); Nardo, R. [Dipartimento di Fisica Nucleare e Teorica and Sezione INFN, via A. Bassi 6, 27100 Pavia (Italy); Necchi, M.M. [Dipartimento di Fisica Nucleare e Teorica and Sezione INFN, via A. Bassi 6, 27100 Pavia (Italy); Pagano, D. [Dipartimento di Fisica Nucleare e Teorica and Sezione INFN, via A. Bassi 6, 27100 Pavia (Italy); Ratti, S.P. [Dipartimento di Fisica Nucleare e Teorica and Sezione INFN, via A. Bassi 6, 27100 Pavia (Italy); Sani, G. [Dipartimento di Fisica Nucleare e Teorica and Sezione INFN, via A. Bassi 6, 27100 Pavia (Italy); Vicini, A. [Dipartimento di Fisica Nucleare e Teorica and Sezione INFN, via A. Bassi 6, 27100 Pavia (Italy); Vitulo, P. [Dipartimento di Fisica Nucleare e Teorica and Sezione INFN, via A. Bassi 6, 27100 Pavia (Italy); Viviani, C. [Dipartimento di Fisica Nucleare e Teorica and Sezione INFN, via A. Bassi 6, 27100 Pavia (Italy)

    2006-08-15

    We are studying possible applications of Resistive Plate Chambers (RPCs) in the biomedical domain such as Positron Emission Tomography (PET). The use of RPCs in PET can provide several improvements on the usual scintillation-based detectors. The most striking features are the extremely good spatial and time resolutions. They can be as low as 50 {mu}m and 25 ps respectively, to be compared to the much higher intrinsic limits in bulk detectors. Much efforts have been made to investigate suitable materials to make RPCs sensitive to 511 keV photons. For this reason, we are studying different types of coating employing high Z materials with proper electrical resistivity. Later investigations explored the possibility of coating glass electrodes by mean of serigraphy techniques, employing oxide based mixtures with a high density of high Z materials; the efficiency is strongly dependent on its thickness and it reaches a maximum for a characteristic value that is a function of the compound (usually a few hundred microns). The most promising mixtures seem to be PbO, Bi{sub 2}O{sub 3} and Tl{sub 2}O. Preliminary gamma efficiency measurements for a Multigap RPC prototype (MRPC) are presented as well as simulations using GEANT4-based framework. The MRPC has 5 gas gaps; their spacings are kept by 0.3 mm diameter nylon fishing line, electrodes are made of thin glasses (1 mm for the outer electrodes, 0.15-0.4 mm for the inner ones). The detector is enclosed in a metallic gas-tight box, filled with a C{sub 2}H{sub 2}F{sub 4} 92.5%, SF{sub 6} 2.5%, C{sub 4}H{sub 10} 5% mixture. Different gas mixtures are being studied increasing the SF6 percentage and results of efficiency as a function of the new mixtures will be presented.

  8. RPCs in biomedical applications

    Science.gov (United States)

    Belli, G.; De Vecchi, C.; Giroletti, E.; Guida, R.; Musitelli, G.; Nardò, R.; Necchi, M. M.; Pagano, D.; Ratti, S. P.; Sani, G.; Vicini, A.; Vitulo, P.; Viviani, C.

    2006-08-01

    We are studying possible applications of Resistive Plate Chambers (RPCs) in the biomedical domain such as Positron Emission Tomography (PET). The use of RPCs in PET can provide several improvements on the usual scintillation-based detectors. The most striking features are the extremely good spatial and time resolutions. They can be as low as 50 μm and 25 ps respectively, to be compared to the much higher intrinsic limits in bulk detectors. Much efforts have been made to investigate suitable materials to make RPCs sensitive to 511 keV photons. For this reason, we are studying different types of coating employing high Z materials with proper electrical resistivity. Later investigations explored the possibility of coating glass electrodes by mean of serigraphy techniques, employing oxide based mixtures with a high density of high Z materials; the efficiency is strongly dependent on its thickness and it reaches a maximum for a characteristic value that is a function of the compound (usually a few hundred microns). The most promising mixtures seem to be PbO, Bi 2O 3 and Tl 2O. Preliminary gamma efficiency measurements for a Multigap RPC prototype (MRPC) are presented as well as simulations using GEANT4-based framework. The MRPC has 5 gas gaps; their spacings are kept by 0.3 mm diameter nylon fishing line, electrodes are made of thin glasses (1 mm for the outer electrodes, 0.15-0.4 mm for the inner ones). The detector is enclosed in a metallic gas-tight box, filled with a C 2H 2F 4 92.5%, SF 6 2.5%, C 4H 10 5% mixture. Different gas mixtures are being studied increasing the SF6 percentage and results of efficiency as a function of the new mixtures will be presented.

  9. Bio-SCoRes: A Smorgasbord Architecture for Coreference Resolution in Biomedical Text.

    Science.gov (United States)

    Kilicoglu, Halil; Demner-Fushman, Dina

    2016-01-01

    Coreference resolution is one of the fundamental and challenging tasks in natural language processing. Resolving coreference successfully can have a significant positive effect on downstream natural language processing tasks, such as information extraction and question answering. The importance of coreference resolution for biomedical text analysis applications has increasingly been acknowledged. One of the difficulties in coreference resolution stems from the fact that distinct types of coreference (e.g., anaphora, appositive) are expressed with a variety of lexical and syntactic means (e.g., personal pronouns, definite noun phrases), and that resolution of each combination often requires a different approach. In the biomedical domain, it is common for coreference annotation and resolution efforts to focus on specific subcategories of coreference deemed important for the downstream task. In the current work, we aim to address some of these concerns regarding coreference resolution in biomedical text. We propose a general, modular framework underpinned by a smorgasbord architecture (Bio-SCoRes), which incorporates a variety of coreference types, their mentions and allows fine-grained specification of resolution strategies to resolve coreference of distinct coreference type-mention pairs. For development and evaluation, we used a corpus of structured drug labels annotated with fine-grained coreference information. In addition, we evaluated our approach on two other corpora (i2b2/VA discharge summaries and protein coreference dataset) to investigate its generality and ease of adaptation to other biomedical text types. Our results demonstrate the usefulness of our novel smorgasbord architecture. The specific pipelines based on the architecture perform successfully in linking coreferential mention pairs, while we find that recognition of full mention clusters is more challenging. The corpus of structured drug labels (SPL) as well as the components of Bio-SCoRes and

  10. Promoting coherent minimum reporting guidelines for biological and biomedical investigations: the MIBBI project

    Science.gov (United States)

    Taylor, Chris F; Field, Dawn; Sansone, Susanna-Assunta; Aerts, Jan; Apweiler, Rolf; Ashburner, Michael; Ball, Catherine A; Binz, Pierre-Alain; Bogue, Molly; Booth, Tim; Brazma, Alvis; Brinkman, Ryan R; Clark, Adam Michael; Deutsch, Eric W; Fiehn, Oliver; Fostel, Jennifer; Ghazal, Peter; Gibson, Frank; Gray, Tanya; Grimes, Graeme; Hancock, John M; Hardy, Nigel W; Hermjakob, Henning; Julian, Randall K; Kane, Matthew; Kettner, Carsten; Kinsinger, Christopher; Kolker, Eugene; Kuiper, Martin; Le Novère, Nicolas; Leebens-Mack, Jim; Lewis, Suzanna E; Lord, Phillip; Mallon, Ann-Marie; Marthandan, Nishanth; Masuya, Hiroshi; McNally, Ruth; Mehrle, Alexander; Morrison, Norman; Orchard, Sandra; Quackenbush, John; Reecy, James M; Robertson, Donald G; Rocca-Serra, Philippe; Rodriguez, Henry; Rosenfelder, Heiko; Santoyo-Lopez, Javier; Scheuermann, Richard H; Schober, Daniel; Smith, Barry; Snape, Jason; Stoeckert, Christian J; Tipton, Keith; Sterk, Peter; Untergasser, Andreas; Vandesompele, Jo; Wiemann, Stefan

    2009-01-01

    The Minimum Information for Biological and Biomedical Investigations (MIBBI) project provides a resource for those exploring the range of extant minimum information checklists and fosters coordinated development of such checklists. PMID:18688244

  11. Biomedical discovery acceleration, with applications to craniofacial development.

    Directory of Open Access Journals (Sweden)

    Sonia M Leach

    2009-03-01

    Full Text Available The profusion of high-throughput instruments and the explosion of new results in the scientific literature, particularly in molecular biomedicine, is both a blessing and a curse to the bench researcher. Even knowledgeable and experienced scientists can benefit from computational tools that help navigate this vast and rapidly evolving terrain. In this paper, we describe a novel computational approach to this challenge, a knowledge-based system that combines reading, reasoning, and reporting methods to facilitate analysis of experimental data. Reading methods extract information from external resources, either by parsing structured data or using biomedical language processing to extract information from unstructured data, and track knowledge provenance. Reasoning methods enrich the knowledge that results from reading by, for example, noting two genes that are annotated to the same ontology term or database entry. Reasoning is also used to combine all sources into a knowledge network that represents the integration of all sorts of relationships between a pair of genes, and to calculate a combined reliability score. Reporting methods combine the knowledge network with a congruent network constructed from experimental data and visualize the combined network in a tool that facilitates the knowledge-based analysis of that data. An implementation of this approach, called the Hanalyzer, is demonstrated on a large-scale gene expression array dataset relevant to craniofacial development. The use of the tool was critical in the creation of hypotheses regarding the roles of four genes never previously characterized as involved in craniofacial development; each of these hypotheses was validated by further experimental work.

  12. Biomedical discovery acceleration, with applications to craniofacial development.

    Science.gov (United States)

    Leach, Sonia M; Tipney, Hannah; Feng, Weiguo; Baumgartner, William A; Kasliwal, Priyanka; Schuyler, Ronald P; Williams, Trevor; Spritz, Richard A; Hunter, Lawrence

    2009-03-01

    The profusion of high-throughput instruments and the explosion of new results in the scientific literature, particularly in molecular biomedicine, is both a blessing and a curse to the bench researcher. Even knowledgeable and experienced scientists can benefit from computational tools that help navigate this vast and rapidly evolving terrain. In this paper, we describe a novel computational approach to this challenge, a knowledge-based system that combines reading, reasoning, and reporting methods to facilitate analysis of experimental data. Reading methods extract information from external resources, either by parsing structured data or using biomedical language processing to extract information from unstructured data, and track knowledge provenance. Reasoning methods enrich the knowledge that results from reading by, for example, noting two genes that are annotated to the same ontology term or database entry. Reasoning is also used to combine all sources into a knowledge network that represents the integration of all sorts of relationships between a pair of genes, and to calculate a combined reliability score. Reporting methods combine the knowledge network with a congruent network constructed from experimental data and visualize the combined network in a tool that facilitates the knowledge-based analysis of that data. An implementation of this approach, called the Hanalyzer, is demonstrated on a large-scale gene expression array dataset relevant to craniofacial development. The use of the tool was critical in the creation of hypotheses regarding the roles of four genes never previously characterized as involved in craniofacial development; each of these hypotheses was validated by further experimental work.

  13. Author Keywords in Biomedical Journal Articles

    OpenAIRE

    Névéol, Aurélie; Doğan, Rezarta Islamaj; Lu, Zhiyong

    2010-01-01

    As an information retrieval system, PubMed® aims at providing efficient access to documents cited in MEDLINE®. For this purpose, it relies on matching representations of documents, as provided by authors and indexers to user queries. In this paper, we describe the growth of author keywords in biomedical journal articles and present a comparative study of author keywords and MeSH® indexing terms assigned by MEDLINE indexers to PubMed Central Open Access articles. A similarity metric is used to...

  14. All India Seminar on Biomedical Engineering 2012

    CERN Document Server

    Bhatele, Mukta

    2013-01-01

    This book is a collection of articles presented by researchers and practitioners, including engineers, biologists, health professionals and informatics/computer scientists, interested in both theoretical advances and applications of information systems, artificial intelligence, signal processing, electronics and other engineering tools in areas related to biology and medicine in the All India Seminar on Biomedical Engineering 2012 (AISOBE 2012), organized by The Institution of Engineers (India), Jabalpur Local Centre, Jabalpur, India during November 3-4, 2012. The content of the book is useful to doctors, engineers, researchers and academicians as well as industry professionals.

  15. The biomedical discourse relation bank

    Directory of Open Access Journals (Sweden)

    Joshi Aravind

    2011-05-01

    Full Text Available Abstract Background Identification of discourse relations, such as causal and contrastive relations, between situations mentioned in text is an important task for biomedical text-mining. A biomedical text corpus annotated with discourse relations would be very useful for developing and evaluating methods for biomedical discourse processing. However, little effort has been made to develop such an annotated resource. Results We have developed the Biomedical Discourse Relation Bank (BioDRB, in which we have annotated explicit and implicit discourse relations in 24 open-access full-text biomedical articles from the GENIA corpus. Guidelines for the annotation were adapted from the Penn Discourse TreeBank (PDTB, which has discourse relations annotated over open-domain news articles. We introduced new conventions and modifications to the sense classification. We report reliable inter-annotator agreement of over 80% for all sub-tasks. Experiments for identifying the sense of explicit discourse connectives show the connective itself as a highly reliable indicator for coarse sense classification (accuracy 90.9% and F1 score 0.89. These results are comparable to results obtained with the same classifier on the PDTB data. With more refined sense classification, there is degradation in performance (accuracy 69.2% and F1 score 0.28, mainly due to sparsity in the data. The size of the corpus was found to be sufficient for identifying the sense of explicit connectives, with classifier performance stabilizing at about 1900 training instances. Finally, the classifier performs poorly when trained on PDTB and tested on BioDRB (accuracy 54.5% and F1 score 0.57. Conclusion Our work shows that discourse relations can be reliably annotated in biomedical text. Coarse sense disambiguation of explicit connectives can be done with high reliability by using just the connective as a feature, but more refined sense classification requires either richer features or more

  16. Biomedical journals: keeping up and reading critically.

    Science.gov (United States)

    Chase, Karen L; DiGiacomo, Ronald F; Van Hoosier, Gerald L

    2006-09-01

    By extrapolation from studies of physicians, knowledge and practice of laboratory animal medicine and science are expected to become progressively more outdated the longer practitioners are out of school. Keeping up with current literature and practice is a challenge that necessitates the use of many different sources of continuing education. Both veterinarians and physicians consistently list journals as the most beneficial source of new information. Accordingly, they must select from the veterinary and biomedical literature articles that report original studies and systematic reviews and recognize and respond to valid new knowledge to improve diagnostic and therapeutic approaches and maintain consistent clinical skills. Other objectives include selecting journals for general information and for information relevant or specific to one's field of research. Lastly, candidates for board certification need to read articles from journals that potentially provide the basis for questions on the examination. 'High-impact' journals should be identified, and articles should be reviewed critically. In a survey of recent candidates for laboratory animal medicine board examination, these journals included Contemporary Topics (now JAALAS), Comparative Medicine, ILAR Journal, and Laboratory Animals. Strategies for coping with the challenge of staying current with the literature include wise use of technology, journal clubs, and consultation with colleagues. A laboratory animal practitioner can become a better scientist and clinician by evaluating the research performed by others. Thorough, critical review of biomedical literature is paramount to these goals. PMID:16995641

  17. Biomedical journals: keeping up and reading critically.

    Science.gov (United States)

    Chase, Karen L; DiGiacomo, Ronald F; Van Hoosier, Gerald L

    2006-09-01

    By extrapolation from studies of physicians, knowledge and practice of laboratory animal medicine and science are expected to become progressively more outdated the longer practitioners are out of school. Keeping up with current literature and practice is a challenge that necessitates the use of many different sources of continuing education. Both veterinarians and physicians consistently list journals as the most beneficial source of new information. Accordingly, they must select from the veterinary and biomedical literature articles that report original studies and systematic reviews and recognize and respond to valid new knowledge to improve diagnostic and therapeutic approaches and maintain consistent clinical skills. Other objectives include selecting journals for general information and for information relevant or specific to one's field of research. Lastly, candidates for board certification need to read articles from journals that potentially provide the basis for questions on the examination. 'High-impact' journals should be identified, and articles should be reviewed critically. In a survey of recent candidates for laboratory animal medicine board examination, these journals included Contemporary Topics (now JAALAS), Comparative Medicine, ILAR Journal, and Laboratory Animals. Strategies for coping with the challenge of staying current with the literature include wise use of technology, journal clubs, and consultation with colleagues. A laboratory animal practitioner can become a better scientist and clinician by evaluating the research performed by others. Thorough, critical review of biomedical literature is paramount to these goals.

  18. How to Select a Journal to Submit and Publish Your Biomedical Paper?

    Directory of Open Access Journals (Sweden)

    Farhad Shokraneh

    2012-03-01

    Full Text Available Introduction: selection of journal for publication purpose is one of concerns of biomedical researchers. They apply various criteria to choose appropriate journal. Here, we have tried to collect main criteria biomedical researchers use to select a journal to submit their work. Methods: we collected these criteria through focus group conversations with researchers during our careers, feedbacks from participants of our scientific writing workshops and non-systematic reviewing of some related literature. Results: we have presented a summative and informative guidance in selection of biomedical journals for biomedical paper submission and publication. Conclusion: Categorized criteria as a mnemonic tool for authors may help the authors in journal selection process.

  19. Unstable markup: A template-based information extraction from web sites with unstable markup

    OpenAIRE

    Kolchin, Maxim; Kozlov, Fedor

    2014-01-01

    This paper presents results of a work on crawling CEUR Workshop proceedings web site to a Linked Open Data (LOD) dataset in the framework of ESWC 2014 Semantic Publishing Challenge 2014. Our approach is based on using an extensible template-dependent crawler and DBpedia for linking extracted entities, such as the names of universities and countries.

  20. Comparison of Qinzhou bay wetland landscape information extraction by three methods

    Directory of Open Access Journals (Sweden)

    X. Chang

    2014-04-01

    and OO is 219 km2, 193.70 km2, 217.40 km2 respectively. The result indicates that SC is in the f irst place, followed by OO approach, and the third DT method when used to extract Qingzhou Bay coastal wetland.

  1. Road Extraction and Network Building from Synthetic Aperture Radar Images using A-Priori Information

    NARCIS (Netherlands)

    Dekker, R.J.

    2008-01-01

    This paper describes a method for the extraction of road networks from radar images. Three phases can be distinguished: (1) detection of road lines, (2) network building, and (3) network fusion. The method has been demonstrated on two radar images, one urban and one rural. Despite the differences, t

  2. Functionalized carbon nanotubes: biomedical applications

    Directory of Open Access Journals (Sweden)

    Vardharajula S

    2012-10-01

    Full Text Available Sandhya Vardharajula,1 Sk Z Ali,2 Pooja M Tiwari,1 Erdal Eroğlu,1 Komal Vig,1 Vida A Dennis,1 Shree R Singh11Center for NanoBiotechnology and Life Sciences Research, Alabama State University, Montgomery, AL, USA; 2Department of Microbiology, Osmania University, Hyderabad, IndiaAbstract: Carbon nanotubes (CNTs are emerging as novel nanomaterials for various biomedical applications. CNTs can be used to deliver a variety of therapeutic agents, including biomolecules, to the target disease sites. In addition, their unparalleled optical and electrical properties make them excellent candidates for bioimaging and other biomedical applications. However, the high cytotoxicity of CNTs limits their use in humans and many biological systems. The biocompatibility and low cytotoxicity of CNTs are attributed to size, dose, duration, testing systems, and surface functionalization. The functionalization of CNTs improves their solubility and biocompatibility and alters their cellular interaction pathways, resulting in much-reduced cytotoxic effects. Functionalized CNTs are promising novel materials for a variety of biomedical applications. These potential applications are particularly enhanced by their ability to penetrate biological membranes with relatively low cytotoxicity. This review is directed towards the overview of CNTs and their functionalization for biomedical applications with minimal cytotoxicity.Keywords: carbon nanotubes, cytotoxicity, functionalization, biomedical applications

  3. Introductory Course in Biomedical Ethics in the Obstetrics-Gynecology Residency.

    Science.gov (United States)

    Elkins, Thomas E.

    1988-01-01

    Information used in a brief lecture that introduces a biomedical ethics curriculum in an obstetrics and gynecology residency is described. Major components include theories of philosophic ethics (formalist and consequentialist) and principles of biomedical ethics (honesty, contract-keeping, nonmaleficence, justice, autonomy, beneficence,…

  4. Implantable biomedical devices on bioresorbable substrates

    Science.gov (United States)

    Rogers, John A; Kim, Dae-Hyeong; Omenetto, Fiorenzo; Kaplan, David L; Litt, Brian; Viventi, Jonathan; Huang, Yonggang; Amsden, Jason

    2014-03-04

    Provided herein are implantable biomedical devices, methods of administering implantable biomedical devices, methods of making implantable biomedical devices, and methods of using implantable biomedical devices to actuate a target tissue or sense a parameter associated with the target tissue in a biological environment. Each implantable biomedical device comprises a bioresorbable substrate, an electronic device having a plurality of inorganic semiconductor components supported by the bioresorbable substrate, and a barrier layer encapsulating at least a portion of the inorganic semiconductor components. Upon contact with a biological environment the bioresorbable substrate is at least partially resorbed, thereby establishing conformal contact between the implantable biomedical device and the target tissue in the biological environment.

  5. Extracting depth information of 3-dimensional structures from a single-view X-ray Fourier-transform hologram.

    Science.gov (United States)

    Geilhufe, J; Tieg, C; Pfau, B; Günther, C M; Guehrs, E; Schaffert, S; Eisebitt, S

    2014-10-20

    We demonstrate how information about the three-dimensional structure of an object can be extracted from a single Fourier-transform X-ray hologram. In contrast to lens-based 3D imaging approaches that provide depth information of a specimen utilizing several images from different angles or via adjusting the focus to different depths, our method capitalizes on the use of the holographically encoded phase and amplitude information of the object's wavefield. It enables single-shot measurements of 3D objects at coherent X-ray sources. As the ratio of longitudinal resolution over transverse resolution scales proportional to the diameter of the reference beam aperture over the X-ray wavelength, we expect the approach to be particularly useful in the extreme ultraviolet and soft-X-ray regime.

  6. Extracting Information about the Initial State from the Black Hole Radiation.

    Science.gov (United States)

    Lochan, Kinjalk; Padmanabhan, T

    2016-02-01

    The crux of the black hole information paradox is related to the fact that the complete information about the initial state of a quantum field in a collapsing spacetime is not available to future asymptotic observers, belying the expectations from a unitary quantum theory. We study the imprints of the initial quantum state contained in a specific class of distortions of the black hole radiation and identify the classes of in states that can be partially or fully reconstructed from the information contained within. Even for the general in state, we can uncover some specific information. These results suggest that a classical collapse scenario ignores this richness of information in the resulting spectrum and a consistent quantum treatment of the entire collapse process might allow us to retrieve much more information from the spectrum of the final radiation.

  7. Extracting Information about the Initial State from the Black Hole Radiation.

    Science.gov (United States)

    Lochan, Kinjalk; Padmanabhan, T

    2016-02-01

    The crux of the black hole information paradox is related to the fact that the complete information about the initial state of a quantum field in a collapsing spacetime is not available to future asymptotic observers, belying the expectations from a unitary quantum theory. We study the imprints of the initial quantum state contained in a specific class of distortions of the black hole radiation and identify the classes of in states that can be partially or fully reconstructed from the information contained within. Even for the general in state, we can uncover some specific information. These results suggest that a classical collapse scenario ignores this richness of information in the resulting spectrum and a consistent quantum treatment of the entire collapse process might allow us to retrieve much more information from the spectrum of the final radiation. PMID:26894699

  8. A New Paradigm for the Extraction of Information:Application to Enhancement of Visual Information in a Medical Application

    Institute of Scientific and Technical Information of China (English)

    V. Courboulay; A. Histace; M. Ménard; C.Cavaro-Menard

    2004-01-01

    The noninvasive evaluation of the cardiac function presents a great interest for the diagnosis of cardiovascular diseases. Tagged cardiac MRI allows the measurement of anatomical and functional myocardial parameters. This protocol generates a dark grid which is deformed with the myocardium displacement on both Short-Axis (SA) and Long-Axis (LA) frames in a time sequence. Visual evaluation of the grid deformation allows the estimation of the displacement inside the myocardium. The work described in this paper aims to make robust and reliable the visual enhancement of the grid tags on cardiac MRI sequences, thanks to an informational formalism based on Extreme Physical Informational (EPI). This approach leads to the development of an original diffusion pre-processing allowing us to make better the robustness of the visual detection and the following of the grid of tags.

  9. Flexible sensors for biomedical technology.

    Science.gov (United States)

    Vilela, Diana; Romeo, Agostino; Sánchez, Samuel

    2016-02-01

    Flexible sensing devices have gained a great deal of attention among the scientific community in recent years. The application of flexible sensors spans over several fields, including medicine, industrial automation, robotics, security, and human-machine interfacing. In particular, non-invasive health-monitoring devices are expected to play a key role in the improvement of patient life and in reducing costs associated with clinical and biomedical diagnostic procedures. Here, we focus on recent advances achieved in flexible devices applied on the human skin for biomedical and healthcare purposes. PMID:26675174

  10. Flexible sensors for biomedical technology.

    Science.gov (United States)

    Vilela, Diana; Romeo, Agostino; Sánchez, Samuel

    2016-02-01

    Flexible sensing devices have gained a great deal of attention among the scientific community in recent years. The application of flexible sensors spans over several fields, including medicine, industrial automation, robotics, security, and human-machine interfacing. In particular, non-invasive health-monitoring devices are expected to play a key role in the improvement of patient life and in reducing costs associated with clinical and biomedical diagnostic procedures. Here, we focus on recent advances achieved in flexible devices applied on the human skin for biomedical and healthcare purposes.

  11. TennisSense: a platform for extracting semantic information from multi-camera tennis data

    OpenAIRE

    Ó Conaire, Ciarán; Kelly, Philip; Connaghan, Damien; O''Connor, Noel E.

    2009-01-01

    In this paper, we introduce TennisSense, a technology platform for the digital capture, analysis and retrieval of tennis training and matches. Our algorithms for extracting useful metadata from the overhead court camera are described and evaluated. We track the tennis ball using motion images for ball candidate detection and then link ball candidates into locally linear tracks. From these tracks we can infer when serves and rallies take place. Using background subtraction and hysteresis-type ...

  12. Ontology-based information extraction from the configuration command line of network routers

    OpenAIRE

    Martínez Manzanilla, Anny Gabriela; Yannuzzi, Marcelo; Serral Gracià, René; Ramirez Almonte, Wilson

    2014-01-01

    Knowledge extraction is increasingly attracting the attention of researchers from different disciplines, as a means to automate complex tasks that rely on bulk textual resources. However, the configuration of many devices in the networking field continues to be a labor intensive task, based on the human interpretation and manual entry of commands through a text-based user interface. Typically, these Command-Line Interfaces (CLIs) are both device and vendor-specific, and thus, commands differ ...

  13. Amplitude extraction in pseudoscalar-meson photoproduction: towards a situation of complete information

    OpenAIRE

    Nys, Jannes; Vrancx, Tom; Ryckebusch, Jan

    2015-01-01

    A complete set for pseudoscalar-meson photoproduction is a minimum set of observables from which one can determine the underlying reaction amplitudes unambiguously. The complete sets considered in this work involve single- and double-polarization observables. It is argued that for extracting amplitudes from data, the transversity representation of the reaction amplitudes offers advantages over alternate representations. It is shown that with the available single-polarization data for the p({\\...

  14. Extracting information from non adiabatic dynamics: excited symmetric states of the Bose-Hubbard model

    OpenAIRE

    Lacki, M.; Delande, D; Zakrzewski, J.

    2011-01-01

    Using Fourier transform on a time series generated by unitary evolution, we extract many-body eigenstates of the Bose-Hubbard model corresponding to low energy excitations, which are generated when the insulator-superfluid phase transition is realized in a typical experiment. The analysis is conducted in a symmetric external potential both without and with and disorder. A simple classification of excitations in the absence disorder is provided. The evolution is performed assuming the presence...

  15. Biomedical Application of Laser

    Institute of Scientific and Technical Information of China (English)

    K. X. He; Alan Chow; Jiada Mo; Wang Zhuo

    2004-01-01

    @@ INTRODUCTION Lasers have revolutionized research and development in medicine and dentistry. They have led to development and production of many new products. Laser applications in diagnosis, treatment and surgery are enormous and have led to speedy and more efficient results, as well as better and quicker healing Processes. The applications could be classified in terms of areas of uses or in terms of instruments/products.In this paper, discussions will not be grouped in a particular fashion, but will be on specific applications. A lot of information on these applications can be found in the Internet. Such information will be mentioned in related discussions and will be given in the appendix.

  16. Extracting principles for information management adaptability during crisis response: A dynamic capability view

    NARCIS (Netherlands)

    Bharosa, N.; Janssen, M.F.W.H.A.

    2010-01-01

    During crises, relief agency commanders have to make decisions in a complex and uncertain environment, requiring them to continuously adapt to unforeseen environmental changes. In the process of adaptation, the commanders depend on information management systems for information. Yet there are still

  17. A new fast method for earthquake-triggered secondary geological hazard information extraction from high-resolution remote sensing imagery

    International Nuclear Information System (INIS)

    Earthquake-triggered secondary geological hazards usually significantly augment the destruction caused by an earthquake due to their rapid impact, long duration and high repeatability. Seismic disaster losses are often exacerbated and can cause substantial difficulties for rescue, relief and post-disaster reconstruction. The quick acquisition of disaster information is important to aid relief efforts in the 72 critical hours after an earthquake occurs. Currently, computer-aided interpretation of remotely sensed imagery of a single time phase plays a central role in information extraction for geological hazards, whereas automated interpretation is relatively limited. Because high-resolution remotely sensed imagery is rich in textures and spatial details of the imaged objects, such as their size, shape and neighborhood relationship, geologic hazard investigations employing such imagery can offer more accurate results. In this paper, a rapid method for earthquake-triggered secondary geological hazard information extraction from high-resolution remotely sensed imagery is proposed and is demonstrated to be a fast and automatic method that uses multiple image features of a hazard

  18. Sentence Simplification Aids Protein-Protein Interaction Extraction

    CERN Document Server

    Jonnalagadda, Siddhartha

    2010-01-01

    Accurate systems for extracting Protein-Protein Interactions (PPIs) automatically from biomedical articles can help accelerate biomedical research. Biomedical Informatics researchers are collaborating to provide metaservices and advance the state-of-art in PPI extraction. One problem often neglected by current Natural Language Processing systems is the characteristic complexity of the sentences in biomedical literature. In this paper, we report on the impact that automatic simplification of sentences has on the performance of a state-of-art PPI extraction system, showing a substantial improvement in recall (8%) when the sentence simplification method is applied, without significant impact to precision.

  19. Biomedical Engineering: A Compendium of Research Training Programs.

    Science.gov (United States)

    National Inst. of General Medical Sciences (NIH), Bethesda, MD.

    This document was prepared to provide a comprehensive view of the programs in biomedical engineering in existence in 1969. These programs are supported by the National Institute of General Medical Sciences and are located at 18 universities. This compendium provides information as to the intent and content of these programs from data provided by…

  20. Schroedinger Eigenmaps for the Analysis of Bio-Medical Data

    CERN Document Server

    Czaja, Wojciech

    2011-01-01

    We introduce Schroedinger Eigenmaps, a new semi-supervised manifold learning and recovery technique. This method is based on an implementation of graph Schroedinger operators with appropriately constructed barrier potentials as carriers of labeled information. We apply it to analyze two complex bio-medical datasets: multispectral retinal images and microarray gene expressions.

  1. Use of dual isotope tracers in biomedical research

    NARCIS (Netherlands)

    Stellaard, F

    2005-01-01

    Biomedical stable isotope studies involve administration of tracer and measurement of isotope enrichment in blood, urine, feces or breath. The aim of the studies is to gather quantitative information about a specific metabolic function. However, the measured isotope enrichment may be affected by oth

  2. Linking attentional processes and conceptual problem solving: visual cues facilitate the automaticity of extracting relevant information from diagrams.

    Science.gov (United States)

    Rouinfar, Amy; Agra, Elise; Larson, Adam M; Rebello, N Sanjay; Loschky, Lester C

    2014-01-01

    This study investigated links between visual attention processes and conceptual problem solving. This was done by overlaying visual cues on conceptual physics problem diagrams to direct participants' attention to relevant areas to facilitate problem solving. Participants (N = 80) individually worked through four problem sets, each containing a diagram, while their eye movements were recorded. Each diagram contained regions that were relevant to solving the problem correctly and separate regions related to common incorrect responses. Problem sets contained an initial problem, six isomorphic training problems, and a transfer problem. The cued condition saw visual cues overlaid on the training problems. Participants' verbal responses were used to determine their accuracy. This study produced two major findings. First, short duration visual cues which draw attention to solution-relevant information and aid in the organizing and integrating of it, facilitate both immediate problem solving and generalization of that ability to new problems. Thus, visual cues can facilitate re-representing a problem and overcoming impasse, enabling a correct solution. Importantly, these cueing effects on problem solving did not involve the solvers' attention necessarily embodying the solution to the problem, but were instead caused by solvers attending to and integrating relevant information in the problems into a solution path. Second, this study demonstrates that when such cues are used across multiple problems, solvers can automatize the extraction of problem-relevant information extraction. These results suggest that low-level attentional selection processes provide a necessary gateway for relevant information to be used in problem solving, but are generally not sufficient for correct problem solving. Instead, factors that lead a solver to an impasse and to organize and integrate problem information also greatly facilitate arriving at correct solutions.

  3. Science gateways for biomedical big data analysis

    NARCIS (Netherlands)

    S. Shahand

    2015-01-01

    Biomedical researchers are facing data deluge challenges such as dealing with large volume of complex heterogeneous data and complex and computationally demanding data processing methods. Such scale and complexity of biomedical research requires multi-disciplinary collaboration between scientists fr

  4. Functional network and its application to extract information from chaotic communication

    Institute of Scientific and Technical Information of China (English)

    李卫斌; 焦李成

    2004-01-01

    In chaotic communication system, the useful signal is hidden in chaotic signal, so the general method does not work well. Due to the random feature of chaotic signal, a functional network-based method is presented. In this method,the neural functions are selected from some complete function set for the functional network to reconstruct the chaotic signal, so the useful signal hidden in chaotic background is extracted. In addition, its learning algorithm is presented here and the example proves its good preformance.

  5. SEMANTIC INFORMATION EXTRACTION FROM MULTISPECTRAL GEOSPATIAL IMAGERY VIA A FLEXIBLE FRAMEWORK

    Energy Technology Data Exchange (ETDEWEB)

    Gleason, Shaun Scott [ORNL; Ferrell, Regina Kay [ORNL; Cheriyadat, Anil M [ORNL; Vatsavai, Raju [ORNL; De, Soumya [Missouri University of Science and Technology

    2010-01-01

    Identification and automatic labeling of facilities in high-resolution satellite images is a challenging task as the current thematic classification schemes and the low-level image features are not good enough to capture complex objects and their spatial relationships. In this paper we present a novel algorithm framework for automated semantic labeling of large image collections. The framework consists of various segmentation, feature extraction, vector quantization, and Latent Dirichlet Allocation modules. Initial experimental results show promise as well as the challenges in semantic classification technology development for nuclear proliferation monitoring.

  6. Integrated Biomaterials for Biomedical Technology

    CERN Document Server

    Ramalingam, Murugan; Ramakrishna, Seeram; Kobayashi, Hisatoshi

    2012-01-01

    This cutting edge book provides all the important aspects dealing with the basic science involved in materials in biomedical technology, especially structure and properties, techniques and technological innovations in material processing and characterizations, as well as the applications. The volume consists of 12 chapters written by acknowledged experts of the biomaterials field and covers a wide range of topics and applications.

  7. Biomedical Engineering Education in Perspective

    Science.gov (United States)

    Gowen, Richard J.

    1973-01-01

    Discusses recent developments in the health care industry and their impact on the future of biomedical engineering education. Indicates that a more thorough understanding of the complex functions of the living organism can be acquired through the application of engineering techniques to problems of life sciences. (CC)

  8. Mathematical modeling in biomedical imaging

    CERN Document Server

    2009-01-01

    This volume gives an introduction to a fascinating research area to applied mathematicians. It is devoted to providing the exposition of promising analytical and numerical techniques for solving challenging biomedical imaging problems, which trigger the investigation of interesting issues in various branches of mathematics.

  9. Mathematical modeling in biomedical imaging

    CERN Document Server

    2012-01-01

    This volume reports on recent mathematical and computational advances in optical, ultrasound, and opto-acoustic tomographies. It outlines the state-of-the-art and future directions in these fields and provides readers with the most recently developed mathematical and computational tools.  It is particularly suitable for researchers and graduate students in applied mathematics and biomedical engineering.

  10. Biomedical applications of magnesium alloys

    NARCIS (Netherlands)

    Sillekens, W.H.; Bormann, D.

    2012-01-01

    This chapter deals with the emerging field of biomedical applications for magnesium-based materials, envisioning degradable implants that dissolve in the human body after having cured a particular medical condition. After outlining the background of this interest, some major aspects concerning degra

  11. The New Biomedical Technology

    OpenAIRE

    Scoggin, Charles H.

    1985-01-01

    New methods for studying the genetic information of humans in health and disease are emerging from basic science laboratories. Because these approaches are yielding fundamental insights for diagnosing and treating disease, it is important that practitioners begin to understand these methods and how they are used. Methods for genetic analysis using recombinant DNA techniques consist of isolation, separation, propagation in microorganisms and molecular hybridization of DNA. The study of RNA all...

  12. Hydrography, Spring locations in Arizona incorporating information extracted from both the USGS Geonames database and from USGS Digital Line Graphs (DLG's)., Published in 1993, Arizona State Land Department.

    Data.gov (United States)

    NSGIC GIS Inventory (aka Ramona) — This Hydrography dataset as of 1993. It is described as 'Spring locations in Arizona incorporating information extracted from both the USGS Geonames database and...

  13. New roles & responsibilities of hospital biomedical engineering.

    Science.gov (United States)

    Frisch, P H; Stone, B; Booth, P; Lui, W

    2014-01-01

    Over the last decade the changing healthcare environment has required hospitals and specifically Biomedical Engineering to critically evaluate, optimize and adapt their operations. The focus is now on new technologies, changes to the environment of care, support requirements and financial constraints. Memorial Sloan Kettering Cancer Center (MSKCC), an NIH-designated comprehensive cancer center, has been transitioning to an increasing outpatient care environment. This transition is driving an increase in-patient acuity coupled with the need for added urgency of support and response time. New technologies, regulatory requirements and financial constraints have impacted operating budgets and in some cases, resulted in a reduction in staffing. Specific initiatives, such as the Joint Commission's National Patient Safety Goals, requirements for an electronic medical record, meaningful use and ICD10 have caused institutions to reevaluate their operations and processes including requiring Biomedical Engineering to manage new technologies, integrations and changes in the electromagnetic environment, while optimizing operational workflow and resource utilization. This paper addresses the new and expanding responsibilities and approach of Biomedical Engineering organizations, specifically at MSKCC. It is suggested that our experience may be a template for other organizations facing similar problems. Increasing support is necessary for Medical Software - Medical Device Data Systems in the evolving wireless environment, including RTLS and RFID. It will be necessary to evaluate the potential impact on the growing electromagnetic environment, on connectivity resulting in the need for dynamic and interactive testing and the growing demand to establish new and needed operational synergies with Information Technology operations and other operational groups within the institution, such as nursing, facilities management, central supply, and the user departments.

  14. Investigation of the Impact of Extracting and Exchanging Health Information by Using Internet and Social Networks

    Science.gov (United States)

    Pistolis, John; Zimeras, Stelios; Chardalias, Kostas; Roupa, Zoe; Fildisis, George; Diomidous, Marianna

    2016-01-01

    Introduction: Social networks (1) have been embedded in our daily life for a long time. They constitute a powerful tool used nowadays for both searching and exchanging information on different issues by using Internet searching engines (Google, Bing, etc.) and Social Networks (Facebook, Twitter etc.). In this paper, are presented the results of a research based on the frequency and the type of the usage of the Internet and the Social Networks by the general public and the health professionals. Objectives: The objectives of the research were focused on the investigation of the frequency of seeking and meticulously searching for health information in the social media by both individuals and health practitioners. The exchanging of information is a procedure that involves the issues of reliability and quality of information. Methods: In this research, by using advanced statistical techniques an effort is made to investigate the participant’s profile in using social networks for searching and exchanging information on health issues. Results: Based on the answers 93 % of the people, use the Internet to find information on health-subjects. Considering principal component analysis, the most important health subjects were nutrition (0.719 %), respiratory issues (0.79 %), cardiological issues (0.777%), psychological issues (0.667%) and total (73.8%). Conclusions: The research results, based on different statistical techniques revealed that the 61.2% of the males and 56.4% of the females intended to use the social networks for searching medical information. Based on the principal components analysis, the most important sources that the participants mentioned, were the use of the Internet and social networks for exchanging information on health issues. These sources proved to be of paramount importance to the participants of the study. The same holds for nursing, medical and administrative staff in hospitals. PMID:27482135

  15. Investigation of the Impact of Extracting and Exchanging Health Information by Using Internet and Social Networks

    Science.gov (United States)

    Pistolis, John; Zimeras, Stelios; Chardalias, Kostas; Roupa, Zoe; Fildisis, George; Diomidous, Marianna

    2016-01-01

    Introduction: Social networks (1) have been embedded in our daily life for a long time. They constitute a powerful tool used nowadays for both searching and exchanging information on different issues by using Internet searching engines (Google, Bing, etc.) and Social Networks (Facebook, Twitter etc.). In this paper, are presented the results of a research based on the frequency and the type of the usage of the Internet and the Social Networks by the general public and the health professionals. Objectives: The objectives of the research were focused on the investigation of the frequency of seeking and meticulously searching for health information in the social media by both individuals and health practitioners. The exchanging of information is a procedure that involves the issues of reliability and quality of information. Methods: In this research, by using advanced statistical techniques an effort is made to investigate the participant’s profile in using social networks for searching and exchanging information on health issues. Results: Based on the answers 93 % of the people, use the Internet to find information on health-subjects. Considering principal component analysis, the most important health subjects were nutrition (0.719 %), respiratory issues (0.79 %), cardiological issues (0.777%), psychological issues (0.667%) and total (73.8%). Conclusions: The research results, based on different statistical techniques revealed that the 61.2% of the males and 56.4% of the females intended to use the social networks for searching medical information. Based on the principal components analysis, the most important sources that the participants mentioned, were the use of the Internet and social networks for exchanging information on health issues. These sources proved to be of paramount importance to the participants of the study. The same holds for nursing, medical and administrative staff in hospitals.

  16. Extracting protein dynamics information from overlapped NMR signals using relaxation dispersion difference NMR spectroscopy

    Energy Technology Data Exchange (ETDEWEB)

    Konuma, Tsuyoshi [Icahn School of Medicine at Mount Sinai, Department of Structural and Chemical Biology (United States); Harada, Erisa [Suntory Foundation for Life Sciences, Bioorganic Research Institute (Japan); Sugase, Kenji, E-mail: sugase@sunbor.or.jp, E-mail: sugase@moleng.kyoto-u.ac.jp [Kyoto University, Department of Molecular Engineering, Graduate School of Engineering (Japan)

    2015-12-15

    Protein dynamics plays important roles in many biological events, such as ligand binding and enzyme reactions. NMR is mostly used for investigating such protein dynamics in a site-specific manner. Recently, NMR has been actively applied to large proteins and intrinsically disordered proteins, which are attractive research targets. However, signal overlap, which is often observed for such proteins, hampers accurate analysis of NMR data. In this study, we have developed a new methodology called relaxation dispersion difference that can extract conformational exchange parameters from overlapped NMR signals measured using relaxation dispersion spectroscopy. In relaxation dispersion measurements, the signal intensities of fluctuating residues vary according to the Carr-Purcell-Meiboon-Gill pulsing interval, whereas those of non-fluctuating residues are constant. Therefore, subtraction of each relaxation dispersion spectrum from that with the highest signal intensities, measured at the shortest pulsing interval, leaves only the signals of the fluctuating residues. This is the principle of the relaxation dispersion difference method. This new method enabled us to extract exchange parameters from overlapped signals of heme oxygenase-1, which is a relatively large protein. The results indicate that the structural flexibility of a kink in the heme-binding site is important for efficient heme binding. Relaxation dispersion difference requires neither selectively labeled samples nor modification of pulse programs; thus it will have wide applications in protein dynamics analysis.

  17. B2C电子商务中的信息抽取技术%Information Extraction Techniques Applied to B2C E-Commerce

    Institute of Scientific and Technical Information of China (English)

    于琨; 蔡智; 糜仲春; 蔡庆生

    2002-01-01

    After stepping out the valley of net economy, B2C e-commerce is about to come into a climax of develop-ment. Information extraction techniques are going to be one of the most important factors to promote B2C e-com-merce. In this paper, we present a review on the recent progress of information extraction techniques applied to B2Ce-commerce. The characteristics of each technique are also appraised.

  18. Architecture and data processing alternatives for the TSE computer. Volume 2: Extraction of topological information from an image by the Tse computer

    Science.gov (United States)

    Jones, J. R.; Bodenheimer, R. E.

    1976-01-01

    A simple programmable Tse processor organization and arithmetic operations necessary for extraction of the desired topological information are described. Hardware additions to this organization are discussed along with trade-offs peculiar to the tse computing concept. An improved organization is presented along with the complementary software for the various arithmetic operations. The performance of the two organizations is compared in terms of speed, power, and cost. Software routines developed to extract the desired information from an image are included.

  19. Advanced Biomedical Computing Center (ABCC) | DSITP

    Science.gov (United States)

    The Advanced Biomedical Computing Center (ABCC), located in Frederick Maryland (MD), provides HPC resources for both NIH/NCI intramural scientists and the extramural biomedical research community. Its mission is to provide HPC support, to provide collaborative research, and to conduct in-house research in various areas of computational biology and biomedical research.

  20. Electrophysiology for biomedical engineering students: a practical and theoretical course in animal electrocorticography.

    Science.gov (United States)

    Albarracín, Ana L; Farfán, Fernando D; Coletti, Marcos A; Teruya, Pablo Y; Felice, Carmelo J

    2016-09-01

    The major challenge in laboratory teaching is the application of abstract concepts in simple and direct practical lessons. However, students rarely have the opportunity to participate in a laboratory that combines practical learning with a realistic research experience. In the Biomedical Engineering career, we offer short and optional courses to complement studies for students as they initiate their Graduation Project. The objective of these theoretical and practical courses is to introduce students to the topics of their projects. The present work describes an experience in electrophysiology to teach undergraduate students how to extract cortical information using electrocorticographic techniques. Students actively participate in some parts of the experience and then process and analyze the data obtained with different signal processing tools. In postlaboratory evaluations, students described the course as an exceptional opportunity for students interested in following a postgraduate science program and fully appreciated their contents.