WorldWideScience

Sample records for data dictionaries

  1. From Data to Dictionary

    DEFF Research Database (Denmark)

    Nielsen, Sandro; Almind, Richard

    2011-01-01

    definitions, thereby creating both monolingual and bilingual dictionaries. Users access the data through online dictionaries that allow them to make structured searches. The dictionaries mainly provide help in communicative situations such as understanding, producing and translating accounting texts, but also......Most online dictionaries are based on printed dictionaries or specially developed databases. However, these dictionaries do not fully satisfy the needs for help and knowledge users have, so a re-assessment of the practical and theoretical foundation is necessary. Based on the work on the Accounting...... help users acquire knowledge about general or specific accounting matters in cognitive user situations. This theoretical foundation allows lexicographers to develop dictionaries that search in structured data sets and then present data types selected because they provide help in specific situations...

  2. The ABCs of Data Dictionaries

    Science.gov (United States)

    Gould, Tate; Nicholas, Amy; Blandford, William; Ruggiero, Tony; Peters, Mary; Thayer, Sara

    2014-01-01

    This overview of the basic components of a data dictionary is designed to educate and inform IDEA Part C and Part B 619 state staff about the purpose and benefits of having up-to-date data dictionaries for their data systems. This report discusses the following topics: (1) What Is a Data Dictionary?; (2) Why Is a Data Dictionary Needed and How Can…

  3. Data-Dictionary-Editing Program

    Science.gov (United States)

    Cumming, A. P.

    1989-01-01

    Access to data-dictionary relations and attributes made more convenient. Data Dictionary Editor (DDE) application program provides more convenient read/write access to data-dictionary table ("descriptions table") via data screen using SMARTQUERY function keys. Provides three main advantages: (1) User works with table names and field names rather than with table numbers and field numbers, (2) Provides online access to definitions of data-dictionary keys, and (3) Provides displayed summary list that shows, for each datum, which data-dictionary entries currently exist for any specific relation or attribute. Computer program developed to give developers of data bases more convenient access to the OMNIBASE VAX/IDM data-dictionary relations and attributes.

  4. Oracle Data Dictionary Pocket Reference

    CERN Document Server

    Kreines, David

    2003-01-01

    If you work with Oracle, then you don't need to be told that the data dictionary is large and complex, and grows larger with each new Oracle release. It's one of the basic elements of the Oracle database you interact with regularly, but the sheer number of tables and views makes it difficult to remember which view you need, much less the name of the specific column. Want to make it simpler? The Oracle Data Dictionary Pocket Reference puts all the information you need right at your fingertips. Its handy and compact format lets you locate the table and view you need effortlessly without stoppin

  5. Data Presentation Structures in Specialised Dictionaries: Law Dictionaries with Communicative Functions

    DEFF Research Database (Denmark)

    Nielsen, Sandro

    2015-01-01

    data stand out, lexicographers should prioritize functional data that are directly related to and support the function(s) of dictionaries on a need-to-have/nice-to have basis, because data presentation structures with functional focus may better help users achieve their intended goals, i.e. finding......Theoretical lexicographers have developed a range of elaborate structures to describe the arrangement of data inside dictionaries, in particular in dictionary articles. However, most of these structures have been developed on the basis of detailed analyses of print dictionaries and relatively...... little has been said about the arrangement of data in e-dictionaries. The relevant data types are lexicographical data providing help concerning the function(s) and use of dictionaries on search results pages. In order to create a visual hierarchy on screen that makes the most important search result...

  6. Standardized Representation of Clinical Study Data Dictionaries with CIMI Archetypes.

    Science.gov (United States)

    Sharma, Deepak K; Solbrig, Harold R; Prud'hommeaux, Eric; Pathak, Jyotishman; Jiang, Guoqian

    2016-01-01

    Researchers commonly use a tabular format to describe and represent clinical study data. The lack of standardization of data dictionary's metadata elements presents challenges for their harmonization for similar studies and impedes interoperability outside the local context. We propose that representing data dictionaries in the form of standardized archetypes can help to overcome this problem. The Archetype Modeling Language (AML) as developed by the Clinical Information Modeling Initiative (CIMI) can serve as a common format for the representation of data dictionary models. We mapped three different data dictionaries (identified from dbGAP, PheKB and TCGA) onto AML archetypes by aligning dictionary variable definitions with the AML archetype elements. The near complete alignment of data dictionaries helped map them into valid AML models that captured all data dictionary model metadata. The outcome of the work would help subject matter experts harmonize data models for quality, semantic interoperability and better downstream data integration.

  7. MARKAL-QUEBEC dictionary and data base

    Energy Technology Data Exchange (ETDEWEB)

    Haurie, A

    1985-07-01

    This two-part volume contains the data base of the MARKAL-QUEBEC energy modeling program. The first part, which constitutes the MARKAL dictionary, contains the list of all classes and their members. The second part contains the complete data set, presented as input tables to meet the requirements of the model. The tables are regrouped in sections in accordance with the class definition: in this way, one can find successively the tables SRC, CON, PRC, DMD, and ADRATIO (one table TCH is associated with each of the first four classes). At the beginning of each section, a brief description of the input data is given.

  8. The athena data dictionary and description language

    International Nuclear Information System (INIS)

    Bazan, A.; Ghez, P.; Le Flour, T.; Lieunard, S.; Tull, C.

    2001-01-01

    The authors have developed a data object description tool suite and service for Athena consisting of: a language grammar based upon an extended proper subset of IDL 2.0, a compiler front end based upon this language grammar, JavaCC, and a Java Reflection API-like interface, and several compiler back ends which meet specific needs in ATLAS such as automatic generation of object converters, and data object scripting interfaces. The authors present here details of the work and experience to date on the Athena Definition Language and Athena Data Dictionary

  9. A Compact In-Memory Dictionary for RDF data

    NARCIS (Netherlands)

    Bazoubandi, Hamid R.; de Rooij, Steven; Urbani, Jacopo; ten Teije, Annette; van Harmelen, Frank; Bal, Henri

    2015-01-01

    While almost all dictionary compression techniques focus on static RDF data, we present a compact in-memory RDF dictionary for dynamic and streaming data. To do so, we analysed the structure of terms in real-world datasets and observed a high degree of common prefixes. We studied the applicability

  10. Detailed Facility Report Data Dictionary | ECHO | US EPA

    Science.gov (United States)

    The Detailed Facility Report Data Dictionary provides users with a list of the variables and definitions that have been incorporated into the Detailed Facility Report. The Detailed Facility Report provides a concise enforcement and compliance history for a facility.

  11. The Athena data dictionary and description language

    International Nuclear Information System (INIS)

    Bazan, Alain; Bouedo, Thierry; Ghez, Philippe; Marino, Massimo; Tull, Craig

    2003-01-01

    Athena is the ATLAS off-line software framework, based upon the GAUDI architecture from LHCb. As part of ATLAS' continuing efforts to enhance and customize the architecture to meet our needs, we have developed a data object description tool suite and service for Athena. The aim is to provide a set of tools to describe, manage, integrate and use the Event Data Model at a design level according to the concepts of the Athena framework (use of patterns, relationships,...). Moreover, to ensure stability and reusability this must be fully independent from the implementation details. After an extensive investigation into the many options, we have developed a language grammar based upon a description language (IDL, ODL) to provide support for object integration in Athena. We have then developed a compiler front end based upon this language grammar, JavaCC, and a Java Reflection API-like interface. We have then used these tools to develop several compiler back ends which meet specific needs in ATLAS such as automatic generation of object converters, and data object scripting interfaces. We present here details of our work and experience to date on the Athena Definition Language and Athena Data Dictionary. (authors)

  12. Classification of Polarimetric SAR Data Using Dictionary Learning

    DEFF Research Database (Denmark)

    Vestergaard, Jacob Schack; Nielsen, Allan Aasbjerg; Dahl, Anders Lindbjerg

    2012-01-01

    This contribution deals with classification of multilook fully polarimetric synthetic aperture radar (SAR) data by learning a dictionary of crop types present in the Foulum test site. The Foulum test site contains a large number of agricultural fields, as well as lakes, forests, natural vegetation......, grasslands and urban areas, which make it ideally suited for evaluation of classification algorithms. Dictionary learning centers around building a collection of image patches typical for the classification problem at hand. This requires initial manual labeling of the classes present in the data and is thus...... a method for supervised classification. Sparse coding of these image patches aims to maintain a proficient number of typical patches and associated labels. Data is consecutively classified by a nearest neighbor search of the dictionary elements and labeled with probabilities of each class. Each dictionary...

  13. The Planetary Data System (PDS) Data Dictionary Tool (LDDTool)

    Science.gov (United States)

    Raugh, Anne C.; Hughes, John S.

    2017-10-01

    One of the major design goals of the PDS4 development effort was to provide an avenue for discipline specialists and large data preparers such as mission archivists to extend the core PDS4 Information Model (IM) to include metadata definitions specific to their own contexts. This capability is critical for the Planetary Data System - an archive that deals with a data collection that is diverse along virtually every conceivable axis. Amid such diversity, it is in the best interests of the PDS archive and its users that all extensions to the core IM follow the same design techniques, conventions, and restrictions as the core implementation itself. Notwithstanding, expecting all mission and discipline archivist seeking to define metadata for a new context to acquire expertise in information modeling, model-driven design, ontology, schema formulation, and PDS4 design conventions and philosophy is unrealistic, to say the least.To bridge that expertise gap, the PDS Engineering Node has developed the data dictionary creation tool known as “LDDTool”. This tool incorporates the same software used to maintain and extend the core IM, packaged with an interface that enables a developer to create his contextual information model using the same, open standards-based metadata framework PDS itself uses. Through this interface, the novice dictionary developer has immediate access to the common set of data types and unit classes for defining attributes, and a straight-forward method for constructing classes. The more experienced developer, using the same tool, has access to more sophisticated modeling methods like abstraction and extension, and can define very sophisticated validation rules.We present the key features of the PDS Local Data Dictionary Tool, which both supports the development of extensions to the PDS4 IM, and ensures their compatibility with the IM.

  14. A Common Postsecondary Data Dictionary for Perkins Accountability

    Science.gov (United States)

    Kotamraju, Pradeep; Richards, Amanda; Wun, Jolene; Klein, Steven G.

    2010-01-01

    This project assesses the feasibility of creating a voluntary, nationwide data dictionary that can be used to standardize the reporting of postsecondary accountability reporting requirements for the Carl D. Perkins Career and Technical Education Act of 2006 (otherwise known as Perkins IV). Variables, field codes, and programming instructions,…

  15. Data dictionary services in XNAT and the Human Connectome Project

    Science.gov (United States)

    Herrick, Rick; McKay, Michael; Olsen, Timothy; Horton, William; Florida, Mark; Moore, Charles J.; Marcus, Daniel S.

    2014-01-01

    The XNAT informatics platform is an open source data management tool used by biomedical imaging researchers around the world. An important feature of XNAT is its highly extensible architecture: users of XNAT can add new data types to the system to capture the imaging and phenotypic data generated in their studies. Until recently, XNAT has had limited capacity to broadcast the meaning of these data extensions to users, other XNAT installations, and other software. We have implemented a data dictionary service for XNAT, which is currently being used on ConnectomeDB, the Human Connectome Project (HCP) public data sharing website. The data dictionary service provides a framework to define key relationships between data elements and structures across the XNAT installation. This includes not just core data representing medical imaging data or subject or patient evaluations, but also taxonomical structures, security relationships, subject groups, and research protocols. The data dictionary allows users to define metadata for data structures and their properties, such as value types (e.g., textual, integers, floats) and valid value templates, ranges, or field lists. The service provides compatibility and integration with other research data management services by enabling easy migration of XNAT data to standards-based formats such as the Resource Description Framework (RDF), JavaScript Object Notation (JSON), and Extensible Markup Language (XML). It also facilitates the conversion of XNAT's native data schema into standard neuroimaging vocabularies and structures. PMID:25071542

  16. Data Dictionary Services in XNAT and the Human Connectome Project

    Directory of Open Access Journals (Sweden)

    Rick eHerrick

    2014-07-01

    Full Text Available The XNAT informatics platform is an open source data management tool used by biomedical imaging researchers around the world. An important feature of XNAT is its highly extensible architecture: users of XNAT can add new data types to the system to capture the imaging and phenotypic data generated in their studies. Until recently, XNAT has had limited capacity to broadcast the meaning of these data extensions to users, other XNAT installations, and other software.We have implemented a data dictionary service for XNAT, which is currently being used on ConnectomeDB, the Human Connectome Project (HCP public data sharing website. The data dictionary service provides a framework to define key relationships between data elements and structures across the XNAT installation. This includes not just core data representing medical imaging data or subject or patient evaluations, but also taxonomical structures, security relationships, subject groups, and research protocols. The data dictionary allows users to define metadata for data structures and their properties, such as value types (e.g. textual, integers, floats and valid value templates, ranges, or field lists. The service provides compatibility and integration with other research data management services by enabling easy migration of XNAT data to standards-based formats such as RDF, JSON, and XML. It also facilitates the conversion of XNAT’s native data schema into standard neuroimaging ontology structures and provenances.

  17. Knowledge Dictionary for Information Extraction on the Arabic Text Data

    Directory of Open Access Journals (Sweden)

    Wahyu Jauharis Saputra

    2013-04-01

    Full Text Available Information extraction is an early stage of a process of textual data analysis. Information extraction is required to get information from textual data that can be used for process analysis, such as classification and categorization. A textual data is strongly influenced by the language. Arabic is gaining a significant attention in many studies because Arabic language is very different from others, and in contrast to other languages, tools and research on the Arabic language is still lacking. The information extracted using the knowledge dictionary is a concept of expression. A knowledge dictionary is usually constructed manually by an expert and this would take a long time and is specific to a problem only. This paper proposed a method for automatically building a knowledge dictionary. Dictionary knowledge is formed by classifying sentences having the same concept, assuming that they will have a high similarity value. The concept that has been extracted can be used as features for subsequent computational process such as classification or categorization. Dataset used in this paper was the Arabic text dataset. Extraction result was tested by using a decision tree classification engine and the highest precision value obtained was 71.0% while the highest recall value was 75.0%. 

  18. An Online Dictionary Learning-Based Compressive Data Gathering Algorithm in Wireless Sensor Networks.

    Science.gov (United States)

    Wang, Donghao; Wan, Jiangwen; Chen, Junying; Zhang, Qiang

    2016-09-22

    To adapt to sense signals of enormous diversities and dynamics, and to decrease the reconstruction errors caused by ambient noise, a novel online dictionary learning method-based compressive data gathering (ODL-CDG) algorithm is proposed. The proposed dictionary is learned from a two-stage iterative procedure, alternately changing between a sparse coding step and a dictionary update step. The self-coherence of the learned dictionary is introduced as a penalty term during the dictionary update procedure. The dictionary is also constrained with sparse structure. It's theoretically demonstrated that the sensing matrix satisfies the restricted isometry property (RIP) with high probability. In addition, the lower bound of necessary number of measurements for compressive sensing (CS) reconstruction is given. Simulation results show that the proposed ODL-CDG algorithm can enhance the recovery accuracy in the presence of noise, and reduce the energy consumption in comparison with other dictionary based data gathering methods.

  19. An Online Dictionary Learning-Based Compressive Data Gathering Algorithm in Wireless Sensor Networks

    Directory of Open Access Journals (Sweden)

    Donghao Wang

    2016-09-01

    Full Text Available To adapt to sense signals of enormous diversities and dynamics, and to decrease the reconstruction errors caused by ambient noise, a novel online dictionary learning method-based compressive data gathering (ODL-CDG algorithm is proposed. The proposed dictionary is learned from a two-stage iterative procedure, alternately changing between a sparse coding step and a dictionary update step. The self-coherence of the learned dictionary is introduced as a penalty term during the dictionary update procedure. The dictionary is also constrained with sparse structure. It’s theoretically demonstrated that the sensing matrix satisfies the restricted isometry property (RIP with high probability. In addition, the lower bound of necessary number of measurements for compressive sensing (CS reconstruction is given. Simulation results show that the proposed ODL-CDG algorithm can enhance the recovery accuracy in the presence of noise, and reduce the energy consumption in comparison with other dictionary based data gathering methods.

  20. Tank Characterization Database (TCD) Data Dictionary: Version 4.0

    International Nuclear Information System (INIS)

    1996-04-01

    This document is the data dictionary for the tank characterization database (TCD) system and contains information on the data model and SYBASE reg-sign database structure. The first two parts of this document are subject areas based on the two different areas of the (TCD) database: sample analysis and waste inventory. Within each subject area is an alphabetical list of all the database tables contained in the subject area. Within each table defintiion is a brief description of the table and alist of field names and attributes. The third part, Field Descriptions, lists all field names in the data base alphabetically

  1. Iterative dictionary construction for compression of large DNA data sets.

    Science.gov (United States)

    Kuruppu, Shanika; Beresford-Smith, Bryan; Conway, Thomas; Zobel, Justin

    2012-01-01

    Genomic repositories increasingly include individual as well as reference sequences, which tend to share long identical and near-identical strings of nucleotides. However, the sequential processing used by most compression algorithms, and the volumes of data involved, mean that these long-range repetitions are not detected. An order-insensitive, disk-based dictionary construction method can detect this repeated content and use it to compress collections of sequences. We explore a dictionary construction method that improves repeat identification in large DNA data sets. Our adaptation, COMRAD, of an existing disk-based method identifies exact repeated content in collections of sequences with similarities within and across the set of input sequences. COMRAD compresses the data over multiple passes, which is an expensive process, but allows COMRAD to compress large data sets within reasonable time and space. COMRAD allows for random access to individual sequences and subsequences without decompressing the whole data set. COMRAD has no competitor in terms of the size of data sets that it can compress (extending to many hundreds of gigabytes) and, even for smaller data sets, the results are competitive compared to alternatives; as an example, 39 S. cerevisiae genomes compressed to 0.25 bits per base.

  2. Recent research in data description of the measurement property resource on common data dictionary

    Science.gov (United States)

    Lu, Tielin; Fan, Zitian; Wang, Chunxi; Liu, Xiaojing; Wang, Shuo; Zhao, Hua

    2018-03-01

    A method for measurement equipment data description has been proposed based on the property resource analysis. The applications of common data dictionary (CDD) to devices and equipment is mainly used in digital factory to advance the management not only in the enterprise, also to the different enterprise in the same data environment. In this paper, we can make a brief of the data flow in the whole manufacture enterprise and the automatic trigger the process of the data exchange. Furthermore,the application of the data dictionary is available for the measurement and control equipment, which can also be used in other different industry in smart manufacture.

  3. Dictionary learning for data recovery in positron emission tomography

    International Nuclear Information System (INIS)

    Valiollahzadeh, SeyyedMajid; Clark, John W Jr; Mawlawi, Osama

    2015-01-01

    Compressed sensing (CS) aims to recover images from fewer measurements than that governed by the Nyquist sampling theorem. Most CS methods use analytical predefined sparsifying domains such as total variation, wavelets, curvelets, and finite transforms to perform this task. In this study, we evaluated the use of dictionary learning (DL) as a sparsifying domain to reconstruct PET images from partially sampled data, and compared the results to the partially and fully sampled image (baseline).A CS model based on learning an adaptive dictionary over image patches was developed to recover missing observations in PET data acquisition. The recovery was done iteratively in two steps: a dictionary learning step and an image reconstruction step. Two experiments were performed to evaluate the proposed CS recovery algorithm: an IEC phantom study and five patient studies. In each case, 11% of the detectors of a GE PET/CT system were removed and the acquired sinogram data were recovered using the proposed DL algorithm. The recovered images (DL) as well as the partially sampled images (with detector gaps) for both experiments were then compared to the baseline. Comparisons were done by calculating RMSE, contrast recovery and SNR in ROIs drawn in the background, and spheres of the phantom as well as patient lesions.For the phantom experiment, the RMSE for the DL recovered images were 5.8% when compared with the baseline images while it was 17.5% for the partially sampled images. In the patients’ studies, RMSE for the DL recovered images were 3.8%, while it was 11.3% for the partially sampled images. Our proposed CS with DL is a good approach to recover partially sampled PET data. This approach has implications toward reducing scanner cost while maintaining accurate PET image quantification. (paper)

  4. Human Spaceflight Architecture Model (HSFAM) Data Dictionary

    Science.gov (United States)

    Shishko, Robert

    2016-01-01

    HSFAM is a data model based on the DoDAF 2.02 data model with some for purpose extensions. These extensions are designed to permit quantitative analyses regarding stakeholder concerns about technical feasibility, configuration and interface issues, and budgetary and/or economic viability.

  5. Dictionaries and distributions: Combining expert knowledge and large scale textual data content analysis : Distributed dictionary representation.

    Science.gov (United States)

    Garten, Justin; Hoover, Joe; Johnson, Kate M; Boghrati, Reihane; Iskiwitch, Carol; Dehghani, Morteza

    2018-02-01

    Theory-driven text analysis has made extensive use of psychological concept dictionaries, leading to a wide range of important results. These dictionaries have generally been applied through word count methods which have proven to be both simple and effective. In this paper, we introduce Distributed Dictionary Representations (DDR), a method that applies psychological dictionaries using semantic similarity rather than word counts. This allows for the measurement of the similarity between dictionaries and spans of text ranging from complete documents to individual words. We show how DDR enables dictionary authors to place greater emphasis on construct validity without sacrificing linguistic coverage. We further demonstrate the benefits of DDR on two real-world tasks and finally conduct an extensive study of the interaction between dictionary size and task performance. These studies allow us to examine how DDR and word count methods complement one another as tools for applying concept dictionaries and where each is best applied. Finally, we provide references to tools and resources to make this method both available and accessible to a broad psychological audience.

  6. Sparsity-promoting orthogonal dictionary updating for image reconstruction from highly undersampled magnetic resonance data

    International Nuclear Information System (INIS)

    Huang, Jinhong; Guo, Li; Feng, Qianjin; Chen, Wufan; Feng, Yanqiu

    2015-01-01

    Image reconstruction from undersampled k-space data accelerates magnetic resonance imaging (MRI) by exploiting image sparseness in certain transform domains. Employing image patch representation over a learned dictionary has the advantage of being adaptive to local image structures and thus can better sparsify images than using fixed transforms (e.g. wavelets and total variations). Dictionary learning methods have recently been introduced to MRI reconstruction, and these methods demonstrate significantly reduced reconstruction errors compared to sparse MRI reconstruction using fixed transforms. However, the synthesis sparse coding problem in dictionary learning is NP-hard and computationally expensive. In this paper, we present a novel sparsity-promoting orthogonal dictionary updating method for efficient image reconstruction from highly undersampled MRI data. The orthogonality imposed on the learned dictionary enables the minimization problem in the reconstruction to be solved by an efficient optimization algorithm which alternately updates representation coefficients, orthogonal dictionary, and missing k-space data. Moreover, both sparsity level and sparse representation contribution using updated dictionaries gradually increase during iterations to recover more details, assuming the progressively improved quality of the dictionary. Simulation and real data experimental results both demonstrate that the proposed method is approximately 10 to 100 times faster than the K-SVD-based dictionary learning MRI method and simultaneously improves reconstruction accuracy. (paper)

  7. Sparsity-promoting orthogonal dictionary updating for image reconstruction from highly undersampled magnetic resonance data.

    Science.gov (United States)

    Huang, Jinhong; Guo, Li; Feng, Qianjin; Chen, Wufan; Feng, Yanqiu

    2015-07-21

    Image reconstruction from undersampled k-space data accelerates magnetic resonance imaging (MRI) by exploiting image sparseness in certain transform domains. Employing image patch representation over a learned dictionary has the advantage of being adaptive to local image structures and thus can better sparsify images than using fixed transforms (e.g. wavelets and total variations). Dictionary learning methods have recently been introduced to MRI reconstruction, and these methods demonstrate significantly reduced reconstruction errors compared to sparse MRI reconstruction using fixed transforms. However, the synthesis sparse coding problem in dictionary learning is NP-hard and computationally expensive. In this paper, we present a novel sparsity-promoting orthogonal dictionary updating method for efficient image reconstruction from highly undersampled MRI data. The orthogonality imposed on the learned dictionary enables the minimization problem in the reconstruction to be solved by an efficient optimization algorithm which alternately updates representation coefficients, orthogonal dictionary, and missing k-space data. Moreover, both sparsity level and sparse representation contribution using updated dictionaries gradually increase during iterations to recover more details, assuming the progressively improved quality of the dictionary. Simulation and real data experimental results both demonstrate that the proposed method is approximately 10 to 100 times faster than the K-SVD-based dictionary learning MRI method and simultaneously improves reconstruction accuracy.

  8. XML Flight/Ground Data Dictionary Management

    Science.gov (United States)

    Wright, Jesse; Wiklow, Colette

    2007-01-01

    A computer program generates Extensible Markup Language (XML) files that effect coupling between the command- and telemetry-handling software running aboard a spacecraft and the corresponding software running in ground support systems. The XML files are produced by use of information from the flight software and from flight-system engineering. The XML files are converted to legacy ground-system data formats for command and telemetry, transformed into Web-based and printed documentation, and used in developing new ground-system data-handling software. Previously, the information about telemetry and command was scattered in various paper documents that were not synchronized. The process of searching and reading the documents was time-consuming and introduced errors. In contrast, the XML files contain all of the information in one place. XML structures can evolve in such a manner as to enable the addition, to the XML files, of the metadata necessary to track the changes and the associated documentation. The use of this software has reduced the extent of manual operations in developing a ground data system, thereby saving considerable time and removing errors that previously arose in the translation and transcription of software information from the flight to the ground system.

  9. The Athena Data Dictionary and Description Language

    CERN Document Server

    Bazan, A; Ghez, P; Marino, M; Tull, C

    2003-01-01

    Athena is the ATLAS off-line software framework, based upon the GAUDI architecture from LHCb. As part of ATLAS' continuing efforts to enhance and customise the architecture to meet our needs, we have developed a data object description tool suite and service for Athena. The aim is to provide a set of tools to describe, manage, integrate and use the Event Data Model at a design level according to the concepts of the Athena framework (use of patterns, relationships, ...). Moreover, to ensure stability and reusability this must be fully independent from the implementation details. After an extensive investigation into the many options, we have developed a language grammar based upon a description language (IDL, ODL) to provide support for object integration in Athena. We have then developed a compiler front end based upon this language grammar, JavaCC, and a Java Reflection API-like interface. We have then used these tools to develop several compiler back ends which meet specific needs in ATLAS such as automatic...

  10. 75 FR 22805 - Federal Travel Regulation; Relocation Allowances; Standard Data Dictionary for Collection of...

    Science.gov (United States)

    2010-04-30

    ... GENERAL SERVICES ADMINISTRATION [Proposed GSA Bulletin FTR 10-XXX; Docket 2010-0009; Sequence 1] Federal Travel Regulation; Relocation Allowances; Standard Data Dictionary for Collection of Transaction... GSA is posting online a proposed FTR bulletin that contains the data dictionary that large Federal...

  11. Dictionaries for text production

    DEFF Research Database (Denmark)

    Fuertes-Olivera, Pedro; Bergenholtz, Henning

    2018-01-01

    Dictionaries for Text Production are information tools that are designed and constructed for helping users to produce (i.e. encode) texts, both oral and written texts. These can be broadly divided into two groups: (a) specialized text production dictionaries, i.e., dictionaries that only offer...... a small amount of lexicographic data, most or all of which are typically used in a production situation, e.g. synonym dictionaries, grammar and spelling dictionaries, collocation dictionaries, concept dictionaries such as the Longman Language Activator, which is advertised as the World’s First Production...... Dictionary; (b) general text production dictionaries, i.e., dictionaries that offer all or most of the lexicographic data that are typically used in a production situation. A review of existing production dictionaries reveals that there are many specialized text production dictionaries but only a few general...

  12. Terminological reference of a knowledge-based system: the data dictionary.

    Science.gov (United States)

    Stausberg, J; Wormek, A; Kraut, U

    1995-01-01

    The development of open and integrated knowledge bases makes new demands on the definition of the used terminology. The definition should be realized in a data dictionary separated from the knowledge base. Within the works done at a reference model of medical knowledge, a data dictionary has been developed and used in different applications: a term definition shell, a documentation tool and a knowledge base. The data dictionary includes that part of terminology, which is largely independent of a certain knowledge model. For that reason, the data dictionary can be used as a basis for integrating knowledge bases into information systems, for knowledge sharing and reuse and for modular development of knowledge-based systems.

  13. Data base dictionary for the Oak Ridge Reservation Hydrology and Geology Study Groundwater Data Base

    Energy Technology Data Exchange (ETDEWEB)

    Thompson, B.K.

    1993-04-01

    The Oak Ridge Reservation Hydrology and Geology Study (ORRHAGS) Groundwater Data Base has been compiled to consolidate groundwater data from the three US Department of Energy facilities located on the Oak Ridge Reservation: the Oak Ridge K-25 Site, the Oak Ridge National Laboratory, and the Oak Ridge Y-12 Plant. Each of these facilities maintains its own groundwater and well construction data bases. Data were extracted from the existing data bases, converted to a consistent format, and integrated into the ORRHAGS Groundwater Data Base structures. This data base dictionary describes the data contained in the ORRHAGS Groundwater Data Base and contains information on data base structure, conventions, contents, and use.

  14. Joint seismic data denoising and interpolation with double-sparsity dictionary learning

    Science.gov (United States)

    Zhu, Lingchen; Liu, Entao; McClellan, James H.

    2017-08-01

    Seismic data quality is vital to geophysical applications, so that methods of data recovery, including denoising and interpolation, are common initial steps in the seismic data processing flow. We present a method to perform simultaneous interpolation and denoising, which is based on double-sparsity dictionary learning. This extends previous work that was for denoising only. The original double-sparsity dictionary learning algorithm is modified to track the traces with missing data by defining a masking operator that is integrated into the sparse representation of the dictionary. A weighted low-rank approximation algorithm is adopted to handle the dictionary updating as a sparse recovery optimization problem constrained by the masking operator. Compared to traditional sparse transforms with fixed dictionaries that lack the ability to adapt to complex data structures, the double-sparsity dictionary learning method learns the signal adaptively from selected patches of the corrupted seismic data, while preserving compact forward and inverse transform operators. Numerical experiments on synthetic seismic data indicate that this new method preserves more subtle features in the data set without introducing pseudo-Gibbs artifacts when compared to other directional multi-scale transform methods such as curvelets.

  15. Evaluation of the Clinical Data Dictionary (CiDD)

    Science.gov (United States)

    Lee, Myung Kyung; Min, Yul Ha; Kim, Younglan; Min, Hyo Ki; Ham, Sung Woo

    2010-01-01

    Objectives The purpose of the study was to evaluate content coverage and data quality of the Clinical Data Dictionary (CiDD) developed by the Center for Interoperable EHR (CiEHR). Methods A total of 12,994 terms were collected from 98 clinical forms of a tertiary cancer center hospital with 500 beds. After data cleaning, 9,418 terms were mapped with the data items of the CiDD by the research team, and validated by 30 doctors and nurses at the research hospital. Results Mapping results were classified into five categories: lexically mapped; semantically mapped; mapped to either a broader term or a narrower term; mapped to more than one term and not mapped. In terms of coverage, out of 9,418 terms, 6,750 (71.7%) terms were mapped; 4,319 (45.9%) terms were lexically mapped; 2,431 (25.8%) were semantically mapped; 281 (3.0%) terms were mapped to a broader term; 43 (0.5%) were mapped to a narrower term; and 550 (5.8%) were mapped to more than one term. In terms of data quality, the CiDD has problems such as errors in concept namingand representation, redundancy in synonyms, inadequate synonyms, and ambiguity in meaning. Conclusions Although the CiDD has terms covering 72% of local clinical terms, the CiDD can be improved by cleaning up errors and redundancies, adding textual definitions or use cases of the concept, and arranging the concepts in a hierarchy. PMID:21818428

  16. Implementing the Freight Transportation Data Architecture : Data Element Dictionary

    Science.gov (United States)

    2015-01-01

    NCFRP Report 9: Guidance for Developing a Freight Data Architecture articulates the value of establishing architecture for linking data across modes, subjects, and levels of geography to obtain essential information for decision making. Central to th...

  17. Dictionary criticism

    DEFF Research Database (Denmark)

    Nielsen, Sandro

    2018-01-01

    Dictionary criticism is part of the lexicographical universe and reviewing of electronic and printed dictionaries is not an exercise in linguistics or in subject fields but an exercise in lexicography. It does not follow from this that dictionary reviews should not be based on a linguistic approach......, but that the linguistic approach is only one of several approaches to dictionary reviewing. Similarly, the linguistic and factual competences of reviewers should not be relegated to an insignificant position in the review process. Moreover, reviewers should define the object of their reviews, the dictionary, as a complex...... information tool with several components and in terms of significant lexicographical features: lexicographical functions, data and structures. This emphasises the fact that dictionaries are much more than mere vessels of linguistic categories, namely lexicographical tools that have been developed to fulfil...

  18. Extending the data dictionary for data/knowledge management

    Science.gov (United States)

    Hydrick, Cecile L.; Graves, Sara J.

    1988-01-01

    Current relational database technology provides the means for efficiently storing and retrieving large amounts of data. By combining techniques learned from the field of artificial intelligence with this technology, it is possible to expand the capabilities of such systems. This paper suggests using the expanded domain concept, an object-oriented organization, and the storing of knowledge rules within the relational database as a solution to the unique problems associated with CAD/CAM and engineering data.

  19. An analysis dictionary learning algorithm under a noisy data model with orthogonality constraint.

    Science.gov (United States)

    Zhang, Ye; Yu, Tenglong; Wang, Wenwu

    2014-01-01

    Two common problems are often encountered in analysis dictionary learning (ADL) algorithms. The first one is that the original clean signals for learning the dictionary are assumed to be known, which otherwise need to be estimated from noisy measurements. This, however, renders a computationally slow optimization process and potentially unreliable estimation (if the noise level is high), as represented by the Analysis K-SVD (AK-SVD) algorithm. The other problem is the trivial solution to the dictionary, for example, the null dictionary matrix that may be given by a dictionary learning algorithm, as discussed in the learning overcomplete sparsifying transform (LOST) algorithm. Here we propose a novel optimization model and an iterative algorithm to learn the analysis dictionary, where we directly employ the observed data to compute the approximate analysis sparse representation of the original signals (leading to a fast optimization procedure) and enforce an orthogonality constraint on the optimization criterion to avoid the trivial solutions. Experiments demonstrate the competitive performance of the proposed algorithm as compared with three baselines, namely, the AK-SVD, LOST, and NAAOLA algorithms.

  20. An Analysis Dictionary Learning Algorithm under a Noisy Data Model with Orthogonality Constraint

    Directory of Open Access Journals (Sweden)

    Ye Zhang

    2014-01-01

    Full Text Available Two common problems are often encountered in analysis dictionary learning (ADL algorithms. The first one is that the original clean signals for learning the dictionary are assumed to be known, which otherwise need to be estimated from noisy measurements. This, however, renders a computationally slow optimization process and potentially unreliable estimation (if the noise level is high, as represented by the Analysis K-SVD (AK-SVD algorithm. The other problem is the trivial solution to the dictionary, for example, the null dictionary matrix that may be given by a dictionary learning algorithm, as discussed in the learning overcomplete sparsifying transform (LOST algorithm. Here we propose a novel optimization model and an iterative algorithm to learn the analysis dictionary, where we directly employ the observed data to compute the approximate analysis sparse representation of the original signals (leading to a fast optimization procedure and enforce an orthogonality constraint on the optimization criterion to avoid the trivial solutions. Experiments demonstrate the competitive performance of the proposed algorithm as compared with three baselines, namely, the AK-SVD, LOST, and NAAOLA algorithms.

  1. The PDS4 Data Dictionary Tool - Metadata Design for Data Preparers

    Science.gov (United States)

    Raugh, A.; Hughes, J. S.

    2017-12-01

    One of the major design goals of the PDS4 development effort was to create an extendable Information Model (IM) for the archive, and to allow mission data designers/preparers to create extensions for metadata definitions specific to their own contexts. This capability is critical for the Planetary Data System - an archive that deals with a data collection that is diverse along virtually every conceivable axis. Amid such diversity in the data itself, it is in the best interests of the PDS archive and its users that all extensions to the IM follow the same design techniques, conventions, and restrictions as the core implementation itself. But it is unrealistic to expect mission data designers to acquire expertise in information modeling, model-driven design, ontology, schema formulation, and PDS4 design conventions and philosophy in order to define their own metadata. To bridge that expertise gap and bring the power of information modeling to the data label designer, the PDS Engineering Node has developed the data dictionary creation tool known as "LDDTool". This tool incorporates the same software used to maintain and extend the core IM, packaged with an interface that enables a developer to create his extension to the IM using the same, standards-based metadata framework PDS itself uses. Through this interface, the novice dictionary developer has immediate access to the common set of data types and unit classes for defining attributes, and a straight-forward method for constructing classes. The more experienced developer, using the same tool, has access to more sophisticated modeling methods like abstraction and extension, and can define context-specific validation rules. We present the key features of the PDS Local Data Dictionary Tool, which both supports the development of extensions to the PDS4 IM, and ensures their compatibility with the IM.

  2. Technical Profile of Seven Data Element Dictionary/Directory Systems. Computer Science & Technology Series.

    Science.gov (United States)

    Leong-Hong, Belkis; Marron, Beatrice

    A Data Element Dictionary/Directory (DED/D) is a software tool that is used to control and manage data elements in a uniform manner. It can serve data base administrators, systems analysts, software designers, and programmers by providing a central repository for information about data resources across organization and application lines. This…

  3. Evaluation of a data dictionary system. [information dissemination and computer systems programs

    Science.gov (United States)

    Driggers, W. G.

    1975-01-01

    The usefulness was investigated of a data dictionary/directory system for achieving optimum benefits from existing and planned investments in computer data files in the Data Systems Development Branch and the Institutional Data Systems Division. Potential applications of the data catalogue system are discussed along with an evaluation of the system. Other topics discussed include data description, data structure, programming aids, programming languages, program networks, and test data.

  4. Rdesign: A data dictionary with relational database design capabilities in Ada

    Science.gov (United States)

    Lekkos, Anthony A.; Kwok, Teresa Ting-Yin

    1986-01-01

    Data Dictionary is defined to be the set of all data attributes, which describe data objects in terms of their intrinsic attributes, such as name, type, size, format and definition. It is recognized as the data base for the Information Resource Management, to facilitate understanding and communication about the relationship between systems applications and systems data usage and to help assist in achieving data independence by permitting systems applications to access data knowledge of the location or storage characteristics of the data in the system. A research and development effort to use Ada has produced a data dictionary with data base design capabilities. This project supports data specification and analysis and offers a choice of the relational, network, and hierarchical model for logical data based design. It provides a highly integrated set of analysis and design transformation tools which range from templates for data element definition, spreadsheet for defining functional dependencies, normalization, to logical design generator.

  5. Building a RAPPOR with the Unknown: Privacy-Preserving Learning of Associations and Data Dictionaries

    Directory of Open Access Journals (Sweden)

    Fanti Giulia

    2016-07-01

    Full Text Available Techniques based on randomized response enable the collection of potentially sensitive data from clients in a privacy-preserving manner with strong local differential privacy guarantees. A recent such technology, RAPPOR [12], enables estimation of the marginal frequencies of a set of strings via privacy-preserving crowdsourcing. However, this original estimation process relies on a known dictionary of possible strings; in practice, this dictionary can be extremely large and/or unknown. In this paper, we propose a novel decoding algorithm for the RAPPOR mechanism that enables the estimation of “unknown unknowns,” i.e., strings we do not know we should be estimating. To enable learning without explicit dictionary knowledge, we develop methodology for estimating the joint distribution of multiple variables collected with RAPPOR. Our contributions are not RAPPOR-specific, and can be generalized to other local differential privacy mechanisms for learning distributions of string-valued random variables.

  6. Basis Expansion Approaches for Regularized Sequential Dictionary Learning Algorithms With Enforced Sparsity for fMRI Data Analysis.

    Science.gov (United States)

    Seghouane, Abd-Krim; Iqbal, Asif

    2017-09-01

    Sequential dictionary learning algorithms have been successfully applied to functional magnetic resonance imaging (fMRI) data analysis. fMRI data sets are, however, structured data matrices with the notions of temporal smoothness in the column direction. This prior information, which can be converted into a constraint of smoothness on the learned dictionary atoms, has seldomly been included in classical dictionary learning algorithms when applied to fMRI data analysis. In this paper, we tackle this problem by proposing two new sequential dictionary learning algorithms dedicated to fMRI data analysis by accounting for this prior information. These algorithms differ from the existing ones in their dictionary update stage. The steps of this stage are derived as a variant of the power method for computing the SVD. The proposed algorithms generate regularized dictionary atoms via the solution of a left regularized rank-one matrix approximation problem where temporal smoothness is enforced via regularization through basis expansion and sparse basis expansion in the dictionary update stage. Applications on synthetic data experiments and real fMRI data sets illustrating the performance of the proposed algorithms are provided.

  7. Online dictionaries

    DEFF Research Database (Denmark)

    Tarp, Sven

    2012-01-01

    This article initially provides a panoramic overview and a preliminary typologization of present and future online dictionaries based upon their application of the available technologies and suggests that the future of lexicography will be the development of highly sophisticated tools which may......, need, consultation, and data. The article then proceeds to the discussion of some advanced information science techniques that may contribute to the desired individualization. Upon this basis, it finally discusses the interaction between online dictionaries and external sources like the Internet...

  8. A Relational Data Dictionary Compatible with the National Bureau of Standards Information Resource Dictionary System.

    Science.gov (United States)

    1985-12-01

    IRDS DATA ARCHITECTURE This section presents an overview of the framwork in which IRDS data is organized and presented to the user. 33 4 -T i .V 1...system-standard schema contains twelve entity-types that conceptually can be grouped into three categories, Data, Process, and External. [Ref. 32] 36

  9. The GLAS Standard Data Products Specification-Data Dictionary, Version 1.0. Volume 15

    Science.gov (United States)

    Lee, Jeffrey E.

    2013-01-01

    The Geoscience Laser Altimeter System (GLAS) is the primary instrument for the ICESat (Ice, Cloud and Land Elevation Satellite) laser altimetry mission. ICESat was the benchmark Earth Observing System (EOS) mission for measuring ice sheet mass balance, cloud and aerosol heights, as well as land topography and vegetation characteristics. From 2003 to 2009, the ICESat mission provided multi-year elevation data needed to determine ice sheet mass balance as well as cloud property information, especially for stratospheric clouds common over polar areas. It also provided topography and vegetation data around the globe, in addition to the polar-specific coverage over the Greenland and Antarctic ice sheets.This document contains the data dictionary for the GLAS standard data products. It details the parameters present on GLAS standard data products. Each parameter is defined with a short name, a long name, units on product, type of variable, a long description and products that contain it. The term standard data products refers to those EOS instrument data that are routinely generated for public distribution. These products are distributed by the National Snow and Ice Data Center (NSDIC).

  10. Dictionary Snakes

    DEFF Research Database (Denmark)

    Dahl, Anders Bjorholm; Dahl, Vedrana Andersen

    2014-01-01

    for image segmentation that operates without training data. Our method is based on a probabilistic dictionary of image patches coupled with a deformable model inspired by snakes and active contours without edges. We separate the image into two classes based on the information provided by the evolving curve......, which moves according to the probabilistic information obtained from the dictionary. Initially, the image patches are assigned to the nearest dictionary element, where the image is sampled at each pixel such that patches overlap. The curve divides the image into an inside and an outside region allowing...... us to estimate the pixel-wise probability of the dictionary elements. In each iteration we evolve the curve and update the probabilities, which merges similar texture patterns and pulls dissimilar patterns apart. We experimentally evaluate our approach, and show how textured objects are precisely...

  11. Discipline, Dilemmas, Decisions and Data Distribution in the Planning and Compilation of Monolingual Dictionaries

    Directory of Open Access Journals (Sweden)

    Rufus H Gouws

    2011-10-01

    Full Text Available

    Abstract: Bilingual dictionaries play an important role in the standardisation of a language and are often the first dictionary type to be compiled for a given speech community. However, this may never lead to an underestimation of the role and importance of monolingual descriptive dictionaries in the early lexicographic development of a language. In the planning of first descriptive dictionaries the choice of the proper subtype and a consistent application of theoretical principles should be regarded as of extreme importance. Even the compilation of a restricted descriptive dictionary should be done according to similar theoretical principles as those applying to comprehensive dictionaries. This contribution indicates a number of dilemmas confronting the lexicographer during the compilation of restricted monolingual descriptive dictionaries. Attention is given to the role of lexicographic functions and the choice and presentation of lexicographic data, with special reference to the presentation of certain types of polysemous senses which are subjected to frequency of use restrictions. Emphasis is placed on the value of a heterogeneous article structure and a micro-architecture in the articles of restricted dictionaries.

    Keywords: ACCESS STRUCTURE, DATA DISTRIBUTION, FRAME STRUCTURE, FRE-QUENCY OF USE, HETEROGENEOUS ARTICLE STRUCTURE, LEXICOGRAPHIC FUNC-TIONS, LEXICOGRAPHIC PROCESS, MICRO-ARCHITECTURE, MONOLINGUAL DICTION-ARY, POLYSEMY, SEMANTIC DATA, TEXT BLOCK, USER-FRIENDLINESS, USER-PERSPEC-TIVE, VERTICAL ARCHITECTONIC EXTENSION

    Opsomming: Dissipline, dilemmas, besluite en dataverspreiding in die beplanning en samestelling van eentalige woordeboeke. Tweetalige woordeboeke speel 'n belangrike rol in die standaardisering van taal en is dikwels die eerste woordeboektipe wat vir 'n bepaalde taalgemeenskap saamgestel word. Dit mag egter nie tot 'n geringskatting lei van die rol en waarde van eentalige verklarende woordeboeke in die

  12. Towards a supervised rescoring system for unstructured data bases used to build specialized dictionaries

    Directory of Open Access Journals (Sweden)

    Antonio Rico-Sulayes

    2014-12-01

    Full Text Available This article proposes the architecture for a system that uses previously learned weights to sort query results from unstructured data bases when building specialized dictionaries. A common resource in the construction of dictionaries, unstructured data bases have been especially useful in providing information about lexical items frequencies and examples in use. However, when building specialized dictionaries, whose selection of lexical items does not rely on frequency, the use of these data bases gets restricted to a simple provider of examples. Even in this task, the information unstructured data bases provide may not be very useful when looking for specialized uses of lexical items with various meanings and very long lists of results. In the face of this problem, long lists of hits can be rescored based on a supervised learning model that relies on previously helpful results. The allocation of a vast set of high quality training data for this rescoring system is reported here. Finally, the architecture of sucha system,an unprecedented tool in specialized lexicography, is proposed.

  13. Three-dimensional dictionary-learning reconstruction of (23)Na MRI data.

    Science.gov (United States)

    Behl, Nicolas G R; Gnahm, Christine; Bachert, Peter; Ladd, Mark E; Nagel, Armin M

    2016-04-01

    To reduce noise and artifacts in (23)Na MRI with a Compressed Sensing reconstruction and a learned dictionary as sparsifying transform. A three-dimensional dictionary-learning compressed sensing reconstruction algorithm (3D-DLCS) for the reconstruction of undersampled 3D radial (23)Na data is presented. The dictionary used as the sparsifying transform is learned with a K-singular-value-decomposition (K-SVD) algorithm. The reconstruction parameters are optimized on simulated data, and the quality of the reconstructions is assessed with peak signal-to-noise ratio (PSNR) and structural similarity (SSIM). The performance of the algorithm is evaluated in phantom and in vivo (23)Na MRI data of seven volunteers and compared with nonuniform fast Fourier transform (NUFFT) and other Compressed Sensing reconstructions. The reconstructions of simulated data have maximal PSNR and SSIM for an undersampling factor (USF) of 10 with numbers of averages equal to the USF. For 10-fold undersampling, the PSNR is increased by 5.1 dB compared with the NUFFT reconstruction, and the SSIM by 24%. These results are confirmed by phantom and in vivo (23)Na measurements in the volunteers that show markedly reduced noise and undersampling artifacts in the case of 3D-DLCS reconstructions. The 3D-DLCS algorithm enables precise reconstruction of undersampled (23)Na MRI data with markedly reduced noise and artifact levels compared with NUFFT reconstruction. Small structures are well preserved. © 2015 Wiley Periodicals, Inc.

  14. Data base dictionary for the Oak Ridge Reservation Hydrology and Geology Study Groundwater Data Base. Environmental Restoration Program

    Energy Technology Data Exchange (ETDEWEB)

    Thompson, B.K.

    1993-04-01

    The Oak Ridge Reservation Hydrology and Geology Study (ORRHAGS) Groundwater Data Base has been compiled to consolidate groundwater data from the three US Department of Energy facilities located on the Oak Ridge Reservation: the Oak Ridge K-25 Site, the Oak Ridge National Laboratory, and the Oak Ridge Y-12 Plant. Each of these facilities maintains its own groundwater and well construction data bases. Data were extracted from the existing data bases, converted to a consistent format, and integrated into the ORRHAGS Groundwater Data Base structures. This data base dictionary describes the data contained in the ORRHAGS Groundwater Data Base and contains information on data base structure, conventions, contents, and use.

  15. Extended dynamic mode decomposition with dictionary learning: A data-driven adaptive spectral decomposition of the Koopman operator.

    Science.gov (United States)

    Li, Qianxiao; Dietrich, Felix; Bollt, Erik M; Kevrekidis, Ioannis G

    2017-10-01

    Numerical approximation methods for the Koopman operator have advanced considerably in the last few years. In particular, data-driven approaches such as dynamic mode decomposition (DMD) 51 and its generalization, the extended-DMD (EDMD), are becoming increasingly popular in practical applications. The EDMD improves upon the classical DMD by the inclusion of a flexible choice of dictionary of observables which spans a finite dimensional subspace on which the Koopman operator can be approximated. This enhances the accuracy of the solution reconstruction and broadens the applicability of the Koopman formalism. Although the convergence of the EDMD has been established, applying the method in practice requires a careful choice of the observables to improve convergence with just a finite number of terms. This is especially difficult for high dimensional and highly nonlinear systems. In this paper, we employ ideas from machine learning to improve upon the EDMD method. We develop an iterative approximation algorithm which couples the EDMD with a trainable dictionary represented by an artificial neural network. Using the Duffing oscillator and the Kuramoto Sivashinsky partical differential equation as examples, we show that our algorithm can effectively and efficiently adapt the trainable dictionary to the problem at hand to achieve good reconstruction accuracy without the need to choose a fixed dictionary a priori. Furthermore, to obtain a given accuracy, we require fewer dictionary terms than EDMD with fixed dictionaries. This alleviates an important shortcoming of the EDMD algorithm and enhances the applicability of the Koopman framework to practical problems.

  16. Database Dictionary for Ethiopian National Ground-Water DAtabase (ENGDA) Data Fields

    Science.gov (United States)

    Kuniansky, Eve L.; Litke, David W.; Tucci, Patrick

    2007-01-01

    Introduction This document describes the data fields that are used for both field forms and the Ethiopian National Ground-water Database (ENGDA) tables associated with information stored about production wells, springs, test holes, test wells, and water level or water-quality observation wells. Several different words are used in this database dictionary and in the ENGDA database to describe a narrow shaft constructed in the ground. The most general term is borehole, which is applicable to any type of hole. A well is a borehole specifically constructed to extract water from the ground; however, for this data dictionary and for the ENGDA database, the words well and borehole are used interchangeably. A production well is defined as any well used for water supply and includes hand-dug wells, small-diameter bored wells equipped with hand pumps, or large-diameter bored wells equipped with large-capacity motorized pumps. Test holes are borings made to collect information about the subsurface with continuous core or non-continuous core and/or where geophysical logs are collected. Test holes are not converted into wells. A test well is a well constructed for hydraulic testing of an aquifer in order to plan a larger ground-water production system. A water-level or water-quality observation well is a well that is used to collect information about an aquifer and not used for water supply. A spring is any naturally flowing, local, ground-water discharge site. The database dictionary is designed to help define all fields on both field data collection forms (provided in attachment 2 of this report) and for the ENGDA software screen entry forms (described in Litke, 2007). The data entered into each screen entry field are stored in relational database tables within the computer database. The organization of the database dictionary is designed based on field data collection and the field forms, because this is what the majority of people will use. After each field, however, the

  17. Toward better public health reporting using existing off the shelf approaches: The value of medical dictionaries in automated cancer detection using plaintext medical data.

    Science.gov (United States)

    Kasthurirathne, Suranga N; Dixon, Brian E; Gichoya, Judy; Xu, Huiping; Xia, Yuni; Mamlin, Burke; Grannis, Shaun J

    2017-05-01

    Existing approaches to derive decision models from plaintext clinical data frequently depend on medical dictionaries as the sources of potential features. Prior research suggests that decision models developed using non-dictionary based feature sourcing approaches and "off the shelf" tools could predict cancer with performance metrics between 80% and 90%. We sought to compare non-dictionary based models to models built using features derived from medical dictionaries. We evaluated the detection of cancer cases from free text pathology reports using decision models built with combinations of dictionary or non-dictionary based feature sourcing approaches, 4 feature subset sizes, and 5 classification algorithms. Each decision model was evaluated using the following performance metrics: sensitivity, specificity, accuracy, positive predictive value, and area under the receiver operating characteristics (ROC) curve. Decision models parameterized using dictionary and non-dictionary feature sourcing approaches produced performance metrics between 70 and 90%. The source of features and feature subset size had no impact on the performance of a decision model. Our study suggests there is little value in leveraging medical dictionaries for extracting features for decision model building. Decision models built using features extracted from the plaintext reports themselves achieve comparable results to those built using medical dictionaries. Overall, this suggests that existing "off the shelf" approaches can be leveraged to perform accurate cancer detection using less complex Named Entity Recognition (NER) based feature extraction, automated feature selection and modeling approaches. Copyright © 2017 Elsevier Inc. All rights reserved.

  18. Data quality system using reference dictionaries and edit distance algorithms

    Science.gov (United States)

    Karbarz, Radosław; Mulawka, Jan

    2015-09-01

    The real art of management it is important to make smart decisions, what in most of the cases is not a trivial task. Those decisions may lead to determination of production level, funds allocation for investments etc. Most of the parameters in decision-making process such as: interest rate, goods value or exchange rate may change. It is well know that these parameters in the decision-making are based on the data contained in datamarts or data warehouse. However, if the information derived from the processed data sets is the basis for the most important management decisions, it is required that the data is accurate, complete and current. In order to achieve high quality data and to gain from them measurable business benefits, data quality system should be used. The article describes the approach to the problem, shows the algorithms in details and their usage. Finally the test results are provide. Test results show the best algorithms (in terms of quality and quantity) for different parameters and data distribution.

  19. EXFOR/CINDA dictionaries. Edited on behalf of the cooperating data centres

    International Nuclear Information System (INIS)

    Schwerer, O.; Lemmel, H.D.

    1996-01-01

    EXFOR is the agreed exchange format for the computerized exchange of nuclear reaction data between national and international nuclear data centers for the benefit of nuclear data users in all countries. CINDA is the computerized index to literature and computer files of neutron nuclear data. This document contains the EXFOR/CINDA dictionaries, that is the collection of all agreed terms, keywords, codes, abbreviations used in EXFOR and CINDA, in particular codes for defining the nuclear data categories (nuclear reactions, cross-sections, angular distributions, energy-spectra, resonance-parameters, etc); headings and units of data columns; bibliographic references; institutions, laboratories, countries; measurement facilities and methods; system-identifier and information-identifier keywords; etc. (author)

  20. User's guide and data dictionary for Kenai Lakes Investigation Project

    International Nuclear Information System (INIS)

    Newell, A.D.; Mitch, M.E.

    1992-03-01

    In 1984, the U.S. Environmental Protection Agency (EPA) implemented the National Surface Water Survey (NSWS) as part of the Aquatic Effects Research Program (AERP). The AERP conducted several integrated studies in areas containing surface waters considered potentially sensitive to change as a result of acidic deposition. The NSWS focused its assessment on lakes and streams located in the contiguous United States. Since the majority of the systems examined in the NSWS receive moderate to high levels of acidic deposition, it is difficult to evaluate the role of natural factors in controlling the chemistry of aquatic ecosystems. Therefore, the EPA implemented a project to collect data on lakes in the Kenai Peninsula of Alaska, an area expected to receive low levels of acidic deposition. The database guide provides a brief overview of the survey and the KLIP database. Detailed information on KLIP results is found in Eilers et al. The document also summarizes the sampling and analytical methods, sources of geographic information, and precision and accuracy results from quality assurance (QA) analysis. The datasets are described in Section 3 and their formats in Section 6. The variables are defined in Section 5, and Appendix A contains a listof the lakes and their chemistry. Appendix B provides reference values taken from the Long Range Transport of Airborne Pollutants (LRTAP) Project audit samples

  1. LZ-Compressed String Dictionaries

    OpenAIRE

    Arz, Julian; Fischer, Johannes

    2013-01-01

    We show how to compress string dictionaries using the Lempel-Ziv (LZ78) data compression algorithm. Our approach is validated experimentally on dictionaries of up to 1.5 GB of uncompressed text. We achieve compression ratios often outperforming the existing alternatives, especially on dictionaries containing many repeated substrings. Our query times remain competitive.

  2. Special Report: Conflicting Data on Spanish Intransitive Verbs in Two Leading Dictionaries.

    Science.gov (United States)

    Teschner, Richard V.; Flemming, Jennifer

    1996-01-01

    Presents a conflation of and a comparison between the 1,646 verbs the Royal Academy's "Diccionario de la lengua espanola" (Dictionary of the Spanish Language) classifies as solely or partly intransitive and the 1,382 verbs that are so classified by the "Pequeno Larousse ilustrado" (Illustrated Larousse Small Dictionary).…

  3. Many general language dictionaries contain specialized terms

    African Journals Online (AJOL)

    user

    Lexikos 25 (AFRILEX-reeks/series 25: 2015): 246-261 ... attention to the link between dictionary functions, corpora and the data presented in dictionaries, ... technical words) in general language dictionaries is sparse and concerns terms .... civil procedure terms to focus on and in which dictionaries to look, I will go on.

  4. Core Standards of the EUBIROD Project. Defining a European Diabetes Data Dictionary for Clinical Audit and Healthcare Delivery.

    Science.gov (United States)

    Cunningham, S G; Carinci, F; Brillante, M; Leese, G P; McAlpine, R R; Azzopardi, J; Beck, P; Bratina, N; Bocquet, V; Doggen, K; Jarosz-Chobot, P K; Jecht, M; Lindblad, U; Moulton, T; Metelko, Ž; Nagy, A; Olympios, G; Pruna, S; Skeie, S; Storms, F; Di Iorio, C T; Massi Benedetti, M

    2016-01-01

    A set of core diabetes indicators were identified in a clinical review of current evidence for the EUBIROD project. In order to allow accurate comparisons of diabetes indicators, a standardised currency for data storage and aggregation was required. We aimed to define a robust European data dictionary with appropriate clinical definitions that can be used to analyse diabetes outcomes and provide the foundation for data collection from existing electronic health records for diabetes. Existing clinical datasets used by 15 partner institutions across Europe were collated and common data items analysed for consistency in terms of recording, data definition and units of measurement. Where necessary, data mappings and algorithms were specified in order to allow partners to meet the standard definitions. A series of descriptive elements were created to document metadata for each data item, including recording, consistency, completeness and quality. While datasets varied in terms of consistency, it was possible to create a common standard that could be used by all. The minimum dataset defined 53 data items that were classified according to their feasibility and validity. Mappings and standardised definitions were used to create an electronic directory for diabetes care, providing the foundation for the EUBIROD data analysis repository, also used to implement the diabetes registry and model of care for Cyprus. The development of data dictionaries and standards can be used to improve the quality and comparability of health information. A data dictionary has been developed to be compatible with other existing data sources for diabetes, within and beyond Europe.

  5. Robust Multimodal Dictionary Learning

    Science.gov (United States)

    Cao, Tian; Jojic, Vladimir; Modla, Shannon; Powell, Debbie; Czymmek, Kirk; Niethammer, Marc

    2014-01-01

    We propose a robust multimodal dictionary learning method for multimodal images. Joint dictionary learning for both modalities may be impaired by lack of correspondence between image modalities in training data, for example due to areas of low quality in one of the modalities. Dictionaries learned with such non-corresponding data will induce uncertainty about image representation. In this paper, we propose a probabilistic model that accounts for image areas that are poorly corresponding between the image modalities. We cast the problem of learning a dictionary in presence of problematic image patches as a likelihood maximization problem and solve it with a variant of the EM algorithm. Our algorithm iterates identification of poorly corresponding patches and re-finements of the dictionary. We tested our method on synthetic and real data. We show improvements in image prediction quality and alignment accuracy when using the method for multimodal image registration. PMID:24505674

  6. The chemical component dictionary: complete descriptions of constituent molecules in experimentally determined 3D macromolecules in the Protein Data Bank

    OpenAIRE

    Westbrook, John D.; Shao, Chenghua; Feng, Zukang; Zhuravleva, Marina; Velankar, Sameer; Young, Jasmine

    2014-01-01

    Summary: The Chemical Component Dictionary (CCD) is a chemical reference data resource that describes all residue and small molecule components found in Protein Data Bank (PDB) entries. The CCD contains detailed chemical descriptions for standard and modified amino acids/nucleotides, small molecule ligands and solvent molecules. Each chemical definition includes descriptions of chemical properties such as stereochemical assignments, chemical descriptors, systematic chemical names and idealize...

  7. Specialised Translation Dictionaries for Learners

    DEFF Research Database (Denmark)

    Nielsen, Sandro

    2010-01-01

    Specialised translation dictionaries for learners are reference tools that can help users with domain discourse in a foreign language in connection with translation. The most common type is the business dictionary covering several more or less related subject fields. However, business dictionaries...... the needs of learners, it is proposed that specialised translation dictionaries should be designed as augmented reference tools. It is argued that electronic and printed dictionaries should include sections or CD-ROMs with syntactic, translation etc. data as well as exercises and illustrative documents...

  8. A semantic data dictionary method for database schema integration in CIESIN

    Science.gov (United States)

    Hinds, N.; Huang, Y.; Ravishankar, C.

    1993-08-01

    CIESIN (Consortium for International Earth Science Information Network) is funded by NASA to investigate the technology necessary to integrate and facilitate the interdisciplinary use of Global Change information. A clear of this mission includes providing a link between the various global change data sets, in particular the physical sciences and the human (social) sciences. The typical scientist using the CIESIN system will want to know how phenomena in an outside field affects his/her work. For example, a medical researcher might ask: how does air-quality effect emphysema? This and many similar questions will require sophisticated semantic data integration. The researcher who raised the question may be familiar with medical data sets containing emphysema occurrences. But this same investigator may know little, if anything, about the existance or location of air-quality data. It is easy to envision a system which would allow that investigator to locate and perform a ``join'' on two data sets, one containing emphysema cases and the other containing air-quality levels. No such system exists today. One major obstacle to providing such a system will be overcoming the heterogeneity which falls into two broad categories. ``Database system'' heterogeneity involves differences in data models and packages. ``Data semantic'' heterogeneity involves differences in terminology between disciplines which translates into data semantic issues, and varying levels of data refinement, from raw to summary. Our work investigates a global data dictionary mechanism to facilitate a merged data service. Specially, we propose using a semantic tree during schema definition to aid in locating and integrating heterogeneous databases.

  9. The U.S. national nuclear forensics library, nuclear materials information program, and data dictionary

    International Nuclear Information System (INIS)

    Lamont, Stephen Philip; Brisson, Marcia; Curry, Michael

    2011-01-01

    Nuclear forensics assessments to determine material process history requires careful comparison of sample data to both measured and modeled nuclear material characteristics. Developing centralized databases, or nuclear forensics libraries, to house this information is an important step to ensure all relevant data will be available for comparison during a nuclear forensics analysis and help expedite the assessment of material history. The approach most widely accepted by the international community at this time is the implementation of National Nuclear Forensics libraries, which would be developed and maintained by individual nations. This is an attractive alternative toan international database since it provides an understanding that each country has data on materials produced and stored within their borders, but eliminates the need to reveal any proprietary or sensitive information to other nations. To support the concept of National Nuclear Forensics libraries, the United States Department of Energy has developed a model library, based on a data dictionary, or set of parameters designed to capture all nuclear forensic relevant information about a nuclear material. Specifically, information includes material identification, collection background and current location, analytical laboratories where measurements were made, material packaging and container descriptions, physical characteristics including mass and dimensions, chemical and isotopic characteristics, particle morphology or metallurgical properties, process history including facilities, and measurement quality assurance information. While not necessarily required, it may also be valuable to store modeled data sets including reactor burn-up or enrichment cascade data for comparison. It is fully expected that only a subset of this information is available or relevant to many materials, and much of the data populating a National Nuclear Forensics library would be process analytical or material accountability

  10. From data to knowledge through concept-oriented terminologies: experience with the Medical Entities Dictionary.

    Science.gov (United States)

    Cimino, J J

    2000-01-01

    Knowledge representation involves enumeration of conceptual symbols and arrangement of these symbols into some meaningful structure. Medical knowledge representation has traditionally focused more on the structure than the symbols. Several significant efforts are under way, at local, national, and international levels, to address the representation of the symbols though the creation of high-quality terminologies that are themselves knowledge based. This paper reviews these efforts, including the Medical Entities Dictionary (MED) in use at Columbia University and the New York Presbyterian Hospital. A decade's experience with the MED is summarized to serve as a proof-of-concept that knowledge-based terminologies can support the use of coded patient data for a variety of knowledge-based activities, including the improved understanding of patient data, the access of information sources relevant to specific patient care problems, the application of expert systems directly to the care of patients, and the discovery of new medical knowledge. The terminological knowledge in the MED has also been used successfully to support clinical application development and maintenance, including that of the MED itself. On the basis of this experience, current efforts to create standard knowledge-based terminologies appear to be justified.

  11. RUNTIME DICTIONARIES FOR ROOT

    CERN Document Server

    Wind, David Kofoed

    2013-01-01

    ROOT is the LHC physicists' common tool for data analysis; almost all data is stored using ROOT's I/O system. This system benefits from a custom description of types (a so-called dictionary) that is optimised for the I/O. Until now, the dictionary cannot be provided at run-time; it needs to be prepared in a separate prerequisite step. This project will move the generation of the dictionary to run-time, making use of ROOT 6's new just-in-time compiler. It allows a more dynamic and natural access to ROOT's I/O features especially for user code.

  12. Dictionary learning in visual computing

    CERN Document Server

    Zhang, Qiang

    2015-01-01

    The last few years have witnessed fast development on dictionary learning approaches for a set of visual computing tasks, largely due to their utilization in developing new techniques based on sparse representation. Compared with conventional techniques employing manually defined dictionaries, such as Fourier Transform and Wavelet Transform, dictionary learning aims at obtaining a dictionary adaptively from the data so as to support optimal sparse representation of the data. In contrast to conventional clustering algorithms like K-means, where a data point is associated with only one cluster c

  13. Using the LOINC Semantic Structure to Integrate Community-based Survey Items into a Concept-based Enterprise Data Dictionary to Support Comparative Effectiveness Research.

    Science.gov (United States)

    Co, Manuel C; Boden-Albala, Bernadette; Quarles, Leigh; Wilcox, Adam; Bakken, Suzanne

    2012-01-01

    In designing informatics infrastructure to support comparative effectiveness research (CER), it is necessary to implement approaches for integrating heterogeneous data sources such as clinical data typically stored in clinical data warehouses and those that are normally stored in separate research databases. One strategy to support this integration is the use of a concept-oriented data dictionary with a set of semantic terminology models. The aim of this paper is to illustrate the use of the semantic structure of Clinical LOINC (Logical Observation Identifiers, Names, and Codes) in integrating community-based survey items into the Medical Entities Dictionary (MED) to support the integration of survey data with clinical data for CER studies.

  14. The semantics of Chemical Markup Language (CML): dictionaries and conventions

    Science.gov (United States)

    2011-01-01

    The semantic architecture of CML consists of conventions, dictionaries and units. The conventions conform to a top-level specification and each convention can constrain compliant documents through machine-processing (validation). Dictionaries conform to a dictionary specification which also imposes machine validation on the dictionaries. Each dictionary can also be used to validate data in a CML document, and provide human-readable descriptions. An additional set of conventions and dictionaries are used to support scientific units. All conventions, dictionaries and dictionary elements are identifiable and addressable through unique URIs. PMID:21999509

  15. The semantics of Chemical Markup Language (CML): dictionaries and conventions.

    Science.gov (United States)

    Murray-Rust, Peter; Townsend, Joe A; Adams, Sam E; Phadungsukanan, Weerapong; Thomas, Jens

    2011-10-14

    The semantic architecture of CML consists of conventions, dictionaries and units. The conventions conform to a top-level specification and each convention can constrain compliant documents through machine-processing (validation). Dictionaries conform to a dictionary specification which also imposes machine validation on the dictionaries. Each dictionary can also be used to validate data in a CML document, and provide human-readable descriptions. An additional set of conventions and dictionaries are used to support scientific units. All conventions, dictionaries and dictionary elements are identifiable and addressable through unique URIs.

  16. TERRESTRIAL LASER SCANNER DATA DENOISING BY DICTIONARY LEARNING OF SPARSE CODING

    Directory of Open Access Journals (Sweden)

    E. Smigiel

    2013-07-01

    Full Text Available Point cloud processing is basically a signal processing issue. The huge amount of data which are collected with Terrestrial Laser Scanners or photogrammetry techniques faces the classical questions linked with signal or image processing. Among others, denoising and compression are questions which have to be addressed in this context. That is why, one has to turn attention to signal theory because it is susceptible to guide one's good practices or to inspire new ideas from the latest developments of this field. The literature have been showing for decades how strong and dynamic, the theoretical field is and how efficient the derived algorithms have become. For about ten years, a new technique has appeared: known as compressive sensing or compressive sampling, it is based first on sparsity which is an interesting characteristic of many natural signals. Based on this concept, many denoising and compression techniques have shown their efficiencies. Sparsity can also be seen as redundancy removal of natural signals. Taken along with incoherent measurements, compressive sensing has appeared and uses the idea that redundancy could be removed at the very early stage of sampling. Hence, instead of sampling the signal at high sampling rate and removing redundancy as a second stage, the acquisition stage itself may be run with redundancy removal. This paper gives some theoretical aspects of these ideas with first simple mathematics. Then, the idea of compressive sensing for a Terrestrial Laser Scanner is examined as a potential research question and finally, a denoising scheme based on a dictionary learning of sparse coding is experienced. Both the theoretical discussion and the obtained results show that it is worth staying close to signal processing theory and its community to take benefit of its latest developments.

  17. Weakly Supervised Dictionary Learning

    Science.gov (United States)

    You, Zeyu; Raich, Raviv; Fern, Xiaoli Z.; Kim, Jinsub

    2018-05-01

    We present a probabilistic modeling and inference framework for discriminative analysis dictionary learning under a weak supervision setting. Dictionary learning approaches have been widely used for tasks such as low-level signal denoising and restoration as well as high-level classification tasks, which can be applied to audio and image analysis. Synthesis dictionary learning aims at jointly learning a dictionary and corresponding sparse coefficients to provide accurate data representation. This approach is useful for denoising and signal restoration, but may lead to sub-optimal classification performance. By contrast, analysis dictionary learning provides a transform that maps data to a sparse discriminative representation suitable for classification. We consider the problem of analysis dictionary learning for time-series data under a weak supervision setting in which signals are assigned with a global label instead of an instantaneous label signal. We propose a discriminative probabilistic model that incorporates both label information and sparsity constraints on the underlying latent instantaneous label signal using cardinality control. We present the expectation maximization (EM) procedure for maximum likelihood estimation (MLE) of the proposed model. To facilitate a computationally efficient E-step, we propose both a chain and a novel tree graph reformulation of the graphical model. The performance of the proposed model is demonstrated on both synthetic and real-world data.

  18. A Dictionary Learning Approach for Signal Sampling in Task-Based fMRI for Reduction of Big Data.

    Science.gov (United States)

    Ge, Bao; Li, Xiang; Jiang, Xi; Sun, Yifei; Liu, Tianming

    2018-01-01

    The exponential growth of fMRI big data offers researchers an unprecedented opportunity to explore functional brain networks. However, this opportunity has not been fully explored yet due to the lack of effective and efficient tools for handling such fMRI big data. One major challenge is that computing capabilities still lag behind the growth of large-scale fMRI databases, e.g., it takes many days to perform dictionary learning and sparse coding of whole-brain fMRI data for an fMRI database of average size. Therefore, how to reduce the data size but without losing important information becomes a more and more pressing issue. To address this problem, we propose a signal sampling approach for significant fMRI data reduction before performing structurally-guided dictionary learning and sparse coding of whole brain's fMRI data. We compared the proposed structurally guided sampling method with no sampling, random sampling and uniform sampling schemes, and experiments on the Human Connectome Project (HCP) task fMRI data demonstrated that the proposed method can achieve more than 15 times speed-up without sacrificing the accuracy in identifying task-evoked functional brain networks.

  19. A Dictionary Learning Approach for Signal Sampling in Task-Based fMRI for Reduction of Big Data

    Science.gov (United States)

    Ge, Bao; Li, Xiang; Jiang, Xi; Sun, Yifei; Liu, Tianming

    2018-01-01

    The exponential growth of fMRI big data offers researchers an unprecedented opportunity to explore functional brain networks. However, this opportunity has not been fully explored yet due to the lack of effective and efficient tools for handling such fMRI big data. One major challenge is that computing capabilities still lag behind the growth of large-scale fMRI databases, e.g., it takes many days to perform dictionary learning and sparse coding of whole-brain fMRI data for an fMRI database of average size. Therefore, how to reduce the data size but without losing important information becomes a more and more pressing issue. To address this problem, we propose a signal sampling approach for significant fMRI data reduction before performing structurally-guided dictionary learning and sparse coding of whole brain's fMRI data. We compared the proposed structurally guided sampling method with no sampling, random sampling and uniform sampling schemes, and experiments on the Human Connectome Project (HCP) task fMRI data demonstrated that the proposed method can achieve more than 15 times speed-up without sacrificing the accuracy in identifying task-evoked functional brain networks. PMID:29706880

  20. Multilingualism and Dictionaries

    Directory of Open Access Journals (Sweden)

    Wojciech Paweł Sosnowski

    2015-12-01

    Full Text Available Multilingualism and Dictionaries The Russian-Bulgarian-Polish dictionary that we (Wojciech Sosnowski, Violetta Koseska-Toszewa and Anna Kisiel are currently developing has no precedent as far as its theoretical foundations and its structure are concerned. The dictionary offers a unique combination of three Slavic languages that belong to three different groups: a West Slavic language (Polish, a South Slavic language (Bulgarian and an East Slavic language (Russian. The dictionary describes semantic and syntactic equivalents of words between the languages. When completed, the dictionary will contain around 30,000 entries. The principle we build the dictionary on is that every language should be given equal status. Many of our data come from the Parallel Polish-Bulgarian-Russian corpus developed by us as part of the CLARIN-PL initiative. In the print version, the entries come in the order of the Cyrillic alphabet and they are not numbered (except for homonyms, which are disambiguated with Roman numbers. We selected the lemmas for the dictionary on the basis of their frequency in the corpus. Our dictionary is the first dictionary to include forms of address and most recent neologisms in the three languages. Faithful to the recent developments in contrastive linguistics, we begin with a form from the dictionary’s primary language and we define it in Polish. Subsequently, based on this definition, we try to find an equivalent in the second and the third language. Therefore, the meaning comes first and only then we look for the form (i.e. the equivalent that corresponds to this meaning. This principle, outlined in Gramatyka konfrontatywna języków polskiego i bułgarskiego (GKBP, allows us to treat data from multiple languages as equal. In the dictionary, we draw attention to the correct choice of equivalents in translation; we also provide categorisers that indicate the meaning of verbal tenses and aspects. The definitions of states, events and

  1. Learning Category-Specific Dictionary and Shared Dictionary for Fine-Grained Image Categorization.

    Science.gov (United States)

    Gao, Shenghua; Tsang, Ivor Wai-Hung; Ma, Yi

    2014-02-01

    This paper targets fine-grained image categorization by learning a category-specific dictionary for each category and a shared dictionary for all the categories. Such category-specific dictionaries encode subtle visual differences among different categories, while the shared dictionary encodes common visual patterns among all the categories. To this end, we impose incoherence constraints among the different dictionaries in the objective of feature coding. In addition, to make the learnt dictionary stable, we also impose the constraint that each dictionary should be self-incoherent. Our proposed dictionary learning formulation not only applies to fine-grained classification, but also improves conventional basic-level object categorization and other tasks such as event recognition. Experimental results on five data sets show that our method can outperform the state-of-the-art fine-grained image categorization frameworks as well as sparse coding based dictionary learning frameworks. All these results demonstrate the effectiveness of our method.

  2. How Dictionary Users Choose Senses in Bilingual Dictionary Entries ...

    African Journals Online (AJOL)

    advanced Polish learners of English, consulted 26 Polish-to-English dictionary pages prompted with a sentence translation task. ... structural involvedness of dictionaries themselves, the quality of the data returned is questionable. In contrast ...... scans patterned differently. They tended to be more rapid and the landing.

  3. Insurance dictionary

    International Nuclear Information System (INIS)

    Mueller-Lutz, H.L.

    1984-01-01

    Special technical terms used in the world of insurance can hardly be found in general dictionaries. This is a gap which the 'Insurance dictionary' now presented is designed to fill. In view of its supplementary function, the number of terms covered is limited to 1200. To make this dictionary especially convenient for ready reference, only the most commonly used translations are given for each key word in any of the four languages. This dictionary is subdivided into four parts, each containing the translation of the selected terms in the three other languages. To further facilitate the use of the booklet, paper of different colours was used for the printing of the German, English, French and Greek sections. The present volume was developed from a Swedish insurance dictionary (Fickordbok Foersaekring), published in 1967, which - with Swedish as the key language- offers English, French and German translations of the basic insurance terms. (orig./HP) [de

  4. Martin Benjamin (EPFL), The Particles of Language: "The Dictionary" as elemental data for 7000 languages across time and space

    CERN Multimedia

    CERN. Geneva

    2015-01-01

    WhiteArea lectures' twiki HERE How can we document detailed data about all the world's language in a consistent, unified source, in a way that can serve knowledge and technology needs for people and their machines around the globe? Dictionaries have historically presented selective information about words and their meanings within a language, or translation equivalents between languages, in idiosyncratic, incommensurable formats with little basis in data science. The Kamusi Project introduces a new approach, conceiving of language as a matrix of interrelated data elements. By documenting these elements within each language, and linking elements at conceptual and functional nodes across languages, Kamusi aims toward an elusive Big Data goal: "every word in every language." If successful, the results will run the gamut from preserving the human heritage embedded in endangered languages, to providing international vocabularies for students to succeed in science, to a Star Trek-...

  5. Improving Long-term Quality and Continuity of Landsat-7 Data Through Inpainting of Lost Data Based on the Nonconvex Model of Dynamic Dictionary Learning

    Science.gov (United States)

    Miao, J.; Zhou, Z.; Zhou, X.; Huang, T.

    2017-12-01

    On May 31, 2003, the scan line corrector (SLC) of the Enhance Thematic Mapper Plus (ETM+) on board the Landsat-7 satellite was broken down, resulting in strips of lost data in the Landsat-7 images, which seriously affected the quality and continuous applications of the ETM+ data for space and earth science. This paper proposes a new inpainting method for repairing the Landsat-7 ETM+ images taking into account the physical characteristics and geometric features of the ground area of which the data are missed. Firstly, the two geometric slopes of the boundaries of each missing stripe of the georeferenced ETM+ image is calculated by the Hough, ignoring the slope of the part of the missing strip that are on the same edges of the whole image. Secondly, an adaptive dictionary was developed and trained using a large number of Landsat-7 ETM+ SLC-ON images. When the adaptive dictionary is used to restore an image with missing data, the dictionary is actually dynamic. Then the data-missing strips were repaired along their slope directions by using the logdet (.) low-rank non-convex model along with dynamic dictionary. Imperfect points are defined as the pixels whose values are quite different from its surrounding pixel values. They can be real values but most likely can be noise. Lastly, the imperfect points after the second step were replaced by using the method of sparse restoration of the overlapping groups. We take the Landsat ETM+ images of June 10, 2002 as the test image for our algorithm evaluation. There is no data missing in this image. Therefore we extract the same missing -stripes of the images of the same WRS path and WRS row as the 2002 image but acquired after 2003 to form the missing-stripe model. Then we overlay the missing-stripe model over the image of 2002 to get the simulated missing image. Fig.1(a)-(c) show the simulated missing images of Bands 1, 3, and 5 of the 2002 ETM+ image data. We apply the algorithm to restore the missing stripes. Fig.1(d

  6. Mediostructures in bilingual LSP dictionaries

    DEFF Research Database (Denmark)

    Nielsen, Sandro

    2003-01-01

    This paper argues that the lexicographic mediostructure is a network structure that deals with a set or sets of relations that exist between different parts of data by way of cross-referencing, dictionary-internal as well as dictionary, external. The abstract mediostructure consists of all...... the possible sets of cross-referential relations, whether realised by concrete sets or not in the dictionary. The actual realisation of these referential networks may be function-related and the primary function of the dictionary may then be given priority. The actual cross-references at this level...... are then the concrete sets of relations depending on the function of the dictionary, the distribution structure and the search path involved in retrieving the information. The paper introduces a distinction between use-related and funtion-related corss-references and focuses on cross-references supporting...

  7. Incorporating High-Frequency Physiologic Data Using Computational Dictionary Learning Improves Prediction of Delayed Cerebral Ischemia Compared to Existing Methods.

    Science.gov (United States)

    Megjhani, Murad; Terilli, Kalijah; Frey, Hans-Peter; Velazquez, Angela G; Doyle, Kevin William; Connolly, Edward Sander; Roh, David Jinou; Agarwal, Sachin; Claassen, Jan; Elhadad, Noemie; Park, Soojin

    2018-01-01

    Accurate prediction of delayed cerebral ischemia (DCI) after subarachnoid hemorrhage (SAH) can be critical for planning interventions to prevent poor neurological outcome. This paper presents a model using convolution dictionary learning to extract features from physiological data available from bedside monitors. We develop and validate a prediction model for DCI after SAH, demonstrating improved precision over standard methods alone. 488 consecutive SAH admissions from 2006 to 2014 to a tertiary care hospital were included. Models were trained on 80%, while 20% were set aside for validation testing. Modified Fisher Scale was considered the standard grading scale in clinical use; baseline features also analyzed included age, sex, Hunt-Hess, and Glasgow Coma Scales. An unsupervised approach using convolution dictionary learning was used to extract features from physiological time series (systolic blood pressure and diastolic blood pressure, heart rate, respiratory rate, and oxygen saturation). Classifiers (partial least squares and linear and kernel support vector machines) were trained on feature subsets of the derivation dataset. Models were applied to the validation dataset. The performances of the best classifiers on the validation dataset are reported by feature subset. Standard grading scale (mFS): AUC 0.54. Combined demographics and grading scales (baseline features): AUC 0.63. Kernel derived physiologic features: AUC 0.66. Combined baseline and physiologic features with redundant feature reduction: AUC 0.71 on derivation dataset and 0.78 on validation dataset. Current DCI prediction tools rely on admission imaging and are advantageously simple to employ. However, using an agnostic and computationally inexpensive learning approach for high-frequency physiologic time series data, we demonstrated that we could incorporate individual physiologic data to achieve higher classification accuracy.

  8. Incorporating High-Frequency Physiologic Data Using Computational Dictionary Learning Improves Prediction of Delayed Cerebral Ischemia Compared to Existing Methods

    Directory of Open Access Journals (Sweden)

    Murad Megjhani

    2018-03-01

    Full Text Available PurposeAccurate prediction of delayed cerebral ischemia (DCI after subarachnoid hemorrhage (SAH can be critical for planning interventions to prevent poor neurological outcome. This paper presents a model using convolution dictionary learning to extract features from physiological data available from bedside monitors. We develop and validate a prediction model for DCI after SAH, demonstrating improved precision over standard methods alone.Methods488 consecutive SAH admissions from 2006 to 2014 to a tertiary care hospital were included. Models were trained on 80%, while 20% were set aside for validation testing. Modified Fisher Scale was considered the standard grading scale in clinical use; baseline features also analyzed included age, sex, Hunt–Hess, and Glasgow Coma Scales. An unsupervised approach using convolution dictionary learning was used to extract features from physiological time series (systolic blood pressure and diastolic blood pressure, heart rate, respiratory rate, and oxygen saturation. Classifiers (partial least squares and linear and kernel support vector machines were trained on feature subsets of the derivation dataset. Models were applied to the validation dataset.ResultsThe performances of the best classifiers on the validation dataset are reported by feature subset. Standard grading scale (mFS: AUC 0.54. Combined demographics and grading scales (baseline features: AUC 0.63. Kernel derived physiologic features: AUC 0.66. Combined baseline and physiologic features with redundant feature reduction: AUC 0.71 on derivation dataset and 0.78 on validation dataset.ConclusionCurrent DCI prediction tools rely on admission imaging and are advantageously simple to employ. However, using an agnostic and computationally inexpensive learning approach for high-frequency physiologic time series data, we demonstrated that we could incorporate individual physiologic data to achieve higher classification accuracy.

  9. Electrochemical dictionary

    CERN Document Server

    Bard, Allen J; Scholz, Fritz 0

    2014-01-01

    This comprehensive dictionary includes some 3000 common terms in electrochemistry and energy research, and related fields. Offers clear, precise definitions, references and more than 600 illustrations. The new edition adds more than 300 new and revised terms.

  10. IAA Space Terminological Multilingual Data Bank Towards an On- Line Dictionary with Definitions in French and in English

    Science.gov (United States)

    Bensaid, R.

    2002-01-01

    It has been emphasized in previous papers that the bilingual "basic list" of the IAA multilingual terminological data bank (MTDB) needed improvement before beginning works on definitions. In this communication, in a first part, we report, on the works (corrections and additions) done to improve the scope of the "basic list" . These works have yet to be done by coordinators for the others twelve languages concerned by the IAA MTBD. In a second part, according to the decision of the IAA MTDB committee to complete the MTDB with definitions in French and in English, we describe the methodology adopted and the problems encountered to elaborate a mock-up of a space dictionary, including in a first step definitions in English and in French, of the English terms and expressions beginning by the letter "A" in the basic list.

  11. Comparative Nivkh Dictionary

    DEFF Research Database (Denmark)

    Fortescue, Michael David

    This dictionary undertakes to reconstruct the lexis and morphology of the Nivkh proto-language by marshaling and organizing all the data available in published form on the contemporary dialects. It builds upon a considerable body of descriptive and comparative work carried out by scholars who have...... World is a subject of continuing interest to both linguists and anthropologists. The dictionary does not address this question directly. Reconstructing the proto-language is an essential step, however, to any further comparative work – in particular to sorting out the relationship between Nivkh...

  12. The chemical component dictionary: complete descriptions of constituent molecules in experimentally determined 3D macromolecules in the Protein Data Bank.

    Science.gov (United States)

    Westbrook, John D; Shao, Chenghua; Feng, Zukang; Zhuravleva, Marina; Velankar, Sameer; Young, Jasmine

    2015-04-15

    The Chemical Component Dictionary (CCD) is a chemical reference data resource that describes all residue and small molecule components found in Protein Data Bank (PDB) entries. The CCD contains detailed chemical descriptions for standard and modified amino acids/nucleotides, small molecule ligands and solvent molecules. Each chemical definition includes descriptions of chemical properties such as stereochemical assignments, chemical descriptors, systematic chemical names and idealized coordinates. The content, preparation, validation and distribution of this CCD chemical reference dataset are described. The CCD is updated regularly in conjunction with the scheduled weekly release of new PDB structure data. The CCD and amino acid variant reference datasets are hosted in the public PDB ftp repository at ftp://ftp.wwpdb.org/pub/pdb/data/monomers/components.cif.gz, ftp://ftp.wwpdb.org/pub/pdb/data/monomers/aa-variants-v1.cif.gz, and its mirror sites, and can be accessed from http://wwpdb.org. jwest@rcsb.rutgers.edu. Supplementary data are available at Bioinformatics online. © The Author 2014. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.

  13. Internet accounting dictionaries

    DEFF Research Database (Denmark)

    Nielsen, Sandro; Mourier, Lise

    2005-01-01

    An examination of existing accounting dictionaries on the Internet reveals a general need for a new type of dictionary. In contrast to the dictionaries now accessible, the future accounting dictionaries should be designed as proper Internet dictionaries based on a functional approach so they can...

  14. Seismic Signal Compression Using Nonparametric Bayesian Dictionary Learning via Clustering

    Directory of Open Access Journals (Sweden)

    Xin Tian

    2017-06-01

    Full Text Available We introduce a seismic signal compression method based on nonparametric Bayesian dictionary learning method via clustering. The seismic data is compressed patch by patch, and the dictionary is learned online. Clustering is introduced for dictionary learning. A set of dictionaries could be generated, and each dictionary is used for one cluster’s sparse coding. In this way, the signals in one cluster could be well represented by their corresponding dictionaries. A nonparametric Bayesian dictionary learning method is used to learn the dictionaries, which naturally infers an appropriate dictionary size for each cluster. A uniform quantizer and an adaptive arithmetic coding algorithm are adopted to code the sparse coefficients. With comparisons to other state-of-the art approaches, the effectiveness of the proposed method could be validated in the experiments.

  15. Dictionary Based Segmentation in Volumes

    DEFF Research Database (Denmark)

    Emerson, Monica Jane; Jespersen, Kristine Munk; Jørgensen, Peter Stanley

    2015-01-01

    We present a method for supervised volumetric segmentation based on a dictionary of small cubes composed of pairs of intensity and label cubes. Intensity cubes are small image volumes where each voxel contains an image intensity. Label cubes are volumes with voxelwise probabilities for a given...... label. The segmentation process is done by matching a cube from the volume, of the same size as the dictionary intensity cubes, to the most similar intensity dictionary cube, and from the associated label cube we get voxel-wise label probabilities. Probabilities from overlapping cubes are averaged...... and hereby we obtain a robust label probability encoding. The dictionary is computed from labeled volumetric image data based on weighted clustering. We experimentally demonstrate our method using two data sets from material science – a phantom data set of a solid oxide fuel cell simulation for detecting...

  16. Dictionary Management

    DEFF Research Database (Denmark)

    Bergenholtz, Henning

    2018-01-01

    in different projects. The same steps can be applied to lexicographic projects. In this field, by looking at finished and not finished dictionary projects, we find that: many are started but never finished; and many are planned to be carried out within a certain time frame, but it takes much longer than...... anticipated until the project is completed with the publication of one or more dictionaries. The reason for that is normally an unrealistic and much too optimistic planning. But it is also due to a missing knowledge about management planning according to a relevant overall lexicographic theory....... There is a long tradition of understanding lexicography as the compiling of dictionaries, especially among British scholars. But there is also a tradition of focusing on theoretical lexicography, especially among German scholars. In this contribution, I consider lexicography a discipline with two legs: (1...

  17. Subject-field components as integrated parts of LSP dictionaries

    DEFF Research Database (Denmark)

    Bergenholtz, Henning; Nielsen, Sandro

    2006-01-01

    The dividing line between specialised lexicography and terminography is non-existent. The focus of preparing dictionaries for a particular subject-field should be the needs of its user group in specific situations. This is catered for by the modern theory of dictionary functions and includes...... the introduction of subject-field components in dictionaries. Dictionary functions are communication-orientated or cognition-orientated, and the lexicographers must identify the relevant functions and select and present the data so that the dictionary satisfies the needs of the users. The optimal dictionary...

  18. Learning Dictionaries of Discriminative Image Patches

    DEFF Research Database (Denmark)

    Dahl, Anders Lindbjerg; Larsen, Rasmus

    2011-01-01

    using dictionaries of image patches with associated label data. The approach is based on ideas from sparse generative image models and texton based texture modeling. The intensity and label dictionaries are learned from training images with associated label information of (a subset) of the pixels based...... on a modified vector quantization approach. For new images the intensity dictionary is used to encode the image data and the label dictionary is used to build a segmentation of the image. We demonstrate the algorithm on composite and real texture images and show how successful training is possible even...

  19. Compiling Dictionaries

    African Journals Online (AJOL)

    Information Technology

    This method results in a classified word list that can be efficiently ... standardized list of domains to classify multiple dictionaries opens up possibilities for cross-lin- .... part of speech, noun class, the plural form of each noun, and a simple gloss. ... But these mental links tend to cluster around a ... group, duet, trio, ensemble.

  20. Using Bilingual Dictionaries.

    Science.gov (United States)

    Thompson, Geoff

    1987-01-01

    Monolingual dictionaries have serious disadvantages in many language teaching situations; bilingual dictionaries are potentially more efficient and more motivating sources of information for language learners. (Author/CB)

  1. Dictionary-Based Tensor Canonical Polyadic Decomposition

    Science.gov (United States)

    Cohen, Jeremy Emile; Gillis, Nicolas

    2018-04-01

    To ensure interpretability of extracted sources in tensor decomposition, we introduce in this paper a dictionary-based tensor canonical polyadic decomposition which enforces one factor to belong exactly to a known dictionary. A new formulation of sparse coding is proposed which enables high dimensional tensors dictionary-based canonical polyadic decomposition. The benefits of using a dictionary in tensor decomposition models are explored both in terms of parameter identifiability and estimation accuracy. Performances of the proposed algorithms are evaluated on the decomposition of simulated data and the unmixing of hyperspectral images.

  2. The compilation of electronic dictionaries for the African languages ...

    African Journals Online (AJOL)

    Lexicographers increasingly acknowledge the enormous potential of electronic dictionaries. The great capacity and speed characteristic of electronic products, combined with enhanced query and data retrieval technology, pave the way to a new generation of dictionaries unimagined in the paper-dictionary era. It is amazing ...

  3. Effective Look-up Techniques to Approach a Monolingual Dictionary

    Directory of Open Access Journals (Sweden)

    Nauman Al Amin Ali El Sayed

    2013-05-01

    Full Text Available A dictionary is (a learning tool that can help the language learner in acquiring great knowledge of and about a foreign language. Almost all language learners buy or at least possess, at one time, a monolingual or bilingual dictionary, to which the learner may refer to look up the meaning of words. Unfortunately, using dictionary to look up the meaning of words seems to be the most important service, which a dictionary is expected to provide to language learners. In fact, a dictionary provides much data about language to its readers such as telling them about: the word spelling, phonology, phonetics, etymology, stylistics and definitions among other aspects. This paper sheds light on how the dictionary can teach its readers with special focus on monolingual dictionary. Hence, the discussion of this paper will centre on how dictionaries can teach students rather than on how students can learn from them.

  4. A Study of Comparatively Low Achievement Students' Bilingualized Dictionary Use and Their English Learning

    Science.gov (United States)

    Chen, Szu-An

    2016-01-01

    This study investigates bilingualized dictionary use of Taiwanese university students. It aims to examine EFL learners' overall dictionary use behavior and their perspectives on book dictionary as well as the necessity of advance guidance in using dictionaries. Data was collected through questionnaires and analyzed by SPSS 15.0. Findings indicate…

  5. Tensor Dictionary Learning for Positive Definite Matrices.

    Science.gov (United States)

    Sivalingam, Ravishankar; Boley, Daniel; Morellas, Vassilios; Papanikolopoulos, Nikolaos

    2015-11-01

    Sparse models have proven to be extremely successful in image processing and computer vision. However, a majority of the effort has been focused on sparse representation of vectors and low-rank models for general matrices. The success of sparse modeling, along with popularity of region covariances, has inspired the development of sparse coding approaches for these positive definite descriptors. While in earlier work, the dictionary was formed from all, or a random subset of, the training signals, it is clearly advantageous to learn a concise dictionary from the entire training set. In this paper, we propose a novel approach for dictionary learning over positive definite matrices. The dictionary is learned by alternating minimization between sparse coding and dictionary update stages, and different atom update methods are described. A discriminative version of the dictionary learning approach is also proposed, which simultaneously learns dictionaries for different classes in classification or clustering. Experimental results demonstrate the advantage of learning dictionaries from data both from reconstruction and classification viewpoints. Finally, a software library is presented comprising C++ binaries for all the positive definite sparse coding and dictionary learning approaches presented here.

  6. Alternatively Constrained Dictionary Learning For Image Superresolution.

    Science.gov (United States)

    Lu, Xiaoqiang; Yuan, Yuan; Yan, Pingkun

    2014-03-01

    Dictionaries are crucial in sparse coding-based algorithm for image superresolution. Sparse coding is a typical unsupervised learning method to study the relationship between the patches of high-and low-resolution images. However, most of the sparse coding methods for image superresolution fail to simultaneously consider the geometrical structure of the dictionary and the corresponding coefficients, which may result in noticeable superresolution reconstruction artifacts. In other words, when a low-resolution image and its corresponding high-resolution image are represented in their feature spaces, the two sets of dictionaries and the obtained coefficients have intrinsic links, which has not yet been well studied. Motivated by the development on nonlocal self-similarity and manifold learning, a novel sparse coding method is reported to preserve the geometrical structure of the dictionary and the sparse coefficients of the data. Moreover, the proposed method can preserve the incoherence of dictionary entries and provide the sparse coefficients and learned dictionary from a new perspective, which have both reconstruction and discrimination properties to enhance the learning performance. Furthermore, to utilize the model of the proposed method more effectively for single-image superresolution, this paper also proposes a novel dictionary-pair learning method, which is named as two-stage dictionary training. Extensive experiments are carried out on a large set of images comparing with other popular algorithms for the same purpose, and the results clearly demonstrate the effectiveness of the proposed sparse representation model and the corresponding dictionary learning algorithm.

  7. Dictionary of materials testing

    International Nuclear Information System (INIS)

    Goedecke, W.

    1992-01-01

    This trilingual dictionary contains about 12000 terms from the field of non-destructive and destructive materials testing; the English and French terms can be looked up in two separate, alphabetical indexes. The compilation also presents terms from related fields such as quality control, production control, environmental protection and radiological protection, and wherever appropriate in the context from the fields of physics, chemistry, mathematics and electronic data processing. (HP) [de

  8. Unsupervised behaviour-specific dictionary learning for abnormal event detection

    DEFF Research Database (Denmark)

    Ren, Huamin; Liu, Weifeng; Olsen, Søren Ingvor

    2015-01-01

    the training data is only a small proportion of the surveillance data. Therefore, we propose behavior-specific dictionaries (BSD) through unsupervised learning, pursuing atoms from the same type of behavior to represent one behavior dictionary. To further improve the dictionary by introducing information from...... potential infrequent normal patterns, we refine the dictionary by searching ‘missed atoms’ that have compact coefficients. Experimental results show that our BSD algorithm outperforms state-of-the-art dictionaries in abnormal event detection on the public UCSD dataset. Moreover, BSD has less false alarms...

  9. The Construction of Online Specialized Dictionaries

    DEFF Research Database (Denmark)

    Nielsen, Sandro; Fuertes-Olivera, Pedro A.; Bergenholtz, Henning

    2013-01-01

    the needs of translators (primary user group), accountants and financial experts (secondary user group), and students of accountancy, students of translation, journalism and interested laypersons (tertiary user group). It addresses the issue as a lexicographical problem and makes comments on the decisions...... laypersons, and use situations, typically cognitive-oriented and communicative-oriented types (Bergenholtz/Tarp 2003, 2004). This paper follows suit and elaborates on the selection of Spanish lemmas in a particular dictionary project: the Accounting Dictionaries. This dictionary project aims to satisfy...... taken by a lemma selection team who based their decisions on the principle of relevance. This principle states that the selection and treatment of dictionary data are directly related with the nature of the data to be included, the function(s) of the dictionary and the user situation in which...

  10. 2008-09 National Rivers and Streams Assessment Fish Tissue Data Dictionary

    Science.gov (United States)

    The Office of Science and Technology (OST) is providing the fish tissue results from the 2008-09 National Rivers and Streams Assessment (NRSA). This document includes the “data dictionary” for Mercury, Selenium, PBDEs, PCBs, Pesticides and PFCs.

  11. 2010 Great Lakes Human Health Fish Tissue Study Fish Tissue Data Dictionary

    Science.gov (United States)

    The Office of Science and Technology (OST) is providing the fish tissue results from the 2010 Great Lakes Human Health Fish Tissue Study (GLHHFTS). This document includes the “data dictionary” for Mercury, PFC, PBDE and PCBs.

  12. Making the Dictionary of the Frisian Language available in the Dutch historical dictionary portal

    NARCIS (Netherlands)

    Depuydt, Katrien; de Does, Jesse; Duijff, P.; Sijens, H.; Odijk, Jan; van Hessen, Arjan

    2017-01-01

    The main goal of the GTB-WFT project was to publish the the monumental Dictionary of the Frisian Language GTB-WFT (Wurdboek fan 'e Fryske Taal, WFT) in the CLARIN research infrastructure, according to open, CLARIN-compliant standards. This has been achieved by 1) curation of the dictionary data,

  13. Greedy Deep Dictionary Learning

    OpenAIRE

    Tariyal, Snigdha; Majumdar, Angshul; Singh, Richa; Vatsa, Mayank

    2016-01-01

    In this work we propose a new deep learning tool called deep dictionary learning. Multi-level dictionaries are learnt in a greedy fashion, one layer at a time. This requires solving a simple (shallow) dictionary learning problem, the solution to this is well known. We apply the proposed technique on some benchmark deep learning datasets. We compare our results with other deep learning tools like stacked autoencoder and deep belief network; and state of the art supervised dictionary learning t...

  14. NHD Event Data Dictionary

    Science.gov (United States)

    The Reach Address Database (RAD) stores reach address information for each Water Program feature that has been linked to the underlying surface water features (streams, lakes, etc) in the National Hydrology Database (NHD) Plus dataset.

  15. Discriminative Bayesian Dictionary Learning for Classification.

    Science.gov (United States)

    Akhtar, Naveed; Shafait, Faisal; Mian, Ajmal

    2016-12-01

    We propose a Bayesian approach to learn discriminative dictionaries for sparse representation of data. The proposed approach infers probability distributions over the atoms of a discriminative dictionary using a finite approximation of Beta Process. It also computes sets of Bernoulli distributions that associate class labels to the learned dictionary atoms. This association signifies the selection probabilities of the dictionary atoms in the expansion of class-specific data. Furthermore, the non-parametric character of the proposed approach allows it to infer the correct size of the dictionary. We exploit the aforementioned Bernoulli distributions in separately learning a linear classifier. The classifier uses the same hierarchical Bayesian model as the dictionary, which we present along the analytical inference solution for Gibbs sampling. For classification, a test instance is first sparsely encoded over the learned dictionary and the codes are fed to the classifier. We performed experiments for face and action recognition; and object and scene-category classification using five public datasets and compared the results with state-of-the-art discriminative sparse representation approaches. Experiments show that the proposed Bayesian approach consistently outperforms the existing approaches.

  16. Dictionary Based Segmentation in Volumes

    DEFF Research Database (Denmark)

    Emerson, Monica Jane; Jespersen, Kristine Munk; Jørgensen, Peter Stanley

    Method for supervised segmentation of volumetric data. The method is trained from manual annotations, and these annotations make the method very flexible, which we demonstrate in our experiments. Our method infers label information locally by matching the pattern in a neighborhood around a voxel ...... to a dictionary, and hereby accounts for the volume texture....

  17. Dictionary of distances

    CERN Document Server

    Deza, Michel-Marie

    2006-01-01

    This book comes out of need and urgency (expressed especially in areas of Information Retrieval with respect to Image, Audio, Internet and Biology) to have a working tool to compare data.The book will provide powerful resource for all researchers using Mathematics as well as for mathematicians themselves. In the time when over-specialization and terminology fences isolate researchers, this Dictionary try to be ""centripedal"" and ""oikoumeni"", providing some access and altitude of vision but without taking the route of scientific vulgarisation. This attempted balance is the main philosophy

  18. Dictionary Visions, Research and Practice

    DEFF Research Database (Denmark)

    This book is about dictionaries and dictionary making. In six thematic sections it presents nineteen contributions covering a wide field within lexicography: Online Lexicography, Dictionary Structure, Phraseology in Dictionaries, LSP Lexicography, Dictionaries and the User, plus Etymology, History...... and Culture in Lexicography. Some chapters focus on theoretical aspects, others report on dictionary work in the making, and still others compare and analyze existing dictionaries. Common to all authors, however, is the concern for the dictionary user. Trivial as it may seem, the fact that dictionaries...

  19. Teaching Dictionary Skills through a Slang Dictionary.

    Science.gov (United States)

    Steed, Stanley M.

    A unit for teaching dictionary skills through the compilation of a slang dictionary was written with the purpose of providing an inductive learning situation. The students are to begin by defining slang usage and bringing in slang words and definitions on cards. Small groups are to be formed to evaluate the definitions and make additions. In…

  20. French Dictionaries. Series: Specialised Bibliographies.

    Science.gov (United States)

    Klaar, R. M.

    This is a list of French monolingual, French-English and English-French dictionaries available in December 1975. Dictionaries of etymology, phonetics, place names, proper names, and slang are included, as well as dictionaries for children and dictionaries of Belgian, Canadian, and Swiss French. Most other specialized dictionaries, encyclopedias,…

  1. The role of linguists in planning and making dictionaries in modern information society

    DEFF Research Database (Denmark)

    Bergenholtz, Henning

    2013-01-01

    , but not necessarily for all, e.g. not for meaning items, collocations or synonym items. This will be discussed outgoing from the description of a database and the concept for one polyfunctional and five monofunctional monolingual general dictionaries. The first monofunctional dictionary is a reception dictionary...... type of expert is best suited to make modern dictionaries in the information age? This question is quite complex. We have different kinds of dictionaries, e.g. general language and special language dictionaries. And we have different kinds of lexicographers, e.g. 1. metalexicographer, 2. practical...... lexicographer making the concept for a planned dictionary, 3. lexicographer making the concrete dictionary articles or parts of them. For (2) a linguist is of course not the natural choice. For (3) we need linguists for certain kind of dictionaries and certain data types in general languages...

  2. Specification of Data Dictionary Definition of National Water Resources Monitoring and Controlling Capacity-Building%国家水资源监控能力建设数据字典字段定义规范

    Institute of Scientific and Technical Information of China (English)

    徐荣嵘; 吴永祥; 雷四华; 王高旭

    2015-01-01

    In order to standardize data dictionary definition of national water monitoring capacity-building, eliminate ambiguity and improve the reliability of field definition, according to the theory of logic for definition, combining the practical problems encountered in compiling data dictionary field definition,it sorts data dictionary field definition from types and ways of the definition, discusses various types and methods of defined applicable conditions and limitations, summarizes the law of the field definitions,sets up the definition of the field of the field,to be a reference for future data dictionary revision.%为规范国家水资源监控能力建设数据字典的字段定义,消除歧义,提高信度,根据逻辑学对于定义的理论研究,结合编写数据字典字段定义中所遇到的实际问题,从定义的种类和方式 2 个维度对数据字典字段定义进行梳理,讨论各类定义种类和方法的适用条件及其局限性,总结编写字段定义的规律,并制定字段定义编写规范流程,为以后数据字典的修订提供参考.

  3. Bilingual Dictionaries for Communication in the Domain of Economics: Function-Based Translation Dictionaries

    DEFF Research Database (Denmark)

    Nielsen, Sandro

    2015-01-01

    With their focus on terms, bilingual dictionaries are important tools for translating texts on economics. The most common type is the multi-fi eld dictionary covering several related subject fi elds; however, multi-fi eld dictionaries treat one or few fi elds extensively thereby neglecting other fi...... elds in contrast to single-fi eld and sub-fi eld dictionaries. Furthermore, recent research shows that economic translation is not limited to terms so lexicographers who identify and analyse the needs of translators, usage situations and stages in translating economic texts will have a sound basis...... for designing their lexicographic tools. The function theory allows lexicographers to study these basics so that they can offer translation tools to the domain of economics. Dictionaries should include data about terms, their grammatical properties, and their combinatorial potential as well as language...

  4. Fast Low-Rank Shared Dictionary Learning for Image Classification.

    Science.gov (United States)

    Tiep Huu Vu; Monga, Vishal

    2017-11-01

    Despite the fact that different objects possess distinct class-specific features, they also usually share common patterns. This observation has been exploited partially in a recently proposed dictionary learning framework by separating the particularity and the commonality (COPAR). Inspired by this, we propose a novel method to explicitly and simultaneously learn a set of common patterns as well as class-specific features for classification with more intuitive constraints. Our dictionary learning framework is hence characterized by both a shared dictionary and particular (class-specific) dictionaries. For the shared dictionary, we enforce a low-rank constraint, i.e., claim that its spanning subspace should have low dimension and the coefficients corresponding to this dictionary should be similar. For the particular dictionaries, we impose on them the well-known constraints stated in the Fisher discrimination dictionary learning (FDDL). Furthermore, we develop new fast and accurate algorithms to solve the subproblems in the learning step, accelerating its convergence. The said algorithms could also be applied to FDDL and its extensions. The efficiencies of these algorithms are theoretically and experimentally verified by comparing their complexities and running time with those of other well-known dictionary learning methods. Experimental results on widely used image data sets establish the advantages of our method over the state-of-the-art dictionary learning methods.

  5. Women in the Dictionary of Danish Insular Dialects

    DEFF Research Database (Denmark)

    Hovmark, Henrik

    Women in the Dictionary of Danish Insular Dialects In this presentation, I discuss the representation of female domains in the Dictionary of Danish Insular Dialects (DID; Ømålsordbogen ), an historical dictionary giving thorough descriptions of the dialects on the Danish isles 1750-1945. First...... volume appeared in 1992 but data collection and structure of the dictionary date back to the 1920s. It has been pointed out that the language, thoughts and domains of women until recently have been strongly neglected in for instance literary studies and history – and that the representations have been...... characterised by stereotypical images. This point has also been made as regards dictionaries (Hageberg 1990, choice of vocabulary; Mattisson 2006, data and examples). As for DID, however, female domains (‘food’, ‘clothes’, ‘milk’ etc.) are thoroughly registered in the collections and described in the dictionary...

  6. Toward better public health reporting using existing off the shelf approaches: A comparison of alternative cancer detection approaches using plaintext medical data and non-dictionary based feature selection.

    Science.gov (United States)

    Kasthurirathne, Suranga N; Dixon, Brian E; Gichoya, Judy; Xu, Huiping; Xia, Yuni; Mamlin, Burke; Grannis, Shaun J

    2016-04-01

    Increased adoption of electronic health records has resulted in increased availability of free text clinical data for secondary use. A variety of approaches to obtain actionable information from unstructured free text data exist. These approaches are resource intensive, inherently complex and rely on structured clinical data and dictionary-based approaches. We sought to evaluate the potential to obtain actionable information from free text pathology reports using routinely available tools and approaches that do not depend on dictionary-based approaches. We obtained pathology reports from a large health information exchange and evaluated the capacity to detect cancer cases from these reports using 3 non-dictionary feature selection approaches, 4 feature subset sizes, and 5 clinical decision models: simple logistic regression, naïve bayes, k-nearest neighbor, random forest, and J48 decision tree. The performance of each decision model was evaluated using sensitivity, specificity, accuracy, positive predictive value, and area under the receiver operating characteristics (ROC) curve. Decision models parameterized using automated, informed, and manual feature selection approaches yielded similar results. Furthermore, non-dictionary classification approaches identified cancer cases present in free text reports with evaluation measures approaching and exceeding 80-90% for most metrics. Our methods are feasible and practical approaches for extracting substantial information value from free text medical data, and the results suggest that these methods can perform on par, if not better, than existing dictionary-based approaches. Given that public health agencies are often under-resourced and lack the technical capacity for more complex methodologies, these results represent potentially significant value to the public health field. Copyright © 2016 Elsevier Inc. All rights reserved.

  7. The SMAP Dictionary Management System

    Science.gov (United States)

    Smith, Kevin A.; Swan, Christoper A.

    2014-01-01

    The Soil Moisture Active Passive (SMAP) Dictionary Management System is a web-based tool to develop and store a mission dictionary. A mission dictionary defines the interface between a ground system and a spacecraft. In recent years, mission dictionaries have grown in size and scope, making it difficult for engineers across multiple disciplines to coordinate the dictionary development effort. The Dictionary Management Systemaddresses these issues by placing all dictionary information in one place, taking advantage of the efficiencies inherent in co-locating what were once disparate dictionary development efforts.

  8. Weighted Discriminative Dictionary Learning based on Low-rank Representation

    International Nuclear Information System (INIS)

    Chang, Heyou; Zheng, Hao

    2017-01-01

    Low-rank representation has been widely used in the field of pattern classification, especially when both training and testing images are corrupted with large noise. Dictionary plays an important role in low-rank representation. With respect to the semantic dictionary, the optimal representation matrix should be block-diagonal. However, traditional low-rank representation based dictionary learning methods cannot effectively exploit the discriminative information between data and dictionary. To address this problem, this paper proposed weighted discriminative dictionary learning based on low-rank representation, where a weighted representation regularization term is constructed. The regularization associates label information of both training samples and dictionary atoms, and encourages to generate a discriminative representation with class-wise block-diagonal structure, which can further improve the classification performance where both training and testing images are corrupted with large noise. Experimental results demonstrate advantages of the proposed method over the state-of-the-art methods. (paper)

  9. An augmented Lagrangian multi-scale dictionary learning algorithm

    Directory of Open Access Journals (Sweden)

    Ye Meng

    2011-01-01

    Full Text Available Abstract Learning overcomplete dictionaries for sparse signal representation has become a hot topic fascinated by many researchers in the recent years, while most of the existing approaches have a serious problem that they always lead to local minima. In this article, we present a novel augmented Lagrangian multi-scale dictionary learning algorithm (ALM-DL, which is achieved by first recasting the constrained dictionary learning problem into an AL scheme, and then updating the dictionary after each inner iteration of the scheme during which majorization-minimization technique is employed for solving the inner subproblem. Refining the dictionary from low scale to high makes the proposed method less dependent on the initial dictionary hence avoiding local optima. Numerical tests for synthetic data and denoising applications on real images demonstrate the superior performance of the proposed approach.

  10. Monolingual accounting dictionaries for EFL text production

    DEFF Research Database (Denmark)

    Nielsen, Sandro

    2006-01-01

    Monolingual accounting dictionaries are important for producing financial reporting texts in English in an international setting, because of the lack of specialised bilingual dictionaries. As the intended user groups have different factual and linguistic competences, they require specific types...... text production. The monolingual accounting dictionary needs to include information about UK, US and international accounting terms, their grammatical properties, their potential for being combined with other words in collocations, phrases and sentences in order to meet user requirements. Data items...... of information. By identifying and analysing the users' factual and linguistic competences, user needs, use-situations and the stages involved in producing accounting texts in English as a foreign language, lexicographers will have a sound basis for designing the optimal English accounting dictionary for EFL...

  11. Monolingual Accounting Dictionaries for EFL Text Production

    DEFF Research Database (Denmark)

    Nielsen, Sandro

    2009-01-01

    Monolingual accounting dictionaries are important for producing financial reporting texts in English in an international setting, because of the lack of specialised bilingual dictionaries. As the intended user groups have different factual and linguistic competences, they require specific types...... text production. The monolingual accounting dictionary needs to include information about UK, US and international accounting terms, their grammatical properties, their potential for being combined with other words in collocations, phrases and sentences in order to meet user requirements. Data items...... of information. By identifying and analysing the users' factual and linguistic competences, user needs, use-situations and the stages involved in producing accounting texts in English as a foreign language, lexicographers will have a sound basis for designing the optimal English accounting dictionary for EFL...

  12. Which Dictionary? A Review of the Leading Learners' Dictionaries.

    Science.gov (United States)

    Nesi, Hilary

    Three major dictionaries designed for learners of English as a second language are reviewed, their elements and approaches compared and evaluated, their usefulness for different learners discussed, and recommendations for future dictionary improvement made. The dictionaries in question are the "Oxford Advanced Learner's Dictionary," the…

  13. Do Dictionaries Help Students Write?

    Science.gov (United States)

    Nesi, Hilary

    Examples are given of real lexical errors made by learner writers, and consideration is given to the way in which three learners' dictionaries could deal with the lexical items that were misused. The dictionaries were the "Oxford Advanced Learner's Dictionary," the "Longman Dictionary of Contemporary English," and the "Chambers Universal Learners'…

  14. Dictionary of nuclear power

    International Nuclear Information System (INIS)

    Koelzer, W.

    2012-04-01

    The actualized version (April 2012) of the dictionary on nuclear power includes all actualizations and new inputs since the last version of 2001. The original publication dates from 1980. The dictionary includes definitions, terms, measuring units and helpful information on the actual knowledge concerning nuclear power, nuclear facilities, and radiation protection.

  15. Dictionary as Database.

    Science.gov (United States)

    Painter, Derrick

    1996-01-01

    Discussion of dictionaries as databases focuses on the digitizing of The Oxford English dictionary (OED) and the use of Standard Generalized Mark-Up Language (SGML). Topics include the creation of a consortium to digitize the OED, document structure, relational databases, text forms, sequence, and discourse. (LRW)

  16. Dictionary of Multicultural Education.

    Science.gov (United States)

    Grant, Carl A., Ed.; Ladson-Billings, Gloria, Ed.

    The focus of this dictionary is the meanings and perspectives of various terms that are used in multicultural education. Contributors have often addressed the literal meanings of words and terms as well as contextual meanings and examples that helped create those meanings. Like other dictionaries, this one is arranged alphabetically, but it goes…

  17. Dictionary of Marketing Terms.

    Science.gov (United States)

    Everhardt, Richard M.

    A listing of words and definitions compiled from more than 10 college and high school textbooks are presented in this dictionary of marketing terms. Over 1,200 entries of terms used in retailing, wholesaling, economics, and investments are included. This dictionary was designed to aid both instructors and students to better understand the…

  18. Design of a Function-Based Internet Accounting Dictionary

    DEFF Research Database (Denmark)

    Nielsen, Sandro; Mourier, Lise

    2007-01-01

    The traditional definition of a dictionary needs to be replaced by one that defines the dictionary in terms of lexicographic functions, data and structures. These must be linked to the intended user groups, the users’ linguistic and factual competences and their needs in the relevant situations o...... to the user in communication-oriented situations within a register-specific context such as accounting....

  19. L2 writing assistants and context-aware dictionaries: New ...

    African Journals Online (AJOL)

    Dictionaries are increasingly integrated into other tools designed to assist the reading, writing and translation of texts. Write Assistant is a newly developed tool aimed at assisting people writing in a second language. It feeds on big data taken in from corpora and digital dictionaries. The paper discusses the philosophy ...

  20. Revisiting lemma lists in Swahili dictionaries | Wójtowicz | Lexikos

    African Journals Online (AJOL)

    When compiling a dictionary, a lexicographer has a set of decisions to make — start-ing with drawing up a lemma list to such issues as formatting a dictionary entry. Relying on corpus data while designing a lemma list and describing entries is standard in present lexicography, but there are still decisions — like the choice of ...

  1. Usage Notes in the Oxford American Dictionary.

    Science.gov (United States)

    Berner, R. Thomas

    1981-01-01

    Compares the "Oxford American Dictionary" with the "American Heritage Dictionary." Examines the dictionaries' differences in philosophies of language, introductory essays, and usage notes. Concludes that the "Oxford American Dictionary" is too conservative, paternalistic, and dogmatic for the 1980s. (DMM)

  2. Dictionary of electrochemistry. Lexikon Elektrochemie

    Energy Technology Data Exchange (ETDEWEB)

    Hibbert, D B; James, A M

    1987-01-01

    Electrochemistry, officially a branch of physical chemistry, is an interdisciplinary field bordering on biology, physics, metallurgy and other fields of engineering. This glossary and dictionary presents information and basic knowledge on recent developments in electrochemistry, i.e. fuel cells, corrosion, energy conversion, electrode kinetics, ion-selective electrodes, and bioelectrochemistry. The user is given a short and precise definition of each term, its importance in different fields of science and, in case of measuring units, a description of the method of measurement. Electrochemical and thermodynamic equations are presented without formal proof, but with an indication of their applications and limitations. Access to relevant information is facilitated by drawings and tables. Bibliographic data are many, and SI units are used throughout the book. A dictionary in the annex makes it easier for the user to find English-language literature. The book may be a useful reference book for biologists, microbiologists, biochemists, chemists, pharmacists, geologists, physicists, technicians, and especially metallurgists.

  3. Explanatory Notes in LSP Dictionaries

    DEFF Research Database (Denmark)

    Laursen, Anne Lise

    2010-01-01

    Translators of LSP texts often have to face the problem of not being subject-field experts and not being able to make sufficient research of the domain in question because of short delivery deadlines. The common denominator often seems to be cognitive shortcomings.   Rita Temmerman (2000......) challenges the traditional terminology concept of 'definition' and suggests 'templates of understanding' with a varying number of modules of information for different units of understanding and different perspectives. The theory of lexicographic functions likewise operates with flexible dictionary concepts...... for specific users and user situations (Tarp 2004) advocating user-oriented subject-matter information as a natural category of translation dictionaries. Via further operational tools for selection and specification of subject-matter data, like the recommendations for adapting definitions to non-experts made...

  4. Dictionaries: British and American. The Language Library.

    Science.gov (United States)

    Hulbert, James Root

    An account of the dictionaries, great and small, of the English-speaking world is given in this book. Subjects covered include the origin of English dictionaries, early dictionaries, Noah Webster and his successors to the present, abridged dictionaries, "The Oxford English Dictionary" and later dictionaries patterned after it, the…

  5. Bayesian nonparametric dictionary learning for compressed sensing MRI.

    Science.gov (United States)

    Huang, Yue; Paisley, John; Lin, Qin; Ding, Xinghao; Fu, Xueyang; Zhang, Xiao-Ping

    2014-12-01

    We develop a Bayesian nonparametric model for reconstructing magnetic resonance images (MRIs) from highly undersampled k -space data. We perform dictionary learning as part of the image reconstruction process. To this end, we use the beta process as a nonparametric dictionary learning prior for representing an image patch as a sparse combination of dictionary elements. The size of the dictionary and patch-specific sparsity pattern are inferred from the data, in addition to other dictionary learning variables. Dictionary learning is performed directly on the compressed image, and so is tailored to the MRI being considered. In addition, we investigate a total variation penalty term in combination with the dictionary learning model, and show how the denoising property of dictionary learning removes dependence on regularization parameters in the noisy setting. We derive a stochastic optimization algorithm based on Markov chain Monte Carlo for the Bayesian model, and use the alternating direction method of multipliers for efficiently performing total variation minimization. We present empirical results on several MRI, which show that the proposed regularization framework can improve reconstruction accuracy over other methods.

  6. Multi-instance dictionary learning via multivariate performance measure optimization

    KAUST Repository

    Wang, Jim Jing-Yan

    2016-12-29

    The multi-instance dictionary plays a critical role in multi-instance data representation. Meanwhile, different multi-instance learning applications are evaluated by specific multivariate performance measures. For example, multi-instance ranking reports the precision and recall. It is not difficult to see that to obtain different optimal performance measures, different dictionaries are needed. This observation motives us to learn performance-optimal dictionaries for this problem. In this paper, we propose a novel joint framework for learning the multi-instance dictionary and the classifier to optimize a given multivariate performance measure, such as the F1 score and precision at rank k. We propose to represent the bags as bag-level features via the bag-instance similarity, and learn a classifier in the bag-level feature space to optimize the given performance measure. We propose to minimize the upper bound of a multivariate loss corresponding to the performance measure, the complexity of the classifier, and the complexity of the dictionary, simultaneously, with regard to both the dictionary and the classifier parameters. In this way, the dictionary learning is regularized by the performance optimization, and a performance-optimal dictionary is obtained. We develop an iterative algorithm to solve this minimization problem efficiently using a cutting-plane algorithm and a coordinate descent method. Experiments on multi-instance benchmark data sets show its advantage over both traditional multi-instance learning and performance optimization methods.

  7. Multi-instance dictionary learning via multivariate performance measure optimization

    KAUST Repository

    Wang, Jim Jing-Yan; Tsang, Ivor Wai-Hung; Cui, Xuefeng; Lu, Zhiwu; Gao, Xin

    2016-01-01

    The multi-instance dictionary plays a critical role in multi-instance data representation. Meanwhile, different multi-instance learning applications are evaluated by specific multivariate performance measures. For example, multi-instance ranking reports the precision and recall. It is not difficult to see that to obtain different optimal performance measures, different dictionaries are needed. This observation motives us to learn performance-optimal dictionaries for this problem. In this paper, we propose a novel joint framework for learning the multi-instance dictionary and the classifier to optimize a given multivariate performance measure, such as the F1 score and precision at rank k. We propose to represent the bags as bag-level features via the bag-instance similarity, and learn a classifier in the bag-level feature space to optimize the given performance measure. We propose to minimize the upper bound of a multivariate loss corresponding to the performance measure, the complexity of the classifier, and the complexity of the dictionary, simultaneously, with regard to both the dictionary and the classifier parameters. In this way, the dictionary learning is regularized by the performance optimization, and a performance-optimal dictionary is obtained. We develop an iterative algorithm to solve this minimization problem efficiently using a cutting-plane algorithm and a coordinate descent method. Experiments on multi-instance benchmark data sets show its advantage over both traditional multi-instance learning and performance optimization methods.

  8. EXFOR Dictionaries

    International Nuclear Information System (INIS)

    Schwerer, O.; Attree, P.M.; Lemmel, H.D.; Smith, P.M.

    1979-06-01

    A collection of all agreed upon terms, keywords, codes, and abbreviations used in the exchange format (EXFOR) for the magnetic-tape exchange of nuclear reaction data between national and international nuclear data centres. In particular, the codes for defining the nuclear data categories (nuclear reactions, cross-sections, angular distributions, energy-spectra, resonance-parameters, etc); headings and units of data columns; bibliographic references; institutions, laboratories, countries; measurement facilities and methods; system-identifier and information-identifier keywords; etc. are included

  9. Trials for product's data management through RosettaNet using RosettaNet Technical Dictionary (RNTD) and Partner Interface Processes (PIP) 2A10

    Energy Technology Data Exchange (ETDEWEB)

    Shinya, H. [NEC Electronics Corp. (Japan); Katsumi, S. [Sony Corporation Corp. (Japan); Seigo, I. [Toshiba Corp. (Japan); Eita, I. [Fujitsu LTD. (Japan); Hisashi, F.; Mackin, J.W. [RosettaNet (Japan)

    2004-07-01

    We face a major challenge in identifying and tracking the amount of hazardous materials contained in electric and electronic products (EE) in accordance with various changing laws or customers' demands. Hence, the Material Composition Milestone Program (MatComp) was established in RosettaNet (USA) on Jan 2003. Many major electronics companies including NOKIA, Sony, NEC Electronics, Toshiba, Fujitsu, etc. are involved in the MatComp program. In addition, RosettaNet Japan has established an environmental information team to develop dictionaries for chemicals listed in the Joint Industry Guide (draft) which was published September, 2003. The RosettaNet RNTD and PIP 2A10 enable product material composition notification between trading partners. 2A10 allows us to describe the product in a hierarchical structure along with product itself in a method similar to IMDS's requirements. In this paper, we will demonstrate the advantage of using RosettaNet protocol for product data exchange between trading partners. Our goal is to complete a full-automated transaction to a design for environment (DfE). (orig.)

  10. Macmillan English Dictionary: The End of Print?

    Directory of Open Access Journals (Sweden)

    Michael Rundell

    2014-12-01

    Full Text Available This paper reports on the Macmillan English Dictionary (MED and its transition from printed book to digital-only resource. The background to this decision is explained in terms of changes both in technology and in dictionary-users’ behaviour: was this move inevitable, and will other dictionary publishers follow (sooner or later? The possible downsides of abandoning print are discussed, alongside the advantages of digital media. As well as offering great opportunities (many still unexplored, being online also creates new demands. With easy access to numerous free reference sites, users searching for lexical information have a huge variety of options. Consequently, publishers are under pressure to continually broaden the range of content they supply, to improve the quality of the design and “user experience”, and above all to stay abreast of language change. And, it will be shown, there is much more to keeping a dictionary up to date than simply adding new words as they emerge. The imperative of moving to digital has generated a good deal of turbulence in the world of dictionary publishing (especially for commercial publishers who cannot run at a loss, and there is considerable uncertainty around the long-term survival of “the dictionary” as the autonomous object we are all familiar with. But humans’ communicative needs should ensure a continued demand for high-quality lexical data – even if this data is delivered and accessed in new and different ways.

  11. Historical Dictionaries and Historical Dictionary Research: Papers

    African Journals Online (AJOL)

    rbr

    Resensies / Reviews. 309 ... and Historical Dictionary Research: Papers from the International ... "Cambridge, Trinity College Library MS 0.5.4: A Fifteenth-century ... There are among others ten types of manuscript collections that need attention, ..... The collection is rounded off by a selective index, supplementing the Table.

  12. Few-view image reconstruction with dual dictionaries

    International Nuclear Information System (INIS)

    Lu Yang; Zhao Jun; Wang Ge

    2012-01-01

    In this paper, we formulate the problem of computed tomography (CT) under sparsity and few-view constraints, and propose a novel algorithm for image reconstruction from few-view data utilizing the simultaneous algebraic reconstruction technique (SART) coupled with dictionary learning, sparse representation and total variation (TV) minimization on two interconnected levels. The main feature of our algorithm is the use of two dictionaries: a transitional dictionary for atom matching and a global dictionary for image updating. The atoms in the global and transitional dictionaries represent the image patches from high-quality and low-quality CT images, respectively. Experiments with simulated and real projections were performed to evaluate and validate the proposed algorithm. The results reconstructed using the proposed approach are significantly better than those using either SART or SART–TV. (paper)

  13. ENCYCLOPEDIC DEFINITIONS IN LANGUAGE DICTIONARIES – A TREASURY OF CULTURE

    Directory of Open Access Journals (Sweden)

    Valentyna Skybina

    2015-10-01

    Full Text Available This paper discusses encyclopedic module of definitions in language dictionaries as a source of historical and cultural information. The main aim of the study is to reveal and compare the encyclopedic modules of definitions in early dictionaries of Australian and Indian English. The method applied consists in the analysis of the definitions and in the review of citation. The data was selected from two dictionaries on historical principles – Austral English (Morris, 1898 and Hobson-Jobson (Yule and Burnell, 1886. The corpus consists of 320 and 292 articles respectively. The study showed that in both dictionaries encyclopedic module of the definitions overshadows the linguistic one. At the same time, specificity of the nascent varieties of English and particularities of the linguistic situation in Australia and India determined the framework of these dictionaries, mainly the criteria of the entries’ selection and, as a consequence, the lexical domains covered by encyclopedic modules of the definitions.

  14. Navajo-English Dictionary.

    Science.gov (United States)

    Wall, Leon; Morgan, William

    A brief summary of the sound system of the Navajo language introduces this Navajo-English dictionary. Diacritical markings and an English definition are given for each Navajo word. Words are listed alphabetically by Navajo sound. (VM)

  15. Online Law Dictionaries

    DEFF Research Database (Denmark)

    Nielsen, Sandro

    2012-01-01

    Online dictionaries that assist users in writing legal texts in English as a foreign language are important lexicographic tools. They can help law students bridge the factual and linguistic gaps between the two legal universes involved. However, existing online law dictionaries with English...... as the target language primarily focus on terms, but students also need to write the remainder of the texts in factually and linguistically correct English. It is therefore important to have a sound theoretical foundation before embarking on a dictionary project that aims to help law students communicate...... in English as a foreign language. The function theory of lexicography offers an appropriate basis as it focuses on three key concepts: user needs, user competences, and user situations. It is proposed that online dictionaries should be designed to satisfy the lexicographically relevant user needs...

  16. A timeless music dictionary

    African Journals Online (AJOL)

    R.B. Ruthven

    Abstract: A music dictionary for the Internet fulfils the same functions as printed .... This does not mean that there cannot be a cognitive gain through a communi- ... 2006, and has, since its completion in August 2006, been accessible free of.

  17. Dictionary of nuclear power

    International Nuclear Information System (INIS)

    Koelzer, W.

    2012-06-01

    The actualized version (June 2012) of the dictionary on nuclear power includes all actualizations and new inputs since the last version of 2001. The original publication dates from 1980. The dictionary includes definitions, terms, measuring units and helpful information on the actual knowledge concerning nuclear power, nuclear fuel cycle, nuclear facilities, radioactive waste management, nuclear physics, reactor physics, isotope production, biological radiation effects, and radiation protection.

  18. Dictionary descent in optimization

    OpenAIRE

    Temlyakov, Vladimir

    2015-01-01

    The problem of convex optimization is studied. Usually in convex optimization the minimization is over a d-dimensional domain. Very often the convergence rate of an optimization algorithm depends on the dimension d. The algorithms studied in this paper utilize dictionaries instead of a canonical basis used in the coordinate descent algorithms. We show how this approach allows us to reduce dimensionality of the problem. Also, we investigate which properties of a dictionary are beneficial for t...

  19. Dictionary quality and dictionary design: a methodology for ...

    African Journals Online (AJOL)

    Although recent dictionaries for the ESL market have been praised for their innovative design features, the prime concern of users, lexicographers and metalexicographers is the functional quality of the dictionary products provided for the market. The functional quality of dictionaries and the scientific assessment thereof ...

  20. Reviewing printed and electronic dictionaries

    DEFF Research Database (Denmark)

    Nielsen, Sandro

    2009-01-01

    Dictionary reviewing is an integral part of the lexicographic universe. However, lexicographers have called for generally applicable principles embracing both printed and electronic dictionaries. I propose that scholarly reviews contain information that is useful to their intended audiences...

  1. PHRASEOLOGY IN CHAKAVIAN DIALECTOLOGICAL DICTIONARIES

    Directory of Open Access Journals (Sweden)

    Sanja Bogović

    1997-01-01

    Full Text Available The paper analyses the presentation and processing of idioms in eight dictionaries of chakavian organic language systems. According to the systematicality of their presentation and elaboration, the dictionaries are divided into three groups: systematic, partially systematic and non-systematic. Their analysis has shown the prevalence of dictionaries with partially systematic presentation and processing of idioms. Based on the results of the analysis, the paper presents procedures for a systematic presentation of idioms in organic language dictionaries.

  2. The concept of 'dictionary usage'

    DEFF Research Database (Denmark)

    Bergenholtz, Henning; Tarp, Sven

    2004-01-01

    that users that might have a bad dictionary culture feel that the dictionaries meet their needs. In doing so, you generate inbreeding and block the necessary innovation. This is the unavoidable result of a practice that pays excessive attention to the study of existing dictionaries and doesn't endeavour...... to produce new concepts and to introduce a new dictionary culture. It is, in other words, a poor lexicography....

  3. Dictionaries of Canadian English

    Directory of Open Access Journals (Sweden)

    John Considine

    2011-10-01

    Full Text Available

    Abstract: The lexicographical record of English in Canada began with wordlists of the late eighteenth, nineteenth, and early twentieth centuries. From the beginning of the twentieth century onwards, the general vocabulary of English in Canada has been represented in bilingual and monolingual dictionaries, often adapted from American or British dictionaries. In the 1950s, several important projects were initiated, resulting in the publication of general dictionaries of English in Canada, and of dictionaries of Canadianisms and of the vocabulary of particular regions of Can-ada. This article gives an overview of these dictionaries and of their reception, contextualizing them in the larger picture of the lexicography of Canada's other official language, French, and of a number of its non-official languages. It concludes by looking at the future of English-language lexicography in Canada, and by observing that although it has, at its best, reached a high degree of sophistication, there are still major opportunities waiting to be taken.

    Keywords: DICTIONARY, LEXICOGRAPHY, CANADIAN ENGLISH, CANADIANISMS, NATIONAL DICTIONARIES, CANADIAN FRENCH, CANADIAN FIRST NATIONS LAN-GUAGES, BILINGUAL DICTIONARIES, REGIONAL DICTIONARIES, UNFINISHED DICTIONARY PROJECTS

    Opsomming: Woordeboeke van Kanadese Engels. Die leksikografiese optekening van Engels in Kanada begin met woordelyste van die laat agtiende, neëntiende en vroeë twintigste eeue. Van die begin van die twintigste eeu af en verder, is die algemene woordeskat van Engels weergegee in tweetalige en eentalige woordeboeke, dikwels met wysiginge ontleen aan Ameri-kaanse en Britse woordeboeke. In die 1950's is verskeie belangrike projekte onderneem wat gelei het tot die publikasie van algemene woordeboeke van Engels in Kanada, en van woordeboeke van Kanadeïsmes en van die woordeskat van bepaalde streke van Kanada. Hierdie artikel gee 'n oorsig van dié woordeboeke, en van hul ontvangs, deur

  4. Learning multimodal dictionaries.

    Science.gov (United States)

    Monaci, Gianluca; Jost, Philippe; Vandergheynst, Pierre; Mailhé, Boris; Lesage, Sylvain; Gribonval, Rémi

    2007-09-01

    Real-world phenomena involve complex interactions between multiple signal modalities. As a consequence, humans are used to integrate at each instant perceptions from all their senses in order to enrich their understanding of the surrounding world. This paradigm can be also extremely useful in many signal processing and computer vision problems involving mutually related signals. The simultaneous processing of multimodal data can, in fact, reveal information that is otherwise hidden when considering the signals independently. However, in natural multimodal signals, the statistical dependencies between modalities are in general not obvious. Learning fundamental multimodal patterns could offer deep insight into the structure of such signals. In this paper, we present a novel model of multimodal signals based on their sparse decomposition over a dictionary of multimodal structures. An algorithm for iteratively learning multimodal generating functions that can be shifted at all positions in the signal is proposed, as well. The learning is defined in such a way that it can be accomplished by iteratively solving a generalized eigenvector problem, which makes the algorithm fast, flexible, and free of user-defined parameters. The proposed algorithm is applied to audiovisual sequences and it is able to discover underlying structures in the data. The detection of such audio-video patterns in audiovisual clips allows to effectively localize the sound source on the video in presence of substantial acoustic and visual distractors, outperforming state-of-the-art audiovisual localization algorithms.

  5. OSD CALS Architecture Master Plan Study. Data Dictionary. Concept Paper. Draft Version 1.2. Volume 29

    Science.gov (United States)

    1989-10-01

    Rapid advances in information technology are changing the way technical data is created, stored, and used. These advances have created opportunities to reduce costs and improve productivity in both the administration of data and in the acquisition an...

  6. Kirkeby's English–Swahili Dictionary

    African Journals Online (AJOL)

    rbr

    largest Swahili dictionary is the Swahili–French dictionary of Sacleux (1939) with 1 112 pages. Kirkeby ... An entry in this dictionary could be a basic form, a derived or inflectional form of the ...... Cf. cook, boil, fry, roast, bake, etc. (cookery); ugali ...

  7. Improving Dictionary Skills in Ndebele

    African Journals Online (AJOL)

    rbr

    Abstract: This article proposes ways of improving dictionary skills amongst the Ndebele. One way of accomplishing this is incorporating the teaching of dictionary skills into teacher training syllabi. Teachers can impart their knowledge to students and a dictionary culture can develop for enhancing effective use of current ...

  8. Adaptive Greedy Dictionary Selection for Web Media Summarization.

    Science.gov (United States)

    Cong, Yang; Liu, Ji; Sun, Gan; You, Quanzeng; Li, Yuncheng; Luo, Jiebo

    2017-01-01

    Initializing an effective dictionary is an indispensable step for sparse representation. In this paper, we focus on the dictionary selection problem with the objective to select a compact subset of basis from original training data instead of learning a new dictionary matrix as dictionary learning models do. We first design a new dictionary selection model via l 2,0 norm. For model optimization, we propose two methods: one is the standard forward-backward greedy algorithm, which is not suitable for large-scale problems; the other is based on the gradient cues at each forward iteration and speeds up the process dramatically. In comparison with the state-of-the-art dictionary selection models, our model is not only more effective and efficient, but also can control the sparsity. To evaluate the performance of our new model, we select two practical web media summarization problems: 1) we build a new data set consisting of around 500 users, 3000 albums, and 1 million images, and achieve effective assisted albuming based on our model and 2) by formulating the video summarization problem as a dictionary selection issue, we employ our model to extract keyframes from a video sequence in a more flexible way. Generally, our model outperforms the state-of-the-art methods in both these two tasks.

  9. Compilation of Cooperative Data Element Dictionary of Five Federal Agencies’ Systems for Processing of Technical Report Literature

    Science.gov (United States)

    1983-03-01

    investigator participated with Libary of Congress (LC) Staff and Gov- ernment Printing Office (GPO) catalogers in identifying which data elements...National Standards Institute Standard format, ANSI Z.39-2). This standard includes iden- tif ication of specific data elements by means of 3- digit tags...the closer they can come to efficient exchange. But tags assigned to the core data elements differ: they are all 3- digit tags, but the same 3 digits

  10. Dictionary of microelectronics and microcomputer technology

    International Nuclear Information System (INIS)

    Attiyate, Y.H.; Shah, R.R.

    1984-01-01

    This bilingual dictionary (German-English and English-German) is to give the general public a clearer idea of the terminology of microelectronics, microcomputers, data processing, and computer science. Each part contains about 7500 terms frequently encountered in practice, about 2000 of which are supplemented by precise explanations. (orig./HP) [de

  11. The "New Oxford English Dictionary" Project.

    Science.gov (United States)

    Fawcett, Heather

    1993-01-01

    Describes the conversion of the 22,000-page Oxford English Dictionary to an electronic version incorporating a modified Standard Generalized Markup Language (SGML) syntax. Explains that the database designers chose structured markup because it supports users' data searching needs, allows textual components to be extracted or modified, and allows…

  12. The Role of Dictionaries in Language Learning.

    Science.gov (United States)

    White, Philip A.

    1997-01-01

    Examines assumptions about dictionaries, especially the bilingual dictionary, and suggests ways of integrating the monolingual dictionary into the second-language instructional process. Findings indicate that the monolingual dictionary can coexist with bilingual dictionaries within a foreign-language course if the latter are appropriately used as…

  13. Dictionaries of Canadian English | Considine | Lexikos

    African Journals Online (AJOL)

    ... its best, reached a high degree of sophistication, there are still major opportunities waiting to be taken. keywords: dictionary, lexicography, canadian english, canadianisms, national dictionaries, canadian french, canadian first nations lan-guages, bilingual dictionaries, regional dictionaries, unfinished diction-ary projects ...

  14. An Online Spanish Learners’ Dictionary: The DAELE Project

    Directory of Open Access Journals (Sweden)

    Blanca Arias-Badia

    2014-12-01

    Full Text Available Current online dictionaries for learners of Spanish as a second language largely just reproduce their printed counterparts. This report summarizes the advances made in the DAELE, a prototype dictionary for learners of Spanish as a second language that has been designed solely for online publication. The purpose and main features of the macro- and microstructure of the dictionary are briefly described, as well as the methodology whereby the data are collected and the first steps taken to include the most significant collocations by applying a method based on collocational networks.

  15. Fast Dictionary-Based Reconstruction for Diffusion Spectrum Imaging

    Science.gov (United States)

    Bilgic, Berkin; Chatnuntawech, Itthi; Setsompop, Kawin; Cauley, Stephen F.; Yendiki, Anastasia; Wald, Lawrence L.; Adalsteinsson, Elfar

    2015-01-01

    Diffusion Spectrum Imaging (DSI) reveals detailed local diffusion properties at the expense of substantially long imaging times. It is possible to accelerate acquisition by undersampling in q-space, followed by image reconstruction that exploits prior knowledge on the diffusion probability density functions (pdfs). Previously proposed methods impose this prior in the form of sparsity under wavelet and total variation (TV) transforms, or under adaptive dictionaries that are trained on example datasets to maximize the sparsity of the representation. These compressed sensing (CS) methods require full-brain processing times on the order of hours using Matlab running on a workstation. This work presents two dictionary-based reconstruction techniques that use analytical solutions, and are two orders of magnitude faster than the previously proposed dictionary-based CS approach. The first method generates a dictionary from the training data using Principal Component Analysis (PCA), and performs the reconstruction in the PCA space. The second proposed method applies reconstruction using pseudoinverse with Tikhonov regularization with respect to a dictionary. This dictionary can either be obtained using the K-SVD algorithm, or it can simply be the training dataset of pdfs without any training. All of the proposed methods achieve reconstruction times on the order of seconds per imaging slice, and have reconstruction quality comparable to that of dictionary-based CS algorithm. PMID:23846466

  16. Sparse dictionary for synthetic transmit aperture medical ultrasound imaging.

    Science.gov (United States)

    Wang, Ping; Jiang, Jin-Yang; Li, Na; Luo, Han-Wu; Li, Fang; Cui, Shi-Gang

    2017-07-01

    It is possible to recover a signal below the Nyquist sampling limit using a compressive sensing technique in ultrasound imaging. However, the reconstruction enabled by common sparse transform approaches does not achieve satisfactory results. Considering the ultrasound echo signal's features of attenuation, repetition, and superposition, a sparse dictionary with the emission pulse signal is proposed. Sparse coefficients in the proposed dictionary have high sparsity. Images reconstructed with this dictionary were compared with those obtained with the three other common transforms, namely, discrete Fourier transform, discrete cosine transform, and discrete wavelet transform. The performance of the proposed dictionary was analyzed via a simulation and experimental data. The mean absolute error (MAE) was used to quantify the quality of the reconstructions. Experimental results indicate that the MAE associated with the proposed dictionary was always the smallest, the reconstruction time required was the shortest, and the lateral resolution and contrast of the reconstructed images were also the closest to the original images. The proposed sparse dictionary performed better than the other three sparse transforms. With the same sampling rate, the proposed dictionary achieved excellent reconstruction quality.

  17. Fast dictionary-based reconstruction for diffusion spectrum imaging.

    Science.gov (United States)

    Bilgic, Berkin; Chatnuntawech, Itthi; Setsompop, Kawin; Cauley, Stephen F; Yendiki, Anastasia; Wald, Lawrence L; Adalsteinsson, Elfar

    2013-11-01

    Diffusion spectrum imaging reveals detailed local diffusion properties at the expense of substantially long imaging times. It is possible to accelerate acquisition by undersampling in q-space, followed by image reconstruction that exploits prior knowledge on the diffusion probability density functions (pdfs). Previously proposed methods impose this prior in the form of sparsity under wavelet and total variation transforms, or under adaptive dictionaries that are trained on example datasets to maximize the sparsity of the representation. These compressed sensing (CS) methods require full-brain processing times on the order of hours using MATLAB running on a workstation. This work presents two dictionary-based reconstruction techniques that use analytical solutions, and are two orders of magnitude faster than the previously proposed dictionary-based CS approach. The first method generates a dictionary from the training data using principal component analysis (PCA), and performs the reconstruction in the PCA space. The second proposed method applies reconstruction using pseudoinverse with Tikhonov regularization with respect to a dictionary. This dictionary can either be obtained using the K-SVD algorithm, or it can simply be the training dataset of pdfs without any training. All of the proposed methods achieve reconstruction times on the order of seconds per imaging slice, and have reconstruction quality comparable to that of dictionary-based CS algorithm.

  18. Oxford dictionary of Physics

    Science.gov (United States)

    Isaacs, Alan

    The dictionary is derived from the Concise Science Dictionary, first published by Oxford University Press in 1984 (third edition, 1996). It consists of all the entries relating to physics in that dictionary, together with some of those entries relating to astronomy that are required for an understanding of astrophysics and many entries that relate to physical chemistry. It also contains a selection of the words used in mathematics that are relevant to physics, as well as the key words in metal science, computing, and electronics. For this third edition a number of words from quantum field physics and statistical mechanics have been added. Cosmology and particle physics have been updated and a number of general entries have been expanded.

  19. Clinical dictionary of MRT

    International Nuclear Information System (INIS)

    Herborn, C.U.; Zink, C.

    2007-01-01

    MRT is the method of choice in many clinical problems. It is also one of the most complex imaging procedures, requiring much technical and physical knowledge and also detailed knowledge of the signal response of physiological and pathological tissues in the many variants of contrast enhancement, image reconstruction and image processing. In 2006, the pocket dictionary of MRT was published by the same publisher. It met with much positive response, which induced the publication of this new dictionary. With more than 3000 terms and definitions elaborated in cooperation with an excellent practician, the clinical applications of MRT are illustrated by typical findings and examples. (orig.)

  20. Radiological sciences dictionary

    CERN Document Server

    Dowsett, David

    2009-01-01

    The Radiological Sciences Dictionary is a rapid reference guide for all hospital staff employed in diagnostic imaging, providing definitions of over 3000 keywords as applied to the technology of diagnostic radiology.Written in a concise and easy to digest form, the dictionary covers a wide variety of subject matter, including:· radiation legislation and measurement · computing and digital imaging terminology· nuclear medicine radionuclides and radiopharmaceuticals· radiographic contrast agents (x-ray, MRI and ultrasound)· definitions used in ultrasound and MRI technology· statistical exp

  1. A dictionary of astronomy

    CERN Document Server

    2003-01-01

    This revised edition contains 4,000 up-to-date entries written by an expert team of contributors, under the editorship of Ian Ridpath, renowned author and broadcaster. Covering the most recent space exploration missions and latest technological development, this authoritative dictionary covers everything from astrophysics to galaxies and time. World-wide coverage of observatories and telescopes, and major entries on supernova, Big Bang theory, and stellar evolution, make this an invaluable reference source for students, professionals, and amateur astronomers. Appendices include tables of Apollo lunar landing missions and the constellations. The entries are supported by numerous tables and diagrams, and the dictionary also features biographical entries on eminent astronomers.

  2. Monolingual accounting dictionaries for EFL text production

    Directory of Open Access Journals (Sweden)

    Sandro Nielsen

    2006-10-01

    Full Text Available Monolingual accounting dictionaries are important for producing financial reporting texts in English in an international setting, because of the lack of specialised bilingual dictionaries. As the intended user groups have different factual and linguistic competences, they require specific types of information. By identifying and analysing the users' factual and linguistic competences, user needs, use-situations and the stages involved in producing accounting texts in English as a foreign language, lexicographers will have a sound basis for designing the optimal English accounting dictionary for EFL text production. The monolingual accounting dictionary needs to include information about UK, US and international accounting terms, their grammatical properties, their potential for being combined with other words in collocations, phrases and sentences in order to meet user requirements. Data items that deal with these aspects are necessary for the international user group as they produce subject-field specific and register-specific texts in a foreign language, and the data items are relevant for the various stages in text production: draft writing, copyediting, stylistic editing and proofreading.

  3. The Making of the "Oxford English Dictionary."

    Science.gov (United States)

    Winchester, Simon

    2003-01-01

    Summarizes remarks made to open the Gallaudet University conference on Dictionaries and the Standardization of languages. It concerns the making of what is arguably the world's greatest dictionary, "The Oxford English Dictionary." (VWL)

  4. Example sentences in bilingual specialised dictionaries assisting ...

    African Journals Online (AJOL)

    Keywords: Specialised lexicography, online dictionaries, printed dictionaries, technical dictionaries, specialised communication, examples, lexicographical functions, text production, user needs, writing, translation. Voorbeeldsinne in tweetalige vakwoordeboeke help met kommunikasie in 'n vreemde taal. Praktisyns ...

  5. Dr.Johnson's Dictionary in Miniature

    OpenAIRE

    Imazato, Chiaki

    1988-01-01

    More than hundred 'Johnson's' dictionaries have so far been published not only in English but in other countries, and there are numerous books and articles on Johnson's Dictionary. But few have referred to Johnson's Dictionary in Miniature; nor were there any books or articles on it. Fortunately, however, I've got one copy of Johnson's Dictionary in Miniature, which was published in 1806. Johnson's Dictionary (1755) has 41,677 entries, whereas Johnson's Dictionary in Miniature 23,439 entr...

  6. Supervised dictionary learning for inferring concurrent brain networks.

    Science.gov (United States)

    Zhao, Shijie; Han, Junwei; Lv, Jinglei; Jiang, Xi; Hu, Xintao; Zhao, Yu; Ge, Bao; Guo, Lei; Liu, Tianming

    2015-10-01

    Task-based fMRI (tfMRI) has been widely used to explore functional brain networks via predefined stimulus paradigm in the fMRI scan. Traditionally, the general linear model (GLM) has been a dominant approach to detect task-evoked networks. However, GLM focuses on task-evoked or event-evoked brain responses and possibly ignores the intrinsic brain functions. In comparison, dictionary learning and sparse coding methods have attracted much attention recently, and these methods have shown the promise of automatically and systematically decomposing fMRI signals into meaningful task-evoked and intrinsic concurrent networks. Nevertheless, two notable limitations of current data-driven dictionary learning method are that the prior knowledge of task paradigm is not sufficiently utilized and that the establishment of correspondences among dictionary atoms in different brains have been challenging. In this paper, we propose a novel supervised dictionary learning and sparse coding method for inferring functional networks from tfMRI data, which takes both of the advantages of model-driven method and data-driven method. The basic idea is to fix the task stimulus curves as predefined model-driven dictionary atoms and only optimize the other portion of data-driven dictionary atoms. Application of this novel methodology on the publicly available human connectome project (HCP) tfMRI datasets has achieved promising results.

  7. On A Nonlinear Generalization of Sparse Coding and Dictionary Learning.

    Science.gov (United States)

    Xie, Yuchen; Ho, Jeffrey; Vemuri, Baba

    2013-01-01

    Existing dictionary learning algorithms are based on the assumption that the data are vectors in an Euclidean vector space ℝ d , and the dictionary is learned from the training data using the vector space structure of ℝ d and its Euclidean L 2 -metric. However, in many applications, features and data often originated from a Riemannian manifold that does not support a global linear (vector space) structure. Furthermore, the extrinsic viewpoint of existing dictionary learning algorithms becomes inappropriate for modeling and incorporating the intrinsic geometry of the manifold that is potentially important and critical to the application. This paper proposes a novel framework for sparse coding and dictionary learning for data on a Riemannian manifold, and it shows that the existing sparse coding and dictionary learning methods can be considered as special (Euclidean) cases of the more general framework proposed here. We show that both the dictionary and sparse coding can be effectively computed for several important classes of Riemannian manifolds, and we validate the proposed method using two well-known classification problems in computer vision and medical imaging analysis.

  8. Compiling a corpus-based dictionary grammar: an example for ...

    African Journals Online (AJOL)

    In this article it is shown how a corpus-based dictionary grammar may be compiled — that is, a mini-grammar fully based on corpus data and specifically written for use in and inte-grated with a dictionary. Such an effort is, to the best of our knowledge, a world's first. We exem-plify our approach for a Northern Sotho ...

  9. Translation Dictionaries and Bilingual Dictionaries. Two Different Concepts

    DEFF Research Database (Denmark)

    Tarp, Sven

    2002-01-01

    of dictionaries - and in some cases even not the best ones - to assist the translator who runs into problems in the translation process. In my paper, I will argue that monolingual dictionaries - together with bilingual dictionaries «the other way around«, e.g. L2-L1 dictionaries when translating from L1 into L2...... in relation to translation and what types of problems pop up during the translation process in order to clarify up to which point lexicography can assist translator. Finally, I will discuss in which types of dictionary (monolingual or bilingual) the assistance to the translator should be provided and......The starting point in any scientific process is always the formulation of the problem and then the search for a solution. In my opinion the question on the relaton between lexicography and translation should be put in this way: How can dictionaries assist translators in finding solutions...

  10. L2 write assistants and context-aware dictionaries: New challenges to lexicography

    DEFF Research Database (Denmark)

    Tarp, Sven; Fisker, Kasper; Sepstrup, Peter

    2017-01-01

    Dictionaries are increasingly integrated into other tools designed to assist the reading, writing and translation of texts. Write Assistant is a newly developed tool aimed at assisting people writing in a second language. It feeds on big data taken in from corpora and digital dictionaries...... dictionaries need to be conceptionally adapted to the specific tool in order to optimize the service. All this poses new challenges to lexicography....

  11. Stochastic Learning of Multi-Instance Dictionary for Earth Mover's Distance based Histogram Comparison

    OpenAIRE

    Fan, Jihong; Liang, Ru-Ze

    2016-01-01

    Dictionary plays an important role in multi-instance data representation. It maps bags of instances to histograms. Earth mover's distance (EMD) is the most effective histogram distance metric for the application of multi-instance retrieval. However, up to now, there is no existing multi-instance dictionary learning methods designed for EMD based histogram comparison. To fill this gap, we develop the first EMD-optimal dictionary learning method using stochastic optimization method. In the stoc...

  12. Using Dictionary Pair Learning for Seizure Detection.

    Science.gov (United States)

    Ma, Xin; Yu, Nana; Zhou, Weidong

    2018-02-13

    Automatic seizure detection is extremely important in the monitoring and diagnosis of epilepsy. The paper presents a novel method based on dictionary pair learning (DPL) for seizure detection in the long-term intracranial electroencephalogram (EEG) recordings. First, for the EEG data, wavelet filtering and differential filtering are applied, and the kernel function is performed to make the signal linearly separable. In DPL, the synthesis dictionary and analysis dictionary are learned jointly from original training samples with alternating minimization method, and sparse coefficients are obtained by using of linear projection instead of costly [Formula: see text]-norm or [Formula: see text]-norm optimization. At last, the reconstructed residuals associated with seizure and nonseizure sub-dictionary pairs are calculated as the decision values, and the postprocessing is performed for improving the recognition rate and reducing the false detection rate of the system. A total of 530[Formula: see text]h from 20 patients with 81 seizures were used to evaluate the system. Our proposed method has achieved an average segment-based sensitivity of 93.39%, specificity of 98.51%, and event-based sensitivity of 96.36% with false detection rate of 0.236/h.

  13. Nuclear medicine imaging. An encyclopedic dictionary

    International Nuclear Information System (INIS)

    Thie, Joseph A.

    2012-01-01

    The rapidly growing and somewhat complex area of nuclear medicine imaging receives only limited attention in broad-based medical dictionaries. This encyclopedic dictionary is intended to fill the gap. More than 400 entries of between one and three paragraphs are included, defining and carefully explaining terms in an appropriate degree of detail. The dictionary encompasses concepts used in planar, SPECT, and PET imaging protocols and covers both scanner operations and popular data analysis approaches. In spite of the mathematical complexities in the acquisition and analysis of images, the explanations given are kept simple and easy to understand; in addition, many helpful concrete examples are provided. Nuclear Medicine Imaging: An Encyclopedic Dictionary will be ideal for those who wish to obtain a rapid grasp of a concept beyond a definition of a few words but do not want to resort to a time-consuming search of the reference literature. The almost tutorial-like style accommodates the needs of students, nuclear medicine technologists, and varieties of other medical professionals who interface with specialists within nuclear medicine.

  14. Ecological Concerns Data Dictionary - Ecological Concerns data dictionary

    Data.gov (United States)

    National Oceanic and Atmospheric Administration, Department of Commerce — Evaluating the status of threatened and endangered salmonid populations requires information on the current status of the threats (e.g., habitat, hatcheries,...

  15. Dictionary Based Image Segmentation

    DEFF Research Database (Denmark)

    Dahl, Anders Bjorholm; Dahl, Vedrana Andersen

    2015-01-01

    We propose a method for weakly supervised segmentation of natural images, which may contain both textured or non-textured regions. Our texture representation is based on a dictionary of image patches. To divide an image into separated regions with similar texture we use an implicit level sets...

  16. Dictionary of machine terms

    International Nuclear Information System (INIS)

    1990-06-01

    This book has introduction of dictionary of machine terms, and a compilation committee and introductory remarks. It gives descriptions of the machine terms in alphabetical order from a to Z and also includes abbreviation of machine terms and symbol table, way to read mathematical symbols and abbreviation and terms of drawings.

  17. MANDARIN CHINESE DICTIONARY.

    Science.gov (United States)

    WANG, FRED FANGYU

    IN RESPONSE TO THE NEEDS OF THE GROWING NUMBER OF AMERICAN HIGH SCHOOL AND COLLEGE STUDENTS LEARNING CHINESE, SETON HALL UNIVERSITY UNDERTOOK A CONTRACT WITH THE U.S. OFFICE OF EDUCATION TO COMPILE A BILINGUAL POCKET-SIZE DICTIONARY FOR BEGINNING STUDENTS OF SPOKEN MANDARIN CHINESE. THE PRESENT WORK IS THE CHINESE TO ENGLISH SECTION IN PRELIMINARY…

  18. Hashing, Randomness and Dictionaries

    DEFF Research Database (Denmark)

    Pagh, Rasmus

    to the similarity to a bookshelf dictionary, which contains a set of words and has an explanation associated with each word. In the static version of the problem the set is fixed, whereas in the dynamic version, insertions and deletions of elements are possible. The approach taken is that of the theoretical...

  19. Dictionary of Cotton

    Science.gov (United States)

    The Dictionary of Cotton has over 2,000 terms and definitions that were compiled by 33 researchers. It reflects the ongoing commitment of the International Cotton Advisory Committee, through its Technical Information Section, to the spread of knowledge about cotton to all those who have an interest ...

  20. Accessing the ANW dictionary

    NARCIS (Netherlands)

    Moerdijk, F.; Tiberius, C.; Niestadt, J.; Zock, M.; Huang, C.-R.

    2008-01-01

    This paper describes the functional design of an interface for an online scholarly dictionary of contemporary standard Dutch, the ANW. One of the main innovations of the ANW is a twofold meaning description: definitions are accompanied by ‘semagrams’. In this paper we focus on the strategies that

  1. Dictionary of Black Culture.

    Science.gov (United States)

    Baskin, Wade; Runes, Richard N.

    This dictionary is an encyclopedic survey of the cultural background and development of the black American, covering the basic issues, events, contributions and biographies germane to the subject. The author-compiler is Chairman of Classical Languages Department at Southeastern State College, Durant, Oklahoma. Richard Runes is practicing law as a…

  2. Historical dictionary of librarianship

    CERN Document Server

    Quinn, Mary Ellen

    2014-01-01

    The Historical Dictionary of Librarianship focuses on librarianship as a modern, organized profession, emphasizing the period beginning in the mid-nineteenth century. Author Mary Ellen Quinn relates the history of this profession through a chronology, an introductory essay, appendixes, and an extensive bibliography.

  3. Dictionary of Telecommunications.

    Science.gov (United States)

    Bones, R. A.

    A wide range of terms used in the telecommunications industry are defined in this dictionary. Many of the terms and definitions are either reproduced from, or based on, the "Glossary of Terms Used in Telecommunications (including Radio) and Electronics" prepared by the British Standards Institute. The principal entry for each term is found under…

  4. Topological structure of dictionary graphs

    International Nuclear Information System (INIS)

    Fuks, Henryk; Krzeminski, Mark

    2009-01-01

    We investigate the topological structure of the subgraphs of dictionary graphs constructed from WordNet and Moby thesaurus data. In the process of learning a foreign language, the learner knows only a subset of all words of the language, corresponding to a subgraph of a dictionary graph. When this subgraph grows with time, its topological properties change. We introduce the notion of the pseudocore and argue that the growth of the vocabulary roughly follows decreasing pseudocore numbers-that is, one first learns words with a high pseudocore number followed by smaller pseudocores. We also propose an alternative strategy for vocabulary growth, involving decreasing core numbers as opposed to pseudocore numbers. We find that as the core or pseudocore grows in size, the clustering coefficient first decreases, then reaches a minimum and starts increasing again. The minimum occurs when the vocabulary reaches a size between 10 3 and 10 4 . A simple model exhibiting similar behavior is proposed. The model is based on a generalized geometric random graph. Possible implications for language learning are discussed.

  5. Legal terms in general dictionaries of English: The civil procedure mystery

    DEFF Research Database (Denmark)

    Nielsen, Sandro

    2015-01-01

    examines four general dictionaries of English to see how they treat civil procedure terms used in England and Wales in the light of the change of structure of and terminology used in civil proceedings that took place in 1999. Despite being based on large, up-to-date corpora the dictionaries contain some......Many general language dictionaries contain specialized terms, including legal terms relating to civil lawsuits. The existing literature provides general discussions of scientific and technical terms in ordinary dictionaries but does not specifically address the inclusion of legal terms. This study...... of the old terms but fail to include the new terms that have been in use for more than 15 years. Why this is the case is a mystery. However, some clues indicate that if they pay more attention to the link between dictionary functions, corpora and the data presented in dictionaries, lexicographers may be able...

  6. A novel structured dictionary for fast processing of 3D medical images, with application to computed tomography restoration and denoising

    Science.gov (United States)

    Karimi, Davood; Ward, Rabab K.

    2016-03-01

    Sparse representation of signals in learned overcomplete dictionaries has proven to be a powerful tool with applications in denoising, restoration, compression, reconstruction, and more. Recent research has shown that learned overcomplete dictionaries can lead to better results than analytical dictionaries such as wavelets in almost all image processing applications. However, a major disadvantage of these dictionaries is that their learning and usage is very computationally intensive. In particular, finding the sparse representation of a signal in these dictionaries requires solving an optimization problem that leads to very long computational times, especially in 3D image processing. Moreover, the sparse representation found by greedy algorithms is usually sub-optimal. In this paper, we propose a novel two-level dictionary structure that improves the performance and the speed of standard greedy sparse coding methods. The first (i.e., the top) level in our dictionary is a fixed orthonormal basis, whereas the second level includes the atoms that are learned from the training data. We explain how such a dictionary can be learned from the training data and how the sparse representation of a new signal in this dictionary can be computed. As an application, we use the proposed dictionary structure for removing the noise and artifacts in 3D computed tomography (CT) images. Our experiments with real CT images show that the proposed method achieves results that are comparable with standard dictionary-based methods while substantially reducing the computational time.

  7. Making a dictionary without words

    DEFF Research Database (Denmark)

    Kristoffersen, Jette Hedegaard; Troelsgård, Thomas

    2010-01-01

    This paper addresses some of the particular problems connected with lemma representation and lemmatization in a sign language dictionary. The paper is mainly based on the authors' work experience from the Danish Sign Language Dictionary project. In a sign language dictionary sign representation...... constitutes a problem. as there is - at least for Danish Sign Language - no conventional notation used by native signers and the various other sign user groups. We look into the different possibilities of representing signs and present the solution that we chose for the Danish Sign Language Dictionary....... Defining the criteria for lernmatization is another area where sign language dictionaries differ from written language dictionaries. The criteria should obviously include the manual expression of the signs, but a sign's manual expression has features from several categories (e.g. handshape, place...

  8. Changes in Dictionary Subject Matter

    DEFF Research Database (Denmark)

    Nielsen, Sandro

    2003-01-01

    The general content of the three editions of the Duden dictionary has undergone few changes. The most substantial changes are the addition of syllabification and the deletion of antonomy in respect of lemmata in the second and third editions. The concept of dictionary subject matter is questioned......, and it is argued that it is more appropriate to consider how the relationships between the classes of items interact with the function of the dictionary....

  9. The Compilation of Electronic Dictionaries for the African Languages

    Directory of Open Access Journals (Sweden)

    D.J.. Prinsloo

    2011-10-01

    Full Text Available

    Abstract: Lexicographers increasingly acknowledge the enormous potential of electronic dictionaries. The great capacity and speed characteristic of electronic products, combined with enhanced query and data retrieval technology, pave the way to a new generation of dictionaries unimagined in the paper-dictionary era. It is amazing to see how many of the lexicographer's greatest obstacles disappear in the electronic dictionary. This article will, firstly, attempt to give a perspective on typical features of electronic dictionaries. Secondly, electronic-dictionary entries will be designed as a solution to some of the most burning lemmatization problems encountered by lexicographers for African languages in paper dictionaries.

    Keywords: ELECTRONIC DICTIONARY, LEXICOGRAPHY, DATA RETRIEVAL, LEMMATIZATION, CD-ROM, ACCESS ROUTE, POP-UP FUNCTIONALITIES, POP-UP SCREENS, EDUTAINMENT, CROSS-REFERENCING, INFORMATION RETRIEVAL, ENCODING, DECODING, AFRICAN LANGUAGES, NAVIGATION BAR.

    Opsomming: Die samestelling van elektroniese woordeboeke vir dieAfrikatale. Leksikograwe erken in toenemende mate die enorme moontlikhede van elektroniesewoordeboeke. Die groot vermoë en spoed wat kenmerkend is van elektroniese produkte,tesame met die bykomende tegnologie van soektogte en dataopsporing, berei die weg voor na 'nnuwe generasie woordeboeke wat ondenkbaar was in die era van gedrukte woordeboeke. Dit isverbasend om te sien hoeveel van die leksikograaf se grootste struikelblokke verdwyn in die elektroniesewoordeboek. Hierdie artikel sal eerstens probeer om 'n oorsig te gee oor die kenmerkendeeienskappe van elektroniese woordeboeke. Tweedens sal inskrywings vir die elektroniese woordeboekontwerp word as oplossing vir sommige van die dringendste lemmatiseringsprobleme watleksikograwe van Afrikatale by gedrukte woordeboeke teëkom.

    Sleutelwoorde: ELEKTRONIESE WOORDEBOEK, LEKSIKOGRAFIE, DATAOPSPORING,LEMMATISERING, CD-ROM, TOEGANGSROETE

  10. The INL Dictionary Writing System

    Directory of Open Access Journals (Sweden)

    Carole Tiberius

    2014-12-01

    Full Text Available The INL-DWS is a Dictionary Writing System (DWS for compiling monolingual and bilingual dictionaries. It has been developed at the Institute of Dutch Lexicology (INL since 2007 and is now being used for the production of a monolingual dictionary at INL and a bilingual dictionary at the Fryske Akademy. This paper describes the functionalities of the system, on the one hand, from a lexicographical point of view, and on the other hand, from a more technical perspective. The paper concludes with a short evaluation of the advantages and disadvantages of in-house systems versus off-the-shelf systems.

  11. Spatial data content standards for Africa

    CSIR Research Space (South Africa)

    Cooper, Antony K

    2005-11-01

    Full Text Available , they selected 14 standards containing data dictionaries or feature catalogues, and compared their feature types. They have also provided some advice and recommendations on data content standards (particularly for data dictionaries and feature catalogues...

  12. Defining datasets and creating data dictionaries for quality improvement and research in chronic disease using routinely collected data: an ontology-driven approach

    Directory of Open Access Journals (Sweden)

    Simon de Lusignan

    2011-06-01

    Conclusion Adopting an ontology-driven approach to case finding could improve the quality of disease registers and of research based on routine data. It would offer considerable advantages over using limited datasets to define cases. This approach should be considered by those involved in research and quality improvement projects which utilise routine data.

  13. Fiber optics standard dictionary

    CERN Document Server

    Weik, Martin H

    1997-01-01

    Fiber Optics Vocabulary Development In 1979, the National Communications System published Technical InfonnationBulle­ tin TB 79-1, Vocabulary for Fiber Optics and Lightwave Communications, written by this author. Based on a draft prepared by this author, the National Communications System published Federal Standard FED-STD-1037, Glossary of Telecommunications Terms, in 1980 with no fiber optics tenns. In 1981, the first edition of this dictionary was published under the title Fiber Optics and Lightwave Communications Standard Dictionary. In 1982, the then National Bureau of Standards, now the National Institute of Standards and Technology, published NBS Handbook 140, Optical Waveguide Communications Glossary, which was also published by the General Services Admin­ istration as PB82-166257 under the same title. Also in 1982, Dynamic Systems, Inc. , Fiberoptic Sensor Technology Handbook, co-authored and edited by published the this author, with an extensive Fiberoptic Sensors Glossary. In 1989, the handbook w...

  14. Energy dictionary. 1992 ed.

    International Nuclear Information System (INIS)

    1992-01-01

    This dictionary seeks to link the definitions of the various forms of energy in the different languages but makes no effort to be exhaustive, notably in the field of economics where only concepts related to energy have been included. This edition contains nearly 2000 defined concepts and an index of several thousands keywords selected from the concept definitions. Either as a dictionary or a glossary, it is given in four languages: English, French, German and Spanish. This new edition is based on the one published in 1986 which has been considerably expanded by including observations and suggestions received since 1986. Two sections have been added: one section dealing with ''district heating'' and one on ''units''. Internationally recognized, officially adopted and accepted definitions have been used in this edition. A short introduction to each section specifies the scope of its contents

  15. Manifold optimization-based analysis dictionary learning with an ℓ1∕2-norm regularizer.

    Science.gov (United States)

    Li, Zhenni; Ding, Shuxue; Li, Yujie; Yang, Zuyuan; Xie, Shengli; Chen, Wuhui

    2018-02-01

    Recently there has been increasing attention towards analysis dictionary learning. In analysis dictionary learning, it is an open problem to obtain the strong sparsity-promoting solutions efficiently while simultaneously avoiding the trivial solutions of the dictionary. In this paper, to obtain the strong sparsity-promoting solutions, we employ the ℓ 1∕2 norm as a regularizer. The very recent study on ℓ 1∕2 norm regularization theory in compressive sensing shows that its solutions can give sparser results than using the ℓ 1 norm. We transform a complex nonconvex optimization into a number of one-dimensional minimization problems. Then the closed-form solutions can be obtained efficiently. To avoid trivial solutions, we apply manifold optimization to update the dictionary directly on the manifold satisfying the orthonormality constraint, so that the dictionary can avoid the trivial solutions well while simultaneously capturing the intrinsic properties of the dictionary. The experiments with synthetic and real-world data verify that the proposed algorithm for analysis dictionary learning can not only obtain strong sparsity-promoting solutions efficiently, but also learn more accurate dictionary in terms of dictionary recovery and image processing than the state-of-the-art algorithms. Copyright © 2017 Elsevier Ltd. All rights reserved.

  16. Improving dictionary skills in Ndebele | Hadebe | Lexikos

    African Journals Online (AJOL)

    This article proposes ways of improving dictionary skills amongst the Ndebele. One way of accomplishing this is incorporating the teaching of dictionary skills into teacher trainingsyllabi. Teachers can impart their knowledge to students and a dictionary culture can develop for enhancing effective use of current dictionaries ...

  17. Nuclear energy dictionary

    Energy Technology Data Exchange (ETDEWEB)

    NONE

    1978-03-15

    This book is a dictionary for nuclear energy which lists the technical terms in alphabetical order. It adds four appendixes. The first appendix is about people involved with nuclear energy. The second one is a bibliography and the third one is a checklist of German, English and Korean. The last one has an index. This book gives explanations on technical terms of nuclear energy such as nuclear reaction and atomic disintegration.

  18. Nuclear energy dictionary

    International Nuclear Information System (INIS)

    1978-03-01

    This book is a dictionary for nuclear energy which lists the technical terms in alphabetical order. It adds four appendixes. The first appendix is about people involved with nuclear energy. The second one is a bibliography and the third one is a checklist of German, English and Korean. The last one has an index. This book gives explanations on technical terms of nuclear energy such as nuclear reaction and atomic disintegration.

  19. Nuclear operations dictionary

    International Nuclear Information System (INIS)

    1990-01-01

    In less than fifty years, a sophisticated technical language has developed worldwide around broad-ranging operations in the field of nuclear technology. In South Africa the need to adapt this new technical language in an orderly and acceptable manner for common use was identified. The aim of this dictionary is to promote the use of correct nuclear technology in both English and Afrikaans, and to aid in the translation of nuclear terms from English into Afrikaans

  20. Nuclear operations dictionary

    International Nuclear Information System (INIS)

    1990-01-01

    In less than fifty years, a sophisticated technical language has developed worldwide around broad-ranging operations in the field of nuclear technology. In South Africa the need to adapt this new technical language in an orderly and acceptable manner for common use was identified. The aim of this dictionary is to promote the use of correct nuclear terminology in both Afrikaans and English, and to aid in the translation of nuclear terms from Afrikaans into English

  1. Dictionary of Microscopy

    Science.gov (United States)

    Heath, Julian

    2005-10-01

    The past decade has seen huge advances in the application of microscopy in all areas of science. This welcome development in microscopy has been paralleled by an expansion of the vocabulary of technical terms used in microscopy: terms have been coined for new instruments and techniques and, as microscopes reach even higher resolution, the use of terms that relate to the optical and physical principles underpinning microscopy is now commonplace. The Dictionary of Microscopy was compiled to meet this challenge and provides concise definitions of over 2,500 terms used in the fields of light microscopy, electron microscopy, scanning probe microscopy, x-ray microscopy and related techniques. Written by Dr Julian P. Heath, Editor of Microscopy and Analysis, the dictionary is intended to provide easy navigation through the microscopy terminology and to be a first point of reference for definitions of new and established terms. The Dictionary of Microscopy is an essential, accessible resource for: students who are new to the field and are learning about microscopes equipment purchasers who want an explanation of the terms used in manufacturers' literature scientists who are considering using a new microscopical technique experienced microscopists as an aide mémoire or quick source of reference librarians, the press and marketing personnel who require definitions for technical reports.

  2. Sparse decompositions in 'incoherent' dictionaries

    DEFF Research Database (Denmark)

    Gribonval, R.; Nielsen, Morten

    2003-01-01

    a unique sparse representation in such a dictionary. In particular, it is proved that the result of Donoho and Huo, concerning the replacement of a combinatorial optimization problem with a linear programming problem when searching for sparse representations, has an analog for dictionaries that may...

  3. Trying Out a New Dictionary.

    Science.gov (United States)

    Benson, Morton; Benson, Evelyn

    1988-01-01

    Describes the BBI Combinatory Dictionary of English and demonstrates its usefulness for advanced learners of English by administering a monolingual completion test, first without a dictionary and then with the BBI, to Hungarian and Russian English teachers. Both groups' scores improved dramatically on the posttest. (LMO)

  4. Notes on Compiling a Corpus- Based Dictionary

    Directory of Open Access Journals (Sweden)

    František Čermák

    2011-10-01

    Full Text Available

    ABSTRACT: On the basis of sample analysis of a Czech adjective, a definition based on the data drawn from the Czech National Corpus (cf. Čermák and Schmiedtová 2003 is gradually compiled and finally offered, pointing at the drawbacks of definitions found in traditional dictionaries. Steps undertaken here are then generalized and used, in an ordered sequence (similar to a work-flow ordering, as topics, briefly discussed in the second part to which lexicographers of monolingual dictionaries should pay attention. These are supplemented by additional remarks and caveats useful in the compilation of a dictionary. Thus, a brief survey of some of the major steps of dictionary compilation is presented here, supplemented by the original Czech data, analyzed in their raw, though semiotically classified form.

    OPSOMMING: Aantekeninge oor die samestelling van 'n korpusgebaseerde woordeboek. Op grond van 'n steekproefontleding van 'n Tsjeggiese adjektief, word 'n definisie gebaseer op data ontleen aan die Tsjeggiese Nasionale Korpus (cf. Čermák en Schmiedtová 2003 geleidelik saamgestel en uiteindelik aangebied wat wys op die gebreke van definisies aangetref in tradisionele woordeboeke. Stappe wat hier onderneem word, word dan veralgemeen en gebruik in 'n geordende reeks (soortgelyk aan 'n werkvloeiordening, as onderwerpe, kortliks bespreek in die tweede deel, waaraan leksikograwe van eentalige woordeboeke aandag behoort te gee. Hulle word aangevul deur bykomende opmerkings en waarskuwings wat nuttig is vir die samestelling van 'n woordeboek. Op dié manier word 'n kort oorsig van sommige van die hoofstappe van woordeboeksamestelling hier aangebied, aangevul deur die oorspronklike Tsjeggiese data, ontleed in hul onbewerkte, alhoewel semioties geklassifiseerde vorm.

    Sleutelwoorde: EENTALIGE WOORDEBOEKE, KORPUSLEKSIKOGRAFIE, SINTAGMATIEK EN PARADIGMATIEK IN WOORDEBOEKE, WOORDEBOEKINSKRYWING, SOORTE LEMMAS, PRAGMATIEK, BEHANDELING VAN

  5. Proposals for Upgrading the Lexicographical Treatment of Prepositions in Bilingual Dictionaries for Business Translation

    DEFF Research Database (Denmark)

    Nielsen, Sandro; Fuertes-Olivera, Pedro

    2010-01-01

    dictionaries. The theory of lexicographical functions is used to determine which grammatical data types are needed by specific user types. The analysis focuses on a bidirectional Spanish-English business dictionary, its treatment of prepositions in entries and cross-references to the middle matter texts......A good lexicographical basis is needed for designing bilingual dictionaries that help users translate business texts. Many approaches have been suggested for including grammatical data in general dictionaries, but few have analysed the types of grammatical data relevant for bilingual specialised....... The findings show that the treatment is inconsistent and may mislead users, because their business-language competence is insufficient for producing correct translations. Bilingual dictionaries should offer a systematic treatment of prepositions and cross-reference users from entries to contrastive middle...

  6. Classification of multiple sclerosis lesions using adaptive dictionary learning.

    Science.gov (United States)

    Deshpande, Hrishikesh; Maurel, Pierre; Barillot, Christian

    2015-12-01

    This paper presents a sparse representation and an adaptive dictionary learning based method for automated classification of multiple sclerosis (MS) lesions in magnetic resonance (MR) images. Manual delineation of MS lesions is a time-consuming task, requiring neuroradiology experts to analyze huge volume of MR data. This, in addition to the high intra- and inter-observer variability necessitates the requirement of automated MS lesion classification methods. Among many image representation models and classification methods that can be used for such purpose, we investigate the use of sparse modeling. In the recent years, sparse representation has evolved as a tool in modeling data using a few basis elements of an over-complete dictionary and has found applications in many image processing tasks including classification. We propose a supervised classification approach by learning dictionaries specific to the lesions and individual healthy brain tissues, which include white matter (WM), gray matter (GM) and cerebrospinal fluid (CSF). The size of the dictionaries learned for each class plays a major role in data representation but it is an even more crucial element in the case of competitive classification. Our approach adapts the size of the dictionary for each class, depending on the complexity of the underlying data. The algorithm is validated using 52 multi-sequence MR images acquired from 13 MS patients. The results demonstrate the effectiveness of our approach in MS lesion classification. Copyright © 2015 Elsevier Ltd. All rights reserved.

  7. Efficient Sum of Outer Products Dictionary Learning (SOUP-DIL) and Its Application to Inverse Problems.

    Science.gov (United States)

    Ravishankar, Saiprasad; Nadakuditi, Raj Rao; Fessler, Jeffrey A

    2017-12-01

    The sparsity of signals in a transform domain or dictionary has been exploited in applications such as compression, denoising and inverse problems. More recently, data-driven adaptation of synthesis dictionaries has shown promise compared to analytical dictionary models. However, dictionary learning problems are typically non-convex and NP-hard, and the usual alternating minimization approaches for these problems are often computationally expensive, with the computations dominated by the NP-hard synthesis sparse coding step. This paper exploits the ideas that drive algorithms such as K-SVD, and investigates in detail efficient methods for aggregate sparsity penalized dictionary learning by first approximating the data with a sum of sparse rank-one matrices (outer products) and then using a block coordinate descent approach to estimate the unknowns. The resulting block coordinate descent algorithms involve efficient closed-form solutions. Furthermore, we consider the problem of dictionary-blind image reconstruction, and propose novel and efficient algorithms for adaptive image reconstruction using block coordinate descent and sum of outer products methodologies. We provide a convergence study of the algorithms for dictionary learning and dictionary-blind image reconstruction. Our numerical experiments show the promising performance and speedups provided by the proposed methods over previous schemes in sparse data representation and compressed sensing-based image reconstruction.

  8. PURE OR HYBRID? THE DEVELOPMENT OF MIXED DICTIONARY GENRES

    OpenAIRE

    Reinhard R. K. Hartmann

    2005-01-01

    This paper explores 'hybrid' genres of dictionaries and other reference works. Against the tradition of general dictionaries becoming ever more specialised, there has also been a growing trend of mixing two or more 'pure' dictionary types for achieving specific purposes, e.g. the combination of alphabetic and thematic dictionary,general dictionary and technical glossary, dictionary and thesaurus, dictionary and encyclopedia, monolingual and bilingual dictionary, etc. Examples of these various...

  9. Dictionary-based fiber orientation estimation with improved spatial consistency.

    Science.gov (United States)

    Ye, Chuyang; Prince, Jerry L

    2018-02-01

    Diffusion magnetic resonance imaging (dMRI) has enabled in vivo investigation of white matter tracts. Fiber orientation (FO) estimation is a key step in tract reconstruction and has been a popular research topic in dMRI analysis. In particular, the sparsity assumption has been used in conjunction with a dictionary-based framework to achieve reliable FO estimation with a reduced number of gradient directions. Because image noise can have a deleterious effect on the accuracy of FO estimation, previous works have incorporated spatial consistency of FOs in the dictionary-based framework to improve the estimation. However, because FOs are only indirectly determined from the mixture fractions of dictionary atoms and not modeled as variables in the objective function, these methods do not incorporate FO smoothness directly, and their ability to produce smooth FOs could be limited. In this work, we propose an improvement to Fiber Orientation Reconstruction using Neighborhood Information (FORNI), which we call FORNI+; this method estimates FOs in a dictionary-based framework where FO smoothness is better enforced than in FORNI alone. We describe an objective function that explicitly models the actual FOs and the mixture fractions of dictionary atoms. Specifically, it consists of data fidelity between the observed signals and the signals represented by the dictionary, pairwise FO dissimilarity that encourages FO smoothness, and weighted ℓ 1 -norm terms that ensure the consistency between the actual FOs and the FO configuration suggested by the dictionary representation. The FOs and mixture fractions are then jointly estimated by minimizing the objective function using an iterative alternating optimization strategy. FORNI+ was evaluated on a simulation phantom, a physical phantom, and real brain dMRI data. In particular, in the real brain dMRI experiment, we have qualitatively and quantitatively evaluated the reproducibility of the proposed method. Results demonstrate that

  10. A lexicographic approach to language policy and recommendations for future dictionaries

    DEFF Research Database (Denmark)

    Tarp, Sven; Gouws, Rufus H.

    2008-01-01

    Language policy prevails at different levels and its formulation typically results in a prescriptive presentation of data. In their dictionaries, lexicographers have to respond to the deci­sions of language policy makers. In this regard dictionaries can adhere to a strict prescriptive policy...... by including only the prescribed forms. Dictionaries can also give a descriptive account of lan­guage use without making any recommendations or claims of correctness. Thirdly, dictionaries can be proscriptive by recommending certain forms, even if such a recommendation goes against the prescribed forms....... This article offers an overview of different levels of language policy and the prin­ciples of prescription, description and proscription. Examples are given to illustrate certain lexico­graphic applications of prescription. It is emphasised that access to relevant data is important to dictionary users...

  11. Denoising of gravitational wave signals via dictionary learning algorithms

    Science.gov (United States)

    Torres-Forné, Alejandro; Marquina, Antonio; Font, José A.; Ibáñez, José M.

    2016-12-01

    Gravitational wave astronomy has become a reality after the historical detections accomplished during the first observing run of the two advanced LIGO detectors. In the following years, the number of detections is expected to increase significantly with the full commissioning of the advanced LIGO, advanced Virgo and KAGRA detectors. The development of sophisticated data analysis techniques to improve the opportunities of detection for low signal-to-noise-ratio events is, hence, a most crucial effort. In this paper, we present one such technique, dictionary-learning algorithms, which have been extensively developed in the last few years and successfully applied mostly in the context of image processing. However, to the best of our knowledge, such algorithms have not yet been employed to denoise gravitational wave signals. By building dictionaries from numerical relativity templates of both binary black holes mergers and bursts of rotational core collapse, we show how machine-learning algorithms based on dictionaries can also be successfully applied for gravitational wave denoising. We use a subset of signals from both catalogs, embedded in nonwhite Gaussian noise, to assess our techniques with a large sample of tests and to find the best model parameters. The application of our method to the actual signal GW150914 shows promising results. Dictionary-learning algorithms could be a complementary addition to the gravitational wave data analysis toolkit. They may be used to extract signals from noise and to infer physical parameters if the data are in good enough agreement with the morphology of the dictionary atoms.

  12. Developing a National-Level Concept Dictionary for EHR Implementations in Kenya.

    Science.gov (United States)

    Keny, Aggrey; Wanyee, Steven; Kwaro, Daniel; Mulwa, Edwin; Were, Martin C

    2015-01-01

    The increasing adoption of Electronic Health Records (EHR) by developing countries comes with the need to develop common terminology standards to assure semantic interoperability. In Kenya, where the Ministry of Health has rolled out an EHR at 646 sites, several challenges have emerged including variable dictionaries across implementations, inability to easily share data across systems, lack of expertise in dictionary management, lack of central coordination and custody of a terminology service, inadequately defined policies and processes, insufficient infrastructure, among others. A Concept Working Group was constituted to address these challenges. The country settled on a common Kenya data dictionary, initially derived as a subset of the Columbia International eHealth Laboratory (CIEL)/Millennium Villages Project (MVP) dictionary. The initial dictionary scope largely focuses on clinical needs. Processes and policies around dictionary management are being guided by the framework developed by Bakhshi-Raiez et al. Technical and infrastructure-based approaches are also underway to streamline workflow for dictionary management and distribution across implementations. Kenya's approach on comprehensive common dictionary can serve as a model for other countries in similar settings.

  13. Reconstruction of magnetic resonance imaging by three-dimensional dual-dictionary learning.

    Science.gov (United States)

    Song, Ying; Zhu, Zhen; Lu, Yang; Liu, Qiegen; Zhao, Jun

    2014-03-01

    To improve the magnetic resonance imaging (MRI) data acquisition speed while maintaining the reconstruction quality, a novel method is proposed for multislice MRI reconstruction from undersampled k-space data based on compressed-sensing theory using dictionary learning. There are two aspects to improve the reconstruction quality. One is that spatial correlation among slices is used by extending the atoms in dictionary learning from patches to blocks. The other is that the dictionary-learning scheme is used at two resolution levels; i.e., a low-resolution dictionary is used for sparse coding and a high-resolution dictionary is used for image updating. Numerical experiments are carried out on in vivo 3D MR images of brains and abdomens with a variety of undersampling schemes and ratios. The proposed method (dual-DLMRI) achieves better reconstruction quality than conventional reconstruction methods, with the peak signal-to-noise ratio being 7 dB higher. The advantages of the dual dictionaries are obvious compared with the single dictionary. Parameter variations ranging from 50% to 200% only bias the image quality within 15% in terms of the peak signal-to-noise ratio. Dual-DLMRI effectively uses the a priori information in the dual-dictionary scheme and provides dramatically improved reconstruction quality. Copyright © 2013 Wiley Periodicals, Inc.

  14. Dictionary of dissuasion

    International Nuclear Information System (INIS)

    Wodka-Gallien, P.

    2011-09-01

    With more than 300 head words, this dictionary covers, at the worldwide scale, the history of men (from Curie and Einstein to Barack Obama), the issues (secrecy and information, weapons rush, proliferation and counter-proliferation..), the strategies (massive or gradual counter-attacks), the organisations (IAEA, CEA, Greenpeace etc.) and the equipments in relation with nuclear dissuasion. If the main part of the book is devoted to military equipments and topics, some other aspects of the nuclear domain are also presented, like radioactivity, civil nuclear accidents, thermonuclear fusion, laboratory equipments, disarmament and others. (J.S.)

  15. Modern dictionary of electronics

    CERN Document Server

    Graf, Rudolf F

    1999-01-01

    Included in this fully revised classic are well over 28,000 terms, phrases, acronyms, and abbreviations from the ever-expanding worlds of consumer electronics, optics, microelectronics, computers, communications, and medical electronics. From the basic elements of theory to the most cutting-edge circuit technology, this book explains it all in both words and pictures.For easy reference, the author has provided definitions for standard abbreviations and equations as well as tables of SI (International System of Units) units, measurements, and schematic symbolsModern Dictionary of Electronics is

  16. Compiling the First Monolingual Lusoga Dictionary

    Directory of Open Access Journals (Sweden)

    Minah Nabirye

    2011-10-01

    Full Text Available

    Abstract: In this research article a study is made of the approach followed to compile the first-ever monolingual dictionary for Lusoga. Lusoga is a Bantu language spoken in Uganda by slightly over two mil-lion people. Being an under-resourced language, the Lusoga orthography had to be designed, a grammar written, and a corpus built, before embarking on the compilation of the dictionary. This compilation was aimed at attaining an academic degree, hence requiring a rigorous research methodology. Firstly, the prevail-ing methods for compiling dictionaries were mainly practical and insufficient in explaining the theoretical linguistic basis for dictionary compilation. Since dictionaries are based on meaning, the theory of meaning was used to account for all linguistic data considered in dictionaries. However, meaning is considered at a very abstract level, far removed from the process of compiling dictionaries. Another theory, the theory of modularity, was used to bridge the gap between the theory of meaning and the compilation process. The modular theory explains how the different modules of a language contribute information to the different parts of the dictionary article or dictionary information in general. Secondly, the research also had to contend with the different approaches for analysing Bantu languages for Bantu and European audiences. A descrip-tion of the Bantu- and European-centred approaches to Bantu studies was undertaken in respect of (a the classification of Lusoga words, and (b the specification of their citations. As a result, Lusoga lexicography deviates from the prevailing Bantu classification and citation of nouns, adjectives and verbs in particular. The dictionary was tested on two separate occasions and all the feedback was considered in the compilation pro-cess. This article, then, gives an overall summary of all the steps involved in the compilation of the Eiwanika ly'Olusoga, i.e. the Monolingual Lusoga Dictionary

  17. System semantics of explanatory dictionaries

    Directory of Open Access Journals (Sweden)

    Volodymyr Shyrokov

    2015-11-01

    Full Text Available System semantics of explanatory dictionaries Some semantic properties of the language to be followed from the structure of lexicographical systems of big explanatory dictionaries are considered. The hyperchains and hypercycles are determined as the definite kind of automorphisms of the lexicographical system of explanatory dictionary. Some semantic consequencies following from the principles of lexicographic closure and lexicographic completeness are investigated using the hyperchains and hypercycles formalism. The connection between the hypercyle properties of the lexicographical system semantics and Goedel’s incompleteness theorem is discussed.

  18. Seismic classification through sparse filter dictionaries

    Energy Technology Data Exchange (ETDEWEB)

    Hickmann, Kyle Scott [Los Alamos National Lab. (LANL), Los Alamos, NM (United States); Srinivasan, Gowri [Los Alamos National Lab. (LANL), Los Alamos, NM (United States)

    2017-09-13

    We tackle a multi-label classi cation problem involving the relation between acoustic- pro le features and the measured seismogram. To isolate components of the seismo- grams unique to each class of acoustic pro le we build dictionaries of convolutional lters. The convolutional- lter dictionaries for the individual classes are then combined into a large dictionary for the entire seismogram set. A given seismogram is classi ed by computing its representation in the large dictionary and then comparing reconstruction accuracy with this representation using each of the sub-dictionaries. The sub-dictionary with the minimal reconstruction error identi es the seismogram class.

  19. Robust Visual Tracking via Online Discriminative and Low-Rank Dictionary Learning.

    Science.gov (United States)

    Zhou, Tao; Liu, Fanghui; Bhaskar, Harish; Yang, Jie

    2017-09-12

    In this paper, we propose a novel and robust tracking framework based on online discriminative and low-rank dictionary learning. The primary aim of this paper is to obtain compact and low-rank dictionaries that can provide good discriminative representations of both target and background. We accomplish this by exploiting the recovery ability of low-rank matrices. That is if we assume that the data from the same class are linearly correlated, then the corresponding basis vectors learned from the training set of each class shall render the dictionary to become approximately low-rank. The proposed dictionary learning technique incorporates a reconstruction error that improves the reliability of classification. Also, a multiconstraint objective function is designed to enable active learning of a discriminative and robust dictionary. Further, an optimal solution is obtained by iteratively computing the dictionary, coefficients, and by simultaneously learning the classifier parameters. Finally, a simple yet effective likelihood function is implemented to estimate the optimal state of the target during tracking. Moreover, to make the dictionary adaptive to the variations of the target and background during tracking, an online update criterion is employed while learning the new dictionary. Experimental results on a publicly available benchmark dataset have demonstrated that the proposed tracking algorithm performs better than other state-of-the-art trackers.

  20. Online multi-modal robust non-negative dictionary learning for visual tracking.

    Science.gov (United States)

    Zhang, Xiang; Guan, Naiyang; Tao, Dacheng; Qiu, Xiaogang; Luo, Zhigang

    2015-01-01

    Dictionary learning is a method of acquiring a collection of atoms for subsequent signal representation. Due to its excellent representation ability, dictionary learning has been widely applied in multimedia and computer vision. However, conventional dictionary learning algorithms fail to deal with multi-modal datasets. In this paper, we propose an online multi-modal robust non-negative dictionary learning (OMRNDL) algorithm to overcome this deficiency. Notably, OMRNDL casts visual tracking as a dictionary learning problem under the particle filter framework and captures the intrinsic knowledge about the target from multiple visual modalities, e.g., pixel intensity and texture information. To this end, OMRNDL adaptively learns an individual dictionary, i.e., template, for each modality from available frames, and then represents new particles over all the learned dictionaries by minimizing the fitting loss of data based on M-estimation. The resultant representation coefficient can be viewed as the common semantic representation of particles across multiple modalities, and can be utilized to track the target. OMRNDL incrementally learns the dictionary and the coefficient of each particle by using multiplicative update rules to respectively guarantee their non-negativity constraints. Experimental results on a popular challenging video benchmark validate the effectiveness of OMRNDL for visual tracking in both quantity and quality.

  1. A tensor-based dictionary learning approach to tomographic image reconstruction

    DEFF Research Database (Denmark)

    Soltani, Sara; Kilmer, Misha E.; Hansen, Per Christian

    2016-01-01

    We consider tomographic reconstruction using priors in the form of a dictionary learned from training images. The reconstruction has two stages: first we construct a tensor dictionary prior from our training data, and then we pose the reconstruction problem in terms of recovering the expansion...... coefficients in that dictionary. Our approach differs from past approaches in that (a) we use a third-order tensor representation for our images and (b) we recast the reconstruction problem using the tensor formulation. The dictionary learning problem is presented as a non-negative tensor factorization problem...... with sparsity constraints. The reconstruction problem is formulated in a convex optimization framework by looking for a solution with a sparse representation in the tensor dictionary. Numerical results show that our tensor formulation leads to very sparse representations of both the training images...

  2. Tensor-Dictionary Learning with Deep Kruskal-Factor Analysis

    Energy Technology Data Exchange (ETDEWEB)

    Stevens, Andrew J.; Pu, Yunchen; Sun, Yannan; Spell, Gregory; Carin, Lawrence

    2017-04-20

    We introduce new dictionary learning methods for tensor-variate data of any order. We represent each data item as a sum of Kruskal decomposed dictionary atoms within the framework of beta-process factor analysis (BPFA). Our model is nonparametric and can infer the tensor-rank of each dictionary atom. This Kruskal-Factor Analysis (KFA) is a natural generalization of BPFA. We also extend KFA to a deep convolutional setting and develop online learning methods. We test our approach on image processing and classification tasks achieving state of the art results for 2D & 3D inpainting and Caltech 101. The experiments also show that atom-rank impacts both overcompleteness and sparsity.

  3. Bootstrapping pronunciation dictionaries: practical issues

    CSIR Research Space (South Africa)

    Davel, MH

    2005-09-01

    Full Text Available Bootstrapping techniques are an efficient way to develop electronic pronunciation dictionaries, but require fast system response to be practical for medium-to-large lexicons. In addition, user errors are inevitable during this process...

  4. A Dictionary of Disaster Management

    DEFF Research Database (Denmark)

    Rubin, Olivier; Dahlberg, Rasmus

    A Dictionary of Disaster Management offers over 200 terms covering different disasters from a social science perspective, brining together insights from many different disciplines including sociology, political science, history, anthropology, and natural science. It also features practical terms...

  5. NCI Dictionary of Genetics Terms

    Science.gov (United States)

    A dictionary of more than 150 genetics-related terms written for healthcare professionals. This resource was developed to support the comprehensive, evidence-based, peer-reviewed PDQ cancer genetics information summaries.

  6. Kirkeby's English?Swahili Dictionary

    Directory of Open Access Journals (Sweden)

    James S. Mdee

    2011-10-01

    Full Text Available

    Abstract: Kirkeby's English–Swahili Dictionary is a bilingual dictionary of more than 50 000entries. The most laudable feature of the dictionary is its attempt to be user-friendly especially inthe way the entry words have been arranged and the amount of information given. However, aclear objective for the compilation of the ditionary is lacking. The compilers do not seem to knowthe lexicographical gap they want to fill, the users they are targeting, and their dictionary-usingskills. In discussing the strong and weak points of the dictionary, the article will refer to theories ofdictionary criticism. Three criteria set by McMillan (1949 will guide this review article: (1 thequantity of the information in the dictionary; (2 the quality of the information presented; and (3the effectiveness of the presentation of the information. Questions posed in the course of this articlewill include: Does the dictionary give the information required by the user? Is the informationtransparently accessible? How is the information presented?

    Keywords: DICTIONARY EVALUATION, USER-FRIENDLY, DICTIONARY-USINGSKILLS, LEXICOGRAPHICAL ENTRIES, GRAMMATICAL CATEGORIES, SUBGRAMMATICALCATEGORIES, WORD COMBINATIONS, COLLOCATIONS, TRANSLATION EQUIVALENTS

    Opsomming: Kirkeby se English–Swahili Dictionary. Kirkeby se English–SwahiliDictionary is 'n tweetalige woordeboek met meer as 50 000 inskrywings. Die mees prysenswaardigekenmerk van die woordeboek is sy poging om gebruikersvriendelik te wees, veral deur die manierwaarop die trefwoorde gerangskik is en die hoeveelheid inligting wat verskaf word. 'n Duidelikedoelwit vir die samestelling van die woordeboek ontbreek egter. Die samestellers is skynbaaronseker oor die leksikografiese leemte wat hulle wil vul, en die gebruikers vir wie dit bedoel is enhul woordeboekgebruikersvaardighede. In die bespreking van die sterk en die swak eienskappevan die woordeboek sal die artikel verwys na teorieë van

  7. SDL: Saliency-Based Dictionary Learning Framework for Image Similarity.

    Science.gov (United States)

    Sarkar, Rituparna; Acton, Scott T

    2018-02-01

    In image classification, obtaining adequate data to learn a robust classifier has often proven to be difficult in several scenarios. Classification of histological tissue images for health care analysis is a notable application in this context due to the necessity of surgery, biopsy or autopsy. To adequately exploit limited training data in classification, we propose a saliency guided dictionary learning method and subsequently an image similarity technique for histo-pathological image classification. Salient object detection from images aids in the identification of discriminative image features. We leverage the saliency values for the local image regions to learn a dictionary and respective sparse codes for an image, such that the more salient features are reconstructed with smaller error. The dictionary learned from an image gives a compact representation of the image itself and is capable of representing images with similar content, with comparable sparse codes. We employ this idea to design a similarity measure between a pair of images, where local image features of one image, are encoded with the dictionary learned from the other and vice versa. To effectively utilize the learned dictionary, we take into account the contribution of each dictionary atom in the sparse codes to generate a global image representation for image comparison. The efficacy of the proposed method was evaluated using three tissue data sets that consist of mammalian kidney, lung and spleen tissue, breast cancer, and colon cancer tissue images. From the experiments, we observe that our methods outperform the state of the art with an increase of 14.2% in the average classification accuracy over all data sets.

  8. Customized Dictionary Learning for Subdatasets with Fine Granularity

    Directory of Open Access Journals (Sweden)

    Lei Ye

    2016-01-01

    Full Text Available Sparse models have a wide range of applications in machine learning and computer vision. Using a learned dictionary instead of an “off-the-shelf” one can dramatically improve performance on a particular dataset. However, learning a new one for each subdataset (subject with fine granularity may be unwarranted or impractical, due to restricted availability subdataset samples and tremendous numbers of subjects. To remedy this, we consider the dictionary customization problem, that is, specializing an existing global dictionary corresponding to the total dataset, with the aid of auxiliary samples obtained from the target subdataset. Inspired by observation and then deduced from theoretical analysis, a regularizer is employed penalizing the difference between the global and the customized dictionary. By minimizing the sum of reconstruction errors of the above regularizer under sparsity constraints, we exploit the characteristics of the target subdataset contained in the auxiliary samples while maintaining the basic sketches stored in the global dictionary. An efficient algorithm is presented and validated with experiments on real-world data.

  9. Dictionary Writing System (DWS) + Corpus Query Package (CQP ...

    African Journals Online (AJOL)

    In this article the integrated corpus query functionality of the dictionary compilation software TshwaneLex is analysed. Attention is given to the handling of both raw corpus data and annotated corpus data. With regard to the latter it is shown how, with a minimum of human effort, machine learning techniques can be employed ...

  10. Improved Bounds for Dictionary Look-up with One Error

    DEFF Research Database (Denmark)

    Brodal, Gerth Stølting; Srinivasan, Venkatesh

    2000-01-01

    Given a dictionary S of n binary strings each of length m , we consider the problem of designing a data structure for S that supports d -queries; given a binary query string q of length m , a d -query reports if there exists a string in S within Hamming distance d of q . We construct a data...

  11. Synopsis articles in the planning of a trilingual dictionary: Yilumbu ...

    African Journals Online (AJOL)

    A distinction is often drawn between single articles and synopsis articles. A single article is the so-called default article. It does not deviate from the traditional microstructural approach of the dictionary because it presents the minimum data for each lemma sign treated, while a synopsis article gives additional data for each ...

  12. Regularized spherical polar fourier diffusion MRI with optimal dictionary learning.

    Science.gov (United States)

    Cheng, Jian; Jiang, Tianzi; Deriche, Rachid; Shen, Dinggang; Yap, Pew-Thian

    2013-01-01

    Compressed Sensing (CS) takes advantage of signal sparsity or compressibility and allows superb signal reconstruction from relatively few measurements. Based on CS theory, a suitable dictionary for sparse representation of the signal is required. In diffusion MRI (dMRI), CS methods proposed for reconstruction of diffusion-weighted signal and the Ensemble Average Propagator (EAP) utilize two kinds of Dictionary Learning (DL) methods: 1) Discrete Representation DL (DR-DL), and 2) Continuous Representation DL (CR-DL). DR-DL is susceptible to numerical inaccuracy owing to interpolation and regridding errors in a discretized q-space. In this paper, we propose a novel CR-DL approach, called Dictionary Learning - Spherical Polar Fourier Imaging (DL-SPFI) for effective compressed-sensing reconstruction of the q-space diffusion-weighted signal and the EAP. In DL-SPFI, a dictionary that sparsifies the signal is learned from the space of continuous Gaussian diffusion signals. The learned dictionary is then adaptively applied to different voxels using a weighted LASSO framework for robust signal reconstruction. Compared with the start-of-the-art CR-DL and DR-DL methods proposed by Merlet et al. and Bilgic et al., respectively, our work offers the following advantages. First, the learned dictionary is proved to be optimal for Gaussian diffusion signals. Second, to our knowledge, this is the first work to learn a voxel-adaptive dictionary. The importance of the adaptive dictionary in EAP reconstruction will be demonstrated theoretically and empirically. Third, optimization in DL-SPFI is only performed in a small subspace resided by the SPF coefficients, as opposed to the q-space approach utilized by Merlet et al. We experimentally evaluated DL-SPFI with respect to L1-norm regularized SPFI (L1-SPFI), which uses the original SPF basis, and the DR-DL method proposed by Bilgic et al. The experiment results on synthetic and real data indicate that the learned dictionary produces

  13. Digital Lexicography: Research of Usage of Different Dictionary Types at Vilnius Gediminas Technical University

    Directory of Open Access Journals (Sweden)

    Daiva Nomicienė

    2017-05-01

    Full Text Available The current research focuses on the investigation of the VGTU students’ and foreign language teachers’ attitude towards the types of dictionaries: printed, electronic versions of printed dictionaries and dictionaries available online. The aim of the research was to set the most used type of the dictionaries by the students and foreign language teachers at VGTU. 296 students from different study programmes and 12 foreign language teachers were selected to participate in the investigation. The research data was analysed from two perspectives: qualitatively and quantitatively. Qualitatively data was analysed with the help of literature review method, a questionnaire and a comparative analysis. Therefore, quantitatively data was analysed relying on the method of descriptive statistics. The collected data revealed the differences in the students’ and the foreign language teachers’ attitude. Majority of the students preferred online dictionaries to printed and electronic, whereas the selection of the dictionary type by the foreign language teachers was limited to a printed version. The same tendency could be observed relying on the question about the search of special terms. Majority of the students would rely on the dictionaries available on the Internet, hence teachers would select printed ones. Relying on the research, it could be concluded that while preparing study material and recommendations for the students, foreign language teachers should rely on students’ attitude.

  14. Accurate classification of brain gliomas by discriminate dictionary learning based on projective dictionary pair learning of proton magnetic resonance spectra.

    Science.gov (United States)

    Adebileje, Sikiru Afolabi; Ghasemi, Keyvan; Aiyelabegan, Hammed Tanimowo; Saligheh Rad, Hamidreza

    2017-04-01

    Proton magnetic resonance spectroscopy is a powerful noninvasive technique that complements the structural images of cMRI, which aids biomedical and clinical researches, by identifying and visualizing the compositions of various metabolites within the tissues of interest. However, accurate classification of proton magnetic resonance spectroscopy is still a challenging issue in clinics due to low signal-to-noise ratio, overlapping peaks of metabolites, and the presence of background macromolecules. This paper evaluates the performance of a discriminate dictionary learning classifiers based on projective dictionary pair learning method for brain gliomas proton magnetic resonance spectroscopy spectra classification task, and the result were compared with the sub-dictionary learning methods. The proton magnetic resonance spectroscopy data contain a total of 150 spectra (74 healthy, 23 grade II, 23 grade III, and 30 grade IV) from two databases. The datasets from both databases were first coupled together, followed by column normalization. The Kennard-Stone algorithm was used to split the datasets into its training and test sets. Performance comparison based on the overall accuracy, sensitivity, specificity, and precision was conducted. Based on the overall accuracy of our classification scheme, the dictionary pair learning method was found to outperform the sub-dictionary learning methods 97.78% compared with 68.89%, respectively. Copyright © 2016 John Wiley & Sons, Ltd. Copyright © 2016 John Wiley & Sons, Ltd.

  15. The concept of a bilingual dictionary

    DEFF Research Database (Denmark)

    Tarp, Sven

    2005-01-01

    The term bilingual dictionary is widely used, not only by librarians and dictionary users en general but also by professional lexicographers dedicated to the theory and practice of dictionary making. For this reason it should be expected that there were a common and well-established definition...... of the concept of a bilingual dictionary. It is evident that most people has an intuitive idea of what is meant by «bilingual dictionary». But science-based lexicographic theory - at least if it wants to be considered as such - must go beyond intuition and furnish precise definitions of the concepts used...... chapters, various definitions will be discussed and related to dictionary practice and, subsequently, the very concept of a bilingual dictionary will be examined in the light of a dictionary typology based upon the modern theory of lexicographic functions....

  16. The Pocket Dictionary: A Textbook for Spelling.

    Science.gov (United States)

    Doggett, Maran

    1982-01-01

    Reports on a productive approach to secondary-school spelling instruction--one that emphasizes how and when to use the dictionary. Describes two of the many class activities that cultivate student use of the dictionary. (RL)

  17. Cheap Words: A Paperback Dictionary Roundup.

    Science.gov (United States)

    Kister, Ken

    1979-01-01

    Surveys currently available paperback editions in three classes of dictionaries: collegiate, abridged, and pocket. A general discussion distinguishes among the classes and offers seven consumer tips, followed by an annotated listing of dictionaries now available. (SW)

  18. Namibian University Entrants' Concepts of 'a Dictionary'*

    African Journals Online (AJOL)

    rbr

    , nor is it ..... Rather, it indicates the effect of OUP's marketing efforts ..... tionary pedagogy it might not be sufficient to merely state that a dictionary ... To determine if the variables of exposure to dictionary pedagogy at school, fre-.

  19. Neologisms in bilingual digital dictionaries (on the example of Bulgarian-Polish dictionary

    Directory of Open Access Journals (Sweden)

    Ludmila Dimitrova

    2015-11-01

    Full Text Available Neologisms in bilingual digital dictionaries (on the example of Bulgarian-Polish dictionary The paper discusses the presentation of neologisms in the recent version of the Bulgarian-Polish digital dictionary. We also continue the discussion of important problems related to the classifiers of the verbs as headwords of the digital dictionary entries. We analyze some examples from ongoing experimental version of the Bulgarian-Polish digital dictionary.

  20. 3D Reconstruction of human bones based on dictionary learning.

    Science.gov (United States)

    Zhang, Binkai; Wang, Xiang; Liang, Xiao; Zheng, Jinjin

    2017-11-01

    An effective method for reconstructing a 3D model of human bones from computed tomography (CT) image data based on dictionary learning is proposed. In this study, the dictionary comprises the vertices of triangular meshes, and the sparse coefficient matrix indicates the connectivity information. For better reconstruction performance, we proposed a balance coefficient between the approximation and regularisation terms and a method for optimisation. Moreover, we applied a local updating strategy and a mesh-optimisation method to update the dictionary and the sparse matrix, respectively. The two updating steps are iterated alternately until the objective function converges. Thus, a reconstructed mesh could be obtained with high accuracy and regularisation. The experimental results show that the proposed method has the potential to obtain high precision and high-quality triangular meshes for rapid prototyping, medical diagnosis, and tissue engineering. Copyright © 2017 IPEM. Published by Elsevier Ltd. All rights reserved.

  1. Loops and Self-Reference in the Construction of Dictionaries

    Science.gov (United States)

    Levary, David; Eckmann, Jean-Pierre; Moses, Elisha; Tlusty, Tsvi

    2012-07-01

    Dictionaries link a given word to a set of alternative words (the definition) which in turn point to further descendants. Iterating through definitions in this way, one typically finds that definitions loop back upon themselves. We demonstrate that such definitional loops are created in order to introduce new concepts into a language. In contrast to the expectations for a random lexical network, in graphs of the dictionary, meaningful loops are quite short, although they are often linked to form larger, strongly connected components. These components are found to represent distinct semantic ideas. This observation can be quantified by a singular value decomposition, which uncovers a set of conceptual relationships arising in the global structure of the dictionary. Finally, we use etymological data to show that elements of loops tend to be added to the English lexicon simultaneously and incorporate our results into a simple model for language evolution that falls within the “rich-get-richer” class of network growth.

  2. TWRS privatization support project waste characterization resource dictionary

    International Nuclear Information System (INIS)

    Patello, G.K.; Wiemers, K.D.

    1996-09-01

    A single estimate of waste characteristics for each underground storage tanks at the Hanford Site is not available. The information that is available was developed for specific programmatic objectives and varies in format and level of descriptive detail, depending on the intended application. This dictionary reflects an attempt to define what waste characterization information is available. It shows the relationship between the identified resource and the original data source and the inter-relationships among the resources; it also provides a brief description of each resource. Developed as a general dictionary for waste characterization information, this document is intended to make the user aware of potenially useful resources

  3. Dictionary of energy

    Energy Technology Data Exchange (ETDEWEB)

    Counihan, M

    1981-01-01

    Every aspect of energy - production, conversion and use - is discussed and explained. Comprehensive and well-illustrated entries cover fossil and other types of chemical fuel; hydroelectric and nuclear power; energy conservation; solar energy of every kind; wind, wave and tidal power. Every type of nuclear reactor is described, with emphasis on the energy technologies that have the greatest present relevance and future promise. The first section is devoted to an explanation of the units used, with conversion tables; key concepts are defined. The closing sections comprise tables of international energy statistics and a short bibliography. This dictionary is an introduction and reference book for general readers, students and all workers in energy and energy-related fields. It is fully metricated.

  4. Dictionary of nuclear engineering

    International Nuclear Information System (INIS)

    Sube, R.

    1985-01-01

    This dictionary covers nuclear engineering defined in its general sense as applied nuclear physics: industrial and other applications of nuclear power, isotopes and ionizing radiation, nuclear materials, nuclear facilities and nuclear weapons together with their scientific and technological fundamentals. During the compilation of terms, great attention was only given to generally valid basic expressions and to special terms where these occurred in all four languages. A great number of textbooks and monographs, as well as specialist journals covering many years, have been evaluated. Detailed attention has been paid to standards. Of importance in nuclear engineering are the international standards of the International Atomic Energy Organization (including the terminology employed by the International Nuclear Information System INIS), the International Organization of Standardization, the Council for Mutual Economic Assistance, the World Energy Conference, the International Electrical Engineering Commission, and also a great many national standards which, unfortunately, frequently deviate from one another as regards definition and, in particular, designation. (orig.)

  5. Dictionary Learning on the Manifold of Square Root Densities and Application to Reconstruction of Diffusion Propagator Fields*

    Science.gov (United States)

    Sun, Jiaqi; Xie, Yuchen; Ye, Wenxing; Ho, Jeffrey; Entezari, Alireza; Blackband, Stephen J.

    2013-01-01

    In this paper, we present a novel dictionary learning framework for data lying on the manifold of square root densities and apply it to the reconstruction of diffusion propagator (DP) fields given a multi-shell diffusion MRI data set. Unlike most of the existing dictionary learning algorithms which rely on the assumption that the data points are vectors in some Euclidean space, our dictionary learning algorithm is designed to incorporate the intrinsic geometric structure of manifolds and performs better than traditional dictionary learning approaches when applied to data lying on the manifold of square root densities. Non-negativity as well as smoothness across the whole field of the reconstructed DPs is guaranteed in our approach. We demonstrate the advantage of our approach by comparing it with an existing dictionary based reconstruction method on synthetic and real multi-shell MRI data. PMID:24684004

  6. 3D dictionary learning based iterative cone beam CT reconstruction

    Directory of Open Access Journals (Sweden)

    Ti Bai

    2014-03-01

    Full Text Available Purpose: This work is to develop a 3D dictionary learning based cone beam CT (CBCT reconstruction algorithm on graphic processing units (GPU to improve the quality of sparse-view CBCT reconstruction with high efficiency. Methods: A 3D dictionary containing 256 small volumes (atoms of 3 × 3 × 3 was trained from a large number of blocks extracted from a high quality volume image. On the basis, we utilized cholesky decomposition based orthogonal matching pursuit algorithm to find the sparse representation of each block. To accelerate the time-consuming sparse coding in the 3D case, we implemented the sparse coding in a parallel fashion by taking advantage of the tremendous computational power of GPU. Conjugate gradient least square algorithm was adopted to minimize the data fidelity term. Evaluations are performed based on a head-neck patient case. FDK reconstruction with full dataset of 364 projections is used as the reference. We compared the proposed 3D dictionary learning based method with tight frame (TF by performing reconstructions on a subset data of 121 projections. Results: Compared to TF based CBCT reconstruction that shows good overall performance, our experiments indicated that 3D dictionary learning based CBCT reconstruction is able to recover finer structures, remove more streaking artifacts and also induce less blocky artifacts. Conclusion: 3D dictionary learning based CBCT reconstruction algorithm is able to sense the structural information while suppress the noise, and hence to achieve high quality reconstruction under the case of sparse view. The GPU realization of the whole algorithm offers a significant efficiency enhancement, making this algorithm more feasible for potential clinical application.-------------------------------Cite this article as: Bai T, Yan H, Shi F, Jia X, Lou Y, Xu Q, Jiang S, Mou X. 3D dictionary learning based iterative cone beam CT reconstruction. Int J Cancer Ther Oncol 2014; 2(2:020240. DOI: 10

  7. The Oxford English Dictionary: A Brief History.

    Science.gov (United States)

    Fritze, Ronald H.

    1989-01-01

    Reviews the development of English dictionaries in general and the Oxford English Dictionary (OED) in particular. The discussion covers the decision by the Philological Society to create the dictionary, the principles that guided its development, the involvement of James Augustus Henry Murray, the magnitude and progress of the project, and the…

  8. THE PROPOSED NDEBELE-SHONA DICTIONARY: PROSPECTS ...

    African Journals Online (AJOL)

    R.B. Ruthven

    Ndebele and Shona reflect the intentions of Zimbabwean language planners from different periods. .... The inclusion of the bilingual dictionary in the ALLEX master plan for dictionaries implies that the importance of the dictionary was already felt at that very early .... among others, patriotism, moral values and national unity.

  9. Elaboration of a dictionary for radiographic findings

    International Nuclear Information System (INIS)

    Rocha, Roberto A.; Huff, Stanley M.; Haug, Peter J.

    1996-01-01

    The process for creating a dictionary to represent chest radiologic findings is summarized. The dictionary is built from different sources of terms, including medical vocabularies and chest X-ray reports. The relevance of each source is estimated using the proportions with which they can be found in the final edition of the dictionary

  10. The New Unabridged English-Persian Dictionary.

    Science.gov (United States)

    Aryanpur, Abbas; Saleh, Jahan Shah

    This five-volume English-Persian dictionary is based on Webster's International Dictionary (1960 and 1961) and The Shorter Oxford English Dictionary (1959); it attempts to provide Persian equivalents of all the words of Oxford and all the key-words of Webster. Pronunciation keys for the English phonetic transcription and for the difficult Persian…

  11. Methods in Lexicography and Dictionary Research | Schierholz ...

    African Journals Online (AJOL)

    Methods are used in every stage of dictionary-making and in every scientific analysis which is carried out in the field of dictionary research. This article presents some general considerations on methods in philosophy of science, gives an overview of many methods used in linguistics, in lexicography, dictionary research as ...

  12. Expectation Levels in Dictionary Consultation and Compilation ...

    African Journals Online (AJOL)

    Dictionary consultation and compilation is a two-way engagement between two parties, namely a dictionary user and a lexicographer. How well users cope with looking up words in a Bantu language dictionary and to what extent their expectations are met, depends on their consultation skills, their knowledge of the structure ...

  13. Klein Woordeboek / Little Dictionary | Louw | Lexikos

    African Journals Online (AJOL)

    Bilingual translation dictionaries play an important part in modern user orientated lexicography in South Africa. An affordable bidirectional pocket translation dictionary, such as Klein Woordeboek/Little Dictionary, with English and Afrikaans as language pair, is growing in value as a carrier of necessary everyday linguistic ...

  14. Expectation Levels in Dictionary Consultation and Compilation*

    African Journals Online (AJOL)

    Abstract: Dictionary consultation and compilation is a two-way engagement between two par- ties, namely a dictionary user and a lexicographer. How well users cope with looking up words in a Bantu language dictionary and to what extent their expectations are met, depends on their con- sultation skills, their knowledge of ...

  15. Dictionary Pair Learning on Grassmann Manifolds for Image Denoising.

    Science.gov (United States)

    Zeng, Xianhua; Bian, Wei; Liu, Wei; Shen, Jialie; Tao, Dacheng

    2015-11-01

    Image denoising is a fundamental problem in computer vision and image processing that holds considerable practical importance for real-world applications. The traditional patch-based and sparse coding-driven image denoising methods convert 2D image patches into 1D vectors for further processing. Thus, these methods inevitably break down the inherent 2D geometric structure of natural images. To overcome this limitation pertaining to the previous image denoising methods, we propose a 2D image denoising model, namely, the dictionary pair learning (DPL) model, and we design a corresponding algorithm called the DPL on the Grassmann-manifold (DPLG) algorithm. The DPLG algorithm first learns an initial dictionary pair (i.e., the left and right dictionaries) by employing a subspace partition technique on the Grassmann manifold, wherein the refined dictionary pair is obtained through a sub-dictionary pair merging. The DPLG obtains a sparse representation by encoding each image patch only with the selected sub-dictionary pair. The non-zero elements of the sparse representation are further smoothed by the graph Laplacian operator to remove the noise. Consequently, the DPLG algorithm not only preserves the inherent 2D geometric structure of natural images but also performs manifold smoothing in the 2D sparse coding space. We demonstrate that the DPLG algorithm also improves the structural SIMilarity values of the perceptual visual quality for denoised images using the experimental evaluations on the benchmark images and Berkeley segmentation data sets. Moreover, the DPLG also produces the competitive peak signal-to-noise ratio values from popular image denoising algorithms.

  16. The use of examples in polyfunctional dictionaries | Prinsloo | Lexikos

    African Journals Online (AJOL)

    ... evaluate some current approaches towards the handling of examples of usage as a data category in modern dictionaries and to suggest ways in which this information category can be improved by compiling, selecting and shaping examples to render optimal transfer of information and to enhance information retrieval.

  17. From print to digital: implications for dictionary policy and ...

    African Journals Online (AJOL)

    user

    data, while at the same time making a large quantity of information easily accessible. The risk of .... for a smaller segment of total traffic to most dictionary sites. .... Atwood, discussing the concept of freedom in the age of the Internet, provides.

  18. PURE OR HYBRID? THE DEVELOPMENT OF MIXED DICTIONARY GENRES

    Directory of Open Access Journals (Sweden)

    Reinhard R. K. Hartmann

    2005-11-01

    Full Text Available This paper explores 'hybrid' genres of dictionaries and other reference works. Against the tradition of general dictionaries becoming ever more specialised, there has also been a growing trend of mixing two or more 'pure' dictionary types for achieving specific purposes, e.g. the combination of alphabetic and thematic dictionary,general dictionary and technical glossary, dictionary and thesaurus, dictionary and encyclopedia, monolingual and bilingual dictionary, etc. Examples of these various sub-types are discussed (admitting that dictionary research has neglected their study,with the aim of determining overall trends and implications, particularly with regard to the possibility of their further development with the means of information technology.

  19. Dictionary-enhanced imaging cytometry

    Science.gov (United States)

    Orth, Antony; Schaak, Diane; Schonbrun, Ethan

    2017-02-01

    State-of-the-art high-throughput microscopes are now capable of recording image data at a phenomenal rate, imaging entire microscope slides in minutes. In this paper we investigate how a large image set can be used to perform automated cell classification and denoising. To this end, we acquire an image library consisting of over one quarter-million white blood cell (WBC) nuclei together with CD15/CD16 protein expression for each cell. We show that the WBC nucleus images alone can be used to replicate CD expression-based gating, even in the presence of significant imaging noise. We also demonstrate that accurate estimates of white blood cell images can be recovered from extremely noisy images by comparing with a reference dictionary. This has implications for dose-limited imaging when samples belong to a highly restricted class such as a well-studied cell type. Furthermore, large image libraries may endow microscopes with capabilities beyond their hardware specifications in terms of sensitivity and resolution. We call for researchers to crowd source large image libraries of common cell lines to explore this possibility.

  20. On the timelessness of music dictionaries

    DEFF Research Database (Denmark)

    Bergenholtz, Henning; Bergenholtz, Inger

    2007-01-01

    A music dictionary for the Internet serves the same functions as printed music dictionaries. An old music dictionary is as useful as a new one if its information is correct. But the fact that an Internet dictionary can at any time be corrected according to modern practices makes it, if not timeless...... reception rather than translation or text production. It is described what was the starting point of the dictionary and in what way the possibilities of the Internet has influenced the concept and the content of the articles and the outer texts....

  1. Compressed Sensing with Rank Deficient Dictionaries

    DEFF Research Database (Denmark)

    Hansen, Thomas Lundgaard; Johansen, Daniel Højrup; Jørgensen, Peter Bjørn

    2012-01-01

    In compressed sensing it is generally assumed that the dictionary matrix constitutes a (possibly overcomplete) basis of the signal space. In this paper we consider dictionaries that do not span the signal space, i.e. rank deficient dictionaries. We show that in this case the signal-to-noise ratio...... (SNR) in the compressed samples can be increased by selecting the rows of the measurement matrix from the column space of the dictionary. As an example application of compressed sensing with a rank deficient dictionary, we present a case study of compressed sensing applied to the Coarse Acquisition (C...

  2. On the timelessness of music dictionaries

    DEFF Research Database (Denmark)

    Bergenholtz, Henning; Bergenholtz, Inger

    2007-01-01

    A music dictionary for the Internet serves the same functions as printed music dictionaries. An old music dictionary is as useful as a new one if its information is correct. But the fact that an Internet dictionary can at any time be corrected according to modern practices makes it, if not timeless......, at any rate more up to date. Besides, the possibilities of illustrating with picture and sound open a wide field of usefulness. Nevertheless the lexicographer has to be aware of the different needs for different user types in different user situations. The dictionary in question is made for text...

  3. Bilingualized Dictionaries with Special Reference to the Chinese ...

    African Journals Online (AJOL)

    As a type of dictionary with huge popularity among EFL learners in China, the bilingualized dictionary (BLD) deserves more academic and pedagogical attention than it receives nowadays. This article gives an overview of the BLD within the framework of dictionary research, including dictionary history, dictionary typology, ...

  4. Learning Low-Rank Class-Specific Dictionary and Sparse Intra-Class Variant Dictionary for Face Recognition.

    Science.gov (United States)

    Tang, Xin; Feng, Guo-Can; Li, Xiao-Xin; Cai, Jia-Xin

    2015-01-01

    Face recognition is challenging especially when the images from different persons are similar to each other due to variations in illumination, expression, and occlusion. If we have sufficient training images of each person which can span the facial variations of that person under testing conditions, sparse representation based classification (SRC) achieves very promising results. However, in many applications, face recognition often encounters the small sample size problem arising from the small number of available training images for each person. In this paper, we present a novel face recognition framework by utilizing low-rank and sparse error matrix decomposition, and sparse coding techniques (LRSE+SC). Firstly, the low-rank matrix recovery technique is applied to decompose the face images per class into a low-rank matrix and a sparse error matrix. The low-rank matrix of each individual is a class-specific dictionary and it captures the discriminative feature of this individual. The sparse error matrix represents the intra-class variations, such as illumination, expression changes. Secondly, we combine the low-rank part (representative basis) of each person into a supervised dictionary and integrate all the sparse error matrix of each individual into a within-individual variant dictionary which can be applied to represent the possible variations between the testing and training images. Then these two dictionaries are used to code the query image. The within-individual variant dictionary can be shared by all the subjects and only contribute to explain the lighting conditions, expressions, and occlusions of the query image rather than discrimination. At last, a reconstruction-based scheme is adopted for face recognition. Since the within-individual dictionary is introduced, LRSE+SC can handle the problem of the corrupted training data and the situation that not all subjects have enough samples for training. Experimental results show that our method achieves the

  5. Learning Low-Rank Class-Specific Dictionary and Sparse Intra-Class Variant Dictionary for Face Recognition.

    Directory of Open Access Journals (Sweden)

    Xin Tang

    Full Text Available Face recognition is challenging especially when the images from different persons are similar to each other due to variations in illumination, expression, and occlusion. If we have sufficient training images of each person which can span the facial variations of that person under testing conditions, sparse representation based classification (SRC achieves very promising results. However, in many applications, face recognition often encounters the small sample size problem arising from the small number of available training images for each person. In this paper, we present a novel face recognition framework by utilizing low-rank and sparse error matrix decomposition, and sparse coding techniques (LRSE+SC. Firstly, the low-rank matrix recovery technique is applied to decompose the face images per class into a low-rank matrix and a sparse error matrix. The low-rank matrix of each individual is a class-specific dictionary and it captures the discriminative feature of this individual. The sparse error matrix represents the intra-class variations, such as illumination, expression changes. Secondly, we combine the low-rank part (representative basis of each person into a supervised dictionary and integrate all the sparse error matrix of each individual into a within-individual variant dictionary which can be applied to represent the possible variations between the testing and training images. Then these two dictionaries are used to code the query image. The within-individual variant dictionary can be shared by all the subjects and only contribute to explain the lighting conditions, expressions, and occlusions of the query image rather than discrimination. At last, a reconstruction-based scheme is adopted for face recognition. Since the within-individual dictionary is introduced, LRSE+SC can handle the problem of the corrupted training data and the situation that not all subjects have enough samples for training. Experimental results show that our

  6. Learning Low-Rank Class-Specific Dictionary and Sparse Intra-Class Variant Dictionary for Face Recognition

    Science.gov (United States)

    Tang, Xin; Feng, Guo-can; Li, Xiao-xin; Cai, Jia-xin

    2015-01-01

    Face recognition is challenging especially when the images from different persons are similar to each other due to variations in illumination, expression, and occlusion. If we have sufficient training images of each person which can span the facial variations of that person under testing conditions, sparse representation based classification (SRC) achieves very promising results. However, in many applications, face recognition often encounters the small sample size problem arising from the small number of available training images for each person. In this paper, we present a novel face recognition framework by utilizing low-rank and sparse error matrix decomposition, and sparse coding techniques (LRSE+SC). Firstly, the low-rank matrix recovery technique is applied to decompose the face images per class into a low-rank matrix and a sparse error matrix. The low-rank matrix of each individual is a class-specific dictionary and it captures the discriminative feature of this individual. The sparse error matrix represents the intra-class variations, such as illumination, expression changes. Secondly, we combine the low-rank part (representative basis) of each person into a supervised dictionary and integrate all the sparse error matrix of each individual into a within-individual variant dictionary which can be applied to represent the possible variations between the testing and training images. Then these two dictionaries are used to code the query image. The within-individual variant dictionary can be shared by all the subjects and only contribute to explain the lighting conditions, expressions, and occlusions of the query image rather than discrimination. At last, a reconstruction-based scheme is adopted for face recognition. Since the within-individual dictionary is introduced, LRSE+SC can handle the problem of the corrupted training data and the situation that not all subjects have enough samples for training. Experimental results show that our method achieves the

  7. PDBML: the representation of archival macromolecular structure data in XML.

    Science.gov (United States)

    Westbrook, John; Ito, Nobutoshi; Nakamura, Haruki; Henrick, Kim; Berman, Helen M

    2005-04-01

    The Protein Data Bank (PDB) has recently released versions of the PDB Exchange dictionary and the PDB archival data files in XML format collectively named PDBML. The automated generation of these XML files is driven by the data dictionary infrastructure in use at the PDB. The correspondences between the PDB dictionary and the XML schema metadata are described as well as the XML representations of PDB dictionaries and data files.

  8. Dictionary criticism and lexicographical function theory

    DEFF Research Database (Denmark)

    Tarp, Sven

    2017-01-01

    This contribution discusses dictionary criticism in the light of the function theory. It starts analyzing the objective of dictionary criticism and lists eight of the most important purposes with which criticism has been made by supporters of the function theory. It then discusses the two main...... types of dictionary criticism, namely criticism of other authors’ dictionaries and self-criticism of one’s own dictionaries. Based on this discussion, it proceeds to a definition of the concept of dictionary criticism which is above all considered a theory-based activity, the outcome of which may...... by the supporters of the function theory, and the way it could be presented in order to create debate. Finally, the contribution indicates the important role dictionary criticism has had in the development of the function theory and endorses an open and critical discussion culture within lexicography....

  9. Sentiment Polarity Analysis based multi-dictionary

    Science.gov (United States)

    Jiao, Jian; Zhou, Yanquan

    This paper presents a novel algorithm for Chinese online reviews, which identifies sentiment polarity. To determine the sentence is negative or positive, we extracted opinion words and identified their opinion targets by CRFs and establish the absolute emotional dictionary (AbED), the relative emotional dictionary (ReED), the field of emotional dictionary (FiED) and the field of targets and opinion words dictionary (TfED). With those emotional dictionary, negative dictionary and modified dictionary, we achieved an effective algorithm to discriminate sentiment polarity by multi-string pattern matching algorithm. For evaluation, we used car online reviews, hotel online reviews and computer online reviews which annotated positive or negative. Experimental results show that our proposed method has made a higher precision and recall rate.

  10. Cognitive aspects of problem solving using dictionaries in L2 writing

    Directory of Open Access Journals (Sweden)

    Inna Kozlova

    2015-11-01

    Full Text Available This article reports on the use of dictionaries for L2 text production purposes by first-year ESP students. Research into dictionary use and cognitive studies of L2 writing are combined in this paper to outline the cognitive dimension of a dictionary consultation. Our objective is to focus on the situation in which an information need occurs, with a freshman ESP student as a specific user in mind. In an attempt to guarantee the relevance of consulting a dictionary, for the purposes of our study we separated the draft stage of a composition from that of its revision. In the latter stage, external resources like dictionaries were made available. Our data suggest that our students were able to detect problems in their writing and reported having improved their compositions after having had the chance to consult dictionaries. The corrections were nonetheless implemented only in one-third of all the problems detected. It was also found that the tentative solution in L2 allowed for monolingual dictionary consultation but students often opted for generating another access key in their native language.

  11. Developing a hybrid dictionary-based bio-entity recognition technique

    Science.gov (United States)

    2015-01-01

    Background Bio-entity extraction is a pivotal component for information extraction from biomedical literature. The dictionary-based bio-entity extraction is the first generation of Named Entity Recognition (NER) techniques. Methods This paper presents a hybrid dictionary-based bio-entity extraction technique. The approach expands the bio-entity dictionary by combining different data sources and improves the recall rate through the shortest path edit distance algorithm. In addition, the proposed technique adopts text mining techniques in the merging stage of similar entities such as Part of Speech (POS) expansion, stemming, and the exploitation of the contextual cues to further improve the performance. Results The experimental results show that the proposed technique achieves the best or at least equivalent performance among compared techniques, GENIA, MESH, UMLS, and combinations of these three resources in F-measure. Conclusions The results imply that the performance of dictionary-based extraction techniques is largely influenced by information resources used to build the dictionary. In addition, the edit distance algorithm shows steady performance with three different dictionaries in precision whereas the context-only technique achieves a high-end performance with three difference dictionaries in recall. PMID:26043907

  12. Multi-level discriminative dictionary learning with application to large scale image classification.

    Science.gov (United States)

    Shen, Li; Sun, Gang; Huang, Qingming; Wang, Shuhui; Lin, Zhouchen; Wu, Enhua

    2015-10-01

    The sparse coding technique has shown flexibility and capability in image representation and analysis. It is a powerful tool in many visual applications. Some recent work has shown that incorporating the properties of task (such as discrimination for classification task) into dictionary learning is effective for improving the accuracy. However, the traditional supervised dictionary learning methods suffer from high computation complexity when dealing with large number of categories, making them less satisfactory in large scale applications. In this paper, we propose a novel multi-level discriminative dictionary learning method and apply it to large scale image classification. Our method takes advantage of hierarchical category correlation to encode multi-level discriminative information. Each internal node of the category hierarchy is associated with a discriminative dictionary and a classification model. The dictionaries at different layers are learnt to capture the information of different scales. Moreover, each node at lower layers also inherits the dictionary of its parent, so that the categories at lower layers can be described with multi-scale information. The learning of dictionaries and associated classification models is jointly conducted by minimizing an overall tree loss. The experimental results on challenging data sets demonstrate that our approach achieves excellent accuracy and competitive computation cost compared with other sparse coding methods for large scale image classification.

  13. Developing a hybrid dictionary-based bio-entity recognition technique.

    Science.gov (United States)

    Song, Min; Yu, Hwanjo; Han, Wook-Shin

    2015-01-01

    Bio-entity extraction is a pivotal component for information extraction from biomedical literature. The dictionary-based bio-entity extraction is the first generation of Named Entity Recognition (NER) techniques. This paper presents a hybrid dictionary-based bio-entity extraction technique. The approach expands the bio-entity dictionary by combining different data sources and improves the recall rate through the shortest path edit distance algorithm. In addition, the proposed technique adopts text mining techniques in the merging stage of similar entities such as Part of Speech (POS) expansion, stemming, and the exploitation of the contextual cues to further improve the performance. The experimental results show that the proposed technique achieves the best or at least equivalent performance among compared techniques, GENIA, MESH, UMLS, and combinations of these three resources in F-measure. The results imply that the performance of dictionary-based extraction techniques is largely influenced by information resources used to build the dictionary. In addition, the edit distance algorithm shows steady performance with three different dictionaries in precision whereas the context-only technique achieves a high-end performance with three difference dictionaries in recall.

  14. The Danish Sign Language Dictionary

    DEFF Research Database (Denmark)

    Kristoffersen, Jette Hedegaard; Troelsgård, Thomas

    2010-01-01

    The entries of the The Danish Sign Language Dictionary have four sections:  Entry header: In this section the sign headword is shown as a photo and a gloss. The first occurring location and handshape of the sign are shown as icons.  Video window: By default the base form of the sign headword...... forms of the sign (only for classifier entries). In addition to this, frequent co-occurrences with the sign are shown in this section. The signs in the The Danish Sign Language Dictionary can be looked up through:  Handshape: Particular handshapes for the active and the passive hand can be specified...... to find signs that are not themselves lemmas in the dictionary, but appear in example sentences.  Topic: Topics can be chosen as search criteria from a list of 70 topics....

  15. The Latent Structure of Dictionaries.

    Science.gov (United States)

    Vincent-Lamarre, Philippe; Massé, Alexandre Blondin; Lopes, Marcos; Lord, Mélanie; Marcotte, Odile; Harnad, Stevan

    2016-07-01

    How many words-and which ones-are sufficient to define all other words? When dictionaries are analyzed as directed graphs with links from defining words to defined words, they reveal a latent structure. Recursively removing all words that are reachable by definition but that do not define any further words reduces the dictionary to a Kernel of about 10% of its size. This is still not the smallest number of words that can define all the rest. About 75% of the Kernel turns out to be its Core, a "Strongly Connected Subset" of words with a definitional path to and from any pair of its words and no word's definition depending on a word outside the set. But the Core cannot define all the rest of the dictionary. The 25% of the Kernel surrounding the Core consists of small strongly connected subsets of words: the Satellites. The size of the smallest set of words that can define all the rest-the graph's "minimum feedback vertex set" or MinSet-is about 1% of the dictionary, about 15% of the Kernel, and part-Core/part-Satellite. But every dictionary has a huge number of MinSets. The Core words are learned earlier, more frequent, and less concrete than the Satellites, which are in turn learned earlier, more frequent, but more concrete than the rest of the Dictionary. In principle, only one MinSet's words would need to be grounded through the sensorimotor capacity to recognize and categorize their referents. In a dual-code sensorimotor/symbolic model of the mental lexicon, the symbolic code could do all the rest through recombinatory definition. Copyright © 2016 Cognitive Science Society, Inc.

  16. Monolingual and Bilingual Learners' Dictionaries*

    Directory of Open Access Journals (Sweden)

    Rufus H. Gouws

    2011-10-01

    Full Text Available

    Abstract: When deciding on the best learners' dictionary for a specific user and a specificsituation of usage one often has to make a choice between a monolingual and a bilingual learners'dictionary. This article discusses some aspects of the user-driven approach so prevalent in moderndaylexicographic thought, focuses broadly on dictionary typology and takes a closer look at monolingualand bilingual learners' dictionaries. Some problems users experience when learning a newlanguage, e.g. language distortion and problems related to the phenomenon of false friends, especiallyin closely related languages, are mentioned. It is indicated that a typological hybrid dictionarycould assist certain users. The importance of an unambiguous identification of the relevantlexicographic functions is emphasised and the notions of function condensation and function mergingare introduced. It is shown that the typological choice should be determined by a function-basedapproach to dictionary usage.

    Keywords: BILINGUAL DICTIONARY, FALSE FRIENDS, FUNCTION CONDENSATION,FUNCTION MERGING, GENUINE PURPOSE, LEARNERS' DICTIONARY, LEXICOGRAPHICFUNCTIONS, MONOLINGUAL DICTIONARY, TEXT PRODUCTION, TEXT RECEPTION,TYPOLOGICAL HYBRID, TYPOLOGY.

    Opsomming: Eentalige en tweetalige aanleerderwoordeboeke. Wanneerbesluit moet word oor die beste aanleerderwoordeboek vir 'n spesifieke gebruiker en 'n spesifiekegebruiksituasie moet daar dikwels gekies word tussen 'n eentalige en 'n tweetalige aanleerderwoordeboek.Hierdie artikel bespreek bepaalde aspekte van die gebruikersgedrewe benaderingwat kenmerkend is van die moderne leksikografiese denke, fokus breedweg op woordeboektipologieen gee in meer besonderhede aandag aan sekere aspekte van eentalige en tweetalige aanleerderwoordeboeke.Bepaalde probleme wat gebruikers ervaar by die aanleer van 'n vreemde taal,bv. taalversteuring en probleme verwant aan die verskynsel van valse vriende, veral in nou verwantetale, kry aandag

  17. Legal Translation Dictionaries for Learners

    DEFF Research Database (Denmark)

    Nielsen, Sandro

    2010-01-01

    in conditional clauses. When translating into languages not allowing such structures, for instance, English and French, learners need their legal translation dictionaries to help them with both the legal terms and the syntactic structures. The uses of textual conventions that characterise the legal genre vary....... Lexicographers should therefore design their dictionaries so that they contain intra-lingual or contrastive descriptions of the relevant genre conventions. As illustrated in Nielsen (2000) whether the best solution is to retain the genre conventions found in the SL text or to adopt the conventions used in TL...

  18. A Translation Dictionary of Phrasal Verbs: An Ongoing Project ...

    African Journals Online (AJOL)

    A Translation Dictionary of Phrasal Verbs: An Ongoing Project. ... Abstract. The paper centres on a plan for an English-Arabic phrasal verb dictionary for Arab trainee translators. Such a dictionary ... AJOL African Journals Online. HOW TO USE ...

  19. South Africa's new African language dictionaries and their use for ...

    African Journals Online (AJOL)

    Riette Ruthven

    printed dictionaries, electronic dictionaries, online and cell phone ... which "25 are living languages, 3 are second languages without mother-tongue speakers ..... planners to distribute dictionaries easily, as they could, like ring tones or games.

  20. South Africa's new African language dictionaries and their use for ...

    African Journals Online (AJOL)

    Riette Ruthven

    Having dictionaries, and especially technical, online or cell phone dic- ..... guage. Nevertheless, this can be a disadvantage as well, if the dictionary con- ... planners to distribute dictionaries easily, as they could, like ring tones or games.

  1. Hand Depth Image Denoising and Superresolution via Noise-Aware Dictionaries

    Directory of Open Access Journals (Sweden)

    Huayang Li

    2016-01-01

    Full Text Available This paper proposes a two-stage method for hand depth image denoising and superresolution, using bilateral filters and learned dictionaries via noise-aware orthogonal matching pursuit (NAOMP based K-SVD. The bilateral filtering phase recovers singular points and removes artifacts on silhouettes by averaging depth data using neighborhood pixels on which both depth difference and RGB similarity restrictions are imposed. The dictionary learning phase uses NAOMP for training dictionaries which separates faithful depth from noisy data. Compared with traditional OMP, NAOMP adds a residual reduction step which effectively weakens the noise term within the residual during the residual decomposition in terms of atoms. Experimental results demonstrate that the bilateral phase and the NAOMP-based learning dictionaries phase corporately denoise both virtual and real depth images effectively.

  2. The dictionary of lexicography and dictionary research | Gouws ...

    African Journals Online (AJOL)

    This article presents a brief explanation of the current state of the work on the Wörterbuch zur Lexikographie und Wörterbuchforschung. It is done in a way that gives a general impression of the structure of this multivolume specialised dictionary. Keywords: alphabetical equivalent index, functional cross-reference article ...

  3. A Weighted Block Dictionary Learning Algorithm for Classification

    OpenAIRE

    Shi, Zhongrong

    2016-01-01

    Discriminative dictionary learning, playing a critical role in sparse representation based classification, has led to state-of-the-art classification results. Among the existing discriminative dictionary learning methods, two different approaches, shared dictionary and class-specific dictionary, which associate each dictionary atom to all classes or a single class, have been studied. The shared dictionary is a compact method but with lack of discriminative information; the class-specific dict...

  4. Exploiting Attribute Correlations: A Novel Trace Lasso-Based Weakly Supervised Dictionary Learning Method.

    Science.gov (United States)

    Wu, Lin; Wang, Yang; Pan, Shirui

    2017-12-01

    It is now well established that sparse representation models are working effectively for many visual recognition tasks, and have pushed forward the success of dictionary learning therein. Recent studies over dictionary learning focus on learning discriminative atoms instead of purely reconstructive ones. However, the existence of intraclass diversities (i.e., data objects within the same category but exhibit large visual dissimilarities), and interclass similarities (i.e., data objects from distinct classes but share much visual similarities), makes it challenging to learn effective recognition models. To this end, a large number of labeled data objects are required to learn models which can effectively characterize these subtle differences. However, labeled data objects are always limited to access, committing it difficult to learn a monolithic dictionary that can be discriminative enough. To address the above limitations, in this paper, we propose a weakly-supervised dictionary learning method to automatically learn a discriminative dictionary by fully exploiting visual attribute correlations rather than label priors. In particular, the intrinsic attribute correlations are deployed as a critical cue to guide the process of object categorization, and then a set of subdictionaries are jointly learned with respect to each category. The resulting dictionary is highly discriminative and leads to intraclass diversity aware sparse representations. Extensive experiments on image classification and object recognition are conducted to show the effectiveness of our approach.

  5. Lexicographic Approaches to Sense Disambiguation in Monolingual Dictionaries and Equivalent Differentiation in Bilingual Dictionaries

    Directory of Open Access Journals (Sweden)

    Marjeta Vrbinc

    2011-05-01

    Full Text Available The article discusses methods of sense disambiguation in monolingual dictionaries and equivalent differentiation in bilingual dictionaries. In current dictionaries, sense disambiguation and equivalent differentiation is presented in the form of specifiers or glosses, collocators or indications of context, (domain labels, metalinguistic and encyclopaedic information. Each method is presented and illustrated by actual samples of dictionary articles taken from mono and bilingual dictionaries. The last part of the article is devoted to equivalent differentiation in bilingual decoding dictionaries. In bilingual dictionaries, equivalent differentiation is often needed to describe the lack of agreement between the source language (SL and target language (TL. The article concludes by stating that equivalent differentiation should be written in the native language of the target audience and sense indicators in a monolingual learner’s dictionary should be words that the users are most familiar with.

  6. Fast dictionary generation and searching for magnetic resonance fingerprinting.

    Science.gov (United States)

    Jun Xie; Mengye Lyu; Jian Zhang; Hui, Edward S; Wu, Ed X; Ze Wang

    2017-07-01

    A super-fast dictionary generation and searching (DGS) algorithm was developed for MR parameter quantification using magnetic resonance fingerprinting (MRF). MRF is a new technique for simultaneously quantifying multiple MR parameters using one temporally resolved MR scan. But it has a multiplicative computation complexity, resulting in a big burden of dictionary generating, saving, and retrieving, which can easily be intractable for any state-of-art computers. Based on retrospective analysis of the dictionary matching object function, a multi-scale ZOOM like DGS algorithm, dubbed as MRF-ZOOM, was proposed. MRF ZOOM is quasi-parameter-separable so the multiplicative computation complexity is broken into additive one. Evaluations showed that MRF ZOOM was hundreds or thousands of times faster than the original MRF parameter quantification method even without counting the dictionary generation time in. Using real data, it yielded nearly the same results as produced by the original method. MRF ZOOM provides a super-fast solution for MR parameter quantification.

  7. Self-expressive Dictionary Learning for Dynamic 3D Reconstruction.

    Science.gov (United States)

    Zheng, Enliang; Ji, Dinghuang; Dunn, Enrique; Frahm, Jan-Michael

    2017-08-22

    We target the problem of sparse 3D reconstruction of dynamic objects observed by multiple unsynchronized video cameras with unknown temporal overlap. To this end, we develop a framework to recover the unknown structure without sequencing information across video sequences. Our proposed compressed sensing framework poses the estimation of 3D structure as the problem of dictionary learning, where the dictionary is defined as an aggregation of the temporally varying 3D structures. Given the smooth motion of dynamic objects, we observe any element in the dictionary can be well approximated by a sparse linear combination of other elements in the same dictionary (i.e. self-expression). Our formulation optimizes a biconvex cost function that leverages a compressed sensing formulation and enforces both structural dependency coherence across video streams, as well as motion smoothness across estimates from common video sources. We further analyze the reconstructability of our approach under different capture scenarios, and its comparison and relation to existing methods. Experimental results on large amounts of synthetic data as well as real imagery demonstrate the effectiveness of our approach.

  8. Dynamic Textures Modeling via Joint Video Dictionary Learning.

    Science.gov (United States)

    Wei, Xian; Li, Yuanxiang; Shen, Hao; Chen, Fang; Kleinsteuber, Martin; Wang, Zhongfeng

    2017-04-06

    Video representation is an important and challenging task in the computer vision community. In this paper, we consider the problem of modeling and classifying video sequences of dynamic scenes which could be modeled in a dynamic textures (DT) framework. At first, we assume that image frames of a moving scene can be modeled as a Markov random process. We propose a sparse coding framework, named joint video dictionary learning (JVDL), to model a video adaptively. By treating the sparse coefficients of image frames over a learned dictionary as the underlying "states", we learn an efficient and robust linear transition matrix between two adjacent frames of sparse events in time series. Hence, a dynamic scene sequence is represented by an appropriate transition matrix associated with a dictionary. In order to ensure the stability of JVDL, we impose several constraints on such transition matrix and dictionary. The developed framework is able to capture the dynamics of a moving scene by exploring both sparse properties and the temporal correlations of consecutive video frames. Moreover, such learned JVDL parameters can be used for various DT applications, such as DT synthesis and recognition. Experimental results demonstrate the strong competitiveness of the proposed JVDL approach in comparison with state-of-the-art video representation methods. Especially, it performs significantly better in dealing with DT synthesis and recognition on heavily corrupted data.

  9. Concepts for monofunctional accounting dictionaries

    DEFF Research Database (Denmark)

    Bergenholtz, Henning

    2012-01-01

    up to now. They are normally constructed as polyfunctional tools trying to give help by different kind of cognitive and communicative problems. Outgoing from one database I discuss the conception for this database and for 22 different accounting dictionaries with the languages Danish, English...

  10. Comprehensive dictionary of electrical engineering

    CERN Document Server

    Laplante, Philip A

    1998-01-01

    The Comprehensive Dictionary of Electrical Engineering is a complete lexicon covering all the fields of electrical engineering.Areas examined include:applied electrical engineeringmicrowave engineeringcontrol engineeringpower engineeringdigital systems engineeringdevice electronicsand much more! The book provides workable definitions for practicing engineers, serves as a reference and research tool for students, and offers practical information for scientists and engineers in other disciplines.

  11. Dictionary of the fuel trade

    Energy Technology Data Exchange (ETDEWEB)

    1982-01-01

    A dictionary of liquid and solid fuels and applications for thermal engineering and heating, in understandable terms and explanations with a broad range of terminology, special aspects and definitions Annex: 1. International trade conditions, 2. tables of conversion relations, not calorific value, division of solids fuels etc.

  12. Pocket dictionary of laboratory equipment

    International Nuclear Information System (INIS)

    Junge, H.D.

    1987-01-01

    This pocket dictionary contains the 2500 most common terms for scientific and technical equipment in chemical laboratories. It is a useful tool for those who are used to communicating in German and English, but have to learn the special terminology in this field. (orig.) [de

  13. English-French Cognate Dictionary.

    Science.gov (United States)

    Hammer, Petra; Monod, Madeleine

    This dictionary contains a word list of 10,993 English-French cognates (words with the same or similar spelling and meaning in both languages), including some loan words from other languages. A systematic review of the Larousse "Dictionnaire Moderne Francais--Anglais" (1960) provided this list of cognates. Deceptive cognates, or words…

  14. Marketing and Communications Media Dictionary.

    Science.gov (United States)

    Vigrolio, Tom; Zahler, Jack

    The authors have compiled a dictionary of terms used in marketing, advertising, public relations, and radio/television, photography/filmmaking, and graphics. Included in the volume are articles of a general and historical interest regarding the various media covered in the definitions. A list of trade publications is appended. (JY)

  15. Kokugo Dictionaries as Tools for Learners: Problems and Potential

    Directory of Open Access Journals (Sweden)

    Tom GALLY

    2012-10-01

    Full Text Available For second-language learners, monolingual dictionaries can be useful tools because they often provide more detailed explanations of meanings and more extensive vocabulary coverage than bilingual dictionaries do. While learners of English have access to many monolingual dictionaries designed specifically to meet their needs, learners of Japanese must make do with Kokugo dictionaries, that is, monolingual dictionaries intended for native Japanese speakers. This paper, after briefly describing Kokugo dictionaries in general, analyzes a typical entry from such a dictionary to illustrate the advantages and challenges of the use of Kokugo dictionaries by learners of Japanese.

  16. Size-Dictionary Interpolation for Robot's Adjustment

    Directory of Open Access Journals (Sweden)

    Morteza eDaneshmand

    2015-05-01

    Full Text Available This paper describes the classification and size-dictionary interpolation of the three-dimensional data obtained by a laser scanner to be used in a realistic virtual fitting room, where automatic activation of the chosen mannequin robot, while several mannequin robots of different genders and sizes are simultaneously connected to the same computer, is also considered to make it mimic the body shapes and sizes instantly. The classification process consists of two layers, dealing, respectively, with gender and size. The interpolation procedure tries to find out which set of the positions of the biologically-inspired actuators for activation of the mannequin robots could lead to the closest possible resemblance of the shape of the body of the person having been scanned, through linearly mapping the distances between the subsequent size-templates and the corresponding position set of the bioengineered actuators, and subsequently, calculating the control measures that could maintain the same distance proportions, where minimizing the Euclidean distance between the size-dictionary template vectors and that of the desired body sizes determines the mathematical description. In this research work, the experimental results of the implementation of the proposed method on Fits.me's mannequin robots are visually illustrated, and explanation of the remaining steps towards completion of the whole realistic online fitting package is provided.

  17. The peace and nuclear war dictionary

    International Nuclear Information System (INIS)

    Ali, S.R.

    1989-01-01

    The Peace and Nuclear War Dictionary is organized so that entries and supplementary data can be located easily and quickly. Items are arranged alphabetically throughout, rather than grouped into chapters. When doubtful about how to locate an entry, consult the general index. Page numbers for terms appear in the index in heavy black type; subsidiary concepts discussed within entries can be found in the index, identified by page numbers in regular type. For study purposes, numerous entries have also been subsumed under major topical headings in the index, affording the reader access to broad classes of related information. The reader can also fully explore a topic by employing the extensive cross-references included in all entries. Many entries can be found as subsidiary terms, but in each case the concept is related to the main entry. The author has adopted the format of this book to provide the reader a variety of useful applications. These include its use as a dictionary and ready reference guide to the global language of peace and nuclear war; a study guide for introductory courses in Nuclear War and Peace of International Relations, or for any specialized course in the area; a supplement to a textbook or a group of paperback monographs adopted for use in these courses; a source of review material for the political science major enrolled in advanced courses; and a social science aid for use in business, education, government, policy sciences, and journalism

  18. Students' Understanding of Dictionary Entries: A Study with Respect to Four Learners' Dictionaries.

    Science.gov (United States)

    Jana, Abhra; Amritavalli, Vijaya; Amritavalli, R.

    2003-01-01

    Investigates the effects of definitional information in the form of dictionary entries, on second language learners' vocabulary learning in an instructed setting. Indian students (Native Hindi speakers) of English received monolingual English dictionary entries of five previously unknown words from four different learner's dictionaries. Results…

  19. Sentiment analysis of political communication: combining a dictionary approach with crowdcoding.

    Science.gov (United States)

    Haselmayer, Martin; Jenny, Marcelo

    2017-01-01

    Sentiment is important in studies of news values, public opinion, negative campaigning or political polarization and an explosive expansion of digital textual data and fast progress in automated text analysis provide vast opportunities for innovative social science research. Unfortunately, tools currently available for automated sentiment analysis are mostly restricted to English texts and require considerable contextual adaption to produce valid results. We present a procedure for collecting fine-grained sentiment scores through crowdcoding to build a negative sentiment dictionary in a language and for a domain of choice. The dictionary enables the analysis of large text corpora that resource-intensive hand-coding struggles to cope with. We calculate the tonality of sentences from dictionary words and we validate these estimates with results from manual coding. The results show that the crowdbased dictionary provides efficient and valid measurement of sentiment. Empirical examples illustrate its use by analyzing the tonality of party statements and media reports.

  20. Dictionary Networking in an LSP Learning Context

    DEFF Research Database (Denmark)

    Nielsen, Sandro

    2007-01-01

    text production, but discusses an individual dictionary for a particular function. It is shown that in a general context of learning accounting and its relevant LSP with a view to writing or translating financial reporting texts, the modern theory of dictionary functions provides a good theoretical...... and usage of a subject-field, particularly when they have to read, write or translate domain-specific texts. The modern theory of dictionary functions presented in Bergenholtz and Tarp (2002) opens up exciting new possibilities for theoretical and practical lexicography and encourages lexicographers......-lexicographic environment, i.e. what happens outside the dictionary when users write or translate texts, and relate these findings to the lexicographic environment represented by the theoretical basis and the dictionary itself. Nielsen (2006) gives a preliminary discussion of monolingual accounting dictionaries for EFL...

  1. Dictionary Approaches to Image Compression and Reconstruction

    Science.gov (United States)

    Ziyad, Nigel A.; Gilmore, Erwin T.; Chouikha, Mohamed F.

    1998-01-01

    This paper proposes using a collection of parameterized waveforms, known as a dictionary, for the purpose of medical image compression. These waveforms, denoted as phi(sub gamma), are discrete time signals, where gamma represents the dictionary index. A dictionary with a collection of these waveforms is typically complete or overcomplete. Given such a dictionary, the goal is to obtain a representation image based on the dictionary. We examine the effectiveness of applying Basis Pursuit (BP), Best Orthogonal Basis (BOB), Matching Pursuits (MP), and the Method of Frames (MOF) methods for the compression of digitized radiological images with a wavelet-packet dictionary. The performance of these algorithms is studied for medical images with and without additive noise.

  2. Cultural notions in Spanish Dictionaries for Foreigners

    Directory of Open Access Journals (Sweden)

    Luis Pablo-Núñez

    2017-11-01

    Full Text Available Although later than in English, Linguistics applied to the teaching of Spanish language has produced several didactic dictionaries for foreigners in the last two decades. This dictionaries include grammatical information in order to facilitate pronunciation, and morphological or syntactical comprehension; cultural notions, however, are more difficult to include because they go beyond the scope of the lexicon. Through the analysis of some terms related to folk music and gastronomy, we analyse the inclusion of Spanish and Latin American cultural notions in the three main dictionaries of Spanish for foreigners: the dictionary for the teaching of the Spanish language published by Vox-Alcalá University (Diccionario para la enseñanza de la lengua española, the Salamanca Dictionary (Diccionario Salamanca de la lengua española and the Spanish dictionary for foreigners of SM publishing house (Diccionario de español para extranjeros.

  3. Booksearch: What Dictionary (General or Specialized) Do You Find Useful or Interesting for Students?

    Science.gov (United States)

    English Journal, 1988

    1988-01-01

    Presents classroom teachers' recommendations for a variety of dictionaries that may heighten students' interest in language: a reverse dictionary, a visual dictionary, WEIGHTY WORD BOOK, a collegiate desk dictionary, OXFORD ENGLISH DICTIONARY, DICTIONARY OF AMERICAN REGIONAL ENGLISH, and a dictionary of idioms. (ARH)

  4. Encyclopaedic dictionary on archaeology of Tatarstan:conceptual problems

    Directory of Open Access Journals (Sweden)

    Abdullin Khalim M.

    2015-09-01

    Full Text Available Theoretical and methodological problems of creation the glossary for the preparation of encyclopedic dictionary, which is related to the Republic of Tatarstan archaeology are considered in this article. It is noticed that creation of such generalizing editions determines a new important stage of science and its theoretic and methodological basis development. Encyclopedias and dictionaries are the terminological thesaurus and functioning as a source of norms. They are forming the uniform, unifying and conventional approach to archaeological definitions and their content. They are also able to provide an insight into the basic archaeological concepts in the accessible form, to give the characteristic to archaeological monuments on Republic territory, to acquaint with archaeologists, who has ever worked on territory of Tatarstan, to present the last archaeological discoveries, and to popularize achievements of the Kazan Archaeology school. The complete information about archaeology in Republic is supposed to be included in the encyclopedic dictionary on archaeology of Tatarstan (the special attention will be focused on the conceptual system of archaeology, monuments and antiquity subjects, about objects and monuments of historic and archaeological heritage, as well as biographic data of all archaeologists who has ever worked in Tatarstan and information about all organizations related to archaeology in region. There are all preconditions to claim that the considerable source study and theoretical base for creation of the encyclopedic dictionary on archaeology of Tatarstan is created. It is gathered the significant experience on complex studying and generalization of considerable volume of a material which is referring to an ancient and medieval history of region and on research and ordering of archaeological monuments. It is suggested that at the first investigation phase will be created a glossary and after that the collective of authors can pass

  5. Online Dictionary Learning Aided Target Recognition In Cognitive GPR

    OpenAIRE

    Giovanneschi, Fabio; Mishra, Kumar Vijay; Gonzalez-Huici, Maria Antonia; Eldar, Yonina C.; Ender, Joachim H. G.

    2017-01-01

    Sparse decomposition of ground penetration radar (GPR) signals facilitates the use of compressed sensing techniques for faster data acquisition and enhanced feature extraction for target classification. In this paper, we investigate the application of an online dictionary learning (ODL) technique in the context of GPR to bring down the learning time as well as improve identification of abandoned anti-personnel landmines. Our experimental results using real data from an L-band GPR for PMN/PMA2...

  6. Dictionary Usage in English Language Learning

    OpenAIRE

    Rohmatillah, Rohmatillah

    2016-01-01

    This article examined about the important of using dictionary in English language learning. We cannot deny in learning a foreign language, we need to consult a dictionary. It is supported by Laufer in Koca believes that when word looks familiar but the sentence in which it is found or its wider context makes no sense at all, the learner should be encouraged to consult a dictionary. Sometimes the learners are reluctant to find out the other meaning of word from dictionary, as a result the mea...

  7. System specifications for the NDS Dictionary System

    International Nuclear Information System (INIS)

    Attree, P.M.; Smith, P.M.

    1979-09-01

    The NDS Dictionary System is a computerized system for maintaining and distributing the EXFOR dictionaries and for preparing internal versions of these dictionaries for use in the NDS EXFOR System and other NDS systems. This document is an internal manual for the system specifications of the NDS Dictionary System. It includes flow charts, system and program summaries, input and output specifications and file and record descriptions. This manual is updated from time to time when system modifications are made; this is the version of January 1979

  8. Data Element Registry Services

    Data.gov (United States)

    U.S. Environmental Protection Agency — Data Element Registry Services (DERS) is a resource for information about value lists (aka code sets / pick lists), data dictionaries, data elements, and EPA data...

  9. Travel time tomography with local image regularization by sparsity constrained dictionary learning

    Science.gov (United States)

    Bianco, M.; Gerstoft, P.

    2017-12-01

    We propose a regularization approach for 2D seismic travel time tomography which models small rectangular groups of slowness pixels, within an overall or `global' slowness image, as sparse linear combinations of atoms from a dictionary. The groups of slowness pixels are referred to as patches and a dictionary corresponds to a collection of functions or `atoms' describing the slowness in each patch. These functions could for example be wavelets.The patch regularization is incorporated into the global slowness image. The global image models the broad features, while the local patch images incorporate prior information from the dictionary. Further, high resolution slowness within patches is permitted if the travel times from the global estimates support it. The proposed approach is formulated as an algorithm, which is repeated until convergence is achieved: 1) From travel times, find the global slowness image with a minimum energy constraint on the pixel variance relative to a reference. 2) Find the patch level solutions to fit the global estimate as a sparse linear combination of dictionary atoms.3) Update the reference as the weighted average of the patch level solutions.This approach relies on the redundancy of the patches in the seismic image. Redundancy means that the patches are repetitions of a finite number of patterns, which are described by the dictionary atoms. Redundancy in the earth's structure was demonstrated in previous works in seismics where dictionaries of wavelet functions regularized inversion. We further exploit redundancy of the patches by using dictionary learning algorithms, a form of unsupervised machine learning, to estimate optimal dictionaries from the data in parallel with the inversion. We demonstrate our approach on densely, but irregularly sampled synthetic seismic images.

  10. Accelerating the reconstruction of magnetic resonance imaging by three-dimensional dual-dictionary learning using CUDA.

    Science.gov (United States)

    Jiansen Li; Jianqi Sun; Ying Song; Yanran Xu; Jun Zhao

    2014-01-01

    An effective way to improve the data acquisition speed of magnetic resonance imaging (MRI) is using under-sampled k-space data, and dictionary learning method can be used to maintain the reconstruction quality. Three-dimensional dictionary trains the atoms in dictionary in the form of blocks, which can utilize the spatial correlation among slices. Dual-dictionary learning method includes a low-resolution dictionary and a high-resolution dictionary, for sparse coding and image updating respectively. However, the amount of data is huge for three-dimensional reconstruction, especially when the number of slices is large. Thus, the procedure is time-consuming. In this paper, we first utilize the NVIDIA Corporation's compute unified device architecture (CUDA) programming model to design the parallel algorithms on graphics processing unit (GPU) to accelerate the reconstruction procedure. The main optimizations operate in the dictionary learning algorithm and the image updating part, such as the orthogonal matching pursuit (OMP) algorithm and the k-singular value decomposition (K-SVD) algorithm. Then we develop another version of CUDA code with algorithmic optimization. Experimental results show that more than 324 times of speedup is achieved compared with the CPU-only codes when the number of MRI slices is 24.

  11. Dictionary construction and identification of possible adverse drug events in Danish clinical narrative text.

    Science.gov (United States)

    Eriksson, Robert; Jensen, Peter Bjødstrup; Frankild, Sune; Jensen, Lars Juhl; Brunak, Søren

    2013-01-01

    Drugs have tremendous potential to cure and relieve disease, but the risk of unintended effects is always present. Healthcare providers increasingly record data in electronic patient records (EPRs), in which we aim to identify possible adverse events (AEs) and, specifically, possible adverse drug events (ADEs). Based on the undesirable effects section from the summary of product characteristics (SPC) of 7446 drugs, we have built a Danish ADE dictionary. Starting from this dictionary we have developed a pipeline for identifying possible ADEs in unstructured clinical narrative text. We use a named entity recognition (NER) tagger to identify dictionary matches in the text and post-coordination rules to construct ADE compound terms. Finally, we apply post-processing rules and filters to handle, for example, negations and sentences about subjects other than the patient. Moreover, this method allows synonyms to be identified and anatomical location descriptions can be merged to allow appropriate grouping of effects in the same location. The method identified 1 970 731 (35 477 unique) possible ADEs in a large corpus of 6011 psychiatric hospital patient records. Validation was performed through manual inspection of possible ADEs, resulting in precision of 89% and recall of 75%. The presented dictionary-building method could be used to construct other ADE dictionaries. The complication of compound words in Germanic languages was addressed. Additionally, the synonym and anatomical location collapse improve the method. The developed dictionary and method can be used to identify possible ADEs in Danish clinical narratives.

  12. Dictionary Learning Based on Nonnegative Matrix Factorization Using Parallel Coordinate Descent

    Directory of Open Access Journals (Sweden)

    Zunyi Tang

    2013-01-01

    Full Text Available Sparse representation of signals via an overcomplete dictionary has recently received much attention as it has produced promising results in various applications. Since the nonnegativities of the signals and the dictionary are required in some applications, for example, multispectral data analysis, the conventional dictionary learning methods imposed simply with nonnegativity may become inapplicable. In this paper, we propose a novel method for learning a nonnegative, overcomplete dictionary for such a case. This is accomplished by posing the sparse representation of nonnegative signals as a problem of nonnegative matrix factorization (NMF with a sparsity constraint. By employing the coordinate descent strategy for optimization and extending it to multivariable case for processing in parallel, we develop a so-called parallel coordinate descent dictionary learning (PCDDL algorithm, which is structured by iteratively solving the two optimal problems, the learning process of the dictionary and the estimating process of the coefficients for constructing the signals. Numerical experiments demonstrate that the proposed algorithm performs better than the conventional nonnegative K-SVD (NN-KSVD algorithm and several other algorithms for comparison. What is more, its computational consumption is remarkably lower than that of the compared algorithms.

  13. The pocket dictionary of energy

    International Nuclear Information System (INIS)

    Ahlhaus, O.; Boldt, G.; Gonsior, B.; Klein, K.; Ziburske, H.

    1981-01-01

    The pocket dictionary of energy does not only address the interested amateur but also students, pupils, teachers, scientists, technicians, and polititcians in like manner. The dictionary contains ca. 900 key-words from the fields of energy, consumption, energy types, energy deposits, energy programmes, energy industry, thermal insulation, governmental aids for energy conservation measures, heating cost calculation, energy utilization and energy conservation. The problems of the costs and efficiency of energy conversion, energy pricing, the promotion of research projects, the rentability of heating devices or insulation, the sanitation of old buildings, governmental aids by subsidies or tax abatement according to the modernization and energy conservation law etc., as well as the problem of pollution and the endangering of the environment by exhaust air, waste heat, ash and litter are emphasized particularly. Considering the space available the criterion for the selection of the key-words was not a scientific completeness but the provision of a fundamental understanding of the matter. (orig.) [de

  14. Information on Quantifiers and Argument Structure in English Learner's Dictionaries.

    Science.gov (United States)

    Lee, Thomas Hun-tak

    1993-01-01

    Lexicographers have been arguing for the inclusion of abstract and complex grammatical information in dictionaries. This paper examines the extent to which information about quantifiers and the argument structure of verbs is encoded in English learner's dictionaries. The Oxford Advanced Learner's Dictionary (1989), the Longman Dictionary of…

  15. Dictionary of applied energy conservation

    Energy Technology Data Exchange (ETDEWEB)

    Kut, D

    1982-01-01

    The escalating cost of energy is drawing an ever increasing number of people into the planning and execution of energy conservation measures and programs and confronts them with the specialist terminology of the conservationist. The object of this illustrated dictionary is to list the generality of terms employed in energy conservation practice and to explain, with the aid of appropriate illustrations, the basic definitions and underlying techniques.

  16. Dictionary materials engineering, materials testing

    International Nuclear Information System (INIS)

    1994-01-01

    This dictionary contains about 9,500 entries in each part of the following fields: 1) Materials using and selection; 2) Mechanical engineering materials -Metallic materials - Non-metallic inorganic materials - Plastics - Composites -Materials damage and protection; 3) Electrical and electronics materials -Conductor materials - Semiconductors - magnetic materials - Dielectric materials - non-conducting materials; 4) Materials testing - Mechanical methods - Analytical methods - Structure investigation - Complex methods - Measurement of physical properties - Non-destructive testing. (orig.) [de

  17. One Database, Four Monofunctional Dictionaries

    DEFF Research Database (Denmark)

    Bergenholtz, Inger; Bergenholtz, Henning

    2013-01-01

    on musical terms mainly from the world of classical music, but also from commercial music and the so-called world music. The music dictionaries intend to be tools for music students in universities and music schools, for both amateurs and professional musicians and for every interested person who wants aid...... when reading texts on music or who wishes to get further information on musical terms and topics....

  18. Construction of FuzzyFind Dictionary using Golay Coding Transformation for Searching Applications

    Science.gov (United States)

    Kowsari, Kamram

    2015-03-01

    searching through a large volume of data is very critical for companies, scientists, and searching engines applications due to time complexity and memory complexity. In this paper, a new technique of generating FuzzyFind Dictionary for text mining was introduced. We simply mapped the 23 bits of the English alphabet into a FuzzyFind Dictionary or more than 23 bits by using more FuzzyFind Dictionary, and reflecting the presence or absence of particular letters. This representation preserves closeness of word distortions in terms of closeness of the created binary vectors within Hamming distance of 2 deviations. This paper talks about the Golay Coding Transformation Hash Table and how it can be used on a FuzzyFind Dictionary as a new technology for using in searching through big data. This method is introduced by linear time complexity for generating the dictionary and constant time complexity to access the data and update by new data sets, also updating for new data sets is linear time depends on new data points. This technique is based on searching only for letters of English that each segment has 23 bits, and also we have more than 23-bit and also it could work with more segments as reference table.

  19. Log files as a tool for improving Internet dictionaries

    DEFF Research Database (Denmark)

    Henning, Bergenholtz.; Johnsen, Mia

    2005-01-01

    are not related to concrete examples of dictionary use. The surveys, which have always been concerned with printed dictionaries, have therefore not contributed to substantial improvements of dictionary conception. In the case of internet dictionaries, on the other hand, technical possibilities enable...... in the dictionary. Furthermore, log files allow lexicographers to see the types of information which have not, or not yet, been searched for. All in all, log files may thus be used as a tool for improving internet dictionaries - and perhaps also printed dictionaries - quite considerably....

  20. Stochastic learning of multi-instance dictionary for earth mover’s distance-based histogram comparison

    KAUST Repository

    Fan, Jihong; Liang, Ru-Ze

    2016-01-01

    Dictionary plays an important role in multi-instance data representation. It maps bags of instances to histograms. Earth mover’s distance (EMD) is the most effective histogram distance metric for the application of multi-instance retrieval. However

  1. Getting the Most out of the Dictionary

    Science.gov (United States)

    Marckwardt, Albert H.

    2012-01-01

    The usefulness of the dictionary as a reliable source of information for word meanings, spelling, and pronunciation is widely recognized. But even in these obvious matters, the information that the dictionary has to offer is not always accurately interpreted. With respect to pronunciation there seem to be two general pitfalls: (1) the…

  2. Review of "A Dictionary of Global Huayu"

    Science.gov (United States)

    Li, Rui

    2016-01-01

    As the first Huayu dictionary published by the Commercial Press, "A Dictionary of Global Huayu" (Chinese Language) did a pioneer work in many aspects. It did expand the influence of Chinese and provided Chinese speaker abroad a valuable reference book for study and communication. Nevertheless, there are still some demerits. First of all,…

  3. Dictionary criticism in Scandinavian lexicographic journals

    DEFF Research Database (Denmark)

    Nielsen, Sandro

    2017-01-01

    Dictionary criticism has a long tradition in the Nordic countries in for instance academic journals and newspapers. In order to limit the scope of the paper, the examination of dictionary reviews in the Nordic countries is restricted to the Nordic lexicographic journal LexicoNordica. The examinat......Dictionary criticism has a long tradition in the Nordic countries in for instance academic journals and newspapers. In order to limit the scope of the paper, the examination of dictionary reviews in the Nordic countries is restricted to the Nordic lexicographic journal Lexico......Nordica. The examination shows that there is great variation in the way reviewers write their critiques, and reviewers address many relevant topics and do not focus on one or two topics. Another characteristic of dictionary criticism in LexicoNordica is that reviewers often compare the dictionary under review with other...... dictionaries and previous editions and where relevant compare print and online editions. Furthermore, many reviewers address outside matter and meta-texts, thereby treating dictionaries as complex lexicographic products that consist of several distinct but interrelated components. A final characteristic...

  4. Bilingualised Dictionaries: How Learners Really Use Them.

    Science.gov (United States)

    Laufer, Batia; Kimmel, Michal

    1997-01-01

    Seventy native Hebrew-speaking English-as-a-Second-Language students participated in a study that investigated what part of an entry second-language learners read when they look up an unfamiliar word in a bilingualised dictionary: the monolingual, the bilingual, or both. Results suggest the bilingualised dictionary is very effective because it is…

  5. Linguistic and Cultural Strategies in ELT Dictionaries

    Science.gov (United States)

    Corrius, Montse; Pujol, Didac

    2010-01-01

    There are three main types of ELT dictionaries: monolingual, bilingual, and bilingualized. Each type of dictionary, while having its own advantages, also hinders the learning of English as a foreign language and culture in so far as it is written from a homogenizing (linguistic- and culture-centric) perspective. This paper presents a new type of…

  6. Dictionaries of African Sign Languages: An Overview

    Science.gov (United States)

    Schmaling, Constanze H.

    2012-01-01

    This article gives an overview of dictionaries of African sign languages that have been published to date most of which have not been widely distributed. After an introduction into the field of sign language lexicography and a discussion of some of the obstacles that authors of sign language dictionaries face in general, I will show problems…

  7. The large dictionary on chemical engineering

    International Nuclear Information System (INIS)

    1995-03-01

    This book mentions the large dictionary on chemical engineering. It starts the preface. It mentions introduction for publish committee. It also has signature of publish committee. It introduces explanatory notes. It gives descriptions of glossary on chemical engineering. This has appendixes and index. This book consists of seven part to explain chemical engineering glossary. It was written by chemical engineering dictionary publish committee.

  8. The Oxford Picture Dictionary. Beginning Workbook.

    Science.gov (United States)

    Fuchs, Marjorie

    The beginning workbook of the Oxford Picture Dictionary is in full color and offers vocabulary reinforcement activities that correspond page for page with the dictionary. Clear and simple instructions with examples make it suitable for independent use in the classroom or at home. The workbook has up-to-date art and graphics, explaining over 3700…

  9. Monolingual and bilingual learners' dictionaries | Gouws | Lexikos

    African Journals Online (AJOL)

    The importance of an unambiguous identification of the relevant lexicographic functions is emphasised and the notions of function condensation and function merging are introduced. It is shown that the typological choice should be determined by a function-based approach to dictionary usage. Keywords: bilingual dictionary ...

  10. Translating Culture in Bilingual Dictionaries | Kotze | Lexikos

    African Journals Online (AJOL)

    In addition to the act of translation, transculturalisation and strategies of textuality come into playas interacting factors in the compilation of interlingual dictionaries. In this article, some conclusions resulting from a comparative survey of some South African dictionaries are drawn, specifically with regard to bi- and trilingual ...

  11. The Dictionary On Environment and Pollution

    International Nuclear Information System (INIS)

    1989-01-01

    This book is the dictionary for environment and pollution, which puts the words in alphabetical order. It includes words such as street refuse, powdered soap, sodium hydroxide, waste caustics, causticization, vibration acceleration level, gasoline, and processed fuel. This dictionary gives descriptions on each word which is related the environment and pollution.

  12. Dictionary of nuclear power. upd. ed.

    International Nuclear Information System (INIS)

    Koelzer, W.

    2010-10-01

    The dictionary on nuclear energy is the revised version (status 20110) of the dictionary edited in 1980, with revision in 1997 and 2001 with alphabetically ordered descriptions and definitions of nuclear energy and radioactivity related terms, items, units, institutions, notions etc.

  13. Bilingual Dictionaries, the Lexicographer and the Translator

    African Journals Online (AJOL)

    rbr

    Abstract: This article focuses on the problems, and advantages and disadvantages of the bilin- gual dictionary from both the lexicographer's and the translator's point of view, with specific refer- ence to bilingual Zulu dictionaries. It is shown that there are many and varying problems the lexi- cographer has to deal with and ...

  14. On development of “smart” dictionaries

    Directory of Open Access Journals (Sweden)

    Mark Kit

    2015-11-01

    Full Text Available On development of “smart” dictionaries The paper discusses the need for development of intelligent dictionaries that allow for two-way interaction with its users. Theoretical ground for such development is suggested. Practical implementation as LexSite lexical resource is shown, concepts for further improvement of the efficiency are proposed.

  15. From Polyfunctional to Monofunctional Accounting Dictionaries

    African Journals Online (AJOL)

    This article describes the theoretical foundation of the accounting dictionaries as well as its practical results. Furthermore, the implementation of the project shows how the constant interaction of lexicographical theory with practical dictionary work over a period of 10 years has led to lexicographical development and ...

  16. The Etymological Dictionary as a Teaching Device

    Science.gov (United States)

    Becker, Donald

    1977-01-01

    The etymological dictionary is a useful tool for language students; this article presents three series of homework problems whose solutions are to be found in a German etymological dictionary. Regular assignment of such problems will teach students both vocabulary skills and knowledge and curiosity about the German language. (CHK)

  17. Implementation of dictionary pair learning algorithm for image quality improvement

    Science.gov (United States)

    Vimala, C.; Aruna Priya, P.

    2018-04-01

    This paper proposes an image denoising on dictionary pair learning algorithm. Visual information is transmitted in the form of digital images is becoming a major method of communication in the modern age, but the image obtained after transmissions is often corrupted with noise. The received image needs processing before it can be used in applications. Image denoising involves the manipulation of the image data to produce a visually high quality image.

  18. Sparsity and Nullity: Paradigm for Analysis Dictionary Learning

    Science.gov (United States)

    2016-08-09

    applied mathematics , on account of its theoretical complexity, and its high relevance to big data problems. Dictionary learning has been one of the key...and hence rank(Ni ⊕ ni) = rank(Ni) + 1. There are two inherent difficulties in this formulation. First, ‖ · ‖0 is of combinatorial nature, hence the...from incomplete and inaccurate measurements, Communications on pure and applied mathematics , 59 (2006), pp. 1207–1223. [8] Thomas F Coleman and Alex

  19. Sensitivity computation of the l1 minimization problem and its application to dictionary design of ill-posed problems

    International Nuclear Information System (INIS)

    Horesh, L; Haber, E

    2009-01-01

    The l 1 minimization problem has been studied extensively in the past few years. Recently, there has been a growing interest in its application for inverse problems. Most studies have concentrated in devising ways for sparse representation of a solution using a given prototype dictionary. Very few studies have addressed the more challenging problem of optimal dictionary construction, and even these were primarily devoted to the simplistic sparse coding application. In this paper, sensitivity analysis of the inverse solution with respect to the dictionary is presented. This analysis reveals some of the salient features and intrinsic difficulties which are associated with the dictionary design problem. Equipped with these insights, we propose an optimization strategy that alleviates these hurdles while utilizing the derived sensitivity relations for the design of a locally optimal dictionary. Our optimality criterion is based on local minimization of the Bayesian risk, given a set of training models. We present a mathematical formulation and an algorithmic framework to achieve this goal. The proposed framework offers the design of dictionaries for inverse problems that incorporate non-trivial, non-injective observation operators, where the data and the recovered parameters may reside in different spaces. We test our algorithm and show that it yields improved dictionaries for a diverse set of inverse problems in geophysics and medical imaging

  20. Sensitivity computation of the ell1 minimization problem and its application to dictionary design of ill-posed problems

    Science.gov (United States)

    Horesh, L.; Haber, E.

    2009-09-01

    The ell1 minimization problem has been studied extensively in the past few years. Recently, there has been a growing interest in its application for inverse problems. Most studies have concentrated in devising ways for sparse representation of a solution using a given prototype dictionary. Very few studies have addressed the more challenging problem of optimal dictionary construction, and even these were primarily devoted to the simplistic sparse coding application. In this paper, sensitivity analysis of the inverse solution with respect to the dictionary is presented. This analysis reveals some of the salient features and intrinsic difficulties which are associated with the dictionary design problem. Equipped with these insights, we propose an optimization strategy that alleviates these hurdles while utilizing the derived sensitivity relations for the design of a locally optimal dictionary. Our optimality criterion is based on local minimization of the Bayesian risk, given a set of training models. We present a mathematical formulation and an algorithmic framework to achieve this goal. The proposed framework offers the design of dictionaries for inverse problems that incorporate non-trivial, non-injective observation operators, where the data and the recovered parameters may reside in different spaces. We test our algorithm and show that it yields improved dictionaries for a diverse set of inverse problems in geophysics and medical imaging.

  1. Stochastic learning of multi-instance dictionary for earth mover’s distance-based histogram comparison

    KAUST Repository

    Fan, Jihong

    2016-09-17

    Dictionary plays an important role in multi-instance data representation. It maps bags of instances to histograms. Earth mover’s distance (EMD) is the most effective histogram distance metric for the application of multi-instance retrieval. However, up to now, there is no existing multi-instance dictionary learning methods designed for EMD-based histogram comparison. To fill this gap, we develop the first EMD-optimal dictionary learning method using stochastic optimization method. In the stochastic learning framework, we have one triplet of bags, including one basic bag, one positive bag, and one negative bag. These bags are mapped to histograms using a multi-instance dictionary. We argue that the EMD between the basic histogram and the positive histogram should be smaller than that between the basic histogram and the negative histogram. Base on this condition, we design a hinge loss. By minimizing this hinge loss and some regularization terms of the dictionary, we update the dictionary instances. The experiments over multi-instance retrieval applications shows its effectiveness when compared to other dictionary learning methods over the problems of medical image retrieval and natural language relation classification. © 2016 The Natural Computing Applications Forum

  2. Terminology and Labelling Words by Subject in Monolingual Dictionaries – What Do Domain Labels Say to Dictionary Users ?

    Directory of Open Access Journals (Sweden)

    Nová Jana

    2017-12-01

    Full Text Available The paper focuses on labelling words by subject in a non-specialized dictionary. We compare the existing monolingual dictionaries of Czech and their ways of labelling terms of medicine and related fields; besides apparent differences between dictionaries, there are also inconsistencies within one dictionary. We consider pros and cons of domain labels as such and their usability in the light of needs and limits of dictionary users, with the aim to motivate further discussion on related issues.

  3. Tensor-based Dictionary Learning for Dynamic Tomographic Reconstruction

    Science.gov (United States)

    Tan, Shengqi; Zhang, Yanbo; Wang, Ge; Mou, Xuanqin; Cao, Guohua; Wu, Zhifang; Yu, Hengyong

    2015-01-01

    In dynamic computed tomography (CT) reconstruction, the data acquisition speed limits the spatio-temporal resolution. Recently, compressed sensing theory has been instrumental in improving CT reconstruction from far few-view projections. In this paper, we present an adaptive method to train a tensor-based spatio-temporal dictionary for sparse representation of an image sequence during the reconstruction process. The correlations among atoms and across phases are considered to capture the characteristics of an object. The reconstruction problem is solved by the alternating direction method of multipliers. To recover fine or sharp structures such as edges, the nonlocal total variation is incorporated into the algorithmic framework. Preclinical examples including a sheep lung perfusion study and a dynamic mouse cardiac imaging demonstrate that the proposed approach outperforms the vectorized dictionary-based CT reconstruction in the case of few-view reconstruction. PMID:25779991

  4. Tensor-based dictionary learning for dynamic tomographic reconstruction

    International Nuclear Information System (INIS)

    Tan, Shengqi; Wu, Zhifang; Zhang, Yanbo; Mou, Xuanqin; Wang, Ge; Cao, Guohua; Yu, Hengyong

    2015-01-01

    In dynamic computed tomography (CT) reconstruction, the data acquisition speed limits the spatio-temporal resolution. Recently, compressed sensing theory has been instrumental in improving CT reconstruction from far few-view projections. In this paper, we present an adaptive method to train a tensor-based spatio-temporal dictionary for sparse representation of an image sequence during the reconstruction process. The correlations among atoms and across phases are considered to capture the characteristics of an object. The reconstruction problem is solved by the alternating direction method of multipliers. To recover fine or sharp structures such as edges, the nonlocal total variation is incorporated into the algorithmic framework. Preclinical examples including a sheep lung perfusion study and a dynamic mouse cardiac imaging demonstrate that the proposed approach outperforms the vectorized dictionary-based CT reconstruction in the case of few-view reconstruction. (paper)

  5. The Effect of Lexicographical Information Costs on Dictionary Making and Use

    DEFF Research Database (Denmark)

    Nielsen, Sandro

    2008-01-01

    ­tinction is proposed between two general types of lexicographical information costs. Firstly, search-related costs are the efforts required by the look-up activities users have to perform when con­sulting a dictionary to find access to the data they are searching for. It is argued that the access route, article...... structure, data distribution and cross-references may influence search-related infor­mation costs positively as well as negatively. Secondly, comprehension-related costs are the efforts connected to the user's ability to understand and interpret the data presented in a dictionary. In other words: How easy...... or difficult is it for users to understand the data presented? Examples show how textual condensation, dictionary functions and use-situations may impact on the level of comprehension-related information costs. It is thus possible to establish a framework for evaluating lexicographical information costs...

  6. Linguistic Features of English and Russian Dictionaries (A Comparative Study

    Directory of Open Access Journals (Sweden)

    Robert Leščinskij

    2013-06-01

    Full Text Available The purpose of this study is to establish differences and similarities between linguistic characteristics of English and Russian dictionaries. Two dictionaries were selected for the study – electronic version of the 8th edition of Oxford Advanced Learner’s Dictionary (OALD and the online version of Ozhegov’s explanatory dictionary. The methods chosen for the study were descriptive, comparative and contrastive analysis. Linguistic characteristics of the dictionaries were analysed and compared. The research showed that both reference books provided different linguistic information on the headwords. OALD provided exhaustive phonetic information, which Ozhegov’s dictionary lacked. The two dictionaries provided different orthographic information. OALD disclosed semantic information via various tools available in the electronic version; these were unavailable in Ozhegov’s dictionary. Both dictionaries used similar stylistic labels.

  7. Lamjung Yolmo - Nepali - English Dictionary

    OpenAIRE

    Gawne, Lauren

    2011-01-01

    pdf of book Lamjung Yolmo is a Tibeto-Burman language spoken in Nepal. It is closely related to Helambu Sherpa and Kyirong. This small dictionary is part of an ongoing project to document the language of Lamjung Yolmo. This is the first time that Lamjung Yolmo has appeared in print, and is accessible to Lamjung Yolmo, Nepali and English speakers alike. The hardcopy is retailing at the Uni Bookshop: http://www.bookshop.unimelb.edu.au/cbc/?IS.9781921775697 (accessed week of 21...

  8. Dictionary corrosion and corrosion control

    International Nuclear Information System (INIS)

    1985-01-01

    This dictionary has 13000 entries in both languages. Keywords and extensive accompanying information simplify the choice of word for the user. The following topics are covered: Theoretical principles of corrosion; Corrosion of the metals and alloys most frequently used in engineering. Types of corrosion - (chemical-, electro-chemical, biological corrosion); forms of corrosion (superficial, pitting, selective, intercrystalline and stress corrosion; vibrational corrosion cracking); erosion and cavitation. Methods of corrosion control (material selection, temporary corrosion protection media, paint and plastics coatings, electro-chemical coatings, corrosion prevention by treatment of the corrosive media); Corrosion testing methods. (orig./HP) [de

  9. Emo, love and god: making sense of Urban Dictionary, a crowd-sourced online dictionary.

    Science.gov (United States)

    Nguyen, Dong; McGillivray, Barbara; Yasseri, Taha

    2018-05-01

    The Internet facilitates large-scale collaborative projects and the emergence of Web 2.0 platforms, where producers and consumers of content unify, has drastically changed the information market. On the one hand, the promise of the 'wisdom of the crowd' has inspired successful projects such as Wikipedia, which has become the primary source of crowd-based information in many languages. On the other hand, the decentralized and often unmonitored environment of such projects may make them susceptible to low-quality content. In this work, we focus on Urban Dictionary, a crowd-sourced online dictionary. We combine computational methods with qualitative annotation and shed light on the overall features of Urban Dictionary in terms of growth, coverage and types of content. We measure a high presence of opinion-focused entries, as opposed to the meaning-focused entries that we expect from traditional dictionaries. Furthermore, Urban Dictionary covers many informal, unfamiliar words as well as proper nouns. Urban Dictionary also contains offensive content, but highly offensive content tends to receive lower scores through the dictionary's voting system. The low threshold to include new material in Urban Dictionary enables quick recording of new words and new meanings, but the resulting heterogeneous content can pose challenges in using Urban Dictionary as a source to study language innovation.

  10. Sinogram denoising via simultaneous sparse representation in learned dictionaries

    International Nuclear Information System (INIS)

    Karimi, Davood; Ward, Rabab K

    2016-01-01

    Reducing the radiation dose in computed tomography (CT) is highly desirable but it leads to excessive noise in the projection measurements. This can significantly reduce the diagnostic value of the reconstructed images. Removing the noise in the projection measurements is, therefore, essential for reconstructing high-quality images, especially in low-dose CT. In recent years, two new classes of patch-based denoising algorithms proved superior to other methods in various denoising applications. The first class is based on sparse representation of image patches in a learned dictionary. The second class is based on the non-local means method. Here, the image is searched for similar patches and the patches are processed together to find their denoised estimates. In this paper, we propose a novel denoising algorithm for cone-beam CT projections. The proposed method has similarities to both these algorithmic classes but is more effective and much faster. In order to exploit both the correlation between neighboring pixels within a projection and the correlation between pixels in neighboring projections, the proposed algorithm stacks noisy cone-beam projections together to form a 3D image and extracts small overlapping 3D blocks from this 3D image for processing. We propose a fast algorithm for clustering all extracted blocks. The central assumption in the proposed algorithm is that all blocks in a cluster have a joint-sparse representation in a well-designed dictionary. We describe algorithms for learning such a dictionary and for denoising a set of projections using this dictionary. We apply the proposed algorithm on simulated and real data and compare it with three other algorithms. Our results show that the proposed algorithm outperforms some of the best denoising algorithms, while also being much faster. (paper)

  11. Image fusion using sparse overcomplete feature dictionaries

    Science.gov (United States)

    Brumby, Steven P.; Bettencourt, Luis; Kenyon, Garrett T.; Chartrand, Rick; Wohlberg, Brendt

    2015-10-06

    Approaches for deciding what individuals in a population of visual system "neurons" are looking for using sparse overcomplete feature dictionaries are provided. A sparse overcomplete feature dictionary may be learned for an image dataset and a local sparse representation of the image dataset may be built using the learned feature dictionary. A local maximum pooling operation may be applied on the local sparse representation to produce a translation-tolerant representation of the image dataset. An object may then be classified and/or clustered within the translation-tolerant representation of the image dataset using a supervised classification algorithm and/or an unsupervised clustering algorithm.

  12. A General Framework for Reviewing Dictionaries

    DEFF Research Database (Denmark)

    Nielsen, Sandro

    2013-01-01

    in specific types of situations in the real (extra-lexicographic) world. I propose a basis for a framework that contains an outline of general theoretical and practical principles that underlie the true nature of dictionary reviews, and places the reviews in a lexicographic universe with the dictionary...... and lexicography at its centre. This seems to be in line with the modern understanding of lexicography as a separate academic discipline concerned with the compilation, design, evaluation and use of dictionaries. Moreover, a set of generally applicable principles may lead the discourse community to accept...

  13. RCRAInfo Download Summary and Data Element Dictionary ...

    Science.gov (United States)

    ECHO, Enforcement and Compliance History Online, provides compliance and enforcement information for approximately 800,000 EPA-regulated facilities nationwide. ECHO includes permit, inspection, violation, enforcement action, and penalty information about facilities regulated under the Clean Air Act (CAA) Stationary Source Program, Clean Water Act (CWA) National Pollutant Elimination Discharge System (NPDES), and/or Resource Conservation and Recovery Act (RCRA). Information also is provided on surrounding demographics when available.

  14. FRS Download Summary and Data Element Dictionary ...

    Science.gov (United States)

    ECHO, Enforcement and Compliance History Online, provides compliance and enforcement information for approximately 800,000 EPA-regulated facilities nationwide. ECHO includes permit, inspection, violation, enforcement action, and penalty information about facilities regulated under the Clean Air Act (CAA) Stationary Source Program, Clean Water Act (CWA) National Pollutant Elimination Discharge System (NPDES), and/or Resource Conservation and Recovery Act (RCRA). Information also is provided on surrounding demographics when available.

  15. ITS logical architecture : volume 3, data dictionary.

    Science.gov (United States)

    1981-01-01

    The objective of the research effort was to develop an empirically and experiencially based model pedestrian safety program which cities can use as guidelines for pedestrian safety program planning, implementation, and evaluation. The basis of these ...

  16. Real-time variables dictionary (RTVD), and expert system for development of real-time applications in nuclear power plants

    International Nuclear Information System (INIS)

    Senra Martinez, A.; Schirru, R.; Dutra Thome Filho, Z.

    1990-01-01

    It is presented in this paper a computerized methodology based on a data dictionary managed by an expert system called Real-Time Variables Dictionary (RTVD). This system is very usefull for development of real-time applications in nuclear power plants. It is described in details the RTVD functions and its implantation in a VAX 8600 computer. It is also pointed out the concepts of artificial intelligence used in teh RTVD

  17. Patient-Specific Seizure Detection in Long-Term EEG Using Signal-Derived Empirical Mode Decomposition (EMD)-based Dictionary Approach.

    Science.gov (United States)

    Kaleem, Muhammad; Gurve, Dharmendra; Guergachi, Aziz; Krishnan, Sridhar

    2018-06-25

    The objective of the work described in this paper is development of a computationally efficient methodology for patient-specific automatic seizure detection in long-term multi-channel EEG recordings. Approach: A novel patient-specific seizure detection approach based on signal-derived Empirical Mode Decomposition (EMD)-based dictionary approach is proposed. For this purpose, we use an empirical framework for EMD-based dictionary creation and learning, inspired by traditional dictionary learning methods, in which the EMD-based dictionary is learned from the multi-channel EEG data being analyzed for automatic seizure detection. We present the algorithm for dictionary creation and learning, whose purpose is to learn dictionaries with a small number of atoms. Using training signals belonging to seizure and non-seizure classes, an initial dictionary, termed as the raw dictionary, is formed. The atoms of the raw dictionary are composed of intrinsic mode functions obtained after decomposition of the training signals using the empirical mode decomposition algorithm. The raw dictionary is then trained using a learning algorithm, resulting in a substantial decrease in the number of atoms in the trained dictionary. The trained dictionary is then used for automatic seizure detection, such that coefficients of orthogonal projections of test signals against the trained dictionary form the features used for classification of test signals into seizure and non-seizure classes. Thus no hand-engineered features have to be extracted from the data as in traditional seizure detection approaches. Main results: The performance of the proposed approach is validated using the CHB-MIT benchmark database, and averaged accuracy, sensitivity and specificity values of 92.9%, 94.3% and 91.5%, respectively, are obtained using support vector machine classifier and five-fold cross-validation method. These results are compared with other approaches using the same database, and the suitability

  18. What Dictionary to Use? A Closer Look at the "Oxford Advanced Learner's Dictionary," the "Longman Dictionary of Contemporary English" and the "Longman Lexicon of Contempory English."

    Science.gov (United States)

    Shaw, A. M.

    1983-01-01

    Three dictionaries are compared for their usefulness to teachers of English as a foreign language, teachers in training, students, and other users of English as a foreign language. The issue of monolingual versus bilingual dictionary format is discussed, and a previous analysis of the two bilingual dictionaries is summarized. Pronunciation…

  19. A Cluster-based Approach Towards Detecting and Modeling Network Dictionary Attacks

    Directory of Open Access Journals (Sweden)

    A. Tajari Siahmarzkooh

    2016-12-01

    Full Text Available In this paper, we provide an approach to detect network dictionary attacks using a data set collected as flows based on which a clustered graph is resulted. These flows provide an aggregated view of the network traffic in which the exchanged packets in the network are considered so that more internally connected nodes would be clustered. We show that dictionary attacks could be detected through some parameters namely the number and the weight of clusters in time series and their evolution over the time. Additionally, the Markov model based on the average weight of clusters,will be also created. Finally, by means of our suggested model, we demonstrate that artificial clusters of the flows are created for normal and malicious traffic. The results of the proposed approach on CAIDA 2007 data set suggest a high accuracy for the model and, therefore, it provides a proper method for detecting the dictionary attack.

  20. Parsing and Tagging of Binlingual Dictionary

    National Research Council Canada - National Science Library

    Ma, Huanfeng; Karagol-Ayan, Burcu; Doermann, David S; Oard, Doug; Wang, Jianqiang

    2003-01-01

    Bilingual dictionaries hold great potential as a source of lexical resources for training and testing automated systems for optical character recognition, machine translation, and cross-language information retrieval...

  1. Parsing and Tagging of Bilingual Dictionary

    National Research Council Canada - National Science Library

    Ma, Huanfeng; Karagol-Ayan, Burcu; Doermann, David S; Oard, Doug; Wang, Jianqiang

    2003-01-01

    Bilingual dictionaries hold great potential as a source of lexical resources for training and testing automated systems for optical character recognition, machine translation, and cross-language information retrieval...

  2. Dictionnaires et encyclopedies (Dictionaries and Encyclopedias).

    Science.gov (United States)

    Ferran, Pierre

    1988-01-01

    Eight French dictionaries and encyclopedic reference books are reviewed, focusing on their formats, characteristics, and intended uses. They include references for language, geopolitics and economics, economic history, signs and symbols, and an almanac. (MSE)

  3. Dictionary of nuclear power. upd. ed.

    International Nuclear Information System (INIS)

    Koelzer, W.

    2011-10-01

    The updated dictionary on nuclear power contains definitions and explanations on nuclear physics, nuclear engineering, nuclear power, radiation effects and radiation protection in alphabetic order. Attachments on units, their conversion and physical constants are included.

  4. Verifying pronunciation dictionaries using conflict analysis

    CSIR Research Space (South Africa)

    Davel, MH

    2010-09-01

    Full Text Available The authors describe a new language-independent technique for automatically identifying errors in an electronic pronunciation dictionary by analyzing the source of conflicting patterns directly.They evaluate the effectiveness of the technique in two...

  5. Hyperspectral Image Classification Using Discriminative Dictionary Learning

    International Nuclear Information System (INIS)

    Zongze, Y; Hao, S; Kefeng, J; Huanxin, Z

    2014-01-01

    The hyperspectral image (HSI) processing community has witnessed a surge of papers focusing on the utilization of sparse prior for effective HSI classification. In sparse representation based HSI classification, there are two phases: sparse coding with an over-complete dictionary and classification. In this paper, we first apply a novel fisher discriminative dictionary learning method, which capture the relative difference in different classes. The competitive selection strategy ensures that atoms in the resulting over-complete dictionary are the most discriminative. Secondly, motivated by the assumption that spatially adjacent samples are statistically related and even belong to the same materials (same class), we propose a majority voting scheme incorporating contextual information to predict the category label. Experiment results show that the proposed method can effectively strengthen relative discrimination of the constructed dictionary, and incorporating with the majority voting scheme achieve generally an improved prediction performance

  6. Using dictionaries to study the mental lexicon.

    Science.gov (United States)

    Anshen, F; Aronoff, M

    The notion of a mental lexicon has its historical roots in practical reference dictionaries. The distributional analysis of dictionaries provides one means of investigating the structure of the mental lexicon. We review our earlier work with dictionaries, based on a three-way horserace model of lexical access and production, and then present the most recent results of our ongoing analysis of the Oxford English Dictionary, Second Edition on CD-ROM, which traces changes in productivity over time of the English suffixes -ment and -ity, both of which originate in French borrowings. Our results lead us to question the validity of automatic analogy from a set of existing words as the driving force behind morphological productivity. Copyright 1999 Academic Press.

  7. Phonemic Transcriptions in British and American Dictionaries

    Directory of Open Access Journals (Sweden)

    Rastislav Šuštaršič

    2005-06-01

    Full Text Available In view of recent criticisms concerning vowel symbols in some British English dictionaries (in particular by J. Windsor Lewis in JIPA (Windsor Lewis, 2003, with regard to the Oxford Dictionary of Pronunciation (Upton, 2001, this article extends the discussion on English phonemic transcriptions by including those that typically occur in standard American dictionaries, and by comparing the most common conventions of British and American dictionaries. In addition to symbols for both vowels and consonants, the paper also deals with the different representations of word accentuation and the issue of consistency regarding application of phonemic (systemic, broad, rather than phonetic (allophonic, narrow transcription. The different transcriptions are assessed from the points of view of their departures from the International Phonetic Alphabet, their overlapping with orthographic representation (spelling and their appropriateness in terms of reflecting actual pronunciation in standard British and/or American pronunciation.

  8. A critical analysis of multilingual dictionaries

    African Journals Online (AJOL)

    user

    (ANNA) attempts to make a case for the value of reading a dictionary for recreation ... This assumption, however, should be supported by actual user studies and ...... inexperienced user can incorrectly conclude that ka is the word for power in.

  9. (CEPTSA) — Translating and Explanatory Dictionaries

    African Journals Online (AJOL)

    The phases of the project, consisting of different translating and explanatory versions, are discussed. ... already published, namely the Dictionary category of the South African Translators' Institute (SATI) (2003), ... AJOL African Journals Online.

  10. Some Features of Monolingual LSP Dictionaries

    African Journals Online (AJOL)

    rbr

    119 tion. Therefore, an important product of lexicography is a dictionary or word- .... graphic language description, focusing on the set of eventual questions from ..... (3) "Absence" (Matrimonial Causes Act 1950 (c. 25), s. 14 (3)) means "physi-.

  11. Approximate dictionary queries

    DEFF Research Database (Denmark)

    Brodal, Gerth Stølting; Gasieniec, Leszek

    1996-01-01

    Given a set of n binary strings of length m each. We consider the problem of answering d-queries. Given a binary query string of length m, a d-query is to report if there exists a string in the set within Hamming distance d of . We present a data structure of size O(nm) supporting 1-queries in ti...

  12. Routledge French technical dictionary

    CERN Document Server

    1994-01-01

    The French-English volume of this highly acclaimed set consists of some 100,000 keywords in both French and English, drawn from the whole range of modern applied science and technical terminology. Covers over 70 subject areas, from engineering and chemistry to packaging, transportation, data processing and much more.

  13. An electronic dictionary of Danish Sign Language

    DEFF Research Database (Denmark)

    Kristoffersen, Jette Hedegaard; Troelsgård, Thomas

    2008-01-01

    Compiling sign language dictionaries has in the last 15 years changed from most often being simply collecting and presenting signs for a given gloss in the surrounding vocal language to being a complicated lexicographic task including all parts of linguistic analysis, i.e. phonology, phonetics......, morphology, syntax and semantics. In this presentation we will give a short overview of the Danish Sign Language dictionary project. We will further focus on lemma selection and some of the problems connected with lemmatisation....

  14. Oxford Dictionary of Sports Science and Medicine

    OpenAIRE

    2007-01-01

    DESCRIPTION The Oxford Dictionary of Sports Science and Medicine provides reliable definitions of sports science and medicine terms. It provides an invaluable reference book for anyone interested in the captivating subject of sport. PURPOSE This dictionary aims to include almost every sports science, anatomy, physiology, biomechanical, injuries description, and psychological term as related to sports medicine and science and support the explanations by illustrations wherever necessary. AUDIEN...

  15. Dictionary of nuclear power. January 2013 ed.

    International Nuclear Information System (INIS)

    Koelzer, Winfried

    2013-01-01

    The actualized version (January 2013) of the dictionary on nuclear power includes all actualizations and new inputs since the last version of 2001. The original publication dates from 1980. The dictionary includes definitions, terms, measuring units and helpful information on the actual knowledge concerning nuclear power, nuclear fuel cycle, nuclear facilities, radioactive waste management, nuclear physics, reactor physics, isotope production, biological radiation effects, and radiation protection.

  16. The need for an electronic multilingual dictionary

    Directory of Open Access Journals (Sweden)

    Anna Kisiel

    2014-09-01

    Full Text Available The need for an electronic multilingual dictionary The paper analyses the issue of providing adequate equivalents in multilingual dictionaries. If equivalents are adequate, it means that: (1 the scope of meaning of one item is identical to its equivalent (cf. drive: drive a nail vs. drive a car; and (2 the collocations of the equivalents overlap. Two significant problems arise when searching for adequate equivalents: the lack of equivalents whose meanings are identical (narrower/wider meanings, partial overlap of meanings, more than equally good equivalent, and equivalents with homographs in a given language. Because such issues are difficult to resolve in a printed dictionary, we put forward some methods of addressing the problems in an electronic dictionary. The paper offers an example entry from such a dictionary, which presents a suggestion of a layout. We also took into consideration the potential problems which may appear if the entry is presented in this manner: first, one must set a limit for the description (a defined number of lexical units; second, one must avoid circularity, but at the same time also strive for an exhaustive description. Electronic dictionaries offer greater possibilities of presenting modern vocabulary and adding new classifiers (e.g. a classifier of politeness.

  17. Emo, love and god: making sense of Urban Dictionary, a crowd-sourced online dictionary

    Science.gov (United States)

    McGillivray, Barbara

    2018-01-01

    The Internet facilitates large-scale collaborative projects and the emergence of Web 2.0 platforms, where producers and consumers of content unify, has drastically changed the information market. On the one hand, the promise of the ‘wisdom of the crowd’ has inspired successful projects such as Wikipedia, which has become the primary source of crowd-based information in many languages. On the other hand, the decentralized and often unmonitored environment of such projects may make them susceptible to low-quality content. In this work, we focus on Urban Dictionary, a crowd-sourced online dictionary. We combine computational methods with qualitative annotation and shed light on the overall features of Urban Dictionary in terms of growth, coverage and types of content. We measure a high presence of opinion-focused entries, as opposed to the meaning-focused entries that we expect from traditional dictionaries. Furthermore, Urban Dictionary covers many informal, unfamiliar words as well as proper nouns. Urban Dictionary also contains offensive content, but highly offensive content tends to receive lower scores through the dictionary’s voting system. The low threshold to include new material in Urban Dictionary enables quick recording of new words and new meanings, but the resulting heterogeneous content can pose challenges in using Urban Dictionary as a source to study language innovation. PMID:29892417

  18. The Compilation of Electronic Dictionaries for the African Languages

    African Journals Online (AJOL)

    rbr

    English language on an ongoing basis" (OED Online, Introduction). ... games. In LIED and LIAD the user searching one section of the database will also be .... Some electronic dictionaries such as the Oxford English Dictionary (second.

  19. Use of Monolingual and Bilingual Dictionaries among Students of English

    Directory of Open Access Journals (Sweden)

    Monika Kavalir

    2010-12-01

    Full Text Available The study of dictionary use in 32 firstyear students of English at the University of Ljubljana in the academic year 2009/2010 shows that students use a variety of dictionaries with a slight preponderance of monolingual dictionaries over bilingual ones. The bilingual dictionaries listed do not include some of the most recent and most comprehensive dictionaries while some of the most frequently used resources are quite modest sized. The students are already predominantly users of electronic and online dictionaries with a lower frequency of printed resources – a trend which is only likely to accelerate with the advent of new bilingual online dictionaries. These results have practical relevance for teachers in all sectors, from primary and secondary schools to universities, as they point towards a need for additional training in the use of bilingual dictionaries. The transition from printed to electronic and online resources can also be expected to induce changes in EFL methodology at all levels.

  20. Making an Online Dictionary of New Zealand Sign Language ...

    African Journals Online (AJOL)

    ... is n example of a contemporary sign language dictionary that leverages the 21st ... informed development of this bilingual, bi-directional, multimedia dictionary. ... and dealing with sociolinguistic variation in the selection and performance of ...

  1. Sparse representation and dictionary learning penalized image reconstruction for positron emission tomography

    International Nuclear Information System (INIS)

    Chen, Shuhang; Liu, Huafeng; Shi, Pengcheng; Chen, Yunmei

    2015-01-01

    Accurate and robust reconstruction of the radioactivity concentration is of great importance in positron emission tomography (PET) imaging. Given the Poisson nature of photo-counting measurements, we present a reconstruction framework that integrates sparsity penalty on a dictionary into a maximum likelihood estimator. Patch-sparsity on a dictionary provides the regularization for our effort, and iterative procedures are used to solve the maximum likelihood function formulated on Poisson statistics. Specifically, in our formulation, a dictionary could be trained on CT images, to provide intrinsic anatomical structures for the reconstructed images, or adaptively learned from the noisy measurements of PET. Accuracy of the strategy with very promising application results from Monte-Carlo simulations, and real data are demonstrated. (paper)

  2. Recent Development of Dual-Dictionary Learning Approach in Medical Image Analysis and Reconstruction

    Science.gov (United States)

    Wang, Bigong; Li, Liang

    2015-01-01

    As an implementation of compressive sensing (CS), dual-dictionary learning (DDL) method provides an ideal access to restore signals of two related dictionaries and sparse representation. It has been proven that this method performs well in medical image reconstruction with highly undersampled data, especially for multimodality imaging like CT-MRI hybrid reconstruction. Because of its outstanding strength, short signal acquisition time, and low radiation dose, DDL has allured a broad interest in both academic and industrial fields. Here in this review article, we summarize DDL's development history, conclude the latest advance, and also discuss its role in the future directions and potential applications in medical imaging. Meanwhile, this paper points out that DDL is still in the initial stage, and it is necessary to make further studies to improve this method, especially in dictionary training. PMID:26089956

  3. Compiling the Dictionary of Word Associations in Estonian: From scratch to the database

    Directory of Open Access Journals (Sweden)

    Ene Vainik

    2018-04-01

    Full Text Available The present paper describes the project titled “The Dictionary of Word Associations in Estonian” undertaken by the author at the Institute of the Estonian Language. The general aim of the Dictionary is to provide insights into Estonians’ common-sense mind. It is meant to be a tool of self-reflection for Estonian native speakers and a guide for the foreigners who are eager enough to make themselves familiar with the Estonian cultural patterns of thought. The Dictionary will be published online. The number of keywords was initially limited to approximately 800. Specific emphasis is given to the stage of data collection by implementing the principles of citizen science.

  4. Recent Development of Dual-Dictionary Learning Approach in Medical Image Analysis and Reconstruction.

    Science.gov (United States)

    Wang, Bigong; Li, Liang

    2015-01-01

    As an implementation of compressive sensing (CS), dual-dictionary learning (DDL) method provides an ideal access to restore signals of two related dictionaries and sparse representation. It has been proven that this method performs well in medical image reconstruction with highly undersampled data, especially for multimodality imaging like CT-MRI hybrid reconstruction. Because of its outstanding strength, short signal acquisition time, and low radiation dose, DDL has allured a broad interest in both academic and industrial fields. Here in this review article, we summarize DDL's development history, conclude the latest advance, and also discuss its role in the future directions and potential applications in medical imaging. Meanwhile, this paper points out that DDL is still in the initial stage, and it is necessary to make further studies to improve this method, especially in dictionary training.

  5. Overcoming complexities: Damage detection using dictionary learning framework

    Science.gov (United States)

    Alguri, K. Supreet; Melville, Joseph; Deemer, Chris; Harley, Joel B.

    2018-04-01

    For in situ damage detection, guided wave structural health monitoring systems have been widely researched due to their ability to evaluate large areas and their ability detect many types of damage. These systems often evaluate structural health by recording initial baseline measurements from a pristine (i.e., undamaged) test structure and then comparing later measurements with that baseline. Yet, it is not always feasible to have a pristine baseline. As an alternative, substituting the baseline with data from a surrogate (nearly identical and pristine) structure is a logical option. While effective in some circumstance, surrogate data is often still a poor substitute for pristine baseline measurements due to minor differences between the structures. To overcome this challenge, we present a dictionary learning framework to adapt surrogate baseline data to better represent an undamaged test structure. We compare the performance of our framework with two other surrogate-based damage detection strategies: (1) using raw surrogate data for comparison and (2) using sparse wavenumber analysis, a precursor to our framework for improving the surrogate data. We apply our framework to guided wave data from two 108 mm by 108 mm aluminum plates. With 20 measurements, we show that our dictionary learning framework achieves a 98% accuracy, raw surrogate data achieves a 92% accuracy, and sparse wavenumber analysis achieves a 57% accuracy.

  6. Den Engelske Regnskabsordbog/English Dictionary of Accounting

    DEFF Research Database (Denmark)

    Nielsen, Sandro; Mourier, Lise; Bergenholtz, Henning

    The English Dictionary of Accounting contains about 5.600 accounting terms, both British, American and international (IFRS). The terms are defined and the dictionary gives language information about the terms. The dictionary can be used when writing and reading English accounting texts and when y...... want to learn more about accounting and financial reporting. The dictionary is designed for accountants, auditors, translators, students communication officers and others interested in financial reporting....

  7. Dictionary of radiology. Radiologisches Woerterbuch

    Energy Technology Data Exchange (ETDEWEB)

    Freye, K; Lammers, W

    1982-01-01

    The dictionary of radiology is based on practical experience in diagnostic radiology. Following a brief clinical introduction, radiological methods including nuclear medicine and the increasingly important field of sonography are presented in alphabetic order, each term with a short definition. The most favourable order of application is determined by the diagnostic value, technical requirements and discomfort of the various methods. Preparative measures, the duration of the examinations, and problems of radiation hygiene are discussed. Illustrative drawings supplement the text. The fields of application given for the various methods are based on the latest state of knowledge. Other methods, e.g. endoscopy in all its variants and thermography, are mentioned whereever they are of diagnostic value. The book has a brief appendix in which the fundamental physical and technical context are explained, also in alphabetic order. Detailed cross-references establish a connection between diseases and diagnostic methods, thus facilitating access to the desired information.

  8. Dictionary of Minor Planet Names

    CERN Document Server

    Schmadel, Lutz D

    2007-01-01

    Dictionary of Minor Planet Names, Fifth Edition, is the official reference for the field of the IAU, which serves as the internationally recognised authority for assigning designations to celestial bodies and any surface features on them. The accelerating rate of the discovery of minor planets has not only made a new edition of this established compendium necessary but has also significantly altered its scope: this thoroughly revised edition concentrates on the approximately 10,000 minor planets that carry a name. It provides authoritative information about the basis for all names of minor planets. In addition to being of practical value for identification purposes, this collection provides a most interesting historical insight into the work of those astronomers who over two centuries vested their affinities in a rich and colorful variety of ingenious names, from heavenly goddesses to more prosaic constructions. The fifth edition serves as the primary reference, with plans for complementary booklets with newl...

  9. Compiling Dictionaries Using Semantic Domains*

    Directory of Open Access Journals (Sweden)

    Ronald Moe

    2011-10-01

    Full Text Available

    Abstract: The task of providing dictionaries for all the world's languages is prodigious, re-quiring efficient techniques. The text corpus method cannot be used for minority languages lacking texts. To meet the need, the author has constructed a list of 1 600 semantic domains, which he has successfully used to collect words. In a workshop setting, a group of speakers can collect as many as 17 000 words in ten days. This method results in a classified word list that can be efficiently expanded into a full dictionary. The method works because the mental lexicon is a giant web or-ganized around key concepts. A semantic domain can be defined as an important concept together with the words directly related to it by lexical relations. A person can utilize the mental web to quickly jump from word to word within a domain. The author is developing a template for each domain to aid in collecting words and in de-scribing their semantics. Investigating semantics within the context of a domain yields many in-sights. The method permits the production of both alphabetically and semantically organized dic-tionaries. The list of domains is intended to be universal in scope and applicability. Perhaps due to universals of human experience and universals of linguistic competence, there are striking simi-larities in various lists of semantic domains developed for languages around the world. Using a standardized list of domains to classify multiple dictionaries opens up possibilities for cross-lin-guistic research into semantic and lexical universals.

    Keywords: SEMANTIC DOMAINS, SEMANTIC FIELDS, SEMANTIC CATEGORIES, LEX-ICAL RELATIONS, SEMANTIC PRIMITIVES, DOMAIN TEMPLATES, MENTAL LEXICON, SEMANTIC UNIVERSALS, MINORITY LANGUAGES, LEXICOGRAPHY

    Opsomming: Samestelling van woordeboeke deur gebruikmaking van se-mantiese domeine. Die taak van die voorsiening van woordeboeke aan al die tale van die wêreld is geweldig en vereis doeltreffende tegnieke. Die

  10. Compiling a Sign Language Dictionary

    DEFF Research Database (Denmark)

    Kristoffersen, Jette Hedegaard; Troelsgård, Thomas

    2010-01-01

    As we began working on the Danish Sign Language (DTS) Dictionary, we soon realised the truth in the statement that a lexicographer has to deal with problems within almost any linguistic discipline. Most of these problems come down to establishing simple rules, rules that can easily be applied every...... – or are they homonyms?" and so on. Very often such questions demand further research and can't be answered sufficiently through a simple standard formula. Therefore lexicographic work often seems like an endless series of compromises. Another source of compromise arises when you set out to decide which information...... this dilemma, as we see DTS learners and teachers as well as native DTS signers as our target users. In the following we will focus on four problem areas with particular relevance for the sign language lexicographer: Sign representation Spoken languague equivalents and mouth movements Example sentences Partial...

  11. Sparse and Adaptive Diffusion Dictionary (SADD) for recovering intra-voxel white matter structure.

    Science.gov (United States)

    Aranda, Ramon; Ramirez-Manzanares, Alonso; Rivera, Mariano

    2015-12-01

    On the analysis of the Diffusion-Weighted Magnetic Resonance Images, multi-compartment models overcome the limitations of the well-known Diffusion Tensor model for fitting in vivo brain axonal orientations at voxels with fiber crossings, branching, kissing or bifurcations. Some successful multi-compartment methods are based on diffusion dictionaries. The diffusion dictionary-based methods assume that the observed Magnetic Resonance signal at each voxel is a linear combination of the fixed dictionary elements (dictionary atoms). The atoms are fixed along different orientations and diffusivity profiles. In this work, we present a sparse and adaptive diffusion dictionary method based on the Diffusion Basis Functions Model to estimate in vivo brain axonal fiber populations. Our proposal overcomes the following limitations of the diffusion dictionary-based methods: the limited angular resolution and the fixed shapes for the atom set. We propose to iteratively re-estimate the orientations and the diffusivity profile of the atoms independently at each voxel by using a simplified and easier-to-solve mathematical approach. As a result, we improve the fitting of the Diffusion-Weighted Magnetic Resonance signal. The advantages with respect to the former Diffusion Basis Functions method are demonstrated on the synthetic data-set used on the 2012 HARDI Reconstruction Challenge and in vivo human data. We demonstrate that improvements obtained in the intra-voxel fiber structure estimations benefit brain research allowing to obtain better tractography estimations. Hence, these improvements result in an accurate computation of the brain connectivity patterns. Copyright © 2015 Elsevier B.V. All rights reserved.

  12. The Use of Pocket Electronic Dictionaries by Thai University Students

    African Journals Online (AJOL)

    rbr

    Ed.). 1998. Using. Dictionaries. Studies of Dictionary Use by Language Learners and Translators: 83-122. Lexicogra- phica. Series Maior 88. Tübingen: Max Niemeyer. Boonmoh, A. and H. Nesi. 2008. A Survey of Dictionary Use by Thai ...

  13. Implementing a dictionary culture for South Africa: attempt at a ...

    African Journals Online (AJOL)

    R.B. Ruthven

    This information should mainly be distributed through dictionary aware- .... could co-operate and share the costs of developing a marketing strategy which could be ... Pro (Sesotho sa Leboa Dictionary Project) online dictionary in South African ... Examples of teaching lexicographic theory are workshops at certain lexi-.

  14. Research Timeline: Dictionary Use by English Language Learners

    Science.gov (United States)

    Nesi, Hilary

    2014-01-01

    The history of research into dictionary use tends to be characterised by small-scale studies undertaken in a variety of different contexts, rather than larger-scale, longer-term funded projects. The research conducted by dictionary publishers is not generally made public, because of its commercial sensitivity, yet because dictionary production is…

  15. Evaluating Bilingual and Monolingual Dictionaries for L2 Learners.

    Science.gov (United States)

    Hunt, Alan

    1997-01-01

    A discussion of dictionaries and their use for second language (L2) learning suggests that lack of computerized modern language corpora can adversely affect bilingual dictionaries, commonly used by L2 learners, and shows how use of such corpora has benefitted two contemporary monolingual L2 learner dictionaries (1995 editions of the Longman…

  16. Online English-English Learner Dictionaries Boost Word Learning

    Science.gov (United States)

    Nurmukhamedov, Ulugbek

    2012-01-01

    Learners of English might be familiar with several online monolingual dictionaries that are not necessarily the best choices for the English as Second/Foreign Language (ESL/EFL) context. Although these monolingual online dictionaries contain definitions, pronunciation guides, and other elements normally found in general-use dictionaries, they are…

  17. The Use of Hyper-Reference and Conventional Dictionaries.

    Science.gov (United States)

    Aust, Ronald; And Others

    1993-01-01

    Describes a study of 80 undergraduate foreign language learners that compared the use of a hyper-reference source incorporating an electronic dictionary and a conventional paper dictionary. Measures of consultation frequency, study time, efficiency, and comprehension are examined; bilingual and monolingual dictionary use is compared; and further…

  18. Innovation and compromise in K. Endemann's dictionary of the ...

    African Journals Online (AJOL)

    This article seeks to highlight some lexicographic features of the dictionary of the Sotho language by K.Endemann against the backdrop of best practices of dictionary compilation today. Over the last centurythe dictionary, entitled Wörterbuch der Sotho-Sprache (1911), has elicited mixed reactions from Bantuscholars, some ...

  19. South Africa's new African language dictionaries and their use for ...

    African Journals Online (AJOL)

    Dictionaries are useful tools for language documentation and standardization, as they try to cover and docu-ment the general vocabulary (general dictionaries) or the specialized vocabulary (technical diction-aries). They empower the language users because they help to improve communication by pro-viding users with the ...

  20. Learner features in a New Corpus-based Swahili dictionary ...

    African Journals Online (AJOL)

    As far as traditionally published Swahili language dictionaries are concerned, throughout the long history of Swahili lexicography, most new dictionaries were based on their predecessors. Thus far the only innovative traditionally printed corpus-based dictionary has been published by Finnish scholars (Abdulla et al. 2002).

  1. Language learner's use of a bilingual dictionary: a comparative ...

    African Journals Online (AJOL)

    This paper compares and contrasts dictionary use and needs of language learners at the University of York in the United Kingdom and at the University of Dar es Salaam in Tanzania. Five aspects are discussed in this study viz. dictionaries used, instructions and guidance on dictionary use, the functions for which students ...

  2. Thoughts and views on the compilation of monolingual dictionaries ...

    African Journals Online (AJOL)

    The end-products should be of a high lexicographic standard, well-balanced in terms of lemma selection, length of the articles, maximum utilisation of available dictionary space etc. They should also be planned and compiled in such a way that the transition from paper dictionaries to electronic dictionaries could be easily ...

  3. Two Recent Major Afrikaans–English/English–Afrikaans Dictionaries ...

    African Journals Online (AJOL)

    Abstract: When Pharos Dictionaries was established in 1996, its first order of business was to develop a comprehensive Afrikaans–English/English–Afrikaans dictionary that could succeed the standard-bearing but ageing TW (Tweetalige Woordeboek/Bilingual Dictionary by Bosman, Van der. Merwe and Hiemstra).

  4. OXFORD DICTIONARY OF SPORTS SCIENCE AND MEDICINE

    Directory of Open Access Journals (Sweden)

    Michael Kent

    2007-03-01

    Full Text Available DESCRIPTION The Oxford Dictionary of Sports Science and Medicine provides reliable definitions of sports science and medicine terms. It provides an invaluable reference book for anyone interested in the captivating subject of sport. PURPOSE This dictionary aims to include almost every sports science, anatomy, physiology, biomechanical, injuries description, and psychological term as related to sports medicine and science and support the explanations by illustrations wherever necessary. AUDIENCE As a comprehensive dictionary of sports science and medicine, it will be of particular help to medical specialists and general practitioners, as well as students of PE, coaches, and athletes who need to understand the anatomical structures and physiological processes which affect athletic performance. Any member of public interested in health and fitness; exercise and sport or wants to understand what the obscure terms mean, like jogger's nipple, social loafing, and Zatopek phenomenon will also benefit from this book. FEATURES The Oxford Dictionary of Sports Science and Medicine features terms in A to Z fashion at all the major areas of sports science and medicine including: anatomy, physiology/exercise physiology, biomechanics, training principles and techniques, nutrition, sports psychology and sociology, sports injuries and rehabilitation. A team of prominent contributors and advisers put together this dictionary in the first edition. The third edition includes around 8000 cross-referenced terms which have been updated or added since the first edition. There are plenty of illustrations wherever appropriate to make the terms easily understandable. ASSESSMENT A must-have dictionary for all medics practising in sports and exercise medicine, as well as students of medicine, physical education, nursing and physiotherapy. Even coaches, trainers, biomechanical experts; in fact anyone who has a special interest in this area will find this dictionary useful.

  5. Dictionary of control technology. Pneumatics, hydraulics, electronics. English-German, German-English. Woerterbuch der Steuerungstechnik. Pneumatik, Hydraulik, Elektronik. Deutsch-Englisch, Englisch-Deutsch

    Energy Technology Data Exchange (ETDEWEB)

    Budd, F

    1988-01-01

    The English-German/German-English dictionary covers the complete field of control technology present in industry today. The subjects represent appropriate terms from hydraulics, pneumatics, electrical engineering, electronics, data processing, administration, and training. (DG).

  6. Dictionary construction and identification of possible adverse drug events in Danish clinical narrative text

    Science.gov (United States)

    Eriksson, Robert; Jensen, Peter Bjødstrup; Frankild, Sune; Jensen, Lars Juhl; Brunak, Søren

    2013-01-01

    Objective Drugs have tremendous potential to cure and relieve disease, but the risk of unintended effects is always present. Healthcare providers increasingly record data in electronic patient records (EPRs), in which we aim to identify possible adverse events (AEs) and, specifically, possible adverse drug events (ADEs). Materials and methods Based on the undesirable effects section from the summary of product characteristics (SPC) of 7446 drugs, we have built a Danish ADE dictionary. Starting from this dictionary we have developed a pipeline for identifying possible ADEs in unstructured clinical narrative text. We use a named entity recognition (NER) tagger to identify dictionary matches in the text and post-coordination rules to construct ADE compound terms. Finally, we apply post-processing rules and filters to handle, for example, negations and sentences about subjects other than the patient. Moreover, this method allows synonyms to be identified and anatomical location descriptions can be merged to allow appropriate grouping of effects in the same location. Results The method identified 1 970 731 (35 477 unique) possible ADEs in a large corpus of 6011 psychiatric hospital patient records. Validation was performed through manual inspection of possible ADEs, resulting in precision of 89% and recall of 75%. Discussion The presented dictionary-building method could be used to construct other ADE dictionaries. The complication of compound words in Germanic languages was addressed. Additionally, the synonym and anatomical location collapse improve the method. Conclusions The developed dictionary and method can be used to identify possible ADEs in Danish clinical narratives. PMID:23703825

  7. Metadata Dictionary Database: A Proposed Tool for Academic Library Metadata Management

    Science.gov (United States)

    Southwick, Silvia B.; Lampert, Cory

    2011-01-01

    This article proposes a metadata dictionary (MDD) be used as a tool for metadata management. The MDD is a repository of critical data necessary for managing metadata to create "shareable" digital collections. An operational definition of metadata management is provided. The authors explore activities involved in metadata management in…

  8. Dictionary of microelectronics and microcomputer technology. Woerterbuch der Mikroelektronik und Mikrorechnertechnik

    Energy Technology Data Exchange (ETDEWEB)

    Attiyate, Y H; Shah, R R

    1984-01-01

    This bilingual dictionary (German-English and English-German) is to give the general public a clearer idea of the terminology of microelectronics, microcomputers, data processing, and computer science. Each part contains about 7500 terms frequently encountered in practice, about 2000 of which are supplemented by precise explanations.

  9. The Effect of Lexicographical Information Costs on Dictionary Making and Use

    DEFF Research Database (Denmark)

    Nielsen, Sandro

    2008-01-01

    ­tinction is proposed between two general types of lexicographical information costs. Firstly, search-related costs are the efforts required by the look-up activities users have to perform when con­sulting a dictionary to find access to the data they are searching for. It is argued that the access route, article...

  10. The Dictionary Unit for South African English. South African Concise Oxford Dictionary

    Directory of Open Access Journals (Sweden)

    Rajend Mesthrie

    2011-10-01

    Full Text Available The South African Concise Oxford Dictionary (henceforth SACOD is a South Af-rican version of the Concise Oxford Dictionary, the first time that this particular hybrid has been prepared. It is testimony to the enduring success of the work of the Dictionary Unit for South African English at Rhodes University, headed by teams that included Jean and William Branford in the 1970s, Penny Silva in the 1990s and now, Kathryn Kavanagh. The lexicographical work from the unit saw the publication of four editions of the Dictionary of Southern African English (1978, 1980, 1987, 1991, a South African Pocket Oxford Dictionary (SAPOD and the Dictionary of South African English on Historical Principles (DOSAEHP (1995. SACOD differs from the rest in several ways. It is larger in scope than SAPOD, smaller than DOSAEHP, and unlike DOSAE and DOSAEHP, does not deal with South African words alone. Based on the 10th edition of the Concise Oxford Dictionary SACOD has excised some words from the parent, whilst adding many new words of general English as well as of South Africa.

  11. Compiling a Monolingual Dictionary for Native Speakers

    Directory of Open Access Journals (Sweden)

    Patrick Hanks

    2011-10-01

    Full Text Available

    ABSTRACT: This article gives a survey of the main issues confronting the compilers of monolingual dictionaries in the age of the Internet. Among others, it discusses the relationship between a lexical database and a monolingual dictionary, the role of corpus evidence, historical principles in lexicography vs. synchronic principles, the instability of word meaning, the need for full vocabulary coverage, principles of definition writing, the role of dictionaries in society, and the need for dictionaries to give guidance on matters of disputed word usage. It concludes with some questions about the future of dictionary publishing.

    OPSOMMING: Die samestelling van 'n eentalige woordeboek vir moedertaalsprekers. Hierdie artikel gee 'n oorsig van die hoofkwessies waarmee die samestellers van eentalige woordeboeke in die eeu van die Internet te kampe het. Dit bespreek onder andere die verhouding tussen 'n leksikale databasis en 'n eentalige woordeboek, die rol van korpusgetuienis, historiese beginsels vs sinchroniese beginsels in die leksikografie, die onstabiliteit van woordbetekenis, die noodsaak van 'n volledige woordeskatdekking, beginsels van die skryf van definisies, die rol van woordeboeke in die maatskappy, en die noodsaak vir woordeboeke om leiding te gee oor sake van betwiste woordgebruik. Dit sluit af met 'n aantal vrae oor die toekoms van die publikasie van woordeboeke.

    Sleutelwoorde: EENTALIGE WOORDEBOEKE, LEKSIKALE DATABASIS, WOORDEBOEKSTRUKTUUR, WOORDBETEKENIS, BETEKENISVERANDERING, GEBRUIK, GEBRUIKSAANTEKENINGE, HISTORIESE BEGINSELS VAN DIE LEKSIKOGRAFIE, SINCHRONIESE BEGINSELS VAN DIE LEKSIKOGRAFIE, REGISTER, SLANG, STANDAARDENGELS, WOORDESKATDEKKING, KONSEKWENSIE VAN VERSAMELINGS, FRASEOLOGIE, SINTAGMATIESE PATRONE, PROBLEME VAN KOMPOSISIONALITEIT, LINGUISTIESE PRESKRIPTIVISME, LEKSIKALE GETUIENIS

  12. ENLISH JURIDICAL TERMINOLOGY IN INDIAN DICTIONARIES

    Directory of Open Access Journals (Sweden)

    Bytko Nataliya Sergeevna

    2015-03-01

    Full Text Available The lexicography of India has a long and fruitful history with Sanskrit being the central object of lexicographic description. However, the development of linguistic and sociocultural situation in the region brought other languages, including English, to the attention of lexicographers. In this article, one of the first dictionaries representing the use of English in India A Glossary of Judicial and Revenue Terms and Useful Words is studied. The methodology consists in the complex lexicographic analysis of the dictionary in the context with the linguistic and sociocultural situation in India. The research revealed the correlation between macroand micro- parameters of the dictionary and linguistic and sociocultural situation of the time. At the macro-level the correlation manifests itself in the fact that the dictionary parameters and the content were determined by the board of directors of the East India Company; the boards recommendations were based on the necessities of the Companys employees around the Raj territory. The exigency of better understanding of the terms cultural components required the substantial use of encyclopedic information. As a result, the dictionary typological characteristics were changed. At the micro-level the correlation reveals itself in the unification of entries orthography, in the combination of alphabetical and net word ordering.

  13. Terms in the Language of Culture-Dependent LSP Dictionaries

    DEFF Research Database (Denmark)

    Bergenholtz, Henning; Nielsen, Sandro

    2003-01-01

    Many dictionaries covering the cultural and social sciences are monolingual, but bi- and polylingual dictionaries should also be included in this category, in particular bilingual dictionaries with a monolingual dimension. These are culture-dependent dictionaries that can only be analysed...... and designed with a view to their genuine purpose. First, this concerns the components of the dictionary, as the inclusion of an encyclopaedic section may facilitate cross-references from individual articles to a systematic and general presentation of a specialist field. Second, it concerns the lemma selection...

  14. Effective Look-up Techniques to Approach a Monolingual Dictionary

    OpenAIRE

    Nauman Al Amin Ali El Sayed; Ahmed Gumaa Siddiek

    2013-01-01

    A dictionary is (a) learning tool that can help the language learner in acquiring great knowledge of and about a foreign language. Almost all language learners buy or at least possess, at one time, a monolingual or bilingual dictionary, to which the learner may refer to look up the meaning of words. Unfortunately, using dictionary to look up the meaning of words seems to be the most important service, which a dictionary is expected to provide to language learners. In fact, a dictionary provid...

  15. The Dictionary Unit for South African English. South African Concise Oxford Dictionary

    OpenAIRE

    Rajend Mesthrie

    2011-01-01

    The South African Concise Oxford Dictionary (henceforth SACOD) is a South Af-rican version of the Concise Oxford Dictionary, the first time that this particular hybrid has been prepared. It is testimony to the enduring success of the work of the Dictionary Unit for South African English at Rhodes University, headed by teams that included Jean and William Branford in the 1970s, Penny Silva in the 1990s and now, Kathryn Kavanagh. The lexicographical work from the unit saw the publication of fou...

  16. Coupled dictionary learning for joint MR image restoration and segmentation

    Science.gov (United States)

    Yang, Xuesong; Fan, Yong

    2018-03-01

    To achieve better segmentation of MR images, image restoration is typically used as a preprocessing step, especially for low-quality MR images. Recent studies have demonstrated that dictionary learning methods could achieve promising performance for both image restoration and image segmentation. These methods typically learn paired dictionaries of image patches from different sources and use a common sparse representation to characterize paired image patches, such as low-quality image patches and their corresponding high quality counterparts for the image restoration, and image patches and their corresponding segmentation labels for the image segmentation. Since learning these dictionaries jointly in a unified framework may improve the image restoration and segmentation simultaneously, we propose a coupled dictionary learning method to concurrently learn dictionaries for joint image restoration and image segmentation based on sparse representations in a multi-atlas image segmentation framework. Particularly, three dictionaries, including a dictionary of low quality image patches, a dictionary of high quality image patches, and a dictionary of segmentation label patches, are learned in a unified framework so that the learned dictionaries of image restoration and segmentation can benefit each other. Our method has been evaluated for segmenting the hippocampus in MR T1 images collected with scanners of different magnetic field strengths. The experimental results have demonstrated that our method achieved better image restoration and segmentation performance than state of the art dictionary learning and sparse representation based image restoration and image segmentation methods.

  17. The ideal number of lemmas in an ideal accounting dictionary

    DEFF Research Database (Denmark)

    Bergenholtz, Henning; Norddahl, Bjarni

    2014-01-01

    no clear rules or methods to avoid lemma flooding. Now we will try the same kind of analysis of log files for the English-Danish and the Danish-English Accounting Dictionaries. We see here that there are differences between different dictionaries (monolingual for English and Danish and bilingual......Lemma lacunas in dictionaries are a traditional focus area for lexicographers, but the opposite problem, which we choose to call lemma flooding, has received very little attention. The study of this flooding could be relevant in order to save lexicographers spending thousands of hours producing...... dictionary entries which nobody reads. In Bergenholtz/Norddahl (2012) we showed that during a three-year period less than 33% of all dictionary articles out of 18 million dictionary consultations were consulted in a dictionary with 111.000 entries. We examined nine possible reasons why a given word might...

  18. Dictionary-driven protein annotation.

    Science.gov (United States)

    Rigoutsos, Isidore; Huynh, Tien; Floratos, Aris; Parida, Laxmi; Platt, Daniel

    2002-09-01

    Computational methods seeking to automatically determine the properties (functional, structural, physicochemical, etc.) of a protein directly from the sequence have long been the focus of numerous research groups. With the advent of advanced sequencing methods and systems, the number of amino acid sequences that are being deposited in the public databases has been increasing steadily. This has in turn generated a renewed demand for automated approaches that can annotate individual sequences and complete genomes quickly, exhaustively and objectively. In this paper, we present one such approach that is centered around and exploits the Bio-Dictionary, a collection of amino acid patterns that completely covers the natural sequence space and can capture functional and structural signals that have been reused during evolution, within and across protein families. Our annotation approach also makes use of a weighted, position-specific scoring scheme that is unaffected by the over-representation of well-conserved proteins and protein fragments in the databases used. For a given query sequence, the method permits one to determine, in a single pass, the following: local and global similarities between the query and any protein already present in a public database; the likeness of the query to all available archaeal/ bacterial/eukaryotic/viral sequences in the database as a function of amino acid position within the query; the character of secondary structure of the query as a function of amino acid position within the query; the cytoplasmic, transmembrane or extracellular behavior of the query; the nature and position of binding domains, active sites, post-translationally modified sites, signal peptides, etc. In terms of performance, the proposed method is exhaustive, objective and allows for the rapid annotation of individual sequences and full genomes. Annotation examples are presented and discussed in Results, including individual queries and complete genomes that were

  19. CHEMICAL EFFECTS IN BIOLOGICAL SYSTEMS – DATA DICTIONARY (CEBS-DD): A COMPENDIUM OF TERMS FOR THE CAPTURE AND INTEGRATION OF BIOLOGICAL STUDY DESIGN DESCRIPTION, CONVENTIONAL PHENOTYPES AND ‘OMICS’ DATA

    Science.gov (United States)

    A critical component in the design of the Chemical Effects in Biological Systems (CEBS) Knowledgebase is a strategy to capture toxicogenomics study protocols and the toxicity endpoint data (clinical pathology and histopathology). A Study is generally an experiment carried out du...

  20. DOLPHIn—Dictionary Learning for Phase Retrieval

    Science.gov (United States)

    Tillmann, Andreas M.; Eldar, Yonina C.; Mairal, Julien

    2016-12-01

    We propose a new algorithm to learn a dictionary for reconstructing and sparsely encoding signals from measurements without phase. Specifically, we consider the task of estimating a two-dimensional image from squared-magnitude measurements of a complex-valued linear transformation of the original image. Several recent phase retrieval algorithms exploit underlying sparsity of the unknown signal in order to improve recovery performance. In this work, we consider such a sparse signal prior in the context of phase retrieval, when the sparsifying dictionary is not known in advance. Our algorithm jointly reconstructs the unknown signal - possibly corrupted by noise - and learns a dictionary such that each patch of the estimated image can be sparsely represented. Numerical experiments demonstrate that our approach can obtain significantly better reconstructions for phase retrieval problems with noise than methods that cannot exploit such "hidden" sparsity. Moreover, on the theoretical side, we provide a convergence result for our method.

  1. Dictionary of pressure vessel and piping technology

    International Nuclear Information System (INIS)

    Schmitz, H.P.

    1987-01-01

    This dictionary is the result of many years of evaluation of technical terminology taken from the salient non-German rules, regulations, standards and specifications such as ANSI, API, ASME, ASNT, ASTM, BSI, EJMA, TEMA, and WRC (see bibliography) and of comparing these with the corresponding German rules, regulations, etc., as well as examining relevant technical documentation. This dictionary fills the gap left by existing dictionaries. The following specialized factors are given special attention: pressure vessels, tanks, heat exchangers, piping, valves and fittings, expansion joints, flanges, giving particular consideration to the fields of materials, welding, strength calculation, design and construction, fracture mechanics, destructive and non-destructive testing, as well as heat and mass transfer. (orig./HP) [de

  2. Dictionary Indexing of Electron Channeling Patterns.

    Science.gov (United States)

    Singh, Saransh; De Graef, Marc

    2017-02-01

    The dictionary-based approach to the indexing of diffraction patterns is applied to electron channeling patterns (ECPs). The main ingredients of the dictionary method are introduced, including the generalized forward projector (GFP), the relevant detector model, and a scheme to uniformly sample orientation space using the "cubochoric" representation. The GFP is used to compute an ECP "master" pattern. Derivative free optimization algorithms, including the Nelder-Mead simplex and the bound optimization by quadratic approximation are used to determine the correct detector parameters and to refine the orientation obtained from the dictionary approach. The indexing method is applied to poly-silicon and shows excellent agreement with the calibrated values. Finally, it is shown that the method results in a mean disorientation error of 1.0° with 0.5° SD for a range of detector parameters.

  3. Pharos Woordeboeke / Dictionaries 5 in 1

    Directory of Open Access Journals (Sweden)

    Phillip Louw

    2011-10-01

    Full Text Available In 2000 het 'n produk van Pharos verskyn wat die landskap van veral die Afrikaanse leksikografie ingrypend verander het. Pharos Woordeboeke/Dictionaries 5 in 1 (voortaan Pharos 5 in 1 bied vir die eerste maal 'n geïntegreerde digitale biblioteek van woordeboeke met Afrikaans as behandelde taal wat ook kan inskakel by 'n groter digitale biblioteek op die Logos-platform. Vir die eerste keer word daar vir die gebruiker 'n kommersieel suksesvolle vertalende woordeboek met Afrikaans en Engels as behandelde taalpaar op CD-ROM gebied, nl. Groot Woordeboek/Major Dictionary met Pharos se Nuwe Woorde/New Words en Tweetalige Frasewoordeboek/Bilingual Phrase Dictionary ter ondersteuning. Die Afrikaanse verklarende leksikografie baat ook met aanvulling vir die ELHAT wat kom in die vorm van Verklarende Afrikaanse Woordeboek. Laastens word die Groot Tesourus van Afrikaans ter afronding van die biblioteek beskikbaar gestel.

  4. Nuclear engineering dictionary. Woerterbuch Kerntechnik

    Energy Technology Data Exchange (ETDEWEB)

    Sube, R

    1985-01-01

    This dictionary treats the subject field of nuclear engineering as a field of applied nuclear physics: Industrial and other applications of nuclear energy, isotopes and ionizing radiation, and their, scientific-technical bases. Emphasis is placed on the terminology of the nuclear fuel cycle. Other applications of nuclear energy include military applications, nuclear fusion technology, and plasma physics, as well as methods and equipment of isotope and radiation technology, without the aspects of biological applications. High-energy physics is also excluded. The terminology presented primarily covers general and basic concepts, special terms have been included as far as available and ascertainable in all four languages. For selection of terms, numerous textbooks and monographies have been searched and compared, as well as various subject-related journals which have been regularly scanned for years. Standards have been a main source of information, as e.g. the international standards of the IAEA (including the INIS terminology), of the ISO, of the COMECON, and of the World Energy Conference and the IEC. Numerous national standards have been evaluated in search for definitions and designations. Users will appreciate the introduction of subject-field codes indicating the main field of usage of a term. Explanations and other hints are numerous and extensive in order to clearly define the terms chosen from other, similar terms, and in order to show homonyms.

  5. On the Timelessness of Music Dictionaries

    Directory of Open Access Journals (Sweden)

    Henning Bergenholtz

    2011-10-01

    Full Text Available

    Abstract: A music dictionary for the Internet fulfils the same functions as printed music dictionaries. An earlier music dictionary is as useful as a new one if its information is correct. But the fact that an Internet dictionary can at any time be corrected according to modern practices makes it, if not timeless, at least more up to date. Furthermore, the possibilities of illustrating with picture and sound open a wide field of usefulness. Nevertheless the lexicographer has to be aware of the different needs of different user types in different user situations. The dictionary being discussed, Musikordbogen, has been designed for text reception rather than translation or text production. After the inception of the dictionary has been described, the way the possibilities of the Internet has influenced the concept and the content of the articles and the outer texts is discussed.

    Keywords: MUSIC DICTIONARY, DICTIONARY FUNCTIONS, SPECIALIZED LEXICO-GRAPHY, INTERNET LEXICOGRAPHY, TRANSLATION, TEXT RECEPTION, TEXT PRODUC-TION, MUSICAL KNOWLEDGE

    Opsomming: Oor die tydloosheid van musiekwoordeboeke. 'n Musiekwoor-deboek vir die Internet vervul dieselfde funksie as gedrukte musiekwoordeboeke. 'n Vroeëre musiekwoordeboek is net so nuttig soos 'n nuwe een indien sy inligting juis is. Maar die feit dat 'n Internetwoordeboek enige tyd gekorrigeer kan word volgens moderne praktyke, maak dit, indien nie tydloos nie, ten minste meer op datum. Verder open die moontlikhede van illustrasie met prent en klank 'n wye veld van bruikbaarheid. Nogtans moet die leksikograaf bewus wees van die ver-skillende behoeftes van verskillende gebruikertipes in verskillende gebruikersituasies. Die woorde-boek wat bespreek word, Musikordbogen, is beplan vir teksontvangs eerder as vertaling of tekspro-duksie. Nadat die ontstaan van die woordeboek beskryf is, word die manier bespreek waarop die moontlikhede van die internet die ontwerp en inhoud van die artikels en

  6. How to do language policy with dictionaries

    DEFF Research Database (Denmark)

    Bergenholtz, Henning

    2006-01-01

    The lexicographic presentation of terms from the field of language planning often lacks a clear and unambiguous distinction and a proper explanation. Too often dictionaries even fail to include these terms in the lemma list and some central terms have not been treated in any general or special...... and the motivation for the introduction of the notion of a communication policy should help lexicographers to give a more comprehensive account of terms from this field and it will also benefit scholars in the field of language policy. A second aspect of this paper is the discussion of ways in which dictionaries...

  7. Non-classical continuum mechanics a dictionary

    CERN Document Server

    Maugin, Gérard A

    2017-01-01

    This dictionary offers clear and reliable explanations of over 100 keywords covering the entire field of non-classical continuum mechanics and generalized mechanics, including the theory of elasticity, heat conduction, thermodynamic and electromagnetic continua, as well as applied mathematics. Every entry includes the historical background and the underlying theory, basic equations and typical applications. The reference list for each entry provides a link to the original articles and the most important in-depth theoretical works. Last but not least, every entry is followed by a cross-reference to other related subject entries in the dictionary.

  8. Dictionary of pressure vessel and piping technology

    International Nuclear Information System (INIS)

    Jentgen, L.; Schmitz, H.P.

    1986-01-01

    A specialised dictionary has been compiled containing the appropriate English and German terms in the following technical fields: materials science, welding, destructive and non-destructive testing, thermal and mass transfer, the design and construction in particular of pressure vessels, tanks, heat exchangers, piping, expansion joints, valves, and components associated with the above fields. This dictionary is the result of many years spent in evaluating technical terminology from the relevant American and British regulations, technical rules, standards, and specifications (see bibliography) and correlating these with the terminology of comparable German regulations, rules and standards, together with the essential technical literature. (orig.) [de

  9. The Routledge Dictionary of English Language Studies

    CERN Document Server

    Pearce, Michael

    2012-01-01

    Filled with real examples of the way people use English in different contexts, The Routledge Dictionary of English Language Studies is an indispensable guide to the richness and variety of the English language for both students and the general reader.From abbreviation to zero-article, via fricative and slang, the Dictionary contains over 600 wide ranging and informative entries covering:the core areas of language description and analysis: phonetics and phonology, grammar, lexis, semantics, pragmatics and discoursesociolinguistics, including entries on social and regional variation, stylistic v

  10. Form, its meaning, and dictionary entries

    Directory of Open Access Journals (Sweden)

    Violetta Koseska-Toszewa

    2015-11-01

    It is worth stressing that distinguishing between the form and its meaning in comparing the material 6 languages belonging to three different groups of Slavic languages (as is the case in the MONDILEX Project will allow us to avoid numeorus substantiva mistakes and erroneous conclusions. Hence dictionary entries should be verified and made uniform in that respect before they are “digitalized”... Distinction between the form and its meaning in a dictionary entry is fully possible, as shown by works of Z. Saloni (2002 and A.Przepiórkowski (2008.

  11. Lower Bounds for External Memory Dictionaries

    DEFF Research Database (Denmark)

    Brodal, Gerth Stølting; Fagerberg, Rolf

    2003-01-01

    We study trade-offs between the update time and the query time for comparison based external memory dictionaries. The main contributions of this paper are two lower bound trade offs between the I/O complexity of member queries and insertions: If N < M insertions perform at most δ · N/B I/Os, then......We study trade-offs between the update time and the query time for comparison based external memory dictionaries. The main contributions of this paper are two lower bound trade offs between the I/O complexity of member queries and insertions: If N

  12. Multilingual Aeronautical Dictionary (Dictionnaire Aeronautique Multilingue)

    Science.gov (United States)

    1980-01-01

    8217See ’aerofoil profile’ DE Bord’Boden-Funkverkehr (ili 20~ AGARD MULTILINGUAL AERONAUTICAL DICTIONARY 10318 air mileage indicator (AMI) ES comunicacion ...Autogenschweissen (nil ES sistema fml autom~tico de comunicacion NE automatische besturing ES soldadura MI autdgena aire-tierra P otooWatmtc FR soudage Wm autogene...AERONAUTICAL DICTIONARY DE Fernmeldesatellit [m) RU 1. maPXWbPOBK& ff1 OTcOKOB RU onPe~ene~me Wn Aesma84HN Komnaca ES satelite Wm do comunicaciones 2

  13. An efficient dictionary learning algorithm and its application to 3-D medical image denoising.

    Science.gov (United States)

    Li, Shutao; Fang, Leyuan; Yin, Haitao

    2012-02-01

    In this paper, we propose an efficient dictionary learning algorithm for sparse representation of given data and suggest a way to apply this algorithm to 3-D medical image denoising. Our learning approach is composed of two main parts: sparse coding and dictionary updating. On the sparse coding stage, an efficient algorithm named multiple clusters pursuit (MCP) is proposed. The MCP first applies a dictionary structuring strategy to cluster the atoms with high coherence together, and then employs a multiple-selection strategy to select several competitive atoms at each iteration. These two strategies can greatly reduce the computation complexity of the MCP and assist it to obtain better sparse solution. On the dictionary updating stage, the alternating optimization that efficiently approximates the singular value decomposition is introduced. Furthermore, in the 3-D medical image denoising application, a joint 3-D operation is proposed for taking the learning capabilities of the presented algorithm to simultaneously capture the correlations within each slice and correlations across the nearby slices, thereby obtaining better denoising results. The experiments on both synthetically generated data and real 3-D medical images demonstrate that the proposed approach has superior performance compared to some well-known methods. © 2011 IEEE

  14. A Weighted Two-Level Bregman Method with Dictionary Updating for Nonconvex MR Image Reconstruction

    Directory of Open Access Journals (Sweden)

    Qiegen Liu

    2014-01-01

    Full Text Available Nonconvex optimization has shown that it needs substantially fewer measurements than l1 minimization for exact recovery under fixed transform/overcomplete dictionary. In this work, two efficient numerical algorithms which are unified by the method named weighted two-level Bregman method with dictionary updating (WTBMDU are proposed for solving lp optimization under the dictionary learning model and subjecting the fidelity to the partial measurements. By incorporating the iteratively reweighted norm into the two-level Bregman iteration method with dictionary updating scheme (TBMDU, the modified alternating direction method (ADM solves the model of pursuing the approximated lp-norm penalty efficiently. Specifically, the algorithms converge after a relatively small number of iterations, under the formulation of iteratively reweighted l1 and l2 minimization. Experimental results on MR image simulations and real MR data, under a variety of sampling trajectories and acceleration factors, consistently demonstrate that the proposed method can efficiently reconstruct MR images from highly undersampled k-space data and presents advantages over the current state-of-the-art reconstruction approaches, in terms of higher PSNR and lower HFEN values.

  15. The Effectiveness of Using Contextual Clues, Dictionary Strategy and Computer Assisted Language Learning (Call In Learning Vocabulary

    Directory of Open Access Journals (Sweden)

    Zuraina Ali

    2013-07-01

    Full Text Available This study investigates the effectiveness of three vocabulary learning methods that are Contextual Clues, Dictionary Strategy, and Computer Assisted Language Learning (CALL in learning vocabulary among ESL learners. First, it aims at finding which of the vocabulary learning methods namely Dictionary Strategy, Contextual Clues, and CALL that may result in the highest number of words learnt in the immediate and delayed recall tests. Second, it compares the results of the Pre-test and the Delayed Recall Post-test to determine the differences of learning vocabulary using the methods. A quasi-experiment that tested the effectiveness of learning vocabulary using Dictionary Strategy, Contextual clues, and CALL involved 123 first year university students. Qualitative procedures included the collection of data from interviews which were conducted to triangulate the data obtain from the quantitative inquiries. Findings from the study using ANOVA revealed that there were significant differences when students were exposed to Dictionary Strategy, Contextual Clues and CALL in the immediate recall tests but not in the Delayed Recall Post-test. Also, there were significant differences when t test was used to compare the scores between the Pre-test and the Delayed Recall Post-test in using the three methods of vocabulary learning. Although many researchers have advocated the relative effectiveness of Dictionary Strategy, Contextual Clues, and CALL in learning vocabulary, the study however, is still paramount since there is no study has ever empirically investigated the relative efficacy of these three methods in a single study.

  16. On the Application of Joint-Domain Dictionary Mapping for Multiple Power Disturbance Assessment

    Directory of Open Access Journals (Sweden)

    Delong Cai

    2018-02-01

    Full Text Available This paper proposes a joint-domain dictionary mapping method to obtain high assessment accuracy of multiple power disturbances. Firstly, in order to achieve resolutions in both the time and frequency domains, a joint-domain dictionary is proposed which consists of a discrete Hartley base and an identity matrix. Due to the low correlation between the discrete Hartley base and the identity matrix, the joint-domain dictionary mapping can separately capture the approximations of the sinusoidal components and transients. Since the mapping coefficients contain the physical quantities, the eigenvalues of each component can be effectively estimated. A quantified eigenvalue classifier was designed for identifying power disturbances using the estimated eigenvalues. The proposed method was compared with several advanced methods through simulated power disturbances under different noise conditions, and actual data from the Institute of Electrical and Electronics Engineers Power and Energy Society database. The results reveal that the joint-domain dictionary mapping technique shows good performance on parameter estimation and recognition precision, even dealing with complicated multiple power disturbances.

  17. A tool to facilitate clinical biomarker studies - a tissue dictionary based on the Human Protein Atlas

    Directory of Open Access Journals (Sweden)

    Kampf Caroline

    2012-09-01

    Full Text Available Abstract The complexity of tissue and the alterations that distinguish normal from cancer remain a challenge for translating results from tumor biological studies into clinical medicine. This has generated an unmet need to exploit the findings from studies based on cell lines and model organisms to develop, validate and clinically apply novel diagnostic, prognostic and treatment predictive markers. As one step to meet this challenge, the Human Protein Atlas project has been set up to produce antibodies towards human protein targets corresponding to all human protein coding genes and to map protein expression in normal human tissues, cancer and cells. Here, we present a dictionary based on microscopy images created as an amendment to the Human Protein Atlas. The aim of the dictionary is to facilitate the interpretation and use of the image-based data available in the Human Protein Atlas, but also to serve as a tool for training and understanding tissue histology, pathology and cell biology. The dictionary contains three main parts, normal tissues, cancer tissues and cells, and is based on high-resolution images at different magnifications of full tissue sections stained with H & E. The cell atlas is centered on immunofluorescence and confocal microscopy images, using different color channels to highlight the organelle structure of a cell. Here, we explain how this dictionary can be used as a tool to aid clinicians and scientists in understanding the use of tissue histology and cancer pathology in diagnostics and biomarker studies.

  18. Building a protein name dictionary from full text: a machine learning term extraction approach

    Directory of Open Access Journals (Sweden)

    Campagne Fabien

    2005-04-01

    Full Text Available Abstract Background The majority of information in the biological literature resides in full text articles, instead of abstracts. Yet, abstracts remain the focus of many publicly available literature data mining tools. Most literature mining tools rely on pre-existing lexicons of biological names, often extracted from curated gene or protein databases. This is a limitation, because such databases have low coverage of the many name variants which are used to refer to biological entities in the literature. Results We present an approach to recognize named entities in full text. The approach collects high frequency terms in an article, and uses support vector machines (SVM to identify biological entity names. It is also computationally efficient and robust to noise commonly found in full text material. We use the method to create a protein name dictionary from a set of 80,528 full text articles. Only 8.3% of the names in this dictionary match SwissProt description lines. We assess the quality of the dictionary by studying its protein name recognition performance in full text. Conclusion This dictionary term lookup method compares favourably to other published methods, supporting the significance of our direct extraction approach. The method is strong in recognizing name variants not found in SwissProt.

  19. Nonlinear approximation with dictionaries I. Direct estimates

    DEFF Research Database (Denmark)

    Gribonval, Rémi; Nielsen, Morten

    2004-01-01

    We study various approximation classes associated with m-term approximation by elements from a (possibly) redundant dictionary in a Banach space. The standard approximation class associated with the best m-term approximation is compared to new classes defined by considering m-term approximation w...

  20. Gabonese French Dictionaries: Survey and Perspectives*

    African Journals Online (AJOL)

    Gabonese French Dictionaries: Survey and Perspectives. 257 ... be regarded, according to Gouws (2007: 314), as externally motivated products. .... Fair attention is given to different style or normative levels, i.e. .... differs enough from standard French to be regarded as a new emerging lan- ..... Tous les leaders des par-.

  1. Some Dictionary Descriptions of Grammatical Structure

    African Journals Online (AJOL)

    : ... cept of valency: ''Valency is the term used in dependency theory to refer to the ... were a major strength of Johnson's Dictionary, making it, like Oxford English ... this has to be gleaned from the illustrative quotations and is not spelled out.

  2. Meaning discrimination in bilingual Venda dictionaries | Mafela ...

    African Journals Online (AJOL)

    In most cases, the equivalents of the entry-words are provided without giving meaning discrimination. Without a good command of Venda and the provision of meaning discrimination, users will find it difficult to make a correct choice of the equivalent for which they are looking. Bilingual Venda dictionaries are therefore not ...

  3. Monolingual Dictionary Use in an EFL Context

    Science.gov (United States)

    Ali, Holi Ibrahim Holi

    2012-01-01

    Caledonian College of Engineering, Oman, has been encouraging its students to use monolingual dictionaries rather than bilingual or bilingualized ones in classroom and during the exams. This policy with has been received with mixed feelings and attitudes. Therefore, this study strives to explore teachers' and students' attitudes about the use of…

  4. Dictionary of chemistry. English/German

    International Nuclear Information System (INIS)

    Wenske, G.

    1992-01-01

    This English/German dictionary covers more than 100.000 terms from chemistry, chemical engineering and related fields. It also contains molecular formulas, as well as numerous synonyms and areas of application. IUPAC terminology is emphasized, and outdated or rare terminology is indicated. (MM) [de

  5. Bikol Dictionary. PALI Language Texts: Philippines.

    Science.gov (United States)

    Mintz, Malcolm W.

    The Bikol language of the Philippines, spoken in the southernmost peninsula of Luzon Island and extending into the island provinces of Catanduanes and Masbate, is presented in this bilingual dictionary. An introduction explains the Bikol alphabet, orthographic representation (including policies adopted in writing Spanish and English loan words),…

  6. Methods in Lexicography and Dictionary Research

    African Journals Online (AJOL)

    user

    Access Structures in Printed Dictionaries. Gouws,. Rufus H., Ulrich Heid, Wolfgang Schweickard and Herbert Ernst Wiegand (Eds.). 2013: 110-149. Wiegand, Herbert Ernst and Mª Teresa Fuentes Morán. 2010. Estructuras lexicográficas. Aspectos centrales de una teoría de la forma del diccionario. Colección Lexicografía 2.

  7. Culinary Arts Dictionary 1. Project HIRE.

    Science.gov (United States)

    Gardner, David C.; And Others

    Designed as supplemental material to on-going instruction in the vocational program, this first of three picture dictionary booklets in the Culinary Arts series is intended to assist the learning handicapped student to master the core vocabulary taught in the trade. Intended for individual or small group instruction with minimal supervision, this…

  8. Culinary Arts Dictionary 3. Project HIRE.

    Science.gov (United States)

    Gardner, David C.; And Others

    Designed as supplemental material to on-going instruction in the vocational program, this third of three picture dictionary booklets in the Culinary Arts series is intended to assist the learning handicapped student to master the core vocabulary taught in the trade. Intended for individual or small group instruction with minimal supervision, this…

  9. Culinary Arts Dictionary 2. Project HIRE.

    Science.gov (United States)

    Gardner, David C.; And Others

    Designed as supplemental material to on-going instruction in the vocational program, this second of three picture dictionary booklets in the Culinary Arts series is intended to assist the learning handicapped student to master the core vocabulary taught in the trade. Intended for individual or small group instruction with minimal supervision, this…

  10. Bilingual Dictionary and Meaning Discrimination in Venda*

    African Journals Online (AJOL)

    Riette Ruthven

    Page 1 ... The translation equivalents of entry-words in a bilingual dictionary are usually of two types, i.e. translational and explanatory. A translational equiva- lent is a lexical unit which can immediately be ... to express themselves in or translate into the foreign language. Venda, one of the languages which were previously ...

  11. Categorising Example Sentences in Dictionaries for Research ...

    African Journals Online (AJOL)

    able contextual or grammatical support. I have constructed a table to classify example sentences according to different criteria. I filled in this table with randomly selected words and their examples which have been taken from five different South African school dictionaries. The goal of this research is to present characteristics ...

  12. Some Dictionary Descriptions of Grammatical Structure | Branford ...

    African Journals Online (AJOL)

    This paper examines some points in the treatment of grammatical structure in four recent dictionaries of English as Ll. These are viewed against the background concepts of "Iexicogrammar" (Halliday 1978) and of the interdependence of lexicographical and syntactic descriptions of language. Its scope is necessari1y ...

  13. Dictionary of Large Hadron Collider signatures

    Indian Academy of Sciences (India)

    We report on a plan to establish a `Dictionary of LHC Signatures', an initiative that started at the WHEPP-X workshop in Chennai, January 2008. This study aims at the strategy of distinguishing 3 classes of dark matter motivated scenarios such as -parity conserved supersymmetry, little Higgs models with -parity ...

  14. Bilingual Dictionaries and Communicative Equivalence for a ...

    African Journals Online (AJOL)

    This implies that a bilingual dictionary becomes a poly functional instrument, presenting more information than just translation equivalents. ... With the emphasis on the user perspective, metalexicographical criteria are used to investigate problems regarding the access structure and the addressing procedures in Afrikaans ...

  15. Comparative Eskimo Dictionary with Aleut Cognates.

    Science.gov (United States)

    Fortescue, Michael, Ed.; And Others

    This dictionary covers 10 Eskimo dialects (Alutiiq, Central Alaskan Yupik, Naukan, Central Siberian Yupik, Sirenik, Seward Peninsula Inuit, North Alaskan Inuit, Western Canadian Inuit, Eastern Canadian Inuit, Greenlandic Inuit). An introductory section details the classification of languages and dialects and their phonologies, and discusses the…

  16. Children's Dictionary of Occupations. Third Edition.

    Science.gov (United States)

    Parramore, Barbara M.; Hopke, William E.; Drier, Harry N.

    About 300 job titles are listed and defined with an illustration of a child working in each one in this specialized dictionary. Approximate phonetic pronunciations are given. Both girls and boys of various racial or ethnic backgrounds are used in the illustrations. Discussion of the world of work, getting a job, kinds of jobs, careers, and the…

  17. Many general language dictionaries contain specialized terms

    African Journals Online (AJOL)

    user

    used in England and Wales in the light of the change of structure of and terminology ... Legal Terms in General Dictionaries of English: The Civil Procedure Mystery ... (2015: 8), between 1.4 million and 2.1 million cases annually were brought.

  18. Book Review: Public Administration Dictionary | Marais | Lexikos

    African Journals Online (AJOL)

    Book Title: Public Administration Dictionary. Book Author: William Fox & Ivan H. Meyer. 1995. viii + 139 pp. ISBN 0 70213219 5. Juta. Full Text: EMAIL FREE FULL TEXT EMAIL FREE FULL TEXT · DOWNLOAD FULL TEXT DOWNLOAD FULL TEXT · AJOL African Journals Online. HOW TO USE AJOL... for Researchers · for ...

  19. SYNONYMS IN GERMAN ONLINE MONOLINGUAL DICTIONARIES

    Directory of Open Access Journals (Sweden)

    Paloma Sánchez Hernández

    2017-03-01

    Full Text Available This study includes both theoretical and qualitative research and falls within the framework of semantics and lexicography. It is based on work conducted as a part of the COMBIDIGILEX research project: MINECO-FEDER FFI2015-64476-P. The lexicographical description proposed in the COMBIDIGILEX project is based on the foundations of bilingual lexicography from an onomasiological perspective, including paradigmatic information and syntagmatic analysis, which is useful to users creating texts for students at an advanced level. The project analyses verbal lexemes in German and Spanish based on a paradigmatic, syntagmatic, orthographic and morphological perspective (among others. Subsequently, a contrastive analysis was conducted between both languages. In this contribution, we first analyse what paradigmatic information is, including its relevance to a dictionary. Paradigmatic information includes not only synonyms and antonyms but also hyperonyms and hyponyms, which often complete the lexicographical article in a general dictionary. Paradigmatic relations can be observed in light of semantic definitions or may independently become part of the lexical entry. Forming the paradigmatic information of an entry in an independent manner is known as “intentionelle Paradigmatik”, and it constitutes a series of advantages in the dictionary (Hausmann 1991b: 2794. This type of information aids the processes of production and expands vocabulary. Next, we examine the appearance of synonyms in three German online monolingual dictionaries – DWDS, WORTSCHATZLEXIKON and DUDEN ONLINE – from the semantic perspective of cognition verbs. The primary objective of the study is to demonstrate the relevance of this type of information as well as the needs it covers from a user’s perspective. Offering the user a series of lexical elements along with information on semantic relations of a paradigmatic nature thus addresses the issue of users having an array of

  20. Convolutional Dictionary Learning: Acceleration and Convergence

    Science.gov (United States)

    Chun, Il Yong; Fessler, Jeffrey A.

    2018-04-01

    Convolutional dictionary learning (CDL or sparsifying CDL) has many applications in image processing and computer vision. There has been growing interest in developing efficient algorithms for CDL, mostly relying on the augmented Lagrangian (AL) method or the variant alternating direction method of multipliers (ADMM). When their parameters are properly tuned, AL methods have shown fast convergence in CDL. However, the parameter tuning process is not trivial due to its data dependence and, in practice, the convergence of AL methods depends on the AL parameters for nonconvex CDL problems. To moderate these problems, this paper proposes a new practically feasible and convergent Block Proximal Gradient method using a Majorizer (BPG-M) for CDL. The BPG-M-based CDL is investigated with different block updating schemes and majorization matrix designs, and further accelerated by incorporating some momentum coefficient formulas and restarting techniques. All of the methods investigated incorporate a boundary artifacts removal (or, more generally, sampling) operator in the learning model. Numerical experiments show that, without needing any parameter tuning process, the proposed BPG-M approach converges more stably to desirable solutions of lower objective values than the existing state-of-the-art ADMM algorithm and its memory-efficient variant do. Compared to the ADMM approaches, the BPG-M method using a multi-block updating scheme is particularly useful in single-threaded CDL algorithm handling large datasets, due to its lower memory requirement and no polynomial computational complexity. Image denoising experiments show that, for relatively strong additive white Gaussian noise, the filters learned by BPG-M-based CDL outperform those trained by the ADMM approach.

  1. The Ideology of the Perfect Dictionary: How Efficient Can a Dictionary Be?

    Directory of Open Access Journals (Sweden)

    Michaël Abecassis

    2011-10-01

    Full Text Available

    Abstract: Dictionaries have become essential tools of the modern world. Not only have dictionary sales dramatically increased, but the variety of dictionaries and the competition between editors are also very much on the rise. Monolingual dictionaries attract native speakers for several reasons. Some wish to capture the subtleties of their own language, others to speak the 'standard' language, and an 'ideologically' politically correct variety devoid of colloquialisms, hence the crucial role played by style labels. Furthermore, a large number of word enthusiasts enjoy linguistic curiosities, archaisms and other vestiges from the past conserved in dictionaries. Is the concept of a perfect dictionary a reality or an ideal? There is no perfect student. Language learners, for whom dictionaries are of great importance, seek user-friendly material which will improve both their fluency in and understanding of the target language, and embed acquired lexis in their long-term memory. Lexicographers, in their search for perfection and in compliance with users' wishes, are constantly innovating, and every dictionary hopes to become a landmark in lexicography and in second language acquisition. This article aims to look at the way dictionaries have evolved and assess the latest generation of computer-based dictionaries, as well as consider possible developments which will contribute to the compilation of future dictionaries.

    Keywords: LEXICOGRAPHY, LEXICAL ACQUISITION, VOCABULARY, STYLE LABELS,CORPUS/CORPORA, DICTIONARIES, IDEOLOGY, STANDARD, LANGUAGE LEARNING,FRENCH MONOLINGUAL DICTIONARIES, ELECTRONIC DICTIONARIES, CD-ROMS

    Opsomming: Die ideologie van die volmaakte woordeboek: Hoe doeltreffendkan 'n woordeboek wees? Woordeboeke het noodsaaklike werktuie van die modernewêreld geword. Nie alleen het woordeboekverkope dramaties vermeerder nie, maar die verskeidenheidwoordeboeke en die wedywering tussen redakteurs is ook aansienlik aan die

  2. Classification of multispectral or hyperspectral satellite imagery using clustering of sparse approximations on sparse representations in learned dictionaries obtained using efficient convolutional sparse coding

    Science.gov (United States)

    Moody, Daniela; Wohlberg, Brendt

    2018-01-02

    An approach for land cover classification, seasonal and yearly change detection and monitoring, and identification of changes in man-made features may use a clustering of sparse approximations (CoSA) on sparse representations in learned dictionaries. The learned dictionaries may be derived using efficient convolutional sparse coding to build multispectral or hyperspectral, multiresolution dictionaries that are adapted to regional satellite image data. Sparse image representations of images over the learned dictionaries may be used to perform unsupervised k-means clustering into land cover categories. The clustering process behaves as a classifier in detecting real variability. This approach may combine spectral and spatial textural characteristics to detect geologic, vegetative, hydrologic, and man-made features, as well as changes in these features over time.

  3. Dictionary Based Machine Translation from Kannada to Telugu

    Science.gov (United States)

    Sindhu, D. V.; Sagar, B. M.

    2017-08-01

    Machine Translation is a task of translating from one language to another language. For the languages with less linguistic resources like Kannada and Telugu Dictionary based approach is the best approach. This paper mainly focuses on Dictionary based machine translation for Kannada to Telugu. The proposed methodology uses dictionary for translating word by word without much correlation of semantics between them. The dictionary based machine translation process has the following sub process: Morph analyzer, dictionary, transliteration, transfer grammar and the morph generator. As a part of this work bilingual dictionary with 8000 entries is developed and the suffix mapping table at the tag level is built. This system is tested for the children stories. In near future this system can be further improved by defining transfer grammar rules.

  4. Polarimetric SAR image classification based on discriminative dictionary learning model

    Science.gov (United States)

    Sang, Cheng Wei; Sun, Hong

    2018-03-01

    Polarimetric SAR (PolSAR) image classification is one of the important applications of PolSAR remote sensing. It is a difficult high-dimension nonlinear mapping problem, the sparse representations based on learning overcomplete dictionary have shown great potential to solve such problem. The overcomplete dictionary plays an important role in PolSAR image classification, however for PolSAR image complex scenes, features shared by different classes will weaken the discrimination of learned dictionary, so as to degrade classification performance. In this paper, we propose a novel overcomplete dictionary learning model to enhance the discrimination of dictionary. The learned overcomplete dictionary by the proposed model is more discriminative and very suitable for PolSAR classification.

  5. Z-Index Parameterization for Volumetric CT Image Reconstruction via 3-D Dictionary Learning.

    Science.gov (United States)

    Bai, Ti; Yan, Hao; Jia, Xun; Jiang, Steve; Wang, Ge; Mou, Xuanqin

    2017-12-01

    Despite the rapid developments of X-ray cone-beam CT (CBCT), image noise still remains a major issue for the low dose CBCT. To suppress the noise effectively while retain the structures well for low dose CBCT image, in this paper, a sparse constraint based on the 3-D dictionary is incorporated into a regularized iterative reconstruction framework, defining the 3-D dictionary learning (3-DDL) method. In addition, by analyzing the sparsity level curve associated with different regularization parameters, a new adaptive parameter selection strategy is proposed to facilitate our 3-DDL method. To justify the proposed method, we first analyze the distributions of the representation coefficients associated with the 3-D dictionary and the conventional 2-D dictionary to compare their efficiencies in representing volumetric images. Then, multiple real data experiments are conducted for performance validation. Based on these results, we found: 1) the 3-D dictionary-based sparse coefficients have three orders narrower Laplacian distribution compared with the 2-D dictionary, suggesting the higher representation efficiencies of the 3-D dictionary; 2) the sparsity level curve demonstrates a clear Z-shape, and hence referred to as Z-curve, in this paper; 3) the parameter associated with the maximum curvature point of the Z-curve suggests a nice parameter choice, which could be adaptively located with the proposed Z-index parameterization (ZIP) method; 4) the proposed 3-DDL algorithm equipped with the ZIP method could deliver reconstructions with the lowest root mean squared errors and the highest structural similarity index compared with the competing methods; 5) similar noise performance as the regular dose FDK reconstruction regarding the standard deviation metric could be achieved with the proposed method using (1/2)/(1/4)/(1/8) dose level projections. The contrast-noise ratio is improved by ~2.5/3.5 times with respect to two different cases under the (1/8) dose level compared

  6. Nonparametric, Coupled ,Bayesian ,Dictionary ,and Classifier Learning for Hyperspectral Classification.

    Science.gov (United States)

    Akhtar, Naveed; Mian, Ajmal

    2017-10-03

    We present a principled approach to learn a discriminative dictionary along a linear classifier for hyperspectral classification. Our approach places Gaussian Process priors over the dictionary to account for the relative smoothness of the natural spectra, whereas the classifier parameters are sampled from multivariate Gaussians. We employ two Beta-Bernoulli processes to jointly infer the dictionary and the classifier. These processes are coupled under the same sets of Bernoulli distributions. In our approach, these distributions signify the frequency of the dictionary atom usage in representing class-specific training spectra, which also makes the dictionary discriminative. Due to the coupling between the dictionary and the classifier, the popularity of the atoms for representing different classes gets encoded into the classifier. This helps in predicting the class labels of test spectra that are first represented over the dictionary by solving a simultaneous sparse optimization problem. The labels of the spectra are predicted by feeding the resulting representations to the classifier. Our approach exploits the nonparametric Bayesian framework to automatically infer the dictionary size--the key parameter in discriminative dictionary learning. Moreover, it also has the desirable property of adaptively learning the association between the dictionary atoms and the class labels by itself. We use Gibbs sampling to infer the posterior probability distributions over the dictionary and the classifier under the proposed model, for which, we derive analytical expressions. To establish the effectiveness of our approach, we test it on benchmark hyperspectral images. The classification performance is compared with the state-of-the-art dictionary learning-based classification methods.

  7. Criteria for Selecting a Monolingual Dictionary for Learners

    OpenAIRE

    Colin, Rogers

    2003-01-01

    There are a bewildering number of monolingual dictionaries on the market in Japan, including new learner dictionaries which give students a great deal of potentially useful information about how to use words. However it is essential to carefully evaluate dictionaries to ensure that they meet the needs of the learners who will use them. This article sets out some criteria to help make such decisions.

  8. English-Chinese Cross-Language IR Using Bilingual Dictionaries

    Science.gov (United States)

    2006-01-01

    specialized dictionaries together contain about two million entries [6]. 4 Monolingual Experiment The Chinese documents and the Chinese translations of... monolingual performance. The main performance-limiting factor is the limited coverage of the dictionary used in query translation. Some of the key con...English-Chinese Cross-Language IR using Bilingual Dictionaries Aitao Chen , Hailing Jiang , and Fredric Gey School of Information Management

  9. Word Similarity from Dictionaries: Inferring Fuzzy Measures from Fuzzy Graphs

    Directory of Open Access Journals (Sweden)

    Vicenc Torra

    2008-01-01

    Full Text Available WORD SIMILARITY FROM DICTIONARIES: INFERRING FUZZY MEASURES FROM FUZZY GRAPHS The computation of similarities between words is a basic element of information retrieval systems, when retrieval is not solely based on word matching. In this work we consider a measure between words based on dictionaries. This is achieved assuming that a dictionary is formalized as a fuzzy graph. We show that the approach permits to compute measures not only for pairs of words but for sets of them.

  10. The New Etymological Dictionary of Hungarian Idioms and Proverbs

    Directory of Open Access Journals (Sweden)

    Bárdosi Vilmos

    2017-01-01

    Full Text Available After briefly surveying the research carried out in Hungary on the origin of sayings, proverbs and adages, this paper introduces and exemplifies the way the new Hungarian phraseological etymological dictionary has been compiled. It subsequently presents excerpts from the dictionary that will expound on the origin of 1800 set phrases and statistically analyses the linguistic, cultural-historical, historical, literary, ethnographic and intercultural background of the Hungarian set phrases included in the dictionary.

  11. Technical Features of the Architecture of an Electronic Trilingual Dictionary

    Directory of Open Access Journals (Sweden)

    Grygorii Chetverikov

    2016-12-01

    Full Text Available Technical Features of the Architecture of an Electronic Trilingual Dictionary This article is devoted to the development of the software system used to create an English-Russian-Ukrainian terminological dictionary. Scanned and recognized documents in MSWord format were the input data for the dictionary. Issues which appeared during the parsing of the input data are analyzed and solutions using regular expressions are identified. This article also describes the scheme of the dictionary’s lexicographical database, and its classes of models, views and view models. In addition, a detailed description of the software system from a user’s perspective is included, the prospects for the usage of the dictionary are discussed, and the methods used during the development of the system are described. The software system is built using the design pattern Model-View-View-Model. Through the use of this pattern, internal logic is separated from user interface, thus changes made in different parts of the software may be independent. The developed software system allows users to edit, to fill, and thus to create new thematic transferable electronic dictionaries. The main advantage of the system is the equality of languages, i.e. each user can decide which language is to be major.   Opracowanie oprogramowania trzyjęzycznego słownika elektronicznego Artykuł jest poświęcony opracowaniu oprogramowania rosyjsko-ukraińsko-angielskiego słownika terminologicznego. Za wejściowe dane autorzy przyjęli zeskanowane i rozpoznane dokumenty w formacie MSWord. Błędy powstałe w czasie analizy składniowej wejściowych danych zostały przeanalizowane, a autorzy wskazali drogę ich likwidacji za pomocą regularnych wyrażeń. W pracy została dokładnie opisana baza leksykograficzna danych słownika, zostały opisane klasy modelu danych i klasy modelu prezentacji systemu. Oprogramowanie jest zbudowane w taki sposób, aby można było wykorzystać szablon

  12. Low-Dose X-ray CT Reconstruction via Dictionary Learning

    Science.gov (United States)

    Xu, Qiong; Zhang, Lei; Hsieh, Jiang; Wang, Ge

    2013-01-01

    Although diagnostic medical imaging provides enormous benefits in the early detection and accuracy diagnosis of various diseases, there are growing concerns on the potential side effect of radiation induced genetic, cancerous and other diseases. How to reduce radiation dose while maintaining the diagnostic performance is a major challenge in the computed tomography (CT) field. Inspired by the compressive sensing theory, the sparse constraint in terms of total variation (TV) minimization has already led to promising results for low-dose CT reconstruction. Compared to the discrete gradient transform used in the TV method, dictionary learning is proven to be an effective way for sparse representation. On the other hand, it is important to consider the statistical property of projection data in the low-dose CT case. Recently, we have developed a dictionary learning based approach for low-dose X-ray CT. In this paper, we present this method in detail and evaluate it in experiments. In our method, the sparse constraint in terms of a redundant dictionary is incorporated into an objective function in a statistical iterative reconstruction framework. The dictionary can be either predetermined before an image reconstruction task or adaptively defined during the reconstruction process. An alternating minimization scheme is developed to minimize the objective function. Our approach is evaluated with low-dose X-ray projections collected in animal and human CT studies, and the improvement associated with dictionary learning is quantified relative to filtered backprojection and TV-based reconstructions. The results show that the proposed approach might produce better images with lower noise and more detailed structural features in our selected cases. However, there is no proof that this is true for all kinds of structures. PMID:22542666

  13. Low-dose X-ray CT reconstruction via dictionary learning.

    Science.gov (United States)

    Xu, Qiong; Yu, Hengyong; Mou, Xuanqin; Zhang, Lei; Hsieh, Jiang; Wang, Ge

    2012-09-01

    Although diagnostic medical imaging provides enormous benefits in the early detection and accuracy diagnosis of various diseases, there are growing concerns on the potential side effect of radiation induced genetic, cancerous and other diseases. How to reduce radiation dose while maintaining the diagnostic performance is a major challenge in the computed tomography (CT) field. Inspired by the compressive sensing theory, the sparse constraint in terms of total variation (TV) minimization has already led to promising results for low-dose CT reconstruction. Compared to the discrete gradient transform used in the TV method, dictionary learning is proven to be an effective way for sparse representation. On the other hand, it is important to consider the statistical property of projection data in the low-dose CT case. Recently, we have developed a dictionary learning based approach for low-dose X-ray CT. In this paper, we present this method in detail and evaluate it in experiments. In our method, the sparse constraint in terms of a redundant dictionary is incorporated into an objective function in a statistical iterative reconstruction framework. The dictionary can be either predetermined before an image reconstruction task or adaptively defined during the reconstruction process. An alternating minimization scheme is developed to minimize the objective function. Our approach is evaluated with low-dose X-ray projections collected in animal and human CT studies, and the improvement associated with dictionary learning is quantified relative to filtered backprojection and TV-based reconstructions. The results show that the proposed approach might produce better images with lower noise and more detailed structural features in our selected cases. However, there is no proof that this is true for all kinds of structures.

  14. A New Way to Lemmatize Adjectives in a User-friendly Zulu–English Dictionary

    Directory of Open Access Journals (Sweden)

    Gilles-Maurice de Schryver

    2011-10-01

    Full Text Available

    Abstract: Traditionally, Zulu adjectives have been lemmatized under their stems only. In this research article, an in-depth analysis is undertaken to make a case for the lemmatization of all frequent adjectival forms with their adjective concords rather. It is shown that the supposed explosion in size of the dictionary may be contained within a corpus-driven Sinclairian framework. The advantages of such a word-like treatment far outnumber the generalizations that have hitherto characterized the lexicographic treatment of adjectives in Zulu. The study is supported by ample dictionary extracts from a Zulu–English dictionary project aimed at junior users. Comparisons with existing dictionaries and textbook data are also made.

    Keywords: LEXICOGRAPHY, LINGUISTICS, GRAMMAR, DICTIONARY, BILINGUAL,CORPUS, LEMMATIZATION, FREQUENCY, ZULU (ISIZULU, ENGLISH, ADJECTIVE,ADJECTIVE STEM, QUALIFICATIVE ADJECTIVE, COPULATIVE ADJECTIVE, USER-FRIENDLY,REAL EXAMPLE, COLLOCATION, COMBINATION, DERIVATION, IDIOMATIC USE,SEMANTIC PROSODY

    Samenvatting: Een nieuwe manier om adjectieven te lemmatiseren in eengebruiksvriendelijk Zoeloe–Engels woordenboek. Traditioneel worden adjectievenin Zoeloe enkel onder hun stam gelemmatiseerd. In dit onderzoeksartikel wordt een grondigeanalyse uitgevoerd met het oog op de invoering van een nieuwe methode waarbij alle frequenteadjectieven met hun adjectiefschakel in het woordenboek worden geplaatst. Er wordt aangetoond datde vooronderstelde explosie in grootte van het woordenboek beperkt kan worden binnen een corpusgedrevenSinclairiaans kader. De voordelen van zo een woordachtige behandeling overstijgenruimschoots de veralgemeningen die totnogtoe de lexicografische behandeling van adjectieven inZoeloe hebben gekarakteriseerd. De studie wordt ondersteund door een groot aantal passages uiteen Zoeloe–Engels woordenboekproject gericht op jonge gebruikers. Vergelijkingen met bestaandewoordenboeken, alsook handboeken

  15. Danish Lexicography with Special Reference to LSP Dictionaries

    DEFF Research Database (Denmark)

    Nielsen, Sandro

    2011-01-01

    liberal and professional areas such as music, business and technical subjects. The number of LSP dictionaries increased at a slow but steady pace until the early 1900s when the pace went up and from around 1970 the number of new LSP dictionaries increased significantly. This indicates that lexicography...... business. This also reflects the trend of making printed dictionaries available in electronic form either on CD-ROMs or on the Internet. The traditional publishers offer most of their LSP dictionaries in electronic form on a subscription basis and new actors, such as IT companies have entered the scene...

  16. Implementation of the Bulgarian-Polish online dictionary

    Directory of Open Access Journals (Sweden)

    Ludmila Dimitrova

    2015-11-01

    Full Text Available Implementation of the Bulgarian-Polish online dictionary The paper describes the implementation of an online Bulgarian-Polish dictionary as a technological tool for applications in digital humanities. This bilingual digital dictionary is developed in the frame of the joint research project “Semantics and Contrastive Linguistics with a focus on a bilingual electronic dictionary” between IMI-BAS and ISS-PAS, supervised by L. Dimitrova (IMI-BAS and V. Koseska-Toszewa (ISS-PAS. In addition, the main software tools for web-presentation of the dictionary are described briefly.

  17. Dealing with phraseology in business dictionaries: focus on dictionary functions – not phrases

    Directory of Open Access Journals (Sweden)

    Leroyer, Patrick

    2006-01-01

    Full Text Available The language of written business communication is characterised by the extensive use of phraseology, not only in terms of collocations and idiomatic expressions, but also of standard phrases in prototypical business genres. In any case, the phraseological information should be included in business dictionaries (in the following referred to as BDs in accordance with the planned dictionary functions. Hence, the selection and presentation of the phraseological information should be decided by the lexicographer on the basis of the user needs alone and not on the recommendations of the phraseological literature about lexicographical practice. In this paper, I will firstly explain why lexicography and phraseology, although closely associated in a large number of studies, are quite different disciplines, and how their shared interest for dictionary practice in general is based on radically different views. I will then discuss the dictionary functions of BDs and focus on a number of concepts featuring extensive phraseological solutions to show and argue that dealing with phraseology in BDs should always keep focus on dictionary functions.

  18. Dictionaries of Mexican Sexual Slang for NLP

    Directory of Open Access Journals (Sweden)

    Roberto Villarejo-Martínez

    2018-04-01

    Full Text Available Abstract: In this paper the creation of two relevant resources for the double entendre and humour recognition problem in Mexican Spanish is described: a morphological dictionary and a semantic dictionary. These were created from two sources: a corpus of albures (drawn from “Antología del albur” book and a Mexican slang dictionary (“El chilangonario”. The morphological dictionary consists of 410 forms of words that corresponds to 350 lemmas. The semantic dictionary consists of 27 synsets that are associated to lemmas of morphological dictionary. Since both resources are based on Freeling library, they are easy to implement for tasks in Natural Language Processing. The motivation for this work comes from the need to address problems such as double entendre and computational humour. The usefulness of these disciplines has been discussed many times and it has been shown that they have a direct impact on user interfaces and, mainly, in human-computer interaction. This work aims to promote that the scientific community generates more resources about informal language in Spanish and other languages.  Spanish Abstract: En este artículo se describe la creación de dos recursos relevantes para el reconocimiento del doble sentido y el humor en el español mexicano: un diccionario morfológico y un diccionario semántico. Éstos fueron creados a partir de dos fuentes: un corpus de albures (extraídos del libro "Antología del albur" y un diccionario de argot mexicano ("El chilangonario". El diccionario morfológico consiste en 410 formas de palabras que corresponden a 350 lemas. El diccionario semántico consiste en 27 synsets que están asociados a lemas del diccionario morfológico. Puesto que ambos recursos están basados en la biblioteca Freeling, son fáciles de implementar en tareas de Procesamiento del Lenguaje Natural. La motivación de este trabajo proviene de la necesidad de abordar problemas como el doble sentido y el humor

  19. SU-E-I-41: Dictionary Learning Based Quantitative Reconstruction for Low-Dose Dual-Energy CT (DECT)

    International Nuclear Information System (INIS)

    Xu, Q; Xing, L; Xiong, G; Elmore, K; Min, J

    2015-01-01

    Purpose: DECT collects two sets of projection data under higher and lower energies. With appropriates composition methods on linear attenuation coefficients, quantitative information about the object, such as density, can be obtained. In reality, one of the important problems in DECT is the radiation dose due to doubled scans. This work is aimed at establishing a dictionary learning based reconstruction framework for DECT for improved image quality while reducing the imaging dose. Methods: In our method, two dictionaries were learned respectively from the high-energy and lowenergy image datasets of similar objects under normal dose in advance. The linear attenuation coefficient was decomposed into two basis components with material based composition method. An iterative reconstruction framework was employed. Two basis components were alternately updated with DECT datasets and dictionary learning based sparse constraints. After one updating step under the dataset fidelity constraints, both high-energy and low-energy images can be obtained from the two basis components. Sparse constraints based on the learned dictionaries were applied to the high- and low-energy images to update the two basis components. The iterative calculation continues until a pre-set number of iteration was reached. Results: We evaluated the proposed dictionary learning method with dual energy images collected using a DECT scanner. We re-projected the projection data with added Poisson noise to reflect the low-dose situation. The results obtained by the proposed method were compared with that obtained using FBP based method and TV based method. It was found that the proposed approach yield better results than other methods with higher resolution and less noise. Conclusion: The use of dictionary learned from DECT images under normal dose is valuable and leads to improved results with much lower imaging dose

  20. SU-E-I-41: Dictionary Learning Based Quantitative Reconstruction for Low-Dose Dual-Energy CT (DECT)

    Energy Technology Data Exchange (ETDEWEB)

    Xu, Q [School of Electronic and Information Engineering, Xi’an Jiaotong University, Xi’an, Shaanxi 710049 (China); Department of Radiation Oncology, Stanford University, Stanford, CA 94305 (United States); Xing, L [Department of Radiation Oncology, Stanford University, Stanford, CA 94305 (United States); Xiong, G; Elmore, K; Min, J [Dalio Institute of Cardiovascular Imaging, New York-Presbyterian Hospital and Weill Cornell Medical College, New York, NY (United States)

    2015-06-15

    Purpose: DECT collects two sets of projection data under higher and lower energies. With appropriates composition methods on linear attenuation coefficients, quantitative information about the object, such as density, can be obtained. In reality, one of the important problems in DECT is the radiation dose due to doubled scans. This work is aimed at establishing a dictionary learning based reconstruction framework for DECT for improved image quality while reducing the imaging dose. Methods: In our method, two dictionaries were learned respectively from the high-energy and lowenergy image datasets of similar objects under normal dose in advance. The linear attenuation coefficient was decomposed into two basis components with material based composition method. An iterative reconstruction framework was employed. Two basis components were alternately updated with DECT datasets and dictionary learning based sparse constraints. After one updating step under the dataset fidelity constraints, both high-energy and low-energy images can be obtained from the two basis components. Sparse constraints based on the learned dictionaries were applied to the high- and low-energy images to update the two basis components. The iterative calculation continues until a pre-set number of iteration was reached. Results: We evaluated the proposed dictionary learning method with dual energy images collected using a DECT scanner. We re-projected the projection data with added Poisson noise to reflect the low-dose situation. The results obtained by the proposed method were compared with that obtained using FBP based method and TV based method. It was found that the proposed approach yield better results than other methods with higher resolution and less noise. Conclusion: The use of dictionary learned from DECT images under normal dose is valuable and leads to improved results with much lower imaging dose.