WorldWideScience

Sample records for machine translation phrases

  1. Exact Decoding for Phrase-Based Statistical Machine Translation

    NARCIS (Netherlands)

    Aziz, W.; Dymetman, M.; Specia, L.

    2014-01-01

    The combinatorial space of translation derivations in phrase-based statistical machine translation is given by the intersection between a translation lattice and a target language model. We replace this intractable intersection by a tractable relaxation which incorporates a low-order upperbound on t

  2. Integrating source-language context into phrase-based statistical machine translation

    NARCIS (Netherlands)

    Haque, R.; Kumar Naskar, S.; Bosch, A.P.J. van den; Way, A.

    2011-01-01

    The translation features typically used in Phrase-Based Statistical Machine Translation (PB-SMT) model dependencies between the source and target phrases, but not among the phrases in the source language themselves. A swathe of research has demonstrated that integrating source context modelling dire

  3. Syntactic Reordering for Arabic- English Phrase-Based Machine Translation

    Science.gov (United States)

    Hatem, Arwa; Omar, Nazlia

    Machine Translation (MT) refers to the use of a machine for performing translation task which converts text or speech in one Natural Language (Source Language (SL)) into another Natural Language (Target Language (TL)). The translation from Arabic to English is difficult task due to the Arabic languages are highly inflectional, rich morphology and relatively free word order. Word ordering plays an important part in the translation process. The paper proposes a transfer-based approach in Arabic to English MT to handle the word ordering problem. Preliminary tested indicate that our system, AE-TBMT is competitive when compared against other approaches from the literature.

  4. Machine Translation of Noun Phrases from English to Igala using ...

    African Journals Online (AJOL)

    PROF. OLIVER OSUAGWA

    2014-06-01

    Jun 1, 2014 ... for easier processing of concepts This clearly shows that ... research organizations and government agencies to .... first hybrid machine translation in 2009. English and ... sentence structure in the target language. There is no ...

  5. Advances in Fully-Automatic and Interactive Phrase-Based Statistical Machine Translation

    OpenAIRE

    Ortiz Martínez, Daniel

    2011-01-01

    This thesis presents different contributions in the fields of fully-automatic statistical machine translation and interactive statistical machine translation. In the field of statistical machine translation there are three problems that are to be addressed, namely, the modelling problem, the training problem and the search problem. In this thesis we present contributions regarding these three problems. Regarding the modelling problem, an alternative derivation of phrase-based s...

  6. Pre-editing and Recursive-Phrase Composites for a Better English-to-Arabic Machine Translation

    Directory of Open Access Journals (Sweden)

    Mansoor Al-A'ali

    2007-01-01

    Full Text Available This research presents an approach for an English-to-Arabic Machine Translation System based on Building correct grammar and phrase structures first and then automatically deriving Translation Rules for phrase translation. For every English phrase, the grammar is first analysed and then a corresponding Arabic translation is given which would be used by the machine learning system to produce a translation rule with the help of a dictionary and the user. These same derived rules can partially be used for other phrase sequences especially in the case of a phrase consisting of a number of smaller phrases and thus implemeting the idea of recusive phrase strucutres. The approach was implemented and tested on simple cases and the results are given which indicate that this approach is successful for small to medium phrases. Our approach is an enhancement on existing phrase translation techniques because it analyses the source language grammar first, then builds a syntactic structure before proceeding with the machine learning process of learning the translation rules. Our approach is enhancement on existing phrase based translations in two directions: the grammar editing before the translation rules and the derived translation rules can be complete for complete phrases or are rules for translating smaller phrases which are subsets of larger phrases. The approach has improved the speed and correctness of phrase translations.

  7. Translation of Untranslatable Words — Integration of Lexical Approximation and Phrase-Table Extension Techniques into Statistical Machine Translation

    Science.gov (United States)

    Paul, Michael; Arora, Karunesh; Sumita, Eiichiro

    This paper proposes a method for handling out-of-vocabulary (OOV) words that cannot be translated using conventional phrase-based statistical machine translation (SMT) systems. For a given OOV word, lexical approximation techniques are utilized to identify spelling and inflectional word variants that occur in the training data. All OOV words in the source sentence are then replaced with appropriate word variants found in the training corpus, thus reducing the number of OOV words in the input. Moreover, in order to increase the coverage of such word translations, the SMT translation model is extended by adding new phrase translations for all source language words that do not have a single-word entry in the original phrase-table but only appear in the context of larger phrases. The effectiveness of the proposed methods is investigated for the translation of Hindi to English, Chinese, and Japanese.

  8. Rule Based Machine Translation of Noun Phrases from Punjabi to English

    Directory of Open Access Journals (Sweden)

    Kamaljeet Kaur Batra

    2010-09-01

    Full Text Available The paper presents automatic translation of noun phrases from Punjabi to English using transfer approach. The system has analysis, translation and synthesis component. The steps involved are pre processing, tagging, ambiguity resolution, translation and synthesis of words in target language. The accuracy is calculated for each step and the overall accuracy of the system is calculated to be about 85% for a particular type of noun phrases.

  9. Javanese Phrase Construction in Classical Books Translation

    Directory of Open Access Journals (Sweden)

    Moh. Masrukhi

    2016-07-01

    Full Text Available When the system or structure of language is used and influenced by other languages, it is called interference, and it may ruin the concept of structures.   This research is about Arabic phrase construction that influenced Javanese phrase construction in classical books translation (TKK. This involvement appears when the concept of Arabic’s construction is translated literally into Javanese.   The research was carried out by applying theories of interference and translation. It’s analysis approach with contrastive analysis which is allegedly enabled errors to be predicted from a comparison between Arabic’s and Javanese’s phrase construction. The data were found and collected from several classical books in Arabic language (KKbA translated by different writers. The results show that Javanese phrase construction (as the target language, particularly, with noun phrases,  with adjectival phrases, with numeral phrases, and with prepositional phrases was influenced by Arabic’s murakkab or Arabic phrase construction (as the source language. Arabic has its own concept of phrase construction.  The phrase construction cannot be translated into Javanese directly through word-for-word translation or literal translation. Thus, Javanese in TKK became inconvenient and ungrammatical. Arabic phrase construction is flipped around (with noun phrases and adjectival phrases and prepositions are used and translated improperly or in the wrong position (with prepositional phrases. This research has many implications for further use, such as: for the identification and description of the deviation of  Javanese phrase construction which has been affected by direct translation from the Arabic language,  and furthermore, to increase the knowledge of those who are learning by increasing the realization and awareness in writing and translating (especially from Arabic to Javanese about the fact that  Arabic and Javanese  have their own regulations or

  10. Machine Translation

    Institute of Scientific and Technical Information of China (English)

    张严心

    2015-01-01

    As a kind of ancillary translation tool, Machine Translation has been paid increasing attention to and received different kinds of study by a great deal of researchers and scholars for a long time. To know the definition of Machine Translation and to analyse its benefits and problems are significant for translators in order to make good use of Machine Translation, and helpful to develop and consummate Machine Translation Systems in the future.

  11. Phrase translation probabilities with ITG priors and smoothing as learning objective

    NARCIS (Netherlands)

    Mylonakis, M.; Sima'an, K.

    2008-01-01

    The conditional phrase translation probabilities constitute the principal components of phrase-based machine translation systems. These probabilities are estimated using a heuristic method that does not seem to optimize any reasonable objective function of the word-aligned, parallel training corpus.

  12. On Using Very Large Target Vocabulary for Neural Machine Translation

    OpenAIRE

    Jean, Sébastien; Cho, Kyunghyun; Memisevic, Roland; Bengio, Yoshua

    2014-01-01

    Neural machine translation, a recently proposed approach to machine translation based purely on neural networks, has shown promising results compared to the existing approaches such as phrase-based statistical machine translation. Despite its recent success, neural machine translation has its limitation in handling a larger vocabulary, as training complexity as well as decoding complexity increase proportionally to the number of target words. In this paper, we propose a method that allows us ...

  13. Machine Translation and Other Translation Technologies.

    Science.gov (United States)

    Melby, Alan

    1996-01-01

    Examines the application of linguistic theory to machine translation and translator tools, discusses the use of machine translation and translator tools in the real world of translation, and addresses the impact of translation technology on conceptions of language and other issues. Findings indicate that the human mind is flexible and linguistic…

  14. Research on joint Chinese-Japanese word segmentation for phrase-based statistical machine translation%面向短语统计机器翻译的汉日联合分词研究

    Institute of Scientific and Technical Information of China (English)

    吴培昊; 徐金安; 张玉洁

    2015-01-01

    Unknown words and word segmentation granularity are two main problems for Chinese-Japanese machine translation. Word segmentation is the first important step for Chinese and Japanese natural language processing. As Chi-nese and Japanese word segmentation is processed with different tagging system and semantic performance, the granularity of word segmentation results should be readjusted to improve the performance of Statistical Machine Translation(SMT). This paper proposes an approach to adjust the word segmentation granularity for improving the performance of SMT, which combines Hanzi-Kanji comparison table and Japanese-Chinese dictionary. Experimental results express that the pro-posed method could adjust the granularity between Chinese and Japanese effectively and improve the performance of SMT. This paper analyses the experimental results and discusses the effect of joint Chinese-Japanese word segmentation granularity for phrase-based SMT.%未登录词与分词粒度是汉日日汉机器翻译研究的两个主要问题。与英语等西方语言不同,汉语与日语词语间不存在空格,分词为汉日双语处理的重要工作。由于词性标注体系、文法及语义表现上的差异,分词结果的粒度需要进一步调整,以改善统计机器翻译系统的性能。提出了面向统计机器翻译的基于汉日汉字对照表及日汉词典信息的汉语与日语的分词粒度调整方法。实验结果表明,该方法能有效地调节源语言和目标语言端的分词粒度,提高统计机器翻译系统的性能。通过对比实验结果,分析探讨分词粒度对汉日双语统计系统性能的影响。

  15. Chunk Alignment for Corpus-Based Machine Translation

    Science.gov (United States)

    Kim, Jae Dong

    2011-01-01

    Since sub-sentential alignment is critically important to the translation quality of an Example-Based Machine Translation (EBMT) system, which operates by finding and combining phrase-level matches against the training examples, we developed a new alignment algorithm for the purpose of improving the EBMT system's performance. This new…

  16. Countability and Number in Japanese-to-English Machine Translation

    CERN Document Server

    Bond, F; Ikehara, S; Bond, Francis; Ogura, Kentaro; Ikehara, Satoru

    1994-01-01

    This paper presents a heuristic method that uses information in the Japanese text along with knowledge of English countability and number stored in transfer dictionaries to determine the countability and number of English noun phrases. Incorporating this method into the machine translation system ALT-J/E, helped to raise the percentage of noun phrases generated with correct use of articles and number from 65% to 73%.

  17. Machine Translation Effect on Communication

    DEFF Research Database (Denmark)

    Jensen, Mika Yasuoka; Bjørn, Pernille

    2011-01-01

    Intercultural collaboration facilitated by machine translation has gradually spread in various settings. Still, little is known as for the practice of machine-translation mediated communication. This paper investigates how machine translation affects intercultural communication in practice. Based...... determines communication process largely, our data indicates communication relies more on a dynamic process where participants establish common ground than on reproducibility and grammatical accuracy.......Intercultural collaboration facilitated by machine translation has gradually spread in various settings. Still, little is known as for the practice of machine-translation mediated communication. This paper investigates how machine translation affects intercultural communication in practice. Based...... on communication in which multilingual communication system is applied, we identify four communication types and its’ influences on stakeholders’ communication process, especially focusing on establishment and maintenance of common ground. Different from our expectation that quality of machine translation results...

  18. Automatic Evaluation of Machine Translation

    DEFF Research Database (Denmark)

    Martinez, Mercedes Garcia; Koglin, Arlene; Mesa-Lao, Bartolomé

    2015-01-01

    The availability of systems capable of producing fairly accurate translations has increased the popularity of machine translation (MT). The translation industry is steadily incorporating MT in their workflows engaging the human translator to post-edit the raw MT output in order to comply with a set...

  19. Parsing statistical machine translation output

    NARCIS (Netherlands)

    Carter, S.; Monz, C.; Vetulani, Z.

    2009-01-01

    Despite increasing research into the use of syntax during statistical machine translation, the incorporation of syntax into language models has seen limited success. We present a study of the discriminative abilities of generative syntax-based language models, over and above standard n-gram models,

  20. Automatic Evaluation of Machine Translation

    DEFF Research Database (Denmark)

    Martinez, Mercedes Garcia; Koglin, Arlene; Mesa-Lao, Bartolomé

    2015-01-01

    The availability of systems capable of producing fairly accurate translations has increased the popularity of machine translation (MT). The translation industry is steadily incorporating MT in their workflows engaging the human translator to post-edit the raw MT output in order to comply with a set...... of post-editing effort, namely i) temporal (time), ii) cognitive (mental processes) and iii) technical (keyboard activity). For the purposes of this research, TER scores were correlated with two different indicators of post-editing effort as computed in the CRITT Translation Process Database (TPR......-DB) *. On the one hand, post-editing temporal effort was measured using FDur values (duration of segment production time excluding keystroke pauses >_ 200 seconds) and KDur values (duration of coherent keyboard activity excluding keystroke pauses >_ 5 seconds). On the other hand, post-editing technical effort...

  1. On automatic machine translation evaluation

    Directory of Open Access Journals (Sweden)

    Darinka Verdonik

    2013-05-01

    Full Text Available An important task of developing machine translation (MT is evaluating system performance. Automatic measures are most commonly used for this task, as manual evaluation is time-consuming and costly. However, to perform an objective evaluation is not a trivial task. Automatic measures, such as BLEU, TER, NIST, METEOR etc., have their own weaknesses, while manual evaluations are also problematic since they are always to some extent subjective. In this paper we test the influence of a test set on the results of automatic MT evaluation for the subtitling domain. Translating subtitles is a rather specific task for MT, since subtitles are a sort of summarization of spoken text rather than a direct translation of (written text. Additional problem when translating language pair that does not include English, in our example Slovene-Serbian, is that commonly the translations are done from English to Serbian and from English to Slovenian, and not directly, since most of the TV production is originally filmed in English. All this poses additional challenges to MT and consequently to MT evaluation. Automatic evaluation is based on a reference translation, which is usually taken from an existing parallel corpus and marked as a test set. In our experiments, we compare the evaluation results for the same MT system output using three types of test set. In the first round, the test set are 4000 subtitles from the parallel corpus of subtitles SUMAT. These subtitles are not direct translations from Serbian to Slovene or vice versa, but are based on an English original. In the second round, the test set are 1000 subtitles randomly extracted from the first test set and translated anew, from Serbian to Slovenian, based solely on the Serbian written subtitles. In the third round, the test set are the same 1000 subtitles, however this time the Slovene translations were obtained by manually correcting the Slovene MT outputs so that they are correct translations of the

  2. Evaluating Arabic to English Machine Translation

    Directory of Open Access Journals (Sweden)

    Laith S. Hadla

    2014-11-01

    Full Text Available Online text machine translation systems are widely used throughout the world freely. Most of these systems use statistical machine translation (SMT that is based on a corpus full with translation examples to learn from them how to translate correctly. Online text machine translation systems differ widely in their effectiveness, and therefore we have to fairly evaluate their effectiveness. Generally the manual (human evaluation of machine translation (MT systems is better than the automatic evaluation, but it is not feasible to be used. The distance or similarity of MT candidate output to a set of reference translations are used by many MT evaluation approaches. This study presents a comparison of effectiveness of two free online machine translation systems (Google Translate and Babylon machine translation system to translate Arabic to English. There are many automatic methods used to evaluate different machine translators, one of these methods; Bilingual Evaluation Understudy (BLEU method. BLEU is used to evaluate translation quality of two free online machine translation systems under consideration. A corpus consists of more than 1000 Arabic sentences with two reference English translations for each Arabic sentence is used in this study. This corpus of Arabic sentences and their English translations consists of 4169 Arabic words, where the number of unique Arabic words is 2539. This corpus is released online to be used by researchers. These Arabic sentences are distributed among four basic sentence functions (declarative, interrogative, exclamatory, and imperative. The experimental results show that Google machine translation system is better than Babylon machine translation system in terms of precision of translation from Arabic to English.

  3. Topic-aware pivot language approach for statistical machine translation

    Institute of Scientific and Technical Information of China (English)

    Jin-song SU; Xiao-dong SHI; Yan-zhou HUANG; Yang LIU; Qing-qiang WU; Yi-dong CHEN; Huai-lin DONG

    2014-01-01

    The pivot language approach for statistical machine translation (SMT) is a good method to break the resource bottleneck for certain language pairs. However, in the implementation of conventional approaches, pivot-side context information is far from fully utilized, resulting in erroneous estimations of translation probabilities. In this study, we propose two topic-aware pivot language approaches to use different levels of pivot-side context. The fi rst method takes advantage of document-level context by assuming that the bridged phrase pairs should be similar in the document-level topic distributions. The second method focuses on the effect of local context. Central to this approach are that the phrase sense can be refl ected by local context in the form of probabilistic topics, and that bridged phrase pairs should be compatible in the latent sense distributions. Then, we build an interpolated model bringing the above methods together to further enhance the system performance. Experimental results on French-Spanish and French-German translations using English as the pivot language demonstrate the effectiveness of topic-based context in pivot-based SMT.

  4. Machine Translation in Post-Contemporary Era

    Science.gov (United States)

    Lin, Grace Hui Chin

    2010-01-01

    This article focusing on translating techniques via personal computer or laptop reports updated artificial intelligence progresses before 2010. Based on interpretations and information for field of MT [Machine Translation] by Yorick Wilks' book, "Machine Translation, Its scope and limits," this paper displays understandable theoretical frameworks…

  5. Machine Translation Approaches and Survey for Indian Languages

    OpenAIRE

    Khan, Nadeem Jadoon; Anwar, Waqas; Durrani, Nadir

    2017-01-01

    In this study, we present an analysis regarding the performance of the state-of-art Phrase-based Statistical Machine Translation (SMT) on multiple Indian languages. We report baseline systems on several language pairs. The motivation of this study is to promote the development of SMT and linguistic resources for these language pairs, as the current state-of-the-art is quite bleak due to sparse data resources. The success of an SMT system is contingent on the availability of a large parallel c...

  6. Classifiers in Japanese-to-English Machine Translation

    CERN Document Server

    Bond, F; Ikehara, S; Bond, Francis; Ogura, Kentaro; Ikehara, Satoru

    1996-01-01

    This paper proposes an analysis of classifiers into four major types: UNIT, METRIC, GROUP and SPECIES, based on properties of both Japanese and English. The analysis makes possible a uniform and straightforward treatment of noun phrases headed by classifiers in Japanese-to-English machine translation, and has been implemented in the MT system ALT-J/E. Although the analysis is based on the characteristics of, and differences between, Japanese and English, it is shown to be also applicable to the unrelated language Thai.

  7. Machine Translation Tools - Tools of The Translator's Trade

    DEFF Research Database (Denmark)

    Kastberg, Peter

    2012-01-01

    In this article three of the more common types of translation tools are presented, discussed and critically evaluated. The types of translation tools dealt with in this article are: Fully Automated Machine Translation (or FAMT), Human Aided Machine Translation (or HAMT) and Machine Aided Human...... Translation (or MAHT). The strengths and weaknesses of the different types of tools are discussed and evaluated by means of a number of examples. The article aims at two things: at presenting a sort of state of the art of what is commonly referred to as “machine translation” as well as at providing the reader...... with a sound basis for considering what translation tool (if any) is the most appropriate in order to meet his or her specific translation needs....

  8. Marker-based filtering of bilingual phrase pairs for SMT

    OpenAIRE

    Sánchez-Martínez, Felipe; Way, Andy

    2009-01-01

    State-of-the-art statistical machine translation systems make use of a large translation table obtained after scoring a set of bilingual phrase pairs automatically extracted from a parallel corpus. The number of bilingual phrase pairs extracted from a pair of aligned sentences grows exponentially as the length of the sentences increases; therefore, the number of entries in the phrase table used to carry out the translation may become unmanageable, especially when online, ‘on demand’ translati...

  9. Adding Statistical Machine Translation Adaptation to Computer-Assisted Translation

    Science.gov (United States)

    2013-09-01

    on Telecommunications. Tehran, 2012, 822–826. Bertoldi, N.; Federico, M. Domain Adaptation for Statistical Machine Translation with Monolingual ...for Interactive Machine Translation. ICMI’11. Alicante, Spain: ACM, 2011, 197–200. 14 Haffari, G.; Sarkar, A. Active Learning for Multilingual

  10. RM-structure alignment based statistical machine translation model

    Institute of Scientific and Technical Information of China (English)

    Sun Jiadong; Zhao Tiejun

    2008-01-01

    A novel model based on structure alignments is proposed for statistical machine translation in this paper.Meta-structure and sequence of meta-structure for a parse tree are defined.During the translation process, a parse tree is decomposed to deal with the structure divergence and the alignments can be constructed at different levels of recombination of meta-structure (RM).This method can perform the structure mapping across the sub-tree structure between languages.As a result, we get not only the translation for the target language, but sequence of meta-structure of its parse tree at the same time.Experiments show that the model in the framework of log-linear model has better generative ability and significantly outperforms Pharaoh, a phrase-based system.

  11. LOOSE PHRASE EXTRACTION WITH n-BEST ALIGNMENTS

    Institute of Scientific and Technical Information of China (English)

    Xue Yongzeng; Li Sheng

    2007-01-01

    Loose phrase extraction method is proposed and applied for phrase-based statistical machine translation. The method extracts phrase pairs that are not strictly consistent with word alignments. Two types of constraints on word positions are investigated for this method. Furthermore,n-best alignments are introduced for phrase extraction instead of the one-best. Experimental results show that the proposed approach outperforms the baseline system, Pharaoh system, for both one-best and n-best alignments.

  12. Machine Translation Based on Translation Corresponding Tree Structure

    Institute of Scientific and Technical Information of China (English)

    2006-01-01

    A representation schema called translation corresponding tree (TCT) has been applied to a Portuguese to Chinese example-based machine translation system. The translation examples are annotated by the representation of the TCT structure. Each TCT describes not only the syntactic structure of the source sentence (i.e., Portuguese in our system) but also the translation correspondences (i.e., Chinese translation). In addition, the TCT nodes describe the corresponding linguistic relationships between the source and target languages. The translation examples can be effectively represented with this annotation schema and organized in the bilingual knowledge database or example base. In the real machine translation process, the target language is synthesized with higher quality by referring to the TCT translation information.

  13. Discriminative syntactic reranking for statistical machine translation

    NARCIS (Netherlands)

    Carter, S.; Monz, C.

    2010-01-01

    This paper describes a method that successfully exploits simple syntactic features for n-best translation candidate reranking using perceptrons. Our approach uses discriminative language modelling to rerank the n-best translations generated by a statistical machine translation system. The performanc

  14. Possessive Pronouns as Determiners in Japanese-to-English Machine Translation

    CERN Document Server

    Bond, F; Ikehara, S; Bond, Francis; Ogura, Kentaro; Ikehara, Satoru

    1996-01-01

    Possessive pronouns are used as determiners in English when no equivalent would be used in a Japanese sentence with the same meaning. This paper proposes a heuristic method of generating such possessive pronouns even when there is no equivalent in the Japanese. The method uses information about the use of possessive pronouns in English treated as a lexical property of nouns, in addition to contextual information about noun phrase referentiality and the subject and main verb of the sentence that the noun phrase appears in. The proposed method has been implemented in NTT Communication Science Laboratories' Japanese-to-English machine translation system ALT-J/E. In a test set of 6,200 sentences, the proposed method increased the number of noun phrases with appropriate possessive pronouns generated, by 263 to 609, at the cost of generating 83 noun phrases with inappropriate possessive pronouns.

  15. Fundamentals of Machine Learning for Neural Machine Translation

    OpenAIRE

    Kelleher, John

    2016-01-01

    This paper presents a short introduction to neural networks and how they are used for machine translation and concludes with some discussion on the current research challenges being addressed by neural machine translation (NMT) research. The primary goal of this paper is to give a no-tears introduction to NMT to readers that do not have a computer science or mathematical background. The secondary goal is to provide the reader with a deep enough understanding of NMT that they can appreciate th...

  16. Urdu to Punjabi Machine Translation: An Incremental Training Approach

    Directory of Open Access Journals (Sweden)

    Umrinderpal Singh

    2016-04-01

    Full Text Available The statistical machine translation approach is highly popular in automatic translation research area and promising approach to yield good accuracy. Efforts have been made to develop Urdu to Punjabi statistical machine translation system. The system is based on an incremental training approach to train the statistical model. In place of the parallel sentences corpus has manually mapped phrases which were used to train the model. In preprocessing phase, various rules were used for tokenization and segmentation processes. Along with these rules, text classification system was implemented to classify input text to predefined classes and decoder translates given text according to selected domain by the text classifier. The system used Hidden Markov Model(HMM for the learning process and Viterbi algorithm has been used for decoding. Experiment and evaluation have shown that simple statistical model like HMM yields good accuracy for a closely related language pair like Urdu-Punjabi. The system has achieved 0.86 BLEU score and in manual testing and got more than 85% accuracy.

  17. Using example-based machine translation to translate DVD subtitles

    DEFF Research Database (Denmark)

    Flanagan, Marian

    Audiovisual Translation (AVT), and in particular subtitling, has been recognised as an area that could potentially benefit from the introduction of machine translation (followed by post-editing). In recent years the demands on subtitlers have increased, while the payment to subtitlers and time al...

  18. Project-specific Machine Translation

    Science.gov (United States)

    2011-12-01

    Linguistic Data Consortium’s Reflex Pack for Less Commonly Taught Languages (LCTL) (7). The corpora labeled Sada -e-Azadi (SeA) (11) and Afghan Recovery...Linguistics, Philadelphia, PA, 2002, 311–318. 11. Sada -e Azadi. http://www.sada-e-azadi.net (accessed 2011). 12. Stolcke, A. SRILM - An Extensible...Translation Center QAMO Qamoosuna (dictionaries) RH Ranger Handbook SCFG Synchronous Context Free Grammar SeA Sada -e-Azadi SME subject matter expert SMT

  19. Rule-based machine translation for Aymara

    NARCIS (Netherlands)

    Coler, Matthew; Homola, Petr; Jones, Mari

    2014-01-01

    This paper presents the ongoing result of an approach developed by the collaboration of a computational linguist with a field linguist that addresses one of the oft-overlooked keys to language maintenance: the development of modern language-learning tools. Although machine translation isn’t commonly

  20. Findings of the 2010 Joint Workshop on Statistical Machine Translation and Metrics for Machine Translation

    NARCIS (Netherlands)

    Callison-Burch, C.; Koehn, P.; Monz, C.; Peterson, K.; Przybocki, M.; Zaidan, O.F.

    2010-01-01

    This paper presents the results of the WMT10 and MetricsMATR10 shared tasks, which included a translation task, a system combination task, and an evaluation task. We conducted a large-scale manual evaluation of 104 machine translation systems and 41 system combination entries. We used the ranking of

  1. Unraveling the Contribution of Image Captioning and Neural Machine Translation for Multimodal Machine Translation

    Directory of Open Access Journals (Sweden)

    Lala Chiraag

    2017-06-01

    Full Text Available Recent work on multimodal machine translation has attempted to address the problem of producing target language image descriptions based on both the source language description and the corresponding image. However, existing work has not been conclusive on the contribution of visual information. This paper presents an in-depth study of the problem by examining the differences and complementarities of two related but distinct approaches to this task: textonly neural machine translation and image captioning. We analyse the scope for improvement and the effect of different data and settings to build models for these tasks. We also propose ways of combining these two approaches for improved translation quality.

  2. Resolution of Unidentified Words in Machine Translation

    CERN Document Server

    Ullah, Sana; Kwak, Kyung Sup

    2009-01-01

    This paper presents a mechanism of resolving unidentified lexical units in text-based machine translation (TBMT). In machine translation system it is unlikely to have a complete MT lexicon and hence there is a need of a mechanism to handle the problem of unidentified words. These unknown words could be abbreviations, names, acronyms and newly introduced terms. We have proposed an algorithm for the resolution of the unidentified words. This algorithm takes discourse unit (primitive discourse) as a unit of analysis and provides real time updates to the lexicon. We have manually applied the algorithm to news paper fragments. Along with anaphora and cataphora1 resolution, many unknown words especially names and abbreviations were updated to the lexicon. Moreover flowchart of the proposed algorithm is also presented.

  3. An Evaluation of Output Quality of Machine Translation (Padideh Software vs. Google Translate

    Directory of Open Access Journals (Sweden)

    Haniyeh Sadeghi Azer

    2015-08-01

    Full Text Available This study aims to evaluate the translation quality of two machine translation systems in translating six different text-types, from English to Persian. The evaluation was based on criteria proposed by Van Slype (1979. The proposed model for evaluation is a black-box type, comparative and adequacy-oriented evaluation. To conduct the evaluation, a questionnaire was assigned to end-users to evaluate the outputs to examine and determine, if the machine-generated translations are intelligible and acceptable from their point of view and which one of the machine-generated translations produced by Padideh software and Google Translate is more acceptable and useful from the end-users point of view. The findings indicate that, the machine-generated translations are intelligible and acceptable in translating certain text-types, for end-users and Google Translate is more acceptable from end-users point of view. Keywords: Machine Translation, Machine Translation Evaluation, Translation Quality

  4. Findings of the 2014 Workshop on Statistical Machine Translation

    NARCIS (Netherlands)

    Bojar, O.; Buck, C.; Federmann, C.; Haddow, B.; Koehn, P.; Leveling, J.; Monz, C.; Pecina, P.; Post, M.; Saint-Amand, H.; Soricut, R.; Specia, L.; Tamchyna, A.

    2014-01-01

    This paper presents the results of the WMT14 shared tasks, which included a standard news translation task, a separate medical translation task, a task for run-time estimation of machine translation quality, and a metrics task. This year, 143 machine translation systems from 23 institutions were sub

  5. Development and evaluation of RapTAT: a machine learning system for concept mapping of phrases from medical narratives.

    Science.gov (United States)

    Gobbel, Glenn T; Reeves, Ruth; Jayaramaraja, Shrimalini; Giuse, Dario; Speroff, Theodore; Brown, Steven H; Elkin, Peter L; Matheny, Michael E

    2014-04-01

    Rapid, automated determination of the mapping of free text phrases to pre-defined concepts could assist in the annotation of clinical notes and increase the speed of natural language processing systems. The aim of this study was to design and evaluate a token-order-specific naïve Bayes-based machine learning system (RapTAT) to predict associations between phrases and concepts. Performance was assessed using a reference standard generated from 2860 VA discharge summaries containing 567,520 phrases that had been mapped to 12,056 distinct Systematized Nomenclature of Medicine - Clinical Terms (SNOMED CT) concepts by the MCVS natural language processing system. It was also assessed on the manually annotated, 2010 i2b2 challenge data. Performance was established with regard to precision, recall, and F-measure for each of the concepts within the VA documents using bootstrapping. Within that corpus, concepts identified by MCVS were broadly distributed throughout SNOMED CT, and the token-order-specific language model achieved better performance based on precision, recall, and F-measure (0.95±0.15, 0.96±0.16, and 0.95±0.16, respectively; mean±SD) than the bag-of-words based, naïve Bayes model (0.64±0.45, 0.61±0.46, and 0.60±0.45, respectively) that has previously been used for concept mapping. Precision, recall, and F-measure on the i2b2 test set were 92.9%, 85.9%, and 89.2% respectively, using the token-order-specific model. RapTAT required just 7.2ms to map all phrases within a single discharge summary, and mapping rate did not decrease as the number of processed documents increased. The high performance attained by the tool in terms of both accuracy and speed was encouraging, and the mapping rate should be sufficient to support near-real-time, interactive annotation of medical narratives. These results demonstrate the feasibility of rapidly and accurately mapping phrases to a wide range of medical concepts based on a token-order-specific naïve Bayes model and

  6. Findings of the 2011 workshop on statistical machine translation

    NARCIS (Netherlands)

    Callison-Burch, C.; Koehn, P.; Monz, C.; Zaidan, O.F.

    2011-01-01

    This paper presents the results of the WMT11 shared tasks, which included a translation task, a system combination task, and a task for machine translation evaluation metrics. We conducted a large-scale manual evaluation of 148 machine translation systems and 41 system combination entries. We used t

  7. Machine translation of TV subtitles for large scale production

    OpenAIRE

    Volk, Martin; Sennrich, Rico; Hardmeier, Christian; Tidström, Frida

    2010-01-01

    This paper describes our work on building and employing Statistical Machine Translation systems for TV subtitles in Scandinavia. We have built translation systems for Danish, English, Norwegian and Swedish. They are used in daily subtitle production and translate large volumes. As an example we report on our evaluation results for three TV genres. We discuss our lessons learned in the system development process which shed interesting light on the practical use of Machine Translation technology.

  8. Evaluation of Hindi to Punjabi Machine Translation System

    CERN Document Server

    Goyal, Vishal

    2009-01-01

    Machine Translation in India is relatively young. The earliest efforts date from the late 80s and early 90s. The success of every system is judged from its evaluation experimental results. Number of machine translation systems has been started for development but to the best of author knowledge, no high quality system has been completed which can be used in real applications. Recently, Punjabi University, Patiala, India has developed Punjabi to Hindi Machine translation system with high accuracy of about 92%. Both the systems i.e. system under question and developed system are between same closely related languages. Thus, this paper presents the evaluation results of Hindi to Punjabi machine translation system. It makes sense to use same evaluation criteria as that of Punjabi to Hindi Punjabi Machine Translation System. After evaluation, the accuracy of the system is found to be about 95%.

  9. An Overall Perspective of Machine Translation with its Shortcomings

    Directory of Open Access Journals (Sweden)

    Alireza Akbari

    2014-01-01

    Full Text Available The petition for language translation has strikingly augmented recently due to cross-cultural communication and exchange of information. In order to communicate well, text should be translated correctly and completely in each field such as legal documents, technical texts, scientific texts, publicity leaflets, and instructional materials. In this connection, Machine translation is of great importance in translation. The term “Machine Translation” was first proposed by George Artsrouni and Smirnov Troyanski (1933 to design a storage design on paper tape. This paper sought to investigate an overall perspective of Machine Translation models and its metrics in detail. Finally, it scrutinized the ins and outs shortcomings of Machine Translation

  10. Quantum Neural Network Based Machine Translator for Hindi to English

    OpenAIRE

    Ravi Narayan; V. P. Singh; S. Chakraverty

    2014-01-01

    This paper presents the machine learning based machine translation system for Hindi to English, which learns the semantically correct corpus. The quantum neural based pattern recognizer is used to recognize and learn the pattern of corpus, using the information of part of speech of individual word in the corpus, like a human. The system performs the machine translation using its knowledge gained during the learning by inputting the pair of sentences of Devnagri-Hindi and English. To analyze t...

  11. Quantum Neural Network Based Machine Translator for Hindi to English

    OpenAIRE

    Ravi Narayan; Singh, V. P.; S. Chakraverty

    2014-01-01

    This paper presents the machine learning based machine translation system for Hindi to English, which learns the semantically correct corpus. The quantum neural based pattern recognizer is used to recognize and learn the pattern of corpus, using the information of part of speech of individual word in the corpus, like a human. The system performs the machine translation using its knowledge gained during the learning by inputting the pair of sentences of Devnagri-Hindi and English. To analyze t...

  12. LSTM Neural Reordering Feature for Statistical Machine Translation

    OpenAIRE

    Cui, Yiming; Wang, Shijin; Li, Jianfeng

    2015-01-01

    Artificial neural networks are powerful models, which have been widely applied into many aspects of machine translation, such as language modeling and translation modeling. Though notable improvements have been made in these areas, the reordering problem still remains a challenge in statistical machine translations. In this paper, we present a novel neural reordering model that directly models word pairs and alignment. By utilizing LSTM recurrent neural networks, much longer context could be ...

  13. Machine Translation as a Complex System: The Role of Esperanto

    Directory of Open Access Journals (Sweden)

    Federic Gobbo

    2015-04-01

    Full Text Available The history of machine translation and the history of Esperanto have long been connected, as they are two different ways to deal with the same problem: the problem of communication across language barriers. Language can be considered a Complex Adaptive System (CAS, and machine translation too. In fact, there are multiple agents (both natural and artificial involved, interacting with one another and committed to achieve a common goal, i.e., the machine translation task. The main characteristics of language as a CAS are also shared in machine translation, especially if we consider the example-based, statistical approach, which is nowadays paradigmatic and unavoidable. In fact, control is distributed, there is no ideal representing agent (intrinsic diversity, there are perpetual dynamics in performance, adapted through amplification and competition of new examples from the crowd of users. On the other hand, Esperanto, being a living language, can be considered a CAS, but of a special kind, because its intrinsic regularity in structure simplifies the task of machine translation, at least up to a certain level. This paper reviews how Esperanto has enhanced the development of human-machine communication in general and within machine translation in particular, tracing some prospects for further development of machine translation, where Esperanto could play a key role.

  14. Machine Translation in the German Classroom: Detection, Reaction, Prevention

    Science.gov (United States)

    Steding, Soren

    2009-01-01

    There are many websites today that offer free machine translations and although beginning students of German are not always proficient enough to judge the quality of these translations or to fully understand certain translation results, they use these services nonetheless for their assignments. The problem for the educator is to distinguish…

  15. Exploration and exploitation of multilingual data for statistical machine translation

    NARCIS (Netherlands)

    Carter, S.C.

    2012-01-01

    Shortly after the birth of computer science, researchers realised the importance of machine translation as a task worth of concentrated effort, but it is only recently that algorithms are able to provide automatic translations usable by the masses. Modern translation systems are dependent on bilingu

  16. Quantum Neural Network Based Machine Translator for Hindi to English

    Directory of Open Access Journals (Sweden)

    Ravi Narayan

    2014-01-01

    Full Text Available This paper presents the machine learning based machine translation system for Hindi to English, which learns the semantically correct corpus. The quantum neural based pattern recognizer is used to recognize and learn the pattern of corpus, using the information of part of speech of individual word in the corpus, like a human. The system performs the machine translation using its knowledge gained during the learning by inputting the pair of sentences of Devnagri-Hindi and English. To analyze the effectiveness of the proposed approach, 2600 sentences have been evaluated during simulation and evaluation. The accuracy achieved on BLEU score is 0.7502, on NIST score is 6.5773, on ROUGE-L score is 0.9233, and on METEOR score is 0.5456, which is significantly higher in comparison with Google Translation and Bing Translation for Hindi to English Machine Translation.

  17. Quantum neural network based machine translator for Hindi to English.

    Science.gov (United States)

    Narayan, Ravi; Singh, V P; Chakraverty, S

    2014-01-01

    This paper presents the machine learning based machine translation system for Hindi to English, which learns the semantically correct corpus. The quantum neural based pattern recognizer is used to recognize and learn the pattern of corpus, using the information of part of speech of individual word in the corpus, like a human. The system performs the machine translation using its knowledge gained during the learning by inputting the pair of sentences of Devnagri-Hindi and English. To analyze the effectiveness of the proposed approach, 2600 sentences have been evaluated during simulation and evaluation. The accuracy achieved on BLEU score is 0.7502, on NIST score is 6.5773, on ROUGE-L score is 0.9233, and on METEOR score is 0.5456, which is significantly higher in comparison with Google Translation and Bing Translation for Hindi to English Machine Translation.

  18. RuLearn: an Open-source Toolkit for the Automatic Inference of Shallow-transfer Rules for Machine Translation

    Directory of Open Access Journals (Sweden)

    Sánchez-Cartagena Víctor M.

    2016-10-01

    Full Text Available This paper presents ruLearn, an open-source toolkit for the automatic inference of rules for shallow-transfer machine translation from scarce parallel corpora and morphological dictionaries. ruLearn will make rule-based machine translation a very appealing alternative for under-resourced language pairs because it avoids the need for human experts to handcraft transfer rules and requires, in contrast to statistical machine translation, a small amount of parallel corpora (a few hundred parallel sentences proved to be sufficient. The inference algorithm implemented by ruLearn has been recently published by the same authors in Computer Speech & Language (volume 32. It is able to produce rules whose translation quality is similar to that obtained by using hand-crafted rules. ruLearn generates rules that are ready for their use in the Apertium platform, although they can be easily adapted to other platforms. When the rules produced by ruLearn are used together with a hybridisation strategy for integrating linguistic resources from shallow-transfer rule-based machine translation into phrase-based statistical machine translation (published by the same authors in Journal of Artificial Intelligence Research, volume 55, they help to mitigate data sparseness. This paper also shows how to use ruLearn and describes its implementation.

  19. Adaptation of machine translation for multilingual information retrieval in the medical domain.

    Science.gov (United States)

    Pecina, Pavel; Dušek, Ondřej; Goeuriot, Lorraine; Hajič, Jan; Hlaváčová, Jaroslava; Jones, Gareth J F; Kelly, Liadh; Leveling, Johannes; Mareček, David; Novák, Michal; Popel, Martin; Rosa, Rudolf; Tamchyna, Aleš; Urešová, Zdeňka

    2014-07-01

    We investigate machine translation (MT) of user search queries in the context of cross-lingual information retrieval (IR) in the medical domain. The main focus is on techniques to adapt MT to increase translation quality; however, we also explore MT adaptation to improve effectiveness of cross-lingual IR. Our MT system is Moses, a state-of-the-art phrase-based statistical machine translation system. The IR system is based on the BM25 retrieval model implemented in the Lucene search engine. The MT techniques employed in this work include in-domain training and tuning, intelligent training data selection, optimization of phrase table configuration, compound splitting, and exploiting synonyms as translation variants. The IR methods include morphological normalization and using multiple translation variants for query expansion. The experiments are performed and thoroughly evaluated on three language pairs: Czech-English, German-English, and French-English. MT quality is evaluated on data sets created within the Khresmoi project and IR effectiveness is tested on the CLEF eHealth 2013 data sets. The search query translation results achieved in our experiments are outstanding - our systems outperform not only our strong baselines, but also Google Translate and Microsoft Bing Translator in direct comparison carried out on all the language pairs. The baseline BLEU scores increased from 26.59 to 41.45 for Czech-English, from 23.03 to 40.82 for German-English, and from 32.67 to 40.82 for French-English. This is a 55% improvement on average. In terms of the IR performance on this particular test collection, a significant improvement over the baseline is achieved only for French-English. For Czech-English and German-English, the increased MT quality does not lead to better IR results. Most of the MT techniques employed in our experiments improve MT of medical search queries. Especially the intelligent training data selection proves to be very successful for domain adaptation of

  20. ARABIC-MALAY MACHINE TRANSLATION USING RULE-BASED APPROACH

    Directory of Open Access Journals (Sweden)

    Ahmed Jumaa Alsaket

    2014-01-01

    Full Text Available Arabic machine translation has been taking place in machine translation projects in recent years. This study concentrates on the translation of Arabic text to its equivalent in Malay language. The problem of this research is the syntactic and morphological differences between Arabic and Malay adjective sentences. The main aim of this study is to design and develop Arabic-Malay machine translation model. First, we analyze the adjective role in the Arabic and Malay languages. Based on this analysis, we identify the transfer bilingual rules form source language to target language so that the translation of source language to target language can be performed by computers successfully. Then, we build and implement a machine translation prototype called AMTS to translate from Arabic to Malay based on rule based approach. The system is evaluated on set of simple Arabic sentences. The techniques used to evaluate the correctness of the system translation are the BLEU metric algorithm and the human judgment. The results of the BLEU algorithm show that the AMTS system performs better than Google in the translation of Arabic sentences into Malay. In addition, the average accuracy given by human judges is 92.3% for our system and 75.3% for Google.

  1. An Evaluation of Output Quality of Machine Translation (Padideh Software vs. Google Translate)

    Science.gov (United States)

    Azer, Haniyeh Sadeghi; Aghayi, Mohammad Bagher

    2015-01-01

    This study aims to evaluate the translation quality of two machine translation systems in translating six different text-types, from English to Persian. The evaluation was based on criteria proposed by Van Slype (1979). The proposed model for evaluation is a black-box type, comparative and adequacy-oriented evaluation. To conduct the evaluation, a…

  2. Bean Soup Translation: Flexible, Linguistically-Motivated Syntax for Machine Translation

    Science.gov (United States)

    Mehay, Dennis Nolan

    2012-01-01

    Machine translation (MT) systems attempt to translate texts from one language into another by translating words from a "source language" and rearranging them into fluent utterances in a "target language." When the two languages organize concepts in very different ways, knowledge of their general sentence structure, or…

  3. Machine Translation-Assisted Language Learning: Writing for Beginners

    Science.gov (United States)

    Garcia, Ignacio; Pena, Maria Isabel

    2011-01-01

    The few studies that deal with machine translation (MT) as a language learning tool focus on its use by advanced learners, never by beginners. Yet, freely available MT engines (i.e. Google Translate) and MT-related web initiatives (i.e. Gabble-on.com) position themselves to cater precisely to the needs of learners with a limited command of a…

  4. Machine Translation with Many Manually Labeled Discourse Connectives

    OpenAIRE

    Meyer, Thomas; Polakova, Lucie

    2013-01-01

    The paper presents machine translation experiments from English to Czech with a large amount of manually annotated discourse connectives. The gold-standard discourse relation annotation leads to better translation performance in ranges of 4–60% for some ambiguous English connectives and helps to find correct syntactical constructs in Czech for less ambiguous connectives. Automatic scoring confirms the stability of the newly built discourse-aware translation systems. Error analysis and human t...

  5. Approaches to handle scarce resources for Bengali statistical machine translation

    OpenAIRE

    Roy, Maxim

    2010-01-01

    Machine translation (MT) is a hard problem because of the highly complex, irregular and diverse nature of natural language. MT refers to computerized systems that utilize software to translate text from one natural language into another with or without human assistance. It is impossible to accurately model all the linguistic rules and relationships that shape the translation process, and therefore MT has to make decisions based on incomplete data. In order to handle this incomplete data, a pr...

  6. Integrating Automatic Speech Recognition and Machine Translation for Better Translation Outputs

    DEFF Research Database (Denmark)

    Liyanapathirana, Jeevanthi

    than typing, making the translation process faster. The spoken translation is analyzed and combined with the machine translation output of the same sentence using different methods. We study a number of different translation models in the context of n-best list rescoring methods. As an alternative...... to the n-best list rescoring, we also use word graphs with the expectation of arriving at a tighter integration of ASR and MT models. Integration methods include constraining ASR models using language and translation models of MT, and vice versa. We currently develop and experiment different methods...... on the Danish – English language pair, with the use of a speech corpora and parallel text. The methods are investigated to check ways that the accuracy of the spoken translation of the translator can be increased with the use of machine translation outputs, which would be useful for potential computer...

  7. Linguistically motivated statistical machine translation models and algorithms

    CERN Document Server

    Xiong, Deyi

    2015-01-01

    This book provides a wide variety of algorithms and models to integrate linguistic knowledge into Statistical Machine Translation (SMT). It helps advance conventional SMT to linguistically motivated SMT by enhancing the following three essential components: translation, reordering and bracketing models. It also serves the purpose of promoting the in-depth study of the impacts of linguistic knowledge on machine translation. Finally it provides a systematic introduction of Bracketing Transduction Grammar (BTG) based SMT, one of the state-of-the-art SMT formalisms, as well as a case study of linguistically motivated SMT on a BTG-based platform.

  8. Domain adaptation of statistical machine translation with domain-focused web crawling.

    Science.gov (United States)

    Pecina, Pavel; Toral, Antonio; Papavassiliou, Vassilis; Prokopidis, Prokopis; Tamchyna, Aleš; Way, Andy; van Genabith, Josef

    In this paper, we tackle the problem of domain adaptation of statistical machine translation (SMT) by exploiting domain-specific data acquired by domain-focused crawling of text from the World Wide Web. We design and empirically evaluate a procedure for automatic acquisition of monolingual and parallel text and their exploitation for system training, tuning, and testing in a phrase-based SMT framework. We present a strategy for using such resources depending on their availability and quantity supported by results of a large-scale evaluation carried out for the domains of environment and labour legislation, two language pairs (English-French and English-Greek) and in both directions: into and from English. In general, machine translation systems trained and tuned on a general domain perform poorly on specific domains and we show that such systems can be adapted successfully by retuning model parameters using small amounts of parallel in-domain data, and may be further improved by using additional monolingual and parallel training data for adaptation of language and translation models. The average observed improvement in BLEU achieved is substantial at 15.30 points absolute.

  9. Precise machine translation of computer science study

    CSIR Research Space (South Africa)

    Marais, L

    2015-07-01

    Full Text Available mobile (Android) application for translating discrete mathematics definitions between English and Afrikaans. The main component of the system is a Grammatical Framework (GF) application grammar which produces syntactically and semantically accurate...

  10. Machine Translation in the Hands of Trainee Translators – an Empirical Study

    Directory of Open Access Journals (Sweden)

    Sycz-Opoń Joanna

    2017-03-01

    Full Text Available Automated translation (machine translation, MT is systematically gaining popularity among professional translators, who claim that editing MT output requires less time and effort than translating from scratch. MT technology is also offered in leading translator’s workstations, e.g., SDL Trados Studio, memoQ, Déjà Vu and Wordfast. Therefore, the dilemma arises: should MT be introduced into formal translation training? In order to answer this question, first, it is necessary to understand how trainee translators actually use MT.

  11. Machine Translation Using Constraint-Based Synchronous Grammar

    Institute of Scientific and Technical Information of China (English)

    WONG Fai; DONG Mingchui; HU Dongcheng

    2006-01-01

    A synchronous grammar based on the formalism of context-free grammar was developed by generalizing the first component of production that models the source text. Unlike other synchronous grammars,the grammar allows multiple target productions to be associated to a single production rule which can be used to guide a parser to infer different possible translational equivalences for a recognized input string according to the feature constraints of symbols in the pattern. An extended generalized LR algorithm was adapted to the parsing of the proposed formalism to analyze the syntactic structure of a language. The grammar was used as the basis for building a machine translation system for Portuguese to Chinese translation. The empirical results show that the grammar is more expressive when modeling the translational equivalences of parallel texts for machine translation and grammar rewriting applications.

  12. English to Sanskrit Machine Translation Using Transfer Based approach

    Science.gov (United States)

    Pathak, Ganesh R.; Godse, Sachin P.

    2010-11-01

    Translation is one of the needs of global society for communicating thoughts and ideas of one country with other country. Translation is the process of interpretation of text meaning and subsequent production of equivalent text, also called as communicating same meaning (message) in another language. In this paper we gave detail information on how to convert source language text in to target language text using Transfer Based Approach for machine translation. Here we implemented English to Sanskrit machine translator using transfer based approach. English is global language used for business and communication but large amount of population in India is not using and understand the English. Sanskrit is ancient language of India most of the languages in India are derived from Sanskrit. Sanskrit can be act as an intermediate language for multilingual translation.

  13. English to Tamil machine translation system using universal networking language

    Indian Academy of Sciences (India)

    RAJESWARI SRIDHAR; PAVITHRA SETHURAMAN; KASHYAP KRISHNAKUMAR

    2016-06-01

    This paper proposes English to Tamil machine translation system, using the universal networking language (UNL) as the intermediate representation. The UNL approach is a hybrid approach of the rule and knowledge-based approaches to machine translation. UNL is a declarative formal language, specifically designed to represent semantic data extracted from a natural language text. The input English sentence is converted to UNL (enconversion), which is then converted to a Tamil sentence (deconversion) by ensuring thatthe meaning of the input sentence is preserved. The representation of UNL was modified to suit the translation process. A new sentence formation algorithm was also proposed to rearrange the translated Tamil words to sentences. The translation system was evaluated using bilingual evaluation understudy (BLEU) score. A BLEU score of 0.581 was achieved, which is an indication that most of the information in the input sentence is retained in the translated sentence. The scores obtained using the UNL based approach were compared with existingapproaches to translation, and it can be concluded that the UNL is a more suited approach to machine translation.

  14. Using example-based machine translation to translate DVD subtitles

    DEFF Research Database (Denmark)

    Flanagan, Marian

    allotted to produce the subtitles have both decreased. Therefore, this market is recognised as a potential real-world application of MT. Recent publications have introduced Corpus-Based MT approaches to translate subtitles. An SMT system has been implemented in a Swedish subtitling company to translate...... between Swedish and Danish and Swedish and Norwegian subtitles, with the company already reporting a successful return on their investment. The hybrid EBMT/SMT system used in the current research, on the other hand, remains within the confines of academic research, and the real potential of the system...... more accessible for use within the AVT industry, and to put the system in a position where it could compete commercially with the SMT approach....

  15. TRANSLATOR OF FINITE STATE MACHINE MODEL PARAMETERS FROM MATLAB ENVIRONMENT INTO HUMAN-MACHINE INTERFACE APPLICATION

    OpenAIRE

    2012-01-01

    Technology and means for automatic translation of FSM model parameters from Matlab application to human-machine interface application is proposed. The example of technology application to the electric apparatus model is described.

  16. Machine translation with minimal reliance on parallel resources

    CERN Document Server

    Tambouratzis, George; Sofianopoulos, Sokratis

    2017-01-01

    This book provides a unified view on a new methodology for Machine Translation (MT). This methodology extracts information from widely available resources (extensive monolingual corpora) while only assuming the existence of a very limited parallel corpus, thus having a unique starting point to Statistical Machine Translation (SMT). In this book, a detailed presentation of the methodology principles and system architecture is followed by a series of experiments, where the proposed system is compared to other MT systems using a set of established metrics including BLEU, NIST, Meteor and TER. Additionally, a free-to-use code is available, that allows the creation of new MT systems. The volume is addressed to both language professionals and researchers. Prerequisites for the readers are very limited and include a basic understanding of the machine translation as well as of the basic tools of natural language processing.

  17. Good Applications for Crummy Machine Translation

    Science.gov (United States)

    1991-07-01

    multilingual 16 computational linguistics __.__n _____ODE 16. PRICE CODE 17. SECURITY CLASSIFICTION 18. SECURITY CLASSIFICATION 19. SECURITY...relatively few keystrokes. Peter Brown (personal communication) once remarked that such a super-fast typewriter ought to be possible in the monolingual case... Multilingual System under Development, Computational Linguistics, 11:2-3, pp. 155-169. [81 Kay, M. (1980) "The Proper Place of Men and Machines in Language

  18. Translating DVD Subtitles English-German, English-Japanese, Using Example-based Machine Translation

    DEFF Research Database (Denmark)

    Armstrong, Stephen; Caffrey, Colm; Flanagan, Marian

    2006-01-01

    Due to limited budgets and an ever-diminishing time-frame for the production of subtitles for movies released in cinema and DVD, there is a compelling case for a technology-based translation solution for subtitles. In this paper we describe how an Example-Based Machine Translation (EBMT) approach...

  19. Transliteration normalization for Information Extraction and Machine Translation

    Directory of Open Access Journals (Sweden)

    Yuval Marton

    2014-12-01

    Full Text Available Foreign name transliterations typically include multiple spelling variants. These variants cause data sparseness and inconsistency problems, increase the Out-of-Vocabulary (OOV rate, and present challenges for Machine Translation, Information Extraction and other natural language processing (NLP tasks. This work aims to identify and cluster name spelling variants using a Statistical Machine Translation method: word alignment. The variants are identified by being aligned to the same “pivot” name in another language (the source-language in Machine Translation settings. Based on word-to-word translation and transliteration probabilities, as well as the string edit distance metric, names with similar spellings in the target language are clustered and then normalized to a canonical form. With this approach, tens of thousands of high-precision name transliteration spelling variants are extracted from sentence-aligned bilingual corpora in Arabic and English (in both languages. When these normalized name spelling variants are applied to Information Extraction tasks, improvements over strong baseline systems are observed. When applied to Machine Translation tasks, a large improvement potential is shown.

  20. Statistical Machine Translation Features with Multitask Tensor Networks

    OpenAIRE

    Setiawan, Hendra; Huang, Zhongqiang; Devlin, Jacob; Lamar, Thomas; Zbib, Rabih; Schwartz, Richard; Makhoul, John

    2015-01-01

    We present a three-pronged approach to improving Statistical Machine Translation (SMT), building on recent success in the application of neural networks to SMT. First, we propose new features based on neural networks to model various non-local translation phenomena. Second, we augment the architecture of the neural network with tensor layers that capture important higher-order interaction among the network units. Third, we apply multitask learning to estimate the neural network parameters joi...

  1. Neural Networks Classifier for Data Selection in Statistical Machine Translation

    OpenAIRE

    Peris, Álvaro; Chinea-Rios, Mara; Casacuberta, Francisco

    2016-01-01

    We address the data selection problem in statistical machine translation (SMT) as a classification task. The new data selection method is based on a neural network classifier. We present a new method description and empirical results proving that our data selection method provides better translation quality, compared to a state-of-the-art method (i.e., Cross entropy). Moreover, the empirical results reported are coherent across different language pairs.

  2. On the Properties of Neural Machine Translation: Encoder-Decoder Approaches

    OpenAIRE

    Cho, Kyunghyun; van Merrienboer, Bart; Bahdanau, Dzmitry; Bengio, Yoshua

    2014-01-01

    Neural machine translation is a relatively new approach to statistical machine translation based purely on neural networks. The neural machine translation models often consist of an encoder and a decoder. The encoder extracts a fixed-length representation from a variable-length input sentence, and the decoder generates a correct translation from this representation. In this paper, we focus on analyzing the properties of the neural machine translation using two models; RNN Encoder--Decoder and...

  3. Complex Networks Analysis of Manual and Machine Translations

    Science.gov (United States)

    Amancio, Diego R.; Antiqueira, Lucas; Pardo, Thiago A. S.; da F. Costa, Luciano; Oliveira, Osvaldo N.; Nunes, Maria G. V.

    Complex networks have been increasingly used in text analysis, including in connection with natural language processing tools, as important text features appear to be captured by the topology and dynamics of the networks. Following previous works that apply complex networks concepts to text quality measurement, summary evaluation, and author characterization, we now focus on machine translation (MT). In this paper we assess the possible representation of texts as complex networks to evaluate cross-linguistic issues inherent in manual and machine translation. We show that different quality translations generated by MT tools can be distinguished from their manual counterparts by means of metrics such as in- (ID) and out-degrees (OD), clustering coefficient (CC), and shortest paths (SP). For instance, we demonstrate that the average OD in networks of automatic translations consistently exceeds the values obtained for manual ones, and that the CC values of source texts are not preserved for manual translations, but are for good automatic translations. This probably reflects the text rearrangements humans perform during manual translation. We envisage that such findings could lead to better MT tools and automatic evaluation metrics.

  4. Foreign Developments in Information Processing and Machine Translation

    Science.gov (United States)

    2007-11-02

    on developments in the following fields of language data processing: Machine translation studies; questions on structural linguistics, phonological ...34katakana" or in phonetic syllables without the use of any "kanji" or Chinese characters, the "kun"- pronunciation of Chinese characters was adopted as

  5. Human in the Loop Machine Translation of Medical Terminology

    Science.gov (United States)

    2010-04-01

    March 30–31, 2009, http://www.aclweb.org/anthology/W/W09/W09-0x24. 3. Chiang, D.; Lopez, A.; Madnani, N.; Monz, C.; Resnik , P.; Subotin, M. The Hiero... Resnik , P. S. Machine Translation by Pattern Matching; University of Maryland at College Park, College Park, MD, 2008, ISBN: 978-0-549-57255-8

  6. Translating DVD Subtitles English-German, English-Japanese, Using Example-based Machine Translation

    DEFF Research Database (Denmark)

    Armstrong, Stephen; Caffrey, Colm; Flanagan, Marian

    2006-01-01

    Due to limited budgets and an ever-diminishing time-frame for the production of subtitles for movies released in cinema and DVD, there is a compelling case for a technology-based translation solution for subtitles. In this paper we describe how an Example-Based Machine Translation (EBMT) approach...... to the translation of English DVD subtitles into German and Japanese can aid the subtitler. Our research focuses on an EBMT tool that produces fully automated translations, which in turn can be edited if required. To our knowledge this is the first time that any EBMT approach has been used with DVD subtitle...

  7. JTEC panel report on machine translation in Japan

    Science.gov (United States)

    Carbonell, Jaime; Rich, Elaine; Johnson, David; Tomita, Masaru; Vasconcellos, Muriel; Wilks, Yorick

    1992-01-01

    The goal of this report is to provide an overview of the state of the art of machine translation (MT) in Japan and to provide a comparison between Japanese and Western technology in this area. The term 'machine translation' as used here, includes both the science and technology required for automating the translation of text from one human language to another. Machine translation is viewed in Japan as an important strategic technology that is expected to play a key role in Japan's increasing participation in the world economy. MT is seen in Japan as important both for assimilating information into Japanese as well as for disseminating Japanese information throughout the world. Most of the MT systems now available in Japan are transfer-based systems. The majority of them exploit a case-frame representation of the source text as the basis of the transfer process. There is a gradual movement toward the use of deeper semantic representations, and some groups are beginning to look at interlingua-based systems.

  8. A Study on Automatic Scoring for Machine Translation Systems

    Institute of Scientific and Technical Information of China (English)

    Yao Jianmin(姚建民); Zhang Jing; Zhao Tiejun; Li Sheng

    2004-01-01

    String similarity measures of edit distance, cosine correlation and Dice coefficient are adopted to evaluate machine translation results. Experiment shows that the evaluation method distinguishes well between "good" and "bad" translations. Another experiment manifests a consistency between human and automatic scorings of 6 general-purpose MT systems. Equational analysis validates the experimental results. Although the data and graphs are very promising, correlation coefficient and significance tests at 0.01 level are made to ensure the reliability of the results. Linear regression is made to map the automatic scoring results to human scorings.

  9. Empirical Investigation of Optimization Algorithms in Neural Machine Translation

    Directory of Open Access Journals (Sweden)

    Bahar Parnia

    2017-06-01

    Full Text Available Training neural networks is a non-convex and a high-dimensional optimization problem. In this paper, we provide a comparative study of the most popular stochastic optimization techniques used to train neural networks. We evaluate the methods in terms of convergence speed, translation quality, and training stability. In addition, we investigate combinations that seek to improve optimization in terms of these aspects. We train state-of-the-art attention-based models and apply them to perform neural machine translation. We demonstrate our results on two tasks: WMT 2016 En→Ro and WMT 2015 De→En.

  10. INTEGRATING MACHINE TRANSLATION AND SPEECH SYNTHESIS COMPONENT FOR ENGLISH TO DRAVIDIAN LANGUAGE SPEECH TO SPEECH TRANSLATION SYSTEM

    Directory of Open Access Journals (Sweden)

    J. SANGEETHA

    2015-02-01

    Full Text Available This paper provides an interface between the machine translation and speech synthesis system for converting English speech to Tamil text in English to Tamil speech to speech translation system. The speech translation system consists of three modules: automatic speech recognition, machine translation and text to speech synthesis. Many procedures for incorporation of speech recognition and machine translation have been projected. Still speech synthesis system has not yet been measured. In this paper, we focus on integration of machine translation and speech synthesis, and report a subjective evaluation to investigate the impact of speech synthesis, machine translation and the integration of machine translation and speech synthesis components. Here we implement a hybrid machine translation (combination of rule based and statistical machine translation and concatenative syllable based speech synthesis technique. In order to retain the naturalness and intelligibility of synthesized speech Auto Associative Neural Network (AANN prosody prediction is used in this work. The results of this system investigation demonstrate that the naturalness and intelligibility of the synthesized speech are strongly influenced by the fluency and correctness of the translated text.

  11. Using the TED Talks to Evaluate Spoken Post-editing of Machine Translation

    DEFF Research Database (Denmark)

    Liyanapathirana, Jeevanthi; Popescu-Belis, Andrei

    2016-01-01

    This paper presents a solution to evaluate spoken post-editing of imperfect machine translation output by a human translator. We compare two approaches to the combination of machine translation (MT) and automatic speech recognition (ASR): a heuristic algorithm and a machine learning method...

  12. Building a Large-Scale Knowledge Base for Machine Translation

    CERN Document Server

    Knight, K; Knight, Kevin; Luk, Steve K.

    1994-01-01

    Knowledge-based machine translation (KBMT) systems have achieved excellent results in constrained domains, but have not yet scaled up to newspaper text. The reason is that knowledge resources (lexicons, grammar rules, world models) must be painstakingly handcrafted from scratch. One of the hypotheses being tested in the PANGLOSS machine translation project is whether or not these resources can be semi-automatically acquired on a very large scale. This paper focuses on the construction of a large ontology (or knowledge base, or world model) for supporting KBMT. It contains representations for some 70,000 commonly encountered objects, processes, qualities, and relations. The ontology was constructed by merging various online dictionaries, semantic networks, and bilingual resources, through semi-automatic methods. Some of these methods (e.g., conceptual matching of semantic taxonomies) are broadly applicable to problems of importing/exporting knowledge from one KB to another. Other methods (e.g., bilingual match...

  13. A Reordering Model Using a Source-Side Parse-Tree for Statistical Machine Translation

    Science.gov (United States)

    Hashimoto, Kei; Yamamoto, Hirofumi; Okuma, Hideo; Sumita, Eiichiro; Tokuda, Keiichi

    This paper presents a reordering model using a source-side parse-tree for phrase-based statistical machine translation. The proposed model is an extension of IST-ITG (imposing source tree on inversion transduction grammar) constraints. In the proposed method, the target-side word order is obtained by rotating nodes of the source-side parse-tree. We modeled the node rotation, monotone or swap, using word alignments based on a training parallel corpus and source-side parse-trees. The model efficiently suppresses erroneous target word orderings, especially global orderings. Furthermore, the proposed method conducts a probabilistic evaluation of target word reorderings. In English-to-Japanese and English-to-Chinese translation experiments, the proposed method resulted in a 0.49-point improvement (29.31 to 29.80) and a 0.33-point improvement (18.60 to 18.93) in word BLEU-4 compared with IST-ITG constraints, respectively. This indicates the validity of the proposed reordering model.

  14. Analysis of MultiWord Expression Translation Errors in Statistical Machine Translation

    DEFF Research Database (Denmark)

    Klyueva, Natalia; Liyanapathirana, Jeevanthi

    2015-01-01

    In this paper, we analyse the usage of multiword expressions (MWE) in Statistical Machine Translation (SMT). We exploit the Moses SMT toolkit to train models for French-English and Czech-Russian language pairs. For each language pair, two models were built: a baseline model without additional MWE...

  15. Statistical translation with scarce resources: a South African case study

    CSIR Research Space (South Africa)

    Ronald, K

    2006-11-01

    Full Text Available , applying the algorithm in section 2. 4. Statistical Phrase-based Translation We evaluated the performance of our approach using the phrased-based translation framework. In this framework the task is to find that English sentence e given a foreign...- tax and morphology in the target language for example noun-phrase agreement and subject-verb agreement. Fur- ther more we plan to investigate the benefits of merging phrase-based and syntax-based systems to build a hybrid machine translation system...

  16. Word Transition Entropy as an Indicator for Expected Machine Translation Quality

    DEFF Research Database (Denmark)

    Carl, Michael; Schaeffer, Moritz

    2014-01-01

    While most machine translation evaluation techniques (BLEU, NIST, TER, METEOR) assess translation quality based on a set of reference translations, we suggest to evaluate the literality of a set of (human or machine generated) translations to infer their potential quality. We provide evidence which...... suggests that more literal translations are produced more easily, by humans and machine, and are also less error prone. Literal translations may not be appropriate or even possible for all languages, types of texts, and translation purposes. However, in this paper we show that an assessment...... of the literality of translations allows us to (1) evaluate human and machine translations in a similar fashion and (2) may be instrumental to predict machine translation quality scores,...

  17. ScaleMT: a free/open-source framework for building scalable machine translation web services

    OpenAIRE

    Sánchez-Cartagena, Víctor M.; Pérez-Ortiz, Juan Antonio

    2009-01-01

    Machine translation web services usage is growing amazingly mainly because of the translation quality and reliability of the service provided by the Google Ajax Language API. To allow the open-source machine ranslation projects to compete with Google’s one and gain visibility on the internet, we have developed ScaleMT: a free/open-source framework that exposes existing machine translation engines as public web services. This framework is highly scalable as it can run coordinately on many serv...

  18. Quantitative evaluation of English-Japanese machine translation of medical literature.

    Science.gov (United States)

    Kiuchi, T; Kaihara, S

    1991-08-01

    Although many machine-translation programs are currently available, few evaluation methods of such translation exist for any given application area. It is difficult to evaluate machine-translation systems objectively because the quality of a translation depends on the combination of three factors: the translation program, the dictionary, and the original document. In this study, we developed a quantitative evaluation method for assessing machine translation, which evaluates these three factors separately. We applied this method to the translation of English to Japanese for medical literature and the method proved to be a good indicator for further system improvement. Using this method we also discovered other important points for machine translation, such as the examination of target documents for the construction of a better application dictionary.

  19. A discourse based approach in text-based machine translation

    CERN Document Server

    Ullah, Sana; Kwak, Kyung Sup

    2009-01-01

    This paper presents a theoretical research based approach to ellipsis resolution in machine translation. Moreover, the formula of discourse is applied in order to resolve ellipses. The validity of the discourse formula is analyzed by applying it to the real world text i.e. newspaper fragments. The source text is converted into mono-sentential discourses where complex discourses require further dissection either directly into primitive discourses or first into compound discourses and later into primitive ones. The procedure of dissection needs further improvement i.e. discovering as many primitive discourse forms as possible [1]. This work is further improvement to the concepts presented by Khan (Khan, 1995). Likewise, an attempt has been made to investigate new primitive discourses i.e. patterns from the given text.

  20. Machine Translation as a complex system, and the phenomenon of Esperanto

    NARCIS (Netherlands)

    Gobbo, F.

    2015-01-01

    The history of machine translation and the history of Esperanto have long been connected, as they are two different ways to deal with the same problem: the problem of communication across language barriers. Language can be considered a Complex Adaptive System (CAS), and machine translation too. In f

  1. Exploring the Further Integration of Machine Translation in English-Chinese Cross Language Information Access

    Science.gov (United States)

    Wu, Dan; He, Daqing

    2012-01-01

    Purpose: This paper seeks to examine the further integration of machine translation technologies with cross language information access in providing web users the capabilities of accessing information beyond language barriers. Machine translation and cross language information access are related technologies, and yet they have their own unique…

  2. Impact of Machine-Translated Text on Entity and Relationship Extraction

    Science.gov (United States)

    2014-12-01

    Impact of Machine-Translated Text on Entity and Relationship Extraction by Mark R Mittrick and John T Richardson ARL-TN-0649 December...2014 Impact of Machine-Translated Text on Entity and Relationship Extraction Mark R Mittrick and John T Richardson Computational and...

  3. Machine Translation as a complex system, and the phenomenon of Esperanto

    NARCIS (Netherlands)

    Gobbo, F.

    2015-01-01

    The history of machine translation and the history of Esperanto have long been connected, as they are two different ways to deal with the same problem: the problem of communication across language barriers. Language can be considered a Complex Adaptive System (CAS), and machine translation too. In f

  4. Analysing Quality of English-Hindi Machine Translation Engine outputs using Baysian Classification

    OpenAIRE

    Rashmi Gupta; Nisheeth Joshi; Iti Mathur

    2013-01-01

    This paper considers the problem for estimating the quality of machine translation outputs which are independent of human intervention and are generally addressed using machine learning techniques.There are various measures through which a machine learns translations quality. Automatic Evaluation metrics produce good co-relation at corpus level but cannot produce the same results at the same segment or sentence level. In this paper 16 features are extracted from the input sentence...

  5. An Efficient Machine Translation System for English to Indian Languages Using Hybrid Mechanism

    Directory of Open Access Journals (Sweden)

    J. Sangeetha

    2014-08-01

    Full Text Available Machine Translation is an essential approach for localization, and is especially appropriate in a linguistically diversenation like India. Automatic translation between languages which are morphologically rich and syntactically different is generally regarded as a complex task. A number of machine translation systems have been proposed in literature. But, conventional rule-based machine translation system is costly in terms of formulating rules. It introduces inconsistencies, and it is inflexible to be robust. Statistical MT is an approach that automatically attains knowledge from a vast amount of training data. This approach is characterized by the use of machine learning techniques. But, still there is scope for better performance of the system. In this paper, a Hybrid Machine Translation (HMT approach is proposed which is the combination of rule based and statistical technique for translating text from English to Indian languages such as Tamil, Malayalam and Hindi. The rule based machine translation technique, involves the formation of rules which helps to re-order the syntactic structures of the source language sentence along with its dependency information which brings close to the structure of the target sentence. The parser identifies the syntactical elements in English sentences and suggests its Indian languages translation taking into account various grammatical forms of those Indian languages. Context Free Grammars (CFG is used in generation of the language structures, and then the errors in the translated sentences are corrected by applying a statistical technique. Simplifying and segmenting an input language text becomes mandatory in order to improve the machine translation quality. The experimental results show that the proposed approach competes with the machine translation methods reported in the literature and it provides the best translated output in each language.

  6. Domain Adaptation for Machine Translation with Instance Selection

    Directory of Open Access Journals (Sweden)

    Biçici Ergun

    2015-04-01

    Full Text Available Domain adaptation for machine translation (MT can be achieved by selecting training instances close to the test set from a larger set of instances. We consider 7 different domain adaptation strategies and answer 7 research questions, which give us a recipe for domain adaptation in MT. We perform English to German statistical MT (SMT experiments in a setting where test and training sentences can come from different corpora and one of our goals is to learn the parameters of the sampling process. Domain adaptation with training instance selection can obtain 22% increase in target 2-gram recall and can gain up to 3:55 BLEU points compared with random selection. Domain adaptation with feature decay algorithm (FDA not only achieves the highest target 2-gram recall and BLEU performance but also perfectly learns the test sample distribution parameter with correlation 0:99. Moses SMT systems built with FDA selected 10K training sentences is able to obtain F1 results as good as the baselines that use up to 2M sentences. Moses SMT systems built with FDA selected 50K training sentences is able to obtain F1 point better results than the baselines.

  7. Can multilingual machine translation help make medical record content more comprehensible to patients?

    Science.gov (United States)

    Zeng-Treitler, Qing; Kim, Hyeoneui; Rosemblat, Graciela; Keselman, Alla

    2010-01-01

    With the development of electronic personal health records, more patients are gaining access to their own medical records. However, comprehension of medical record content remains difficult for many patients. Because each record is unique, it is also prohibitively costly to employ human translators to solve this problem. In this study, we investigated whether multilingual machine translation could help make medical record content more comprehensible to patients who lack proficiency in the language of the records. We used a popular general-purpose machine translation tool called Babel Fish to translate 213 medical record sentences from English into Spanish, Chinese, Russian and Korean. We evaluated the comprehensibility and accuracy of the translation. The text characteristics of the incorrectly translated sentences were also analyzed. In each language, the majority of the translations were incomprehensible (76% to 92%) and/or incorrect (77% to 89%). The main causes of the translation are vocabulary difficulty and syntactical complexity. A general-purpose machine translation tool like the Babel Fish is not adequate for the translation of medical records; however, a machine translation tool can potentially be improved significantly, if it is trained to target certain narrow domains in medicine.

  8. Quantifying the Efficiency of a Translator: The Effect of Syntactical and Literal Written Translations on Language Comprehension Using the Machine Translation System FALCon

    Science.gov (United States)

    McCulloh, Ian A.; Morton, Jillian; Jantzi, Jennifer K.; Rodriguez, Amy M.; Graham, John

    2008-01-01

    This study introduces a new method of evaluating human comprehension in the context of machine translation using a language translation program known as the FALCon (Forward Area Language Converter). The participants include 48 freshmen from the United States Military Academy enrolled in the General Psychology course, PL100. Results of this study…

  9. Quantifying the Efficiency of a Translator: The Effect of Syntactical and Literal Written Translations on Language Comprehension Using the Machine Translation System FALCon (Foreign Area Language Converter)

    Science.gov (United States)

    McCulloh, Ian A.; Morton, Jillian; Jantzi, Jennifer K.; Rodriguez, Amy M.; Graham, John

    2008-01-01

    The purpose of this study is to introduce a new method of evaluating human comprehension in the context of machine translation using a language translation program known as the FALCon (Forward Area Language Converter). The FALCon works by converting documents into digital images via scanner, and then converting those images to electronic text by…

  10. Using the TED Talks to Evaluate Spoken Post-editing of Machine Translation

    DEFF Research Database (Denmark)

    Liyanapathirana, Jeevanthi; Popescu-Belis, Andrei

    2016-01-01

    This paper presents a solution to evaluate spoken post-editing of imperfect machine translation output by a human translator. We compare two approaches to the combination of machine translation (MT) and automatic speech recognition (ASR): a heuristic algorithm and a machine learning method....... To obtain a data set with spoken post-editing information, we use the French version of TED talks as the source texts submitted to MT, and the spoken English counterparts as their corrections, which are submitted to an ASR system. We experiment with various levels of artificial ASR noise and also...

  11. Breaking the language barrier: machine assisted diagnosis using the medical speech translator.

    Science.gov (United States)

    Starlander, Marianne; Bouillon, Pierrette; Rayner, Manny; Chatzichrisafis, Nikos; Hockey, Beth Ann; Isahara, Hitoshi; Kanzaki, Kyoko; Nakao, Yukie; Santaholma, Marianne

    2005-01-01

    In this paper, we describe and evaluate an Open Source medical speech translation system (MedSLT) intended for safety-critical applications. The aim of this system is to eliminate the language barriers in emergency situation. It translates spoken questions from English into French, Japanese and Finnish in three medical subdomains (headache, chest pain and abdominal pain), using a vocabulary of about 250-400 words per sub-domain. The architecture is a compromise between fixed-phrase translation on one hand and complex linguistically-based systems on the other. Recognition is guided by a Context Free Grammar Language Model compiled from a general unification grammar, automatically specialised for the domain. We present an evaluation of this initial prototype that shows the advantages of this grammar-based approach for this particular translation task in term of both reliability and use.

  12. Machine Transliteration

    CERN Document Server

    Knight, K; Knight, Kevin; Graehl, Jonathan

    1997-01-01

    It is challenging to translate names and technical terms across languages with different alphabets and sound inventories. These items are commonly transliterated, i.e., replaced with approximate phonetic equivalents. For example, "computer" in English comes out as "konpyuutaa" in Japanese. Translating such items from Japanese back to English is even more challenging, and of practical interest, as transliterated items make up the bulk of text phrases not found in bilingual dictionaries. We describe and evaluate a method for performing backwards transliterations by machine. This method uses a generative model, incorporating several distinct stages in the transliteration process.

  13. The Dostoevsky Machine in Georgetown: scientific translation in the Cold War.

    Science.gov (United States)

    Gordin, Michael D

    2016-04-01

    Machine Translation (MT) is now ubiquitous in discussions of translation. The roots of this phenomenon - first publicly unveiled in the so-called 'Georgetown-IBM Experiment' on 9 January 1954 - displayed not only the technological utopianism still associated with dreams of a universal computer translator, but was deeply enmeshed in the political pressures of the Cold War and a dominating conception of scientific writing as both the goal of machine translation as well as its method. Machine translation was created, in part, as a solution to a perceived crisis sparked by the massive expansion of Soviet science. Scientific prose was also perceived as linguistically simpler, and so served as the model for how to turn a language into a series of algorithms. This paper follows the rise of the Georgetown program - the largest single program in the world - from 1954 to the (as it turns out, temporary) collapse of MT in 1964.

  14. ADAPTING HYBRID MACHINE TRANSLATION TECHNIQUES FOR CROSS-LANGUAGE TEXT RETRIEVAL SYSTEM

    Directory of Open Access Journals (Sweden)

    P. ISWARYA

    2017-03-01

    Full Text Available This research work aims in developing Tamil to English Cross - language text retrieval system using hybrid machine translation approach. The hybrid machine translation system is a combination of rule based and statistical based approaches. In an existing word by word translation system there are lot of issues and some of them are ambiguity, Out-of-Vocabulary words, word inflections, and improper sentence structure. To handle these issues, proposed architecture is designed in such a way that, it contains Improved Part-of-Speech tagger, machine learning based morphological analyser, collocation based word sense disambiguation procedure, semantic dictionary, and tense markers with gerund ending rules, and two pass transliteration algorithm. From the experimental results it is clear that the proposed Tamil Query based translation system achieves significantly better translation quality over existing system, and reaches 95.88% of monolingual performance.

  15. An Evaluative Study of Machine Translation in the EFL Scenario of Saudi Arabia

    Directory of Open Access Journals (Sweden)

    Raneem Khalid Al-Tuwayrish

    2016-02-01

    Full Text Available Artificial Intelligence or AI as it is popularly known and its corollary, Machine Translation (MT have long engaged scientists, thinkers and linguists alike in the twenty first century. However, the wider question that lies in the relation between technology and translation is, What does technology do to language? This is an important question in the current paradigm because new translation technologies, such as, translation memories, data-based machine translation, and collaborative translation, far from being just additional tools, are changing the very nature of the translators’ cognitive activity, social relations, and professional standing. In fact, in some translation situations such as when translating technical materials or subject matter that are not a specialization with human translators, one potentially needs technology.  The purview of this paper, however, is limited to the role of MT in day to day situations where the generic MT tools like Google Translate or Bing Translator are encouraged. Further, it endeavours to weigh and empirically demonstrate the pros and cons of MT with a view to recommending measures for better communication training in the EFL set up of Saudi Arabia. Keywords: AI, MT, translation, technology, necessity, communication

  16. Advancement in Productivity of Arabic into English Machine Translation Systems from 2008 to 2013

    Science.gov (United States)

    Abu-Al-Sha'r, Awatif M.; AbuSeileek, Ali F.

    2013-01-01

    This paper attempts to compare between the advancements in the productivity of Arabic into English Machine Translation Systems between two years, 2008 and 2013. It also aims to evaluate the progress achieved by various systems of Arabic into English electronic translation between the two years. For tracing such advancement, a comparative analysis…

  17. The Integration of Project-Based Methodology into Teaching in Machine Translation

    Science.gov (United States)

    Madkour, Magda

    2016-01-01

    This quantitative-qualitative analytical research aimed at investigating the effect of integrating project-based teaching methodology into teaching machine translation on students' performance. Data was collected from the graduate students in the College of Languages and Translation, at Imam Muhammad Ibn Saud Islamic University, Riyadh, Saudi…

  18. Recycling Texts: Human evaluation of example-based machine translation subtitles for DVD

    DEFF Research Database (Denmark)

    Flanagan, Marian

    2009-01-01

    This project focuses on translation reusability in audiovisual contexts. Specifically, the project seeks to establish (1) whether target language subtitles produced by an Example-Based Machine Translation (EBMT) system are considered intelligible and acceptable by viewers of movies on DVD, and (2...

  19. A Linguistic Evaluation of Rule-Based, Phrase-Based, and Neural MT Engines

    Directory of Open Access Journals (Sweden)

    Burchardt Aljoscha

    2017-06-01

    Full Text Available In this paper, we report an analysis of the strengths and weaknesses of several Machine Translation (MT engines implementing the three most widely used paradigms. The analysis is based on a manually built test suite that comprises a large range of linguistic phenomena. Two main observations are on the one hand the striking improvement of an commercial online system when turning from a phrase-based to a neural engine and on the other hand that the successful translations of neural MT systems sometimes bear resemblance with the translations of a rule-based MT system.

  20. Machine-Aided Translation: From Terminology Banks to Interactive Translation Systems.

    Science.gov (United States)

    Greenfield, Concetta C.; Serain, Daniel

    The rapid growth of the need for technical translations in recent years has led specialists to utilize computer technology to improve the efficiency and quality of translation. The two approaches considered were automatic translation and terminology banks. Since the results of fully automatic translation were considered unsatisfactory by various…

  1. Statistical Sign Language Machine Translation: from English written text to American Sign Language Gloss

    CERN Document Server

    Othman, Achraf

    2011-01-01

    This works aims to design a statistical machine translation from English text to American Sign Language (ASL). The system is based on Moses tool with some modifications and the results are synthesized through a 3D avatar for interpretation. First, we translate the input text to gloss, a written form of ASL. Second, we pass the output to the WebSign Plug-in to play the sign. Contributions of this work are the use of a new couple of language English/ASL and an improvement of statistical machine translation based on string matching thanks to Jaro-distance.

  2. English-Latvian SMT: the challenge of translating into a free word order language

    NARCIS (Netherlands)

    Khalilov, M.; Fonollosa, J.A.R.; Skadiņa, I.; Brālītis, E.; Pretkalniņa, L.

    2010-01-01

    This paper presents a comparative study of two approaches to statistical machine translation (SMT) and their application to a task of English-to-Latvian translation, which is still an open research line in the field of automatic translation. We consider a state-of-the-art phrase-based SMT and an alt

  3. 互联网机器翻译%Web-based Machine Translation

    Institute of Scientific and Technical Information of China (English)

    王海峰; 吴华; 刘占一

    2011-01-01

    该文在回顾机器翻译发展的基础上,总结了主要的机器翻译方法,并主要阐述互联网机器翻译的特点及面临的挑战.面向互联网机器翻译的应用需求,并针对互联网资源具有海量、高噪声、时效性、稀疏的特点,提出了多策略混合翻译方法、资源挖掘和过滤以及分布式处理技术、领域自适应技术,针对数据稀疏论述枢轴语言技术和新语种快速部署技术;然后结合翻译与搜索技术,阐述翻译个性化特点和方案.最后,论述机器翻译技术和产品的应用.%This paper digs into the characteristics and challenges of web-based machine translation, and proposes possible solutions. First of all, we look back on the history of machine translation and summarize its methods. Next, we analyze the characteristics of internet bilingual corpora and monolingual corpora as: large scale, with lots of noise, real-time and sometimes sparse. Based on the features described above, we propose the hybrid machine translation method, corpus mining and filtering methods, and distributed computing methods. Furthermore, the pivot language approach is adopted to tackle the data sparseness problem, thus enabling the quick development of multilingual machine translation systems. We then discuss the approach to support the personalization of machine translation via the combination of translation technology and search technology. Finally the applications and products of machine translation technology are presented.

  4. Predicting post-translational lysine acetylation using support vector machines

    DEFF Research Database (Denmark)

    Gnad, Florian; Ren, Shubin; Choudhary, Chunaram

    2010-01-01

    spectrometry to identify 3600 lysine acetylation sites on 1750 human proteins covering most of the previously annotated sites and providing the most comprehensive acetylome so far. This dataset should provide an excellent source to train support vector machines (SVMs) allowing the high accuracy in silico...

  5. Evaluation of Machine Translation Errors in English and Iraqi Arabic

    Science.gov (United States)

    2010-05-01

    are made. Llitjós, Carbonell & Lavie (2005) created a hierarchical taxonomy of errors for use in refining rules of transfer-based MT systems...Translation. Proceedings of MT Summit XII. Llitjós, A., Carbonell , J., and Lavie, A. (2005). A framework for interactive and automatic refinement of

  6. A survey of statistical machine translation using paraphrasing technology%引入复述技术的统计机器翻译研究综述

    Institute of Scientific and Technical Information of China (English)

    2013-01-01

      基于对引入复述技术的统计机器翻译研究现状的分析,提出具有研究价值的课题方向。首先归纳了复述的概念,总结了引入复述技术的统计机器翻译各类方法。然后对复述知识在统计机器翻译中的模型训练、参数调整、待译语句改写和机器翻译自动评测等方面应用的主流方法进行了概括、比较和分析,说明了复述与统计机器翻译是紧密相关的,强调了复述在统计机器翻译应用中的关键问题是复述的正确性和多样性。最后指出提高复述资源的精确度、建立复述与机器翻译的联合模型、采用新方法解决稀疏问题等是有待进一步研究的课题。%In this paper, the research team discussed possible new prospective research directions of paraphrasing technology in statistical machine translation (SMT), based on reviews of state-of-the-art technology.First the re-search team introduced the concept of paraphrases , and next a summarization of the latest progress utilizing para-phrasing technology in SMT was conducted.Finally, conclusions were drawn, data was compared and an analysis of the main issues of incorporating paraphrases into SMT , including translation model training, parameter tuning, in-put sentences rewriting and machine translation evaluation was performed .The results proved that there is an inher-ent connection between paraphrasing and SMT .The results also point out that the correctness and diversity of para-phrasing are the key issues to apply paraphrasing to SMT .It was highly noted that the improvement in the quality of paraphrasing resource, the establishment of a joint model of paraphrasing and machine translation and the new pro -posed approach to solve data sparseness are problems which need further study .

  7. On the Chinese Set Phrases in the Translation of English Lyrics:From the Perspective of Cross-cultural Communication%跨文化交际视域下英文歌词汉译的成语运用

    Institute of Scientific and Technical Information of China (English)

    王艳艳

    2014-01-01

    As an integral part of Anglo-American culture, English songs make the best combination of music and language. Against the background of globalization, the translation of English lyrics for the purpose of cross-cultural communication turns out to be a hot research topic. This paper intends to discuss the effective application of Chinese set phrases and idioms in the translation of English lyrics via analyzing the stylistic features of the set phrases and idioms, and summarizing the translation criteria and strategies in the hope of facilitating both the practice and theory concerning English lyrics translation.%作为英美文化的重要组成部分,英文歌曲是音乐和语言的最佳结合体。在全球化大语境下,以跨文化交流为导向的英语歌词翻译是一个值得研究的话题。本文立足于该语境,通过分析英文歌曲翻译的标准和方法,结合汉语成语的文体特征,旨在讨论汉语成语和词组在英文歌曲翻译中的实际效用,以便助益跨文化交际目的下英文歌词汉译的实践活动和理论研究。

  8. Self-Organizing Machine Translation Example-Driven Induction of Transfer Functions

    CERN Document Server

    Juola, P

    1994-01-01

    With the advent of faster computers, the notion of doing machine translation from a huge stored database of translation examples is no longer unreasonable. This paper describes an attempt to merge the Example-Based Machine Translation (EBMT) approach with psycholinguistic principles. A new formalism for context- free grammars, called *marker-normal form*, is demonstrated and used to describe language data in a way compatible with psycholinguistic theories. By embedding this formalism in a standard multivariate optimization framework, a system can be built that infers correct transfer functions for a set of bilingual sentence pairs and then uses those functions to translate novel sentences. The validity of this line of reasoning has been tested in the development of a system called METLA-1. This system has been used to infer English->French and English->Urdu transfer functions from small corpora. The results of those experiments are examined, both in engineering terms as well as in more linguistic terms. In ge...

  9. A Lexical Conceptual Approach to Generation for Machine Translation

    Science.gov (United States)

    1988-01-01

    etc.). Once a verb has been selected to translate the predicate, the semantic arguments of the deep structure are filled with the instantiated...8217strid iii ts oft it, argumtents. Two examples are t he E nglish words shish and stil ilr: (4) (i) He slashed the woman * ’Dio cuchilladas a la mujer ...are simplified once rules and general inferencing are eliminated: LCS and $-role mappings obviate the need for complicated network searches and rule

  10. Climbing the Tower of Babel: Perfecting Machine Translation

    Science.gov (United States)

    2011-02-16

    Peter Norvig , and Fernando Pereira, "The Unreasonable Effectiveness of Data," IEEE Intelligence Systems, (2009): 8. So, this [trillion-word] corpus...Alon Halevy, Peter Norvig , and Fernando Pereira, Google assisted translation? Some experts believe technology can develop MT capabilities with...recognition.htm (accessed 28 September 2010). Halevy, Alon, Peter Norvig , and Fernando Pereira. "The Unreasonable Effectiveness of Data." IEEE

  11. Hindi to English Transfer Based Machine Translation System

    Directory of Open Access Journals (Sweden)

    Shashi Pal Singh

    2015-06-01

    Full Text Available In large societies like India there is a huge demand to convert one human language into another. Lots of work has been done in this area. Many transfer based MTS have developed for English to other languages, as MANTRA CDAC Pune, MATRA CDAC Pune, SHAKTI IISc Bangalore and IIIT Hyderabad. Still there is a little work done for Hindi to other languages. Currently we are working on it. In this paper we focus on designing a system, that translate the document from Hindi to English by using transfer based approach. This system takes an input text check its structure through parsing. Reordering rules are used to generate the text in target language. It is better than Corpus Based MTS because Corpus Based MTS require large amount of word aligned data for translation that is not available for many languages while Transfer Based MTS requires only knowledge of both the languages (source language and target language to make transfer rules. We get correct translation for simple assertive sentences and almost correct for complex and compound sentences.

  12. English-Lithuanian-English Machine Translation lexicon and engine: current state and future work

    CERN Document Server

    Barisevičius, G

    2011-01-01

    This article overviews the current state of the English-Lithuanian-English machine translation system. The first part of the article describes the problems that system poses today and what actions will be taken to solve them in the future. The second part of the article tackles the main issue of the translation process. Article briefly overviews the word sense disambiguation for MT technique using Google.

  13. An open-source highly scalable web service architecture for the Apertium machine translation engine

    OpenAIRE

    Sánchez-Cartagena, Víctor M.; Pérez-Ortiz, Juan Antonio

    2009-01-01

    Some machine translation services like Google Ajax Language API have become very popular as they make the collaboratively created contents of the web 2.0 available to speakers of many languages. One of the keys of its success is its clear and easy-to-use application programming interface (API) and a scalable and reliable service. This paper describes a highly scalable implementation of an Apertium-based translation web service, that aims to make contents available to speakers of lesser resour...

  14. ANALYTICAL MODEL OF CALCULUS FOR INFLUENCE THE TRANSLATION GUIDE WEAR OVER THE MACHINING ACCURACY ON THE MACHINE TOOL

    Directory of Open Access Journals (Sweden)

    Ivona PETRE

    2010-10-01

    Full Text Available The wear of machine tools guides influences favorably to vibrations. As a result of guides wear, the initial trajectory of cutting tools motion will be modified, the generating dimensional accuracy discrepancies and deviations of geometrical shape of the work pieces. As it has already been known, the wear of mobile and rigid guides is determined by many parameters (pressure, velocity, friction length, lubrication, material. The choice of one or another analytic model and/or the experimental model of the wear is depending by the working conditions, assuming that the coupling material is known.The present work’s goal is to establish an analytic model of calculus showing the influence of the translation guides wear over the machining accuracy on machine-tools.

  15. Web-Based Machine Translation as a Tool for Promoting Electronic Literacy and Language Awareness

    Science.gov (United States)

    Williams, Lawrence

    2006-01-01

    This article addresses a pervasive problem of concern to teachers of many foreign languages: the use of Web-Based Machine Translation (WBMT) by students who do not understand the complexities of this relatively new tool. Although networked technologies have greatly increased access to many language and communication tools, WBMT is still…

  16. Four Generations of Machine Translation Research and Prospects for the Future.

    Science.gov (United States)

    Wilks, Yorick

    This paper begins with a description of four generations of research in machine translation: the original efforts of 1957 to 1965 and three types of surviving and sometimes competing present projects. The three types of present projects include those relying on "brute force" methods involving larger and faster computers; those based on a…

  17. Improving the quality of automated DVD subtitles via example-based machine translation

    DEFF Research Database (Denmark)

    Armstrong, Stephen; Caffrey, Colm; Flanagan, Marian

    Denoual (2005) discovered that, contrary to popular belief, an Example-Based Machine Translation system trained on heterogeneous data produced significantly better results than a system trained on homogeneous data. Using similar evaluation metrics and a few additional ones, in this paper we show ...

  18. Development of Chinese-English Machine Translation System. Fnal Technical Report.

    Science.gov (United States)

    Wang, William S-Y; Chan, Stephen W.

    The report documents progress and results of a 2-1/3 year effort to further the prototype Chinese-English Machine Translation System. Additional rules were incorporated into the existing grammar for Chinese analysis and interlingual transfer, with emphasis on the latter. CHIDIC was updated and revised. Approximately 16,000 new entries were added…

  19. Improvement of Machine Translation Evaluation by Simple Linguistically Motivated Features

    Institute of Scientific and Technical Information of China (English)

    Mu-Yun Yang; Shu-Qi Sun; Jun-Guo Zhu; Sheng Li; Tie-Jun Zhao; Xiao-Ning Zhu

    2011-01-01

    Adopting the regression SVM framework, this paper proposes a linguistically motivated feature engineering strategy to develop an MT evaluation metric with a better correlation with human assessments. In contrast to current practices of "greedy" combination of all available features, six features are suggested according to the human intuition for translation quality. Then the contribution of linguistic features is examined and analyzed via a hill-climbing strategy. Experiments indicate that, compared to either the SVM-ranking model or the previous attempts on exhaustive linguistic features, the regression SVM model with six linguistic information based features generalizes across different datasets better, and augmenting these linguistic features with proper non-linguistic metrics can achieve additional improvements.

  20. Finding Translation Examples for Under-Resourced Language Pairs or for Narrow Domains; the Case for Machine Translation

    Directory of Open Access Journals (Sweden)

    Dan Tufis

    2012-07-01

    Full Text Available The cyberspace is populated with valuable information sources, expressed in about 1500 different languages and dialects. Yet, for the vast majority of WEB surfers this wealth of information is practically inaccessible or meaningless. Recent advancements in cross-lingual information retrieval, multilingual summarization, cross-lingual question answering and machine translation promise to narrow the linguistic gaps and lower the communication barriers between humans and/or software agents. Most of these language technologies are based on statistical machine learning techniques which require large volumes of cross lingual data. The most adequate type of cross-lingual data is represented by parallel corpora, collection of reciprocal translations. However, it is not easy to find enough parallel data for any language pair might be of interest. When required parallel data refers to specialized (narrow domains, the scarcity of data becomes even more acute. Intelligent information extraction techniques from comparable corpora provide one of the possible answers to this lack of translation data.

  1. An overview of the EtsaTrans machine translation system: compilation of an administrative domain

    Directory of Open Access Journals (Sweden)

    L. Ehlers

    2008-07-01

    Full Text Available The EtsaTrans machine translation system has been in development at the University of the Free State for the last four years and is currently the only machine translation system being developed in South Africa for specialised and nongeneral translation needs. The purpose of this exposition is to present the program through its phases of development, and to report on current levels of performance. We analyse the output, the size of the database, and then propose the future implementation of a part of speech tagger and word stemmer into the program to improve its linguistic performance. Our goal with the system is not to translate all types of document, but to work in a specialised domain that will allow the system to translate documents that are repetitive in nature. This will enable translators to spend more time on non-repetitive subject matter. By capturing the nature of the language of such repetitive documents in the database, we are able to create a standardised language usage for the specialised domain.

  2. On the application of AntConc in pre-translation of machine translation

    Institute of Scientific and Technical Information of China (English)

    王培; 赵茫茫

    2016-01-01

    This article introduces the application of a green software of corpus tool named AntConc. The article mainly focuses on its glossary function, analysis of frequency and concordance of lexical chunks, which makes the translation work more formal and standard.

  3. NPtool, a detector of English noun phrases

    CERN Document Server

    Voutilainen, A

    1995-01-01

    NPtool is a fast and accurate system for extracting noun phrases from English texts for the purposes of e.g. information retrieval, translation unit discovery, and corpus studies. After a general introduction, the system architecture is presented in outline. Then follows an examination of a recently written Constraint Syntax. An evaluation report concludes the paper.

  4. Handbook of natural language processing and machine translation DARPA global autonomous language exploitation

    CERN Document Server

    Olive, Joseph P; McCary, John

    2011-01-01

    This comprehensive handbook, written by leading experts in the field, details the groundbreaking research conducted under the breakthrough GALE program - The Global Autonomous Language Exploitation within the Defense Advanced Research Projects Agency (DARPA), while placing it in the context of previous research in the fields of natural language and signal processing, artificial intelligence and machine translation. The most fundamental contrast between GALE and its predecessor programs was its holistic integration of previously separate or sequential processes. In earlier language research pro

  5. Myanmar-English Bidirectional Machine Translation System with Numerical Particles Identification

    Directory of Open Access Journals (Sweden)

    Yin Yin Win

    2016-06-01

    Full Text Available This paper the development of MyanmarEnglish bidirectional machine translation system is implemented applying Rule based machine translation approach. Stanford and ML2KR parsers are used for preprocessing step. From this step, parsers generate corresponding parse tree structures. Used parsers generate corresponding CFG rules which are collected and created as synchronous context free grammar SCFG rules. Myanmar language can be written free order style, but it must be verb final structure. Therefore, CFG rules are required for reordering the structure of the two languages. After that tree to tree transformation is carried on the source tree structure which corresponds with used parser (Stanford parser or ML2KR‟s parser. When source parse tree is transformed as target parse tree, it is changed according to the SCFG rules. And then system carries out the morphological synthesis. In this stage, we need to solve only for English to Myanmar machine translation because Myanmar language is morphologically rich language. Therefore, particles for Myanmar language can be solved in this system by proposed algorithm. After finishing morphological synthesis, this system generates meaningful and appropriate smoothing sentences

  6. Toward an example-based machine translation from written text to ASL using virtual agent animation

    CERN Document Server

    Boulares, Mehrez

    2012-01-01

    Modern computational linguistic software cannot produce important aspects of sign language translation. Using some researches we deduce that the majority of automatic sign language translation systems ignore many aspects when they generate animation; therefore the interpretation lost the truth information meaning. Our goals are: to translate written text from any language to ASL animation; to model maximum raw information using machine learning and computational techniques; and to produce a more adapted and expressive form to natural looking and understandable ASL animations. Our methods include linguistic annotation of initial text and semantic orientation to generate the facial expression. We use the genetic algorithms coupled to learning/recognized systems to produce the most natural form. To detect emotion we are based on fuzzy logic to produce the degree of interpolation between facial expressions. Roughly, we present a new expressive language Text Adapted Sign Modeling Language TASML that describes all ...

  7. 1001 easy French phrases

    CERN Document Server

    McCoy, Heather

    2010-01-01

    The perfect companion for tourists and business travelers in France and other places where the French language is spoken, this book offers fast, effective communication. More than 1,000 basic words, phrases, and sentences cover everything from asking directions and renting a car to ordering dinner and finding a bank.Designed as a quick reference tool and an easy study guide, this inexpensive and easy-to-use book offers completely up-to-date terms for modern telecommunications, idioms, and slang. The contents are arranged for quick access to phrases related to greetings, transportation, shoppin

  8. A Novel Approach for English to South Dravidian Language Statistical Machine Translation System

    Directory of Open Access Journals (Sweden)

    Unnikrishnan P

    2010-11-01

    Full Text Available Development of a well fledged bilingual machine translation (MT system for any two natural languages with limited electronic resources and tools is a challenging and demanding task. This paper presents the development of a statistical machine translation (SMT system for English to South Dravidian languages like Malayalam and Kannada by incorporating syntactic and morphological information. SMT is a data oriented statistical framework for translating text from one natural language to another based on the knowledge extracted from bilingual corpus. Even though there are efforts towards building such an English to South Dravidian translation system ,unfortunately we do not have an efficient translation system till now. The first and most important step in SMT is creating a well aligned parallel corpus for training the system. Experimental research shows that the existing methodology for bilingual parallel corpus creation is not efficient for English to South Dravidian language in the SMT system. In order toincrease the performance of the translation system, we have introduced a new approach in creating parallel corpus. The mainideas which we have implemented and proven very effective forEnglish to south Dravidian languages SMT system are: (i reordering the English source sentence according to Dravidian syntax, (ii using the root suffix separation on both English and Dravidian words and iii use of morphological information which substantially reduce the corpus size required for training the system. Since the navailability of full fledged parsing and morphological tools for Malayalam and Kannada languages, sentence synthesis was done both manually and existing morph analyzer created by Amrita university. From the experiment we found that the performance of our systems are significantly well and achieves a very competitive accuracy for small sized bilingual corpora. The proposed ideas can be directly used for other south Dravidian languages like Tamil

  9. Unsupervised Key-phrase Extraction using Noun Phrases

    National Research Council Canada - National Science Library

    Shailendra Singh Kathait; Shubhrita Tiwari; Anubha Varshney; Ajit Sharma

    2017-01-01

    .... The automated key phrase generation of resources helps in the information retrieval. To generate the key phrases for texts from all possible domains, the need is an automated approach that would extract the key ideas directly from the text itself...

  10. Telugu Bigram Splitting using Consonant-based and Phrase-based Splitting

    Directory of Open Access Journals (Sweden)

    T. Kameswara Rao

    2014-06-01

    Full Text Available Splitting is a conventional process in most of Indian languages according to their grammar rules. It is called ‘pada vicchEdanam’ (a Sanskrit term for word splitting and is widely used by most of the Indian languages. Splitting plays a key role in Machine Translation (MT particularly when the source language (SL is an Indian language. Though this splitting may not succeed completely in extracting the root words of which the compound is formed, but it shows considerable impact in Natural Language Processing (NLP as an important phase. Though there are many types of splitting, this paper considers only consonant based and phrase based splitting.

  11. Easy French phrase book over 700 phrases for everyday use

    CERN Document Server

    McCoy, Heather

    2012-01-01

    Over 700 French phrases and sentences include terms for modern telecommunications, idioms, and slang. Designed as a quick reference and study guide, this up-to-date volume is organized for quick access to phrases related to greetings, transportation, shopping, emergencies, and other common circumstances. A phonetic pronunciation accompanies each phrase.

  12. Our Policies, Their Text: German Language Students' Strategies with and Beliefs about Web-Based Machine Translation

    Science.gov (United States)

    White, Kelsey D.; Heidrich, Emily

    2013-01-01

    Most educators are aware that some students utilize web-based machine translators for foreign language assignments, however, little research has been done to determine how and why students utilize these programs, or what the implications are for language learning and teaching. In this mixed-methods study we utilized surveys, a translation task,…

  13. Discriminative feature-rich models for syntax-based machine translation.

    Energy Technology Data Exchange (ETDEWEB)

    Dixon, Kevin R.

    2012-12-01

    This report describes the campus executive LDRD %E2%80%9CDiscriminative Feature-Rich Models for Syntax-Based Machine Translation,%E2%80%9D which was an effort to foster a better relationship between Sandia and Carnegie Mellon University (CMU). The primary purpose of the LDRD was to fund the research of a promising graduate student at CMU; in this case, Kevin Gimpel was selected from the pool of candidates. This report gives a brief overview of Kevin Gimpel's research.

  14. Improving a Japanese-Spanish Machine Translation System Using Wikipedia Medical Articles

    Directory of Open Access Journals (Sweden)

    Jessica C. Ramirez

    2015-02-01

    Full Text Available The quality, length and coverage of a parallel corp us are fundamental features in the performance of a Statistical Machine Translation Sy stem (SMT. For some pair of languages there is a considerable lack of resources suitable for Natural Language Processing tasks. This paper introduces a technique for extracting medical information from the Wikipedia page. Using a medical ontological dictionary and then we evaluate on a Japanese-Spanish SMT system. The study shows an increment in the BLEU sc ore.

  15. 15000 useful phrases

    CERN Document Server

    Kleiser, Grenville

    1900-01-01

    This incredibly useful book will appeal to writers, public speakers and anyone else who would like to improve their vocabulary. It contains a huge number of phrases which you can use in everyday speech and at special occasions, which will help you express your thoughts, ideas and feelings in a brand new way. A classic reference work, this edition has been specially formatted for today's e-readers.

  16. Preliminary study of online machine translation use of nursing literature: quality evaluation and perceived usability

    Directory of Open Access Journals (Sweden)

    Anazawa Ryoko

    2012-11-01

    Full Text Available Abstract Background Japanese nurses are increasingly required to read published international research in clinical, educational, and research settings. Language barriers are a significant obstacle, and online machine translation (MT is a tool that can be used to address this issue. We examined the quality of Google Translate® (English to Japanese and Korean to Japanese, which is a representative online MT, using a previously verified evaluation method. We also examined the perceived usability and current use of online MT among Japanese nurses. Findings Randomly selected nursing abstracts were translated and then evaluated for intelligibility and usability by 28 participants, including assistants and research associates from nursing universities throughout Japan. They answered a questionnaire about their online MT use. From simple comparison of mean scores between two language pairs, translation quality was significantly better, with respect to both intelligibility and usability, for Korean-Japanese than for English-Japanese. Most respondents perceived a language barrier. Online MT had been used by 61% of the respondents and was perceived as not useful enough. Conclusion Nursing articles translated from Korean into Japanese by an online MT system could be read at an acceptable level of comprehension, but the same could not be said for English-Japanese translations. Respondents with experience using online MT used it largely to grasp the overall meanings of the original text. Enrichment in technical terms appeared to be the key to better usability. Users will be better able to use MT outputs if they improve their foreign language proficiency as much as possible. Further research is being conducted with a larger sample size and detailed analysis.

  17. A Conjoint Analysis Framework for Evaluating User Preferences in Machine Translation.

    Science.gov (United States)

    Kirchhoff, Katrin; Capurro, Daniel; Turner, Anne M

    2014-03-01

    Despite much research on machine translation (MT) evaluation, there is surprisingly little work that directly measures users' intuitive or emotional preferences regarding different types of MT errors. However, the elicitation and modeling of user preferences is an important prerequisite for research on user adaptation and customization of MT engines. In this paper we explore the use of conjoint analysis as a formal quantitative framework to assess users' relative preferences for different types of translation errors. We apply our approach to the analysis of MT output from translating public health documents from English into Spanish. Our results indicate that word order errors are clearly the most dispreferred error type, followed by word sense, morphological, and function word errors. The conjoint analysis-based model is able to predict user preferences more accurately than a baseline model that chooses the translation with the fewest errors overall. Additionally we analyze the effect of using a crowd-sourced respondent population versus a sample of domain experts and observe that main preference effects are remarkably stable across the two samples.

  18. Extracting Date/Time Expressions in Super-Function Based Japanese-English Machine Translation

    Science.gov (United States)

    Sasayama, Manabu; Kuroiwa, Shingo; Ren, Fuji

    Super-Function Based Machine Translation(SFBMT) which is a type of Example-Based Machine Translation has a feature which makes it possible to expand the coverage of examples by changing nouns into variables, however, there were problems extracting entire date/time expressions containing parts-of-speech other than nouns, because only nouns/numbers were changed into variables. We describe a method for extracting date/time expressions for SFBMT. SFBMT uses noun determination rules to extract nouns and a bilingual dictionary to obtain correspondence of the extracted nouns between the source and the target languages. In this method, we add a rule to extract date/time expressions and then extract date/time expressions from a Japanese-English bilingual corpus. The evaluation results shows that the precision of this method for Japanese sentences is 96.7%, with a recall of 98.2% and the precision for English sentences is 94.7%, with a recall of 92.7%.

  19. Clinical utility of machine learning approaches in schizophrenia: Improving diagnostic confidence for translational neuroimaging

    Directory of Open Access Journals (Sweden)

    Sarina eIwabuchi

    2013-08-01

    Full Text Available Machine-learning approaches are becoming commonplace in the neuroimaging literature as potential diagnostic and prognostic tools for the study of clinical populations. However, very few studies provide clinically informative measures to aid in decision-making and resource allocation. Head-to-head comparison of neuroimaging-based multivariate classifiers is an essential first step to promote translation of these tools to clinical practice. We systematically evaluated the classifier performance using back-to-back structural MRI in two field strengths (3-Tesla and 7-Tesla to discriminate patients with schizophrenia (n=19 from healthy controls (n=20. Grey (GM and white matter (WM images were used as inputs into a support vector machine (SVM to classify patients and control subjects. 7T classifiers outperformed the 3T classifiers with accuracy reaching as high as 77% for the 7T GM classifier compared to 66.6% for the 3T GM classifier. Furthermore, diagnostic odds ratio (a measure that is not affected by variations in sample characteristics and number needed to predict (a measure based on Bayesian certainty of a test result indicated superior performance of the 7T classifiers, whereby for each correct diagnosis made, the number of patients that need to be examined using the 7T GM classifier was one less than the number that need to be examined if a different classifier was used. Using a hypothetical example, we highlight how these findings could have significant implications for clinical decision-making. We encourage the reporting of measures proposed here in future studies utilizing machine-learning approaches. This will not only promote the search for an optimum diagnostic tool but also aid in the translation of neuroimaging to clinical use.

  20. The Noun Phrase

    DEFF Research Database (Denmark)

    Rijkhoff, Jan

    it shows that the noun phrase word order patterns of any human language can be derived from three universal ordering principles and, furthermore, that these are all elaborations of one general iconic ordering strategy according to which elements that belong together semantically tend to occur together...... the occurrence of adjectives as a major word class in the part-of-speech system of a language. The book aims for accessibility and assumes no knowledge of a particular formal or functional theory of grammar. It addresses professional linguists and students of linguistics of all theoretical persuasions...

  1. The Noun Phrase

    DEFF Research Database (Denmark)

    Rijkhoff, Jan

    it shows that the noun phrase word order patterns of any human language can be derived from three universal ordering principles and, furthermore, that these are all elaborations of one general iconic ordering strategy according to which elements that belong together semantically tend to occur together...... the occurrence of adjectives as a major word class in the part-of-speech system of a language. The book aims for accessibility and assumes no knowledge of a particular formal or functional theory of grammar. It addresses professional linguists and students of linguistics of all theoretical persuasions...

  2. Offering a New Approach for Approximate Pattern Matching in Example-Based Machine Translation

    Directory of Open Access Journals (Sweden)

    Reza Akbari

    2015-01-01

    Full Text Available In this article, a new model is proposed in order to measure the degree of similarity between two sentences in machine translation based on example. The proposed model has applied genetic algorithm beside a new fitness function which is based on semantic load matching between the two sentences. Here, verbs are considered as the heart of a sentence because they are the main part of a sentence and carry the major part of the semantic load in the sentence; therefore more attention is paid to the verbs in the fitness function. It is noteworthy that the proposed model is largely dependent on the verbal part and the extracted synonyms from WordNet as well as the arrangement of words. The results are promising by precision and recall, indicating that the proposed method improves the quality of the retrieved matched sentences.

  3. Comparison of Parallel Kinematic Machines with Three Translational Degrees of Freedom and Linear Actuation

    Institute of Scientific and Technical Information of China (English)

    PRAUSE Isabel; CHARAF EDDINE Sami; CORVES Burkhard

    2015-01-01

    The development of new robot structures, in particular of parallel kinematic machines(PKM), is widely systematized by different structure synthesis methods. Recent research increasingly focuses on PKM with less than six degrees of freedom(DOF). However, an overall comparison and evaluation of these structures is missing. In order to compare symmetrical PKM with three translational DOF, different evaluation criteria are used. Workspace, maximum actuation forces and velocities, power, actuator stiffness, accuracy and transmission behavior are taken into account to investigate strengths and weaknesses of the PKMs. A selection scheme based on possible configurations of translational PKM including different frame configurations is presented. Moreover, an optimization method based on a genetic algorithm is described to determine the geometric parameters of the selected PKM for an exemplary load case and a prescribed workspace. The values of the mentioned criteria are determined for all considered PKM with respect to certain boundary conditions. The distribution and spreading of these values within the prescribed workspace is presented by using box plots for each criterion. Thereby, the performance characteristics of the different structures can be compared directly. The results show that there is no“best”PKM. Further inquiries such as dynamic or stiffness analysis are necessary to extend the comparison and to finally select a PKM.

  4. OxLM: A Neural Language Modelling Framework for Machine Translation

    Directory of Open Access Journals (Sweden)

    Paul Baltescu

    2014-09-01

    Full Text Available This paper presents an open source implementation1 of a neural language model for machine translation. Neural language models deal with the problem of data sparsity by learning distributed representations for words in a continuous vector space. The language modelling probabilities are estimated by projecting a word's context in the same space as the word representations and by assigning probabilities proportional to the distance between the words and the context's projection. Neural language models are notoriously slow to train and test. Our framework is designed with scalability in mind and provides two optional techniques for reducing the computational cost: the so-called class decomposition trick and a training algorithm based on noise contrastive estimation. Our models may be extended to incorporate direct n-gram features to learn weights for every n-gram in the training data. Our framework comes with wrappers for the cdec and Moses translation toolkits, allowing our language models to be incorporated as normalized features in their decoders (inside the beam search.

  5. Toward an example-based machine translation from written text to ASL using virtual agent animation

    Directory of Open Access Journals (Sweden)

    Mehrez Boulares

    2012-01-01

    Full Text Available Modern computational linguistic software cannot produce important aspects of sign language translation. Using some researches we deduce that the majority of automatic sign language translation systems ignore many aspects when they generate animation; therefore the interpretation lost the truth information meaning. This problem is due to sign language consideration as a derivative language, but it is a complete language with its own unique grammar. This grammar is related to semantic-cognitive models of spatially, time, action and facial expression to represent complex information to make sign interpretation more efficiently, smooth, expressive and natural-looking human gestures. All this aspects give us useful insights into the design principles that have evolved in natural communication between people. In this work we are interested in American Sign Language, because it is the simplest and most standardized sign language. Our goals are: to translate written text from any language to ASL animation; to model maximum raw information using machine learning and computational techniques; and to produce a more adapted and expressive form to natural looking and understandable ASL animations. Our methods include linguistic annotation of initial text and semantic orientation to generate the facial expression. We use genetic algorithms coupled to learning/recognized systems to produce the most natural form. To detect emotion we based on fuzzy logic to produce the degree of interpolation between facial expressions. Roughly, we present a new expressive language Text Adapted Sign Modeling Language TASML that describes all maximum aspects related to a good sign language interpretation. This paper is organized as follow: the next section is devoted to present the comprehension effect of using Space/Time/SVO form in ASL animation based on experimentation. In section 3, we describe our technical considerations. We present the general approach we adopted to

  6. On search guide phrase compilation for recommending home medical products.

    Science.gov (United States)

    Luo, Gang

    2010-01-01

    To help people find desired home medical products (HMPs), we developed an intelligent personal health record (iPHR) system that can automatically recommend HMPs based on users' health issues. Using nursing knowledge, we pre-compile a set of "search guide" phrases that provides semantic translation from words describing health issues to their underlying medical meanings. Then iPHR automatically generates queries from those phrases and uses them and a search engine to retrieve HMPs. To avoid missing relevant HMPs during retrieval, the compiled search guide phrases need to be comprehensive. Such compilation is a challenging task because nursing knowledge updates frequently and contains numerous details scattered in many sources. This paper presents a semi-automatic tool facilitating such compilation. Our idea is to formulate the phrase compilation task as a multi-label classification problem. For each newly obtained search guide phrase, we first use nursing knowledge and information retrieval techniques to identify a small set of potentially relevant classes with corresponding hints. Then a nurse makes the final decision on assigning this phrase to proper classes based on those hints. We demonstrate the effectiveness of our techniques by compiling search guide phrases from an occupational therapy textbook.

  7. Multilingual online machine translation research%多语言在线机器翻译研究

    Institute of Scientific and Technical Information of China (English)

    董兴华; 徐春; 王磊; 周喜

    2012-01-01

    This paper gives a description of implementing an online, high-powered multilingual translation engine, which consists of three language pairs, that is, Uyghur-Chinese, Kazakh-Chinese and Kyrgyz-Chinese, based on additional knowledge base and multithreading and task distributing technology. The translation engine is easy to be extended to other language pairs, which has an ability of translating words, phrases, sentences, files and webs in one language to those of another.%描述了通过使用外部知识库和基于短语的翻译模型,利用多线程、任务分发的技术实现了一个在线的、高性能的多语言翻译引擎,已初步实现了维汉、哈汉、柯汉三种语言间的翻译.翻译引擎很容易扩展到其他语言对,具有翻译词、短语、句子、文件和网页的功能.

  8. Prediction of post-translational modification sites using multiple kernel support vector machine

    Directory of Open Access Journals (Sweden)

    BingHua Wang

    2017-04-01

    Full Text Available Protein post-translational modification (PTM is an important mechanism that is involved in the regulation of protein function. Considering the high-cost and labor-intensive of experimental identification, many computational prediction methods are currently available for the prediction of PTM sites by using protein local sequence information in the context of conserved motif. Here we proposed a novel computational method by using the combination of multiple kernel support vector machines (SVM for predicting PTM sites including phosphorylation, O-linked glycosylation, acetylation, sulfation and nitration. To largely make use of local sequence information and site-modification relationships, we developed a local sequence kernel and Gaussian interaction profile kernel, respectively. Multiple kernels were further combined to train SVM for efficiently leveraging kernel information to boost predictive performance. We compared the proposed method with existing PTM prediction methods. The experimental results revealed that the proposed method performed comparable or better performance than the existing prediction methods, suggesting the feasibility of the developed kernels and the usefulness of the proposed method in PTM sites prediction.

  9. CloudLM: a Cloud-based Language Model for Machine Translation

    Directory of Open Access Journals (Sweden)

    Ferrández-Tordera Jorge

    2016-04-01

    Full Text Available Language models (LMs are an essential element in statistical approaches to natural language processing for tasks such as speech recognition and machine translation (MT. The advent of big data leads to the availability of massive amounts of data to build LMs, and in fact, for the most prominent languages, using current techniques and hardware, it is not feasible to train LMs with all the data available nowadays. At the same time, it has been shown that the more data is used for a LM the better the performance, e.g. for MT, without any indication yet of reaching a plateau. This paper presents CloudLM, an open-source cloud-based LM intended for MT, which allows to query distributed LMs. CloudLM relies on Apache Solr and provides the functionality of state-of-the-art language modelling (it builds upon KenLM, while allowing to query massive LMs (as the use of local memory is drastically reduced, at the expense of slower decoding speed.

  10. Noun Phrase in Bahasa Indonesia

    National Research Council Canada - National Science Library

    Rahmi Yulia Ningsih; Chandra Kurniawan Wiharja

    2017-01-01

    ...). The aim of this article was identifying elements of forming noun phrases as the participants which represented processes in clauses, functions, as well as the distribution of its position in the clause...

  11. Apertium goes SOA: an efficient and scalable service based on the Apertium rule-based machine translation platform

    OpenAIRE

    Minervini, Pasquale

    2009-01-01

    Service Oriented Architecture (SOA) is a paradigm for organising and using distributed services that may be under the control of different ownership domains and implemented using various technology stacks. In some contexts, an organisation using an IT infrastructure implementing the SOA paradigm can take a great benefit from the integration, in its business processes, of efficient machine translation (MT) services to overcome language barriers. This paper describes the architecture and the de...

  12. Devised Guidelines of Rapid Post-Editing in Machine Translation Out⁃put of Mark on Some Aspects

    Institute of Scientific and Technical Information of China (English)

    屈亚媛; 王庆怡

    2012-01-01

      One of uses of machine translation (MT), is helping readers to read for the gist of a foreign text through a draft translation produced by MT engines. Rapid post-editing, as Jeffrey Allen defines it as a“strictly minimal editing on texts in order to remove blatant and significant errors without considering stylistic issues”, can help present the reader with a roughly comprehensible translation as quickly as possible. The purpose of this article is on a proposed set of rapid post-editing guidelines for Biblical Chinese-English MT, with its application on editing the English MT version of Chapter one of Mark (马尔谷福音) of the Chinese Catholic Bible (天主教思高本圣经) as an example.

  13. Word Sense Disambiguation Using English-Spanish Aligned Phrases over Comparable Corpora

    CERN Document Server

    Fernandez-Amoros, David

    2009-01-01

    In this paper we describe a WSD experiment based on bilingual English-Spanish comparable corpora in which individual noun phrases have been identified and aligned with their respective counterparts in the other language. The evaluation of the experiment has been carried out against SemCor. We show that, with the alignment algorithm employed, potential precision is high (74.3%), however the coverage of the method is low (2.7%), due to alignments being far less frequent than we expected. Contrary to our intuition, precision does not rise consistently with the number of alignments. The coverage is low due to several factors; there are important domain differences, and English and Spanish are too close languages for this approach to be able to discriminate efficiently between senses, rendering it unsuitable for WSD, although the method may prove more productive in machine translation.

  14. Noun Phrase in Bahasa Indonesia

    Directory of Open Access Journals (Sweden)

    Rahmi Yulia Ningsih

    2017-01-01

    Full Text Available This article contained research about the analysis of the text of Bahasa Indonesia speech with the Systemic Functional Linguistic approach (LFS. The aim of this article was identifying elements of forming noun phrases as the participants which represented processes in clauses, functions, as well as the distribution of its position in the clause. The research method used was qualitative research with the method of data analysis in the form of 60 clauses through the approach of LFS. The 60 clauses data were taken from the text of the speech of UNJ Rector, which was then analyzed into 100 phrases in the noun. According to the LFS method, it finds that the noun phrase is formed by elements of the noun classes. From the 100 noun phrases analyzed, there are 100 nouns that have different functions in each process. There are 86 nouns represent the material process, 5 nouns represent the relational process, 4 nouns represent the verbal process, 3 nouns represent mental processes, and 2 nouns represent the existential process. The study also finds that in the form of the lexicon, the same noun with the different distribution of its position will result in different functions in the representation process.

  15. Conception d'une methodologie generale d'evaluation de la traduction automatique (Conception of a General Methodology for Evaluating Machine Translation).

    Science.gov (United States)

    van Slype, Georges

    1982-01-01

    It is proposed that assessment of human translation versus machine translation programs use methods and criteria that measure efficiency and cost effectiveness and are efficient and cost-effective in themselves. A variety of methods and criteria are evaluated and discussed. (MSE)

  16. 中日两国机器翻译研究进展及比较%Machine Translation Research in China and Japan: Advances and Comparison

    Institute of Scientific and Technical Information of China (English)

    张均胜; 何彦青; 李颖; 王惠临

    2011-01-01

    Machine translation investigates the use of computer software to translate text or speech from one natural language to another. Since the first computer was invented, people have been studying and exploring high quality and high efficiency of machine translation technology. Recently, rule-based machine translation, example-based machine translation and statistical translation are the main three translation patterns. There are some approaches ofsystem combination for better machine translation results. With the development of science, technology, economy and culture, machine translation has become more important in breaking the language barrier between Chinese and Japanese for promoting China-Japanese exchanges and cooperation. Machine translation researchers in China and Japan have carried out a large number of Chinese-Japanese/Japanese-Chinese machine translation of theoretical research and system development They have achieved a lot of effective results, however, it is still far from the practical translation application of large-scale and high quality. Therefore, it is necessary for researchers in China and Japan to cooperate in machine translation technology and system development for Ch inese-to-Japanese and Japanese-to-Chinese, especially in the parallel corpus, dictionary, terminology, syntactic analysis and so on. This paper presents an overview of the China-Japanese machine translation research and rampares machine translation research in China and Japan. We also discuss the prospects of China-Japanese cooperation in machine translation research.%机器翻译研究用计算机实现不同自然语言之间的翻译.自第一台计算机诞生开始,人们一直在研究和探索高质量高效率的机器翻译技术.近年来,基于规则的机器翻译、基于实例的机器翻译和基于统计的机器翻译这几种主要的翻译模式共同存在且相互补充,并有不断融合之势.随着中国和日本在科技、经济和文化交流的不

  17. Progress of Automatic Extraction of Uyghur Phrases%维吾尔语短语自动抽取研究进展

    Institute of Scientific and Technical Information of China (English)

    张海军

    2015-01-01

    Phrase extraction, which is the research basis of machine translation and information retrieval, plays a very important role in natural language processing. This paper puts the emphasis on the research progress of of Uyghur phrase extraction. To make convenience for discussion, this paper studies the linguistic features of Uyghur phrases and analyzes the impacts of these features on the phrase extraction. This paper mainly summarizes the philological theories of phrase identification in Uyghur and discusses the technologies of automatic extraction of Uyghur phrases. There has made great progress on the extraction of Uyghur phrases in both theory and technology. However, there are still lots of work to be carried out, such as to formulate tagging standard, study tagged corpus and expend research domains etc. It is hoped that this paper can give some references to the research on phrase extraction in Uyghur.%短语识别是机器翻译与信息检索的技术基础,具有重要的研究价值.围绕维吾尔语短语识别的研究进展,阐述了维吾尔语的语言特点,分析了这些特点对维吾尔语短语识别的影响,总结了近年来维吾尔语短语识别的有关语言学研究成果,重点梳理了维吾尔语短语自动抽取的相关研究方法.通过总结和梳理,发现目前维吾尔语短语自动抽取研究在理论和实现技术方面取得了较大进展,但在短语标注标准、研究语料及研究领域等方面还有大量工作尚未有效开展,需要予以关注.希望该文能为维吾尔语短语抽取相关研究提供借鉴和参考.

  18. Translation

    OpenAIRE

    2005-01-01

    "Translation" is a life narrative about the ways in which cultural histories shape personal stories, and the capacity of the imagination to develop alternative narratives about oneself and the world. It can also be read a way of addressing the effects of what Ato Quayson calls the global process of postcolonializing. Quaysons critical perspective might be used as an interpretive lens for seeing some of the ways in which  this autobiographical narrative complicates the jargon of race, cl...

  19. Translation

    OpenAIRE

    2005-01-01

    "Translation" is a life narrative about the ways in which cultural histories shape personal stories, and the capacity of the imagination to develop alternative narratives about oneself and the world. It can also be read a way of addressing the effects of what Ato Quayson calls the global process of postcolonializing. Quaysons critical perspective might be used as an interpretive lens for seeing some of the ways in which  this autobiographical narrative complicates the jargon of race, class, ...

  20. Improving Statistical Machine Translation Through N-best List Re-ranking and Optimization

    Science.gov (United States)

    2014-03-27

    Language Processing, 1352–1362. Association for Computational Linguistics, Edinburgh, Scotland , UK., July 2011. URL http://www.aclweb.org/anthology/D11-1125...Josef. “Minimum Error Rate Training in Statistical Machine Translation”. Erhard Hinrichs and Dan Roth (editors), Proceedings of the 41st Annual Meeting of

  1. Learning to Translate: A Statistical and Computational Analysis

    Directory of Open Access Journals (Sweden)

    Marco Turchi

    2012-01-01

    Full Text Available We present an extensive experimental study of Phrase-based Statistical Machine Translation, from the point of view of its learning capabilities. Very accurate Learning Curves are obtained, using high-performance computing, and extrapolations of the projected performance of the system under different conditions are provided. Our experiments confirm existing and mostly unpublished beliefs about the learning capabilities of statistical machine translation systems. We also provide insight into the way statistical machine translation learns from data, including the respective influence of translation and language models, the impact of phrase length on performance, and various unlearning and perturbation analyses. Our results support and illustrate the fact that performance improves by a constant amount for each doubling of the data, across different language pairs, and different systems. This fundamental limitation seems to be a direct consequence of Zipf law governing textual data. Although the rate of improvement may depend on both the data and the estimation method, it is unlikely that the general shape of the learning curve will change without major changes in the modeling and inference phases. Possible research directions that address this issue include the integration of linguistic rules or the development of active learning procedures.

  2. Reading Strategies in a L2: A Study on Machine Translation

    Science.gov (United States)

    Karnal, Adriana Riess; Pereira, Vera Vanmacher

    2015-01-01

    This article aims at understanding cognitive strategies which are involved in reading academic texts in English as a L2/FL. Specifically, we focus on reading comprehension when a text is read either using Google translator or not. From this perspective we must consider the reading process in its complexity not only as a decoding process. We follow…

  3. A Lexicalist Approach to the Translation of Colloquial Text

    CERN Document Server

    Popowich, F; Laurens, O; McFetridge, P; Nicholson, J D; McGivern, P; Peña, M C; Pidruchney, L; MacDonald, S; Popowich, Fred; Turcato, Davide; Laurens, Olivier; Fetridge, Paul Mc; Givern, Patrick Mc; Pena, Maricela Corzo; Pidruchney, Lisa; Donald, Scott Mac

    1997-01-01

    Colloquial English (CE) as found in television programs or typical conversations is different than text found in technical manuals, newspapers and books. Phrases tend to be shorter and less sophisticated. In this paper, we look at some of the theoretical and implementational issues involved in translating CE. We present a fully automatic large-scale multilingual natural language processing system for translation of CE input text, as found in the commercially transmitted closed-caption television signal, into simple target sentences. Our approach is based on the Whitelock's Shake and Bake machine translation paradigm, which relies heavily on lexical resources. The system currently translates from English to Spanish with the translation modules for Brazilian Portuguese under development.

  4. Exeter at CLEF 2003: Experiments with machine translation for monolingual, bilingual and multilingual retrieval

    OpenAIRE

    Lam-Adesina, Adenike M.; Jones, Gareth J.F.

    2004-01-01

    The University of Exeter group participated in the monolingual, bilingual and multilingual-4 retrieval tasks this year. The main focus of our investigation this year was the small multilingual task comprising four languages, French, German, Spanish and English. We adopted a document translation strategy and tested four merging techniques to combine results from the separate document collections, as well as a merged collection strategy. For both the monolingual and bilingual tasks we explored ...

  5. A nonparametric Bayesian method of translating machine learning scores to probabilities in clinical decision support.

    Science.gov (United States)

    Connolly, Brian; Cohen, K Bretonnel; Santel, Daniel; Bayram, Ulya; Pestian, John

    2017-08-07

    Probabilistic assessments of clinical care are essential for quality care. Yet, machine learning, which supports this care process has been limited to categorical results. To maximize its usefulness, it is important to find novel approaches that calibrate the ML output with a likelihood scale. Current state-of-the-art calibration methods are generally accurate and applicable to many ML models, but improved granularity and accuracy of such methods would increase the information available for clinical decision making. This novel non-parametric Bayesian approach is demonstrated on a variety of data sets, including simulated classifier outputs, biomedical data sets from the University of California, Irvine (UCI) Machine Learning Repository, and a clinical data set built to determine suicide risk from the language of emergency department patients. The method is first demonstrated on support-vector machine (SVM) models, which generally produce well-behaved, well understood scores. The method produces calibrations that are comparable to the state-of-the-art Bayesian Binning in Quantiles (BBQ) method when the SVM models are able to effectively separate cases and controls. However, as the SVM models' ability to discriminate classes decreases, our approach yields more granular and dynamic calibrated probabilities comparing to the BBQ method. Improvements in granularity and range are even more dramatic when the discrimination between the classes is artificially degraded by replacing the SVM model with an ad hoc k-means classifier. The method allows both clinicians and patients to have a more nuanced view of the output of an ML model, allowing better decision making. The method is demonstrated on simulated data, various biomedical data sets and a clinical data set, to which diverse ML methods are applied. Trivially extending the method to (non-ML) clinical scores is also discussed.

  6. Based on HNC Theory of Machine Translation Research ---English-Chinese Translation%基于HNC理论机助人译--英汉翻译对比研究

    Institute of Scientific and Technical Information of China (English)

    陆小鹿

    2014-01-01

    [Abstract]Machine Translation is indispensable to modern information era to overcome barriers of the language communication . HNC theory by using the method of primitives, hierarchical, network, formal, integrates the sentence structures with semantics, Based on the HNC theory, interprets the conversion of sentence category and patterns between English and Chinese and Contrast between English and Chinese.%机器翻译是现代信息时代克服语言交流障碍不可缺少的的手段。 HNC理论采用基元化、层次化、网络化、形式化的方法,通过句类精妙地把自然语言的表层结构和深层语义联系起来。通过机器翻译,对比研究英汉翻译中的句类句式转换的问题。

  7. NASA's online machine aided indexing system

    Science.gov (United States)

    Silvester, June P.; Genuardi, Michael T.; Klingbiel, Paul H.

    1993-01-01

    This report describes the NASA Lexical Dictionary, a machine aided indexing system used online at the National Aeronautics and Space Administration's Center for Aerospace Information (CASI). This system is comprised of a text processor that is based on the computational, non-syntactic analysis of input text, and an extensive 'knowledge base' that serves to recognize and translate text-extracted concepts. The structure and function of the various NLD system components are described in detail. Methods used for the development of the knowledge base are discussed. Particular attention is given to a statistically-based text analysis program that provides the knowledge base developer with a list of concept-specific phrases extracted from large textual corpora. Production and quality benefits resulting from the integration of machine aided indexing at CASI are discussed along with a number of secondary applications of NLD-derived systems including on-line spell checking and machine aided lexicography.

  8. Integrated Features by Administering the Support Vector Machine (SVM of Translational Initiations Sites in Alternative Polymorphic Contex

    Directory of Open Access Journals (Sweden)

    Nurul Arneida Husin

    2012-04-01

    Full Text Available Many algorithms and methods have been proposed for classification problems in bioinformatics. In this study, the discriminative approach in particular support vector machines (SVM is employed to recognize the studied TIS patterns. The applied discriminative approach is used to learn about some discriminant functions of samples that have been labelled as positive or negative. After learning, the discriminant functions are employed to decide whether a new sample is true or false. In this study, support vector machines (SVM is employed to recognize the patterns for studied translational initiation sites in alternative weak context. The method has been optimized with the best parameters selected; c=100, E=10-6 and ex=2 for non linear kernel function. Results show that with top 5 features and non linear kernel, the best prediction accuracy achieved is 95.8%. J48 algorithm is applied to compare with SVM with top 15 features and the results show a good prediction accuracy of 95.8%. This indicates that the top 5 features selected by the IGR method and that are performed by SVM are sufficient to use in the prediction of TIS in weak contexts.

  9. Qualitative: Open Source Python Tool for Quality Estimation over Multiple Machine Translation Outputs

    Directory of Open Access Journals (Sweden)

    Eleftherios Avramidis

    2014-09-01

    Full Text Available “Qualitative” is a python toolkit for ranking and selection of sentence-level output by different MT systems using Quality Estimation. The toolkit implements a basic pipeline for annotating the given sentences with black-box features. Consequently, it applies a machine learning mechanism in order to rank data based on models pre-trained on human preferences. The preprocessing pipeline includes support for language models, PCFG parsing, language checking tools and various other pre-processors and feature generators. The code follows the principles of object-oriented programming to allow modularity and extensibility. The tool can operate by processing both batch-files and single sentences. An XML-RPC interface is provided for hooking up with web-services and a graphical animated web-based interface demonstrates its potential on-line use.

  10. *En-phrases and Their Morphosyntactic and Semantic Particulars

    Directory of Open Access Journals (Sweden)

    Olga Thomason

    2011-08-01

    Full Text Available Prepositions constitute a problematic category because they tend to have complex semantic and syntagmatic properties, vary in case governance and are frequently in variation with each other. Reflexes of the Indo-European *en are well attested and remain productive in Greek, Classical Armenian, Gothic and Old Church Slavic among other languages. Correspondences of Greek en/eis with Gothic in, Armenian i and Old Church Slavic vŭ occur in many instances in the canonical gospels of the New Testament. However, Greek en/eis is frequently translated with other constructions in these languages that range from prepositional phrases that contain prepositions non-cognate with *en and nominal constructions to clausal structures. This investigation examines such correlations and points out morphological, syntactic, semantic and pragmatic factors that promote these correspondences. Case syncretism and changes in the prepositional governance are among leading reasons that prompt translators to look for translational means other than the cognate constructions. As expected, differences in the inventory of prepositions available in the languages being examined and diversity in division of semantic space by the prepositional phrases also add to the variety of possible renditions of Greek en/eis. Among pragmatic factors that influence the translation are a compositional marking of a certain concept, complexity of an event/situation being marked, and pragmatic appropriateness of a certain formation.

  11. Optimal Growth in Phrase Structure

    Directory of Open Access Journals (Sweden)

    David P. Medeiros

    2008-09-01

    Full Text Available This article claims that some familiar properties of phrase structure reflect laws of form. It is shown that optimal sequencing of recursive Merge operations so as to dynamically minimize c-command and containment relations in unlabeled branching forms leads to structural correlates of projection. Thus, a tendency for syntactic structures to pattern according to the X-bar schema (or other shapes exhibiting endocentricity and maximality of ‘non-head daughters’ is plausibly an emergent epiphenomenon of efficient computation. The specifier-head-complement configuration of X-bar theory is shown to be intimately connected to the Fibonacci sequence, suggesting connections with similar mathematical properties in optimal arboration and optimal packing elsewhere in nature.

  12. Determination of referential property and number of nouns in Japanese sentences for machine translation into English

    CERN Document Server

    Murata, M; Murata, Masaki; Nagao, Makoto

    1994-01-01

    When translating Japanese nouns into English, we face the problem of articles and numbers which the Japanese language does not have, but which are necessary for the English composition. To solve this difficult problem we classified the referential property and the number of nouns into three types respectively. This paper shows that the referential property and the number of nouns in a sentence can be estimated fairly reliably by the words in the sentence. Many rules for the estimation were written in forms similar to rewriting rules in expert systems. We obtained the correct recognition scores of 85.5\\% and 89.0\\% in the estimation of the referential property and the number respectively for the sentences which were used for the construction of our rules. We tested these rules for some other texts, and obtained the scores of 68.9\\% and 85.6\\% respectively.

  13. Exclusives, equatives and prosodic phrases in Samoan

    Directory of Open Access Journals (Sweden)

    Sasha Calhoun

    2017-02-01

    Full Text Available This paper investigates the functions of prosodic phrasing in the Austronesian VSO language Samoan. Two types of sentences are investigated, exclusives (involving the particle 'na’o' ‘only’ and equatives. Two complementary methodologies were used, a production study and an acceptability judgment study, to examine the prosodic realisation and relative naturalness of different word orderings of the two sentence types. The particle 'na’o' has an unusual distribution: preceding the initial constituent, be it the verb or a fronted noun phrase; or following the verb, but only modifying the absolutive (object. It was found that post-verbal absolutives modified by 'na’o' are usually not preceded by a phrase boundary, unlike unmodified absolutives which are consistently preceded by a high phrase tone (H- (cf. Yu 2009. Equatives in Samoan involve clauses which are the juxtaposition of two noun phrases, one the rheme (focus and the other the theme (topic. It was found that rhemes are usually followed by a phrase break, while for themes this is optional. Rheme-theme order was strongly preferred to theme-rheme order. These findings are argued to show a close relationship between information structure, constituent ordering and prosodic phrasing in Samoan. The preferred order of constituents in Samoan is rheme-theme, with a high phrase tone marking the end of the rheme. The absolutive argument is strongly preferred to be at the start of the theme. This article is part of theSpecial Collection: Prosody and constituent structure

  14. The Arabic noun phrase : a minimalist approach

    NARCIS (Netherlands)

    Kremers, Joost Merijn

    2003-01-01

    The aim of this study is to develop a model of the Arabic noun phrase from a minimalist perspective. It provides an analysis of many phenomena in the Arabic noun phrase, such as the construct state, adjectival agreement, definiteness inheritance, the formation of deverbal nouns and participles, etc.

  15. Explaining word order in the noun phrase

    DEFF Research Database (Denmark)

    Rijkhoff, Jan

    1990-01-01

    This article argues that word order in the noun phrase is largely determined by three iconic principles of constituent ordering. The patterns that these principles predict for simple noun phrases are tested against data from various existing samples. It appears that the predicted patterns are all...

  16. Generic Adaptively Secure Searchable Phrase Encryption

    Directory of Open Access Journals (Sweden)

    Kissel Zachary A.

    2017-01-01

    Full Text Available In recent years searchable symmetric encryption has seen a rapid increase in query expressiveness including keyword, phrase, Boolean, and fuzzy queries. With this expressiveness came increasingly complex constructions. Having these facts in mind, we present an efficient and generic searchable symmetric encryption construction for phrase queries. Our construction is straightforward to implement, and is proven secure under adaptively chosen query attacks (CQA2 in the random oracle model with an honest-but-curious adversary. To our knowledge, this is the first encrypted phrase search system that achieves CQA2 security. Moreover, we demonstrate that our document collection preprocessing algorithm allows us to extend a dynamic SSE construction so that it supports phrase queries. We also provide a compiler theorem which transforms any CQA2-secure SSE construction for keyword queries into a CQA2-secure SSE construction that supports phrase queries.

  17. Explaining word order in the noun phrase

    DEFF Research Database (Denmark)

    Rijkhoff, Jan

    1990-01-01

    This article argues that word order in the noun phrase is largely determined by three iconic principles of constituent ordering. The patterns that these principles predict for simple noun phrases are tested against data from various existing samples. It appears that the predicted patterns are all...... attested, with the exception of those stated to be nonexistent (or rare) by Greenberg's Universal 18. Some apparent counterexamples are examined in some detail and it is shown that they involve constituents that are not an integral part of the simple noun phrase....

  18. The Noun Phrase in Functional Discourse Grammar

    DEFF Research Database (Denmark)

    (last section from 'Introduction' of The Noun Phrase in Functional Discourse Grammar) The previous sections have shown that there are basically three proposals concerning the way NPs should be analysed in F(D)G: 1. the analysis according to “classical FG” (Dik 1997); 2. the analysis proposed in FDG...... treatment of noun phrases. Hengeveld takes the prototypical noun phrase as his point of departure, which he defines as an NP with a nominal head that denotes a concrete, first-order entity by lexical means and is used referentially rather than ascriptively. He then discusses examples of non-prototypical NPs...... earlier studies. The authors assume a process interpretation of the F(D)G model in which matters of form and order are interlaced. Their contribution is particularly concerned with agreement phenomena in the German noun phrase, which is noted for its morphological complexity. They show that their model...

  19. Translation Procedures (a Case Study)

    OpenAIRE

    Silalahi, Roswita; Lestari, Disa Evawani

    2012-01-01

    Roswita Silalahi This research is entitled Translation Procedures (A Case Study) which analysis translation procedures used in translating economic terms from English language book Principles of Economics by N. Gregory Mankiw into Bahasa Indonesia book Pengantar Ekonomi by Haris Munandar. The improvement of economic science which creates new economic terms challenges translator especially in translating a word or phrase, which in some cases, does not exist in the target l...

  20. Semantic Multisense Features and Translation of English Preposition OF

    Institute of Scientific and Technical Information of China (English)

    张招霞

    2009-01-01

    The linguist often claims,"English is a language of the preposition".It is a difficult point to understand and translate it though the preposition is simple in form.The preposition OF is such a simple preposition.On the surface,it seems very simple,indeed insufficient for talk,to OF which belongs to the simple preposition and prepositional phrases formed by it,but there are many kinds of its usages.This article help readers further understanding of semantic multisense features and translation of English prepositions and offer a kind of research for the fact that the English-Chinese machine translation system choose for its model.

  1. Cultural Connotations of Metaphorical Animal Phrases and Their Translation

    Institute of Scientific and Technical Information of China (English)

    雷慧娟; 王珊珊

    2011-01-01

    Communication cross cultures is becoming more and more frequent and extensive. In our case, cross-eultural communication refers to that between English and Chinese-both in culture and language. Due to different traditions, cultures, social mentality, ther

  2. morphogen: Translation into Morphologically Rich Languages with Synthetic Phrases

    Science.gov (United States)

    2013-10-01

    challenging, due to lex - ical sparsity on account of grammatical features being expressed with morphology. In this paper, we present an open-source Python tool...a stem σ is represented as a tuple of a lemma and its inflectional class. 52 E. Schlinger, V . Chahuneau, C. Dyer morphogen (51–62) она пыталась...sional morphology feature vector function, W is an m× n parameter matrix, and V is an n× n parameter matrix. In our implementation,φ andψ return

  3. Arabic Phrase-Level Contextual Polarity Recognition to Enhance Sentiment Arabic Lexical Semantic Database Generation

    Directory of Open Access Journals (Sweden)

    Samir E. Abdelrahman

    2014-10-01

    Full Text Available Most of opinion mining works need lexical resources for opinion which recognize the polarity of words (positive/ negative regardless their contexts which called prior polarity. The word prior polarity may be changed when it is considered in its contexts, for example, positive words may be used in phrases expressing negative sentiments, or vice versa. In this paper, we aim at generating sentiment Arabic lexical semantic database having the word prior coupled with its contextual polarities and the related phrases. To do that, we study first the prior polarity effects of each word using our Sentiment Arabic Lexical Semantic Database on the sentence-level subjectivity and Support Vector Machine classifier. We then use the seminal English two-step contextual polarity phrase-level recognition approach to enhance word polarities within its contexts. Our results achieve significant improvement over baselines.

  4. Translating poetry: Creative art or semantic science? A case study

    Directory of Open Access Journals (Sweden)

    C. Beckett

    2000-05-01

    Full Text Available There is an on-going debate as to the real value of translation: is it an art or a science? Is the translator engaged in genuine creation or is she merely transliterating the creation of someone else? In order to attempt to resolve this long-standing and thorny problem, this article examines the poet’s understanding of the “logos", the creative force of the word and the relationship which exists between the “signifiant" and the “signifié”. Extracts from poems by Alan Paton, Victor Hugo and Pablo Neruda serve to illustrate that poetic words not only transmit the poet’s experience but actually create it. If the poet is sensitive to the creative nature of language, as these two extracts suggest he should be, it follows logically that a good translator too must be aware of the degree to which language can create, and this perception must be implemented in the subsequent translation. Because only human beings and not machines possess sensitivity, it stands to reason that a machine is incapable of effectively translating the most emotional of literary genres: poetry. So as to illustrate this fact, this article compares and contrasts a computer-generated translation of Paul Verlaine's poem “Chanson d’automne" with three “human-generated” translations. In my own translation, comments and justifications are made as to the choice of a particular word or phrase proposed as translation. The conclusion is reached that translation implies a high degree of sensitivity towards the poet’s original intention as well as a collaborative process between poet and translator which results in an entirely new poem which involves as much, but different creativity as the original writing of the poem.

  5. Locative prepositional phrases in severe aphasia.

    Science.gov (United States)

    Weinrich, M; McCall, D; Shoosmith, L; Thomas, K; Katzenberger, K; Weber, C

    1993-07-01

    Six severely aphasic patients were trained on C-VIC, a computerized alternative communication system. Patients with both global and Broca's aphasia were able to produce and comprehend locative prepositions using C-VIC. The Broca's aphasic patients were able to produce and interpret symbol order correctly in C-VIC locative prepositional phrases, despite their difficulties in performing the same tasks using English. One patient's verbal production of locative prepositional phrases improved markedly after C-VIC training. These results suggest that the "mapping" impairment suffered by some Broca's aphasics may be ameliorated by use of alternative communication system, such as C-VIC.

  6. Grammatical Gender in Translation

    Science.gov (United States)

    Bordag, Denisa; Pechmann, Thomas

    2008-01-01

    In three experiments native speakers of Czech translated bare nouns and gender-marked adjective + noun phrases into German, their second language (L2). In Experiments 1-3 we explored the so-called gender interference effect from first language (L1) as observed in previous picture naming studies (naming latencies were longer when the L1 noun and…

  7. Translation of Idioms in Chinese Literature

    Institute of Scientific and Technical Information of China (English)

    赵丹丹

    2009-01-01

    <正>This paper explains two approaches to the translation of idioms in Chinese literature: free translation and literal translation.Chinese idioms,in its broad sense including set phrases,proverbs and two-part allegorical,have the specific features and rich in their content.By analyzing the basic principles of translation,we can better understand how to make full use of various translation methods,which can lay a good foundation for the translation of Chinese idioms.

  8. Phrase Completions: An Alternative to Likert Scales.

    Science.gov (United States)

    Hodge, David R.; Gillespie, David

    2003-01-01

    Delineates the problems with Likert scales, with a particular emphasis on multidimensionality and coarse response categories, and proposes a new measurement method called "phrase completions," which has been designed to circumvent the problems inherent in Likert scales. Also conducts an exploratory test, in which Likert items were adapted to…

  9. On Directionality of Phrase Structure Building

    Science.gov (United States)

    Chesi, Cristiano

    2015-01-01

    Minimalism in grammatical theorizing (Chomsky in "The minimalist program." MIT Press, Cambridge, 1995) led to simpler linguistic devices and a better focalization of the core properties of the structure building engine: a lexicon and a free (recursive) phrase formation operation, dubbed Merge, are the basic components that serve in…

  10. On Directionality of Phrase Structure Building

    Science.gov (United States)

    Chesi, Cristiano

    2015-01-01

    Minimalism in grammatical theorizing (Chomsky in "The minimalist program." MIT Press, Cambridge, 1995) led to simpler linguistic devices and a better focalization of the core properties of the structure building engine: a lexicon and a free (recursive) phrase formation operation, dubbed Merge, are the basic components that serve in…

  11. A Lexical Phrase Grammar for ESL.

    Science.gov (United States)

    Nattinger, James R.

    1980-01-01

    Lexical phrases such as deictic locutions, phrasal constraints, sentence builders, and situational utterances are examined using categories from artificial intelligence. It is argued that these and other types of patterned speech should be carefully organized and given a greater place in English as a Second Language curricula than at present. (PMJ)

  12. On degree phrases and result clauses

    NARCIS (Netherlands)

    Rijkhoek, Paulien Dea

    1998-01-01

    This dissertation consists of two parts, one on degree phrases and one on result clauses. Sentence (1) below exemplifies a result clause construction: (1) Zij had zo mooi gezongen dat het publiek er stil van was she has so beautifully sang that the audience there silent of was ‘She had sung so beaut

  13. Noun-Phrase Indexing, Pronouns, and the "Definite Article."

    Science.gov (United States)

    Sampson, Geoffrey

    Chomsky has suggested that certain lexical items, which he calls "referential items," should be given integer markers (or "indices") and that the noun-phrase deletion transformation which creates reflexive pronouns should be limited to cases where the noun-phrase to be deleted is fully identical to the antecedent noun-phrase,…

  14. Can Intonational Phrase Structure Be Primed (Like Syntactic Structure)?

    Science.gov (United States)

    Tooley, Kristen M.; Konopka, Agnieszka E.; Watson, Duane G.

    2014-01-01

    In 3 experiments, we investigated whether intonational phrase structure can be primed. In all experiments, participants listened to sentences in which the presence and location of intonational phrase boundaries were manipulated such that the recording included either no intonational phrase boundaries, a boundary in a structurally dispreferred…

  15. A Comparative Study on the Effectiveness of Two Song-Teaching Methods: Holistic vs. Phrase-by-Phrase

    Science.gov (United States)

    Persellin, Diane; Bateman, Laura

    2009-01-01

    The purpose of our study was to compare the effectiveness of two song-teaching methods: holistic and phrase-by-phrase. Thirty-two first-grade children (n = 32) from two music classes in an urban elementary school were taught two folksongs. The first class (n = 16) was taught one song through the phrase-by-phrase method and another song through the…

  16. A Comparative Study on the Effectiveness of Two Song-Teaching Methods: Holistic vs. Phrase-by-Phrase

    Science.gov (United States)

    Persellin, Diane; Bateman, Laura

    2009-01-01

    The purpose of our study was to compare the effectiveness of two song-teaching methods: holistic and phrase-by-phrase. Thirty-two first-grade children (n = 32) from two music classes in an urban elementary school were taught two folksongs. The first class (n = 16) was taught one song through the phrase-by-phrase method and another song through the…

  17. Machine-translatability and post-editing effort: an empirical study using translog and choice network analysis

    OpenAIRE

    O'Brien, Sharon

    2006-01-01

    Studies on Controlled Language (CL) suggest that by removing features that are known to be problematic for MT (termed here “negative translatability indicators”, or “NTIs”), the MT output can be improved. It is assumed that an improvement in the output will result in lower post-editing effort. This study tests that assumption by measuring the technical, temporal and cognitive post-editing effort (Krings 2001) for English sentences in a user manual that have been translated into German using a...

  18. Translation Techniques

    Directory of Open Access Journals (Sweden)

    Marcia Pinheiro

    2015-05-01

    Full Text Available In this paper, we discuss three translation techniques: literal, cultural, and artistic. Literal translation is a well-known technique, which means that it is quite easy to find sources on the topic. Cultural and artistic translation may be new terms. Whilst cultural translation focuses on matching contexts, artistic translation focuses on matching reactions. Because literal translation matches only words, it is not hard to find situations in which we should not use this technique.  Because artistic translation focuses on reactions, judging the quality of an artistic translation work is one of the most difficult things one can do. We end up having a score of complexity and humanity for each one of the mentioned techniques: Literal translation would be the closest thing we have to the machines world and artistic translation would be the closest thing we have to the purely human world. By creating these classifications and studying the subtleties of each one of them, we are adding degrees of quality to our courses and to translation as a professional field. The main contribution of this paper is then the formalization of such a piece of knowledge. We, however, also lay the foundations for studies of this type.

  19. Phrase frequency effects in language production.

    Directory of Open Access Journals (Sweden)

    Niels Janssen

    Full Text Available A classic debate in the psychology of language concerns the question of the grain-size of the linguistic information that is stored in memory. One view is that only morphologically simple forms are stored (e.g., 'car', 'red', and that more complex forms of language such as multi-word phrases (e.g., 'red car' are generated on-line from the simple forms. In two experiments we tested this view. In Experiment 1, participants produced noun+adjective and noun+noun phrases that were elicited by experimental displays consisting of colored line drawings and two superimposed line drawings. In Experiment 2, participants produced noun+adjective and determiner+noun+adjective utterances elicited by colored line drawings. In both experiments, naming latencies decreased with increasing frequency of the multi-word phrase, and were unaffected by the frequency of the object name in the utterance. These results suggest that the language system is sensitive to the distribution of linguistic information at grain-sizes beyond individual words.

  20. Lost in Translation

    Institute of Scientific and Technical Information of China (English)

    2011-01-01

    The intricacies of language still eludes even the most sophisticated technology IT is no surprise Jost Zetzsche,an English-to-German translator,raised the question of whether machine translation would ever replace the human variety in front of 700 interpreters and translators who gathered in San Francisco to discuss topical issues in the translation industry.

  1. Translators and Tools

    Institute of Scientific and Technical Information of China (English)

    2011-01-01

    Technology is not yet advanced enough to capture the nuances of language It is no surprise Jost Zetzsche,an English-to-German translator,raised the question of whether machine translation would ever replace the human variety in front of 700 interpreters and translators who gathered in San Francisco to discuss topical issues in the translation industry.

  2. Lost in Translation

    Institute of Scientific and Technical Information of China (English)

    Ding Zhitao; Chen Wen

    2011-01-01

    IT is no surprise Jost Zetzsche,an English-to-German translator,raised the question of whether machine translation would ever replace the human variety in front of 700 interpreters and translators who gathered in San Francisco to discuss topical issues in the translation industry.

  3. Translation: Aids, Robots, and Automation.

    Science.gov (United States)

    Andreyewsky, Alexander

    1981-01-01

    Examines electronic aids to translation both as ways to automate it and as an approach to solve problems resulting from shortage of qualified translators. Describes the limitations of robotic MT (Machine Translation) systems, viewing MAT (Machine-Aided Translation) as the only practical solution and the best vehicle for further automation. (MES)

  4. Towards a French/French Sign Language Machine-Translation System (TLF Vers un système de traduction automatique : français/langue des signes française (TLF

    Directory of Open Access Journals (Sweden)

    Gaspard Breton

    2009-04-01

    Full Text Available We present a work in the framewok of machine translation from French to French Sign Language (FSL with synthesis of gestures through a virtual agent. We first give some descriptive and theoretical elements about the FSL. Then we propose a formalization of the standard part of the FSL lexicon as well as propositions concerning some morpho-syntactic phenomena. Then we present the machine translation system based on interlingua representation system "TiLT" developed at France Telecom R&D and the adaptation of its generation module to FSL. We end with the avatar technology also developed at France Telecom R&D and its FSL treatment.

  5. Effects on Text Simplification: Evaluation of Splitting Up Noun Phrases.

    Science.gov (United States)

    Leroy, Gondy; Kauchak, David; Hogue, Alan

    2016-01-01

    To help increase health literacy, we are developing a text simplification tool that creates more accessible patient education materials. Tool development is guided by a data-driven feature analysis comparing simple and difficult text. In the present study, we focus on the common advice to split long noun phrases. Our previous corpus analysis showed that easier texts contained shorter noun phrases. Subsequently, we conducted a user study to measure the difficulty of sentences containing noun phrases of different lengths (2-gram, 3-gram, and 4-gram); noun phrases of different conditions (split or not); and, to simulate unknown terms, pseudowords (present or not). We gathered 35 evaluations for 30 sentences in each condition (3 × 2 × 2 conditions) on Amazon's Mechanical Turk (N = 12,600). We conducted a 3-way analysis of variance for perceived and actual difficulty. Splitting noun phrases had a positive effect on perceived difficulty but a negative effect on actual difficulty. The presence of pseudowords increased perceived and actual difficulty. Without pseudowords, longer noun phrases led to increased perceived and actual difficulty. A follow-up study using the phrases (N = 1,350) showed that measuring awkwardness may indicate when to split noun phrases. We conclude that splitting noun phrases benefits perceived difficulty but hurts actual difficulty when the phrasing becomes less natural.

  6. Generic noun phrases in mother-child conversations.

    Science.gov (United States)

    Pappas, A; Gelman, S A

    1998-02-01

    Generic noun phrases (e.g. Tigers are fierce) are of interest for their semantic properties: they capture 'essential' properties, are timeless, and are context-free. The present study examines use of generic noun phrases by preschool children and their mothers. Mother-child pairs were videotaped while looking through a book of animal pictures. Each page depicted either a single instance of a particular category (e.g. one crab) or multiple instances of a particular category (e.g. many crabs). The results indicated a striking difference in how generics vs. non-generics were distributed, both in the speech of mothers and in the speech of preschool children. Whereas the form of non-generic noun phrases was closely linked to the structure of the page (i.e. singular noun phrases were used more often when a single instance was presented; plural noun phrases were used more often when multiple instances were presented), the form of generic noun phrases was independent of the information depicted (e.g. plural noun phrases were as frequent when only one instance was presented as when multiple instances were presented). We interpret the data as providing evidence that generic noun phrases differ in their semantics and conceptual organization from non-generic noun phrases, both in the input to young children and in children's own speech. Thus, this simple linguistic device may provide input to, and a reflection of, children's early developing notion of 'kinds'.

  7. Automatic translation among spoken languages

    Science.gov (United States)

    Walter, Sharon M.; Costigan, Kelly

    1994-01-01

    The Machine Aided Voice Translation (MAVT) system was developed in response to the shortage of experienced military field interrogators with both foreign language proficiency and interrogation skills. Combining speech recognition, machine translation, and speech generation technologies, the MAVT accepts an interrogator's spoken English question and translates it into spoken Spanish. The spoken Spanish response of the potential informant can then be translated into spoken English. Potential military and civilian applications for automatic spoken language translation technology are discussed in this paper.

  8. Rejoindre une phrase, s’y poursuivre

    OpenAIRE

    2014-01-01

    Comment vivons-nous avec les phrases (et pas nécessairement avec les récits synthétiques) que nous lisons ou que nous avons lues ? Comment les citations littéraires deviennent-elles l’occasion d’une orientation intérieure de notre expérience ? Cet article souligne le mouvement esthétique par lequel nous pouvons être attirés vers des possibilités authentiques de nous-mêmes dans l’acte même de la lecture. Pour ainsi faire, il offre une vision dynamique du « style » et explore la logique de la l...

  9. The Axial Part Phrase in Japanese

    Directory of Open Access Journals (Sweden)

    Kaori Takamine

    2007-01-01

    Full Text Available In this paper, I investigate the categorial status of spatial terms in locative/directional expressions in Japanese. I will show that a certain class of spatial terms have a distinct categorial status from both regular postpositions and nouns. On one hand, syntactic diagnostics such as doubling, coordination by to, and co-occurrence with demonstratives indicate that these spatial terms belong to a nominal category rather than to a postpositional category. On the other hand, the fact that these spatial terms are modified by range modifiers indicates that they are more similar to regular postpositions than to nouns. On the basis of these diagnostics, I will argue that spatial terms in Japanese need to be assigned a new category Axial Part Phrase which is proposed by Svenonius 2006.

  10. Distance and Defamiliarisation: Translation as Philosophical Method

    Science.gov (United States)

    Ruitenberg, Claudia W.

    2009-01-01

    In this article I posit translation as philosophical operation that disrupts commonsense meaning and understanding. By defamiliarising language, translation can arrest thinking about a text in a way that assumes the language is understood. In recent work I have grappled with the phrase "ways of knowing", which, for linguistic and conceptual…

  11. Literary authorship attribution with phrase-structure fragments

    NARCIS (Netherlands)

    van Cranenburgh, Andreas

    2012-01-01

    We present a method of authorship attribution and stylometry that exploits hierarchical information in phrase-structures. Contrary to much previous work in stylometry, we focus on content words rather than function words. Texts are parsed to obtain phrase-structures, and compared with texts to be an

  12. The Dependency Structure of Coordinate Phrases: A Corpus Approach

    Science.gov (United States)

    Temperley, David

    2005-01-01

    Hudson (1990) proposes that each conjunct in a coordinate phrase forms dependency relations with heads or dependents outside the coordinate phrase (the "multi-head" view). This proposal is tested through corpus analysis of Wall Street Journal text. For right-branching constituents (such as direct-object NPs), a short-long preference for conjunct…

  13. On the Translation Methods of Biblical Idioms

    Institute of Scientific and Technical Information of China (English)

    杜丽芬

    2016-01-01

    Biblical idioms are widely used in daily communication in English-speaking countries. In order to prevent cross-cultural communication barriers, it is important for non-native English speakers to understand the background information of biblical idioms, such as religious stories. Biblical idioms can be classified into two categories: phrases originated from Christian stories and quotations extracted from the Bible. Different biblical idioms requires different translation methods. Three main translation methods, literal translation, free translation, and corresponding translation, are discussed in this paper with detailed case analysis. The three translation methods have their own advantages and are applied in different circumstances.

  14. Implementation of the common phrase index method on the phrase query for information retrieval

    Science.gov (United States)

    Fatmawati, Triyah; Zaman, Badrus; Werdiningsih, Indah

    2017-08-01

    As the development of technology, the process of finding information on the news text is easy, because the text of the news is not only distributed in print media, such as newspapers, but also in electronic media that can be accessed using the search engine. In the process of finding relevant documents on the search engine, a phrase often used as a query. The number of words that make up the phrase query and their position obviously affect the relevance of the document produced. As a result, the accuracy of the information obtained will be affected. Based on the outlined problem, the purpose of this research was to analyze the implementation of the common phrase index method on information retrieval. This research will be conducted in English news text and implemented on a prototype to determine the relevance level of the documents produced. The system is built with the stages of pre-processing, indexing, term weighting calculation, and cosine similarity calculation. Then the system will display the document search results in a sequence, based on the cosine similarity. Furthermore, system testing will be conducted using 100 documents and 20 queries. That result is then used for the evaluation stage. First, determine the relevant documents using kappa statistic calculation. Second, determine the system success rate using precision, recall, and F-measure calculation. In this research, the result of kappa statistic calculation was 0.71, so that the relevant documents are eligible for the system evaluation. Then the calculation of precision, recall, and F-measure produces precision of 0.37, recall of 0.50, and F-measure of 0.43. From this result can be said that the success rate of the system to produce relevant documents is low.

  15. Semantic Translation and Communicative Translation:A Case Study in My Chinese Translation of Alnwick’s Inspired Heritage

    Institute of Scientific and Technical Information of China (English)

    叶谋锦

    2013-01-01

      This essay deals with my Chinese translation of a tour brochure and highlights some issues in translation. The translation strategies semantic translation and communicative translation are famous approaches proposed by Peter Newmark in his book Approaches to Translation. He puts forward his text typology and categorizes all text types into three types, i.e. expressive text, informative text and vocative text. In order to solve problems emerging in different types, he presents two dominant translation approaches, namely, semantic translation and communicative translation (Newmark, 2001). These translation methods are regarded as a milestone in translation theory and highly quoted by many scholars. Semantic translation emphasizes the thinking process of the author and represents the origin meanings of the source text. In this case, translator makes little interference with the harmony of the source text. Semantic translation can preserve the characteristics and expression patterns of the source language, forcing readers to follow the thinking pattern of the author. While communicative translation indicates that the influence of translation on target readers should be identical to the effect of original article on source readers. Communicative translation is target-language-orientated and focuses on transference of information. This translation strategy allows translator to reorganize sentences with classical phrases in the target language to make his translation accessible to target readers. Of course, these different translation strategies often com⁃plement with each other, and should be combined together in translation.

  16. 理性主义与经验主义相结合的机器翻译研究策略%Rationalism and Empiricism on the Combination in Machine Translation

    Institute of Scientific and Technical Information of China (English)

    徐金安

    2011-01-01

    主要介绍了基于规则、基于实例和基于统计等3种主流机器翻译方法,探讨了自然语言处理技术和机器翻译中基于规则的理性主义方法和基于统计的经验主义方法的优缺点,结合机器翻译研究的现状和发展方向,提出了规则和统计相结合的机器翻译方法的基本思路,阐述了词义消歧中的理性主义方法和经验主义方法相结合的发展方向,对机器翻译的发展趋势进行了探讨.%This paper firstly discussed the development of machine translation, secondly described the advantages and disadvantages of the rationalism and empiricism in natural language processing and machine translation, and thirdly introduced our basic ideas of combination of the rule-based approach and statistical approach.Furthermore, the paper deliberated the tendency of combination of the rationalism and empiricism in word sense disambiguation, finally summarized the developmental tendency of machine translation.

  17. On directionality of phrase structure building.

    Science.gov (United States)

    Chesi, Cristiano

    2015-02-01

    Minimalism in grammatical theorizing (Chomsky in The minimalist program. MIT Press, Cambridge, 1995) led to simpler linguistic devices and a better focalization of the core properties of the structure building engine: a lexicon and a free (recursive) phrase formation operation, dubbed Merge, are the basic components that serve in building syntactic structures. Here I suggest that by looking at the elementary restrictions that apply to Merge (i.e., selection and licensing of functional features), we could conclude that a re-orientation of the syntactic derivation (from bottom-up/right-left to top-down/left-right) is necessary to make the theory simpler, especially for long-distance (filler-gap) dependencies, and is also empirically more adequate. If the structure building operations would assemble lexical items in the order they are pronounced (Phillips in Order and structure. PhD thesis, MIT, 1996; Chesi in Phases and cartography in linguistic computation: Toward a cognitively motivated computational model of linguistic competence. PhD thesis, Università di Siena, 2004; Chesi in Competence and computation: Toward a processing friendly minimalist grammar. Unipress, Padova, 2012), on-line performance data could better fit the grammatical model, without resorting to external "performance factors." The phase-based, top-down (and, as a consequence, left-right) Minimalist Grammar here discussed goes in this direction, ultimately showing how strong Islands (Huang in Logical relations in Chinese and the theory of grammar. PhD thesis, MIT, 1982) and intervention effects (Gordon et al. in J Exp Psychol Learn Mem Cogn 27:1411-1423, 2001, Gordon et al. in J Mem Lang 51:97-114, 2004) could be better explained in structural terms assuming this unconventional derivational direction.

  18. Noun-Phrase Analysis in Unrestricted Text for Information Retrieval

    CERN Document Server

    Evans, D A; Evans, David A.; Zhai, Chengxiang

    1996-01-01

    Information retrieval is an important application area of natural-language processing where one encounters the genuine challenge of processing large quantities of unrestricted natural-language text. This paper reports on the application of a few simple, yet robust and efficient noun-phrase analysis techniques to create better indexing phrases for information retrieval. In particular, we describe a hybrid approach to the extraction of meaningful (continuous or discontinuous) subcompounds from complex noun phrases using both corpus statistics and linguistic heuristics. Results of experiments show that indexing based on such extracted subcompounds improves both recall and precision in an information retrieval system. The noun-phrase analysis techniques are also potentially useful for book indexing and automatic thesaurus extraction.

  19. A Rule-Based Approach To Prepositional Phrase Attachment Disambiguation

    CERN Document Server

    Brill, E; Brill, Eric; Resnik, Philip

    1994-01-01

    In this paper, we describe a new corpus-based approach to prepositional phrase attachment disambiguation, and present results comparing performance of this algorithm with other corpus-based approaches to this problem.

  20. Identifying synonymy between relational phrases using word embeddings.

    Science.gov (United States)

    Nguyen, Nhung T H; Miwa, Makoto; Tsuruoka, Yoshimasa; Tojo, Satoshi

    2015-08-01

    Many text mining applications in the biomedical domain benefit from automatic clustering of relational phrases into synonymous groups, since it alleviates the problem of spurious mismatches caused by the diversity of natural language expressions. Most of the previous work that has addressed this task of synonymy resolution uses similarity metrics between relational phrases based on textual strings or dependency paths, which, for the most part, ignore the context around the relations. To overcome this shortcoming, we employ a word embedding technique to encode relational phrases. We then apply the k-means algorithm on top of the distributional representations to cluster the phrases. Our experimental results show that this approach outperforms state-of-the-art statistical models including latent Dirichlet allocation and Markov logic networks.

  1. Statistical Machine Translation of Japanese

    Science.gov (United States)

    2007-03-01

    spoken, it may be phonologically displayed as /W AA T AA SH IY/, where each of the six elements between the slashes is a written representation of...phonemes. A chart showing the possible assignment of kana to phonemes is in Figure 3.1. The complete phonetic symbol set in Table 6.4 is the set of...sequences output by the recognizer. 6.6 Acoustic Model 66 The acoustic model, P(O|S) from Bayes’ rule, maps acoustic features to distinct phonetic

  2. Noun-Phrase Analysis in Unrestricted Text for Information Retrieval

    OpenAIRE

    Evans, David A.; Zhai, Chengxiang

    1996-01-01

    Information retrieval is an important application area of natural-language processing where one encounters the genuine challenge of processing large quantities of unrestricted natural-language text. This paper reports on the application of a few simple, yet robust and efficient noun-phrase analysis techniques to create better indexing phrases for information retrieval. In particular, we describe a hybrid approach to the extraction of meaningful (continuous or discontinuous) subcompounds from ...

  3. Assigning phrase breaks from part-of-speech sequences.

    OpenAIRE

    Black, Alan W.; Taylor, Paul A.

    1997-01-01

    One of the important stages in the process of turning unmarked text into speech is the assignment of appropriate phrase break boundaries. Phrase break boundaries are important to later modules including accent assignment, duration control and pause insertion. A number of different algorithms have been proposed for such a task, ranging from the simple to the complex. These different algorithms require different information such as part of speech tags, syntax and even ...

  4. Verbal behavior in Alzheimer disease patients: Analysis of phrase repetition

    Directory of Open Access Journals (Sweden)

    Juliana Francisca Cecato

    Full Text Available Abstract Language problems in the elderly with AD are due to the fact that deterioration occurs not only in semantic memory, but in a group of cognitive factors, evidenced by a deficiency in search strategies for linguistic information. Objectives: To evaluate phrase repetition in two cognitive tests, the MMSE and MoCA, in a group of Alzheimer disease patients (AD and normal controls. Methods: A Cross-sectional study was conducted involving 20 patients who sought medical assistance at a geriatric institute in Jundiaí, São Paulo. The subjects underwent a detailed clinical examination and neuropsychometric evaluation. All subjects with AD met DSM-IV and NINCDS-ADRDA criteria. Ten patients received a diagnosis of AD and 10 were healthy subjects, forming the control group (CG. Results: All participants correctly answered the phrase from the MMSE (phrase 1. The MoCA phrases (phrases 2 and 3 were correct in 80% and 90%, respectively in the CG and in 40% and 50%, respectively in the AD group. Conclusions: The MoCA test proved more effective in evaluating the echoic behavior in AD patients compared to the MMSE. The simpler phrase repetition task in the MMSE was found to be less sensitive in detecting mild language decline in AD patients.

  5. Translator awareness Translator awareness

    Directory of Open Access Journals (Sweden)

    Wolfram Wilss

    2008-04-01

    Full Text Available If we want to encompass adequately the wide-ranging field of human translation, it is necessary to include in translation studies (TS the concept of translator awareness (or translator consciousness, for that matter. However, this is more easily said than done, because this concept does not easily lend itself to definition, let alone to measurement, e. g., by investigating translator behaviour. To put it bluntly: Translator awareness is a fuzzy concept. Like many obviously difficult-to-define concepts, with which dialogue in TS is burdened, translator awareness lacks an articulated theory within which different forms of translator behaviour can be convincingly related to, or distinguished from, one another. Hence, TS has so far not tackled, at least not systematically, the issue of translator awareness. If we want to encompass adequately the wide-ranging field of human translation, it is necessary to include in translation studies (TS the concept of translator awareness (or translator consciousness, for that matter. However, this is more easily said than done, because this concept does not easily lend itself to definition, let alone to measurement, e. g., by investigating translator behaviour. To put it bluntly: Translator awareness is a fuzzy concept. Like many obviously difficult-to-define concepts, with which dialogue in TS is burdened, translator awareness lacks an articulated theory within which different forms of translator behaviour can be convincingly related to, or distinguished from, one another. Hence, TS has so far not tackled, at least not systematically, the issue of translator awareness.

  6. Portmanteau Constructions, Phrase Structure, and Linearization.

    Science.gov (United States)

    Chan, Brian Hok-Shing

    2015-01-01

    In bilingual code-switching which involves language-pairs with contrasting head-complement orders (i.e., head-initial vs. head-final), a head may be lexicalized from both languages with its complement sandwiched in the middle. These so-called "portmanteau" sentences (Nishimura, 1985, 1986; Sankoff et al., 1990, etc.) have been attested for decades, but they had never received a systematic, formal analysis in terms of current syntactic theory before a few recent attempts (Hicks, 2010, 2012). Notwithstanding this lack of attention, these structures are in fact highly relevant to theories of linearization and phrase structure. More specifically, they challenge binary-branching (Kayne, 1994, 2004, 2005) as well as the Antisymmetry hypothesis (ibid.). Not explained by current grammatical models of code-switching, including the Equivalence Constraint (Poplack, 1980), the Matrix Language Frame Model (Myers-Scotton, 1993, 2002, etc.), and the Bilingual Speech Model (Muysken, 2000, 2013), the portmanteau construction indeed looks uncommon or abnormal, defying any systematic account. However, the recurrence of these structures in various datasets and constraints on them do call for an explanation. This paper suggests an account which lies with syntax and also with the psycholinguistics of bilingualism. Assuming that linearization is a process at the Sensori-Motor (SM) interface (Chomsky, 2005, 2013), this paper sees that word order is not fixed in a syntactic tree but it is set in the production process, and much information of word order rests in the processor, for instance, outputting a head before its complement (i.e., head-initial word order) or the reverse (i.e., head-final word order). As for the portmanteau construction, it is the output of bilingual speakers co-activating two sets of head-complement orders which summon the phonetic forms of the same word in both languages. Under this proposal, the underlying structure of a portmanteau construction is as simple as an

  7. Portmanteau constructions, phrase structure and linearization

    Directory of Open Access Journals (Sweden)

    Brian Hok-Shing Chan

    2015-12-01

    Full Text Available In bilingual code-switching which involves language-pairs with contrasting head-complement orders (i.e. head-initial vs head-final, a head may be lexicalized from both languages with its complement sandwiched in the middle. These so-called portmanteau sentences (Nishimura, 1985, 1986; Sankoff, Poplack, and Vanniarajan, 1990, etc. have been attested for decades, but they had never received a systematic, formal analysis in terms of current syntactic theory before a few recent attempts (Hicks, 2010, 2012. Notwithstanding this lack of attention, these structures are in fact highly relevant to theories of linearization and phrase structure. More specifically, they challenge binary-branching (Kayne, 1994, 2004, 2005 as well as the Antisymmetry hypothesis (ibid.. Not explained by current grammatical models of code-switching, including the Equivalence Constraint (Poplack, 1980, the Matrix Language Frame Model (Myers-Scotton, 1993, 2002, etc., and the Bilingual Speech Model (Muysken, 2000, 2013, the portmanteau construction indeed looks uncommon or abnormal, defying any systematic account. However, the recurrence of these structures in various datasets and constraints on them do call for an explanation. This paper suggests an account which lies with syntax and also with the psycholinguistics of bilingualism. Assuming that linearization is a process at the Sensori-Motor (SM interface (Chomsky, 2005; 2013, this paper sees that word order is not fixed in a syntactic tree but it is set in the production process, and much information of word order rests in the processor, for instance, outputting a head before its complement (i.e. head-initial word order or the reverse (i.e. head-final word order. As for the portmanteau construction, it is the output of bilingual speakers co-activating two sets of head-complement orders which summon the phonetic forms of the same word in both languages. Under this proposal, the underlying structure of a portmanteau

  8. Bilinguals' Plausibility Judgments for Phrases with a Literal vs. Non-literal Meaning: The Influence of Language Brokering Experience

    Directory of Open Access Journals (Sweden)

    Belem G. López

    2017-09-01

    Full Text Available Previous work has shown that prior experience in language brokering (informal translation may facilitate the processing of meaning within and across language boundaries. The present investigation examined the influence of brokering on bilinguals' processing of two word collocations with either a literal or a figurative meaning in each language. Proficient Spanish-English bilinguals classified as brokers or non-brokers were asked to judge if adjective+noun phrases presented in each language made sense or not. Phrases with a literal meaning (e.g., stinging insect were interspersed with phrases with a figurative meaning (e.g., stinging insult and non-sensical phrases (e.g., stinging picnic. It was hypothesized that plausibility judgments would be facilitated for literal relative to figurative meanings in each language but that experience in language brokering would be associated with a more equivalent pattern of responding across languages. These predictions were confirmed. The findings add to the body of empirical work on individual differences in language processing in bilinguals associated with prior language brokering experience.

  9. The Syntax and Semantics of Phonological Phrasing in Shanghai and Hokkien.

    Science.gov (United States)

    Soh, Hooi Ling

    2001-01-01

    It has been observed that phonological phrasing in Shanghai Chinese distinguishes certain determiners from others and wh-quantifier phrases from non-wh-quantifier phrases. This article shows that such phonological phrasing distinctions are also found in Hokkien Chinese but in a more restricted environment. (Author/VWL)

  10. Case Markers in Mongolian: A Means for Encoding Null Constituents in Noun Phrase and Relative Clause

    OpenAIRE

    Tseden Otgonsuren

    2017-01-01

    This paper focuses on the capacity of the case markers in the Mongolian language, as a relative element, to generate any finite noun phrase or relative clause based on their syntactic function or relationship. In Mongolian, there are two different approaches to generate noun phrases: parataxis and hypotaxis. According to my early observation, if the noun phrase generated through the parataxis, is the complement of the postpositional phrase, the head word of the relevant noun phrase can be tru...

  11. On the Translation of English-Chinese Idioms

    Institute of Scientific and Technical Information of China (English)

    崔建设

    2013-01-01

    The idiomatic phrase is equal to the phrase that people use frequently language in the world that had long and splendid history all contains large quantities of idiom. They form regular phrases and short sentences through long use and refinement in the development of language. The idiomatic phrases contain varied cultural information, embody a lot of cultural feature, and cul⁃tural background. The idiomatic phrases display distinct image and figure of speech, and they also have bright national characteris⁃tic and regional (local) color. The paper includes four parts. The first chapter is to introduce the generalization of idioms and trans⁃lation, and the second part is to list four different definitions of idiom according to some translational experts, next step leads in classification and sources of idiom. The third chapter introduces some common strategies of idiomatic translation such as literal translation, literal translation plus notes, free translation, and the combination of literal and free translation. At last sum up the pa⁃per.

  12. Learning Noun Phrase Anaphoricity in Coreference Resolution via Label Propagation

    Institute of Scientific and Technical Information of China (English)

    Guo-Dong Zhou; Fang Kong

    2011-01-01

    Knowledge of noun phrase anaphoricity might be profitably exploited in coreference resolution to bypass the resolution of non-anaphoric noun phrases. However, it is surprising to notice that recent attempts to incorporate automatically acquired anaphoricity information into coreference resolution systems have been far from expectation. This paper proposes a global learning method in determining the anaphoricity of noun phrases via a label propagation algorithm to improve learning-based coreference resolution. In order to eliminate the huge computational burden in the label propagation algorithm, we employ the weighted support vectors as the critical instances to represent all the anaphoricity-labeled NP instances in the training texts. In addition, two kinds of kernels, i.e., the feature-based RBF (Radial Basis Function) kernel and the convolution tree kernel with approximate matching, are explored to compute the anaphoricity similarity between two noun phrases. Experiments on the ACE2003 corpus demonstrate the great effectiveness of our method in anaphoricity determination of noun phrases and its application in learning-based coreference resolution.

  13. Measuring Cognitive Translation Effort with Activity Units

    DEFF Research Database (Denmark)

    Schaeffer, Moritz; Carl, Michael; Lacruz, Isabel

    2016-01-01

    Despite the increased quality of Machine Translation output, human interaction will remain a crucial activity to guarantee the quality of the final translation products. Human-computer interaction in translation will likely be the more successful the more we understand the properties and compleme...... methods in empirical translation process research and suggests ngrams of Activity Units for measuring the translation process....

  14. Pattern recognition & machine learning

    CERN Document Server

    Anzai, Y

    1992-01-01

    This is the first text to provide a unified and self-contained introduction to visual pattern recognition and machine learning. It is useful as a general introduction to artifical intelligence and knowledge engineering, and no previous knowledge of pattern recognition or machine learning is necessary. Basic for various pattern recognition and machine learning methods. Translated from Japanese, the book also features chapter exercises, keywords, and summaries.

  15. On Yao's method of translation

    NARCIS (Netherlands)

    Liu, X.; Hoede, C.

    2002-01-01

    Machine Translation, i.e., translating one kind of natural language to another kind of natural language by using a computer system, is a very important research branch in Artificial Intelligence. Yao developed a method of translation that he called ``Lexical-Semantic Driven". In his system he introd

  16. Introduction (of 'The Noun Phrase in Functional Discourse Grammar')

    DEFF Research Database (Denmark)

    García Velasco, Daniel; Rijkhoff, Jan

    2008-01-01

    The articles in this volume analyse the noun phrase within the framework of Functional Discourse Grammar (FDG), the successor to Simon C. Dik’s Functional Grammar (FG). The Noun Phrase was the main conference theme of the 11th International Conference on Functional Grammar, which took place......, the most recent treatment of NPs by Dik in terms of “classical FG” was published (posthumously) in 1997, in the first volume of The Theory of Functional Grammar. Given the fact that FDG presents a strongly revised version of Dikkian FG with respect to rules, variables, representations and overall design...

  17. Translations and Translators.

    Science.gov (United States)

    Nida, Eugene A.

    1979-01-01

    The necessity for stylistic appropriateness in translation as well as correct content is discussed. To acquire this skill, translators must be trained in stylistics through close examination of their own language and must have practice in translating for different audiences at different levels. (PMJ)

  18. Translating Words, Translating Cultures

    Directory of Open Access Journals (Sweden)

    Richard Whitaker

    2012-03-01

    Full Text Available What exactly does (or should translation from one language into another try to do? Attempt to convey to readers of the target language (the language into which one is translating something of the strangeness, difference and historicity of the original in the source language (the language from which one is translating? Or must translation try to bridge the gap between source and target language, by rendering the original in a thoroughly contemporary style and diction, as if this were a work being written now for the first time? And related to these the further questions: how closely should a translation render the genre, language, metre, style and content of the original? How far can a translation depart from the original without ceasing to be a translation – in other words, where is one to situate the border between “translation”, “version” and “adaptation”?

  19. The Numerical Semigroup of Phrases' Lengths in a Simple Alphabet

    Directory of Open Access Journals (Sweden)

    Aureliano M. Robles-Pérez

    2013-01-01

    Full Text Available Let be an alphabet with two elements. Considering a particular class of words (the phrases over such an alphabet, we connect with the theory of numerical semigroups. We study the properties of the family of numerical semigroups which arise from this starting point.

  20. The numerical semigroup of phrases' lengths in a simple alphabet.

    Science.gov (United States)

    Robles-Pérez, Aureliano M; Rosales, José Carlos

    2013-01-01

    Let A be an alphabet with two elements. Considering a particular class of words (the phrases) over such an alphabet, we connect with the theory of numerical semigroups. We study the properties of the family of numerical semigroups which arise from this starting point.

  1. Relevance and Phrase Searching in Summon: looking under the hood

    Directory of Open Access Journals (Sweden)

    Thomas Hodge

    2013-07-01

    Full Text Available This article briefly examines the mechanisms behind seemingly counter-intuitive phrase search results in Serials Solutions’ discovery platform Summon. The authors use the platform’s search API to explain why users sometimes encounter greater numbers of results when typically they would expect fewer. The article explores the reasons behind the search results and the implications for library instruction.

  2. To Weaken the Language Barriers for the Chinese Minorities by the Machine Translation Technology%利用机器翻译技术消弱少数民族语言障碍

    Institute of Scientific and Technical Information of China (English)

    吴凤娟

    2011-01-01

    As a multi-ethnic region, there are both advantages and disadvantages for China. The language barrier is always a great obstruction for the development of the minority nationalities. The machine translation specific to the languages of Chinese minorities contribute to advance the culture communication, to drive the economic growth and to build a harmonious society. By now some development has been made in this field. But if we want to get the actual applying effect, there is a long road before us. To advance the machine translation specific to the languages of Chinese minorities is of great significance.%我国多民族聚居的局面有利有弊,语言障碍一直是少数民族地区发展的巨大阻力.针对少数民族语言的机器翻译有助于推进文化交流、带动经济发展、建立和谐社会.目前该领域的研究已取得了一定进展,但要达到实际应用效果仍有较长的路要走.大力推进针对少数民族的机器翻译工作意义重大.

  3. A Comparative Study of "Google Translate" Translations: An Error Analysis of English-to-Persian and Persian-to-English Translations

    Science.gov (United States)

    Ghasemi, Hadis; Hashemian, Mahmood

    2016-01-01

    Both lack of time and the need to translate texts for numerous reasons brought about an increase in studying machine translation with a history spanning over 65 years. During the last decades, Google Translate, as a statistical machine translation (SMT), was in the center of attention for supporting 90 languages. Although there are many studies on…

  4. 面向移动终端的统计机器翻译解码定点化方法%A Fixed Point Decoding Approach for Statistical Machine Translation on Mobile Terminals

    Institute of Scientific and Technical Information of China (English)

    李响; 徐金安; 姜文斌; 吕雅娟; 刘群

    2011-01-01

    面向移动终端的统计机器翻译需求越来越多,但无浮点运算单元的处理器限制了翻译速度.该文提出了一种对统计机器翻译解码运算的定点化运算方法,缓解了无浮点运算单元的处理器对翻译速度的影响.基于PC和移动终端的实验表明,在保证翻译质量的情况下,利用定点处理浮点运算的解码器的运算速度较编译器模拟的浮点运算速度提高135.6%.因此,该方法可以有效地提高浮点运算能力薄弱的移动终端统计机器翻译速度.%The demand for statistical machine translation (SMT) on mobile terminals is increasing, but the processor without floating point unit (FPU) restricts the translation speed. This paper proposes an approach to switch floating point operation to fixed point operation for decoder of SMT system on mobile terminals, and increase the translation speed on the processor without FPU. The experiments based on PC and mobile terminal show while this approach assures the quality of translation, the speed of our approach is 135.6% faster than the speed of floating point operation emulated by compiler. Therefore, this approach can efficiently increase the translation speed of SMT system on mobile terminals with weak ability in floating point operation.

  5. Completeness of Compositional Translation for Context-Free Grammars

    CERN Document Server

    Huijsen, W O

    1996-01-01

    A machine translation system is said to be *complete* if all expressions that are correct according to the source-language grammar can be translated into the target language. This paper addresses the completeness issue for compositional machine translation in general, and for compositional machine translation of context-free grammars in particular. Conditions that guarantee translation completeness of context-free grammars are presented.

  6. 从机器翻译历程看自然语言处理研究的发展策略%The Development Strategy for Natural Language Processing Research Inspired from a Historical View on Machine Translation

    Institute of Scientific and Technical Information of China (English)

    孙茂松; 周建设

    2016-01-01

    Machine translation (MT) is one of the major research fi elds of natural language processing (NLP), and it always spearheads the research frontier in NLP. In this paper, after a systematic survey of the development history of MT from a macroscopic perspective, with particular emphasis on the main development path of underlying methodologies and core technologies in MT, we drew a general picture of the milestones that marked the key points of a long journey for both theoretical study and practical accomplishment for the past seven decades. The latest fruitful development achieved in the area of MT application shows that, the paradigm shift from the tradi-tional linguistic rule-based approaches to the so-called empirical approach, based on increasingly available amounts of “raw data” in the form of massive collections of texts and their translations, compounded by the phenomenal advancement of computer technology, will become the driving force that will potentially lead to the breakthrough in MT. Based on the above observation and analysis, some sug-gestions on the short-term development strategy for machine translation as well as natural language processing in China are proposed.%本文试图从超脱细节的宏观角度,对机器翻译的发展历程进行扼要的总结和深刻的评介,着重于刻画各个时期在基本方法和核心技术上的主要特征,从而勾勒出机器翻译的全过程演进脉络。在上述考察和分析的基础上,文章对国内机器翻译乃至自然语言处理研究的近期发展策略提出了若干建议。

  7. A Taxonomy of Human Translation Styles

    DEFF Research Database (Denmark)

    Carl, Michael; Dragsted, Barbara; Lykke Jakobsen, Arnt

    2011-01-01

    While the translation profession becomes increasingly technological, we are still far from understanding how humans actually translate and how they could be best supported by machines. In this paper we outline a method which helps to uncover characteristics of human translation processes. Based o...... on the translators' activity data, we develop a taxonomy of translation styles. The taxonomy could serve to inform the development of advanced translation assistance tools and provide a basis for a felicitous and grounded integration of human machine interaction in translation....

  8. Cueing the Virtual Storyteller: Analysis of cue phrase usage in fairy tales

    NARCIS (Netherlands)

    Penning, Manon; Theune, Mariët; Busemann, S.

    2007-01-01

    An existing taxonomy of Dutch cue phrases, designed for use in story generation, was validated by analysing cue phrase usage in a corpus of classical fairy tales. The analysis led to some adaptations of the original taxonomy.

  9. BattRAE: Bidimensional Attention-Based Recursive Autoencoders for Learning Bilingual Phrase Embeddings

    OpenAIRE

    Zhang, Biao; Xiong, Deyi; Su, Jinsong

    2016-01-01

    In this paper, we propose a bidimensional attention based recursive autoencoder (BattRAE) to integrate clues and sourcetarget interactions at multiple levels of granularity into bilingual phrase representations. We employ recursive autoencoders to generate tree structures of phrases with embeddings at different levels of granularity (e.g., words, sub-phrases and phrases). Over these embeddings on the source and target side, we introduce a bidimensional attention network to learn their interac...

  10. Strategy Of Translating Gadget Brochure

    Directory of Open Access Journals (Sweden)

    Deny Kusuma

    2015-10-01

    Full Text Available The title of this writing is strategy of translating gadget brochure. There were two problems discussed in this thesis, namely (1 terms found in the gadget manual book and its equivalence in Indonesia, (2 strategies applied in translating gadget brochure. Based on the analysis result, it was found that the terms and its equivalent words found in the gadget brochure were classified based on: 1 simple words or compound words and terminology forming phrase. 2 words category found are: noun and verb. The recommended pattern to determine the equivalent word was pure borrowing strategy, not adaptation borrowing strategy. The adaptation borrowing in this context was related to the spellings, the pronunciation or sound adaptation in the TL rather than adaptation for the cultural substitutes that conceptually mismatches with the standardized terminology in the SL. In addition, the result of the analysis that there were seven strategies found in gadget brochure, they were translation by more general word (Superordinate, translation by a more neutral/expressive word, translation by cultural substitution, translation by using loan word or loan word plus explanation, translation by paraphrase using related word, translation by omission and translation by illustration.

  11. Translation Method and Computer Programme for Assisting the Same

    DEFF Research Database (Denmark)

    2013-01-01

    The present invention relates to a translation method comprising the steps of: a translator speaking a translation of a written source text in a target language, an automatic speech recognition system converting the spoken translation into a set of phone and word hypotheses in the target language......, a machine translation system translating the written source text into a set of translations hypotheses in the target language, and an integration module combining the set of spoken word hypotheses and the set of machine translation hypotheses obtaining a text in the target language. Thereby obtaining...... a significantly improved and faster translation compared to what is achieved by known translation methods....

  12. Prepositional Phrase Attachment through a Backed-Off Model

    CERN Document Server

    Collins, M; Collins, Michael; Brooks, James

    1995-01-01

    Recent work has considered corpus-based or statistical approaches to the problem of prepositional phrase attachment ambiguity. Typically, ambiguous verb phrases of the form {v np1 p np2} are resolved through a model which considers values of the four head words (v, n1, p and n2). This paper shows that the problem is analogous to n-gram language models in speech recognition, and that one of the most common methods for language modeling, the backed-off estimate, is applicable. Results on Wall Street Journal data of 84.5% accuracy are obtained using this method. A surprising result is the importance of low-count events - ignoring events which occur less than 5 times in training data reduces performance to 81.6%.

  13. Finite-State Approximation of Phrase-Structure Grammars

    CERN Document Server

    Pereira, F C N; Pereira, Fernando C. N.; Wright, Rebecca N.

    1996-01-01

    Phrase-structure grammars are effective models for important syntactic and semantic aspects of natural languages, but can be computationally too demanding for use as language models in real-time speech recognition. Therefore, finite-state models are used instead, even though they lack expressive power. To reconcile those two alternatives, we designed an algorithm to compute finite-state approximations of context-free grammars and context-free-equivalent augmented phrase-structure grammars. The approximation is exact for certain context-free grammars generating regular languages, including all left-linear and right-linear context-free grammars. The algorithm has been used to build finite-state language models for limited-domain speech recognition tasks.

  14. Some aspects of agreement with numeral phrases in Polish

    Directory of Open Access Journals (Sweden)

    Jacek Witkoś

    2016-12-01

    Full Text Available The aim of this article is to briefly analyze the agreement patterns in Polish constructions with quantified subjects and adjectival predicates/participles, and propose an account built on the nanosyntactic ideas regarding the nature of case, i.e., split Kase Phrase (Caha 2009, 2010. In the analysis we address the troublesome issues regarding the Genitive of Quantification, i.e., the source of Genitive on the nominal complement in structural contexts, and the optionality in agreement in case between the adjectival predicate/participle and the numeral (≥5, or the noun of the quantified subject. The essential part of our proposal is based on the articulated Kase Phrase in the functional sequence of the extended nominal projection and its role in the syntactic derivation of case in the spirit of nanosyntactic approach.

  15. MT In Business English Translation

    Institute of Scientific and Technical Information of China (English)

    张志新

    2009-01-01

    In this article the operational principles of MT in business English translation is briefly introduced with an aim to point out that to improve the MT quality machine study is a key factor to work on.

  16. The Point Approach and the Phrase Approach to Vocabulary Learning

    Institute of Scientific and Technical Information of China (English)

    刘梦媛

    2013-01-01

      As is known to all, vocabulary acquisition plays an essential role in English learning. However, it was supposed very dif⁃ficult to many Chinese learners. For the reason that so many kinds of approaches exists in the real life, English learners are always do not know which one is suitable and more effective. To solve this problem, the paper will analyze two approaches (point approach and phrase approach) for you.

  17. Learning theories for noun-phrase sentiment composition

    OpenAIRE

    Petrakis, S.; Klenner, M

    2011-01-01

    The work presented here is an approach to Sentiment Analy- sis from a rule-based, compositional perspective. The proposed approach is characterized by three major points: (a) rules are automatically learned from annotated corpora using Inductive Logic Programming and represented as Prolog sets of clauses, (b) the focus is on the noun-phrase (NP) level, and (c) learning is performed on deep-parsed structures. We describe the process of annotating a collection of some 3000 German NPs of medi...

  18. Automatic Titling of Electronic Documents with Noun Phrase Extraction

    OpenAIRE

    LOPEZ, Cédric; Prince, Violaine; Roche, Mathieu

    2011-01-01

    International audience; Automatic titling (i.e. providing titles) is one of key domains of Web site accessibility. This paper provides an approach allowing the automatic titling of texts (e.g. emails, fora, etc.) relying on the morphosyntactic study of human written titles in a corpus of various texts. The method is developed in four stages: Corpus acquisition, candidate sentences determination for titling, noun phrase extraction in the candidate sentences, and finally, selecting a particular...

  19. Translating China

    Institute of Scientific and Technical Information of China (English)

    2011-01-01

    sidney Shapiro, an American-born translator famous for his translation of Chinese literary works, received the Lifetime Achievement Award in Translation by the Translators Association of China on December 2, 2010.

  20. Translation Theory 'Translated'

    DEFF Research Database (Denmark)

    Wæraas, Arild; Nielsen, Jeppe

    2016-01-01

    Translation theory has proved to be a versatile analytical lens used by scholars working from different traditions. On the basis of a systematic literature review, this study adds to our understanding of the ‘translations’ of translation theory by identifying the distinguishing features of the mo......, but also overlapping. We discuss the ways in which the three versions of translation theory may be combined and enrich each other so as to inform future research, thereby offering a more complete understanding of translation in and across organizational settings.......Translation theory has proved to be a versatile analytical lens used by scholars working from different traditions. On the basis of a systematic literature review, this study adds to our understanding of the ‘translations’ of translation theory by identifying the distinguishing features of the most...... common theoretical approaches to translation within the organization and management discipline: actor-network theory, knowledge-based theory, and Scandinavian institutionalism. Although each of these approaches already has borne much fruit in research, the literature is diverse and somewhat fragmented...

  1. Understanding Translation

    DEFF Research Database (Denmark)

    Schjoldager, Anne Gram; Gottlieb, Henrik; Klitgård, Ida

    Understanding Translation is designed as a textbook for courses on the theory and practice of translation in general and of particular types of translation - such as interpreting, screen translation and literary translation. The aim of the book is to help you gain an in-depth understanding...... - translators, language teachers, translation users and literary, TV and film critics, for instance. Discussions focus on translation between Danish and English....

  2. Understanding Translation

    DEFF Research Database (Denmark)

    Schjoldager, Anne Gram; Gottlieb, Henrik; Klitgård, Ida

    Understanding Translation is designed as a textbook for courses on the theory and practice of translation in general and of particular types of translation - such as interpreting, screen translation and literary translation. The aim of the book is to help you gain an in-depth understanding of the...... - translators, language teachers, translation users and literary, TV and film critics, for instance. Discussions focus on translation between Danish and English....

  3. Case Markers in Mongolian: A Means for Encoding Null Constituents in Noun Phrase and Relative Clause

    Directory of Open Access Journals (Sweden)

    Tseden Otgonsuren

    2017-02-01

    Full Text Available This paper focuses on the capacity of the case markers in the Mongolian language, as a relative element, to generate any finite noun phrase or relative clause based on their syntactic function or relationship. In Mongolian, there are two different approaches to generate noun phrases: parataxis and hypotaxis. According to my early observation, if the noun phrase generated through the parataxis, is the complement of the postpositional phrase, the head word of the relevant noun phrase can be truncated. In other words, since this head noun is governed by case marker in its null form to generate the postpositional phrase, the head noun can be encoded.  The second approach generates two different types of noun phrases in their structures: free structured and non-free structured noun phrases. Of them, the free structured noun phrase allows any syntactic transformations in their internal structure based on the senses of the case markers which denote a relation. That is to say, the null constituents in this type of noun phrases can be encoded to generate an extended alternative of the noun phrase and a relative clause.

  4. Synonymy and translation

    NARCIS (Netherlands)

    Jong, de Franciska; Appelo, Lisette

    1987-01-01

    This paper is meant to give some insight into the interaction between on the one hand theoretical concepts in the field of formal semantics, and on the other hand linguistic research directed towards an application, more specifically, the research in the machine translation project Rosetta. The cent

  5. Synonymy and Translation

    NARCIS (Netherlands)

    de Jong, Franciska M.G.; Appelo, Lisette

    1987-01-01

    This paper is meant to give some insight into the interaction between on the one hand theoretical concepts in the field of formal semantics, and on the other hand linguistic research directed towards an application, more specifically, the research in the machine translation project Rosetta. The

  6. Evaluative Meaning in Translation

    Directory of Open Access Journals (Sweden)

    Primož Jurko

    2015-12-01

    Full Text Available Evaluative meaning is frequently understood as a form of connotation, pertaining to single lexical items. However, the term has recently been applied also to longer stretches of text, spanning several words. The paper generally deals with the latter aspect and provides a corpus-driven analysis of several Slovene and English phrases that may appear semantically neutral, yet turn out to be heavily non-neutral in terms of their semantic properties. After a short introduction of the main approaches to the topic, including its dilemmas, the paper focuses on the L1-into-L2 translation of such strings, with particular emphasis on proper rendering of the evaluative meaning. While further research is warranted into the matter, the work presented here attests to the complex and ubiquitous nature of the Sinclairian extended unit of meaning.

  7. Translating Means Translating Meaning

    Institute of Scientific and Technical Information of China (English)

    李海燕

    2000-01-01

    美国著名翻译理论家尤金·奈达说 :“翻译即译意 (Translating m eans translating m eaning)。”就实质而言 ,翻译即译意。就是把一种语言表达的意义用另一种语言表达出来。翻译分理解与表达两个步骤。理解是翻译的基础 ,表达直接决定译文的成败与优劣 ,两者缺一不可

  8. A Sign-Based Phrase Structure Grammar for Turkish

    CERN Document Server

    Sehitoglu, O T

    1996-01-01

    This study analyses Turkish syntax from an informational point of view. Sign based linguistic representation and principles of HPSG (Head-driven Phrase Structure Grammar) theory are adapted to Turkish. The basic informational elements are nested and inherently sorted feature structures called signs. In the implementation, logic programming tool ALE (Attribute Logic Engine) which is primarily designed for implementing HPSG grammars is used. A type and structure hierarchy of Turkish language is designed. Syntactic phenomena such a s subcategorization, relative clauses, constituent order variation, adjuncts, nomina l predicates and complement-modifier relations in Turkish are analyzed. A parser is designed and implemented in ALE.

  9. Key-phrase based classification of public health web pages.

    Science.gov (United States)

    Dolamic, Ljiljana; Boyer, Célia

    2013-01-01

    This paper describes and evaluates the public health web pages classification model based on key phrase extraction and matching. Easily extendible both in terms of new classes as well as the new language this method proves to be a good solution for text classification faced with the total lack of training data. To evaluate the proposed solution we have used a small collection of public health related web pages created by a double blind manual classification. Our experiments have shown that by choosing the adequate threshold value the desired value for either precision or recall can be achieved.

  10. Human Translator and Translation Technology

    Institute of Scientific and Technical Information of China (English)

    李辰

    2016-01-01

    With the great development of technology, translation technology exerts great influence on human translators because during their translation process, they may use many computer-aided translation tools, such as TRADOS, Snowman, WordFisher and etc. However, they always misunderstand the concept of computer-aided translation, so this thesis managed to providedetails about some translation technology and human translators' strengths so as to help them improve the productivity and the quality of theirtranslation works effectively and efficiently.

  11. Lexical, conceptual and motor information in memory for action phrases: a multi-system account.

    Science.gov (United States)

    Engelkamp, Johannes; Jahn, Petra

    2003-06-01

    According to the multi-system account [J. Engelkamp, H.D. Zimmer, The Human Memory: A Multi-Modal Approach, Hogrefe & Huber, Seattle, 1994], the enactment effect in free recall of action phrases (e.g. break the stick) is independent of inter- and intra-phrase associations because it emerges from the nonverbal encoding processes under enactment. Two experiments have been reported which tested these and further assumptions of the multi-system account of the enactment effect. In both experiments, inter- and intra-phrase (verb-object) associations were varied simultaneously in addition to enactment. In Experiment 1, the memory test was free recall, and in Experiment 2, it was cued recall. Independent effects of all three factors were observed in free recall. Enacted phrases were recalled better than phrases learned only verbally. Phrases with high-associated objects and verbs were better recalled than phrases with low-associated objects and verbs, and categorically related phrases were better recalled than unrelated phrases. In cued recall, there was no effect of categorical list structure. The effects of intra-phrase structure and enactment corresponded to those in free recall. All interactions were nonsignificant. The findings were interpreted as support for the multi-system account.

  12. When Machines Design Machines!

    DEFF Research Database (Denmark)

    2011-01-01

    Until recently we were the sole designers, alone in the driving seat making all the decisions. But, we have created a world of complexity way beyond human ability to understand, control, and govern. Machines now do more trades than humans on stock markets, they control our power, water, gas...... and food supplies, manage our elevators, microclimates, automobiles and transport systems, and manufacture almost everything. It should come as no surprise that machines are now designing machines. The chips that power our computers and mobile phones, the robots and commercial processing plants on which we...... depend, all are now largely designed by machines. So what of us - will be totally usurped, or are we looking at a new symbiosis with human and artificial intelligences combined to realise the best outcomes possible. In most respects we have no choice! Human abilities alone cannot solve any of the major...

  13. Phrase Length and Prosody in On-Line Ambiguity Resolution.

    Science.gov (United States)

    Webman-Shafran, Ronit; Fodor, Janet Dean

    2016-06-01

    We investigated the processing of ambiguous double-PP constructions in Hebrew. Selection restrictions forced the first prepositional phrase (PP1) to attach low, but PP2 could attach maximally high to VP or maximally low to the NP inside PP1. A length contrast in PP2 was also examined. This construction affords more potential locations for prosodic boundaries, and has a sharper structural contrast between the two attachment sites, than the single-PP construction which has yielded mixed results in previous work. A combined production-comprehension task showed more productions of pre-PP2 prosodic boundaries for long-PP2 than short-PP2. In comprehension, high PP2-attachment was favored by a prosodic boundary before PP2, regardless of PP2 length. This study provides performance data supporting the interplay of phrase lengths with structure-sensitivity as posited in the linguistics literature on the syntax-prosody interface, and supports the claim that readers are sensitive to the structural implications of the prosody they project onto sentences.

  14. Interacting effects of syllable and phrase position on consonant articulation

    Science.gov (United States)

    Byrd, Dani; Lee, Sungbok; Riggs, Daylen; Adams, Jason

    2005-12-01

    The complexities of how prosodic structure, both at the phrasal and syllable levels, shapes speech production have begun to be illuminated through studies of articulatory behavior. The present study contributes to an understanding of prosodic signatures on articulation by examining the joint effects of phrasal and syllable position on the production of consonants. Articulatory kinematic data were collected for five subjects using electromagnetic articulography (EMA) to record target consonants (labial, labiodental, and tongue tip), located in (1) either syllable final or initial position and (2) either at a phrase edge or phrase medially. Spatial and temporal characteristics of the consonantal constriction formation and release were determined based on kinematic landmarks in the articulator velocity profiles. The results indicate that syllable and phrasal position consistently affect the movement duration; however, effects on displacement were more variable. For most subjects, the boundary-adjacent portions of the movement (constriction release for a preboundary coda and constriction formation for a postboundary onset) are not differentially affected in terms of phrasal lengthening-both lengthen comparably.

  15. Neural Correlates of Phrase Rhythm: An EEG Study of Bipartite vs. Rondo Sonata Form

    Directory of Open Access Journals (Sweden)

    Antonio Fernández-Caballero

    2017-04-01

    Full Text Available This paper introduces the neural correlates of phrase rhythm. In short, phrase rhythm is the rhythmic aspect of phrase construction and the relationships between phrases. For the sake of establishing the neural correlates, a musical experiment has been designed to induce music-evoked stimuli related to phrase rhythm. Brain activity is monitored through electroencephalography (EEG by using a brain–computer interface. The power spectral value of each EEG channel is estimated to obtain how power variance distributes as a function of frequency. Our experiment shows statistical differences in theta and alpha bands in the phrase rhythm variations of two classical sonatas, one in bipartite form and the other in rondo form.

  16. Multi-function radar emitter identification based on stochastic syntax-directed translation schema

    Institute of Scientific and Technical Information of China (English)

    Liu Haijun Diao; Yu Hongqi; Sun Zhaolin; Jietao

    2014-01-01

    To cope with the problem of emitter identification caused by the radar words’ uncer-tainty of measured multi-function radar emitters, this paper proposes a new identification method based on stochastic syntax-directed translation schema (SSDTS). This method, which is deduced from the syntactic modeling of multi-function radars, considers the probabilities of radar phrases appearance in different radar modes as well as the probabilities of radar word errors occurrence in different radar phrases. It concludes that the proposed method can not only correct the defective radar words by using the stochastic translation schema, but also identify the real radar phrases and working modes of measured emitters concurrently. Furthermore, a number of simulations are presented to demonstrate the identification capability and adaptability of the SSDTS algorithm. The results show that even under the condition of the defective radar words distorted by noise, the proposed algorithm can infer the phrases, work modes and types of measured emitters correctly.

  17. Modeling of meaning acquisition based on NL phrases using state transitions

    Directory of Open Access Journals (Sweden)

    Sergiu Cretu

    2015-05-01

    Full Text Available This study aims to develop techniques for the meaning acquisition of the NL phrases. Our approach is based on modeling meaning extraction by analyzing lexical component, syntactic component and semantic component of the phrases written or spoken in NL. To assure necessary semantic precision we adopt a double-level interpretation of the NL phrases using formal language with well defined semantics. This fact is essential for our approach.

  18. Installing the ARL Phrase Book Android Application and Configuring its Dependencies

    Science.gov (United States)

    2015-02-01

    Installing the ARL Phrase Book Android Application and Configuring its Dependencies by Michael H Lee ARL-TN-0664 February 2015...February 2015 Installing the ARL Phrase Book Android Application and Configuring its Dependencies Michael H Lee Computational and...COVERED (From - To) 03/2014–05/2014 4. TITLE AND SUBTITLE Installing the ARL Phrase Book Android Application and Configuring its Dependencies 5a

  19. Computer-aided Translation Technology and Its Applications

    Institute of Scientific and Technical Information of China (English)

    汪美侠; 何大顺

    2014-01-01

    This article begins with a brief analysis of the significance of translation technology in different spheres of modern life, followed by a distinction between machine translation (MT) and computer-aided translation (CAT). It then describes some trans-lation resources and tools and examines the negative and positive aspects of computer-aided translations. Finally it comes to a conclusion that it would be greatly efficient and productive for the translators to acquire the new skills in the translation work-place.

  20. On the Systematicity of Human Translation Processes

    DEFF Research Database (Denmark)

    Carl, Michael; Dragsted, Barbara; Lykke Jakobsen, Arnt

    While translation careers and the translation profession become more globalised and more technological, we are still far from understanding how humans actually translate and how they could be best supported by machines. In this paper we attempt to outline a method which helps to uncover character...

  1. 基于有限状态自动机的远程命令识别与解析%Recognition & Translation of Remote Commands Based on Finite State Machine

    Institute of Scientific and Technical Information of China (English)

    毛凤翔; 韩冰

    2013-01-01

    远程命令识别与解析是嵌入式环境中终端-控制台和上-下位机模式实现远程管控的基础和关键.文中分析水下探测智能终端的工作过程,提出了一种基于有限状态自动机的远程命令识别与解析方法,智能终端可以根据工作状态自动机模型对远程命令进行快速、准确地响应,避免了复杂的计算和繁琐的决策过程.实验发现,水下探测智能终端及时识别出控制台发送的管控指令,按要求转入相应的工作状态,该方法有效地提高了水下探测智能终端机的工作性能.%It is a fundamental and key problem of how to correctly recognize and translate remote commands for terminal-console and upper-low computer to conduct remote management and control in embedded systems.By carefully analyzing working procedure of underwater probe terminal in this paper,an effective method based on finite state machine model is put forward,which can quickly and accurately respond the remote instructions,and avoid complicated computation and decision.Applied practices show the underwater probe intelligent terminal can understand kinds of management-control instructions sent from console machine,and properly tune to corresponding working mode,which can greatly improve the performance of underwater probe intelligent terminal.

  2. MEDLINE Abstracts Classification Based on Noun Phrases Extraction

    Science.gov (United States)

    Ruiz-Rico, Fernando; Vicedo, José-Luis; Rubio-Sánchez, María-Consuelo

    Many algorithms have come up in the last years to tackle automated text categorization. They have been exhaustively studied, leading to several variants and combinations not only in the particular procedures but also in the treatment of the input data. A widely used approach is representing documents as Bag-Of-Words (BOW) and weighting tokens with the TFIDF schema. Many researchers have thrown into precision and recall improvements and classification time reduction enriching BOW with stemming, n-grams, feature selection, noun phrases, metadata, weight normalization, etc. We contribute to this field with a novel combination of these techniques. For evaluation purposes, we provide comparisons to previous works with SVM against the simple BOW. The well known OHSUMED corpus is exploited and different sets of categories are selected, as previously done in the literature. The conclusion is that the proposed method can be successfully applied to existing binary classifiers such as SVM outperforming the mixture of BOW and TFIDF approaches.

  3. You had me at hello: How phrasing affects memorability

    CERN Document Server

    Danescu-Niculescu-Mizil, Cristian; Kleinberg, Jon; Lee, Lillian

    2012-01-01

    Understanding the ways in which information achieves widespread public awareness is a research question of significant interest. We consider whether, and how, the way in which the information is phrased --- the choice of words and sentence structure --- can affect this process. To this end, we develop an analysis framework and build a corpus of movie quotes, annotated with memorability information, in which we are able to control for both the speaker and the setting of the quotes. We find significant differences between memorable and non-memorable quotes in several key dimensions. One is lexical distinctiveness: in aggregate, memorable quotes use less common word choices, but at the same time are built upon a scaffolding of common syntactic patterns; another is that memorable quotes tend to be more general in ways that make them easy to apply in new contexts. We also show how the concept of "memorable language" can be extended across domains.

  4. Literary Translation: The Pleasure Principle

    Institute of Scientific and Technical Information of China (English)

    杜博妮

    2007-01-01

    Chinese to English literary translation (CELT) has brought enormous reading pleasure to large numbers of people. These readers are not necessarily committed to China or Chinese literature as a cause or profession: they are disinterested readers. Publishers today, such as the Foreign Languages Press, can significantly increase their publication figures and strengthen their impact by going beyond captive readers to reach this wider audience. Guidelines that may lead to such a result can be summed up in the phrase: trust the reader. Readers of literary translations already belong to a limited class: they are risk-takers; they are curious and knowledgeable about other cultures; and they are skilled literary readers in their own language.

  5. Translational Creativity

    DEFF Research Database (Denmark)

    Nielsen, Sandro

    2010-01-01

    A long-established approach to legal translation focuses on terminological equivalence making translators strictly follow the words of source texts. Recent research suggests that there is room for some creativity allowing translators to deviate from the source texts. However, little attention...... is given to genre conventions in source texts and the ways in which they can best be translated. I propose that translators of statutes with an informative function in expert-to-expert communication may be allowed limited translational creativity when translating specific types of genre convention....... This creativity is a result of translators adopting either a source-language or a target-language oriented strategy and is limited by the pragmatic principle of co-operation. Examples of translation options are provided illustrating the different results in target texts. The use of a target-language oriented...

  6. Machine aided indexing from natural language text

    Science.gov (United States)

    Silvester, June P.; Genuardi, Michael T.; Klingbiel, Paul H.

    1993-01-01

    The NASA Lexical Dictionary (NLD) Machine Aided Indexing (MAI) system was designed to (1) reuse the indexing of the Defense Technical Information Center (DTIC); (2) reuse the indexing of the Department of Energy (DOE); and (3) reduce the time required for original indexing. This was done by automatically generating appropriate NASA thesaurus terms from either the other agency's index terms, or, for original indexing, from document titles and abstracts. The NASA STI Program staff devised two different ways to generate thesaurus terms from text. The first group of programs identified noun phrases by a parsing method that allowed for conjunctions and certain prepositions, on the assumption that indexable concepts are found in such phrases. Results were not always satisfactory, and it was noted that indexable concepts often occurred outside of noun phrases. The first method also proved to be too slow for the ultimate goal of interactive (online) MAI. The second group of programs used the knowledge base (KB), word proximity, and frequency of word and phrase occurrence to identify indexable concepts. Both methods are described and illustrated. Online MAI has been achieved, as well as several spinoff benefits, which are also described.

  7. Frases en ingles para defenderse (Phrases for Getting Along in English).

    Science.gov (United States)

    Butovsky, Lillian; McHugh, John

    This book of Spanish-English phrases intended for Spanish-speaking immigrants is divided into two main sections. Part One contains the grammatical section, a pronunciation guide, and basic words and phrases a newly arrived Spanish-speaking person would need to get along in an English speaking environment. Grammatical notes in Part One indicate…

  8. Frases en ingles para defenderse (Phrases for Getting Along in English).

    Science.gov (United States)

    Butovsky, Lillian; McHugh, John

    This book of Spanish-English phrases intended for Spanish-speaking immigrants is divided into two main sections. Part One contains the grammatical section, a pronunciation guide, and basic words and phrases a newly arrived Spanish-speaking person would need to get along in an English speaking environment. Grammatical notes in Part One indicate…

  9. An "Alms-Basket" of "Bric-a-Brac": "Brewer's Dictionary of Phrase and Fable".

    Science.gov (United States)

    Bunge, Charles A.

    1999-01-01

    Describes the development and history of "Brewer's Dictionary of Phrase and Fable," a reference source first published in 1870 that includes the etymology of phrases, allusions and words. Discusses reviews that reflected and shaped its status as a standard reference book, describes the current edition, and considers its enduring value. (LRW)

  10. The Effect of Anaphoric References and Noun Phrase Organizers on Paragraph Comprehension.

    Science.gov (United States)

    Fishman, Anne Stevens

    1978-01-01

    According to a theory of textual cohesion, anaphoric references and noun phrase organizers may serve as cohesive ties linking separate sentences into unified paragraphs. This study revealed that only the noun phrase organizer condition was significant in promoting comprehension and indicated no difference in time scores by paragraph condition.…

  11. Blue Car, Red Car: Developing Efficiency in Online Interpretation of Adjective-Noun Phrases

    Science.gov (United States)

    Fernald, Anne; Thorpe, Kirsten; Marchman, Virginia A.

    2010-01-01

    Two experiments investigated the development of fluency in interpreting adjective-noun phrases in 30- and 36-month-old English-learning children. Using online processing measures, children's gaze patterns were monitored as they heard the familiar adjective-noun phrases (e.g. "blue car") in visual contexts where the adjective was either informative…

  12. Dealing with Phrase Level Co-Articulation (PLC) in speech recognition: a first approach

    NARCIS (Netherlands)

    Ordelman, Roeland J.F.; van Hessen, Adrianus J.; van Leeuwen, David A.; Robinson, Tony; Renals, Steve

    1999-01-01

    Whereas nowadays within-word co-articulation effects are usually sufficiently dealt with in automatic speech recognition, this is not always the case with phrase level co-articulation effects (PLC). This paper describes a first approach in dealing with phrase level co-articulation by applying these

  13. The word order change in Dutch directional phrases: from the perspective of language typology

    NARCIS (Netherlands)

    Lin, J.

    2012-01-01

    The current study provides an alternative explanation of the word order change observed in Dutch directional phrases from the perspective of typology. This change took place around the end of the 19th century and concerns directional phrases such as 'in de kamer' (‘in(to) the room’) and 'de kamer in

  14. Case Markers in Mongolian: A Means for Encoding Null Constituents in Noun Phrase and Relative Clause

    Science.gov (United States)

    Otgonsuren, Tseden

    2017-01-01

    This paper focuses on the capacity of the case markers in the Mongolian language, as a relative element, to generate any finite noun phrase or relative clause based on their syntactic function or relationship. In Mongolian, there are two different approaches to generate noun phrases: parataxis and hypotaxis. According to my early observation, if…

  15. An "Alms-Basket" of "Bric-a-Brac": "Brewer's Dictionary of Phrase and Fable".

    Science.gov (United States)

    Bunge, Charles A.

    1999-01-01

    Describes the development and history of "Brewer's Dictionary of Phrase and Fable," a reference source first published in 1870 that includes the etymology of phrases, allusions and words. Discusses reviews that reflected and shaped its status as a standard reference book, describes the current edition, and considers its enduring value.…

  16. Phrase-Level Sentiment Polarity Classification Using Rule-Based Typed Dependencies and Additional Complex Phrases Consideration

    Institute of Scientific and Technical Information of China (English)

    Luke Kien-Weng Tan; Jin-Cheon Na; Yin-Leng Theng; Kuiyu Chang

    2012-01-01

    The advent of Web 2.0 has led to an increase in user-generated content on the Web. This has provided an extensive collection of free-style texts with opinion expressions that could influence the decisions and actions of their readters.Providers of such content exert a certain level of influence on the receivers and this is evident from blog sites having effect on their readers' purchase decisions,political view points,financial planning,and others.By detecting the opinion expressed,we can identify the sentiments on the topics discussed and the influence exerted on the readers.In this paper,we introduce an automatic approach in deriving polarity pattern rules to detect sentiment polarity at the phrase level,and in addition consider the effects of the more complex relationships found between words in sentiment polarity classification.Recent sentiment analysis research has focused on the functional relations of words using typed dependency parsing,providing a refined analysis on the grammar and semantics of textual data.Heuristics are typically used to determine the typed dependency polarity patterns,which may not comprehensively identify all possible rules.We study the use of class sequential rules (CSRs) to automatically learn the typed dependency patterns,and benchmark the performance of CSR against a heuristic method.Preliminary results show CSR leads to further improvements in classification performance achieving over 80% F1 scores in the test cases.In addition,we observe more complex relationships between words that could influence phrase sentiment polarity,and further discuss on possible approaches to handle the effects of these complex relationships.

  17. English Translation of Chinese Personal Names

    Institute of Scientific and Technical Information of China (English)

    徐洁

    2008-01-01

    This paper talks about the translation of personal names from Chinese into English, which is complicated by different factors, including orthographic, phonetic, geographic and social ones. To translate personal names appropriately, a wide range of knowledge is required. Although translation machine is more often used nowadays, it cannot take place of person in the end. Several types of name translation will be talked about in this paper.

  18. The Temple Translator’s Workstation Project

    Science.gov (United States)

    1996-01-01

    allows both translator/analysts and monolingual analysts to use the machine- translation function for assessing the relevance of a translated...using a multilingual editor (Figure2). Source documents and their translations are managed using the Tipster Document Manager developed at CRL...an English morphological generator [Penman 88]. • A multilingual document editor (the Tipster Editor for Documents developed at CRL under the

  19. Semantic Vector Machines

    CERN Document Server

    Vincent, Etter

    2011-01-01

    We first present our work in machine translation, during which we used aligned sentences to train a neural network to embed n-grams of different languages into an $d$-dimensional space, such that n-grams that are the translation of each other are close with respect to some metric. Good n-grams to n-grams translation results were achieved, but full sentences translation is still problematic. We realized that learning semantics of sentences and documents was the key for solving a lot of natural language processing problems, and thus moved to the second part of our work: sentence compression. We introduce a flexible neural network architecture for learning embeddings of words and sentences that extract their semantics, propose an efficient implementation in the Torch framework and present embedding results comparable to the ones obtained with classical neural language models, while being more powerful.

  20. Machine Learning and Radiology

    Science.gov (United States)

    Wang, Shijun; Summers, Ronald M.

    2012-01-01

    In this paper, we give a short introduction to machine learning and survey its applications in radiology. We focused on six categories of applications in radiology: medical image segmentation, registration, computer aided detection and diagnosis, brain function or activity analysis and neurological disease diagnosis from fMR images, content-based image retrieval systems for CT or MRI images, and text analysis of radiology reports using natural language processing (NLP) and natural language understanding (NLU). This survey shows that machine learning plays a key role in many radiology applications. Machine learning identifies complex patterns automatically and helps radiologists make intelligent decisions on radiology data such as conventional radiographs, CT, MRI, and PET images and radiology reports. In many applications, the performance of machine learning-based automatic detection and diagnosis systems has shown to be comparable to that of a well-trained and experienced radiologist. Technology development in machine learning and radiology will benefit from each other in the long run. Key contributions and common characteristics of machine learning techniques in radiology are discussed. We also discuss the problem of translating machine learning applications to the radiology clinical setting, including advantages and potential barriers. PMID:22465077

  1. Neural Features of Processing the Enforcement Phrases Used during Occupational Health and Safety Inspections: An ERP Study

    Science.gov (United States)

    Ma, Qingguo; Shi, Liping; Hu, Linfeng; Liu, Qiang; Yang, Zheng; Wang, Qiuzhen

    2016-01-01

    The appropriate enforcement phrases used during occupational health and safety (OHS) inspection activities is a crucial factor to guarantee the compliance with OHS regulations in enterprises. However, few researchers have empirically investigated the issue of how enforcement phrases are processed. The present study explored the neural features of processing two types of enforcement phrases (severe-and-deterrent vs. mild-and-polite phrases) used during OHS inspections by applying event-related potentials (ERP) method. Electroencephalogram data were recorded while the participants distinguished between severe-and-deterrent phrases and mild-and-polite phrases depicted in written Chinese words. The ERP results showed that severe-and-deterrent phrases elicited significantly augmented P300 amplitude with a central-parietal scalp distribution compared with mild-and-polite phrases, indicating the allocation of more attention resources to and elaborate processing of the severe-and-deterrent phrases. It reveals that humans may consider the severe-and-deterrent phrases as more motivationally significant and elaborately process the severity and deterrence information contained in the enforcement phrases for the adaptive protection. The current study provides an objective and supplementary way to measure the efficiency of different enforcement phrases at neural level, which may help generate appropriate enforcement phrases and improve the performance of OHS inspections. PMID:27807404

  2. Effects of musical expertise and boundary markers on phrase perception in music.

    Science.gov (United States)

    Neuhaus, Christiane; Knösche, Thomas R; Friederici, Angela D

    2006-03-01

    A neural correlate for phrase boundary perception in music has recently been identified in musicians. It is called music closure positive shift ("music CPS") and has an equivalent in the perception of speech ("language CPS"). The aim of the present study was to investigate the influence of musical expertise and different phrase boundary markers on the music CPS, using event-related brain potentials (ERPs) and event-related magnetic fields (ERFs). Musicians and nonmusicians were tested while listening to binary phrased melodies. ERPs and ERFs of both subject groups differed considerably from each other. Phrased melody versions evoked an electric CPS and a magnetic CPSm in musicians, but an early negativity and a less pronounced CPSm in nonmusicians, suggesting different perceptual strategies for both subject groups. Musicians seem to process musical phrases in a structured manner similar to language. Nonmusicians, in contrast, are thought to detect primarily discontinuity in the melodic input. Variations of acoustic cues in the vicinity of the phrase boundary reveal that the CPS is influenced by a number of parameters that are considered to indicate phrasing in melodies: pause length, length of the last tone preceding the pause, and harmonic function of this last tone. This is taken as evidence that the CPS mainly reflects higher cognitive processing of phrasing, rather than mere perception of pauses. Furthermore, results suggest that the ERP and MEG methods are sensitive to different aspects within phrase perception. For both subject groups, qualitatively different ERP components (CPS and early negativity) seem to reflect a top-down activation of general but different phrasing schemata, whereas quantitatively differing MEG signals appear to reflect gradual differences in the bottom-up processing of acoustic boundary markers.

  3. Leveraging syntax to better capture the semantics of elliptical coordinated compound noun phrases.

    Science.gov (United States)

    Blake, Catherine; Rindflesch, Tom

    2017-08-01

    Full-text scientific articles are increasingly available, but capturing the meaning conveyed within an article automatically remains a bottleneck for semantic search and reasoning systems. In this paper we consider elliptical coordinated compound noun phrases that authors use to save space in an article. Systems that do not attend to coordination would incorrectly interpret "breast or lung cancer" as a body part (breast) and a disease (lung cancer) rather than two diseases. The algorithmic approach introduced in this paper uses a generate-and-test strategy where candidate expansions for forward, backward and complex ellipses are generated from syntactic dependencies. Dependencies are also used to create a dictionary of non-coordinated noun phrases that is used during the test phrase. Experiments on 21,280 full-text articles show that more than a million noun phrases were impacted by coordinated ellipses. The system achieves 73.07% precision, 75.38% recall, 74.23% F-score and 94.72% accuracy for new noun phrases in the development set. The precision was higher for backward (82.62 vs. 78.63) and forward expansions (64.82 vs. 60.17) and lower for complex expansions (63.41 vs. 72.59) in a test set. On average 10.79% of all noun phrases would be missed if coordination were not resolved, which corresponds to 48 new noun phrases per article in the journal Carcinogenesis, 52 new phrases per article in Diabetes, and 56 new phrases per article in Endocrinology. Results also show coordinated ellipses are more prevalent in abstracts (12.31% of all noun phrases) than in the body of an article (10.70%). To further test the generalizability of this approach the system (without modification) was used on two new collections. Copyright © 2017 Elsevier Inc. All rights reserved.

  4. Translation Nation

    Institute of Scientific and Technical Information of China (English)

    2008-01-01

    The International Federation of Translators will hold its largest ever world congress in China on the eve of 2008 Olympic Games china’ s position as a powerhouse of the translation industry is to be cemented,

  5. Translating Europe

    Directory of Open Access Journals (Sweden)

    Yves Chevrel

    2007-07-01

    Europe thinks in many languages and Europe is a land of translation. Translation is a means of transmitting culture, a means of making it available to others and an invitation to share. It is a cement which binds Europe together.

  6. teaching translation

    Directory of Open Access Journals (Sweden)

    Sergio Bolaños Cuéllar

    2007-01-01

    Full Text Available The advance in cultural-oriented perspectives in Translation Studies has sometimes played down the text linguistic nature of translation. A pilot study in teaching translation was carried out to make students aware of the text linguistic character of translating and help them to improve their translation skills, particularly with an emphasis on self-awareness and self-correcting strategies. The theoretical background is provided by the Dynamic Translation Model (2004, 2005 proposed by the author, with relevant and important contributions taken from Genette’s (1982 transtextuality phenomena (hypertext, hypotext, metatext, paratext, intertext and House and Kasper’s (1981 pragmatic modality markers (downgraders, upgraders. The key conceptual role of equivalence as a defining feature of translation is also dealt with. The textual relationship between Source Language Text (slt is deemed to be pivotal for performing translation and correction tasks in the classroom. Finally, results of the pilot study are discussed and some conclusions are drawn.

  7. Literal Translation and Free Translation

    Institute of Scientific and Technical Information of China (English)

    彭佼

    2011-01-01

    @@ Dispute over the method of literal translation and that of free translation had a long history in China, in East Jin Dynasty Daoan(道安314-385),a well-known monk, was the representative of those who firmly advocated literal translation.Since he feared that free translation might not be true to the original, he advocated strict literal translation so as to preserve the true features.Works under his direction were typical of word-for-word translation, in which no alteration was made except accidental changes in word order.

  8. Manually Classified Errors in Czech-Slovak Translation

    OpenAIRE

    Galuščáková, Petra; Bojar, Ondřej

    2012-01-01

    Outputs of five Czech-Slovak machine translation systems (Česílko, Česílko 2, Google Translate and Moses with different settings) for first 50 sentences of WMT 2010 testing set. The translations were manually processed and the errors were marked and classified according to the scheme by Vilar et al. (David Vilar, Jia Xu, Luis Fernando D’Haro, Hermann Ney: Error Analysis of Statistical Machine Translation Output, Proceedings of LREC-2006, 2006)

  9. Gender Ideology of Translators Implicated in Feminist Novels

    Directory of Open Access Journals (Sweden)

    Fateme Mohammadi

    2014-07-01

    Full Text Available This study compares two Persian translations of Virginia Woolf `s Mrs Dalloway(1925; one done by a male (Darush and one by a female translator(Taheri. Regarding Farzaneh Farahzad`s translation criticism based on CDA and Farzan Sojoodi`s stratificational semiotics, this study analyzed the effect of gender ideology on lexical choices by the two translators. The manipulated words and phrases were analyzed as the reflection of fundamental theories based on the cognitive and personality psychology. It was revealed that translators are definitely affected by their cognitive and mental psychological areas in making gendered lexical choices. Furthermore, almost all the manipulated selected words were done by the male translator. Statistical data, on the other hand, revealed a difference between positive and negative gender values in which the negative choices were used more by the male and the positive ones more by the female translator.

  10. Handling Translation Divergences in Generation-Heavy Hybrid Machine Translation

    Science.gov (United States)

    2002-03-01

    currently valid OMB control number. 1. REPORT DATE MAR 2002 2. REPORT TYPE 3. DATES COVERED 00-03-2002 to 00-03-2002 4. TITLE AND SUBTITLE...n$<a>@?N+A< g?CE(CBIg HT?CN+`

  11. Understanding and Writing G & M Code for CNC Machines

    Science.gov (United States)

    Loveland, Thomas

    2012-01-01

    In modern CAD and CAM manufacturing companies, engineers design parts for machines and consumable goods. Many of these parts are cut on CNC machines. Whether using a CNC lathe, milling machine, or router, the ideas and designs of engineers must be translated into a machine-readable form called G & M Code that can be used to cut parts to precise…

  12. Understanding and Writing G & M Code for CNC Machines

    Science.gov (United States)

    Loveland, Thomas

    2012-01-01

    In modern CAD and CAM manufacturing companies, engineers design parts for machines and consumable goods. Many of these parts are cut on CNC machines. Whether using a CNC lathe, milling machine, or router, the ideas and designs of engineers must be translated into a machine-readable form called G & M Code that can be used to cut parts to precise…

  13. It's Not Just the "Heavy NP": Relative Phrase Length Modulates the Production of Heavy-NP Shift

    Science.gov (United States)

    Stallings, Lynne M.; MacDonald, Maryellen C.

    2011-01-01

    Heavy-NP shift is the tendency for speakers to place long direct object phrases at the end of a clause rather than next to the verb. Though some analyses have focused on length of the direct object phrase alone, results from two experiments demonstrate that the length of the direct object relative to that of other phrases, and not the length of…

  14. South African sign language assistive translation

    CSIR Research Space (South Africa)

    Olivrin, GJ

    2008-04-01

    Full Text Available The authors describe a novel approach and practical solution to an interactive sign language machine translation system from English to South African Sign Language. They interface the system with the GNApp application, which is an augmentative...

  15. Grammatical Metaphor, Controlled Languageand Machine Translation

    DEFF Research Database (Denmark)

    Møller, Margrethe

    2003-01-01

    , syntactic and pragmatic restrictions on the writing style of the author. We will focus on syntactic restrictions and investigate whether a writing style in which various forms of grammatical metaphors have been dissolved in order to make the text more accessible to human readers, will in fact make the texts...... more suitable for MT. The basis of our investigation is a small corpus of English LSP texts that have been evaluated regarding their accessibility and acceptability by human users. The MT-system we will introduce as an additional "user" of the texts is the English-Danish prototype of the Comprendium MT-system....

  16. Using Linguistic Knowledge in Statistical Machine Translation

    Science.gov (United States)

    2010-09-01

    48 2.4 Conjugation of verb ktb ’to write ’. The common verb stem for each of the two...is shared throughout the Arabic speaking region, with little local variation. It is almost always used for writing , to the exclusion of the spoken...connected to the adjacent letters. Arabic has three vowel (a, u, i). Both vowel and consonant duration are phonemic. Short vowels and consonant doubling are

  17. UNITRAN: An Interlingual Machine Translation System.

    Science.gov (United States)

    1987-12-01

    modification in the interlingual approach as embodied by CETA and Sharp. ’Grenoble Universitv, France , 1961. 4 4 English Spanish French Japanese Parameter...comer: Yo como el pan. gustar: El libro me gusta a mi. eat: I eat the bread. like: I like the book. Figure 9: There is no thematic divergence between

  18. A Survey of Statistical Machine Translation

    Science.gov (United States)

    2007-04-01

    are multilingual governments and news agencies and companies operating in the global marketplace. The internet enables them to easily publish...SMT research. Because they are the product of day-to-day 1 human activities, they are constantly growing. Multilingual governments interested in...for one language. 5 The monolingual syntax will look much like our example fragment CFG (Rules C1-C8). In order to use this monolingual syntax in an

  19. Machine translation its scope and limits

    CERN Document Server

    Wilks, Yorick

    2008-01-01

    Author won Antonio Zampolli prize for outstanding contributions to the language technology Comprehensive look at MT technology from the perspective of a participant in its history and developmentDetails the deep differences and struggles between rival groups on how best to do MTConsiders MT as a fundamental part of Artificial Intelligence and the ultimate test-bed for all computational linguistics 

  20. Dependency Structures for Statistical Machine Translation

    Science.gov (United States)

    Bach, Nguyen

    2012-01-01

    Dependency structures represent a sentence as a set of dependency relations. Normally the dependency structures from a tree connect all the words in a sentence. One of the most defining characters of dependency structures is the ability to bring long distance dependency between words to local dependency structures. Another the main attraction of…

  1. Toward Determining the Comprehensibility of Machine Translations

    Science.gov (United States)

    2012-01-01

    responses to a stimulus (Macmillan and Creelman , 1991). It has been applied in areas such as lie detection (truth/lie), inspection (ac- ceptable...1-1/(2N) (Macmil- lan and Creelman , 1991). Negative values, which usually indicate response confusion, were eliminated. The results of...Macmillan, Neil and C. Douglas Creelman . (1991). Detection theory: A User’s guide. Cambridge Univer- sity Press, pp. 10 &125. Marchant

  2. Machine on Trial

    Science.gov (United States)

    2012-06-01

    front of the Judge, how did we come to justify the ethical use of such a machine? The Judge called for a recess so that he could further educate ...commerce, where money would change hands, or for the purposes of controlling critical systems.”29 The initial group that set up the ARPANET did not... EBook of Fundamental Principals of the Metaphysic of Morals, May, 2004 [ EBook #5682 translated by Thomas Kingsmill Abbott, 12. http://manybooks.net

  3. Extracting Noun Phrases from Large-Scale Texts A Hybrid Approach and Its Automatic Evaluation

    CERN Document Server

    Chen, K; Chen, Kuang-hua; Chen, Hsin-Hsi

    1994-01-01

    To acquire noun phrases from running texts is useful for many applications, such as word grouping,terminology indexing, etc. The reported literatures adopt pure probabilistic approach, or pure rule-based noun phrases grammar to tackle this problem. In this paper, we apply a probabilistic chunker to deciding the implicit boundaries of constituents and utilize the linguistic knowledge to extract the noun phrases by a finite state mechanism. The test texts are SUSANNE Corpus and the results are evaluated by comparing the parse field of SUSANNE Corpus automatically. The results of this preliminary experiment are encouraging.

  4. Translation modalities: an investigation of the translated short story “Dez de dezembro”

    Directory of Open Access Journals (Sweden)

    Clara Peron da Silva Guedes

    2017-05-01

    Full Text Available During the translation process translators adopt linguistic strategies in order to make decisions that help to render a translated text suitable to the target language and culture. The translation modalities proposed by Aubert (105-10 constitute a tool that enables one to identify some of these strategies. In addition, they permit to measure the level of linguistic differentiation between a source text and a target text verifying the distance or the proximity of the target text to the linguistic and cultural issues of the source text. Thus, this paper aims to investigate the translation modalities in the short story “Dez de dezembro” (Saunders 204-38, a translation of the short story “Tenth of December” (Saunders 215-51. For quantifying the translation modalities in the translated text the noun phrases from the source text were selected and their counterparts in the target text were classified and annotated within Notepad++ software. The most recurrent translation modalities in the corpus were Literal Translation and Transposition, categories considered intermediate ones in the rank proposed by Aubert (105-10. Therefore, a relation of equivalence can be established between the target and the source texts.

  5. The translation of Psalm 89:13 and its implications

    Directory of Open Access Journals (Sweden)

    Pieter M. Venter

    2005-10-01

    Full Text Available Psalm 89:13a can be translated: “You created north and south.” However, it can also be translated as “Zaphon and Yemin you created,” In the former case the phrase is seen as an indication of the extensiveness of the earth. In the latter the translation points to the mythological form of the hymn section (Ps 89:2-19 in which it appears. This article chooses for the mythological alternative and substantiates this position from the northern toponyms used in the hymn and the hymn’s obvious mythological structure.

  6. Sustainable machining

    CERN Document Server

    2017-01-01

    This book provides an overview on current sustainable machining. Its chapters cover the concept in economic, social and environmental dimensions. It provides the reader with proper ways to handle several pollutants produced during the machining process. The book is useful on both undergraduate and postgraduate levels and it is of interest to all those working with manufacturing and machining technology.

  7. Revising Translations

    DEFF Research Database (Denmark)

    Rasmussen, Kirsten Wølch; Schjoldager, Anne

    2011-01-01

    out by specialised revisers, but by staff translators, who revise the work of colleagues and freelancers on an ad hoc basis. Corrections are mostly given in a peer-to-peer fashion, though the work of freelancers and inexperienced in-house translators is often revised in an authoritative (nonnegotiable......) way. Most respondents and interviewees are worried about increasing pressures on the translation market, which, combined with customers’ general lack of understanding of the translation process, mean that systematic, all-encompassing quality assurance is rarely financially viable....

  8. Structural organization and scale-free properties in Chinese Phrase Networks

    Institute of Scientific and Technical Information of China (English)

    LI Yong; WEI Luoxia; NIU Yi; YIN Junxun

    2005-01-01

    Monkey language models are defined for Chinese Phrase Networks, and scale-free features of Chinese Phrase Networks are uncovered. It is pointed out that the ratio of average degree to the total number of nodes ((K)IN) is close to a constant. Simulation for the evolution of phrase networks indicates that one of the important reasons for power law distributions is the word selection frequency, which, when tuned aptly, can make the monkey language present similar statistic traits as that of natural languages. Power law tails emerge at large k, and the exponent is about 6. Comparison between monkey model and natural language shows that humans are able to use Chinese words resources in more effective and compact ways to express their intentions. All the results demonstrate an important fact that the least effort principle is the basis of Chinese Phrase Networks.

  9. Increases in individualistic words and phrases in American books, 1960-2008.

    Science.gov (United States)

    Twenge, Jean M; Campbell, W Keith; Gentile, Brittany

    2012-01-01

    Cultural products such as song lyrics, television shows, and books reveal cultural differences, including cultural change over time. Two studies examine changes in the use of individualistic words (Study 1) and phrases (Study 2) in the Google Books Ngram corpus of millions of books in American English. Current samples from the general population generated and rated lists of individualistic words and phrases (e.g., "unique," "personalize," "self," "all about me," "I am special," "I'm the best"). Individualistic words and phrases increased in use between 1960 and 2008, even when controlling for changes in communal words and phrases. Language in American books has become increasingly focused on the self and uniqueness in the decades since 1960.

  10. The phrase “information storage and retrieval” (IS&R)

    DEFF Research Database (Denmark)

    Hjørland, Birger

    2015-01-01

    Scholars have uncovered abundant data about the history of the term “information”, as well as some of its many combined phrases (e.g. “information science”, “information retrieval” and “information technology”). Many other compounds involving “information” seem, however, not to have a known origi...... yet. In this article, further information about the phrase “information storage and retrieval” is provided. To know the history of terms and their associated concepts is an important prescription against poor terminological phrasing and theoretical confusion.......Scholars have uncovered abundant data about the history of the term “information”, as well as some of its many combined phrases (e.g. “information science”, “information retrieval” and “information technology”). Many other compounds involving “information” seem, however, not to have a known origin...

  11. An investigation into the correlation of cue phrases, unfilled pauses and the structuring of spoken discourse

    CERN Document Server

    Cahn, J E

    1992-01-01

    Expectations about the correlation of cue phrases, the duration of unfilled pauses and the structuring of spoken discourse are framed in light of Grosz and Sidner's theory of discourse and are tested for a directions-giving dialogue. The results suggest that cue phrase and discourse structuring tasks may align, and show a correlation for pause length and some of the modifications that speakers can make to discourse structure.

  12. The Role of Sustained Attention in the Production of Conjoined Noun Phrases: An Individual Differences Study.

    Science.gov (United States)

    Jongman, Suzanne R; Meyer, Antje S; Roelofs, Ardi

    2015-01-01

    It has previously been shown that language production, performed simultaneously with a nonlinguistic task, involves sustained attention. Sustained attention concerns the ability to maintain alertness over time. Here, we aimed to replicate the previous finding by showing that individuals call upon sustained attention when they plan single noun phrases (e.g., "the carrot") and perform a manual arrow categorization task. In addition, we investigated whether speakers also recruit sustained attention when they produce conjoined noun phrases (e.g., "the carrot and the bucket") describing two pictures, that is, when both the first and second task are linguistic. We found that sustained attention correlated with the proportion of abnormally slow phrase-production responses. Individuals with poor sustained attention displayed a greater number of very slow responses than individuals with better sustained attention. Importantly, this relationship was obtained both for the production of single phrases while performing a nonlinguistic manual task, and the production of noun phrase conjunctions in referring to two spatially separated objects. Inhibition and updating abilities were also measured. These scores did not correlate with our measure of sustained attention, suggesting that sustained attention and executive control are distinct. Overall, the results suggest that planning conjoined noun phrases involves sustained attention, and that language production happens less automatically than has often been assumed.

  13. Revising Translations

    DEFF Research Database (Denmark)

    Rasmussen, Kirsten Wølch; Schjoldager, Anne

    2011-01-01

    out by specialised revisers, but by staff translators, who revise the work of colleagues and freelancers on an ad hoc basis. Corrections are mostly given in a peer-to-peer fashion, though the work of freelancers and inexperienced in-house translators is often revised in an authoritative (nonnegotiable...

  14. Coping with Uncertainty: Noun Phrase Interpretation and Early Semantic Analysis

    OpenAIRE

    Mellish, Christopher Stuart

    1981-01-01

    A computer program which can "understand" natural language texts must have both syntactic knowledge about the language concerned and semantic knowledge of how what is written relates to its internal representation of the world. It has been a matter of some controversy how these sources of information can best be integrated to translate from an input text to a formal meaning representation. The controversy has concerned largely the question as to what degree of syntactic a...

  15. Binary translation using peephole translation rules

    Science.gov (United States)

    Bansal, Sorav; Aiken, Alex

    2010-05-04

    An efficient binary translator uses peephole translation rules to directly translate executable code from one instruction set to another. In a preferred embodiment, the translation rules are generated using superoptimization techniques that enable the translator to automatically learn translation rules for translating code from the source to target instruction set architecture.

  16. Simple machines

    CERN Document Server

    Graybill, George

    2007-01-01

    Just how simple are simple machines? With our ready-to-use resource, they are simple to teach and easy to learn! Chocked full of information and activities, we begin with a look at force, motion and work, and examples of simple machines in daily life are given. With this background, we move on to different kinds of simple machines including: Levers, Inclined Planes, Wedges, Screws, Pulleys, and Wheels and Axles. An exploration of some compound machines follows, such as the can opener. Our resource is a real time-saver as all the reading passages, student activities are provided. Presented in s

  17. A Hybrid Model For Phrase Chunking Employing Artificial Immunity System And Rule Based Methods

    Directory of Open Access Journals (Sweden)

    Bindu.M.S

    2011-11-01

    Full Text Available Natural language Understanding (NLU, an important field of Artificial Intelligence (AI is concerned with the speech and language understanding between human and computer. Understanding language means knowing what concept a word or phrase stands for and how to link them to form meaningful sentence. Identification of phrases or phrase chunking is an important step in natural language understanding (NLU. Chunker identifies and divides sentences into syntactically correlated word groups. Question Answering (QA systems, another important application of Artificial Intelligence (AImostly requires retrieval of nouns or noun phrases as answers to the questions raised by the users. Also Chunking is an important preprocessing step in full parsing. Due to high ambiguity of natural language, exact parsing of text may become very complex. This ambiguity may be partially resolved by using chunking as an intermediate step. To the best of our knowledge no known work or tag set is available for phrase chunking in Malayalam. To separate the chunks in a document it must be labeled with parts-ofspeech (POS tags. POS Tagging is a difficult task in Malayalam as it is a complex and compounding language. In this paper we describe the application of artificial immunity system (AIS for chunking which is implemented and obtained an accurate output with 96% precision and 93% recall. This system istested on corpuses collected from reputed news papers and magazines. These corpuses contained documents from five different domains such as sports, health, agriculture, science and politics and each document contained sentences –simple, compound, complex-of various levels of complexity. POS tag set with 52 tags is developed for preparing the tagged corpus for Malayalam. The phrase tag set contains 20 phrase tags.

  18. Electric machine

    Science.gov (United States)

    El-Refaie, Ayman Mohamed Fawzi [Niskayuna, NY; Reddy, Patel Bhageerath [Madison, WI

    2012-07-17

    An interior permanent magnet electric machine is disclosed. The interior permanent magnet electric machine comprises a rotor comprising a plurality of radially placed magnets each having a proximal end and a distal end, wherein each magnet comprises a plurality of magnetic segments and at least one magnetic segment towards the distal end comprises a high resistivity magnetic material.

  19. Literal Translation using Google Translate in Translating the Text from French to English in Digital Tourism Brochure Bienvenue Paris

    Directory of Open Access Journals (Sweden)

    Rila Hilma

    2011-04-01

    Full Text Available Translation is basically change of form. The form from which the translation is made will be called the source language and the form into which it is to be changed will be called the receptor language. Translation consists of transferring the meaning of the source language into the receptor language. Translating is not an easy job to do because many things to be considered to do this activity because translation means determining the meaning of a text, then reconstructing this same meaning using the appropriate structure and form in the receptor language. Translation is basically divided by two types of translation, one is literal and the other is idiomatic. Literal translation is really strict to the structure and form then often can not well express the true meaning of source language. Idiomatic translation makes every effort to communicate the meaning of the source language text in the natural forms of the receptor language. Then the most popular translation machine, Google Translate, in this study shows the results of translation which remain odd, unnatural, and nonsensical because the unsuccessful of message delivery, which is notably the typically error of literal translation.

  20. CONTEXTUAL MEANING STUDY OF TRANSLATION OF CHILDRENS STORY THE LION KING FROM ENGLISH INTO INDONESIAN

    Directory of Open Access Journals (Sweden)

    Djuria Suprato

    2010-04-01

    Full Text Available Article aimed to present study on the translation of words, phrases, expressions and sentences in a children's story.The story of "The Lion King" was used as case and content analysis was applied as a reference for the study. The focus of this analysis was aimed to observe the appearance of the terms or meaning of certain words that are considered important and supports the goals of the research, in this case is the translation of words, phrases, sentences in the source language (English into the target language (Indonesian. As its criterion, dictionaries was used and rules of contextual translation was applied. It can be concluded that children's stories emphasize the style and meaning contained in it, so the translation is done must consider the context of the story.

  1. Interactive Translation Prediction versus Conventional Post-editing in Practice

    DEFF Research Database (Denmark)

    Sanchis-Trilles, German; Alabau, Vicent; Buck, Christian;

    2014-01-01

    We conducted a field trial in computer-assisted professional translation to compare Interactive Translation Prediction (ITP) against conventional post- editing (PE) of machine translation (MT) output. In contrast to the conventional PE set-up, where an MT system first produces a static translatio...

  2. Sketch of a Noisy Channel Model for the Translation Process

    DEFF Research Database (Denmark)

    Carl, Michael

    The paper develops a Noisy Channel Model for the translation process that is based on actual user activity data. It builds on the monitor model and makes a distinction between early, automatic and late, conscious translation processes: while early priming processes are at the basis of a "literal ...... of the Center for Research and Innovation in Translation and Translation Technology (CRITT) at IBC....... of International Business Communication. His current research interests are related to the investigation of human translation processes and how advanced computer tools (such as machine translation) can fruitfully complement and support the human (translation) activities. Furthermore, he is the director...

  3. A "Living" Machine

    Institute of Scientific and Technical Information of China (English)

    N.R.Bogatyrev

    2004-01-01

    Biomimetics (or bionics) is the engineering discipline that constructs artificial systems using biological principles. The ideal final result in biomimetics is to create a living machine. But what are the desirable and non-desirable properties of biomimetic product? Where can natural prototypes be found? How can technical solutions be transferred from nature to technology? Can we use living nature like LEGO bricks for construction our machines? How can biology help us? What is a living machine? In biomimetic practice only some "part" (organ, part of organ, tissue) of the observed whole organism is utilized. A possible template for future super-organism extension for biomimetic methods might be drawn from experiments in holistic ecological agriculture (ecological design, permaculture, ecological engineering, etc. ). The necessary translation of these rules to practical action can be achieved with the Russian Theory of Inventive Problem Solving (TRIZ), specifically adjusted to biology. Thus, permaculture, reinforced by a TRIZ conceptual framework, might provide the basis for Super-Organismic Bionics, which is hypothesized as necessary for effective ecological engineering. This hypothesis is supported by a case study-the design of a sustainable artificial nature reserve for wild pollinators as a living machine.

  4. Parallel functional category deficits in clauses and nominal phrases: The case of English agrammatism.

    Science.gov (United States)

    Wang, Honglei; Yoshida, Masaya; Thompson, Cynthia K

    2014-01-01

    Individuals with agrammatic aphasia exhibit restricted patterns of impairment of functional morphemes, however, syntactic characterization of the impairment is controversial. Previous studies have focused on functional morphology in clauses only. This study extends the empirical domain by testing functional morphemes in English nominal phrases in aphasia and comparing patients' impairment to their impairment of functional morphemes in English clauses. In the linguistics literature, it is assumed that clauses and nominal phrases are structurally parallel but exhibit inflectional differences. The results of the present study indicated that aphasic speakers evinced similar impairment patterns in clauses and nominal phrases. These findings are consistent with the Distributed Morphology Hypothesis (DMH), suggesting that the source of functional morphology deficits among agrammatics relates to difficulty implementing rules that convert inflectional features into morphemes. Our findings, however, are inconsistent with the Tree Pruning Hypothesis (TPH), which suggests that patients have difficulty building complex hierarchical structures.

  5. Distinct ERP signatures of word frequency, phrase frequency, and prototypicality in speech production.

    Science.gov (United States)

    Hendrix, Peter; Bolger, Patrick; Baayen, Harald

    2017-01-01

    Recent studies have documented frequency effects for word n-grams, independently of word unigram frequency. Further studies have revealed constructional prototype effects, both at the word level as well as for phrases. The present speech production study investigates the time course of these effects for the production of prepositional phrases in English, using event related potentials (ERPs). For word frequency, oscillations in the theta range emerged. By contrast, phrase frequency showed a persistent effect over time. Furthermore, independent effects with different temporal and topographical signatures characterized phrasal prototypicality. In a simulation study we demonstrate that naive discrimination learning provides an alternative account of the data that is as least as powerful as a standard lexical predictor analysis. The implications of the current findings for models of language processing are discussed. (PsycINFO Database Record

  6. The Machine within the Machine

    CERN Multimedia

    Katarina Anthony

    2014-01-01

    Although Virtual Machines are widespread across CERN, you probably won't have heard of them unless you work for an experiment. Virtual machines - known as VMs - allow you to create a separate machine within your own, allowing you to run Linux on your Mac, or Windows on your Linux - whatever combination you need.   Using a CERN Virtual Machine, a Linux analysis software runs on a Macbook. When it comes to LHC data, one of the primary issues collaborations face is the diversity of computing environments among collaborators spread across the world. What if an institute cannot run the analysis software because they use different operating systems? "That's where the CernVM project comes in," says Gerardo Ganis, PH-SFT staff member and leader of the CernVM project. "We were able to respond to experimentalists' concerns by providing a virtual machine package that could be used to run experiment software. This way, no matter what hardware they have ...

  7. Translating democracy

    DEFF Research Database (Denmark)

    Doerr, Nicole

    2012-01-01

    grassroots activists in social movements use translation as a novel practice to debate political alternatives in the European Union's (EU) multilingual public sphere. In recent years, new cross-European protest movements have created the multilingual discursive democracy arena known as the European Social...... in institutionalized habits and norms of deliberation. Addressing democratic theorists, my findings suggest that translation could be a way to think about difference not as a hindrance but as a resource for democracy in linguistically heterogeneous societies and public spaces, without presupposing a shared language...

  8. Amharic Speech Recognition for Speech Translation

    OpenAIRE

    Melese, Michael; Besacier, Laurent; Meshesha, Million

    2016-01-01

    International audience; The state-of-the-art speech translation can be seen as a cascade of Automatic Speech Recognition, Statistical Machine Translation and Text-To-Speech synthesis. In this study an attempt is made to experiment on Amharic speech recognition for Amharic-English speech translation in tourism domain. Since there is no Amharic speech corpus, we developed a read-speech corpus of 7.43hr in tourism domain. The Amharic speech corpus has been recorded after translating standard Bas...

  9. Descriptive and discourse-referential modifiers in a layered model of the noun phrase

    DEFF Research Database (Denmark)

    Rijkhoff, Jan

    2008-01-01

    This article argues that adnominal modifiers in a layered model of the noun phrase can be divided into two major subcategories: descriptive modifiers and discourse-referential modifiers. Whereas descriptive modifiers can be subdivided into classifying, qualifying, quantifying and localizing......), (ii) the special relationship between localizing and discourse-referential modifiers (section 5), and (iii) semantic and morpho-syntactic parallels between modifier categories in the noun phrase and the clause (section 6). In addition this sample-based typological study shows (contra Hawkins...

  10. The phrase “information storage and retrieval” (IS&R):A historical note

    OpenAIRE

    Hjørland, Birger

    2015-01-01

    Scholars have uncovered abundant data about the history of the term “information”, as well as some of its many combined phrases (e.g. “information science”, “information retrieval” and “information technology”). Many other compounds involving “information” seem, however, not to have a known origin yet. In this article, further information about the phrase “information storage and retrieval” is provided. To know the history of terms and their associated concepts is an important prescription ag...

  11. Anatomie d'une pseudo-phrase complexe: le cas de heureusement que P

    Directory of Open Access Journals (Sweden)

    Gaatone David

    2012-07-01

    Full Text Available Les phrases du type "Heureusement que P" ont souvent intrigué les linguistes, qui les mentionnent, en règle générale, au chapitre de la phrase complexe. Elles soulèvent un certain nombre de questions, auxquelles ont été données des réponses pas toujours évidentes. S'agit-il vraiment d'une phrase complexe, introduite par un adverbe, qui ferait fonction de proposition principale elliptique, averbale, suivie d'une subordonnée, elle-même introduite par la conjonction de subordination par excellence "que"? Ou aurait-on plutot affaire à une phrase indépendante, ou "que", qui peut commuter avec une virgule, ne serait rien d'autre qu'une marque prosodique? Comment, dans chacune de ces optiques, rendre compte de la différence de mode du verbe dans les énoncés sémantiquement équivalents "Heureusement que Nir (est/*soit venu" et "Il est heureux que Nir (?est/soit venu"? Quels sont les adverbes susceptibles d'introduire de telles phrases? Appartiennent-ils à une famille semantique homogène et, le cas échéant, laquelle? Quelle est la structure thématique des phrases de ce type et leur rôle dans la communication? Sont-elle toujours permises ou leur emploi dépend-il d'un certain contexte? Et, lorsqu'elles le sont, peut-on toujours les faire commuter avec des phrases de même sens, mais formellement différentes, telles que, par exemple, les phrases correspondantes avec adverbe en tête, séparé du reste par une simple pause ("Heureusement, Nir est venu"? Enfin, avons-nous affaire a une structure productive, un moule dans lequel on peut introduire tout adverbe de la famille syntactico-sémantique de "heureusement", ou plutot d'une construction en quelque sorte figée, limitée à un relativement petit nombre d'adverbes?

  12. Machine Learning

    CERN Document Server

    CERN. Geneva

    2017-01-01

    Machine learning, which builds on ideas in computer science, statistics, and optimization, focuses on developing algorithms to identify patterns and regularities in data, and using these learned patterns to make predictions on new observations. Boosted by its industrial and commercial applications, the field of machine learning is quickly evolving and expanding. Recent advances have seen great success in the realms of computer vision, natural language processing, and broadly in data science. Many of these techniques have already been applied in particle physics, for instance for particle identification, detector monitoring, and the optimization of computer resources. Modern machine learning approaches, such as deep learning, are only just beginning to be applied to the analysis of High Energy Physics data to approach more and more complex problems. These classes will review the framework behind machine learning and discuss recent developments in the field.

  13. Content and Phrasing in Titles of Original Research and Review Articles in 2015: Range of Practice in Four Clinical Journals

    Directory of Open Access Journals (Sweden)

    Mary Ellen Kerans

    2016-04-01

    Full Text Available Reporting guidelines for clinical research designs emerged in the mid-1990s and have influenced various aspects of research articles, including titles, which have also been subject to changing uses with the growth of electronic database searching and efforts to reduce bias in literature searches. We aimed (1 to learn more about titles in clinical medicine today and (2 to develop an efficient, reliable way to study titles over time and on the fly—for quick application by authors, manuscript editors, translators and instructors. We compared content and form in titles from two general medical journals—the New England Journal of Medicine (NEJM and the British Medical Journal—and two anesthesiology journals (the European Journal of Anaesthesiology and Anesthesiology; we also analyzed the inter-rater reliability of our coding. Significant content differences were found in the frequencies of mentions of methods, results (between general and subspecialty titles, and geographic setting; phrasing differences were found in the prevalence of full-sentence and compound titles (and their punctuation. NEJM titles were significantly shorter, and this journal differed consistently on several features. We conclude that authors must learn to efficiently survey titles for form and content patterns when preparing manuscripts to submit to unfamiliar journals or on resubmitting to a new journal after rejection.

  14. 试论英文国名词汇的文化情结%A Study of the Cultural Emotion in English Phrases with Country Names

    Institute of Scientific and Technical Information of China (English)

    毛丽霞

    2013-01-01

    Language often objectively relfects speciifc things and cultures, but ethnocentrism often causes the proud praise of one’s own country and the demeaning of other countries. Therefore, care must be taken in understanding and translating phrases containing names of countries. To avoid possible misunderstanding, the meanings of phrases with country names must be f ully understood by analyzing their underlied cultural connotations.%语言客观反映一国所特有的东西和文化,但民族中心主义导致了对本国的褒扬和他国的贬低,因此,对含有国家名称的词汇如何理解和翻译要慎重,切莫直译乱翻、望文生义,而要在了解其文化背景的基础上理解其涵义,避免产生误解。

  15. Phonological Phrase Boundaries Constrain the Online Syntactic Analysis of Spoken Sentences

    Science.gov (United States)

    Millotte, Severine; Rene, Alice; Wales, Roger; Christophe, Anne

    2008-01-01

    Two experiments tested whether phonological phrase boundaries constrain online syntactic analysis in French. Pairs of homophones belonging to different syntactic categories (verb and adjective) were used to create sentences with a local syntactic ambiguity (e.g., [le petit chien "mort"], in English, the "dead" little dog, vs. [le petit chien]…

  16. A Developmental Study of Prepositional Phrases in Hebrew Written Text Construction

    Science.gov (United States)

    Brandes, Gilad; Ravid, Dorit

    2017-01-01

    Prepositional phrases (PPs) are considered an important feature of mature written expression. However, little is known about the development of PPs during the school years. The study examined the use of PPs in 160 narrative and expository texts, written by Hebrew-users in grades 4, 7, and 11, and adults. PPs were identified, counted, and…

  17. Children's Knowledge of Hierarchical Phrase Structure: Quantifier Floating in Japanese

    Science.gov (United States)

    Suzuki, Takaaki; Yoshinaga, Naoko

    2013-01-01

    The interpretation of floating quantifiers in Japanese requires knowledge of hierarchical phrase structure. However, the input to children is insufficient or even misleading, as our analysis indicates. This presents an intriguing question on learnability: do children interpret floating quantifiers based on a structure-dependent rule which is not…

  18. "THE BACON" Not "the Bacon": How Children and Adults Understand Accented and Unaccented Noun Phrases

    Science.gov (United States)

    Arnold, Jennifer E.

    2008-01-01

    Two eye-tracking experiments examine whether adults and 4- and 5-year-old children use the presence or absence of accenting to guide their interpretation of noun phrases (e.g., "the bacon") with respect to the discourse context. Unaccented nouns tend to refer to contextually accessible referents, while accented variants tend to be used for less…

  19. Acoustic patterns and communicative functions of phrase-final F0 rises in German

    DEFF Research Database (Denmark)

    Dombrowski, Ernst; Niebuhr, Oliver

    2006-01-01

    Acoustic features and communicative functions of phrase-final F(0) rises starting before an accented-vowel onset are analysed in a corpus of German unscripted speech. Two conversational conditions are examined: turn-yielding and turn-holding. The most important feature distinguishing rises...

  20. On Equivalence and Non-equivalence of Words and Phrases In Different Cultures

    Institute of Scientific and Technical Information of China (English)

    高新霞

    2008-01-01

    This paper is mainly focused on the study of the relationship among three concepts, namely, culture,psychological association and equivalence or non-equivalence of words and phrases with combination of Problems posed by my students and discuss three patterns of the relationship with an attempt to describe the importance of culture to equivalence.

  1. HPSG 2005 - The 12th International Conference on Head-Driven Phrase Structure Grammar: Conference Notes

    OpenAIRE

    Branco, António Horta; Costa, Francisco

    2005-01-01

    This technical report collects the extended abstracts selected for presentation at HPSG 2005 - The 12th International Conference on Head-Driven Phrase Structure Grammar (University of Lisbon, August 23-24, 2005 - http://hpsg2005.di.fc.ul.pt/) by the corresponding program committee

  2. The perception of musical phrase structure: a cross-cultural ERP study.

    Science.gov (United States)

    Nan, Yun; Knösche, Thomas R; Friederici, Angela D

    2006-06-13

    Electroencephalography (EEG) was used in a cross-cultural music study investigating phrase boundary perception. Chinese and German musicians performed a cultural categorization task under Chinese and Western music listening conditions. Western music was the major subject for both groups of musicians, while Chinese music was familiar to Chinese subjects only. By manipulating the presence of pauses between two phrases in the biphrasal melodies, EEG correlates for the perception of phrase boundaries were found in both groups under both music listening conditions. Between 450 and 600 ms, the music CPS (closure positive shift), which had been found in earlier studies with a false tone detection task, was replicated for the more global categorization task and for all combinations of subject group and musical style. At short latencies (100 and 450 ms post phrase boundary offset), EEG correlates varied as a function of musical styles and subject group. Both bottom-up (style properties of the music) and top-down (acculturation of the subjects) information interacted during this early processing stage.

  3. Adding and Subtracting Alternation: Resumption and Prepositional Phrase Chopping in Spanish Relative Clauses

    Science.gov (United States)

    Cerron-Palomino Lopez, Alvaro

    2010-01-01

    This dissertation is a variationist account of two non-standard relative clause (RC) structures in Spanish: resumptive pronouns (RPs) and prepositional-phrase (PP) chopping. Previous typological studies considered RP explanations based on difficulty of processing (Hawkins, 1994), while Spanish-specific quantitative studies proposed a number of…

  4. On the Law of Zipf-Mandelbrot for Multi-Word Phrases.

    Science.gov (United States)

    Egghe, L.

    1999-01-01

    Zipf's law states that if words in a text are arranged in decreasing order of occurrence, then the product of the rank of a word and the number of times it occurs is a constant of that text. Examination of the probabilities of occurrence of multi-word phrases in information retrieval in relation to the probabilities of occurrence of single words…

  5. The Economy of Fluent Speaking: Phrase-Level Reduction in a Patient with Pure Apraxia of Speech

    Science.gov (United States)

    Staiger, Anja; Ruttenauer, Anna; Ziegler, Wolfram

    2010-01-01

    The term "phrase-level reduction" refers to transformations of the phonetic forms of words in connected speech. They are a characteristic property of fluent speech in normal speakers. Phrase-level reductions contribute to a reduction of articulatory-motor effort and constitute an important aspect of speech naturalness. So far, these phenomena have…

  6. Improving Oral Reading Fluency through Response Opportunities: A Comparison of Phrase Drill Error Correction with Repeated Readings

    Science.gov (United States)

    Begeny, John C.; Daly, Edward J., III; Valleley, Rachel J.

    2006-01-01

    The purpose of this study was to compare two oral reading fluency treatments (repeated readings and phrase drill error correction) which differ in the way they prompt student responding. Repeated readings (RR) and phrase drill (PD) error correction were alternated with a baseline and a reward condition within an alternating treatments design with…

  7. 13 CFR 142.6 - What does the phrase “know or have reason to know” mean?

    Science.gov (United States)

    2010-01-01

    ... 13 Business Credit and Assistance 1 2010-01-01 2010-01-01 false What does the phrase âknow or have reason to knowâ mean? 142.6 Section 142.6 Business Credit and Assistance SMALL BUSINESS ADMINISTRATION PROGRAM FRAUD CIVIL REMEDIES ACT REGULATIONS Overview and Definitions § 142.6 What does the phrase...

  8. The Economy of Fluent Speaking: Phrase-Level Reduction in a Patient with Pure Apraxia of Speech

    Science.gov (United States)

    Staiger, Anja; Ruttenauer, Anna; Ziegler, Wolfram

    2010-01-01

    The term "phrase-level reduction" refers to transformations of the phonetic forms of words in connected speech. They are a characteristic property of fluent speech in normal speakers. Phrase-level reductions contribute to a reduction of articulatory-motor effort and constitute an important aspect of speech naturalness. So far, these phenomena have…

  9. Beyond Translation

    DEFF Research Database (Denmark)

    Olwig, Mette Fog

    2013-01-01

    This article contributes to the growing scholarship on local development practitioners by re-examining conceptualizations of practitioners as ‘brokers’ strategically translating between ‘travelling’ (development institution) rationalities and ‘placed’ (recipient area) rationalities in relation...... and practice spurred by new challenges deriving from climate change anxiety, the study shows how local practitioners often make local activities fit into travelling development rationalities as a matter of habit, rather than as a conscious strategy. They may therefore cease to ‘translate’ between different...... rationalities. This is shown to have important implications for theory, research and practice concerning disaster risk reduction and climate change adaptation in which such translation is often expected....

  10. Translational genomics

    Directory of Open Access Journals (Sweden)

    Martin Kussmann

    2014-09-01

    Full Text Available The term “Translational Genomics” reflects both title and mission of this new journal. “Translational” has traditionally been understood as “applied research” or “development”, different from or even opposed to “basic research”. Recent scientific and societal developments have triggered a re-assessment of the connotation that “translational” and “basic” are either/or activities: translational research nowadays aims at feeding the best science into applications and solutions for human society. We therefore argue here basic science to be challenged and leveraged for its relevance to human health and societal benefits. This more recent approach and attitude are catalyzed by four trends or developments: evidence-based solutions; large-scale, high dimensional data; consumer/patient empowerment; and systems-level understanding.

  11. Hodnocení automatického překladu SMT systémů (Google translate, Bing) z francouzštiny do češtiny: kolokace z oblasti "bezpečnost potravin"

    OpenAIRE

    ŠVARCOVÁ, Zora

    2016-01-01

    The aim of this bachelor thesis is to test and evaluate the translation quality of selected terminological collocations related to "food safety" using free available translators (Microsoft Bing Translator, Google Translate). The work is divided into two parts theoretical and practical. Theoretical part is divided into several chapters, which focus on the development, history and use of statistical machine translation, the basic strategies of machine translation systems (rule-based machine tra...

  12. Machine Learning

    Energy Technology Data Exchange (ETDEWEB)

    Chikkagoudar, Satish; Chatterjee, Samrat; Thomas, Dennis G.; Carroll, Thomas E.; Muller, George

    2017-04-21

    The absence of a robust and unified theory of cyber dynamics presents challenges and opportunities for using machine learning based data-driven approaches to further the understanding of the behavior of such complex systems. Analysts can also use machine learning approaches to gain operational insights. In order to be operationally beneficial, cybersecurity machine learning based models need to have the ability to: (1) represent a real-world system, (2) infer system properties, and (3) learn and adapt based on expert knowledge and observations. Probabilistic models and Probabilistic graphical models provide these necessary properties and are further explored in this chapter. Bayesian Networks and Hidden Markov Models are introduced as an example of a widely used data driven classification/modeling strategy.

  13. Applications and modelling of bulk HTSs in brushless ac machines

    Energy Technology Data Exchange (ETDEWEB)

    Barnes, G.J. [Department of Engineering Science, University of Oxford, Parks Road, Oxford OX1 3PJ (United Kingdom). E-mail: gary.barnes at eng.ox.ac.uk; McCulloch, M.D.; Dew-Hughes, D. [Department of Engineering Science, University of Oxford, Parks Road, Oxford OX1 3PJ (United Kingdom)

    2000-06-01

    The use of high temperature superconducting material in its bulk form for engineering applications is attractive due to the large power densities that can be achieved. In brushless electrical machines, there are essentially four properties that can be exploited; their hysteretic nature, their flux shielding properties, their ability to trap large flux densities and their ability to produce levitation. These properties translate to hysteresis machines, reluctance machines, trapped-field synchronous machines and linear motors respectively. Each one of these machines is addressed separately and computer simulations that reveal the current and field distributions within the machines are used to explain their operation. (author)

  14. Applications and modelling of bulk HTSs in brushless ac machines

    Science.gov (United States)

    Barnes, G. J.; McCulloch, M. D.; Dew-Hughes, D.

    2000-06-01

    The use of high temperature superconducting material in its bulk form for engineering applications is attractive due to the large power densities that can be achieved. In brushless electrical machines, there are essentially four properties that can be exploited; their hysteretic nature, their flux shielding properties, their ability to trap large flux densities and their ability to produce levitation. These properties translate to hysteresis machines, reluctance machines, trapped-field synchronous machines and linear motors respectively. Each one of these machines is addressed separately and computer simulations that reveal the current and field distributions within the machines are used to explain their operation.

  15. The Multilingual Translator: Words and Phrases in 15 Languages To Help You Communicate with Students of Diverse Backgrounds.

    Science.gov (United States)

    Moore, Helen H.

    The illustrated guide provides vocabulary for common classroom terminology in 15 languages: Arabic; Bengali; Chinese; Farsi; Haitian-Creole; Hindi; Italian; Korean; Polish; Russian; Serbo-Croatian; Spanish; Tagalog; Urdu; and Vietnamese. The languages are those most commonly spoken by immigrants to the United States. For each language there are…

  16. AN ANALYSIS TOWARDS CULTURAL FACTORS IN ENGLISH AND CHINESE TRANSLATION IN THE MATERIAL FIELD

    Institute of Scientific and Technical Information of China (English)

    Guo Ying

    2008-01-01

    Language is the carrier of culture, and translation is a cultural information exchanging process in which the connotation of one language is expressed in the other one, namely, translation is not only the process of language exchanging but also the process of culture exchanging. To a great extent, the transform of culture information is restricted by culture factors. Therefore, in order to reappear the original flavor naturally and properly during the process of translation, one must think highly of the fact that culture factors affect translation activities a lot. This thesis transilluminates the culture factors in E/C translation through the translation comparison among the Chinese and English set phrases, idioms, allusions, sayings, and colloquialism; It also tries to analyze the way of cultural information transaction in E/C translation and put forward the opinion to it.

  17. Theorizing About Translation and Translation Studies

    OpenAIRE

    Williams, Mukesh

    2012-01-01

    The act of translation between languages and cultures has been going on for centuries, but the act of theorizing about-translation is of recent origin. In the last forty years translation scholars have attempted to understand the process of translation and evaluate its merits giving rise to a whole range of conceptualizing which is now called translation studies. Translation studies , therefore, has grown within important scholastic enclosures of the west attempting to conduct political and i...

  18. Machine testning

    DEFF Research Database (Denmark)

    De Chiffre, Leonardo

    This document is used in connection with a laboratory exercise of 3 hours duration as a part of the course GEOMETRICAL METROLOGY AND MACHINE TESTING. The exercise includes a series of tests carried out by the student on a conventional and a numerically controled lathe, respectively. This document...

  19. Representational Machines

    DEFF Research Database (Denmark)

    Petersson, Dag; Dahlgren, Anna; Vestberg, Nina Lager

    to the enterprises of the medium. This is the subject of Representational Machines: How photography enlists the workings of institutional technologies in search of establishing new iconic and social spaces. Together, the contributions to this edited volume span historical epochs, social environments, technological...

  20. Contextual Meaning Study of Translation of Children’s Story “The Lion King” from English into Indonesian

    Directory of Open Access Journals (Sweden)

    Djuria Suprapto

    2010-05-01

    Full Text Available Article aimed to present study on the translation of words, phrases, expressions and sentences in a children's story.The story of "The Lion King" was used as case and content analysis was applied as a reference for the study. The focus of this analysis was aimed to observe the appearance of the terms or meaning of certain words that are considered important and supports the goals of the research, in this case is the translation of words, phrases, sentences in the source language (English into the target language (Indonesian. As its criterion, dictionaries was used and rules of contextual translation was applied. It can be concluded that children's stories emphasize the style and meaning contained in it, so the translation is done must consider the context of the story. 

  1. Automatic Generation of English-Japanese Translation Pattern Utilizing Genetic Programming Technique

    Science.gov (United States)

    Matsumura, Koki; Tamekuni, Yuji; Kimura, Shuhei

    There are a lot of constructional differences in an English-Japanese phrase template, and that often makes the act of translation difficult. Moreover, there exist various and tremendous phrase templates and sentence to be refered to. It is not easy to prepare the corpus that covers the all. Therefore, it is very significant to generate the translation pattern of the sentence pattern automatically from a viewpoint of the translation success rate and the capacity of the pattern dictionary. Then, for the purpose of realizing the automatic generation of the translation pattern, this paper proposed the new method for the generation of the translation pattern by using the genetic programming technique (GP). The technique tries to generate the translation pattern of various sentences which are not registered in the phrase template dictionary automatically by giving the genetic operation to the parsing tree of a basic pattern. The tree consists of the pair of the English-Japanese sentence generated as the first stage population. The analysis tree data base with 50,100,150,200 pairs was prepared as the first stage population. And this system was applied and executed for an English input of 1,555 sentences. As a result, the analysis tree increases from 200 to 517, and the accuracy rate of the translation pattern has improved from 42.57% to 70.10%. And, 86.71% of the generated translations was successfully done, whose meanings are enough acceptable and understandable. It seemed that this proposal technique became a clue to raise the translation success rate, and to find the possibility of the reduction of the analysis tree data base.

  2. A double-sided linear primary permanent magnet vernier machine.

    Science.gov (United States)

    Du, Yi; Zou, Chunhua; Liu, Xianxing

    2015-01-01

    The purpose of this paper is to present a new double-sided linear primary permanent magnet (PM) vernier (DSLPPMV) machine, which can offer high thrust force, low detent force, and improved power factor. Both PMs and windings of the proposed machine are on the short translator, while the long stator is designed as a double-sided simple iron core with salient teeth so that it is very robust to transmit high thrust force. The key of this new machine is the introduction of double stator and the elimination of translator yoke, so that the inductance and the volume of the machine can be reduced. Hence, the proposed machine offers improved power factor and thrust force density. The electromagnetic performances of the proposed machine are analyzed including flux, no-load EMF, thrust force density, and inductance. Based on using the finite element analysis, the characteristics and performances of the proposed machine are assessed.

  3. A Double-Sided Linear Primary Permanent Magnet Vernier Machine

    Directory of Open Access Journals (Sweden)

    Yi Du

    2015-01-01

    Full Text Available The purpose of this paper is to present a new double-sided linear primary permanent magnet (PM vernier (DSLPPMV machine, which can offer high thrust force, low detent force, and improved power factor. Both PMs and windings of the proposed machine are on the short translator, while the long stator is designed as a double-sided simple iron core with salient teeth so that it is very robust to transmit high thrust force. The key of this new machine is the introduction of double stator and the elimination of translator yoke, so that the inductance and the volume of the machine can be reduced. Hence, the proposed machine offers improved power factor and thrust force density. The electromagnetic performances of the proposed machine are analyzed including flux, no-load EMF, thrust force density, and inductance. Based on using the finite element analysis, the characteristics and performances of the proposed machine are assessed.

  4. Searching to Translate and Translating to Search: When Information Retrieval Meets Machine Translation

    Science.gov (United States)

    Ture, Ferhan

    2013-01-01

    With the adoption of web services in daily life, people have access to tremendous amounts of information, beyond any human's reading and comprehension capabilities. As a result, search technologies have become a fundamental tool for accessing information. Furthermore, the web contains information in multiple languages, introducing another barrier…

  5. Adding machine and calculating machine

    Institute of Scientific and Technical Information of China (English)

    2005-01-01

    In 1642 the French mathematician Blaise Pascal(1623-1662) invented a machine;.that could add and subtract. It had.wheels that each had: 1 to 10 marked off along its circumference. When the wheel at the right, representing units, made one complete circle, it engaged the wheel to its left, represents tens, and moved it forward one notch.

  6. Polysynthesis in Hueyapan Nahuatl: The Status of Noun Phrases, Basic Word Order, and Other Concerns

    DEFF Research Database (Denmark)

    Pharao Hansen, Magnus

    2010-01-01

    This article presents data showing that the syntax of the Nahuatl dialect spoken in Hueyapan, Morelos, Mexico has traits of nonconfigurationality: free word order and free pro-drop, with predicate-initial word order being pragmatically neutral. It permits discontinuous noun phrases and has...... no naturally occurring true quantifiers, suggesting that noun phrases in Hueyapan Nahuatl are adjuncts rather than actual arguments. These findings are contrasted with those of an earlier study by Jeffrey MacSwan, who concludes that Nahuatl syntax has relatively fixed subject-verb-object word order....... It is suggested that the differences observed between the two Nahuatl varieties may be a result of methodological problems in MacSwan's collection of data, skewing it in the direction of a more rigid syntax....

  7. Evolution of the most common English words and phrases over the centuries

    CERN Document Server

    Perc, Matjaz

    2012-01-01

    By determining which were the most common English words and phrases since the beginning of the 16th century, we obtain a unique large-scale view of the evolution of written text. We find that the most common words and phrases in any given year had a much shorter popularity lifespan in the 16th than they had in the 20th century. By measuring how their usage propagated across the years, we show that for the past two centuries the process has been governed by linear preferential attachment. Along with the steady growth of the English lexicon, this provides an empirical explanation for the ubiquity of the Zipf's law in language statistics and confirms that writing, although undoubtedly an expression of art and skill, is not immune to the same influences of self-organization that are known to regulate processes as diverse as the making of new friends and World Wide Web growth.

  8. What Constitutes a Phrase in Sound-Based Music? A Mixed-Methods Investigation of Perception and Acoustics

    Science.gov (United States)

    Olsen, Kirk N.; Dean, Roger T.; Leung, Yvonne

    2016-01-01

    Phrasing facilitates the organization of auditory information and is central to speech and music. Not surprisingly, aspects of changing intensity, rhythm, and pitch are key determinants of musical phrases and their boundaries in instrumental note-based music. Different kinds of speech (such as tone- vs. stress-languages) share these features in different proportions and form an instructive comparison. However, little is known about whether or how musical phrasing is perceived in sound-based music, where the basic musical unit from which a piece is created is commonly non-instrumental continuous sounds, rather than instrumental discontinuous notes. This issue forms the target of the present paper. Twenty participants (17 untrained in music) were presented with six stimuli derived from sound-based music, note-based music, and environmental sound. Their task was to indicate each occurrence of a perceived phrase and qualitatively describe key characteristics of the stimulus associated with each phrase response. It was hypothesized that sound-based music does elicit phrase perception, and that this is primarily associated with temporal changes in intensity and timbre, rather than rhythm and pitch. Results supported this hypothesis. Qualitative analysis of participant descriptions showed that for sound-based music, the majority of perceived phrases were associated with intensity or timbral change. For the note-based piano piece, rhythm was the main theme associated with perceived musical phrasing. We modeled the occurrence in time of perceived musical phrases with recurrent event ‘hazard’ analyses using time-series data representing acoustic predictors associated with intensity, spectral flatness, and rhythmic density. Acoustic intensity and timbre (represented here by spectral flatness) were strong predictors of perceived musical phrasing in sound-based music, and rhythm was only predictive for the piano piece. A further analysis including five additional spectral

  9. What Constitutes a Phrase in Sound-Based Music? A Mixed-Methods Investigation of Perception and Acoustics.

    Science.gov (United States)

    Olsen, Kirk N; Dean, Roger T; Leung, Yvonne

    2016-01-01

    Phrasing facilitates the organization of auditory information and is central to speech and music. Not surprisingly, aspects of changing intensity, rhythm, and pitch are key determinants of musical phrases and their boundaries in instrumental note-based music. Different kinds of speech (such as tone- vs. stress-languages) share these features in different proportions and form an instructive comparison. However, little is known about whether or how musical phrasing is perceived in sound-based music, where the basic musical unit from which a piece is created is commonly non-instrumental continuous sounds, rather than instrumental discontinuous notes. This issue forms the target of the present paper. Twenty participants (17 untrained in music) were presented with six stimuli derived from sound-based music, note-based music, and environmental sound. Their task was to indicate each occurrence of a perceived phrase and qualitatively describe key characteristics of the stimulus associated with each phrase response. It was hypothesized that sound-based music does elicit phrase perception, and that this is primarily associated with temporal changes in intensity and timbre, rather than rhythm and pitch. Results supported this hypothesis. Qualitative analysis of participant descriptions showed that for sound-based music, the majority of perceived phrases were associated with intensity or timbral change. For the note-based piano piece, rhythm was the main theme associated with perceived musical phrasing. We modeled the occurrence in time of perceived musical phrases with recurrent event 'hazard' analyses using time-series data representing acoustic predictors associated with intensity, spectral flatness, and rhythmic density. Acoustic intensity and timbre (represented here by spectral flatness) were strong predictors of perceived musical phrasing in sound-based music, and rhythm was only predictive for the piano piece. A further analysis including five additional spectral

  10. Translator's preface.

    Science.gov (United States)

    Lamiell, James T

    2013-08-01

    Presents a preface from James T. Lamiell, who translates Wilhelm Wundt's Psychology's Struggle for Existence (Die Psychologie im Kampf ums Dasein), in which Wundt advised against the impending divorce of psychology from philosophy, into English. Lamiell comments that more than a decade into the 21st century, it appears that very few psychologists have any interest at all in work at the interface of psychology and philosophy. He notes that one clear indication of this is that the Society for Theoretical and Philosophical Psychology, which is Division 24 of the American Psychological Association (APA), remains one of the smallest of the APA's nearly 60 divisions. (PsycINFO Database Record (c) 2013 APA, all rights reserved).

  11. Symmetry and phrase definition: Ginastera’s First Piano Sonata, op. 22, II mov.

    OpenAIRE

    Carballo, Erick

    2013-01-01

    The article evaluates the usefulness of the twelve-tone analytical approach when applied to the second movement of Alberto Ginastera’s first Piano Sonata, Op. 22 (1952). Based on the results from this evaluation, it postulates a new analytical approach grounded on two main principles. The first principle is the recognition of symmetrical pitch-class sets and their rapport with the establishment of phrase structures. The second principle deals with the combination of tonal and atonal gestures ...

  12. Polysynthesis in Hueyapan Nahuatl: The Status of Noun Phrases, Basic Word Order, and Other Concerns

    DEFF Research Database (Denmark)

    Pharao Hansen, Magnus

    2010-01-01

    This article presents data showing that the syntax of the Nahuatl dialect spoken in Hueyapan, Morelos, Mexico has traits of nonconfigurationality: free word order and free pro-drop, with predicate-initial word order being pragmatically neutral. It permits discontinuous noun phrases and has....... It is suggested that the differences observed between the two Nahuatl varieties may be a result of methodological problems in MacSwan's collection of data, skewing it in the direction of a more rigid syntax....

  13. Automatic extraction of syntactic patterns for dependency parsing in noun phrase chunks

    Directory of Open Access Journals (Sweden)

    Mihaela Colhon

    2014-05-01

    Full Text Available In this article we present a method for automatic extraction of syntactic patterns that are used to develop a dependency parsing method. The patterns have been extracted from a corpus automatically annotated for tokens, sentences’ borders, parts of speech and noun phrases, and manually annotated for dependency relations between words. The evaluation shows promising results in the case of an order-free language.

  14. Bidirectional Extraction of Phrases for Expanding Queries in Academic Paper Retrieval

    Directory of Open Access Journals (Sweden)

    Yuzana Win

    2016-01-01

    Full Text Available This paper proposes a new method for query expansion based on bidirectional extraction of phrases as word n-grams from research paper titles. The proposed method aims to extract information relevant to users’ needs and interests and thus to provide a useful system for technical paper retrieval. The outcome of proposed method are the trigrams as phrases that can be used for query expansion. First, word trigrams are extracted from research paper titles. Second, a co-occurrence graph of the extracted trigrams is constructed. To construct the co-occurrence graph, the direction of edges is considered in two ways: forward and reverse. In the forward and reverse co-occurrence graphs, the trigrams point to other trigrams appearing after and before them in a paper title, respectively. Third, Jaccard similarity is computed between trigrams as the weight of the graph edge. Fourth, the weighted version of PageRank is applied. Consequently, the following two types of phrases can be obtained as the trigrams associated with the higher PageRank scores. The trigrams of the one type, which are obtained from the forward co-occurrence graph, can form a more specific query when users add a technical word or words before them. Those of the other type, obtained from the reverse co-occurrence graph, can form a more specific query when users add a technical word or words after them. The extraction of phrases is evaluated as additional features in the paper title classification task using SVM. The experimental results show that the classification accuracy is improved than the accuracy achieved when the standard TF-IDF text features are only used. Moreover, the trigrams extracted by the proposed method can be utilized to expand query words in research paper retrieval.

  15. Genesis machines

    CERN Document Server

    Amos, Martyn

    2014-01-01

    Silicon chips are out. Today's scientists are using real, wet, squishy, living biology to build the next generation of computers. Cells, gels and DNA strands are the 'wetware' of the twenty-first century. Much smaller and more intelligent, these organic computers open up revolutionary possibilities. Tracing the history of computing and revealing a brave new world to come, Genesis Machines describes how this new technology will change the way we think not just about computers - but about life itself.

  16. On Feminist Translation Theory and Literary Translation

    Institute of Scientific and Technical Information of China (English)

    狄东睿

    2015-01-01

    Feminist translation theory emerged in the late 70s and early 80s of 20th Century. It is the combination of the feminist movement and the“cultural turn”of translation. It was introduced to China in the 1980s, and with the development of the transla⁃tion theory and translation practice, more and more Chinese translators want to study the feminist translation theory from the deep levels.

  17. Children with autism spectrum disorders who do not develop phrase speech in the preschool years.

    Science.gov (United States)

    Norrelgen, Fritjof; Fernell, Elisabeth; Eriksson, Mats; Hedvall, Åsa; Persson, Clara; Sjölin, Maria; Gillberg, Christopher; Kjellmer, Liselotte

    2015-11-01

    There is uncertainty about the proportion of children with autism spectrum disorders who do not develop phrase speech during the preschool years. The main purpose of this study was to examine this ratio in a population-based community sample of children. The cohort consisted of 165 children (141 boys, 24 girls) with autism spectrum disorders aged 4-6 years followed longitudinally over 2 years during which time they had received intervention at a specialized autism center. In this study, data collected at the 2-year follow-up were used. Three categories of expressive language were defined: nonverbal, minimally verbal, and phrase speech. Data from the Vineland Adaptive Behavior Scales-II were used to classify expressive language. A secondary objective of the study was to analyze factors that might be linked to verbal ability, namely, child age, cognitive level, autism subtype and severity of core autism symptoms, developmental regression, epilepsy or other medical conditions, and intensity of intervention. The proportion of children who met the criteria for nonverbal, minimally verbal, and phrase speech were 15%, 10%, and 75%, respectively. The single most important factor linked to expressive language was the child's cognitive level, and all children classified as being nonverbal or minimally verbal had intellectual disability. © The Author(s) 2014.

  18. Using phrases and document metadata to improve topic modeling of clinical reports.

    Science.gov (United States)

    Speier, William; Ong, Michael K; Arnold, Corey W

    2016-06-01

    Probabilistic topic models provide an unsupervised method for analyzing unstructured text, which have the potential to be integrated into clinical automatic summarization systems. Clinical documents are accompanied by metadata in a patient's medical history and frequently contains multiword concepts that can be valuable for accurately interpreting the included text. While existing methods have attempted to address these problems individually, we present a unified model for free-text clinical documents that integrates contextual patient- and document-level data, and discovers multi-word concepts. In the proposed model, phrases are represented by chained n-grams and a Dirichlet hyper-parameter is weighted by both document-level and patient-level context. This method and three other Latent Dirichlet allocation models were fit to a large collection of clinical reports. Examples of resulting topics demonstrate the results of the new model and the quality of the representations are evaluated using empirical log likelihood. The proposed model was able to create informative prior probabilities based on patient and document information, and captured phrases that represented various clinical concepts. The representation using the proposed model had a significantly higher empirical log likelihood than the compared methods. Integrating document metadata and capturing phrases in clinical text greatly improves the topic representation of clinical documents. The resulting clinically informative topics may effectively serve as the basis for an automatic summarization system for clinical reports. Copyright © 2016 Elsevier Inc. All rights reserved.

  19. English Words and Phrases in Croatian: A Small-Scale Study of Language Awareness and Attitudes

    Directory of Open Access Journals (Sweden)

    Marija Perić

    2015-12-01

    Full Text Available The focus of this paper is on language attitudes towards English words and phrases in the Croatian language. In order to prevent loanwords, linguistic purism has arisen as a theory about what languages should be like. The tradition of linguistic purism in Croatia has been shaped by various socio-historical factors. English may be viewed as a language of opportunity, or as a threat to the survival of other, usually minority and endangered, languages. In order to provide an insight into the use of English words and phrases in the Croatian context, a questionnaire about language attitudes and awareness was conducted on 534 participants. The aim of the questionnaire was to determine participants’ language attitudes and familiarity with English words and phrases. The results show that although people in Croatia generally like English, many of them are not familiar with English words, especially older participants and those with little or no knowledge of the English language. Moreover, the results indicate that the younger generation is more inclined towards English than the older generation; however, they are not as familiar with Croatian equivalents as they claim.

  20. ERP responses to processing prosodic phrasing of sentences in amplitude modulated noise.

    Science.gov (United States)

    Carroll, Rebecca; Ruigendijk, Esther

    2016-02-01

    Intonation phrase boundaries (IPBs) were hypothesized to be especially difficult to process in the presence of an amplitude modulated noise masker because of a potential rhythmic competition. In an event-related potential study, IPBs were presented in silence, stationary, and amplitude modulated noise. We elicited centro-parietal Closure Positive Shifts (CPS) in 23 young adults with normal hearing at IPBs in all acoustic conditions, albeit with some differences. CPS peak amplitudes were highest in stationary noise, followed by modulated noise, and lowest in silence. Both noise types elicited CPS delays, slightly more so in stationary compared to amplitude modulated noise. These data suggest that amplitude modulation is not tantamount to a rhythmic competitor for prosodic phrasing but rather supports an assumed speech perception benefit due to local release from masking. The duration of CPS time windows was, however, not only longer in noise compared to silence, but also longer for amplitude modulated compared to stationary noise. This is interpreted as support for additional processing load associated with amplitude modulation for the CPS component. Taken together, processing prosodic phrasing of sentences in amplitude modulated noise seems to involve the same issues that have been observed for the perception and processing of segmental information that are related to lexical items presented in noise: a benefit from local release from masking, even for prosodic cues, and a detrimental additional processing load that is associated with either stream segregation or signal reconstruction.

  1. Translating into Free Word Order Languages

    CERN Document Server

    Hoffman, B

    1996-01-01

    In this paper, I discuss machine translation of English text into Turkish, a relatively ``free'' word order language. I present algorithms that determine the topic and the focus of each target sentence (using salience (Centering Theory), old vs. new information, and contrastiveness in the discourse model) in order to generate the contextually appropriate word orders in the target language.

  2. Using Bilingual Parallel Corpora in Translation Memory Systems

    Directory of Open Access Journals (Sweden)

    Hossein Keshtkar

    2012-09-01

    Full Text Available Automatic word alignment techniques commonly used in Translation Memory systems tend basically to work at single word level where there is a one to one correspondence between words in subsequences of the two languages. This, results in not being able to fully use subsentential repetitions like clauses, phrases and expressions. In this paper, using spaces between words, a search method named "space-based reduction search" is introduced. The main goal is to maximize the use of parallel corpus resources. We want to show that this search method can significantly enhance the chance of finding matches for subsequences of input sentences; hence applicable in a Sub-Sentential Translation Memory (SSTM system without running automatic alignment tools. Keywords: Sub-Sentential Translation Memory, Parallel corpus, Alignment

  3. Heavy noun phrase constructions in the Afrikaans novel 'n Ander Land: a cognitive account of a stylistic feature

    National Research Council Canada - National Science Library

    Beard, Luna

    2005-01-01

    This article focusses on one grammatical construction, namely heavy noun phrase (NP) constructions in Afrikaans, and more specifically on those instances encountered in the Afrikaans novel 'n Ander Land by Karel Schoeman...

  4. Of Mice Moths and Men Machines

    Directory of Open Access Journals (Sweden)

    Susan Schuppli

    2008-10-01

    Full Text Available In 1947, Grace Murray Hopper a pioneer in early computing made an unusual entry into her daily logbook: lsquo;Relay #70 Panel F (moth in relay. First actual case of bug being found.rsquo; Accompanying this entry is an actual celluloid tape encrusted bug, or more specifically a moth, fastened to the page of the logbook. According to Hopper, one of the technicians in her team solved a glitch in the emHarvard Mark II/em computer by pulling an actual insect out from between the contacts of one of its relays. Word soon went out that they had lsquo;debugged the machinersquo; and the phrase quickly entered our lexicon. After languishing for years this mythic moth was eventually transported to the emSmithsonian/em where it now lies in archival state. The mothrsquo;s dynamic vitality had introduced a kind of surplus or aberrant code into the machine, which in effect pushed the machine towards a state of chaos and breakdown. Its failure to act as desired, to perform the coding sequences of its programmed history suggests that even a seemingly inert or lifeless machine can become lsquo;more and other than its historyrsquo;. (Elizabeth Grosz, 2005 Hopperrsquo;s bug is thus a material witness to the creative co-evolution of the machine with the living matter of the moth. Moreover, as a cipher for machinic defect the bug reminds us that mutations are in fact necessary for systems to change and evolve. The crisis introduced into a biological system or machine through the virulence of the bug is terminal only to the extent that it becomes the source for another kind of order, another kind of interaction. This is used as a casenbsp;study to argue that chaos is not only an animating force in the constitution of new systems but is necessary for the evolution of difference.

  5. Simulating Turing machines on Maurer machines

    NARCIS (Netherlands)

    Bergstra, J.A.; Middelburg, C.A.

    2008-01-01

    In a previous paper, we used Maurer machines to model and analyse micro-architectures. In the current paper, we investigate the connections between Turing machines and Maurer machines with the purpose to gain an insight into computability issues relating to Maurer machines. We introduce ways to

  6. Reference in English-Arabic Translation

    DEFF Research Database (Denmark)

    Ehrensvärd, Martin Gustaf

    2007-01-01

    The use of referring expressions (proper nouns, noun phrases [NPs] and pronouns) in a target language [TL] text may differ from the source language [SL] text with regard to how explicit they are: One may have a pronoun (less explicit) where the other has a proper noun (more explicit), or one may...... have an indefinite NP (less explicit) where the other has a definite NP (more explicit). But are these differences in any way systematic? In an article in Babel, Yowell Y. Aziz shows that remarkable differences obtain in the explicitness of various referring expressions when an Arabic text is compared...... to its English translation and that overall, the English text is the more explicit. Now, does this mean that English generally is more explicit in its use of referring expressions, or is the difference due, rather, to the need of the TL text to be more explicit in order to facilitate the understanding...

  7. Traduccion automatica mediante el ordenador (Automatic Translation Using a Computer).

    Science.gov (United States)

    Bueno, Julian L.

    This report on machine translation contains a brief history of the field; a description of the processes involved; a discussion of systems currently in use, including three software packages on the market (Teaching Assistant, Translate, and Globalink); reflections on implications for teaching; observations of results obtained when elements of…

  8. Environmentally Friendly Machining

    CERN Document Server

    Dixit, U S; Davim, J Paulo

    2012-01-01

    Environment-Friendly Machining provides an in-depth overview of environmentally-friendly machining processes, covering numerous different types of machining in order to identify which practice is the most environmentally sustainable. The book discusses three systems at length: machining with minimal cutting fluid, air-cooled machining and dry machining. Also covered is a way to conserve energy during machining processes, along with useful data and detailed descriptions for developing and utilizing the most efficient modern machining tools. Researchers and engineers looking for sustainable machining solutions will find Environment-Friendly Machining to be a useful volume.

  9. Korean to English Translation Using Synchronous TAGs

    CERN Document Server

    Egedi, D; Park, H S; Joshi, A K; Egedi, Dania; Palmer, Martha; Park, Hyun S.; Joshi, Aravind K.

    1994-01-01

    It is often argued that accurate machine translation requires reference to contextual knowledge for the correct treatment of linguistic phenomena such as dropped arguments and accurate lexical selection. One of the historical arguments in favor of the interlingua approach has been that, since it revolves around a deep semantic representation, it is better able to handle the types of linguistic phenomena that are seen as requiring a knowledge-based approach. In this paper we present an alternative approach, exemplified by a prototype system for machine translation of English and Korean which is implemented in Synchronous TAGs. This approach is essentially transfer based, and uses semantic feature unification for accurate lexical selection of polysemous verbs. The same semantic features, when combined with a discourse model which stores previously mentioned entities, can also be used for the recovery of topicalized arguments. In this paper we concentrate on the translation of Korean to English.

  10. A Comparison of Different Machine Transliteration Models

    CERN Document Server

    Choi, K; Oh, J; 10.1613/jair.1999

    2011-01-01

    Machine transliteration is a method for automatically converting words in one language into phonetically equivalent ones in another language. Machine transliteration plays an important role in natural language applications such as information retrieval and machine translation, especially for handling proper nouns and technical terms. Four machine transliteration models -- grapheme-based transliteration model, phoneme-based transliteration model, hybrid transliteration model, and correspondence-based transliteration model -- have been proposed by several researchers. To date, however, there has been little research on a framework in which multiple transliteration models can operate simultaneously. Furthermore, there has been no comparison of the four models within the same framework and using the same data. We addressed these problems by 1) modeling the four models within the same framework, 2) comparing them under the same conditions, and 3) developing a way to improve machine transliteration through this com...

  11. What can children tell us about the fixedness of idiomatic phrases? The psycholinguistic relevance of a linguistic model

    Directory of Open Access Journals (Sweden)

    Kleopatra Diakogiorgi

    2011-12-01

    Full Text Available The present paper contains two distinct yet intricately linked studies concerning idiomatic phrases in Greek: a linguistic study and a psycholinguistic study. The linguistic study aims at investigating the degree of fixedness of idiomatic phrases, through a semantic-lexical categorization of 470 phrases with fixed subject. Three phrase categories were distinguished: typical phrases characterized by a strong semantic-lexical articulation between their constituents, and non-typical phrases, in which the constituents have a certain semantic autonomy. Non-typical phrases were further categorized into a quasi-phrases and b conventionalized phrases, the constituents of which have the highest degree of semantic autonomy. Α linguistic model was developed named “graded fixedness model”, and the psycholinguistic relevance of the model assessed by examining idiom comprehension in Greek elementary school children, aged 7.5 and 9.5 years old. To achieve this, a psycholinguistic study was then carried out. Children were presented with 8 typical, 8 quasi and 8 conventionalized phrases. For each phrase, they were asked to choose the one they thought correct out of three interpretations proposed (idiomatic, literal and other. The research findings presented and discussed herein provide evidence supporting the psychological reality of the notion of semantic autonomy even for the younger children’s processing which was overall quite poor: thus, the greater the semantic autonomy of a phrase’s constituents, the easier the access to its idiomatic meaning.Le présent article comporte deux études distinctes mais étroitement liées concernant les phrases idomatiques en grec : une étude linguistique et une étude psycholinguistique. L’étude linguistique vise à examiner le degré de figement des phrases idiomatiques, à travers une catégorisation sémantico-lexicale de 470 phrases à sujet fixe. Trois catégories de phrases ont été distingu

  12. Terra: a Collection of Translation Error-Annotated Corpora

    OpenAIRE

    Popović, Maja; Fishel, Mark; Bojar, Ondřej

    2012-01-01

    Recently the first methods of automatic diagnostics of machine translation have emerged; since this area of research is relatively young, the efforts are not coordinated. We present a collection of translation error-annotated corpora, consisting of automatically produced trans- lations and their detailed manual translation error analysis. Using the collected corpora we evaluate the available state-of-the-art methods of MT diagnostics and assess, how well the methods perform, how they...

  13. Deconstructive Reconsideration of Translation

    Institute of Scientific and Technical Information of China (English)

    刘艳清

    2014-01-01

    This paper is on the deconstructive reconsideration of the definition of translation, the relation of source text and translation, as well as the relation of translator and author. It also points out the differences between tranditioanal concepts of translation and deconstructive reconsideration of translation.

  14. On Becoming a Translator

    Institute of Scientific and Technical Information of China (English)

    王莹

    2015-01-01

    Douglas Robinson's Becoming a Translator:An Introduction to the Theory and Practice of Translation has been well received around the world.Fusing translation theory with advice and information about the practicalities of translating,it is an essential resource for novice and practicing translators.

  15. Translator Studies in Retrospect

    Institute of Scientific and Technical Information of China (English)

    郭勤

    2009-01-01

    The concern about translator has run for thousands of years on which many translators and translation scholars have made comments.A rough examination of the translator within translation studies at home and abroad is expounded in a loosely chronological way as follows.

  16. On Becoming a Translator

    Institute of Scientific and Technical Information of China (English)

    王莹

    2015-01-01

    Douglas Robinson’s Becoming a Translator:An Introduction to the Theory and Practice of Translation has been well received around the world.Fusing translation theory with advice and information about the practicalities of translating,it is an essential resource for novice and practicing translators.

  17. Speaking your Translation

    DEFF Research Database (Denmark)

    Dragsted, Barbara; Mees, Inger M.; Gorm Hansen, Inge

    2011-01-01

    In this article we discuss the translation processes and products of 14 MA students who produced translations from Danish (L1) into English (L2) under different working conditions: (1) written translation, (2) sight translation, and (3) sight translation with a speech recognition (SR) tool. Audio...

  18. Equivalence in Translation

    Institute of Scientific and Technical Information of China (English)

    李良杰

    2013-01-01

    There are many researches about translation theories and methods in western translation history. Equivalence in transla⁃tion has always been the central issue for discussion. This paper gives a general review and comment on equivalence in translation in terms of three representative translation theorists and their views about equivalence in translation.

  19. MARKS OF ETHNICITY IN PURPLE HIBISCUS TRANSLATION

    Directory of Open Access Journals (Sweden)

    Fernanda de Oliveira Müller

    2016-10-01

    Full Text Available This study gives an analysis of the English – Brazilian Portuguese translation of Chimamanda Ngozi Adichie’s novel, Purple Hibiscus, made by Julia Romeu. It is an attempt to analyze how traces of ethnic identities marked in the source text are reproduced in the Brazilian version Hibisco Roxo, published in 2011. Initially, is a brief biography of the writer is presented together with her history towards the construction of a new paradigm for the literature about Africa and Nigeria. Adichie challenges Western stereotypes about that continent, which tend to report poverty, war and disease scenarios. Secondly, a summary of the story was made and the main characters were described. Thirdly, a collection of recorded words and phrases in the Igbo language was compiled from the original text and an analysis of the translation of those terms into Brazilian Portuguese was performed. Afterwards, the concept of ethnicity described by the sociologist Anthony Giddens was presented. Based on that concept, it was concluded that the terms previously selected could be considered as marks of ethnicity, reflecting the presence of the Igbo ethnic group in the British colonial culture. Finally, taking Antoine Berman’s proposition for an ethical translation, which embraces the foreign and rejects ethnocentrism, the conclusion to be drawn is that the translator’s option to keep Igbo terms in her work respected the author’s manifest intention of, through her work, showing the readers from other countries a bit of Nigeria’s culture and history.

  20. Machine Protection

    CERN Document Server

    Schmidt, R

    2014-01-01

    The protection of accelerator equipment is as old as accelerator technology and was for many years related to high-power equipment. Examples are the protection of powering equipment from overheating (magnets, power converters, high-current cables), of superconducting magnets from damage after a quench and of klystrons. The protection of equipment from beam accidents is more recent. It is related to the increasing beam power of high-power proton accelerators such as ISIS, SNS, ESS and the PSI cyclotron, to the emission of synchrotron light by electron–positron accelerators and FELs, and to the increase of energy stored in the beam (in particular for hadron colliders such as LHC). Designing a machine protection system requires an excellent understanding of accelerator physics and operation to anticipate possible failures that could lead to damage. Machine protection includes beam and equipment monitoring, a system to safely stop beam operation (e.g. dumping the beam or stopping the beam at low energy) and an ...

  1. Literal Translation using Google Translate in Translating the Text from French to English in Digital Tourism Brochure “Bienvenue À Paris”

    Directory of Open Access Journals (Sweden)

    Rila Hilma

    2011-05-01

    Full Text Available Translation is basically change of form. The form from which the translation is made will be called the source language and the form into which it is to be changed will be called the receptor language. Translation consists of transferring the meaning of the source language into the receptor language. Translating is not an easy job to do because many things to be considered to do this activity because translation means determining the meaning of a text, then reconstructing this same meaning using the appropriate structure and form in the receptor language. Translation is basically divided by two types of translation, one is literal and the other is idiomatic. Literal translation is really strict to the structure and form then often can not well express the true meaning of source language. Idiomatic translation makes every effort to communicate the meaning of the source language text in the natural forms of the receptor language. Then the most popular translation machine, Google Translate, in this study shows the results of translation which remain odd, unnatural, and nonsensical because the unsuccessful of message delivery, which is notably the typically error of literal translation.

  2. Analysis of machining and machine tools

    CERN Document Server

    Liang, Steven Y

    2016-01-01

    This book delivers the fundamental science and mechanics of machining and machine tools by presenting systematic and quantitative knowledge in the form of process mechanics and physics. It gives readers a solid command of machining science and engineering, and familiarizes them with the geometry and functionality requirements of creating parts and components in today’s markets. The authors address traditional machining topics, such as: single and multiple point cutting processes grinding components accuracy and metrology shear stress in cutting cutting temperature and analysis chatter They also address non-traditional machining, such as: electrical discharge machining electrochemical machining laser and electron beam machining A chapter on biomedical machining is also included. This book is appropriate for advanced undergraduate and graduate mechani cal engineering students, manufacturing engineers, and researchers. Each chapter contains examples, exercises and their solutions, and homework problems that re...

  3. Translation Theory and Translation Studies in China

    Science.gov (United States)

    Guo, Qin

    2012-01-01

    This dissertation is a comparative study of "translation theory" and "translation studies" in China and the West. Its focus is to investigate whether there is translation theory in the Chinese tradition. My study begins with an examination of the debate in China over whether there has already existed a system of translation…

  4. Identité stricte ou partielle et identification dans les phrases à copule. Comment les identifier ?

    Directory of Open Access Journals (Sweden)

    Amary-Coudreau Valérie

    2014-07-01

    Full Text Available Le terme d’identité est utilisé dans de nombreux domaines et notre étude pose la question de ce que peut être une identité en langue et plus particulièrement dans le traitement des phrases à copule. En s’appuyant sur la classification proposé par Higgins (1973, on se propose de distinguer l’identité de la spécification et de l’identification à l’aide de différents outils concernant deux problématiques : qu’elle peut être la forme d’une identité pour une phrase à copule et quel type de définition peut-on lui donner ? Pour ce qui concerne la forme, les phrases données comme des identités et traitées par les logiciens et philosophes en termes de vrai ou faux ne correspondent pas à ce qu’on peut supposer être des identités en langue : par exemple, en français, une phrase comme « Cicéron est Tullius » n’est pas valide et doit comporter c’est à la place de est (comme d’ailleurs les phrases spécificationnelles et identificationnelles pour ne pas paraître artificielle. On peut trouver une justification à cela dans la dichotomie entre prédicationnelles et équatives proposée par Heycock & Kroch (1999. Les équatives regroupent la spécification, l’identification et l’identité, mais ces deux dernières sous-catégories ne sont pas clairement distinguées dans la plupart des travaux sur le sujet. Pour ce qui concerne les définitions, l’identité mathématique, d’une part, et sur la Théorie des Opérations Enonciatives (cf. Culioli & Desclès, 1981, parmi d’autres références, d’autre part, ne permettent pas de lever totalement le voile sur cette problématique. On pose donc une hypothèse où la forme et l’interprétation sont liées : l’identité se présente sous la forme d’une phrase en X c’est Y où les deux termes X et Y possèdent exactement la même structure interne, et par conséquent le même degré de référentialité. Ceci nous permet de d’envisager les caract

  5. Inter-subject variability modulates phonological advance planning in the production of adjective-noun phrases

    Directory of Open Access Journals (Sweden)

    Violaine eMichel Lange

    2014-01-01

    Full Text Available The literature on advance phonological planning in adjective-noun phrases (NPs presents diverging results: while many experimental studies suggest that the entire NP is encoded before articulation, other results favour a span of encoding limited to the first word. Although cross-linguistic differences in the structure of adjective-noun phrases may account for some of these contrasting results, divergences have been reported even among similar languages and syntactic structures. Here we examined whether inter-individual differences account for variability in the span of phonological planning in the production of French NPs, where previous results indicated encoding limited to the first word. The span of phonological encoding is tested with the picture-word interference (PWI paradigm using phonological distractors related to the noun or to the adjective of the NPs. In Experiment 1, phonological priming effects were limited to the first word in adjective NPs whichever the position of the adjective (pre-nominal or post-nominal. Crucially, phonological priming effects on the second word interacted with speakers´ production speed suggesting different encoding strategies for participants. In Experiment 2, we tested this hypothesis further with a larger group of participants. Results clearly showed that slow and fast initialising participants presented different phonological priming patterns on the last element of adjective-noun phrases: while the first word was primed by a distractor for all speakers, only the slow speaker group presented a priming effect on the second element of the NP. These results show that the span of phonological encoding is modulated by inter-individual strategies: in experimental paradigms some speakers plan word by word whereas others encode beyond the initial word. We suggest that the diverging results reported in the literature on advance phonological planning may partly be reconciled in light of the present results.

  6. LA FRASE NOMINAL NGÄBÉRE The Nominal Phrase Ngäbére

    Directory of Open Access Journals (Sweden)

    José Manuel Murillo Miranda

    2009-12-01

    Full Text Available Este trabajo es el avance de una amplia investigación cuyo propósito es elaborar una gramática descriptiva de la lengua chibcha ngäbére. El artículo presenta las características particulares más relevantes de la frase nominal de dicha variante dialectal (hablada en Costa Rica, también en Panamá, y establece la relación con algunas de las características generales de otras lenguas chibchas ístmicas. El primer apartado del escrito es una introducción a la lengua ngäbére y a sus hablantes; el segundo presenta la morfología de los sustantivos, pronombres y otras clases de palabras que conforman la frase nominal; el tercer apartado trata de la sintaxis de la frase nominal y esboza, de manera sucinta, las relaciones gramaticales; el cuarto apartado es una conclusión valorativa de este trabajo.This work is a preview of a broader study whose goal is to create a descriptive grammar of the Chibchan language, Ngäbere. The article describes the most relevant features of the dialect's nominal phrase (spoken in Costa Rica and Panama and establishes the relationship with some of the general features of other Isthmian Chibchan languages. The first part of the report is an introduction to Ngäbere and it speakers; the second, describes the morphology of the nouns, pronouns, and other words that make up the nominal phrase; the third, is about the syntax of the nominal phrase and it outlines in a succinct manner, grammatical relationships; the last part is a critical conclusion of this work.

  7. Generating descriptive visual words and visual phrases for large-scale image applications.

    Science.gov (United States)

    Zhang, Shiliang; Tian, Qi; Hua, Gang; Huang, Qingming; Gao, Wen

    2011-09-01

    Bag-of-visual Words (BoWs) representation has been applied for various problems in the fields of multimedia and computer vision. The basic idea is to represent images as visual documents composed of repeatable and distinctive visual elements, which are comparable to the text words. Notwithstanding its great success and wide adoption, visual vocabulary created from single-image local descriptors is often shown to be not as effective as desired. In this paper, descriptive visual words (DVWs) and descriptive visual phrases (DVPs) are proposed as the visual correspondences to text words and phrases, where visual phrases refer to the frequently co-occurring visual word pairs. Since images are the carriers of visual objects and scenes, a descriptive visual element set can be composed by the visual words and their combinations which are effective in representing certain visual objects or scenes. Based on this idea, a general framework is proposed for generating DVWs and DVPs for image applications. In a large-scale image database containing 1506 object and scene categories, the visual words and visual word pairs descriptive to certain objects or scenes are identified and collected as the DVWs and DVPs. Experiments show that the DVWs and DVPs are informative and descriptive and, thus, are more comparable with the text words than the classic visual words. We apply the identified DVWs and DVPs in several applications including large-scale near-duplicated image retrieval, image search re-ranking, and object recognition. The combination of DVW and DVP performs better than the state of the art in large-scale near-duplicated image retrieval in terms of accuracy, efficiency and memory consumption. The proposed image search re-ranking algorithm: DWPRank outperforms the state-of-the-art algorithm by 12.4% in mean average precision and about 11 times faster in efficiency.

  8. U.S. Army Research Laboratory (ARL) Corporate Dari Document Transcription and Translation Guidelines

    Science.gov (United States)

    2012-10-01

    text file format. 15. SUBJECT TERMS Transcription, Translation, guidelines, ground truth, Optical character recognition, OCR, Machine Translation, MT...foreign language into a target language in order to train, test, and evaluate optical character recognition (OCR) and machine translation (MT) embedded...the final word in this sequence). 5. Spacing: When printed words give the illusion of having a space in between letters in the word due to

  9. Phrase-level speech simulation with an airway modulation model of speech production.

    Science.gov (United States)

    Story, Brad H

    2013-06-01

    Artificial talkers and speech synthesis systems have long been used as a means of understanding both speech production and speech perception. The development of an airway modulation model is described that simulates the time-varying changes of the glottis and vocal tract, as well as acoustic wave propagation, during speech production. The result is a type of artificial talker that can be used to study various aspects of how sound is generated by humans and how that sound is perceived by a listener. The primary components of the model are introduced and simulation of words and phrases are demonstrated.

  10. Minimal Interference from Possessor Phrases in the Production of Subject-Verb Agreement.

    Science.gov (United States)

    Nicol, Janet L; Barss, Andrew; Barker, Jason E

    2016-01-01

    We explore the language production process by eliciting subject-verb agreement errors. Participants were asked to create complete sentences from sentence beginnings such as The elf's/elves' house with the tiny window/windows and The statue in the elf's/elves' gardens. These are subject noun phrases containing a head noun and controller of agreement (statue) and two nonheads, a "local noun" (window(s)/garden(s)), and a possessor noun (elf's/elves'). Past research has shown that a plural nonhead noun (an "attractor") within a subject noun phrase triggers the production of verb agreement errors, and further, that the nearer the attractor to the head noun, the greater the interference. This effect can be interpreted in terms of relative hierarchical distance from the head noun, or via a processing window account, which claims that during production, there is a window in which the head and modifying material may be co-active, and an attractor must be active at the same time as the head to give rise to errors. Using possessors attached at different heights within the same window, we are able to empirically distinguish these accounts. Possessors also allow us to explore two additional issues. First, case marking of local nouns has been shown to reduce agreement errors in languages with "rich" inflectional systems, and we explore whether English speakers attend to case. Secondly, formal syntactic analyses differ regarding the structural position of the possessive marker, and we distinguish them empirically with the relative magnitude of errors produced by possessors and local nouns. Our results show that, across the board, plural possessors are significantly less disruptive to the agreement process than plural local nouns. Proximity to the head noun matters: a possessor directly modifying the head noun induce a significant number of errors, but a possessor within a modifying prepositional phrase did not, though the local noun did. These findings suggest that proximity to a

  11. Speaking your Translation

    DEFF Research Database (Denmark)

    Dragsted, Barbara; Mees, Inger M.; Gorm Hansen, Inge

    2011-01-01

    In this article we discuss the translation processes and products of 14 MA students who produced translations from Danish (L1) into English (L2) under different working conditions: (1) written translation, (2) sight translation, and (3) sight translation with a speech recognition (SR) tool. Audio...... output and keystrokes were recorded. Oral and written translation data were examined in order to investigate if task times and translation quality differed in the three modalities. Although task times were found to be highest in written translation, the quality was not consistently better. In addition...

  12. Expressive timing facilitates the neural processing of phrase boundaries in music: evidence from event-related potentials.

    Directory of Open Access Journals (Sweden)

    Eva Istók

    Full Text Available The organization of sound into meaningful units is fundamental to the processing of auditory information such as speech and music. In expressive music performance, structural units or phrases may become particularly distinguishable through subtle timing variations highlighting musical phrase boundaries. As such, expressive timing may support the successful parsing of otherwise continuous musical material. By means of the event-related potential technique (ERP, we investigated whether expressive timing modulates the neural processing of musical phrases. Musicians and laymen listened to short atonal scale-like melodies that were presented either isochronously (deadpan or with expressive timing cues emphasizing the melodies' two-phrase structure. Melodies were presented in an active and a passive condition. Expressive timing facilitated the processing of phrase boundaries as indicated by decreased N2b amplitude and enhanced P3a amplitude for target phrase boundaries and larger P2 amplitude for non-target boundaries. When timing cues were lacking, task demands increased especially for laymen as reflected by reduced P3a amplitude. In line, the N2b occurred earlier for musicians in both conditions indicating general faster target detection compared to laymen. Importantly, the elicitation of a P3a-like response to phrase boundaries marked by a pitch leap during passive exposure suggests that expressive timing information is automatically encoded and may lead to an involuntary allocation of attention towards significant events within a melody. We conclude that subtle timing variations in music performance prepare the listener for musical key events by directing and guiding attention towards their occurrences. That is, expressive timing facilitates the structuring and parsing of continuous musical material even when the auditory input is unattended.

  13. Expressive timing facilitates the neural processing of phrase boundaries in music: evidence from event-related potentials.

    Science.gov (United States)

    Istók, Eva; Friberg, Anders; Huotilainen, Minna; Tervaniemi, Mari

    2013-01-01

    The organization of sound into meaningful units is fundamental to the processing of auditory information such as speech and music. In expressive music performance, structural units or phrases may become particularly distinguishable through subtle timing variations highlighting musical phrase boundaries. As such, expressive timing may support the successful parsing of otherwise continuous musical material. By means of the event-related potential technique (ERP), we investigated whether expressive timing modulates the neural processing of musical phrases. Musicians and laymen listened to short atonal scale-like melodies that were presented either isochronously (deadpan) or with expressive timing cues emphasizing the melodies' two-phrase structure. Melodies were presented in an active and a passive condition. Expressive timing facilitated the processing of phrase boundaries as indicated by decreased N2b amplitude and enhanced P3a amplitude for target phrase boundaries and larger P2 amplitude for non-target boundaries. When timing cues were lacking, task demands increased especially for laymen as reflected by reduced P3a amplitude. In line, the N2b occurred earlier for musicians in both conditions indicating general faster target detection compared to laymen. Importantly, the elicitation of a P3a-like response to phrase boundaries marked by a pitch leap during passive exposure suggests that expressive timing information is automatically encoded and may lead to an involuntary allocation of attention towards significant events within a melody. We conclude that subtle timing variations in music performance prepare the listener for musical key events by directing and guiding attention towards their occurrences. That is, expressive timing facilitates the structuring and parsing of continuous musical material even when the auditory input is unattended.

  14. L'organizzazione di una lezione di traduzione basata sull'approccio dell'analisi testuale (The Organization of a Translation Lesson Based on a Textual Analysis Approach).

    Science.gov (United States)

    Horrakh, Livio

    1988-01-01

    Describes a three-phase approach to teaching translation: (1) decoding the "macro" (beyond sentence level) and the "micro" (sentence or phrase level) elements of the passage; (2) coordinating the elements of the original language text with the target language text; and (3) producing the text in the target language. (CFM)

  15. 基于特征比较和最大熵模型的统计机器翻译错误检测%Error Detection for Statistical Machine Translation Based on Feature Comparison and Maximum Entropy Model Classifier

    Institute of Scientific and Technical Information of China (English)

    杜金华; 王莎

    2013-01-01

    首先介绍3种典型的用于翻译错误检测和分类的单词后验概率特征,即基于固定位置的词后验概率、基于滑动窗的词后验概率和基于词对齐的词后验概率,分析其对错误检测性能的影响;然后,将其分别与语言学特征如词性、词及由LG句法分析器抽取的句法特征等进行组合,利用最大熵分类器预测翻译错误,并在汉英NIST数据集上进行实验验证和比较.实验结果表明,不同的单词后验概率对分类错误率的影响是显著的,并且在词后验概率基础上加入语言学特征的组合特征可以显著降低分类错误率,提高译文错误预测性能.%The authors firstly introduce three typical word posterior probabilities (WPP) for error detection and classification, which are fixed position WPP, sliding window WPP, and alignment-based WPP, and analyzes their impact on the detection performance. Then each WPP feature is combined with three linguistic features (Word, POS and LG Parsing knowledge) over the maximum entropy classifier to predict the translation errors. Experimental results on Chinese-to-English NIST datasets show that the influences of different WPP features on the classification error rate (CER) are significant, and the combination of WPP with linguistic features can significantly reduce the CER and improve the prediction capability of the classifier.

  16. Word Translation Prediction for Morphologically Rich Languages with Bilingual Neural Networks

    NARCIS (Netherlands)

    Tran, K.; Bisazza, A.; Monz, C.

    2014-01-01

    Translating into morphologically rich languages is a particularly difficult problem in machine translation due to the high degree of inflectional ambiguity in the target language, often only poorly captured by existing word translation models. We present a general approach that exploits source-side

  17. Mutual Disambiguation of Eye Gaze and Speech for Sight Translation and Reading

    DEFF Research Database (Denmark)

    Kulkarni, Rucha; Jain, Kritika; Bansal, Himanshu;

    Researchers are proposing interactive machine translation as a potential method to make language translation process more efficient and usable. Introduction of different modalities like eye gaze and speech are being explored to add to the interactivity of language translation system. Unfortunately...

  18. Towards improving English-Latvian translation: a system comparison and a new rescoring feature

    NARCIS (Netherlands)

    Khalilov, M.; Fonollosa, J.A.R.; Skadiņa, I.; Brālītis, E.; Pretkalniņa, L.; Calzolari, N.; Choukri, K.; Maegaard, B.; Mariani, J.; Odijk, J.; Piperidis, S.; Rosner, M.; Tapias, D.

    2010-01-01

    Translation into the languages with relatively free word order has received a lot less attention than translation into fixed word order languages (English), or into analytical languages (Chinese). At the same time this translation task is found among the most difficult challenges for machine transla

  19. Embedding Web-Based Statistical Translation Models in Cross-Language Information Retrieval

    NARCIS (Netherlands)

    Kraaij, W.; Nie, J.Y.; Simard, M.

    2003-01-01

    Although more and more language pairs are covered by machine translation (MT) services, there are still many pairs that lack translation resources. Cross-language information retrieval (CUR) is an application that needs translation functionality of a relatively low level of sophistication, since

  20. Embedding Web-Based Statistical Translation Models in Cross-Language Information Retrieval

    NARCIS (Netherlands)

    Kraaij, W.; Nie, J.Y.; Simard, M.

    2003-01-01

    Although more and more language pairs are covered by machine translation (MT) services, there are still many pairs that lack translation resources. Cross-language information retrieval (CUR) is an application that needs translation functionality of a relatively low level of sophistication, since cur

  1. Mutual Disambiguation of Eye Gaze and Speech for Sight Translation and Reading

    DEFF Research Database (Denmark)

    Kulkarni, Rucha; Jain, Kritika; Bansal, Himanshu

    2013-01-01

    Researchers are proposing interactive machine translation as a potential method to make language translation process more efficient and usable. Introduction of different modalities like eye gaze and speech are being explored to add to the interactivity of language translation system. Unfortunatel...

  2. Automation of printing machine

    OpenAIRE

    Sušil, David

    2016-01-01

    Bachelor thesis is focused on the automation of the printing machine and comparing the two types of printing machines. The first chapter deals with the history of printing, typesettings, printing techniques and various kinds of bookbinding. The second chapter describes the difference between sheet-fed printing machines and offset printing machines, the difference between two representatives of rotary machines, technological process of the products on these machines, the description of the mac...

  3. Electrocorticographic language mapping with a listening task consisting of alternating speech and music phrases.

    Science.gov (United States)

    Mooij, Anne H; Huiskamp, Geertjan J M; Gosselaar, Peter H; Ferrier, Cyrille H

    2016-02-01

    Electrocorticographic (ECoG) mapping of high gamma activity induced by language tasks has been proposed as a more patient friendly alternative for electrocortical stimulation mapping (ESM), the gold standard in pre-surgical language mapping of epilepsy patients. However, ECoG mapping often reveals more language areas than considered critical with ESM. We investigated if critical language areas can be identified with a listening task consisting of speech and music phrases. Nine patients with implanted subdural grid electrodes listened to an audio fragment in which music and speech alternated. We analysed ECoG power in the 65-95 Hz band and obtained task-related activity patterns in electrodes over language areas. We compared the spatial distribution of sites that discriminated between listening to speech and music to ESM results using sensitivity and specificity calculations. Our listening task of alternating speech and music phrases had a low sensitivity (0.32) but a high specificity (0.95). The high specificity indicates that this test does indeed point to areas that are critical to language processing. Our test cannot replace ESM, but this short and simple task can give a reliable indication where to find critical language areas, better than ECoG mapping using language tasks alone. Copyright © 2015 International Federation of Clinical Neurophysiology. Published by Elsevier Ireland Ltd. All rights reserved.

  4. On Collocations and Their Interaction with Parsing and Translation

    Directory of Open Access Journals (Sweden)

    Violeta Seretan

    2013-10-01

    Full Text Available We address the problem of automatically processing collocations—a subclass of multi-word expressions characterized by a high degree of morphosyntactic flexibility—in the context of two major applications, namely, syntactic parsing and machine translation. We show that parsing and collocation identification are processes that are interrelated and that benefit from each other, inasmuch as syntactic information is crucial for acquiring collocations from corpora and, vice versa, collocational information can be used to improve parsing performance. Similarly, we focus on the interrelation between collocations and machine translation, highlighting the use of translation information for multilingual collocation identification, as well as the use of collocational knowledge for improving translation. We give a panorama of the existing relevant work, and we parallel the literature surveys with our own experiments involving a symbolic parser and a rule-based translation system. The results show a significant improvement over approaches in which the corresponding tasks are decoupled.

  5. The use of phrase-level prosodic information in lexical segmentation: evidence from word-spotting experiments in Korean.

    Science.gov (United States)

    Kim, Sahyang; Cho, Taehong

    2009-05-01

    This study investigated the role of phrase-level prosodic boundary information in word segmentation in Korean with two word-spotting experiments. In experiment 1, it was found that intonational cues alone helped listeners with lexical segmentation. Listeners paid more attention to local intonational cues (...H#L...) across the prosodic boundary than the intonational information within a prosodic phrase. The results imply that intonation patterns with high frequency are used, though not exclusively, in lexical segmentation. In experiment 2, final lengthening was added to see how multiple prosodic cues influence lexical segmentation. The results showed that listeners did not necessarily benefit from the presence of both intonational and final lengthening cues: Their performance was improved only when intonational information contained infrequent tonal patterns for boundary marking, showing only partially cumulative effects of prosodic cues. When the intonational information was optimal (frequent) for boundary marking, however, poorer performance was observed with final lengthening. This is arguably because the phrase-initial segmental allophonic cues for the accentual phrase were not matched with the prosodic cues for the intonational phrase. It is proposed that the asymmetrical use of multiple cues was due to interaction between prosodic and segmental information that are computed in parallel in lexical segmentation.

  6. Discussion on the Difference between Phrases and Compound Words%谈短语与合成词的区别

    Institute of Scientific and Technical Information of China (English)

    李立

    2012-01-01

    短语与合成词是两种大小不同的语法单位,大多数短语和词的界限是清楚的。由于语言的发展变化,有些短语逐渐向合成词转化,在转化过程中就产生了某些短语和某些合成词的区分问题。本文对有争议的短语和合成词进行区分,以达到正确区分的目的。%Phrases and compound words are two grammatical units between the most phrases and words is discernible. As the changes which have different sizes, and the boundary of Chinese language, some phrases transform gradually into the compound words, and the differentiated problem of some phrases and some compound words has appeared. In this paper, we differentiate the debatable phrases and compound words, in order to master the cor- rect?differentiating standard.

  7. Corpora and Translator

    Institute of Scientific and Technical Information of China (English)

    傅丽莉

    2010-01-01

    @@ 1 Introduction Recent years have witnessed a significant growth of corpora heat. One of the many fields where corpora have a growing impact is translation, both at a descriptive and a practical level. Chesterman pointed out that the focus of Translation Studies shifted from translation itself to translators, from regulative to descriptive studies, from philosophical to empirical studies (Chesterman, 1998).

  8. Sound Effects in Translation

    DEFF Research Database (Denmark)

    Mees, Inger M.; Dragsted, Barbara; Gorm Hansen, Inge

    2015-01-01

    implications for the curriculum, (3) be pedagogically motivating, and (4) prepare students for employing translation technology in their future practice as translators. In a two-phase study in which 14 MA students translated texts in three modalities (sight, written, and oral translation using an SR program...

  9. On translation equivalence

    Institute of Scientific and Technical Information of China (English)

    石雏凤

    2009-01-01

    Nida's translation theories, especially for his "Dynamic equivalence theory", are highly praised and adopted in Chinese translation circle. Howev-er, there are a lot of criticism and misunderstanding at the same time. This paper explores the issue on translation equivalence so as to benefit our translation studies on both theory and practice level.

  10. Machine musicianship

    Science.gov (United States)

    Rowe, Robert

    2002-05-01

    The training of musicians begins by teaching basic musical concepts, a collection of knowledge commonly known as musicianship. Computer programs designed to implement musical skills (e.g., to make sense of what they hear, perform music expressively, or compose convincing pieces) can similarly benefit from access to a fundamental level of musicianship. Recent research in music cognition, artificial intelligence, and music theory has produced a repertoire of techniques that can make the behavior of computer programs more musical. Many of these were presented in a recently published book/CD-ROM entitled Machine Musicianship. For use in interactive music systems, we are interested in those which are fast enough to run in real time and that need only make reference to the material as it appears in sequence. This talk will review several applications that are able to identify the tonal center of musical material during performance. Beyond this specific task, the design of real-time algorithmic listening through the concurrent operation of several connected analyzers is examined. The presentation includes discussion of a library of C++ objects that can be combined to perform interactive listening and a demonstration of their capability.

  11. FACTORS REGULATING LIBERAL TRANSLATION

    Institute of Scientific and Technical Information of China (English)

    龚海红

    2012-01-01

    Literal translation and liberal translation are two important methods and both play key roles in translation.However,some textbooks say that most translations are literal translations while others maintain most are liberal ones,besides,some others suggest a combination of the two.This paper focuses on the facts that regulate liberal translation.Because of the differences in culture,society,history,geography,and so on,there exists a great difference between Chinese language and English language,which does naturally lead to the liberal translation.

  12. Some Major Steps to Translation and Translator

    Directory of Open Access Journals (Sweden)

    Mohammad Reza Hojat Shamami

    2012-09-01

    Full Text Available This work is an overview of the main issues at the core of theorizing translation practice and the features of a good translator or how to be a good translator and of course what are the Skills to become a Freelance Translator and Translation process. In this world of science and technology there is knowledge explosion every day. This knowledge which is generally written in the English language needs to be transmitted in various languages so that people who do not know how to speak and write the original language can get the knowledge necessary for industrial development and technological innovation to keep up with the rest of the world. To transmit this knowledge effectively, there is a need for competent translators in various languages.

  13. Proses dan Hasil Penerjemahan Mahasiswa (Studi Kasus Pada Mahasiswa Translation Theory and Practice Di STAIN Pamekasan

    Directory of Open Access Journals (Sweden)

    Mosleh Habibullah

    2015-05-01

    , from translation technique in the research is found from the text which college students translate, error in table 1, there are 3 errors, in table 2 there are six language errors, in table 3 there are six errors. In table 4 there are seven errors.However, researcher finds inadequacy which is caused by mastering vocabulary, phrase, idiom, and students’ inadequacy in using grammatical Principal from BSu (NL. Besides that, it is possible that time allocation is not enough, 100 minutes in 2 sks translation theory and 100 minutes in 2 sks translation practice

  14. Cultural Context and Translation

    Institute of Scientific and Technical Information of China (English)

    张敏

    2009-01-01

    cultural context plays an important role in translation. Because translation is a cross-culture activity, the culture context that influ-ences translating is consisted of both the culture contexts of source language and target language. This article firstly analyzes the concept of context and cultural context, then according to the procedure of translating classifies cultural context into two stages and talks about how they respectively influence translating.

  15. Stylistic Requirement for Translation

    Institute of Scientific and Technical Information of China (English)

    HOU Yin-zhen

    2016-01-01

    Stylistic research is subordinate to language use research. The fast maturing modern stylistics has greatly boosted trans-lation studies. Translation has a close relationship with stylistics. Many problems can be solved in translation practice by stylis-tic theories and analysis methods. Based on a brief introduction of stylistics and the relationship between stylistics and transla-tion, this paper will give a specific analysis of the stylistic requirement for translation.

  16. Dialog between a Lexicographer and a Translator

    Directory of Open Access Journals (Sweden)

    Mark Kit

    2015-06-01

    Full Text Available Dialog between a Lexicographer and a Translator The discussion between the authors of the paper concerns the most pressing issues encountered in natural language semantics, as well as in corpus linguistics and computational linguistics. A broad range of knowledge, allowing linguists and information scientists to work together, is required in these areas. The paper describes some primary problems of human and machine translation caused by gaps between different fields of knowledge. The authors suggest that interdisciplinary approach is required when it comes to contrastive studies in linguistics.

  17. Translation lexicon acquisition from bilingual dictionaries

    Science.gov (United States)

    Doermann, David S.; Ma, Huanfeng; Karagol-Ayan, Burcu; Oard, Douglas W.

    2001-12-01

    Bilingual dictionaries hold great potential as a source of lexical resources for training automated systems for optical character recognition, machine translation and cross-language information retrieval. In this work we describe a system for extracting term lexicons from printed copies of bilingual dictionaries. We describe our approach to page and definition segmentation and entry parsing. We have used the approach to parse a number of dictionaries and demonstrate the results for retrieval using a French-English Dictionary to generate a translation lexicon and a corpus of English queries applied to French documents to evaluation cross-language IR.

  18. Chapter 16: text mining for translational bioinformatics.

    Directory of Open Access Journals (Sweden)

    K Bretonnel Cohen

    2013-04-01

    Full Text Available Text mining for translational bioinformatics is a new field with tremendous research potential. It is a subfield of biomedical natural language processing that concerns itself directly with the problem of relating basic biomedical research to clinical practice, and vice versa. Applications of text mining fall both into the category of T1 translational research-translating basic science results into new interventions-and T2 translational research, or translational research for public health. Potential use cases include better phenotyping of research subjects, and pharmacogenomic research. A variety of methods for evaluating text mining applications exist, including corpora, structured test suites, and post hoc judging. Two basic principles of linguistic structure are relevant for building text mining applications. One is that linguistic structure consists of multiple levels. The other is that every level of linguistic structure is characterized by ambiguity. There are two basic approaches to text mining: rule-based, also known as knowledge-based; and machine-learning-based, also known as statistical. Many systems are hybrids of the two approaches. Shared tasks have had a strong effect on the direction of the field. Like all translational bioinformatics software, text mining software for translational bioinformatics can be considered health-critical and should be subject to the strictest standards of quality assurance and software testing.

  19. Chapter 16: text mining for translational bioinformatics.

    Science.gov (United States)

    Cohen, K Bretonnel; Hunter, Lawrence E

    2013-04-01

    Text mining for translational bioinformatics is a new field with tremendous research potential. It is a subfield of biomedical natural language processing that concerns itself directly with the problem of relating basic biomedical research to clinical practice, and vice versa. Applications of text mining fall both into the category of T1 translational research-translating basic science results into new interventions-and T2 translational research, or translational research for public health. Potential use cases include better phenotyping of research subjects, and pharmacogenomic research. A variety of methods for evaluating text mining applications exist, including corpora, structured test suites, and post hoc judging. Two basic principles of linguistic structure are relevant for building text mining applications. One is that linguistic structure consists of multiple levels. The other is that every level of linguistic structure is characterized by ambiguity. There are two basic approaches to text mining: rule-based, also known as knowledge-based; and machine-learning-based, also known as statistical. Many systems are hybrids of the two approaches. Shared tasks have had a strong effect on the direction of the field. Like all translational bioinformatics software, text mining software for translational bioinformatics can be considered health-critical and should be subject to the strictest standards of quality assurance and software testing.

  20. Translation-coupling systems

    Science.gov (United States)

    Pfleger, Brian; Mendez-Perez, Daniel

    2013-11-05

    Disclosed are systems and methods for coupling translation of a target gene to a detectable response gene. A version of the invention includes a translation-coupling cassette. The translation-coupling cassette includes a target gene, a response gene, a response-gene translation control element, and a secondary structure-forming sequence that reversibly forms a secondary structure masking the response-gene translation control element. Masking of the response-gene translation control element inhibits translation of the response gene. Full translation of the target gene results in unfolding of the secondary structure and consequent translation of the response gene. Translation of the target gene is determined by detecting presence of the response-gene protein product. The invention further includes RNA transcripts of the translation-coupling cassettes, vectors comprising the translation-coupling cassettes, hosts comprising the translation-coupling cassettes, methods of using the translation-coupling cassettes, and gene products produced with the translation-coupling cassettes.

  1. Translation-coupling systems

    Energy Technology Data Exchange (ETDEWEB)

    Pfleger, Brian; Mendez-Perez, Daniel

    2015-05-19

    Disclosed are systems and methods for coupling translation of a target gene to a detectable response gene. A version of the invention includes a translation-coupling cassette. The translation-coupling cassette includes a target gene, a response gene, a response-gene translation control element, and a secondary structure-forming sequence that reversibly forms a secondary structure masking the response-gene translation control element. Masking of the response-gene translation control element inhibits translation of the response gene. Full translation of the target gene results in unfolding of the secondary structure and consequent translation of the response gene. Translation of the target gene is determined by detecting presence of the response-gene protein product. The invention further includes RNA transcripts of the translation-coupling cassettes, vectors comprising the translation-coupling cassettes, hosts comprising the translation-coupling cassettes, methods of using the translation-coupling cassettes, and gene products produced with the translation-coupling cassettes.

  2. Electrical machines mathematical fundamentals of machine topologies

    CERN Document Server

    Gerling, Dieter

    2015-01-01

    Electrical Machines and Drives play a powerful role in industry with an ever increasing importance. This fact requires the understanding of machine and drive principles by engineers of many different disciplines. Therefore, this book is intended to give a comprehensive deduction of these principles. Special attention is given to the precise mathematical derivation of the necessary formulae to calculate machines and drives and to the discussion of simplifications (if applied) with the associated limits. The book shows how the different machine topologies can be deduced from general fundamentals, and how they are linked together. This book addresses graduate students, researchers, and developers of Electrical Machines and Drives, who are interested in getting knowledge about the principles of machine and drive operation and in detecting the mathematical and engineering specialties of the different machine and drive topologies together with their mutual links. The detailed - but nevertheless compact - mat...

  3. Comparison and Translation of English and Chinese Idioms

    Institute of Scientific and Technical Information of China (English)

    马俊宁; 刘晴

    2016-01-01

    The so called idiom refers to a phrase where the words together have a meaning that is different from the dictionary definitions of the individual words, but must be learnt as a whole. Because of their rich associations and strong nationality, idioms are usually hard for learners to understand and master. But as you know, language is the carrier of culture, and idioms are the cream of a language. Since both English and Chinese idioms have their unique characteristics, the comparison of the two may help us understand the similarities and differences between western and Chinese cultures, bridge the cultural gaps and enhance the effectiveness of cross-cultural communication. Besides that, the comparison can also be instructional in translation theory and translation practice.

  4. Laser machining of advanced materials

    CERN Document Server

    Dahotre, Narendra B

    2011-01-01

    Advanced materialsIntroductionApplicationsStructural ceramicsBiomaterials CompositesIntermetallicsMachining of advanced materials IntroductionFabrication techniquesMechanical machiningChemical Machining (CM)Electrical machiningRadiation machining Hybrid machiningLaser machiningIntroductionAbsorption of laser energy and multiple reflectionsThermal effectsLaser machining of structural ceramicsIntrodu

  5. Translation Theory 'Translated': Three Perspectives on Translation in Organizational Research

    DEFF Research Database (Denmark)

    Wæraas, Arild; Nielsen, Jeppe

    2016-01-01

    Translation theory has proved to be a versatile analytical lens used by scholars working from different traditions. On the basis of a systematic literature review, this study adds to our understanding of the ‘translations’ of translation theory by identifying the distinguishing features of the mo......, but also overlapping. We discuss the ways in which the three versions of translation theory may be combined and enrich each other so as to inform future research, thereby offering a more complete understanding of translation in and across organizational settings.......Translation theory has proved to be a versatile analytical lens used by scholars working from different traditions. On the basis of a systematic literature review, this study adds to our understanding of the ‘translations’ of translation theory by identifying the distinguishing features of the most...... common theoretical approaches to translation within the organization and management discipline: actor-network theory, knowledge-based theory, and Scandinavian institutionalism. Although each of these approaches already has borne much fruit in research, the literature is diverse and somewhat fragmented...

  6. Writing Through: Practising Translation

    Directory of Open Access Journals (Sweden)

    Joel Scott

    2010-05-01

    Full Text Available This essay exists as a segment in a line of study and writing practice that moves between a critical theory analysis of translation studies conceptions of language, and the practical questions of what those ideas might mean for contemporary translation and writing practice. Although the underlying preoccupation of this essay, and my more general line of inquiry, is translation studies and practice, in many ways translation is merely a way into a discussion on language. For this essay, translation is the threshold of language. But the two trails of the discussion never manage to elude each other, and these concatenations have informed two experimental translation methods, referred to here as Live Translations and Series Translations. Following the essay are a number of poems in translation, all of which come from Blanco Nuclear by the contemporary Spanish poet, Esteban Pujals Gesalí. The first group, the Live Translations consist of transcriptions I made from audio recordings read in a public setting, in which the texts were translated in situ, either off the page of original Spanish-language poems, or through a process very much like that carried out by simultaneous translators, for which readings of the poems were played back to me through headphones at varying speeds to be translated before the audience. The translations collected are imperfect renderings, attesting to a moment in language practice rather than language objects. The second method involves an iterative translation process, by which three versions of any one poem are rendered, with varying levels of fluency, fidelity and servility. All three translations are presented one after the other as a series, with no version asserting itself as the primary translation. These examples, as well as the translation methods themselves, are intended as preliminary experiments within an endlessly divergent continuum of potential methods and translations, and not as a complete representation of

  7. The deleuzian abstract machines

    DEFF Research Database (Denmark)

    Werner Petersen, Erik

    2005-01-01

    production. In Kafka: Toward a Minor Literature, Deleuze and Guatari gave the most comprehensive explanation to the abstract machine in the work of art. Like the war-machines of Virilio, the Kafka-machine operates in three gears or speeds. Furthermore, the machine is connected to spatial diagrams...

  8. Sound Effects in Translation

    DEFF Research Database (Denmark)

    Mees, Inger M.; Dragsted, Barbara; Gorm Hansen, Inge

    2013-01-01

    On the basis of a pilot study using speech recognition (SR) software, this paper attempts to illustrate the benefits of adopting an interdisciplinary approach in translator training. It shows how the collaboration between phoneticians, translators and interpreters can (1) advance research, (2) have...... implications for the curriculum, (3) be pedagogically motivating, and (4) prepare students for employing translation technology in their future practice as translators. In a two-phase study in which 14 MA students translated texts in three modalities (sight, written, and oral translation using an SR program......), Translog was employed to measure task times. The quality of the products was assessed by three experienced translators, and the number and types of misrecognitions were identified by a phonetician. Results indicate that SR translation provides a potentially useful supplement to written translation...

  9. contemporary translation studies and bible translation

    African Journals Online (AJOL)

    the source text to the translation process, the product and/or reception of ... The methodological impact was a shift from normative linguistic-based ... ary Society Period, with formal-equivalent translations being made by mis- ... the theoretical foundation of the functional-equivalent approach problem- ... After a review of.

  10. Believing in God the Father: Interpreting a phrase from the Apostle’s Creed

    Directory of Open Access Journals (Sweden)

    Marcel Sarot

    2016-05-01

    Full Text Available In our days, the creedal phrase ‘I believe in God the Father almighty’ is interpreted primarilyalong Trinitarian lines: It is applied to God as the Father of Jesus Christ. Here I argue that ithas a dual background: in Jesus’ prayer practice, in which He consistently addressed God as‘Father’, and in the Hellenistic habit of referring to the Creator as ‘Father’. I discuss Jesus’ useof the term ‘Father’ against its Old Testament background, and argue that it primarily pointsto the intimacy of Jesus’ relationship with His father. Against the Hellenistic background,however, the metaphor ‘Father’ means ‘he who brings forth effortlessly’. Finally, I discusssome gender issues connected with the use of the term ‘Father’ for God.

  11. Error-Driven Pruning of Treebank Grammars for Base Noun Phrase Identification

    CERN Document Server

    Cardie, C; Cardie, Claire; Pierce, David

    1998-01-01

    Finding simple, non-recursive, base noun phrases is an important subtask for many natural language processing applications. While previous empirical methods for base NP identification have been rather complex, this paper instead proposes a very simple algorithm that is tailored to the relative simplicity of the task. In particular, we present a corpus-based approach for finding base NPs by matching part-of-speech tag sequences. The training phase of the algorithm is based on two successful techniques: first the base NP grammar is read from a ``treebank'' corpus; then the grammar is improved by selecting rules with high ``benefit'' scores. Using this simple algorithm with a naive heuristic for matching rules, we achieve surprising accuracy in an evaluation on the Penn Treebank Wall Street Journal.

  12. Testing the stem dominance hypothesis: meaning analysis of inflected words and prepositional phrases.

    Directory of Open Access Journals (Sweden)

    Minna Lehtonen

    Full Text Available We tested the hypothesis that lexical-semantic access of inflected words is governed by the word stem. Object drawings overlaid with a dot/arrow marking position/movement were matched with corresponding linguistic expressions like "from the house". To test whether the stem dominates lexical-semantic access irrespective of its position, we used Swedish prepositional phrases (locative information via preposition immediately preceding the stem or Finnish case-inflected words (locative information via suffix immediately following the stem. Both in monolingual Swedish and in bilingual Finnish-Swedish speakers, correct stems with incorrect prepositions/case-endings were hardest to reject. This finding supports the view that the stem is indeed the dominant unit in meaning access of inflected words.

  13. Testing the stem dominance hypothesis: meaning analysis of inflected words and prepositional phrases.

    Science.gov (United States)

    Lehtonen, Minna; Harrer, Gabor; Wande, Erling; Laine, Matti

    2014-01-01

    We tested the hypothesis that lexical-semantic access of inflected words is governed by the word stem. Object drawings overlaid with a dot/arrow marking position/movement were matched with corresponding linguistic expressions like "from the house". To test whether the stem dominates lexical-semantic access irrespective of its position, we used Swedish prepositional phrases (locative information via preposition immediately preceding the stem) or Finnish case-inflected words (locative information via suffix immediately following the stem). Both in monolingual Swedish and in bilingual Finnish-Swedish speakers, correct stems with incorrect prepositions/case-endings were hardest to reject. This finding supports the view that the stem is indeed the dominant unit in meaning access of inflected words.

  14. [Acquisition of German morphosyntax in the context of constituent movement into the complementizer phrase position].

    Science.gov (United States)

    Miller, S; Ptok, M

    2014-04-01

    German is a genuine verb-second (V2) language. For V2 languages it is assumed that all sentence structures can be derived from the underlying subordinate clause structure via the movement of individual components. The acquisition of German V2 structures represents a critical step in German first language acquisition and is usually mastered at the age of 3 years. However, not all children succeed in filtering the necessary information from the spoken input in order to acquire this key component of German syntax. Use of canonical sentence structures alone is not a confirmation of a successful V2 acquisition. Moreover, this structure often represents some kind of interim grammar. Only the use of complementizer phrase (CP)-related structures, such as wh-questions or topicalizations of objects is a real indication of a successful V2 acquisition.

  15. Early acquisition of gender agreement in the Spanish noun phrase: starting small.

    Science.gov (United States)

    Mariscal, Sonia

    2009-01-01

    Nativist and constructivist accounts differ in their characterization of children's knowledge of grammatical categories. In this paper we present research on the process of acquisition of a particular grammatical system, gender agreement in the Spanish noun phrase, in children under three years of age. The design of the longitudinal study employed presents some variations in relation to classical studies. The aim was to obtain a large corpus of NP data which would allow different types of analysis of the children's productions to be carried out. Intra-individual variability in early NP types was analyzed and measured, and an elicitation task for adjectives was used. Results show that the acquisition of NP and gender agreement is a complex process which advances as the children gradually integrate different pieces of evidence: phonological, distributional and functional. The reduction of variability as the grammatical process advances is a key feature for its explanation.

  16. Spiritualist Writing Machines: Telegraphy, Typtology, Typewriting

    Directory of Open Access Journals (Sweden)

    Anthony Enns

    2015-09-01

    Full Text Available This paper examines how religious concepts both reflected and informed the development of new technologies for encoding, transmitting, and printing written information. While many spiritualist writing machines were based on existing technologies that were repurposed for spirit communication, others prefigured or even inspired more advanced technological innovations. The history of spiritualist writing machines thus not only represents a response to the rise of new media technologies in the nineteenth century, but it also reflects a set of cultural demands that helped to shape the development of new technologies, such as the need to replace handwriting with discrete, uniform lettering, which accelerated the speed of composition; the need to translate written information into codes, which could be transmitted across vast distances; and the need to automate the process of transmitting, translating, and transcribing written information, which seemed to endow the machines themselves with a certain degree of autonomy or even intelligence. While spiritualists and inventors were often (but not always motivated by different goals, the development of spiritualist writing machines and the development of technological writing machines were nevertheless deeply interrelated and interdependent.

  17. Translation, Quality and Growth

    DEFF Research Database (Denmark)

    Petersen, Margrethe

    The paper investigates the feasibility and some of the possible consequences of applying quality management to translation. It first gives an introduction to two different schools of translation and to (total) quality management. It then examines whether quality management may, in theory......, be applied to translation and goes on to present a case study which involves a firm in the translation industry and which illustrates quality management in practice. The paper shows that applying quality management to translation is feasible and that doing so may translate into sustained growth....

  18. Translation, Quality and Growth

    DEFF Research Database (Denmark)

    Petersen, Margrethe

    The paper investigates the feasibility and some of the possible consequences of applying quality management to translation. It first gives an introduction to two different schools of translation and to (total) quality management. It then examines whether quality management may, in theory......, be applied to translation and goes on to present a case study which involves a firm in the translation industry and which illustrates quality management in practice. The paper shows that applying quality management to translation is feasible and that doing so may translate into sustained growth....

  19. On Advertisement Language Translation

    Institute of Scientific and Technical Information of China (English)

    LI Ya-lu

    2015-01-01

    Advertisement language is a special practical writing with abundant imagination, great creativity and instigation. During translation the diversity in social culture, language and ethnic psychology, etc. will be directly reflected into its effect, presenting both the trouble in business of translators and also significant influences on the business brand. Starting from the features of adver⁃tisement language itself, this paper integrates translation situations and measures from several schools over the latest 20 years, gives typical examples in advertising translation and analyzes from varies perspectives and points out some problems in today ’s ad⁃vertisement translation,aiming to provide some constructive opinions for translation of advertisements.

  20. Film Name Translation

    Institute of Scientific and Technical Information of China (English)

    师晓晓

    2014-01-01

    <正>1.Introduction A good translation of the name should convey the information of the film and attract the audience’s desire for going to the cinema.Translation of film names should have business,information,culture,aesthetic features,while a short eye-catching name aims to leave the audience an unforgettable impression.This thesis discusses the translation of English film names from the aspects of the importance of English film name translation,principles for translating English film names and methods of English film name translation.