WorldWideScience

Sample records for machine translation pages

  1. Machine Translation

    Institute of Scientific and Technical Information of China (English)

    张严心

    2015-01-01

    As a kind of ancillary translation tool, Machine Translation has been paid increasing attention to and received different kinds of study by a great deal of researchers and scholars for a long time. To know the definition of Machine Translation and to analyse its benefits and problems are significant for translators in order to make good use of Machine Translation, and helpful to develop and consummate Machine Translation Systems in the future.

  2. Machine Translation and Other Translation Technologies.

    Science.gov (United States)

    Melby, Alan

    1996-01-01

    Examines the application of linguistic theory to machine translation and translator tools, discusses the use of machine translation and translator tools in the real world of translation, and addresses the impact of translation technology on conceptions of language and other issues. Findings indicate that the human mind is flexible and linguistic…

  3. Machine Translation Effect on Communication

    DEFF Research Database (Denmark)

    Jensen, Mika Yasuoka; Bjørn, Pernille

    2011-01-01

    Intercultural collaboration facilitated by machine translation has gradually spread in various settings. Still, little is known as for the practice of machine-translation mediated communication. This paper investigates how machine translation affects intercultural communication in practice. Based...... determines communication process largely, our data indicates communication relies more on a dynamic process where participants establish common ground than on reproducibility and grammatical accuracy.......Intercultural collaboration facilitated by machine translation has gradually spread in various settings. Still, little is known as for the practice of machine-translation mediated communication. This paper investigates how machine translation affects intercultural communication in practice. Based...... on communication in which multilingual communication system is applied, we identify four communication types and its’ influences on stakeholders’ communication process, especially focusing on establishment and maintenance of common ground. Different from our expectation that quality of machine translation results...

  4. Machine Learning Algorithms in Web Page Classification

    Directory of Open Access Journals (Sweden)

    W.A.AWAD

    2012-11-01

    Full Text Available In this paper we use machine learning algorithms like SVM, KNN and GIS to perform a behaviorcomparison on the web pages classifications problem, from the experiment we see in the SVM with smallnumber of negative documents to build the centroids has the smallest storage requirement and the least online test computation cost. But almost all GIS with different number of nearest neighbors have an evenhigher storage requirement and on line test computation cost than KNN. This suggests that some futurework should be done to try to reduce the storage requirement and on list test cost of GIS.

  5. Automatic Evaluation of Machine Translation

    DEFF Research Database (Denmark)

    Martinez, Mercedes Garcia; Koglin, Arlene; Mesa-Lao, Bartolomé

    2015-01-01

    The availability of systems capable of producing fairly accurate translations has increased the popularity of machine translation (MT). The translation industry is steadily incorporating MT in their workflows engaging the human translator to post-edit the raw MT output in order to comply with a set...

  6. Parsing statistical machine translation output

    NARCIS (Netherlands)

    Carter, S.; Monz, C.; Vetulani, Z.

    2009-01-01

    Despite increasing research into the use of syntax during statistical machine translation, the incorporation of syntax into language models has seen limited success. We present a study of the discriminative abilities of generative syntax-based language models, over and above standard n-gram models,

  7. Automatic Evaluation of Machine Translation

    DEFF Research Database (Denmark)

    Martinez, Mercedes Garcia; Koglin, Arlene; Mesa-Lao, Bartolomé

    2015-01-01

    The availability of systems capable of producing fairly accurate translations has increased the popularity of machine translation (MT). The translation industry is steadily incorporating MT in their workflows engaging the human translator to post-edit the raw MT output in order to comply with a set...... of post-editing effort, namely i) temporal (time), ii) cognitive (mental processes) and iii) technical (keyboard activity). For the purposes of this research, TER scores were correlated with two different indicators of post-editing effort as computed in the CRITT Translation Process Database (TPR......-DB) *. On the one hand, post-editing temporal effort was measured using FDur values (duration of segment production time excluding keystroke pauses >_ 200 seconds) and KDur values (duration of coherent keyboard activity excluding keystroke pauses >_ 5 seconds). On the other hand, post-editing technical effort...

  8. On automatic machine translation evaluation

    Directory of Open Access Journals (Sweden)

    Darinka Verdonik

    2013-05-01

    Full Text Available An important task of developing machine translation (MT is evaluating system performance. Automatic measures are most commonly used for this task, as manual evaluation is time-consuming and costly. However, to perform an objective evaluation is not a trivial task. Automatic measures, such as BLEU, TER, NIST, METEOR etc., have their own weaknesses, while manual evaluations are also problematic since they are always to some extent subjective. In this paper we test the influence of a test set on the results of automatic MT evaluation for the subtitling domain. Translating subtitles is a rather specific task for MT, since subtitles are a sort of summarization of spoken text rather than a direct translation of (written text. Additional problem when translating language pair that does not include English, in our example Slovene-Serbian, is that commonly the translations are done from English to Serbian and from English to Slovenian, and not directly, since most of the TV production is originally filmed in English. All this poses additional challenges to MT and consequently to MT evaluation. Automatic evaluation is based on a reference translation, which is usually taken from an existing parallel corpus and marked as a test set. In our experiments, we compare the evaluation results for the same MT system output using three types of test set. In the first round, the test set are 4000 subtitles from the parallel corpus of subtitles SUMAT. These subtitles are not direct translations from Serbian to Slovene or vice versa, but are based on an English original. In the second round, the test set are 1000 subtitles randomly extracted from the first test set and translated anew, from Serbian to Slovenian, based solely on the Serbian written subtitles. In the third round, the test set are the same 1000 subtitles, however this time the Slovene translations were obtained by manually correcting the Slovene MT outputs so that they are correct translations of the

  9. Evaluating Arabic to English Machine Translation

    Directory of Open Access Journals (Sweden)

    Laith S. Hadla

    2014-11-01

    Full Text Available Online text machine translation systems are widely used throughout the world freely. Most of these systems use statistical machine translation (SMT that is based on a corpus full with translation examples to learn from them how to translate correctly. Online text machine translation systems differ widely in their effectiveness, and therefore we have to fairly evaluate their effectiveness. Generally the manual (human evaluation of machine translation (MT systems is better than the automatic evaluation, but it is not feasible to be used. The distance or similarity of MT candidate output to a set of reference translations are used by many MT evaluation approaches. This study presents a comparison of effectiveness of two free online machine translation systems (Google Translate and Babylon machine translation system to translate Arabic to English. There are many automatic methods used to evaluate different machine translators, one of these methods; Bilingual Evaluation Understudy (BLEU method. BLEU is used to evaluate translation quality of two free online machine translation systems under consideration. A corpus consists of more than 1000 Arabic sentences with two reference English translations for each Arabic sentence is used in this study. This corpus of Arabic sentences and their English translations consists of 4169 Arabic words, where the number of unique Arabic words is 2539. This corpus is released online to be used by researchers. These Arabic sentences are distributed among four basic sentence functions (declarative, interrogative, exclamatory, and imperative. The experimental results show that Google machine translation system is better than Babylon machine translation system in terms of precision of translation from Arabic to English.

  10. Machine Translation in Post-Contemporary Era

    Science.gov (United States)

    Lin, Grace Hui Chin

    2010-01-01

    This article focusing on translating techniques via personal computer or laptop reports updated artificial intelligence progresses before 2010. Based on interpretations and information for field of MT [Machine Translation] by Yorick Wilks' book, "Machine Translation, Its scope and limits," this paper displays understandable theoretical frameworks…

  11. Machine Translation Tools - Tools of The Translator's Trade

    DEFF Research Database (Denmark)

    Kastberg, Peter

    2012-01-01

    In this article three of the more common types of translation tools are presented, discussed and critically evaluated. The types of translation tools dealt with in this article are: Fully Automated Machine Translation (or FAMT), Human Aided Machine Translation (or HAMT) and Machine Aided Human...... Translation (or MAHT). The strengths and weaknesses of the different types of tools are discussed and evaluated by means of a number of examples. The article aims at two things: at presenting a sort of state of the art of what is commonly referred to as “machine translation” as well as at providing the reader...... with a sound basis for considering what translation tool (if any) is the most appropriate in order to meet his or her specific translation needs....

  12. Adding Statistical Machine Translation Adaptation to Computer-Assisted Translation

    Science.gov (United States)

    2013-09-01

    on Telecommunications. Tehran, 2012, 822–826. Bertoldi, N.; Federico, M. Domain Adaptation for Statistical Machine Translation with Monolingual ...for Interactive Machine Translation. ICMI’11. Alicante, Spain: ACM, 2011, 197–200. 14 Haffari, G.; Sarkar, A. Active Learning for Multilingual

  13. Socorro Students Translate NRAO Web Pages Into Spanish

    Science.gov (United States)

    2002-07-01

    Six Socorro High School students are spending their summer working at the National Radio Astronomy Observatory (NRAO) on a unique project that gives them experience in language translation, World Wide Web design, and technical communication. Under the project, called "Un puente a los cielos," the students are translating many of NRAO's Web pages on astronomy into Spanish. "These students are using their bilingual skills to help us make basic information about astronomy and radio telescopes available to the Spanish-speaking community," said Kristy Dyer, who works at NRAO as a National Science Foundation postdoctoral fellow and who developed the project and obtained funding for it from the National Aeronautics and Space Administration. The students are: Daniel Acosta, 16; Rossellys Amarante, 15; Sandra Cano, 16; Joel Gonzalez, 16; Angelica Hernandez, 16; and Cecilia Lopez, 16. The translation project, a joint effort of NRAO and the NM Tech physics department, also includes Zammaya Moreno, a teacher from Ecuador, Robyn Harrison, NRAO's education officer, and NRAO computer specialist Allan Poindexter. The students are translating NRAO Web pages aimed at the general public. These pages cover the basics of radio astronomy and frequently-asked questions about NRAO and the scientific research done with NRAO's telescopes. "Writing about science for non-technical audiences has to be done carefully. Scientific concepts must be presented in terms that are understandable to non-scientists but also that remain scientifically accurate," Dyer said. "When translating this type of writing from one language to another, we need to preserve both the understandability and the accuracy," she added. For that reason, Dyer recruited 14 Spanish-speaking astronomers from Argentina, Mexico and the U.S. to help verify the scientific accuracy of the Spanish translations. The astronomers will review the translations. The project is giving the students a broad range of experience. "They are

  14. Machine Translation Based on Translation Corresponding Tree Structure

    Institute of Scientific and Technical Information of China (English)

    2006-01-01

    A representation schema called translation corresponding tree (TCT) has been applied to a Portuguese to Chinese example-based machine translation system. The translation examples are annotated by the representation of the TCT structure. Each TCT describes not only the syntactic structure of the source sentence (i.e., Portuguese in our system) but also the translation correspondences (i.e., Chinese translation). In addition, the TCT nodes describe the corresponding linguistic relationships between the source and target languages. The translation examples can be effectively represented with this annotation schema and organized in the bilingual knowledge database or example base. In the real machine translation process, the target language is synthesized with higher quality by referring to the TCT translation information.

  15. Discriminative syntactic reranking for statistical machine translation

    NARCIS (Netherlands)

    Carter, S.; Monz, C.

    2010-01-01

    This paper describes a method that successfully exploits simple syntactic features for n-best translation candidate reranking using perceptrons. Our approach uses discriminative language modelling to rerank the n-best translations generated by a statistical machine translation system. The performanc

  16. Fundamentals of Machine Learning for Neural Machine Translation

    OpenAIRE

    Kelleher, John

    2016-01-01

    This paper presents a short introduction to neural networks and how they are used for machine translation and concludes with some discussion on the current research challenges being addressed by neural machine translation (NMT) research. The primary goal of this paper is to give a no-tears introduction to NMT to readers that do not have a computer science or mathematical background. The secondary goal is to provide the reader with a deep enough understanding of NMT that they can appreciate th...

  17. Using example-based machine translation to translate DVD subtitles

    DEFF Research Database (Denmark)

    Flanagan, Marian

    Audiovisual Translation (AVT), and in particular subtitling, has been recognised as an area that could potentially benefit from the introduction of machine translation (followed by post-editing). In recent years the demands on subtitlers have increased, while the payment to subtitlers and time al...

  18. Machine Learning Feature Selection for Tuning Memory Page Swapping

    Science.gov (United States)

    2013-09-01

    erroneous and generally results in useful pages being paged out too early, only to be paged back in shortly there after. [1] The first in/first out ( FIFO ...the tail of the queue are selected. This algorithm has been shown to have significant shortcomings. When using a FIFO PRA, it is possible to encounter a...page which was just paged out. FIFO is therefore, a sub-optimal page replacement algorithm. Least recently used (LRU) is incredibly simple in concept

  19. Project-specific Machine Translation

    Science.gov (United States)

    2011-12-01

    Linguistic Data Consortium’s Reflex Pack for Less Commonly Taught Languages (LCTL) (7). The corpora labeled Sada -e-Azadi (SeA) (11) and Afghan Recovery...Linguistics, Philadelphia, PA, 2002, 311–318. 11. Sada -e Azadi. http://www.sada-e-azadi.net (accessed 2011). 12. Stolcke, A. SRILM - An Extensible...Translation Center QAMO Qamoosuna (dictionaries) RH Ranger Handbook SCFG Synchronous Context Free Grammar SeA Sada -e-Azadi SME subject matter expert SMT

  20. Rule-based machine translation for Aymara

    NARCIS (Netherlands)

    Coler, Matthew; Homola, Petr; Jones, Mari

    2014-01-01

    This paper presents the ongoing result of an approach developed by the collaboration of a computational linguist with a field linguist that addresses one of the oft-overlooked keys to language maintenance: the development of modern language-learning tools. Although machine translation isn’t commonly

  1. Understanding scanlation: how to read one million fan-translated manga pages

    Directory of Open Access Journals (Sweden)

    Lev Manovich

    2011-03-01

    Full Text Available This article presents the first project in digital humanities which uses digital image analysis and visualization for the study of a massive image collection - one million manga pages. These pages correspond to 883 manga series that were available as “scanlations” (manga digitized and translated by fans in the fall 2009. Using computational techniques we were able to systematically analyze the visual language of special pages inserted by fans in scanlated versions, and also study visual differences between the pages from original Japanese publications and official English translations. The result is a better understanding of the multiple "mangas" which make up the "manga universe."

  2. Findings of the 2010 Joint Workshop on Statistical Machine Translation and Metrics for Machine Translation

    NARCIS (Netherlands)

    Callison-Burch, C.; Koehn, P.; Monz, C.; Peterson, K.; Przybocki, M.; Zaidan, O.F.

    2010-01-01

    This paper presents the results of the WMT10 and MetricsMATR10 shared tasks, which included a translation task, a system combination task, and an evaluation task. We conducted a large-scale manual evaluation of 104 machine translation systems and 41 system combination entries. We used the ranking of

  3. On Using Very Large Target Vocabulary for Neural Machine Translation

    OpenAIRE

    Jean, Sébastien; Cho, Kyunghyun; Memisevic, Roland; Bengio, Yoshua

    2014-01-01

    Neural machine translation, a recently proposed approach to machine translation based purely on neural networks, has shown promising results compared to the existing approaches such as phrase-based statistical machine translation. Despite its recent success, neural machine translation has its limitation in handling a larger vocabulary, as training complexity as well as decoding complexity increase proportionally to the number of target words. In this paper, we propose a method that allows us ...

  4. Unraveling the Contribution of Image Captioning and Neural Machine Translation for Multimodal Machine Translation

    Directory of Open Access Journals (Sweden)

    Lala Chiraag

    2017-06-01

    Full Text Available Recent work on multimodal machine translation has attempted to address the problem of producing target language image descriptions based on both the source language description and the corresponding image. However, existing work has not been conclusive on the contribution of visual information. This paper presents an in-depth study of the problem by examining the differences and complementarities of two related but distinct approaches to this task: textonly neural machine translation and image captioning. We analyse the scope for improvement and the effect of different data and settings to build models for these tasks. We also propose ways of combining these two approaches for improved translation quality.

  5. Resolution of Unidentified Words in Machine Translation

    CERN Document Server

    Ullah, Sana; Kwak, Kyung Sup

    2009-01-01

    This paper presents a mechanism of resolving unidentified lexical units in text-based machine translation (TBMT). In machine translation system it is unlikely to have a complete MT lexicon and hence there is a need of a mechanism to handle the problem of unidentified words. These unknown words could be abbreviations, names, acronyms and newly introduced terms. We have proposed an algorithm for the resolution of the unidentified words. This algorithm takes discourse unit (primitive discourse) as a unit of analysis and provides real time updates to the lexicon. We have manually applied the algorithm to news paper fragments. Along with anaphora and cataphora1 resolution, many unknown words especially names and abbreviations were updated to the lexicon. Moreover flowchart of the proposed algorithm is also presented.

  6. An Evaluation of Output Quality of Machine Translation (Padideh Software vs. Google Translate

    Directory of Open Access Journals (Sweden)

    Haniyeh Sadeghi Azer

    2015-08-01

    Full Text Available This study aims to evaluate the translation quality of two machine translation systems in translating six different text-types, from English to Persian. The evaluation was based on criteria proposed by Van Slype (1979. The proposed model for evaluation is a black-box type, comparative and adequacy-oriented evaluation. To conduct the evaluation, a questionnaire was assigned to end-users to evaluate the outputs to examine and determine, if the machine-generated translations are intelligible and acceptable from their point of view and which one of the machine-generated translations produced by Padideh software and Google Translate is more acceptable and useful from the end-users point of view. The findings indicate that, the machine-generated translations are intelligible and acceptable in translating certain text-types, for end-users and Google Translate is more acceptable from end-users point of view. Keywords: Machine Translation, Machine Translation Evaluation, Translation Quality

  7. Findings of the 2014 Workshop on Statistical Machine Translation

    NARCIS (Netherlands)

    Bojar, O.; Buck, C.; Federmann, C.; Haddow, B.; Koehn, P.; Leveling, J.; Monz, C.; Pecina, P.; Post, M.; Saint-Amand, H.; Soricut, R.; Specia, L.; Tamchyna, A.

    2014-01-01

    This paper presents the results of the WMT14 shared tasks, which included a standard news translation task, a separate medical translation task, a task for run-time estimation of machine translation quality, and a metrics task. This year, 143 machine translation systems from 23 institutions were sub

  8. Findings of the 2011 workshop on statistical machine translation

    NARCIS (Netherlands)

    Callison-Burch, C.; Koehn, P.; Monz, C.; Zaidan, O.F.

    2011-01-01

    This paper presents the results of the WMT11 shared tasks, which included a translation task, a system combination task, and a task for machine translation evaluation metrics. We conducted a large-scale manual evaluation of 148 machine translation systems and 41 system combination entries. We used t

  9. Machine translation of TV subtitles for large scale production

    OpenAIRE

    Volk, Martin; Sennrich, Rico; Hardmeier, Christian; Tidström, Frida

    2010-01-01

    This paper describes our work on building and employing Statistical Machine Translation systems for TV subtitles in Scandinavia. We have built translation systems for Danish, English, Norwegian and Swedish. They are used in daily subtitle production and translate large volumes. As an example we report on our evaluation results for three TV genres. We discuss our lessons learned in the system development process which shed interesting light on the practical use of Machine Translation technology.

  10. Controlling misses and false alarms in a machine learning framework for predicting uniformity of printed pages

    Science.gov (United States)

    Nguyen, Minh Q.; Allebach, Jan P.

    2015-01-01

    In our previous work1 , we presented a block-based technique to analyze printed page uniformity both visually and metrically. The features learned from the models were then employed in a Support Vector Machine (SVM) framework to classify the pages into one of the two categories of acceptable and unacceptable quality. In this paper, we introduce a set of tools for machine learning in the assessment of printed page uniformity. This work is primarily targeted to the printing industry, specifically the ubiquitous laser, electrophotographic printer. We use features that are well-correlated with the rankings of expert observers to develop a novel machine learning framework that allows one to achieve the minimum "false alarm" rate, subject to a chosen "miss" rate. Surprisingly, most of the research that has been conducted on machine learning does not consider this framework. During the process of developing a new product, test engineers will print hundreds of test pages, which can be scanned and then analyzed by an autonomous algorithm. Among these pages, most may be of acceptable quality. The objective is to find the ones that are not. These will provide critically important information to systems designers, regarding issues that need to be addressed in improving the printer design. A "miss" is defined to be a page that is not of acceptable quality to an expert observer that the prediction algorithm declares to be a "pass". Misses are a serious problem, since they represent problems that will not be seen by the systems designers. On the other hand, "false alarms" correspond to pages that an expert observer would declare to be of acceptable quality, but which are flagged by the prediction algorithm as "fails". In a typical printer testing and development scenario, such pages would be examined by an expert, and found to be of acceptable quality after all. "False alarm" pages result in extra pages to be examined by expert observers, which increases labor cost. But "false

  11. Evaluation of Hindi to Punjabi Machine Translation System

    CERN Document Server

    Goyal, Vishal

    2009-01-01

    Machine Translation in India is relatively young. The earliest efforts date from the late 80s and early 90s. The success of every system is judged from its evaluation experimental results. Number of machine translation systems has been started for development but to the best of author knowledge, no high quality system has been completed which can be used in real applications. Recently, Punjabi University, Patiala, India has developed Punjabi to Hindi Machine translation system with high accuracy of about 92%. Both the systems i.e. system under question and developed system are between same closely related languages. Thus, this paper presents the evaluation results of Hindi to Punjabi machine translation system. It makes sense to use same evaluation criteria as that of Punjabi to Hindi Punjabi Machine Translation System. After evaluation, the accuracy of the system is found to be about 95%.

  12. An efficient scheme for automatic web pages categorization using the support vector machine

    Science.gov (United States)

    Bhalla, Vinod Kumar; Kumar, Neeraj

    2016-07-01

    In the past few years, with an evolution of the Internet and related technologies, the number of the Internet users grows exponentially. These users demand access to relevant web pages from the Internet within fraction of seconds. To achieve this goal, there is a requirement of an efficient categorization of web page contents. Manual categorization of these billions of web pages to achieve high accuracy is a challenging task. Most of the existing techniques reported in the literature are semi-automatic. Using these techniques, higher level of accuracy cannot be achieved. To achieve these goals, this paper proposes an automatic web pages categorization into the domain category. The proposed scheme is based on the identification of specific and relevant features of the web pages. In the proposed scheme, first extraction and evaluation of features are done followed by filtering the feature set for categorization of domain web pages. A feature extraction tool based on the HTML document object model of the web page is developed in the proposed scheme. Feature extraction and weight assignment are based on the collection of domain-specific keyword list developed by considering various domain pages. Moreover, the keyword list is reduced on the basis of ids of keywords in keyword list. Also, stemming of keywords and tag text is done to achieve a higher accuracy. An extensive feature set is generated to develop a robust classification technique. The proposed scheme was evaluated using a machine learning method in combination with feature extraction and statistical analysis using support vector machine kernel as the classification tool. The results obtained confirm the effectiveness of the proposed scheme in terms of its accuracy in different categories of web pages.

  13. An Overall Perspective of Machine Translation with its Shortcomings

    Directory of Open Access Journals (Sweden)

    Alireza Akbari

    2014-01-01

    Full Text Available The petition for language translation has strikingly augmented recently due to cross-cultural communication and exchange of information. In order to communicate well, text should be translated correctly and completely in each field such as legal documents, technical texts, scientific texts, publicity leaflets, and instructional materials. In this connection, Machine translation is of great importance in translation. The term “Machine Translation” was first proposed by George Artsrouni and Smirnov Troyanski (1933 to design a storage design on paper tape. This paper sought to investigate an overall perspective of Machine Translation models and its metrics in detail. Finally, it scrutinized the ins and outs shortcomings of Machine Translation

  14. Quantum Neural Network Based Machine Translator for Hindi to English

    OpenAIRE

    Ravi Narayan; V. P. Singh; S. Chakraverty

    2014-01-01

    This paper presents the machine learning based machine translation system for Hindi to English, which learns the semantically correct corpus. The quantum neural based pattern recognizer is used to recognize and learn the pattern of corpus, using the information of part of speech of individual word in the corpus, like a human. The system performs the machine translation using its knowledge gained during the learning by inputting the pair of sentences of Devnagri-Hindi and English. To analyze t...

  15. Quantum Neural Network Based Machine Translator for Hindi to English

    OpenAIRE

    Ravi Narayan; Singh, V. P.; S. Chakraverty

    2014-01-01

    This paper presents the machine learning based machine translation system for Hindi to English, which learns the semantically correct corpus. The quantum neural based pattern recognizer is used to recognize and learn the pattern of corpus, using the information of part of speech of individual word in the corpus, like a human. The system performs the machine translation using its knowledge gained during the learning by inputting the pair of sentences of Devnagri-Hindi and English. To analyze t...

  16. LSTM Neural Reordering Feature for Statistical Machine Translation

    OpenAIRE

    Cui, Yiming; Wang, Shijin; Li, Jianfeng

    2015-01-01

    Artificial neural networks are powerful models, which have been widely applied into many aspects of machine translation, such as language modeling and translation modeling. Though notable improvements have been made in these areas, the reordering problem still remains a challenge in statistical machine translations. In this paper, we present a novel neural reordering model that directly models word pairs and alignment. By utilizing LSTM recurrent neural networks, much longer context could be ...

  17. Machine Translation as a Complex System: The Role of Esperanto

    Directory of Open Access Journals (Sweden)

    Federic Gobbo

    2015-04-01

    Full Text Available The history of machine translation and the history of Esperanto have long been connected, as they are two different ways to deal with the same problem: the problem of communication across language barriers. Language can be considered a Complex Adaptive System (CAS, and machine translation too. In fact, there are multiple agents (both natural and artificial involved, interacting with one another and committed to achieve a common goal, i.e., the machine translation task. The main characteristics of language as a CAS are also shared in machine translation, especially if we consider the example-based, statistical approach, which is nowadays paradigmatic and unavoidable. In fact, control is distributed, there is no ideal representing agent (intrinsic diversity, there are perpetual dynamics in performance, adapted through amplification and competition of new examples from the crowd of users. On the other hand, Esperanto, being a living language, can be considered a CAS, but of a special kind, because its intrinsic regularity in structure simplifies the task of machine translation, at least up to a certain level. This paper reviews how Esperanto has enhanced the development of human-machine communication in general and within machine translation in particular, tracing some prospects for further development of machine translation, where Esperanto could play a key role.

  18. Exact Decoding for Phrase-Based Statistical Machine Translation

    NARCIS (Netherlands)

    Aziz, W.; Dymetman, M.; Specia, L.

    2014-01-01

    The combinatorial space of translation derivations in phrase-based statistical machine translation is given by the intersection between a translation lattice and a target language model. We replace this intractable intersection by a tractable relaxation which incorporates a low-order upperbound on t

  19. Machine Translation in the German Classroom: Detection, Reaction, Prevention

    Science.gov (United States)

    Steding, Soren

    2009-01-01

    There are many websites today that offer free machine translations and although beginning students of German are not always proficient enough to judge the quality of these translations or to fully understand certain translation results, they use these services nonetheless for their assignments. The problem for the educator is to distinguish…

  20. Exploration and exploitation of multilingual data for statistical machine translation

    NARCIS (Netherlands)

    Carter, S.C.

    2012-01-01

    Shortly after the birth of computer science, researchers realised the importance of machine translation as a task worth of concentrated effort, but it is only recently that algorithms are able to provide automatic translations usable by the masses. Modern translation systems are dependent on bilingu

  1. Quantum Neural Network Based Machine Translator for Hindi to English

    Directory of Open Access Journals (Sweden)

    Ravi Narayan

    2014-01-01

    Full Text Available This paper presents the machine learning based machine translation system for Hindi to English, which learns the semantically correct corpus. The quantum neural based pattern recognizer is used to recognize and learn the pattern of corpus, using the information of part of speech of individual word in the corpus, like a human. The system performs the machine translation using its knowledge gained during the learning by inputting the pair of sentences of Devnagri-Hindi and English. To analyze the effectiveness of the proposed approach, 2600 sentences have been evaluated during simulation and evaluation. The accuracy achieved on BLEU score is 0.7502, on NIST score is 6.5773, on ROUGE-L score is 0.9233, and on METEOR score is 0.5456, which is significantly higher in comparison with Google Translation and Bing Translation for Hindi to English Machine Translation.

  2. Quantum neural network based machine translator for Hindi to English.

    Science.gov (United States)

    Narayan, Ravi; Singh, V P; Chakraverty, S

    2014-01-01

    This paper presents the machine learning based machine translation system for Hindi to English, which learns the semantically correct corpus. The quantum neural based pattern recognizer is used to recognize and learn the pattern of corpus, using the information of part of speech of individual word in the corpus, like a human. The system performs the machine translation using its knowledge gained during the learning by inputting the pair of sentences of Devnagri-Hindi and English. To analyze the effectiveness of the proposed approach, 2600 sentences have been evaluated during simulation and evaluation. The accuracy achieved on BLEU score is 0.7502, on NIST score is 6.5773, on ROUGE-L score is 0.9233, and on METEOR score is 0.5456, which is significantly higher in comparison with Google Translation and Bing Translation for Hindi to English Machine Translation.

  3. Syntactic Reordering for Arabic- English Phrase-Based Machine Translation

    Science.gov (United States)

    Hatem, Arwa; Omar, Nazlia

    Machine Translation (MT) refers to the use of a machine for performing translation task which converts text or speech in one Natural Language (Source Language (SL)) into another Natural Language (Target Language (TL)). The translation from Arabic to English is difficult task due to the Arabic languages are highly inflectional, rich morphology and relatively free word order. Word ordering plays an important part in the translation process. The paper proposes a transfer-based approach in Arabic to English MT to handle the word ordering problem. Preliminary tested indicate that our system, AE-TBMT is competitive when compared against other approaches from the literature.

  4. Machine Translation of Noun Phrases from English to Igala using ...

    African Journals Online (AJOL)

    PROF. OLIVER OSUAGWA

    2014-06-01

    Jun 1, 2014 ... for easier processing of concepts This clearly shows that ... research organizations and government agencies to .... first hybrid machine translation in 2009. English and ... sentence structure in the target language. There is no ...

  5. Web Page Classification using an ensemble of support vector machine classifiers

    Directory of Open Access Journals (Sweden)

    Shaobo Zhong

    2011-11-01

    Full Text Available Web Page Classification (WPC is both an important and challenging topic in data mining. The knowledge of WPC can help users to obtain useable information from the huge internet dataset automatically and efficiently. Many efforts have been made to WPC. However, there is still room for improvement of current approaches. One particular challenge in training classifiers comes from the fact that the available dataset is usually unbalanced. Standard machine learning algorithms tend to be overwhelmed by the major class and ignore the minor one and thus lead to high false negative rate. In this paper, a novel approach for Web page classification was proposed to address this problem by using an ensemble of support vector machine classifiers to perform this work. Principal Component Analysis (PCA is used for feature reduction and Independent Component Analysis (ICA for feature selection. The experimental results indicate that the proposed approach outperforms other existing classifiers widely used in WPC.

  6. ARABIC-MALAY MACHINE TRANSLATION USING RULE-BASED APPROACH

    Directory of Open Access Journals (Sweden)

    Ahmed Jumaa Alsaket

    2014-01-01

    Full Text Available Arabic machine translation has been taking place in machine translation projects in recent years. This study concentrates on the translation of Arabic text to its equivalent in Malay language. The problem of this research is the syntactic and morphological differences between Arabic and Malay adjective sentences. The main aim of this study is to design and develop Arabic-Malay machine translation model. First, we analyze the adjective role in the Arabic and Malay languages. Based on this analysis, we identify the transfer bilingual rules form source language to target language so that the translation of source language to target language can be performed by computers successfully. Then, we build and implement a machine translation prototype called AMTS to translate from Arabic to Malay based on rule based approach. The system is evaluated on set of simple Arabic sentences. The techniques used to evaluate the correctness of the system translation are the BLEU metric algorithm and the human judgment. The results of the BLEU algorithm show that the AMTS system performs better than Google in the translation of Arabic sentences into Malay. In addition, the average accuracy given by human judges is 92.3% for our system and 75.3% for Google.

  7. An Evaluation of Output Quality of Machine Translation (Padideh Software vs. Google Translate)

    Science.gov (United States)

    Azer, Haniyeh Sadeghi; Aghayi, Mohammad Bagher

    2015-01-01

    This study aims to evaluate the translation quality of two machine translation systems in translating six different text-types, from English to Persian. The evaluation was based on criteria proposed by Van Slype (1979). The proposed model for evaluation is a black-box type, comparative and adequacy-oriented evaluation. To conduct the evaluation, a…

  8. Bean Soup Translation: Flexible, Linguistically-Motivated Syntax for Machine Translation

    Science.gov (United States)

    Mehay, Dennis Nolan

    2012-01-01

    Machine translation (MT) systems attempt to translate texts from one language into another by translating words from a "source language" and rearranging them into fluent utterances in a "target language." When the two languages organize concepts in very different ways, knowledge of their general sentence structure, or…

  9. Machine Translation-Assisted Language Learning: Writing for Beginners

    Science.gov (United States)

    Garcia, Ignacio; Pena, Maria Isabel

    2011-01-01

    The few studies that deal with machine translation (MT) as a language learning tool focus on its use by advanced learners, never by beginners. Yet, freely available MT engines (i.e. Google Translate) and MT-related web initiatives (i.e. Gabble-on.com) position themselves to cater precisely to the needs of learners with a limited command of a…

  10. Chunk Alignment for Corpus-Based Machine Translation

    Science.gov (United States)

    Kim, Jae Dong

    2011-01-01

    Since sub-sentential alignment is critically important to the translation quality of an Example-Based Machine Translation (EBMT) system, which operates by finding and combining phrase-level matches against the training examples, we developed a new alignment algorithm for the purpose of improving the EBMT system's performance. This new…

  11. Machine Translation with Many Manually Labeled Discourse Connectives

    OpenAIRE

    Meyer, Thomas; Polakova, Lucie

    2013-01-01

    The paper presents machine translation experiments from English to Czech with a large amount of manually annotated discourse connectives. The gold-standard discourse relation annotation leads to better translation performance in ranges of 4–60% for some ambiguous English connectives and helps to find correct syntactical constructs in Czech for less ambiguous connectives. Automatic scoring confirms the stability of the newly built discourse-aware translation systems. Error analysis and human t...

  12. Approaches to handle scarce resources for Bengali statistical machine translation

    OpenAIRE

    Roy, Maxim

    2010-01-01

    Machine translation (MT) is a hard problem because of the highly complex, irregular and diverse nature of natural language. MT refers to computerized systems that utilize software to translate text from one natural language into another with or without human assistance. It is impossible to accurately model all the linguistic rules and relationships that shape the translation process, and therefore MT has to make decisions based on incomplete data. In order to handle this incomplete data, a pr...

  13. Integrating Automatic Speech Recognition and Machine Translation for Better Translation Outputs

    DEFF Research Database (Denmark)

    Liyanapathirana, Jeevanthi

    than typing, making the translation process faster. The spoken translation is analyzed and combined with the machine translation output of the same sentence using different methods. We study a number of different translation models in the context of n-best list rescoring methods. As an alternative...... to the n-best list rescoring, we also use word graphs with the expectation of arriving at a tighter integration of ASR and MT models. Integration methods include constraining ASR models using language and translation models of MT, and vice versa. We currently develop and experiment different methods...... on the Danish – English language pair, with the use of a speech corpora and parallel text. The methods are investigated to check ways that the accuracy of the spoken translation of the translator can be increased with the use of machine translation outputs, which would be useful for potential computer...

  14. Linguistically motivated statistical machine translation models and algorithms

    CERN Document Server

    Xiong, Deyi

    2015-01-01

    This book provides a wide variety of algorithms and models to integrate linguistic knowledge into Statistical Machine Translation (SMT). It helps advance conventional SMT to linguistically motivated SMT by enhancing the following three essential components: translation, reordering and bracketing models. It also serves the purpose of promoting the in-depth study of the impacts of linguistic knowledge on machine translation. Finally it provides a systematic introduction of Bracketing Transduction Grammar (BTG) based SMT, one of the state-of-the-art SMT formalisms, as well as a case study of linguistically motivated SMT on a BTG-based platform.

  15. Precise machine translation of computer science study

    CSIR Research Space (South Africa)

    Marais, L

    2015-07-01

    Full Text Available mobile (Android) application for translating discrete mathematics definitions between English and Afrikaans. The main component of the system is a Grammatical Framework (GF) application grammar which produces syntactically and semantically accurate...

  16. Machine Translation in the Hands of Trainee Translators – an Empirical Study

    Directory of Open Access Journals (Sweden)

    Sycz-Opoń Joanna

    2017-03-01

    Full Text Available Automated translation (machine translation, MT is systematically gaining popularity among professional translators, who claim that editing MT output requires less time and effort than translating from scratch. MT technology is also offered in leading translator’s workstations, e.g., SDL Trados Studio, memoQ, Déjà Vu and Wordfast. Therefore, the dilemma arises: should MT be introduced into formal translation training? In order to answer this question, first, it is necessary to understand how trainee translators actually use MT.

  17. Machine Translation Using Constraint-Based Synchronous Grammar

    Institute of Scientific and Technical Information of China (English)

    WONG Fai; DONG Mingchui; HU Dongcheng

    2006-01-01

    A synchronous grammar based on the formalism of context-free grammar was developed by generalizing the first component of production that models the source text. Unlike other synchronous grammars,the grammar allows multiple target productions to be associated to a single production rule which can be used to guide a parser to infer different possible translational equivalences for a recognized input string according to the feature constraints of symbols in the pattern. An extended generalized LR algorithm was adapted to the parsing of the proposed formalism to analyze the syntactic structure of a language. The grammar was used as the basis for building a machine translation system for Portuguese to Chinese translation. The empirical results show that the grammar is more expressive when modeling the translational equivalences of parallel texts for machine translation and grammar rewriting applications.

  18. English to Sanskrit Machine Translation Using Transfer Based approach

    Science.gov (United States)

    Pathak, Ganesh R.; Godse, Sachin P.

    2010-11-01

    Translation is one of the needs of global society for communicating thoughts and ideas of one country with other country. Translation is the process of interpretation of text meaning and subsequent production of equivalent text, also called as communicating same meaning (message) in another language. In this paper we gave detail information on how to convert source language text in to target language text using Transfer Based Approach for machine translation. Here we implemented English to Sanskrit machine translator using transfer based approach. English is global language used for business and communication but large amount of population in India is not using and understand the English. Sanskrit is ancient language of India most of the languages in India are derived from Sanskrit. Sanskrit can be act as an intermediate language for multilingual translation.

  19. English to Tamil machine translation system using universal networking language

    Indian Academy of Sciences (India)

    RAJESWARI SRIDHAR; PAVITHRA SETHURAMAN; KASHYAP KRISHNAKUMAR

    2016-06-01

    This paper proposes English to Tamil machine translation system, using the universal networking language (UNL) as the intermediate representation. The UNL approach is a hybrid approach of the rule and knowledge-based approaches to machine translation. UNL is a declarative formal language, specifically designed to represent semantic data extracted from a natural language text. The input English sentence is converted to UNL (enconversion), which is then converted to a Tamil sentence (deconversion) by ensuring thatthe meaning of the input sentence is preserved. The representation of UNL was modified to suit the translation process. A new sentence formation algorithm was also proposed to rearrange the translated Tamil words to sentences. The translation system was evaluated using bilingual evaluation understudy (BLEU) score. A BLEU score of 0.581 was achieved, which is an indication that most of the information in the input sentence is retained in the translated sentence. The scores obtained using the UNL based approach were compared with existingapproaches to translation, and it can be concluded that the UNL is a more suited approach to machine translation.

  20. Using example-based machine translation to translate DVD subtitles

    DEFF Research Database (Denmark)

    Flanagan, Marian

    allotted to produce the subtitles have both decreased. Therefore, this market is recognised as a potential real-world application of MT. Recent publications have introduced Corpus-Based MT approaches to translate subtitles. An SMT system has been implemented in a Swedish subtitling company to translate...... between Swedish and Danish and Swedish and Norwegian subtitles, with the company already reporting a successful return on their investment. The hybrid EBMT/SMT system used in the current research, on the other hand, remains within the confines of academic research, and the real potential of the system...... more accessible for use within the AVT industry, and to put the system in a position where it could compete commercially with the SMT approach....

  1. TRANSLATOR OF FINITE STATE MACHINE MODEL PARAMETERS FROM MATLAB ENVIRONMENT INTO HUMAN-MACHINE INTERFACE APPLICATION

    OpenAIRE

    2012-01-01

    Technology and means for automatic translation of FSM model parameters from Matlab application to human-machine interface application is proposed. The example of technology application to the electric apparatus model is described.

  2. Machine translation with minimal reliance on parallel resources

    CERN Document Server

    Tambouratzis, George; Sofianopoulos, Sokratis

    2017-01-01

    This book provides a unified view on a new methodology for Machine Translation (MT). This methodology extracts information from widely available resources (extensive monolingual corpora) while only assuming the existence of a very limited parallel corpus, thus having a unique starting point to Statistical Machine Translation (SMT). In this book, a detailed presentation of the methodology principles and system architecture is followed by a series of experiments, where the proposed system is compared to other MT systems using a set of established metrics including BLEU, NIST, Meteor and TER. Additionally, a free-to-use code is available, that allows the creation of new MT systems. The volume is addressed to both language professionals and researchers. Prerequisites for the readers are very limited and include a basic understanding of the machine translation as well as of the basic tools of natural language processing.

  3. Good Applications for Crummy Machine Translation

    Science.gov (United States)

    1991-07-01

    multilingual 16 computational linguistics __.__n _____ODE 16. PRICE CODE 17. SECURITY CLASSIFICTION 18. SECURITY CLASSIFICATION 19. SECURITY...relatively few keystrokes. Peter Brown (personal communication) once remarked that such a super-fast typewriter ought to be possible in the monolingual case... Multilingual System under Development, Computational Linguistics, 11:2-3, pp. 155-169. [81 Kay, M. (1980) "The Proper Place of Men and Machines in Language

  4. Translating DVD Subtitles English-German, English-Japanese, Using Example-based Machine Translation

    DEFF Research Database (Denmark)

    Armstrong, Stephen; Caffrey, Colm; Flanagan, Marian

    2006-01-01

    Due to limited budgets and an ever-diminishing time-frame for the production of subtitles for movies released in cinema and DVD, there is a compelling case for a technology-based translation solution for subtitles. In this paper we describe how an Example-Based Machine Translation (EBMT) approach...

  5. Transliteration normalization for Information Extraction and Machine Translation

    Directory of Open Access Journals (Sweden)

    Yuval Marton

    2014-12-01

    Full Text Available Foreign name transliterations typically include multiple spelling variants. These variants cause data sparseness and inconsistency problems, increase the Out-of-Vocabulary (OOV rate, and present challenges for Machine Translation, Information Extraction and other natural language processing (NLP tasks. This work aims to identify and cluster name spelling variants using a Statistical Machine Translation method: word alignment. The variants are identified by being aligned to the same “pivot” name in another language (the source-language in Machine Translation settings. Based on word-to-word translation and transliteration probabilities, as well as the string edit distance metric, names with similar spellings in the target language are clustered and then normalized to a canonical form. With this approach, tens of thousands of high-precision name transliteration spelling variants are extracted from sentence-aligned bilingual corpora in Arabic and English (in both languages. When these normalized name spelling variants are applied to Information Extraction tasks, improvements over strong baseline systems are observed. When applied to Machine Translation tasks, a large improvement potential is shown.

  6. Statistical Machine Translation Features with Multitask Tensor Networks

    OpenAIRE

    Setiawan, Hendra; Huang, Zhongqiang; Devlin, Jacob; Lamar, Thomas; Zbib, Rabih; Schwartz, Richard; Makhoul, John

    2015-01-01

    We present a three-pronged approach to improving Statistical Machine Translation (SMT), building on recent success in the application of neural networks to SMT. First, we propose new features based on neural networks to model various non-local translation phenomena. Second, we augment the architecture of the neural network with tensor layers that capture important higher-order interaction among the network units. Third, we apply multitask learning to estimate the neural network parameters joi...

  7. Neural Networks Classifier for Data Selection in Statistical Machine Translation

    OpenAIRE

    Peris, Álvaro; Chinea-Rios, Mara; Casacuberta, Francisco

    2016-01-01

    We address the data selection problem in statistical machine translation (SMT) as a classification task. The new data selection method is based on a neural network classifier. We present a new method description and empirical results proving that our data selection method provides better translation quality, compared to a state-of-the-art method (i.e., Cross entropy). Moreover, the empirical results reported are coherent across different language pairs.

  8. A Temporal Locality-Aware Page-Mapped Flash Translation Layer

    Energy Technology Data Exchange (ETDEWEB)

    Kim, Youngjae [ORNL; Gupta, Aayush [IBM Corporation, Almaden Research Center; Urgaonkar, Bhuvan [Pennsylvania State University

    2013-01-01

    The poor performance of random writes has been a cause of major concern which needs to be addressed to better utilize the potential of flash in enterprise-scale environments. We examine one of the important causes of this poor performance: the design of the flash translation layer (FTL) which performs the virtual-to-physical address translations and hides the erase-before-write characteristics of flash. We propose a complete paradigm shift in the design of the core FTL engine from the existing techniques with our Demand-Based Flash Translation Layer (DFTL) which selectively caches page- level address mappings. Our experimental evaluation using FlashSim with realistic enterprise-scale workloads endorses the utility of DFTL in enterprise-scale storage systems by demonstrating: 1) improved performance, 2) reduced garbage collection overhead and 3) better overload behavior compared with hybrid FTL schemes which are the most popular implementation methods. For example, a predominantly random-write dominant I/O trace from an OLTP application running at a large financial institution shows a 78% improvement in average response time (due to a 3-fold reduction in operations of the garbage collector), compared with the hybrid FTL scheme. Even for the well-known read-dominant TPC-H benchmark, for which DFTL introduces additional overheads, we improve system response time by 56%. Moreover, interestingly, when write-back cache on DFTL-based SSD is enabled, DFTL even outperforms the page-based FTL scheme, improving their response time by 72% in Financial trace.

  9. Improving a Japanese-Spanish Machine Translation System Using Wikipedia Medical Articles

    Directory of Open Access Journals (Sweden)

    Jessica C. Ramirez

    2015-02-01

    Full Text Available The quality, length and coverage of a parallel corp us are fundamental features in the performance of a Statistical Machine Translation Sy stem (SMT. For some pair of languages there is a considerable lack of resources suitable for Natural Language Processing tasks. This paper introduces a technique for extracting medical information from the Wikipedia page. Using a medical ontological dictionary and then we evaluate on a Japanese-Spanish SMT system. The study shows an increment in the BLEU sc ore.

  10. On the Properties of Neural Machine Translation: Encoder-Decoder Approaches

    OpenAIRE

    Cho, Kyunghyun; van Merrienboer, Bart; Bahdanau, Dzmitry; Bengio, Yoshua

    2014-01-01

    Neural machine translation is a relatively new approach to statistical machine translation based purely on neural networks. The neural machine translation models often consist of an encoder and a decoder. The encoder extracts a fixed-length representation from a variable-length input sentence, and the decoder generates a correct translation from this representation. In this paper, we focus on analyzing the properties of the neural machine translation using two models; RNN Encoder--Decoder and...

  11. Complex Networks Analysis of Manual and Machine Translations

    Science.gov (United States)

    Amancio, Diego R.; Antiqueira, Lucas; Pardo, Thiago A. S.; da F. Costa, Luciano; Oliveira, Osvaldo N.; Nunes, Maria G. V.

    Complex networks have been increasingly used in text analysis, including in connection with natural language processing tools, as important text features appear to be captured by the topology and dynamics of the networks. Following previous works that apply complex networks concepts to text quality measurement, summary evaluation, and author characterization, we now focus on machine translation (MT). In this paper we assess the possible representation of texts as complex networks to evaluate cross-linguistic issues inherent in manual and machine translation. We show that different quality translations generated by MT tools can be distinguished from their manual counterparts by means of metrics such as in- (ID) and out-degrees (OD), clustering coefficient (CC), and shortest paths (SP). For instance, we demonstrate that the average OD in networks of automatic translations consistently exceeds the values obtained for manual ones, and that the CC values of source texts are not preserved for manual translations, but are for good automatic translations. This probably reflects the text rearrangements humans perform during manual translation. We envisage that such findings could lead to better MT tools and automatic evaluation metrics.

  12. Foreign Developments in Information Processing and Machine Translation

    Science.gov (United States)

    2007-11-02

    on developments in the following fields of language data processing: Machine translation studies; questions on structural linguistics, phonological ...34katakana" or in phonetic syllables without the use of any "kanji" or Chinese characters, the "kun"- pronunciation of Chinese characters was adopted as

  13. Human in the Loop Machine Translation of Medical Terminology

    Science.gov (United States)

    2010-04-01

    March 30–31, 2009, http://www.aclweb.org/anthology/W/W09/W09-0x24. 3. Chiang, D.; Lopez, A.; Madnani, N.; Monz, C.; Resnik , P.; Subotin, M. The Hiero... Resnik , P. S. Machine Translation by Pattern Matching; University of Maryland at College Park, College Park, MD, 2008, ISBN: 978-0-549-57255-8

  14. Translating DVD Subtitles English-German, English-Japanese, Using Example-based Machine Translation

    DEFF Research Database (Denmark)

    Armstrong, Stephen; Caffrey, Colm; Flanagan, Marian

    2006-01-01

    Due to limited budgets and an ever-diminishing time-frame for the production of subtitles for movies released in cinema and DVD, there is a compelling case for a technology-based translation solution for subtitles. In this paper we describe how an Example-Based Machine Translation (EBMT) approach...... to the translation of English DVD subtitles into German and Japanese can aid the subtitler. Our research focuses on an EBMT tool that produces fully automated translations, which in turn can be edited if required. To our knowledge this is the first time that any EBMT approach has been used with DVD subtitle...

  15. JTEC panel report on machine translation in Japan

    Science.gov (United States)

    Carbonell, Jaime; Rich, Elaine; Johnson, David; Tomita, Masaru; Vasconcellos, Muriel; Wilks, Yorick

    1992-01-01

    The goal of this report is to provide an overview of the state of the art of machine translation (MT) in Japan and to provide a comparison between Japanese and Western technology in this area. The term 'machine translation' as used here, includes both the science and technology required for automating the translation of text from one human language to another. Machine translation is viewed in Japan as an important strategic technology that is expected to play a key role in Japan's increasing participation in the world economy. MT is seen in Japan as important both for assimilating information into Japanese as well as for disseminating Japanese information throughout the world. Most of the MT systems now available in Japan are transfer-based systems. The majority of them exploit a case-frame representation of the source text as the basis of the transfer process. There is a gradual movement toward the use of deeper semantic representations, and some groups are beginning to look at interlingua-based systems.

  16. A Study on Automatic Scoring for Machine Translation Systems

    Institute of Scientific and Technical Information of China (English)

    Yao Jianmin(姚建民); Zhang Jing; Zhao Tiejun; Li Sheng

    2004-01-01

    String similarity measures of edit distance, cosine correlation and Dice coefficient are adopted to evaluate machine translation results. Experiment shows that the evaluation method distinguishes well between "good" and "bad" translations. Another experiment manifests a consistency between human and automatic scorings of 6 general-purpose MT systems. Equational analysis validates the experimental results. Although the data and graphs are very promising, correlation coefficient and significance tests at 0.01 level are made to ensure the reliability of the results. Linear regression is made to map the automatic scoring results to human scorings.

  17. Empirical Investigation of Optimization Algorithms in Neural Machine Translation

    Directory of Open Access Journals (Sweden)

    Bahar Parnia

    2017-06-01

    Full Text Available Training neural networks is a non-convex and a high-dimensional optimization problem. In this paper, we provide a comparative study of the most popular stochastic optimization techniques used to train neural networks. We evaluate the methods in terms of convergence speed, translation quality, and training stability. In addition, we investigate combinations that seek to improve optimization in terms of these aspects. We train state-of-the-art attention-based models and apply them to perform neural machine translation. We demonstrate our results on two tasks: WMT 2016 En→Ro and WMT 2015 De→En.

  18. INTEGRATING MACHINE TRANSLATION AND SPEECH SYNTHESIS COMPONENT FOR ENGLISH TO DRAVIDIAN LANGUAGE SPEECH TO SPEECH TRANSLATION SYSTEM

    Directory of Open Access Journals (Sweden)

    J. SANGEETHA

    2015-02-01

    Full Text Available This paper provides an interface between the machine translation and speech synthesis system for converting English speech to Tamil text in English to Tamil speech to speech translation system. The speech translation system consists of three modules: automatic speech recognition, machine translation and text to speech synthesis. Many procedures for incorporation of speech recognition and machine translation have been projected. Still speech synthesis system has not yet been measured. In this paper, we focus on integration of machine translation and speech synthesis, and report a subjective evaluation to investigate the impact of speech synthesis, machine translation and the integration of machine translation and speech synthesis components. Here we implement a hybrid machine translation (combination of rule based and statistical machine translation and concatenative syllable based speech synthesis technique. In order to retain the naturalness and intelligibility of synthesized speech Auto Associative Neural Network (AANN prosody prediction is used in this work. The results of this system investigation demonstrate that the naturalness and intelligibility of the synthesized speech are strongly influenced by the fluency and correctness of the translated text.

  19. Using the TED Talks to Evaluate Spoken Post-editing of Machine Translation

    DEFF Research Database (Denmark)

    Liyanapathirana, Jeevanthi; Popescu-Belis, Andrei

    2016-01-01

    This paper presents a solution to evaluate spoken post-editing of imperfect machine translation output by a human translator. We compare two approaches to the combination of machine translation (MT) and automatic speech recognition (ASR): a heuristic algorithm and a machine learning method...

  20. Building a Large-Scale Knowledge Base for Machine Translation

    CERN Document Server

    Knight, K; Knight, Kevin; Luk, Steve K.

    1994-01-01

    Knowledge-based machine translation (KBMT) systems have achieved excellent results in constrained domains, but have not yet scaled up to newspaper text. The reason is that knowledge resources (lexicons, grammar rules, world models) must be painstakingly handcrafted from scratch. One of the hypotheses being tested in the PANGLOSS machine translation project is whether or not these resources can be semi-automatically acquired on a very large scale. This paper focuses on the construction of a large ontology (or knowledge base, or world model) for supporting KBMT. It contains representations for some 70,000 commonly encountered objects, processes, qualities, and relations. The ontology was constructed by merging various online dictionaries, semantic networks, and bilingual resources, through semi-automatic methods. Some of these methods (e.g., conceptual matching of semantic taxonomies) are broadly applicable to problems of importing/exporting knowledge from one KB to another. Other methods (e.g., bilingual match...

  1. Countability and Number in Japanese-to-English Machine Translation

    CERN Document Server

    Bond, F; Ikehara, S; Bond, Francis; Ogura, Kentaro; Ikehara, Satoru

    1994-01-01

    This paper presents a heuristic method that uses information in the Japanese text along with knowledge of English countability and number stored in transfer dictionaries to determine the countability and number of English noun phrases. Incorporating this method into the machine translation system ALT-J/E, helped to raise the percentage of noun phrases generated with correct use of articles and number from 65% to 73%.

  2. Analysis of MultiWord Expression Translation Errors in Statistical Machine Translation

    DEFF Research Database (Denmark)

    Klyueva, Natalia; Liyanapathirana, Jeevanthi

    2015-01-01

    In this paper, we analyse the usage of multiword expressions (MWE) in Statistical Machine Translation (SMT). We exploit the Moses SMT toolkit to train models for French-English and Czech-Russian language pairs. For each language pair, two models were built: a baseline model without additional MWE...

  3. Urdu to Punjabi Machine Translation: An Incremental Training Approach

    Directory of Open Access Journals (Sweden)

    Umrinderpal Singh

    2016-04-01

    Full Text Available The statistical machine translation approach is highly popular in automatic translation research area and promising approach to yield good accuracy. Efforts have been made to develop Urdu to Punjabi statistical machine translation system. The system is based on an incremental training approach to train the statistical model. In place of the parallel sentences corpus has manually mapped phrases which were used to train the model. In preprocessing phase, various rules were used for tokenization and segmentation processes. Along with these rules, text classification system was implemented to classify input text to predefined classes and decoder translates given text according to selected domain by the text classifier. The system used Hidden Markov Model(HMM for the learning process and Viterbi algorithm has been used for decoding. Experiment and evaluation have shown that simple statistical model like HMM yields good accuracy for a closely related language pair like Urdu-Punjabi. The system has achieved 0.86 BLEU score and in manual testing and got more than 85% accuracy.

  4. RM-structure alignment based statistical machine translation model

    Institute of Scientific and Technical Information of China (English)

    Sun Jiadong; Zhao Tiejun

    2008-01-01

    A novel model based on structure alignments is proposed for statistical machine translation in this paper.Meta-structure and sequence of meta-structure for a parse tree are defined.During the translation process, a parse tree is decomposed to deal with the structure divergence and the alignments can be constructed at different levels of recombination of meta-structure (RM).This method can perform the structure mapping across the sub-tree structure between languages.As a result, we get not only the translation for the target language, but sequence of meta-structure of its parse tree at the same time.Experiments show that the model in the framework of log-linear model has better generative ability and significantly outperforms Pharaoh, a phrase-based system.

  5. Word Transition Entropy as an Indicator for Expected Machine Translation Quality

    DEFF Research Database (Denmark)

    Carl, Michael; Schaeffer, Moritz

    2014-01-01

    While most machine translation evaluation techniques (BLEU, NIST, TER, METEOR) assess translation quality based on a set of reference translations, we suggest to evaluate the literality of a set of (human or machine generated) translations to infer their potential quality. We provide evidence which...... suggests that more literal translations are produced more easily, by humans and machine, and are also less error prone. Literal translations may not be appropriate or even possible for all languages, types of texts, and translation purposes. However, in this paper we show that an assessment...... of the literality of translations allows us to (1) evaluate human and machine translations in a similar fashion and (2) may be instrumental to predict machine translation quality scores,...

  6. Machine Translation Approaches and Survey for Indian Languages

    OpenAIRE

    Khan, Nadeem Jadoon; Anwar, Waqas; Durrani, Nadir

    2017-01-01

    In this study, we present an analysis regarding the performance of the state-of-art Phrase-based Statistical Machine Translation (SMT) on multiple Indian languages. We report baseline systems on several language pairs. The motivation of this study is to promote the development of SMT and linguistic resources for these language pairs, as the current state-of-the-art is quite bleak due to sparse data resources. The success of an SMT system is contingent on the availability of a large parallel c...

  7. Classifiers in Japanese-to-English Machine Translation

    CERN Document Server

    Bond, F; Ikehara, S; Bond, Francis; Ogura, Kentaro; Ikehara, Satoru

    1996-01-01

    This paper proposes an analysis of classifiers into four major types: UNIT, METRIC, GROUP and SPECIES, based on properties of both Japanese and English. The analysis makes possible a uniform and straightforward treatment of noun phrases headed by classifiers in Japanese-to-English machine translation, and has been implemented in the MT system ALT-J/E. Although the analysis is based on the characteristics of, and differences between, Japanese and English, it is shown to be also applicable to the unrelated language Thai.

  8. Topic-aware pivot language approach for statistical machine translation

    Institute of Scientific and Technical Information of China (English)

    Jin-song SU; Xiao-dong SHI; Yan-zhou HUANG; Yang LIU; Qing-qiang WU; Yi-dong CHEN; Huai-lin DONG

    2014-01-01

    The pivot language approach for statistical machine translation (SMT) is a good method to break the resource bottleneck for certain language pairs. However, in the implementation of conventional approaches, pivot-side context information is far from fully utilized, resulting in erroneous estimations of translation probabilities. In this study, we propose two topic-aware pivot language approaches to use different levels of pivot-side context. The fi rst method takes advantage of document-level context by assuming that the bridged phrase pairs should be similar in the document-level topic distributions. The second method focuses on the effect of local context. Central to this approach are that the phrase sense can be refl ected by local context in the form of probabilistic topics, and that bridged phrase pairs should be compatible in the latent sense distributions. Then, we build an interpolated model bringing the above methods together to further enhance the system performance. Experimental results on French-Spanish and French-German translations using English as the pivot language demonstrate the effectiveness of topic-based context in pivot-based SMT.

  9. Advances in Fully-Automatic and Interactive Phrase-Based Statistical Machine Translation

    OpenAIRE

    Ortiz Martínez, Daniel

    2011-01-01

    This thesis presents different contributions in the fields of fully-automatic statistical machine translation and interactive statistical machine translation. In the field of statistical machine translation there are three problems that are to be addressed, namely, the modelling problem, the training problem and the search problem. In this thesis we present contributions regarding these three problems. Regarding the modelling problem, an alternative derivation of phrase-based s...

  10. ScaleMT: a free/open-source framework for building scalable machine translation web services

    OpenAIRE

    Sánchez-Cartagena, Víctor M.; Pérez-Ortiz, Juan Antonio

    2009-01-01

    Machine translation web services usage is growing amazingly mainly because of the translation quality and reliability of the service provided by the Google Ajax Language API. To allow the open-source machine ranslation projects to compete with Google’s one and gain visibility on the internet, we have developed ScaleMT: a free/open-source framework that exposes existing machine translation engines as public web services. This framework is highly scalable as it can run coordinately on many serv...

  11. Quantitative evaluation of English-Japanese machine translation of medical literature.

    Science.gov (United States)

    Kiuchi, T; Kaihara, S

    1991-08-01

    Although many machine-translation programs are currently available, few evaluation methods of such translation exist for any given application area. It is difficult to evaluate machine-translation systems objectively because the quality of a translation depends on the combination of three factors: the translation program, the dictionary, and the original document. In this study, we developed a quantitative evaluation method for assessing machine translation, which evaluates these three factors separately. We applied this method to the translation of English to Japanese for medical literature and the method proved to be a good indicator for further system improvement. Using this method we also discovered other important points for machine translation, such as the examination of target documents for the construction of a better application dictionary.

  12. A discourse based approach in text-based machine translation

    CERN Document Server

    Ullah, Sana; Kwak, Kyung Sup

    2009-01-01

    This paper presents a theoretical research based approach to ellipsis resolution in machine translation. Moreover, the formula of discourse is applied in order to resolve ellipses. The validity of the discourse formula is analyzed by applying it to the real world text i.e. newspaper fragments. The source text is converted into mono-sentential discourses where complex discourses require further dissection either directly into primitive discourses or first into compound discourses and later into primitive ones. The procedure of dissection needs further improvement i.e. discovering as many primitive discourse forms as possible [1]. This work is further improvement to the concepts presented by Khan (Khan, 1995). Likewise, an attempt has been made to investigate new primitive discourses i.e. patterns from the given text.

  13. Machine Translation as a complex system, and the phenomenon of Esperanto

    NARCIS (Netherlands)

    Gobbo, F.

    2015-01-01

    The history of machine translation and the history of Esperanto have long been connected, as they are two different ways to deal with the same problem: the problem of communication across language barriers. Language can be considered a Complex Adaptive System (CAS), and machine translation too. In f

  14. Exploring the Further Integration of Machine Translation in English-Chinese Cross Language Information Access

    Science.gov (United States)

    Wu, Dan; He, Daqing

    2012-01-01

    Purpose: This paper seeks to examine the further integration of machine translation technologies with cross language information access in providing web users the capabilities of accessing information beyond language barriers. Machine translation and cross language information access are related technologies, and yet they have their own unique…

  15. Impact of Machine-Translated Text on Entity and Relationship Extraction

    Science.gov (United States)

    2014-12-01

    Impact of Machine-Translated Text on Entity and Relationship Extraction by Mark R Mittrick and John T Richardson ARL-TN-0649 December...2014 Impact of Machine-Translated Text on Entity and Relationship Extraction Mark R Mittrick and John T Richardson Computational and...

  16. Machine Translation as a complex system, and the phenomenon of Esperanto

    NARCIS (Netherlands)

    Gobbo, F.

    2015-01-01

    The history of machine translation and the history of Esperanto have long been connected, as they are two different ways to deal with the same problem: the problem of communication across language barriers. Language can be considered a Complex Adaptive System (CAS), and machine translation too. In f

  17. Analysing Quality of English-Hindi Machine Translation Engine outputs using Baysian Classification

    OpenAIRE

    Rashmi Gupta; Nisheeth Joshi; Iti Mathur

    2013-01-01

    This paper considers the problem for estimating the quality of machine translation outputs which are independent of human intervention and are generally addressed using machine learning techniques.There are various measures through which a machine learns translations quality. Automatic Evaluation metrics produce good co-relation at corpus level but cannot produce the same results at the same segment or sentence level. In this paper 16 features are extracted from the input sentence...

  18. An Efficient Machine Translation System for English to Indian Languages Using Hybrid Mechanism

    Directory of Open Access Journals (Sweden)

    J. Sangeetha

    2014-08-01

    Full Text Available Machine Translation is an essential approach for localization, and is especially appropriate in a linguistically diversenation like India. Automatic translation between languages which are morphologically rich and syntactically different is generally regarded as a complex task. A number of machine translation systems have been proposed in literature. But, conventional rule-based machine translation system is costly in terms of formulating rules. It introduces inconsistencies, and it is inflexible to be robust. Statistical MT is an approach that automatically attains knowledge from a vast amount of training data. This approach is characterized by the use of machine learning techniques. But, still there is scope for better performance of the system. In this paper, a Hybrid Machine Translation (HMT approach is proposed which is the combination of rule based and statistical technique for translating text from English to Indian languages such as Tamil, Malayalam and Hindi. The rule based machine translation technique, involves the formation of rules which helps to re-order the syntactic structures of the source language sentence along with its dependency information which brings close to the structure of the target sentence. The parser identifies the syntactical elements in English sentences and suggests its Indian languages translation taking into account various grammatical forms of those Indian languages. Context Free Grammars (CFG is used in generation of the language structures, and then the errors in the translated sentences are corrected by applying a statistical technique. Simplifying and segmenting an input language text becomes mandatory in order to improve the machine translation quality. The experimental results show that the proposed approach competes with the machine translation methods reported in the literature and it provides the best translated output in each language.

  19. Domain Adaptation for Machine Translation with Instance Selection

    Directory of Open Access Journals (Sweden)

    Biçici Ergun

    2015-04-01

    Full Text Available Domain adaptation for machine translation (MT can be achieved by selecting training instances close to the test set from a larger set of instances. We consider 7 different domain adaptation strategies and answer 7 research questions, which give us a recipe for domain adaptation in MT. We perform English to German statistical MT (SMT experiments in a setting where test and training sentences can come from different corpora and one of our goals is to learn the parameters of the sampling process. Domain adaptation with training instance selection can obtain 22% increase in target 2-gram recall and can gain up to 3:55 BLEU points compared with random selection. Domain adaptation with feature decay algorithm (FDA not only achieves the highest target 2-gram recall and BLEU performance but also perfectly learns the test sample distribution parameter with correlation 0:99. Moses SMT systems built with FDA selected 10K training sentences is able to obtain F1 results as good as the baselines that use up to 2M sentences. Moses SMT systems built with FDA selected 50K training sentences is able to obtain F1 point better results than the baselines.

  20. Can multilingual machine translation help make medical record content more comprehensible to patients?

    Science.gov (United States)

    Zeng-Treitler, Qing; Kim, Hyeoneui; Rosemblat, Graciela; Keselman, Alla

    2010-01-01

    With the development of electronic personal health records, more patients are gaining access to their own medical records. However, comprehension of medical record content remains difficult for many patients. Because each record is unique, it is also prohibitively costly to employ human translators to solve this problem. In this study, we investigated whether multilingual machine translation could help make medical record content more comprehensible to patients who lack proficiency in the language of the records. We used a popular general-purpose machine translation tool called Babel Fish to translate 213 medical record sentences from English into Spanish, Chinese, Russian and Korean. We evaluated the comprehensibility and accuracy of the translation. The text characteristics of the incorrectly translated sentences were also analyzed. In each language, the majority of the translations were incomprehensible (76% to 92%) and/or incorrect (77% to 89%). The main causes of the translation are vocabulary difficulty and syntactical complexity. A general-purpose machine translation tool like the Babel Fish is not adequate for the translation of medical records; however, a machine translation tool can potentially be improved significantly, if it is trained to target certain narrow domains in medicine.

  1. Quantifying the Efficiency of a Translator: The Effect of Syntactical and Literal Written Translations on Language Comprehension Using the Machine Translation System FALCon

    Science.gov (United States)

    McCulloh, Ian A.; Morton, Jillian; Jantzi, Jennifer K.; Rodriguez, Amy M.; Graham, John

    2008-01-01

    This study introduces a new method of evaluating human comprehension in the context of machine translation using a language translation program known as the FALCon (Forward Area Language Converter). The participants include 48 freshmen from the United States Military Academy enrolled in the General Psychology course, PL100. Results of this study…

  2. Quantifying the Efficiency of a Translator: The Effect of Syntactical and Literal Written Translations on Language Comprehension Using the Machine Translation System FALCon (Foreign Area Language Converter)

    Science.gov (United States)

    McCulloh, Ian A.; Morton, Jillian; Jantzi, Jennifer K.; Rodriguez, Amy M.; Graham, John

    2008-01-01

    The purpose of this study is to introduce a new method of evaluating human comprehension in the context of machine translation using a language translation program known as the FALCon (Forward Area Language Converter). The FALCon works by converting documents into digital images via scanner, and then converting those images to electronic text by…

  3. Using the TED Talks to Evaluate Spoken Post-editing of Machine Translation

    DEFF Research Database (Denmark)

    Liyanapathirana, Jeevanthi; Popescu-Belis, Andrei

    2016-01-01

    This paper presents a solution to evaluate spoken post-editing of imperfect machine translation output by a human translator. We compare two approaches to the combination of machine translation (MT) and automatic speech recognition (ASR): a heuristic algorithm and a machine learning method....... To obtain a data set with spoken post-editing information, we use the French version of TED talks as the source texts submitted to MT, and the spoken English counterparts as their corrections, which are submitted to an ASR system. We experiment with various levels of artificial ASR noise and also...

  4. Translation of Untranslatable Words — Integration of Lexical Approximation and Phrase-Table Extension Techniques into Statistical Machine Translation

    Science.gov (United States)

    Paul, Michael; Arora, Karunesh; Sumita, Eiichiro

    This paper proposes a method for handling out-of-vocabulary (OOV) words that cannot be translated using conventional phrase-based statistical machine translation (SMT) systems. For a given OOV word, lexical approximation techniques are utilized to identify spelling and inflectional word variants that occur in the training data. All OOV words in the source sentence are then replaced with appropriate word variants found in the training corpus, thus reducing the number of OOV words in the input. Moreover, in order to increase the coverage of such word translations, the SMT translation model is extended by adding new phrase translations for all source language words that do not have a single-word entry in the original phrase-table but only appear in the context of larger phrases. The effectiveness of the proposed methods is investigated for the translation of Hindi to English, Chinese, and Japanese.

  5. The Dostoevsky Machine in Georgetown: scientific translation in the Cold War.

    Science.gov (United States)

    Gordin, Michael D

    2016-04-01

    Machine Translation (MT) is now ubiquitous in discussions of translation. The roots of this phenomenon - first publicly unveiled in the so-called 'Georgetown-IBM Experiment' on 9 January 1954 - displayed not only the technological utopianism still associated with dreams of a universal computer translator, but was deeply enmeshed in the political pressures of the Cold War and a dominating conception of scientific writing as both the goal of machine translation as well as its method. Machine translation was created, in part, as a solution to a perceived crisis sparked by the massive expansion of Soviet science. Scientific prose was also perceived as linguistically simpler, and so served as the model for how to turn a language into a series of algorithms. This paper follows the rise of the Georgetown program - the largest single program in the world - from 1954 to the (as it turns out, temporary) collapse of MT in 1964.

  6. ADAPTING HYBRID MACHINE TRANSLATION TECHNIQUES FOR CROSS-LANGUAGE TEXT RETRIEVAL SYSTEM

    Directory of Open Access Journals (Sweden)

    P. ISWARYA

    2017-03-01

    Full Text Available This research work aims in developing Tamil to English Cross - language text retrieval system using hybrid machine translation approach. The hybrid machine translation system is a combination of rule based and statistical based approaches. In an existing word by word translation system there are lot of issues and some of them are ambiguity, Out-of-Vocabulary words, word inflections, and improper sentence structure. To handle these issues, proposed architecture is designed in such a way that, it contains Improved Part-of-Speech tagger, machine learning based morphological analyser, collocation based word sense disambiguation procedure, semantic dictionary, and tense markers with gerund ending rules, and two pass transliteration algorithm. From the experimental results it is clear that the proposed Tamil Query based translation system achieves significantly better translation quality over existing system, and reaches 95.88% of monolingual performance.

  7. An Evaluative Study of Machine Translation in the EFL Scenario of Saudi Arabia

    Directory of Open Access Journals (Sweden)

    Raneem Khalid Al-Tuwayrish

    2016-02-01

    Full Text Available Artificial Intelligence or AI as it is popularly known and its corollary, Machine Translation (MT have long engaged scientists, thinkers and linguists alike in the twenty first century. However, the wider question that lies in the relation between technology and translation is, What does technology do to language? This is an important question in the current paradigm because new translation technologies, such as, translation memories, data-based machine translation, and collaborative translation, far from being just additional tools, are changing the very nature of the translators’ cognitive activity, social relations, and professional standing. In fact, in some translation situations such as when translating technical materials or subject matter that are not a specialization with human translators, one potentially needs technology.  The purview of this paper, however, is limited to the role of MT in day to day situations where the generic MT tools like Google Translate or Bing Translator are encouraged. Further, it endeavours to weigh and empirically demonstrate the pros and cons of MT with a view to recommending measures for better communication training in the EFL set up of Saudi Arabia. Keywords: AI, MT, translation, technology, necessity, communication

  8. Advancement in Productivity of Arabic into English Machine Translation Systems from 2008 to 2013

    Science.gov (United States)

    Abu-Al-Sha'r, Awatif M.; AbuSeileek, Ali F.

    2013-01-01

    This paper attempts to compare between the advancements in the productivity of Arabic into English Machine Translation Systems between two years, 2008 and 2013. It also aims to evaluate the progress achieved by various systems of Arabic into English electronic translation between the two years. For tracing such advancement, a comparative analysis…

  9. The Integration of Project-Based Methodology into Teaching in Machine Translation

    Science.gov (United States)

    Madkour, Magda

    2016-01-01

    This quantitative-qualitative analytical research aimed at investigating the effect of integrating project-based teaching methodology into teaching machine translation on students' performance. Data was collected from the graduate students in the College of Languages and Translation, at Imam Muhammad Ibn Saud Islamic University, Riyadh, Saudi…

  10. Integrating source-language context into phrase-based statistical machine translation

    NARCIS (Netherlands)

    Haque, R.; Kumar Naskar, S.; Bosch, A.P.J. van den; Way, A.

    2011-01-01

    The translation features typically used in Phrase-Based Statistical Machine Translation (PB-SMT) model dependencies between the source and target phrases, but not among the phrases in the source language themselves. A swathe of research has demonstrated that integrating source context modelling dire

  11. Recycling Texts: Human evaluation of example-based machine translation subtitles for DVD

    DEFF Research Database (Denmark)

    Flanagan, Marian

    2009-01-01

    This project focuses on translation reusability in audiovisual contexts. Specifically, the project seeks to establish (1) whether target language subtitles produced by an Example-Based Machine Translation (EBMT) system are considered intelligible and acceptable by viewers of movies on DVD, and (2...

  12. Machine-Aided Translation: From Terminology Banks to Interactive Translation Systems.

    Science.gov (United States)

    Greenfield, Concetta C.; Serain, Daniel

    The rapid growth of the need for technical translations in recent years has led specialists to utilize computer technology to improve the efficiency and quality of translation. The two approaches considered were automatic translation and terminology banks. Since the results of fully automatic translation were considered unsatisfactory by various…

  13. Statistical Sign Language Machine Translation: from English written text to American Sign Language Gloss

    CERN Document Server

    Othman, Achraf

    2011-01-01

    This works aims to design a statistical machine translation from English text to American Sign Language (ASL). The system is based on Moses tool with some modifications and the results are synthesized through a 3D avatar for interpretation. First, we translate the input text to gloss, a written form of ASL. Second, we pass the output to the WebSign Plug-in to play the sign. Contributions of this work are the use of a new couple of language English/ASL and an improvement of statistical machine translation based on string matching thanks to Jaro-distance.

  14. 互联网机器翻译%Web-based Machine Translation

    Institute of Scientific and Technical Information of China (English)

    王海峰; 吴华; 刘占一

    2011-01-01

    该文在回顾机器翻译发展的基础上,总结了主要的机器翻译方法,并主要阐述互联网机器翻译的特点及面临的挑战.面向互联网机器翻译的应用需求,并针对互联网资源具有海量、高噪声、时效性、稀疏的特点,提出了多策略混合翻译方法、资源挖掘和过滤以及分布式处理技术、领域自适应技术,针对数据稀疏论述枢轴语言技术和新语种快速部署技术;然后结合翻译与搜索技术,阐述翻译个性化特点和方案.最后,论述机器翻译技术和产品的应用.%This paper digs into the characteristics and challenges of web-based machine translation, and proposes possible solutions. First of all, we look back on the history of machine translation and summarize its methods. Next, we analyze the characteristics of internet bilingual corpora and monolingual corpora as: large scale, with lots of noise, real-time and sometimes sparse. Based on the features described above, we propose the hybrid machine translation method, corpus mining and filtering methods, and distributed computing methods. Furthermore, the pivot language approach is adopted to tackle the data sparseness problem, thus enabling the quick development of multilingual machine translation systems. We then discuss the approach to support the personalization of machine translation via the combination of translation technology and search technology. Finally the applications and products of machine translation technology are presented.

  15. Predicting post-translational lysine acetylation using support vector machines

    DEFF Research Database (Denmark)

    Gnad, Florian; Ren, Shubin; Choudhary, Chunaram

    2010-01-01

    spectrometry to identify 3600 lysine acetylation sites on 1750 human proteins covering most of the previously annotated sites and providing the most comprehensive acetylome so far. This dataset should provide an excellent source to train support vector machines (SVMs) allowing the high accuracy in silico...

  16. Evaluation of Machine Translation Errors in English and Iraqi Arabic

    Science.gov (United States)

    2010-05-01

    are made. Llitjós, Carbonell & Lavie (2005) created a hierarchical taxonomy of errors for use in refining rules of transfer-based MT systems...Translation. Proceedings of MT Summit XII. Llitjós, A., Carbonell , J., and Lavie, A. (2005). A framework for interactive and automatic refinement of

  17. Self-Organizing Machine Translation Example-Driven Induction of Transfer Functions

    CERN Document Server

    Juola, P

    1994-01-01

    With the advent of faster computers, the notion of doing machine translation from a huge stored database of translation examples is no longer unreasonable. This paper describes an attempt to merge the Example-Based Machine Translation (EBMT) approach with psycholinguistic principles. A new formalism for context- free grammars, called *marker-normal form*, is demonstrated and used to describe language data in a way compatible with psycholinguistic theories. By embedding this formalism in a standard multivariate optimization framework, a system can be built that infers correct transfer functions for a set of bilingual sentence pairs and then uses those functions to translate novel sentences. The validity of this line of reasoning has been tested in the development of a system called METLA-1. This system has been used to infer English->French and English->Urdu transfer functions from small corpora. The results of those experiments are examined, both in engineering terms as well as in more linguistic terms. In ge...

  18. A Lexical Conceptual Approach to Generation for Machine Translation

    Science.gov (United States)

    1988-01-01

    etc.). Once a verb has been selected to translate the predicate, the semantic arguments of the deep structure are filled with the instantiated...8217strid iii ts oft it, argumtents. Two examples are t he E nglish words shish and stil ilr: (4) (i) He slashed the woman * ’Dio cuchilladas a la mujer ...are simplified once rules and general inferencing are eliminated: LCS and $-role mappings obviate the need for complicated network searches and rule

  19. Climbing the Tower of Babel: Perfecting Machine Translation

    Science.gov (United States)

    2011-02-16

    Peter Norvig , and Fernando Pereira, "The Unreasonable Effectiveness of Data," IEEE Intelligence Systems, (2009): 8. So, this [trillion-word] corpus...Alon Halevy, Peter Norvig , and Fernando Pereira, Google assisted translation? Some experts believe technology can develop MT capabilities with...recognition.htm (accessed 28 September 2010). Halevy, Alon, Peter Norvig , and Fernando Pereira. "The Unreasonable Effectiveness of Data." IEEE

  20. Hindi to English Transfer Based Machine Translation System

    Directory of Open Access Journals (Sweden)

    Shashi Pal Singh

    2015-06-01

    Full Text Available In large societies like India there is a huge demand to convert one human language into another. Lots of work has been done in this area. Many transfer based MTS have developed for English to other languages, as MANTRA CDAC Pune, MATRA CDAC Pune, SHAKTI IISc Bangalore and IIIT Hyderabad. Still there is a little work done for Hindi to other languages. Currently we are working on it. In this paper we focus on designing a system, that translate the document from Hindi to English by using transfer based approach. This system takes an input text check its structure through parsing. Reordering rules are used to generate the text in target language. It is better than Corpus Based MTS because Corpus Based MTS require large amount of word aligned data for translation that is not available for many languages while Transfer Based MTS requires only knowledge of both the languages (source language and target language to make transfer rules. We get correct translation for simple assertive sentences and almost correct for complex and compound sentences.

  1. English-Lithuanian-English Machine Translation lexicon and engine: current state and future work

    CERN Document Server

    Barisevičius, G

    2011-01-01

    This article overviews the current state of the English-Lithuanian-English machine translation system. The first part of the article describes the problems that system poses today and what actions will be taken to solve them in the future. The second part of the article tackles the main issue of the translation process. Article briefly overviews the word sense disambiguation for MT technique using Google.

  2. An open-source highly scalable web service architecture for the Apertium machine translation engine

    OpenAIRE

    Sánchez-Cartagena, Víctor M.; Pérez-Ortiz, Juan Antonio

    2009-01-01

    Some machine translation services like Google Ajax Language API have become very popular as they make the collaboratively created contents of the web 2.0 available to speakers of many languages. One of the keys of its success is its clear and easy-to-use application programming interface (API) and a scalable and reliable service. This paper describes a highly scalable implementation of an Apertium-based translation web service, that aims to make contents available to speakers of lesser resour...

  3. Web page translating software program The Hon-yaku; Hon'yaku software 'The hon'yaku

    Energy Technology Data Exchange (ETDEWEB)

    NONE

    2000-03-01

    A translating software program The Hon-yaku (the translator) Internet V4.0 has been developed, which provides an optimum means to help those who have to perform English-Japanese and Japanese-English translation on the Internet in collecting information from abroad, e-mailing to other countries, and chatting (exchanging of messages or dialogues) with friends overseas. When the Shortcut Read function is used, the user is instantly provided with the gist of the page on the screen even if it is full of long sentences. The program, furthermore, is equipped with a function of translating documents prepared using the state-of-the-art XML (eXtensible Markup Language) and PDF (Portable Document Format) technologies. Another translation software program The Hon-yaku Office V2.0 has also been developed, which adds a translating function to Microsoft(reg sign) Office 2000. Using this program, translation is completed upon clicking on the translation button on a Word, Excel, or Power Point(reg sign) screen. A template etc. are also provided, which simplifies the preparation of business letters in English. (translated by NEDO)

  4. ANALYTICAL MODEL OF CALCULUS FOR INFLUENCE THE TRANSLATION GUIDE WEAR OVER THE MACHINING ACCURACY ON THE MACHINE TOOL

    Directory of Open Access Journals (Sweden)

    Ivona PETRE

    2010-10-01

    Full Text Available The wear of machine tools guides influences favorably to vibrations. As a result of guides wear, the initial trajectory of cutting tools motion will be modified, the generating dimensional accuracy discrepancies and deviations of geometrical shape of the work pieces. As it has already been known, the wear of mobile and rigid guides is determined by many parameters (pressure, velocity, friction length, lubrication, material. The choice of one or another analytic model and/or the experimental model of the wear is depending by the working conditions, assuming that the coupling material is known.The present work’s goal is to establish an analytic model of calculus showing the influence of the translation guides wear over the machining accuracy on machine-tools.

  5. Web-Based Machine Translation as a Tool for Promoting Electronic Literacy and Language Awareness

    Science.gov (United States)

    Williams, Lawrence

    2006-01-01

    This article addresses a pervasive problem of concern to teachers of many foreign languages: the use of Web-Based Machine Translation (WBMT) by students who do not understand the complexities of this relatively new tool. Although networked technologies have greatly increased access to many language and communication tools, WBMT is still…

  6. Four Generations of Machine Translation Research and Prospects for the Future.

    Science.gov (United States)

    Wilks, Yorick

    This paper begins with a description of four generations of research in machine translation: the original efforts of 1957 to 1965 and three types of surviving and sometimes competing present projects. The three types of present projects include those relying on "brute force" methods involving larger and faster computers; those based on a…

  7. Improving the quality of automated DVD subtitles via example-based machine translation

    DEFF Research Database (Denmark)

    Armstrong, Stephen; Caffrey, Colm; Flanagan, Marian

    Denoual (2005) discovered that, contrary to popular belief, an Example-Based Machine Translation system trained on heterogeneous data produced significantly better results than a system trained on homogeneous data. Using similar evaluation metrics and a few additional ones, in this paper we show ...

  8. Development of Chinese-English Machine Translation System. Fnal Technical Report.

    Science.gov (United States)

    Wang, William S-Y; Chan, Stephen W.

    The report documents progress and results of a 2-1/3 year effort to further the prototype Chinese-English Machine Translation System. Additional rules were incorporated into the existing grammar for Chinese analysis and interlingual transfer, with emphasis on the latter. CHIDIC was updated and revised. Approximately 16,000 new entries were added…

  9. Improvement of Machine Translation Evaluation by Simple Linguistically Motivated Features

    Institute of Scientific and Technical Information of China (English)

    Mu-Yun Yang; Shu-Qi Sun; Jun-Guo Zhu; Sheng Li; Tie-Jun Zhao; Xiao-Ning Zhu

    2011-01-01

    Adopting the regression SVM framework, this paper proposes a linguistically motivated feature engineering strategy to develop an MT evaluation metric with a better correlation with human assessments. In contrast to current practices of "greedy" combination of all available features, six features are suggested according to the human intuition for translation quality. Then the contribution of linguistic features is examined and analyzed via a hill-climbing strategy. Experiments indicate that, compared to either the SVM-ranking model or the previous attempts on exhaustive linguistic features, the regression SVM model with six linguistic information based features generalizes across different datasets better, and augmenting these linguistic features with proper non-linguistic metrics can achieve additional improvements.

  10. Finding Translation Examples for Under-Resourced Language Pairs or for Narrow Domains; the Case for Machine Translation

    Directory of Open Access Journals (Sweden)

    Dan Tufis

    2012-07-01

    Full Text Available The cyberspace is populated with valuable information sources, expressed in about 1500 different languages and dialects. Yet, for the vast majority of WEB surfers this wealth of information is practically inaccessible or meaningless. Recent advancements in cross-lingual information retrieval, multilingual summarization, cross-lingual question answering and machine translation promise to narrow the linguistic gaps and lower the communication barriers between humans and/or software agents. Most of these language technologies are based on statistical machine learning techniques which require large volumes of cross lingual data. The most adequate type of cross-lingual data is represented by parallel corpora, collection of reciprocal translations. However, it is not easy to find enough parallel data for any language pair might be of interest. When required parallel data refers to specialized (narrow domains, the scarcity of data becomes even more acute. Intelligent information extraction techniques from comparable corpora provide one of the possible answers to this lack of translation data.

  11. 基于页面写相关的闪存转换层策略%Page write-related scheme for flash translation layer

    Institute of Scientific and Technical Information of China (English)

    陈金忠; 姚念民; 蔡绍滨; 战福瑞; 孙美玲

    2013-01-01

      针对固态硬盘(SSD)的闪存转换层(FTL)策略诸如BAST、FAST增加了垃圾回收的成本,带来了固态硬盘性能的下降等缺点,提出了一种基于页面“写相关”的FTL策略PWRST。PWRST的基本思想是分析I/O请求的访问历史并找出“写相关”的页面,将“写相关”的页面存储到同一数据块。从而减少垃圾回收开销和I/O请求的平均响应时间。实验结果表明PWRST在Postmark和IOzone负载下的响应时间比BAST减少了35%,比FAST减少了26%。在TPC-C负载下的响应时间比BAST减少了12%,比FAST减少了10%。%  Most existing flash translation layer (FTL) schemes such as block associative sector translation (BAST) and fully associative sector translation (FAST) increase garbage collection overhead and degrade the performance of sol-id-state drive (SSD). A novel flash translation layer scheme called page write-related sector translation (PWRST) was proposed. PWRST finds write-related pages from request history set and writes these pages into the same data block, which reduces garbage collection overhead and average request response time. Experiment results indicate that PWRST reduces average response time by 35%compared to BAST and 26%compared to FAST for Postmark and IOzone work-load. For TPC-C workload, PWRST reduces response time by 12%compared to BAST and 10%compared to FAST.

  12. An overview of the EtsaTrans machine translation system: compilation of an administrative domain

    Directory of Open Access Journals (Sweden)

    L. Ehlers

    2008-07-01

    Full Text Available The EtsaTrans machine translation system has been in development at the University of the Free State for the last four years and is currently the only machine translation system being developed in South Africa for specialised and nongeneral translation needs. The purpose of this exposition is to present the program through its phases of development, and to report on current levels of performance. We analyse the output, the size of the database, and then propose the future implementation of a part of speech tagger and word stemmer into the program to improve its linguistic performance. Our goal with the system is not to translate all types of document, but to work in a specialised domain that will allow the system to translate documents that are repetitive in nature. This will enable translators to spend more time on non-repetitive subject matter. By capturing the nature of the language of such repetitive documents in the database, we are able to create a standardised language usage for the specialised domain.

  13. On the application of AntConc in pre-translation of machine translation

    Institute of Scientific and Technical Information of China (English)

    王培; 赵茫茫

    2016-01-01

    This article introduces the application of a green software of corpus tool named AntConc. The article mainly focuses on its glossary function, analysis of frequency and concordance of lexical chunks, which makes the translation work more formal and standard.

  14. Handbook of natural language processing and machine translation DARPA global autonomous language exploitation

    CERN Document Server

    Olive, Joseph P; McCary, John

    2011-01-01

    This comprehensive handbook, written by leading experts in the field, details the groundbreaking research conducted under the breakthrough GALE program - The Global Autonomous Language Exploitation within the Defense Advanced Research Projects Agency (DARPA), while placing it in the context of previous research in the fields of natural language and signal processing, artificial intelligence and machine translation. The most fundamental contrast between GALE and its predecessor programs was its holistic integration of previously separate or sequential processes. In earlier language research pro

  15. Myanmar-English Bidirectional Machine Translation System with Numerical Particles Identification

    Directory of Open Access Journals (Sweden)

    Yin Yin Win

    2016-06-01

    Full Text Available This paper the development of MyanmarEnglish bidirectional machine translation system is implemented applying Rule based machine translation approach. Stanford and ML2KR parsers are used for preprocessing step. From this step, parsers generate corresponding parse tree structures. Used parsers generate corresponding CFG rules which are collected and created as synchronous context free grammar SCFG rules. Myanmar language can be written free order style, but it must be verb final structure. Therefore, CFG rules are required for reordering the structure of the two languages. After that tree to tree transformation is carried on the source tree structure which corresponds with used parser (Stanford parser or ML2KR‟s parser. When source parse tree is transformed as target parse tree, it is changed according to the SCFG rules. And then system carries out the morphological synthesis. In this stage, we need to solve only for English to Myanmar machine translation because Myanmar language is morphologically rich language. Therefore, particles for Myanmar language can be solved in this system by proposed algorithm. After finishing morphological synthesis, this system generates meaningful and appropriate smoothing sentences

  16. Pre-editing and Recursive-Phrase Composites for a Better English-to-Arabic Machine Translation

    Directory of Open Access Journals (Sweden)

    Mansoor Al-A'ali

    2007-01-01

    Full Text Available This research presents an approach for an English-to-Arabic Machine Translation System based on Building correct grammar and phrase structures first and then automatically deriving Translation Rules for phrase translation. For every English phrase, the grammar is first analysed and then a corresponding Arabic translation is given which would be used by the machine learning system to produce a translation rule with the help of a dictionary and the user. These same derived rules can partially be used for other phrase sequences especially in the case of a phrase consisting of a number of smaller phrases and thus implemeting the idea of recusive phrase strucutres. The approach was implemented and tested on simple cases and the results are given which indicate that this approach is successful for small to medium phrases. Our approach is an enhancement on existing phrase translation techniques because it analyses the source language grammar first, then builds a syntactic structure before proceeding with the machine learning process of learning the translation rules. Our approach is enhancement on existing phrase based translations in two directions: the grammar editing before the translation rules and the derived translation rules can be complete for complete phrases or are rules for translating smaller phrases which are subsets of larger phrases. The approach has improved the speed and correctness of phrase translations.

  17. Adaptation of machine translation for multilingual information retrieval in the medical domain.

    Science.gov (United States)

    Pecina, Pavel; Dušek, Ondřej; Goeuriot, Lorraine; Hajič, Jan; Hlaváčová, Jaroslava; Jones, Gareth J F; Kelly, Liadh; Leveling, Johannes; Mareček, David; Novák, Michal; Popel, Martin; Rosa, Rudolf; Tamchyna, Aleš; Urešová, Zdeňka

    2014-07-01

    We investigate machine translation (MT) of user search queries in the context of cross-lingual information retrieval (IR) in the medical domain. The main focus is on techniques to adapt MT to increase translation quality; however, we also explore MT adaptation to improve effectiveness of cross-lingual IR. Our MT system is Moses, a state-of-the-art phrase-based statistical machine translation system. The IR system is based on the BM25 retrieval model implemented in the Lucene search engine. The MT techniques employed in this work include in-domain training and tuning, intelligent training data selection, optimization of phrase table configuration, compound splitting, and exploiting synonyms as translation variants. The IR methods include morphological normalization and using multiple translation variants for query expansion. The experiments are performed and thoroughly evaluated on three language pairs: Czech-English, German-English, and French-English. MT quality is evaluated on data sets created within the Khresmoi project and IR effectiveness is tested on the CLEF eHealth 2013 data sets. The search query translation results achieved in our experiments are outstanding - our systems outperform not only our strong baselines, but also Google Translate and Microsoft Bing Translator in direct comparison carried out on all the language pairs. The baseline BLEU scores increased from 26.59 to 41.45 for Czech-English, from 23.03 to 40.82 for German-English, and from 32.67 to 40.82 for French-English. This is a 55% improvement on average. In terms of the IR performance on this particular test collection, a significant improvement over the baseline is achieved only for French-English. For Czech-English and German-English, the increased MT quality does not lead to better IR results. Most of the MT techniques employed in our experiments improve MT of medical search queries. Especially the intelligent training data selection proves to be very successful for domain adaptation of

  18. Toward an example-based machine translation from written text to ASL using virtual agent animation

    CERN Document Server

    Boulares, Mehrez

    2012-01-01

    Modern computational linguistic software cannot produce important aspects of sign language translation. Using some researches we deduce that the majority of automatic sign language translation systems ignore many aspects when they generate animation; therefore the interpretation lost the truth information meaning. Our goals are: to translate written text from any language to ASL animation; to model maximum raw information using machine learning and computational techniques; and to produce a more adapted and expressive form to natural looking and understandable ASL animations. Our methods include linguistic annotation of initial text and semantic orientation to generate the facial expression. We use the genetic algorithms coupled to learning/recognized systems to produce the most natural form. To detect emotion we are based on fuzzy logic to produce the degree of interpolation between facial expressions. Roughly, we present a new expressive language Text Adapted Sign Modeling Language TASML that describes all ...

  19. A Novel Approach for English to South Dravidian Language Statistical Machine Translation System

    Directory of Open Access Journals (Sweden)

    Unnikrishnan P

    2010-11-01

    Full Text Available Development of a well fledged bilingual machine translation (MT system for any two natural languages with limited electronic resources and tools is a challenging and demanding task. This paper presents the development of a statistical machine translation (SMT system for English to South Dravidian languages like Malayalam and Kannada by incorporating syntactic and morphological information. SMT is a data oriented statistical framework for translating text from one natural language to another based on the knowledge extracted from bilingual corpus. Even though there are efforts towards building such an English to South Dravidian translation system ,unfortunately we do not have an efficient translation system till now. The first and most important step in SMT is creating a well aligned parallel corpus for training the system. Experimental research shows that the existing methodology for bilingual parallel corpus creation is not efficient for English to South Dravidian language in the SMT system. In order toincrease the performance of the translation system, we have introduced a new approach in creating parallel corpus. The mainideas which we have implemented and proven very effective forEnglish to south Dravidian languages SMT system are: (i reordering the English source sentence according to Dravidian syntax, (ii using the root suffix separation on both English and Dravidian words and iii use of morphological information which substantially reduce the corpus size required for training the system. Since the navailability of full fledged parsing and morphological tools for Malayalam and Kannada languages, sentence synthesis was done both manually and existing morph analyzer created by Amrita university. From the experiment we found that the performance of our systems are significantly well and achieves a very competitive accuracy for small sized bilingual corpora. The proposed ideas can be directly used for other south Dravidian languages like Tamil

  20. Domain adaptation of statistical machine translation with domain-focused web crawling.

    Science.gov (United States)

    Pecina, Pavel; Toral, Antonio; Papavassiliou, Vassilis; Prokopidis, Prokopis; Tamchyna, Aleš; Way, Andy; van Genabith, Josef

    In this paper, we tackle the problem of domain adaptation of statistical machine translation (SMT) by exploiting domain-specific data acquired by domain-focused crawling of text from the World Wide Web. We design and empirically evaluate a procedure for automatic acquisition of monolingual and parallel text and their exploitation for system training, tuning, and testing in a phrase-based SMT framework. We present a strategy for using such resources depending on their availability and quantity supported by results of a large-scale evaluation carried out for the domains of environment and labour legislation, two language pairs (English-French and English-Greek) and in both directions: into and from English. In general, machine translation systems trained and tuned on a general domain perform poorly on specific domains and we show that such systems can be adapted successfully by retuning model parameters using small amounts of parallel in-domain data, and may be further improved by using additional monolingual and parallel training data for adaptation of language and translation models. The average observed improvement in BLEU achieved is substantial at 15.30 points absolute.

  1. Our Policies, Their Text: German Language Students' Strategies with and Beliefs about Web-Based Machine Translation

    Science.gov (United States)

    White, Kelsey D.; Heidrich, Emily

    2013-01-01

    Most educators are aware that some students utilize web-based machine translators for foreign language assignments, however, little research has been done to determine how and why students utilize these programs, or what the implications are for language learning and teaching. In this mixed-methods study we utilized surveys, a translation task,…

  2. 基于极限学习机的网页分类应用%Classification of web pages based on extreme learning machine

    Institute of Scientific and Technical Information of China (English)

    陈先福; 李石君; 曾慧

    2015-01-01

    ELM extreme learning machine is different from traditional neural network learning algorithm(such as BP algo-rithm), is a highly efficient Single hidden Layer Feedforward Neural network(SLFNs)learning algorithm. In this paper, ELM is introduced to Chinese web page classification task. Trait tree of web page is formed after pre-processing the Chi-nese web and extracting its characteristic information. Fixed-length coding is produced and took as input data of ELM. Experimental results show that the method can effectively classify web pages.%极限学习机ELM不同于传统的神经网络学习算法(如BP算法),是一种高效的单隐层前馈神经网络(SLFNs)学习算法。将极限学习机引入到中文网页分类任务中。对中文网页进行预处理,提取其特性信息,从而形成网页特征树,产生定长编码作为极限学习机的输入数据。实验结果表明该方法能够有效地分类网页。

  3. Discriminative feature-rich models for syntax-based machine translation.

    Energy Technology Data Exchange (ETDEWEB)

    Dixon, Kevin R.

    2012-12-01

    This report describes the campus executive LDRD %E2%80%9CDiscriminative Feature-Rich Models for Syntax-Based Machine Translation,%E2%80%9D which was an effort to foster a better relationship between Sandia and Carnegie Mellon University (CMU). The primary purpose of the LDRD was to fund the research of a promising graduate student at CMU; in this case, Kevin Gimpel was selected from the pool of candidates. This report gives a brief overview of Kevin Gimpel's research.

  4. RuLearn: an Open-source Toolkit for the Automatic Inference of Shallow-transfer Rules for Machine Translation

    Directory of Open Access Journals (Sweden)

    Sánchez-Cartagena Víctor M.

    2016-10-01

    Full Text Available This paper presents ruLearn, an open-source toolkit for the automatic inference of rules for shallow-transfer machine translation from scarce parallel corpora and morphological dictionaries. ruLearn will make rule-based machine translation a very appealing alternative for under-resourced language pairs because it avoids the need for human experts to handcraft transfer rules and requires, in contrast to statistical machine translation, a small amount of parallel corpora (a few hundred parallel sentences proved to be sufficient. The inference algorithm implemented by ruLearn has been recently published by the same authors in Computer Speech & Language (volume 32. It is able to produce rules whose translation quality is similar to that obtained by using hand-crafted rules. ruLearn generates rules that are ready for their use in the Apertium platform, although they can be easily adapted to other platforms. When the rules produced by ruLearn are used together with a hybridisation strategy for integrating linguistic resources from shallow-transfer rule-based machine translation into phrase-based statistical machine translation (published by the same authors in Journal of Artificial Intelligence Research, volume 55, they help to mitigate data sparseness. This paper also shows how to use ruLearn and describes its implementation.

  5. Preliminary study of online machine translation use of nursing literature: quality evaluation and perceived usability

    Directory of Open Access Journals (Sweden)

    Anazawa Ryoko

    2012-11-01

    Full Text Available Abstract Background Japanese nurses are increasingly required to read published international research in clinical, educational, and research settings. Language barriers are a significant obstacle, and online machine translation (MT is a tool that can be used to address this issue. We examined the quality of Google Translate® (English to Japanese and Korean to Japanese, which is a representative online MT, using a previously verified evaluation method. We also examined the perceived usability and current use of online MT among Japanese nurses. Findings Randomly selected nursing abstracts were translated and then evaluated for intelligibility and usability by 28 participants, including assistants and research associates from nursing universities throughout Japan. They answered a questionnaire about their online MT use. From simple comparison of mean scores between two language pairs, translation quality was significantly better, with respect to both intelligibility and usability, for Korean-Japanese than for English-Japanese. Most respondents perceived a language barrier. Online MT had been used by 61% of the respondents and was perceived as not useful enough. Conclusion Nursing articles translated from Korean into Japanese by an online MT system could be read at an acceptable level of comprehension, but the same could not be said for English-Japanese translations. Respondents with experience using online MT used it largely to grasp the overall meanings of the original text. Enrichment in technical terms appeared to be the key to better usability. Users will be better able to use MT outputs if they improve their foreign language proficiency as much as possible. Further research is being conducted with a larger sample size and detailed analysis.

  6. A Conjoint Analysis Framework for Evaluating User Preferences in Machine Translation.

    Science.gov (United States)

    Kirchhoff, Katrin; Capurro, Daniel; Turner, Anne M

    2014-03-01

    Despite much research on machine translation (MT) evaluation, there is surprisingly little work that directly measures users' intuitive or emotional preferences regarding different types of MT errors. However, the elicitation and modeling of user preferences is an important prerequisite for research on user adaptation and customization of MT engines. In this paper we explore the use of conjoint analysis as a formal quantitative framework to assess users' relative preferences for different types of translation errors. We apply our approach to the analysis of MT output from translating public health documents from English into Spanish. Our results indicate that word order errors are clearly the most dispreferred error type, followed by word sense, morphological, and function word errors. The conjoint analysis-based model is able to predict user preferences more accurately than a baseline model that chooses the translation with the fewest errors overall. Additionally we analyze the effect of using a crowd-sourced respondent population versus a sample of domain experts and observe that main preference effects are remarkably stable across the two samples.

  7. Extracting Date/Time Expressions in Super-Function Based Japanese-English Machine Translation

    Science.gov (United States)

    Sasayama, Manabu; Kuroiwa, Shingo; Ren, Fuji

    Super-Function Based Machine Translation(SFBMT) which is a type of Example-Based Machine Translation has a feature which makes it possible to expand the coverage of examples by changing nouns into variables, however, there were problems extracting entire date/time expressions containing parts-of-speech other than nouns, because only nouns/numbers were changed into variables. We describe a method for extracting date/time expressions for SFBMT. SFBMT uses noun determination rules to extract nouns and a bilingual dictionary to obtain correspondence of the extracted nouns between the source and the target languages. In this method, we add a rule to extract date/time expressions and then extract date/time expressions from a Japanese-English bilingual corpus. The evaluation results shows that the precision of this method for Japanese sentences is 96.7%, with a recall of 98.2% and the precision for English sentences is 94.7%, with a recall of 92.7%.

  8. Clinical utility of machine learning approaches in schizophrenia: Improving diagnostic confidence for translational neuroimaging

    Directory of Open Access Journals (Sweden)

    Sarina eIwabuchi

    2013-08-01

    Full Text Available Machine-learning approaches are becoming commonplace in the neuroimaging literature as potential diagnostic and prognostic tools for the study of clinical populations. However, very few studies provide clinically informative measures to aid in decision-making and resource allocation. Head-to-head comparison of neuroimaging-based multivariate classifiers is an essential first step to promote translation of these tools to clinical practice. We systematically evaluated the classifier performance using back-to-back structural MRI in two field strengths (3-Tesla and 7-Tesla to discriminate patients with schizophrenia (n=19 from healthy controls (n=20. Grey (GM and white matter (WM images were used as inputs into a support vector machine (SVM to classify patients and control subjects. 7T classifiers outperformed the 3T classifiers with accuracy reaching as high as 77% for the 7T GM classifier compared to 66.6% for the 3T GM classifier. Furthermore, diagnostic odds ratio (a measure that is not affected by variations in sample characteristics and number needed to predict (a measure based on Bayesian certainty of a test result indicated superior performance of the 7T classifiers, whereby for each correct diagnosis made, the number of patients that need to be examined using the 7T GM classifier was one less than the number that need to be examined if a different classifier was used. Using a hypothetical example, we highlight how these findings could have significant implications for clinical decision-making. We encourage the reporting of measures proposed here in future studies utilizing machine-learning approaches. This will not only promote the search for an optimum diagnostic tool but also aid in the translation of neuroimaging to clinical use.

  9. Possessive Pronouns as Determiners in Japanese-to-English Machine Translation

    CERN Document Server

    Bond, F; Ikehara, S; Bond, Francis; Ogura, Kentaro; Ikehara, Satoru

    1996-01-01

    Possessive pronouns are used as determiners in English when no equivalent would be used in a Japanese sentence with the same meaning. This paper proposes a heuristic method of generating such possessive pronouns even when there is no equivalent in the Japanese. The method uses information about the use of possessive pronouns in English treated as a lexical property of nouns, in addition to contextual information about noun phrase referentiality and the subject and main verb of the sentence that the noun phrase appears in. The proposed method has been implemented in NTT Communication Science Laboratories' Japanese-to-English machine translation system ALT-J/E. In a test set of 6,200 sentences, the proposed method increased the number of noun phrases with appropriate possessive pronouns generated, by 263 to 609, at the cost of generating 83 noun phrases with inappropriate possessive pronouns.

  10. Offering a New Approach for Approximate Pattern Matching in Example-Based Machine Translation

    Directory of Open Access Journals (Sweden)

    Reza Akbari

    2015-01-01

    Full Text Available In this article, a new model is proposed in order to measure the degree of similarity between two sentences in machine translation based on example. The proposed model has applied genetic algorithm beside a new fitness function which is based on semantic load matching between the two sentences. Here, verbs are considered as the heart of a sentence because they are the main part of a sentence and carry the major part of the semantic load in the sentence; therefore more attention is paid to the verbs in the fitness function. It is noteworthy that the proposed model is largely dependent on the verbal part and the extracted synonyms from WordNet as well as the arrangement of words. The results are promising by precision and recall, indicating that the proposed method improves the quality of the retrieved matched sentences.

  11. Comparison of Parallel Kinematic Machines with Three Translational Degrees of Freedom and Linear Actuation

    Institute of Scientific and Technical Information of China (English)

    PRAUSE Isabel; CHARAF EDDINE Sami; CORVES Burkhard

    2015-01-01

    The development of new robot structures, in particular of parallel kinematic machines(PKM), is widely systematized by different structure synthesis methods. Recent research increasingly focuses on PKM with less than six degrees of freedom(DOF). However, an overall comparison and evaluation of these structures is missing. In order to compare symmetrical PKM with three translational DOF, different evaluation criteria are used. Workspace, maximum actuation forces and velocities, power, actuator stiffness, accuracy and transmission behavior are taken into account to investigate strengths and weaknesses of the PKMs. A selection scheme based on possible configurations of translational PKM including different frame configurations is presented. Moreover, an optimization method based on a genetic algorithm is described to determine the geometric parameters of the selected PKM for an exemplary load case and a prescribed workspace. The values of the mentioned criteria are determined for all considered PKM with respect to certain boundary conditions. The distribution and spreading of these values within the prescribed workspace is presented by using box plots for each criterion. Thereby, the performance characteristics of the different structures can be compared directly. The results show that there is no“best”PKM. Further inquiries such as dynamic or stiffness analysis are necessary to extend the comparison and to finally select a PKM.

  12. A Reordering Model Using a Source-Side Parse-Tree for Statistical Machine Translation

    Science.gov (United States)

    Hashimoto, Kei; Yamamoto, Hirofumi; Okuma, Hideo; Sumita, Eiichiro; Tokuda, Keiichi

    This paper presents a reordering model using a source-side parse-tree for phrase-based statistical machine translation. The proposed model is an extension of IST-ITG (imposing source tree on inversion transduction grammar) constraints. In the proposed method, the target-side word order is obtained by rotating nodes of the source-side parse-tree. We modeled the node rotation, monotone or swap, using word alignments based on a training parallel corpus and source-side parse-trees. The model efficiently suppresses erroneous target word orderings, especially global orderings. Furthermore, the proposed method conducts a probabilistic evaluation of target word reorderings. In English-to-Japanese and English-to-Chinese translation experiments, the proposed method resulted in a 0.49-point improvement (29.31 to 29.80) and a 0.33-point improvement (18.60 to 18.93) in word BLEU-4 compared with IST-ITG constraints, respectively. This indicates the validity of the proposed reordering model.

  13. OxLM: A Neural Language Modelling Framework for Machine Translation

    Directory of Open Access Journals (Sweden)

    Paul Baltescu

    2014-09-01

    Full Text Available This paper presents an open source implementation1 of a neural language model for machine translation. Neural language models deal with the problem of data sparsity by learning distributed representations for words in a continuous vector space. The language modelling probabilities are estimated by projecting a word's context in the same space as the word representations and by assigning probabilities proportional to the distance between the words and the context's projection. Neural language models are notoriously slow to train and test. Our framework is designed with scalability in mind and provides two optional techniques for reducing the computational cost: the so-called class decomposition trick and a training algorithm based on noise contrastive estimation. Our models may be extended to incorporate direct n-gram features to learn weights for every n-gram in the training data. Our framework comes with wrappers for the cdec and Moses translation toolkits, allowing our language models to be incorporated as normalized features in their decoders (inside the beam search.

  14. Toward an example-based machine translation from written text to ASL using virtual agent animation

    Directory of Open Access Journals (Sweden)

    Mehrez Boulares

    2012-01-01

    Full Text Available Modern computational linguistic software cannot produce important aspects of sign language translation. Using some researches we deduce that the majority of automatic sign language translation systems ignore many aspects when they generate animation; therefore the interpretation lost the truth information meaning. This problem is due to sign language consideration as a derivative language, but it is a complete language with its own unique grammar. This grammar is related to semantic-cognitive models of spatially, time, action and facial expression to represent complex information to make sign interpretation more efficiently, smooth, expressive and natural-looking human gestures. All this aspects give us useful insights into the design principles that have evolved in natural communication between people. In this work we are interested in American Sign Language, because it is the simplest and most standardized sign language. Our goals are: to translate written text from any language to ASL animation; to model maximum raw information using machine learning and computational techniques; and to produce a more adapted and expressive form to natural looking and understandable ASL animations. Our methods include linguistic annotation of initial text and semantic orientation to generate the facial expression. We use genetic algorithms coupled to learning/recognized systems to produce the most natural form. To detect emotion we based on fuzzy logic to produce the degree of interpolation between facial expressions. Roughly, we present a new expressive language Text Adapted Sign Modeling Language TASML that describes all maximum aspects related to a good sign language interpretation. This paper is organized as follow: the next section is devoted to present the comprehension effect of using Space/Time/SVO form in ASL animation based on experimentation. In section 3, we describe our technical considerations. We present the general approach we adopted to

  15. Prediction of post-translational modification sites using multiple kernel support vector machine

    Directory of Open Access Journals (Sweden)

    BingHua Wang

    2017-04-01

    Full Text Available Protein post-translational modification (PTM is an important mechanism that is involved in the regulation of protein function. Considering the high-cost and labor-intensive of experimental identification, many computational prediction methods are currently available for the prediction of PTM sites by using protein local sequence information in the context of conserved motif. Here we proposed a novel computational method by using the combination of multiple kernel support vector machines (SVM for predicting PTM sites including phosphorylation, O-linked glycosylation, acetylation, sulfation and nitration. To largely make use of local sequence information and site-modification relationships, we developed a local sequence kernel and Gaussian interaction profile kernel, respectively. Multiple kernels were further combined to train SVM for efficiently leveraging kernel information to boost predictive performance. We compared the proposed method with existing PTM prediction methods. The experimental results revealed that the proposed method performed comparable or better performance than the existing prediction methods, suggesting the feasibility of the developed kernels and the usefulness of the proposed method in PTM sites prediction.

  16. CloudLM: a Cloud-based Language Model for Machine Translation

    Directory of Open Access Journals (Sweden)

    Ferrández-Tordera Jorge

    2016-04-01

    Full Text Available Language models (LMs are an essential element in statistical approaches to natural language processing for tasks such as speech recognition and machine translation (MT. The advent of big data leads to the availability of massive amounts of data to build LMs, and in fact, for the most prominent languages, using current techniques and hardware, it is not feasible to train LMs with all the data available nowadays. At the same time, it has been shown that the more data is used for a LM the better the performance, e.g. for MT, without any indication yet of reaching a plateau. This paper presents CloudLM, an open-source cloud-based LM intended for MT, which allows to query distributed LMs. CloudLM relies on Apache Solr and provides the functionality of state-of-the-art language modelling (it builds upon KenLM, while allowing to query massive LMs (as the use of local memory is drastically reduced, at the expense of slower decoding speed.

  17. Apertium goes SOA: an efficient and scalable service based on the Apertium rule-based machine translation platform

    OpenAIRE

    Minervini, Pasquale

    2009-01-01

    Service Oriented Architecture (SOA) is a paradigm for organising and using distributed services that may be under the control of different ownership domains and implemented using various technology stacks. In some contexts, an organisation using an IT infrastructure implementing the SOA paradigm can take a great benefit from the integration, in its business processes, of efficient machine translation (MT) services to overcome language barriers. This paper describes the architecture and the de...

  18. Devised Guidelines of Rapid Post-Editing in Machine Translation Out⁃put of Mark on Some Aspects

    Institute of Scientific and Technical Information of China (English)

    屈亚媛; 王庆怡

    2012-01-01

      One of uses of machine translation (MT), is helping readers to read for the gist of a foreign text through a draft translation produced by MT engines. Rapid post-editing, as Jeffrey Allen defines it as a“strictly minimal editing on texts in order to remove blatant and significant errors without considering stylistic issues”, can help present the reader with a roughly comprehensible translation as quickly as possible. The purpose of this article is on a proposed set of rapid post-editing guidelines for Biblical Chinese-English MT, with its application on editing the English MT version of Chapter one of Mark (马尔谷福音) of the Chinese Catholic Bible (天主教思高本圣经) as an example.

  19. Introduction pages

    OpenAIRE

    2015-01-01

    Introduction Pages and Table of Contents Research ArticlesInsulin Requirements in Relation to Insulin Pump Indications in Type 1 DiabetesPDFGabriela GHIMPEŢEANU, Silvia Ş. IANCU, Gabriela ROMAN, Anca M. ALIONESCU259-263Comparative Antibacterial Efficacy of Vitellaria paradoxa (Shea Butter Tree) Extracts Against Some Clinical Bacterial IsolatesPDFKamoldeen Abiodun AJIJOLAKEWU, Fola Jose AWARUN264-268A Murine Effort Model for Studying the Influence of Trichinella on Muscular Activity of MicePDF...

  20. Rule Based Machine Translation of Noun Phrases from Punjabi to English

    Directory of Open Access Journals (Sweden)

    Kamaljeet Kaur Batra

    2010-09-01

    Full Text Available The paper presents automatic translation of noun phrases from Punjabi to English using transfer approach. The system has analysis, translation and synthesis component. The steps involved are pre processing, tagging, ambiguity resolution, translation and synthesis of words in target language. The accuracy is calculated for each step and the overall accuracy of the system is calculated to be about 85% for a particular type of noun phrases.

  1. Conception d'une methodologie generale d'evaluation de la traduction automatique (Conception of a General Methodology for Evaluating Machine Translation).

    Science.gov (United States)

    van Slype, Georges

    1982-01-01

    It is proposed that assessment of human translation versus machine translation programs use methods and criteria that measure efficiency and cost effectiveness and are efficient and cost-effective in themselves. A variety of methods and criteria are evaluated and discussed. (MSE)

  2. 中日两国机器翻译研究进展及比较%Machine Translation Research in China and Japan: Advances and Comparison

    Institute of Scientific and Technical Information of China (English)

    张均胜; 何彦青; 李颖; 王惠临

    2011-01-01

    Machine translation investigates the use of computer software to translate text or speech from one natural language to another. Since the first computer was invented, people have been studying and exploring high quality and high efficiency of machine translation technology. Recently, rule-based machine translation, example-based machine translation and statistical translation are the main three translation patterns. There are some approaches ofsystem combination for better machine translation results. With the development of science, technology, economy and culture, machine translation has become more important in breaking the language barrier between Chinese and Japanese for promoting China-Japanese exchanges and cooperation. Machine translation researchers in China and Japan have carried out a large number of Chinese-Japanese/Japanese-Chinese machine translation of theoretical research and system development They have achieved a lot of effective results, however, it is still far from the practical translation application of large-scale and high quality. Therefore, it is necessary for researchers in China and Japan to cooperate in machine translation technology and system development for Ch inese-to-Japanese and Japanese-to-Chinese, especially in the parallel corpus, dictionary, terminology, syntactic analysis and so on. This paper presents an overview of the China-Japanese machine translation research and rampares machine translation research in China and Japan. We also discuss the prospects of China-Japanese cooperation in machine translation research.%机器翻译研究用计算机实现不同自然语言之间的翻译.自第一台计算机诞生开始,人们一直在研究和探索高质量高效率的机器翻译技术.近年来,基于规则的机器翻译、基于实例的机器翻译和基于统计的机器翻译这几种主要的翻译模式共同存在且相互补充,并有不断融合之势.随着中国和日本在科技、经济和文化交流的不

  3. Breaking the language barrier: machine assisted diagnosis using the medical speech translator.

    Science.gov (United States)

    Starlander, Marianne; Bouillon, Pierrette; Rayner, Manny; Chatzichrisafis, Nikos; Hockey, Beth Ann; Isahara, Hitoshi; Kanzaki, Kyoko; Nakao, Yukie; Santaholma, Marianne

    2005-01-01

    In this paper, we describe and evaluate an Open Source medical speech translation system (MedSLT) intended for safety-critical applications. The aim of this system is to eliminate the language barriers in emergency situation. It translates spoken questions from English into French, Japanese and Finnish in three medical subdomains (headache, chest pain and abdominal pain), using a vocabulary of about 250-400 words per sub-domain. The architecture is a compromise between fixed-phrase translation on one hand and complex linguistically-based systems on the other. Recognition is guided by a Context Free Grammar Language Model compiled from a general unification grammar, automatically specialised for the domain. We present an evaluation of this initial prototype that shows the advantages of this grammar-based approach for this particular translation task in term of both reliability and use.

  4. 基于内存页面动态合并的旁路转换缓冲器设计%Translation lookaside buffer design based on dynamic memory page merging

    Institute of Scientific and Technical Information of China (English)

    陈志坚; 孟建熠; 葛海通; 严晓浪

    2012-01-01

    针对内存管理中虚拟页面和物理页面连续分配的特性,提出可对相邻页面进行动态合并的旁路转换缓冲器(TLB)设计方法.该方法的核心思想是在处理器运行过程中,通过对相邻页面的递归合并,动态扩展单个TLB表项的地址映射范围,提高TLB表项的利用率并降低TLB缺失率.在两级TLB架构中,提出基于快速uTLB(fuTLB)和影子uTLB(suTLB)动态切换的新型uTLB结构,作为两级TLB架构的一级缓存,为页面动态合并提供现场和载体,页面合并过程对软件透明.基于Mibench测试基准的实验结果表明,与filter-TLB架构相比,该页面动态合并方法可以平均降低TLB缺失率达27%.%Virtual memory pages and physical memory pages are often sequentially allocated in traditional memory management algorithms. A translation lookaside buffer (TLB) design method was proposed to merge two sequential small size memory pages into a large one during the processor execution. The mapping size of each TLB entry is automatically enlarged with recursive memory page merging. Consequently, the utilization efficiency of TLB can be improved and the TLB miss rate can be reduced. A new uTLB architecture composed of fuTLB and suTLB was proposed. Both fuTLB and suTLB are not only used as the first level address translation buffer of the two-level TLB architecture, but also provided as the temporary buffer for hardware based dynamic page merging. The page merging operation is processed by hardware and not affected by software. Experimental results from Mibench show that the TLB miss ratio can be reduced by 27% with the new TLB design method compared with the filter-TLB design method.

  5. Translation

    OpenAIRE

    2005-01-01

    "Translation" is a life narrative about the ways in which cultural histories shape personal stories, and the capacity of the imagination to develop alternative narratives about oneself and the world. It can also be read a way of addressing the effects of what Ato Quayson calls the global process of postcolonializing. Quaysons critical perspective might be used as an interpretive lens for seeing some of the ways in which  this autobiographical narrative complicates the jargon of race, cl...

  6. Translation

    OpenAIRE

    2005-01-01

    "Translation" is a life narrative about the ways in which cultural histories shape personal stories, and the capacity of the imagination to develop alternative narratives about oneself and the world. It can also be read a way of addressing the effects of what Ato Quayson calls the global process of postcolonializing. Quaysons critical perspective might be used as an interpretive lens for seeing some of the ways in which  this autobiographical narrative complicates the jargon of race, class, ...

  7. Improving Statistical Machine Translation Through N-best List Re-ranking and Optimization

    Science.gov (United States)

    2014-03-27

    Language Processing, 1352–1362. Association for Computational Linguistics, Edinburgh, Scotland , UK., July 2011. URL http://www.aclweb.org/anthology/D11-1125...Josef. “Minimum Error Rate Training in Statistical Machine Translation”. Erhard Hinrichs and Dan Roth (editors), Proceedings of the 41st Annual Meeting of

  8. Reading Strategies in a L2: A Study on Machine Translation

    Science.gov (United States)

    Karnal, Adriana Riess; Pereira, Vera Vanmacher

    2015-01-01

    This article aims at understanding cognitive strategies which are involved in reading academic texts in English as a L2/FL. Specifically, we focus on reading comprehension when a text is read either using Google translator or not. From this perspective we must consider the reading process in its complexity not only as a decoding process. We follow…

  9. Exeter at CLEF 2003: Experiments with machine translation for monolingual, bilingual and multilingual retrieval

    OpenAIRE

    Lam-Adesina, Adenike M.; Jones, Gareth J.F.

    2004-01-01

    The University of Exeter group participated in the monolingual, bilingual and multilingual-4 retrieval tasks this year. The main focus of our investigation this year was the small multilingual task comprising four languages, French, German, Spanish and English. We adopted a document translation strategy and tested four merging techniques to combine results from the separate document collections, as well as a merged collection strategy. For both the monolingual and bilingual tasks we explored ...

  10. Introduction pages

    Directory of Open Access Journals (Sweden)

    Radu E. Sestras

    2015-09-01

    Full Text Available Introduction Pages and Table of Contents Research ArticlesInsulin Requirements in Relation to Insulin Pump Indications in Type 1 DiabetesPDFGabriela GHIMPEŢEANU,\tSilvia Ş. IANCU,\tGabriela ROMAN,\tAnca M. ALIONESCU259-263Comparative Antibacterial Efficacy of Vitellaria paradoxa (Shea Butter Tree Extracts Against Some Clinical Bacterial IsolatesPDFKamoldeen Abiodun AJIJOLAKEWU,\tFola Jose AWARUN264-268A Murine Effort Model for Studying the Influence of Trichinella on Muscular Activity of MicePDFIonut MARIAN,\tCălin Mircea GHERMAN,\tAndrei Daniel MIHALCA269-271Prevalence and Antibiogram of Generic Extended-Spectrum β-Lactam-Resistant Enterobacteria in Healthy PigsPDFIfeoma Chinyere UGWU,\tMadubuike Umunna ANYANWU,\tChidozie Clifford UGWU,\tOgbonna Wilfred UGWUANYI272-280Index of Relative Importance of the Dietary Proportions of Sloth Bear (Melursus ursinus in Semi-Arid RegionPDFTana P. MEWADA281-288Bioaccumulation Potentials of Momordica charantia L. Medicinal Plant Grown in Lead Polluted Soil under Organic Fertilizer AmendmentPDFOjo Michael OSENI,\tOmotola Esther DADA,\tAdekunle Ajayi ADELUSI289-294Induced Chitinase and Chitosanase Activities in Turmeric Plants by Application of β-D-Glucan NanoparticlesPDFSathiyanarayanan ANUSUYA,\tMuthukrishnan SATHIYABAMA295-298Present or Absent? About a Threatened Fern, Asplenium adulterinum Milde, in South-Eastern Carpathians (RomaniaPDFAttila BARTÓK,\tIrina IRIMIA299-307Comparative Root and Stem Anatomy of Four Rare Onobrychis Mill. (Fabaceae Taxa Endemic in TurkeyPDFMehmet TEKİN,\tGülden YILMAZ308-312Propagation of Threatened Nepenthes khasiana: Methods and PrecautionsPDFJibankumar S. KHURAIJAM,\tRup K. ROY313-315Alleviate Seed Ageing Effects in Silybum marianum by Application of Hormone Seed PrimingPDFSeyed Ata SIADAT,\tSeyed Amir MOOSAVI,\tMehran SHARAFIZADEH316-321The Effect of Halopriming and Salicylic Acid on the Germination of Fenugreek (Trigonella foenum-graecum under Different Cadmium

  11. Web Page Design.

    Science.gov (United States)

    Lindsay, Lorin

    Designing a web home page involves many decisions that affect how the page will look, the kind of technology required to use the page, the links the page will provide, and kinds of patrons who can use the page. The theme of information literacy needs to be built into every web page; users need to be taught the skills of sorting and applying…

  12. A nonparametric Bayesian method of translating machine learning scores to probabilities in clinical decision support.

    Science.gov (United States)

    Connolly, Brian; Cohen, K Bretonnel; Santel, Daniel; Bayram, Ulya; Pestian, John

    2017-08-07

    Probabilistic assessments of clinical care are essential for quality care. Yet, machine learning, which supports this care process has been limited to categorical results. To maximize its usefulness, it is important to find novel approaches that calibrate the ML output with a likelihood scale. Current state-of-the-art calibration methods are generally accurate and applicable to many ML models, but improved granularity and accuracy of such methods would increase the information available for clinical decision making. This novel non-parametric Bayesian approach is demonstrated on a variety of data sets, including simulated classifier outputs, biomedical data sets from the University of California, Irvine (UCI) Machine Learning Repository, and a clinical data set built to determine suicide risk from the language of emergency department patients. The method is first demonstrated on support-vector machine (SVM) models, which generally produce well-behaved, well understood scores. The method produces calibrations that are comparable to the state-of-the-art Bayesian Binning in Quantiles (BBQ) method when the SVM models are able to effectively separate cases and controls. However, as the SVM models' ability to discriminate classes decreases, our approach yields more granular and dynamic calibrated probabilities comparing to the BBQ method. Improvements in granularity and range are even more dramatic when the discrimination between the classes is artificially degraded by replacing the SVM model with an ad hoc k-means classifier. The method allows both clinicians and patients to have a more nuanced view of the output of an ML model, allowing better decision making. The method is demonstrated on simulated data, various biomedical data sets and a clinical data set, to which diverse ML methods are applied. Trivially extending the method to (non-ML) clinical scores is also discussed.

  13. Based on HNC Theory of Machine Translation Research ---English-Chinese Translation%基于HNC理论机助人译--英汉翻译对比研究

    Institute of Scientific and Technical Information of China (English)

    陆小鹿

    2014-01-01

    [Abstract]Machine Translation is indispensable to modern information era to overcome barriers of the language communication . HNC theory by using the method of primitives, hierarchical, network, formal, integrates the sentence structures with semantics, Based on the HNC theory, interprets the conversion of sentence category and patterns between English and Chinese and Contrast between English and Chinese.%机器翻译是现代信息时代克服语言交流障碍不可缺少的的手段。 HNC理论采用基元化、层次化、网络化、形式化的方法,通过句类精妙地把自然语言的表层结构和深层语义联系起来。通过机器翻译,对比研究英汉翻译中的句类句式转换的问题。

  14. Integrated Features by Administering the Support Vector Machine (SVM of Translational Initiations Sites in Alternative Polymorphic Contex

    Directory of Open Access Journals (Sweden)

    Nurul Arneida Husin

    2012-04-01

    Full Text Available Many algorithms and methods have been proposed for classification problems in bioinformatics. In this study, the discriminative approach in particular support vector machines (SVM is employed to recognize the studied TIS patterns. The applied discriminative approach is used to learn about some discriminant functions of samples that have been labelled as positive or negative. After learning, the discriminant functions are employed to decide whether a new sample is true or false. In this study, support vector machines (SVM is employed to recognize the patterns for studied translational initiation sites in alternative weak context. The method has been optimized with the best parameters selected; c=100, E=10-6 and ex=2 for non linear kernel function. Results show that with top 5 features and non linear kernel, the best prediction accuracy achieved is 95.8%. J48 algorithm is applied to compare with SVM with top 15 features and the results show a good prediction accuracy of 95.8%. This indicates that the top 5 features selected by the IGR method and that are performed by SVM are sufficient to use in the prediction of TIS in weak contexts.

  15. 基于机器学习的批量网页篡改检测方法%Tamper detection of numerous web pages based on machine learning

    Institute of Scientific and Technical Information of China (English)

    赖清楠; 陈诗洋; 马皓; 张蓓

    2016-01-01

    A numerous web pages tamper detection method was designed to cope with page tamper problem .All the registered websites of a comprehensive university were studied ,and all the data in home pages were crawled and classified ,corresponding detection rules were built ,and an overall judg‐ment was given for each pages .The proposed method included learning and detecting phase .Each de‐tector standard value was trained from web pages history information in learning phase .In detecting phase ,each parameter was detected ,the detectors′ output were gathered ,the results were shown , and the website administrator was notified to confirm immediately if a webpage was detected be tampe‐red with ,and the system retrained to modify parameters when it was a false positive .Tampering sim‐ulates experimental results show that when the window size is 11 and the alarm threshold value is 2 , the false positive rate is 1 .183% and the false negative rate is 0 .878% ,so the optimal results were ob‐tained .%针对网页篡改问题,设计了一种基于机器学习的批量网页篡改检测方法.以一所综合性大学所有注册网站为研究对象,通过抓取网站首页面的所有信息,对抓取数据进行分类建立对应的检测规则,综合判断网页是否存在篡改.该方法分为学习阶段和检测阶段,学习阶段根据网页历史信息获取各个检测器的标准值,检测阶段对待检测网页的各个参数进行检测,综合多个检测器的输出,反馈检测结果,若结果为误报,则系统进行重新训练修正参数.以实际发生的网页篡改案例为依据,进行网页篡改模拟,并对误报率和漏报率进行了分析,结果表明:当检测数据集窗口大小为11,报警阈值为2时,误报率为1.183%,漏报率为0.878%,获得了最优的效果.

  16. 图式视角下的高校网页翻译%Colleges and universities' web page translation under the perspective of schema

    Institute of Scientific and Technical Information of China (English)

    潘彬彬

    2015-01-01

    图式形成于对外部世界的信息处理过程中,是世界知识的精神结构.中西方受众不同的图式要求高校网页翻译以受众为中心,保留对应图式,化解图式冲突,填补图式缺省,从而将源语所承载的信息最大限度传入译入语中并有效影响受众,实现高校外宣翻译目的.%The process of information processing pattern formed in the outside world, is the world's knowledge structure. Web translation between Chinese and western audience different schema requires colleges and universities to the audience as the center, keep the corresponding schema, dissolve the schema conflict, fill the schema of the default, thus the information carried by the source language is maximum incoming in the target language and effective influence audience, realize the heralded the translation purpose.

  17. Qualitative: Open Source Python Tool for Quality Estimation over Multiple Machine Translation Outputs

    Directory of Open Access Journals (Sweden)

    Eleftherios Avramidis

    2014-09-01

    Full Text Available “Qualitative” is a python toolkit for ranking and selection of sentence-level output by different MT systems using Quality Estimation. The toolkit implements a basic pipeline for annotating the given sentences with black-box features. Consequently, it applies a machine learning mechanism in order to rank data based on models pre-trained on human preferences. The preprocessing pipeline includes support for language models, PCFG parsing, language checking tools and various other pre-processors and feature generators. The code follows the principles of object-oriented programming to allow modularity and extensibility. The tool can operate by processing both batch-files and single sentences. An XML-RPC interface is provided for hooking up with web-services and a graphical animated web-based interface demonstrates its potential on-line use.

  18. Determination of referential property and number of nouns in Japanese sentences for machine translation into English

    CERN Document Server

    Murata, M; Murata, Masaki; Nagao, Makoto

    1994-01-01

    When translating Japanese nouns into English, we face the problem of articles and numbers which the Japanese language does not have, but which are necessary for the English composition. To solve this difficult problem we classified the referential property and the number of nouns into three types respectively. This paper shows that the referential property and the number of nouns in a sentence can be estimated fairly reliably by the words in the sentence. Many rules for the estimation were written in forms similar to rewriting rules in expert systems. We obtained the correct recognition scores of 85.5\\% and 89.0\\% in the estimation of the referential property and the number respectively for the sentences which were used for the construction of our rules. We tested these rules for some other texts, and obtained the scores of 68.9\\% and 85.6\\% respectively.

  19. Sleep Apnea Information Page

    Science.gov (United States)

    ... Page You are here Home » Disorders » All Disorders Sleep Apnea Information Page Sleep Apnea Information Page Search Disorders Search NINDS SEARCH ... Institutes of Health (NIH) conduct research related to sleep apnea in laboratories at the NIH, and also ...

  20. Translating for Marketing --On Marketing-oriented Translation of Enterprise Web Page%以营销为导向的企业网页信息翻译探究

    Institute of Scientific and Technical Information of China (English)

    董丽颖

    2012-01-01

    Recent years has witnessed China's rising as the world's second economy and greater than ever efforts made by our Chinese enterprises in attracting international publicity, very few of whom, however, have achieved expected perception and affinity of foreign markets. In this age of "All get done online", the translation of website texts also contributes to effective enterprise imaging and marketing in the target foreign market. This paper holds that marketing-oriented translation of enterprise website texts is yet to be fully appreciated and is urgently required to guarantee desired information to reach the foreign customers, whose needs must be addressed and whose reaction is pivotal to the success of translation for enterprise overseas marketing.%随着中国崛起为世界第二大经济体,中国企业正吸引世界目光,处于全球推广的黄金时期。企业英文网页无疑成为对外信息与形象传播的先锋平台。企业网页信息翻译的特殊性在于其不仅是翻译活动,同时还是企业国际营销战略的核心构成,因此除了翻译理论框架之外,营销学也是译者任务操作的重要视角,只有如此,企业英文网页才具有真正的传播力,成为企业对外信息发布、形象塑造、品牌推广的营销窗口。

  1. 基于支持向量机与无监督聚类相结合的中文网页分类器%A Chinese Web Page Classifier Based on Support Vector Machine and Unsupervised Clustering

    Institute of Scientific and Technical Information of China (English)

    李晓黎; 刘继敏; 史忠植

    2001-01-01

    This paper presents a new algorithm that combines Support VectorMachine (SVM) and unsupervised clustering. After analyzing the characteristics of web pages, it proposes a new vector representation of web pages and applies it to web page classification. Given a training set, the algorithm clusters positive and negative examples respectively by the unsupervised clustering algorithm (UC), which will produce a number of positive and negative centers. Then, it selects only some of the examples to input to SVM according to ISUC algorithm. At the end, it constructs a classifier through SVM learning. Any text can be classified by comparing the distance of clustering centers or by SVM. If the text nears one cluster center of a category and far away from all the cluster centers of other categories, UC can classify it rightly with high possibility, otherwise SVM is employed to decide the category it belongs. The algorithm utilizes the virtues of SVM and unsupervised clustering. The experiment shows that it not only improves training efficiency, but also has good precision.%提出了一种将支持向量机与无监督聚类相结合的新分类算法,给出了一种新的网页表示方法并应用于网页分类问题.该算法首先利用无监督聚类分别对训练集中正例和反例聚类,然后挑选一些例子训练SVM并获得SVM分类器.任何网页可以通过比较其与聚类中心的距离决定采用无监督聚类方法或SVM分类器进行分类.该算法充分利用了SVM准确率高与无监督聚类速度快的优点.实验表明它不仅具有较高的训练效率,而且有很高的精确度.

  2. Translation lexicon acquisition from bilingual dictionaries

    Science.gov (United States)

    Doermann, David S.; Ma, Huanfeng; Karagol-Ayan, Burcu; Oard, Douglas W.

    2001-12-01

    Bilingual dictionaries hold great potential as a source of lexical resources for training automated systems for optical character recognition, machine translation and cross-language information retrieval. In this work we describe a system for extracting term lexicons from printed copies of bilingual dictionaries. We describe our approach to page and definition segmentation and entry parsing. We have used the approach to parse a number of dictionaries and demonstrate the results for retrieval using a French-English Dictionary to generate a translation lexicon and a corpus of English queries applied to French documents to evaluation cross-language IR.

  3. The AFRL-MITLL WMT16 News-Translation Task Systems

    Science.gov (United States)

    2016-08-11

    Fast and robust neural network joint models for sta- tistical machine translation. In Proceedings of the 52nd Annual Meeting of the Association for Compu ... Compu - tational Linguistics, pages 423–430, Sapporo, Japan, July. Association for Computational Linguistics. Philipp Koehn, Hieu Hoang, Alexandra

  4. Creating Web Pages Simplified

    CERN Document Server

    Wooldridge, Mike

    2011-01-01

    The easiest way to learn how to create a Web page for your family or organization Do you want to share photos and family lore with relatives far away? Have you been put in charge of communication for your neighborhood group or nonprofit organization? A Web page is the way to get the word out, and Creating Web Pages Simplified offers an easy, visual way to learn how to build one. Full-color illustrations and concise instructions take you through all phases of Web publishing, from laying out and formatting text to enlivening pages with graphics and animation. This easy-to-follow visual guide sho

  5. Full page insight

    DEFF Research Database (Denmark)

    Cortsen, Rikke Platz

    2014-01-01

    Alan Moore and his collaborating artists often manipulate time and space by drawing upon the formal elements of comics and making alternative constellations. This article looks at an element that is used frequently in comics of all kinds – the full page – and discusses how it helps shape spatio......, something that it shares with the full page in comics. Through an analysis of several full pages from Moore titles like Swamp Thing, From Hell, Watchmen and Promethea, it is made clear why the full page provides an apt vehicle for an apocalypse in comics....

  6. Full page insight

    DEFF Research Database (Denmark)

    Cortsen, Rikke Platz

    2014-01-01

    Alan Moore and his collaborating artists often manipulate time and space by drawing upon the formal elements of comics and making alternative constellations. This article looks at an element that is used frequently in comics of all kinds – the full page – and discusses how it helps shape spatio-t......, something that it shares with the full page in comics. Through an analysis of several full pages from Moore titles like Swamp Thing, From Hell, Watchmen and Promethea, it is made clear why the full page provides an apt vehicle for an apocalypse in comics....

  7. Research of Web Pages Categorization

    Institute of Scientific and Technical Information of China (English)

    Zhongda Lin; Kun Deng; Yanfen Hong

    2006-01-01

    In this paper, we discuss several issues related to automated classification of web pages, especially text classification of web pages. We analyze features selection and categorization algorithms of web pages and give some suggestions for web pages categorization.

  8. Larry Page on Google

    Institute of Scientific and Technical Information of China (English)

    Miguel Helft

    2012-01-01

    Last month, Larry Page sat down with Fortune Senior Writer Miguel Helft for a lengthy interview for a forthcoming Fortune magazine article. It was only Page' s second wide-ranging conversation with a print publication since becoming CEO of Google in April 2011.

  9. Cuckoo Hashing with Pages

    CERN Document Server

    Dietzfelbinger, Martin; Rink, Michael

    2011-01-01

    Although cuckoo hashing has significant applications in both theoretical and practical settings, a relevant downside is that it requires lookups to multiple locations. In many settings, where lookups are expensive, cuckoo hashing becomes a less compelling alternative. One such standard setting is when memory is arranged in large pages, and a major cost is the number of page accesses. We propose the study of cuckoo hashing with pages, advocating approaches where each key has several possible locations, or cells, on a single page, and additional choices on a second backup page. We show experimentally that with k cell choices on one page and a single backup cell choice, one can achieve nearly the same loads as when each key has k+1 random cells to choose from, with most lookups requiring just one page access, even when keys are placed online using a simple algorithm. While our results are currently experimental, they suggest several interesting new open theoretical questions for cuckoo hashing with pages.

  10. On Page Rank

    NARCIS (Netherlands)

    Hoede, C.

    2008-01-01

    In this paper the concept of page rank for the world wide web is discussed. The possibility of describing the distribution of page rank by an exponential law is considered. It is shown that the concept is essentially equal to that of status score, a centrality measure discussed already in 1953 by Ka

  11. Page Styles on steroids

    DEFF Research Database (Denmark)

    Madsen, Lars

    2008-01-01

    Designing a page style has long been a pain for novice users. Some parts are easy; others need strong LATEX knowledge. In this article we will present the memoir way of dealing with page styles, including new code added to the recent version of memoir that will reduce the pain to a mild annoyance...

  12. New WWW Pages

    CERN Multimedia

    Pommes, K

    New WWW pages have been created in order to provide easy access to the many activities and pertaining information of the ATLAS Technical Coordination. The main entry point is available on the ATLAS Collaboration page by clicking the Technical Coordination link which leads to the page shown in the following picture. Each button links to a page listing all tasks of the corresponding activity, the responsible task leaders, schedules, work-packages, and action lists, etc... The "ATLAS Documentation Center" button will present the pop-up window shown in the next figure: Besides linking to the Technical Coordination Activities, this page provides direct access to the tools for Project Progress Tracking (PPT) and Engineering Data Management (EDMS), as well as to the main topics being coordinated by the Technical Coordination.

  13. A survey of statistical machine translation using paraphrasing technology%引入复述技术的统计机器翻译研究综述

    Institute of Scientific and Technical Information of China (English)

    2013-01-01

      基于对引入复述技术的统计机器翻译研究现状的分析,提出具有研究价值的课题方向。首先归纳了复述的概念,总结了引入复述技术的统计机器翻译各类方法。然后对复述知识在统计机器翻译中的模型训练、参数调整、待译语句改写和机器翻译自动评测等方面应用的主流方法进行了概括、比较和分析,说明了复述与统计机器翻译是紧密相关的,强调了复述在统计机器翻译应用中的关键问题是复述的正确性和多样性。最后指出提高复述资源的精确度、建立复述与机器翻译的联合模型、采用新方法解决稀疏问题等是有待进一步研究的课题。%In this paper, the research team discussed possible new prospective research directions of paraphrasing technology in statistical machine translation (SMT), based on reviews of state-of-the-art technology.First the re-search team introduced the concept of paraphrases , and next a summarization of the latest progress utilizing para-phrasing technology in SMT was conducted.Finally, conclusions were drawn, data was compared and an analysis of the main issues of incorporating paraphrases into SMT , including translation model training, parameter tuning, in-put sentences rewriting and machine translation evaluation was performed .The results proved that there is an inher-ent connection between paraphrasing and SMT .The results also point out that the correctness and diversity of para-phrasing are the key issues to apply paraphrasing to SMT .It was highly noted that the improvement in the quality of paraphrasing resource, the establishment of a joint model of paraphrasing and machine translation and the new pro -posed approach to solve data sparseness are problems which need further study .

  14. Exploiting link structure for web page genre identification

    KAUST Repository

    Zhu, Jia

    2015-07-07

    As the World Wide Web develops at an unprecedented pace, identifying web page genre has recently attracted increasing attention because of its importance in web search. A common approach for identifying genre is to use textual features that can be extracted directly from a web page, that is, On-Page features. The extracted features are subsequently inputted into a machine learning algorithm that will perform classification. However, these approaches may be ineffective when the web page contains limited textual information (e.g., the page is full of images). In this study, we address genre identification of web pages under the aforementioned situation. We propose a framework that uses On-Page features while simultaneously considering information in neighboring pages, that is, the pages that are connected to the original page by backward and forward links. We first introduce a graph-based model called GenreSim, which selects an appropriate set of neighboring pages. We then construct a multiple classifier combination module that utilizes information from the selected neighboring pages and On-Page features to improve performance in genre identification. Experiments are conducted on well-known corpora, and favorable results indicate that our proposed framework is effective, particularly in identifying web pages with limited textual information. © 2015 The Author(s)

  15. Machine-translatability and post-editing effort: an empirical study using translog and choice network analysis

    OpenAIRE

    O'Brien, Sharon

    2006-01-01

    Studies on Controlled Language (CL) suggest that by removing features that are known to be problematic for MT (termed here “negative translatability indicators”, or “NTIs”), the MT output can be improved. It is assumed that an improvement in the output will result in lower post-editing effort. This study tests that assumption by measuring the technical, temporal and cognitive post-editing effort (Krings 2001) for English sentences in a user manual that have been translated into German using a...

  16. Translation Techniques

    Directory of Open Access Journals (Sweden)

    Marcia Pinheiro

    2015-05-01

    Full Text Available In this paper, we discuss three translation techniques: literal, cultural, and artistic. Literal translation is a well-known technique, which means that it is quite easy to find sources on the topic. Cultural and artistic translation may be new terms. Whilst cultural translation focuses on matching contexts, artistic translation focuses on matching reactions. Because literal translation matches only words, it is not hard to find situations in which we should not use this technique.  Because artistic translation focuses on reactions, judging the quality of an artistic translation work is one of the most difficult things one can do. We end up having a score of complexity and humanity for each one of the mentioned techniques: Literal translation would be the closest thing we have to the machines world and artistic translation would be the closest thing we have to the purely human world. By creating these classifications and studying the subtleties of each one of them, we are adding degrees of quality to our courses and to translation as a professional field. The main contribution of this paper is then the formalization of such a piece of knowledge. We, however, also lay the foundations for studies of this type.

  17. Economic page turners

    OpenAIRE

    Frank, Björn

    2011-01-01

    Economic page turners like Freakonomics are well written and there is much to be learned from them - not only about economics, but also about writing techniques. Their authors know how to build up suspense, i.e., they make readers want to know what comes. An uncountable number of pages in books and magazines are filled with advice on writing reportages or suspense novels. While many of the tips are specific to the respective genres, some carry over to economic page turners in an instructive w...

  18. Lost in Translation

    Institute of Scientific and Technical Information of China (English)

    2011-01-01

    The intricacies of language still eludes even the most sophisticated technology IT is no surprise Jost Zetzsche,an English-to-German translator,raised the question of whether machine translation would ever replace the human variety in front of 700 interpreters and translators who gathered in San Francisco to discuss topical issues in the translation industry.

  19. Translators and Tools

    Institute of Scientific and Technical Information of China (English)

    2011-01-01

    Technology is not yet advanced enough to capture the nuances of language It is no surprise Jost Zetzsche,an English-to-German translator,raised the question of whether machine translation would ever replace the human variety in front of 700 interpreters and translators who gathered in San Francisco to discuss topical issues in the translation industry.

  20. Lost in Translation

    Institute of Scientific and Technical Information of China (English)

    Ding Zhitao; Chen Wen

    2011-01-01

    IT is no surprise Jost Zetzsche,an English-to-German translator,raised the question of whether machine translation would ever replace the human variety in front of 700 interpreters and translators who gathered in San Francisco to discuss topical issues in the translation industry.

  1. Translation: Aids, Robots, and Automation.

    Science.gov (United States)

    Andreyewsky, Alexander

    1981-01-01

    Examines electronic aids to translation both as ways to automate it and as an approach to solve problems resulting from shortage of qualified translators. Describes the limitations of robotic MT (Machine Translation) systems, viewing MAT (Machine-Aided Translation) as the only practical solution and the best vehicle for further automation. (MES)

  2. Towards a French/French Sign Language Machine-Translation System (TLF Vers un système de traduction automatique : français/langue des signes française (TLF

    Directory of Open Access Journals (Sweden)

    Gaspard Breton

    2009-04-01

    Full Text Available We present a work in the framewok of machine translation from French to French Sign Language (FSL with synthesis of gestures through a virtual agent. We first give some descriptive and theoretical elements about the FSL. Then we propose a formalization of the standard part of the FSL lexicon as well as propositions concerning some morpho-syntactic phenomena. Then we present the machine translation system based on interlingua representation system "TiLT" developed at France Telecom R&D and the adaptation of its generation module to FSL. We end with the avatar technology also developed at France Telecom R&D and its FSL treatment.

  3. Multilingual online machine translation research%多语言在线机器翻译研究

    Institute of Scientific and Technical Information of China (English)

    董兴华; 徐春; 王磊; 周喜

    2012-01-01

    This paper gives a description of implementing an online, high-powered multilingual translation engine, which consists of three language pairs, that is, Uyghur-Chinese, Kazakh-Chinese and Kyrgyz-Chinese, based on additional knowledge base and multithreading and task distributing technology. The translation engine is easy to be extended to other language pairs, which has an ability of translating words, phrases, sentences, files and webs in one language to those of another.%描述了通过使用外部知识库和基于短语的翻译模型,利用多线程、任务分发的技术实现了一个在线的、高性能的多语言翻译引擎,已初步实现了维汉、哈汉、柯汉三种语言间的翻译.翻译引擎很容易扩展到其他语言对,具有翻译词、短语、句子、文件和网页的功能.

  4. Users page feedback

    CERN Multimedia

    2010-01-01

    In October last year the Communication Group proposed an interim redesign of the users’ web pages in order to improve the visibility of key news items, events and announcements to the CERN community. The proposed update to the users' page (right), and the current version (left, behind) This proposed redesign was seen as a small step on the way to much wider reforms of the CERN web landscape proposed in the group’s web communication plan.   The results are available here. Some of the key points: - the balance between news / events / announcements and access to links on the users’ pages was not right - many people asked to see a reversal of the order so that links appeared first, news/events/announcements last; - many people felt that we should keep the primary function of the users’ pages as an index to other CERN websites; - many people found the sections of the front page to be poorly delineated; - people do not like scrolling; - there were performance...

  5. JERHRE's New Web Pages.

    Science.gov (United States)

    2006-06-01

    JERHRE'S WEBSITE, www.csueastbay.edu/JERHRE/ has two new pages. One of those pages is devoted to curriculum that may be used to educate students, investigators and ethics committee members about issues in the ethics of human subjects research, and to evaluate their learning. It appears at www.csueastbay.edu/JERHRE/cur.html. The other is devoted to emailed letters from readers. Appropriate letters will be posted as soon as they are received by the editor. Letters from readers appear at www.csueastbay.edu/JERHRE/let.html.

  6. Folding worlds between pages

    CERN Multimedia

    Meier, Matthias

    2010-01-01

    "We all remember pop-up books form our childhood. As fascinated as we were back then, we probably never imagined how much engineering know-how went into these books. Pop-up engineer Anton Radevsky has even managed to fold a 27-kilometre particle accelerator into a book" (4 pages)

  7. Automatic translation among spoken languages

    Science.gov (United States)

    Walter, Sharon M.; Costigan, Kelly

    1994-01-01

    The Machine Aided Voice Translation (MAVT) system was developed in response to the shortage of experienced military field interrogators with both foreign language proficiency and interrogation skills. Combining speech recognition, machine translation, and speech generation technologies, the MAVT accepts an interrogator's spoken English question and translates it into spoken Spanish. The spoken Spanish response of the potential informant can then be translated into spoken English. Potential military and civilian applications for automatic spoken language translation technology are discussed in this paper.

  8. Alzheimer's Disease Information Page

    Science.gov (United States)

    ... Translational Research Research at NINDS Focus on Research Alzheimer's & Related Dementias Bioengineering Epilepsy Health Disparities Neural Interfaces Parkinson's Disease Spinal Cord Injury Stem Cells Traumatic Brain Injury Trans-Agency Activities Interagency Research ...

  9. 理性主义与经验主义相结合的机器翻译研究策略%Rationalism and Empiricism on the Combination in Machine Translation

    Institute of Scientific and Technical Information of China (English)

    徐金安

    2011-01-01

    主要介绍了基于规则、基于实例和基于统计等3种主流机器翻译方法,探讨了自然语言处理技术和机器翻译中基于规则的理性主义方法和基于统计的经验主义方法的优缺点,结合机器翻译研究的现状和发展方向,提出了规则和统计相结合的机器翻译方法的基本思路,阐述了词义消歧中的理性主义方法和经验主义方法相结合的发展方向,对机器翻译的发展趋势进行了探讨.%This paper firstly discussed the development of machine translation, secondly described the advantages and disadvantages of the rationalism and empiricism in natural language processing and machine translation, and thirdly introduced our basic ideas of combination of the rule-based approach and statistical approach.Furthermore, the paper deliberated the tendency of combination of the rationalism and empiricism in word sense disambiguation, finally summarized the developmental tendency of machine translation.

  10. Machine Transliteration

    CERN Document Server

    Knight, K; Knight, Kevin; Graehl, Jonathan

    1997-01-01

    It is challenging to translate names and technical terms across languages with different alphabets and sound inventories. These items are commonly transliterated, i.e., replaced with approximate phonetic equivalents. For example, "computer" in English comes out as "konpyuutaa" in Japanese. Translating such items from Japanese back to English is even more challenging, and of practical interest, as transliterated items make up the bulk of text phrases not found in bilingual dictionaries. We describe and evaluate a method for performing backwards transliterations by machine. This method uses a generative model, incorporating several distinct stages in the transliteration process.

  11. Statistical Machine Translation of Japanese

    Science.gov (United States)

    2007-03-01

    spoken, it may be phonologically displayed as /W AA T AA SH IY/, where each of the six elements between the slashes is a written representation of...phonemes. A chart showing the possible assignment of kana to phonemes is in Figure 3.1. The complete phonetic symbol set in Table 6.4 is the set of...sequences output by the recognizer. 6.6 Acoustic Model 66 The acoustic model, P(O|S) from Bayes’ rule, maps acoustic features to distinct phonetic

  12. Multiplex PageRank

    CERN Document Server

    Halu, Arda; Pansaraza, Pietro; Bianconi, Ginestra

    2013-01-01

    Many complex systems can be described as multiplex networks in which the same nodes can interact with one another in different layers, thus forming a set of interacting and co-evolving networks. Examples of such multiplex systems are social networks where people are involved in different types of relationships and interact through various forms of communication media. The ranking of nodes in multiplex networks is one of the most pressing and challenging tasks that research on complex networks is currently facing. When pairs of nodes can be connected through multiple links and in multiple layers, the ranking of nodes should necessarily reflect the importance of nodes in one layer as well as their importance in other interdependent layers. In this paper, we draw on the idea of biased random walks to define the Multiplex PageRank centrality measure in which the effects of the interplay between networks on the centrality of nodes are directly taken into account. In particular, depending on the intensity of the in...

  13. Translator awareness Translator awareness

    Directory of Open Access Journals (Sweden)

    Wolfram Wilss

    2008-04-01

    Full Text Available If we want to encompass adequately the wide-ranging field of human translation, it is necessary to include in translation studies (TS the concept of translator awareness (or translator consciousness, for that matter. However, this is more easily said than done, because this concept does not easily lend itself to definition, let alone to measurement, e. g., by investigating translator behaviour. To put it bluntly: Translator awareness is a fuzzy concept. Like many obviously difficult-to-define concepts, with which dialogue in TS is burdened, translator awareness lacks an articulated theory within which different forms of translator behaviour can be convincingly related to, or distinguished from, one another. Hence, TS has so far not tackled, at least not systematically, the issue of translator awareness. If we want to encompass adequately the wide-ranging field of human translation, it is necessary to include in translation studies (TS the concept of translator awareness (or translator consciousness, for that matter. However, this is more easily said than done, because this concept does not easily lend itself to definition, let alone to measurement, e. g., by investigating translator behaviour. To put it bluntly: Translator awareness is a fuzzy concept. Like many obviously difficult-to-define concepts, with which dialogue in TS is burdened, translator awareness lacks an articulated theory within which different forms of translator behaviour can be convincingly related to, or distinguished from, one another. Hence, TS has so far not tackled, at least not systematically, the issue of translator awareness.

  14. Turning the page

    Science.gov (United States)

    2015-01-01

    in hand to allow for a more limited production of a paper version of the Annals for RCS fellows and members who continue to elect to receive their Annals in the traditional format. Medical colleges around the world are currently undergoing similar deliberations and for some a digital version may represent the only opportunity to maintain editorial independence – unhindered by the implications of a commercial publishing partner. It is however hoped that for the vast majority of fellows and members, the new and enhanced digital platform will offer significant advantages such that the digital version becomes the de facto medium of choice. Matt Whitaker and the team at the Annals should be congratulated for their sterling efforts in making this transition. The new site, now live at http://publishing.rcseng.ac.uk, will enhance the experience of finding, accessing, reading, citing, sharing and saving articles from the Annals, Bulletin and FDJ. Sign-on will be much easier; page load times quicker and the search engine more powerful and intuitive. The new platform boasts improved functionality, full in-page article text and multi-media, citation tracking, reference generators and advanced social media integration. We are simultaneously launching a new video library where we will be hosting our technical videos. It will, I am certain, become a huge resource for our surgical fraternity. Our new platform will be followed later this year by the inevitable and ubiquitous app, which will allow readers to download issues of the Annals and read them offline and at leisure on whatever their tablet of choice might be. It is my belief that these and forthcoming changes herald the transformation of the Annals into a truly modern journal with all the digital services that authors and readers now rightly expect from their RCS publication. Tim Lane Editor-in-Chief, rcsannalseditor@gmail.com

  15. Writing Through: Practising Translation

    Directory of Open Access Journals (Sweden)

    Joel Scott

    2010-05-01

    Full Text Available This essay exists as a segment in a line of study and writing practice that moves between a critical theory analysis of translation studies conceptions of language, and the practical questions of what those ideas might mean for contemporary translation and writing practice. Although the underlying preoccupation of this essay, and my more general line of inquiry, is translation studies and practice, in many ways translation is merely a way into a discussion on language. For this essay, translation is the threshold of language. But the two trails of the discussion never manage to elude each other, and these concatenations have informed two experimental translation methods, referred to here as Live Translations and Series Translations. Following the essay are a number of poems in translation, all of which come from Blanco Nuclear by the contemporary Spanish poet, Esteban Pujals Gesalí. The first group, the Live Translations consist of transcriptions I made from audio recordings read in a public setting, in which the texts were translated in situ, either off the page of original Spanish-language poems, or through a process very much like that carried out by simultaneous translators, for which readings of the poems were played back to me through headphones at varying speeds to be translated before the audience. The translations collected are imperfect renderings, attesting to a moment in language practice rather than language objects. The second method involves an iterative translation process, by which three versions of any one poem are rendered, with varying levels of fluency, fidelity and servility. All three translations are presented one after the other as a series, with no version asserting itself as the primary translation. These examples, as well as the translation methods themselves, are intended as preliminary experiments within an endlessly divergent continuum of potential methods and translations, and not as a complete representation of

  16. A reverse engineering approach for automatic annotation of Web pages

    NARCIS (Netherlands)

    R. de Virgilio (Roberto); F. Frasincar (Flavius); W. Hop (Wim); S. Lachner (Stephan)

    2013-01-01

    textabstractThe Semantic Web is gaining increasing interest to fulfill the need of sharing, retrieving, and reusing information. Since Web pages are designed to be read by people, not machines, searching and reusing information on the Web is a difficult task without human participation. To this aim

  17. #NoMorePage3

    DEFF Research Database (Denmark)

    Glozer, Sarah; McCarthy, Lauren; Whelan, Glen

    2015-01-01

    Fourth wave feminists are currently seeking to bring an end to The Sun’s Page 3, a British institution infamous for featuring a topless female model daily. This paper investigates the No More Page 3 (NMP3) campaign through which feminist activists have sought to disrupt the institutionalized obje...

  18. Web Page Design (Part Three).

    Science.gov (United States)

    Descy, Don E.

    1997-01-01

    Discusses fonts as well as design considerations that should be reviewed when designing World Wide Web pages and sites to make them easier for clients to use and easier to maintain. Also discusses the simplicity of names; organization of pages, folders, and files; and sites to help build Web sites. (LRW)

  19. Measuring Cognitive Translation Effort with Activity Units

    DEFF Research Database (Denmark)

    Schaeffer, Moritz; Carl, Michael; Lacruz, Isabel

    2016-01-01

    Despite the increased quality of Machine Translation output, human interaction will remain a crucial activity to guarantee the quality of the final translation products. Human-computer interaction in translation will likely be the more successful the more we understand the properties and compleme...... methods in empirical translation process research and suggests ngrams of Activity Units for measuring the translation process....

  20. Pattern recognition & machine learning

    CERN Document Server

    Anzai, Y

    1992-01-01

    This is the first text to provide a unified and self-contained introduction to visual pattern recognition and machine learning. It is useful as a general introduction to artifical intelligence and knowledge engineering, and no previous knowledge of pattern recognition or machine learning is necessary. Basic for various pattern recognition and machine learning methods. Translated from Japanese, the book also features chapter exercises, keywords, and summaries.

  1. On Yao's method of translation

    NARCIS (Netherlands)

    Liu, X.; Hoede, C.

    2002-01-01

    Machine Translation, i.e., translating one kind of natural language to another kind of natural language by using a computer system, is a very important research branch in Artificial Intelligence. Yao developed a method of translation that he called ``Lexical-Semantic Driven". In his system he introd

  2. Paging memory from random access memory to backing storage in a parallel computer

    Science.gov (United States)

    Archer, Charles J; Blocksome, Michael A; Inglett, Todd A; Ratterman, Joseph D; Smith, Brian E

    2013-05-21

    Paging memory from random access memory (`RAM`) to backing storage in a parallel computer that includes a plurality of compute nodes, including: executing a data processing application on a virtual machine operating system in a virtual machine on a first compute node; providing, by a second compute node, backing storage for the contents of RAM on the first compute node; and swapping, by the virtual machine operating system in the virtual machine on the first compute node, a page of memory from RAM on the first compute node to the backing storage on the second compute node.

  3. Translations and Translators.

    Science.gov (United States)

    Nida, Eugene A.

    1979-01-01

    The necessity for stylistic appropriateness in translation as well as correct content is discussed. To acquire this skill, translators must be trained in stylistics through close examination of their own language and must have practice in translating for different audiences at different levels. (PMJ)

  4. Production Machine Shop Employment Competencies. Part Four: The Milling Machine.

    Science.gov (United States)

    Bishart, Gus; Werner, Claire

    Competencies for production machine shop are provided for the fourth of four topic areas: the milling machine. Each competency appears in a one-page format. It is presented as a goal statement followed by one or more "indicator" statements, which are performance objectives describing an ability that, upon attainment, will establish…

  5. Distributed paging for general networks

    Energy Technology Data Exchange (ETDEWEB)

    Awerbuch, B.; Bartal, Y.; Fiat, A. [Tel-Aviv Univ. (Israel)

    1996-12-31

    Distributed paging deals with the dynamic allocation of copies of files in a distributed network as to minimize the total communication cost over a sequence of read and write requests. Most previous work deals with the file allocation problem where infinite nodal memory capacity is assumed. In contrast the distributed paging problem makes the more realistic assumption that nodal memory capacity is limited. Former work on distributed paging deals with the problem only in the case of a uniform network topology. This paper gives the first distributed paging algorithm for general networks. The algorithm is competitive in storage and communication. The competitive ratios are poly-logarithmic in the total number of network nodes and the diameter of the network.

  6. PageRank of integers

    CERN Document Server

    Frahm, K M; Shepelyansky, D L

    2012-01-01

    We build up a directed network tracing links from a given integer to its divisors and analyze the properties of the Google matrix of this network. The PageRank vector of this matrix is computed numerically and it is shown that its probability is inversely proportional to the PageRank index thus being similar to the Zipf law and the dependence established for the World Wide Web. The spectrum of the Google matrix of integers is characterized by a large gap and a relatively small number of nonzero eigenvalues. A simple semi-analytical expression for the PageRank of integers is derived that allows to find this vector for matrices of billion size. This network provides a new PageRank order of integers.

  7. Code AI Personal Web Pages

    Science.gov (United States)

    Garcia, Joseph A.; Smith, Charles A. (Technical Monitor)

    1998-01-01

    The document consists of a publicly available web site (george.arc.nasa.gov) for Joseph A. Garcia's personal web pages in the AI division. Only general information will be posted and no technical material. All the information is unclassified.

  8. Translating Words, Translating Cultures

    Directory of Open Access Journals (Sweden)

    Richard Whitaker

    2012-03-01

    Full Text Available What exactly does (or should translation from one language into another try to do? Attempt to convey to readers of the target language (the language into which one is translating something of the strangeness, difference and historicity of the original in the source language (the language from which one is translating? Or must translation try to bridge the gap between source and target language, by rendering the original in a thoroughly contemporary style and diction, as if this were a work being written now for the first time? And related to these the further questions: how closely should a translation render the genre, language, metre, style and content of the original? How far can a translation depart from the original without ceasing to be a translation – in other words, where is one to situate the border between “translation”, “version” and “adaptation”?

  9. The Implications of Virtual Machine Introspection for Digital Forensics on Nonquiescent Virtual Machines

    Science.gov (United States)

    2011-06-01

    TERMS Virtual Machine Introspection, VMI , Virtual Machine, Forensics, Xen 15. NUMBER OF PAGES 61 16. PRICE CODE 17. SECURITY CLASSIFICATION OF...taken for VMI memory dump utility to run with different loads ...........22 x THIS PAGE INTENTIONALLY LEFT BLANK xi LIST OF TABLES Table 1...Time to compile kernel without VMI ..............................................................20 Table 2. Time to compile kernel with VMI

  10. To Weaken the Language Barriers for the Chinese Minorities by the Machine Translation Technology%利用机器翻译技术消弱少数民族语言障碍

    Institute of Scientific and Technical Information of China (English)

    吴凤娟

    2011-01-01

    As a multi-ethnic region, there are both advantages and disadvantages for China. The language barrier is always a great obstruction for the development of the minority nationalities. The machine translation specific to the languages of Chinese minorities contribute to advance the culture communication, to drive the economic growth and to build a harmonious society. By now some development has been made in this field. But if we want to get the actual applying effect, there is a long road before us. To advance the machine translation specific to the languages of Chinese minorities is of great significance.%我国多民族聚居的局面有利有弊,语言障碍一直是少数民族地区发展的巨大阻力.针对少数民族语言的机器翻译有助于推进文化交流、带动经济发展、建立和谐社会.目前该领域的研究已取得了一定进展,但要达到实际应用效果仍有较长的路要走.大力推进针对少数民族的机器翻译工作意义重大.

  11. A Comparative Study of "Google Translate" Translations: An Error Analysis of English-to-Persian and Persian-to-English Translations

    Science.gov (United States)

    Ghasemi, Hadis; Hashemian, Mahmood

    2016-01-01

    Both lack of time and the need to translate texts for numerous reasons brought about an increase in studying machine translation with a history spanning over 65 years. During the last decades, Google Translate, as a statistical machine translation (SMT), was in the center of attention for supporting 90 languages. Although there are many studies on…

  12. 面向移动终端的统计机器翻译解码定点化方法%A Fixed Point Decoding Approach for Statistical Machine Translation on Mobile Terminals

    Institute of Scientific and Technical Information of China (English)

    李响; 徐金安; 姜文斌; 吕雅娟; 刘群

    2011-01-01

    面向移动终端的统计机器翻译需求越来越多,但无浮点运算单元的处理器限制了翻译速度.该文提出了一种对统计机器翻译解码运算的定点化运算方法,缓解了无浮点运算单元的处理器对翻译速度的影响.基于PC和移动终端的实验表明,在保证翻译质量的情况下,利用定点处理浮点运算的解码器的运算速度较编译器模拟的浮点运算速度提高135.6%.因此,该方法可以有效地提高浮点运算能力薄弱的移动终端统计机器翻译速度.%The demand for statistical machine translation (SMT) on mobile terminals is increasing, but the processor without floating point unit (FPU) restricts the translation speed. This paper proposes an approach to switch floating point operation to fixed point operation for decoder of SMT system on mobile terminals, and increase the translation speed on the processor without FPU. The experiments based on PC and mobile terminal show while this approach assures the quality of translation, the speed of our approach is 135.6% faster than the speed of floating point operation emulated by compiler. Therefore, this approach can efficiently increase the translation speed of SMT system on mobile terminals with weak ability in floating point operation.

  13. Completeness of Compositional Translation for Context-Free Grammars

    CERN Document Server

    Huijsen, W O

    1996-01-01

    A machine translation system is said to be *complete* if all expressions that are correct according to the source-language grammar can be translated into the target language. This paper addresses the completeness issue for compositional machine translation in general, and for compositional machine translation of context-free grammars in particular. Conditions that guarantee translation completeness of context-free grammars are presented.

  14. 从机器翻译历程看自然语言处理研究的发展策略%The Development Strategy for Natural Language Processing Research Inspired from a Historical View on Machine Translation

    Institute of Scientific and Technical Information of China (English)

    孙茂松; 周建设

    2016-01-01

    Machine translation (MT) is one of the major research fi elds of natural language processing (NLP), and it always spearheads the research frontier in NLP. In this paper, after a systematic survey of the development history of MT from a macroscopic perspective, with particular emphasis on the main development path of underlying methodologies and core technologies in MT, we drew a general picture of the milestones that marked the key points of a long journey for both theoretical study and practical accomplishment for the past seven decades. The latest fruitful development achieved in the area of MT application shows that, the paradigm shift from the tradi-tional linguistic rule-based approaches to the so-called empirical approach, based on increasingly available amounts of “raw data” in the form of massive collections of texts and their translations, compounded by the phenomenal advancement of computer technology, will become the driving force that will potentially lead to the breakthrough in MT. Based on the above observation and analysis, some sug-gestions on the short-term development strategy for machine translation as well as natural language processing in China are proposed.%本文试图从超脱细节的宏观角度,对机器翻译的发展历程进行扼要的总结和深刻的评介,着重于刻画各个时期在基本方法和核心技术上的主要特征,从而勾勒出机器翻译的全过程演进脉络。在上述考察和分析的基础上,文章对国内机器翻译乃至自然语言处理研究的近期发展策略提出了若干建议。

  15. A Taxonomy of Human Translation Styles

    DEFF Research Database (Denmark)

    Carl, Michael; Dragsted, Barbara; Lykke Jakobsen, Arnt

    2011-01-01

    While the translation profession becomes increasingly technological, we are still far from understanding how humans actually translate and how they could be best supported by machines. In this paper we outline a method which helps to uncover characteristics of human translation processes. Based o...... on the translators' activity data, we develop a taxonomy of translation styles. The taxonomy could serve to inform the development of advanced translation assistance tools and provide a basis for a felicitous and grounded integration of human machine interaction in translation....

  16. The Faculty Web Page: Contrivance or Continuation?

    Science.gov (United States)

    Lennex, Lesia

    2007-01-01

    In an age of Internet education, what does it mean for a tenure/tenure-track faculty to have a web page? How many professors have web pages? If they have a page, what does it look like? Do they really need a web page at all? Many universities have faculty web pages. What do those collective pages look like? In what way do they represent the…

  17. Instant PageSpeed optimization

    CERN Document Server

    Jaiswal, Sanjeev

    2013-01-01

    Filled with practical, step-by-step instructions and clear explanations for the most important and useful tasks. Instant PageSpeed Optimization is a hands-on guide that provides a number of clear, step-by-step exercises for optimizing your websites for better performance and improving their efficiency.Instant PageSpeed Optimization is aimed at website developers and administrators who wish to make their websites load faster without any errors and consume less bandwidth. It's assumed that you will have some experience in basic web technologies like HTML, CSS3, JavaScript, and the basics of netw

  18. Translation Method and Computer Programme for Assisting the Same

    DEFF Research Database (Denmark)

    2013-01-01

    The present invention relates to a translation method comprising the steps of: a translator speaking a translation of a written source text in a target language, an automatic speech recognition system converting the spoken translation into a set of phone and word hypotheses in the target language......, a machine translation system translating the written source text into a set of translations hypotheses in the target language, and an integration module combining the set of spoken word hypotheses and the set of machine translation hypotheses obtaining a text in the target language. Thereby obtaining...... a significantly improved and faster translation compared to what is achieved by known translation methods....

  19. What snippets say about pages

    NARCIS (Netherlands)

    Demeester, Thomas; Nguyen, Dong; Trieschnigg, Dolf; Develder, Chris; Hiemstra, Djoerd

    2013-01-01

    What is the likelihood that a Web page is considered relevant to a query, given the relevance assessment of the corresponding snippet? Using a new FederatedWeb Search test collection that contains search results from over a hundred search engines on the internet, we are able to investigate such rese

  20. Database-Based Web Page

    Institute of Scientific and Technical Information of China (English)

    2001-01-01

    Database-based web page which uses IIS4.0 + ASP + ADO + SQL7.0 isbriefly introduced. It has been successfully used in E-commerce , bulletin board system and chat room, and so on in the web site of Computer Center Hudong Campus, Tong ji University.

  1. Design of Educational Web Pages

    Science.gov (United States)

    Galan, Jose Gomez; Blanco, Soledad Mateos

    2004-01-01

    The methodological characteristics of teaching in primary and secondary education make it necessary to revise the pedagogical and instructive lines with which to introduce the new Information and Communication Technologies into the school context. The construction of Web pages that can be used to improve student learning is, therefore, fundamental…

  2. Web Page Design (Part One).

    Science.gov (United States)

    Descy, Don E.

    1997-01-01

    Discusses rules for Web page design: consider audiences' Internet skills and equipment; know your content; outline the material; map or sketch the site; be consistent; regulate size of graphics to control download time; place eye catching material in the first 300 pixels; moderate use of color to control file size and bandwidth; include a…

  3. Learning through Web Page Design.

    Science.gov (United States)

    Peel, Deborah

    2001-01-01

    Describes and evaluates the use of Web page design in an undergraduate course in the United Kingdom on town planning. Highlights include incorporating information and communication technologies into higher education; and a theoretical framework for the use of educational technology. (LRW)

  4. MT In Business English Translation

    Institute of Scientific and Technical Information of China (English)

    张志新

    2009-01-01

    In this article the operational principles of MT in business English translation is briefly introduced with an aim to point out that to improve the MT quality machine study is a key factor to work on.

  5. Translating China

    Institute of Scientific and Technical Information of China (English)

    2011-01-01

    sidney Shapiro, an American-born translator famous for his translation of Chinese literary works, received the Lifetime Achievement Award in Translation by the Translators Association of China on December 2, 2010.

  6. Translation Theory 'Translated'

    DEFF Research Database (Denmark)

    Wæraas, Arild; Nielsen, Jeppe

    2016-01-01

    Translation theory has proved to be a versatile analytical lens used by scholars working from different traditions. On the basis of a systematic literature review, this study adds to our understanding of the ‘translations’ of translation theory by identifying the distinguishing features of the mo......, but also overlapping. We discuss the ways in which the three versions of translation theory may be combined and enrich each other so as to inform future research, thereby offering a more complete understanding of translation in and across organizational settings.......Translation theory has proved to be a versatile analytical lens used by scholars working from different traditions. On the basis of a systematic literature review, this study adds to our understanding of the ‘translations’ of translation theory by identifying the distinguishing features of the most...... common theoretical approaches to translation within the organization and management discipline: actor-network theory, knowledge-based theory, and Scandinavian institutionalism. Although each of these approaches already has borne much fruit in research, the literature is diverse and somewhat fragmented...

  7. Understanding Translation

    DEFF Research Database (Denmark)

    Schjoldager, Anne Gram; Gottlieb, Henrik; Klitgård, Ida

    Understanding Translation is designed as a textbook for courses on the theory and practice of translation in general and of particular types of translation - such as interpreting, screen translation and literary translation. The aim of the book is to help you gain an in-depth understanding...... - translators, language teachers, translation users and literary, TV and film critics, for instance. Discussions focus on translation between Danish and English....

  8. Understanding Translation

    DEFF Research Database (Denmark)

    Schjoldager, Anne Gram; Gottlieb, Henrik; Klitgård, Ida

    Understanding Translation is designed as a textbook for courses on the theory and practice of translation in general and of particular types of translation - such as interpreting, screen translation and literary translation. The aim of the book is to help you gain an in-depth understanding of the...... - translators, language teachers, translation users and literary, TV and film critics, for instance. Discussions focus on translation between Danish and English....

  9. Innovating Web Page Classification Through Reducing Noise

    Institute of Scientific and Technical Information of China (English)

    LI Xiaoli (李晓黎); SHI Zhongzhi(史忠植)

    2002-01-01

    This paper presents a new method that eliminates noise in Web page classification. It first describes the presentation of a Web page based on HTML tags. Then through a novel distance formula, it eliminates the noise in similarity measure. After carefully analyzing Web pages, we design an algorithm that can distinguish related hyperlinks from noisy ones.We can utilize non-noisy hyperlinks to improve the performance of Web page classification (the CAWN algorithm). For any page, wecan classify it through the text and category of neighbor pages related to the page. The experimental results show that our approach improved classification accuracy.

  10. Market study: Tactile paging system

    Science.gov (United States)

    1977-01-01

    A market survey was conducted regarding the commercialization potential and key market factors relevant to a tactile paging system for deaf-blind people. The purpose of the tactile paging system is to communicate to the deaf-blind people in an institutional environment. The system consists of a main console and individual satellite wrist units. The console emits three signals by telemetry to the wrist com (receiving unit) which will measure approximately 2 x 4 x 3/4 inches and will be fastened to the wrist by a strap. The three vibration signals are fire alarm, time period indication, and a third signal which will alert the wearer of the wrist com to the fact that the pin on the top of the wrist is emitting a morse coded message. The Morse code message can be felt and recognized with the finger.

  11. Functional Multiplex PageRank

    CERN Document Server

    Iacovacci, Jacopo; Arenas, Alex; Bianconi, Ginestra

    2016-01-01

    Recently it has been recognized that many complex social, technological and biological networks have a multilayer nature and can be described by multiplex networks. Multiplex networks are formed by a set of nodes connected by links having different connotations forming the different layers of the multiplex. Characterizing the centrality of the nodes in a multiplex network is a challenging task since the centrality of the node naturally depends on the importance associated to links of a certain type. Here we propose to assign to each node of a multiplex network a centrality called Functional Multiplex PageRank that is a function of the weights given to every different pattern of connections (multilinks) existent in the multiplex network between any two nodes. Since multilinks distinguish all the possible ways in which the links in different layers can overlap, the Functional Multiplex PageRank can describe important non-linear effects when large relevance or small relevance is assigned to multilinks with overl...

  12. Synonymy and translation

    NARCIS (Netherlands)

    Jong, de Franciska; Appelo, Lisette

    1987-01-01

    This paper is meant to give some insight into the interaction between on the one hand theoretical concepts in the field of formal semantics, and on the other hand linguistic research directed towards an application, more specifically, the research in the machine translation project Rosetta. The cent

  13. Synonymy and Translation

    NARCIS (Netherlands)

    de Jong, Franciska M.G.; Appelo, Lisette

    1987-01-01

    This paper is meant to give some insight into the interaction between on the one hand theoretical concepts in the field of formal semantics, and on the other hand linguistic research directed towards an application, more specifically, the research in the machine translation project Rosetta. The

  14. Functional Multiplex PageRank

    Science.gov (United States)

    Iacovacci, Jacopo; Rahmede, Christoph; Arenas, Alex; Bianconi, Ginestra

    2016-10-01

    Recently it has been recognized that many complex social, technological and biological networks have a multilayer nature and can be described by multiplex networks. Multiplex networks are formed by a set of nodes connected by links having different connotations forming the different layers of the multiplex. Characterizing the centrality of the nodes in a multiplex network is a challenging task since the centrality of the node naturally depends on the importance associated to links of a certain type. Here we propose to assign to each node of a multiplex network a centrality called Functional Multiplex PageRank that is a function of the weights given to every different pattern of connections (multilinks) existent in the multiplex network between any two nodes. Since multilinks distinguish all the possible ways in which the links in different layers can overlap, the Functional Multiplex PageRank can describe important non-linear effects when large relevance or small relevance is assigned to multilinks with overlap. Here we apply the Functional Page Rank to the multiplex airport networks, to the neuronal network of the nematode C. elegans, and to social collaboration and citation networks between scientists. This analysis reveals important differences existing between the most central nodes of these networks, and the correlations between their so-called pattern to success.

  15. Translating Means Translating Meaning

    Institute of Scientific and Technical Information of China (English)

    李海燕

    2000-01-01

    美国著名翻译理论家尤金·奈达说 :“翻译即译意 (Translating m eans translating m eaning)。”就实质而言 ,翻译即译意。就是把一种语言表达的意义用另一种语言表达出来。翻译分理解与表达两个步骤。理解是翻译的基础 ,表达直接决定译文的成败与优劣 ,两者缺一不可

  16. Human Translator and Translation Technology

    Institute of Scientific and Technical Information of China (English)

    李辰

    2016-01-01

    With the great development of technology, translation technology exerts great influence on human translators because during their translation process, they may use many computer-aided translation tools, such as TRADOS, Snowman, WordFisher and etc. However, they always misunderstand the concept of computer-aided translation, so this thesis managed to providedetails about some translation technology and human translators' strengths so as to help them improve the productivity and the quality of theirtranslation works effectively and efficiently.

  17. Interstellar Initiative Web Page Design

    Science.gov (United States)

    Mehta, Alkesh

    1999-01-01

    This summer at NASA/MSFC, I have contributed to two projects: Interstellar Initiative Web Page Design and Lenz's Law Relative Motion Demonstration. In the Web Design Project, I worked on an Outline. The Web Design Outline was developed to provide a foundation for a Hierarchy Tree Structure. The Outline would help design a Website information base for future and near-term missions. The Website would give in-depth information on Propulsion Systems and Interstellar Travel. The Lenz's Law Relative Motion Demonstrator is discussed in this volume by Russell Lee.

  18. AUTOMATIC TAGGING OF PERSIAN WEB PAGES BASED ON N-GRAM LANGUAGE MODELS USING MAPREDUCE

    Directory of Open Access Journals (Sweden)

    Saeed Shahrivari

    2015-07-01

    Full Text Available Page tagging is one of the most important facilities for increasing the accuracy of information retrieval in the web. Tags are simple pieces of data that usually consist of one or several words, and briefly describe a page. Tags provide useful information about a page and can be used for boosting the accuracy of searching, document clustering, and result grouping. The most accurate solution to page tagging is using human experts. However, when the number of pages is large, humans cannot be used, and some automatic solutions should be used instead. We propose a solution called PerTag which can automatically tag a set of Persian web pages. PerTag is based on n-gram models and uses the tf-idf method plus some effective Persian language rules to select proper tags for each web page. Since our target is huge sets of web pages, PerTag is built on top of the MapReduce distributed computing framework. We used a set of more than 500 million Persian web pages during our experiments, and extracted tags for each page using a cluster of 40 machines. The experimental results show that PerTag is both fast and accurate

  19. Block Models and Personalized PageRank

    CERN Document Server

    Kloumann, Isabel; Kleinberg, Jon

    2016-01-01

    Methods for ranking the importance of nodes in a network have a rich history in machine learning and across domains that analyze structured data. Recent work has evaluated these methods though the seed set expansion problem: given a subset $S$ of nodes from a community of interest in an underlying graph, can we reliably identify the rest of the community? We start from the observation that the most widely used techniques for this problem, personalized PageRank and heat kernel methods, operate in the space of landing probabilities of a random walk rooted at the seed set, ranking nodes according to weighted sums of landing probabilities of different length walks. Both schemes, however, lack an a priori relationship to the seed set objective. In this work we develop a principled framework for evaluating ranking methods by studying seed set expansion applied to the stochastic block model. We derive the optimal gradient for separating the landing probabilities of two classes in a stochastic block model, and find, ...

  20. When Machines Design Machines!

    DEFF Research Database (Denmark)

    2011-01-01

    Until recently we were the sole designers, alone in the driving seat making all the decisions. But, we have created a world of complexity way beyond human ability to understand, control, and govern. Machines now do more trades than humans on stock markets, they control our power, water, gas...... and food supplies, manage our elevators, microclimates, automobiles and transport systems, and manufacture almost everything. It should come as no surprise that machines are now designing machines. The chips that power our computers and mobile phones, the robots and commercial processing plants on which we...... depend, all are now largely designed by machines. So what of us - will be totally usurped, or are we looking at a new symbiosis with human and artificial intelligences combined to realise the best outcomes possible. In most respects we have no choice! Human abilities alone cannot solve any of the major...

  1. Training Activity Summary Page (TASP) Campus

    Data.gov (United States)

    U.S. Department of Health & Human Services — The Training Activity Summary Page (formerly the Training Exit Survey Cover Page) dataset contains data about each training event. This dataset includes information...

  2. Evaluation of the Importance of Web Pages

    Institute of Scientific and Technical Information of China (English)

    2001-01-01

    Google's algorithm on PageRank is analyzed in details. Some disadvantages of this algorithm is presented, for instance, preferring old pages, ignoring special sites and inaccurate judge of hyperlinks pointed out from one page. Furthermore, author's improved algorithm is described. Experiments show that the author's consideration on evaluating the importance of pages can make an improvement over the original algorithm. Based on this improved algorithm a topicspecific searching system have been developed.

  3. Web Page Design and Network Analysis.

    Science.gov (United States)

    Wan, Hakman A.; Chung, Chi-wai

    1998-01-01

    Examines problems in Web-site design from the perspective of network analysis. In view of the similarity between the hypertext structure of Web pages and a generic network, network analysis presents concepts and theories that provide insight for Web-site design. Describes the problem of home-page location and control of number of Web pages and…

  4. Classifying web pages with visual features

    NARCIS (Netherlands)

    de Boer, V.; van Someren, M.; Lupascu, T.; Filipe, J.; Cordeiro, J.

    2010-01-01

    To automatically classify and process web pages, current systems use the textual content of those pages, including both the displayed content and the underlying (HTML) code. However, a very important feature of a web page is its visual appearance. In this paper, we show that using generic visual fea

  5. Computer-aided Translation Technology and Its Applications

    Institute of Scientific and Technical Information of China (English)

    汪美侠; 何大顺

    2014-01-01

    This article begins with a brief analysis of the significance of translation technology in different spheres of modern life, followed by a distinction between machine translation (MT) and computer-aided translation (CAT). It then describes some trans-lation resources and tools and examines the negative and positive aspects of computer-aided translations. Finally it comes to a conclusion that it would be greatly efficient and productive for the translators to acquire the new skills in the translation work-place.

  6. On the Systematicity of Human Translation Processes

    DEFF Research Database (Denmark)

    Carl, Michael; Dragsted, Barbara; Lykke Jakobsen, Arnt

    While translation careers and the translation profession become more globalised and more technological, we are still far from understanding how humans actually translate and how they could be best supported by machines. In this paper we attempt to outline a method which helps to uncover character...

  7. 基于有限状态自动机的远程命令识别与解析%Recognition & Translation of Remote Commands Based on Finite State Machine

    Institute of Scientific and Technical Information of China (English)

    毛凤翔; 韩冰

    2013-01-01

    远程命令识别与解析是嵌入式环境中终端-控制台和上-下位机模式实现远程管控的基础和关键.文中分析水下探测智能终端的工作过程,提出了一种基于有限状态自动机的远程命令识别与解析方法,智能终端可以根据工作状态自动机模型对远程命令进行快速、准确地响应,避免了复杂的计算和繁琐的决策过程.实验发现,水下探测智能终端及时识别出控制台发送的管控指令,按要求转入相应的工作状态,该方法有效地提高了水下探测智能终端机的工作性能.%It is a fundamental and key problem of how to correctly recognize and translate remote commands for terminal-console and upper-low computer to conduct remote management and control in embedded systems.By carefully analyzing working procedure of underwater probe terminal in this paper,an effective method based on finite state machine model is put forward,which can quickly and accurately respond the remote instructions,and avoid complicated computation and decision.Applied practices show the underwater probe intelligent terminal can understand kinds of management-control instructions sent from console machine,and properly tune to corresponding working mode,which can greatly improve the performance of underwater probe intelligent terminal.

  8. Research on joint Chinese-Japanese word segmentation for phrase-based statistical machine translation%面向短语统计机器翻译的汉日联合分词研究

    Institute of Scientific and Technical Information of China (English)

    吴培昊; 徐金安; 张玉洁

    2015-01-01

    Unknown words and word segmentation granularity are two main problems for Chinese-Japanese machine translation. Word segmentation is the first important step for Chinese and Japanese natural language processing. As Chi-nese and Japanese word segmentation is processed with different tagging system and semantic performance, the granularity of word segmentation results should be readjusted to improve the performance of Statistical Machine Translation(SMT). This paper proposes an approach to adjust the word segmentation granularity for improving the performance of SMT, which combines Hanzi-Kanji comparison table and Japanese-Chinese dictionary. Experimental results express that the pro-posed method could adjust the granularity between Chinese and Japanese effectively and improve the performance of SMT. This paper analyses the experimental results and discusses the effect of joint Chinese-Japanese word segmentation granularity for phrase-based SMT.%未登录词与分词粒度是汉日日汉机器翻译研究的两个主要问题。与英语等西方语言不同,汉语与日语词语间不存在空格,分词为汉日双语处理的重要工作。由于词性标注体系、文法及语义表现上的差异,分词结果的粒度需要进一步调整,以改善统计机器翻译系统的性能。提出了面向统计机器翻译的基于汉日汉字对照表及日汉词典信息的汉语与日语的分词粒度调整方法。实验结果表明,该方法能有效地调节源语言和目标语言端的分词粒度,提高统计机器翻译系统的性能。通过对比实验结果,分析探讨分词粒度对汉日双语统计系统性能的影响。

  9. Museum: Multidimensional web page segment evaluation model

    CERN Document Server

    Kuppusamy, K S

    2012-01-01

    The evaluation of a web page with respect to a query is a vital task in the web information retrieval domain. This paper proposes the evaluation of a web page as a bottom-up process from the segment level to the page level. A model for evaluating the relevancy is proposed incorporating six different dimensions. An algorithm for evaluating the segments of a web page, using the above mentioned six dimensions is proposed. The benefits of fine-granining the evaluation process to the segment level instead of the page level are explored. The proposed model can be incorporated for various tasks like web page personalization, result re-ranking, mobile device page rendering etc.

  10. Web Page Watermarking for Tamper-Proof

    Institute of Scientific and Technical Information of China (English)

    2005-01-01

    This paper proposed a watermarking algorithm for tamper-proof of web pages. For a web page, it generates a watermark consisting of a sequence of Space and Tab. The watermark is then embedded into the web page after each word and each line. When a watermarked web page is tampered, the extracted watermark can detect and locate the modifications to the web page. Besides, the framework of watermarked Web Server system was given.Compared with traditional digital signature methods, this watermarking method is more transparent in that there is no necessary to detach the watermark before displaying web pages. The experimental results show that the proposed scheme is an effective tool for tamper-proof of web pages.

  11. Hidden Page WebCrawler Model for Secure Web Pages

    Directory of Open Access Journals (Sweden)

    K. F. Bharati

    2013-03-01

    Full Text Available The traditional search engines available over the internet are dynamic in searching the relevant content over the web. The search engine has got some constraints like getting the data asked from a varied source, where the data relevancy is exceptional. The web crawlers are designed only to more towards a specific path of the web and are restricted in moving towards a different path as they are secured or at times restricted due to the apprehension of threats. It is possible to design a web crawler that will have the capability of penetrating through the paths of the web, not reachable by the traditional web crawlers, in order to get a better solution in terms of data, time and relevancy for the given search query. The paper makes use of a newer parser and indexer for coming out with a novel idea of web crawler and a framework to support it. The proposed web crawler is designed to attend Hyper Text Transfer Protocol Secure (HTTPS based websites and web pages that needs authentication to view and index. User has to fill a search form and his/her creditionals will be used by the web crawler to attend secure web server for authentication. Once it is indexed the secure web server will be inside the web crawler’s accessible zone

  12. Categorization of web pages - Performance enhancement to search engine

    Digital Repository Service at National Institute of Oceanography (India)

    Lakshminarayana, S.

    are the major areas of research in IR and strive to improve the effectiveness of interactive IR and can be used as performance evaluation tool. The classification studies at early stages were with strong human interaction than machine learning. The term... and the location of the link. In the absence such works, the spider/worm either moves to the next page available at the least time or by network selection. This classification serves in judgment of traversal of web spider/worm and minimization. Such processes...

  13. Translational Creativity

    DEFF Research Database (Denmark)

    Nielsen, Sandro

    2010-01-01

    A long-established approach to legal translation focuses on terminological equivalence making translators strictly follow the words of source texts. Recent research suggests that there is room for some creativity allowing translators to deviate from the source texts. However, little attention...... is given to genre conventions in source texts and the ways in which they can best be translated. I propose that translators of statutes with an informative function in expert-to-expert communication may be allowed limited translational creativity when translating specific types of genre convention....... This creativity is a result of translators adopting either a source-language or a target-language oriented strategy and is limited by the pragmatic principle of co-operation. Examples of translation options are provided illustrating the different results in target texts. The use of a target-language oriented...

  14. Mimicked Web Page Detection over Internet

    Directory of Open Access Journals (Sweden)

    Y. Narasimha Rao

    2014-01-01

    Full Text Available Phishing is process of steeling valuable information such as ATM pins, Credit card details over internet. Where the attacker creates mimicked web pages from the legitimate web pages to fool users. In this paper, we propose an effective anti-phishing solution, by combining image based visual similarity based approach to detect plagiarized web pages. We used effective algorithms for our detection mechanism, speeded up Robust Features (SURF algorithm in order to generate signature based on extracting stable key points from the screen shot of the web page. When a legitimate web page is registered with our system, this algorithm applied on that web page in order to generate signatures, and these signatures are stored in the database for our trained system. When there is a suspected web page, this algorithm is applied to generate both the signatures of the suspected page and is verified against our database of corresponding legitimate web pages. Our results verified that our proposed system is very effective to detect the mimicked web pages with minimal false positives

  15. English Translation of Chinese Personal Names

    Institute of Scientific and Technical Information of China (English)

    徐洁

    2008-01-01

    This paper talks about the translation of personal names from Chinese into English, which is complicated by different factors, including orthographic, phonetic, geographic and social ones. To translate personal names appropriately, a wide range of knowledge is required. Although translation machine is more often used nowadays, it cannot take place of person in the end. Several types of name translation will be talked about in this paper.

  16. The Temple Translator’s Workstation Project

    Science.gov (United States)

    1996-01-01

    allows both translator/analysts and monolingual analysts to use the machine- translation function for assessing the relevance of a translated...using a multilingual editor (Figure2). Source documents and their translations are managed using the Tipster Document Manager developed at CRL...an English morphological generator [Penman 88]. • A multilingual document editor (the Tipster Editor for Documents developed at CRL under the

  17. Identifying Information Senders of Web Pages

    Science.gov (United States)

    Kato, Yoshikiyo; Kawahara, Daisuke; Inui, Kentaro; Kurohashi, Sadao; Shibata, Tomohide

    The source of information is one of the crucial elements when judging the credibility of the information. On the current Web, however, the information about the source is not readily available to the users. In this paper, we formulate the problem of identifying the information source as the problem of identifying the information sender configuration (ISC) of a Web page. An information sender of a Web page is an entity which is involved in the publication of the information on the page. An information sender configuration of a Web page describes the information senders of the page and the relationship among them. Information sender identification is a sub-problem of identifying ISC, and we present a method for extracting information senders from Web pages, along with its evaluation. ISC provides a basis for deeper analysis of information on the Web.

  18. Web Page Categorization Using Artificial Neural Networks

    CERN Document Server

    Kamruzzaman, S M

    2010-01-01

    Web page categorization is one of the challenging tasks in the world of ever increasing web technologies. There are many ways of categorization of web pages based on different approach and features. This paper proposes a new dimension in the way of categorization of web pages using artificial neural network (ANN) through extracting the features automatically. Here eight major categories of web pages have been selected for categorization; these are business & economy, education, government, entertainment, sports, news & media, job search, and science. The whole process of the proposed system is done in three successive stages. In the first stage, the features are automatically extracted through analyzing the source of the web pages. The second stage includes fixing the input values of the neural network; all the values remain between 0 and 1. The variations in those values affect the output. Finally the third stage determines the class of a certain web page out of eight predefined classes. This stage i...

  19. Realistic page-turning of electronic books

    Science.gov (United States)

    Fan, Chaoran; Li, Haisheng; Bai, Yannan

    2014-01-01

    The booming electronic books (e-books), as an extension to the paper book, are popular with readers. Recently, many efforts are put into the realistic page-turning simulation o f e-book to improve its reading experience. This paper presents a new 3D page-turning simulation approach, which employs piecewise time-dependent cylindrical surfaces to describe the turning page and constructs smooth transition method between time-dependent cylinders. The page-turning animation is produced by sequentially mapping the turning page into the cylinders with different radii and positions. Compared to the previous approaches, our method is able to imitate various effects efficiently and obtains more natural animation of turning page.

  20. Semantic Vector Machines

    CERN Document Server

    Vincent, Etter

    2011-01-01

    We first present our work in machine translation, during which we used aligned sentences to train a neural network to embed n-grams of different languages into an $d$-dimensional space, such that n-grams that are the translation of each other are close with respect to some metric. Good n-grams to n-grams translation results were achieved, but full sentences translation is still problematic. We realized that learning semantics of sentences and documents was the key for solving a lot of natural language processing problems, and thus moved to the second part of our work: sentence compression. We introduce a flexible neural network architecture for learning embeddings of words and sentences that extract their semantics, propose an efficient implementation in the Torch framework and present embedding results comparable to the ones obtained with classical neural language models, while being more powerful.

  1. Machine Learning and Radiology

    Science.gov (United States)

    Wang, Shijun; Summers, Ronald M.

    2012-01-01

    In this paper, we give a short introduction to machine learning and survey its applications in radiology. We focused on six categories of applications in radiology: medical image segmentation, registration, computer aided detection and diagnosis, brain function or activity analysis and neurological disease diagnosis from fMR images, content-based image retrieval systems for CT or MRI images, and text analysis of radiology reports using natural language processing (NLP) and natural language understanding (NLU). This survey shows that machine learning plays a key role in many radiology applications. Machine learning identifies complex patterns automatically and helps radiologists make intelligent decisions on radiology data such as conventional radiographs, CT, MRI, and PET images and radiology reports. In many applications, the performance of machine learning-based automatic detection and diagnosis systems has shown to be comparable to that of a well-trained and experienced radiologist. Technology development in machine learning and radiology will benefit from each other in the long run. Key contributions and common characteristics of machine learning techniques in radiology are discussed. We also discuss the problem of translating machine learning applications to the radiology clinical setting, including advantages and potential barriers. PMID:22465077

  2. Deriving Dynamics of Web Pages: A Survey

    OpenAIRE

    Oita, Marilena; Senellart, Pierre

    2011-01-01

    International audience; The World Wide Web is dynamic by nature: content is continuously added, deleted, or changed, which makes it challenging for Web crawlers to keep up-to-date with the current version of a Web page, all the more so since not all apparent changes are significant ones. We review major approaches to change detection in Web pages and extraction of temporal properties (especially, timestamps) of Web pages. We focus our attention on techniques and systems that have been propose...

  3. Optimization of web pages for search engines

    OpenAIRE

    Harej, Anže

    2011-01-01

    The thesis describes the most important elements of a Web Page and outside factors that affect Search Engine Optimization. The basic structure of a Web page, structure and functionality of a modern Search Engine is described at the beginning. The first section deals with the start of Search Engine Optimization, including planning, analysis of web space and the selection of the most important keywords for which the site will be optimized. The next section Web Page Optimization describes...

  4. Translation Nation

    Institute of Scientific and Technical Information of China (English)

    2008-01-01

    The International Federation of Translators will hold its largest ever world congress in China on the eve of 2008 Olympic Games china’ s position as a powerhouse of the translation industry is to be cemented,

  5. Translating Europe

    Directory of Open Access Journals (Sweden)

    Yves Chevrel

    2007-07-01

    Europe thinks in many languages and Europe is a land of translation. Translation is a means of transmitting culture, a means of making it available to others and an invitation to share. It is a cement which binds Europe together.

  6. teaching translation

    Directory of Open Access Journals (Sweden)

    Sergio Bolaños Cuéllar

    2007-01-01

    Full Text Available The advance in cultural-oriented perspectives in Translation Studies has sometimes played down the text linguistic nature of translation. A pilot study in teaching translation was carried out to make students aware of the text linguistic character of translating and help them to improve their translation skills, particularly with an emphasis on self-awareness and self-correcting strategies. The theoretical background is provided by the Dynamic Translation Model (2004, 2005 proposed by the author, with relevant and important contributions taken from Genette’s (1982 transtextuality phenomena (hypertext, hypotext, metatext, paratext, intertext and House and Kasper’s (1981 pragmatic modality markers (downgraders, upgraders. The key conceptual role of equivalence as a defining feature of translation is also dealt with. The textual relationship between Source Language Text (slt is deemed to be pivotal for performing translation and correction tasks in the classroom. Finally, results of the pilot study are discussed and some conclusions are drawn.

  7. Library links on medical school home pages.

    Science.gov (United States)

    Thomas, Sheila L

    2011-01-01

    The purpose of this study was to assess the websites of American Association of Medical Colleges (AAMC)-member medical schools for the presence of library links. Sixty-one percent (n = 92) of home pages of the 150 member schools of the AAMC contain library links. For the 58 home pages not offering such links, 50 provided a pathway of two or three clicks to a library link. The absence of library links on 39% of AAMC medical school home pages indicates that the designers of those pages did not consider the library to be a primary destination for their visitors.

  8. Literal Translation and Free Translation

    Institute of Scientific and Technical Information of China (English)

    彭佼

    2011-01-01

    @@ Dispute over the method of literal translation and that of free translation had a long history in China, in East Jin Dynasty Daoan(道安314-385),a well-known monk, was the representative of those who firmly advocated literal translation.Since he feared that free translation might not be true to the original, he advocated strict literal translation so as to preserve the true features.Works under his direction were typical of word-for-word translation, in which no alteration was made except accidental changes in word order.

  9. Clustering of Deep WebPages: A Comparative Study

    Directory of Open Access Journals (Sweden)

    Muhunthaadithya C

    2015-10-01

    Full Text Available The internethas massive amount of information. This information is stored in the form of zillions of webpages. The information that can be retrieved by search engines is huge, and this information constitutes the ‘surface web’.But the remaining information, which is not indexed by search engines – the ‘deep web’, is much bigger in size than the ‘surface web’, and remains unexploited yet. Several machine learning techniques have been commonly employed to access deep web content. Under machine learning, topic models provide a simple way to analyze large volumes of unlabeled text. A ‘topic’is a cluster of words that frequently occur together and topic models can connect words with similar meanings and distinguish between words with multiple meanings. In this paper, we cluster deep web databases employing several methods, and then perform a comparative study. In the first method, we apply Latent Semantic Analysis (LSA over the dataset. In the second method, we use a generative probabilistic model called Latent Dirichlet Allocation(LDA for modeling content representative of deep web databases.Both these techniques are implemented after preprocessing the set of web pages to extract page contents and form contents.Further, we propose another version of Latent Dirichlet Allocation (LDA to the dataset. Experimental results show that the proposed method outperforms the existing clustering methods.

  10. Arabic web pages clustering and annotation using semantic class features

    Directory of Open Access Journals (Sweden)

    Hanan M. Alghamdi

    2014-12-01

    Full Text Available To effectively manage the great amount of data on Arabic web pages and to enable the classification of relevant information are very important research problems. Studies on sentiment text mining have been very limited in the Arabic language because they need to involve deep semantic processing. Therefore, in this paper, we aim to retrieve machine-understandable data with the help of a Web content mining technique to detect covert knowledge within these data. We propose an approach to achieve clustering with semantic similarities. This approach comprises integrating k-means document clustering with semantic feature extraction and document vectorization to group Arabic web pages according to semantic similarities and then show the semantic annotation. The document vectorization helps to transform text documents into a semantic class probability distribution or semantic class density. To reach semantic similarities, the approach extracts the semantic class features and integrates them into the similarity weighting schema. The quality of the clustering result has evaluated the use of the purity and the mean intra-cluster distance (MICD evaluation measures. We have evaluated the proposed approach on a set of common Arabic news web pages. We have acquired favorable clustering results that are effective in minimizing the MICD, expanding the purity and lowering the runtime.

  11. Manually Classified Errors in Czech-Slovak Translation

    OpenAIRE

    Galuščáková, Petra; Bojar, Ondřej

    2012-01-01

    Outputs of five Czech-Slovak machine translation systems (Česílko, Česílko 2, Google Translate and Moses with different settings) for first 50 sentences of WMT 2010 testing set. The translations were manually processed and the errors were marked and classified according to the scheme by Vilar et al. (David Vilar, Jia Xu, Luis Fernando D’Haro, Hermann Ney: Error Analysis of Statistical Machine Translation Output, Proceedings of LREC-2006, 2006)

  12. Handling Translation Divergences in Generation-Heavy Hybrid Machine Translation

    Science.gov (United States)

    2002-03-01

    currently valid OMB control number. 1. REPORT DATE MAR 2002 2. REPORT TYPE 3. DATES COVERED 00-03-2002 to 00-03-2002 4. TITLE AND SUBTITLE...n$<a>@?N+A< g?CE(CBIg HT?CN+`

  13. Understanding and Writing G & M Code for CNC Machines

    Science.gov (United States)

    Loveland, Thomas

    2012-01-01

    In modern CAD and CAM manufacturing companies, engineers design parts for machines and consumable goods. Many of these parts are cut on CNC machines. Whether using a CNC lathe, milling machine, or router, the ideas and designs of engineers must be translated into a machine-readable form called G & M Code that can be used to cut parts to precise…

  14. Understanding and Writing G & M Code for CNC Machines

    Science.gov (United States)

    Loveland, Thomas

    2012-01-01

    In modern CAD and CAM manufacturing companies, engineers design parts for machines and consumable goods. Many of these parts are cut on CNC machines. Whether using a CNC lathe, milling machine, or router, the ideas and designs of engineers must be translated into a machine-readable form called G & M Code that can be used to cut parts to precise…

  15. 40 CFR 1502.7 - Page limits.

    Science.gov (United States)

    2010-07-01

    ... 40 Protection of Environment 32 2010-07-01 2010-07-01 false Page limits. 1502.7 Section 1502.7 Protection of Environment COUNCIL ON ENVIRONMENTAL QUALITY ENVIRONMENTAL IMPACT STATEMENT § 1502.7 Page limits. The text of final environmental impact statements (e.g., paragraphs (d) through (g) of §...

  16. Textual Article Clustering in Newspaper Pages

    NARCIS (Netherlands)

    Aiello, Marco; Pegoretti, Andrea

    2006-01-01

    In the analysis of a newspaper page an important step is the clustering of various text blocks into logical units, i.e., into articles. We propose three algorithms based on text processing techniques to cluster articles in newspaper pages. Based on the complexity of the three algorithms and

  17. Textual Article Clustering in Newspaper Pages

    NARCIS (Netherlands)

    Aiello, Marco; Pegoretti, Andrea

    2004-01-01

    In the analysis of a newspaper page an important step is the clustering of various text blocks into logical units, i.e., into articles. We propose three algorithms based on text processing techniques to cluster articles in newspaper pages. Based on the complexity of the three algorithms and

  18. Customisation of Indico pages - Layout and Menus

    CERN Document Server

    CERN. Geneva; Ferreira, Pedro

    2017-01-01

    In this tutorial you are going to learn how to customize the layout of your Indico pages (for example you can change the color of the background images or change the logo) and the menus on your Indico pages  (for example you can add or hide certain blocks, or change their name and order).  

  19. Textual Article Clustering in Newspaper Pages

    NARCIS (Netherlands)

    Aiello, Marco; Pegoretti, Andrea

    2006-01-01

    In the analysis of a newspaper page an important step is the clustering of various text blocks into logical units, i.e., into articles. We propose three algorithms based on text processing techniques to cluster articles in newspaper pages. Based on the complexity of the three algorithms and experime

  20. Textual Article Clustering in Newspaper Pages

    NARCIS (Netherlands)

    Aiello, Marco; Pegoretti, Andrea

    2004-01-01

    In the analysis of a newspaper page an important step is the clustering of various text blocks into logical units, i.e., into articles. We propose three algorithms based on text processing techniques to cluster articles in newspaper pages. Based on the complexity of the three algorithms and experime

  1. Minimal Guidelines for Authors of Web Pages.

    Science.gov (United States)

    ADE Bulletin, 2002

    2002-01-01

    Presents guidelines that recommend the minimal reference information that should be provided on Web pages intended for use by students, teachers, and scholars in the modern languages. Suggests the inclusion of information about responsible parties, copyright declaration, privacy statements, and site information. Makes a note on Web page style. (SG)

  2. Web page classification on child suitability

    NARCIS (Netherlands)

    Eickhoff, C.; Serdyukov, P.; Vries, A.P. de

    2010-01-01

    Children spend significant amounts of time on the Internet. Recent studies showed, that during these periods they are often not under adult supervision. This work presents an automatic approach to identifying suitable web pages for children based on topical and non-topical web page aspects. We discu

  3. Veracity in in vitro fertilization Web pages.

    Science.gov (United States)

    Cowan, Bryan D

    2005-03-01

    Huang et al. described compliance of IVF websites against the American Medical Association online health information guidelines and reported that IVF websites scored poorly. We describe a protocol for IVF websites that would inform readers about truthfulness of the page, develop standards for page construction, and establish a review process.

  4. CERN Web Pages Receive a Makeover

    CERN Multimedia

    2001-01-01

    Asudden allergic reaction to the colour turquoise? Never fear, from Monday 2 April you'll be able to click in the pink box at the top of the CERN users' welcome page to go to the all-new welcome page, which is simpler and better organized. CERN's new-look intranet is the first step in a complete Web-makeover being applied by the Web Public Education (WPE) group of ETT Division. The transition will be progressive, to allow users to familiarize themselves with the new pages. Until 17 April, CERN users will still get the familiar turquoise welcome page by default, with the new pages operating in parallel. From then on, the default will switch to the new pages, with the old ones being finally switched off on 25 May. Some 400 pages have received the makeover treatment. For more information about the changes to your Web, take a look at: http://www.cern.ch/CERN/NewUserPages/ Happy surfing!

  5. Comparing classical and quantum PageRanks

    CERN Document Server

    Loke, T; Rodriguez, J; Small, M; Wang, J B

    2015-01-01

    Following recent developments in quantum PageRanking, we present a comparative analysis of discrete-time and continuous-time quantum-walk-based PageRank algorithms. For the discrete-time case, we introduce an alternative PageRank measure based on the maximum probabilities achieved by the walker on the nodes. We demonstrate that the required time of evolution does not scale significantly with increasing network size. We affirm that all three quantum PageRank measures considered here distinguish clearly between outerplanar hierarchical, scale-free, and Erd\\"os-R\\'enyi network types. Relative to classical PageRank and to different extents, the quantum measures better highlight secondary hubs and resolve ranking degeneracy among peripheral nodes for the networks we studied in this paper.

  6. South African sign language assistive translation

    CSIR Research Space (South Africa)

    Olivrin, GJ

    2008-04-01

    Full Text Available The authors describe a novel approach and practical solution to an interactive sign language machine translation system from English to South African Sign Language. They interface the system with the GNApp application, which is an augmentative...

  7. The effect of new links on Google PageRank

    NARCIS (Netherlands)

    Avrachenkov, Konstatin; Litvak, Nelly

    2004-01-01

    PageRank is one of the principle criteria according to which Google ranks Web pages. PageRank can be interpreted as a frequency of visiting a Web page by a random surfer and thus it reflects the popularity of a Web page. We study the effect of newly created links on Google PageRank. We discuss to wh

  8. Grammatical Metaphor, Controlled Languageand Machine Translation

    DEFF Research Database (Denmark)

    Møller, Margrethe

    2003-01-01

    , syntactic and pragmatic restrictions on the writing style of the author. We will focus on syntactic restrictions and investigate whether a writing style in which various forms of grammatical metaphors have been dissolved in order to make the text more accessible to human readers, will in fact make the texts...... more suitable for MT. The basis of our investigation is a small corpus of English LSP texts that have been evaluated regarding their accessibility and acceptability by human users. The MT-system we will introduce as an additional "user" of the texts is the English-Danish prototype of the Comprendium MT-system....

  9. Using Linguistic Knowledge in Statistical Machine Translation

    Science.gov (United States)

    2010-09-01

    48 2.4 Conjugation of verb ktb ’to write ’. The common verb stem for each of the two...is shared throughout the Arabic speaking region, with little local variation. It is almost always used for writing , to the exclusion of the spoken...connected to the adjacent letters. Arabic has three vowel (a, u, i). Both vowel and consonant duration are phonemic. Short vowels and consonant doubling are

  10. UNITRAN: An Interlingual Machine Translation System.

    Science.gov (United States)

    1987-12-01

    modification in the interlingual approach as embodied by CETA and Sharp. ’Grenoble Universitv, France , 1961. 4 4 English Spanish French Japanese Parameter...comer: Yo como el pan. gustar: El libro me gusta a mi. eat: I eat the bread. like: I like the book. Figure 9: There is no thematic divergence between

  11. A Survey of Statistical Machine Translation

    Science.gov (United States)

    2007-04-01

    are multilingual governments and news agencies and companies operating in the global marketplace. The internet enables them to easily publish...SMT research. Because they are the product of day-to-day 1 human activities, they are constantly growing. Multilingual governments interested in...for one language. 5 The monolingual syntax will look much like our example fragment CFG (Rules C1-C8). In order to use this monolingual syntax in an

  12. Machine translation its scope and limits

    CERN Document Server

    Wilks, Yorick

    2008-01-01

    Author won Antonio Zampolli prize for outstanding contributions to the language technology Comprehensive look at MT technology from the perspective of a participant in its history and developmentDetails the deep differences and struggles between rival groups on how best to do MTConsiders MT as a fundamental part of Artificial Intelligence and the ultimate test-bed for all computational linguistics 

  13. Dependency Structures for Statistical Machine Translation

    Science.gov (United States)

    Bach, Nguyen

    2012-01-01

    Dependency structures represent a sentence as a set of dependency relations. Normally the dependency structures from a tree connect all the words in a sentence. One of the most defining characters of dependency structures is the ability to bring long distance dependency between words to local dependency structures. Another the main attraction of…

  14. Toward Determining the Comprehensibility of Machine Translations

    Science.gov (United States)

    2012-01-01

    responses to a stimulus (Macmillan and Creelman , 1991). It has been applied in areas such as lie detection (truth/lie), inspection (ac- ceptable...1-1/(2N) (Macmil- lan and Creelman , 1991). Negative values, which usually indicate response confusion, were eliminated. The results of...Macmillan, Neil and C. Douglas Creelman . (1991). Detection theory: A User’s guide. Cambridge Univer- sity Press, pp. 10 &125. Marchant

  15. Machine on Trial

    Science.gov (United States)

    2012-06-01

    front of the Judge, how did we come to justify the ethical use of such a machine? The Judge called for a recess so that he could further educate ...commerce, where money would change hands, or for the purposes of controlling critical systems.”29 The initial group that set up the ARPANET did not... EBook of Fundamental Principals of the Metaphysic of Morals, May, 2004 [ EBook #5682 translated by Thomas Kingsmill Abbott, 12. http://manybooks.net

  16. The changing pages of comics : Page layouts across eight decades of American superhero comics

    NARCIS (Netherlands)

    Pederson, Kaitlin; Cohn, Neil

    2016-01-01

    Page layouts are one of the most overt features of comics’ structure. We hypothesized that American superhero comics have changed in their page layout over eight decades, and investigated this using a corpus analysis of 40 comics from 1940 through 2014. On the whole, we found that comics pages decre

  17. The changing pages of comics : Page layouts across eight decades of American superhero comics

    NARCIS (Netherlands)

    Pederson, Kaitlin; Cohn, Neil

    2016-01-01

    Page layouts are one of the most overt features of comics’ structure. We hypothesized that American superhero comics have changed in their page layout over eight decades, and investigated this using a corpus analysis of 40 comics from 1940 through 2014. On the whole, we found that comics pages

  18. Sustainable machining

    CERN Document Server

    2017-01-01

    This book provides an overview on current sustainable machining. Its chapters cover the concept in economic, social and environmental dimensions. It provides the reader with proper ways to handle several pollutants produced during the machining process. The book is useful on both undergraduate and postgraduate levels and it is of interest to all those working with manufacturing and machining technology.

  19. 一种改进的PageRank算法%An Improved PageRank Algorithm

    Institute of Scientific and Technical Information of China (English)

    王钟斐

    2011-01-01

    Aiming at the problems of Topic-drift and emphasizing on old web pages for PageRank algorithm, combined with anchor texts similarity and timing feedback factor, this article presents an improved algorithm STPR, and analyzs STPR algorithm by experiment. First this article compares the traditional PageRank algorithm and adding anchor text similarity PageRank algorithm, results show that by adding anchor text similarity PageRank algorithm helps to reduce the occurrence of the phenomenon of topic-drift. Second this article compares adding anchor text similarity PageRank algorithm and STPR algorithm, results show that STPR algorithm not only reduce topic-drift phenomenon, but also make up PageRank value for new web pages.%针对PageRank算法存在主题漂移以及偏重旧网页的问题,结合锚文本相似度和时间反馈因子提出了一种PageRank改进算法STPR,并对STPR算法进行实验分析.先比较了传统PageRank算法与加入锚文本相似度的PageRank算法,结果表明加入锚文本相似度的PageRank算法有利于减少主题漂移现象的发生;其次比较了加入锚文本相似度的PageRank算法与STPR算法,结果表明STPR算法不但减少了主题漂移现象,而且还弥补了新网页的PageRank值.

  20. The matrix method to calculate page rank

    Directory of Open Access Journals (Sweden)

    H. Barboucha, M. Nasri

    2014-06-01

    Full Text Available Choosing the right keywords is relatively easy, whereas getting a high PageRank is more complicated. The index Page Rank is what defines the position in the result pages of search engines (for Google of course, but the other engines are now using more or less the same kind of algorithm. It is therefore very important to understand how this type of algorithm functions to hope to appear on the first page of results (the only page read in 95 % of cases or at least be among the first. We propose in this paper to clarify the operation of this algorithm using a matrix method and a JavaScript program enabling to experience this type of analysis. It is of course a simplified version, but it can add value to the website and achieve a high ranking in the search results and reach a larger customer base. The interest is to disclose an algorithm to calculate the relevance of each page. This is in fact a mathematical algorithm based on a web graph. This graph is formed of all the web pages that are modeled by nodes, and hyperlinks that are modeled by arcs.

  1. Flash-Aware Page Replacement Algorithm

    Directory of Open Access Journals (Sweden)

    Guangxia Xu

    2014-01-01

    Full Text Available Due to the limited main memory resource of consumer electronics equipped with NAND flash memory as storage device, an efficient page replacement algorithm called FAPRA is proposed for NAND flash memory in the light of its inherent characteristics. FAPRA introduces an efficient victim page selection scheme taking into account the benefit-to-cost ratio for evicting each victim page candidate and the combined recency and frequency value, as well as the erase count of the block to which each page belongs. Since the dirty victim page often contains clean data that exist in both the main memory and the NAND flash memory based storage device, FAPRA only writes the dirty data within the victim page back to the NAND flash memory based storage device in order to reduce the redundant write operations. We conduct a series of trace-driven simulations and experimental results show that our proposed FAPRA algorithm outperforms the state-of-the-art algorithms in terms of page hit ratio, the number of write operations, runtime, and the degree of wear leveling.

  2. 基于MapReduce的并行PageRank算法实现%Implementation of Parallel PageRank Algoirthm Based on MapReduce

    Institute of Scientific and Technical Information of China (English)

    平宇; 向阳; 张波; 黄寅飞

    2014-01-01

    The emergence of distributed Web crawl largely expands the scale of related Web information. Since PageRank needs to process the topology of entire existed page set, the limitation of CPU, I/O and memory becomes the big issue when it confronts the data in TB or PB level. Aiming at these problems, this paper proposes a parallel PageRank algorithm based on MapReduce. In a certain iteration of algorithm, it processes the files containing the topology of Web page graph by Map function and calculates the pages’ scores by Reduce function. Using the global Web page score as convergence to control iterations and get more precise Web page sorting result. Experimental result shows that the improved algorithm has better clustering performance and faster execution speed on the basis of keeping the overall Web page sorting accuracy of single machine PageRank algorithm.%分布式网络爬虫的广泛应用使得搜索引擎的数据规模呈几何式增长,面对数以 TB 甚至 PB 量级的数据,单机模式下的PageRank 算法由于 CPU、I/O 和内存的开销过大导致效率低下。为此,提出一种基于 MapReduce 框架的并行 PageRank 算法。在算法的一次迭代过程中,利用 Map 函数对网页拓扑信息文件进行解析,使用 Reduce 函数计算网页得分,从而并行化 PageRank 算法的中间迭代过程。通过计算全局网页得分控制迭代次数,得到较精确的网页排序结果。实验结果表明,该算法在保持原有单机PageRank 算法整体网页排序精度的基础上,具有较好的集群性能和较快的执行速度。

  3. Revising Translations

    DEFF Research Database (Denmark)

    Rasmussen, Kirsten Wølch; Schjoldager, Anne

    2011-01-01

    out by specialised revisers, but by staff translators, who revise the work of colleagues and freelancers on an ad hoc basis. Corrections are mostly given in a peer-to-peer fashion, though the work of freelancers and inexperienced in-house translators is often revised in an authoritative (nonnegotiable......) way. Most respondents and interviewees are worried about increasing pressures on the translation market, which, combined with customers’ general lack of understanding of the translation process, mean that systematic, all-encompassing quality assurance is rarely financially viable....

  4. Web Page Recommendation Using Web Mining

    Directory of Open Access Journals (Sweden)

    Modraj Bhavsar

    2014-07-01

    Full Text Available On World Wide Web various kind of content are generated in huge amount, so to give relevant result to user web recommendation become important part of web application. On web different kind of web recommendation are made available to user every day that includes Image, Video, Audio, query suggestion and web page. In this paper we are aiming at providing framework for web page recommendation. 1 First we describe the basics of web mining, types of web mining. 2 Details of each web mining technique.3We propose the architecture for the personalized web page recommendation.

  5. Ensemble Enabled Weighted PageRank

    CERN Document Server

    Luo, Dongsheng; Hu, Renjun; Duan, Liang; Ma, Shuai

    2016-01-01

    This paper describes our solution for WSDM Cup 2016. Ranking the query independent importance of scholarly articles is a critical and challenging task, due to the heterogeneity and dynamism of entities involved. Our approach is called Ensemble enabled Weighted PageRank (EWPR). To do this, we first propose Time-Weighted PageRank that extends PageRank by introducing a time decaying factor. We then develop an ensemble method to assemble the authorities of the heterogeneous entities involved in scholarly articles. We finally propose to use external data sources to further improve the ranking accuracy. Our experimental study shows that our EWPR is a good choice for ranking scholarly articles.

  6. Broadening the translational immunology landscape.

    Science.gov (United States)

    Peakman, M

    2012-12-01

    It is just over 5 years since Clinical and Experimental Immunology came under the direction of a new team of Editors and made a concerted effort to refresh its approach to promoting clinical and applied immunology through its pages. There were two major objectives: to foster papers in a field which, at the time, we loosely termed 'translational immunology'; and to create a forum for the presentation and discussion of immunology that is relevant to clinicians operating in this space. So, how are we doing with these endeavours? This brief paper aims to summarize some of the key learning points and successes and highlight areas in which translational gaps remain.

  7. Generating Best Features for Web Page Classification

    Directory of Open Access Journals (Sweden)

    K. Selvakuberan

    2008-03-01

    Full Text Available As the Internet provides millions of web pages for each and every search term, getting interesting and required results quickly from the Web becomes very difficult. Automatic classification of web pages into relevant categories is the current research topic which helps the search engine to get relevant results. As the web pages contain many irrelevant, infrequent and stop words that reduce the performance of the classifier, extracting or selecting representative features from the web page is an essential pre-processing step. The goal of this paper is to find minimum number of highly qualitative features by integrating feature selection techniques. We conducted experiments with various numbers of features selected by different feature selection algorithms on a well defined initial set of features and show that cfssubset evaluator combined with term frequency method gives minimal qualitative features enough to attain considerable classification accuracy.

  8. Universal Emergence of PageRank

    CERN Document Server

    Frahm, K M; Shepelyansky, D L

    2011-01-01

    The PageRank algorithm enables to rank the nodes of a network through a specific eigenvector of the Google matrix, using a damping parameter $\\alpha \\in ]0,1[$. Using extensive numerical simulations of large web networks, we determine numerically and analytically the universal features of PageRank vector at its emergence when $\\alpha \\rightarrow 1$. The whole network can be divided into a core part and a group of invariant subspaces. For $ \\alpha \\rightarrow 1$ the PageRank converges to a universal power law distribution on the invariant subspaces whose size distribution also follows a universal power law. The convergence of PageRank at $ \\alpha \\rightarrow 1$ is controlled by eigenvalues of the core part of the Google matrix which are exponentially close to unity leading to large relaxation times as for example in spin glasses.

  9. Universal emergence of PageRank

    Energy Technology Data Exchange (ETDEWEB)

    Frahm, K M; Georgeot, B; Shepelyansky, D L, E-mail: frahm@irsamc.ups-tlse.fr, E-mail: georgeot@irsamc.ups-tlse.fr, E-mail: dima@irsamc.ups-tlse.fr [Laboratoire de Physique Theorique du CNRS, IRSAMC, Universite de Toulouse, UPS, 31062 Toulouse (France)

    2011-11-18

    The PageRank algorithm enables us to rank the nodes of a network through a specific eigenvector of the Google matrix, using a damping parameter {alpha} Element-Of ]0, 1[. Using extensive numerical simulations of large web networks, with a special accent on British University networks, we determine numerically and analytically the universal features of the PageRank vector at its emergence when {alpha} {yields} 1. The whole network can be divided into a core part and a group of invariant subspaces. For {alpha} {yields} 1, PageRank converges to a universal power-law distribution on the invariant subspaces whose size distribution also follows a universal power law. The convergence of PageRank at {alpha} {yields} 1 is controlled by eigenvalues of the core part of the Google matrix, which are extremely close to unity, leading to large relaxation times as, for example, in spin glasses. (paper)

  10. Best Practices for Searchable Collection Pages

    Science.gov (United States)

    Searchable Collection pages are stand-alone documents that do not have any web area navigation. They should not recreate existing content on other sites and should be tagged with quality metadata and taxonomy terms.

  11. Revising Translations

    DEFF Research Database (Denmark)

    Rasmussen, Kirsten Wølch; Schjoldager, Anne

    2011-01-01

    out by specialised revisers, but by staff translators, who revise the work of colleagues and freelancers on an ad hoc basis. Corrections are mostly given in a peer-to-peer fashion, though the work of freelancers and inexperienced in-house translators is often revised in an authoritative (nonnegotiable...

  12. A Web Page Summarization for Mobile Phones

    Science.gov (United States)

    Hasegawa, Takaaki; Nishikawa, Hitoshi; Imamura, Kenji; Kikui, Gen'ichiro; Okumur, Manabu

    Recently, web pages for mobile devices are widely spread on the Internet and a lot of people can access web pages through search engines by mobile devices as well as personal computers. A summary of a retrieved web page is important because the people judge whether or not the page would be relevant to their information need according to the summary. In particular, the summary must be not only compact but also grammatical and meaningful when the users retrieve information using a mobile phone with a small screen. Most search engines seem to produce a snippet based on the keyword-in-context (KWIC) method. However, this simple method could not generate a refined summary suitable for mobile phones because of low grammaticality and content overlap with the page title. We propose a more suitable method to generate a snippet for mobile devices using sentence extraction and sentence compression methods. First, sentences are biased based on whether they include the query terms from the users or words that are relevant to the queries, as well as whether they do not overlap with the page title based on maximal marginal relevance (MMR). Second, the selected sentences are compressed based on their phrase coverage, which is measured by the scores of words, and their phrase connection probability measured based on the language model, according to the dependency structure converted from the sentence. The experimental results reveal the proposed method outperformed the KWIC method in terms of relevance judgment, grammaticality, non-redundancy and content coverage.

  13. Banner Pages on the New Printing Infrastructure

    CERN Multimedia

    2006-01-01

    Changes to the printing service were announced in CERN Bulletin No. 37-38/2006. In the new infrastructure, the printing of the banner page has been disabled in order to reduce paper consumption. Statistics show that the average print job size is small and the paper savings by not printing the banner page could be up to 20 %. When each printer is moved onto the new infrastructure banner page printing will be disabled. In the case of corridor printers which are shared by several users, the Helpdesk can re-enable banner page printing upon request. We hope ultimately to arrive at a situation where banner page printing is enabled on fewer than 10% of printers registered on the network. You can still print banner pages on printers where it has been centrally disabled by using Linux. Simply add it to your print job on the client side by adding the -o job-sheets option to your lpr command. Detailed documentation is available on each SLC3/4 under the following link: http://localhost:631/sum.html#4_2 Please bea...

  14. Personal and Public Start Pages in a library setting

    NARCIS (Netherlands)

    Kieft-Wondergem, Dorine

    2009-01-01

    Personal and Public Start Pages are web-based resources. With these kind of tools it is possible to make your own free start page. A Start Page allows you to put all your web resources into one page, including blogs, email, podcasts, RSSfeeds. It is possible to share the content of the page with oth

  15. Personal and Public Start Pages in a library setting

    NARCIS (Netherlands)

    Kieft-Wondergem, Dorine

    Personal and Public Start Pages are web-based resources. With these kind of tools it is possible to make your own free start page. A Start Page allows you to put all your web resources into one page, including blogs, email, podcasts, RSSfeeds. It is possible to share the content of the page with

  16. Binary translation using peephole translation rules

    Science.gov (United States)

    Bansal, Sorav; Aiken, Alex

    2010-05-04

    An efficient binary translator uses peephole translation rules to directly translate executable code from one instruction set to another. In a preferred embodiment, the translation rules are generated using superoptimization techniques that enable the translator to automatically learn translation rules for translating code from the source to target instruction set architecture.

  17. Simple machines

    CERN Document Server

    Graybill, George

    2007-01-01

    Just how simple are simple machines? With our ready-to-use resource, they are simple to teach and easy to learn! Chocked full of information and activities, we begin with a look at force, motion and work, and examples of simple machines in daily life are given. With this background, we move on to different kinds of simple machines including: Levers, Inclined Planes, Wedges, Screws, Pulleys, and Wheels and Axles. An exploration of some compound machines follows, such as the can opener. Our resource is a real time-saver as all the reading passages, student activities are provided. Presented in s

  18. Uniform Page Migration Problem in Euclidean Space

    Directory of Open Access Journals (Sweden)

    Amanj Khorramian

    2016-08-01

    Full Text Available The page migration problem in Euclidean space is revisited. In this problem, online requests occur at any location to access a single page located at a server. Every request must be served, and the server has the choice to migrate from its current location to a new location in space. Each service costs the Euclidean distance between the server and request. A migration costs the distance between the former and the new server location, multiplied by the page size. We study the problem in the uniform model, in which the page has size D = 1 . All request locations are not known in advance; however, they are sequentially presented in an online fashion. We design a 2.75 -competitive online algorithm that improves the current best upper bound for the problem with the unit page size. We also provide a lower bound of 2.732 for our algorithm. It was already known that 2.5 is a lower bound for this problem.

  19. Electric machine

    Science.gov (United States)

    El-Refaie, Ayman Mohamed Fawzi [Niskayuna, NY; Reddy, Patel Bhageerath [Madison, WI

    2012-07-17

    An interior permanent magnet electric machine is disclosed. The interior permanent magnet electric machine comprises a rotor comprising a plurality of radially placed magnets each having a proximal end and a distal end, wherein each magnet comprises a plurality of magnetic segments and at least one magnetic segment towards the distal end comprises a high resistivity magnetic material.

  20. Model for Predicting End User Web Page Response Time

    OpenAIRE

    Nagarajan, Sathya Narayanan; Ravikumar, Srijith

    2012-01-01

    Perceived responsiveness of a web page is one of the most important and least understood metrics of web page design, and is critical for attracting and maintaining a large audience. Web pages can be designed to meet performance SLAs early in the product lifecycle if there is a way to predict the apparent responsiveness of a particular page layout. Response time of a web page is largely influenced by page layout and various network characteristics. Since the network characteristics vary widely...

  1. Literal Translation using Google Translate in Translating the Text from French to English in Digital Tourism Brochure Bienvenue Paris

    Directory of Open Access Journals (Sweden)

    Rila Hilma

    2011-04-01

    Full Text Available Translation is basically change of form. The form from which the translation is made will be called the source language and the form into which it is to be changed will be called the receptor language. Translation consists of transferring the meaning of the source language into the receptor language. Translating is not an easy job to do because many things to be considered to do this activity because translation means determining the meaning of a text, then reconstructing this same meaning using the appropriate structure and form in the receptor language. Translation is basically divided by two types of translation, one is literal and the other is idiomatic. Literal translation is really strict to the structure and form then often can not well express the true meaning of source language. Idiomatic translation makes every effort to communicate the meaning of the source language text in the natural forms of the receptor language. Then the most popular translation machine, Google Translate, in this study shows the results of translation which remain odd, unnatural, and nonsensical because the unsuccessful of message delivery, which is notably the typically error of literal translation.

  2. Web Page Recommendation Models Theory and Algorithms

    CERN Document Server

    Gündüz-Ögüdücü, Sule

    2010-01-01

    One of the application areas of data mining is the World Wide Web (WWW or Web), which serves as a huge, widely distributed, global information service for every kind of information such as news, advertisements, consumer information, financial management, education, government, e-commerce, health services, and many other information services. The Web also contains a rich and dynamic collection of hyperlink information, Web page access and usage information, providing sources for data mining. The amount of information on the Web is growing rapidly, as well as the number of Web sites and Web page

  3. Interactive Translation Prediction versus Conventional Post-editing in Practice

    DEFF Research Database (Denmark)

    Sanchis-Trilles, German; Alabau, Vicent; Buck, Christian;

    2014-01-01

    We conducted a field trial in computer-assisted professional translation to compare Interactive Translation Prediction (ITP) against conventional post- editing (PE) of machine translation (MT) output. In contrast to the conventional PE set-up, where an MT system first produces a static translatio...

  4. Page sample size in web accessibility testing: how many pages is enough?

    NARCIS (Netherlands)

    Velleman, Eric; Geest, van der Thea

    2013-01-01

    Various countries and organizations use a different sampling approach and sample size of web pages in accessibility conformance tests. We are conducting a systematic analysis to determine how many pages is enough for testing whether a website is compliant with standard accessibility guidelines. This

  5. Mission Connect Mild TBI Translational Research Consortium

    Science.gov (United States)

    2010-08-31

    TBI Translational Research Consortium Executive Committee Steering Committee Model of Injury Working Group Neuroprotection Working Group Regeneration ...Report, Holcomb Page 22 Specific aim #3.1: To study neuroprotection and enhanced neurological recovery with erythropoietin ( Epo ) and Epo ...derivatives after MTBI. - #3.1.1 To study the effects of Epo and Epo derivatives on neurogenesis, angiogenesis, and outcome after experimental MTBI

  6. Sketch of a Noisy Channel Model for the Translation Process

    DEFF Research Database (Denmark)

    Carl, Michael

    The paper develops a Noisy Channel Model for the translation process that is based on actual user activity data. It builds on the monitor model and makes a distinction between early, automatic and late, conscious translation processes: while early priming processes are at the basis of a "literal ...... of the Center for Research and Innovation in Translation and Translation Technology (CRITT) at IBC....... of International Business Communication. His current research interests are related to the investigation of human translation processes and how advanced computer tools (such as machine translation) can fruitfully complement and support the human (translation) activities. Furthermore, he is the director...

  7. Production Machine Shop Employment Competencies. Part One: Practices and Principles.

    Science.gov (United States)

    Bishart, Gus; Werner, Claire

    Competencies for production machine shop are provided for the first of four topic areas: principles and practice of machine shop. Each competency appears in a one-page format. It is presented as a goal statement followed by one or more "indicator" statements, which are performance objectives describing an ability that, upon attainment,…

  8. A "Living" Machine

    Institute of Scientific and Technical Information of China (English)

    N.R.Bogatyrev

    2004-01-01

    Biomimetics (or bionics) is the engineering discipline that constructs artificial systems using biological principles. The ideal final result in biomimetics is to create a living machine. But what are the desirable and non-desirable properties of biomimetic product? Where can natural prototypes be found? How can technical solutions be transferred from nature to technology? Can we use living nature like LEGO bricks for construction our machines? How can biology help us? What is a living machine? In biomimetic practice only some "part" (organ, part of organ, tissue) of the observed whole organism is utilized. A possible template for future super-organism extension for biomimetic methods might be drawn from experiments in holistic ecological agriculture (ecological design, permaculture, ecological engineering, etc. ). The necessary translation of these rules to practical action can be achieved with the Russian Theory of Inventive Problem Solving (TRIZ), specifically adjusted to biology. Thus, permaculture, reinforced by a TRIZ conceptual framework, might provide the basis for Super-Organismic Bionics, which is hypothesized as necessary for effective ecological engineering. This hypothesis is supported by a case study-the design of a sustainable artificial nature reserve for wild pollinators as a living machine.

  9. The Machine within the Machine

    CERN Multimedia

    Katarina Anthony

    2014-01-01

    Although Virtual Machines are widespread across CERN, you probably won't have heard of them unless you work for an experiment. Virtual machines - known as VMs - allow you to create a separate machine within your own, allowing you to run Linux on your Mac, or Windows on your Linux - whatever combination you need.   Using a CERN Virtual Machine, a Linux analysis software runs on a Macbook. When it comes to LHC data, one of the primary issues collaborations face is the diversity of computing environments among collaborators spread across the world. What if an institute cannot run the analysis software because they use different operating systems? "That's where the CernVM project comes in," says Gerardo Ganis, PH-SFT staff member and leader of the CernVM project. "We were able to respond to experimentalists' concerns by providing a virtual machine package that could be used to run experiment software. This way, no matter what hardware they have ...

  10. DESIGNING AN ENGLISH LEARNING WEB PAGE

    Institute of Scientific and Technical Information of China (English)

    Wu; Xiaozhen

    1999-01-01

    This paper reviews the development in CALL research and the currently acknowledged guide-lines for CALL designing.Following these guidelines,the author designed an English learning Web pageof her own.Target learners,rationale,designing aids,as well as the lesson plan using the Web page,areincluded.

  11. Upgrade of CERN OP Webtools IRRAD Page

    CERN Document Server

    Vik, Magnus Bjerke

    2017-01-01

    CERN Beams Department maintains a website with various tools for the Operations Group, with one of them being specific for the Proton Irradiation Facility (IRRAD). The IRRAD team use the tool to follow up and optimize the operation of the facility. The original version of the tool was difficult to maintain and adding new features to the page was challenging. Thus this summer student project is aimed to upgrade the web page by rewriting the web page with maintainability and flexibility in mind. The new application uses a server--client architecture with a REST API on the back end which is used by the front end to request data for visualization. PHP is used on the back end to implement the API's and Swagger is used to document them. Vue, Semantic UI, Webpack, Node and ECMAScript 5 is used on the fronted to visualize and administrate the data. The result is a new IRRAD operations web application with extended functionality, improved structure and an improved user interface. It includes a new Status Panel page th...

  12. 16 CFR 436.3 - Cover page.

    Science.gov (United States)

    2010-01-01

    ... with a cover page, in the order and form as follows: (a) The title “FRANCHISE DISCLOSURE DOCUMENT” in... begin operation of a franchise is . This includes that must be paid to the franchisor or affiliate. (2) This disclosure document summarizes certain provisions of your franchise agreement and...

  13. Modeling clicks beyond the first result page

    NARCIS (Netherlands)

    Chuklin, A.; Serdyukov, P.; de Rijke, M.

    2013-01-01

    Most modern web search engines yield a list of documents of a fixed length (usually 10) in response to a user query. The next ten search results are usually available in one click. These documents either replace the current result page or are appended to the end. Hence, in order to examine more

  14. Web Mining Using PageRank Algorithm

    Directory of Open Access Journals (Sweden)

    Vignesh. V

    2013-11-01

    Full Text Available Data mining is extracting and automatic discovering the web based information has been used as web mining. It is one of the most universal and a dominant application on the Internet and it becomes increasing in size and search tools that combine the results of multiple search engines are becoming more valuable. But, almost none of these studies deals with genetic relation algorithm (GRA, where GRA is one of the evolutionary methods with graph structure. GRA was designed to both increase the effectiveness of search engine and improve their efficiency. GRA considers the correlation coefficient between stock brands as strength, which indicates the relation between nodes in each individual of GRA. The reduced number of hyperlinks provided by GRA in the final generation consists of only the most similar hyperlinks with respect to the query. But, the end user’s not satisfied fully. To improve the satisfaction of user by using Page rank algorithm to measure the importance of a page and to prioritize pages returned from a GRA. It will reduce the user’s searching time. PageRank algorithm works to allocate rank for filtered links based on number of keyword occurred in the content.

  15. Reconfigurable Full-Page Braille Displays

    Science.gov (United States)

    Garner, H. Douglas

    1994-01-01

    Electrically actuated braille display cells of proposed type arrayed together to form full-page braille displays. Like other braille display cells, these provide changeable patterns of bumps driven by digitally recorded text stored on magnetic tapes or in solid-state electronic memories. Proposed cells contain electrorheological fluid. Viscosity of such fluid increases in strong electrostatic field.

  16. Modeling clicks beyond the first result page

    NARCIS (Netherlands)

    Chuklin, A.; Serdyukov, P.; de Rijke, M.

    2013-01-01

    Most modern web search engines yield a list of documents of a fixed length (usually 10) in response to a user query. The next ten search results are usually available in one click. These documents either replace the current result page or are appended to the end. Hence, in order to examine more docu

  17. Referencing web pages and e-journals.

    Science.gov (United States)

    Bryson, David

    2013-12-01

    One of the areas that can confuse students and authors alike is how to reference web pages and electronic journals (e-journals). The aim of this professional development article is to go back to first principles for referencing and see how with examples these should be referenced.

  18. Accounting Programs' Home Pages: What's Happening.

    Science.gov (United States)

    Peek, Lucia E.; Roxas, Maria L.

    2002-01-01

    Content analysis of 62 accounting programs' websites indicated the following: 53% include mission statements; 62.9% list accreditation; many faculty biographies and personal pages used inconsistent formats; provision of information on financial aid, student organizations, career services, and certified public accountant requirements varied. Many…

  19. Efficient Web Change Monitoring with Page Digest

    Energy Technology Data Exchange (ETDEWEB)

    Buttler, D J; Rocco, D; Liu, L

    2004-02-20

    The Internet and the World Wide Web have enabled a publishing explosion of useful online information, which has produced the unfortunate side effect of information overload: it is increasingly difficult for individuals to keep abreast of fresh information. In this paper we describe an approach for building a system for efficiently monitoring changes to Web documents. This paper has three main contributions. First, we present a coherent framework that captures different characteristics of Web documents. The system uses the Page Digest encoding to provide a comprehensive monitoring system for content, structure, and other interesting properties of Web documents. Second, the Page Digest encoding enables improved performance for individual page monitors through mechanisms such as short-circuit evaluation, linear time algorithms for document and structure similarity, and data size reduction. Finally, we develop a collection of sentinel grouping techniques based on the Page Digest encoding to reduce redundant processing in large-scale monitoring systems by grouping similar monitoring requests together. We examine how effective these techniques are over a wide range of parameters and have seen an order of magnitude speed up over existing Web-based information monitoring systems.

  20. Translating democracy

    DEFF Research Database (Denmark)

    Doerr, Nicole

    2012-01-01

    grassroots activists in social movements use translation as a novel practice to debate political alternatives in the European Union's (EU) multilingual public sphere. In recent years, new cross-European protest movements have created the multilingual discursive democracy arena known as the European Social...... in institutionalized habits and norms of deliberation. Addressing democratic theorists, my findings suggest that translation could be a way to think about difference not as a hindrance but as a resource for democracy in linguistically heterogeneous societies and public spaces, without presupposing a shared language...

  1. Amharic Speech Recognition for Speech Translation

    OpenAIRE

    Melese, Michael; Besacier, Laurent; Meshesha, Million

    2016-01-01

    International audience; The state-of-the-art speech translation can be seen as a cascade of Automatic Speech Recognition, Statistical Machine Translation and Text-To-Speech synthesis. In this study an attempt is made to experiment on Amharic speech recognition for Amharic-English speech translation in tourism domain. Since there is no Amharic speech corpus, we developed a read-speech corpus of 7.43hr in tourism domain. The Amharic speech corpus has been recorded after translating standard Bas...

  2. Machine Learning

    CERN Document Server

    CERN. Geneva

    2017-01-01

    Machine learning, which builds on ideas in computer science, statistics, and optimization, focuses on developing algorithms to identify patterns and regularities in data, and using these learned patterns to make predictions on new observations. Boosted by its industrial and commercial applications, the field of machine learning is quickly evolving and expanding. Recent advances have seen great success in the realms of computer vision, natural language processing, and broadly in data science. Many of these techniques have already been applied in particle physics, for instance for particle identification, detector monitoring, and the optimization of computer resources. Modern machine learning approaches, such as deep learning, are only just beginning to be applied to the analysis of High Energy Physics data to approach more and more complex problems. These classes will review the framework behind machine learning and discuss recent developments in the field.

  3. Predicting Networked Strategic Behavior via Machine Learning and Game Theory

    Science.gov (United States)

    2015-01-13

    Report: Predicting Networked Strategic Behavior via Machine Learning and Game Theory The views, opinions and/or findings contained in this report...2211 machine learning, game theory , microeconomics, behavioral data REPORT DOCUMENTATION PAGE 11. SPONSOR/MONITOR’S REPORT NUMBER(S) 10. SPONSOR...Strategic Behavior via Machine Learning and Game Theory Report Title The funding for this project was used to develop basic models, methodology

  4. The Importance of Prior Probabilities for Entry Page Search

    NARCIS (Netherlands)

    Kraaij, W.; Westerveld, T.H.W.; Hiemstra, D.

    2002-01-01

    An important class of searches on the world-wide-web has the goal to find an entry page (homepage) of an organisation. Entry page search is quite different from Ad Hoc search. Indeed a plain Ad Hoc system performs disappointingly. We explored three non-content features of web pages: page length, num

  5. Implementing database system for LHCb publications page

    CERN Document Server

    Abdullayev, Fakhriddin

    2017-01-01

    The LHCb is one of the main detectors of Large Hadron Collider, where physicists and scientists work together on high precision measurements of matter-antimatter asymmetries and searches for rare and forbidden decays, with the aim of discovering new and unexpected forces. The work does not only consist of analyzing data collected from experiments but also in publishing the results of those analyses. The LHCb publications are gathered on LHCb publications page to maximize their availability to both LHCb members and to the high energy community. In this project a new database system was implemented for LHCb publications page. This will help to improve access to research papers for scientists and better integration with current CERN library website and others.

  6. Personal home pages as an information resource

    Directory of Open Access Journals (Sweden)

    Shant Narsesian

    2004-12-01

    Full Text Available Nowadays, for many people, the World Wide Web (WWW is the first place to go to look something up, to find that bit of information. However, even though people have their favourite sites, and their favourite search engines, they often seem to miss that bit of information. This could very well be because it is hiding on a small, unpopular, enthusiast's Personal Home Page. The author believes that there is more information on the Web than that which one will find on the major, "commercial-style" sites. Hence, this paper looks at the possibility of using Personal Home Pages (PHP as an information resource, not only for the academic, but the web-surfing world in general.

  7. Developing a web page: bringing clinics online.

    Science.gov (United States)

    Peterson, Ronnie; Berns, Susan

    2004-01-01

    Introducing clinical staff education, along with new policies and procedures, to over 50 different clinical sites can be a challenge. As any staff educator will confess, getting people to attend an educational inservice session can be difficult. Clinical staff request training, but no one has time to attend training sessions. Putting the training along with the policies and other information into "neat" concise packages via the computer and over the company's intranet was the way to go. However, how do you bring the clinics online when some of the clinical staff may still be reluctant to turn on their computers for anything other than to gather laboratory results? Developing an easy, fun, and accessible Web page was the answer. This article outlines the development of the first training Web page at the University of Wisconsin Medical Foundation, Madison, WI.

  8. Perspectives on the consecutive pages problem

    Science.gov (United States)

    Srinivasan, V. K.

    2011-04-01

    This article presents different approaches to a problem, dubbed by the author as 'the consecutive pages problem'. The aim of this teaching-oriented article is to promote the teaching of abstract concepts in mathematics, by selecting a challenging amusement problem and then presenting various solutions in such a way that it can engage the attention of a fourth-grade student, a high school senior student, an average college student and scholars.

  9. Evaluating Multilingual Gisting of Web Pages

    CERN Document Server

    Resnik, P

    1997-01-01

    We describe a prototype system for multilingual gisting of Web pages, and present an evaluation methodology based on the notion of gisting as decision support. This evaluation paradigm is straightforward, rigorous, permits fair comparison of alternative approaches, and should easily generalize to evaluation in other situations where the user is faced with decision-making on the basis of information in restricted or alternative form.

  10. Borges and Textual Quality in Translation

    Directory of Open Access Journals (Sweden)

    Walter Carlos Costa

    1996-01-01

    Full Text Available THROUGHOUT HIS LIFE BORGES dealt with translation problems. In hischildhood translation was a daily experience for him since he spokeboth Spanish and English at home. Later on he learnt French and German in Lausane, Switzerland, where he spent three years whenhe was a teenager. Most of his reading was. done in foteign languages, especially in English but also in French, German, and to a lesser extent in Italian and Portuguese. Borges translated little but regularly through his entire literarycareer, which was very long indeed. Among others, he translated into Spanish some pages of Ulysses, Faulkner's Wild Palms, Virginia Woolf 's A Room of One's Own, a selection of Whitman's Leaves ofGrass. Moreover, he translated a great number of quotations of writersand philosophers that he inserted copiously in his writings, includinghis fiction and poems.

  11. A Mobile Agent-based Web Page Constructing Framework MiPage

    Science.gov (United States)

    Fukuta, Naoki; Ozono, Tadachika; Shintani, Toramatsu

    In this paper, we present a programming framework, `MiPage', for realizing intelligent WWW applications based on the mobile agent technology. On the framework, an agent is programmed by using hyper text markup language and logic programming language. To realize the framework, we designed a new logic programming environment `MiLog', and an agent program compiler `MiPage Compiler'. The framework enables us to enhance both richness of the services and manageability of the application.

  12. Coming to Life: A Review of Movie Comics: Page to Screen/Screen to Page

    Directory of Open Access Journals (Sweden)

    Nicolas Labarre

    2017-03-01

    Full Text Available This book review provides an overview of 'Movie Comics: Page to Screen/Screen to Page' by Blair Davis (Rutgers University Press, 2017 a book which examines the reciprocal adaptations of film into comics and comics into films from 1930 to 1960. This review argues that 'Movie Comics' provides a useful and finely-textured cultural history of that phenomenon, which help contextualize scholarly studies of contemporary adaptations and transmedia constructions.

  13. Improving Web Page Readability by Plain Language

    CERN Document Server

    Hussain, Walayat; Ali, Arif

    2011-01-01

    In today's world anybody who wants to access any information the first choice is to use the web because it is the only source to provide easy and instant access to information. However web readers face many hurdles from web which includes load of web pages, text size, finding related information, spelling and grammar etc. However understanding of web pages written in English language creates great problems for non native readers who have basic knowledge of English. In this paper, we propose a plain language for a local language (Urdu) using English alphabets for web pages in Pakistan. For this purpose we developed two websites, one with a normal English fonts and other in a local language text scheme using English alphabets. We also conducted a questionnaire from 40 different users with a different level of English language fluency in Pakistan to gain the evidence of the practicality of our approach. The result shows that the proposed plain language text scheme using English alphabets improved the reading com...

  14. Improving Web Page Readability by Plain Language

    Directory of Open Access Journals (Sweden)

    Walayat Hussain

    2011-05-01

    Full Text Available In todays world anybody who wants to access any information the first choice is to use the web because it is the only source to provide easy and instant access to information. However web readers face many hurdles from web which includes load of web pages, text size, finding related information, spelling and grammar etc. However understanding of web pages written in English language creates great problems for non native readers who have basic knowledge of English. In this paper, we propose a plain language for a local language (Urdu using English alphabets for web pages in Pakistan. For this purpose we developed two websites, one with a normal English fonts and other in a local language text scheme using English alphabets. We also conducted a questionnaire from 40 different users with a different level of English language fluency in Pakistan to gain the evidence of the practicality of our approach. The result shows that the proposed plain language text scheme using English alphabets improved the reading comprehension for non native English speakers in Pakistan.

  15. 新的PageRank优化算法%New PageRank optimization algorithm

    Institute of Scientific and Technical Information of China (English)

    蒋永辉; 吴洪丽

    2012-01-01

    Search engines repeatedly return currently popular pages at the top of search results, popular pages tend to get even more popular, while unpopular pages get ignored by an average user. In order to escape from this problem, an improved ranking function and effective Web user model are employed, and a New PageRank Optimization (NPRO) algorithm is provided. Experimental data show that the provided algorithm can attain unbiased Web ranking.%为了克服PageRank在搜索过程中重复性地把当前受欢迎的网页放在搜索结果的首要位置,而不受欢迎的网页被大多数用户忽略的问题,采用了一种改进的评估函数及有效的用户模型,获得了一个新的PageRank优化算法.实验结果表明,该算法达到了较好的公平性.

  16. Insights into Facebook Pages: an early adolescent health research study page targeted at parents.

    Science.gov (United States)

    Amon, Krestina L; Paxton, Karen; Klineberg, Emily; Riley, Lisa; Hawke, Catherine; Steinbeck, Katharine

    2016-02-01

    Facebook has been used in health research, but there is a lack of literature regarding how Facebook may be used to recruit younger adolescents. A Facebook Page was created for an adolescent cohort study on the effects of puberty hormones on well-being and behaviour in early adolescence. Used as a communication tool with existing participants, it also aimed to alert potential participants to the study. The purpose of this paper is to provide a detailed description of the development of the study Facebook Page and present the fan response to the types of posts made on the Page using the Facebook-generated Insights data. Two types of posts were made on the study Facebook Page. The first type was study-related update posts and events. The second was relevant adolescent and family research and current news posts. Observations on the use of and response to the Page were made over 1 year across three phases (phase 1, very low Facebook use; phase 2, high Facebook use; phase 3, low Facebook use). Most Page fans were female (88.6%), with the largest group of fans aged between 35 and 44 years. Study-related update posts with photographs were the most popular. This paper provides a model on which other researchers could base Facebook communication and potential recruitment in the absence of established guidelines.

  17. Beyond Translation

    DEFF Research Database (Denmark)

    Olwig, Mette Fog

    2013-01-01

    This article contributes to the growing scholarship on local development practitioners by re-examining conceptualizations of practitioners as ‘brokers’ strategically translating between ‘travelling’ (development institution) rationalities and ‘placed’ (recipient area) rationalities in relation...... and practice spurred by new challenges deriving from climate change anxiety, the study shows how local practitioners often make local activities fit into travelling development rationalities as a matter of habit, rather than as a conscious strategy. They may therefore cease to ‘translate’ between different...... rationalities. This is shown to have important implications for theory, research and practice concerning disaster risk reduction and climate change adaptation in which such translation is often expected....

  18. Translational genomics

    Directory of Open Access Journals (Sweden)

    Martin Kussmann

    2014-09-01

    Full Text Available The term “Translational Genomics” reflects both title and mission of this new journal. “Translational” has traditionally been understood as “applied research” or “development”, different from or even opposed to “basic research”. Recent scientific and societal developments have triggered a re-assessment of the connotation that “translational” and “basic” are either/or activities: translational research nowadays aims at feeding the best science into applications and solutions for human society. We therefore argue here basic science to be challenged and leveraged for its relevance to human health and societal benefits. This more recent approach and attitude are catalyzed by four trends or developments: evidence-based solutions; large-scale, high dimensional data; consumer/patient empowerment; and systems-level understanding.

  19. Hodnocení automatického překladu SMT systémů (Google translate, Bing) z francouzštiny do češtiny: kolokace z oblasti "bezpečnost potravin"

    OpenAIRE

    ŠVARCOVÁ, Zora

    2016-01-01

    The aim of this bachelor thesis is to test and evaluate the translation quality of selected terminological collocations related to "food safety" using free available translators (Microsoft Bing Translator, Google Translate). The work is divided into two parts theoretical and practical. Theoretical part is divided into several chapters, which focus on the development, history and use of statistical machine translation, the basic strategies of machine translation systems (rule-based machine tra...

  20. Machine Learning

    Energy Technology Data Exchange (ETDEWEB)

    Chikkagoudar, Satish; Chatterjee, Samrat; Thomas, Dennis G.; Carroll, Thomas E.; Muller, George

    2017-04-21

    The absence of a robust and unified theory of cyber dynamics presents challenges and opportunities for using machine learning based data-driven approaches to further the understanding of the behavior of such complex systems. Analysts can also use machine learning approaches to gain operational insights. In order to be operationally beneficial, cybersecurity machine learning based models need to have the ability to: (1) represent a real-world system, (2) infer system properties, and (3) learn and adapt based on expert knowledge and observations. Probabilistic models and Probabilistic graphical models provide these necessary properties and are further explored in this chapter. Bayesian Networks and Hidden Markov Models are introduced as an example of a widely used data driven classification/modeling strategy.

  1. Applications and modelling of bulk HTSs in brushless ac machines

    Energy Technology Data Exchange (ETDEWEB)

    Barnes, G.J. [Department of Engineering Science, University of Oxford, Parks Road, Oxford OX1 3PJ (United Kingdom). E-mail: gary.barnes at eng.ox.ac.uk; McCulloch, M.D.; Dew-Hughes, D. [Department of Engineering Science, University of Oxford, Parks Road, Oxford OX1 3PJ (United Kingdom)

    2000-06-01

    The use of high temperature superconducting material in its bulk form for engineering applications is attractive due to the large power densities that can be achieved. In brushless electrical machines, there are essentially four properties that can be exploited; their hysteretic nature, their flux shielding properties, their ability to trap large flux densities and their ability to produce levitation. These properties translate to hysteresis machines, reluctance machines, trapped-field synchronous machines and linear motors respectively. Each one of these machines is addressed separately and computer simulations that reveal the current and field distributions within the machines are used to explain their operation. (author)

  2. Applications and modelling of bulk HTSs in brushless ac machines

    Science.gov (United States)

    Barnes, G. J.; McCulloch, M. D.; Dew-Hughes, D.

    2000-06-01

    The use of high temperature superconducting material in its bulk form for engineering applications is attractive due to the large power densities that can be achieved. In brushless electrical machines, there are essentially four properties that can be exploited; their hysteretic nature, their flux shielding properties, their ability to trap large flux densities and their ability to produce levitation. These properties translate to hysteresis machines, reluctance machines, trapped-field synchronous machines and linear motors respectively. Each one of these machines is addressed separately and computer simulations that reveal the current and field distributions within the machines are used to explain their operation.

  3. Model for Predicting End User Web Page Response Time

    CERN Document Server

    Nagarajan, Sathya Narayanan

    2012-01-01

    Perceived responsiveness of a web page is one of the most important and least understood metrics of web page design, and is critical for attracting and maintaining a large audience. Web pages can be designed to meet performance SLAs early in the product lifecycle if there is a way to predict the apparent responsiveness of a particular page layout. Response time of a web page is largely influenced by page layout and various network characteristics. Since the network characteristics vary widely from country to country, accurately modeling and predicting the perceived responsiveness of a web page from the end user's perspective has traditionally proven very difficult. We propose a model for predicting end user web page response time based on web page, network, browser download and browser rendering characteristics. We start by understanding the key parameters that affect perceived response time. We then model each of these parameters individually using experimental tests and statistical techniques. Finally, we d...

  4. Weighted Page Content Rank for Ordering Web Search Result

    Directory of Open Access Journals (Sweden)

    POOJA SHARMA,

    2010-12-01

    Full Text Available With the explosive growth of information sources available on the World Wide Web, it has become increasingly necessary for user’s to utilize automated tools in order to find, extract, filter and evaluate the desired information and resources. Web structure mining and content mining plays an effective role in this approach. There are two Ranking algorithms PageRank and Weighted PageRank. PageRank is a commonly used algorithm in Web Structure Mining. Weighted Page Rank also takes the importance of the inlinks and outlinks of the pages but the rank score to all links is not equally distributed. i.e. unequal distribution is performed. In this paper we proposed a new algorithm, Weighted Page Content Rank (WPCRbased on web content mining and structure mining that shows the relevancy of the pages to a given query is better determined, as compared to the existing PageRank and Weighted PageRank algorithms.

  5. Theorizing About Translation and Translation Studies

    OpenAIRE

    Williams, Mukesh

    2012-01-01

    The act of translation between languages and cultures has been going on for centuries, but the act of theorizing about-translation is of recent origin. In the last forty years translation scholars have attempted to understand the process of translation and evaluate its merits giving rise to a whole range of conceptualizing which is now called translation studies. Translation studies , therefore, has grown within important scholastic enclosures of the west attempting to conduct political and i...

  6. OnlineMin: A Fast Strongly Competitive Randomized Paging Algorithm

    DEFF Research Database (Denmark)

    Brodal, Gerth Stølting; Moruz, Gabriel; Negoescu, Andrei

    2012-01-01

    n the field of online algorithms paging is one of the most studied problems. For randomized paging algorithms a tight bound of H k on the competitive ratio has been known for decades, yet existing algorithms matching this bound have high running times. We present the first randomized paging...... approach that both has optimal competitiveness and selects victim pages in subquadratic time. In fact, if k pages fit in internal memory the best previous solution required O(k 2) time per request and O(k) space, whereas our approach takes also O(k) space, but only O(logk) time in the worst case per page...

  7. Review Pages: Cities, Energy and Climate Change

    Directory of Open Access Journals (Sweden)

    Gennaro Angiello

    2015-04-01

    Full Text Available Starting from the relationship between urban planning and mobility management, TeMA has gradually expanded the view of the covered topics, always remaining in the groove of rigorous scientific in-depth analysis. During the last two years a particular attention has been paid on the Smart Cities theme and on the different meanings that come with it. The last section of the journal is formed by the Review Pages. They have different aims: to inform on the problems, trends and evolutionary processes; to investigate on the paths by highlighting the advanced relationships among apparently distant disciplinary fields; to explore the interaction’s areas, experiences and potential applications; to underline interactions, disciplinary developments but also, if present, defeats and setbacks. Inside the journal the Review Pages have the task of stimulating as much as possible the circulation of ideas and the discovery of new points of view. For this reason the section is founded on a series of basic’s references, required for the identification of new and more advanced interactions. These references are the research, the planning acts, the actions and the applications, analysed and investigated both for their ability to give a systematic response to questions concerning the urban and territorial planning, and for their attention to aspects such as the environmental sustainability and the innovation in the practices. For this purpose the Review Pages are formed by five sections (Web Resources; Books; Laws; Urban Practices; News and Events, each of which examines a specific aspect of the broader information storage of interest for TeMA.

  8. Review Pages: Cities, Energy and Built Environment

    Directory of Open Access Journals (Sweden)

    Gennaro Angiello

    2015-07-01

    Full Text Available Starting from the relationship between urban planning and mobility management, TeMA has gradually expanded the view of the covered topics, always remaining in the groove of rigorous scientific in-depth analysis. During the last two years a particular attention has been paid on the Smart Cities theme and on the different meanings that come with it. The last section of the journal is formed by the Review Pages. They have different aims: to inform on the problems, trends and evolutionary processes; to investigate on the paths by highlighting the advanced relationships among apparently distant disciplinary fields; to explore the interaction’s areas, experiences and potential applications; to underline interactions, disciplinary developments but also, if present, defeats and setbacks. Inside the journal the Review Pages have the task of stimulating as much as possible the circulation of ideas and the discovery of new points of view. For this reason the section is founded on a series of basic’s references, required for the identification of new and more advanced interactions. These references are the research, the planning acts, the actions and the applications, analysed and investigated both for their ability to give a systematic response to questions concerning the urban and territorial planning, and for their attention to aspects such as the environmental sustainability and the innovation in the practices. For this purpose the Review Pages are formed by five sections (Web Resources; Books; Laws; Urban Practices; News and Events, each of which examines a specific aspect of the broader information storage of interest for TeMA.

  9. Review Pages: Cities, Energy and Mobility

    Directory of Open Access Journals (Sweden)

    Gennaro Angiello

    2015-12-01

    Full Text Available Starting from the relationship between urban planning and mobility management, TeMA has gradually expanded the view of the covered topics, always remaining in the groove of rigorous scientific in-depth analysis. During the last two years a particular attention has been paid on the Smart Cities theme and on the different meanings that come with it. The last section of the journal is formed by the Review Pages. They have different aims: to inform on the problems, trends and evolutionary processes; to investigate on the paths by highlighting the advanced relationships among apparently distant disciplinary fields; to explore the interaction’s areas, experiences and potential applications; to underline interactions, disciplinary developments but also, if present, defeats and setbacks. Inside the journal the Review Pages have the task of stimulating as much as possible the circulation of ideas and the discovery of new points of view. For this reason the section is founded on a series of basic’s references, required for the identification of new and more advanced interactions. These references are the research, the planning acts, the actions and the applications, analysed and investigated both for their ability to give a systematic response to questions concerning the urban and territorial planning, and for their attention to aspects such as the environmental sustainability and the innovation in the practices. For this purpose the Review Pages are formed by five sections (Web Resources; Books; Laws; Urban Practices; News and Events, each of which examines a specific aspect of the broader information storage of interest for TeMA.

  10. Machine testning

    DEFF Research Database (Denmark)

    De Chiffre, Leonardo

    This document is used in connection with a laboratory exercise of 3 hours duration as a part of the course GEOMETRICAL METROLOGY AND MACHINE TESTING. The exercise includes a series of tests carried out by the student on a conventional and a numerically controled lathe, respectively. This document...

  11. Representational Machines

    DEFF Research Database (Denmark)

    Petersson, Dag; Dahlgren, Anna; Vestberg, Nina Lager

    to the enterprises of the medium. This is the subject of Representational Machines: How photography enlists the workings of institutional technologies in search of establishing new iconic and social spaces. Together, the contributions to this edited volume span historical epochs, social environments, technological...

  12. Discontinuation of the Bulletin's menu page

    CERN Multimedia

    Publications Section

    2005-01-01

    The menus of the various CERN restaurants will no longer be published in the Bulletin as of Monday 4 April (issue No. 14/2005). The menu pages are being discontinued both as a savings measure and due to the low level of interest in this section of the Bulletin. The most recent survey of Bulletin readers showed that only 13% of the people questioned regularly read the menu section, compared to between 40% and 85% in the case of the other sections. Publications Section SG/CO Tel. 79971

  13. Discontinuation of the Bulletin's menu page

    CERN Multimedia

    Publications Section

    2005-01-01

    The menus of the various CERN restaurants will no longer be published in the Bulletin as of Monday 4 April (issue No. 14/2005). The menu pages are being discontinued both as a savings measure and due to the low level of interest in this section of the Bulletin. The most recent survey of Bulletin readers showed that only 13% of the people questioned regularly read the menu section, compared to between 40% and 85% in the case of the other sections. Publications Section DSU-CO Tel. 79971

  14. Processing Web Pages for College English Reading

    Institute of Scientific and Technical Information of China (English)

    WangLixin; WangYang; YangMuyun

    2004-01-01

    ELT in the university is to further cultivate students' ability in listening, speaking, reading, writing as well as translation, so that they could effectively perform communication in English. Among the five, reading is now giving his chair to speaking: the circle is giving more and more emphasis on the capability of face-to-face communication.

  15. JavaScript DNA translator: DNA-aligned protein translations.

    Science.gov (United States)

    Perry, William L

    2002-12-01

    There are many instances in molecular biology when it is necessary to identify ORFs in a DNA sequence. While programs exist for displaying protein translations in multiple ORFs in alignment with a DNA sequence, they are often expensive, exist as add-ons to software that must be purchased, or are only compatible with a particular operating system. JavaScript DNA Translator is a shareware application written in JavaScript, a scripting language interpreted by the Netscape Communicator and Internet Explorer Web browsers, which makes it compatible with several different operating systems. While the program uses a familiar Web page interface, it requires no connection to the Internet since calculations are performed on the user's own computer. The program analyzes one or multiple DNA sequences and generates translations in up to six reading frames aligned to a DNA sequence, in addition to displaying translations as separate sequences in FASTA format. ORFs within a reading frame can also be displayed as separate sequences. Flexible formatting options are provided, including the ability to hide ORFs below a minimum size specified by the user. The program is available free of charge at the BioTechniques Software Library (www.Biotechniques.com).

  16. A double-sided linear primary permanent magnet vernier machine.

    Science.gov (United States)

    Du, Yi; Zou, Chunhua; Liu, Xianxing

    2015-01-01

    The purpose of this paper is to present a new double-sided linear primary permanent magnet (PM) vernier (DSLPPMV) machine, which can offer high thrust force, low detent force, and improved power factor. Both PMs and windings of the proposed machine are on the short translator, while the long stator is designed as a double-sided simple iron core with salient teeth so that it is very robust to transmit high thrust force. The key of this new machine is the introduction of double stator and the elimination of translator yoke, so that the inductance and the volume of the machine can be reduced. Hence, the proposed machine offers improved power factor and thrust force density. The electromagnetic performances of the proposed machine are analyzed including flux, no-load EMF, thrust force density, and inductance. Based on using the finite element analysis, the characteristics and performances of the proposed machine are assessed.

  17. A Double-Sided Linear Primary Permanent Magnet Vernier Machine

    Directory of Open Access Journals (Sweden)

    Yi Du

    2015-01-01

    Full Text Available The purpose of this paper is to present a new double-sided linear primary permanent magnet (PM vernier (DSLPPMV machine, which can offer high thrust force, low detent force, and improved power factor. Both PMs and windings of the proposed machine are on the short translator, while the long stator is designed as a double-sided simple iron core with salient teeth so that it is very robust to transmit high thrust force. The key of this new machine is the introduction of double stator and the elimination of translator yoke, so that the inductance and the volume of the machine can be reduced. Hence, the proposed machine offers improved power factor and thrust force density. The electromagnetic performances of the proposed machine are analyzed including flux, no-load EMF, thrust force density, and inductance. Based on using the finite element analysis, the characteristics and performances of the proposed machine are assessed.

  18. Searching to Translate and Translating to Search: When Information Retrieval Meets Machine Translation

    Science.gov (United States)

    Ture, Ferhan

    2013-01-01

    With the adoption of web services in daily life, people have access to tremendous amounts of information, beyond any human's reading and comprehension capabilities. As a result, search technologies have become a fundamental tool for accessing information. Furthermore, the web contains information in multiple languages, introducing another barrier…

  19. Facebook's personal page modelling and simulation

    Science.gov (United States)

    Sarlis, Apostolos S.; Sakas, Damianos P.; Vlachos, D. S.

    2015-02-01

    In this paper we will try to define the utility of Facebook's Personal Page marketing method. This tool that Facebook provides, is modelled and simulated using iThink in the context of a Facebook marketing agency. The paper has leveraged the system's dynamic paradigm to conduct Facebook marketing tools and methods modelling, using iThink™ system to implement them. It uses the design science research methodology for the proof of concept of the models and modelling processes. The following model has been developed for a social media marketing agent/company, Facebook platform oriented and tested in real circumstances. This model is finalized through a number of revisions and iterators of the design, development, simulation, testing and evaluation processes. The validity and usefulness of this Facebook marketing model for the day-to-day decision making are authenticated by the management of the company organization. Facebook's Personal Page method can be adjusted, depending on the situation, in order to maximize the total profit of the company which is to bring new customers, keep the interest of the old customers and deliver traffic to its website.

  20. A Syntactic Classification based Web Page Ranking Algorithm

    CERN Document Server

    Mukhopadhyay, Debajyoti; Kim, Young-Chon

    2011-01-01

    The existing search engines sometimes give unsatisfactory search result for lack of any categorization of search result. If there is some means to know the preference of user about the search result and rank pages according to that preference, the result will be more useful and accurate to the user. In the present paper a web page ranking algorithm is being proposed based on syntactic classification of web pages. Syntactic Classification does not bother about the meaning of the content of a web page. The proposed approach mainly consists of three steps: select some properties of web pages based on user's demand, measure them, and give different weightage to each property during ranking for different types of pages. The existence of syntactic classification is supported by running fuzzy c-means algorithm and neural network classification on a set of web pages. The change in ranking for difference in type of pages but for same query string is also being demonstrated.

  1. Training Activity Summary Page (TASP) State and Tribe

    Data.gov (United States)

    U.S. Department of Health & Human Services — The Training Activity Summary Page (formerly the Training Exit Survey Cover Page) dataset contains data about each training event. This dataset includes information...

  2. Adding machine and calculating machine

    Institute of Scientific and Technical Information of China (English)

    2005-01-01

    In 1642 the French mathematician Blaise Pascal(1623-1662) invented a machine;.that could add and subtract. It had.wheels that each had: 1 to 10 marked off along its circumference. When the wheel at the right, representing units, made one complete circle, it engaged the wheel to its left, represents tens, and moved it forward one notch.

  3. Semantic Web Techniques for Yellow Page Service Providers

    Directory of Open Access Journals (Sweden)

    Raghu Anantharangachar

    2012-08-01

    Full Text Available Applications providing “yellow pages information” for use over the web should ideally be based on structured information. Use of web pages providing unstructured information poses variety of problems to the user, such as use of arbitrary formats, unsuitability for machine processing and likely incompleteness of information. Structured data alleviates these problems but we require more. Capturing the semantics of a domain in the form of an ontology is necessary to ensure that unforeseen application can easily be created at a later date. Very often yellow page systems are implemented using a centralized database. In some cases, human intermediaries accessible over the phone network examine a centralized database and use their reasoning ability to deal with the user’s need for information. Centralized operation and considerable central administration make these systems expensive to operate. Scaling up such systems is difficult. They behave like isolated systems and it is common for such systems to be highly domain specific, for instance systems dealing with accommodation and travel. This paper explores an alternative – a highly distributed system design meeting a variety of needs – considerably reducing efforts required at a central organization, enabling large numbers of vendors to enter information about their own products and services, enablingend-users to contribute information such as their own ratings, using an ontology to describe each domain of application in a flexible manner for uses foreseen and unforeseen, enabling distributed search and mashups, use of vendor independent standards, using reasoning to find the best matches to a given query, geospatial reasoning and a simple, interactive, mobile application/interface. We view this design as one in which vendors and end-users do the bulk of the work in building large distributed collections of information in a Web 2.0 style. We give importance to geo-spatial information and

  4. Digital Ethnography: Library Web Page Redesign among Digital Natives

    Science.gov (United States)

    Klare, Diane; Hobbs, Kendall

    2011-01-01

    Presented with an opportunity to improve Wesleyan University's dated library home page, a team of librarians employed ethnographic techniques to explore how its users interacted with Wesleyan's current library home page and web pages in general. Based on the data that emerged, a group of library staff and members of the campus' information…

  5. Required Discussion Web Pages in Psychology Courses and Student Outcomes

    Science.gov (United States)

    Pettijohn, Terry F., II; Pettijohn, Terry F.

    2007-01-01

    We conducted 2 studies that investigated student outcomes when using discussion Web pages in psychology classes. In Study 1, we assigned 213 students enrolled in Introduction to Psychology courses to either a mandatory or an optional Web page discussion condition. Students used the discussion Web page significantly more often and performed…

  6. Young Children's Interpretations of Page Breaks in Contemporary Picture Storybooks

    Science.gov (United States)

    Sipe, Lawrence R.; Brightman, Anne E.

    2009-01-01

    This article reports on a study of the responses of a second-grade class to the page breaks in contemporary picturebooks. In a picturebook, the text and accompanying illustrations are divided into a series of facing pages called openings, and the divisions between the openings are called page breaks or turns. Unlike a novel, in which the page…

  7. Translator's preface.

    Science.gov (United States)

    Lamiell, James T

    2013-08-01

    Presents a preface from James T. Lamiell, who translates Wilhelm Wundt's Psychology's Struggle for Existence (Die Psychologie im Kampf ums Dasein), in which Wundt advised against the impending divorce of psychology from philosophy, into English. Lamiell comments that more than a decade into the 21st century, it appears that very few psychologists have any interest at all in work at the interface of psychology and philosophy. He notes that one clear indication of this is that the Society for Theoretical and Philosophical Psychology, which is Division 24 of the American Psychological Association (APA), remains one of the smallest of the APA's nearly 60 divisions. (PsycINFO Database Record (c) 2013 APA, all rights reserved).

  8. Genesis machines

    CERN Document Server

    Amos, Martyn

    2014-01-01

    Silicon chips are out. Today's scientists are using real, wet, squishy, living biology to build the next generation of computers. Cells, gels and DNA strands are the 'wetware' of the twenty-first century. Much smaller and more intelligent, these organic computers open up revolutionary possibilities. Tracing the history of computing and revealing a brave new world to come, Genesis Machines describes how this new technology will change the way we think not just about computers - but about life itself.

  9. On Feminist Translation Theory and Literary Translation

    Institute of Scientific and Technical Information of China (English)

    狄东睿

    2015-01-01

    Feminist translation theory emerged in the late 70s and early 80s of 20th Century. It is the combination of the feminist movement and the“cultural turn”of translation. It was introduced to China in the 1980s, and with the development of the transla⁃tion theory and translation practice, more and more Chinese translators want to study the feminist translation theory from the deep levels.

  10. Migrating Multi-page Web Applications to Single-page AJAX Interfaces

    NARCIS (Netherlands)

    Mesbah, A.; Van Deursen, A.

    2006-01-01

    Recently, a new web development technique for creating interactive web applications, dubbed AJAX, has emerged. In this new model, the single-page web interface is composed of individual components which can be updated/replaced independently. With the rise of AJAX web applications classical multi-pag

  11. Efficient Checkpointing of Virtual Machines using Virtual Machine Introspection

    Energy Technology Data Exchange (ETDEWEB)

    Aderholdt, Ferrol [Tennessee Technological University; Han, Fang [Tennessee Technological University; Scott, Stephen L [ORNL; Naughton, III, Thomas J [ORNL

    2014-01-01

    Cloud Computing environments rely heavily on system-level virtualization. This is due to the inherent benefits of virtualization including fault tolerance through checkpoint/restart (C/R) mechanisms. Because clouds are the abstraction of large data centers and large data centers have a higher potential for failure, it is imperative that a C/R mechanism for such an environment provide minimal latency as well as a small checkpoint file size. Recently, there has been much research into C/R with respect to virtual machines (VM) providing excellent solutions to reduce either checkpoint latency or checkpoint file size. However, these approaches do not provide both. This paper presents a method of checkpointing VMs by utilizing virtual machine introspection (VMI). Through the usage of VMI, we are able to determine which pages of memory within the guest are used or free and are better able to reduce the amount of pages written to disk during a checkpoint. We have validated this work by using various benchmarks to measure the latency along with the checkpoint size. With respect to checkpoint file size, our approach results in file sizes within 24% or less of the actual used memory within the guest. Additionally, the checkpoint latency of our approach is up to 52% faster than KVM s default method.

  12. Machine Learning Based Statistical Prediction Model for Improving Performance of Live Virtual Machine Migration

    Directory of Open Access Journals (Sweden)

    Minal Patel

    2016-01-01

    Full Text Available Service can be delivered anywhere and anytime in cloud computing using virtualization. The main issue to handle virtualized resources is to balance ongoing workloads. The migration of virtual machines has two major techniques: (i reducing dirty pages using CPU scheduling and (ii compressing memory pages. The available techniques for live migration are not able to predict dirty pages in advance. In the proposed framework, time series based prediction techniques are developed using historical analysis of past data. The time series is generated with transferring of memory pages iteratively. Here, two different regression based models of time series are proposed. The first model is developed using statistical probability based regression model and it is based on ARIMA (autoregressive integrated moving average model. The second one is developed using statistical learning based regression model and it uses SVR (support vector regression model. These models are tested on real data set of Xen to compute downtime, total number of pages transferred, and total migration time. The ARIMA model is able to predict dirty pages with 91.74% accuracy and the SVR model is able to predict dirty pages with 94.61% accuracy that is higher than ARIMA.

  13. Translating into Free Word Order Languages

    CERN Document Server

    Hoffman, B

    1996-01-01

    In this paper, I discuss machine translation of English text into Turkish, a relatively ``free'' word order language. I present algorithms that determine the topic and the focus of each target sentence (using salience (Centering Theory), old vs. new information, and contrastiveness in the discourse model) in order to generate the contextually appropriate word orders in the target language.

  14. An Efficient PageRank Approach for Urban Traffic Optimization

    Directory of Open Access Journals (Sweden)

    Florin Pop

    2012-01-01

    to determine optimal decisions for each traffic light, based on the solution given by Larry Page for page ranking in Web environment (Page et al. (1999. Our approach is similar with work presented by Sheng-Chung et al. (2009 and Yousef et al. (2010. We consider that the traffic lights are controlled by servers and a score for each road is computed based on efficient PageRank approach and is used in cost function to determine optimal decisions. We demonstrate that the cumulative contribution of each car in the traffic respects the main constrain of PageRank approach, preserving all the properties of matrix consider in our model.

  15. A Model for Web Page Usage Mining Based on Segmentation

    CERN Document Server

    Kuppusamy, K S

    2012-01-01

    The web page usage mining plays a vital role in enriching the page's content and structure based on the feedbacks received from the user's interactions with the page. This paper proposes a model for micro-managing the tracking activities by fine-tuning the mining from the page level to the segment level. The proposed model enables the web-master to identify the segments which receives more focus from users comparing with others. The segment level analytics of user actions provides an important metric to analyse the factors which facilitate the increase in traffic for the page. The empirical validation of the model is performed through prototype implementation.

  16. Facebook Pages and the Effects of Reputation Management

    Directory of Open Access Journals (Sweden)

    Nicoleta Ciacu

    2013-07-01

    Full Text Available This paper aims to identify the categories which own the largest number of Facebook pages as well as the Romanian Facebook pages which have the largest number of fans. Another aim of this study is to analyse the categories generating the largest number of fans on their official Facebook pages, the increase of the number of fans in the last month and the Facebook pages generating the biggest feedback among their fans. The data was collected by using Social Media analysis sites, such as www.socialbakers.com and www.facebrands.ro. The study sample comprises 20 Facebook pages with the largest number of fans.

  17. A New Page Ranking Algorithm Based On WPRVOL Algorithm

    Directory of Open Access Journals (Sweden)

    Roja Javadian Kootenae

    2013-03-01

    Full Text Available The amount of information on the web is always growing, thus powerful search tools are needed to search for such a large collection. Search engines in this direction help users so they can find their desirable information among the massive volume of information in an easier way. But what is important in the search engines and causes a distinction between them is page ranking algorithm used in them. In this paper a new page ranking algorithm based on "Weighted Page Ranking based on Visits of Links (WPRVOL Algorithm" for search engines is being proposed which is called WPR'VOL for short. The proposed algorithm considers the number of visits of first and second level in-links. The original WPRVOL algorithm takes into account the number of visits of first level in-links of the pages and distributes rank scores based on the popularity of the pages whereas the proposed algorithm considers both in-links of that page (first level in-links and in-links of the pages that point to it (second level in-links in order to calculation of rank of the page, hence more related pages are displayed at the top of search result list. In the summary it is said that the proposed algorithm assigns higher rank to pages that both themselves and pages that point to them be important.

  18. PageRank算法的研究与改进

    Institute of Scientific and Technical Information of China (English)

    李青淋; 邵家玉

    2016-01-01

    Page ranking algorithm is the core of the search engine.Through the analysis of the traditional PageRank algo-rithm,the algorithm is based on the link between the pages,so it has deficiencies of emphasis on the old web pages,theme offset and so on.In order to improve the accuracy of the page ranking algorithm,an improved PageRank algorithm is pro-posed based on the analysis of the correlation factor of the page content and the time factor of the page update.%页面排序算法是搜索引擎的核心之一。通过分析传统PageRank算法可知该算法主要是依据页面之间的链接关系,容易出现偏重旧网页、主题偏移等不足之处。为了提高网页排序算法的准确率,结合网页内容相关性因子和网页更新时间因子,提出一种改进的PageRank算法。实验结果表明,改进后的PageRank算法提高了搜索的查全率与查准率,提高了网页排序的质量。

  19. PageRank optimization applied to spam detection

    CERN Document Server

    Fercoq, Olivier

    2012-01-01

    We give a new link spam detection and PageRank demotion algorithm called MaxRank. Like TrustRank and AntiTrustRank, it starts with a seed of hand-picked trusted and spam pages. We define the MaxRank of a page as the frequency of visit of this page by a random surfer minimizing an average cost per time unit. On a given page, the random surfer selects a set of hyperlinks and clicks with uniform probability on any of these hyperlinks. The cost function penalizes spam pages and hyperlink removals. The goal is to determine a hyperlink deletion policy that minimizes this score. The MaxRank is interpreted as a modified PageRank vector, used to sort web pages instead of the usual PageRank vector. The bias vector of this ergodic control problem, which is unique up to an additive constant, is a measure of the "spamicity" of each page, used to detect spam pages. We give a scalable algorithm for MaxRank computation that allowed us to perform experimental results on the WEBSPAM-UK2007 dataset. We show that our algorithm o...

  20. The Web Application Test Based on Page Coverage Criteria

    Institute of Scientific and Technical Information of China (English)

    CAI Li-zhi; TONG Wei-qin; YANG Gen-xing

    2008-01-01

    Software testing coverage criteria play an important role in the whole testing process. The current coverage criteria for web applications are based on program or URL. They are not suitable for black-box test or intuitional to use. This paper defines a kind of test criteria based on page coverage sequences only navigated by web application, including Page_Single, Page_Post, Page_Pre,Page_Seq2, Page_SeqK. The test criteria based on page coverage sequences made by interactions between web application and browser are being under consideration after that. In order to avoid ambiguity of natural language, these coverage criteria are depicted using Z formal language. The empirical result shows that the criteria complement traditional coverage and fault detection capability criteria.

  1. Simulating Turing machines on Maurer machines

    NARCIS (Netherlands)

    Bergstra, J.A.; Middelburg, C.A.

    2008-01-01

    In a previous paper, we used Maurer machines to model and analyse micro-architectures. In the current paper, we investigate the connections between Turing machines and Maurer machines with the purpose to gain an insight into computability issues relating to Maurer machines. We introduce ways to

  2. Machine learning in medicine cookbook

    CERN Document Server

    Cleophas, Ton J

    2014-01-01

    The amount of data in medical databases doubles every 20 months, and physicians are at a loss to analyze them. Also, traditional methods of data analysis have difficulty to identify outliers and patterns in big data and data with multiple exposure / outcome variables and analysis-rules for surveys and questionnaires, currently common methods of data collection, are, essentially, missing. Obviously, it is time that medical and health professionals mastered their reluctance to use machine learning and the current 100 page cookbook should be helpful to that aim. It covers in a condensed form the subjects reviewed in the 750 page three volume textbook by the same authors, entitled “Machine Learning in Medicine I-III” (ed. by Springer, Heidelberg, Germany, 2013) and was written as a hand-hold presentation and must-read publication. It was written not only to investigators and students in the fields, but also to jaded clinicians new to the methods and lacking time to read the entire textbooks. General purposes ...

  3. Traduccion automatica mediante el ordenador (Automatic Translation Using a Computer).

    Science.gov (United States)

    Bueno, Julian L.

    This report on machine translation contains a brief history of the field; a description of the processes involved; a discussion of systems currently in use, including three software packages on the market (Teaching Assistant, Translate, and Globalink); reflections on implications for teaching; observations of results obtained when elements of…

  4. The Sunspot Catalogues of Carrington, Peters, and de la Rue: Quality Control and Machine-readable Versions

    CERN Document Server

    Casas, R

    2013-01-01

    In the 19th century, several astronomers made observations of sunspots, recording their positions and sometimes their areas. These observations were published in the form of extensive tables, but have been unhelpful until now. Three of these observers were Richard C. Carrington, Christian H. F. Peters, and Warren de la Rue (and their respective collaborators). They published, in various articles the data corresponding to 26 641 sunspot positions (Carrington, Peters, and de la Rue registered 4 900, 14 040, and 7 701 sunspot positions, respectively). In this paper we present a translation of more than 400 pages of their printed numerical tables into a machine readable format, including an initial analysis targeted at detecting possible mistakes in the reading or in the original transcription. The observations carried out by these three astronomers have been made available at the Centre de Don\\'ees Astronomiques de Strasbourg (http://cdsarc.u-strasbg.fr/cgi-bin/VizieR?-source=VI/138).

  5. Environmentally Friendly Machining

    CERN Document Server

    Dixit, U S; Davim, J Paulo

    2012-01-01

    Environment-Friendly Machining provides an in-depth overview of environmentally-friendly machining processes, covering numerous different types of machining in order to identify which practice is the most environmentally sustainable. The book discusses three systems at length: machining with minimal cutting fluid, air-cooled machining and dry machining. Also covered is a way to conserve energy during machining processes, along with useful data and detailed descriptions for developing and utilizing the most efficient modern machining tools. Researchers and engineers looking for sustainable machining solutions will find Environment-Friendly Machining to be a useful volume.

  6. Korean to English Translation Using Synchronous TAGs

    CERN Document Server

    Egedi, D; Park, H S; Joshi, A K; Egedi, Dania; Palmer, Martha; Park, Hyun S.; Joshi, Aravind K.

    1994-01-01

    It is often argued that accurate machine translation requires reference to contextual knowledge for the correct treatment of linguistic phenomena such as dropped arguments and accurate lexical selection. One of the historical arguments in favor of the interlingua approach has been that, since it revolves around a deep semantic representation, it is better able to handle the types of linguistic phenomena that are seen as requiring a knowledge-based approach. In this paper we present an alternative approach, exemplified by a prototype system for machine translation of English and Korean which is implemented in Synchronous TAGs. This approach is essentially transfer based, and uses semantic feature unification for accurate lexical selection of polysemous verbs. The same semantic features, when combined with a discourse model which stores previously mentioned entities, can also be used for the recovery of topicalized arguments. In this paper we concentrate on the translation of Korean to English.

  7. A Comparison of Different Machine Transliteration Models

    CERN Document Server

    Choi, K; Oh, J; 10.1613/jair.1999

    2011-01-01

    Machine transliteration is a method for automatically converting words in one language into phonetically equivalent ones in another language. Machine transliteration plays an important role in natural language applications such as information retrieval and machine translation, especially for handling proper nouns and technical terms. Four machine transliteration models -- grapheme-based transliteration model, phoneme-based transliteration model, hybrid transliteration model, and correspondence-based transliteration model -- have been proposed by several researchers. To date, however, there has been little research on a framework in which multiple transliteration models can operate simultaneously. Furthermore, there has been no comparison of the four models within the same framework and using the same data. We addressed these problems by 1) modeling the four models within the same framework, 2) comparing them under the same conditions, and 3) developing a way to improve machine transliteration through this com...

  8. Calibrating page sized Gafchromic EBT3 films

    Energy Technology Data Exchange (ETDEWEB)

    Crijns, W.; Maes, F.; Heide, U. A. van der; Van den Heuvel, F. [Department of Radiation Oncology, University Hospitals Leuven, Herestraat 49, 3000 Leuven (Belgium); Department ESAT/PSI-Medical Image Computing, Medical Imaging Research Center, KU Leuven, Herestraat 49, 3000 Leuven (Belgium); Department of Radiation Oncology, The Netherlands Cancer Institute-Antoni van Leeuwenhoek Hospital, Plesmanlaan 121, 1066 CX Amsterdam (Netherlands); Department of Radiation Oncology, University Hospitals Leuven, Herestraat 49, 3000 Leuven (Belgium)

    2013-01-15

    Purpose: The purpose is the development of a novel calibration method for dosimetry with Gafchromic EBT3 films. The method should be applicable for pretreatment verification of volumetric modulated arc, and intensity modulated radiotherapy. Because the exposed area on film can be large for such treatments, lateral scan errors must be taken into account. The correction for the lateral scan effect is obtained from the calibration data itself. Methods: In this work, the film measurements were modeled using their relative scan values (Transmittance, T). Inside the transmittance domain a linear combination and a parabolic lateral scan correction described the observed transmittance values. The linear combination model, combined a monomer transmittance state (T{sub 0}) and a polymer transmittance state (T{sub {infinity}}) of the film. The dose domain was associated with the observed effects in the transmittance domain through a rational calibration function. On the calibration film only simple static fields were applied and page sized films were used for calibration and measurements (treatment verification). Four different calibration setups were considered and compared with respect to dose estimation accuracy. The first (I) used a calibration table from 32 regions of interest (ROIs) spread on 4 calibration films, the second (II) used 16 ROIs spread on 2 calibration films, the third (III), and fourth (IV) used 8 ROIs spread on a single calibration film. The calibration tables of the setups I, II, and IV contained eight dose levels delivered to different positions on the films, while for setup III only four dose levels were applied. Validation was performed by irradiating film strips with known doses at two different time points over the course of a week. Accuracy of the dose response and the lateral effect correction was estimated using the dose difference and the root mean squared error (RMSE), respectively. Results: A calibration based on two films was the optimal

  9. 搜索引擎PageRank算法的改进%Improvement of PageRank Algorithm for Search Engine

    Institute of Scientific and Technical Information of China (English)

    杨劲松; 凌培亮

    2009-01-01

    In order to solve the problems in information retrieval when enterprise making rapid decision, this paper proposes an improved PageRank algorithm. Considering the time factor by Web page, it distributes the forward link different PageRank value based on the proportion by the similarity analysis between anchor text and Web page text. The final PageRank value is more suitable for topic-specific search engine and keeps simplicity of algorithm. Experimental result shows that the improved algorithm can effectively reduce the phenomenon of topic-drift and enhance the PageRank value of new Web page.%为了解决企业快速决策时信息检索的问题,提出一种改进的PageRank算法.在考虑网页产生时间因素的同时,通过锚文本与网页主题的相似度分析按权重分配网页各正向链接PageRank值,产生的PageRank值更贴合主题搜索引擎的要求,并保持算法的简洁性.实验结果证明该改进算法能有效减少主题漂移现象,恰当提升新网页PageRank值.

  10. Terra: a Collection of Translation Error-Annotated Corpora

    OpenAIRE

    Popović, Maja; Fishel, Mark; Bojar, Ondřej

    2012-01-01

    Recently the first methods of automatic diagnostics of machine translation have emerged; since this area of research is relatively young, the efforts are not coordinated. We present a collection of translation error-annotated corpora, consisting of automatically produced trans- lations and their detailed manual translation error analysis. Using the collected corpora we evaluate the available state-of-the-art methods of MT diagnostics and assess, how well the methods perform, how they...

  11. Deconstructive Reconsideration of Translation

    Institute of Scientific and Technical Information of China (English)

    刘艳清

    2014-01-01

    This paper is on the deconstructive reconsideration of the definition of translation, the relation of source text and translation, as well as the relation of translator and author. It also points out the differences between tranditioanal concepts of translation and deconstructive reconsideration of translation.

  12. On Becoming a Translator

    Institute of Scientific and Technical Information of China (English)

    王莹

    2015-01-01

    Douglas Robinson's Becoming a Translator:An Introduction to the Theory and Practice of Translation has been well received around the world.Fusing translation theory with advice and information about the practicalities of translating,it is an essential resource for novice and practicing translators.

  13. Translator Studies in Retrospect

    Institute of Scientific and Technical Information of China (English)

    郭勤

    2009-01-01

    The concern about translator has run for thousands of years on which many translators and translation scholars have made comments.A rough examination of the translator within translation studies at home and abroad is expounded in a loosely chronological way as follows.

  14. On Becoming a Translator

    Institute of Scientific and Technical Information of China (English)

    王莹

    2015-01-01

    Douglas Robinson’s Becoming a Translator:An Introduction to the Theory and Practice of Translation has been well received around the world.Fusing translation theory with advice and information about the practicalities of translating,it is an essential resource for novice and practicing translators.

  15. Speaking your Translation

    DEFF Research Database (Denmark)

    Dragsted, Barbara; Mees, Inger M.; Gorm Hansen, Inge

    2011-01-01

    In this article we discuss the translation processes and products of 14 MA students who produced translations from Danish (L1) into English (L2) under different working conditions: (1) written translation, (2) sight translation, and (3) sight translation with a speech recognition (SR) tool. Audio...

  16. Equivalence in Translation

    Institute of Scientific and Technical Information of China (English)

    李良杰

    2013-01-01

    There are many researches about translation theories and methods in western translation history. Equivalence in transla⁃tion has always been the central issue for discussion. This paper gives a general review and comment on equivalence in translation in terms of three representative translation theorists and their views about equivalence in translation.

  17. Machine Protection

    CERN Document Server

    Schmidt, R

    2014-01-01

    The protection of accelerator equipment is as old as accelerator technology and was for many years related to high-power equipment. Examples are the protection of powering equipment from overheating (magnets, power converters, high-current cables), of superconducting magnets from damage after a quench and of klystrons. The protection of equipment from beam accidents is more recent. It is related to the increasing beam power of high-power proton accelerators such as ISIS, SNS, ESS and the PSI cyclotron, to the emission of synchrotron light by electron–positron accelerators and FELs, and to the increase of energy stored in the beam (in particular for hadron colliders such as LHC). Designing a machine protection system requires an excellent understanding of accelerator physics and operation to anticipate possible failures that could lead to damage. Machine protection includes beam and equipment monitoring, a system to safely stop beam operation (e.g. dumping the beam or stopping the beam at low energy) and an ...

  18. Machine Readable Passports & The Visa Waiver Programme

    CERN Document Server

    2003-01-01

    From 1 October 2003, all passengers intending to enter the USA on the Visa Waiver Programme (VWP) will be required to present a machine-readable passport (MRP). Passengers travelling to the USA with a non-machine readable passport will require a valid US entry visa. Applying for a US visa is a lengthy process, which can take several weeks or even months. Therefore it is strongly recommended that: • All Visa Waiver nationals who hold a non-machine readable passport should obtain a MRP before their next visit to the USA. • Children travelling on a parent's passport (be it machine readable or non-machine readable) cannot benefit from the Visa Waiver Programme and should obtain their own MRP prior to travelling to the USA or request a visa. What is a Machine Readable Passport (MRP)? A MRP has the holders' personal details, e.g. name, date of birth, nationality and their passport number contained in two lines of text at the base of the photo page. This text may be read by machine. These 2 lines ...

  19. Hierarchical Web Page Classification Based on a Topic Model and Neighboring Pages Integration

    OpenAIRE

    Sriurai, Wongkot; Meesad, Phayung; Haruechaiyasak, Choochart

    2010-01-01

    Most Web page classification models typically apply the bag of words (BOW) model to represent the feature space. The original BOW representation, however, is unable to recognize semantic relationships between terms. One possible solution is to apply the topic model approach based on the Latent Dirichlet Allocation algorithm to cluster the term features into a set of latent topics. Terms assigned into the same topic are semantically related. In this paper, we propose a novel hierarchical class...

  20. Striations in PageRank-Ordered Matrices

    CERN Document Server

    Pennycuff, Corey

    2016-01-01

    Patterns often appear in a variety of large, real-world networks, and interesting physical phenomena are often explained by network topology as in the case of the bow-tie structure of the World Wide Web, or the small world phenomenon in social networks. The discovery and modelling of such regular patterns has a wide application from disease propagation to financial markets. In this work we describe a newly discovered regularly occurring striation pattern found in the PageRank ordering of adjacency matrices that encode real-world networks. We demonstrate that these striations are the result of well-known graph generation processes resulting in regularities that are manifest in the typical neighborhood distribution. The spectral view explored in this paper encodes a tremendous amount about the explicit and implicit topology of a given network, so we also discuss the interesting network properties, outliers and anomalies that a viewer can determine from a brief look at the re-ordered matrix.

  1. SDS -PAGE and Western Blotting Techniques.

    Science.gov (United States)

    Blancher, C; Jones, A

    2001-01-01

    The goal of Western blotting, or more correctly, immunoblotting, is to identify with a specific antibody a particular antigen within a complex mixture of proteins that has been fractionated in a polyacrylamide gel and immobilized onto a membrane. Immunoblotting can be used to determine a number of important characteristics of protein antigens-the presence and quantity of an antigen, the relative molecular weight of the polypeptide chain, and the efficiency of extraction of the antigen.Immunoblotting occurs in six stages: (1) extraction and quantification of protein samples; (2) resolution of the protein sample in sodium dodecyl sulfatepolyacrylamide denaturing gel electrophoresis (SDS-PAGE); (3) transfer of the separated polypeptides to a membrane support; (4) blocking nonspecific binding sites on the membrane; (5) addition of antibodies; and (6) detection.Sample preparation is important for obtaining accurate separation of the proteins on the basis of molecular weight. Depending on whether an antigen is primarily extracellular, cytoplasmic, or membrane-associated different procedures might be required to prepare the sample initially. Although there are exceptions, many soluble nuclear and cytoplasmic proteins can be solubilized by lysis buffers that contain the nonionic detergent Nonidet P-40 (NP-40) and either no salt at all or relatively high concentrations of salt (e.g., 0.5 M NaCl). However, the efficiency of extraction is often greatly affected by pH of the buffer and the presence or absence of chelating agents such EDTA.

  2. Literal Translation using Google Translate in Translating the Text from French to English in Digital Tourism Brochure “Bienvenue À Paris”

    Directory of Open Access Journals (Sweden)

    Rila Hilma

    2011-05-01

    Full Text Available Translation is basically change of form. The form from which the translation is made will be called the source language and the form into which it is to be changed will be called the receptor language. Translation consists of transferring the meaning of the source language into the receptor language. Translating is not an easy job to do because many things to be considered to do this activity because translation means determining the meaning of a text, then reconstructing this same meaning using the appropriate structure and form in the receptor language. Translation is basically divided by two types of translation, one is literal and the other is idiomatic. Literal translation is really strict to the structure and form then often can not well express the true meaning of source language. Idiomatic translation makes every effort to communicate the meaning of the source language text in the natural forms of the receptor language. Then the most popular translation machine, Google Translate, in this study shows the results of translation which remain odd, unnatural, and nonsensical because the unsuccessful of message delivery, which is notably the typically error of literal translation.

  3. Analysis of machining and machine tools

    CERN Document Server

    Liang, Steven Y

    2016-01-01

    This book delivers the fundamental science and mechanics of machining and machine tools by presenting systematic and quantitative knowledge in the form of process mechanics and physics. It gives readers a solid command of machining science and engineering, and familiarizes them with the geometry and functionality requirements of creating parts and components in today’s markets. The authors address traditional machining topics, such as: single and multiple point cutting processes grinding components accuracy and metrology shear stress in cutting cutting temperature and analysis chatter They also address non-traditional machining, such as: electrical discharge machining electrochemical machining laser and electron beam machining A chapter on biomedical machining is also included. This book is appropriate for advanced undergraduate and graduate mechani cal engineering students, manufacturing engineers, and researchers. Each chapter contains examples, exercises and their solutions, and homework problems that re...

  4. Translation Theory and Translation Studies in China

    Science.gov (United States)

    Guo, Qin

    2012-01-01

    This dissertation is a comparative study of "translation theory" and "translation studies" in China and the West. Its focus is to investigate whether there is translation theory in the Chinese tradition. My study begins with an examination of the debate in China over whether there has already existed a system of translation…

  5. Extraction of Flat and Nested Data Records from Web Pages

    CERN Document Server

    Hiremath, P S

    2010-01-01

    This paper studies the problem of identification and extraction of flat and nested data records from a given web page. With the explosive growth of information sources available on the World Wide Web, it has become increasingly difficult to identify the relevant pieces of information, since web pages are often cluttered with irrelevant content like advertisements, navigation-panels, copyright notices etc., surrounding the main content of the web page. Hence, it is useful to mine such data regions and data records in order to extract information from such web pages to provide value-added services. Currently available automatic techniques to mine data regions and data records from web pages are still unsatisfactory because of their poor performance. In this paper a novel method to identify and extract the flat and nested data records from the web pages automatically is proposed. It comprises of two steps : (1) Identification and Extraction of the data regions based on visual clues information. (2) Identificatio...

  6. U.S. Army Research Laboratory (ARL) Corporate Dari Document Transcription and Translation Guidelines

    Science.gov (United States)

    2012-10-01

    text file format. 15. SUBJECT TERMS Transcription, Translation, guidelines, ground truth, Optical character recognition, OCR, Machine Translation, MT...foreign language into a target language in order to train, test, and evaluate optical character recognition (OCR) and machine translation (MT) embedded...the final word in this sequence). 5. Spacing: When printed words give the illusion of having a space in between letters in the word due to

  7. Speaking your Translation

    DEFF Research Database (Denmark)

    Dragsted, Barbara; Mees, Inger M.; Gorm Hansen, Inge

    2011-01-01

    In this article we discuss the translation processes and products of 14 MA students who produced translations from Danish (L1) into English (L2) under different working conditions: (1) written translation, (2) sight translation, and (3) sight translation with a speech recognition (SR) tool. Audio...... output and keystrokes were recorded. Oral and written translation data were examined in order to investigate if task times and translation quality differed in the three modalities. Although task times were found to be highest in written translation, the quality was not consistently better. In addition...

  8. An Efficient PageRank Approach for Urban Traffic Optimization

    OpenAIRE

    2012-01-01

    The cities are not static environments. They change constantly. When we talk about traffic in the city, the evolution of traffic lights is a journey from mindless automation to increasingly intelligent, fluid traffic management. In our approach, presented in this paper, reinforcement-learning mechanism based on cost function is introduced to determine optimal decisions for each traffic light, based on the solution given by Larry Page for page ranking in Web environment (Page et al. (1999))...

  9. One-Page Multimedia Interactive Map

    Directory of Open Access Journals (Sweden)

    Nicola Maiellaro

    2017-01-01

    Full Text Available The relevance of local knowledge in cultural heritage is by now acknowledged. It helps to determine many community-based projects by identifying the material to be digitally maintained in multimedia collections provided by communities of volunteers, rather than for-profit businesses or government entities. Considering that the search and browsing of texts, images, video, and 3D models related to places is more essential than using a simple text-based search, an interactive multimedia map was implemented in this study. The map, which is loaded on a single HyperText Markup Language (HTML page using AJAX (Asynchronous JavaScript and XML, with a client-side control mechanism utilising jQuery components that are both freely available and ad-hoc developed, is updated according to user interaction. To simplify the publication of geo-referenced information, the application stores all the data in a Geographic JavaScript Object Notation (GeoJSON file rather than in a database. The multimedia contents—associated with the selected Points of Interest (PoIs—can be selected through text search and list browsing as well as by viewing their previews one by one in a sequence all together in a scrolling window (respectively: “Table”, “Folder”, and “Tile” functions. PoIs—visualised on the map with multi-shape markers using a set of unambiguous colours—can be filtered through their categories and types, accessibility status and timeline, thus improving the system usability. The map functions are illustrated using data collected in a Comenius project. Notes on the application software and architecture are also presented in this paper.

  10. A thorough spring-clean for CERN's Web pages

    CERN Multimedia

    2001-01-01

    This coming Tuesday will see the unveiling of CERN's new user pages on the Web. Their simplified layout and design will make everybody's lives a whole lot easier. Stand by for Tuesday 17 April when, as announced in the Weekly Bulletin of 2 April (n°14/2001), the new newly-designed users welcome page will be hitting our screens as the default CERN home page. But don't worry, if you've got the blues for the good old blue-green home page it's still in service and, to ensure a smooth transition, will be maintained in parallel until 25 May. But in all likelihood you'll be quickly won over by the new-look pages, which are so much simpler to use. Welcome to the new Web! The aim of this revamp, led by the WPE (Web Public Education) group, is to simplify and introduce a more logical hierarchy into the menus and welcome pages on CERN's Intranet. In a second stage, the 'General Public' pages will get a similar makeover. The fact is that the number of links on the user pages, and in particular the welcome page...

  11. DISTRIBUTED APPROACH to WEB PAGE CATEGORIZATION USING MAPREDUCE PROGRAMMING MODEL

    Directory of Open Access Journals (Sweden)

    P.Malarvizhi

    2011-12-01

    Full Text Available The web is a large repository of information and to facilitate the search and retrieval of pages from it,categorization of web documents is essential. An effective means to handle the complexity of information retrieval from the internet is through automatic classification of web pages. Although lots of automatic classification algorithms and systems have been presented, most of the existing approaches are computationally challenging. In order to overcome this challenge, we have proposed a parallel algorithm, known as MapReduce programming model to automatically categorize the web pages. This approach incorporates three concepts. They are web crawler, MapReduce programming model and the proposed web page categorization approach. Initially, we have utilized web crawler to mine the World Wide Web and the crawled web pages are then directly given as input to the MapReduce programming model. Here the MapReduce programming model adapted to our proposed web page categorization approach finds the appropriate category of the web page according to its content. The experimental results show that our proposed parallel web page categorization approach achieves satisfactory results in finding the right category for any given web page.

  12. A Heuristic Algorithm for optimizing Page Selection Instructions

    CERN Document Server

    Li, Qing'an; Chen, Yong; Wu, Wei; Xu, Wenwen

    2010-01-01

    Page switching is a technique that increases the memory in microcontrollers without extending the address buses. This technique is widely used in the design of 8-bit MCUs. In this paper, we present an algorithm to reduce the overhead of page switching. To pursue small code size, we place the emphasis on the allocation of functions into suitable pages with a heuristic algorithm, thereby the cost-effective placement of page selection instructions. Our experimental results showed the optimization achieved a reduction in code size of 13.2 percent.

  13. Recognition of pornographic web pages by classifying texts and images.

    Science.gov (United States)

    Hu, Weiming; Wu, Ou; Chen, Zhouyao; Fu, Zhouyu; Maybank, Steve

    2007-06-01

    With the rapid development of the World Wide Web, people benefit more and more from the sharing of information. However, Web pages with obscene, harmful, or illegal content can be easily accessed. It is important to recognize such unsuitable, offensive, or pornographic Web pages. In this paper, a novel framework for recognizing pornographic Web pages is described. A C4.5 decision tree is used to divide Web pages, according to content representations, into continuous text pages, discrete text pages, and image pages. These three categories of Web pages are handled, respectively, by a continuous text classifier, a discrete text classifier, and an algorithm that fuses the results from the image classifier and the discrete text classifier. In the continuous text classifier, statistical and semantic features are used to recognize pornographic texts. In the discrete text classifier, the naive Bayes rule is used to calculate the probability that a discrete text is pornographic. In the image classifier, the object's contour-based features are extracted to recognize pornographic images. In the text and image fusion algorithm, the Bayes theory is used to combine the recognition results from images and texts. Experimental results demonstrate that the continuous text classifier outperforms the traditional keyword-statistics-based classifier, the contour-based image classifier outperforms the traditional skin-region-based image classifier, the results obtained by our fusion algorithm outperform those by either of the individual classifiers, and our framework can be adapted to different categories of Web pages.

  14. Fuzzy Clustering Method for Web User Based on Pages Classification

    Institute of Scientific and Technical Information of China (English)

    ZHAN Li-qiang; LIU Da-xin

    2004-01-01

    A new method for Web users fuzzy clustering based on analysis of user interest characteristic is proposed in this article.The method first defines page fuzzy categories according to the links on the index page of the site, then computes fuzzy degree of cross page through aggregating on data of Web log.After that, by using fuzzy comprehensive evaluation method, the method constructs user interest vectors according to page viewing times and frequency of hits, and derives the fuzzy similarity matrix from the interest vectors for the Web users.Finally, it gets the clustering result through the fuzzy clustering method.The experimental results show the effectiveness of the method.

  15. A Model for Web Page Usage Mining Based on Segmentation

    OpenAIRE

    Kuppusamy, K. S.; Aghila, G.

    2012-01-01

    The web page usage mining plays a vital role in enriching the page's content and structure based on the feedbacks received from the user's interactions with the page. This paper proposes a model for micro-managing the tracking activities by fine-tuning the mining from the page level to the segment level. The proposed model enables the web-master to identify the segments which receives more focus from users comparing with others. The segment level analytics of user actions provides an importan...

  16. Discovering author impact: A PageRank perspective

    CERN Document Server

    Yan, Erjia

    2010-01-01

    This article provides an alternative perspective for measuring author impact by applying PageRank algorithm to a coauthorship network. A weighted PageRank algorithm considering citation and coauthorship network topology is proposed. We test this algorithm under different damping factors by evaluating author impact in the informetrics research community. In addition, we also compare this weighted PageRank with the h-index, citation, and program committee (PC) membership of the International Society for Scientometrics and Informetrics (ISSI) conferences. Findings show that this weighted PageRank algorithm provides reliable results in measuring author impact.

  17. Machine learning approach for automatic quality criteria detection of health web pages.

    Science.gov (United States)

    Gaudinat, Arnaud; Grabar, Natalia; Boyer, Célia

    2007-01-01

    The number of medical websites is constantly growing [1]. Owing to the open nature of the Web, the reliability of information available on the Web is uneven. Internet users are overwhelmed by the quantity of information available on the Web. The situation is even more critical in the medical area, as the content proposed by health websites can have a direct impact on the users' well being. One way to control the reliability of health websites is to assess their quality and to make this assessment available to users. The HON Foundation has defined a set of eight ethical principles. HON's experts are working in order to manually define whether a given website complies with s the required principles. As the number of medical websites is constantly growing, manual expertise becomes insufficient and automatic systems should be used in order to help medical experts. In this paper we present the design and the evaluation of an automatic system conceived for the categorisation of medical and health documents according to he HONcode ethical principles. A first evaluation shows promising results. Currently the system shows 0.78 micro precision and 0.73 F-measure, with 0.06 errors.

  18. Introducing Considerations in the Translation of Chinese Medicine

    Institute of Scientific and Technical Information of China (English)

    Sonya E.Pritzker; Ka-Kit Hui

    2014-01-01

    This article introduces the document,Considerations in the Translation of Chinese Medicine, published in PDF form online in both Chinese and English. This 20-page document includes several sections describing why theConsiderations is necessary, the speciifcity of texts in Chinese medicine; the history of translation in Chinese medicine; who constitutes an ideal translator of Chinese medicine; what types of language exist in Chinese medicine; and speciifc issues in the translation of Chinese medicine, such as domestication versus foreignization, technical terminology, period-speciifc language, style, polysemy, and etymological translation. The final section offers a brief advisory for consumers, and concludes with a call to further discussion, and action, specifically in the development of international collaborative efforts towards the creation of more rigorous guidelines for the translation of Chinese medicine. The current article provides an overview of several of these sections, and includes links to the original document.

  19. Nuclear proteasomes carry a constitutive posttranslational modification which derails SDS-PAGE (but not CTAB-PAGE).

    Science.gov (United States)

    Pitcher, David S; de Mattos-Shipley, Kate; Wang, Ziming; Tzortzis, Konstantinos; Goudevenou, Katerina; Flynn, Helen; Bohn, Georg; Rahemtulla, Amin; Roberts, Irene; Snijders, Ambrosius P; Karadimitris, Anastasios; Kleijnen, Maurits F

    2014-12-01

    We report that subunits of human nuclear proteasomes carry a previously unrecognised, constitutive posttranslational modification. Subunits with this modification are not visualised by SDS-PAGE, which is used in almost all denaturing protein gel electrophoresis. In contrast, CTAB-PAGE readily visualises such modified subunits. Thus, under most experimental conditions, with identical samples, SDS-PAGE yielded gel electrophoresis patterns for subunits of nuclear proteasomes which were misleading and strikingly different from those obtained with CTAB-PAGE. Initial analysis indicates a novel modification of a high negative charge with some similarity to polyADP-ribose, possibly explaining compatibility with (positively-charged) CTAB-PAGE but not (negatively-charged) SDS-PAGE and providing a mechanism for how nuclear proteasomes may interact with chromatin, DNA and other nuclear components.

  20. The development of compact electroactive polymer actuators suitable for use in full page Braille displays

    Science.gov (United States)

    Gorny, Lee J.; Zellers, Brian C.; Lin, Minren; Liu, Sheng; Zhang, Qiming M.

    2010-04-01

    Piezoceramic actuators, presently used in commercial Braille displays, are limited by the material's relatively small strain and brittle nature. For this reason, it is a challenge to develop full page, compact, graphic Braille displays that are affordable. A newly developed material composed of P(VDF-TrFE-CFE) terpolymer blended with 5% P(VDF-CTFE) electrostrictive actuators exhibits large strains (~5% at 150V/μm), fast actuation (>5 mm/s), and has a relatively high elastic modulus (1.2 GPa). This material exhibits more than double the elastic energy density and a 50% higher modulus of the original electrostrictive terpolymer. Hence, the potential for viable actuators in compact, full page Braille displays is greater than ever, provided actuators can be manufactured reliably in quantity. This talk presents recent work in scaling production of such rolled actuators. Actuators extend .5 mm, are confined to the 2.5 mm grid spacing of conventional Braille text, generate >0.5 N force and operate at less than 200V, thus meeting the primary requirements for a commercialized Braille display. To manufacture these actuators, cast films are stretched using a roll-to-roll zone drawing machine that is capable of producing quantities of 2 μm thick film with high quality. What follows is a discussion of this machine, the roll-to-roll film stretching process and an assessment of the resulting stretched film for use as linear strain actuators, like those used in our Braille cell.

  1. 基于特征比较和最大熵模型的统计机器翻译错误检测%Error Detection for Statistical Machine Translation Based on Feature Comparison and Maximum Entropy Model Classifier

    Institute of Scientific and Technical Information of China (English)

    杜金华; 王莎

    2013-01-01

    首先介绍3种典型的用于翻译错误检测和分类的单词后验概率特征,即基于固定位置的词后验概率、基于滑动窗的词后验概率和基于词对齐的词后验概率,分析其对错误检测性能的影响;然后,将其分别与语言学特征如词性、词及由LG句法分析器抽取的句法特征等进行组合,利用最大熵分类器预测翻译错误,并在汉英NIST数据集上进行实验验证和比较.实验结果表明,不同的单词后验概率对分类错误率的影响是显著的,并且在词后验概率基础上加入语言学特征的组合特征可以显著降低分类错误率,提高译文错误预测性能.%The authors firstly introduce three typical word posterior probabilities (WPP) for error detection and classification, which are fixed position WPP, sliding window WPP, and alignment-based WPP, and analyzes their impact on the detection performance. Then each WPP feature is combined with three linguistic features (Word, POS and LG Parsing knowledge) over the maximum entropy classifier to predict the translation errors. Experimental results on Chinese-to-English NIST datasets show that the influences of different WPP features on the classification error rate (CER) are significant, and the combination of WPP with linguistic features can significantly reduce the CER and improve the prediction capability of the classifier.

  2. Word Translation Prediction for Morphologically Rich Languages with Bilingual Neural Networks

    NARCIS (Netherlands)

    Tran, K.; Bisazza, A.; Monz, C.

    2014-01-01

    Translating into morphologically rich languages is a particularly difficult problem in machine translation due to the high degree of inflectional ambiguity in the target language, often only poorly captured by existing word translation models. We present a general approach that exploits source-side

  3. Mutual Disambiguation of Eye Gaze and Speech for Sight Translation and Reading

    DEFF Research Database (Denmark)

    Kulkarni, Rucha; Jain, Kritika; Bansal, Himanshu;

    Researchers are proposing interactive machine translation as a potential method to make language translation process more efficient and usable. Introduction of different modalities like eye gaze and speech are being explored to add to the interactivity of language translation system. Unfortunately...

  4. Towards improving English-Latvian translation: a system comparison and a new rescoring feature

    NARCIS (Netherlands)

    Khalilov, M.; Fonollosa, J.A.R.; Skadiņa, I.; Brālītis, E.; Pretkalniņa, L.; Calzolari, N.; Choukri, K.; Maegaard, B.; Mariani, J.; Odijk, J.; Piperidis, S.; Rosner, M.; Tapias, D.

    2010-01-01

    Translation into the languages with relatively free word order has received a lot less attention than translation into fixed word order languages (English), or into analytical languages (Chinese). At the same time this translation task is found among the most difficult challenges for machine transla

  5. English-Latvian SMT: the challenge of translating into a free word order language

    NARCIS (Netherlands)

    Khalilov, M.; Fonollosa, J.A.R.; Skadiņa, I.; Brālītis, E.; Pretkalniņa, L.

    2010-01-01

    This paper presents a comparative study of two approaches to statistical machine translation (SMT) and their application to a task of English-to-Latvian translation, which is still an open research line in the field of automatic translation. We consider a state-of-the-art phrase-based SMT and an alt

  6. Embedding Web-Based Statistical Translation Models in Cross-Language Information Retrieval

    NARCIS (Netherlands)

    Kraaij, W.; Nie, J.Y.; Simard, M.

    2003-01-01

    Although more and more language pairs are covered by machine translation (MT) services, there are still many pairs that lack translation resources. Cross-language information retrieval (CUR) is an application that needs translation functionality of a relatively low level of sophistication, since

  7. Embedding Web-Based Statistical Translation Models in Cross-Language Information Retrieval

    NARCIS (Netherlands)

    Kraaij, W.; Nie, J.Y.; Simard, M.

    2003-01-01

    Although more and more language pairs are covered by machine translation (MT) services, there are still many pairs that lack translation resources. Cross-language information retrieval (CUR) is an application that needs translation functionality of a relatively low level of sophistication, since cur

  8. Mutual Disambiguation of Eye Gaze and Speech for Sight Translation and Reading

    DEFF Research Database (Denmark)

    Kulkarni, Rucha; Jain, Kritika; Bansal, Himanshu

    2013-01-01

    Researchers are proposing interactive machine translation as a potential method to make language translation process more efficient and usable. Introduction of different modalities like eye gaze and speech are being explored to add to the interactivity of language translation system. Unfortunatel...

  9. Automation of printing machine

    OpenAIRE

    Sušil, David

    2016-01-01

    Bachelor thesis is focused on the automation of the printing machine and comparing the two types of printing machines. The first chapter deals with the history of printing, typesettings, printing techniques and various kinds of bookbinding. The second chapter describes the difference between sheet-fed printing machines and offset printing machines, the difference between two representatives of rotary machines, technological process of the products on these machines, the description of the mac...

  10. Machine and Woodworking Tool Safety. Module SH-24. Safety and Health.

    Science.gov (United States)

    Center for Occupational Research and Development, Inc., Waco, TX.

    This student module on machine and woodworking tool safety is one of 50 modules concerned with job safety and health. This module discusses specific practices and precautions concerned with the efficient operation and use of most machine and woodworking tools in use today. Following the introduction, 13 objectives (each keyed to a page in the…

  11. On Collocations and Their Interaction with Parsing and Translation

    Directory of Open Access Journals (Sweden)

    Violeta Seretan

    2013-10-01

    Full Text Available We address the problem of automatically processing collocations—a subclass of multi-word expressions characterized by a high degree of morphosyntactic flexibility—in the context of two major applications, namely, syntactic parsing and machine translation. We show that parsing and collocation identification are processes that are interrelated and that benefit from each other, inasmuch as syntactic information is crucial for acquiring collocations from corpora and, vice versa, collocational information can be used to improve parsing performance. Similarly, we focus on the interrelation between collocations and machine translation, highlighting the use of translation information for multilingual collocation identification, as well as the use of collocational knowledge for improving translation. We give a panorama of the existing relevant work, and we parallel the literature surveys with our own experiments involving a symbolic parser and a rule-based translation system. The results show a significant improvement over approaches in which the corresponding tasks are decoupled.

  12. Production Machine Shop Employment Competencies. Part Three: The Engine Lathe.

    Science.gov (United States)

    Bishart, Gus; Werner, Claire

    Competencies for production machine shop are provided for the third of four topic areas: the engine lathe. Each competency appears in a one-page format. It is presented as a goal statement followed by one or more "indicator" statements, which are performance objectives describing an ability that, upon attainment, will establish…

  13. Gigantic particle collision machine does "mini Big Bangs"

    CERN Multimedia

    2007-01-01

    "The world's largest machine is reputed to be the Large Hadron Collider (LHC) at CERN in Geneva and everything about it is big. Designed to carry out high energy particle collisions, when completed next year, one of the collider's experiments includes a 10'000 ton detector." (1/2 page)

  14. Corpora and Translator

    Institute of Scientific and Technical Information of China (English)

    傅丽莉

    2010-01-01

    @@ 1 Introduction Recent years have witnessed a significant growth of corpora heat. One of the many fields where corpora have a growing impact is translation, both at a descriptive and a practical level. Chesterman pointed out that the focus of Translation Studies shifted from translation itself to translators, from regulative to descriptive studies, from philosophical to empirical studies (Chesterman, 1998).

  15. Sound Effects in Translation

    DEFF Research Database (Denmark)

    Mees, Inger M.; Dragsted, Barbara; Gorm Hansen, Inge

    2015-01-01

    implications for the curriculum, (3) be pedagogically motivating, and (4) prepare students for employing translation technology in their future practice as translators. In a two-phase study in which 14 MA students translated texts in three modalities (sight, written, and oral translation using an SR program...

  16. On translation equivalence

    Institute of Scientific and Technical Information of China (English)

    石雏凤

    2009-01-01

    Nida's translation theories, especially for his "Dynamic equivalence theory", are highly praised and adopted in Chinese translation circle. Howev-er, there are a lot of criticism and misunderstanding at the same time. This paper explores the issue on translation equivalence so as to benefit our translation studies on both theory and practice level.

  17. Machine musicianship

    Science.gov (United States)

    Rowe, Robert

    2002-05-01

    The training of musicians begins by teaching basic musical concepts, a collection of knowledge commonly known as musicianship. Computer programs designed to implement musical skills (e.g., to make sense of what they hear, perform music expressively, or compose convincing pieces) can similarly benefit from access to a fundamental level of musicianship. Recent research in music cognition, artificial intelligence, and music theory has produced a repertoire of techniques that can make the behavior of computer programs more musical. Many of these were presented in a recently published book/CD-ROM entitled Machine Musicianship. For use in interactive music systems, we are interested in those which are fast enough to run in real time and that need only make reference to the material as it appears in sequence. This talk will review several applications that are able to identify the tonal center of musical material during performance. Beyond this specific task, the design of real-time algorithmic listening through the concurrent operation of several connected analyzers is examined. The presentation includes discussion of a library of C++ objects that can be combined to perform interactive listening and a demonstration of their capability.

  18. FACTORS REGULATING LIBERAL TRANSLATION

    Institute of Scientific and Technical Information of China (English)

    龚海红

    2012-01-01

    Literal translation and liberal translation are two important methods and both play key roles in translation.However,some textbooks say that most translations are literal translations while others maintain most are liberal ones,besides,some others suggest a combination of the two.This paper focuses on the facts that regulate liberal translation.Because of the differences in culture,society,history,geography,and so on,there exists a great difference between Chinese language and English language,which does naturally lead to the liberal translation.

  19. JavaScript: Convenient Interactivity for the Class Web Page.

    Science.gov (United States)

    Gray, Patricia

    This paper shows how JavaScript can be used within HTML pages to add interactive review sessions and quizzes incorporating graphics and sound files. JavaScript has the advantage of providing basic interactive functions without the use of separate software applications and players. Because it can be part of a standard HTML page, it is…

  20. Toward a User-Centered Academic Library Home Page

    Science.gov (United States)

    McHale, Nina

    2008-01-01

    In the past decade, academic libraries have struggled with the design of an effective library home page. Since librarians' mental models of information architecture differ from those of their patrons, usability assessments are necessary in designing a user-centered home page. This study details a usability sequence of card sort and paper and…

  1. Dynamic Web Pages: Performance Impact on Web Servers.

    Science.gov (United States)

    Kothari, Bhupesh; Claypool, Mark

    2001-01-01

    Discussion of Web servers and requests for dynamic pages focuses on experimentally measuring and analyzing the performance of the three dynamic Web page generation technologies: CGI, FastCGI, and Servlets. Develops a multivariate linear regression model and predicts Web server performance under some typical dynamic requests. (Author/LRW)

  2. ONTOPARK: ONTOLOGY BASED PAGE RANKING FRAMEWORK USING RESOURCE DESCRIPTION FRAMEWORK

    Directory of Open Access Journals (Sweden)

    S. Yasodha

    2014-01-01

    Full Text Available Traditional search engines like Google and Yahoo fail to rank the relevant information for users’ query. This is because such search engines rely on keywords for searching and they fail to consider the semantics of the query. More sophisticated methods that do provide the relevant information for the query is the need of the time. The Semantic Web that stores metadata as ontology could be used to solve this problem. The major drawback of the PageRank algorithm of Google is that ranking is based not only on the page ranks produced but also on the number of hits to the Web page. This paved way for illegitimate means of boosting page ranks. As a result, Web pages whose page rank is zero are also ranked in top-order. This drawback of PageRank algorithm motivated us to contribute to the Web community to provide semantic search results. So we propose ONTOPARK, an ontology based framework for ranking Web pages. The proposed framework combines the Vector Space Model of Information Retrieval with Ontology. The framework constructs semantically annotated Resource Description Framework (RDF files which form the RDF knowledgebase for each query. The proposed framework has been evaluated by two measures, precision and recall. The proposed framework improves the precision of both single-word and multi-word queries which infer that replacing Web database by semantic knowledgebase will definitely improve the quality of search. The surfing time of the surfers will also be minimized.

  3. Evaluating Information Quality: Hidden Biases on the Children's Web Pages

    Science.gov (United States)

    Kurubacak, Gulsun

    2006-01-01

    As global digital communication continues to flourish, the Children's Web pages become more critical for children to realize not only the surface but also breadth and deeper meanings in presenting these milieus. These pages not only are very diverse and complex but also enable intense communication across social, cultural and political…

  4. An Analysis of Academic Library Web Pages for Faculty

    Science.gov (United States)

    Gardner, Susan J.; Juricek, John Eric; Xu, F. Grace

    2008-01-01

    Web sites are increasingly used by academic libraries to promote key services and collections to teaching faculty. This study analyzes the content, location, language, and technological features of fifty-four academic library Web pages designed especially for faculty to expose patterns in the development of these pages.

  5. Project Management - Development of course materiale as WEB pages

    DEFF Research Database (Denmark)

    Thorsteinsson, Uffe; Bjergø, Søren

    1997-01-01

    Development of Internet pages with lessons plans, slideshows, links, conference system and interactive student section for communication between students and to teacher as well.......Development of Internet pages with lessons plans, slideshows, links, conference system and interactive student section for communication between students and to teacher as well....

  6. TSMC: A Novel Approach for Live Virtual Machine Migration

    Directory of Open Access Journals (Sweden)

    Jiaxing Song

    2014-01-01

    Full Text Available Cloud computing attracted more and more attention in recent years, and virtualization technology is the key point for deploying infrastructure services in cloud environment. It allows application isolation and facilitates server consolidation, load balancing, fault management, and power saving. Live virtual machine migration can effectively relocate virtual resources and it has become an important management method in clusters and data centers. Existing precopy live migration approach has to iteratively copy redundant memory pages; another postcopy live migration approach would lead to a lot of page faults and application degradation. In this paper, we present a novel approach called TSMC (three-stage memory copy for live virtual machine migration. In TSMC, memory pages only need to be transmitted twice at most and page fault just occurred in small part of dirty pages. We implement it in Xen and compare it with Xen’s original precopy approach. The experimental results under various memory workloads show that TSMC approach can significantly reduce the cumulative migration time and total pages transferred and achieve better network IO performance in the same time.

  7. Resolving Acetylated and Phosphorylated Proteins by Neutral Urea Triton-Polyacrylamide Gel Electrophoresis, NUT-PAGE

    Science.gov (United States)

    Buehl, Christopher J.; Deng, Xiexiong; Liu, Mengyu; Hovde, Stacy; Xu, Xinjing; Kuo, Min-Hao

    2014-01-01

    Protein acetylation and phosphorylation can be key modifications that regulate both normal and pathological protein functions. Current gel systems used to analyze modified proteins require either expensive reagents or time–consuming second dimension electrophoresis. In this manuscript, we present a neutral pH gel system that allows the analysis of acetylated and phosphorylated proteins. This neutral pH urea Triton-polyacrylamide gel electrophoresis system, or NUT-PAGE, separates proteins based on their charge at pH 7 and generates discrete bands from each acetylated and phosphorylated species. In addition, the gel is composed of common and inexpensive laboratory reagents, and requires only a single dimension of electrophoresis. We are able to demonstrate the effectiveness of this system by analyzing phosphorylated species of an acidic protein, α-synuclein, and both acetylated and phosphorylated species of a basic protein, histone H3. NUT-PAGE thus provides a cost-effective alternative to resolving acetylated and phosphorylated proteins, and potentially proteins with other post-translational modifications that alter net charge. Method Summary Here we present a single-dimension neutral pH urea Triton-polyacrylamide gel electrophoresis (NUT-PAGE) system affording high-resolution separation of acetylated and phosphorylated proteins. PMID:25109292

  8. Some Major Steps to Translation and Translator

    Directory of Open Access Journals (Sweden)

    Mohammad Reza Hojat Shamami

    2012-09-01

    Full Text Available This work is an overview of the main issues at the core of theorizing translation practice and the features of a good translator or how to be a good translator and of course what are the Skills to become a Freelance Translator and Translation process. In this world of science and technology there is knowledge explosion every day. This knowledge which is generally written in the English language needs to be transmitted in various languages so that people who do not know how to speak and write the original language can get the knowledge necessary for industrial development and technological innovation to keep up with the rest of the world. To transmit this knowledge effectively, there is a need for competent translators in various languages.

  9. PageRank model of opinion formation on social networks

    Science.gov (United States)

    Kandiah, Vivek; Shepelyansky, Dima L.

    2012-11-01

    We propose the PageRank model of opinion formation and investigate its rich properties on real directed networks of the Universities of Cambridge and Oxford, LiveJournal, and Twitter. In this model, the opinion formation of linked electors is weighted with their PageRank probability. Such a probability is used by the Google search engine for ranking of web pages. We find that the society elite, corresponding to the top PageRank nodes, can impose its opinion on a significant fraction of the society. However, for a homogeneous distribution of two opinions, there exists a bistability range of opinions which depends on a conformist parameter characterizing the opinion formation. We find that the LiveJournal and Twitter networks have a stronger tendency to a totalitarian opinion formation than the university networks. We also analyze the Sznajd model generalized for scale-free networks with the weighted PageRank vote of electors.

  10. The ICAP (Interactive Course Assignment Pages Publishing System

    Directory of Open Access Journals (Sweden)

    Kim Griggs

    2008-03-01

    Full Text Available The ICAP publishing system is an open source custom content management system that enables librarians to easily and quickly create and manage library help pages for course assignments (ICAPs, without requiring knowledge of HTML or other web technologies. The system's unique features include an emphasis on collaboration and content reuse and an easy-to-use interface that includes in-line help, simple forms and drag and drop functionality. The system generates dynamic, attractive course assignment pages that blend Web 2.0 features with traditional library resources, and makes the pages easier to find by providing a central web page for the course assignment pages. As of December 2007, the code is available as free, open-source software under the GNU General Public License.

  11. Collective Behaviour Learning :A Concept For Filtering Web Pages

    Directory of Open Access Journals (Sweden)

    G. Mercy Bai

    2014-03-01

    Full Text Available The rapid growth of the WWW poses unprecedented challenges for general purpose crawlers and search engines. The Former technique used to crawl web pages was FOCUS (Forum Crawler Under Supervision.This project presents a collective behavior learning algorithm for web crawling. The collective behavior learning algorithm crawl the web pages based on particular keyword. Discriminative learning extracts only the related URL of the particular keyword based on filtering. The goal of this project is to crawl relevant forum content from the web with minimal overhead. The unwanted URL is removed from the web pages and the web page crawling is reduced by using the collective behavior learning. The web pages must be extracted based on certain learning techniques and can be used to collect the unwanted URL’S.

  12. An Improved Approach to the PageRank Problems

    Directory of Open Access Journals (Sweden)

    Yue Xie

    2013-01-01

    Full Text Available We introduce a partition of the web pages particularly suited to the PageRank problems in which the web link graph has a nested block structure. Based on the partition of the web pages, dangling nodes, common nodes, and general nodes, the hyperlink matrix can be reordered to be a more simple block structure. Then based on the parallel computation method, we propose an algorithm for the PageRank problems. In this algorithm, the dimension of the linear system becomes smaller, and the vector for general nodes in each block can be calculated separately in every iteration. Numerical experiments show that this approach speeds up the computation of PageRank.

  13. Cultural Context and Translation

    Institute of Scientific and Technical Information of China (English)

    张敏

    2009-01-01

    cultural context plays an important role in translation. Because translation is a cross-culture activity, the culture context that influ-ences translating is consisted of both the culture contexts of source language and target language. This article firstly analyzes the concept of context and cultural context, then according to the procedure of translating classifies cultural context into two stages and talks about how they respectively influence translating.

  14. Stylistic Requirement for Translation

    Institute of Scientific and Technical Information of China (English)

    HOU Yin-zhen

    2016-01-01

    Stylistic research is subordinate to language use research. The fast maturing modern stylistics has greatly boosted trans-lation studies. Translation has a close relationship with stylistics. Many problems can be solved in translation practice by stylis-tic theories and analysis methods. Based on a brief introduction of stylistics and the relationship between stylistics and transla-tion, this paper will give a specific analysis of the stylistic requirement for translation.

  15. In vitro translation of androgen receptor cRNA results in an activated androgen receptor protein

    NARCIS (Netherlands)

    G.G.J.M. Kuiper (George); P.E. de Ruiter (Petra); J. Trapman (Jan); G.W. Jenster (Guido); A.O. Brinkmann (Albert)

    1993-01-01

    textabstractTranslation of androgen receptor (AR) cRNA in a reticulocyte lysate and subsequent analysis of the translation products by SDS/PAGE showed a protein with an apparent molecular mass of 108 kDa. Scatchard-plot analysis revealed a single binding component with

  16. Dialog between a Lexicographer and a Translator

    Directory of Open Access Journals (Sweden)

    Mark Kit

    2015-06-01

    Full Text Available Dialog between a Lexicographer and a Translator The discussion between the authors of the paper concerns the most pressing issues encountered in natural language semantics, as well as in corpus linguistics and computational linguistics. A broad range of knowledge, allowing linguists and information scientists to work together, is required in these areas. The paper describes some primary problems of human and machine translation caused by gaps between different fields of knowledge. The authors suggest that interdisciplinary approach is required when it comes to contrastive studies in linguistics.

  17. The 'Calvinist Patriarch' Cyril Lucaris and His Bible Translations

    OpenAIRE

    Khokhar, Antony John

    2015-01-01

    Cyril Lucaris’ Bible translation is a curious case from the Middle Ages. This article attempts to bring to the fore Lucaris’ efforts in translating the Bible and its ensuing aftermath. We begin by unfolding a few pages from the life of Lucaris in order to situate him in the context of his so-called grand endeavour. Our paper then concentrates on his teachings on the Bible. Here focus is put on his initiatives to translate the Bible into Modern Greek, a language of the masses. This enormous ta...

  18. Chapter 16: text mining for translational bioinformatics.

    Directory of Open Access Journals (Sweden)

    K Bretonnel Cohen

    2013-04-01

    Full Text Available Text mining for translational bioinformatics is a new field with tremendous research potential. It is a subfield of biomedical natural language processing that concerns itself directly with the problem of relating basic biomedical research to clinical practice, and vice versa. Applications of text mining fall both into the category of T1 translational research-translating basic science results into new interventions-and T2 translational research, or translational research for public health. Potential use cases include better phenotyping of research subjects, and pharmacogenomic research. A variety of methods for evaluating text mining applications exist, including corpora, structured test suites, and post hoc judging. Two basic principles of linguistic structure are relevant for building text mining applications. One is that linguistic structure consists of multiple levels. The other is that every level of linguistic structure is characterized by ambiguity. There are two basic approaches to text mining: rule-based, also known as knowledge-based; and machine-learning-based, also known as statistical. Many systems are hybrids of the two approaches. Shared tasks have had a strong effect on the direction of the field. Like all translational bioinformatics software, text mining software for translational bioinformatics can be considered health-critical and should be subject to the strictest standards of quality assurance and software testing.

  19. Chapter 16: text mining for translational bioinformatics.

    Science.gov (United States)

    Cohen, K Bretonnel; Hunter, Lawrence E

    2013-04-01

    Text mining for translational bioinformatics is a new field with tremendous research potential. It is a subfield of biomedical natural language processing that concerns itself directly with the problem of relating basic biomedical research to clinical practice, and vice versa. Applications of text mining fall both into the category of T1 translational research-translating basic science results into new interventions-and T2 translational research, or translational research for public health. Potential use cases include better phenotyping of research subjects, and pharmacogenomic research. A variety of methods for evaluating text mining applications exist, including corpora, structured test suites, and post hoc judging. Two basic principles of linguistic structure are relevant for building text mining applications. One is that linguistic structure consists of multiple levels. The other is that every level of linguistic structure is characterized by ambiguity. There are two basic approaches to text mining: rule-based, also known as knowledge-based; and machine-learning-based, also known as statistical. Many systems are hybrids of the two approaches. Shared tasks have had a strong effect on the direction of the field. Like all translational bioinformatics software, text mining software for translational bioinformatics can be considered health-critical and should be subject to the strictest standards of quality assurance and software testing.

  20. Translation-coupling systems

    Science.gov (United States)

    Pfleger, Brian; Mendez-Perez, Daniel

    2013-11-05

    Disclosed are systems and methods for coupling translation of a target gene to a detectable response gene. A version of the invention includes a translation-coupling cassette. The translation-coupling cassette includes a target gene, a response gene, a response-gene translation control element, and a secondary structure-forming sequence that reversibly forms a secondary structure masking the response-gene translation control element. Masking of the response-gene translation control element inhibits translation of the response gene. Full translation of the target gene results in unfolding of the secondary structure and consequent translation of the response gene. Translation of the target gene is determined by detecting presence of the response-gene protein product. The invention further includes RNA transcripts of the translation-coupling cassettes, vectors comprising the translation-coupling cassettes, hosts comprising the translation-coupling cassettes, methods of using the translation-coupling cassettes, and gene products produced with the translation-coupling cassettes.