Báo cáo khoa học: "Using Noisy Bilingual Data for Statistical Machine Translation" pot
... Using Noisy Bilingual Data for Statistical Machine Translation Stephan Vogel Interactive Systems Lab Language Technologies ... translation model trained on bilingual data and a language model for the target language, trained on perhaps some larger monolingual data. Often the amount of clean parallel data is limited. This leads ... (Vogel et al., 1996) were trained for...
Ngày tải lên: 24/03/2014, 03:20
... all such subtrees for a hypothetical T and ¯ f h 1 . Fortunately, with a little analysis that ac- counts for ¯ f h+1 , we can show that at most two sub- trees need to be checked. For a given interruption-free ¯ f h 1 , ... constraint for word alignment. Therefore, we propose a soft version of our cohesion constraint. We perform our interruption check, but we do not invalidate any hypo...
Ngày tải lên: 08/03/2014, 01:20
... sta- tistical machine translation whose novel contributions are (a) support for linguisti- cally motivated factors, (b) confusion net- work decoding, and (c) efficient data for- mats for translation ... decoder. 5 Efficient Data Structures for Transla- tion Model and Language Models With the availability of ever-increasing amounts of training data, it has become a challenge...
Ngày tải lên: 23/03/2014, 18:20
Báo cáo khoa học: "A Polynomial-Time Algorithm for Statistical Machine Translation" pot
... polynomial-time algorithm for statistical machine translation. This algorithm can be used in place of the expensive, slow best-first search strate- gies in current statistical translation ar- ... that exploits bracket- ing information (Wu and Ng, 1995). If any brackets for the Chinese sentence can be supplied as addi- tional input information, produced for example by a prepr...
Ngày tải lên: 31/03/2014, 06:20
Tài liệu Báo cáo khoa học: "Refined Lexicon Models for Statistical Machine Translation using a Maximum Entropy Approach" pptx
... Refined Lexicon Models for Statistical Machine Translation using a Maximum Entropy Approach Ismael Garc ´ ıa Varea Dpto. de Inform´atica Univ. de Castilla-La Mancha Campus ... the lexicon models used in statistical machine translation systems do not include any kind of linguistic or contextual information, which often leads to problems in performing a cor- rect word ... this problem...
Ngày tải lên: 20/02/2014, 18:20
Tài liệu Báo cáo khoa học: "A Localized Prediction Model for Statistical Machine Translation" ppt
... ACL, pages 557–564, Ann Arbor, June 2005. c 2005 Association for Computational Linguistics A Localized Prediction Model for Statistical Machine Translation Christoph Tillmann and Tong Zhang IBM ... @us.ibm.com Abstract In this paper, we present a novel training method for a localized phrase-based predic- tion model for statistical machine translation (SMT). The model predicts...
Ngày tải lên: 20/02/2014, 15:20
Tài liệu Báo cáo khoa học: "ADP based Search Algorithm for Statistical Machine Translation" docx
... experimental results for a bilingual cor- pus are reported. 1.1 Statistical Machine Translation In statistical machine translation, the goal of the search strategy can be formulated as follows: ... additional parameter into the recursion formula for DP. In the following, we will explain this method in detail. 2.3 Recursion Formula for DP In the DP formalism, the sear...
Ngày tải lên: 20/02/2014, 18:20
Báo cáo khoa học: "Minimum Error Rate Training in Statistical Machine Translation" potx
... criteria are, for example, F-Measure for parsing, mean average precision for ranked retrieval, and BLEU or multi-reference word error rate for statistical machine translation. The use of statistical ... Minimum Error Rate Training in Statistical Machine Translation Franz Josef Och Information Sciences Institute University of Southern California 4676 Admiralty Way, Suite 1001...
Ngày tải lên: 23/03/2014, 19:20
Báo cáo khoa học: "Enriching Morphologically Poor Languages for Statistical Machine Translation" doc
... re- structuring for statistical machine translation. In ACL ’05: Proceedings of the 43rd Annual Meeting on Asso- ciation for Computational Linguistics, pages 531–540, Morristown, NJ, USA. Association for ... suffers from the lack of information about its role in the sen- tence, making it hard to choose the right inflected forms. Our method is based on factored phrase-based statistic...
Ngày tải lên: 31/03/2014, 00:20
Báo cáo khoa học: "Continuous Space Language Models for Statistical Machine Translation" pdf
... because can it they we that can can can be be have be have be have it it has forgotten has forgotten has has forgotten forgotten been forgotten been forgotten forgotten . . forgotten . . . . . . Figure 1: Example of a translation ... amount of papers investigating new approaches to language modeling for statis- tical machine translation. Traditionally, statistical machine translation...
Ngày tải lên: 31/03/2014, 01:20