large scale discriminative n gram language models for statistical machine translation

Tài liệu Báo cáo khoa học: "Refined Lexicon Models for Statistical Machine Translation using a Maximum Entropy Approach" pptx

... additional information could be: Simple context information: information of the words surrounding the word pair; Syntactic information: part-of-speech in- formation, syntactic constituent, sentence mood; Semantic ... entropy approach is outlined in Section 3. 2 Statistical Machine Translation The goal of the translation process in statisti- cal machine translation can be formulated as fol- lows: A source language ... Tillmann, S. Vogel, H. Ney, and A. Zubiaga. 1997. A DP-based search using monotone alignments in statistical translation. In Proc. 35th Annual Conf. of the Association for Computational Linguistics, pages...

Tài liệu Báo cáo khoa học: "Improved Smoothing for N-gram Language Models Based on Ordinary Counts" doc

... new method eliminating most of the gap between Kneser-Ney and those methods. 1 Introduction Statistical language models are potentially useful for any language technology task that produces natural -language ... is a single unknown probability distribution for the amount of quantization error in every N- gram count. If so, the total quantization error for a given context will tend to be proportional to ... 4- gram language models using English data from the WMT-06 Europarl corpus (Koehn and Monz, 2006). We took 1,003,349 sentences (27,493,499 words) for training, and 2000 sentences each for testing...

Tài liệu Báo cáo khoa học: "Bucking the Trend: Large-Scale Cost-Focused Active Learning for Statistical Machine Translation" docx

... Kaufmann, San Fran- cisco, CA. Rion Snow, Brendan O’Connor, Daniel Jurafsky, and Andrew Ng. 2008. Cheap and fast – but is it good? evaluating non-expert annotations for natu- ral language tasks. In ... solicits translations only for trigger n- grams and not for entire sentences. We provide senten- tial context, highlight the trigger n- gram that we want translated, and ask for a translation of just ... Wren Thornton, Jonathan Weese, and Omar Zaidan. 2009. Joshua: An open source toolkit for parsing-based machine translation. In Proceedings of the Fourth Workshop on Statistical Machine Translation, ...

Tài liệu Báo cáo khoa học: "Incremental Syntactic Language Models for Phrase-based Translation" pptx

... 2009. Quadratic-time dependency parsing for machine trans- lation. In Proceedings of the Joint Conference of the 47th Annual Meeting of the ACL and the 4th Interna- tional Joint Conference on Natural Language ... speech recognition, research in statistical machine trans- lation has effectively used n- gram word sequence models as language models. Modern phrase-based translation using large scale n- gram language models ... Proceedings of the Ninth Ma- chine Translation Summit of the International Associ- ation for Machine Translation. Ciprian Chelba and Frederick Jelinek. 1998. Exploit- ing syntactic structure for language...

Tài liệu Báo cáo khoa học: "Translation Model Adaptation for Statistical Machine Translation with Monolingual Topic Information" doc

... 2006 Evaluation. In Proc. of International Workshop on Spoken Language Translation, pages 103-110. Arne Mauser, Saˇsa Hasan and Hermann Ney 2009. Ex- tending Statistical Machine Translation with Discrimi- native ... Statistical Machine Translation. In Proc. of ACL 2006, pages 609-616. Yajuan Lv, Jin Huang and Qun Liu. 2007. Improv- ing Statistical Machine Translation Performance by Training Data Selection and Optimization. ... obtain synthetic parallel sen- tences, and Wu et al. (2008) used an in-domain translation dictionary and monolingual corpora to adapt an out-of-domain translation model for the in- domain text. Differing...

Tài liệu Báo cáo khoa học: "A Ranking-based Approach to Word Reordering for Statistical Machine Translation" doc

... Chunking. In Proc. CoNLL, pages 63-69. Young-Suk Lee, Bing Zhao and Xiaoqiang Luo. 2010. Constituent reordering and syntax models for English- to-Japanese statistical machine translation. In Proc. Coling. Chi-Ho ... can help English-Hindi Statistical Machine Translation. In Proc. IJCNLP. Roy Tromble. 2009. Search and Learning for the Lin- ear Ordering Problem with an Application to Machine Translation. Ph.D. ... Subject-Object-Verb Languages. In Proc. HLT- NAACL, pages 376-384. Richard Zens and Hermann Ney. 2006. Discriminative Reordering Models for Statistical Machine Transla- tion. In Proc. Workshop on Statistical Machine...

Tài liệu Báo cáo khoa học: "Mixing Multiple Translation Models in Statistical Machine Translation" docx

... Ciprian Chelba, and Franz Och. 2010. Model combination for machine transla- tion. In Human Language Technologies: The 2010 An- nual Conference of the North American Chapter of the Association for ... E. Hinton. 1999. Products of experts. In Artifi- cial Neural Networks, 1999. ICANN 99. Ninth Interna- tional Conference on (Conf. Publ. No. 470), volume 1, pages 1–6. Jing Jiang and ChengXiang Zhai. ... John Benjamins, Amsterdam/Philadelphia. Nicola Ueffing, Gholamreza Haffari, and Anoop Sarkar. 2007. Transductive learning for statistical machine translation. In Proceedings of the 45th Annual...

Tài liệu Báo cáo khoa học: "Bilingual Sense Similarity for Statistical Machine Translation" ppt

... Association for Computational Linguistics Bilingual Sense Similarity for Statistical Machine Translation Boxing Chen, George Foster and Roland Kuhn National Research Council Canada 283 ... string con- sisting of terminal and non-terminal symbols. ~ defines a one-to-one correspondence between non-terminals in α and γ . 1 There has been a lot of work (more details in Section ... performance of Alg2 on Chinese-to-English NIST large data condition and German-to-English WMT task. We can see that IBM model 1 and cosine distance similarity function both obtained significant...

Tài liệu Báo cáo khoa học: "Unsupervised Search for The Optimal Segmentation for Statistical Machine Translation" doc

... models for morpheme segmentation and morphology learn- ing. ACM Transactions on Speech and Language Processing, 4(1):1–34. Sajib Dasgupta and Vincent Ng. 2007. High- performance, language- independent ... SRILM-an extensible language modeling toolkit. In Seventh International Confer- ence on Spoken Language Processing, volume 3. David Talbot and Miles Osborne. 2006. Modelling lexical redundancy for machine ... Pre-Processing for Turkish to English Statistical Machine Translation. In Proc. of the In- ternational Workshop on Spoken Language Transla- tion, pages 129–135, Tokyo, Japan. M.R. Brent. 1999. An efficient,...

Tài liệu Báo cáo khoa học: "Corpus Expansion for Statistical Machine Translation with Semantic Role Label Substitution Rules" doc

... Koehn, and Miles Os- borne. 2006. Improved statistical machine translation using paraphrases. In Proceedings of the main con- ference on Human Language Technology Conference of the North American ... phrase-based translation models, which can be used directly in translation tasks or combined with base- line models. Experimental results on Chinese- English machine translation tasks show an av- erage ... and William Dolan. 2004. Monolingual machine translation for paraphrase gen- eration. In Proceedings of EMNLP 2004, pages 142– 149, Barcelona, Spain, July. Association for Computa- tional Linguistics. 298 ...

