a maximum entropy segmentation model for statistical machine translation

Tài liệu Báo cáo khoa học: "A Localized Prediction Model for Statistical Machine Translation" ppt

Tài liệu Báo cáo khoa học: "A Localized Prediction Model for Statistical Machine Translation" ppt

... paper, we present a block-based model for statis- tical machine translation. A block is a pair of phrases which are translations of each other. For example, Fig. 1 shows an Arabic-English translation ... Conference (HLT 04), pages 177–184, Boston, MA, May. Christoph Tillmann and Fei Xia. 2003. A Phrase-based Unigram Model for Statistical Machine Translation. In Companian Vol. of the Joint HLT and NAACL Confer- ence ... generate a translation graph’ for every input sentence using a proce- dure similar to (Ueffing et al., 2002): a translation graph is a compact way of representing candidate translations which are...

Ngày tải lên: 20/02/2014, 15:20

8 578 0
Tài liệu Báo cáo khoa học: "Refined Lexicon Models for Statistical Machine Translation using a Maximum Entropy Approach" pptx

Tài liệu Báo cáo khoa học: "Refined Lexicon Models for Statistical Machine Translation using a Maximum Entropy Approach" pptx

... Lexicon Models for Statistical Machine Translation using a Maximum Entropy Approach Ismael Garc ´ a Varea Dpto. de Inform´atica Univ. de Castilla-La Mancha Campus Universitario s/n 02071 Albacete, ... lexicon models for statistical machine translation by using maximum entropy models. We have been able to obtain a significant better test corpus perplexity and also a slight improvement in translation ... methods for incorporating informa- tion about the relative position of bilingual word pairs into a maximum entropy translation model. Other authors have applied this approach to lan- guage modeling...

Ngày tải lên: 20/02/2014, 18:20

8 427 0
Tài liệu Báo cáo khoa học: "Translation Model Adaptation for Statistical Machine Translation with Monolingual Topic Information" doc

Tài liệu Báo cáo khoa học: "Translation Model Adaptation for Statistical Machine Translation with Monolingual Topic Information" doc

... 2007. Bilingual LSA-based adaptation for statistical machine translation. Machine Translation, pages 187-207. Nicola Ueffing, Gholamreza Haffari and Anoop Sarkar. 2008. Semi-supervised Model Adaptation for ... Statisti- cal Machine Translation. Machine Translation, pages 77-94. Hua Wu, Haifeng Wang and Chengqing Zong. 2008. Do- main Adaptation for Statistical Machine Translation with Domain Dictionary ... weblog. According to adaptation emphases, domain adap- tation in SMT can be classified into translation mod- el adaptation and language model adaptation. Here we focus on how to adapt a translation model, ...

Ngày tải lên: 19/02/2014, 19:20

10 533 0
Tài liệu Báo cáo khoa học: "A Ranking-based Approach to Word Reordering for Statistical Machine Translation" doc

Tài liệu Báo cáo khoa học: "A Ranking-based Approach to Word Reordering for Statistical Machine Translation" doc

... 271-279. A. Ramanathan, Pushpak Bhattacharyya, Jayprasad Hegde, Ritesh M. Shah and Sasikumar M. 2008. Simple syntactic and morphological processing can help English-Hindi Statistical Machine Translation. In ... integrated into a phrase- based decoder serving as additional distortion fea- tures. We evaluated our approach on large-scale Japanese-English and English-Japanese machine translation tasks, and ... ranking model is auto- matically derived from word aligned parallel data with a syntactic parser for source lan- guage based on both lexical and syntactical features. We evaluated our approach...

Ngày tải lên: 19/02/2014, 19:20

9 616 0
Tài liệu Báo cáo khoa học: "Unsupervised Search for The Optimal Segmentation for Statistical Machine Translation" doc

Tài liệu Báo cáo khoa học: "Unsupervised Search for The Optimal Segmentation for Statistical Machine Translation" doc

... key 6 anahtar + ımı anahtar + ımı my key (ACC.) 5 anahtarla anahtar + la with (the) key 4 anahtarı anahtar + ı 1 (the) key (ACC.); 2 his/her key 3 anahtarı + m anahtar + ım my key 3 anahtarı + n anahtar ... obtained after morpho- logical preprocessing can improve the machine translation performance over a word-based sys- tem (Habash and Sadat, 2006; Oflazer and Durgar El-Kahlout, 2007; Bisazza and ... pages 93–110. MIT Press. Nizar Habash and Fatiha Sadat. 2006. Arabic prepro- cessing schemes for statistical machine translation. In Proc. of the HLT-NAACL, Companion Volume: Short Papers, pages...

Ngày tải lên: 20/02/2014, 04:20

6 446 0
Tài liệu Báo cáo khoa học: "Bilingually Motivated Domain-Adapted Word Segmentation for Statistical Machine Translation" pptx

Tài liệu Báo cáo khoa học: "Bilingually Motivated Domain-Adapted Word Segmentation for Statistical Machine Translation" pptx

... 2009. c 2009 Association for Computational Linguistics Bilingually Motivated Domain-Adapted Word Segmentation for Statistical Machine Translation Yanjun Ma Andy Way National Centre for Language Technology School ... on manu- ally segmented training data so that it can be automatically adapted for differ- ent domains. We evaluate the perfor- mance of our segmentation approach on PB-SMT tasks from two domains ... segmenters are usually trained on a manually segmented domain- specific corpus, which is not adapted for the spe- cific translation task at hand given that the manual segmentation is performed in a monolingual...

Ngày tải lên: 22/02/2014, 02:20

9 236 0
Tài liệu Báo cáo khoa học: "Bilingual Sense Similarity for Statistical Machine Translation" ppt

Tài liệu Báo cáo khoa học: "Bilingual Sense Similarity for Statistical Machine Translation" ppt

... Sense Disambiguation Improves Statistical Machine Translation. In: Proceedings of ACL, Prague. D. Chiang. 2005. A hierarchical phrase-based model for statistical machine translation. In: Proceedings ... helpful for at least one multilingual application: statistical machine translation. Finally, although we described and evaluated bilingual sense similarity algorithms applied to a hierarchical ... Similarity for Statistical Machine Translation Boxing Chen, George Foster and Roland Kuhn National Research Council Canada 283 Alexandre-Taché Boulevard, Gatineau (Québec), Canada J8X 3X7...

Ngày tải lên: 20/02/2014, 04:20

10 595 0
Tài liệu Báo cáo khoa học: "Bucking the Trend: Large-Scale Cost-Focused Active Learning for Statistical Machine Translation" docx

Tài liệu Báo cáo khoa học: "Bucking the Trend: Large-Scale Cost-Focused Active Learning for Statistical Machine Translation" docx

... Language, 22(3):295–312. Andreas Zollmann and Ashish Venugopal. 2006. Syn- tax augmented machine translation via chart pars- ing. In Proceedings of the NAACL-2006 Workshop on Statistical Machine Translation (WMT06), ... build machine translation evaluation sets. In Proceedings of the Workshop on Creating Speech and Language Data With Amazon’s Mechanical Turk, Los Angeles, California, June. Association for Computational ... annotated in LDC data Figure 11: Bucking the trend: performance of HNG-selected additional data from BBC web crawl data annotated via Amazon Mechanical Turk. y-axis measures BLEU. x-axis measures number...

Ngày tải lên: 20/02/2014, 04:20

11 580 0
Tài liệu Báo cáo khoa học: "Corpus Expansion for Statistical Machine Translation with Semantic Role Label Substitution Rules" doc

Tài liệu Báo cáo khoa học: "Corpus Expansion for Statistical Machine Translation with Semantic Role Label Substitution Rules" doc

... of alter- native translations of each source phrase (T/S) and the average source phrase length in the output (A. L.) -1.80 on average TER. An explanation is that us- ing identical alignment model ... Dolan. 2004. Monolingual machine translation for paraphrase gen- eration. In Proceedings of EMNLP 2004, pages 142– 149, Barcelona, Spain, July. Association for Computa- tional Linguistics. 298 ... North American Chapter of the Association of Computational Linguistics, HLT-NAACL ’06, pages 17–24. Chris Callison-Burch. 2008. Syntactic constraints on paraphrases extracted from parallel corpora....

Ngày tải lên: 20/02/2014, 04:20

5 416 0
Tài liệu Báo cáo khoa học: "A Syntax-Driven Bracketing Model for Phrase-Based Translation" pptx

Tài liệu Báo cáo khoa học: "A Syntax-Driven Bracketing Model for Phrase-Based Translation" pptx

... various language-pairs, one issue is that matching syn- tactic analysis can not always guarantee a good translation, and violating syntactic structure does not always induce a bad translation. Marton and Resnik ... Reordering Model for Statistical Machine Translation. In Proceedings of ACL-COLING 2006. Deyi Xiong, Min Zhang, Aiti Aw, and Haizhou Li. 2008. Linguistically Annotated BTG for Statistical Machine Translation. ... Singapore, 2-7 August 2009. c 2009 ACL and AFNLP A Syntax-Driven Bracketing Model for Phrase-Based Translation Deyi Xiong, Min Zhang, Aiti Aw and Haizhou Li Human Language Technology Institute for...

Ngày tải lên: 20/02/2014, 07:20

9 438 0
Tài liệu Báo cáo khoa học: "Combination of Arabic Preprocessing Schemes for Statistical Machine Translation" ppt

Tài liệu Báo cáo khoa học: "Combination of Arabic Preprocessing Schemes for Statistical Machine Translation" ppt

... Council of Canada fatiha.sadat@cnrc-nrc.gc.ca Nizar Habash Center for Computational Learning Systems Columbia University habash@cs.columbia.edu Abstract Statistical machine translation is quite ... for Phrase-based Statistical Machine Translation Mod- els. In Proc. of the Association for Machine Trans- lation in the Americas (AMTA). P. Koehn. 2004b. Statistical Significance Tests for Machine Translation ... Associa- tion for Computational Linguistics (ACL), Ann Ar- bor, Michigan. N. Habash and F. Sadat. 2006. Arabic Preprocess- ing Schemes for Statistical Machine Translation. In Proc. of NAACL, Brooklyn,...

Ngày tải lên: 20/02/2014, 11:21

8 295 0
w