Báo cáo khoa học: "A Clustered Global Phrase Reordering Model for Statistical Machine Translation" docx

Báo cáo khoa học: "A Clustered Global Phrase Reordering Model for Statistical Machine Translation" docx

Báo cáo khoa học: "A Clustered Global Phrase Reordering Model for Statistical Machine Translation" docx

... the reordering model is that it can better model global phrase reordering using a four-valued reordering pattern, and it can be easily 718 Figure 5: BLEU score for the clustered and lexical reordering ... their model, the global reordering is not explicitly modeled. In this paper, we present a global reordering model that explicitly models long distance re- orde...

Ngày tải lên: 31/03/2014, 01:20

8 346 0
Báo cáo khoa học: "Maximum Entropy Based Phrase Reordering Model for Statistical Machine Translation" docx

Báo cáo khoa học: "Maximum Entropy Based Phrase Reordering Model for Statistical Machine Translation" docx

... novel reordering model for phrase- based statistical machine transla- tion (SMT) that uses a maximum entropy (MaxEnt) model to predicate reorderings of neighbor blocks (phrase pairs). The model ... pages 521–528, Sydney, July 2006. c 2006 Association for Computational Linguistics Maximum Entropy Based Phrase Reordering Model for Statistical Machine Translation D...

Ngày tải lên: 08/03/2014, 02:21

8 390 0
Báo cáo khoa học: "A Comparative Study on Reordering Constraints in Statistical Machine Translation" potx

Báo cáo khoa học: "A Comparative Study on Reordering Constraints in Statistical Machine Translation" potx

... statistical machine translation (Brown et al., 1990). It allows an independent modeling of tar- get language model P r (e I 1 ) and translation model P r(f J 1 |e I 1 ). The target language model ... re- orderings for the discussed constraints is listed as a function of the sentence length. We see that for longer sentences the ITG constraints allow for more reorderings than th...

Ngày tải lên: 17/03/2014, 06:20

8 410 0
Tài liệu Báo cáo khoa học: "Bilingually Motivated Domain-Adapted Word Segmentation for Statistical Machine Translation" pptx

Tài liệu Báo cáo khoa học: "Bilingually Motivated Domain-Adapted Word Segmentation for Statistical Machine Translation" pptx

... 2009. c 2009 Association for Computational Linguistics Bilingually Motivated Domain-Adapted Word Segmentation for Statistical Machine Translation Yanjun Ma Andy Way National Centre for Language Technology School ... LDC seg- menter 2 and Stanford segmenter version 2006-05- 11 3 . Both ICTCLAS and Stanford segmenters utilise machine learning techniques, with Hidden Markov Models...

Ngày tải lên: 22/02/2014, 02:20

9 236 0
Tài liệu Báo cáo khoa học: "A Discriminative Syntactic Word Order Model for Machine Translation" pdf

Tài liệu Báo cáo khoa học: "A Discriminative Syntactic Word Order Model for Machine Translation" pdf

... 94305 pichuan@stanford.edu Kristina Toutanova Microsoft Research Redmond, WA kristout@microsoft.com Abstract We present a global discriminative statistical word order model for machine translation. Our model ... 2006). In this paper we develop an order model for machine translation which makes use of both syntactic and surface information. The framework for our statistica...

Ngày tải lên: 20/02/2014, 12:20

8 404 0
Báo cáo khoa học: "A Modified Joint Source-Channel Model for Transliteration" ppt

Báo cáo khoa học: "A Modified Joint Source-Channel Model for Transliteration" ppt

... The joint source-channel model with linguistic knowledge (Model D) has not performed well in the Bengali-English machine transliteration whereas the trigram model (Model E) needs further attention ... as follows. The machine transliteration problem has been formulated under both noisy-channel model and joint source-channel model in Section 2. A number of transliteration m...

Ngày tải lên: 17/03/2014, 04:20

8 312 0
Báo cáo khoa học: "A Syllable Based Word Recognition Model for Korean Noun Extraction" potx

Báo cáo khoa học: "A Syllable Based Word Recognition Model for Korean Noun Extraction" potx

... syllable-tag sequence of the input sentence by using statistical information and extracts nouns by detecting the word boundaries. The statistical in- formation is automatically acquired from a POS ... suitable for the word recognition model. The corpus can be modified through the fol- lowing steps: Step 1 For a given Eojeol, segment word bound- aries and assign word tags to each word....

Ngày tải lên: 17/03/2014, 06:20

8 368 0
Báo cáo khoa học: "A Feature-Rich Constituent Context Model for Grammar Induction" doc

Báo cáo khoa học: "A Feature-Rich Constituent Context Model for Grammar Induction" doc

... grammars from corpora. In Workshop Notes for Statistically- Based NLP Techniques, AAAI, pages 1–13. David Chiang. 2005. A hierarchical phrase- based model for statistical machine translation. In Proceedings ... using a log-linear model: P t (y|x) = exp w, f xyt   y  exp  w, f xy  t  (1) This technique for parameter transformation was shown to be effective in unsupervised...

Ngày tải lên: 23/03/2014, 14:20

6 308 0
Báo cáo khoa học: "A PROPER TREATMEMT OF SYNTAX AND SEMANTICS IN MACHINE TRANSLATION" potx

Báo cáo khoa học: "A PROPER TREATMEMT OF SYNTAX AND SEMANTICS IN MACHINE TRANSLATION" potx

... in a unified form without being affected by the syntactic variety. (2) Semantic representation can play a pivotal role for language transformation and can provide a basis for constructing ... collection of phrases• A phrase itself is a collection of words• In order to restructure the collection of phrases, the processor must first of all attach some sorts of labels to the phra...

Ngày tải lên: 31/03/2014, 17:20

8 309 0
Báo cáo khoa học: "A Unified Single Scan Algorithm for Japanese Base Phrase Chunking and Dependency Parsing" pdf

Báo cáo khoa học: "A Unified Single Scan Algorithm for Japanese Base Phrase Chunking and Dependency Parsing" pdf

... Sassano, 2004) for bunsetsu-based parsers. We use the fol- lowing features for each morpheme: 1. major POS, minor POS, conjugation type, conjugation form, surface form (lexicalized form) 2. Content ... code for base phrase chunking and dependency parsing. 4.2 Pseudo Code for the Proposed Algorithm The algorithm that we propose is based on (Sas- sano, 2004), which is considered to be...

Ngày tải lên: 17/03/2014, 02:20

4 287 0
w