Báo cáo khoa học: "A Joint Rule Selection Model for Hierarch

Báo cáo khoa học: "A Joint Rule Selection Model for Hierarchical Phrase-based Translation" pptx

... China {dozhang,muli,mingzhou}@microsoft.com Abstract In hierarchical phrase-based SMT sys- tems, statistical models are integrated to guide the hierarchical rule selection for better translation performance. Previous work mainly focused on the selection ... the joint probability model into two sub-models based on the Bayes formulation, where the ﬁrst sub -model is source-si...

Ngày tải lên: 23/03/2014, 16:20

6 314 0

Tài liệu Báo cáo khoa học: "A Syntax-Driven Bracketing Model for Phrase-Based Translation" pptx

... VP on the right, therefore CBMF is “VP-RC”. 3.3 The Integration of the SDB Model into Phrase-Based SMT We integrate the SDB model into phrase-based SMT to help decoder perform syntax-driven phrase ... to the binary BTG rules. The SDB model, however, is not only limited to phrase-based SMT using BTG rules. Since it is applied on a source span each time, any other hierarchical phr...

Ngày tải lên: 20/02/2014, 07:20

9 438 0

Báo cáo khoa học: "A Joint Source-Channel Model for Machine Transliteration" doc

... source-channel model In view of the close coupling of the source and target transliteration units, we propose to estimate P(E,C) by a joint source-channel model, or n-gram transliteration model (TM). For ... 12,742 Table 1. Modeling statistics The most common metric for evaluating an n- gram model is the probability that the model assigns to test data, or perplexity (Je...

Ngày tải lên: 31/03/2014, 03:20

8 289 0

Báo cáo khoa học: "A Discriminative Latent Variable Model for Statistical Machine Translation" pdf

... model performs better than a maximum likelihood model; (3) how the performance of our model compares with a frequency count based hierarchical system; and (4) how translation performance ... decoding for the model trained on single derivations has only a small positive effect, while for the latent variable model the impact is much larger. 6 For example, our max-derivation...

Ngày tải lên: 23/03/2014, 17:20

9 291 0

Báo cáo khoa học: "A MARKOV LANGUAGE LEARNING MODEL FOR FINITE PARAMETER SPACES" pptx

... diachronic change remains a topic for fu- ture investigation. As far as we know, the possibility for formally modeling the kind of saltation indicated by the Markov model has not been noted previously ... sociation for Computational Linguistics. Pitts- burgh, PA: Association for Computational Linguis- tics, 243-251. Dresher, Elan and Kaye, Jonathan (1990). "A Compu- tat...

Ngày tải lên: 23/03/2014, 20:21

10 264 0

Báo cáo khoa học: "A Stacked Sub-Word Model for Joint Chinese Word Segmentation and Part-of-Speech Tagging" potx

... often provides important clues for POS tagging, and the POS tags contain much syntactic information, which need context information within a large window for disambiguation. For example, Huang et al. ... 1385–1394, Portland, Oregon, June 19-24, 2011. c 2011 Association for Computational Linguistics A Stacked Sub-Word Model for Joint Chinese Word Segmentation and Part-of-Speech...

Ngày tải lên: 17/03/2014, 00:20

10 412 0

Báo cáo khoa học: "A Progressive Feature Selection Algorithm for Ultra Large Feature Spaces" doc

... efficient PFS algorithm. At the beginning of each round for feature selection, a uniform prior distribution is always assumed for the new CME model. A more pre- cise description of the PFS algorithm ... unlimited feature spaces for conditional maximum entropy (CME) modeling. Experi- mental results in edit region identification demonstrate the benefits of the progressive feature...

Ngày tải lên: 17/03/2014, 04:20

8 388 0

Báo cáo khoa học: "A Trainable Rule-based Algorithm for Word Segmentation" pdf

... adopting it for word segmentation. For example, since word segmentation is merely a preprocessing task for a wide variety of further tasks such as parsing, information extraction, and information ... acquire the rules, rather than expensive manual knowledge engineering. The rules produced can be inspected, which is useful for gain- ing insight into the nature of the rule seq...

Ngày tải lên: 17/03/2014, 23:20

8 470 0

Tài liệu Báo cáo khoa học: "A Uniﬁed Syntactic Model for Parsing Fluent and Disﬂuent Speech∗" ppt

... Right-corner transform Binarized trees 2 are then transformed into right- corner trees using transform rules similar to those described by Johnson(1998a). This right-corner transform is simply the ... repair above, the well-formedness rule says that the repair is well formed if the frag- ment a ﬂight to Boston and to Denver is gram- matical. In this case the repair is well formed since the...

Ngày tải lên: 20/02/2014, 09:20

4 582 0

Tài liệu Báo cáo khoa học: "A Phrase-based Statistical Model for SMS Text Normalization" ppt

... (3) This is the basic function of the channel model for the phrase-based SMS normalization model, where we used the maximum approximation for the sum over all segmentations. Then we further ... normalization model consists of two sub-models: a word-based language model (LM), characterized by 1 (| ) nn P ee − ) k and a phrase- based lexical mapping model (channel model...

Ngày tải lên: 20/02/2014, 12:20

8 400 0