Báo cáo khoa học: "Log-linear Models for Word Alignment" ppt

Báo cáo khoa học: "Hierarchical Search for Word Alignment" ppt

Báo cáo khoa học: "Hierarchical Search for Word Alignment" ppt

... worse to align two English words at different ends of the tree to the same foreign word, than it is to align two English words under the same NP to the same foreign word. To see why a string distance ... and Michael I. Jordan. 2006. Word alignment via Quadratic Assignment. In Proceedings of HLT- EMNLP. New York, NY. USA. Yang Liu, Qun Liu, and Shouxun Lin. 2005. Log- linear Models for...

Ngày tải lên: 23/03/2014, 16:20

10 314 0
Báo cáo khoa học: "Topic Models for Word Sense Disambiguation and Token-based Idiom Detection" pdf

Báo cáo khoa học: "Topic Models for Word Sense Disambiguation and Token-based Idiom Detection" pdf

... de- tailed information may not be available, for in- stance for languages for which such a resource does not exist or for expressions that are not very well covered in WordNet, such as idioms. For those ... topic-document vectors (one for the sense and one for the context). We apply these models to coarse- and fine-grained WSD and find that they outperform comparable systems fo...

Ngày tải lên: 23/03/2014, 16:20

10 371 0
Báo cáo khoa học: "Data Cleaning for Word Alignment" pdf

Báo cáo khoa học: "Data Cleaning for Word Alignment" pdf

... mechanism to aug- ment one source word into several source words or delete a source word, while a NULL insertion is a mechanism of generating several words from blank words. Fertility uses a conditional ... score S W B,X for each pair of sentences where X is 4, 3, 2, and 1 for word- based MT decoder. Step 3: Train phrase-based MT for full parallel corpus. Note that we do not need to...

Ngày tải lên: 08/03/2014, 01:20

9 487 0
Báo cáo khoa học: "Combining Clues for Word Alignment" pdf

Báo cáo khoa học: "Combining Clues for Word Alignment" pdf

... source language word per row and one target language word per column. The cells inside the matrix can be filled with the combined clue values for the correspond- ing word pairs. Henceforth, this ... 0.86 0 The matrix is simply filled with all values of combined clues for each word pair. For ex- ample, the total clue value for the word pair s ="baggage" and t =&qu...

Ngày tải lên: 08/03/2014, 21:20

8 579 0
Tài liệu Báo cáo khoa học: "Topic Models for Dynamic Translation Model Adaptation" pptx

Tài liệu Báo cáo khoa học: "Topic Models for Dynamic Translation Model Adaptation" pptx

... Meeting of the Association for Computational Linguistics, pages 115–119, Jeju, Republic of Korea, 8-14 July 2012. c 2012 Association for Computational Linguistics Topic Models for Dynamic Translation ... explicitly smooth the resulting p s (e|f), since many word pairs will be unseen for a given domain s, we are already performing an implicit form of smoothing (when computing the...

Ngày tải lên: 19/02/2014, 19:20

5 532 0
Báo cáo khoa học: "Distortion Models For Statistical Machine Translation" doc

Báo cáo khoa học: "Distortion Models For Statistical Machine Translation" doc

... tied toabsolute word positionwithin sentences which tend to be different for the same words across sentences. IBM Models 4 and 5 alleviate this limita- tion by replacing absolute word positions ... positions with relative positions. The latter models define the distortion pa- rameters for a cept (one or more words). This models phrasal movement better since words tend to move in...

Ngày tải lên: 08/03/2014, 02:21

8 485 0
Báo cáo khoa học: "Structured Models for Fine-to-Coarse Sentiment Analysis" pdf

Báo cáo khoa học: "Structured Models for Fine-to-Coarse Sentiment Analysis" pdf

... which helped improve performance. 2.2 Beyond Two-Level Models To this point, we have focused solely on a model for two-level fine-to-coarse sentiment analysis not only for simplicity, but because ... systems. 1.1 Related Work The models in this work fall into the broad class of global structured models, which are typically trained with structured learning algorithms. Hidden Markov mo...

Ngày tải lên: 08/03/2014, 02:21

8 347 0
Báo cáo khoa học: "A Method for Word Sense Disambiguation of Unrestricted Text" potx

Báo cáo khoa học: "A Method for Word Sense Disambiguation of Unrestricted Text" potx

... word PROCEDURE: STEP 1. Form a similarity list ]or each sense of one of the words. Pick one of the words, say W2, and using WordNet, form a similarity list for each sense of that word. For ... senses pro- vided in WordNet. The senses are ranked us- ing two sources of information: (1) the Inter- net for gathering statistics for word- word co- occurrences and (2)WordNet fo...

Ngày tải lên: 08/03/2014, 06:20

7 378 0
Báo cáo khoa học: " New Models for Improving Supertag Disambiguation" pdf

Báo cáo khoa học: " New Models for Improving Supertag Disambiguation" pdf

... Head Word Models Rather than head supertags, head words often seem to be more predictive of dependency rela- tions. Based upon this reflection, we have imple- mented models where head words ... information extraction. We extend our su- pertagging models to perform this task in a fash- ion similar to that described in Srinivas (1997b). Selected models have been trained on...

Ngày tải lên: 08/03/2014, 21:20

8 334 0
Báo cáo khoa học: "Fertility Models for Statistical Natural Language Understanding" pdf

Báo cáo khoa học: "Fertility Models for Statistical Natural Language Understanding" pdf

... clump. A headword language model uses two unigram models, a headword model and a non-headword model. Each clump is required to have a headword. All other words are non-headwords. The identity ... g(C), and the ai denote the formal language word to which each e in c~ align. The individual words in a clump c are represented by el el(~). For all fertility models, the fundamental pa...

Ngày tải lên: 08/03/2014, 21:20

6 422 0
Từ khóa:
w