Báo cáo khoa học: "Using Similarity Scoring To Improve the Bilingual Dictionary for Word Alignment" doc

Báo cáo khoa học: "Using Similarity Scoring To Improve the Bilingual Dictionary for Word Alignment" doc

Ngày tải lên : 08/03/2014, 07:20
... language word. is expressed as follows: a word qualifies for clus- tering if As before, are all the target language words that cooccur with source language word . Similarly to the most frequent words, ... Automatically-extracted thesauri for cross-language IR: When better is worse. In Proceed- ings of COMPUTERM’98. Eric Gaussier. 1998. Flow network models for word alignment and terminology extraction from bilingual corpora. ... allowed per word, . For example, if the maximum number is 2, then a word can align to 0, 1, or 2 words in the parallel sentence. In other settings, we en- forced a minimum score in the bilingual...
8 363 0
Báo cáo khoa học: "Data Cleaning for Word Alignment" pdf

Báo cáo khoa học: "Data Cleaning for Word Alignment" pdf

Ngày tải lên : 08/03/2014, 01:20
... are less than 20 percent. 2 1 : n Word Alignment Our discussion of uni-directional alignments of word alignment is limited to IBM Model 4. Definition 1 (Word alignment task) Let e i be the i-th ... two word alignments as an alignment point, 2) add new alignment points that exist in the union with the constraint that a new alignment point connects at least one previ- ously unaligned word, ... mechanism to aug- ment one source word into several source words or delete a source word, while a NULL insertion is a mechanism of generating several words from blank words. Fertility uses a conditional...
9 487 0
Báo cáo khoa học: "Soft Syntactic Constraints for Word Alignment through Discriminative Training" pot

Báo cáo khoa học: "Soft Syntactic Constraints for Word Alignment through Discriminative Training" pot

Ngày tải lên : 08/03/2014, 02:21
... bilin- gual word alignment finds word- to -word connec- tions across languages. Originally introduced as a byproduct of training statistical translation models in (Brown et al., 1993), word alignment ... Log-linear models for word alignment. In Meeting of the Association for Computa- tional Linguistics, pages 459–466, Ann Arbor, USA. I. D. Melamed. 2000. Models of translational equivalence among words. ... exer- cise for word alignment. In HLT-NAACL Workshop on Building and Using Parallel Texts, pages 1–10, Edmon- ton, Canada. R. Moore. 2005. A discriminative framework for bilingual word alignment. ...
8 325 0
Báo cáo khoa học: "Learning Expressive Models for Word Sense Disambiguation" pot

Báo cáo khoa học: "Learning Expressive Models for Word Sense Disambiguation" pot

Ngày tải lên : 08/03/2014, 02:21
... words to the right and left of the verb, identified using POS tags, represented by has_narrow(snt, word_ position, word) : has_narrow(snt 1 , 1st _word_ left, mind). has_narrow(snt 1 , 1st _word_ right, ... the positions of the words, represented by has_narrow_trns(snt, word_ position, portuguese _word) : has_narrow_trns(snt 1 , 1st _word_ right, como). has_narrow_trns(snt 1 , 2nd _word_ right, um). … ... for the disambigua- tion of verbs. We plan to further evaluate our approach for other sets of words, including other parts-of-speech to allow further comparisons with other approach- es. For...
8 380 0
Báo cáo khoa học: "Combining Clues for Word Alignment" pdf

Báo cáo khoa học: "Combining Clues for Word Alignment" pdf

Ngày tải lên : 08/03/2014, 21:20
... short words. 340 Combining Clues for Word Alignment Rirg Tiedemann Department of Linguistics Uppsala University Box 527 SE-751 20 Uppsala, Sweden joerg@stp.ling.uu.se Abstract In this paper, a word ... corpora carry a huge amount of bilingual lexical information. Word alignment approaches focus on the automatic identification of translation relations in translated texts. Alignments are usu- ally ... an alignment clue for the cor- responding word pairs. The likelihood of each translation alternative can be weighted, e.g., by frequency (if available). 2.3 Clue Combinations So far, word alignment...
8 579 0
Báo cáo khoa học: "Confidence Measure for Word Alignment" potx

Báo cáo khoa học: "Confidence Measure for Word Alignment" potx

Ngày tải lên : 17/03/2014, 01:20
... based on word alignment. In this paper we introduce a confidence mea- sure for word alignment, which is robust to extra or missing words in the bilingual sentence pairs, as well as word alignment ... sentence alignments and alignment links from mul- tiple word alignments of the same sen- tence pair. Additionally, we remove low confidence alignment links from the word alignment of a bilingual ... the same word does in- crease the confusion for word alignment and re- duce the link confidence. On the other hand, ad- ditional information (such as the distance of the word pair, the alignment...
9 317 0
Tài liệu Báo cáo khoa học: "Word Alignment for Languages with Scarce Resources Using Bilingual Corpora of Other Language Pairs" pptx

Tài liệu Báo cáo khoa học: "Word Alignment for Languages with Scarce Resources Using Bilingual Corpora of Other Language Pairs" pptx

Ngày tải lên : 20/02/2014, 12:20
... two parameters for the dis- tortion probability: one for head words and the other for non-head words. Distortion Probability for Head Words The distortion probability for head words represents ... two word alignment models for language pairs L1-L3 and L2-L3, respectively. And then, with L3 as a pivot language, we can build a word alignment model for L1 and L2 based on the above two models. ... language word similarity of the Chinese word c and the Japanese word given the English word );,( efcsim f e Figure 1. Similarity Calculation English word e. For the ambiguous English word e,...
8 359 0
Tài liệu Báo cáo khoa học: "Topic Models for Dynamic Translation Model Adaptation" pptx

Tài liệu Báo cáo khoa học: "Topic Models for Dynamic Translation Model Adaptation" pptx

Ngày tải lên : 19/02/2014, 19:20
... BiTAM model (Zhao and Xing, 2006), which uses a bilingual topic model for learning alignment. In our case, by building a topic distri- bution for the source side of the training data, we abstract ... probability under topic 1, topic 2, etc., or F 2 : What is the probability under the most probable topic, second most, etc. A model using F 1 learns whether a specific topic is useful for translation, ... relevant transla- tions based on topic- specific contexts, where topics are induced in an unsupervised way using topic models; this can be thought of as inducing subcorpora for adaptation with- out any...
5 532 0
Tài liệu Báo cáo khoa học: "Guiding Statistical Word Alignment Models With Prior Knowledge" pdf

Tài liệu Báo cáo khoa học: "Guiding Statistical Word Alignment Models With Prior Knowledge" pdf

Ngày tải lên : 20/02/2014, 12:20
... as 1. In building word alignment models, a special “NULL” word is usually introduced to address tar- get words that align to no source words. Since this physically non-existing word is not in the ... way. However, this is chosen for mostly computational 4 2 Constrained Word Alignment Models The framework that we propose to incorporate sta- tistical constraints into word alignment models is generic. ... candidate. This information is de- rived before word alignment model training and will act as soft constraints that need to be respected dur- ing training and alignments. For a given word pair, the...
8 495 0
Báo cáo khoa học: "Employing Topic Models for Pattern-based Semantic Class Discovery" doc

Báo cáo khoa học: "Employing Topic Models for Pattern-based Semantic Class Discovery" doc

Ngày tải lên : 08/03/2014, 00:20
... fewer “documents”, “words”, and “topics”. To further improve efficiency, we also perform preprocess- ing (refer to Section 3.4 for details) before build- ing topic models for C R (q), where some ... be related to multiple topics in some topic models (e.g., pLSI and LDA). Topic modeling Semantic class construction word item (word or phrase) document RASC topic semantic class Table ... pro- vides a formal and convenient way of grouping documents and words to topics. In order to apply topic models to our problem, we map RASCs to documents, items to words, and treat the output topics...
9 398 0
Báo cáo khoa học: "Improved Discriminative Bilingual Word Alignment" pdf

Báo cáo khoa học: "Improved Discriminative Bilingual Word Alignment" pdf

Ngày tải lên : 08/03/2014, 02:21
... Log- linear Models for Word Alignment. In Proceed- ings of the 43rd Annual Meeting of the ACL, pp. 459–466, Ann Arbor, Michigan. Rada Mihalcea and Ted Pedersen. 2003. An Eval- uation Exercise for Word Alignment. ... USA {bobmoore,scottyhi,abode}@microsoft.com Abstract For many years, statistical machine trans- lation relied on generative models to pro- vide bilingual word alignments. In 2005, several independent efforts showed that discriminative models ... equal or surpass the alignment accu- racy of the standard models, if the usual unla- beled bilingual training corpus is supplemented with human-annotated word alignments for only a small subset...
8 217 0
Báo cáo khoa học: "The S-Space Package: An Open Source Package for Word Space Models" pdf

Báo cáo khoa học: "The S-Space Package: An Open Source Package for Word Space Models" pdf

Ngày tải lên : 17/03/2014, 00:20
... with all word space models, which facilitates word space based applications. The package is written in Java and defines a standardized Java interface for word space algo- rithms. While other word ... July 2010. c 2010 Association for Computational Linguistics The S-Space Package: An Open Source Package for Word Space Models David Jurgens University of California, Los Angeles, 4732 Boelter ... algorithms, code documentation and mailing list archives. 2 Word Space Models Word space models are based on the contextual distribution in which a word occurs. This ap- proach has a long history in linguistics,...
6 410 0
Báo cáo khoa học: "Better Word Alignments with Supervised ITG Models" pdf

Báo cáo khoa học: "Better Word Alignments with Supervised ITG Models" pdf

Ngày tải lên : 17/03/2014, 01:20
... rea- sonable alignments, word alignment models must constrain the set of alignments considered. In this section, we discuss and compare alignment fami- lies used to train our discriminative models. Initially, ... many-to-one block alignment potential, and efficient pruning, ITG models can yield state-of-the art word alignments, even when the underlying gold alignments are highly non- ITG. Our models yielded ... across alignments. Specif- ically, for each alignment cell (i, j) which is not a possible alignment in a ∗ , we incur a loss of 1 when a ij = a ∗ ij ; note that if (i, j) is a possible alignment, ...
9 319 0

Xem thêm