rare word translation extraction

Tài liệu Báo cáo khoa học: "Effective Phrase Translation Extraction from Alignment Models" ppt

Tài liệu Báo cáo khoa học: "Effective Phrase Translation Extraction from Alignment Models" ppt

Ngày tải lên : 20/02/2014, 16:20
... of phrase length. Tra- ditional word lexicons propose translations for one source word, while with phrase translations, a single hypothesis pair can span several words in the source or target language. ... the lenth of the phrase, causing each translation to have a score compara- ble to the product of the word to word translations within the phrase. 7 HMM Phrase Extraction In order to compare our ... maximum approximation to score candidate translation hy- potheses. The translation lexicon generated by the IBM models provides translation estimates at the word level built on the complete training...
  • 8
  • 323
  • 0
Tài liệu Báo cáo khoa học: " Word Translation Disambiguation Using Bilingual Bootstrapping" doc

Tài liệu Báo cáo khoa học: " Word Translation Disambiguation Using Bilingual Bootstrapping" doc

Ngày tải lên : 20/02/2014, 21:20
... correct Chinese translation of the ambiguous English word, given an English sentence which contains the word. Word translation disambiguation is actually a special case of word sense disambiguation ... an English word usually has several Chinese words as translations (cf., Figure 5). As a translation dictionary, we used the HIT dictionary, which contains about 76000 Chinese words, 60000 ... Experiment 2: Yarowsky’s Words We also conducted translation on seven of the twelve English words studied in (Yarowsky, 1995). Table 5 shows the list of the words. For each of the words, we extracted...
  • 9
  • 480
  • 0
Tài liệu Báo cáo khoa học: "A Tool for Multi-Word Expression Extraction in Modern Greek Using Syntactic Parsing" pdf

Tài liệu Báo cáo khoa học: "A Tool for Multi-Word Expression Extraction in Modern Greek Using Syntactic Parsing" pdf

Ngày tải lên : 22/02/2014, 02:20
... modules and resources which enable the retrieval of Greek multi -word expressions and their translations: the Greek parser, its lexical database, the extraction and concordanc- ing system. 1 Introduction In ... multilingual society, there is a pressing need for building translation resources, such as large-coverage multilingual lexicons, translation systems or translation aid tools, especially due to the increasing ... Greece, 3 April 2009. c 2009 Association for Computational Linguistics A Tool for Multi -Word Expression Extraction in Modern Greek Using Syntactic Parsing Athina Michou University of Geneva Geneva,...
  • 4
  • 491
  • 0
Báo cáo khoa học: "A Bio-inspired Approach for Multi-Word Expression Extraction" doc

Báo cáo khoa học: "A Bio-inspired Approach for Multi-Word Expression Extraction" doc

Ngày tải lên : 08/03/2014, 02:21
... constantly in corpus(Taneli,2003). The extraction of MWE plays an important role in several areas, such as machine translation (Pas- cale,1997), information extraction (Kalliopi,2000) etc. On the ... also a need for MWE extraction in a much more widespread scenario namely that of human translation and technical writing. Many efforts have been de- voted to the study of MWE extraction (Beat- rice,2003; ... model. 2 Multi -word Expression Multi -word Expression( in general, term) as the linguistic representation of concepts, also has some special statistical features. The component words of terms...
  • 7
  • 376
  • 1
Strategies to deal with non - equivalence at word level in translation

Strategies to deal with non - equivalence at word level in translation

Ngày tải lên : 10/04/2013, 10:37
... as translation by a more general word and translation by cultural substitution while their English classmates did not use the following strategies: translation using a loan word or a loan word ... to solve non-equivalence. Giménez (2005) notes that translation using a related word; translation by paraphrase using unrelated words and translation by omission seem to be favored by Spanish ... Vietnamese word is impossible. Using Vietnamese words “sự tới dự”, the translation will sound non –Vietnamese. Pharaphasing it by a combination of Vietnamese items has brought out an acceptable translation...
  • 57
  • 1.9K
  • 14
Tài liệu Báo cáo khoa học: "A Ranking-based Approach to Word Reordering for Statistical Machine Translation" doc

Tài liệu Báo cáo khoa học: "A Ranking-based Approach to Word Reordering for Statistical Machine Translation" doc

Ngày tải lên : 19/02/2014, 19:20
... the word order in target language. To this end, we propose a simple but effective ranking-based ap- proach to word reordering. The ranking model is automatically derived from the word aligned ... the paper. 2 Word Reordering as Syntax Tree Node Ranking Given a source side parse tree T e , the task of word reordering is to transform T e to T  e , so that e  can match the word order in ... arcs for dependency tree; word alignment links are lines without arrow between the source and target sentences. parse tree, we can obtain the same word order of Japanese translation. It is true...
  • 9
  • 615
  • 0
Tài liệu Báo cáo khoa học: "Fine-grained Tree-to-String Translation Rule Extraction" docx

Tài liệu Báo cáo khoa học: "Fine-grained Tree-to-String Translation Rule Extraction" docx

Ngày tải lên : 20/02/2014, 04:20
... was achieved for Arabic-to-English translation. Birch et al. (2007) also reported a significant improve- ment for Dutch-English translation by applying CCG supertags at a word level to a factorized SMT system ... tree-to-string rule extraction, rather than string-to-string translation (Hassan et al., 2007; Birch et al., 2007). The Logon project 2 (Oepen et al., 2007) for Norwegian-English translation integrates ... of the extraction algorithms (Galley et al., 2006; Mi and Huang, 2008). 3.3 Predicate-argument structures In order to extract translation rules from PASs, we want to localize a predicate word and...
  • 10
  • 473
  • 0
Tài liệu Báo cáo khoa học: "Learning Word-Class Lattices for Definition and Hypernym Extraction" doc

Tài liệu Báo cáo khoa học: "Learning Word-Class Lattices for Definition and Hypernym Extraction" doc

Ngày tải lên : 20/02/2014, 04:20
... frequent words F to generalize words to word classes”. We define a word class as either a word itself or its part of speech. Given a sentence s = w 1 , w 2 , . . . , w |s| , where w i is the i-th word ... order of symbols like in word/ phoneme lat- tices, and nodes are clusters of salient words ag- gregated using synonymy, similarity, or subtrees of a thesaurus. However, salient word selection and aggregation ... where w i is its i-th word. Given the set F of most frequent words in T (cf. Section 3.1), the star pattern σ(s) associated with s is obtained by replacing with * all the words w i ∈ F, that...
  • 10
  • 567
  • 0
Tài liệu Báo cáo khoa học: "Robust Extraction of Named Entity Including Unfamiliar Word" doc

Tài liệu Báo cáo khoa học: "Robust Extraction of Named Entity Including Unfamiliar Word" doc

Ngày tải lên : 20/02/2014, 09:20
... USA, June 2008. c 2008 Association for Computational Linguistics Robust Extraction of Named Entity Including Unfamiliar Word Masatoshi Tsuchiya † Shinya Hida ‡ Seiichi Nakagawa ‡ † Information ... method consists of two steps. The first step is to assign the most simi- lar and familiar word to each unfamiliar word based on their context vectors calculated from a large unannotated corpus. After ... corpus show the effectiveness of the proposed method. 2 Extraction of Japanese Named Entity 2.1 Task of the IREX Workshop The task of NE extraction of the IREX workshop (Sekine and Eriguchi, 2000)...
  • 4
  • 384
  • 1
Tài liệu Báo cáo khoa học: "A Discriminative Syntactic Word Order Model for Machine Translation" pdf

Tài liệu Báo cáo khoa học: "A Discriminative Syntactic Word Order Model for Machine Translation" pdf

Ngày tải lên : 20/02/2014, 12:20
... Introduction The machine translation task can be viewed as con- sisting of two subtasks: predicting the collection of words in a translation, and deciding the order of the predicted words. For some language ... in Machine Translation We apply our model to machine translation by re- ordering the translation produced by a baseline MT system. Our baseline MT system constructs, for each target translation ... described in Section 4.1 Word Bigram: Word bigrams of the target sen- tence. Examples from Figure 2: “kore”+“niyori”, “niyori”+“roku”. DISP: Displacement feature. For each word posi- tion in the...
  • 8
  • 404
  • 0
Tài liệu Báo cáo khoa học: "Term-list Translation using Mono-lingual Word Co-occurrence Vectors*" doc

Tài liệu Báo cáo khoa học: "Term-list Translation using Mono-lingual Word Co-occurrence Vectors*" doc

Ngày tải lên : 20/02/2014, 18:20
... complicated. Some translation alternatives are polysemous in the target language. If a polysemous word has a very general meaning that co-occurs with various words, then this word is more likely ... multi-dimensional vector space, called WORD SPACE (Schuetze, 1997) for defin- ing the coherence of words. The starting point of WORD SPACE is to represent a word with an n- dimensional vector ... many times the word wi occurs close to the word. For simplicity, we consider w~ and wj to occur close in context if and only if they appear within an m -word distance (i.e., the words occur...
  • 5
  • 338
  • 0
Tài liệu Báo cáo khoa học: "Bilingually Motivated Domain-Adapted Word Segmentation for Statistical Machine Translation" pptx

Tài liệu Báo cáo khoa học: "Bilingually Motivated Domain-Adapted Word Segmentation for Statistical Machine Translation" pptx

Ngày tải lên : 22/02/2014, 02:20
... that it is very rare to get reliable alignments between one word and k consecutive words when k is high. 5 While in this paper we are primarily concerned with lan- guages where the word boundaries ... lan- guages marked with word boundaries to construct bilingually motivated “words”. 551 6.5 Using different word aligners The above experiments rely on GIZA++ to per- form word alignment. We next ... iterations). 4 Word Lattice Decoding 4.1 Word Lattices In the decoding stage, the various segmentation alternatives can be encoded into a compact rep- resentation of word lattices. A word lattice...
  • 9
  • 236
  • 0
Báo cáo khoa học: "Discriminative Modeling of Extraction Sets for Machine Translation" pptx

Báo cáo khoa học: "Discriminative Modeling of Extraction Sets for Machine Translation" pptx

Ngày tải lên : 07/03/2014, 22:20
... relationship between word alignments and extraction sets, then define our model. 2.1 Extraction Sets from Word Alignments Rule extraction is a standard concept in machine translation: word alignment ... sentences. Our extraction set model outperforms both un- supervised and supervised word aligners at pre- dicting word alignments and extraction sets. We also demonstrate that extraction sets ... demonstrate that the full extraction set model outperforms supervised and unsupervised baselines in evaluations of word alignment quality, extraction set quality, and translation. In word and bispan evaluations,...
  • 11
  • 420
  • 0