improving ibm word alignment model 1

Báo cáo khoa học: "Improving IBM Word-Alignment Model " pdf

Ngày tải lên: 23/03/2014, 19:20

8 260 0

Báo cáo khoa học: "Improving Domain-Specific Word Alignment for Computer Assisted Translation" potx

Ngày tải lên: 17/03/2014, 06:20

4 266 0

Báo cáo khoa học: "A Probability Model to Improve Word Alignment" ppt

Ngày tải lên: 17/03/2014, 06:20

8 300 0

Báo cáo khoa học: "Diversify and Combine: Improving Word Alignment for Machine Translation on Low-Resource Languages" docx

Ngày tải lên: 23/03/2014, 16:20

5 274 0

Báo cáo khoa học: "Alignment Model Adaptation for Domain-Specific Word Alignment" pptx

Ngày tải lên: 31/03/2014, 03:20

8 329 0

Tài liệu Báo cáo khoa học: "Word Alignment with Synonym Regularization" doc

... 0 .14 5 Proposed 0.947 0.824 0.8 81 0 .11 2 (b) 10 0k Precision Recall F-measure AER GIZA++ standard 0.925 0.7 91 0.853 0 .13 6 with SRH 0.934 0.803 0.864 0 .12 6 HM-BiTAM standard 0.898 0.8 51 0.874 0 .12 4 with ... representative word. For instance, the words ‘sick’ and ‘ill’ in the bilingual sentences # vocabularies 10 k 50k 10 0k English standard 8578 16 924 22 817 with SRH 5435 7235 13 978 French standard 10 7 91 218 72 ... of the ACL 2 010 Conference Short Papers, pages 13 7 14 1, Uppsala, Sweden, 11 -16 July 2 010 . c 2 010 Association for Computational Linguistics Word Alignment with Synonym Regularization Hiroyuki...

Ngày tải lên: 20/02/2014, 04:20

5 471 2

Tài liệu Báo cáo khoa học: "Smoothing a Tera-word Language Model" doc

... order model is taken to be the word frequencies in the Web 1T corpus. The Brown corpus was re- tokenized to match the tokenization style of the Web 1T dataset resulting in 1, 186,262 tokens in 52 ,10 8 sentences. ... 52 ,10 8 sentences. The Web 1T dataset has a 13 million word vocabulary consisting of words that appear 10 0 times or more in its corpus. 769 sentences in Brown that contained words outside this vocabulary ... and Linda C. Bauman Peto. 19 95. A hierarchical Dirichlet language model. Natural Lan- guage Engineering, 1( 3) :1 19 . Y.W. Teh. 2006. A hierarchical Bayesian language model based on Pitman-Yor...

Ngày tải lên: 20/02/2014, 09:20

4 425 1

Tài liệu Báo cáo khoa học: "Yet Another Word Alignment Tool" docx

... the term word alignment 1 Yawat was ﬁrst presented at the 2007 Linguistic Annota- tion Workshop (Germann, 2007). to refer to any form of alignment that identiﬁes words or groups of words as ... sub-sentential alignments of paral- lel text.” Linguistic Annotation Workshop (LAW ’07), 12 1 12 4. Prague, Czech Republic. Hwa, Rebecca and Nitin Madnani. 2004. “The umiacs word alignment interface.” http://www.umiacs.umd.edu/ ∼ nmadnani/ alignment/ forclip.htm. Lambert, ... translations by word alignment but also becaus e of such interface issues that aligning words manually has the reputa- tion of being a very tedious task. 3 Yawat Yawat (Yet Another Word Alignment Tool)...

Ngày tải lên: 20/02/2014, 09:20

4 417 1

Tài liệu Báo cáo khoa học: "Discriminative Word Alignment with Conditional Random Fields" ppt

... 7.73 –dictionary 27.72 7. 21 –sentence position 28.30 8. 01 –POS – 8 .19 Model 1 28.62 8.45 alignment word pair 32. 41 7.20 –Markov 32.75 12 .44 –Dice & Model 1 35.43 14 .10 Table 3. The resulting ... f-score AER Model 4 reﬁned 87.4 95 .1 91. 1 9. 81 Model 4 intersection 97.9 86.0 91. 6 7.42 French → English 96.7 85.0 90.5 9. 21 English → French 97.3 83.0 89.6 10 . 01 intersection 98.7 78.6 87.5 12 .02 reﬁned ... implementa- tion of the IBM alignment models (Brown et al., 19 93). These models treat word alignment as a hidden process, and maximise the probability of the observed (e, f ) sentence pairs 1 using the ex- pectation...

Ngày tải lên: 20/02/2014, 11:21

8 461 0

Tài liệu Báo cáo khoa học: "Using Word Support Model to Improve Chinese Input System" ppt

... usually 2, i.e. bigram model (Lin and Tsai, 19 87; Gu et al., 19 91; Fu et al., 19 96; Ho et al., 19 97; Sproat, 19 90; Gao et al., 2002; Lee 2003). From the studies (Hsu 19 94; Tsai and Hsu, 2002; ... 量刑/事實 /1, 關於/兩性 /1, 關與/實施 /1, 生殖/實施 /1, 關於/事實 /1, 關於/史實 /1 WSM Set 關於( guan yu)/7, 實施(shi shi)/4, 兩性(liang xing)/3, 量刑(liang xing)/2, 知識(zhi shi)/2, 事實(shi shi)/2, 失事( shi shi) /1, 關與(guan yu) /1, ... are (18 .9%, 10 .1% ) and (25.6%, 16 .6%), respectively. From Table 3b, the tonal and toneless STW improvements of the BiGram by using the WP identifier and the WSM are (8.6%, 11 .9%) and (17 .1% ,...