Báo cáo khoa học: "Confidence Measure for Word Alignment" potx

Báo cáo khoa học: "Confidence Measure for Word Alignment" potx

Báo cáo khoa học: "Confidence Measure for Word Alignment" potx

... AFNLP Confidence Measure for Word Alignment Fei Huang IBM T.J.Watson Research Center Yorktown Heights, NY 10598, USA huangfe@us.ibm.com Abstract In this paper we present a confidence mea- sure for word alignment ... learned based on word alignment. In this paper we introduce a confidence mea- sure for word alignment, which is robust to extra or missing words in the bilingual sent...

Ngày tải lên: 17/03/2014, 01:20

9 317 0
Báo cáo khoa học: "Soft Syntactic Constraints for Word Alignment through Discriminative Training" pot

Báo cáo khoa học: "Soft Syntactic Constraints for Word Alignment through Discriminative Training" pot

... bilin- gual word alignment finds word- to -word connec- tions across languages. Originally introduced as a byproduct of training statistical translation models in (Brown et al., 1993), word alignment ... Transduction Grammar or ITG formalism, described in (Wu, 1997), is well suited for our purposes. ITGs per- form string-to-string alignment, but do so through a parsing algorithm that w...

Ngày tải lên: 08/03/2014, 02:21

8 325 0
Báo cáo khoa học: "Flow Network Models for Word Alignment and Terminology Extraction from Bilingual Corpora" docx

Báo cáo khoa học: "Flow Network Models for Word Alignment and Terminology Extraction from Bilingual Corpora" docx

... and French words, an empty English word, and an empty French word, • E comprises edges from the source to all the English words (including the empty one), edges from all the French words (including ... 2The empty words account for the fact that words may not be aligned with other ones, i.e. they are not exphcitely translated for example. 445 • from the source to the empty Engl...

Ngày tải lên: 17/03/2014, 07:20

7 379 0
Báo cáo khoa học: "Adaptive Language Modeling for Word Prediction" potx

Báo cáo khoa học: "Adaptive Language Modeling for Word Prediction" potx

... number of words in the prediction window. We focus on 5 -word prediction windows. Many com- mercial devices provide optimized input for the most common words (called core vocabulary) and offer word ... context is utilized. For ex- ample, Lesher et al. (1999) demonstrate that bigram and trigram models for word prediction are not satu- rated even when trained on 3 million words, in c...

Ngày tải lên: 31/03/2014, 00:20

6 376 0
Báo cáo khoa học: "Log-linear Models for Word Alignment" ppt

Báo cáo khoa học: "Log-linear Models for Word Alignment" ppt

... Model 5 training. For log-linear models, POS information and an additional dictionary are used, which is not the case for GIZA++/IBM models. However, treated as a method for performing symmetrization, ... translation, word alignment plays a crucial role as word- aligned corpora have been found to be an excellent source of translation-related knowledge. Various methods have been propo...

Ngày tải lên: 31/03/2014, 03:20

8 283 0
Báo cáo khoa học: "Confidence-Weighted Learning of Factored Discriminative Language Models" pptx

Báo cáo khoa học: "Confidence-Weighted Learning of Factored Discriminative Language Models" pptx

... Passive- Aggressive algorithm for re-ranking. Input: Tr = {(P i , N i ), 1 ≤ i ≤ K} w 0 ← 0, t ← 0 for a predefined number of iterations do for i from 1 to K do for all (p j , n j ) ∈ (P i × N i ) ... training corpus used for creat- ing the phrase-table, and extended the phrase-table format so as to record, for each token, all its factors. 5.2 Results In the first experiment, we me...

Ngày tải lên: 17/03/2014, 00:20

6 300 0
Báo cáo khoa học: "Confidence Driven Unsupervised Semantic Parsing" pptx

Báo cáo khoa học: "Confidence Driven Unsupervised Semantic Parsing" pptx

... models performance across the different iterations fairly, a uniform scale, such as UNIGRAM and BIGRAM, is required. In the case of the COMBINED measure we used the BI- GRAM measure for performance ... to the deci- sion variables, forcing the resulting output formula to be syntactically legal, for example by restricting active β-variables to be type consistent, and force the resulting...

Ngày tải lên: 23/03/2014, 16:20

10 298 0
Tài liệu Báo cáo khoa học: "Conditional Random Fields for Word Hyphenation" docx

Tài liệu Báo cáo khoa học: "Conditional Random Fields for Word Hyphenation" docx

... overall word- level errors #words with at least one FP or FN swe serious word- level errors #words with at least one FP ower overall word- level error rate owe / (total #words) swer serious word- level ... per- formance expected when hyphenating unknown words, i.e. rare future words. However, in real documents common words appear repeatedly. Therefore, the second less- standard experiment...

Ngày tải lên: 20/02/2014, 04:20

9 608 0
Tài liệu Báo cáo khoa học: "Head-Driven Parsing for Word Lattices" ppt

Tài liệu Báo cáo khoa học: "Head-Driven Parsing for Word Lattices" ppt

... estimate. These measures, while informative, do not cap- ture success of extraction of high-level information from speech. Task-specific measures should be used in tandem with extensional measures such ... for speech recognition, discusses a mod- elling trade-off between producing parse trees and producing strings. Most models are evaluated ei- ther with measures of success for parsing or...

Ngày tải lên: 20/02/2014, 15:21

8 382 0
Báo cáo khoa học: "Learning Expressive Models for Word Sense Disambiguation" pot

Báo cáo khoa học: "Learning Expressive Models for Word Sense Disambiguation" pot

... words to the right and left of the verb, identified using POS tags, represented by has_narrow(snt, word_ position, word) : has_narrow(snt 1 , 1st _word_ left, mind). has_narrow(snt 1 , 1st _word_ right, ... the positions of the words, represented by has_narrow_trns(snt, word_ position, portuguese _word) : has_narrow_trns(snt 1 , 1st _word_ right, como). has_narrow_trns(snt 1 , 2nd _...

Ngày tải lên: 08/03/2014, 02:21

8 381 0
w