... Sarkar. 2008. Semi-supervised Model Adaptation for Statisti- cal Machine Translation. Machine Translation, pages 77-94. Hua Wu, Haifeng Wang and Chengqing Zong. 2008. Do- main Adaptation for Statistical Machine ... Distributed Language Modeling for N-best List Re-ranking. In Proc. of EMNLP 2006, pages 216-223. Bing Zhao, Matthias Eck and Stephan Vogel. 2004. Language Model Adap...
Ngày tải lên: 19/02/2014, 19:20
... NULL Foreign word penalty 8 translation table from intersection of both alignments 16 non-NULL Foreign word penalty Table 3: Sub-Models. Note that sub-models 1 to 5 are IBM Model 4, sub-models ... parameters used in generating Foreign words which are unaligned 11 backoff fertility for words with count <= 5 4 d 1 (j) movement probs of leftmost Foreign word translated from a parti...
Ngày tải lên: 31/03/2014, 01:20
Tài liệu Báo cáo khoa học: "Discriminative Lexicon Adaptation for Improved Character Accuracy – A New Direction in Chinese Language Modeling" pptx
... (CCN) Adaptation Corpus Lexicon Adaptation for Improved Character Accurac y Add/Delete words Lexicon ( Lex i ) Language Model ( LM i ) y (LAICA) Word Segmentation LM Trainin g ( Lex i ) Model ... new words can increase the ASR performance. Here we propose a dis- criminative lexicon adaptation approach for improved character accuracy, which not only adds new words but also...
Ngày tải lên: 20/02/2014, 07:20
Tài liệu Báo cáo khoa học: "Using Confidence Bands for Parallel Texts Alignment" pptx
... intercept (the value of y when x is 0), substituting x for the Portuguese word position. For Table 3, the ex- pected word position for the word I at pt word position 3877 is 0.9165 × 3877 + 141.65 = ... brackets). For average size texts (e.g. the Written Ques- tions), these words account for about 5% of the total (about 3k words / text). This number varies according to langu...
Ngày tải lên: 20/02/2014, 18:20
Báo cáo khoa học: "Mixture Model POMDPs for Efficient Handling of Uncertainty in Dialogue Management" doc
... uncertainty, but no formal semantics was provided for this list, and therefore only heuristic uses were suggested for it. 5 Initial Experiments We have implemented a Mixture Model POMDP ar- chitecture ... partitions of POMDP states. For any set of partitions, the mix- ture model approach could express the same model by defining one MDP state per partition and giving it a uniform di...
Ngày tải lên: 31/03/2014, 00:20
Tài liệu Báo cáo khoa học: " A Declarative Language for Implementing Dynamic Programs∗" pptx
... sentence and grammar by asserting values for certain items. If the input is John loves Mary, the user should assert values of 1 for word( John,0,1), word( loves,1,2), word( Mary,2,3), and end(3). If the ... induction, and finite-state modeling. 1 Introduction Computational linguistics has become a more experi- mental science. One often uses real-world data to test one’s formal models (g...
Ngày tải lên: 20/02/2014, 16:20
Báo cáo khoa học: "Semi-supervised Learning for Natural Language Processing" pptx
Ngày tải lên: 08/03/2014, 01:20
Báo cáo khoa học: "A Comparison of Syntactically Motivated Word Alignment Spaces" doc
... Introduction Bilingual word alignment finds word- level corre- spondences between parallel sentences. The task originally emerged as an intermediate result of training the IBM translation models (Brown ... models (Brown et al., 1993). These models use minimal linguistic intuitions; they essentially treat sentences as flat strings. They remain the dominant method for word alignment (Och a...
Ngày tải lên: 08/03/2014, 21:20
Báo cáo khoa học: "A Semantic Framework for Translation Quality Assessment" pptx
... Three parameters should be adjusted for the AM-FM implementation described in (1): the di- mensionality of the reduced space for AM, the or- der of n-gram model for FM, and the harmonic mean weighting ... it still performs well above random chance predictions, which, for the given average of 4 items per ranking, is about 25% for best and worst ranking predictions, and about...
Ngày tải lên: 17/03/2014, 00:20
Báo cáo khoa học: "A Portable Algorithm for Mapping Bitext Correspondence" pptx
... for the tokenizer to know which words are compounds. A word that has another word as a substring should result in one axis position for the substring and one for the su- perstring. When lexical ... is acceptable for the tokenization pro- gram to overgenerate just as it is acceptable for the matching predicate. For example, when tokenizing German text, it is not necessa...
Ngày tải lên: 17/03/2014, 23:20