... incorporate synonym information effectively into word alignment models To confirm the effect of the synonym pair model with latent topics, we also tested GIZA++ and HMBiTAM with what we call Synonym ... latent topics between the synonym pair model and the word alignment model, the synonym information incorporated in the synonym pair model is used directly for training word...
Ngày tải lên: 20/02/2014, 04:20
... many-to-one word alignments, where each source word is aligned with zero or one target words, and therefore each target word can be aligned with many source words Each source word is labelled with the ... one-to-many alignments, where each target word is aligned with zero or more source words Many-to-many alignments are recoverable using the standard techniques for superimpo...
Ngày tải lên: 20/02/2014, 11:21
Tài liệu Báo cáo khoa học: "Learning Word Senses With Feature Selection and Order Identification Capabilities" pdf
... ingredients: feature selection and order identification Feature selection was formalized as a constrained optimization problem, the output of which was a set of important features to determine word senses ... counting second order co-occurrence was 50 words 3.2 Evaluation method for feature selection For evaluation of feature selection, we used mutual information bet...
Ngày tải lên: 20/02/2014, 16:20
Tài liệu Báo cáo khoa học: "POS Disambiguation and Unknown Word Guessing with Decision Trees" pot
... etc ambiguity and to guess such attributes for unknown words Discussion and Future Goals We have shown a uniform approach to the dual problem of POS disambiguation and unknown word guessing as ... highcoverage lexicon and a set of empirically induced decision trees into a POS tagger achieving ~5,5% error rate for POS disambiguation and ~16% error rate for unknown wo...
Ngày tải lên: 22/02/2014, 03:20
Báo cáo khoa học: "Unsupervised Word Alignment with Arbitrary Features" potx
... algorithms), but is trained entirely from parallel sentences without gold-standard word alignments Thus, it addresses the two limitations of current word alignment approaches This paper is structured as ... trained to maximize likelihood: infrequent source words act as “garbage collectors”, with many target words aligned to them (the word dislike in the Model alignment in Figure i...
Ngày tải lên: 17/03/2014, 00:20
Báo cáo khoa học: "Bayesian Unsupervised Word Segmentation with Nested Pitman-Yor Language Modeling" doc
... so that word length will have a Poisson distribution whose parameter can now be estimated for a given language and word type We describe this in detail in Section 4.3 Nested Pitman-Yor Language ... probabilities over words ? If a lexicon is nite, we can use a uniform prior G0 (w) = 1/|V | for every word w in lexicon V However, with word segmentation every substring could b...
Ngày tải lên: 17/03/2014, 01:20
Báo cáo khoa học: "Better Word Alignments with Supervised ITG Models" pdf
... show that our ITG alignments yield improvements in translation quality thresholding (DeNero and Klein, 2007) The ITG Viterbi alignments are the Viterbi output of the ITG model with all features, ... MIRA ITG R AER 65.8 70.1 25.2 20.3 BITG R AER P 85.0 90.2 73.3 80.1 21.1 15.0 P 85.7 87.3 Likelihood BITG-S BITG-N R AER P R AER 73.7 82.8 20.6 14.9 85.3 88.2 74.8 83.0 20.1 14.4 Tab...
Ngày tải lên: 17/03/2014, 01:20
Báo cáo khoa học: "Extracting Word Sets with Non-Taxonomical Relation" potx
... thematic relation — among the words We then extracted those word sets that not agree with the thesaurus as word sets with a thematic relation We extracted word sets by utilizing inclusive relations ... constructed word sets consisting of these medical terms Then, we chose 977 word sets consisting of three or more terms from them, and removed word sets with a taxo...
Ngày tải lên: 17/03/2014, 04:20
Programming A Game With Unity: A Beginner's Guide
... These indie game development teams have demonstrated an agility and risk-tolerance that, in many cases, allows them to push gameplay innovation faster than their big budget counterparts A number ... actually declared the class and its name (“Mook”); private float health; -This declares a private class variable (which can only be changed from inside the class) The variable is given a v...
Ngày tải lên: 18/03/2014, 21:59
Báo cáo khoa học: "Smaller Alignment Models for Better Translations: Unsupervised Word Alignment with the 0" potx
... 2 Method We start with a brief review of the IBM and HMM word alignment models, then describe how to extend them with a smoothed prior and how to efficiently train them 2.1 IBM Models and HMM Given ... concern us here All three models, as well as IBM Models 3–5, share the same t For further details of these models, the reader is referred to the original papers describin...
Ngày tải lên: 30/03/2014, 17:20