Báo cáo khoa học: " Exploring Asymmetric Clustering for Statistical Language Modeling" docx

Báo cáo khoa học: " Exploring Asymmetric Clustering for Statistical Language Modeling" docx

Báo cáo khoa học: " Exploring Asymmetric Clustering for Statistical Language Modeling" docx

... Exploring Asymmetric Clustering for Statistical Language Modeling Jianfeng Gao Microsoft Research, Asia Beijing, 100080, ... (1) clustering metrics, and (2) cluster numbers. In what follows, we will investigate the impact of each of the factors. 3.2 Asymmetric clustering The basic criterion for statistical clustering ... et al, [2001] give detailed descriptions...

Ngày tải lên: 23/03/2014, 20:20

8 357 0
Báo cáo khoa học: "Fast Syntactic Analysis for Statistical Language Modeling via Substructure Sharing and Uptraining" ppt

Báo cáo khoa học: "Fast Syntactic Analysis for Statistical Language Modeling via Substructure Sharing and Uptraining" ppt

... Association for Computational Linguistics, pages 175–183, Jeju, Republic of Korea, 8-14 July 2012. c 2012 Association for Computational Linguistics Fast Syntactic Analysis for Statistical Language ... syn- tactic information in both generative and discrimi- native language models. For generative LMs, the syntactic information must be part of the generative process. Structured...

Ngày tải lên: 16/03/2014, 19:20

9 319 0
Báo cáo khoa học: "SVD and Clustering for Unsupervised POS Tagging" docx

Báo cáo khoa học: "SVD and Clustering for Unsupervised POS Tagging" docx

... Papers, pages 215–219, Uppsala, Sweden, 11-16 July 2010. c 2010 Association for Computational Linguistics SVD and Clustering for Unsupervised POS Tagging Michael Lamar* Division of Applied Mathematics ... Typically, only a few dozen iterations are required for full convergence of the clustering algorithm. We then apply a second pass of this entire SVD-and -clustering proce...

Ngày tải lên: 07/03/2014, 22:20

5 269 0
Báo cáo khoa học: "Improving Pronoun Translation for Statistical Machine Translation" docx

Báo cáo khoa học: "Improving Pronoun Translation for Statistical Machine Translation" docx

... Script-based Languages, pages 43–50. Kevin Gimpel and Noah A. Smith. 2008. Rich Source-Side Context for Statistical Machine Trans- lation. In Proceedings of the Third Workshop on Statistical Machine ... 26 April 2012. c 2012 Association for Computational Linguistics Improving Pronoun Translation for Statistical Machine Translation Liane Guillou School of Informatics University of...

Ngày tải lên: 17/03/2014, 22:20

10 347 0
Báo cáo khoa học: "Semi-Supervised Training for Statistical Word Alignment" docx

Báo cáo khoa học: "Semi-Supervised Training for Statistical Word Alignment" docx

... July 2006. c 2006 Association for Computational Linguistics Semi-Supervised Training for Statistical Word Alignment Alexander Fraser ISI / University of Southern California 4676 Admiralty Way, Suite ... is important for restricting total time used when producing align- ments for large training corpora. We performed two experiments. The first evalu- ates the number of search errors....

Ngày tải lên: 31/03/2014, 01:20

8 193 0
Báo cáo khoa học: "Exploring Entity Relations for Named Entity Disambiguation" pot

Báo cáo khoa học: "Exploring Entity Relations for Named Entity Disambiguation" pot

... challenges: Surface forms in text can be am- biguous, and the same entity can be referred to by different surface forms. For example, the surface form “George Bush” may denote either of two for- mer U.S. ... where the majority of surface forms is unam- biguous, but some surface forms are very ambigu- ous (Figure 1). This suggests that for a given set of distinct surface forms found in a...

Ngày tải lên: 23/03/2014, 16:20

6 363 0
Báo cáo khoa học: "Distributed Word Clustering for Large Scale Class-Based Language Modeling in Machine Translation" docx

Báo cáo khoa học: "Distributed Word Clustering for Large Scale Class-Based Language Modeling in Machine Translation" docx

... specific clustering for the last word of each trigram but no clustering at all for the first two word positions. Generalizing this leads to arbitrary order class-based n-gram models of the form: P ... 755–762, Columbus, Ohio, USA, June 2008. c 2008 Association for Computational Linguistics Distributed Word Clustering for Large Scale Class-Based Language Modeling in Machine Tran...

Ngày tải lên: 31/03/2014, 00:20

8 336 0
Báo cáo khoa học: "Clique-Based Clustering for improving Named Entity Recognition systems" pot

Báo cáo khoa học: "Clique-Based Clustering for improving Named Entity Recognition systems" pot

... to represent a precise annotation. For example, Oxford is an ambiguous NE but a clique such as <Cambridge, Oxford, Ed- inburgh University, Edinburgh, Oxford Univer- sity> allows to focus ... of cluster for q = 1 to nbitr do for k = 1 to |CLI| do for l = 1 to κ do Compute the contribution of clique cli k with clus- ter clu l : cont l = P cli k  ∈clu l (S kk  − m) end for clu...

Ngày tải lên: 31/03/2014, 20:20

9 297 0
Tài liệu Báo cáo khoa học: "Translation Model Adaptation for Statistical Machine Translation with Monolingual Topic Information" doc

Tài liệu Báo cáo khoa học: "Translation Model Adaptation for Statistical Machine Translation with Monolingual Topic Information" doc

... context in- formation at the sentence level, we adopt the topical context information in our method for the following reasons: (1) the topic informa- tion captures the context information beyond the ... 8-14 July 2012. c 2012 Association for Computational Linguistics Translation Model Adaptation for Statistical Machine Translation with Monolingual Topic Information ∗ Jinsong Su 1,2 ,...

Ngày tải lên: 19/02/2014, 19:20

10 533 0
Tài liệu Báo cáo khoa học: "Bilingual Sense Similarity for Statistical Machine Translation" ppt

Tài liệu Báo cáo khoa học: "Bilingual Sense Similarity for Statistical Machine Translation" ppt

... the Association for Computational Linguistics, pages 834–843, Uppsala, Sweden, 11-16 July 2010. c 2010 Association for Computational Linguistics Bilingual Sense Similarity for Statistical Machine ... units. Therefore, questions emerge: how good is the sense similarity computed via VSM for two units from parallel corpora? Is it useful for multi- lingual applications, such as s...

Ngày tải lên: 20/02/2014, 04:20

10 595 0
w