Báo cáo khoa học: "A Topic Similarity Model for Hierarchical

Báo cáo khoa học: "A Topic Similarity Model for Hierarchical Phrase-based Translation" ppt

... Association for Computational Linguistics, pages 750–758, Jeju, Republic of Korea, 8-14 July 2012. c 2012 Association for Computational Linguistics A Topic Similarity Model for Hierarchical Phrase-based ... calculate the dis- tance is using topic model. Gong et al. (2010) introduce topic model for ﬁl- tering topic- mismatched phrase pairs. They ﬁrst as- sign a spec...

Ngày tải lên: 23/03/2014, 14:20

9 399 0

Tài liệu Báo cáo khoa học: "A Localized Prediction Model for Statistical Machine Translation" ppt

... blocks for for which . 560 4 Online Training of Maximum-entropy Model The local model described in Section 3 leads to the fol- lowing abstract maximum entropy training formulation: (8) In this formulation, ... describes a phrase-based model for SMT similar to the models presented in (Koehn et al., 2003; Och et al., 1999; Tillmann and Xia, 2003). In our paper, phrase pairs are na...

Ngày tải lên: 20/02/2014, 15:20

8 578 0

Tài liệu Báo cáo khoa học: "A Phonotactic Language Model for Spoken Language Identification" pptx

... June 2005. c 2005 Association for Computational Linguistics A Phonotactic Language Model for Spoken Language Identification Haizhou Li and Bin Ma Institute for Infocomm Research Singapore ... acoustic to- kens to form a unified acoustic vocabulary in our voice tokenizer. Readers are referred to (Ma et al. , 2005) for details of acoustic modeling. 3.1 Vector Space Modeling...

Ngày tải lên: 20/02/2014, 15:20

8 437 0

Tài liệu Báo cáo khoa học: "A Unified Graph Model for Sentence-based Opinion Retrieval" pdf

... performance was very low on topic 8 and topic 11. Topic 8, i.e. ‘成龙’ (Jackie Chan), it was influenced by topic 7, i.e. ‘李连杰’ (Jet Lee) as there were a number of similar relevant targets for ... targets for the two topics, and therefore many word pairs ended up the same. As a result, documents belonging to topic 7 and topic 8 could not be differentiated, and they bot...

Ngày tải lên: 20/02/2014, 04:20

9 585 0

Tài liệu Báo cáo khoa học: "A probabilistic generative model for an intermediate constituency-dependency representation" pptx

... re-ranking model performs rather well for a limited number of candidate structures, and out- performs Charniak’s model when k = 5. In this case we observe a small boost in performance for the detection ... consistently outper- forms the PCFG model on this metric, as for UAS, and BAS. Concerning the other metrics, as the number of k-best candidates increases, the PCFG model outpe...

Ngày tải lên: 20/02/2014, 04:20

6 556 0

Tài liệu Báo cáo khoa học: "A Joint Statistical Model for Simultaneous Word Spacing and Spelling Error Correction for Korean" pdf

... 61–64, Prague, June 2007. c 2007 Association for Computational Linguistics A Joint Statistical Model for Simultaneous Word Spacing and Spelling Error Correction for Korean Hyungjong Noh* Jeong-Won ... network 8 4 Experiments and Analyses 4.1 Corpus Information Table 1: Corpus information Table 1 shows the information of corpus which is used for experiments. All corpora ar...

Ngày tải lên: 20/02/2014, 12:20

4 523 0

Tài liệu Báo cáo khoa học: "A SPEECH-FIRST MODEL FOR REPAIR DETECTION AND CORRECTION" docx

... cues for repair processing. Discussion In this paper, we have presented a"speech-first" model, the Repair Interval Model, for studying repairs in spon- taneous speech. This model ... AROA Air Travel Information System (ATIS) database. Our results are interpreted within our "speech-first" framework for investigating repairs, the REPAIR IN- TERVAL MODEL (...

Ngày tải lên: 20/02/2014, 21:20

8 502 0

Báo cáo khoa học: "A Cascaded Linear Model for Joint Chinese Word Segmentation and Part-of-Speech Tagging" pdf

... each feture. ALL: all features, PER: perceptron model, WLM: word language model, PLM: POS language model, GPR: generating model, LPR: labelling model, LEN: word count penalty. LM with Witten-Bell ... algorithm. 1: Input: character sequence C 1:n 2: for i ← 1 n do 3: L ← ∅ 4: for l ← 1 min(i, K) do 5: w ← C i−l+1:i 6: for t ∈ P OS do 7: p ← label w as t 8: for q ∈ V[i − l] do 9:...

Ngày tải lên: 08/03/2014, 01:20

8 445 0

Báo cáo khoa học: "A Discriminative Language Model with Pseudo-Negative Samples" pptx

... discussed except for the topic- based language model (David M. Blei, 2003; Wang et al., 2005), our result may encourage the study of the combination of features for language modeling. A contrastive ... problem is that NLMs cannot handle overlapping information or non-local information easily, which is important for more ac- curate sentence classiﬁcation. For example, a NLM could as...

Ngày tải lên: 08/03/2014, 02:21

8 315 0

Báo cáo khoa học: "A Unified Statistical Model for the Identification of English BaseNP" pptx

... calculation formulas are similar with equations (13) and (14) respectively. Before training trigram model (3), all possible baseNP rules should be extracted from the training corpus. For instance, ... describe the two-pass statistical model, parameters training and Viterbi algorithm for the search of the best sequences of POS tagging and baseNP identification. Before describing our alg...

Ngày tải lên: 08/03/2014, 05:20

8 482 0