Báo cáo khoa học: "A Modified Joint Source-Channel Model for Transliteration" ppt

Báo cáo khoa học: "A Modified Joint Source-Channel Model for Transliteration" ppt

Báo cáo khoa học: "A Modified Joint Source-Channel Model for Transliteration" ppt

... been formulated under both noisy-channel model and joint source-channel model in Section 2. A number of transliteration models based on collocation statistics including the modified joint source-channel ... TUs as the context (this is the joint source channel model) , trigram model with previous and next source TUs as the context and the modified joint source-cha...
Ngày tải lên : 17/03/2014, 04:20
  • 8
  • 312
  • 0
Báo cáo khoa học: "A Stacked Sub-Word Model for Joint Chinese Word Segmentation and Part-of-Speech Tagging" potx

Báo cáo khoa học: "A Stacked Sub-Word Model for Joint Chinese Word Segmentation and Part-of-Speech Tagging" potx

... often provides important clues for POS tagging, and the POS tags contain much syntactic information, which need context information within a large window for disambiguation. For example, Huang et al. ... 1385–1394, Portland, Oregon, June 19-24, 2011. c 2011 Association for Computational Linguistics A Stacked Sub-Word Model for Joint Chinese Word Segmentation and Part-of-Speech...
Ngày tải lên : 17/03/2014, 00:20
  • 10
  • 412
  • 0
Tài liệu Báo cáo khoa học: "A Syntax-Driven Bracketing Model for Phrase-Based Translation" pptx

Tài liệu Báo cáo khoa học: "A Syntax-Driven Bracketing Model for Phrase-Based Translation" pptx

... then 8: Update bracketing instances for index j 9: end if 10: end if 11: end for 12: for each j ∈ c do 13:  :=  ∪ {bracketing instances from j} 14: end for 15: Output: bracketing instances ... VP on the right, therefore CBMF is “VP-RC”. 3.3 The Integration of the SDB Model into Phrase-Based SMT We integrate the SDB model into phrase-based SMT to help decoder perform syntax-drive...
Ngày tải lên : 20/02/2014, 07:20
  • 9
  • 438
  • 0
Tài liệu Báo cáo khoa học: "A Unified Syntactic Model for Parsing Fluent and Disfluent Speech∗" ppt

Tài liệu Báo cáo khoa học: "A Unified Syntactic Model for Parsing Fluent and Disfluent Speech∗" ppt

... Right-corner transform Binarized trees 2 are then transformed into right- corner trees using transform rules similar to those described by Johnson(1998a). This right-corner transform is simply the ... Minnesota schuler@cs.umn.edu Abstract This paper describes a syntactic representation for modeling speech repairs. This representa- tion makes use of a right corner transform of syntax trees t...
Ngày tải lên : 20/02/2014, 09:20
  • 4
  • 581
  • 0
Tài liệu Báo cáo khoa học: "A Phrase-based Statistical Model for SMS Text Normalization" ppt

Tài liệu Báo cáo khoa học: "A Phrase-based Statistical Model for SMS Text Normalization" ppt

... normalization model consists of two sub-models: a word-based language model (LM), characterized by 1 (| ) nn P ee − ) k and a phrase- based lexical mapping model (channel model) , characterized ... substitution transformations, but inadequate to address the insertion transforma- tion. For example, the lingoes “duno”, “ysnite” have to be normalized using an insertion trans- fo...
Ngày tải lên : 20/02/2014, 12:20
  • 8
  • 399
  • 0
Báo cáo khoa học: "A DOM Tree Alignment Model for Mining Parallel Data from the Web" doc

Báo cáo khoa học: "A DOM Tree Alignment Model for Mining Parallel Data from the Web" doc

... the DOM tree alignment model, sen- tence alignment model, and candidate web page pair verification model are introduced. 4 DOM Tree Alignment Model The Document Object Model (DOM) is an appli- cation ... alignment model for machine translation. For example, (Wu 1997; Alshawi, Bangalore, and Douglas, 2000; Yamada and Knight, 2001) have studied syn- chronous context free gram...
Ngày tải lên : 08/03/2014, 02:21
  • 8
  • 435
  • 0
Báo cáo khoa học: "A Generalized Vector Space Model for Text Retrieval Based on Semantic Relatedness" pot

Báo cáo khoa học: "A Generalized Vector Space Model for Text Retrieval Based on Semantic Relatedness" pot

... retrieval per- formance when the retrieval model was based on WSD information. On the contrary, the construc- tion of a sense-based retrieval model by Stokoe et al. (2003) improved performance, while ... Space Models (GVSM) extend the standard Vector Space Model (VSM) by embedding addi- tional types of information, besides terms, in the representation of documents. An interesting type o...
Ngày tải lên : 08/03/2014, 21:20
  • 9
  • 394
  • 0
Báo cáo khoa học: "A Class-Based Agreement Model for Generating Accurately Inflected Translations" pptx

Báo cáo khoa học: "A Class-Based Agreement Model for Generating Accurately Inflected Translations" pptx

... M. Subotin. 2011. An exponential translation model for target language morphology. In ACL-HLT. C. Tillmann. 2004. A unigram orientation model for statistical machine translation. In NAACL. K. ... Association for Computational Linguistics, pages 146–155, Jeju, Republic of Korea, 8-14 July 2012. c 2012 Association for Computational Linguistics A Class-Based Agreement Model for...
Ngày tải lên : 16/03/2014, 19:20
  • 10
  • 414
  • 0
Báo cáo khoa học: "A Language-Independent Unsupervised Model for Morphological Segmentation" pot

Báo cáo khoa học: "A Language-Independent Unsupervised Model for Morphological Segmentation" pot

... results on an application. Performance improvements due to morphological information have been reported for example in MT, information retrieval, and speech recognition. For the latter task, morphological ... same problem also occurs for German nouns. Therefore, this first condition of the affix acqui- sition step needs to be replaced. We therefore intro- duced an additional step for bu...
Ngày tải lên : 17/03/2014, 04:20
  • 8
  • 288
  • 0
Báo cáo khoa học: "A Hierarchical Phrase-Based Model for Statistical Machine Translation" pptx

Báo cáo khoa học: "A Hierarchical Phrase-Based Model for Statistical Machine Translation" pptx

... sys- tem. For all three systems we trained the transla- tion model on the FBIS corpus (7.2M+9.2M words); for the language model, we used the SRI Language Modeling Toolkit to train a trigram model ... standing for X spanning f j i . We choose b and β to balance speed and performance on our development set. For our experiments, we set b = 40, β = 10 −1 for X cells, and b = 15, β...
Ngày tải lên : 17/03/2014, 05:20
  • 8
  • 331
  • 0