... Constituent- Context Model for Improved Grammar Induction Dan Klein and Christopher D. Manning Computer Science Department Stanford University Stanford, CA 94305-9040 {klein, manning}@cs.stanford.edu Abstract We ... is the basis for our algorithm. NP VP PP Usually a Constituent Rarely a Constituent (a) Constituent Types (b) Constituents vs. Distituents Figure 3: The most freq...
Ngày tải lên: 17/03/2014, 08:20
... 12,742 Table 1. Modeling statistics The most common metric for evaluating an n- gram model is the probability that the model assigns to test data, or perplexity (Jelinek, 1991). For a test set ... source-channel model by fully exploring orthographic contextual information, aiming at alleviating the imprecision introduced by the multiple-step phoneme-based approach. 3 Joint...
Ngày tải lên: 31/03/2014, 03:20
Báo cáo khoa học: "A Feature-Rich Constituent Context Model for Grammar Induction" doc
... Association for Computational Linguistics, pages 17–22, Jeju, Republic of Korea, 8-14 July 2012. c 2012 Association for Computational Linguistics A Feature-Rich Constituent Context Model for Grammar ... 1 1 SUFFIX-NN: 1 1 context BASIC--VBD: 1 0 BASIC-VBD-: 0 1 L -CONTEXT- : 1 0 L -CONTEXT- VBD: 0 1 R -CONTEXT- VBD: 1 0 R -CONTEXT- : 0 1 Table 1: Span and context feat...
Ngày tải lên: 23/03/2014, 14:20
Tài liệu Báo cáo khoa học: "A Syntax-Driven Bracketing Model for Phrase-Based Translation" pptx
... then 8: Update bracketing instances for index j 9: end if 10: end if 11: end for 12: for each j ∈ c do 13: := ∪ {bracketing instances from j} 14: end for 15: Output: bracketing instances ... Algo- rithm. 3 The Syntax-Driven Bracketing Model 3.1 The Model Our interest is to automatically detect phrase bracketing using rich contextual information. We consider this task as a bina...
Ngày tải lên: 20/02/2014, 07:20
Tài liệu Báo cáo khoa học: "A Unified Syntactic Model for Parsing Fluent and Disfluent Speech∗" ppt
... transform are similar in form and meaning to non -constituent categories used in Com- binatorial Categorial Grammars (CCGs) (Steedman, 2000). Unlike CCGs, however, a right corner trans- formed grammar ... right- corner trees using transform rules similar to those described by Johnson(1998a). This right-corner transform is simply the left-right dual of a left- corner transform. It transform...
Ngày tải lên: 20/02/2014, 09:20
Tài liệu Báo cáo khoa học: "A Phrase-based Statistical Model for SMS Text Normalization" ppt
... normalization model consists of two sub-models: a word-based language model (LM), characterized by 1 (| ) nn P ee − ) k and a phrase- based lexical mapping model (channel model) , characterized ... substitution transformations, but inadequate to address the insertion transforma- tion. For example, the lingoes “duno”, “ysnite” have to be normalized using an insertion trans- fo...
Ngày tải lên: 20/02/2014, 12:20
Báo cáo khoa học: "A DOM Tree Alignment Model for Mining Parallel Data from the Web" doc
... alignment model for machine translation. For example, (Wu 1997; Alshawi, Bangalore, and Douglas, 2000; Yamada and Knight, 2001) have studied syn- chronous context free grammar. This formalism ... the DOM tree alignment model, sen- tence alignment model, and candidate web page pair verification model are introduced. 4 DOM Tree Alignment Model The Document Object Model (DO...
Ngày tải lên: 08/03/2014, 02:21
Báo cáo khoa học: "A Generalized Vector Space Model for Text Retrieval Based on Semantic Relatedness" pot
... Space Models (GVSM) extend the standard Vector Space Model (VSM) by embedding addi- tional types of information, besides terms, in the representation of documents. An interesting type of information ... retrieval per- formance when the retrieval model was based on WSD information. On the contrary, the construc- tion of a sense-based retrieval model by Stokoe et al. (2003) improved perfo...
Ngày tải lên: 08/03/2014, 21:20
Báo cáo khoa học: "A Morphological Analysis Based Method for Spelling Correction" docx
Ngày tải lên: 09/03/2014, 01:20
Báo cáo khoa học: "A Class-Based Agreement Model for Generating Accurately Inflected Translations" pptx
... M. Subotin. 2011. An exponential translation model for target language morphology. In ACL-HLT. C. Tillmann. 2004. A unigram orientation model for statistical machine translation. In NAACL. K. ... Association for Computational Linguistics, pages 146–155, Jeju, Republic of Korea, 8-14 July 2012. c 2012 Association for Computational Linguistics A Class-Based Agreement Model for...
Ngày tải lên: 16/03/2014, 19:20