Báo cáo khoa học: "Insertion Operator for Bayesian Tree Substitution Grammars" pdf
... Association for Computational Linguistics:shortpapers, pages 206–211, Portland, Oregon, June 19-24, 2011. c 2011 Association for Computational Linguistics Insertion Operator for Bayesian Tree Substitution ... an insertion operator for combining subtrees as in tree adjoining grammars (TAG) (Joshi, 1985) or tree insertion grammars (TIG) (Schabes and Wa- ters, 1995). An in...
Ngày tải lên: 23/03/2014, 16:20
... improve parsing accuracy. 1 Introduction Tree Substitution Grammar (TSG) is a compelling grammar formalism which allows nonterminal rewrites in the form of trees, thereby enabling the modelling ... set of smaller subtrees for a given elementary tree (Sima’an and Buratto, 2003), which are used to smooth its probability estimate. 3 The transform assumes inside inference. For Viterbi r...
Ngày tải lên: 17/03/2014, 00:20
... perspective of linguistic information processing, because it em- ploys lexical information in a more direct way. 1. Introduction Tree Adjoining Grammars (TAGs) are a formal- ism for expressing grammatical ... symbol, 1 and A are two finite sets of trees, called initial trees and auxiliary trees respectively. The trees in the set IuA are called elementary trees. We assume that...
Ngày tải lên: 22/02/2014, 10:20
Báo cáo khoa học: "Native Language Detection with Tree Substitution Grammars" pptx
... model to perform au- thorship attribution, and Post (2011), which uses TSG features in a logistic regression model to per- form grammaticality detection. 3 Tree Substitution Grammars Tree Substitution ... dif- ferent language. We compare two state of the art methods for Tree Substitution Grammar induction and show that features from both methods outperform previous state of the...
Ngày tải lên: 07/03/2014, 18:20
Tài liệu Báo cáo khoa học: "Semantic Parsing with Bayesian Tree Transducers" doc
... a tree transformation based mapping, and (c) a tree trans- ducer that performs the mapping. volve tree transformations either between two trees or a tree and a string. The tree transducer, a formalism ... to tree transducers the benefits of the Bayesian frame- work for principled handling of data sparsity and 488 prior knowledge. Graehl et al. (2008) present an EM training proc...
Ngày tải lên: 19/02/2014, 19:20
Báo cáo khoa học: "Generalized Algorithms for Constructing Statistical Language Models" pdf
... finding all for a given is . Therefore, the total cost is . For all non-empty , we create a new state and for all we set . We create a transition , and for all such that , we set . For all such ... tech- nique for creating an exact representation of -gram lan- guage models by WFAs whose size is practical for offline use even in tasks with a vocabulary size of about 500,000 words a...
Ngày tải lên: 08/03/2014, 04:22
Báo cáo khoa học: "Fertility Models for Statistical Natural Language Understanding" pdf
... in- crease in performance of about 2-3% for most mod- els. For General-LM, results increased by 8-10%. The Poisson and general fertility models show a 2- 5% gain in performance over the basic ... English. For the ATIS task, our formal language is a mi- nor variant of the NL-Parse (Hemphill, Godfrey, and Doddington, 1990) used by ARPA to annotate the ATIS corpus. An example of a f...
Ngày tải lên: 08/03/2014, 21:20
Báo cáo khoa học: "Computer Backup for Field Work in Phonology" pdf
... communication as a system for realizing the output of the grammar. Although the phonology of the 1950s had its prob- lems, it would be foolish to discount it as all bad. For field work, in which ... provided the plan for an essential step that a modern linguist skips only at the risk of basing his generalizations on nothing but an ad hoc subset of a language that is convenient f...
Ngày tải lên: 16/03/2014, 19:20
Báo cáo khoa học: "Reinforcement Learning for Mapping Instructions to Actions" pdf
... divided into 70 for training, 18 for development, and 40 for test. In the puzzle game domain, we use 50 tutorials, divided into 40 for training and 10 for test. 9 Statistics for the datasets ... manuals. In this paper, we present a reinforcement learning framework for in- ducing mappings from text to actions without the need for annotated training examples. For concreteness,...
Ngày tải lên: 17/03/2014, 01:20
Báo cáo khoa học: "Statistical Models for Unsupervised Prepositional Phrase Attachment" pdf
... base forms, as opposed to attachment information. It is therefore less resource-intensive and more portable than pre- vious corpus-based algorithm proposed for this task. We present results for ... noisy but abun- dant substitute for the information that one might get from a treebank. Tables 2 and 3 list the most frequent extracted head word tu- ples for unambiguous verb and n...
Ngày tải lên: 17/03/2014, 07:20