... the word • s: a single-character word while for Joint S&T, a POS tag is attached to the tail of a boundary tag, to incorporate the word boundary information and POS information to- gether. For ... PD to CTB are con- ducted for two tasks: word segmentation alone, and joint segmentation and POS tagging (Joint S&T). The performance measurement indicators for word segm...
Ngày tải lên: 17/03/2014, 01:20
... improve performance. However, for grammar formalisms which use more fine-grained grammatical cate- gories, for example TAG and CCG, tagging accuracy is much lower. In fact, for these formalisms, ... multi- tagger for supertagging results in an effective pre- processor for CCG parsing, and that using a multi- tagger for POS tagging results in more accurate CCG supertagging. 2 Ma...
Ngày tải lên: 23/03/2014, 18:20
Tài liệu Báo cáo khoa học: "A Framework for Processing Partially Free Word Order" ppt
... referred to .as ID/LP format. And it is precisely this aspect of the for- malism that. makes the theory attractive for application to lan- guages with a high degree of word- urder freedom. The ... functioning of the formalism and some of its virtues. 4.2 The Analysis of German Word Order Uszkoreit (1982a) proposes a GPSG analysis of German word order that accounts for the fixe...
Ngày tải lên: 21/02/2014, 20:20
Báo cáo khoa học: "Variational Inference for Grammar Induction with Prior Knowledge" pdf
... position and (b) connect words which are nearby (in string distance). We experiment with six mixture com- ponents: (1) RIGHTATTACH: Each word s parent is to the word s right. The root, therefore, is al- ways ... We therefore report results with our method only for the logistic normal prior. We do inference on sections 1–270 and 301–1151 of CTB10 (4,909 sentences) by running the EM al- gor...
Ngày tải lên: 08/03/2014, 01:20
Báo cáo khoa học: "Structured Models for Fine-to-Coarse Sentiment Analysis" pdf
... possible to run a constrained forward- backward algorithm and learn the parameters for CRFs as well. 2.1.2 Feature Space In this section we define the feature representa- tion for each clique, f(y d , ... which helped improve performance. 2.2 Beyond Two-Level Models To this point, we have focused solely on a model for two-level fine-to-coarse sentiment analysis not only for simplicity,...
Ngày tải lên: 08/03/2014, 02:21
Báo cáo khoa học: "A Framework for Customizable Generation of Hypertext Presentations" pdf
... (producing the text string) and for- matting (determining the formatting marks to insert in the text string). Developing an appli- cation to present the information for a given domain is often ... syntactic struc- tures. • Linguistic grammar:, transformation rules specifying the transformation of syntactic struc- tures into surface word forms and punctuation marks. • Lexicon:...
Ngày tải lên: 08/03/2014, 05:21
Báo cáo khoa học: "ENGLISH GENERATOR FOR A CASE-LABELLED DEPENDENCY REPRESENING" pdf
... It places no restrictions on the form of the fillers for any slot in a gran~ node. The production rules ~,force categorial and order~,~ restrictions. So, for example, the templates reflect ... verbs; it is also used to cover sane other forms of attac~nent to, and modification of, nouns, for example by determiners ( like "a" ) and even for plural or singular number. I...
Ngày tải lên: 09/03/2014, 01:20
Báo cáo khoa học: "AN ALGORITHM FOR GENERATION IN UNIFICATION CATEGORIAL GRAMMAR" pdf
... of commutativity or associativity are available for testing logical equivalence 1. One of the 1Strictly speaking, we test for a very strict form of consistency. Two LFs are considered logically ... arguments. reduce (Sign0, Sign) :- transform(Sign0, Sign1), reduce (Sign1, Sign) . transform(Daughter, Mother) :- unary_rule(Mother, Daughter). transform(Sign0, Sign) :- path_value(...
Ngày tải lên: 09/03/2014, 01:20
Báo cáo khoa học: "Reranking Answers for Definitional QA Using Language Modeling" pdf
... intention. First, reformulate query via simply adding clue words to the questions. i.e., for “Who is ?” question, we add the word “biography”; and for “What is ?” question, we add the word “is usu- ally”, ... elements are the probabilities of word se- quences, denoted as P(w 1 , w 2 , , w n ) or P (w 1,n ) for short. Recently, language model has been successfully used for i...
Ngày tải lên: 23/03/2014, 18:20
Báo cáo khoa học: "Evaluation tool for rule-based anaphora resolution methods" pdf
... chose this format for several reasons: it is easily read, it allows a unified treatment of the files used for training and of those used for evaluation (which are already annotated in XML format) ... cases). It also performs a surface syntactic parsing of the text using dependency links that show the head-modifier relations between words. This kind of information is used for extracting c...
Ngày tải lên: 23/03/2014, 19:20