Báo cáo khoa học: "Scaling Conditional Random Fields Using Error-Correcting Codes" docx

Báo cáo khoa học: "Scaling Conditional Random Fields Using Error-Correcting Codes" docx

Báo cáo khoa học: "Scaling Conditional Random Fields Using Error-Correcting Codes" docx

... 10–17, Ann Arbor, June 2005. c 2005 Association for Computational Linguistics Scaling Conditional Random Fields Using Error-Correcting Codes Trevor Cohn Department of Computer Science and Software Engineering University ... phrase chunking (NPC) and POS tagging using the CoNLL 2000 data-set (Sang and Buchholz, 2000). 2 Conditional random fields CRFs are undirected graphical mo...

Ngày tải lên: 31/03/2014, 03:20

8 260 0
Báo cáo khoa học: "Training Conditional Random Fields with Multivariate Evaluation Measures" potx

Báo cáo khoa học: "Training Conditional Random Fields with Multivariate Evaluation Measures" potx

... 217–224, Sydney, July 2006. c 2006 Association for Computational Linguistics Training Conditional Random Fields with Multivariate Evaluation Measures Jun Suzuki, Erik McDermott and Hideki Isozaki NTT ... isozaki}@cslab.kecl.ntt.co.jp Abstract This paper proposes a framework for train- ing Conditional Random Fields (CRFs) to optimize multivariate evaluation mea- sures, including n...

Ngày tải lên: 17/03/2014, 04:20

8 304 0
Báo cáo khoa học: "Automatically Evaluating Text Coherence Using Discourse Relations" docx

Báo cáo khoa học: "Automatically Evaluating Text Coherence Using Discourse Relations" docx

... between S 1 as Arg1, and S 2 as Arg2 2. Explicit Comparison using “but” between S 2 as Arg1, and S 3 as Arg2 3. Explicit Temporal using “as” within S 3 (Clause C 3.1 as Arg1, and C 3.2 as Arg2) 4. ... Resources and Evaluation (LREC 2008). Radu Soricut and Daniel Marcu. 2006. Discourse gener- ation using utility-trained coherence models. In Pro- ceedings of the COLING/ACL Main Conference...

Ngày tải lên: 23/03/2014, 16:20

10 292 0
Tài liệu Báo cáo khoa học: "Sequential Conditional Generalized Iterative Scaling" pdf

Tài liệu Báo cáo khoa học: "Sequential Conditional Generalized Iterative Scaling" pdf

... theperplexityoftheformusingtwomodels is actually marginally lower (better) than the perplex- ity of the form using a single model, so there does not seem to be any disadvantage to using it. The word ... variation, which appears to have been missed by the conditional maxent com- munity. We show that this fast variation can also be used for conditional probabilities, and that it is usef...

Ngày tải lên: 20/02/2014, 21:20

8 261 0
Tài liệu Báo cáo khoa học: "Scaling up Automatic Cross-Lingual Semantic Role Annotation" docx

Tài liệu Báo cáo khoa học: "Scaling up Automatic Cross-Lingual Semantic Role Annotation" docx

... semantic annotations using an automatic approach that does not rely on a semantic ontol- ogy for the target language. Furthermore, to our knowledge, we report the first results on using joint syntactic-semantic ... One-thousand French sentences are extracted randomly from our parallel corpus without any constraints on the se- mantic parallelism of the sentences, unlike much previous work....

Ngày tải lên: 20/02/2014, 04:20

6 403 0
Báo cáo khoa học: "Scaling up from Dialogue to Multilogue: some principles and benchmarks" doc

Báo cáo khoa học: "Scaling up from Dialogue to Multilogue: some principles and benchmarks" doc

... To assess its reliability a pilot study of the taxonomy was per- formed using two additional non-expert coders. These annot- ated 50 randomly selected NSUs (containing a minimum of 2 instances of each ... contrast, BNC data indic- ates the prevalence in multilogue of short answers that are resolved using material from an antecedent question located several turns back, whereas in dialogue...

Ngày tải lên: 08/03/2014, 04:22

8 353 0
Báo cáo khoa học: "Scaling Phrase-Based Statistical Machine Translation to Larger Corpora and Longer Phrases" pptx

Báo cáo khoa học: "Scaling Phrase-Based Statistical Machine Translation to Larger Corpora and Longer Phrases" pptx

... of the 35,313 word long test set which can be covered using only phrases of the specified length or greater. The table shows the efficacy of using phrases of different lengths. The ta- ble shows ... consider two common ways of calculating the translation probability: using the maximum likelihood estimator (MLE) and smooth- ing the MLE using lexical weighting. The maximum likelihood esti...

Ngày tải lên: 17/03/2014, 05:20

8 316 0
Báo cáo khoa học: "Scaling to Very Very Large Corpora for Natural Language Disambiguation" potx

Báo cáo khoa học: "Scaling to Very Very Large Corpora for Natural Language Disambiguation" potx

... sampling technique where a part of speech tagger is trained using an annotated seed corpus. A family of taggers is then generated by randomly permuting the tagger probabilities, and the disparity ... harvested. We are able to attain improvements in accuracy for free using unsupervised learning, but unlike our learning curve experiments using correctly labeled data, accuracy doe...

Ngày tải lên: 23/03/2014, 19:20

8 265 0
Báo cáo khoa học: "Scaling Distributional Similarity to Large Corpora" doc

Báo cáo khoa học: "Scaling Distributional Similarity to Large Corpora" doc

... 1997). 4.1 Random Indexing Random Indexing (RI) is a hashing technique based on Sparse Distributed Memory (Kanerva, 1993). Karlgren and Sahlgren (2001) showed RI produces results similar to LSA using ... approximation of the Jaccard similarity function using min-wise independent functions. Charikar (2002) proposed an approx- imation of the cosine measure using random hy- perplanes...

Ngày tải lên: 31/03/2014, 01:20

8 242 0
Từ khóa:
w