Báo cáo khoa học: "Learning with Annotation Noise" docx

Báo cáo khoa học: "Learning with Annotation Noise" docx

Báo cáo khoa học: "Learning with Annotation Noise" docx

... person’s annotations. Another possibility, recently explored by Beigman Klebanov and Beigman (2009), is that some items are really quite clear-cut for an annotator with any bias, belonging squarely within ... bias when confronted with annotation noise in training data, irrespective of the size of the dataset. Finally, we discuss the implications of our findings for the practice of annot...

Ngày tải lên: 30/03/2014, 23:20

8 289 0
Tài liệu Báo cáo khoa học: "Learning with Unlabeled Data for Text Categorization Using Bootstrapping and Feature Projection Techniques" doc

Tài liệu Báo cáo khoa học: "Learning with Unlabeled Data for Text Categorization Using Bootstrapping and Feature Projection Techniques" doc

... title word with the maximum similarity score with a word W, c max is the category of the title word T max , and T secondmax is other title word with the second high similarity score with the ... formula means that a word with high ranking in a category has a high similarity score with the title word of the category and a high similarity score difference with other title w...

Ngày tải lên: 20/02/2014, 16:20

8 444 0
Báo cáo khoa học: "Learning Efficient Parsing" docx

Báo cáo khoa học: "Learning Efficient Parsing" docx

... the prefix filter with τ = 0. We performed experiments with two parts of the D-Coi corpus. The first data set, P-P-H, contains newspaper data, and is therefore comparable both 822 with the Alpino ... good results. 4.4 Comparison with link table The filter we developed is reminiscent of the link predicate of (Pereira and Shieber, 1987). An im- portant difference with the filter developed...

Ngày tải lên: 24/03/2014, 03:20

9 155 0
Báo cáo khoa học: "Learning 5000 Relational Extractors" docx

Báo cáo khoa học: "Learning 5000 Relational Extractors" docx

... too slow; we must extract and index lists prior to learning. We begin with a 5 billion page Web crawl. LUCHS can be combined with any list harvesting technique, but we choose a simple approach, ... performed spell check. The “distance” between two attributes was calcu- lated with a combination of edit distance and IR metrics with Wordnet synonyms; then hierarchical agglomerative clust...

Ngày tải lên: 30/03/2014, 21:20

10 385 0
Tài liệu Báo cáo khoa học: "Learning to Translate with Multiple Objectives" doc

Tài liệu Báo cáo khoa học: "Learning to Translate with Multiple Objectives" doc

... of translation edit rate with targeted human annotation. In AMTA. Valentin I. Spitkovsky, Hiyan Alshawi, and Daniel Juraf- sky. 2011. Lateen em: Unsupervised training with multiple objectives, ... different metrics M k (h) for evaluating the quality of h. Without loss of gen- erality, we assume metric scores are bounded be- tween 0 and 1, with 1 being perfect. Each hypoth- esis h can b...

Ngày tải lên: 19/02/2014, 19:20

10 624 0
Tài liệu Báo cáo khoa học: "Learning Hierarchical Translation Structure with Linguistic Annotations" ppt

Tài liệu Báo cáo khoa học: "Learning Hierarchical Translation Structure with Linguistic Annotations" ppt

... the baseline at the 95% confidence level are labelled with a single star, at the 99% level with two. with a 3-gram language model smoothed with modi- fied Knesser-Ney discounting (Chen and Goodman, 1998), ... the impact of the linguis- tic annotations in the LTS system (lts), when com- pared with an instance not employing such annotations (lts-nolabels) and (b) decoding with a 4th-o...

Ngày tải lên: 20/02/2014, 04:20

11 478 0
Tài liệu Báo cáo khoa học: "Learning Accurate, Compact, and Interpretable Tree Annotation" ppt

Tài liệu Báo cáo khoa học: "Learning Accurate, Compact, and Interpretable Tree Annotation" ppt

... More RBR-2 earlier Earlier later IN IN-0 In With After IN-1 In For At IN-2 in for on IN-3 of for on IN-4 from on with IN-5 at for by IN-6 by in with IN-7 for with on IN-8 If While As IN-9 because if ... Splitting Beginning with this baseline grammar, we repeatedly split and re-train the grammar. In each iteration we initialize EM with the results of the smaller gram- mar, splittin...

Ngày tải lên: 20/02/2014, 12:20

8 417 0
Tài liệu Báo cáo khoa học: "Learning Word Senses With Feature Selection and Order Identification Capabilities" pdf

Tài liệu Báo cáo khoa học: "Learning Word Senses With Feature Selection and Order Identification Capabilities" pdf

... e(D, T) be the coverage rate of the feature set T with respect to a set of con- texts D, i.e., the ratio of the number of the occur- rences with at least one feature in their local con- texts ... helps to avoid the bias to- ward the selection of fewer features, since with fewer features, there are more occurrences without features in contexts, and their context vectors will be zero valu...

Ngày tải lên: 20/02/2014, 16:20

8 463 0
Tài liệu Báo cáo khoa học: "Learning Parse and Translation Decisions From Examples With Rich Context" pdf

Tài liệu Báo cáo khoa học: "Learning Parse and Translation Decisions From Examples With Rich Context" pdf

... the complexity of parse grammars with hand-coded rules turned out to be much more difficult than expected, if not impos- sible. Newer statistical approaches with often only very limited context ... trained on very large corpora. To cope with the complexity of unrestricted text, parse rules in any kind of formalism will have to consider a complex context with many different mor...

Ngày tải lên: 22/02/2014, 03:20

8 493 0
Báo cáo khoa học: "Learning Script Knowledge with Web Experiments" doc

Báo cáo khoa học: "Learning Script Knowledge with Web Experiments" doc

... scenarios, there were sig- nificant numbers of ESDs both with the minimum length of 5 and the maximum length of 16 and ev- erything in between. Combined with the fact that 93% of all individual event descriptions ... two phrases if they are semantically similar, i.e. it should cost more to align ‘exit’ with ‘eat’ than ‘exit’ with ‘leave’. Thus we take a measure of se- mantic (dis)simil...

Ngày tải lên: 07/03/2014, 22:20

10 289 0
Từ khóa:
w