Báo cáo khoa học: "Domain Adaptation with Active Learning fo

Báo cáo khoa học: "Domain Adaptation with Active Learning for Word Sense Disambiguation" pdf

... to use active learning for domain adaptation for WSD. A similar work is the recent research by Chen et al. (2006), where active learning was used successfully to reduce the annotation effort for ... 49–56, Prague, Czech Republic, June 2007. c 2007 Association for Computational Linguistics Domain Adaptation with Active Learning for Word Sense Disambiguation Yee...

Ngày tải lên: 08/03/2014, 02:21

8 363 0

Báo cáo khoa học: "SenseRelate::TargetWord – A Generalized Framework for Word Sense Disambiguation" doc

... the word senses. In our system, this module ﬁrst decides the base (uninﬂected) form of each of the n words. It then retrieves all the senses for each word from the sense inventory. We use WordNet ... Measure Context Target Sense Preprocessing Format Filter Sense Inventory Context Selection Postprocessing Pick Sense Figure 1: A generalized framework for Word Sense Disambig...

Ngày tải lên: 08/03/2014, 04:22

4 350 0

Báo cáo khoa học: "An Empirical Study on Class-based Word Sense Disambiguation" pdf

... semantic classes defined for a word. In the sense approach, one classifier is generated for each word sense, and the classifiers choose between the possible senses for the word. The examples to train ... for grouping senses of the same word, thus producing coarser word sense groupings for better disambiguation. Wikipedia 3 has been also recently used to over- come some prob...

Ngày tải lên: 24/03/2014, 03:20

9 423 0

Báo cáo khoa học: "Multi-Criteria-based Active Learning for Named Entity Recognition" ppt

... local context of the target word w, is also used to classify w. However, for active learning in NER, it is not reasonable to select a single word without context for human to label. Even if ... sent. (223K words) PER 5 sent. (131 words) 7809 sent. (157K words) LOC 5 sent. (130 words) 7809 sent. (157K words) Newswire ORG MUC-6 5 sent. (113 words) 602 sent. (14...

Ngày tải lên: 31/03/2014, 03:20

8 204 0

Tài liệu Báo cáo khoa học: "An Equivalent Pseudoword Solution to Chinese Word Sense Disambiguation" ppt

... supervised learning or semi-supervised learning method. This rein- forcement algorithm dates back to Gale et al. (1992a). Their investigation was based on a 6- word test set with 2 senses for each word. ... two words. An ambiguous word has the same number of EPs as of senses. Each EP's sense maps to a sense of ambiguous word. The semantic equivalence demands further...

Ngày tải lên: 20/02/2014, 12:20

8 414 0

Báo cáo khoa học: "Relieving The Data Acquisition Bottleneck In Word Sense Disambiguation" ppt

... clusters; SALAAM identiﬁes the appropriate senses for the words in those clusters based on the words senses’ proximity in WordNet. The word sense proximity is measured in information theo- retic terms based ... Resnik (Resnik, 1999); A sense selection criterion is applied to choose the appropriate sense label or set of sense la- bels for each word in the cluster; The chosen s...

Ngày tải lên: 08/03/2014, 04:22

8 393 0

Báo cáo khoa học: "An Unsupervised Morpheme-Based HMM for Hebrew Morphological Disambiguation" pdf

... word. As a result, for this case, 75% of the words remain with one analysis with 95% accuracy, 20% with two analyses and 5% with three analyses. Segal (2000) built a transformation-based tag- ger ... method for languages with aﬃxational morphology in which the knowledge of word formation rules (which are quite restricted in He- brew) helps in the disambiguation. We adapt HMM...

Ngày tải lên: 23/03/2014, 18:20

8 309 0

Báo cáo khoa học: "Domain Adaptation for Machine Translation by Mining Unseen Words" doc

... same pre- processng for all words, the training words and the OOV words. And the resulting feature vectors for each word are used for learning the CCA projections Since a word can have multiple ... trans- lations for the OOV German words (Haghighi et al., 2008). From the target domain corpus we extract the most frequent words (approximately 5000) for both the languages. Of the...

Ngày tải lên: 17/03/2014, 00:20

6 349 0

Báo cáo khoa học: "Domain Adaptation of Maximum Entropy Language Models" potx

... optimized on the heldout data. Usually, larger values are used for global parameters and for domains with more data, while for domains with less data, the variance is typically set to be smaller, ... this ∗ Currently with Tallinn University of Technology, Esto- nia paper is that we show how the suggested hierar- chical adaptation can be used with suitable pri- ors and combined...

Ngày tải lên: 23/03/2014, 16:20

6 297 0

Báo cáo khoa học: "A Combination of Active Learning and Semi-supervised Learning Starting with Positive and Unlabeled Examples for Word Sense Disambiguation: An Empirical Study on Japanese Web Search Query" pdf

... 2009. c 2009 ACL and AFNLP A Combination of Active Learning and Semi-supervised Learning Starting with Positive and Unlabeled Examples for Word Sense Disambiguation: An Empirical Study on ... data for word sense disambiguation (WSD) in the domain of web queries, where a complete set of ambiguous word senses are unknown. In this paper, we present a combination of...

Ngày tải lên: 23/03/2014, 17:20

4 441 1