Automatic generation of labelled data for word sense disambiguation

Automatic generation of labelled data for word sense disambiguation

Automatic generation of labelled data for word sense disambiguation

... disambiguate the word sense of words in the context of their usage This is the task of Word Sense Disambiguation Given an occurrence of a word w in a natural language text, the task of word sense disambiguation ... Mapping from SENSEVAL to WordNet Since we need to use the information of WordNet, we have to map the sense of SENSEVAL format to WordNet format SE...

Ngày tải lên: 30/09/2015, 14:24

77 151 0
Báo cáo khoa học: "A Method for Word Sense Disambiguation of Unrestricted Text" potx

Báo cáo khoa học: "A Method for Word Sense Disambiguation of Unrestricted Text" potx

... untagged word1 - word2 pair (W1 - W2) OUTPUT: ranking the senses of one word PROCEDURE: STEP Form a similarity list ]or each sense of one of the words Pick one of the words, say W2, and using WordNet, ... using WordNet, form a similarity list for each sense of t h a t word For this, use the words from the synset of each sense and the words from the h y p e r n y m...

Ngày tải lên: 08/03/2014, 06:20

7 378 0
Báo cáo khoa học: "A Combination of Active Learning and Semi-supervised Learning Starting with Positive and Unlabeled Examples for Word Sense Disambiguation: An Empirical Study on Japanese Web Search Query" pdf

Báo cáo khoa học: "A Combination of Active Learning and Semi-supervised Learning Starting with Positive and Unlabeled Examples for Word Sense Disambiguation: An Empirical Study on Japanese Web Search Query" pdf

... 2006 An Empirical Study of the Behavior of Active Learning for Word Sense Disambiguation, Proc of the main conference on Human Language Technology Conference of the North American Chapter of ACL, ... A combination of active learning and semi-supervised learning starting with positive and unlabeled examples Proposed Algorithm At the beginni...

Ngày tải lên: 23/03/2014, 17:20

4 441 1
Báo cáo khoa học: "Automatic Generation of Domain Models for Call Centers from Noisy Transcriptions" pdf

Báo cáo khoa học: "Automatic Generation of Domain Models for Call Centers from Noisy Transcriptions" pdf

... transcript of a help desk dialog minutes For 125 of these calls, call topics were manually assigned Generation of Domain Model Fig shows the steps for generating a domain model in the call center ... from a collection of noisy call transcriptions 4.2 Taxonomy Generation As mentioned in section 3, automatically transcribed data is noisy and requires a good amoun...

Ngày tải lên: 31/03/2014, 01:20

8 397 0
Tài liệu Báo cáo khoa học: "ParaSense or How to Use Parallel Corpora for Word Sense Disambiguation" pdf

Tài liệu Báo cáo khoa học: "ParaSense or How to Use Parallel Corpora for Word Sense Disambiguation" pdf

... instances in the five supported languages Online machine translations tools have already been used before to create artificial parallel corpora that were used for NLP tasks such as for instance Named ... features related to the focus word itself being the word form of the focus word, the lemma, Part-of-Speech and chunk information • local context features related to a window...

Ngày tải lên: 20/02/2014, 05:20

6 538 0
Báo cáo khoa học: "Estimating Class Priors in Domain Adaptation for Word Sense Disambiguation" pdf

Báo cáo khoa học: "Estimating Class Priors in Domain Adaptation for Word Sense Disambiguation" pdf

... class membership estimates, for the instances in the target domain These probabilities were then used by the machine learning methods to estimate the sense priors of each word in the target domain ... of predominant sense is often indicative of a change in domain, as different corpora drawn from different domains usually give different predominant senses For example,...

Ngày tải lên: 08/03/2014, 02:21

8 268 0
Báo cáo khoa học: "Learning Expressive Models for Word Sense Disambiguation" pot

Báo cáo khoa học: "Learning Expressive Models for Word Sense Disambiguation" pot

... Table For the monolingual scenario, we use the sense tagged corpus and sense repositories provided for verbs in Senseval-3 There are 32 verbs with between 40 and 398 examples each The number of senses ... has_narrow(snt, word_ position, word) : has_narrow(snt1, 1st _word_ left, mind) has_narrow(snt1, 1st _word_ right, back) … KS4 POS tags of words to the right and left of the verb,...

Ngày tải lên: 08/03/2014, 02:21

8 381 0
Báo cáo khoa học: "Domain Adaptation with Active Learning for Word Sense Disambiguation" pdf

Báo cáo khoa học: "Domain Adaptation with Active Learning for Word Sense Disambiguation" pdf

... active learning for domain adaptation, followed by count-merging Next, we describe an EMbased algorithm to estimate the sense priors in the new domain Performance of domain adaptation using active ... adaptation examples added (%) Figure 3: Using true predominant sense for the nouns the sense priors in WSJ Using this new set of training examples, we perform domain adapta...

Ngày tải lên: 08/03/2014, 02:21

8 363 0
Báo cáo khoa học: "SenseRelate::TargetWord – A Generalized Framework for Word Sense Disambiguation" doc

Báo cáo khoa học: "SenseRelate::TargetWord – A Generalized Framework for Word Sense Disambiguation" doc

... disambiguated 3.1 Command Line The command-line interface disamb.pl takes as input a S ENSEVAL -2 formatted lexical sample file The program disambiguates the marked up word in each instance and ... http://search.cpan.org/dist/WordNet-QueryData Related Work There is a long history of work in Word Sense Disambiguation that uses Machine Readable Dictionaries, and are highly related to o...

Ngày tải lên: 08/03/2014, 04:22

4 350 0
Báo cáo khoa học: "Exploiting Parallel Texts for Word Sense Disambiguation: An Empirical Study" potx

Báo cáo khoa học: "Exploiting Parallel Texts for Word Sense Disambiguation: An Empirical Study" potx

... multiple and distinct Chi- nese translations appear in the aligned Chinese sentence For example, for an English occurrence channel, both “频道” (sense translation) and “途 径” (sense translation) ... lumped into one sense (i.e., they are all translated into one Chinese word) , we not perform WSD on these words The aver- age number of senses before and after sense lumping is 5.07...

Ngày tải lên: 08/03/2014, 04:22

8 380 0
Báo cáo khoa học: "Topic Models for Word Sense Disambiguation and Token-based Idiom Detection" pdf

Báo cáo khoa học: "Topic Models for Word Sense Disambiguation and Token-based Idiom Detection" pdf

... neighbour for the chosen word and then assigns a sense based on the word, its neighbour and the topic Boyd-Graber and Blei (2007) test their method on WSD and information retrieval tasks and find ... corrected any bad tags and lemmas for the target instances.4 Sense Paraphrases For word sense disambiguation tasks, the paraphrases of the sense keys are represent...

Ngày tải lên: 23/03/2014, 16:20

10 371 0
Báo cáo khoa học: "Domain Kernels for Word Sense Disambiguation" ppt

Báo cáo khoa học: "Domain Kernels for Word Sense Disambiguation" ppt

... of the text in which the word is located is a crucial information for WSD For example the (domain) polysemy among the C OM PUTER S CIENCE and the M EDICINE senses of the word virus can be solved ... abstraction and term similarity for word sense disambiguation: Irst at senseval-3 In Proc of SENSEVAL-3 Third International Workshop on Evaluation of Systems for the Semantic Analy...

Ngày tải lên: 23/03/2014, 19:20

8 306 0
Báo cáo khoa học: "Learning Semantic Classes for Word Sense Disambiguation" pptx

Báo cáo khoa học: "Learning Semantic Classes for Word Sense Disambiguation" pptx

... Ehsanul Faruque 2004 Senselearner: Minimally supervised word sense disambiguation for all words in open text In Senseval-3: Third Intl Workshop on the Evaluation of Systems for the Semantic Analysis ... fact that the primes are common for words, and training data can hence be reused Yarowsky (1992) used Roget’s Thesaurus categories as classes for word senses These classes...

Ngày tải lên: 31/03/2014, 03:20

8 268 0
Báo cáo khoa học: "Personalizing PageRank for Word Sense Disambiguation" docx

Báo cáo khoa học: "Personalizing PageRank for Word Sense Disambiguation" docx

... graph for each target word in the context: for each target word Wi , we concentrate the initial probability mass in the senses of the words surrounding Wi , but not in the senses of the target word ... the MFS in both Senseval-2 and Senseval-3 datasets The results for the supervised system are given for reference, and we can see that the gap is relatively small, specially for...

Ngày tải lên: 31/03/2014, 20:20

9 386 0
Báo cáo khoa học: "Similarity-Based Methods For Word Sense Disambiguation" docx

Báo cáo khoa học: "Similarity-Based Methods For Word Sense Disambiguation" docx

... the two words that make up the pseudo -word 3.2 Task: Pseudo -word Sense Disambiguation In the usual word sense disambiguation problem, the method to be tested is presented with an ambiguous word ... deciding which word pairs require a similarity-based estimate, a method for combining information from similar words, and, of course, a function measuring the similarity between wor...

Ngày tải lên: 31/03/2014, 21:20

8 312 0
Từ khóa:
w