Báo cáo khoa học: "Personalizing PageRank for Word Sense Disambiguation" docx
... the graph for each target word in the context: for each target word W i , we concentrate the initial proba- bility mass in the senses of the words surrounding W i , but not in the senses of the ... subgraph of WordNet which con- nects the senses of the words in the input text, and then apply traditional PageRank over the subgraph. • To use Personalized PageRank, initializing v with...
Ngày tải lên: 31/03/2014, 20:20
... using the senses pro- vided in WordNet. The senses are ranked us- ing two sources of information: (1) the Inter- net for gathering statistics for word- word co- occurrences and (2)WordNet for measuring ... and using WordNet, form a similarity list for each sense of that word. For this, use the words from the synset of each sense and the words from the hypernym synsets....
Ngày tải lên: 08/03/2014, 06:20
... exploiting paraphrase information for the target senses rather than relying on the structure of WordNet as a whole. Topic models have also been applied to the re- lated task of word sense induction. ... semantic re- lations between senses, etc.). Sometimes such de- tailed information may not be available, for in- stance for languages for which such a resource does not exist or fo...
Ngày tải lên: 23/03/2014, 16:20
Báo cáo khoa học: "Domain Kernels for Word Sense Disambiguation" ppt
... of the text in which the word is located is a crucial information for WSD. For example the (domain) polysemy among the COM- PUTER SCIENCE and the MEDICINE senses of the word virus can be solved ... this is clearly unfeasible for all-words WSD tasks, in which all the words of an open text should be dis- ambiguated. On the other hand, the word expert approach works very well for l...
Ngày tải lên: 23/03/2014, 19:20
Báo cáo khoa học: "Unsupervised Relation Discovery with Sense Disambiguation" docx
... features. For example, for pattern “A play B”, pairs which contain B argument “Mozart” could be in one sense, whereas pairs which have “Mets” could be in another sense. Words: The words between ... produced by sense disambiguation. For each sense, we randomly sample 5 entity pairs. We also show top features for each sense. Each row shows one feature type, where “num” stands fo...
Ngày tải lên: 23/03/2014, 14:20
Báo cáo khoa học: "Data Cleaning for Word Alignment" pdf
... mechanism to aug- ment one source word into several source words or delete a source word, while a NULL insertion is a mechanism of generating several words from blank words. Fertility uses a conditional ... score S W B,X for each pair of sentences where X is 4, 3, 2, and 1 for word- based MT decoder. Step 3: Train phrase-based MT for full parallel corpus. Note that we do not need to...
Ngày tải lên: 08/03/2014, 01:20
Báo cáo khoa học: "Combining Clues for Word Alignment" pdf
... source language word per row and one target language word per column. The cells inside the matrix can be filled with the combined clue values for the correspond- ing word pairs. Henceforth, this ... 0.86 0 The matrix is simply filled with all values of combined clues for each word pair. For ex- ample, the total clue value for the word pair s ="baggage" and t =&qu...
Ngày tải lên: 08/03/2014, 21:20
Báo cáo khoa học: "Hierarchical Search for Word Alignment" ppt
... worse to align two English words at different ends of the tree to the same foreign word, than it is to align two English words under the same NP to the same foreign word. To see why a string distance ... Log- linear Models for Word Alignment In Proceedings of the 43rd Annual Meeting of the ACL. Ann Arbor, Michigan. USA. Robert C. Moore. 2005. A Discriminative Framework for Word Alig...
Ngày tải lên: 23/03/2014, 16:20
Tài liệu Báo cáo khoa học: "EM Works for Pronoun Anaphora Resolution" docx
... 2009. c 2009 Association for Computational Linguistics EM Works for Pronoun Anaphora Resolution Eugene Charniak and Micha Elsner Brown Laboratory for Linguistic Information Processing (BLLIP) Brown ... 1998). The data annotated for the Ge research is used here for test- ing and development data. Also, there are many overlaps between their formulation of the problem and ours. For...
Ngày tải lên: 22/02/2014, 02:20
Báo cáo khoa học: " New Models for Improving Supertag Disambiguation" pdf
... statistical information, in the form of a trigram model based on the distribution of su- pertags in an LTAG parsed corpus, can be used to choose the most appropriate supertag for any given word. ... supertags for the word ]eared: a more frequent one corresponding to a subcategorization of NP object (as ~n of Figure 1) and a less frequent one to a S comple- ment. The supertag...
Ngày tải lên: 08/03/2014, 21:20