Building word sense taxonomy and automatic annotation for mandarin chinese 2

Báo cáo khoa học: "Topic Models for Word Sense Disambiguation and Token-based Idiom Detection" pdf

Báo cáo khoa học: "Topic Models for Word Sense Disambiguation and Token-based Idiom Detection" pdf

... neighbour for the chosen word and then assigns a sense based on the word, its neighbour and the topic Boyd-Graber and Blei (2007) test their method on WSD and information retrieval tasks and find ... corrected any bad tags and lemmas for the target instances.4 Sense Paraphrases For word sense disambiguation tasks, the paraphrases of the sense keys are represent...

Ngày tải lên: 23/03/2014, 16:20

10 371 0
Báo cáo khoa học: "Automatic Annotation for All Semantic Layers in FrameNet" potx

Báo cáo khoa học: "Automatic Annotation for All Semantic Layers in FrameNet" potx

... word in italics Semantic Entities in FrameNet The semantic annotation in FrameNet consists of a set of layers One of the layers defines the target, and the other layers provide additional information ... another predicate in the sentence: cling on In FrameNet, such predicates are annotated using the G OV label The constituent that contains the slot filler in question is...

Ngày tải lên: 08/03/2014, 21:20

4 414 0
Báo cáo khoa học: "Latent Semantic Word Sense Induction and Disambiguation" pdf

Báo cáo khoa học: "Latent Semantic Word Sense Induction and Disambiguation" pdf

... perform word sense induction, our model is capable of performing both word sense induction and disambiguation 3.1 find the matrices W and H for which the KullbackLeibler divergence between A and WH ... Ide and Yorick Wilks 2007 Making Sense About Sense In Eneko Agirre and Philip Edmonds, editors, Word Sense Disambiguation, Algorithms and Applications, pages 47–73...

Ngày tải lên: 23/03/2014, 16:20

10 312 0
Domain adaptation and training data acquisition in wide coverage word sense disambiguation and its application to information retrieval

Domain adaptation and training data acquisition in wide coverage word sense disambiguation and its application to information retrieval

... 4.2 In -Domain and Out-of -Domain Evaluation 47 4.2.1 47 Training and Evaluating on OntoNotes ii 4.2.2 4.3 Using Out-of -Domain Training Data 49 Concatenating ... attempt to use the existing training data of one word as the training data for other words Kohomban and Lee (2005) tried to use training examples of words different from the actual word to be cla...

Ngày tải lên: 09/09/2015, 10:06

132 227 0
Tài liệu Báo cáo khoa học: "Unsupervized Word Segmentation: the case for Mandarin Chinese" doc

Tài liệu Báo cáo khoa học: "Unsupervized Word Segmentation: the case for Mandarin Chinese" doc

... distributions for words vs non-words, we observed that the VBE at both boundaries were the most discriminative value Therefore, we decided to take in account the VBE only at the word- candidate ... measure without the need for fine-tuning the balance between the two The evolution of the results w.r.t word length is consistent with the supervized cross-evaluation resu...

Ngày tải lên: 19/02/2014, 19:20

5 467 1
Báo cáo khoa học: "An HMM-Based Approach to Automatic Phrasing for Mandarin Textto-Speech Synthesis" doc

Báo cáo khoa học: "An HMM-Based Approach to Automatic Phrasing for Mandarin Textto-Speech Synthesis" doc

... syllables of disyllabic word in Mandarin Chinese”, Proc ICSLP, Hirschberg, J., 1996 “Training intonational phrasing rules automatically for English and Spanish text -to- speech”, Speech Communication, ... word format and some POS-based ones on the same training set and test set Overall, HMMpath-I can achieve high accuracy by about 10% Conclusions/Discussions We described an approach...

Ngày tải lên: 17/03/2014, 04:20

6 342 0
Báo cáo khoa học: "TBL-Improved Non-Deterministic Segmentation and POS Tagging for a Chinese Parser" pdf

Báo cáo khoa học: "TBL-Improved Non-Deterministic Segmentation and POS Tagging for a Chinese Parser" pdf

... tag :A> B tag :A> B tag :A> B tag :A> B tag :A> B tag :A> B

Ngày tải lên: 17/03/2014, 22:20

9 357 0
Grammar and vocabulary games for children - part 2 ppsx

Grammar and vocabulary games for children - part 2 ppsx

... or D ents it Car er pent Labour er Buider l Sci i t ents L andsca es B each D es ' el t M ount n G r s and as l Clf if ls and l Far l m and Gl e aci r Game #1 ) VowelLengt hs: R em i t chidr t ... he hidr t f l ,pr ded t com m and i begun by Esi on says ' l a com m and i gi olow ovi he s t m ' f s ven w ihout tsi on s 'bef e i and a chid per or s t com m and, he i out The t t m a...

Ngày tải lên: 08/08/2014, 08:22

10 359 1
Generation of prosody and speech for mandarin chinese

Generation of prosody and speech for mandarin chinese

... research is an investigation of the problem of prosody generation for Mandarin Chinese text-to -speech system I mainly work on two issues of prosody: (1) The prediction of prosodic phrase breaks, ... machine-readable form, such as a text file The subject in this research is Mandarin Chinese TTS Therefore, the input of the system is Chinese text in the form of Ch...

Ngày tải lên: 17/09/2015, 17:20

208 2,7K 0
Báo cáo khoa học: "A Taxonomy, Dataset, and Classifier for Automatic Noun Compound Interpretation" potx

Báo cáo khoa học: "A Taxonomy, Dataset, and Classifier for Automatic Noun Compound Interpretation" potx

... noun compounds by Nakov (2008) to collect short phrases for linking the nouns within noun compounds For the Mechanical Turk annotation tests, we created five sets of 100 noun compounds from noun compounds ... 100 noun compounds was uploaded along with category defini- Classification The approaches used for automatic classification are also varied Vanderwende (1994) presents one...

Ngày tải lên: 23/03/2014, 16:20

10 475 0
Báo cáo khoa học: "A Combination of Active Learning and Semi-supervised Learning Starting with Positive and Unlabeled Examples for Word Sense Disambiguation: An Empirical Study on Japanese Web Search Query" pdf

Báo cáo khoa học: "A Combination of Active Learning and Semi-supervised Learning Starting with Positive and Unlabeled Examples for Word Sense Disambiguation: An Empirical Study on Japanese Web Search Query" pdf

... 2006 An Empirical Study of the Behavior of Active Learning for Word Sense Disambiguation, Proc of the main conference on Human Language Technology Conference of the North American Chapter of ACL, ... A combination of active learning and semi-supervised learning starting with positive and unlabeled examples Proposed Algorithm At the beginni...

Ngày tải lên: 23/03/2014, 17:20

4 441 1
EXPLOITING TAGGED AND UNTAGGED CORPORA FOR WORD SENSE DISAMBIGUATION

EXPLOITING TAGGED AND UNTAGGED CORPORA FOR WORD SENSE DISAMBIGUATION

... a domain specific sense) in the sense tagged corpus and there is a large amount of untagged corpora that contain instances for both general senses and the missed sense, then a sense tagger built ... predefined sense inventories for target words The information for semi-supervised sense disambiguation is usually obtained from bilingual corpora (e.g parallel corpora...

Ngày tải lên: 12/09/2015, 11:05

99 128 0
Automatic generation of labelled data for word sense disambiguation

Automatic generation of labelled data for word sense disambiguation

... disambiguate the word sense of words in the context of their usage This is the task of Word Sense Disambiguation Given an occurrence of a word w in a natural language text, the task of word sense disambiguation ... Mapping from SENSEVAL to WordNet Since we need to use the information of WordNet, we have to map the sense of SENSEVAL format to WordNet format SE...

Ngày tải lên: 30/09/2015, 14:24

77 151 0
w