Báo cáo khoa học: "Learning Surface Text Patterns for a Question Answering System" doc
... Learning Surface Text Patterns for a Question Answering System Deepak Ravichandran and Eduard Hovy Information Sciences Institute University of Southern California 4676 Admiralty Way Marina ... few hand-crafted examples of each question type to Altavista. Patterns are then automatically extracted from the returned documents and standardized. We calculate the preci...
Ngày tải lên: 08/03/2014, 07:20
... 1 AA 5 AA_N_JH_AA_N 4 AA_N_JH_AA 3 AA_N_JH 2 AA_N N_JH_AA_N N_JH_AA N_JH N 6 N_JH_AA_N_IY IY N AA_N AA AA_N_IY JH_AA_N JH_AA JH JH_AA_N_IY Figure 3: FSM representing all segmentations for the word ANJANI ... of detection task performance. In Eurospeech. Carolina Parada, Abhinav Sethy, and Bhuvana Ramab- hadran. 2009. Query-by-example spoken term detec- tion for oov terms. In ASRU. Carolin...
Ngày tải lên: 20/02/2014, 04:20
... mining, where each sales transaction in a database can be considered as a sentence in the corpora, and each item in a transaction denotes a word in a sentence. An association language pat- tern ... Traditional approaches to sentence classifica- tion (Khoo et al., 2006; Naughton et al., 2008) or text categorization (Sebastiani 2002) usually adopt bag-of-words as baseline featu...
Ngày tải lên: 08/03/2014, 01:20
Báo cáo khoa học: "Espresso: Leveraging Generic Patterns for Automatically Harvesting Semantic Relations" docx
... Generic Patterns for Automatically Harvesting Semantic Relations Patrick Pantel Information Sciences Institute University of Southern California 4676 Admiralty Way Marina del Rey, CA 90292 ... Web and syntactic ex- pansions to compensate for lacks of redun- dancy in small corpora; • Generality: Espresso is amenable to a wide variety of binary relations, from classical is -a...
Ngày tải lên: 08/03/2014, 02:21
Báo cáo khoa học: "Dealing with distinguishing descriptions in a guided composition system" doc
... contrary, 'l'oiseau' ('the bird') is correct and can lead to at least three different a~ 'l'oiseau qui chante' (the bird that sings') which designate ... and only one entity among others in a context set. Many natural language interfaces need to deal with this sort of NPs. We develop an application where these NPs occur in a particula...
Ngày tải lên: 31/03/2014, 04:20
Tài liệu Báo cáo khoa học: "Learning with Unlabeled Data for Text Categorization Using Bootstrapping and Feature Projection Techniques" doc
... data (Ko and Seo, 2004). How can labeled training data be automatically created from unlabeled data and title words? Maybe unlabeled data don’t have any information for building a text classifier ... labeled data. While labeled data are difficult to obtain, unlabeled data are readily available and plentiful. Therefore, this paper advocates using a bootstrapping framework and a...
Ngày tải lên: 20/02/2014, 16:20
Tài liệu Báo cáo khoa học: "Learning to Find Translations and Transliterations on the Web" doc
... Phrase translation and transliteration is important for cross-language tasks. For example, Knight and Graehl (1998) describe and evaluate a multi-stage machine translation method for back transliterating ... to prepare the training data. 3 Method To find translations for a given term on the Web, a promising approach is automatically learning to extract phrasal translation...
Ngày tải lên: 19/02/2014, 19:20
Tài liệu Báo cáo khoa học: "Learning the Latent Semantics of a Concept from its Definition" pptx
... the last 10 sampling iterations. We also set a threshold to elesk similarity values, which yields better performance. Same as (Sinha and Mihalcea, 2007), values of elesk larger than 240 are set ... senses across dictionaries, hence Wik is only used as augmented data for WMF to better learn the semantics of words. All data is tokenized, POS tagged (Toutanova et al., 2003) and lemmatized, ....
Ngày tải lên: 19/02/2014, 19:20
Tài liệu Báo cáo khoa học: "Learning Word-Class Lattices for Definition and Hypernym Extraction" doc
... but are rather uninformative, like “dynamic programming was the brainchild of an american mathematician”, as well as informative sentences that are not definitional (e.g., they do not have a hypernym), ... UK. Judith Klavans and Smaranda Muresan. 2001. Eval- uation of the DEFINDER system for fully auto- matic glossary construction. In Proc. of the Amer- ican Medical Informatics Association...
Ngày tải lên: 20/02/2014, 04:20
Báo cáo khoa học: "Learning Context-Dependent Mappings from Sentences to Logical Form" doc
... to logical form. The training ex- amples are sequences of sentences anno- tated with lambda-calculus meaning rep- resentations. We develop an algorithm that maintains explicit, lambda-calculus ... logical form for Exam- ple 1 (a) , which is directly available in C, produces the desired final meaning. Elaborations Later statements can expand the meaning of previous ones in ways that are diffi-...
Ngày tải lên: 23/03/2014, 16:21