Báo cáo khoa học: "Exploiting Multiple Treebanks for Parsing with Quasi-synchronous Grammars" doc
... Association for Computational Linguistics, pages 675–684, Jeju, Republic of Korea, 8-14 July 2012. c 2012 Association for Computational Linguistics Exploiting Multiple Treebanks for Parsing with Quasi-synchronous Grammars Zhenghua ... present a simple and effective framework for exploiting multiple monolingual treebanks with different annotation guidelines for pars-...
Ngày tải lên: 16/03/2014, 19:20
... heterogeneous treebanks for target gram- mar parsing. Here heterogeneous treebanks refer to two or more treebanks with different grammar formalisms, e.g., one treebank annotated with de- pendency ... heteroge- neous treebanks for parsing by breaking it down into two sub-problems, convert- ing grammar formalisms of the treebanks to the same one, and parsing on these homo...
Ngày tải lên: 17/03/2014, 01:20
... the use of the system for implementing different types of tree based gram- mars. Section 5 concludes with pointers for fur- ther research and improvements. 2 Linguistic formalism As mentioned ... tree description and/or of semantic formu- las. The XMG formalism furthermore supports the sharing of identifiers across dimension hence al- lowing for a straightforward encoding of the syn- t...
Ngày tải lên: 22/02/2014, 02:20
Báo cáo khoa học: "Exploiting Parallel Texts for Word Sense Disambiguation: An Empirical Study" potx
... problem is par- ticular severe for WSD, since sense-tagged data must be collected separately for each word in a language. One source to look for potential training data for WSD is parallel texts, ... channel together with the 3-sentence context in English surrounding channel then forms a training example for a supervised WSD program in the next step. The average time taken...
Ngày tải lên: 08/03/2014, 04:22
Báo cáo khoa học: "Exploiting Web Redundancy for Answer Validation" pptx
... off non-content words with a stop- words filter. The remaining words are expanded with both synonyms and morphological forms in order to maximize the recall of retrieved docu- ments. Synonyms ... disturbing elements. As for morphol- ogy, verbs are expanded with all their tense forms (i.e. present, present continuous, past tense and past participle). Synonyms and morphological forms are...
Ngày tải lên: 08/03/2014, 07:20
Báo cáo khoa học: "Exploiting Feature Hierarchy for Transfer Learning in Named Entity Recognition" ppt
... datasets similarly labeled with person names, we are additionally adding bi- ological corpora (UT & YAPEX), labeled not with person names but with protein names instead, along with the CSPACE e-mail ... techniques em- ployed, along with specifications for the design and training of our hierarchical prior. Finally, in §3 we present an empirical investigation of our prior’s per-...
Ngày tải lên: 23/03/2014, 17:20
Báo cáo khoa học: "EXPLOITING CONVERSATIONAL IMPLICATURE FOR GENERATING CONCISE EXPLANATIONS" pdf
... concentrated our efforts on presenting different types of knowledge and their interrelations because this kind of infor- mation is typically relevant for explanations. We formally reconstruct ... context so that the subset obtained still conveys the same infor- mation - in a partially implicit and more concise form, but without leading to wrong implica- tions. The intuition behind ....
Ngày tải lên: 24/03/2014, 05:21
Tài liệu Báo cáo khoa học: "Mixing Multiple Translation Models in Statistical Machine Translation" docx
... for Fr2En when using uniform weights, tuned weights and normalization heuristic. The tuned BLEU scores are averaged over three runs with multiple initial points, as in (Clark et al., 2011), with ... sentence. We use the bottom- up CKY parsing algorithm for decoding. For each sentence, a CKY chart is constructed. The cells of the CKY chart are populated with appropriate rules f...
Ngày tải lên: 19/02/2014, 19:20
Tài liệu Báo cáo khoa học: "A Gibbs Sampler for Phrasal Synchronous Grammar Induction" docx
... p null = 10 −10 for this value in the experiments we report below. 784 either φ P z i for phrase pairs or φ null for single lan- guage phrases. We choose Dirichlet process (DP) priors for these parameters: φ P z i ∼ ... translation units. 4.2 A Gibbs sampler for derivations Markov chain Monte Carlo sampling allows us to perform inference for the model described in 4.1 without rest...
Ngày tải lên: 20/02/2014, 07:20
Báo cáo khoa học: "A speech interface for open-domain question-answering" doc
... as hyperlinks to the source documents. This is conve- nient for Web users but should be transformed for spoken output. It now offers plain text as an alterna- tive to HTML for output. 1 We have also ... implemented systems for the Pocket PC that interpret queries spoken in English or Chinese. This last group appears to be at the forefront of current research in spoken interfaces for...
Ngày tải lên: 08/03/2014, 04:22