... small-scale corpus that comes from FBIS corpus consisting of 239K bilin- gual sentence pairs, the other is a large-scale corpus that includes 1.55M bilingual sentence pairs from LDC. The 2002 NIST ... reordering models play a crucial role in phrase-based translation systems. They are usually learned from the word-aligned bilingual corpus by examining the reordering relations of adjacent...
Ngày tải lên: 23/03/2014, 16:20
... information in translation systems. Syntax- based MT often suffers from inadequate constraints in the translation rules extracted, or from striving to combine these rules together towards a full deriva- tion. ... synchronous span, such as that of Figure 3. Each label describes differ- ent properties of the phrase pair (syntactic, semantic etc.), possibly in relation to its context, or...
Ngày tải lên: 20/02/2014, 04:20
Báo cáo khoa học: "Learning Condensed Feature Representations from Large Unsupervised Data Sets for Supervised Learning" docx
... 3.1: Feature potency estimation Features mapped into this area will be zeroed by the effect of C C 0 -C Section 3.2: Feature potency discounting Feature potency ܸ′ ݂ 0 Feature potency ܸ ݂ ܸ ∗ ݂ ( ... condensed feature h m ∈H is charac- 636 Potencies are multiplied by a positive constant δ 0 1 2-1-2 3 4 Section 3.3: Feature potency quantization Feature potency Section 3.4: Condens...
Ngày tải lên: 07/03/2014, 22:20
Báo cáo khoa học: "Learning Expressive Models for Word Sense Disambiguation" pot
... experiment with a monolingual task by using the verbs from Senseval-3 lexical sample task. We explore knowledge from 12 syntactic, semantic and pragmatic sources. In principle, the proposed ... of verbs generally benefits from more specific knowledge sources, such as the verb’s relation to other items in the sentence (for example, by ana- lysing the semantic type of its subje...
Ngày tải lên: 08/03/2014, 02:21
Báo cáo khoa học: "Learning High-Level Planning from Text" pot
... can be made from wood. (a) Low Level Actions for: wood → pickaxe → stone step 1: move from (0,0) to (2,0) step 2: chop tree at: (2,0) step 3: get wood at: (2,0) step 4: craft plank from wood step ... relations from text and to per- form high-level planning guided by those rela- tions. We implement this idea in the reinforce- ment learning framework using feedback au- tomatically obtai...
Ngày tải lên: 16/03/2014, 19:20
Báo cáo khoa học: "Building Accurate Semantic Taxonomies from Monolingual MRDs" ppt
... order to accurately build large taxonomies from any machine-readable dictionary (MRD). Our aim is to profit from conventional MRDs, with no explicit semantic coding. We propose a system that ... accurate taxonomies from DGILE. Finally, some conclusions are drawn. 2 Acquiring taxonomies from MRDs A straightforward way to obtain a LKB acquiring taxonomic relations from dic...
Ngày tải lên: 17/03/2014, 07:20
Báo cáo khoa học: "Learning Translation Consensus with Structured Label Propagation" potx
... systems. Hypothesis mixture decoding (Duan et al., 2011) performs a second decoding process where the search space is enriched with new hypotheses composed out of existing hypotheses from multiple ... decoding for lattices and hypergraphs. Others extend consensus among translations from the same MT system to those from different MT systems. Collaborative decoding (Li et al., 200...
Ngày tải lên: 23/03/2014, 14:20
Báo cáo khoa học: "Learning Context-Dependent Mappings from Sentences to Logical Form" doc
... ex- 976 Example #1: (a) show me the flights from boston to philly λx.flight(x) ∧ from( x, bos) ∧ to(x, phi) (b) show me the ones that leave in the morning λx.flight(x) ∧ from( x, bos) ∧ to(x, phi) ∧ dur ing(x, ... aircraft(x) = y Example #2: (a) show me flights from milwaukee to orlando λx.flight(x) ∧ from( x, mil) ∧ to(x, orl) (b) cheapest argmin(λx.flight(x) ∧ from( x, mil) ∧ to(x, or...
Ngày tải lên: 23/03/2014, 16:21
Báo cáo khoa học: "Learning to Extract Relations from the Web using Minimal Supervision" ppt
... experimen- tal results demonstrating that our approach is able to accurately extract relations from the web by learning from such weak supervision. 2 Problem Description We address the task of learning a ... be seen as an instance of multiple instance learning. However, there are important properties that set it apart from problems previously considered in MIL. The most distinguishing...
Ngày tải lên: 23/03/2014, 18:20
Báo cáo khoa học: "Learning PP attachment for filtering prosodic phrasing" potx
... Dutch PP attachment instances from a syntac- tic treebank. Collection of this data is described in Section 2. We extract lexical head features (uni- gram and bigram) from the treebank occurrences, and ... Cooccurrence strength values are typically estimated from a very large corpus. Often, these corpora are static and do not contain neologisms and names from later periods. In this pap...
Ngày tải lên: 24/03/2014, 03:20