Báo cáo khoa học: "Bootstrapping Word Alignment via Word Packing" doc

Báo cáo khoa học: "Bootstrapping Coreference Resolution Using Word Associations" potx

Báo cáo khoa học: "Bootstrapping Coreference Resolution Using Word Associations" potx

... features), named entity features and semantic word class fea- tures (e.g., from WordNet) that do not distinguish, 783 say, Obama from Hawking. In our approach, word association information is used for ... bootstraps a complete corefer- ence resolution (CoRe) system from word as- sociations mined from a large unlabeled cor- pus. We show that word associations are use- ful for CoRe – e....
Ngày tải lên : 07/03/2014, 22:20
  • 10
  • 296
  • 0
Báo cáo khoa học: "Bootstrapping Word Alignment via Word Packing" doc

Báo cáo khoa học: "Bootstrapping Word Alignment via Word Packing" doc

... Chinese–English word alignment. 3 2.1 The Treatment of 1-to-n Alignments Fertility-based models such as IBM models 3, 4, and 5 allow for alignments between one word and sev- eral words (1-to-n or 1: n alignments ... introduce a simple method to pack words for statistical word alignment. Our goal is to simplify the task of automatic word align- ment by packing several consecutive wo...
Ngày tải lên : 31/03/2014, 01:20
  • 8
  • 242
  • 0
Tài liệu Báo cáo khoa học: "Finding Synonyms Using Automatic Word Alignment and Measures of Distributional Similarity" pdf

Tài liệu Báo cáo khoa học: "Finding Synonyms Using Automatic Word Alignment and Measures of Distributional Similarity" pdf

... automatic word alignment. Context vec- tors are built from the alignments found in a paral- lel corpus. Each aligned word type is a feature in the vector of the target word under consideration. The alignment ... for the automatic word alignment described below. 5.2.2 Alignment Context Context vectors are populated with the links to words in other languages extracted from automa...
Ngày tải lên : 20/02/2014, 12:20
  • 8
  • 516
  • 0
Báo cáo khoa học: "Soft Syntactic Constraints for Word Alignment through Discriminative Training" pot

Báo cáo khoa học: "Soft Syntactic Constraints for Word Alignment through Discriminative Training" pot

... bilin- gual word alignment finds word- to -word connec- tions across languages. Originally introduced as a byproduct of training statistical translation models in (Brown et al., 1993), word alignment ... im- proved alignments. 2 Constrained Alignment Let an alignment be the complete structure that connects two parallel sentences, and a link be one of the word- to -word connecti...
Ngày tải lên : 08/03/2014, 02:21
  • 8
  • 325
  • 0
Báo cáo khoa học: "Bootstrapping via Graph Propagation" potx

Báo cáo khoa học: "Bootstrapping via Graph Propagation" potx

... task. The tasks of Eisner and Karakos (2005) are word sense disambiguation on several English words which have two senses corresponding to two dif- ferent words in French. Data was extracted from the ... used all examples. 624 cent to the word to be disambiguated, and origi- nal and lemmatized context words in the same sen- tence. Their seeds are pairs of adjacent word fea- tures, with...
Ngày tải lên : 16/03/2014, 19:20
  • 9
  • 281
  • 0
Báo cáo khoa học: "Confidence Measure for Word Alignment" potx

Báo cáo khoa học: "Confidence Measure for Word Alignment" potx

... confidence sentence alignments and alignment links from mul- tiple word alignments of the same sen- tence pair. Additionally, we remove low confidence alignment links from the word alignment of a bilingual ... based on word alignment. In this paper we introduce a confidence mea- sure for word alignment, which is robust to extra or missing words in the bilingual sentence pairs, a...
Ngày tải lên : 17/03/2014, 01:20
  • 9
  • 317
  • 0
Báo cáo khoa học: "Flow Network Models for Word Alignment and Terminology Extraction from Bilingual Corpora" docx

Báo cáo khoa học: "Flow Network Models for Word Alignment and Terminology Extraction from Bilingual Corpora" docx

... and French words, an empty English word, and an empty French word, • E comprises edges from the source to all the English words (including the empty one), edges from all the French words (including ... edge from an English word ei (excluding the empty word) to a French word fj (excluding the empty word) to be 7~ = -lnp(ei, fj), the cost of an edge involving an empty word...
Ngày tải lên : 17/03/2014, 07:20
  • 7
  • 379
  • 0
Báo cáo khoa học: "Diversify and Combine: Improving Word Alignment for Machine Translation on Low-Resource Languages" docx

Báo cáo khoa học: "Diversify and Combine: Improving Word Alignment for Machine Translation on Low-Resource Languages" docx

... of word alignment models. In this work, we leverage existing align- ers and generate multiple sets of word alignments based on complementary information, then com- bine them to get the final alignment ... and Pashto word to generate one more alternative for the word align- ment. 3 Confidence-Based Alignment Combination Now we describe the algorithm to combine mul- tiple sets of w...
Ngày tải lên : 23/03/2014, 16:20
  • 5
  • 274
  • 0
Báo cáo khoa học: "Log-linear Models for Word Alignment" ppt

Báo cáo khoa học: "Log-linear Models for Word Alignment" ppt

... translation, word alignment plays a crucial role as word- aligned corpora have been found to be an excellent source of translation-related knowledge. Various methods have been proposed for finding word alignments ... align- ment of words within idiomatic expressions, free translations, and missing content or function words is problematic. When two languages widely differ in word order...
Ngày tải lên : 31/03/2014, 03:20
  • 8
  • 283
  • 0
Tài liệu Báo cáo khoa học: "Conditional Random Fields for Word Hyphenation" docx

Tài liệu Báo cáo khoa học: "Conditional Random Fields for Word Hyphenation" docx

... overall word- level errors #words with at least one FP or FN swe serious word- level errors #words with at least one FP ower overall word- level error rate owe / (total #words) swer serious word- level ... 89,019 hyphenated English words from CELEX, we get 4,000 words. The words that are omitted are proper names, contrac- tions, incomplete words containing apostrophes, and abbreviations s...
Ngày tải lên : 20/02/2014, 04:20
  • 9
  • 607
  • 0
Từ khóa: