... a family of word alignment. Definition 1. The ITG alignment family is a set of word alignments that has at least one BTG deriva- tion. ITG alignment family is only a subset of word alignments because ... am- biguity in word alignment is the case where two or more derivations d 1 , d 2 , d k of G have the same underlying word alignment A. A grammar G is non- spurious if for any given word alignment, ... Null -word Attachment Ambiguity Definition 4. For any given sentence pair (e, f) and its alignment A, let (e , f ) be the sentence pairs with all null-aligned words removed from (e, f). The alignment...
Ngày tải lên: 07/03/2014, 22:20
Ngày tải lên: 30/03/2014, 21:20
Tài liệu Báo cáo khoa học: "Word Alignment with Synonym Regularization" doc
... models into low-frequency word pairs in bilingual sentences, and then improved the word alignment performance. The SRH regards all of the different words coupled with the same word in the synonym pairs ... sen- 140 Figure 1: Graphical model of HM-BiTAM alignment quality. 2 Bilingual Word Alignment Model In this section, we review a conventional gener- ative word alignment model, HM-BiTAM (Zhao and Xing, ... (f j n |E n , a j n , z n ; B ): sample a target word f j n given an aligned source word and topic where alignment a j n = i denotes source word e i and target word f j n are aligned. α is a parame- ter...
Ngày tải lên: 20/02/2014, 04:20
Tài liệu Báo cáo khoa học: "Learning Sub-Word Units for Open Vocabulary Speech Recognition" doc
... coherence. Hybrid word/ sub -word recognizers can produce a sequence of sub -word units in place of OOV words. Ideally, the recognizer outputs a complete word for in-vocabulary (IV) utterances, and sub -word ... sub -word units can expand full-words, we refer to both words and sub-words simply as units. 2 The model can also take multiple pronunciations (§3.1). 713 to phone models (Chen, 2003). Since sub-words ... recognize words beyond their vocab- ulary, many of which are information rich terms, like named entities or foreign words. Hybrid word/ sub -word systems solve this problem by adding sub -word units...
Ngày tải lên: 20/02/2014, 04:20
Tài liệu Báo cáo khoa học: "Yet Another Word Alignment Tool" docx
... translations by word alignment but also becaus e of such interface issues that aligning words manually has the reputa- tion of being a very tedious task. 3 Yawat Yawat (Yet Another Word Alignment Tool) ... Ex- plorer. Figure 3: Alignment v isualization with Yawat. As the mouse is moved over a word, th e word and all words linked with it are highlighted. The highlighting is removed when the mouse leaves the word ... the term word alignment 1 Yawat was first presented at the 2007 Linguistic Annota- tion Workshop (Germann, 2007). to refer to any form of alignment that identifies words or groups of words as...
Ngày tải lên: 20/02/2014, 09:20
Tài liệu Báo cáo khoa học: "Going Beyond AER: An Extensive Analysis of Word Alignments and Their Impact on MT" pdf
... improvements on word alignments (Ayan et al., 2005; Moore, 2005; Ittycheriah and Roukos, 2005; Taskar et al., 2005). The standard technique for evaluating word alignments is to represent alignments ... algorithms to generate word alignments. However, evaluating word alignments is difficult because even humans have difficulty performing this task. The state-of-the art evaluation metric— alignment error ... pairs of words) and to compare the gen- erated alignment against manual alignment of the same data at the level of links. Manual align- ments are represented by two sets: Probable (P ) alignments...
Ngày tải lên: 20/02/2014, 11:21
Tài liệu Báo cáo khoa học: "Discriminative Word Alignment with Conditional Random Fields" ppt
... model many-to-one word alignments, where each source word is aligned with zero or one target words, and therefore each target word can be aligned with many source words. Each source word is labelled ... one-to-many alignments, where each target word is aligned with zero or more source words. Many-to-many alignments are recoverable using the standard techniques for superimposing pre- dicted alignments ... null, denot- ing no alignment. An example word alignment is shown in Figure 1, where the hollow squares and circles indicate the correct alignments. In this example the French words une and autre...
Ngày tải lên: 20/02/2014, 11:21
Tài liệu Báo cáo khoa học: "Finding Synonyms Using Automatic Word Alignment and Measures of Distributional Similarity" pdf
... automatic word alignment. Context vec- tors are built from the alignments found in a paral- lel corpus. Each aligned word type is a feature in the vector of the target word under consideration. The alignment ... for the automatic word alignment described below. 5.2.2 Alignment Context Context vectors are populated with the links to words in other languages extracted from automatic word alignment. We applied ... the target word P(W) is the probability of seeing the word P(f) is the probability of seeing the feature P(W,f) is the probability of seeing the word and the feature together. 3.3 Word Alignment The...
Ngày tải lên: 20/02/2014, 12:20
Tài liệu Báo cáo khoa học: "Word Alignment for Languages with Scarce Resources Using Bilingual Corpora of Other Language Pairs" pptx
... language word similarity of the Chinese word c and the Japanese word given the English word );,( efcsim f e Figure 1. Similarity Calculation English word e. For the ambiguous English word e, ... context word . ij ct j e 0 = ij ct if does not occur in Set i . j e (4) Given the English word e , calculate the cross-language word similarity between the Chinese word and the Japanese word ... one for head words and the other for non-head words. Distortion Probability for Head Words The distortion probability for head words represents the relative position of the head word of the...
Ngày tải lên: 20/02/2014, 12:20
Tài liệu Báo cáo khoa học: "Guiding Statistical Word Alignment Models With Prior Knowledge" pdf
... as 1. In building word alignment models, a special “NULL” word is usually introduced to address tar- get words that align to no source words. Since this physically non-existing word is not in the ... a m 1 specifies the indices of source words that target words are aligned to. In an HMM-based word alignment model, source words are treated as Markov states while target words are observations that are ... generative word alignment models. Prior knowledge serves as soft constraints that shall be placed on translation lexi- con to guide word alignment model training and dis- ambiguation during Viterbi alignment...
Ngày tải lên: 20/02/2014, 12:20
Báo cáo khoa học: "Data Cleaning for Word Alignment" pdf
... are less than 20 percent. 2 1 : n Word Alignment Our discussion of uni-directional alignments of word alignment is limited to IBM Model 4. Definition 1 (Word alignment task) Let e i be the i-th ... two word alignments as an alignment point, 2) add new alignment points that exist in the union with the constraint that a new alignment point connects at least one previ- ously unaligned word, ... mechanism to aug- ment one source word into several source words or delete a source word, while a NULL insertion is a mechanism of generating several words from blank words. Fertility uses a conditional...
Ngày tải lên: 08/03/2014, 01:20
Báo cáo khoa học: "Improved Discriminative Bilingual Word Alignment" pdf
... sums, for each word w, the number of words not linked to w that fall between the first and last words linked to w. The other features counts only such words that are linked to some word other than w. ... have a function word not linked to anything, between two words linked to the same word. exact match feature We have a feature that sums the number of words linked to identical words. This is motivated ... association with respect to a word in a sentence pair to be the number of association types (word- type to word- type) for that word that have higher association scores, such that words of both types occur...
Ngày tải lên: 08/03/2014, 02:21
Báo cáo khoa học: "Soft Syntactic Constraints for Word Alignment through Discriminative Training" pot
... im- proved alignments. 2 Constrained Alignment Let an alignment be the complete structure that connects two parallel sentences, and a link be one of the word- to -word connections that make up an alignment. ... to use a discriminative learning method to train an ITG bitext parser. 1 Introduction Given a parallel sentence pair, or bitext, bilin- gual word alignment finds word- to -word connec- tions across ... traditional word alignment techniques. Otherwise, the features remain the same, including distance features that measure abs j |E| − k |F | ; orthographic features; word frequencies; common-word...
Ngày tải lên: 08/03/2014, 02:21
Báo cáo khoa học: "Boosting Statistical Word Alignment Using Labeled and Unlabeled Data" ppt
... methods for word alignment. In addition, we improve the word alignment results by combining the results of the two semi-supervised boost- ing methods. Experimental results on word alignment ... Statisti- cal Word Alignment. In Proc. of the 10 th Machine Translation Summit, pages 313-320. Hua Wu, Haifeng Wang, and Zhanyi Liu. 2005. Alignment Model Adaptation for Domain-Specific Word Alignment. ... train the alignment models with unlabeled data. A question about word alignment is whether we can further improve the performances of the word aligners with available data and available alignment...
Ngày tải lên: 08/03/2014, 02:21
Báo cáo khoa học: "Tailoring Word Alignments to Syntactic Machine Translation" docx
... VBN NNS DT AUX The jobs are career oriented . les emplois sont axés sur la carrière . . Legend Correct proposed word alignment consistent with human annotation. Proposed word alignment error inconsistent with human annotation. Word alignment constellation that renders ... word alignment. This dependence runs deep; for example, Galley et al. (2006) requires word alignments to project trees from the target lan- guage to the source, while Chiang (2005) requires alignments ... compat- ibility with the word alignment. For a constituent c of t, we consider the set of source words s c that are aligned to c. If none of the source words in the lin- ear closure s ∗ c (the words between...
Ngày tải lên: 08/03/2014, 02:21
Báo cáo khoa học: "Domain Adaptation with Active Learning for Word Sense Disambiguation" pdf
... noun. 3 Active Learning For our experiments, we use naive Bayes as the learning algorithm. The knowledge sources we use include parts-of-speech, local collocations, and sur- rounding words. These ... Nouns The WordNet Domains resource (Magnini and Cavaglia, 2000) assigns domain labels to synsets in WordNet. Since the focus of the WSJ corpus is on business and financial news, we can make use of WordNet ... Introduction In natural language, a word often assumes different meanings, and the task of determining the correct meaning, or sense, of a word in different contexts is known as word sense disambiguation...
Ngày tải lên: 08/03/2014, 02:21
Báo cáo khoa học: "Using Similarity Scoring To Improve the Bilingual Dictionary for Word Alignment" doc
Ngày tải lên: 08/03/2014, 07:20
Báo cáo khoa học: "A Comparison of Syntactically Motivated Word Alignment Spaces" doc
Ngày tải lên: 08/03/2014, 21:20
Báo cáo khoa học: "Unsupervised Word Alignment with Arbitrary Features" potx
Ngày tải lên: 17/03/2014, 00:20
Bạn có muốn tìm thêm với từ khóa: