0

word alignment using labeled

Báo cáo khoa học:

Báo cáo khoa học: "Boosting Statistical Word Alignment Using Labeled and Unlabeled Data" ppt

Báo cáo khoa học

... statistical word alignment with limited labeled data and large amounts of unlabeled data. In this algorithm, we built an in-terpolated model by using both the labeled data 919and the unlabeled ... each word aligner using only the labeled data. Based on this semi-supervised boosting algorithm, we inves-tigate two boosting methods for word alignment. In addition, we improve the word alignment ... incor-porating the unlabeled data. In this algo-rithm, we build a word aligner by using both the labeled data and the unlabeled data. Then we build a pseudo reference set for the unlabeled data,...
  • 8
  • 451
  • 1
Tài liệu Báo cáo khoa học:

Tài liệu Báo cáo khoa học: "Finding Synonyms Using Automatic Word Alignment and Measures of Distributional Similarity" pdf

Báo cáo khoa học

... automatic word alignment. Context vec-tors are built from the alignments found in a paral-lel corpus. Each aligned word type is a feature inthe vector of the target word under consideration.The alignment ... for the automatic word alignment described below.5.2.2 Alignment ContextContext vectors are populated with the links towords in other languages extracted from automatic word alignment. We applied ... the target word P(W) is the probability of seeing the word P(f) is the probability of seeing the featureP(W,f) is the probability of seeing the word and the featuretogether.3.3 Word Alignment The...
  • 8
  • 516
  • 0
Tài liệu Báo cáo khoa học:

Tài liệu Báo cáo khoa học: "Word Alignment for Languages with Scarce Resources Using Bilingual Corpora of Other Language Pairs" pptx

Báo cáo khoa học

... language word similarity of the Chinese word c and the Japanese word given the English word );,( efcsimfeFigure 1. Similarity Calculation English word e. For the ambiguous English word e, ... equation (7). The word similarity of the Chinese word "河岸 (bank2)" and the Japanese word " 銀行(bank1)" given the word English word "bank" is low. Thus, using the updated ... given English word and Japanese word , and put the English sentences in the pairs into Set 2. ef(3) Construct the feature vectors and of the given English word using all other words as context...
  • 8
  • 359
  • 0
Báo cáo khoa học:

Báo cáo khoa học: "Using Similarity Scoring To Improve the Bilingual Dictionary for Word Alignment" doc

Báo cáo khoa học

... language word. is expressed as follows: a word qualifies for clus-tering ifAs before, are all the target language wordsthat cooccur with source language word .Similarly to the most frequent words, ... assumedlyvariants of the same word, then further refines theclusters using a cooccurrence related measure. Word variants are found via a stemmer or by clustering allwords that begin with the same ... contain one word. Then the similarity score of themerged cluster will be the similarity score ofthe word pair.2. Merge a cluster that contains a single word and a cluster that contains wordsand...
  • 8
  • 363
  • 0
Measuring Word Recognition Using a Picture

Measuring Word Recognition Using a Picture

Tư liệu khác

... (Pause). Look at the words around the picture (Pause.) Find the biggest word. What is that word? (Students will say, "Table.") A line has been drawn from that word to the TABLE to ... pattern of lining. • Have one example word that is familiar to all students and is bigger than the rest of the words. Difficulty of ItemsThe task will contain words of varying degrees of difficulty, ... basis of the frequency of word use, complexity of the graphic image, and the meaning of the word. Response Attributes• Response: Students will look at the picture and the words and then draw lines...
  • 3
  • 322
  • 1
Tài liệu Báo cáo khoa học:

Tài liệu Báo cáo khoa học: "Exploring Syntactic Structural Features for Sub-Tree Alignment using Bilingual Tree Kernels" docx

Báo cáo khoa học

... system. For sub-tree- alignment, we use the above word alignment to learn lexical /word alignment feature, and train with the FBIS training corpus (200) using the composite kernel of Plain+dBTK-Root+iBTK-RdSTT. ... bilingual phrases to be consistent with Either Word alignment or Sub-tree alignment (EWoS) instead of being originally consistent with the word alignment only. The method helps tailoring the ...  refers to the word set for the ex-ternal span of the source sub-tree , while  refers to that of the target sub-tree . Internal Word Alignment Features: The word alignment links...
  • 10
  • 467
  • 0
Tài liệu Báo cáo khoa học:

Tài liệu Báo cáo khoa học: "Word Alignment with Synonym Regularization" doc

Báo cáo khoa học

... amethod for using synonym information effectivelyto improve word alignment quality.In general, synonym relations are defined interms of word sense, not in terms of word form. Inother words, synonym ... of the word alignment model by using synonym pairs includ-ing such ambiguous synonym words.Finally, we discuss the data set size used for un-supervised training. As shown in Table 1, using a ... sen-140Figure 1: Graphical model of HM-BiTAM alignment quality.2 Bilingual Word Alignment ModelIn this section, we review a conventional gener-ative word alignment model, HM-BiTAM (Zhaoand Xing,...
  • 5
  • 470
  • 2
Tài liệu Báo cáo khoa học:

Tài liệu Báo cáo khoa học: "Yet Another Word Alignment Tool" docx

Báo cáo khoa học

... translationsby word alignment but also becaus e of such interfaceissues that aligning words manually has the reputa-tion of being a very tedious task.3 YawatYawat (Yet Another Word Alignment Tool) ... Ex-plorer.Figure 3: Alignment v isualization with Yawat. As the mouse is moved over a word, th e word and all words linkedwith it are highlighted. The highlighting is removed when the mouse leaves the word ... the term word alignment 1Yawat was first presented at the 2007 Linguistic Annota-tion Workshop (Germann, 2007).to refer to any form of alignment that identifies wordsor groups of words as...
  • 4
  • 416
  • 1
Tài liệu Báo cáo khoa học:

Tài liệu Báo cáo khoa học: "Discriminative Word Alignment with Conditional Random Fields" ppt

Báo cáo khoa học

... model many-to-one word alignments,where each source word is aligned with zero orone target words, and therefore each target word can be aligned with many source words. Eachsource word is labelled ... one-to-many alignments, where each target word is aligned with zero or more source words.Many-to-many alignments are recoverable using the standard techniques for superimposing pre-dicted alignments ... null, denot-ing no alignment. An example word alignment is shown in Figure 1, where the hollow squaresand circles indicate the correct alignments. In thisexample the French words une and autre...
  • 8
  • 460
  • 0
Tài liệu Báo cáo khoa học:

Tài liệu Báo cáo khoa học: "Guiding Statistical Word Alignment Models With Prior Knowledge" pdf

Báo cáo khoa học

... as 1.In building word alignment models, a special“NULL” word is usually introduced to address tar-get words that align to no source words. Since thisphysically non-existing word is not in the ... am1specifies the indices of source wordsthat target words are aligned to.In an HMM-based word alignment model, sourcewords are treated as Markov states while targetwords are observations that are ... generative word alignment models. Prior knowledge serves as softconstraints that shall be placed on translation lexi-con to guide word alignment model training and dis-ambiguation during Viterbi alignment...
  • 8
  • 495
  • 0
Báo cáo khoa học:

Báo cáo khoa học: "Dealing with Spurious Ambiguity in Learning ITG-based Word Alignment" pdf

Báo cáo khoa học

... a family of word alignment. Definition 1. The ITG alignment family is a set of word alignments that has at least one BTG deriva-tion.ITG alignment family is only a subset of word alignments because ... am-biguity in word alignment is the case where two ormore derivations d1, d2, dkof G have the sameunderlying word alignment A. A grammar G is non-spurious if for any given word alignment, ... Null -word Attachment AmbiguityDefinition 4. For any given sentence pair (e, f) andits alignment A, let (e, f) be the sentence pairswith all null-aligned words removed from (e, f).The alignment...
  • 5
  • 499
  • 0
Báo cáo khoa học:

Báo cáo khoa học: "Data Cleaning for Word Alignment" pdf

Báo cáo khoa học

... areless than 20 percent.2 1 : n Word Alignment Our discussion of uni-directional alignments of word alignment is limited to IBM Model 4.Definition 1 (Word alignment task) Let eibethe i-th ... two word alignmentsas an alignment point, 2) add new alignment pointsthat exist in the union with the constraint that anew alignment point connects at least one previ-ously unaligned word, ... mechanism to aug-ment one source word into several source wordsor delete a source word, while a NULL insertionis a mechanism of generating several words fromblank words. Fertility uses a conditional...
  • 9
  • 487
  • 0
Báo cáo khoa học:

Báo cáo khoa học: "Improved Discriminative Bilingual Word Alignment" pdf

Báo cáo khoa học

... sums, for each word w, the number of wordsnot linked to w that fall between the first and lastwords linked to w. The other features counts onlysuch words that are linked to some word other thanw. ... havea function word not linked to anything, betweentwo words linked to the same word. exact match feature We have a feature thatsums the number of words linked to identicalwords. This is motivated ... association with respect to a word in asentence pair to be the number of association types (word- type to word- type) for that word that havehigher association scores, such that words of bothtypes occur...
  • 8
  • 217
  • 0
Báo cáo khoa học:

Báo cáo khoa học: "Soft Syntactic Constraints for Word Alignment through Discriminative Training" pot

Báo cáo khoa học

... bilin-gual word alignment finds word- to -word connec-tions across languages. Originally introduced as abyproduct of training statistical translation modelsin (Brown et al., 1993), word alignment ... im-proved alignments.2 Constrained Alignment Let an alignment be the complete structure thatconnects two parallel sentences, and a link beone of the word- to -word connections that makeup an alignment. ... exer-cise for word alignment. In HLT-NAACL Workshop onBuilding and Using Parallel Texts, pages 1–10, Edmon-ton, Canada.R. Moore. 2005. A discriminative framework for bilingual word alignment. ...
  • 8
  • 325
  • 0
Báo cáo khoa học:

Báo cáo khoa học: "Combining Clues for Word Alignment" pdf

Báo cáo khoa học

... optimal alignment. Section 2 describes the clue alignment modeland ways of estimating parameters from associ-ation scores. Section 3 introduces the alignment approach which is based on word alignment ... therefore,they can be dismissed in the alignment process.3 Clue Alignment Word alignment clues as described above can beused to model the relations between words oftranslated texts. Parameters ... short words.340Combining Clues for Word Alignment Rirg TiedemannDepartment of LinguisticsUppsala UniversityBox 527SE-751 20 Uppsala, Swedenjoerg@stp.ling.uu.seAbstractIn this paper, a word...
  • 8
  • 579
  • 0

Xem thêm