xử lý ngôn ngữ tự nhiên,regina barzilay,ocw mit edu

THÔNG TIN TÀI LIỆU

Thông tin cơ bản

Định dạng
Số trang	61
Dung lượng	595,42 KB

Nội dung

xử lý ngôn ngữ tự nhiên,regina barzilay,ocw mit edu 6 864 Lecture 16 (November 8th, 2005) Machine Translation Part II CuuDuongThanCong com https //fb com/tailieudientucntt http //cuuduongthancong com?[.]

6.864: Lecture 16 (November 8th, 2005) Machine Translation Part II CuuDuongThanCong.com https://fb.com/tailieudientucntt Overview • The Structure of IBM Models and • EM Training of Models and • Some examples of training Models and • Decoding CuuDuongThanCong.com https://fb.com/tailieudientucntt Recap: IBM Model • Aim is to model the distribution P (f | e) where e is an English sentence e1 el where f is a French sentence f1 fm • Only parameters in Model are translation parameters: T(f | e) where f is a French word, e is an English word • e.g., T(le | the) = 0.7 T(la | the) = 0.2 T(l� | the) = 0.1 CuuDuongThanCong.com https://fb.com/tailieudientucntt Recap: Alignments in IBM Model • Aim is to model the distribution P (f | e) where e is an English sentence e1 el where f is a French sentence f1 fm • An alignment a identifies which English word each French word originated from • Formally, an alignment a is {a1 , am }, where each aj � {0 l} • There are (l + 1)m possible alignments In IBM model all alignments a are equally likely: P (a | e) = C × (l + 1) m where C = prob(length(f ) = m) is a constant CuuDuongThanCong.com https://fb.com/tailieudientucntt IBM Model 1: The Generative Process To generate a French string f from an English string e: • Step 1: Pick the length of f (all lengths equally probable, probability C) • Step 2: Pick an alignment a with probability (l+1)m • Step 3: Pick the French words with probability P (f | a, e) = m � T(fj | eaj ) j =1 The final result: m � C P (f , a | e) = P (a | e)P (f | a, e) = T(fj | eaj ) m (l + 1) j =1 CuuDuongThanCong.com https://fb.com/tailieudientucntt IBM Model • Only difference: we now introduce alignment or distortion parameters D(i | j, l, m) = Probability that j’th French word is connected to i’th English word, given sentence lengths of e and f are l and m respectively • Define P (a | e, l, m) = m � D(aj | j, l, m) j=1 where a = {a1 , am } • Gives P (f , a | e, l, m) = m � D(aj | j, l, m)T(fj | eaj ) j=1 CuuDuongThanCong.com https://fb.com/tailieudientucntt • Note: Model is a special case of Model 2, where D(i | j, l, m) = for all i, j CuuDuongThanCong.com https://fb.com/tailieudientucntt l+1 An Example l = m = e = And the program has been implemented f = Le programme a ete mis en application a = {2, 3, 4, 5, 6, 6, 6} P (a | e, 6, 7) = D(2 | 1, 6, 7) × D(3 | 2, 6, 7) × D(4 | 3, 6, 7) × D(5 | 4, 6, 7) × D(6 | 5, 6, 7) × D(6 | 6, 6, 7) × D(6 | 7, 6, 7) CuuDuongThanCong.com https://fb.com/tailieudientucntt P (f | a, e) = T(Le | the) × T(programme | program) × T(a | has) × T(ete | been) × T(mis | implemented) × T(en | implemented) × T(application | implemented) CuuDuongThanCong.com https://fb.com/tailieudientucntt IBM Model 2: The Generative Process To generate a French string f from an English string e: • Step 1: Pick the length of f (all lengths equally probable, probability C) • Step 2: Pick an alignment a = {a1 , a2 am } with probability m � D(aj | j, l, m) j=1 • Step 3: Pick the French words with probability P (f | a, e) = m � T(fj | eaj ) j=1 The final result: P (f , a | e) = P (a | e)P (f | a, e) = C m � D(aj | j, l, m)T(fj | eaj ) j=1 CuuDuongThanCong.com https://fb.com/tailieudientucntt

Ngày đăng: 27/11/2022, 21:17