0

luanvansieucap

Nạp tiền Tải lên

Đăng ký Đăng nhập

Đăng ký

Đăng nhập

0

statistical machine translation with word and sentence aligned parallel corpora

Báo cáo khoa học:

Báo cáo khoa học: "Statistical Machine Translation with Word- and Sentence-Aligned Parallel Corpora" potx

Danh mục: Báo cáo khoa học

8
368
0

Báo cáo khoa học:

Báo cáo khoa học: "Enhancing Language Models in Statistical Machine Translation with Backward N-grams and Mutual Information Triggers" ppt

Danh mục: Báo cáo khoa học

... SetupOur training corpora 7consist of 96.9M Chinesewords and 109.5M English words in 3.8M sentence pairs. We used all corpora to train our translation model and smaller corpora without the United ... 1288–1297,Portland, Oregon, June 19-24, 2011.c2011 Association for Computational LinguisticsEnhancing Language Models in Statistical Machine Translation with Backward N-grams and Mutual Information ... July.Arne Mauser, Saˇsa Hasan, and Hermann Ney. 2009. Ex-tending statistical machine translation with discrimi-native and trigger-based lexicon models. In Proceed-ings of the 2009 Conference...

10
415
0

Tài liệu Báo cáo khoa học:

Tài liệu Báo cáo khoa học: "Translation Model Adaptation for Statistical Machine Translation with Monolingual Topic Information" doc

Danh mục: Báo cáo khoa học

... Statisti-cal Machine Translation. Machine Translation, pages77-94.Hua Wu, Haifeng Wang and Chengqing Zong. 2008. Do-main Adaptation for Statistical Machine Translation with Domain Dictionary and Monolingual ... (Och and Ney,2004) with the alignment if and only if: (1) theremust be at least one word inside one phrase aligned to a word inside the other phrase and (2) no wordsinside one phrase can be aligned ... FBIS corpus and the Hansard-s part of LDC2004T07 corpus (54.6K documents with 1M parallel sentences, 25.2M Chinese words and 29M English words). We use the Chinese Sohuweblog in 20091 and the...

10
533
0

Tài liệu Báo cáo khoa học:

Tài liệu Báo cáo khoa học: "Improving Statistical Machine Translation with Monolingual Collocation" pdf

Danh mục: Báo cáo khoa học

... em-ploys the word translation model to calculate the probabilities of alignments. In IBM Model 2, both the word translation model and position dis-tribution model are used. IBM Model 3, 4 and 5 consider ... model in addition to the word translation model and position distribution model. And these three models are similar, ex-cept for the word distortion models. One-to-one and many-to-one alignments ... probability Given the monolingual word aligned corpus, we calculate the frequency of two words aligned in the corpus, denoted as ),(jiwwfreq. We filtered the aligned words occurring only once....

9
474
0

Tài liệu Báo cáo khoa học:

Tài liệu Báo cáo khoa học: "Corpus Expansion for Statistical Machine Translation with Semantic Role Label Substitution Rules" doc

Danh mục: Báo cáo khoa học

... system, GS: System trained with only gen-erated sentence pairs, IT: Interpolated phrase table with GS and BL,. GA and IA are GS and IT systems trained with baseline word alignment models accordingly. ... 294–298,Portland, Oregon, June 19-24, 2011.c2011 Association for Computational LinguisticsCorpus Expansion for Statistical Machine Translation with Semantic Role Label Substitution RulesQin Gao and ... classiﬁer with features derived from standardphrase based translation models and bilingual lan-guage models to identify high quality sentence pairs, and use these sentence pairs in the SMT training....

5
416
0

Báo cáo khoa học:

Báo cáo khoa học: "Boosting Statistical Machine Translation by Lemmatization and Linear Interpolation" ppt

Danh mục: Báo cáo khoa học

4
293
0

Báo cáo khoa học:

Báo cáo khoa học: "Statistical Machine Translation through Global Lexical Selection and Sentence Reconstruction" doc

Danh mục: Báo cáo khoa học

8
257
0

Báo cáo khoa học:

Báo cáo khoa học: "Fast and Scalable Decoding with Language Model Look-Ahead for Phrase-based Statistical Machine Translation" potx

Danh mục: Báo cáo khoa học

5
246
0

Tài liệu Báo cáo khoa học:

Tài liệu Báo cáo khoa học: "A Ranking-based Approach to Word Reordering for Statistical Machine Translation" doc

Danh mục: Báo cáo khoa học

... one sentence pair in our training data. Consider the subtree rootedat word “Problem”. With the gold alignment, “Prob-lem” is aligned to the 5th target word, and with latter procedure” are aligned ... canhelp English-Hindi Statistical Machine Translation. In Proc. IJCNLP.Roy Tromble. 2009. Search and Learning for the Lin-ear Ordering Problem with an Application to Machine Translation. Ph.D. ... Galley and Manning, 2008).Long-distance word reordering between languagepairs with substantial word order difference, such asJapanese with Subject-Object-Verb (SOV) structure and English with...

9
615
0

Tài liệu Báo cáo khoa học:

Tài liệu Báo cáo khoa học: "Fixed Length Word Suffix for Factored Statistical Machine Translation" pdf

Danh mục: Báo cáo khoa học

4
353
0

Tài liệu Báo cáo khoa học:

Tài liệu Báo cáo khoa học: "Bilingually Motivated Domain-Adapted Word Segmentation for Statistical Machine Translation" pptx

Danh mục: Báo cáo khoa học

... Domain-Adapted Word Segmentationfor Statistical Machine Translation Yanjun Ma Andy WayNational Centre for Language TechnologySchool of ComputingDublin City UniversityDublin 9, Ireland{yma, away}@computing.dcu.ieAbstractWe ... domain-speciﬁc corpora to trainsegmenters, we make use of bilingual cor-pora and statistical word alignment tech-niques. First of all, our approach isadapted for the speciﬁc translation task athand ... the segmentation of the respective sentencesin the parallel corpus according to these candi-date words; these modiﬁed sentences are thengiven back to the word aligner, which producesnew alignments....

9
236
0

Báo cáo khoa học:

Báo cáo khoa học: "Word Sense Disambiguation Improves Statistical Machine Translation" docx

Danh mục: Báo cáo khoa học

... thatthe most number of words any candidate translation has is two words. Since among all the 2 -word candi-date translations, the translation “every month” hasthe highest translation probability ... model’saccuracy on two simpliﬁed translation tasks: word translation and blank-ﬁlling.Recently, Cabezas and Resnik (2005) experi-mented with incorporating WSD translations intoPharaoh, a state-of-the-art ... English words pro-vided by Hiero+WSD, though appropriate, do not39 translation choices for a word w were deﬁned as theset of words or phrases aligned to w, as gatheredfrom a word- aligned parallel...

8
285
0

Báo cáo khoa học:

Báo cáo khoa học: "Improving Statistical Natural Language Translation with Categories and Rules" potx

Danh mục: Báo cáo khoa học

... TL word is generated by exactly one SL word. We use a matrix Z for every sentence pair, whose fields describe whether or not two words are aligned. In this approach, multiple words can be aligned ... lines and j rows with binary values. The value zij = 1 (zij = 0) means that the word i influences (not) the word j. In figure 1 every link stands for zij = l. The models 1, 2 and 2 ~ and ... contain words with similar syntactic/semantic properties. To arrive at WCs having both (method COMB), we determine TL WCs with the first method and afterwards we determine SL WCs with the...

5
347
0

Báo cáo khoa học:

Báo cáo khoa học: "Modeling with Structures in Statistical Machine Translation" pot

Danh mục: Báo cáo khoa học

... all) statistical machine translation systems employ a word- based alignment model (Brown et al., 1993; Vogel, Ney, and Tillman, 1996; Wang and Waibel, 1997), which treats words in a sentence ... sen- tence, align it with a source word. 3. Produce a word at each target word po- sition according to the source word with which the target word position has been aligned. IBM Alignment ... edu Abstract Most statistical machine translation systems employ a word- based alignment model. In this paper we demonstrate that word- based align- ment is a major cause of translation errors....

7
422
0

Báo cáo khoa học:

Báo cáo khoa học: "Bridging Morpho-Syntactic Gap between Source and Target Sentences for English-Korean Statistical Machine Translation" pot

Danh mục: Báo cáo khoa học

4
296
0

Báo cáo khoa học:

Báo cáo khoa học: "Scaling Phrase-Based Statistical Machine Translation to Larger Corpora and Longer Phrases" pptx

Danh mục: Báo cáo khoa học

8
316
0

Báo cáo khoa học:

Báo cáo khoa học: "Phrase Linguistic Classiﬁcation and Generalization for Improving Statistical Machine Translation" docx

Danh mục: Báo cáo khoa học

6
250
0

Báo cáo khoa học:

Báo cáo khoa học: "Pre- and Postprocessing for Statistical Machine Translation into Germanic Languages" docx

Danh mục: Báo cáo khoa học

6
281
0

Báo cáo khoa học:

Báo cáo khoa học: "Syntax-based Statistical Machine Translation using Tree Automata and Tree Transducers" doc

Danh mục: Báo cáo khoa học

5
274
0

Báo cáo khoa học:

Báo cáo khoa học: "N-gram-based Statistical Machine Translation versus Syntax Augmented Machine Translation: comparison and system combination" potx

Danh mục: Báo cáo khoa học

9
254
0

Bạn có muốn tìm thêm với từ khóa:

Tìm thêm: hệ việt nam nhật bản và sức hấp dẫn của tiếng nhật tại việt nam xác định các mục tiêu của chương trình xác định các nguyên tắc biên soạn khảo sát các chuẩn giảng dạy tiếng nhật từ góc độ lí thuyết và thực tiễn khảo sát chương trình đào tạo gắn với các giáo trình cụ thể điều tra đối với đối tượng giảng viên và đối tượng quản lí điều tra với đối tượng sinh viên học tiếng nhật không chuyên ngữ1 khảo sát thực tế giảng dạy tiếng nhật không chuyên ngữ tại việt nam khảo sát các chương trình đào tạo theo những bộ giáo trình tiêu biểu nội dung cụ thể cho từng kĩ năng ở từng cấp độ xác định mức độ đáp ứng về văn hoá và chuyên môn trong ct phát huy những thành tựu công nghệ mới nhất được áp dụng vào công tác dạy và học ngoại ngữ mở máy động cơ rôto dây quấn hệ số công suất cosp fi p2 đặc tuyến hiệu suất h fi p2 đặc tuyến tốc độ rôto n fi p2 sự cần thiết phải đầu tư xây dựng nhà máy thông tin liên lạc và các dịch vụ từ bảng 3 1 ta thấy ngoài hai thành phần chủ yếu và chiếm tỷ lệ cao nhất là tinh bột và cacbonhydrat trong hạt gạo tẻ còn chứa đường cellulose hemicellulose chỉ tiêu chất lượng 9 tr 25