functional unification grammar a formalism for machine translation

Tài liệu Báo cáo khoa học: "A Large Scale Distributed Syntactic, Semantic and Lexical Language Model for Machine Translation" doc

Tài liệu Báo cáo khoa học: "A Large Scale Distributed Syntactic, Semantic and Lexical Language Model for Machine Translation" doc

... Syntax- based language models for statistical machine transla- tion. MT Summit IX., Intl. Assoc. for Machine Trans- lation. C. Chelba and F. Jelinek. 1998. Exploiting syntactic structure for language ... Language, 14(4):283-332. D. Chiang. 2005. A hierarchical phrase-based model for statistical machine translation. The 43th Annual Con- ference on Association of Computational Linguistics (ACL), ... Stochastic analysis of lexical and semantic enhanced structural language model. The 8th International Colloquium on Grammatical Inference (ICGI), 97-111. K. Yamada and K. Knight. 2001. A syntax-based...

Ngày tải lên: 20/02/2014, 04:20

10 568 0
Tài liệu Báo cáo khoa học: "A Discriminative Syntactic Word Order Model for Machine Translation" pdf

Tài liệu Báo cáo khoa học: "A Discriminative Syntactic Word Order Model for Machine Translation" pdf

... models for statistical machine translation. In ACL. D. Chiang. 2005. A hierarchical phrase-based model for statis- tical machine translation. In ACL. M. Collins. 2000. Discriminative reranking for ... Discriminative training and max- imum entropy models for statistical machine translation. In ACL. F. J. Och and H. Ney. 2004. The alignment template approach to statistical machine translation. ... inference and train- ing of context-rich syntactic translation models. In ACL. P. Koehn. 2004. Pharaoh: A beam search decoder for phrase- based statistical machine translation models. In AMTA. R....

Ngày tải lên: 20/02/2014, 12:20

8 404 0
Báo cáo khoa học: "M AX S IM: A Maximum Similarity Metric for Machine Translation Evaluation" doc

Báo cáo khoa học: "M AX S IM: A Maximum Similarity Metric for Machine Translation Evaluation" doc

... parse tree of the source sentence are translated, and human judges have to rank these translations). During the workshop, Kappa values measured for inter- and intra-annotator agreement for rank ... com- pared (a system translation against a reference translation) , we perform tokenization 2 , lemmati- zation using WordNet 3 , and part-of-speech (POS) tagging with the MXPOST tagger (Ratnaparkhi, 1996). ... each n- gram by its sequence of lemma and POS-tag pairs, we first try to perform an exact match in both lemma and POS-tag. In all our n-gram matching, each n- gram in the system translation can...

Ngày tải lên: 08/03/2014, 01:20

8 248 0
Báo cáo khoa học: "A Comparative Study of Hypothesis Alignment and its Improvement for Machine Translation System Combination" pot

Báo cáo khoa học: "A Comparative Study of Hypothesis Alignment and its Improvement for Machine Translation System Combination" pot

... Sumita, F. Sugaya, H. Yamamoto, and S. Yamamoto. 2002. Toward a broad-coverage bilingual corpus for speech translation of travel conversations in the real world. In Proceeding of LREC-2002, Las ... statis- tical machine translation. In Proceedings of ACL- 2003. Sapporo, Japan. F. J. Och and H. Ney. 2003. A systematic comparison of various statistical alignment models. Computa- tional Linguistics, ... LREC-2002, Las Palmas de Gran Canaria, Spain. D. Xiong, Q. Liu and S. Lin. 2006. Maximum Entro- py Based Phrase Reordering Model for Statistical Machine Translation. In Proceeding of ACL-2006. pp.521-528....

Ngày tải lên: 17/03/2014, 01:20

8 547 1
Tài liệu Báo cáo khoa học: "A Framework for Syntactic Translation" docx

Tài liệu Báo cáo khoa học: "A Framework for Syntactic Translation" docx

... Development Command), and the U.S. Navy ( Office of Naval Research); and in part by the National Science Foundation. 1. Warren Weaver, " ;Translation, " Machine Translation of Languages, edited ... intrinsically capable of producing correct and accurate translation. We are attempting to go beyond simple word -for- word translation; be- yond translation using empirical, ad hoc, or pragmatic ... Massachusetts Institute of Technology, Cambridge, Massachusetts Adequate mechanical translation can be based only on adequate structural descrip- tions of the languages involved and on an adequate...

Ngày tải lên: 19/02/2014, 19:20

7 509 1
Tài liệu Báo cáo khoa học: "Quadratic-Time Dependency Parsing for Machine Translation" pdf

Tài liệu Báo cáo khoa học: "Quadratic-Time Dependency Parsing for Machine Translation" pdf

... transla- tion performance. In Proc. of the ACL Workshop on Statistical Machine Translation, pages 224–232. D. Chiang. 2005. A hierarchical phrase-based model for statistical machine translation. ... alignment template approach to statistical machine translation. Compu- tational Linguistics, 30(4):417–449. F. Och, D. Gildea, S. Khudanpur, A. Sarkar, K. Ya- mada, A. Fraser, S. Kumar, L. Shen, ... Jain, Z. Jin, and D. Radev. 2004. A smor- gasbord of features for statistical machine transla- tion. In Proceedings of HLT-NAACL. F. Och. 2003. Minimum error rate training for statisti- cal machine...

Ngày tải lên: 20/02/2014, 07:20

9 514 0
Tài liệu Báo cáo khoa học: "Fast Decoding and Optimal Decoding for Machine Translation" doc

Tài liệu Báo cáo khoa học: "Fast Decoding and Optimal Decoding for Machine Translation" doc

... program. Here we adapt a subtour elimination strategy used in standard TSP. We create a binary (0/1) integer variable for each pair of hotels Fast Decoding and Optimal Decoding for Machine Translation Ulrich ... University 4676 Admiralty Way, Suite 1001 Stanford, CA 94305 Marina del Rey, CA 90292 jahr@cs.stanford.edu germann,knight,marcu,kyamada @isi.edu Abstract A good decoding algorithm is critical to the ... transform a decoding problem in- stance into a TSP instance? If so, we may take great advantage of previous research into efficient TSP algorithms. We may also take advantage of existing software...

Ngày tải lên: 20/02/2014, 18:20

8 440 0
Tài liệu Báo cáo khoa học: "Phrase-Based Backoff Models for Machine Translation of Highly Inflected Languages" docx

Tài liệu Báo cáo khoa học: "Phrase-Based Backoff Models for Machine Translation of Highly Inflected Languages" docx

... backoff model produced a good translation, but the translation was a para- phrase rather than an identical match to the ref- erence translation. Since only a single reference translation is available ... ton Seattle, WA, USA katrin@ee.washington.edu Abstract We propose a backoff model for phrase- based machine translation that translates unseen word forms in foreign-language text by hierarchical ... p a astăaisiin sell- aiseen lopputulokseen, ett a kyproksen kreikkalainen ja turkkilainen văaestăonosa voisivat yhdess a nauttia liittymisen mukanaan tuomista eduista yhdistetyss a tasavallassa. BASE:...

Ngày tải lên: 22/02/2014, 02:20

8 379 0
Tài liệu Báo cáo khoa học: "Resolution for Machine Translation of Telegraphic Messages" docx

Tài liệu Báo cáo khoa học: "Resolution for Machine Translation of Telegraphic Messages" docx

... syntactic and semantic grammar as proposed here has advantages over either a syntactic grammar or a lexi- calized semantic grammar. Compared with a syntactic grammar, the proposed grammar achieves ... Rate Semantic Grammar 34.8% 8.7% Syntactic Grammar 75.7% 29% Mixed Grammar 77% 10% Table 8: TEST Data Evaluation Results on the Three Types of Grammar Grammar Type Farsin~ Rate Misparse ... the syntactic lgrcal.mm, mar or the semantic grammar. Compared with a lex- lzed semantic grammar, this grammar achieves a higher parsing coverage without increasing the amount of ambigu- ity/misparsing....

Ngày tải lên: 22/02/2014, 03:20

8 365 0
Báo cáo khoa học: "Research Methodology for Machine Translation" pptx

Báo cáo khoa học: "Research Methodology for Machine Translation" pptx

... language): a list of all the words in the language, each represented by one form; grammatical and semantic information may also appear. A dictionary changes as the language expands and contracts. ... "large" are forms of different words. Glossary (of a corpus): a list of all the forms that occur in a corpus; grammatical and semantic information may also appear. Dictionary (of a language): ... card (the translation text card) contains glossary information (see Glossary Development); the second card (the analytic text card) contains analytic information (see Translation and Analysis)....

Ngày tải lên: 07/03/2014, 18:20

8 281 0
Báo cáo khoa học: "Discriminative Modeling of Extraction Sets for Machine Translation" pptx

Báo cáo khoa học: "Discriminative Modeling of Extraction Sets for Machine Translation" pptx

... inference is organized around search states associated with a grammar symbol and a bispan; augmenting grammar symbols also augments this state space. To parse quickly, we prune away search states using ... links have a precise interpretation that dictates what phrasal translation rules can be extracted from a sentence pair. This mapping allows us to train with existing annotated data sets and use ... semi-supervised estimation has also con- tributed evidence that hand-annotations are useful for training alignment models (Fraser and Marcu, 2006; Fraser and Marcu, 2007). The ITG gram- mar formalism, the...

Ngày tải lên: 07/03/2014, 22:20

11 420 0
Báo cáo khoa học: "Stochastic Iterative Alignment for Machine Translation Evaluation" doc

Báo cáo khoa học: "Stochastic Iterative Alignment for Machine Translation Evaluation" doc

... 2005. Syntactic fea- tures for evaluation of machine translation. In ACL 2005 Workshop on Intrinsic and Extrinsic Evalua- tion Measures for Machine Translation and/or Sum- marization. Bo Pang, Kevin ... ex- traction tools are available for English, but are not always available for other languages. In order to take advantage of loose-sequence-based metrics and avoid the problems in ROUGE and METEOR, we ... the Association for Com- putational Linguistics (NAACL-03). Kishore Papineni, Salim Roukos, Todd Ward, and Wei- Jing Zhu. 2002. Bleu: a method for automatic eval- uation of machine translation. ...

Ngày tải lên: 08/03/2014, 02:21

8 265 0
Báo cáo khoa học: "Automatically Creating Bilingual Lexicons for Machine Translation from Bilingual Text" ppt

Báo cáo khoa học: "Automatically Creating Bilingual Lexicons for Machine Translation from Bilingual Text" ppt

... dulingvaj leksikeroj estas uzataj anstatafi ~ablonoj, kiam eble. Tiamaniere, nur mankantaj dulingvaj leksikeroj estas kreataj. La algoritmo povus erari kiam du unuoj en la sama plur'aro havas ... lafi tiaspeca modelo. La algoritmo por krej novajn dulingvajn lek- sikerojn konsistas el kvin pa~oj: (i-ii) Fonta kaj cela frazoj estas analizataj. Fontanal- iza kaj celanaliza plur'aroj ... rezulto estas transira celplur'aro; (iv) La transira celplur'aro kaj la celanal]za plur'aro estas kongruigataj. Sukcesa unuigo sekvigas ke la dullngvaj eroj uzitaj en la transiro...

Ngày tải lên: 08/03/2014, 06:20

8 375 0
Báo cáo khoa học: "Domain Adaptation for Machine Translation by Mining Unseen Words" doc

Báo cáo khoa học: "Domain Adaptation for Machine Translation by Mining Unseen Words" doc

... in translations quality (between 0.5 and 1.5 Bleu points) on four domains and two language pairs. 1 Introduction Large amounts of data are currently available to train statistical machine translation ... domain. As expected, we shall see that unseen words pose a major challenge for adapting translation systems to distant domains. No machine learning approach to adaptation could hope to attenuate ... Adaptation for Machine Translation by Mining Unseen Words Hal Daum ´ e III University of Maryland Collge Park, USA hal@umiacs.umd.edu Jagadeesh Jagarlamudi University of Maryland College Park, USA jags@umiacs.umd.edu Abstract We...

Ngày tải lên: 17/03/2014, 00:20

6 349 0
Báo cáo khoa học: "Handling phrase reorderings for machine translation" pdf

Báo cáo khoa học: "Handling phrase reorderings for machine translation" pdf

... Koehn. 2004. Pharaoh: a beam search decoder for phrase–based statistical machine translation models. In Proc. of AMTA 2004, Washington DC, October. P. Koehn, A. Axelrod, A. B. Mayne, C. Callison– Burch, ... classified with a large margin while nearby candidates are allowed to be classified with a smaller margin. At training time, we used a perceptron–based structure learn- ing (PSL) algorithm to learn {w o } o∈Ω which ... Conf. Spoken Lan- guage Processing, Colorado, September. A. Stolcke. 2002. Minimum error rate training in sta- tistical machine translation. In Proc. ACL, Japan. B. Taskar, C. Guestrin, and D.Koller....

Ngày tải lên: 17/03/2014, 02:20

4 278 0
w