Báo cáo khoa học: "Research Methodology for Machine Translation" pptx
... different occurrences of the same form. Research Methodology 9 10 Edmundson and Hays Word: a form that represents a set of forms differing only in inflection. For example, "great" ... poration for conducting research on MT is that of convergence by successive refinements. At each stage, automatic computing machinery is used for some aspects of translation, and...
Ngày tải lên: 07/03/2014, 18:20
... S 1 , which contains all of the information that is in the input sentence. The part of the information that is implicit in the sentence (tense, voice, and so forth) is made explicit in S 1 . ... certain German verb form into English, it is necessary to un- derstand the German verb form as part of a complex of features of German structure in- cluding possibly other verb forms within t...
Ngày tải lên: 19/02/2014, 19:20
... unseen for a given domain s, we are already performing an implicit form of smoothing (when computing the expected counts), since each docu- ment has a distribution over all topics, and therefore we ... Meeting of the Association for Computational Linguistics, pages 115–119, Jeju, Republic of Korea, 8-14 July 2012. c 2012 Association for Computational Linguistics Topic Models for Dyna...
Ngày tải lên: 19/02/2014, 19:20
Tài liệu Báo cáo khoa học: "HINDI TO PUNJABI MACHINE TRANSLATION SYSTEM" pdf
... module finds the root word for the token and its morphological features.Morphological analyzer developed by IIT-H has been ported for Windows platform for making it usable for this system. (Goyal ... analysis without using morphology has been performed for all those tokens that are not processed by morphological analysis module. Thus, for performing inflectional analysis, ru...
Ngày tải lên: 20/02/2014, 05:20
Tài liệu Báo cáo khoa học: "Phrase-Based Statistical Machine Translation as a Traveling Salesman Problem" docx
... BLEU scores as functions of time for a bigram LM; (c), (d): the same for a trigram LM. The x axis corresponds to the cumulative time for processing the test set; for (a) and (c), the y axis corresponds ... al- gorithm using a small value for the stack size and then use it as initial point, both for the LK algo- rithm and for further Beam-Search optimization (where as before we v...
Ngày tải lên: 20/02/2014, 07:20
Tài liệu Báo cáo khoa học: "Asynchronous Binarization for Synchronous Grammars" pptx
... Transforming the entire for- est to n-ary form is intractable, however, because the number of hyperedges would be exponential in n. Instead, we include only the top k n-ary back- traces for each forest ... Collapsing Binarization To facilitate a change in binarization, we transform the translation forest into n-ary form. In the n-ary forest, each hyperedge corresponds to an original gramm...
Ngày tải lên: 20/02/2014, 09:20
Tài liệu Báo cáo khoa học: "Automatic Evaluation of Machine Translation Quality Using Longest Common Subsequence and Skip-Bigram Statistics" doc
... structure. Therefore, systems optimized for these unigram-based measures might generate adequate but not fluent target language. Since B LEU has been used to report the perform- ance of many machine ... is allowed to form a skip-bigram. Applying such constraint, we limit skip-bigram formation to a fix window size. There- fore, computation time can be reduced and hope- fully performa...
Ngày tải lên: 20/02/2014, 16:20
Tài liệu Báo cáo khoa học: "Lexical Morphology in Machine Translation: a Feasibility Study" potx
... precisely the phenomena that have to be formalized and then the prototype built up for the experiment. 4.1 Phenomena to be formalized Like in any MT project, the formalisation work has to face different ... lexicon does not contain such in- formation. Consequently, we looked for a simple way to automatically extend the Italian lexicon. For example, we looked for a way to automat...
Ngày tải lên: 22/02/2014, 02:20
Báo cáo khoa học: " The Work on Machine Translation in the Soviet Union" pot
... for the recording of the intermediary language can be used also for the recording of information in information machines. Along with work on the algorithms of machine translation from foreign ... language is strictly deductive and formal. This is just what deter- mines its importance both for general linguis- tics and for machine translation. Naturally the formal descrip...
Ngày tải lên: 07/03/2014, 18:20
Báo cáo khoa học: "Syntactic Stylometry for Deception Detection" pptx
... the Association for Computational Linguistics, pages 171–175, Jeju, Republic of Korea, 8-14 July 2012. c 2012 Association for Computational Linguistics Syntactic Stylometry for Deception Detection Song ... node, e.g., PRPˆNP 4 → “you”. 4 Experimental Results For all classification tasks, we use SVM classi- fier, 80% of data for training and 20% for test- ing, with 5-fold cross vali...
Ngày tải lên: 07/03/2014, 18:20