Báo cáo khoa học: "Maximum Entropy Based Restoration of Arab

Báo cáo khoa học: "Maximum Entropy Based Restoration of Arabic Diacritics" ppt

... of lexical, segment- based and part -of- speech tag features. The combination of these feature types leads to a state -of- the-art diacritization model. Using a publicly available corpus (LDC’s Arabic ... Native speakers of Arabic are able, in most cases, to accurately vo- calize words in text based on their context, the speaker’s knowledge of the grammar, and the lexicon o...

Ngày tải lên: 17/03/2014, 04:20

8 337 0

Báo cáo khoa học: "Maximum Entropy Based Phrase Reordering Model for Statistical Machine Translation" docx

... Linguistics Maximum Entropy Based Phrase Reordering Model for Statistical Machine Translation Deyi Xiong Institute of Computing Technology Chinese Academy of Sciences Beijing, China, 100080 Graduate School of ... sentence of average length 28.3 words on a 2GHz Linux system with 4G RAM memory. 3 Maximum Entropy Based Reordering Model In this section, we discuss how to create a...

Ngày tải lên: 08/03/2014, 02:21

8 390 0

Báo cáo khoa học: "Maximum Entropy Model Learning of the Translation Rules" pot

... value of Ai according to: Ai ~- A~ + AAi 4 Maximum Entropy Model Learning of the Translation Rules The art of modeling with the maximum entropy method is to define an informative set of ... collec- tion ~" of candidate features, and because of the limit of machine resources, we can- not expect to obtain all iS(f) estimated in real-life. However, the Maximum E...

Ngày tải lên: 23/03/2014, 19:20

5 195 0

Báo cáo khoa học: "Machine-Learning-Based Transformation of Passive Japanese Sentences into Active by Separating Training Data into Each Input Particle" doc

... 1: Features F1 part of speech (POS) of P F2 main word of P F3 word of P F4 first 1, 2, 3, 4, 5, and 7 digits of category number of P 5 F5 auxiliary verb attached to P F6 word of N F7 first 1, 2, ... digits of category number of N F8 case particles and words of nominals that have de- pendency relationship with P and are other than N F9 first 1, 2, 3, 4, 5, and 7 digits of ca...

Ngày tải lên: 17/03/2014, 04:20

8 426 0

Báo cáo khoa học: "Machine-Learning-Based Transformation of Passive Japanese Sentences into Active by Separating Training Data into Each Input Particle" ppt

... of texts were used to generate training data. We evaluated performance of prediction by accuracy. We deﬁned accuracy by the ratio of the number of correct predictions to that of instances of ... singular/plural usage in the writing of Japanese learners of English. They 595 also have shown that it is likely that performance of the error detection will improve as accuracy o...

Ngày tải lên: 17/03/2014, 04:20

8 305 0

Báo cáo khoa học: "Evaluating Centering-based metrics of coherence for text structuring using a reliably annotated corpus" doc

... Informatics, University of Edinburgh, UK, {nikiforo,jon}@ed.ac.uk ♦ Dept. of Computer Science, University of Essex, UK, poesio at essex dot ac dot uk ♠ Dept. of Computing Science, University of Aberdeen, ... of coherence based on Centering The- ory with respect to their potential usefulness for text structuring in natural language generation. Previous corpus -based evaluations...

Ngày tải lên: 17/03/2014, 06:20

8 608 0

Báo cáo khoa học: "Loosely Tree-Based Alignment for Machine Translation" pptx

... case of wh-movement in the English sentence that could not be realized by any reordering of subtrees of the Korean parse. The probability of adding a clone of original node ε i as a child of node ... number of children of any node in the input tree T , and N the length of the input string. By storing partially completed arcs in the chart and interleaving the in- ner two loop...

Ngày tải lên: 17/03/2014, 06:20

8 274 0

Báo cáo khoa học: "Semantic-Head Based Resolution of Scopal Ambiguities* BjSrn Gamb/ick" docx

... argument structure of an utterance and provide a possible resolution of the scopal ambiguities. This resolution should be built up during the construction of (the rest of) the semantic repre- ... nonhead daughter of a binary branching node contains a hole. In that case, the hole is plugged with the sb-label of the head daughter and the sb- label of the mother node is that...

Ngày tải lên: 17/03/2014, 07:20

5 249 0

Báo cáo khoa học: "Maximum Expected BLEU Training of Phrase and Lexicon Translation Models" pptx

... Li Deng Microsoft Research Microsoft Research One Microsoft Way, Redmond, WA, USA One Microsoft Way, Redmond, WA, USA xiaohe@microsoft.com deng@microsoft.com Abstract This paper ... Lin. 2011. Fast Generation of Translation Forest for Large-Scale SMT Discriminative Training. In Proc. Of EMNLP 2011. 301 Proceedings of the 50th Annual Meeting of the Association fo...

Ngày tải lên: 23/03/2014, 14:20

10 396 0

Báo cáo khoa học: "Coherent Citation-Based Summarization of Scientiﬁc Papers" potx

... summary of Eis- ner’s paper. Teufel (2007) reported that a signiﬁcant number of citation sentences (67% of the sentences in her dataset) were of this type. Likewise, the comprehension of sentence ... equipped with a set of guessing rules that had been hand-crafted using knowledge of English morphology and intuitions. The precision of rule -based taggers may exceed that of t...

Ngày tải lên: 23/03/2014, 16:20

10 319 0