Báo cáo khoa học: "DICTIONARY ORGANIZATION FOR MACHINE TRANSLATION: THE EXPERIENCE AND IMPLICATIONS OF THEUMIST JAPANESE PROJECT" ppt
... DICTIONARY ORGANIZATION FOR MACHINE TRANSLATION: THE EXPERIENCE AND IMPLICATIONS OFTHEUMIST JAPANESE PROJECT Mary McGee Wood, Elaine Pollard, Heather Horsfall, Natsuko Holden, Brian Chandler. and ... reference to machine translation: the optimum grain-size for lexical entries, the division of information about separate languages, and the level of abs...
Ngày tải lên: 01/04/2014, 00:20
... indicating the order of the occurrence in the corpus, punctuation marks before and after the occurrence, and the Russian form of the oc- currence. In order to record all of the information ... Selection of Articles The present RAND corpus consists of ar- ticles in the fields of physics and mathematics. These fields were chosen because of their im-...
Ngày tải lên: 07/03/2014, 18:20
... dev and test data. The “source” domain of Europarl has 996k sentences and 2130k words.) We count the number of words and sentences in the English side of the parallel data, which is the same for ... reduce the dimension- ality of the kernel matrices. We do the same pre- processng for all words, the training words and the OOV words. And the resulting...
Ngày tải lên: 17/03/2014, 00:20
Báo cáo khoa học: "Tricolor DAGs for Machine Translation" ppt
... yellow of the definite node and the def arc to green. 2. Apply the painter to change the yellow of the singular node and the hum arc to green. The resulting TDAG is shown in Figure 4. 3. Run the ... correspond to the forms (1) Boston Office, (2) office of Boston, and (3) office in Boston. Inference of the "*OFFICE in *BOSTON" relation is easi...
Ngày tải lên: 31/03/2014, 06:20
Tài liệu Báo cáo khoa học: "Organizing Encyclopedic Knowledge based on the Web and its Application to Question Answering" ppt
... description. For the second type of question, human examinees would search their knowledge base for the description of each of four candidate terms. Then they would choose the candidate term ... both resources, the accuracy was noticeably improved, and the coverage was comparable with that for the Nichi- gai dictionary. On the other hand, in the case where random...
Ngày tải lên: 20/02/2014, 18:20
Báo cáo khóa học: Surface nucleolin participates in both the binding and endocytosis of lactoferrin in target cells potx
... Properties of the iron-binding site of the N-terminal lobe of human and bovine lactotransferrins. Importance of the glycan moiety and of the non-covalent interactions between the N- and C-terminal ... attach cells and induce the fusion of virus and cell membranes. The receptor complex for HIV entry consists of the CD4 molecule and at least one of...
Ngày tải lên: 07/03/2014, 15:20
Tài liệu Báo cáo khoa học: "A Framework for Syntactic Translation" docx
... of one word to another in patterns. The third type — the ability of the program to determine the syn- tactic function of each word — is basic to the others. It is basic to the first: If the ... other words, these boxes repre- sent in an abstract fashion our understanding of the structures of the languages and of the translation equivalences. In an actual...
Ngày tải lên: 19/02/2014, 19:20
Tài liệu Báo cáo khoa học: "Topic Models for Dynamic Translation Model Adaptation" pptx
... gain of about 0.5 and 0.6 BLEU and 2 TER. Although the performance on BLEU for both the 20 topic models LTM-20 and GTM-20 is suboptimal, the TER im- provement is better. Interestingly, the difference ... The Chinese data were segmented us- ing the Stanford segmenter. We trained a trigram LM on the English side of the corpus with an addi- tional 150M words randomly...
Ngày tải lên: 19/02/2014, 19:20
Tài liệu Báo cáo khoa học: "HINDI TO PUNJABI MACHINE TRANSLATION SYSTEM" pdf
... by Prem Chand, Yashwant jain etc.), Official language for office letters (The Language Officially used on the files in Government offices) and blogs (Posted by general public in forums etc.). ... Punjabi and did not know Hindi and 30 persons were from different professions having knowledge of both Hindi and Punjabi. Average ratings for the sentences of the indiv...
Ngày tải lên: 20/02/2014, 05:20
Tài liệu Báo cáo khoa học: "Phrase-Based Statistical Machine Translation as a Traveling Salesman Problem" docx
... is the source side of the biphrase and ˜ t the target side, and the so-called “phrase penalty” and “word penalty” features, which count the number of phrases and words in the alignment. Other ... and N of the graph are defined as follows: (a) If M is of the form (w, b) and N of the form (w , b), in which b is a single biphrase, and w and w are co...
Ngày tải lên: 20/02/2014, 07:20