Báo cáo khoa học: "A Method for Effective and Scalable Mining of Named Entity Transliterations from Large Comparable Corpora" doc
... In this paper, we detail an effective and scala- ble mining method, called MINT (MIning Named- entity Transliteration equivalents), for mining of NETEs from large comparable corpo- ra. MINT addresses ... Method for Effective and Scalable Mining of Named Entity Transliterations from Large Comparable Corpora Raghavendra Udupa K Saravanan A Kumar...
Ngày tải lên: 24/03/2014, 03:20
... other information such as on the right hand side the next two phrases are “ayda” and “tshyr” or the se- quence of source target POS on the right hand side is “RB VBP”. An example of this type of feature ... output Source you totally different from zaid amr , and not to deprive yourself in a basement of imitation and assimilation . We predict and visualize Human correction...
Ngày tải lên: 20/02/2014, 04:20
... length of a string, therefore obtaining 16655 strings. 3.2 Two Factors for Evaluation We evaluated the following two factors before and after correction: (1) the counting of errors, and (2) ... Unable to understand, and unable to imagine the actual utterance. 4. Results and Discussions 4.1 Decrease in the Number of Errors Table 4-1 shows the number of errors before...
Ngày tải lên: 20/02/2014, 18:20
Báo cáo khoa học: "A Method for Relating Multiple Newspaper Articles by Using Graphs, and Its Application to Webcasting" pptx
... describes methods for relating (thread- ing) multiple newspaper articles, and for visualizing various characteristics of them by using a directed graph. A set of articles is represented by a set of ... Introduction The vast quantity of information available today makes it difficult to search for and understand the information that we want. If there are many related do...
Ngày tải lên: 08/03/2014, 06:20
Báo cáo khoa học: "A Method for Word Sense Disambiguation of Unrestricted Text" potx
... sense of one of the words. Pick one of the words, say W2, and using WordNet, form a similarity list for each sense of that word. For this, use the words from the synset of each sense and the ... word-word co- occurrences and (2)WordNet for measuring the semantic density for a pair of words. We report an average accuracy of 80% for the first ranked sense,...
Ngày tải lên: 08/03/2014, 06:20
Báo cáo khoa học: "a Method for Automatic Evaluation of Machine Translation" pot
... phe- nomenon, and not an artifact of a few toy examples. The primary programming task for a BLEU imple- mentor is to compare n-grams of the candidate with the n-grams of the reference translation and count the ... n-gram counts for all the candidate sentences and divide by the number of candidate n-grams in the test corpus to compute a modified precision score,p n , for the...
Ngày tải lên: 23/03/2014, 20:20
Tài liệu Báo cáo khoa học: A role for the intersubunit disulfides of seminal RNase in the mechanism of its antitumor action docx
... isoforms, isoenzymes, monomeric forms; assay for selective cytotoxicity of the enzyme. Methods Enzymol. 341, 248–263. 13. Kunitz, M. (1946) A spectrophotometric method for the meas- urement of ... in the presence of 10 m M IAM, or (C) of 50 m M IAM. D and M mark the elution volumes of BS-RNase and monomeric BS-RNase, respectively. Ó FEBS 2003 Disulfides and antitumor acti...
Ngày tải lên: 20/02/2014, 11:20
Báo cáo khoa học: "a system for tutoring and computational linguistics experimentation" pptx
... present was built to serve as a platform for research in computational linguistics and tutoring, and can be used for task- based evaluation of algorithms developed for other domains. We are currently ... scaffolding and potentially suggesting additional problems. The disadvantage is a lack of adaptivity and gen- erality: students often get the same remediation for the same...
Ngày tải lên: 17/03/2014, 00:20
... cal boundaries of the citation form and the inflected forms, and of the forms derived from these inflected forms, and so on rccursively. Our present understandi~ of Dutch morphophonology has ... morpho-syntactic codes of the verb form werkte (worked). (Records for citation forms contain pointers to the different forms belonging to their para- digm, and information...
Ngày tải lên: 24/03/2014, 05:21
Báo cáo khoa học: "Self-Training for Enhancement and Domain Adaptation of Statistical Parsers Trained on Small Datasets" ppt
... 2006b). The test and training sections consist of sentences from all of the genres that form the corpus. The training division consists of 90% (9 of each 10 con- secutive sentences) of the data, and the ... create. Furthermore, the performance of these parsers de- creases as the distance between the genres of their training and test data increases. Therefore, enhanc- ing...
Ngày tải lên: 23/03/2014, 18:20