Báo cáo khoa học: "Automatic Evaluation of Chinese Translati

Báo cáo khoa học: "Automatic Evaluation of Chinese Translation Output: Word-Level or Character-Level" doc

... Linguistics Automatic Evaluation of Chinese Translation Output: Word-Level or Character-Level? Maoxi Li Chengqing Zong Hwee Tou Ng National Laboratory of Pattern Recognition Institute of Automation, Chinese ... translation segmented into words: Translation: 多少_钱_的_ 伞 _吗_？ Reference: 这些_ 雨伞 _多少_钱_？ The word “ 伞 ” is a synonym for the word “ 雨伞...

Ngày tải lên: 17/03/2014, 00:20

6 344 1

Tài liệu Báo cáo khoa học: "Automatic Evaluation of Machine Translation Quality Using Longest Common Subsequence and Skip-Bigram Statistics" doc

... construction of N-best translation lexicons from parallel text. Melamed (1995) used the ratio (LCSR) between the length of the LCS of two words and the length of the longer word of the two words ... automatic evaluation measure. To apply LCS in machine translation evaluation, we view a translation as a sequence of words. The intuition is that the longer the LCS...

Ngày tải lên: 20/02/2014, 16:20

8 443 0

Tài liệu Báo cáo khoa học: "Automatic Evaluation of Sentence-Level Fluency Andrew Mutton∗" pdf

... to the main purpose of the SVM, using distance from support vector as a metric. Results are given for correlation of GLEU against sequence sizes for all data (Table 2) and for the human trial ... ‘moderate’ range for correlation in- terpretation. In particular, for the GLEU–human correlation, the score of 0.4014 is approaching the min- imum pairwise human correlation of 0.4710. 5 Di...

Ngày tải lên: 20/02/2014, 12:20

8 508 0

Báo cáo khoa học: "Automatic Evaluation of Linguistic Quality in Multi-Document Summarization" pptx

... of con- tent selection, which allows for frequent evaluation during system development and for report- ing results of experiments performed outside of the annual NIST-led evaluations, the Document Understanding ... of systems, ignoring the linguistic quality of the system output. Part of the reason for this imbalance is the existence of ROUGE (Lin and Hovy, 2003; Lin, 2004),...

Ngày tải lên: 16/03/2014, 23:20

11 409 0

Báo cáo khoa học: "Automatic Construction of Machine Translation Knowledge Using Translation Literalness" docx

... Language Mean # of Translation Words per Source Word Mean Context-freeness (# of Word Link = 4) 28,300 words (49.5%) 3,107 words 1.51 trans./word 4.45 20,722 words (34.0%) 3,601 words 1.94 trans./word 4.21 Rewritten ... For example, word-level statistical MT (Brown et al., 1993) translates a source sentence with a combination of word trans- fer and word order adjustment. Thus, word-...

Ngày tải lên: 08/03/2014, 21:20

8 345 0

Báo cáo khoa học: "AUTOMATIC ACQUISITION OF A LARGE SUBCATEGORIZATION DICTIONARY FROM CORPORA" doc

... training corpus). PREVIOUS WORK While work has been done on various sorts of col- location information that can be obtained from text corpora, the only research that I am aware of that has ... ACQUISITION OF A LARGE SUBCATEGORIZATION DICTIONARY FROM CORPORA Christopher D. Manning Xerox PARC and Stanford University Stanford University Dept. of Linguistics, Bldg. 100 Stanford,...

Ngày tải lên: 23/03/2014, 20:20

8 342 0

Báo cáo khoa học: "Automatic Acquisition of Script Knowledge from a Text Collection" docx

... paragraphs in clusters based on the date of issue of the report. We used only the first paragraphs of the news reports because they tend to describe facts in time order. After that, the sentences in ... objects. A 'pair of actions' consists of two actions that occur in time order. A 'sequence of actions' can be defined as a transitive closure of all the pairs...

Ngày tải lên: 31/03/2014, 20:20

4 351 0

Báo cáo khoa học: "Feedback Cleaning of Machine Translation Rules Using Automatic Evaluation" pot

... Score Number of Rules Number of Iterations Test Corpus BLEU Score Evaluation Corpus BLEU Score Number of Rules Figure 5: Relationship between Number of Itera- tions and BLEU Scores/Number of ... Cross-cleaning In general, most evaluation corpora are smaller than training corpora. Therefore, omissions of cleaning Training Corpus Training Evaluation Training Evaluation Train...

Ngày tải lên: 08/03/2014, 04:22

8 313 0

Báo cáo khoa học: "Automatic Adaptation of Annotation Standards: Chinese Word Segmentation and POS Tagging – A Case Study" potx

... representation of Ng and Low (2004). For word segmentation only, there are four boundary tags: • b: the begin of the word • m: the middle of the word • e: the end of the word • s: a single-character word while ... point for word segmentation and 1 point for Joint S&T, with corresponding error reductions of 30.2% and 14%. The ﬁnal result outperforms the latest work on the same...

Ngày tải lên: 17/03/2014, 01:20

9 404 0

Tài liệu Báo cáo khoa học: "Automatic Extraction of Lexico-Syntactic Patterns for Detection of Negation and Speculation Scopes" pdf

... next performed a simple transformation of the derived rule set. If all children of a rule tree node are of type *scope* or * (i.e. non- cue words), the node label is replaced by *scope* or * respectively, ... tree; neighboring identical siblings of type *scope* or * are replaced by a single node of the corresponding type. Figure 3 shows an example of this transformation. (...

Ngày tải lên: 20/02/2014, 04:20

5 544 1