Báo cáo khoa học: "AUTOMATIC ALIGNMENT IN PARALLEL CORPORA" potx

Báo cáo khoa học: "AUTOMATIC ALIGNMENT IN PARALLEL CORPORA" potx

Báo cáo khoa học: "AUTOMATIC ALIGNMENT IN PARALLEL CORPORA" potx

... concern the testing of the scheme's efficiency at lower levels endowed with necessary bilingual information about potential delimiters. INTRODUCTION Parallel linguistically meaningful text ... clause, and phrase level. Alignment at any of these levels has to invoke a different set of textual and linguistic information (acting as unit delimiters). In this paper, alignment...

Ngày tải lên: 08/03/2014, 07:20

3 193 0
Tài liệu Báo cáo khoa học: "ALIGNING SENTENCES IN PARALLEL CORPORA" doc

Tài liệu Báo cáo khoa học: "ALIGNING SENTENCES IN PARALLEL CORPORA" doc

... statistical tech- nique for aligning sentences with their translations in two parallel corpora. In addition to certain anchor points that are available in our da.ta, the only information about the sentences ... are shown in Table 1. ALIGNING ANCHOR POINTS After examining the Hansard corpora, we realized that the comments laced throughout would serve as uscflll anchor points...

Ngày tải lên: 20/02/2014, 21:20

8 387 0
Báo cáo khoa học: "Word Alignment in English-Hindi Parallel Corpus Using Recency-Vector Approach: Some Studies" ppt

Báo cáo khoa học: "Word Alignment in English-Hindi Parallel Corpus Using Recency-Vector Approach: Some Studies" ppt

... Algorithms In order to reduce the complexity of the dynamic programming algorithm certain constraints have been proposed in (Fung and McKeown, 1994). 1. Starting Point Constraint: The constraint im- posed ... main padtaa hoon (Masculine) but main padtii hoon (Feminine) You read. → tum padte ho (Masculine) or tum padtii ho (Feminine) He will read. → wah padegaa. Due to the presence of mult...

Ngày tải lên: 23/03/2014, 18:20

8 388 0
Tài liệu Báo cáo khoa học: "Text Alignment in a Tool for Translating Revised Documents" docx

Tài liệu Báo cáo khoa học: "Text Alignment in a Tool for Translating Revised Documents" docx

... text) within a certain number of segments, it is interpreted as a case of compensation; if it occurs farther away the situation is interpreted as involving two indepen- dent editing operations. ... The window is set to 4, since the dynamic programming approach is very fast in recovering from local errors. When such a sequence is found, all the segments included in it are marked as...

Ngày tải lên: 22/02/2014, 10:20

5 456 0
Báo cáo khoa học: "Automatic Editing in a Back-End Speech-to-Text System" doc

Báo cáo khoa học: "Automatic Editing in a Back-End Speech-to-Text System" doc

... aim to eliminate it by integrating the punctuation features into the transformation step. In the future we plan to inte- grate additional knowledge sources into our statis- tical method in order ... interactions and non-local changes in the mini- mum edit distance alignment. A subset of the top- ranked non-overlapping rules satisfying frequency and minimum impact constraints are select...

Ngày tải lên: 17/03/2014, 02:20

7 363 0
Báo cáo khoa học: "Automatic Paraphrasing in Essay Format" pdf

Báo cáo khoa học: "Automatic Paraphrasing in Essay Format" pdf

... text, routines for generating relative clauses, although, again, none may occur in the input text, and a routine for converting source text verbs to output text forms end- ing in '-ing.' ... occurring in the outline. The verbs selected still include those in the main text as well as the ones in the outline. Theoretically, the main text could consist of a large libra...

Ngày tải lên: 30/03/2014, 17:20

16 394 0
Báo cáo khoa học: "Paraphrasing with Bilingual Parallel Corpora" pot

Báo cáo khoa học: "Paraphrasing with Bilingual Parallel Corpora" pot

... these points below. 4.3 Using multiple corpora Work in statistical machine translation suggests that, like many other machine learning problems, perfor- mance increases as the amount of training ... try improving the alignments by simply adding more German-English training data. However, there is nothing that limits our paraphrase extraction method to drawing on candidate paraphrases from a...

Ngày tải lên: 08/03/2014, 04:22

8 308 0
Báo cáo khoa học: "Automatic Labeling of Semantic Roles" potx

Báo cáo khoa học: "Automatic Labeling of Semantic Roles" potx

... Elements: Categorization Cognizer Item Category Criterion S NP PRP VP VBD NP SBAR IN S NNP VP VBD NP PP PRP IN NP NN Goal Source Theme Target NP He heard the sound of liquid slurping in a metal container as approached him from behindFarrell

Ngày tải lên: 08/03/2014, 05:20

9 397 0
Báo cáo khoa học: "Parsing Idioms in Lexicalized TAGs" potx

Báo cáo khoa học: "Parsing Idioms in Lexicalized TAGs" potx

... We distinguish two kinds of dis- continuities: discontinuities that come from inter- nal structures and discontinuities that come from the insertion of modifiers. 5.1 Internal Discontinuities ... on the object, and so on. In the first pass, the parser loads all the trees in the tree family corresponding to an item in the input string (unless certain trees in that family do not...

Ngày tải lên: 24/03/2014, 05:21

9 263 0
Báo cáo khoa học: "Subgroup Detection in Ideological Discussions" potx

Báo cáo khoa học: "Subgroup Detection in Ideological Discussions" potx

... 2004a. Mining and summa- rizing customer reviews. In KDD’04, pages 168–177. Minqing Hu and Bing Liu. 2004b. Mining and summa- rizing customer reviews. In Proceedings of the tenth ACM SIGKDD international ... 1115–1118. Soo-Min Kim and Eduard Hovy. 2004. Determining the sentiment of opinions. In COLING, pages 1367–1373. Dan Klein and Christopher D. Manning. 2003. Accu- rate unlexicalized...

Ngày tải lên: 30/03/2014, 17:20

11 430 0
Từ khóa:
w