Báo cáo khoa học: "Clustering Comparable Corpora For Bilingual Lexicon Extraction" ppt

Báo cáo khoa học: "Clustering Comparable Corpora For Bilingual Lexicon Extraction" ppt

Báo cáo khoa học: "Clustering Comparable Corpora For Bilingual Lexicon Extraction" ppt

... Association for Computational Linguistics:shortpapers, pages 473–478, Portland, Oregon, June 19-24, 2011. c 2011 Association for Computational Linguistics Clustering Comparable Corpora For Bilingual Lexicon ... to automat- ically extract bilingual lexicons from various cor- pora. Compared with parallel corpora, it is much easier to build high-volume comparable corpora, i....

Ngày tải lên: 23/03/2014, 16:20

6 308 0
Báo cáo khoa học: "Exploiting Comparable Corpora and Bilingual Dictionaries for Cross-Language Text Categorization" potx

Báo cáo khoa học: "Exploiting Comparable Corpora and Bilingual Dictionaries for Cross-Language Text Categorization" potx

... pages 553–560, Sydney, July 2006. c 2006 Association for Computational Linguistics Exploiting Comparable Corpora and Bilingual Dictionaries for Cross-Language Text Categorization Alfio Gliozzo ... Models from comparable corpora. Experiments show the effectiveness of our approach, providing a low cost solution for the Cross Language Text Categorization task. In particular, when...

Ngày tải lên: 17/03/2014, 04:20

8 361 0
Tài liệu Báo cáo khoa học: "Minimum Cut Model for Spoken Lecture Segmentation" ppt

Tài liệu Báo cáo khoa học: "Minimum Cut Model for Spoken Lecture Segmentation" ppt

... three lectures is- used for estimating the optimal word block length for representing nodes, the threshold distances for discarding node edges, the number of uniform chunks for estimating tf-idf ... lexical weights, the alpha parameter for smoothing, and the length of the smoothing window. We use a simple greedy search procedure for optimizing the parameters. 5.2 Corpora We eval...

Ngày tải lên: 20/02/2014, 11:21

8 495 0
Tài liệu Báo cáo khoa học: " A Declarative Language for Implementing Dynamic Programs∗" pptx

Tài liệu Báo cáo khoa học: " A Declarative Language for Implementing Dynamic Programs∗" pptx

... acquire more over time: we in- tend for it to generalize and encapsulate best practices, and serve as a testbed for new practices. Dyna is now be- ing used for parsing, machine translation, morphological analysis, ... probabilities, which permits heuristic early stopping before the agenda is empty. With viterbi values, it amounts to uniform-cost search for the best parse, and an item’...

Ngày tải lên: 20/02/2014, 16:20

4 560 0
Tài liệu Báo cáo khoa học: "Using Confidence Bands for Parallel Texts Alignment" pptx

Tài liệu Báo cáo khoa học: "Using Confidence Bands for Parallel Texts Alignment" pptx

... Parallel Corpora across Language Groups . Machine Translation, 12/1–2 (Special issue), pp. 53–87. William Gale and Kenneth Church (1991) A Pro- gram for Aligning Sentences in Bilingual Corpora . In ... intermingled. Parallel texts (texts that are mutual transla- tions) are valuable sources of information for bilingual lexicography. However, they are not of much use unless a comp...

Ngày tải lên: 20/02/2014, 18:20

8 464 0
Tài liệu Báo cáo khoa học: "AN INDEXING TECHNIQUE FOR IMPLEMENTING COMMAND RELATIONS" ppt

Tài liệu Báo cáo khoa học: "AN INDEXING TECHNIQUE FOR IMPLEMENTING COMMAND RELATIONS" ppt

... requierments for the formalism used. Hence, the indexing technique has a wide spectrum of applications for testing command relations in syntactic analysis. Futhermore, this method can also be used for ... special cases of definitions 1.1 and 1.2 for the property of being a set of maximal projections. Before I formulate the general command definition in a formal way, I will now...

Ngày tải lên: 22/02/2014, 10:20

6 456 0
Báo cáo khoa học: "Hypothesis Mixture Decoding for Statistical Machine Translation" ppt

Báo cáo khoa học: "Hypothesis Mixture Decoding for Statistical Machine Translation" ppt

... end for 12: end for 13: for each hypothesis           do 14: compute HM decoding features for  15: add  to     16: end for 17: for ... BTG-based HM Decoding 1: for each component model   do 2: output the search space    for the input  3: end for 4: for  to      do 5: for all   s...

Ngày tải lên: 07/03/2014, 22:20

10 389 0
Báo cáo khoa học: "A Bayesian Model for Discovering Typological Implications" ppt

Báo cáo khoa học: "A Bayesian Model for Discovering Typological Implications" ppt

... single m variable, we have three: one for IE, one for Austronesian and one for “all languages.” For a general tree, we assign one implication vari- able for each node (including the root and ... of perform- ing a simple Gibbs sample for m in Step (4), we first sample the m values for the internal nodes us- ing simple Gibbs updates. For the leaf nodes, we use rejection sampling....

Ngày tải lên: 08/03/2014, 02:21

8 472 0
Báo cáo khoa học: "Mining the Web for Bilingual Text" pot

Báo cáo khoa học: "Mining the Web for Bilingual Text" pot

... Mining the Web for Bilingual Text Philip Resnik* Dept. of Linguistics/Institute for Advanced Computer Studies University of Maryland, College ... quantities of bilingual text aligned at the doc- ument or sentence level, and a number of approaches in the burgeoning field of cross- language information retrieval exploit parallel corpora either ... on information from bilingua...

Ngày tải lên: 08/03/2014, 06:20

8 229 0
w