... lexicogra- phy The tradition of British Contextualism 5 de- fines collocations on the basis of statistical as- sumptions about the probability of the cooc- curence of two lexemes. Particularly ... in machine-readable form in German and French by IBM; cf. [RAAB 1988]. 3.2 The generation of paraphrases One of the aims in the development of the "how-to-say"-component of a g...
Ngày tải lên: 01/04/2014, 00:20
... 469–477, Jeju, Republic of Korea, 8-14 July 2012. c 2012 Association for Computational Linguistics A Statistical Model for Unsupervised and Semi-supervised Transliteration Mining Hassan Sajjad Alexander ... English/Russian respectively. Sajjad11 is computationally expensive. For in- stance, a phrase-based statistical MT system is built once in every iteration of the heuristic proce- dure...
Ngày tải lên: 19/02/2014, 19:20
Tài liệu Báo cáo khoa học: "Improving Statistical Machine Translation with Monolingual Collocation" pdf
... 825–833, Uppsala, Sweden, 11-16 July 2010. c 2010 Association for Computational Linguistics Improving Statistical Machine Translation with Monolingual Collocation Zhanyi Liu 1 , Haifeng Wang 2 , ... lisheng@hit.edu.cn Abstract This paper proposes to use monolingual collocations to improve Statistical Ma- chine Translation (SMT). We make use of the collocation probabilities,...
Ngày tải lên: 20/02/2014, 04:20
Tài liệu Báo cáo khoa học: "Guiding Statistical Word Alignment Models With Prior Knowledge" pdf
... better word alignments in the application of statistical machine translation. We propose a simple framework that can inte- grate prior knowledge into statistical word align- ment model training. ... general framework to incor- porate prior knowledge such as heuristics or linguistic features in statistical generative word alignment models. Prior knowledge plays a role of probabilistic so...
Ngày tải lên: 20/02/2014, 12:20
Tài liệu Báo cáo khoa học: "A Statistical Analysis of Morphemes in Japanese Terminology" docx
... A Statistical Analysis of Morphemes in Japanese Terminology Kyo KAGEURA National Center for Science ... in the LNRE zone. 3 The LNRE Framework When a sample is located in the LNRE zone, values of statistical measures such as type-token ratio, the parameters of 'laws' (e.g. of Mandel- ... fact, the results of the term-level and morpheme-level permutations almost coincide,...
Ngày tải lên: 20/02/2014, 18:20
Tài liệu Báo cáo khoa học: "Generating statistical language models from interpretation grammars in dialogue systems" potx
... Generating statistical language models from interpretation grammars in dialogue systems Rebecca Jonson Dept. ... of Linguistics, G ¨ oteborg University and GSLT rj@ling.gu.se Abstract In this paper, we explore statistical lan- guage modelling for a speech-enabled MP3 player application by generating a corpus ... grammar written for the application with the Gram- matical Framework (GF) (Ra...
Ngày tải lên: 22/02/2014, 02:20
Báo cáo khoa học: "A Statistical Tree Annotator and Its Applications" pptx
... 1230–1238, Portland, Oregon, June 19-24, 2011. c 2011 Association for Computational Linguistics A Statistical Tree Annotator and Its Applications Xiaoqiang Luo and Bing Zhao IBM T.J. Watson Research ... natural language applications, there is a need to enrich syntactical parse trees. We present a statistical tree annotator augmenting nodes with additional information. The anno- tator is...
Ngày tải lên: 07/03/2014, 22:20
Báo cáo khoa học: "Combining Statistical and Knowledge-based Spoken Language Understanding in Conditional Models" pptx
... statistical learning framework, has previously been introduced to reduce data requirement. The major contribution of this paper is the investigation of integrating prior knowledge and statistical ... recognition, and best practices in language engineering for every new domain. On the other hand, a statistical learning approach needs a large amount of annotated data for model tra...
Ngày tải lên: 08/03/2014, 02:21
Báo cáo khoa học: "Boosting Statistical Word Alignment Using Labeled and Unlabeled Data" ppt
... experi- mental results. Finally, we conclude in section 6. 2 Statistical Word Alignment Model According to the IBM models (Brown et al., 1993), the statistical word alignment model can be generally ... liuzhanyi}@rdc.toshiba.com.cn Abstract This paper proposes a semi-supervised boosting approach to improve statistical word alignment with limited labeled data and large amo...
Ngày tải lên: 08/03/2014, 02:21
Báo cáo khoa học: "Improving Statistical Natural Language Translation with Categories and Rules" potx
... This paper describes an all level approach on statistical natural language translation (SNLT). Without any predefined knowledge the system learns a statistical translation lexicon (STL), word ... Improving Statistical Natural Language Translation with Categories and Rules Franz Josef Och and Hans Weber ... itself is realized as a beam search. In our method example-based tech- ni...
Ngày tải lên: 08/03/2014, 05:21