unsupervised language model adaptation for handwritten chinese text recognition

Tài liệu Báo cáo khoa học: "Translation Model Adaptation for Statistical Machine Translation with Monolingual Topic Information" doc

Tài liệu Báo cáo khoa học: "Translation Model Adaptation for Statistical Machine Translation with Monolingual Topic Information" doc

... of the context in- formation at the sentence level, we adopt the topical context information in our method for the following reasons: (1) the topic informa- tion captures the context information ... Vogel. 2006. Distributed Language Modeling for N-best List Re-ranking. In Proc. of EMNLP 2006, pages 216-223. Bing Zhao, Matthias Eck and Stephan Vogel. 2004. Language Model Adaptation for Statistical ... translation mod- el adaptation and language model adaptation. Here we focus on how to adapt a translation model, which is trained from the large-scale out-of-domain bilin- gual corpus, for domain-specific...

Ngày tải lên: 19/02/2014, 19:20

10 533 0
Báo cáo khoa học: "On-line Language Model Biasing for Statistical Machine Translation" docx

Báo cáo khoa học: "On-line Language Model Biasing for Statistical Machine Translation" docx

... USA. Association for Computational Linguistics. Bing Zhao, Matthias Eck, and Stephan Vogel. 2004. Language model adaptation for statistical machine translation with structured query models. In Proceed- ings ... parameter estimation. Computational Linguistics, 19:263–311. Woosung Kim. 2005. Language Model Adaptation for Automatic Speech Recognition and Statistical Machine Translation. Ph.D. thesis, The Johns ... Association for Computational Linguistics:shortpapers, pages 445–449, Portland, Oregon, June 19-24, 2011. c 2011 Association for Computational Linguistics On-line Language Model Biasing for Statistical...

Ngày tải lên: 17/03/2014, 00:20

5 311 0
Tài liệu Báo cáo khoa học: "Discriminative Lexicon Adaptation for Improved Character Accuracy – A New Direction in Chinese Language Modeling" pptx

Tài liệu Báo cáo khoa học: "Discriminative Lexicon Adaptation for Improved Character Accuracy – A New Direction in Chinese Language Modeling" pptx

... to statis- tical language modeling for Chinese. ACM Trans- action on Asian Language Information Processing, 1(1):3–33. Jianfeng Gao, Mu Li, Andi Wu, and Chang-Ning Huang. 2004. Chinese word segmentation: ... ex- traction for Chinese information retrieval. In SIGIR, pages 50–58. Sabine Deligne and Yoshinori Sagisaka. 2000. Sta- tistical language modeling with a class-based n- multigram model. Comp. ... can be amended by involving the discriminative language model adaptation in the iteration, which results in a unified language model and lexicon adaptation framework. This can be our future work....

Ngày tải lên: 20/02/2014, 07:20

9 466 0
Tài liệu Báo cáo khoa học: "Topic Models for Dynamic Translation Model Adaptation" pptx

Tài liệu Báo cáo khoa học: "Topic Models for Dynamic Translation Model Adaptation" pptx

... 115–119, Jeju, Republic of Korea, 8-14 July 2012. c 2012 Association for Computational Linguistics Topic Models for Dynamic Translation Model Adaptation Vladimir Eidelman Computer Science and UMIACS University ... transla- tions based on topic-specific contexts, where topics are induced in an unsupervised way using topic models; this can be thought of as inducing subcorpora for adaptation with- out any human annotation. ... for the source it came from, many word pairs will be unobserved for a given table. This sparsity requires smoothing. Sec- ond, we may not know the (sub)corpora our training 1 Language model adaptation...

Ngày tải lên: 19/02/2014, 19:20

5 532 0
Tài liệu Báo cáo khoa học: "A Large Scale Distributed Syntactic, Semantic and Lexical Language Model for Machine Translation" doc

Tài liệu Báo cáo khoa học: "A Large Scale Distributed Syntactic, Semantic and Lexical Language Model for Machine Translation" doc

... Syntax- based language models for statistical machine transla- tion. MT Summit IX., Intl. Assoc. for Machine Trans- lation. C. Chelba and F. Jelinek. 1998. Exploiting syntactic structure for language modeling. ... Dis- tributed language modeling for N-best list re-ranking. The 2006 Conference on Empirical Methods in Natu- ral Language Processing (EMNLP), 216-223. Y. Zhang, 2008. Structured language models for statisti- cal ... n-gram/m- SLM/PLSA language model. The composite n-gram/m-SLM/PLSA lan- guage model can be formulated as a directed MRF model (Wang et al., 2006) with lo- cal normalization constraints for the param- eters...

Ngày tải lên: 20/02/2014, 04:20

10 568 0
Tài liệu Báo cáo khoa học: "Unsupervised Segmentation of Chinese Text by Use of Branching Entropy" pdf

Tài liệu Báo cáo khoa học: "Unsupervised Segmentation of Chinese Text by Use of Branching Entropy" pdf

... COLING/ACL 2006 Main Conference Poster Sessions, pages 428–435, Sydney, July 2006. c 2006 Association for Computational Linguistics 428 0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1 0.55 0.6 0.65...

Ngày tải lên: 20/02/2014, 12:20

8 395 0
Tài liệu Báo cáo khoa học: "A Phonotactic Language Model for Spoken Language Identification" pptx

Tài liệu Báo cáo khoa học: "A Phonotactic Language Model for Spoken Language Identification" pptx

... June 2005. c 2005 Association for Computational Linguistics A Phonotactic Language Model for Spoken Language Identification Haizhou Li and Bin Ma Institute for Infocomm Research Singapore ... the 1996 NIST Language Recognition Evaluation database. 1 Introduction Spoken language and written language are similar in many ways. Therefore, much of the research in spoken language identification, ... n-gram Language Modeling, or PRLM (Zissman, 1996) . Orthographic forms of language, ranging from Latin alphabet to Cyrillic script to Chinese charac- ters, are far more unique to the language...

Ngày tải lên: 20/02/2014, 15:20

8 437 0
Báo cáo khoa học: "A Cascaded Linear Model for Joint Chinese Word Segmentation and Part-of-Speech Tagging" pdf

Báo cáo khoa học: "A Cascaded Linear Model for Joint Chinese Word Segmentation and Part-of-Speech Tagging" pdf

... each feture. ALL: all features, PER: perceptron model, WLM: word language model, PLM: POS language model, GPR: generating model, LPR: labelling model, LEN: word count penalty. LM with Witten-Bell ... processing of Chinese and other Asian languages. Several mod- els were introduced for these problems, for example, the Hidden Markov Model (HMM) (Rabiner, 1989), Maximum Entropy Model (ME) (Ratnaparkhi ... cascaded linear model for joint Chinese word segmentation and part- of-speech tagging. With a character-based perceptron as the core, combined with real- valued features such as language models, the cascaded...

Ngày tải lên: 08/03/2014, 01:20

8 445 0
Báo cáo khoa học: "A Preference-first Language Processor Integrating the Unification Grammar and Markov Language Model for Speech Recognition-ApplicationS" potx

Báo cáo khoa học: "A Preference-first Language Processor Integrating the Unification Grammar and Markov Language Model for Speech Recognition-ApplicationS" potx

... Markov language model, and a simple set of unification grammar rules for the Chinese language, although the present model is in fact language independent. The system is written in C language ... signal preprocessor is included to form a complete speech recognition system. The language processor consists of a language model and a parser. The language model properly integrates the unification ... all of the sentences used in the primary school Chinese text books. The Markov language model is trained using the primary school Chinese text books as training corpus. Since there are no...

Ngày tải lên: 08/03/2014, 07:20

6 393 0
Báo cáo khoa học: "Combining a Statistical Language Model with Logistic Regression to Predict the Lexical and Syntactic Difficulty of Texts for FFL" potx

Báo cáo khoa học: "Combining a Statistical Language Model with Logistic Regression to Predict the Lexical and Syntactic Difficulty of Texts for FFL" potx

... also ensures that the language resembles present-day spoken French. • The target population for our formula is young people and adults. Therefore, only textbooks intended for this public were ... a statistical language model and a measure of tense difficulty. 4.1 The language model The lexical difficulty of a text is quite an elaborate phenomenon to parameterise. The logistic regres- sion models ... of tokens in a text. 2. Deciding what is the best linguistic unit to consider. The equations introduced above use 21 6.1 The language model: probabilities and smoothing For our language model, we...

Ngày tải lên: 08/03/2014, 21:20

9 514 0
Báo cáo khoa học: "A Stacked Sub-Word Model for Joint Chinese Word Segmentation and Part-of-Speech Tagging" potx

Báo cáo khoa học: "A Stacked Sub-Word Model for Joint Chinese Word Segmentation and Part-of-Speech Tagging" potx

... often provides important clues for POS tagging, and the POS tags contain much syntactic information, which need context information within a large window for disambiguation. For example, Huang et al. ... contribution of context information for the disambiguation. A first order Max-Margin Markov Networks model is used to resolve the sequence tagging problem. We use the SVM-HMM 3 implementation for the experiments ... sub-word structure for joint segmentation and tagging. Since the sub-words are large enough in practice, the decoding for POS tagging over sub- words is efficient. Finally, the Chinese language is characterized...

Ngày tải lên: 17/03/2014, 00:20

10 412 0
Báo cáo khoa học: "An Error-Driven Word-Character Hybrid Model for Joint Chinese Word Segmentation and POS Tagging" docx

Báo cáo khoa học: "An Error-Driven Word-Character Hybrid Model for Joint Chinese Word Segmentation and POS Tagging" docx

... cascaded linear model for joint chinese word segmentation and part-of-speech tagging. In Proceedings of ACL. Wenbin Jiang, Haitao Mi, and Qun Liu. 2008b. Word lattice reranking for chinese word ... F 1 results on CTB 3.0. Our baseline model outperforms all prior approaches for both Seg and Seg & Tag, and we hope that our error-driven model can further improve perfor- mance. 6 Related work In ... their representatives. As for search space representation, Ng and Low (2004) found that for Chinese, the character- based model yields better results than the word- based model. Nakagawa and Uchimoto...

Ngày tải lên: 17/03/2014, 01:20

9 338 0
Báo cáo khoa học: "Bilingual-LSA Based LM Adaptation for Spoken Language Translation" pot

Báo cáo khoa học: "Bilingual-LSA Based LM Adaptation for Spoken Language Translation" pot

... marginal adaptation (Kneser et al., 1997) In this paper, we propose a framework to per- form LM adaptation across languages, enabling the adaptation of a LM from one language based on the adaptation text ... propose a bilingual LSA model (bLSA) for crosslingual LM adaptation that can be applied before translation. The bLSA model consists of two LSA models: one for each side of the language trained on ... seman- tic model for unsupervised language model adaptation. In Proc. of ICASSP. A. Venugopal, A. Zollmann, and A. Waibel. 2005. Train- ing and evaluation error minimization rules for statis- tical...

Ngày tải lên: 17/03/2014, 04:20

8 279 0
w