0

unsupervised language model adaptation for handwritten chinese text recognition

Tài liệu Báo cáo khoa học:

Tài liệu Báo cáo khoa học: "Translation Model Adaptation for Statistical Machine Translation with Monolingual Topic Information" doc

Báo cáo khoa học

... of the context in-formation at the sentence level, we adopt thetopical context information in our method for the following reasons: (1) the topic informa-tion captures the context information ... Vogel.2006. Distributed Language Modeling for N-best ListRe-ranking. In Proc. of EMNLP 2006, pages 216-223.Bing Zhao, Matthias Eck and Stephan Vogel. 2004. Language Model Adaptation for Statistical ... translation mod-el adaptation and language model adaptation. Herewe focus on how to adapt a translation model, whichis trained from the large-scale out-of-domain bilin-gual corpus, for domain-specific...
  • 10
  • 533
  • 0
Báo cáo khoa học:

Báo cáo khoa học: "On-line Language Model Biasing for Statistical Machine Translation" docx

Báo cáo khoa học

... USA. Association for Computational Linguistics.Bing Zhao, Matthias Eck, and Stephan Vogel. 2004. Language model adaptation for statistical machinetranslation with structured query models. In Proceed-ings ... parameterestimation. Computational Linguistics, 19:263–311.Woosung Kim. 2005. Language Model Adaptation for Automatic Speech Recognition and Statistical MachineTranslation. Ph.D. thesis, The Johns ... Association for Computational Linguistics:shortpapers, pages 445–449,Portland, Oregon, June 19-24, 2011.c2011 Association for Computational LinguisticsOn-line Language Model Biasing for Statistical...
  • 5
  • 311
  • 0
Tài liệu Báo cáo khoa học:

Tài liệu Báo cáo khoa học: "Discriminative Lexicon Adaptation for Improved Character Accuracy – A New Direction in Chinese Language Modeling" pptx

Báo cáo khoa học

... to statis-tical language modeling for Chinese. ACM Trans-action on Asian Language Information Processing,1(1):3–33.Jianfeng Gao, Mu Li, Andi Wu, and Chang-NingHuang. 2004. Chinese word segmentation: ... ex-traction for Chinese information retrieval. In SIGIR,pages 50–58.Sabine Deligne and Yoshinori Sagisaka. 2000. Sta-tistical language modeling with a class-based n-multigram model. Comp. ... can beamended by involving the discriminative language model adaptation in the iteration, which results ina unified language model and lexicon adaptation framework. This can be our future work....
  • 9
  • 466
  • 0
Tài liệu Báo cáo khoa học:

Tài liệu Báo cáo khoa học: "Topic Models for Dynamic Translation Model Adaptation" pptx

Báo cáo khoa học

... 115–119,Jeju, Republic of Korea, 8-14 July 2012.c2012 Association for Computational LinguisticsTopic Models for Dynamic Translation Model Adaptation Vladimir EidelmanComputer Scienceand UMIACSUniversity ... transla-tions based on topic-specific contexts, wheretopics are induced in an unsupervised wayusing topic models; this can be thought ofas inducing subcorpora for adaptation with-out any human annotation. ... for the source itcame from, many word pairs will be unobserved for a given table. This sparsity requires smoothing. Sec-ond, we may not know the (sub)corpora our training1 Language model adaptation...
  • 5
  • 532
  • 0
Tài liệu Báo cáo khoa học:

Tài liệu Báo cáo khoa học: "A Large Scale Distributed Syntactic, Semantic and Lexical Language Model for Machine Translation" doc

Báo cáo khoa học

... Syntax-based language models for statistical machine transla-tion. MT Summit IX., Intl. Assoc. for Machine Trans-lation.C. Chelba and F. Jelinek. 1998. Exploiting syntacticstructure for language modeling. ... Dis-tributed language modeling for N-best list re-ranking.The 2006 Conference on Empirical Methods in Natu-ral Language Processing (EMNLP), 216-223.Y. Zhang, 2008. Structured language models for statisti-cal ... n-gram/m-SLM/PLSA language model. The composite n-gram/m-SLM/PLSA lan-guage model can be formulated as a directedMRF model (Wang et al., 2006) with lo-cal normalization constraints for the param-eters...
  • 10
  • 567
  • 0
Tài liệu Báo cáo khoa học:

Tài liệu Báo cáo khoa học: "Unsupervised Segmentation of Chinese Text by Use of Branching Entropy" pdf

Báo cáo khoa học

... COLING/ACL 2006 Main Conference Poster Sessions, pages 428–435,Sydney, July 2006.c2006 Association for Computational Linguistics428 0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1 0.55 0.6 0.65...
  • 8
  • 395
  • 0
Tài liệu Báo cáo khoa học:

Tài liệu Báo cáo khoa học: "A Phonotactic Language Model for Spoken Language Identification" pptx

Báo cáo khoa học

... June 2005.c2005 Association for Computational LinguisticsA Phonotactic Language Model for Spoken Language Identification Haizhou Li and Bin Ma Institute for Infocomm Research Singapore ... the 1996 NIST Language Recognition Evaluation database. 1 Introduction Spoken language and written language are similar in many ways. Therefore, much of the research in spoken language identification, ... n-gram Language Modeling, or PRLM (Zissman, 1996) . Orthographic forms of language, ranging from Latin alphabet to Cyrillic script to Chinese charac-ters, are far more unique to the language...
  • 8
  • 436
  • 0
Báo cáo khoa học:

Báo cáo khoa học: "A Cascaded Linear Model for Joint Chinese Word Segmentation and Part-of-Speech Tagging" pdf

Báo cáo khoa học

... each feture. ALL: all features,PER: perceptron model, WLM: word language model, PLM: POS language model, GPR: generating model, LPR: labelling model, LEN: word count penalty.LM with Witten-Bell ... processing of Chinese and other Asian languages. Several mod-els were introduced for these problems, for example,the Hidden Markov Model (HMM) (Rabiner, 1989),Maximum Entropy Model (ME) (Ratnaparkhi ... cascaded linear model for joint Chinese word segmentation and part-of-speech tagging. With a character-basedperceptron as the core, combined with real-valued features such as language models, thecascaded...
  • 8
  • 445
  • 0
Báo cáo khoa học:

Báo cáo khoa học: "A Preference-first Language Processor Integrating the Unification Grammar and Markov Language Model for Speech Recognition-ApplicationS" potx

Báo cáo khoa học

... Markov language model, and a simple set of unification grammar rules for the Chinese language, although the present model is in fact language independent. The system is written in C language ... signal preprocessor is included to form a complete speech recognition system. The language processor consists of a language model and a parser. The language model properly integrates the unification ... all of the sentences used in the primary school Chinese text books. The Markov language model is trained using the primary school Chinese text books as training corpus. Since there are no...
  • 6
  • 392
  • 0
Báo cáo khoa học:

Báo cáo khoa học: "Combining a Statistical Language Model with Logistic Regression to Predict the Lexical and Syntactic Difficulty of Texts for FFL" potx

Báo cáo khoa học

... also ensures thatthe language resembles present-day spokenFrench.• The target population for our formula isyoung people and adults. Therefore, onlytextbooks intended for this public were ... astatistical language model and a measure of tensedifficulty.4.1 The language model The lexical difficulty of a text is quite an elaboratephenomenon to parameterise. The logistic regres-sion models ... oftokens in a text. 2. Deciding what is the best linguistic unit toconsider. The equations introduced above use216.1 The language model: probabilities andsmoothing For our language model, we...
  • 9
  • 514
  • 0
Báo cáo khoa học:

Báo cáo khoa học: "A Stacked Sub-Word Model for Joint Chinese Word Segmentation and Part-of-Speech Tagging" potx

Báo cáo khoa học

... oftenprovides important clues for POS tagging, and thePOS tags contain much syntactic information, whichneed context information within a large window for disambiguation. For example, Huang et al. ... contribution ofcontext information for the disambiguation. A firstorder Max-Margin Markov Networks model is usedto resolve the sequence tagging problem. We use theSVM-HMM3implementation for the experiments ... sub-word structure for joint segmentationand tagging. Since the sub-words are large enoughin practice, the decoding for POS tagging over sub-words is efficient. Finally, the Chinese language ischaracterized...
  • 10
  • 412
  • 0
Báo cáo khoa học:

Báo cáo khoa học: "An Error-Driven Word-Character Hybrid Model for Joint Chinese Word Segmentation and POS Tagging" docx

Báo cáo khoa học

... cascaded linear model for joint chinese word segmentation and part-of-speech tagging. InProceedings of ACL.Wenbin Jiang, Haitao Mi, and Qun Liu. 2008b. Wordlattice reranking for chinese word ... F1results on CTB 3.0. Ourbaseline model outperforms all prior approaches for both Seg and Seg & Tag, and we hope thatour error-driven model can further improve perfor-mance.6 Related workIn ... their representatives.As for search space representation, Ng andLow (2004) found that for Chinese, the character-based model yields better results than the word-based model. Nakagawa and Uchimoto...
  • 9
  • 338
  • 0
Báo cáo khoa học:

Báo cáo khoa học: "Bilingual-LSA Based LM Adaptation for Spoken Language Translation" pot

Báo cáo khoa học

... marginal adaptation (Kneser et al., 1997)In this paper, we propose a framework to per-form LM adaptation across languages, enabling the adaptation of a LM from one language based on the adaptation text ... propose a bilingualLSA model (bLSA) for crosslingual LM adaptation that can be applied before translation. The bLSA model consists of two LSA models: one for eachside of the language trained on ... seman-tic model for unsupervised language model adaptation. In Proc. of ICASSP.A. Venugopal, A. Zollmann, and A. Waibel. 2005. Train-ing and evaluation error minimization rules for statis-tical...
  • 8
  • 279
  • 0

Xem thêm