... of the context in-formation at the sentence level, we adopt thetopical context information in our method for the following reasons: (1) the topic informa-tion captures the context information ... Vogel.2006. Distributed Language Modeling for N-best ListRe-ranking. In Proc. of EMNLP 2006, pages 216-223.Bing Zhao, Matthias Eck and Stephan Vogel. 2004. Language ModelAdaptationfor Statistical ... translation mod-el adaptation and languagemodel adaptation. Herewe focus on how to adapt a translation model, whichis trained from the large-scale out-of-domain bilin-gual corpus, for domain-specific...
... USA. Association for Computational Linguistics.Bing Zhao, Matthias Eck, and Stephan Vogel. 2004. Language modeladaptationfor statistical machinetranslation with structured query models. In Proceed-ings ... parameterestimation. Computational Linguistics, 19:263–311.Woosung Kim. 2005. LanguageModelAdaptation for Automatic Speech Recognition and Statistical MachineTranslation. Ph.D. thesis, The Johns ... Association for Computational Linguistics:shortpapers, pages 445–449,Portland, Oregon, June 19-24, 2011.c2011 Association for Computational LinguisticsOn-line LanguageModel Biasing for Statistical...
... to statis-tical language modeling for Chinese. ACM Trans-action on Asian Language Information Processing,1(1):3–33.Jianfeng Gao, Mu Li, Andi Wu, and Chang-NingHuang. 2004. Chinese word segmentation: ... ex-traction forChinese information retrieval. In SIGIR,pages 50–58.Sabine Deligne and Yoshinori Sagisaka. 2000. Sta-tistical language modeling with a class-based n-multigram model. Comp. ... can beamended by involving the discriminative language modeladaptation in the iteration, which results ina unified languagemodel and lexicon adaptation framework. This can be our future work....
... 115–119,Jeju, Republic of Korea, 8-14 July 2012.c2012 Association for Computational LinguisticsTopic Models for Dynamic Translation Model Adaptation Vladimir EidelmanComputer Scienceand UMIACSUniversity ... transla-tions based on topic-specific contexts, wheretopics are induced in an unsupervised wayusing topic models; this can be thought ofas inducing subcorpora foradaptation with-out any human annotation. ... for the source itcame from, many word pairs will be unobserved for a given table. This sparsity requires smoothing. Sec-ond, we may not know the (sub)corpora our training1 Language model adaptation...
... Syntax-based language models for statistical machine transla-tion. MT Summit IX., Intl. Assoc. for Machine Trans-lation.C. Chelba and F. Jelinek. 1998. Exploiting syntacticstructure forlanguage modeling. ... Dis-tributed language modeling for N-best list re-ranking.The 2006 Conference on Empirical Methods in Natu-ral Language Processing (EMNLP), 216-223.Y. Zhang, 2008. Structured language models for statisti-cal ... n-gram/m-SLM/PLSA language model. The composite n-gram/m-SLM/PLSA lan-guage model can be formulated as a directedMRF model (Wang et al., 2006) with lo-cal normalization constraints for the param-eters...
... June 2005.c2005 Association for Computational LinguisticsA Phonotactic LanguageModelfor Spoken Language Identification Haizhou Li and Bin Ma Institute for Infocomm Research Singapore ... the 1996 NIST LanguageRecognition Evaluation database. 1 Introduction Spoken language and written language are similar in many ways. Therefore, much of the research in spoken language identification, ... n-gram Language Modeling, or PRLM (Zissman, 1996) . Orthographic forms of language, ranging from Latin alphabet to Cyrillic script to Chinese charac-ters, are far more unique to the language...
... each feture. ALL: all features,PER: perceptron model, WLM: word language model, PLM: POS language model, GPR: generating model, LPR: labelling model, LEN: word count penalty.LM with Witten-Bell ... processing of Chinese and other Asian languages. Several mod-els were introduced for these problems, for example,the Hidden Markov Model (HMM) (Rabiner, 1989),Maximum Entropy Model (ME) (Ratnaparkhi ... cascaded linear model for joint Chinese word segmentation and part-of-speech tagging. With a character-basedperceptron as the core, combined with real-valued features such as language models, thecascaded...
... Markov language model, and a simple set of unification grammar rules for the Chinese language, although the present model is in fact language independent. The system is written in C language ... signal preprocessor is included to form a complete speech recognition system. The language processor consists of a languagemodel and a parser. The languagemodel properly integrates the unification ... all of the sentences used in the primary school Chinesetext books. The Markov language model is trained using the primary school Chinese text books as training corpus. Since there are no...
... also ensures thatthe language resembles present-day spokenFrench.• The target population for our formula isyoung people and adults. Therefore, onlytextbooks intended for this public were ... astatistical languagemodel and a measure of tensedifficulty.4.1 The language model The lexical difficulty of a text is quite an elaboratephenomenon to parameterise. The logistic regres-sion models ... oftokens in a text. 2. Deciding what is the best linguistic unit toconsider. The equations introduced above use216.1 The language model: probabilities andsmoothing For our language model, we...
... oftenprovides important clues for POS tagging, and thePOS tags contain much syntactic information, whichneed context information within a large window for disambiguation. For example, Huang et al. ... contribution ofcontext information for the disambiguation. A firstorder Max-Margin Markov Networks model is usedto resolve the sequence tagging problem. We use theSVM-HMM3implementation for the experiments ... sub-word structure for joint segmentationand tagging. Since the sub-words are large enoughin practice, the decoding for POS tagging over sub-words is efficient. Finally, the Chineselanguage ischaracterized...
... cascaded linear modelfor joint chinese word segmentation and part-of-speech tagging. InProceedings of ACL.Wenbin Jiang, Haitao Mi, and Qun Liu. 2008b. Wordlattice reranking forchinese word ... F1results on CTB 3.0. Ourbaseline model outperforms all prior approaches for both Seg and Seg & Tag, and we hope thatour error-driven model can further improve perfor-mance.6 Related workIn ... their representatives.As for search space representation, Ng andLow (2004) found that for Chinese, the character-based model yields better results than the word-based model. Nakagawa and Uchimoto...
... marginal adaptation (Kneser et al., 1997)In this paper, we propose a framework to per-form LM adaptation across languages, enabling the adaptation of a LM from one language based on the adaptation text ... propose a bilingualLSA model (bLSA) for crosslingual LM adaptation that can be applied before translation. The bLSA model consists of two LSA models: one for eachside of the language trained on ... seman-tic modelforunsupervisedlanguagemodel adaptation. In Proc. of ICASSP.A. Venugopal, A. Zollmann, and A. Waibel. 2005. Train-ing and evaluation error minimization rules for statis-tical...