... smoothing techniques forlanguage model- ing. Computer Speech and Language, 13:359–394.Joshua T. Goodman. 2001. A bit of progress in lan-guage modeling. Computer Speech and Language, 15:403–434.Slava ... rates [%].ture models, whereas the best in-domain modelsare 4- or 5-grams. For every language and model size, the webmixture model performs better than the corre-sponding in-domain model. The ... data wereselected for each language. The adaptation wasthought to take place off-line on a server.3.2.1 Data sets For each language, the adaptation takes place ontwo baseline models, which are...
... an information retrieval task. In future work, we will examine the ability of grounded language models to improve perform-ance for other natural language tasks that exploit text based language ... error rates for ASR sys-tems using a grounded language model, a text based language model trained on the switchboard corpus, and the switchboard model interpolated with a text based model trained ... perplexity seen when using the grounded languagemodel compared to the in-terpolated model. Note that these two language models are generated using the same speech tran-scriptions, i.e. the closed...
... Vogel.2006. Distributed Language Modeling for N-best ListRe-ranking. In Proc. of EMNLP 2006, pages 216-223.Bing Zhao, Matthias Eck and Stephan Vogel. 2004. Language ModelAdaptationfor Statistical ... translation mod-el adaptation and languagemodel adaptation. Herewe focus on how to adapt a translation model, whichis trained from the large-scale out-of-domain bilin-gual corpus, for domain-specific ... enforces one-to-one topic corre-spondence and enables latent topic distributions tobe efficiently transferred across languages, to cross-lingual language modeling and translation lexicon adaptation. ...
... parameterestimation. Computational Linguistics, 19:263–311.Woosung Kim. 2005. LanguageModelAdaptation for Automatic SpeechRecognition and Statistical MachineTranslation. Ph.D. thesis, The Johns ... USA. Association for Computational Linguistics.Bing Zhao, Matthias Eck, and Stephan Vogel. 2004. Language modeladaptationfor statistical machinetranslation with structured query models. In Proceed-ings ... Association for Computational Linguistics:shortpapers, pages 445–449,Portland, Oregon, June 19-24, 2011.c2011 Association for Computational LinguisticsOn-line LanguageModel Biasing for Statistical...
... However, for the model to use these language- specific error statistics, a separate classi-fier for each source language needs to be trained.We propose a novel adaptation method, whichshows performance ... comparing systems devel-oped for ESL correction tasks. A language model was found to outperform a maximum entropy classi-fier (Gamon, 2010). However, the language model was trained on the Gigaword ... Perceptron is the best performing model (Sec. 3). Our results do not support earlier conclu-sions with respect to the performance of count-basedmodels (Bergsma et al., 2009) and language mod-els (Gamon,...
... improve the alignment for general words and use the in-domain bilingual corpus for domain-specific words. We implement this by using alignment model adaptation. Although the adaptation technology ... trained models. In other words, we make use of the out-of-domain training data and the in-domain training data by interpolating the trained alignment models. One method to perform modeladaptation ... is the distortion probability in model 3, and the other is the distortion probability in model 4. The interpolation modelfor the distortion probability in model 3 is shown in (10). Since the...
... signal preprocessor is included to form a complete speech recognition system. The language processor consists of a languagemodel and a parser. The languagemodel properly integrates the unification ... Markov language model, and a simple set of unification grammar rules for the Chinese language, although the present model is in fact language independent. The system is written in C language ... summarized. The Laneua~e Model The goal of the languagemodel is to participate in the selection of candidate constituents for a sentence to be identified. The proposed language model is composed...
... syntactic languagemodel has the taskof modeling a distribution over strings in the lan-guage, in a very similar way to traditional n-gram language models. The Structured Language Model (Chelba ... Jelinek. 2000. Structured language modeling. Computer Speech and Language, 14(4):283–332.Ciprian Chelba. 2000. Exploiting Syntactic Structure for Nat-ural Language Modeling. Ph.D. thesis, The ... Previous WorkTechniques for exploiting stochastic context-freegrammars forlanguage modeling have been ex-plored for more than a decade. Early approachesincluded algorithms for efficiently calculating...
... phoneme models, we can build models for words, or for word strings, simply by concatenating the Markov sources of the corresponding phonemes. Figure 1 shows a typical structure for Markov model ... ABSTRACT 4. An automatic speechrecognition system for Italian language has been developed at IBM Italy Scientific Center in Rome. It is able to recognize in real time natural language sentences, ... the Markov modelfor a word. The structure of Markov models is completely defined by the number of states and by interconneetions among them. It is unique for all the phonemes and for all the...
... 115–119,Jeju, Republic of Korea, 8-14 July 2012.c2012 Association for Computational LinguisticsTopic Models for Dynamic Translation Model Adaptation Vladimir EidelmanComputer Scienceand UMIACSUniversity ... for the source itcame from, many word pairs will be unobserved for a given table. This sparsity requires smoothing. Sec-ond, we may not know the (sub)corpora our training1 Language modeladaptation ... topic-specific contexts, wheretopics are induced in an unsupervised wayusing topic models; this can be thought ofas inducing subcorpora foradaptation with-out any human annotation. We use these...