unsupervised language model adaptation for lecture speech recognition

Tài liệu Báo cáo khoa học: "Web augmentation of language models for continuous speech recognition of SMS text messages" docx

Tài liệu Báo cáo khoa học: "Web augmentation of language models for continuous speech recognition of SMS text messages" docx

... smoothing techniques for language model- ing. Computer Speech and Language, 13:359–394. Joshua T. Goodman. 2001. A bit of progress in lan- guage modeling. Computer Speech and Language, 15:403–434. Slava ... rates [%]. ture models, whereas the best in-domain models are 4- or 5-grams. For every language and model size, the web mixture model performs better than the corre- sponding in-domain model. The ... data were selected for each language. The adaptation was thought to take place off-line on a server. 3.2.1 Data sets For each language, the adaptation takes place on two baseline models, which are...

Ngày tải lên: 22/02/2014, 02:20

9 301 0
Báo cáo khoa học: "Grounded Language Modeling for Automatic Speech Recognition of Sports Video" doc

Báo cáo khoa học: "Grounded Language Modeling for Automatic Speech Recognition of Sports Video" doc

... an information retrieval task. In future work, we will examine the ability of grounded language models to improve perform- ance for other natural language tasks that exploit text based language ... error rates for ASR sys- tems using a grounded language model, a text based language model trained on the switchboard corpus, and the switchboard model interpolated with a text based model trained ... perplexity seen when using the grounded language model compared to the in- terpolated model. Note that these two language models are generated using the same speech tran- scriptions, i.e. the closed...

Ngày tải lên: 17/03/2014, 02:20

9 395 0
Tài liệu Báo cáo khoa học: "Translation Model Adaptation for Statistical Machine Translation with Monolingual Topic Information" doc

Tài liệu Báo cáo khoa học: "Translation Model Adaptation for Statistical Machine Translation with Monolingual Topic Information" doc

... Vogel. 2006. Distributed Language Modeling for N-best List Re-ranking. In Proc. of EMNLP 2006, pages 216-223. Bing Zhao, Matthias Eck and Stephan Vogel. 2004. Language Model Adaptation for Statistical ... translation mod- el adaptation and language model adaptation. Here we focus on how to adapt a translation model, which is trained from the large-scale out-of-domain bilin- gual corpus, for domain-specific ... enforces one-to-one topic corre- spondence and enables latent topic distributions to be efficiently transferred across languages, to cross- lingual language modeling and translation lexicon adaptation. ...

Ngày tải lên: 19/02/2014, 19:20

10 533 0
Báo cáo khoa học: "On-line Language Model Biasing for Statistical Machine Translation" docx

Báo cáo khoa học: "On-line Language Model Biasing for Statistical Machine Translation" docx

... parameter estimation. Computational Linguistics, 19:263–311. Woosung Kim. 2005. Language Model Adaptation for Automatic Speech Recognition and Statistical Machine Translation. Ph.D. thesis, The Johns ... USA. Association for Computational Linguistics. Bing Zhao, Matthias Eck, and Stephan Vogel. 2004. Language model adaptation for statistical machine translation with structured query models. In Proceed- ings ... Association for Computational Linguistics:shortpapers, pages 445–449, Portland, Oregon, June 19-24, 2011. c 2011 Association for Computational Linguistics On-line Language Model Biasing for Statistical...

Ngày tải lên: 17/03/2014, 00:20

5 311 0
Báo cáo khoa học: "Algorithm Selection and Model Adaptation for ESL Correction Tasks" doc

Báo cáo khoa học: "Algorithm Selection and Model Adaptation for ESL Correction Tasks" doc

... However, for the model to use these language- specific error statistics, a separate classi- fier for each source language needs to be trained. We propose a novel adaptation method, which shows performance ... comparing systems devel- oped for ESL correction tasks. A language model was found to outperform a maximum entropy classi- fier (Gamon, 2010). However, the language model was trained on the Gigaword ... Perceptron is the best performing model (Sec. 3). Our results do not support earlier conclu- sions with respect to the performance of count-based models (Bergsma et al., 2009) and language mod- els (Gamon,...

Ngày tải lên: 23/03/2014, 16:20

10 518 0
Báo cáo khoa học: "Alignment Model Adaptation for Domain-Specific Word Alignment" pptx

Báo cáo khoa học: "Alignment Model Adaptation for Domain-Specific Word Alignment" pptx

... improve the alignment for general words and use the in-domain bilingual corpus for domain-specific words. We implement this by using alignment model adaptation. Although the adaptation technology ... trained models. In other words, we make use of the out-of-domain training data and the in-domain training data by interpolating the trained alignment models. One method to perform model adaptation ... is the distortion probability in model 3, and the other is the distortion probability in model 4. The interpolation model for the distortion probability in model 3 is shown in (10). Since the...

Ngày tải lên: 31/03/2014, 03:20

8 329 0
Báo cáo khoa học: "A Preference-first Language Processor Integrating the Unification Grammar and Markov Language Model for Speech Recognition-ApplicationS" potx

Báo cáo khoa học: "A Preference-first Language Processor Integrating the Unification Grammar and Markov Language Model for Speech Recognition-ApplicationS" potx

... signal preprocessor is included to form a complete speech recognition system. The language processor consists of a language model and a parser. The language model properly integrates the unification ... Markov language model, and a simple set of unification grammar rules for the Chinese language, although the present model is in fact language independent. The system is written in C language ... summarized. The Laneua~e Model The goal of the language model is to participate in the selection of candidate constituents for a sentence to be identified. The proposed language model is composed...

Ngày tải lên: 08/03/2014, 07:20

6 393 0
Tài liệu Báo cáo khoa học: "Discriminative Syntactic Language Modeling for Speech Recognition" pdf

Tài liệu Báo cáo khoa học: "Discriminative Syntactic Language Modeling for Speech Recognition" pdf

... syntactic language model has the task of modeling a distribution over strings in the lan- guage, in a very similar way to traditional n-gram language models. The Structured Language Model (Chelba ... Jelinek. 2000. Structured language modeling. Computer Speech and Language, 14(4):283–332. Ciprian Chelba. 2000. Exploiting Syntactic Structure for Nat- ural Language Modeling. Ph.D. thesis, The ... Previous Work Techniques for exploiting stochastic context-free grammars for language modeling have been ex- plored for more than a decade. Early approaches included algorithms for efficiently calculating...

Ngày tải lên: 20/02/2014, 15:20

8 410 0
Báo cáo khoa học: "AN AUTOMATIC SPEECH RECOGNITION SYSTEM FOR ITALIAN LANGUAGE" doc

Báo cáo khoa học: "AN AUTOMATIC SPEECH RECOGNITION SYSTEM FOR ITALIAN LANGUAGE" doc

... phoneme models, we can build models for words, or for word strings, simply by concatenating the Markov sources of the corresponding phonemes. Figure 1 shows a typical structure for Markov model ... ABSTRACT 4. An automatic speech recognition system for Italian language has been developed at IBM Italy Scientific Center in Rome. It is able to recognize in real time natural language sentences, ... the Markov model for a word. The structure of Markov models is completely defined by the number of states and by interconneetions among them. It is unique for all the phonemes and for all the...

Ngày tải lên: 01/04/2014, 00:20

4 308 0
Tài liệu Báo cáo khoa học: "Topic Models for Dynamic Translation Model Adaptation" pptx

Tài liệu Báo cáo khoa học: "Topic Models for Dynamic Translation Model Adaptation" pptx

... 115–119, Jeju, Republic of Korea, 8-14 July 2012. c 2012 Association for Computational Linguistics Topic Models for Dynamic Translation Model Adaptation Vladimir Eidelman Computer Science and UMIACS University ... for the source it came from, many word pairs will be unobserved for a given table. This sparsity requires smoothing. Sec- ond, we may not know the (sub)corpora our training 1 Language model adaptation ... topic-specific contexts, where topics are induced in an unsupervised way using topic models; this can be thought of as inducing subcorpora for adaptation with- out any human annotation. We use these...

Ngày tải lên: 19/02/2014, 19:20

5 532 0
w