continuous space language models using restricted boltzmann machines

Tài liệu Báo cáo khoa học: "Temporal Restricted Boltzmann Machines for Dependency Parsing" pdf

Tài liệu Báo cáo khoa học: "Temporal Restricted Boltzmann Machines for Dependency Parsing" pdf

Ngày tải lên : 20/02/2014, 04:20
... propose a generative model based on Temporal Restricted Boltzmann Machines for transition based dependency parsing. The parse tree is built incrementally using a shift- reduce parse and an RBM is ... propose to address the problem of inference in a high-dimensional latent space by using an undi- rected graphical model, Restricted Boltzmann Ma- chines (RBMs), to model the individual parsing decisions. ... Oregon, June 19-24, 2011. c 2011 Association for Computational Linguistics Temporal Restricted Boltzmann Machines for Dependency Parsing Nikhil Garg Department of Computer Science University...
  • 7
  • 414
  • 0
Tài liệu Báo cáo khoa học: "Reading Level Assessment Using Support Vector Machines and Statistical Language Models" pdf

Tài liệu Báo cáo khoa học: "Reading Level Assessment Using Support Vector Machines and Statistical Language Models" pdf

Ngày tải lên : 20/02/2014, 15:20
... of using statistical language models. In this paper, we also use support vector machines to combine features from tradi- tional reading level measures, statistical language models, and other language ... of syntax. Our approach uses n- gram language models as a low-cost automatic ap- proximation of both syntactic and semantic analy- sis. Statistical language models (LMs) are used suc- cessfully ... 2005. c 2005 Association for Computational Linguistics Reading Level Assessment Using Support Vector Machines and Statistical Language Models Sarah E. Schwarm Dept. of Computer Science and Engineering University...
  • 8
  • 446
  • 0
Tài liệu Báo cáo khoa học: "Web augmentation of language models for continuous speech recognition of SMS text messages" docx

Tài liệu Báo cáo khoa học: "Web augmentation of language models for continuous speech recognition of SMS text messages" docx

Ngày tải lên : 22/02/2014, 02:20
... for query selection and its applications in LM aug- mentation and adaptation using web data. The language models are part of a continuous speech recognition system that enables users to use speech as ... 2007. Large language models in machine translation. In Proceedings of the 2007 Joint Conference on Empirical Meth- ods in Natural Language Processing and Com- putational Natural Language Learning ... Kneser- Ney smoothed n-gram models. IEEE Transac- tions on Audio, Speech and Language Processing, 15(5):1617–1624. A. Stolcke. 1998. Entropy-based pruning of backoff language models. In Proc. DARPA...
  • 9
  • 301
  • 0
Tài liệu Báo cáo khoa học: "Incremental Syntactic Language Models for Phrase-based Translation" pptx

Tài liệu Báo cáo khoa học: "Incremental Syntactic Language Models for Phrase-based Translation" pptx

Ngày tải lên : 20/02/2014, 04:20
... trans- lation has effectively used n-gram word sequence models as language models. Modern phrase-based translation using large scale n-gram language models generally performs well in terms of lexical ... to incorporate large- scale n-gram language models in conjunction with incremental syntactic language models. The added decoding time cost of our syntactic language model is very high. By increasing ... use supertag n-gram LMs. Syntactic language models have also been explored with tree-based translation models. Charniak et al. (2003) use syntactic lan- guage models to rescore the output of a...
  • 12
  • 510
  • 0
Tài liệu Báo cáo khoa học: "The impact of language models and loss functions on repair disfluency detection" pptx

Tài liệu Báo cáo khoa học: "The impact of language models and loss functions on repair disfluency detection" pptx

Ngày tải lên : 20/02/2014, 04:20
... language models trained from text or speech corpora of vari- ous genres and sizes. The largest available language models are based on written text: we investigate the effect of written text language models ... dif- ferences among the different language models when extended features are present are relatively small. We assume that much of the information expressed in the language models overlaps with the lexical ... repairs, at least when used with simple language models like a bigram language model. In this paper we first identify the 25 most likely analyses of each sentence using the TAG channel model together...
  • 9
  • 609
  • 0
Tài liệu Báo cáo khoa học: "An Empirical Investigation of Discounting in Cross-Domain Language Models" ppt

Tài liệu Báo cáo khoa học: "An Empirical Investigation of Discounting in Cross-Domain Language Models" ppt

Ngày tải lên : 20/02/2014, 04:20
... likelihood of the test corpus under the train corpus language model (using basic Kneser-Ney) and the likelihood of the test corpus under a jackknife language model from the test itself, which holds ... of English Bigrams. Computer Speech & Language, 5(1):19–54. Joshua Goodman. 2001. A Bit of Progress in Language Modeling. Computer Speech & Language, 15(4):403– 434. Bo-June (Paul) Hsu ... extent possible. 3 A Growing Discount Language Model We now implement and evaluate a language model that incorporates growing discounts. 3.1 Methods Instead of using a fixed discount for most n-gram counts,...
  • 6
  • 444
  • 0
Tài liệu Báo cáo khoa học: "Improved Smoothing for N-gram Language Models Based on Ordinary Counts" doc

Tài liệu Báo cáo khoa học: "Improved Smoothing for N-gram Language Models Based on Ordinary Counts" doc

Ngày tải lên : 20/02/2014, 09:20
... Kneser-Ney and those methods. 1 Introduction Statistical language models are potentially useful for any language technology task that produces natural -language text as a final (or intermediate) output. ... perplexity of any known method for estimating N-gram language models. Kneser-Ney smoothing, however, requires nonstandard N-gram counts for the lower- order models used to smooth the highest- order model. ... using a sequence of lower- order to higher-order language models has been shown to be an efficient way of constraining high- dimensional search spaces for speech recognition (Murveit et al., 1993)...
  • 4
  • 365
  • 0
Tài liệu Báo cáo khoa học: "Generating statistical language models from interpretation grammars in dialogue systems" potx

Tài liệu Báo cáo khoa học: "Generating statistical language models from interpretation grammars in dialogue systems" potx

Ngày tải lên : 22/02/2014, 02:20
... Statistical Language Modeling Using the CMU-Cambridge Toolkit. In Proceedings of Eurospeech. Fosler-Lussier E. and Kuo H K. J. 2001. Using Se- mantic Class Information for Rapid Development of Language Models ... statistical language models (DM-SLMs) by using GF to generate all utterances that are specific to certain dialogue moves from our in- terpretation grammar. In this way we can pro- duce models that ... comparison of in- grammar recognition performance. 3 Language modelling To generate the different trigram language models we used the SRI language modelling toolkit (Stol- cke, 2002) with Good-Turing...
  • 8
  • 381
  • 0
Tài liệu ENGLISH SECOND LANGUAGE LEARNERS: USING MUSIC TO ENHANCE THE LISTENING ABILITIES OF GRADE ONES ppt

Tài liệu ENGLISH SECOND LANGUAGE LEARNERS: USING MUSIC TO ENHANCE THE LISTENING ABILITIES OF GRADE ONES ppt

Ngày tải lên : 24/02/2014, 18:20
... the indigenous languages of South Africa. She also finds, that very few are proficient in their home language, thus causing barriers in acquiring a second language. Second language acquisition ... Yule (1999: 12), acquisition of language means the gradual development of ability in a language by using it naturally in communicative situations. Learning a second language therefore, means the ... of a second language and has a positive effect on the language skills. Adequate hearing is the first step in listening. Language is learnt by ear and the vocabulary and skills in language structure...
  • 242
  • 648
  • 1
Gene Selection for Cancer Classification using Support Vector Machines pot

Gene Selection for Cancer Classification using Support Vector Machines pot

Ngày tải lên : 06/03/2014, 00:22
... about a single feature. III. Feature ranking with Support Vector Machines III.1. Support Vector Machines (SVM) To test the idea of using the weights of a classifier to produce a feature ranking, we ... the test set using the top 40 genes. They were able to classify 32 of 34 cases correctly using 5 genes. In (Chapelle, 2000), the authors achieve 1 error on the test set with 5 genes using the same ... overfitting of the data to some extent without requiring space dimensionality reduction. Such is the case, for instance, of Support Vector Machines (SVMs) ((Boser, 1992), (Vapnik, 1998), 29 Figure...
  • 39
  • 430
  • 0
Báo cáo khoa học: "The use of formal language models in the typology of the morphology of Amerindian languages" potx

Báo cáo khoa học: "The use of formal language models in the typology of the morphology of Amerindian languages" potx

Ngày tải lên : 07/03/2014, 22:20
... morphophonology was implemented using PC-KIMMO (Antworth, 1990) 3 The Toba morphology The Toba language belongs, with the languages pilaga, mocovi and kaduveo, to the guaycuru language family (Messineo, ... 2010. c 2010 Association for Computational Linguistics The use of formal language models in the typology of the morphology of Amerindian languages Andr ´ es Osvaldo Porta Universidad de Buenos Aires hugporta@yahoo.com.ar Abstract The ... the construction is straight- forward using two level morphology and then, describes in a very natural way the Argentinean Quechua morphology using a regular language. On the contrary, the Tobaverbs...
  • 6
  • 439
  • 0
Báo cáo khoa học: "Faster and Smaller N -Gram Language Models" pptx

Báo cáo khoa học: "Faster and Smaller N -Gram Language Models" pptx

Ngày tải lên : 07/03/2014, 22:20
... novel language model caching technique that improves the query speed of our language models (and SRILM) by up to 300%. 1 Introduction For modern statistical machine translation systems, language models ... with two different language models. Our first language model, WMT2010, was a 5- gram Kneser-Ney language model which stores probability/back-off pairs as values. We trained this language model on ... 2010. Storing the web in memory: space efficient language models with con- stant time retrieval. In Proceedings of the Conference on Empirical Methods in Natural Language Process- ing. Boulos Harb,...
  • 10
  • 463
  • 0
Báo cáo khoa học: "Enhancing Language Models in Statistical Machine Translation with Backward N-grams and Mutual Information Triggers" ppt

Báo cáo khoa học: "Enhancing Language Models in Statistical Machine Translation with Backward N-grams and Mutual Information Triggers" ppt

Ngày tải lên : 07/03/2014, 22:20
... or even trillions of English words, huge language models are built in a distributed man- ner (Zhang et al., 2006; Brants et al., 2007). Such language models yield better translation results but at ... explore a dependency language model to improve translation quality. To some ex- tent, these syntactically-informed language models are consistent with syntax-based translation models in capturing ... integrate backward n-grams and mu- tual information (MI) triggers into language models in SMT. In conventional n-gram language models, we look at the preceding n − 1 words when calculating the probability...
  • 10
  • 415
  • 0
Báo cáo khoa học: "Randomized Language Models via Perfect Hash Functions" pptx

Báo cáo khoa học: "Randomized Language Models via Perfect Hash Functions" pptx

Ngày tải lên : 08/03/2014, 01:20
... 2007. Compressing trigram language models with golomb coding. In Proceedings of EMNLP-CoNLL 2007, Prague, Czech Republic, June. P. Clarkson and R. Rosenfeld. 1997 . Statistical language modeling using the CMU-Cambridge ... lossless language model representation can be achieved when using 12 ‘error’ bits, resulting in approx. 3 bytes per n-gram (this includes one byte to store parameter values). 512 2 Scaling Language Models In ... multiple language model servers. We encode the model stored on each lan- guagage model server using the randomized scheme. The proposed randomized LM can encode param- eters estimated using any...
  • 9
  • 273
  • 0
Báo cáo khoa học: "Generalized Algorithms for Constructing Statistical Language Models" pdf

Báo cáo khoa học: "Generalized Algorithms for Constructing Statistical Language Models" pdf

Ngày tải lên : 08/03/2014, 04:22
... . Class-based models. In many applications, it is nat- ural and convenient to construct class-based language models, that is models based on classes of words (Brown et al., 1992). Such models are ... classical definitions of - gram language models and several smoothing techniques commonly used. We then describe a natural representa- tion of -gram language models using failure transitions. This ... re- lated to the construction of language models. We present new and efficient algorithms to address these more gen- eral problems. Counting. Classical language models are constructed by deriving...
  • 8
  • 389
  • 0

Xem thêm