Báo cáo khoa học: "Discriminative Pruning of Language Models for Chinese Word Segmentation" ppt

Báo cáo khoa học: "Discriminative Pruning of Language Models for Chinese Word Segmentation" ppt

Báo cáo khoa học: "Discriminative Pruning of Language Models for Chinese Word Segmentation" ppt

... discriminative pruning method of n-gram language model for Chinese word segmentation. To reduce the size of the language model that is used in a Chinese word segmenta- tion system, importance of each ... To the best of our knowledge, it has not been applied to language model pruning. In this paper, we propose a discriminative pruning method of n-gram langu...
Ngày tải lên : 17/03/2014, 04:20
  • 8
  • 294
  • 0
Tài liệu Báo cáo khoa học: "Web augmentation of language models for continuous speech recognition of SMS text messages" docx

Tài liệu Báo cáo khoa học: "Web augmentation of language models for continuous speech recognition of SMS text messages" docx

... contain the acoustic models, language model and lexicon, but the LM makes up for most of the size. The availability of data varies for the different languages, and therefore the FST sizes are ... statistical language model (LM). The LM learns the probabilities of word sequences from text corpora available for training. The perfor- mance of the model depends on the amount...
Ngày tải lên : 22/02/2014, 02:20
  • 9
  • 301
  • 0
Tài liệu Báo cáo khoa học: "The impact of language models and loss functions on repair disfluency detection" pptx

Tài liệu Báo cáo khoa học: "The impact of language models and loss functions on repair disfluency detection" pptx

... of a variety of language models trained from text or speech corpora of vari- ous genres and sizes. The largest available language models are based on written text: we investigate the effect of ... that using written language data for mod- elling spoken language can improve performance. We turn to three other bodies of text and investi- gate the use of these corpora...
Ngày tải lên : 20/02/2014, 04:20
  • 9
  • 609
  • 0
Báo cáo khoa học: "Discriminative Modeling of Extraction Sets for Machine Translation" pptx

Báo cáo khoa học: "Discriminative Modeling of Extraction Sets for Machine Translation" pptx

... Cherry and Dekang Lin. 2006. Soft syntactic constraints for word alignment through discrimina- tive training. In Proceedings of the Annual Confer- ence of the Association for Computational Linguis- tics. Colin ... Conference of the Associa- tion for Computational Linguistics. John DeNero and Dan Klein. 2008. The complexity of phrase alignment problems. In Proceedings of the...
Ngày tải lên : 07/03/2014, 22:20
  • 11
  • 420
  • 0
Báo cáo khoa học: "Automatic Generation of Domain Models for Call Centers from Noisy Transcriptions" pdf

Báo cáo khoa học: "Automatic Generation of Domain Models for Call Centers from Noisy Transcriptions" pdf

... questions, etc. The {topic→information} index requires iden- tification of the topic for each call to make use of information available in the model. Below we show examples of the use of the model for topic identification. 5.1 ... Component to per- form noise removal. We performed a sequence of cleansing operations to remove stopwords such as the, of, seven, dot, january, hello....
Ngày tải lên : 31/03/2014, 01:20
  • 8
  • 397
  • 0
Báo cáo khoa học: "A Comparison of Event Models for Naive Bayes Anti-Spam E-Mail Filtering" potx

Báo cáo khoa học: "A Comparison of Event Models for Naive Bayes Anti-Spam E-Mail Filtering" potx

... the document. From a linguistic point of view, a document is made up of words, and the semantics of the doc- ument is determined by the meaning of the words and the linguistic structure of the document. The Naive ... which words are used in a document, but not the number of times each words is used, nor the order of the words in the document. In the second model, a document is g...
Ngày tải lên : 31/03/2014, 20:20
  • 8
  • 514
  • 0
Báo cáo khoa học: "Self-Organizing Ò-gram Model for Automatic Word Spacing" ppt

Báo cáo khoa học: "Self-Organizing Ò-gram Model for Automatic Word Spacing" ppt

... incorrect word spacing is a critical task in Ko- rean information processing. One of the most simple and strong models for automatic word spacing is -gram model. In spite of the advantages of the -gram ... mis- takes of word spacing even in news articles. The problem of the inaccurate word spacing is that they are fatal in language processing and in- formation retrieva...
Ngày tải lên : 23/03/2014, 18:20
  • 8
  • 278
  • 0
Tài liệu Báo cáo khoa học: "Discriminative Pruning for Discriminative ITG Alignment" pdf

Tài liệu Báo cáo khoa học: "Discriminative Pruning for Discriminative ITG Alignment" pdf

... the list of alignment hypotheses of minimal number of span pairs. The first type of pruning is equivalent to mi- nimizing the number of hypernodes in a hyper- graph. The task of ITG pruning ... if word alignment is the sole purpose of applying ITG. For instance, there are two parses for three consecutive word pairs, viz. [/ [/ / ] ] and [[/ /] /...
Ngày tải lên : 20/02/2014, 04:20
  • 9
  • 429
  • 0
Báo cáo khoa học: "Intelligent Selection of Language Model Training Data" ppt

Báo cáo khoa học: "Intelligent Selection of Language Model Training Data" ppt

... the entire Gigaword corpus, we trained the Gigaword language model for data selection on a random sample of the Gigaword corpus of a similar size to that of the Europarl training data: 1,874,051 sen- tences, ... perplexity for each of these modifed language models is compared to that of the orig- inal version of the model in Table 2. It can be seen that adjusting the voc...
Ngày tải lên : 07/03/2014, 22:20
  • 5
  • 348
  • 0
Báo cáo khoa học: "Automatic Acquisition of Language Model based on Head-Dependent Relation between Words" pdf

Báo cáo khoa học: "Automatic Acquisition of Language Model based on Head-Dependent Relation between Words" pdf

... the probability of a sentence as the product of the probability of each word in the sentence. It assumes that probability of the nth word is dependent on the previous n- 1 words. The n-gram ... • Every inner word has a head in the word sequence. • Neither crossing nor cycle of dependency relations is allowed. tWe use wi for ith word in a sentence and wi,j for...
Ngày tải lên : 08/03/2014, 05:21
  • 5
  • 334
  • 0

Xem thêm