Báo cáo khoa học: "Synchronous Models of Language" pptx
... application of synchronous productions to linked nonterminals. Each vector v of G is constructed from a pair of synchronous vectors (v', v") of Gs as follows. First, all instances of nonterminals ... application of v' and v" to a set of (occurrences of) nonterminals in a particular link configuration in a sentential form of Gs. • We now introduce a re...
Ngày tải lên: 31/03/2014, 06:20
... of a variety of language models trained from text or speech corpora of vari- ous genres and sizes. The largest available language models are based on written text: we investigate the effect of ... set of reranker features, which consisted of features for all of the language models plus the extended (i.e., indicator) features, and used this model to analyse the test data. T...
Ngày tải lên: 20/02/2014, 04:20
... system consists of acoustic models of speech sounds and of a statistical language model (LM). The LM learns the probabilities of word sequences from text corpora available for training. The perfor- mance of ... afford from the top of the list. However, the relevance of a query is dependent on the sequence of past queries (because of the decay factor). Find- ing the optimal...
Ngày tải lên: 22/02/2014, 02:20
Báo cáo khoa học: "Discriminative Pruning of Language Models for Chinese Word Segmentation" ppt
... F-Measure of 96.33%, number of bigrams decreases by up to 90%. # of bigrams % of KLD KLD 100,000 100% Step-10K 25,000 25% Step-5K 15,000 15% Step-2K 10,000 10% Table 1. Comparison of Number of ... and Figure 6. Per- plexities of KLD models are much lower than that of the other models, but their F-Measures are much worse than that of step-by-step growing 1006...
Ngày tải lên: 17/03/2014, 04:20
Tài liệu Báo cáo khoa học: Animal models of amyloid-b-related pathologies in Alzheimer’s disease docx
... 1391 Transgenic animal models Models devoid of any disease-causing APP mutations Animal models expressing wild-type (wt) human APP are of interest because the great majority of sporadic AD patients ... dominant mode of inheritance, account for < 2% of all AD cases. Onset is most often before 65 years of age, and the penetrance is nearly always complete. The purifica- tion an...
Ngày tải lên: 16/02/2014, 09:20
Báo cáo khoa học: "Intelligent Selection of Language Model Training Data" ppt
... added to the probability of <UNK>. A count cutoff of 2 occurrences was applied to the trigrams and 4-grams in estimating these models. We computed the cross-entropy of each sen- tence in the ... a random sample of the Gigaword corpus of a similar size to that of the Europarl training data: 1,874,051 sen- tences, 48,459,945 tokens. To further increase the comparability of...
Ngày tải lên: 07/03/2014, 22:20
Báo cáo khoa học: "Automatic Acquisition of Language Model based on Head-Dependent Relation between Words" pdf
... grammar-based. N-gram model estimates the probability of a sentence as the product of the probability of each word in the sentence. It assumes that probability of the nth word is dependent on the previous ... probability of the sentence using the probabilities of the struc- tures. Long distance dependencies can be rep- resented well by means of the structures. The appro...
Ngày tải lên: 08/03/2014, 05:21
Báo cáo khoa học: "Co-evolution of Language and of the Language Acquisition Device" docx
... relative fitness of a LAgt is a func- tion of the proportion of its linguistic interactions which have been successful, the expressivity of the language(s) spoken, and, optionally, of the mean ... form part of the encoding (A/D/?). Figure 3 shows several equiva- lent and equally correct sequential encodings of the fragment of the English type system outlined above. A se...
Ngày tải lên: 31/03/2014, 21:20
Báo cáo khoa học: "STOCHASTIC MODELING OF LANGUAGE VIA SENTENCE SPACE PARTITIONING" potx
... on such a model of the Italian language, which is a part of the prototype for the recognition of spoken Italian built at the IBM Rome Scintific Center. STOCHASTIC MODELS OF LANGUAGE In some ... ~'t) i=1 In other terms, the probability of a sequence of words is the product of the conditional probability of each word, given all of the previous ones. As a formal...
Ngày tải lên: 01/04/2014, 00:20
Báo cáo khoa học: "The use of formal language models in the typology of the morphology of Amerindian languages" potx
... typology of the morphol- ogy of the native South American lan- guages from the point of view of the for- mal language theory. With this object, we give two contrasting examples of de- scriptions of ... althought capable of modeling the morphology of the toba, would not work effectively. The effectiveness of a grammar is a measure of their productivity (Heintz, 1991). Takin...
Ngày tải lên: 07/03/2014, 22:20