Báo cáo khoa học: "Improved Discriminative Bilingual Word Alignment" pdf

Tài liệu Báo cáo khoa học: "A Discriminative Syntactic Word Order Model for Machine Translation" pdf

Tài liệu Báo cáo khoa học: "A Discriminative Syntactic Word Order Model for Machine Translation" pdf

... mapping, every word on the target side is asso- ciated with some word on the source side. 10 subtree in the target rooted at the rightmost of these words and attaches the other word( s) to it. ... among possible word orders. We show that com- bining discriminative training with features to detect these two different kinds of move- ment phenomena leads to substantial im- provements in...
Ngày tải lên : 20/02/2014, 12:20
  • 8
  • 404
  • 0
Tài liệu Báo cáo khoa học: "Improved Unsupervised POS Induction through Prototype Discovery" ppt

Tài liệu Báo cáo khoa học: "Improved Unsupervised POS Induction through Prototype Discovery" ppt

... to words in- cluding more than one stem (like weatherman), to words that have a null affix (i.e., where the word is identical to its stem) and to words whose stem is not shared by any other word ... as unknown words. We consider two schemes for handling unknown words. One randomly maps each such word to a cluster, using a probabil- ity proportional to the number of unique known words alr...
Ngày tải lên : 20/02/2014, 04:20
  • 10
  • 330
  • 0
Tài liệu Báo cáo khoa học: "Smoothing a Tera-word Language Model" doc

Tài liệu Báo cáo khoa học: "Smoothing a Tera-word Language Model" doc

... probability of each word using the context made up of the previ- ous n−1 words. Let abc represent an n-gram where a is the first word, c is the last word, and b repre- sents zero or more words in between. ... The Web 1T dataset has a 13 million word vocabulary consisting of words that appear 100 times or more in its corpus. 769 sentences in Brown that contained words outside this vocabul...
Ngày tải lên : 20/02/2014, 09:20
  • 4
  • 425
  • 1
Tài liệu Báo cáo khoa học: "Improved Smoothing for N-gram Language Models Based on Ordinary Counts" doc

Tài liệu Báo cáo khoa học: "Improved Smoothing for N-gram Language Models Based on Ordinary Counts" doc

... language, simple N-gram models, which estimate the probability of each word in a text string based on the N −1 preceding words, remain the most widely used type of model. The simplest possible ... a word w n , given the preceding context w 1 . w n−1 , to be the ratio of the num- ber of occurrences in a training corpus of the N- gram w 1 . w n to the total number of occurrences of any wor...
Ngày tải lên : 20/02/2014, 09:20
  • 4
  • 365
  • 0
Tài liệu Báo cáo khoa học: "Deriving an Ambiguous Word’s Part-of-Speech Distribution from Unannotated Text" doc

Tài liệu Báo cáo khoa học: "Deriving an Ambiguous Word’s Part-of-Speech Distribution from Unannotated Text" doc

... occa- sionally allow for words from other clusters. 4 Results As our test vocabulary we chose a sample of 50 words taken from a previous study (Rapp, 2005). The list of words is included in Table ... automatically in- duce a system of word classes that is in agreement with human intuition, and then to assign all possi- ble parts of speech to a given ambiguous or unam- biguous word. Tw...
Ngày tải lên : 20/02/2014, 12:20
  • 4
  • 389
  • 0
Tài liệu Báo cáo khoa học: Improved ecdysone receptor-based inducible gene regulation system doc

Tài liệu Báo cáo khoa học: Improved ecdysone receptor-based inducible gene regulation system doc

... ligand resulted in 50% and 80% reduction in reporter gene activity by 12 h and 24 h, respectively. Keywords: gene switch; ponasterone A; receptors; EcR; RXR. Twenty hydroxyecdysone (20E) is a steroid
Ngày tải lên : 21/02/2014, 00:20
  • 8
  • 331
  • 0
Tài liệu Báo cáo khoa học: "Discovering Corpus-Specific Word Senses" pot

Tài liệu Báo cáo khoa học: "Discovering Corpus-Specific Word Senses" pot

... h,)  Cik 4111)  11‘ 41 4Wit ler,1110.1/. 1 7, cgtoserek■Ilt Figure 1: Local graph of the word mouse Figure 2: Local graph of the word wing 3 Markov Clustering Ambiguous words link otherwise unrelated areas of meaning E.g. rat ... them. Sense clusters are iteratively computed by clustering the local graph of similar words around an ambiguous word. Dis- crimination against previously...
Ngày tải lên : 22/02/2014, 02:20
  • 4
  • 329
  • 0
Tài liệu Báo cáo khoa học: Sensor of phospholipids inStreptomycesphospholipase D pdf

Tài liệu Báo cáo khoa học: Sensor of phospholipids inStreptomycesphospholipase D pdf

... mentioned above, previous experimental studies have focused on the relationship between HKD motifs Keywords phospholipase D; phospholipid; substrate recognition; SPR; Streptomyces Correspondence T.
Ngày tải lên : 19/02/2014, 02:20
  • 10
  • 425
  • 1
Tài liệu Báo cáo khoa học: "Lemmatisation as a Tagging Task" pdf

Tài liệu Báo cáo khoa học: "Lemmatisation as a Tagging Task" pdf

... regularity of word forms that is shared by all languages: infrequent words tend to be formed according to a regular pattern, while ir- regular word forms tend to occur in frequent words. The described ... set that covers most of the word occur- rences in a text: a specialized label is learnt for fre- quent irregular words, while a generic label is learnt to handle words that follow a re...
Ngày tải lên : 19/02/2014, 19:20
  • 5
  • 456
  • 0
Tài liệu Báo cáo khoa học: "Learning to Follow Navigational Directions" pdf

Tài liệu Báo cáo khoa học: "Learning to Follow Navigational Directions" pdf

... features organized by spatial word. The top row shows the weights of allocentric (landmark-centered) features. For example, the top left figure shows that when the word above occurs, our policy ... accounts of learning typically rely on lin- guistic annotation (Zettlemoyer and Collins, 2009) or word distributions (Curran, 2003). In con- trast, we present an apprenticeship learning sys- te...
Ngày tải lên : 20/02/2014, 04:20
  • 9
  • 526
  • 0