Báo cáo khoa học: "Word Clustering and Word Selection based Feature Reduction for MaxEnt based Hindi NER" ppt
... effectiveness of word clustering and selection as feature reduction tech- niques for MaxEnt based NER. For clustering we use a number of word similarities like cosine sim- ilarity among words and co-occurrence, ... Ohio, USA, June 2008. c 2008 Association for Computational Linguistics Word Clustering and Word Selection based Feature Reduction for...
Ngày tải lên: 08/03/2014, 01:20
... (Lakoff and Johnson, 1980; Fass, 1997; Stern, 1931)). In a place -for- people pattern, a place stands for any persons/organisations associ- ated with it, e.g., for sports teams in (2), (3), and (4), and ... Syntactic Features and Word Similarity for Supervised Metonymy Resolution Malvina Nissim ICCS, School of Informatics University of Edinburgh mnissim@inf.ed.ac.uk Katja Mark...
Ngày tải lên: 08/03/2014, 04:22
... dGTP and dCTP, 1.2 m M MgCl 2 ,2pmolÆmL )1 primers, 1 ngÆmL )1 template DNA, and 0.05 unitsÆmL )1 TaKaRa Taq DNA polymerase. A successive reaction at 95 °C for 30 s, 45 °Cfor60sand 72 °C for 90 ... acids of the HdEG66 was amplified. Finally, 5¢ -and3 ¢-RACE PCRs were performed using primers shown in Table 2, and Hd5RACE-DNA and Hd3RACE-DNA for 5¢ -and3 ¢-terminal regions were amp...
Ngày tải lên: 20/02/2014, 23:20
Báo cáo khoa học: "COMMON TOPICS AND COHERENT SITUATIONS: INTERPRETING ELLIPSIS IN THE CONTEXT OF DISCOURSE INFERENCE" ppt
... and bi are similar (1) Po = -~Pl, ai and bi are similar (2) P0 = Pl, ai and bi are dissimilar for some i Po =Pl ;bl Eaior biCai PO = pl , ai bi and but for example ' in other words ... antecedent and elided clauses (Levin and Prince, 1982; Kehler, 1993b). In this section we review the relevant facts for two such forms of ellipsis, namely gapping and VP-ellip...
Ngày tải lên: 08/03/2014, 07:20
Tài liệu Báo cáo khoa học: "Combining Lexical Semantic Resources with Question & Answer Archives for Translation-Based Answer Finding" doc
... steps performed for all parallel datasets were tokenisa- tion and stop word removal. 5 3.4 Comparison of Word- to -Word Translations Table 1 gives some examples of word- to -word translations obtained for ... senses, and hence dif- ferent glosses, it is possible to extract several gloss pairs for one and the same lexeme and one and the same pair of resources. It is there...
Ngày tải lên: 20/02/2014, 07:20
Tài liệu Báo cáo khoa học: "POS Disambiguation and Unknown Word Guessing with Decision Trees" pot
... The POS of an unknown word is guessed by traversing the decision tree for unknown words, which examines contextual features along with the word ending and capitalization and returns an open-class ... Disambiguator/Guesser is a 'forest' of decision trees, one tree for each ambiguity scheme present in M. Greek and one tree for unknown word guessing. When a w...
Ngày tải lên: 22/02/2014, 03:20
Báo cáo khoa học: "Automatic Identification of Word Translations from Unrelated English and German Corpora" pot
... that word A is two words ahead of another word B, a second vector for the case that word A is one word ahead of word B, a third vector for A directly following B, and a fourth vector for A ... coefficient, and the Dice coefficient (see also Jones & Fur- nas, 1987). For the computation of related terms and synonyms, Ruge (1995), Landauer and Dumais (1997), a...
Ngày tải lên: 08/03/2014, 06:20
Tài liệu Báo cáo khoa học: "Smoothing a Tera-word Language Model" doc
... achieves its best performance with a 4-gram model and gives 8.23 bits on Brown. 4.4 Dirichlet Form MacKay and Peto (1995) show that based on Dirich- let priors a reasonable form for a smoothed distribu- tion ... the first word, c is the last word, and b repre- sents zero or more words in between. One way to estimate Pr(c|ab) is to look at the number of times word c has followed t...
Ngày tải lên: 20/02/2014, 09:20
Tài liệu Báo cáo khoa học: "A Discriminative Syntactic Word Order Model for Machine Translation" pdf
... formulated as the task of ordering a target bag of words, given a source sentence and word alignments between tar- get and source words. In this work we also assume a source dependency tree and ... target word. POSs and POSt: POS tags on the source and target sides. For Japanese, we have a set of 19 POS tags. ’+’ means making conjunction of features and prev() means using th...
Ngày tải lên: 20/02/2014, 12:20
Tài liệu Báo cáo khoa học: "Deriving an Ambiguous Word’s Part-of-Speech Distribution from Unannotated Text" doc
... distributional and morphological information (Clark, 2003; Freitag, 2004), or by clustering words and projecting them to POS vectors (Rapp, 2005). Whereas all these studies are based on global ... consider, and the columns are suitable middle words as we find them in a corpus. Most words in our example are syntactically unambigu- ous. Only link can be either a noun or a verb...
Ngày tải lên: 20/02/2014, 12:20