Báo cáo khoa học: "SVD and Clustering for Unsupervised POS Tagging" docx
... 215–219, Uppsala, Sweden, 11-16 July 2010. c 2010 Association for Computational Linguistics SVD and Clustering for Unsupervised POS Tagging Michael Lamar* Division of Applied Mathematics ... greedy 1-to-1 map, and VI, for the full PTB45 tagset and the reduced PTB17 tagset. HMM-EM, HMM-VB and HMM-GS show the best results from Gao and Johnson (2008); HMM-Sparse(32)...
Ngày tải lên: 07/03/2014, 22:20
... Association for Computational Linguistics:shortpapers, pages 323–328, Portland, Oregon, June 19-24, 2011. c 2011 Association for Computational Linguistics Models and Training for Unsupervised Preposition ... this paper, we present our unsupervised framework and show results for preposition disam- biguation. We hope to present results for the joint disambiguation of preposit...
Ngày tải lên: 20/02/2014, 05:20
... l, and a pruning threshold t c for P(PW i l |CW i-2 j CW i-1 j ), computing sizes and perplexities of each, and a similarly large number of values of l, k, and a separate threshold t w for ... techniques symmetric clustering, and the resulting clusters both clusters. In constructing the ACM, we used asymmetric clustering, in which different clusters are used for pre...
Ngày tải lên: 23/03/2014, 20:20
Tài liệu Báo cáo khoa học: Pathways and products for the metabolism of vitamin D3 by cytochrome P450scc docx
... groups are at positions 22 and 24, as two methyl groups (C26 and C27) and a methine group (C25) are in this spin network. Therefore, this hydroxylation must be at C23. The TOCSY spec- trum for the ... correlations for 3-CH and 23-CH; (B) expan- sion of proton–proton TOCSY correlations for 3-CH and 23-CH; (C) expansion of proton–carbon HSQC showing groups having correla- tion t...
Ngày tải lên: 18/02/2014, 17:20
Tài liệu Báo cáo khoa học: "A Statistical Model for Unsupervised and Semi-supervised Transliteration Mining" pptx
... unsupervised system, and unlike the super- vised and semi-supervised systems we mentioned, our model can be used for both unsupervised and semi-supervised mining in a consistent way. 3 Unsupervised Transliteration ... labelled information for training. Our sys- tem extracts transliteration pairs in an unsupervised fashion. It is also able to utilize labelled information if a...
Ngày tải lên: 19/02/2014, 19:20
Báo cáo khoa học: "Pre- and Postprocessing for Statistical Machine Translation into Germanic Languages" docx
... re- ordering, and error correction. Initial results are positive within all four areas, and there are promising possibilities for extending these ap- proaches. In addition I also focus on methods for performing ... previous research, and Swedish and other Scandinavian lan- guages, where there has been little previous re- search. I believe that both language-pair dependent and in...
Ngày tải lên: 23/03/2014, 16:20
Báo cáo khoa học: "Distributed Word Clustering for Large Scale Class-Based Language Modeling in Machine Translation" docx
... related clustering algorithms (Whittaker and Woodland, 2001). In (Emami and Jelinek, 2005) a clustering algo- rithm is introduced which outputs a separate clus- tering for each word position ... different types of mixed word and class models have been proposed for the purpose of improving the perfor- mance of the model (Goodman, 2000), reducing its size (Goodman and Gao, 2000)...
Ngày tải lên: 31/03/2014, 00:20
Báo cáo khoa học: "Clique-Based Clustering for improving Named Entity Recognition systems" pot
... dependen- cies for measuring similarities between named entities. The specificity of the presented method however, is to combine a clique-based approach and a clustering technique that amounts to a soft clustering method. ... for a NE. On the other hand, clustering methods aim at struc- turing a data set and such techniques can be seen as data compression processes. However, a si...
Ngày tải lên: 31/03/2014, 20:20
Tài liệu Báo cáo khoa học: "Fast and Robust Part-of-Speech Tagging Using Dynamic Model Selection" pptx
... Second, a POS tagger should be tested for its speed. POS tagging is often performed as a pre-processing step to other tasks (e.g., pars- ing, chunking) and it should not be a bottleneck for those ... inspired by Gim ´ enez and M ` arquez (2004) although ambiguity classes are derived selectively for our case. Given a word-form, we count how often each POS tag is used with the f...
Ngày tải lên: 19/02/2014, 19:20
Tài liệu Báo cáo khoa học: "A CONNECTIONIST PARSER FOR STRUCTURE UNIFICATION GRAMMAR" docx
... competence and performance phenomena. The remaining constraints imposed at the level of the parser are traditionally treated as performance constraints. For example, the parser's bounded ... architecture proposed by Shastri and Ajjanagadde (1990) as a special purpose mod- ule for syntactic constituent structure parsing. An SUG description is stored in the module's mem-...
Ngày tải lên: 20/02/2014, 21:20