Báo cáo khoa học: "Fast Methods for Kernel-based Text Analys

Báo cáo khoa học: "Fast Methods for Kernel-based Text Analysis" pot

... inadequate for Information Retrieval, Question Answering, or Text Mining, where fast analysis of large quantities of text is indispensable. This paper presents two novel methods that make the kernel-based ... computational costs. Kernel-based text analysis shows an excellent performance in terms in accuracy; however, these methods are usually too slow to apply to large-scale...

Ngày tải lên: 08/03/2014, 04:22

8 370 0

Tài liệu Báo cáo khoa học: "Generalization Methods for In-Domain and Cross-Domain Opinion Holder Extraction" pdf

... (sub)domains, we chose some text type that is not even news text in order to have a very distant domain. There- fore, we had to use some text not included in the MPQA corpus. Existing text collections contain- ing ... from that of news texts and they contain a large number of different opinion holders (therefore opinion holder extraction is a meaning- ful task on this text type)....

Ngày tải lên: 22/02/2014, 02:20

11 428 0

Báo cáo khoa học: "Ensemble Methods for Unsupervised WSD" doc

... Association for Computational Linguistics Ensemble Methods for Unsupervised WSD Samuel Brody School of Informatics University of Edinburgh s.brody@sms.ed.ac.uk Roberto Navigli Dipartimento di Informatica Universit ` a ... Informatics University of Edinburgh mlap@inf.ed.ac.uk Abstract Combination methods are an effective way of improving system performance. This paper examines the bene...

Ngày tải lên: 08/03/2014, 02:21

8 343 0

Báo cáo khoa học: "Hybrid Methods for POS Guessing of Chinese Unknown Words" pot

... algorithm for calcu- lating the weights for context-independent linear in- terpolation when the n-gram frequencies are known. 4.3 Wu and Jiang’s (2000) Statistical Model There are several reasons for ... recall for disyllabic words is low. The results for the trigram model are listed in Ta- ble 5. Candidates are restricted to the eight POS cat- egories listed in Table 2 for this m...

Ngày tải lên: 08/03/2014, 04:22

6 349 0

Báo cáo khoa học: "Empirical Methods for Compound Splitting" ppt

... options for the German word Aktionsplan Aktionsplan Empirical Methods for Compound Splitting Philipp Koehn Information Sciences Institute Department of Computer Science University of Southern California koehn@isi.edu Kevin ... This poses challenges for a number of NLP applications such as machine translation, speech recognition, text classification, information extraction, or info...

Ngày tải lên: 08/03/2014, 21:20

8 315 0

Báo cáo khoa học: "Unsupervised Methods for Head Assignments" potx

... March – 3 April 2009. c 2009 Association for Computational Linguistics Unsupervised Methods for Head Assignments Federico Sangati, Willem Zuidema Institute for Logic, Language and Computation University ... syntactic information for transforming constituency tree- banks to dependency structures (Nivre et al., 2007) or richer syntactic representations (e.g., Hocken- maier and Stee...

Ngày tải lên: 08/03/2014, 21:20

9 262 0

Báo cáo khoa học: "COMPUTER METHODS FOR MORPHOLOGICAL ANALYSIS" doc

... word recognition mechanisms for these dictionaries. One of the more important issues in word recognition for all morphologically complex languages involves mechanisms for dealing with affixes. ... the other hand, to reject ill-formed words. On the other hand, we want to use our existing word-recognition and analysis programs as tools for gathering further information about Eng...

Ngày tải lên: 24/03/2014, 02:20

8 417 0

Báo cáo khoa học: "Machine Methods for Proving Logical Arguments Expressed in Englis" ppt

... the form NP + $1/ PRNAME; go to 5. 4.2. No: go to 5. 5. Enter SFORM, and determine quasi-logical form of parsed formula; enter LF, and determine logical translation of quasi-logical formula; ... mark end of formula; go to 2. 7. Combine formulae on Shelf 24 into a single formula of conditional form, in which the conjunction of the premisses implies the conclusion; store formula on...

Ngày tải lên: 30/03/2014, 17:20

27 469 0

Báo cáo khoa học: "Transductive learning for statistical machine translation" potx

... solve it. Our hypothesis is that adding information from source language text can also provide improvements. Unlike adding target language text, this hypothesis is a natural semi-supervised learning ... is that for many language pairs the amount of available bilingual text is very limited. In this work, we will address this problem and propose a general frame- work to solve it. Ou...

Ngày tải lên: 08/03/2014, 02:21

8 417 0

Báo cáo khoa học: "CONCEPTUAL ASSOCIATION FOR COMPOUND NOUN ANALYSIS" ppt

... bracketing for a given noun sequence, known to form a compound noun, without knowledge of the context. E.G.: (pottery (coffee mug)); ((coffee mug) holder) Corpus Statistics: The need for wide ... it modifies. So, when CA(pottery, mug) >> CA(pottery, coffee), we prefer (pottery (coffee mug)). First though, we must choose concepts for the words. For each wi (i = 2 or...

Ngày tải lên: 08/03/2014, 07:20

3 250 0