Tài liệu Báo cáo khoa học: "Automatic learning of textual en

Tài liệu Báo cáo khoa học: "Automatic learning of textual entailments with cross-pair similarities" ppt

... Linguistics Automatic learning of textual entailments with cross-pair similarities Fabio Massimo Zanzotto DISCo University of Milano-Bicocca Milan, Italy zanzotto@disco.unimib.it Alessandro Moschitti Department of ... trivial set of entailment cases. The experiments with the data sets of the RTE 2005 challenge show an improvement of 4.4% over the state -of- the-art met...

Ngày tải lên: 20/02/2014, 12:20

8 413 0

Tài liệu Báo cáo khoa học: "Automatic Collection of Related Terms from the Web" pptx

... application of the method is automatic or semi-automatic compilation of a glossary or technical-term dictionary for a certain domain. Re- cursive application of the method enables to collect a list of ... number of pages that sat- isfy a given query. In case the query is a term, its hit is the number of pages that contain the term on the Web. We use the following notation. H(x)=...

Ngày tải lên: 20/02/2014, 16:20

4 437 0

Tài liệu Báo cáo khoa học: "Automatic Extraction of Lexico-Syntactic Patterns for Detection of Negation and Speculation Scopes" pdf

... Statistics of the BioScope corpus. The 2nd and 3d columns show the total number of cues within the datasets; the 4th and 5th columns show the percentage of negated and spec- ulative sentences. 70% of ... (possible) medical conditions. The importance of the task of negation and speculation (a.k.a. hedge) detection is attested by a number of research initiatives. The creation...

Ngày tải lên: 20/02/2014, 04:20

5 544 1

Tài liệu Báo cáo khoa học: "Automatic Satire Detection: Are You Having a Laugh?" ppt

... knowl- edge that humans make use of. The primary contributions of this research are as follows: (1) we introduce a novel task to the arena of computational linguistics and machine learning, and make available ... the ﬁrst line of an article. In this way the heading tokens are represented twice: once in the overall set of uni- grams in the article, and once in the set of heading...

Ngày tải lên: 20/02/2014, 09:20

4 408 0

Tài liệu Báo cáo khoa học: "Automatic Construction of Polarity-tagged Corpus from HTML Documents" docx

... the polarity of words There are some works that discuss learning the polarity of words instead of sentences. Hatzivassiloglou and McKeown proposed a method of learning the polarity of adjectives ... there are two sentences in each of the 454 (1) kono software-no riten-ha hayaku ugoku koto this software-POST advantage-POS T quickly run to The advantage of this software is t...

Ngày tải lên: 20/02/2014, 12:20

8 409 0

Tài liệu Báo cáo khoa học: "Automatic Identification of Pro and Con Reasons in Online Reviews" ppt

... specific and tangible features. Also, there are somewhat a fixed set of features of a specific type of product, for exam- ple, ease of use, durability, battery life, photo quality, and shutter lag ... system without involving a human judge, we annotated a small set of data manually for evaluation purposes. Gold Standard Annotation: Four humans annotated 3 sets of test sets:...

Ngày tải lên: 20/02/2014, 12:20

8 461 1

Tài liệu Báo cáo khoa học: "Automatic Evaluation of Sentence-Level Fluency Andrew Mutton∗" pdf

... Methods PoStag In the ﬁrst of these, we constructed a rough approximation of typical sentence grammar structure by taking bigrams over part -of- speech tags. 6 Then, given a string of PoS tags of length n, t 1 . ... might be an indicator of some degree of ungrammaticality and possibly disﬂuency. In that work, however, correlation with human judgements was left uninvestigated. Th...

Ngày tải lên: 20/02/2014, 12:20

8 508 0

Tài liệu Báo cáo khoa học: "Automatic Evaluation of Machine Translation Quality Using Longest Common Subsequence and Skip-Bigram Statistics" doc

... construction of N-best translation lexicons from parallel text. Melamed (1995) used the ratio (LCSR) between the length of the LCS of two words and the length of the longer word of the two ... skip-bigram matches, SKIP2(X,Y), within the maximum skip distance and replace de- nominators of Equations 13, C(m,2), and 14, C(n,2), with the actual numbers of within distance sk...

Ngày tải lên: 20/02/2014, 16:20

8 443 0

Tài liệu Báo cáo khoa học: "Automatic clustering of collocation for detecting practical sense boundary" ppt

... numbered i of the word x. I x is the word sense indexing function of x that gives an index to each sense of the word x. All contextual words x i ±j of a central word x have their own contextual ... collocation of the central word x, window size w and corpus c is expressed with function f: V N C Æ 2P C/V . In this formula, V means a set of vocabulary, N is the size of th...

Ngày tải lên: 20/02/2014, 16:20

4 425 0

Tài liệu Báo cáo khoa học: "AUTOMATIC ACQUISITION OF SUBCATEGORIZATION FRAMES FROM UNTAGGED TEXT" doc

... The completeness of the output list increases monotonically with the total number of occurrences of each verb in the corpus. False positive rates are one to three percent of observa- tions. ... currences of the verb and occurrences of infinitives following within a certain number of words. Unlike our system, Church's approach does not aim to de- cide whether or not...

Ngày tải lên: 20/02/2014, 21:20

6 416 0