Báo cáo khoa học: "Automatic Acquisition of Ranked Qualia Structures from the Web" potx
... as the number of web pages containing the words the and ’and’. 891 matched. On the basis of these, we then calculate the probability of a certain qualia element given a certain role on the ... inspecting the qualia structures. In contrast to our previous work, the fo- cus of this paper lies in analyzing different measures for ranking the qualia elements in the...
Ngày tải lên: 08/03/2014, 02:21
... '~vhat you then do is you make them think (These examples are actual text from the Penn corpus.) The extraordinary accuracy of verb detection within a tiny fraction of the rate achieved ... (1980). The completeness of the output list increases monotonically with the total number of occurrences of each verb in the corpus. False positive rates are one to th...
Ngày tải lên: 20/02/2014, 21:20
... match the frame adj-obj-for-to-inf VP is the NP of the PP. The only part of the feature structure which is not represented by the GRs is coin- dexation between the omitted direct object 1 of the VP-complement ... please then we know that he is the recipient of pleasure in the first instance and desirous of providing it in the second, but a computational system c...
Ngày tải lên: 08/03/2014, 04:22
Báo cáo khoa học: "Automatic Acquisition of English Topic Signatures Based on a Second Language" potx
... vector is the sum of the vectors of concepts that occur in a context win- dow. If many of the concepts in a window have a strong component for one of the topics, then the sum of the vectors, the context ... w are vectors and N is the dimen- sion of the vector space. The more overlap there is between the neighbours of the two words whose vectors are compa...
Ngày tải lên: 08/03/2014, 04:22
Báo cáo khoa học: "Automatic Acquisition of Named Entity Tagged Corpus from World Wide Web" pot
... to the size of the manual corpus. When we trained with that size of the automatic corpus, the performance was very low compared to the performance of the manual cor- pus. The reason is that the ... the seeds and comparable to that with the manual corpus.Moreover, the domain of the manual training corpus is same with that of the test corpus, i.e., news and n...
Ngày tải lên: 08/03/2014, 04:22
Báo cáo khoa học: "Automatic Acquisition of Language Model based on Head-Dependent Relation between Words" pdf
... gram-based and the other is grammar-based. N-gram model estimates the probability of a sentence as the product of the probability of each word in the sentence. It assumes that probability of the nth ... syntactic structures to a sentence and computes the probability of the sentence using the probabilities of the struc- tures. Long distance dependencie...
Ngày tải lên: 08/03/2014, 05:21
Báo cáo khoa học: "AUTOMATIC ACQUISITION OF A LARGE SUBCATEGORIZATION DICTIONARY FROM CORPORA" doc
... list of elements occurring after the verb, and this list together with the record of whether the verb is passive yields the overall con- text in which the verb appears. The parser skips to the ... pieces of text from other parts of the New York Times newswire, a portion of which is shown in Fig. 1, out of 200 verbs, the acquired subcatego- rization dictio...
Ngày tải lên: 23/03/2014, 20:20
Báo cáo khoa học: "AUTOMATIC ACQUISITION OF THE LEXICAL SEMANTICS OF VERBS FROM SENTENCE FRAMES*" doc
... describe syntactic properties of the verb (e.g."Takes an Object"), others de- scribe aspects of the theta-structure (the predi- cate/argument structure) of the verb (e.g."Takes 178 ... This is the class of DIE, one of the toplevel verb classes. Next, suppose it sees (7) John broke the window. and sees from observation that the referent of &qu...
Ngày tải lên: 31/03/2014, 18:20
Báo cáo khoa học: "Automatic Acquisition of Script Knowledge from a Text Collection" docx
... only the first paragraph from each report, and arranged the paragraphs in clusters based on the date of issue of the report. We used only the first paragraphs of the news reports because they tend ... (pairs) of actions that occur in time order. We then chose among these actions the ones that are typical by ranking them in terms of the fre- quency of their occurr...
Ngày tải lên: 31/03/2014, 20:20
Tài liệu Báo cáo khoa học: "Automatic Construction of Polarity-tagged Corpus from HTML Documents" docx
... Learning the polarity of words There are some works that discuss learning the po- larity of words instead of sentences. Hatzivassiloglou and McKeown proposed a method of learning the polarity of adjectives ... that the judges agree with each other in 467 out of 500 sentences (93.4%). The Kappa value was 0.901. From this result, we can say that the goldstandard was re...
Ngày tải lên: 20/02/2014, 12:20