Báo cáo khoa học: "Weakly-Supervised Acquisition of Open-Domain Classes and Class Attributes from Web Documents and Query Logs" pot
... contents of query logs during the extraction of labeled classes of instances from Web documents, we acquire thousands (4,583, to be exact) of open-domain classes covering a wide range of topics and ... period, symptoms, ] Query logs Web documents (1) (2) Figure 1: Overview of weakly-supervised extraction of class instances, class labels and class at...
Ngày tải lên: 08/03/2014, 01:20
... nique based on the Case Filter of Rouvret and Vergnaud (1980). The completeness of the output list increases monotonically with the total number of occurrences of each verb in the corpus. False ... immediately to the left of a tensed verb, immediately to the right of a preposition, or immediately to the right of a main verb. Adverbs and adverbial phrases (including day...
Ngày tải lên: 20/02/2014, 21:20
... are made up of ” NP QT is made up of NP’ C “p(x) are made of NP QT are made of NP’ C “p(x) comprise” NP QT comprise (of) ? NP’ C “p(x) consist of NP QT consist of NP’ C Table 2: Clues and Patterns ... (Web- Jac) measure relies on the web search engine to calculate the number of documents in which x and y co-occur close to each other, divided by the number of documents...
Ngày tải lên: 08/03/2014, 02:21
Báo cáo khoa học: "Automatic Acquisition of Adjectival Subcategorization from Corpora" docx
... for subcategorization acquisition. 1 Introduction Research into automatic acquisition of lexical in- formation from large repositories of unannotated text (such as the web, corpora of published text, etc.) ... University of Edinburgh Laboratory for Foundations of Computer Science. state -of- art statistical systems and for improving the portability of these systems betwee...
Ngày tải lên: 08/03/2014, 04:22
Báo cáo khoa học: "Automatic Acquisition of English Topic Signatures Based on a Second Language" potx
... Chinese-English and English- Chinese bilingual lexicons and a large amount of Chinese text, which can be collected either from the Web or from Chinese corpora. Since topic sig- natures are potentially ... instances of the fi- nancial sense of interest. One set was extracted from a hand-tagged corpus (Bruce and Wiebe, 1994) and the other by our algorithm. 3 Application on...
Ngày tải lên: 08/03/2014, 04:22
Báo cáo khoa học: "Automatic Acquisition of Named Entity Tagged Corpus from World Wide Web" pot
... Collecting Web Documents It is not appropriate for our purpose to randomly col- lect documents from the web. This is because not all web documents actually contain some NE instances and we also ... web to be used for learning of Named En- tity Recognition systems. We use an NE list and an web search engine to col- lect web documents which contain the NE instances. T...
Ngày tải lên: 08/03/2014, 04:22
Báo cáo khoa học: "Automatic Acquisition of Language Model based on Head-Dependent Relation between Words" pdf
... complete-sequence composed of two complete-links, and (b) is a leftward one. (c) is a complete-sequence composed of zero complete- links, and it can be both leftward and rightward. The word of "complete" ... list of all pairs of unique words in the corpus. The initial pairs represent the ten- tative head-dependent relations of the words. And the initial pro...
Ngày tải lên: 08/03/2014, 05:21
Báo cáo khoa học: "AUTOMATIC ACQUISITION OF A LARGE SUBCATEGORIZATION DICTIONARY FROM CORPORA" doc
... basic measures of results are the in- formation retrieval notions of recall and precision: How many of the subcategorization frames of the verbs were learned and what percentage of the things ... many of the uses of verbs in a text are captured by our subcate- gorization dictionary. For two randomly selected pieces of text from other parts of the New York Times...
Ngày tải lên: 23/03/2014, 20:20
Báo cáo khoa học: "AUTOMATIC ACQUISITION OF THE LEXICAL SEMANTICS OF VERBS FROM SENTENCE FRAMES*" doc
... AUTOMATIC ACQUISITION OF THE LEXICAL SEMANTICS OF VERBS FROM SENTENCE FRAMES* Mort Webster and Mitch Marcus Department of Computer and Information Science University of Pennsylvania ... analysis of the classes currently handled. It is interesting to note that although the partial ordering of verb classes is defined in terms of fea- tures defined over syntactic...
Ngày tải lên: 31/03/2014, 18:20
Báo cáo khoa học: "Automatic Acquisition of Script Knowledge from a Text Collection" docx
... Automatic Acquisition of Script Knowledge from a Text Collection Toshiaki Fujiki Hidetsugu Nanba Interdisciplinary Graduate School of Graduate School of Science and Engineering Information ... automatic acquisition of script knowl- edge and investigated the effectiveness of our method. We used issues of Nihon Keizai Shim- bun for the past 11 years (1990-2000) as a ne...
Ngày tải lên: 31/03/2014, 20:20