Báo cáo khoa học: "Automatic Acquisition of Named Entity Tagged Corpus from World Wide Web" pot
... Automatic Acquisition of Named Entity Tagged Corpus from World Wide Web Joohui An Dept. of CSE POSTECH Pohang, Korea 790-784 minnie@postech.ac.kr Seungwoo Lee Dept. of CSE POSTECH Pohang, ... the con- structed NE tagged corpus, we apply it to a learn- ing of NER system and compare the results with the manually tagged corpus. 2 Automatic Acquisition of an NE...
Ngày tải lên: 08/03/2014, 04:22
... mainly from the Mandarin portion of the Chinese Gigaword Corpus (CGC), produced by the LDC 3 , which contains 1.3GB of newswire text drawn from Xinhua newspaper. Some Chi- nese translations of English ... instances of the fi- nancial sense of interest. One set was extracted from a hand -tagged corpus (Bruce and Wiebe, 1994) and the other by our algorithm. 3 Application on...
Ngày tải lên: 08/03/2014, 04:22
... Taiwan shukai@gmail.com Abstract Identification of transliterated names is a particularly difficult task of Named Entity Recognition (NER), especially in the Chi- nese context. Of all possible variations of transliterated named entities, ... to the automatic extraction of diverging transliterations of foreign named entities by bootstrapping co- occurrence statistics from...
Ngày tải lên: 17/03/2014, 04:20
Báo cáo khoa học: "AUTOMATIC ACQUISITION OF A LARGE SUBCATEGORIZATION DICTIONARY FROM CORPORA" doc
... be obtained from text corpora, the only research that I am aware of that has dealt directly with the problem of the automatic acquisition of subcategorization frames is a series of papers by ... many of the uses of verbs in a text are captured by our subcate- gorization dictionary. For two randomly selected pieces of text from other parts of the New York Times news...
Ngày tải lên: 23/03/2014, 20:20
Tài liệu Báo cáo khoa học: "AUTOMATIC ACQUISITION OF SUBCATEGORIZATION FRAMES FROM UNTAGGED TEXT" doc
... on the Case Filter of Rouvret and Vergnaud (1980). The completeness of the output list increases monotonically with the total number of occurrences of each verb in the corpus. False positive ... detected so far The SF acquisition program has been tested on a corpus of 2.6 million words of the Wall Street Journal (kindly provided by the Penn Tree Bank project). On this...
Ngày tải lên: 20/02/2014, 21:20
Báo cáo khoa học: "Automatic Acquisition of Ranked Qualia Structures from the Web" potx
... Pattern Singular “a(x) x is made up of ” NP QT is made up of NP’ C “a(x) x is made of NP QT is made of NP’ C “a(x) x comprises” NP QT comprises (of) ? NP’ C “a(x) x consists of NP QT consists of NP’ C Plural “p(x) ... NP’ C Plural “p(x) are made up of ” NP QT is made up of NP’ C “p(x) are made of NP QT are made of NP’ C “p(x) comprise” NP QT comprise (of) ? NP’ C “p(x) consis...
Ngày tải lên: 08/03/2014, 02:21
Báo cáo khoa học: "Automatic Acquisition of Adjectival Subcategorization from Corpora" docx
... Proceedings of the 43rd Annual Meeting of the ACL, pages 614–621, Ann Arbor, June 2005. c 2005 Association for Computational Linguistics Automatic Acquisition of Adjectival Subcategorization from Corpora Jeremy ... acquisition. 1 Introduction Research into automatic acquisition of lexical in- formation from large repositories of unannotated text (such as the web, corpora...
Ngày tải lên: 08/03/2014, 04:22
Báo cáo khoa học: "Automatic Acquisition of Language Model based on Head-Dependent Relation between Words" pdf
... (DEP) on a raw corpus extracted from KAIST corpus 3. The raw corpus consists of 1,589 sentences with 13,139 words, describing animal life in nature. We randomly divided the corpus into two ... Given a training corpus, the initial grammar is just a list of all pairs of unique words in the corpus. The initial pairs represent the ten- tative head-dependent relations of...
Ngày tải lên: 08/03/2014, 05:21
Báo cáo khoa học: "AUTOMATIC ACQUISITION OF THE LEXICAL SEMANTICS OF VERBS FROM SENTENCE FRAMES*" doc
... AUTOMATIC ACQUISITION OF THE LEXICAL SEMANTICS OF VERBS FROM SENTENCE FRAMES* Mort Webster and Mitch Marcus Department of Computer and Information Science University of Pennsylvania ... thank Beth Levin and the anonymotm reviewers of this paper for many helpful com- ments. We ~ b~efit~l greatly from disctumion of issues of verb acquisition in children with Lila G...
Ngày tải lên: 31/03/2014, 18:20
Báo cáo khoa học: "Automatic Acquisition of Script Knowledge from a Text Collection" docx
... Automatic Acquisition of Script Knowledge from a Text Collection Toshiaki Fujiki Hidetsugu Nanba Interdisciplinary Graduate School of Graduate School of Science and Engineering Information ... sequences (pairs) of actions from the text collection. 3. Selecting typical sequences. We show the outline of our method in Figure 1, where the process of automatic acquisition...
Ngày tải lên: 31/03/2014, 20:20