Báo cáo khoa học: "AUTOMATIC ACQUISITION OF A LARGE SUBCATEGORIZATION DICTIONARY FROM CORPORA" doc
... computer manuals. 3. Hand-coded lists are expensive to make, and in- variably incomplete. 4. A subcategorization dictionary obtained auto- matically from corpora can be updated quickly and easily ... learning a foreign language. A subcategorization frame is a statement of what types of syntactic arguments a verb (or ad- jective) takes, such as objects, infinitive...
Ngày tải lên: 23/03/2014, 20:20
... learning approach, which is more attractive because it is trainable and adaptable, and subsequently the porting of a machine learning sys- tem to another domain is much easier than that of a rule-based ... procedures and NE instances are finally annotated with the appropriate NE categories. This automatically tagged corpus may have lower quality than the manually tagged ones but its si...
Ngày tải lên: 08/03/2014, 04:22
... sense-tagged corpus, the TWA Sense Tagged Data Set, manually pro- duced by Rada Mihalcea and Li Yang (Mihalcea, 2003), from text drawn from the British National Corpus. We calculated a ‘supervised’ ... of the most influential newspaper in mainland China. It maintains a vast database of news stories, available to search by the public. Among other reasons, we chose this website be- c...
Ngày tải lên: 08/03/2014, 04:22
Báo cáo khoa học: "Automatic Acquisition of Script Knowledge from a Text Collection" docx
... first explain what we mean by 'action', 'pair of actions', and 'sequence of actions' in this paper. In this work, an action is defined as a tuple of a transitive ... For example, when we go to a restaurant, we usually 'enter the restaurant', 'wait', 'sit down', 'get the menu and decide what to eat', 'order...
Ngày tải lên: 31/03/2014, 20:20
Tài liệu Báo cáo khoa học: "AUTOMATIC ACQUISITION OF SUBCATEGORIZATION FRAMES FROM UNTAGGED TEXT" doc
... AUTOMATIC ACQUISITION OF SUBCATEGORIZATION FRAMES FROM UNTAGGED TEXT Michael R. Brent MIT AI Lab 545 Technology Square Cambridge, Massachusetts 02139 michael@ai.mit.edu ABSTRACT This ... immediately to the right of a main verb. Adverbs and adverbial phrases (including days and dates) are ignored for the pur- poses of case adjacency. A noun-phrase that sat- isfies th...
Ngày tải lên: 20/02/2014, 21:20
Báo cáo khoa học: "Automatic Acquisition of Ranked Qualia Structures from the Web" potx
... explore the impact of qualia struc- tures for natural language processing at a larger scale. The approach builds on ear- lier work based on the idea of matching spe- cific lexico-syntactic patterns conveying ... easier to validate existing qualia structures than to create them from scratch, which already corroborates the usefulness of our automatic approach. The qualia structure for...
Ngày tải lên: 08/03/2014, 02:21
Báo cáo khoa học: "Automatic Acquisition of Adjectival Subcategorization from Corpora" docx
... Proceedings of the 43rd Annual Meeting of the ACL, pages 614–621, Ann Arbor, June 2005. c 2005 Association for Computational Linguistics Automatic Acquisition of Adjectival Subcategorization from Corpora Jeremy ... recall rate. A new tool for linguistic annotation of SCFs in corpus data is also introduced which can considerably alleviate the pro- cess of obtaining training...
Ngày tải lên: 08/03/2014, 04:22
Báo cáo khoa học: "Automatic Acquisition of Language Model based on Head-Dependent Relation between Words" pdf
... which is a key part of many natural language applications such as speech recognition and statistical ma- chine translation. In this paper, we present a language modeling based on a kind of simple ... associate a priori prob- ability to a sentence. It is a key part of many natural language applications such as speech recognition and statistical machine translation. Pr...
Ngày tải lên: 08/03/2014, 05:21
Báo cáo khoa học: "Automatic construction of a hypernym-labeled noun hierarchy from text" docx
... would initially require a 50,000 x 50,000 array of values (or a trian- gular array of about half this size). With our current hardware, the largest array we can comfortably handle is about 100 ... they approximated this data by just looking at the nearest NP on each side of a particular NP. Roark and Charniak (1998) built on that work by actu- ally using conjunction and ap...
Ngày tải lên: 08/03/2014, 06:20
Báo cáo khoa học: "Automatic Induction of a CCG Grammar for Turkish" pptx
... Combinatory Categorial Grammar Combinatory Categorial Grammar (Ades and Steed- man, 1982; Steedman, 2000) is an extension to the classical Categorial Grammar (CG) of Aj- dukiewicz (1935) and Bar-Hillel ... is added from “araba” to “uyudu˘gum” to emphasize that the predicate is intransitive and it may have a locative adjunct. Similarly, a T.OBJECT link is added from “kitap” to “okudu...
Ngày tải lên: 17/03/2014, 06:20