Báo cáo khoa học: "Automatic Acquisition of Language Model based on Head-Dependent Relation between Words" pdf
... Automatic Acquisition of Language Model based on Head-Dependent Relation between Words Seungmi Lee and Key-Sun Choi Department of Computer Science Center for Artificial ... grammar, grammar -based language modeling is expected to be more powerful and compact in model size than n-gram -based one. In this paper we present a language modeling based on a kin...
Ngày tải lên: 08/03/2014, 05:21
... Most language models fall into the class of n-gram models, which approximate the distribution over sentences using the conditional distribution of each word given a context consist- ing of only ... are examples of nonpara- metric Bayesian models. Here we give a quick de- scription of the Pitman-Yor process in the context of a unigram language model; good tutorials on such m...
Ngày tải lên: 17/03/2014, 04:20
... Space. A context vector is the sum of the vectors of concepts that occur in a context win- dow. If many of the concepts in a window have a strong component for one of the topics, then the sum of the ... Automatic Acquisition of English Topic Signatures Based on a Second Language Xinglong Wang Department of Informatics University of Sussex Brighton, BN1 9QH, UK xw20@sus...
Ngày tải lên: 08/03/2014, 04:22
Tài liệu Báo cáo khoa học: "AUTOMATIC ACQUISITION OF SUBCATEGORIZATION FRAMES FROM UNTAGGED TEXT" doc
... novel tech- nique based on the Case Filter of Rouvret and Vergnaud (1980). The completeness of the output list increases monotonically with the total number of occurrences of each verb in the ... subcategorization frames (SFs) detected so far The SF acquisition program has been tested on a corpus of 2.6 million words of the Wall Street Journal (kindly provided by the...
Ngày tải lên: 20/02/2014, 21:20
Báo cáo khoa học: "Automatic Acquisition of Ranked Qualia Structures from the Web" potx
... fixed number of basic components”, ”data mining com- prises a range of data analysis techniques”, ”books consist of a series of dots”, or ”a conversation is made up of a series of observable interpersonal ... qualia role. 4.4 Conditional Probability (P) The non web -based conditional probability essen- tially differs from the Web -based conditional prob- ability in that we only r...
Ngày tải lên: 08/03/2014, 02:21
Báo cáo khoa học: "Automatic Acquisition of Adjectival Subcategorization from Corpora" docx
... in- stantiations of slots. A match is considered success- ful if the set of GRs can be unified with any of the disjuncts. Unification of a sentence -relation and a pattern -relation occurs when there is a one-to-one correspondence ... for subcategorization acquisition. 1 Introduction Research into automatic acquisition of lexical in- formation from large repositories of unanno...
Ngày tải lên: 08/03/2014, 04:22
Báo cáo khoa học: "Automatic Acquisition of Named Entity Tagged Corpus from World Wide Web" pot
... these erroneous matches. Sentence refinement is accomplished by three different processes: sep- aration of functional words, segmentation of com- pound nouns, and verification of the usefulness of the ... other contexts to reduce errors of automatic annotation. For example, ‘E¶(kyunggi, Kyunggi/business con- ditions/a game)’ is filtered out because it means a lo- cation (proper noun) i...
Ngày tải lên: 08/03/2014, 04:22
Báo cáo khoa học: "AUTOMATIC ACQUISITION OF A LARGE SUBCATEGORIZATION DICTIONARY FROM CORPORA" doc
... subcategorization dictionary from on- line corpora of unrestricted text: 1. Dictionaries with subcategorization information are unavailable for most languages (only a few recent dictionaries, generally ... location information that can be obtained from text corpora, the only research that I am aware of that has dealt directly with the problem of the automatic acquisition of...
Ngày tải lên: 23/03/2014, 20:20
Báo cáo khoa học: "AUTOMATIC ACQUISITION OF THE LEXICAL SEMANTICS OF VERBS FROM SENTENCE FRAMES*" doc
... or forbidden is merely optional. This model is built upon a classification of verbs based upon a simple three-valued set of features which repre- sents key aspects of a verb's syntactic ... conjunctions of a finite number of predefined quasi-independent features with no need for disjunction or complex boolean combinations of features. Given such a feature set, the P...
Ngày tải lên: 31/03/2014, 18:20
Báo cáo khoa học: "Automatic Acquisition of Script Knowledge from a Text Collection" docx
... A 'pair of actions' consists of two actions that occur in time order. A 'sequence of actions' can be defined as a transitive closure of all the pairs of actions. 1. Cases ... of actions that occur in time order. We then chose among these actions the ones that are typical by ranking them in terms of the fre- quency of their occurrence. To extract sequences...
Ngày tải lên: 31/03/2014, 20:20