... in this study. Training Test Num of Files 728 110 Num of Sentences 9,878 5,290 Num of Words 238,906 165,862 Num of Phrases 141,426 101,449 Table 2: Information of the CTB4 Corpus 3 Chinese Chunking 3.1 ... Comparison of Voting Methods In this section, we compared the performance of the voting methods of four basic systems, which were used in Section 6.2.2. Table 5 shows the re...
Ngày tải lên: 08/03/2014, 02:21
... presence of alternative splicing around the 5¢-end of exon 6 of Meis2 and Meis3 was tested by RT-PCR. The positions of molecular mass markers are shown to the left, and the size in base pairs of the ... DNA-binding cofactors [10–12]. Meis2 is a member of the TALE superfamily of HD proteins, which are characterized by the presence of a three amino acid loop insertion between he...
Ngày tải lên: 16/02/2014, 15:20
Tài liệu Báo cáo khoa học: "An Empirical Investigation of Discounting in Cross-Domain Language Models" ppt
... k train (w) denote the number of occurrences of w in the training corpus, and k test (w) denote the number of occurrences of w in the test corpus. We define the empirical discount of w to be d(w) = k train (w) ... one of 110M words of NYT from 2000 and 2001 (NYT00+01), and one of 110M words of AFP from 2002, 2005, and 2006 (AFP02+05+06). In both cases, we compute ¯ d(i) and t...
Ngày tải lên: 20/02/2014, 04:20
Tài liệu Báo cáo khoa học: "An Implemented Description of Japanese: The Lexeed Dictionary and the Hinoki Treebank" ppt
... description of the most familiar 28,000 words of Japanese. 1 Introduction In this paper we describe the current state of a new lexical resource: the Hinoki treebank. The ultimate goal of our research ... will look at ways of extending our lexicon and ontology to less familiar words. 2 The Lexeed Semantic Database of Japanese The Lexeed Semantic Database of Japanese con- sists o...
Ngày tải lên: 20/02/2014, 12:20
Tài liệu Báo cáo khoa học: "An Empirical Study of Information Synthesis Tasks" doc
... summaries consist of a) picking the first n sentences out of a set of selected documents (with different val- ues for n and different sets of documents) and b) taking the full content of a few doc- uments. ... conclusions of this work. 2 Creation of an Information Synthesis testbed We refer to Information Synthesis as the process of generating a topic-oriented report from a non...
Ngày tải lên: 20/02/2014, 15:20
Tài liệu Báo cáo khoa học: "An Evaluation Method of Words Tendency using Decision " docx
... classes of the input analysis data (test data). 2. POPULARITY OF WORDS CONSIDERING TIME-SERIES VARIATION 2.1 Stability Classes of the Words: To judge the index of popularity of words ... than that of straight line (2). The value of the slice of regression straight line (1) is also higher than that of regression straight line (2). So, we can decide that the words...
Ngày tải lên: 20/02/2014, 16:20
Tài liệu Báo cáo khoa học: "An Empirical Investigation of Proposals in Collaborative Dialogues" docx
... exist to the set of constraint equations, each varl in the set of equations must have a solution. For exam- ple, if 5 instances of sofas are known for varsola, but every assignment of a value to ... de- grees of strength) to some future course of action. The only distinction is whether the commitment is conditional on H's agreement (Offer) or not (Com- mit). With an O...
Ngày tải lên: 20/02/2014, 18:20
Tài liệu Báo cáo khoa học: "An Alternative Conception of Tree-Adjoining Derivation*" ppt
... of the same derived tree. This can be easily seen in that any adjunetion of/ 32 at a node at which an adjunction of/ 31 occurs could instead be replaced by an adjunction of/ 32 at the root of/ 31. ... thesis, Department of Computer and Information Science, Univer- sity of Pennsylvania. An Alternative Conception of Tree-Adjoining Derivation* Yves Schabes Department of Co...
Ngày tải lên: 20/02/2014, 21:20
Tài liệu Báo cáo khoa học: "DESIGN AND IMPLEMENTATION OF A LLXICAL DATA BASE " docx
... out some of the details of the implementation of this model. OVERVIEW OF THE PROBLSM One of the well-known characteristic features of natural languages is the size and the complexity of their ... amounts of text necessitates dictionaries of substantial size, of the order of at least tens of thousands of entries, perhaps even more than I00,000 lexical entries...
Ngày tải lên: 22/02/2014, 09:20
Báo cáo khoa học: "An Empirical Study of the Influence of Argument Conciseness on Argument Effectiveness" docx
... theory, the selection of what evidence to mention in an argument should be based on a measure of the evidence strength of support (or opposition) to the main claim of the argument (Mayberry ... primitive attribute of the entity. A value tree is a decomposition of the value of an entity into a hierarchy of aspects of the entity 2 , in which the leaves correspond to th...
Ngày tải lên: 08/03/2014, 05:20