Báo cáo khoa học: "Evaluation of Importance of Sentences based on Connectivity to Title" doc
... Evaluation of Importance of Sentences based on Connectivity to Title Takehiko Yoshimi and Toshiyuki Okunishi Takahiro Yamaji and Yoji Fukumochi Software Business Development ... Corporation 492 Minosho-cho Yamatokoriyama Nara, Japan Abstract This paper proposes a method of selecting impor- tant sentences from a text based on the evaluation of the connectivity...
Ngày tải lên: 23/03/2014, 19:20
... asymptotic growth of the number of unique words. In the last two pan- els, we show the proportion of words appearing only once among the unique words; this gives an indication of the proportion of ... .5 (middle) and .9 (top). Third panel: proportion of words appearing only once, as a function of the number of words drawn, with d = .5 and θ = 1 (bottom), 10 (middle), 100 (top)...
Ngày tải lên: 17/03/2014, 04:20
... method to that of the conventional methods through a common document retrieval task. Furthermore, as an appli- cation of our method, we apply it to a query-biased document summarization (Tombros ... topics of a document. For example, in case of classifying opinions to objects in a doc- ument, we have to identify what sort of opinion is assigned to the target objects, the...
Ngày tải lên: 23/03/2014, 16:20
Báo cáo khoa học: "A FLEXIBLE NATURAL LANGUAGE PARSER BASED ON A TWO-LEVEL REPRESENTATION OF SYNTAX" ppt
... PARSER BASED ON A TWO-LEVEL REPRESENTATION OF SYNTAX Leonardo Lesmo and Pietro Torasso Istituto di Scienze dell'Informazione Universit~ di Torino C.so Massimo D'Azeglio 42 - 10125 TORINO ... 116 REL Relation Verbs, copulas REF Referent Nouns, pronouns CONN Connector Prepositions, conjunctions Articles, DET Determiner demonstrative adjectives, adjectival question wor...
Ngày tải lên: 18/03/2014, 02:20
Tài liệu Báo cáo khoa học: "A Pronoun Anaphora Resolution System based on Factorial Hidden Markov Models" docx
... 2003) decompose the task into a col- lection of pairwise or mention set coreference de- cisions. Decisions for each pair or each group of mentions are based on probabilities of features extracted ... few considerations made us reconsider. First, exceptions are found in the corpus. Personal pronouns such as she or he are used to refer to coun- try, regions, states or organization...
Ngày tải lên: 20/02/2014, 04:20
Báo cáo khoa học: "a Chat-oriented Dialogue System based on the Vector Space Model" ppt
... is conducted, tokenization and vectorization of the user input is carried out. During tokenization, an additional checking is conducted by the dialogue manager. It looks for any adaptation ... vectorization, word toke- nization was conducted. In this step, all punctua- tion marks were removed, with the exception of the question “?” and exclamation “!” marks. Simi- larly, all other no...
Ngày tải lên: 07/03/2014, 18:20
Báo cáo khoa học: "Chinese-English Term Translation Mining Based on Semantic Prediction" doc
... the process of the following estimation. 4 Translation candidate construction and noise solution The goal of translation candidate construction is to construct and mine all kinds of possible ... is based on the clue that the context of source term is very similar to that of target translation in a large amount of corpora. 3) Acquiring transla- tions from a combinati...
Ngày tải lên: 17/03/2014, 04:20
Báo cáo khoa học: "Automated Japanese Essay Scoring System based on Articles Written by Experts" potx
... that of the others because we consider it an index contribut- ing to not only “rhetoric” but to “content” as well. 2.1 Ease of reading The following items are considered indexes of “ease of reading.” 1. ... transition that takes on a conversational structure in the case of con- cession or compromise. Typical expressions indicating this relationship are “certainly” and of cour...
Ngày tải lên: 23/03/2014, 18:20
Báo cáo khoa học: "Correcting Errors in a Treebank Based on Synchronous Tree Substitution Grammar" pot
... demonstrates that we can ob- tain error correction rules with high precision. TOP NP .VP based on quotations at five major banks The average of interbank offered rates NP TOP NP .VP based on ... by the score function described in Section 3.3, since it is time-consuming and expensive to evaluate all of the rules. These 100 rules were applied at 331 positions. The precision of t...
Ngày tải lên: 30/03/2014, 21:20
Báo cáo khoa học: "A Statistical Machine Translation Model Based on a Synthetic Synchronous Grammar" docx
... The SSG -based Translation Model The translation in our SSG -based translation model can be treated as a SSG derivation. A derivation consists of a sequence of grammar rule applications. To model ... corresponding hypotheses until all nonterminal leaf nodes are processed. The key feature of our decoder is that the derivations are based on synthetic grammar, so that one derivatio...
Ngày tải lên: 31/03/2014, 00:20