Báo cáo khoa học: "Template-Based Information Extraction without the Templates" pot
... Entity Extraction Once documents are labeled with templates, we next extract entities into the template slots. Extraction oc- curs in the trigger sentences from the previous sec- tion. The extraction ... versa. We thus take the max- imum of either cosine score as our final similarity metric between two relations. We then back off to the average of the two cosine scores if...
Ngày tải lên: 23/03/2014, 16:20
... instance, the error in the first pass may be transferred to the second pass when determining the extraction range of detailed information. Therefore the precision and recall of detailed information ... type of them appears in the text, then the weight of this feature is 1, otherwise is 0. 3.3 Block Selection Block selection is used to select the blocks generated fr...
Ngày tải lên: 20/02/2014, 15:20
... This way, the user can, while reading a text, immediately link up textual information to the Internet or to any other docu- ment base without accessing a search engine. The quality of the link ... Further- more, coreferences serve as a means of information transport into the output description on the RHS of the rule. Finally, the choice of feature structures as primar...
Ngày tải lên: 20/02/2014, 16:20
Báo cáo khoa học: "Open Information Extraction using Wikipedia" pdf
... Patterns of the <type>”: The matcher first identifies the type of the entity (e.g., “city” for “Ithaca”), then instantiates the pattern to create the string the city.” Since the first sentence ... sentence if the subject and/or attribute value are not heads of the noun phrases containing them. Third, it discards the sentence if the subject and the attribute value do...
Ngày tải lên: 23/03/2014, 16:20
Báo cáo khoa học: "Sparse Information Extraction: Unsupervised Language Models to the Rescue" pptx
... be correct based on the conjunction of the KnowItAll and the distributional hypotheses. The contributions of the paper are as follows: • The paper introduces the insight that the sub- field of language ... methodology, and then present our re- sults. The first experiment tests the hypothesis that HMM-T outperforms an n-gram-based method on the task of type checking. The...
Ngày tải lên: 31/03/2014, 01:20
Báo cáo khoa học: "Learning Information Structure in The Prague Treebank" docx
... MOD, MANN, ATT, OTHER. The derived features are computed using the de- pendency information from the tectogrammatical level of the treebank and the surface order of the words corresponding to the nodes 5 . ... items belong to the Topic and f items to the Focus. Before the manual annotation, the corpus has been preprocessed to mark all nodes with the TFA attribute of...
Ngày tải lên: 23/03/2014, 19:20
Báo cáo khoa học: "Extracting Hypernym Pairs from the Web" potx
... in em- ploying the web for the extraction of hypernym re- lations. We are especially curious about whether the size of the web allows to achieve meaningful results with basic extraction techniques. In ... make the same assumption as Snow et al. (2005): the hy- pernymy relations in the WordNets are complete for the terms that they contain. This means that if two words are p...
Ngày tải lên: 17/03/2014, 04:20
Báo cáo khoa học: "Synonymous Collocation Extraction Using Translation Information" ppt
... 1998; Gasperin et al., 2001). The methods used the contexts around the investigated words to discover synonyms. The problem of the methods is that the precision of the extracted synonymous words ... are the highest. In order to compare our methods with other methods under the same recall value, we conduct another experiment on the type <verb, OBJ, noun> 4 . We...
Ngày tải lên: 17/03/2014, 06:20
Báo cáo khoa học: "Unsupervised Relation Extraction by Mining Wikipedia Texts Using Information from the Web" pdf
... leveraging the vast size of the Web. Our hypothesis is that there exist some key terms and patterns that provide clues to the rela- tions between pairs. From the snippets retrieved by the search ... information contribute greatly to the coverage of relation extraction. • The combination of these patterns produces a clustering method to achieve high pre- cision for different...
Ngày tải lên: 23/03/2014, 16:21
Tài liệu Báo cáo khoa học: "Semantic Information and Derivation Rules for Robust Dialogue Act Detection in a Spoken Dialogue System" pptx
... λ A is the weight for the ASR score and the lexical score, λ L is the weight of the history score, and λ A + λ L = 1. Table 4 shows the results that history information will effect on the DA ... pivotal in the computa- tion of the lexical score. Finally, the lexical, the his- tory, and the ASR scores are combined to decide the 603 optimal dialogue act, and a proper...
Ngày tải lên: 20/02/2014, 05:20