Báo cáo khoa học: "Using Document Level Cross-Event Inference to Improve Event Extraction" potx
... attacks. 5 Cross -event Approach In this section we present our approach to using document- level event and role information to improve sentence -level ACE event extraction. Our event extraction ... a document- level statistical model for event trigger and argument (role) classification to achieve document level within -event and cross -event consisten...
Ngày tải lên: 30/03/2014, 21:20
... the Corpus-based validator is able to improve the results for the In- ventor relation, which the other two validators are completely unable to do. It is also of interest to compare the performance ... Extractor The Instance Extractor applies the patterns gener- ated by the Pattern Learner to the text corpus. In order to be able to match the slots of the patterns, the Insta...
Ngày tải lên: 23/03/2014, 18:20
... ence - 6: A brief history. In COLING, pages 466–471. IBM. 2010. IBM LanguageWare. P. G. Ipeir otis, E. Agichtein, P. Jain, and L. Gravano. 2006. To search or to crawl?: towards a query opti- mizer ... Patterns 50 20 10 10 10 50 50 10 Priority P 2 R 1 P 2 R 2 P 2 R 3 P 2 R 4 P 2 R 5 P 1 R 1 P 1 R 2 P 1 R 3 RuleId Input First Last Caps Token Output Person Input Lookup Token Output First Las...
Ngày tải lên: 23/03/2014, 16:20
Báo cáo khoa học: "Learning Document-Level Semantic Properties from Free-text Annotations" pot
... values h – document keyphrases η – document keyphrase topics λ – probability of selecting η instead of φ c – selects between η and φ for word topics φ – document topic model z – word topic assignment θ ... the documents are drawn from language models indexed by a set of topics, where the topics correspond to the keyphrase clusters. Crucially, we bias the assignment of hid- den topics...
Ngày tải lên: 23/03/2014, 17:20
Tài liệu Báo cáo khoa học: "Using Cycles and Quasi-Cycles to Disambiguate Dictionary Glosses" pdf
... ambiguous words to be disambiguated in the part-of-speech tagged gloss of sense s. Given a word w ∈ a(s), our aim is to disambiguate w according to the sense inven- tory of D, i.e. to assign it ... objective is to assign the right Italian sense from the Italian-English section to corsa n and gara n . To apply the CQC algorithm, a simple adapta- tion is needed, so as to all...
Ngày tải lên: 22/02/2014, 02:20
Báo cáo khoa học: "Using lexical and relational similarity to classify semantic relations" pptx
... equivalent to requiring that the kernel function equate to an inner product in some vector space. The kernel can be expressed in terms of a mapping function φ from the input space X to a feature ... practice, however, the vector length will be lower due to subsequences occurring more than once and many strings being shorter than s max . One way to reduce the memory load is to re-...
Ngày tải lên: 08/03/2014, 21:20
Báo cáo khoa học: "A Cross-Lingual ILP Solution to Zero Anaphora Resolution" potx
... often used non-anaphorically, to refer to situationally introduced entities, as in I went to John’s office, but they told me that he had left. 806 #instances (anaphoric/total) language type #docs ... Solution to Zero Anaphora Resolution Ryu Iida Tokyo Institute of Technology 2-12-1, ˆ Ookayama, Meguro, Tokyo 152-8552, Japan ryu-i@cl.cs.titech.ac.jp Massimo Poesio Universit`a di Trento,...
Ngày tải lên: 07/03/2014, 22:20
Báo cáo khoa học: "Reducing Wrong Labels in Distant Supervision for Relation Extraction" potx
... Technologies Laboratories Sony Corporation 5-1-12 Kitashinagawa, Shinagawa-ku, Tokyo Shingo.Takamatsu@jp.sony.com Issei Sato and Hiroshi Nakagawa Information Technology Center The University of Tokyo 7-3-1 ... Center The University of Tokyo 7-3-1 Hongo, Bunkyo-ku, Tokyo {sato@r., n3@}dl.itc.u-tokyo.ac.jp Abstract In relation extraction, distant supervision seeks to extract relations between...
Ngày tải lên: 23/03/2014, 14:20
Báo cáo khoa học: "Exploiting Web-Derived Selectional Preference to Improve Statistical Dependency Parsing" docx
... F score relative to dependency length. beyond the annotated corpora are needed to capture the bi-lexical relationship at the word -to- word level. Our purpose in this paper is to exploit web- derived ... Selectional Preference to Improve Statistical Dependency Parsing Guangyou Zhou, Jun Zhao ∗ , Kang Liu, and Li Cai National Laboratory of Pattern Recognition Institute of Automatio...
Ngày tải lên: 30/03/2014, 21:20
Tài liệu Báo cáo khoa học: "Ensemble Document Clustering Using Weighted Hypergraph Generated by NMF" docx
... the term -document matrix to the matrix and the transposed matrix of the matrix (Xuet al., 2003), where is the number of clusters; that is, The -th document corresponds to the -th row vector of ... weighted hypergraph. In our experiment, we use 18 document data sets provided at http://glaros.dtc.umn.edu/ gkhome/cluto/cluto/download . The document vector is not normalized for each d...
Ngày tải lên: 20/02/2014, 12:20