Báo cáo khoa học: "Temporally Anchored Relation Extraction" doc
... system’s 107 Training (1) IR candidate document retrieval (3) Distant supervised learning (5) Relation Extraction (6) Temporal Anchoring Document Collection Document Index (2) Document Representation (4) ... Knowledge Base (KB), we extract a set of relation triples or seeds: entity, relation, value, where the relation is one of the target relations. Our document-level distant sup...
Ngày tải lên: 07/03/2014, 18:20
... selected another 25 documents and added them to the previous 50 documents to get 75 documents. We made sure that every document participated in this experiment. The training documents for each ... The training documents for the 12 All the improvements of F in Table 7, 8 and 9 were significant at confidence levels >= 95%. 527 # docs F of Relation Classification F of Relation...
Ngày tải lên: 30/03/2014, 21:20
... second step, we propose a novel Relational Adaptive bootstraPping (RAP) algorithm to expand the seeds in the target domain by exploiting the labeled source domain data and the relation- ships between ... domain by mining some general syntactic relation patterns between the sen- timent and topic words from the source domain. In the second step, we propose a Relational Adaptive bootstraPpin...
Ngày tải lên: 19/02/2014, 19:20
Tài liệu Báo cáo khoa học: "Effective Phrase Translation Extraction from Alignment Models" ppt
... with diverse con- texts. 9 Conclusions We have presented a method to efficiently ex- tract phrase relationships from IBM word alignment models by leveraging the maximum approximation as well as
Ngày tải lên: 20/02/2014, 16:20
Báo cáo khoa học: "Discriminative Modeling of Extraction Sets for Machine Translation" pptx
... rules. Word-level alignments are gen- erated as a byproduct of inference. We first spec- ify the relationship between word alignments and extraction sets, then define our model. 2.1 Extraction Sets ... heuristic makes no errors, and the time required to compute pseudo-gold alignments is negligible. 5 Relationship to Previous Work Our model is certainly not the first alignment ap- proach to inclu...
Ngày tải lên: 07/03/2014, 22:20
Báo cáo khoa học: "Automatic Set Instance Extraction using the Web" pptx
... Ranker constructs a graph that models all the relations between documents, wrappers, and candidate in- stances. Figure 3 shows an example graph where each node d i represents a document, w i a wrapper, and m i a ... finds other probable instances (e.g., “apple”, “banana”) of S in web documents. As its name implies, SEAL is independent of document languages: both the written (e.g., English) and...
Ngày tải lên: 08/03/2014, 00:20
Báo cáo khoa học: "Rare Word Translation Extraction from Aligned Comparable Documents" doc
... appear together often in documents on the same topic, and rarely in non-related documents. This is the gen- eral assumption behind early work on bilingual lex- icon extraction from parallel documents using ... co- occurrence computation, we suggest to extend it to aligned comparable documents using document as the context window. This document context is too large for co-occurrence computatio...
Ngày tải lên: 17/03/2014, 00:20
Báo cáo khoa học: "A Declarative Information Extraction System" pdf
... Environment Optimizer Rules (XQL) Execution Engine Sample Documents Runtime Environment Runtime Environment Input Document Stream Annotated Document Stream Plan (Algebra) User Interface Publi sh Figure ... tuples in terms of the docu- ment text, or the content of other views. The input to the annotator is a special view called Document containing a single tuple with the document text. The AQL...
Ngày tải lên: 17/03/2014, 00:20
Báo cáo khoa học: "A Beam-Search Extraction Algorithm for Comparable Data" pptx
... all the English documents that have been published within a 7 day window of the source document. We select the 20 highest scoring English docu- ments for each source document . These 20 docu- ments ... used in (Quirk et al., 2007; Utiyama and Isahara, 2003). We split the Spanish data into documents. Each Spanish document is translated into a bag of En- glish words using Model-1 lexicon probab...
Ngày tải lên: 17/03/2014, 02:20
Báo cáo khoa học: "Learning Better Rule Extraction with Translation Span Alignment" pptx
... by the National Science Foundation of China (61073140), the Spe- cialized Research Fund for the Doctoral Program of Higher Education (20100042110031) and the Fundamental Research Funds for the
Ngày tải lên: 23/03/2014, 14:20