Báo cáo khoa học: "Discovering Relations among Named Entities from Large Corpora" pot
... a named entity (NE) tagger to focus on the named entities which should be the arguments of relations. Recently developed named entity tag- gers work quite well and are able to extract named entities ... follows: 1. tagging named entities in text corpora 2. getting co-occurrence pairs of named entities and their context 3. measuring context similarities among pairs of n...
Ngày tải lên: 17/03/2014, 06:20
... 15000 clin- ical named entities in 11 entity types. This paper reports on the challenges involved in creating the annotation schema, and recog- nising and annotating clinical named enti- ties. ... step to the extraction of structured in- formation from these clinical notes is to achieve accurate identification of clinical concepts or named entities. An entity may refer to a concrete...
Ngày tải lên: 08/03/2014, 01:20
... extraction of entities, relations and events from various text sources, such as newswire documents and broadcast transcripts. One such task, relation detection, finds instances of predefined relations ... in- formation extraction that finds predefined relations between pairs of entities in text. This paper describes a relation detection approach that combines clues from dif...
Ngày tải lên: 31/03/2014, 03:20
Tài liệu Báo cáo khoa học: "Creating a Multilingual Collocation Dictionary from Large Text Corpora" docx
... The originality of our approach comes from the fact that collocations are not extracted from raw texts, but rather from syntactically parsed texts. The lin- guistic analysis selects potential pairs of words, ... Creating a Multilingual Collocation Dictionary from Large Text Corpora Luka Nerima, Violeta Seretan, Eric Wehrli Language Technology Laboratory (LATL), ... textual corpora...
Ngày tải lên: 22/02/2014, 02:20
Báo cáo khoa học: "Creating a Multilingual Collocation Dictionary from Large Text Corpora" ppt
... The originality of our approach comes from the fact that collocations are not extracted from raw texts, but rather from syntactically parsed texts. The lin- guistic analysis selects potential pairs of words, ... Creating a Multilingual Collocation Dictionary from Large Text Corpora Luka Nerima, Violeta Seretan, Eric Wehrli Language Technology Laboratory (LATL), ... textual corpora...
Ngày tải lên: 08/03/2014, 21:20
Báo cáo khoa học: "Joint Bilingual Sentiment Classification with Unlabeled Parallel Corpora" potx
... resources (e.g. lexicons) from resource- rich languages (typically English) to other languages, with the goal of transferring sentiment or subjectivity analysis capabilities from English to other ... adaptation from one language (usually English) to other languages with few sentiment resources. Mihalcea et al. (2007), for example, generate subjectivity analysis resources in a new...
Ngày tải lên: 17/03/2014, 00:20
Tài liệu Báo cáo khoa học: "Inducing Gazetteers for Named Entity Recognition by Large-scale Clustering of Dependency Relations" ppt
... en- tries from a large amount of dependency relations in Web documents. To our knowledge, no one else has performed this type of clustering on such a large scale. Wikipedia also produced a large ... the clustering with a vocabulary that is large enough to cover the many named entities required to improve the accuracy of NER is difficult. We enabled such large- scale clustering...
Ngày tải lên: 20/02/2014, 09:20
Tài liệu Báo cáo khoa học: "Discovering asymmetric entailment relations between verbs using selectional preferences" doc
... Language Processing applications often need to rely on large amount of lexical semantic knowledge to achieve good performances. Asym- metric verb relations are part of it. Consider for example the ... resources focus on symmetric semantic relations, such as verb similarity. Yet, not enough attention has been paid so far to the study of asymmetric verb relations, that are often the only...
Ngày tải lên: 20/02/2014, 12:20
Tài liệu Báo cáo khoa học: "Robust Extraction of Named Entity Including Unfamiliar Word" doc
... calculated from a large unannotated corpus. After that, tra- ditional machine learning approaches are em- ployed as the second step. The experiments of extracting Japanese named entities from IREX corpus ... paper proposes a novel method to extract named entities including unfamiliar words which do not occur or occur few times in a training corpus using a large unannotated cor...
Ngày tải lên: 20/02/2014, 09:20
Tài liệu Báo cáo khoa học: "Discourse Relations: A Structural and Presuppositional Account Using Lexicalised TAG*" docx
... of discourse relations. This is achieved using the same semantic machinery used in deriving clause-level semantics. 1 Introduction Research on discourse structure has, by and large, attempted ... informational (semantic) and inten- tional relations can hold between clauses simultan- eously and independently. This suggests that factor- ing the two kinds of relations might lead to...
Ngày tải lên: 20/02/2014, 18:20