Báo cáo khoa học: "Creating a Multilingual Collocation Dictionary from Large Text Corpora" ppt
... corpora are available, also the translation equivalents of the collocation context are displayed, thus allowing the user to see how a given collocation was translated in different lan- guages, and ... is length-based and integrates a shal- low content analysis. It begins by individuating a paragraph in the target text which is a first candi- date as target paragraph, and which w...
Ngày tải lên: 08/03/2014, 21:20
... corpora are available, also the translation equivalents of the collocation context are displayed, thus allowing the user to see how a given collocation was translated in different lan- guages, and ... is length-based and integrates a shal- low content analysis. It begins by individuating a paragraph in the target text which is a first candi- date as target paragraph, and which w...
Ngày tải lên: 22/02/2014, 02:20
... corpus Ryo Nagata Konan University 8-9-1 Okamoto, Kobe 658-0072 Japan rnagata @ konan-u.ac.jp. Edward Whittaker Vera Sheinman The Japan Institute for Educational Measurement Inc. 3-2-4 Kita-Aoyama, Tokyo, ... Lee and Seneff, 2008; Nagata et al., 2004; Nagata et al., 2005; Nagata et al., 2006; Tetreault et al., 2010b). This is one of the most active research areas in natural language processin...
Ngày tải lên: 20/02/2014, 04:20
Báo cáo khoa học: "Creating a Gold Standard for Sentence Clustering in Multi-Document Summarization" potx
... Minneapolis” that he actu- ally is from Minnesota. 5 Evaluation measures The evaluation measures will compare a set of clusters to a set of classes. An ideal evaluation measure should reward a set ... very similar, almost paraphrases. For our task sentences that are not paraphrases can be in the same cluster (see rule 5, 8, 9). In general there are several constraints that pull agains...
Ngày tải lên: 08/03/2014, 01:20
Báo cáo khoa học: "Creating a Corpus of Parse-Annotated Questions" docx
... re- search established that even a small amount of ad- ditional training data can give a substantial im- provement in question analysis in terms of both CFG parse accuracy and LFG grammatical func- tional ... for a given language or task. Large treebanks are available for major languages, however these are often based on a specific text type or genre, e.g. financial newspaper text...
Ngày tải lên: 08/03/2014, 02:21
Báo cáo khoa học: "Creating a CCGbank and a wide-coverage CCG lexicon for German" pdf
... Buch. (2) a. Gibt Peter Maria das Buch? b. Gib Maria das Buch! (3) a. dass Peter Maria das Buch gi bt. b. das Buch, das Peter Maria gibt. Local Scrambling In the so-called “Mittelfeld” all orders of arguments ... treats as sentential modifiers with an anaphoric depen- dency. Arguments that are moved u p are marked as extracted, and an additional “extraction” edge (explained below) from the...
Ngày tải lên: 08/03/2014, 02:21
Báo cáo khoa học: "JaBot: a multilingual Java-based intelligent agent for Web sites" pdf
... JaBot: a multilingual Java-based intelligent agent for Web sites Tim READ & Elena BARCENA Departamento de Filologias Extranjeras y sus Lingi isticas, UNED Senda del Rey s/n, Madrid ... Spain timread@sr.uned.es, ebarcena@sr.uned.es Abstract This paper presents a novel type of intelligent agent with a multilingual natural language interface, which retrieves information fr...
Ngày tải lên: 17/03/2014, 07:20
Tài liệu Báo cáo khoa học: "Automatic clustering of collocation for detecting practical sense boundary" ppt
... boundary in the existing dictionaries with practical senses from the large- scaled corpus. The collocation from the large- scaled corpus contains semantic information. The collocation for ambiguous ... collocation also have their collocation. A target word for collocation is called the ‘central word’, and a word in a collocation is referred to as the ‘contextual...
Ngày tải lên: 20/02/2014, 16:20
Tài liệu Báo cáo khoa học: "DiMLex: A lexicon of discourse markers for text generation and understanding" docx
... markers, we do not regard this distinction as particularly helpful, though. As we have illustrated above and will elaborate below, these words can carry a wide variety of semantic and pragmatic ... that the proper place for describing discourse markers is a dedicated lexicon that provides a classification of their syntactic, semantic and pragmatic features and characterizes the...
Ngày tải lên: 20/02/2014, 18:20
Tài liệu Báo cáo khoa học: "GPSM: A GENERALIZED PROBABILISTIC SEMANTIC MODEL FOR AMBIGUITY RESOLUTION" pptx
... In a large natural language processing system, such as a machine translation system (MTS), am- biguity resolution is a critical problem. Various rule-based and probabilistic approaches had ... LexD from a semantic representation. In general, a particular interpretation of a sentence can be represented by an annotated syntax tree (AST), which is a syntax tree annot...
Ngày tải lên: 20/02/2014, 21:20