Báo cáo khoa học: "Automatic Segmentation of Multiparty Dialogue" pot
... Automatic Segmentation of Multiparty Dialogue Pei-Yun Hsueh School of Informatics University of Edinburgh Edinburgh, EH8 9LW, GB p.hsueh@ed.ac.uk Johanna D. Moore School of Informatics University of ... different levels of granularity of segmentation, we explore the performance of our models for two tasks: hypothesizing where ma- jor topic changes occur and hypothesizing...
Ngày tải lên: 24/03/2014, 03:20
... Elements: Categorization Cognizer Item Category Criterion S NP PRP VP VBD NP SBAR IN S NNP VP VBD NP PP PRP IN NP NN Goal Source Theme Target NP He heard the sound of liquid slurping in a metal container as approached him from behindFarrell
Ngày tải lên: 08/03/2014, 05:20
... the efficacy of this method in the context of Chinese word segmentation and part -of- speech tagging, where no segmentation and POS tagging standards are widely accepted due to the lack of morphology ... latter problem often implies the former, as in our case study. To test the efficacy of our method we choose Chinese word segmentation and part -of- speech tagging, where the probl...
Ngày tải lên: 17/03/2014, 01:20
Tài liệu Báo cáo khoa học: "Automatic Extraction of Lexico-Syntactic Patterns for Detection of Negation and Speculation Scopes" pdf
... (possible) medical conditions. The importance of the task of negation and spec- ulation (a.k.a. hedge) detection is attested by a num- ber of research initiatives. The creation of the Bio- Scope corpus (Vincze ... Statistics of the BioScope corpus. The 2nd and 3d columns show the total number of cues within the datasets; the 4th and 5th columns show the percentage of negated and...
Ngày tải lên: 20/02/2014, 04:20
Tài liệu Báo cáo khoa học: "Automatic learning of textual entailments with cross-pair similarities" ppt
... ex- amples of the previous section. From the point of view of bag -of- word methods, the pairs (T 1 , H 1 ) and (T 1 , H 2 ) have both the same intra-pair simi- larity since the sentences of T 1 and ... divi- dends.” entails the hypothesis H 1 : “At the end of the year, all solid insurance companies pay divi- dends.” but it does not entail the hypothesis H 2 : “At the end of the y...
Ngày tải lên: 20/02/2014, 12:20
Tài liệu Báo cáo khoa học: "Unsupervised Segmentation of Chinese Text by Use of Branching Entropy" pdf
... Proceedings of the COLING/ACL 2006 Main Conference Poster Sessions, pages 428–435, Sydney, July 2006. c 2006 ... Computational Linguistics 428 0.5 1 1.5 2 2.5 3 3.5 4 4.5 5 1 2 3 4 5 6 7 8 entropy offset 429 430 431 432 0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1 0.55 0.6 0.65 0.7 0.75
Ngày tải lên: 20/02/2014, 12:20
Tài liệu Báo cáo khoa học: "Automatic Construction of Polarity-tagged Corpus from HTML Documents" docx
... there are two sentences in each of the 454 (1) kono software-no riten-ha hayaku ugoku koto this software-POST advantage-POS T quickly run to The advantage of this software is to run quickly. (2) ... the polarity of words There are some works that discuss learning the po- larity of words instead of sentences. Hatzivassiloglou and McKeown proposed a method of learning the polarity...
Ngày tải lên: 20/02/2014, 12:20
Tài liệu Báo cáo khoa học: "Automatic Identification of Pro and Con Reasons in Online Reviews" ppt
... specific and tangible features. Also, there are somewhat a fixed set of features of a specific type of product, for exam- ple, ease of use, durability, battery life, photo quality, and shutter lag ... examples of sen- tences that our system identified as reasons of complaints. (1) Unfortunately, I find that I am no longer comfortable in your establishment because of the...
Ngày tải lên: 20/02/2014, 12:20
Tài liệu Báo cáo khoa học: "Automatic Evaluation of Sentence-Level Fluency Andrew Mutton∗" pdf
... Methods PoStag In the first of these, we constructed a rough approximation of typical sentence grammar structure by taking bigrams over part -of- speech tags. 6 Then, given a string of PoS tags of length n, t 1 . ... magnitude of the first three parser metrics, however, lends support to the idea of Wan et al. (2005) to use something like these as indicators of generated sentence fl...
Ngày tải lên: 20/02/2014, 12:20
Tài liệu Báo cáo khoa học: "Automatic Evaluation of Machine Translation Quality Using Longest Common Subsequence and Skip-Bigram Statistics" doc
... construction of N-best translation lexicons from parallel text. Melamed (1995) used the ratio (LCSR) between the length of the LCS of two words and the length of the longer word of the two ... WLCS score ending at word x i of X and y j of Y, w is the table storing the length of consecu- tive matches ended at c table position i and j, and f is a function of consecutiv...
Ngày tải lên: 20/02/2014, 16:20