Báo cáo khoa học: "Joint Bilingual Sentiment Classification with Unlabeled Parallel Corpora" potx

Báo cáo khoa học: "Unsupervised Event Coreference Resolution with Rich Linguistic Features" potx

Báo cáo khoa học: "Unsupervised Event Coreference Resolution with Rich Linguistic Features" potx

... then associate each event mention with only one cluster from each set. The first set uses the transitive closure of the WordNet SYNONYMOUS relation to form clusters with all the words from WordNet ... alignment of semantic roles, we run both parsers on a large amount of unlabeled text. The result of this process is a map with all frame elements statistically aligned to all predi- cate...

Ngày tải lên: 07/03/2014, 22:20

11 336 0
Tài liệu Báo cáo khoa học: "Experiments in Semantic Classification" pptx

Tài liệu Báo cáo khoa học: "Experiments in Semantic Classification" pptx

... refinement of the row classification, but we could easily have several rows for a word in one clump, with quite a crude classification. Perhaps the best way of dealing with this result is to ... have dealt with 500 rows, but 2000 have been prepared. For the initial sample of 500 a small number of words that we have called “starting words,”* with varying ranges of uses, but wi...

Ngày tải lên: 19/02/2014, 19:20

16 472 0
Tài liệu Báo cáo khoa học: "Joint Feature Selection in Distributed Stochastic Learning for Large-Scale Discriminative Training in SMT" pdf

Tài liệu Báo cáo khoa học: "Joint Feature Selection in Distributed Stochastic Learning for Large-Scale Discriminative Training in SMT" pdf

... University Pittsburgh, PA, 15213, USA cdyer@cs.cmu.edu Abstract With a few exceptions, discriminative train- ing in statistical machine translation (SMT) has been content with tuning weights for large feature sets ... types: The first type explicitly counters overestimates of rule counts, or rules with bad overlap points, bad rewrites, or with undesired insertions of target-side termin...

Ngày tải lên: 19/02/2014, 19:20

11 549 0
Tài liệu Báo cáo khoa học: "Subjectivity and Sentiment Analysis of Modern Standard Arabic" doc

Tài liệu Báo cáo khoa học: "Subjectivity and Sentiment Analysis of Modern Standard Arabic" doc

... lemmatization settings, the Stem was found to perform best with 73.17% F (with 1g+2g), compared to 71.97% F (with 1g+2g+3g) for Sur- face and 72.74% F (with 1g+2g) for Lemma. In ad- dition, adding the ... slightly with the two settings. In addition, the UNIQUE feature helps classification with the Lemma, but it hurts with the Stem+Morph. Table 2 shows that although performance on th...

Ngày tải lên: 20/02/2014, 05:20

5 581 0
Tài liệu Báo cáo khoa học: "MemeTube: A Sentiment-based Audiovisual System for Analyzing and Displaying Microblog Messages" pdf

Tài liệu Báo cáo khoa học: "MemeTube: A Sentiment-based Audiovisual System for Analyzing and Displaying Microblog Messages" pdf

... annotated with sentiment labels, we train an n-gram language model for each sentiment. Then, we use such mod- el to calculate the probability that a post expresses the sentiment s associated with ... Summary of related works that detect sentiments in microblogs. 3 Sentiment Analysis of Microblog Posts First, we develop a classification model as our basic sentiment recogni...

Ngày tải lên: 20/02/2014, 05:20

6 449 0
Tài liệu Báo cáo khoa học: "Joint Word Segmentation and POS Tagging using a Single Perceptron" docx

Tài liệu Báo cáo khoa học: "Joint Word Segmentation and POS Tagging using a Single Perceptron" docx

... t 6 word w with tag t and previous character c 7 word w with tag t and next character c 8 tag t on single-character word w in charac- ter trigram c 1 wc 2 9 tag t on a word starting with char ... ending with char c 11 tag t on a word containing char c (not the starting or ending character) 12 tag t on a word starting with char c 0 and containing char c 13 tag t on a word ending wit...

Ngày tải lên: 20/02/2014, 09:20

9 576 0
Tài liệu Báo cáo khoa học: "Word Alignment for Languages with Scarce Resources Using Bilingual Corpora of Other Language Pairs" pptx

Tài liệu Báo cáo khoa học: "Word Alignment for Languages with Scarce Resources Using Bilingual Corpora of Other Language Pairs" pptx

... amounts of bilingual data are available for the desired lan- guage pair L1-L2, large-scale bilingual corpora in L1-L3 and L2-L3 are available. Based on these two additional corpora and with L3 ... links with bilingual corpora (Wu, 1997; Och and Ney, 2003; Cherry and Lin, 2003; Zhang and Gildea, 2005). In order to achieve satisfactory results, all of these methods require a l...

Ngày tải lên: 20/02/2014, 12:20

8 359 0
Báo cáo khoa học: "Joint Inference of Named Entity Recognition and Normalization for Tweets" doc

Báo cáo khoa học: "Joint Inference of Named Entity Recognition and Normalization for Tweets" doc

... “···without you is like an iphone without apps; Lady gaga with- out her telephone···”, the labeled sequence us- ing the BILOU schema is: “···without O you O is O like O an O iphone U−P RODUCT without O apps O ; Lady B−P ... connected with a “1” label. Note that there are no NEN labels for pairs like “her 1 1 ” and “her 1 2 ” or with 1 1 and with 1 2 ”, since words like “her” and with ar...

Ngày tải lên: 07/03/2014, 18:20

10 444 0
Báo cáo khoa học: "A Bilingual Concordancer for Domain-Specific Computer Assisted Translation" potx

Báo cáo khoa học: "A Bilingual Concordancer for Domain-Specific Computer Assisted Translation" potx

... web-based bilingual concordancer, DOMCAT 1 , for domain-specific computer assisted translation. Given a multi-word expression as a query, the system involves retrieving sentence pairs from a bilingual ... measures and coverage rate respectively. 1 Introduction A bilingual concordancer is a tool that can retrieve aligned sentence pairs in a parallel corpus whose source sentenc...

Ngày tải lên: 07/03/2014, 18:20

6 371 0
Báo cáo khoa học: "Joint Identification and Segmentation of Domain-Specific Dialogue Acts for Conversational Dialogue Systems" doc

Báo cáo khoa học: "Joint Identification and Segmentation of Domain-Specific Dialogue Acts for Conversational Dialogue Systems" doc

... any utterances with multiple dialogue acts. This makes it possible to create new conversational dialogue sys- tem scenarios that allow and encourage users to ex- press themselves with fewer restrictions, without ... average number of words in utterances with only a single dialogue act is 7.5 (with a maximum of 34, and minimum of 1), and the average length of utterances with multiple...

Ngày tải lên: 07/03/2014, 22:20

6 354 0
w