Tài liệu Báo cáo khoa học: "Automatic Identification of Pro

Tài liệu Báo cáo khoa học: "Automatic Identification of Pro and Con Reasons in Online Reviews" ppt

... set of pro and con sentences in online reviews using clue phrases for pros and cons in epinions.com in order to train our system. We applied it to label sentences both on epinions.com and ... describes experimental and results and finally, in Section 6, we conclude with future work. 2 Pros and Cons in Online Reviews This section describes how we define...

Ngày tải lên: 20/02/2014, 12:20

8 461 1

Tài liệu Báo cáo khoa học: "Automatic Extraction of Lexico-Syntactic Patterns for Detection of Negation and Speculation Scopes" pdf

... adaptation of the rule set to new domains and corpora. 1 Motivation Information Extraction (IE) systems often face the problem of distinguishing between afﬁrmed, negated, and speculative information in ... of phrases split into subsets (preceding vs. following their scope) to identify cues using string matching. The cue scopes extend from the cue to the beginning or end of the s...

Ngày tải lên: 20/02/2014, 04:20

5 544 1

Tài liệu Báo cáo khoa học: "Automatic learning of textual entailments with cross-pair similarities" ppt

... lines con- necting placeholders of two texts (hypotheses) in- dicate structurally equivalent nodes. For instance, the dashed line between 3 and b links the main verbs both in the texts T 1 and ... the point of view of bag -of- word methods, the pairs (T 1 , H 1 ) and (T 1 , H 2 ) have both the same intra-pair similarity since the sentences of T 1 and H 1 as well as tho...

Ngày tải lên: 20/02/2014, 12:20

8 413 0

Tài liệu Báo cáo khoa học: "Automatic Construction of Polarity-tagged Corpus from HTML Documents" docx

... the indicator, and it is important to recognize indicators in HTML documents. To do this, we manually crafted lexicon, in which positive and negative indicators are listed. This lexicon consists ... consists of 303 positive and 433 negative indicators. Using this lexicon, the polarity-tagged corpus is constructed from HTML documents. The method consists of the following...

Ngày tải lên: 20/02/2014, 12:20

8 409 0

Tài liệu Báo cáo khoa học: "Automatic Evaluation of Sentence-Level Fluency Andrew Mutton∗" pdf

... (as in the treatment in Jurafsky and Martin (2000)). Similarly, the ultrasummarisa- tion model of Witbrock and Mittal (1999) consists of a content model, modelling the probability that a word in ... were interested in the level of agreement of intuitive un- derstanding of ﬂuency. We instructed them also that they should evaluate the sentence without consider- ing its con...

Ngày tải lên: 20/02/2014, 12:20

8 508 0

Tài liệu Báo cáo khoa học: "Automatic Evaluation of Machine Translation Quality Using Longest Common Subsequence and Skip-Bigram Statistics" doc

... of the current consecutive matches ending at words x i and y j . Given two sentences X and Y, the WLCS score of X and Y can be computed using the following dynamic programming procedure: ... dynamic programming table, c(i,j) stores the WLCS score ending at word x i of X and y j of Y, w is the table storing the length of consecutive matches ended at c table positi...

Ngày tải lên: 20/02/2014, 16:20

8 443 0

Tài liệu Báo cáo khoa học: "Automatic clustering of collocation for detecting practical sense boundary" ppt

... paper explains the collocation ambiguity in chapter 2, defines the extracted collocation and proposes the used clustering methods and the labeling algorithms in chapter 3. After explaining the ... used in a similar context - where used and interrelated in same sense of the central word - in the sentence. If contextual words are clustered according to the similarity i...

Ngày tải lên: 20/02/2014, 16:20

4 425 0

Tài liệu Báo cáo khoa học: "Automatic Collection of Related Terms from the Web" pptx

... (text processing) √ 研究開発 (research and development) 情報処理学会 (Information Processing Society of Japan; IPSJ) √√ 意味処理 (semantic processing) √√ 音声処理 (speech processing) √ 音声情報処理 (speech information processing) √√ 情報処理 ... terms T ✛ Filtering ✛ ✓ ✒ ✏ ✑ candidates X Figure 1: System con guration Automatic acquisition of technical terms in a certain domain...

Ngày tải lên: 20/02/2014, 16:20

4 437 0

Tài liệu Báo cáo khoa học: "AUTOMATIC ACQUISITION OF SUBCATEGORIZATION FRAMES FROM UNTAGGED TEXT" doc

... for certain how many verbs occur in a sentence. Finding some of the verbs in a text reliably is hard enough; finding all of them reliably is well beyond the scope of this work. Finally, ... infinitive marker and to as a preposition. Then he measures the mutual information between occurrences of the verb and occurrences of infinitives following within a certain...

Ngày tải lên: 20/02/2014, 21:20

6 416 0

Tài liệu Báo cáo khoa học: "Automatic Detection of Nonreferential It in Spoken Multi-Party Dialog" doc

... machine learning. He represents instances of it as vectors of 35 features. These features encode, among other things, information about the parts of speech and lemmata of words in the context of ... constitutes a mi- nority of all instances of it. Evans (2001) reports that his corpus of approx. 370.000 words from the SUSANNE corpus and the BNC contains 3.171 examples of...

Ngày tải lên: 22/02/2014, 02:20

8 436 0