Tài liệu Báo cáo khoa học: "Automatic Headline Generation using Character Cross-Correlation" doc

Tài liệu Báo cáo khoa học: "Automatic Headline Generation using Character Cross-Correlation" doc

Tài liệu Báo cáo khoa học: "Automatic Headline Generation using Character Cross-Correlation" doc

... the generated headlines as well as the original headline. Also, they were asked to gener- ate 1 headline each for every document. These new 3 headlines will be used as reference headlines in ... of documents. 1 Introduction A headline is considered as a condensed summary of a document. It can be classified as the acme of text summarization. The necessity for automatic headline g...

Ngày tải lên: 20/02/2014, 05:20

5 614 0
Tài liệu Báo cáo khoa học: " Word Translation Disambiguation Using Bilingual Bootstrapping" doc

Tài liệu Báo cáo khoa học: " Word Translation Disambiguation Using Bilingual Bootstrapping" doc

... (a label). Many methods for word sense disambiguation using a supervised learning technique have been proposed. They include those using Naïve Bayes (Gale et al. 1992a), Decision List (Yarowsky ... way, we estimate )|( teP ε using information from not only English but also Chinese. For )|( )( teP E ε , we estimate it with MLE (Maximum Likelihood Estimation) using ε L as...

Ngày tải lên: 20/02/2014, 21:20

9 480 0
Tài liệu Báo cáo khoa học: "Automatic Extraction of Lexico-Syntactic Patterns for Detection of Negation and Speculation Scopes" pdf

Tài liệu Báo cáo khoa học: "Automatic Extraction of Lexico-Syntactic Patterns for Detection of Negation and Speculation Scopes" pdf

... 2). Corpus Type Sentences Documents Mean Document Size Clinical 7520 1954 3.85 Full Papers 3352 9 372.44 Paper Abstracts 14565 1273 11.44 Table 1: Statistics of the BioScope corpus. Document sizes represent ... contrast, {D-mib appears to be uniformly ex- pressed in imaginal discs }. 2) Differentiation assays using water soluble phor- bol esters reveal that differentiation becomes irreversi...

Ngày tải lên: 20/02/2014, 04:20

5 544 1
Tài liệu Báo cáo khoa học: "Automatic Satire Detection: Are You Having a Laugh?" ppt

Tài liệu Báo cáo khoa học: "Automatic Satire Detection: Are You Having a Laugh?" ppt

... embody characteristics of satire news documents. Headline features: Most of the articles in the corpus have a headline as their first line. To a hu- man reader, the vast majority of the satire docu- ments ... (4) manually filtering out “non-newsy”, irrelevant and overly-offensive documents from the top-10 returned documents (i.e. documents not containing satire news articles, or containing...

Ngày tải lên: 20/02/2014, 09:20

4 408 0
Tài liệu Báo cáo khoa học: "Automatic learning of textual entailments with cross-pair similarities" ppt

Tài liệu Báo cáo khoa học: "Automatic learning of textual entailments with cross-pair similarities" ppt

... idf(w) is the inverse document frequency of the word w. For sake of comparison, we consider also the corresponding more classical version that does not apply the inverse document frequency s 2 (T, ... extract relevant syn- tactic subtrees between pairs of text and hypoth- esis. We also use them to characterize the syn- tactic information expressed by such subtrees. In- deed, Eq. 6 depends on...

Ngày tải lên: 20/02/2014, 12:20

8 413 0
Tài liệu Báo cáo khoa học: "Automatic Construction of Polarity-tagged Corpus from HTML Documents" docx

Tài liệu Báo cáo khoa học: "Automatic Construction of Polarity-tagged Corpus from HTML Documents" docx

... building polarity-tagged corpus from HTML documents. The characteristics of this method is that it is fully automatic and can be applied to arbitrary HTML docu- ments. The idea behind our method ... level rather than document level. The reason is that one document often includes both positive and nega- tive sentences, and hence it is difficult to learn the polarity from the corpus tagged at do...

Ngày tải lên: 20/02/2014, 12:20

8 409 0
Tài liệu Báo cáo khoa học: "Automatic Identification of Pro and Con Reasons in Online Reviews" ppt

Tài liệu Báo cáo khoa học: "Automatic Identification of Pro and Con Reasons in Online Reviews" ppt

... opinion holders and topics of opinion sentences. Document level opinion analysis has been mostly applied to review clas- sification, in which a whole document written for a review is judged as carrying ... Wilson et al., 2005). Document level sentiment classification is mostly applied to reviews, where systems assign a positive or negative sentiment for a whole re- view document (Pang e...

Ngày tải lên: 20/02/2014, 12:20

8 461 1
Tài liệu Báo cáo khoa học: "Automatic Evaluation of Sentence-Level Fluency Andrew Mutton∗" pdf

Tài liệu Báo cáo khoa học: "Automatic Evaluation of Sentence-Level Fluency Andrew Mutton∗" pdf

... depending on generation method, the machine learner provides a consistent estimator of fluency. 1 Introduction Intrinsic evaluation of the output of many language technologies can be characterised ... text in another lan- guage for machine translation (MT), a natural lan- guage generation (NLG) input representation, a doc- ument to be summarised, and so on; and how well it conforms to no...

Ngày tải lên: 20/02/2014, 12:20

8 508 0
Tài liệu Báo cáo khoa học: "Automatic Evaluation of Machine Translation Quality Using Longest Common Subsequence and Skip-Bigram Statistics" doc

Tài liệu Báo cáo khoa học: "Automatic Evaluation of Machine Translation Quality Using Longest Common Subsequence and Skip-Bigram Statistics" doc

... Automatic Evaluation of Machine Translation Quality using N-gram Co- Occurrence Statistics. AAAAAAAAAAA http://www.nist.gov/speech/tests/mt /doc/ ngram- study.pdf Pantel, P. and Lin, D. 2002. ... B LEU with unigram and bi- gram, i.e. N=2, for the purpose of explanation and call this B LEU-2. Using S1 as the reference and S2 and S3 as the candidate translations, S2 and S3 would have...

Ngày tải lên: 20/02/2014, 16:20

8 443 0
Tài liệu Báo cáo khoa học: "Automatic clustering of collocation for detecting practical sense boundary" ppt

Tài liệu Báo cáo khoa học: "Automatic clustering of collocation for detecting practical sense boundary" ppt

... language processing systems. This paper proposes the method about discovering sense boundary using the collocation from the large corpora and the clustering methods. In the experiments, the ... continuously. This paper discusses about the analyzing method in order to detect practical senses using the collocation. 2.2 Homonymous collocation The words in the collocation also have th...

Ngày tải lên: 20/02/2014, 16:20

4 425 0
w