Báo cáo khoa học: "Summarizing multiple spoken documents: ﬁn

Báo cáo khoa học: "Summarizing multiple spoken documents: ﬁnding evidence from untranscribed audio" potx

... 549–557, Suntec, Singapore, 2-7 August 2009. c 2009 ACL and AFNLP Summarizing multiple spoken documents: ﬁnding evidence from untranscribed audio Xiaodan Zhu, Gerald Penn and Frank Rudzicz University ... Canada {xzhu,gpenn,frank}@cs.toronto.edu Abstract This paper presents a model for summarizing multiple untranscribed spoken documents. Without assuming the availabi...

Ngày tải lên: 23/03/2014, 16:21

9 152 0

Báo cáo khoa học: "Automatic Acquisition of Ranked Qualia Structures from the Web" potx

... randomly selected words (17.7%) does not differ substantially from the overall average F-Measure (17.1%) to be sure that we have chosen words from all F-Measure ranges. In particular, we asked different ... Karlsruhe jowenderoth@googlemail.com Abstract This paper presents an approach for the automatic acquisition of qualia structures for nouns from the Web and thus opens the pos- sibi...

Ngày tải lên: 08/03/2014, 02:21

8 379 0

Báo cáo khoa học: "Automatic Single-Document Key Fact Extraction from Newswire Articles" potx

... hand, most of CNN’s story highlights are not taken from the beginning of the articles. In fact, more than 50% of the highlights stem from sentences that are not among the ﬁrst 100 words of ... story highlights, we gathered statistics from 1,200 CNN newswire articles. An additional 300 articles were set aside to serve as a test set later on. The articles were taken from a wide ra...

Ngày tải lên: 24/03/2014, 03:20

9 280 0

Tài liệu Báo cáo khoa học: "Mixing Multiple Translation Models in Statistical Machine Translation" docx

... language modeling toolkit. In Proceedings International Con- ference on Spoken Language Processing, pages 257– 286. Jorg Tiedemann. 2009. News from opus - a collection of multilingual parallel corpora with ... theory, it is assumed that the training and test datasets are drawn from the same distribu- tion, or in other words, they are from the same domain. However, bilingual corpora...

Ngày tải lên: 19/02/2014, 19:20

10 456 0

Tài liệu Báo cáo khoa học: "Using Multiple Sources to Construct a Sentiment Sensitive Thesaurus for Cross-Domain Sentiment Classiﬁcation" doc

... some labeled data for multiple other domains, des- ignated as the source domains. We automatically create a sentiment sensitive thesaurus using both labeled and unlabeled data from multiple source ... automatically created sentiment sensitive thesaurus. We use labeled data from multiple source domains and unlabeled data from source and target domains to rep- resent the distr...

Ngày tải lên: 20/02/2014, 04:20

10 556 0

Tài liệu Báo cáo khoa học: "Automatic Construction of Polarity-tagged Corpus from HTML Documents" docx

... In addition, the corpus created from re- views is often noisy as we discuss in Section 2. This paper proposes a novel method of building polarity-tagged corpus from HTML documents. The idea behind ... that express opinion (opinion sentences) from HTML documents. Because this method is fully automatic and can be applied to arbitrary HTML documents, it does not suffer from the same...

Ngày tải lên: 20/02/2014, 12:20

8 409 0

Tài liệu Báo cáo khoa học: "Coherence in Spoken Discourse*" pptx

... discourse. Nevertheless, we believe that the entire spoken discourse is to be represented by one discourse structure. Evidence for this assumption comes from the observation that anaphoric refer- ences ... the producer. In order to obtain a better understand- ing of the planning process we analyse spoken discourse elicited in an experimental setting. Subjects describe the pi...

Ngày tải lên: 20/02/2014, 18:20

5 453 0

Tài liệu Báo cáo khoa học: "Summarizing Neonatal Time Series Data" doc

... we collected from NEONATE, to build the micro planner and realizer. 4.1 Content Selection and Segmentation The most important question in summarization is 'what data points from the input ... the domain constraints such as limits on parameter values. It is clear from our own studies on data summarization and also from the earlier studies by others (Shahar, 1997; Boyd, 1998; K...

Ngày tải lên: 22/02/2014, 02:20

4 337 0

Báo cáo khoa học: "Building Practical Spoken Dialog Systems" pdf

Ngày tải lên: 08/03/2014, 01:20

1 128 0

Báo cáo khoa học: "PARALLEL MULTIPLE CONTEXT-FREE GRAMMARS, FINITE-STATE TRANSLATION SYSTEMS, AND POLYNOMIAL-TIME RECOGNIZABLE SUBCLASSES OF LEXICAL-FUNCTIONAL GRAMMARS" docx

... Context-Free Grammars and Multiple Context-Free Grammars", Trans. IEICE, J71-D-I, 5:758-765. Kasami, T. et al. 1988b. "On the Membership Prob- lem for Head Language and Multiple Context-Free ... languages. Among them are parallel multiple context-free grammars (pmcfg's) and lexical-functional grammars (lfg's). Pmcfg's and their subclass called multiple...

Ngày tải lên: 08/03/2014, 07:20

10 397 0