Báo cáo khoa học: "Discourse Cues for Broadcast News Segmentation" potx
... other news programs such as CNN Prime News (See Figure 1) or MS-NBC. For example, the structure for the Jim Lehrer News Hour provides not only segmentation information but also content information ... 819 2.1 News Story Discourse Structure Broadcast news has a prevalent structure with often explicit cues to signal story shifts. For example, analysis of the structur...
Ngày tải lên: 23/03/2014, 19:20
... there may be no derivational cues for the lexical semantics of a particular word. This is not the case for other surface cues, e.g., distributional cues exist for every word in a corpus. ... identifiable abundant sur- face cues exist for the needed lexical semantic infor- mation. The next section explores the possibility of using derivational affixes as surface cues...
Ngày tải lên: 17/03/2014, 09:20
... the Request schema for inducing actions, the Evidence schema for making claims credible, the/nform schema for causing the reader to know particular information, and so forth. Our knowledge ... particular kinds of purposes which they serve: questions for obtaining information, marked syntactic constructions for creating emphasis, and so forth. At the schema level as well, it...
Ngày tải lên: 24/03/2014, 01:21
Tài liệu Báo cáo khoa học: "Unsupervised Search for The Optimal Segmentation for Statistical Machine Translation" doc
... 31–36, Uppsala, Sweden, 13 July 2010. c 2010 Association for Computational Linguistics Unsupervised Search for The Optimal Segmentation for Statistical Machine Translation Cos¸kun Mermer 1,3 and ... segmentation schemes can outperform a straightforward linguis- tic morphological segmentation, e.g., (Habash and Sadat, 2006), and (ii) it may result in even worse performance than a word-b...
Ngày tải lên: 20/02/2014, 04:20
Báo cáo khoa học: "Transductive learning for statistical machine translation" potx
... genres: editorials, newswire and political speeches in the 2004 test set, and broadcast conversations, broad- cast news, newsgroups and newswire in the 2006 test set. Most of these domains have characteristics which ... Algorithm 1 we run it for a fixed number of iterations and instead focus on finding useful def- initions for Estimate, Score and Select that can be experimentally shown t...
Ngày tải lên: 08/03/2014, 02:21
Báo cáo khoa học: "Pronunciation Modeling for Improved Spelling Correction" potx
... presents a method for incor- porating word pronunciation information in a noisy channel model for spelling cor- rection. The proposed method builds an explicit error model for word pronuncia- tions. ... correspond to nothing (i.e., are silent). For example, the entry in NETtalk (when we remove the empties, which contain information for phone level alignment) for the word able is A...
Ngày tải lên: 17/03/2014, 08:20
Báo cáo khoa học: "Extractive Summaries for Educational Science Content" potx
... basis for the construction of knowledge maps useful both as computational knowledge representations and as learning re- sources for presentation to the student. 2 Related Work Our work is informed ... of very fine granularity and therefore graphs that may not be suitable for presentation to a student. Multi-document summarization (MDS) re- search also informs our work. XDoX analy...
Ngày tải lên: 23/03/2014, 17:20
Báo cáo khoa học: "Topic Analysis for Psychiatric Document Retrieval" potx
... to different suggestions decided by experts. Therefore, an ideal retrieval system for consultation documents should consider such topic information so as to improve the retrieval precision. ... extraction of topic informa- tion is described briefly. The detailed process is described in (Wu et al. 2005a) for symptom and relation identification, and in (Yu et al., 2007) for event id...
Ngày tải lên: 23/03/2014, 18:20
Báo cáo khoa học: "AN ALGORITHM FOR PLAN RECOGNITION COLLABORATIVE DISCOURSE*" pot
... recipe rep- resents information about the performance, in the ab- stract, of act-types, an rgraph represents more spe- cialized information by including act-type performance agents and times. ... assumptions made by his formal model, we directly compare our algorithm to his. ACTION P~EPRESENTATION We use the action representation formally defined by Balkanski (1990) for modelling c...
Ngày tải lên: 23/03/2014, 20:20
Báo cáo khoa học: "Genre distinctions for Discourse in the Penn TreeBank" pot
... comprise the “right” set of genres for future use of the PTB for discourse-related 676 language technology, just that some sensitivity to genre will lead to better performance. Some simple differences ... mean that the latter have to be learned independently of the former. 8 Conclusion This paper has, for the first time, provided genre information about the articles in the Penn Tree- Ban...
Ngày tải lên: 30/03/2014, 23:20