Báo cáo khoa học: "Automatic Labelling of Topic Models" doc
... not directly related to topic labelling, Chang et al. (2009) were one of the first to propose human labelling of topic models, in the form of syn- thetic intruder word and topic detection tasks. ... to ap- proach topic labelling via best term selection, i.e. selecting one of the top-10 topic terms to label the overall topic. While it is often possible to label top- ics...
Ngày tải lên: 30/03/2014, 21:20
... earthquakes up to 156 documents for one of the terrorist attacks). This variation in the number of documents per topic is typical for the TDT corpus. Many of the current approaches of domain modeling ... 6 instances of presidential elections and 3 instances of terrorist attacks. The number of the documents corresponding to the instances varies greatly (from two documents for o...
Ngày tải lên: 17/03/2014, 04:20
... iden- tification of the topic for each call to make use of information available in the model. Below we show examples of the use of the model for topic identification. 5.1 Topic Identification Many of the ... because of its hierarchical nature where information can be obtained for topics of various granularity unlike (Mishne et al., 2005) where there is no concept of topics...
Ngày tải lên: 31/03/2014, 01:20
Báo cáo khoa học: "Automatic Recognition of Intonation Patterns" docx
... in terms of a sequence of discrete elements whose relation to the quantitative level of description is not transparent. An analysis of a contour thus relates heterogeneous levels of description, ... recognition, The domain of this project is English intonation. The system I will describe analyzes fundamental frequency contours (F0 contours) of speech in terms of the theo...
Ngày tải lên: 31/03/2014, 17:20
Báo cáo khoa học: "Automatic Detection of Text Genre" doc
... properties of a text that are associated with facets. 2.1 Structural Cues Examples of structural cues are passives, nominal- izations, topicalized sentences, and counts of the fre- quency of syntactic ... cost of training on a large set of cues. Variation measures capture the amount of varia- tion of a certain count cue in a text (e.g the stan- dard deviation in sente...
Ngày tải lên: 31/03/2014, 21:20
Báo cáo khoa học: "Automatic Acquisition of English Topic Signatures Based on a Second Language" potx
... 1. {English topic signature 1} 2. {English topic signature 2} 1. {English topic signature 1} 2. {English topic signature 2} Figure 1:Process of automatic acquisition of topic signatures. For ... results show that our topic signatures are useful for WSD. The remainder of the paper is organised as fol- lows. Section 2 describes the process of acqui- sition of t...
Ngày tải lên: 08/03/2014, 04:22
Tài liệu Báo cáo khoa học: "Automatic Extraction of Lexico-Syntactic Patterns for Detection of Negation and Speculation Scopes" pdf
... (possible) medical conditions. The importance of the task of negation and spec- ulation (a.k.a. hedge) detection is attested by a num- ber of research initiatives. The creation of the Bio- Scope corpus (Vincze ... Statistics of the BioScope corpus. The 2nd and 3d columns show the total number of cues within the datasets; the 4th and 5th columns show the percentage of negated and...
Ngày tải lên: 20/02/2014, 04:20
Tài liệu Báo cáo khoa học: "Automatic learning of textual entailments with cross-pair similarities" ppt
... ex- amples of the previous section. From the point of view of bag -of- word methods, the pairs (T 1 , H 1 ) and (T 1 , H 2 ) have both the same intra-pair simi- larity since the sentences of T 1 and ... rules that describe a non trivial set of entailment cases. The experiments with the data sets of the RTE 2005 challenge show an improvement of 4.4% over the state -of- the-art me...
Ngày tải lên: 20/02/2014, 12:20
Tài liệu Báo cáo khoa học: "Automatic Construction of Polarity-tagged Corpus from HTML Documents" docx
... there are two sentences in each of the 454 (1) kono software-no riten-ha hayaku ugoku koto this software-POST advantage-POS T quickly run to The advantage of this software is to run quickly. (2) ... the polarity of words There are some works that discuss learning the po- larity of words instead of sentences. Hatzivassiloglou and McKeown proposed a method of learning the polarity...
Ngày tải lên: 20/02/2014, 12:20
Tài liệu Báo cáo khoa học: "Automatic Identification of Pro and Con Reasons in Online Reviews" ppt
... computational model because of the difficulty of determining the unit of an opinion. In general, researchers study opinion at three different lev- els: word level, sentence level, and document level. ... a mini- mum unit of opinion. Researchers try to identify opinion-bearing sentences, classify their senti- ment, and identify opinion holders and topics of opinion sentences. D...
Ngày tải lên: 20/02/2014, 12:20