Báo cáo khoa học: "Structural Topic Model for Latent Topical Structure Analysis" pot
... coher- ent, modeling and discovering latent topical transition structures within documents would be beneficial for many text analysis tasks. In this work, we propose a new topic model, Structural Topic Model, ... transitional structures among topics, i.e., how likely one topic would fol- low another topic, are not captured in this model. 1526 In this paper, we propose a new...
Ngày tải lên: 30/03/2014, 21:20
... used, and sub- section 2.2 describes the model. 2.1 The Formalism In order to handle the non-linear phenomenon of Arabic, our model adopts the two-level formalism presented by (Pulman and Hepple, ... first to appear on the left of LEx. In our morphographemic model, we add a similar formalism for expressing error rules (3). (3) ERROR FORMALISM ErrSurf =~ Surf { PLC- PRC } whe...
Ngày tải lên: 08/03/2014, 07:20
... co- herent topics. Summarization is often used for a long document that includes multiple topics. A summary of such a document can be composed of summaries of the component topics. Identifi- cation of topics ... Markov model (HMM), whose states correspond to topics. Given a word sequence, their system assigns each word a topic so that the maximum-probability topic sequence is obtained....
Ngày tải lên: 31/03/2014, 04:20
Báo cáo khoa học: "a Topic-Model based approach for update summarization" potx
... training set for tuning the hyper- parameters for the model, namely the pseudo- counts for the two Dirichlet priors that affects the topic mix assignment for each document. By per- forming a grid ... sentence with the highest probability for each topic. While hierarchical topic modeling approaches have shown remarkable effectiveness in learning the latent topics of document...
Ngày tải lên: 31/03/2014, 20:20
Tài liệu Báo cáo khoa học: "A Statistical Model for Unsupervised and Semi-supervised Transliteration Mining" pptx
... present a novel model of transliteration min- ing defined as a mixture of a transliteration model and a non-transliteration model. The transliteration model is a joint source channel model (Li et ... labelled information for training. Our sys- tem extracts transliteration pairs in an unsupervised fashion. It is also able to utilize labelled information if available, obtaining improv...
Ngày tải lên: 19/02/2014, 19:20
Tài liệu Báo cáo khoa học: "An Unsupervised Model for Joint Phrase Alignment and Extraction" ppt
... all models, and for the LM we use an interpolated Kneser-Ney 5-gram model. For GIZA ++, we use the standard training reg- imen up to Model 4, and combine alignments with grow-diag-final-and. For ... noted that forcing align- ments smaller than the model suggests is only used for generating alignments for use in heuristic extrac- tion, and does not affect the training process. 5...
Ngày tải lên: 20/02/2014, 04:20
Tài liệu Báo cáo khoa học: "Unsupervised Topic Modelling for Multi-Party Spoken Discourse" ppt
... Association for Computational Linguistics Unsupervised Topic Modelling for Multi-Party Spoken Discourse Matthew Purver CSLI Stanford University Stanford, CA 94305, USA mpurver@stanford.edu Konrad ... Of these lists, 40 contained the most indicative words for each of the 10 topics from different models: the topic segmentation model; a topic model that had the same number of segm...
Ngày tải lên: 20/02/2014, 11:21
Tài liệu Báo cáo khoa học: "Minimum Cut Model for Spoken Lecture Segmentation" ppt
... three lectures is- used for estimating the optimal word block length for representing nodes, the threshold distances for discarding node edges, the number of uniform chunks for estimating tf-idf ... linear sequence of topically coherent segments and thereby induce a content structure of the text. The applications of the derived rep- resentation are broad, encompassing information re...
Ngày tải lên: 20/02/2014, 11:21
Báo cáo khoa học: A mouse model for in vivo tracking of the major dust mite allergen Der p 2 after inhalation docx
... University) for excellent technical assistance. This work was supported by grants from the Swedish Foundation for Health Care Sciences and Allergy Research, the Swedish Research Council for Medicine ... the fate of an aller- gen upon inhalation, we addressed this issue for a major dust mite allergen, Der p 2. First, a model for Der p 2-sensitization was established in C57BL ⁄ 6 J...
Ngày tải lên: 07/03/2014, 21:20
Báo cáo khoa học: "Employing Topic Models for Pattern-based Semantic Class Discovery" doc
... in existing topic modeling applications. Thus we expand the ap- plication scope of topic modeling. 2 Topic Models In this section we briefly introduce the two wide- ly used topic models which ... the topics corresponding to the document (see Section 2 for details). Given a corpus, the latent topics can be obtained by a parameter estimation procedure. Topic modeling provide...
Ngày tải lên: 08/03/2014, 00:20