Topic models MDA Topic model

Tài liệu Báo cáo khoa học: "Unsupervised Topic Modelling for Multi-Party Spoken Discourse" ppt

Tài liệu Báo cáo khoa học: "Unsupervised Topic Modelling for Multi-Party Spoken Discourse" ppt

... the topic assignment for each word, zu,i , conditioned on all other topic assignments, z−(u,i) , all topic change indicators, c, and all words, w; and then drawing the topic change indicator for ... on the distribution over topics: with high probability, the distribution for utterance u is the same as for utterance u−1; otherwise, we sample a new distribution over topics This...
Ngày tải lên : 20/02/2014, 11:21
8 366 0
Báo cáo khoa học: "Interactive Topic Modeling" docx

Báo cáo khoa học: "Interactive Topic Modeling" docx

... were able to infer from the discovered topics Common constraint themes that Relative Error 1.00 Best Session 0.98 10 Topics 0.96 20 Topics 50 Topics 0.94 75 Topics Round The constraint size ranged ... by finding 20 initial topics with no constraints, as shown in Table (left) Notice that topics and 20 both deal with Russia Topic 20 seems to be about the Soviet Union, with topic about the po...
Ngày tải lên : 07/03/2014, 22:20
10 231 0
Báo cáo khoa học: "Efficient Tree-Based Topic Modeling" docx

Báo cáo khoa học: "Efficient Tree-Based Topic Modeling" docx

... is topic k’s count in the document d; αk is topic k’s prior; Z− and L− are topic and path assignments excluding wd,n ; βi→j is the prior for edge i → j, ni→j|t is the count of edge i → j in topic ... non-zero counts in a topic To sample from the conditional distribution, first sample which bucket you need and then (and only then) select a topic within that bucket Because the topic...
Ngày tải lên : 23/03/2014, 14:20
5 350 0
Báo cáo khoa học: "Pattern Learning for Relation Extraction with a Hierarchical Topic Model" pptx

Báo cáo khoa học: "Pattern Learning for Relation Extraction with a Hierarchical Topic Model" pptx

... relations φD captures patterns that are specific about a certain entity pair, but which are not generalizable across all pairs with the same relation Finally A contains the patterns that are ... 2011 Relation extraction with relation topics In Proceedings of Empirical Methods in Natural Language Processing Daniel S Weld, Fei Wu, Eytan Adar, Saleema Amershi, James Fogarty, Raphae...
Ngày tải lên : 30/03/2014, 17:20
6 373 0
Báo cáo khoa học: "Structural Topic Model for Latent Topical Structure Analysis" pot

Báo cáo khoa học: "Structural Topic Model for Latent Topical Structure Analysis" pot

... In this paper, we propose a new topic model, named Structural Topic Model (strTM) to model and analyze both latent topics and topical structures in text documents To so, strTM assumes: ... compared with the baseline methods that don’t explicitly model the topical structure The results confirm the necessity of modeling the latent topical structures inside documents, and a...
Ngày tải lên : 30/03/2014, 21:20
10 466 0
Báo cáo khoa học: "a Topic-Model based approach for update summarization" potx

Báo cáo khoa học: "a Topic-Model based approach for update summarization" potx

... novel information in a collection with respect to another one, which is the primary focus of update summarization 2.2 Update Summarization The goal of update summarization is to generate an update ... as training set for tuning the hyperparameters for the model, namely the pseudocounts for the two Dirichlet priors that affects the topic mix assignment for each document By perf...
Ngày tải lên : 31/03/2014, 20:20
10 341 0
Tài liệu Báo cáo khoa học: "Topic Models for Dynamic Translation Model Adaptation" pptx

Tài liệu Báo cáo khoa học: "Topic Models for Dynamic Translation Model Adaptation" pptx

... framework for finitestate and context-free translation models In Proceedings of ACL System Demonstrations Vladimir Eidelman 2012 Optimization strategies for online large-margin learning in machine translation ... subdomains consistent within a document Results Results for both settings are shown in Table GTM models the latent topics at the document level, while LTM models each...
Ngày tải lên : 19/02/2014, 19:20
5 532 0
Báo cáo khoa học: "Employing Topic Models for Pattern-based Semantic Class Discovery" doc

Báo cáo khoa học: "Employing Topic Models for Pattern-based Semantic Class Discovery" doc

... 3.4 for details) before building topic models for CR(q), where some lowfrequency items are removed Determine the number of topics: Most topic models require the number of topics to be known beforehand1 ... Item q Topic modeling Semantic class construction word item (word or phrase) document RASC topic semantic class Table The mapping from the concepts in topic modeli...
Ngày tải lên : 08/03/2014, 00:20
9 398 0
Báo cáo khoa học: "Bilingual Topic AdMixture Models for Word Alignment" ppt

Báo cáo khoa học: "Bilingual Topic AdMixture Models for Word Alignment" ppt

... over two topics In all our following experiments, we use both Null word and Laplace smoothing for the BiTAM models We train, for comparison, IBM-1&4 and HMM models with iterations of IBM-1, for HMM ... this paper, we proposed novel formalism for statistical word alignment based on bilingual admixture (BiTAM) models Three BiTAM models were proposed and evaluated on word...
Ngày tải lên : 17/03/2014, 04:20
8 354 0
Báo cáo khoa học: "Authorship Attribution with Author-aware Topic Models" pptx

Báo cáo khoa học: "Authorship Attribution with Author-aware Topic Models" pptx

... in DADT author topics are disjoint from document topics, with different priors for each topic set Thus, the number of author topics can be different from the number of document topics, enabling ... the topic- based methods, we used the same overall number of topics for all the topic models We present only the results obtained with the best topic settings: 100 for PAN’11 and 400 for...
Ngày tải lên : 23/03/2014, 14:20
6 230 0
Báo cáo khoa học: "Topic Models for Word Sense Disambiguation and Token-based Idiom Detection" pdf

Báo cáo khoa học: "Topic Models for Word Sense Disambiguation and Token-based Idiom Detection" pdf

... neighbour for the chosen word and then assigns a sense based on the word, its neighbour and the topic Boyd-Graber and Blei (2007) test their method on WSD and information retrieval tasks and find ... corrected any bad tags and lemmas for the target instances.4 Sense Paraphrases For word sense disambiguation tasks, the paraphrases of the sense keys are represent...
Ngày tải lên : 23/03/2014, 16:20
10 371 0
Báo cáo khoa học: "PCFGs, Topic Models, Adaptor Grammars and Learning Topical Collocations and the Structure of Proper Names" ppt

Báo cáo khoa học: "PCFGs, Topic Models, Adaptor Grammars and Learning Topical Collocations and the Structure of Proper Names" ppt

... distribution of topics in the document The corresponding Bayesian PCFG associates probabilities with each of the rules in the CFG The probabilities θ Topici associated with the rules expanding the Topici ... for proper names that don’t fit either of the first two expansions) We extracted all of the proper names (i.e., phrases of category NNP and NNPS) in the...
Ngày tải lên : 23/03/2014, 16:20
10 437 0
Báo cáo khoa học: "Identifying Word Translations from Comparable Corpora Using Latent Topic Models" potx

Báo cáo khoa học: "Identifying Word Translations from Comparable Corpora Using Latent Topic Models" potx

... framework for mining translations of words from latent topic models We have proven that topical knowledge is useful and improves the quality of word translations The quality of translations depends ... sampling a new token as word wi ∈ W S from a topic zk can be obtained as follows: (w ) n i +β P (wi |zk ) = φk,i = |W S | k (w ) (1) nk j + W S β j=1 (w ) where, for a word...
Ngày tải lên : 23/03/2014, 16:20
6 449 0
Báo cáo khoa học: "Multi-Document Summarization using Sentence-based Topic Models" docx

Báo cáo khoa học: "Multi-Document Summarization using Sentence-based Topic Models" docx

... our BSTM model leads to better summarization results term-document matrix term-sentence matrix the number of latent topics sentence -topic matrix auxiliary document -topic matrix 1: Randomly initialize ... them 2: repeat 3: Update U using Eq (3); 4: Update V using Eq (4); 5: Compute f using Eq (2); 6: until f converges Experimental Results 5.1 Data Set To evaluate the summarization...
Ngày tải lên : 23/03/2014, 17:20
4 381 0
Báo cáo khoa học: "Automatic Labelling of Topic Models" doc

Báo cáo khoa học: "Automatic Labelling of Topic Models" doc

... proposed to approach topic labelling via best term selection, i.e selecting one of the top-10 topic terms to label the overall topic While it is often possible to label topics with topic terms (as ... not directly related to topic labelling, Chang et al (2009) were one of the first to propose human labelling of topic models, in the form of synthetic intruder word and...
Ngày tải lên : 30/03/2014, 21:20
10 402 0