employing topic models for pattern based semantic class discovery

Báo cáo khoa học: "Employing Topic Models for Pattern-based Semantic Class Discovery" doc

... organizing them into semantic classes. For example, {red, white, black…} is a semantic class consisting of color instances. A popular way for semantic class discovery is pattern- based approach, ... related to multiple topics in some topic models (e.g., pLSI and LDA). Topic modeling Semantic class construction word item (word or phrase) document RASC topic semantic class Table 3. ... 459–467, Suntec, Singapore, 2-7 August 2009. c 2009 ACL and AFNLP Employing Topic Models for Pattern- based Semantic Class Discovery Huibin Zhang 1* Mingjie Zhu 2* Shuming Shi 3 Ji-Rong Wen 3 ...

Ngày tải lên: 08/03/2014, 00:20

9 398 0

Báo cáo khoa học: "Topic Models for Word Sense Disambiguation and Token-based Idiom Detection" pdf

Ngày tải lên: 23/03/2014, 16:20

10 371 0

Tài liệu Báo cáo khoa học: "Topic Models for Dynamic Translation Model Adaptation" pptx

... relevant translations based on topic- speciﬁc contexts, where topics are induced in an unsupervised way using topic models; this can be thought of as inducing subcorpora for adaptation with- out ... probability under topic 1, topic 2, etc., or F 2 : What is the probability under the most probable topic, second most, etc. A model using F 1 learns whether a speciﬁc topic is useful for translation, ... utilize information from any topic distribution if it helps us cre- ate topic relevant translations. F 2 is useful for dynamic adaptation, where the adapted feature weight changes based on...

Ngày tải lên: 19/02/2014, 19:20

5 532 0

Tài liệu Báo cáo khoa học: "Incremental Syntactic Language Models for Phrase-based Translation" pptx

... Association for Computational Linguistics, pages 620–631, Portland, Oregon, June 19-24, 2011. c 2011 Association for Computational Linguistics Incremental Syntactic Language Models for Phrase -based ... are therefore a good mechanism for incorporat- ing syntax into phrase -based translation. We give a formal deﬁnition of one such linear- time syntactic language model, detail its re- lation to phrase -based ... has effectively used n-gram word sequence models as language models. Modern phrase -based translation using large scale n-gram language models generally performs well in terms of lexical choice,...

Ngày tải lên: 20/02/2014, 04:20

12 511 0

Báo cáo khoa học: "Utilizing Dependency Language Models for Graph-based Dependency Parsing Models" pptx

Ngày tải lên: 23/03/2014, 14:20

10 418 0

Báo cáo khoa học: "HAL-based Cascaded Model for Variable-Length Semantic Pattern Induction from Psychiatry Web Resources" pdf

... information indicates that the k words are more likely to form a semantic pattern of length k. Here the length k also ranges from 2 to 4. For each k, we compute the mutual information for ... each of the better semantic patterns. After the initial set for a particular length is created, each semantic pattern and the seed pattern are represented in the HAL space for further computing ... +1 Cascaded Induction Process Induced Patterns Relevant Patterns Figure 1. Framework for variable-length semantic pattern induction. semantically related to the seed pattern. According to the distance...

Ngày tải lên: 08/03/2014, 02:21

8 376 0

Tài liệu Báo cáo khoa học: "Combining Lexical Semantic Resources with Question & Answer Archives for Translation-Based Answer Finding" doc

... encode some form of semantic knowledge. The semantic similarity and relatedness of words has traditionally been assessed through corpus- based and knowledge -based measures. Corpus- based measures ... determined empirically based on MAP and R-Prec). The performance gaps between the translation -based models and the baseline models are statistically signiﬁcant, except for those marked with a ... terms “mosquito” (for question 2) and “form” (for question 3), but only their inﬂected forms “mosquitoes” and “formed”. 6 Conclusion and Future Work We have presented three datasets for training statistical...

Ngày tải lên: 20/02/2014, 07:20

9 527 0

Tài liệu Báo cáo khoa học: "A FrameNet-based Semantic Role Labeler for Swedish" pdf

... the final classifiers consisted of one million training instances for the start classifier, 500,000 for the end classifier, and 272,000 for the role classifier. The features used by the classifiers ... assume that the frame was known a priori. We used the available semantic roles for all senses of the target word as features for the classifier. On a test set from FrameNet, we estimated that the ... SRL system for Swedish text. Like most previous systems, it consists of two parts: a FE bracketer and a classifier that assigns semantic roles to FEs. Both parts are implemented as SVM classifiers...

Ngày tải lên: 20/02/2014, 12:20

8 470 0

Tài liệu Báo cáo khoa học: "Statistical phrase-based models for interactive computer-assisted translation" pdf

... framework, real-time is re- quired. 4 Phrase -based models The usual statistical translation models can be classiﬁed as single-word based alignment models. Models of this kind assume that an input ... Association for Computational Linguistics Statistical phrase -based models for interactive computer-assisted translation Jes ´ us Tom ´ as and Francisco Casacuberta Instituto Tecnol ´ ogico de Inform ´ atica Universidad ... framework (Och et al., 2003). Phrase -based models have proved to be very ad- equate statistical models for MT (Tom ´ as et al., 2005). In this work, the use of these models has been extended to interactive...

Ngày tải lên: 20/02/2014, 12:20

7 308 0

Tài liệu Báo cáo khoa học: "Towards History-based Grammars: Using Richer Models for Probabilistic Parsing*" docx

... describe a technique for estimating the parameters for this model using decision trees. The history -based grammar model provides a mechanism for taking advantage of contextual information from ... contextual information used by their models is because of the difficulty in estimating very rich probabilistic models of context. In this work, we present a model, the history -based grammar ... h Grammar and parse tree for aabb. dexed the non-terminal (NT) nodes of the tree with this leftmost order. We denote by ~- the sen- tential form obtained just before we expand node i. Hence,...

Ngày tải lên: 20/02/2014, 21:20

7 372 0

Tài liệu Báo cáo khoa học: "Phrase-Based Backoff Models for Machine Translation of Highly Inﬂected Languages" docx

... (Zitouni et al., 2003) for language modeling and by (Gildea, 2001) for semantic role labeling. (Resnik et al., 2001) used backoff translation lexicons for cross-language information re- trieval. ... re- trieval. More recently, (Xi and Hwa, 2005) have used backoff models for combining in-domain and 42 Phrase -Based Backoff Models for Machine Translation of Highly Inﬂected Languages Mei Yang Department ... accordingly. The phrase table will thus include entries for phrases based on full word forms as well as for their stemmed and/or split counterparts. For each entry with decomposed morphological i i...

Ngày tải lên: 22/02/2014, 02:20

8 379 0

Báo cáo khoa học: "Semantic Class Learning from the Web with Hyponym Pattern Linkage Graphs" pdf

... developed for automatic semantic class identiﬁcation, under the rubrics of lexical acquisition, hyponym acquisition, semantic lexicon induction, semantic class learning, and web -based information ... just one doubly-anchored pattern to identify candidate instances for a semantic class: < ;class name> such as < ;class member> and * This pattern has two variables: the name of the semantic class to ... both the name of the semantic class as well as a class member. For example, the pattern “CARS such as FORD and *” will extract automobiles, and the pattern “PRESIDENTS such as FORD and *” will extract presidents....

Ngày tải lên: 17/03/2014, 02:20

9 340 0

Báo cáo khoa học: "Exploring Distributional Similarity Based Models for Query Spelling Correction" docx

... Table 2. Performance results for different models Table 2 details the performance scores for the experiments, which shows that both of the two distributional similarity -based models boost ... methods for error model estimation are all based on the similarity between the character strings of q i and c i as described in 3.1. Here we describe a distributional similarity- based method for ... Similarity -Based Mod- els for Query Spelling Correction 3.1 Motivation Most of the previous work on spelling correction concentrates on the problem of designing better error models based on...

Ngày tải lên: 17/03/2014, 04:20

8 309 0

Báo cáo khoa học: "Integrating Pattern-based and Distributional Similarity Methods for Lexical Entailment Acquisition" doc

... pairs for the input term by pattern- based and distributional similarity methods (Section 3.2); 2: Constructing a feature set for all candidates based on pattern- based and distributional information ... features, suitable for supervised classification. To this end we developed a novel feature set based on both pattern- based and distributional data. To obtain pattern statistics for each pair, ... found in such patterns. In addition to their use for learning lexical semantic relations, patterns were commonly used to learn instances of concrete semantic relations for Information Extraction...

Ngày tải lên: 17/03/2014, 04:20

8 356 0

Báo cáo khoa học: "Bilingual Topic AdMixture Models for Word Alignment" ppt

... of a topic. Given a speciﬁc topic- weight vector θ d for a document-pair, each sentence-pair draws its conditionally independent topics from a mixture of topics. This generative process, for a document-pair ... on topical translation models concern mainly explicit logical representations of semantics for machine translation. This include knowledge -based (Nyberg and Mitamura, 1992) and interlingua -based ... success. We propose a new statistical formalism: Bilingual Topic AdMixture model, or BiTAM, to facilitate topic- based word alignment in SMT. Variants of admixture models have appeared in population...

Ngày tải lên: 17/03/2014, 04:20

8 354 0

Báo cáo khoa học: "WordNet-based Semantic Relatedness Measures in Automatic Speech Recognition for Meetings" doc

... long-term information. In this paper the best performing measures from (Pucher, 2005), which outperform baseline models on word prediction for conversational tele- phone speech are used for Automatic ... dependencies. Therefore (Bellegarda, 2000) proposed combining n-gram language models, which are effective for predicting local dependencies, with Latent Semantic Analysis (LSA) based models for covering ... WordNet -based models can be used for rescoring of ‘real’ N-best lists in a difﬁcult task. 1.1 Word prediction by semantic similarity The standard n-gram approach in language modeling for speech...

Ngày tải lên: 17/03/2014, 04:20

4 204 0

Báo cáo khoa học: "A Semantic-Head-Driven Generation Algorithm for Unification-Based Formalisms" potx

... information is used, it is unclear what semantic information is derived by the rules themselves, and what semantic information is available because of unifications with the original semantics. ... category. For example, here is the rule for sen- tences: s(Form, GO-G, Store)/quant(Q,X,R,S) > s(Form, GO-G, [qterm(Q,X,R) JStore])/S. The term quant (C~, X, R, S) represents a quantified formula ... vp(nonfinite,[np(_)/you])/S. s(Form)/S > Subj, vp(Fona,[Subj])/S. (2) vp(Form,Subcat)/S > vp(Form,[Compl[Subcat])/S, Compl. (3) vp(Form,[Subj])/S > vp(Forl,[Subj])/VP, adv(VP)/S. vp(finite,[np(_)/O,np(3-sing)/S])/love(S,O)...

Ngày tải lên: 17/03/2014, 20:20

11 273 0

Báo cáo khoa học: "Latent Variable Models for Semantic Orientations of Phrases" pdf

Ngày tải lên: 17/03/2014, 22:20

8 404 0

A Survey of Activity Network-Based Process Models for Managing Product Development Projects doc

Ngày tải lên: 23/03/2014, 04:20

24 459 0

Báo cáo khoa học: "Multi-Document Summarization using Sentence-based Topic Models" docx

Ngày tải lên: 23/03/2014, 17:20

4 381 0